Open Language Data Initiative

The contents of this card can be edited in the source repository.

Dataset card

Description

Seed data in Spanish (Latin American).

License

CC-BY-SA-4.0

Attribution

@inproceedings{wmt24-seed-spanish,
    title="Spanish Corpus and Provenance with Computer-Aided Translation for the {WMT24} {OLDI} Shared Task",
    author="Jose Cols",
    booktitle = "Proceedings of the Ninth Conference on Machine Translation",
    month = nov,
    year = "2024",
    address = "Miami, USA",
    publisher = "Association for Computational Linguistics"
}

Language codes

Additional language information

Workflow

Data was translated from English by ten professional translators, all native speakers of the target language and proficient in English. 100% of the data was reviewed by a team of three translators.

Additional guidelines