The contents of this card can be edited in the source repository.
Data card
Description
FLORES+ in Genoese Ligurian
License
CC-BY-SA-4.0
Attribution
@article{nllb-22,
title = {No Language Left Behind: Scaling Human-Centered Machine Translation},
author = {{NLLB Team} and Costa-jussà, Marta R. and Cross, James and Çelebi, Onur and Elbayad, Maha and Heafield, Kenneth and Heffernan, Kevin and Kalbassi, Elahe and Lam, Janice and Licht, Daniel and Maillard, Jean and Sun, Anna and Wang, Skyler and Wenzek, Guillaume and Youngblood, Al and Akula, Bapi and Barrault, Loic and Mejia-Gonzalez, Gabriel and Hansanti, Prangthip and Hoffman, John and Jarrett, Semarley and Sadagopan, Kaushik Ram and Rowe, Dirk and Spruit, Shannon and Tran, Chau and Andrews, Pierre and Ayan, Necip Fazil and Bhosale, Shruti and Edunov, Sergey and Fan, Angela and Gao, Cynthia and Goswami, Vedanuj and Guzmán, Francisco and Koehn, Philipp and Mourachko, Alexandre and Ropers, Christophe and Saleem, Safiyyah and Schwenk, Holger and Wang, Jeff},
year = {2022},
eprint = {arXiv:1902.01382},
}
Language codes
- ISO 639-3: lij
- ISO 15924: Latn
- Glottocode: geno1240
Additional language information
The data is in the Genoese dialect, using traditional spelling as codified in the following reference dictionaries:
- S. Lusito (2022) Dizionario italiano-genovese, Programma, ISBN 9788866438205.
- The Council for Ligurian Linguistic Heritage’s online Italian-Genoese dictionary.
- E. Autelli, S. Lusito, C. Konecny, F. Toso (2018-2021) GEPHRAS.
Reference grammar:
- F. Toso (1997) Grammatica del genovese: varietà urbana e di koinè, Le Mani, ISBN 9788880120728.
Workflow
This data was released as part of the FLORES-200 dataset. It has undergone minor spelling and syntactic fixes following community feedback and additional quality assessment. Please refer to the paper for further information.