Open Language Data Initiative


The contents of this card can be edited in the source repository.

Dataset card for Khakas

Description

FLORES+ in Khakas.

License

CC-BY-SA-4.0

Attribution

See the list of the contributors below

Language codes

Additional language information

The Khakas language (Khakas tili, хакас тілі) is the ethnic language of the Khakas, the indigenous population of the Republic of Khakassia located in southern Siberia. It belongs to the Siberian subsubgroup of the Turkic language family. Khakas is the result of the historical consolidation of dialects. The Kachin and Sagai dialects were chosen as the base of the Khakas literary language based on the greater number of their native speakers. Until 1917, the scientific and official literature used the names the language of the Abakan or Minusinsk Tatars, the language of the Abakan or Yenisei Turks [Донидзе, 1997, p. 459]. The appearance of the name "Khakas tili" (Khakas language) is connected with the adoption in 1917 at the II Congress of the Khakas people on the initiative of S. D. Maynagashev decided to use the ethnonym "Khakasy", which is common to all. "The Congress unanimously decided to return to the people their ancient self–name - Khakasy. The resolutions of the 1918 Uyezd Congress of Soviets confirmed the popular decision" [Кызласов, 1994, p. 8].

N. A. Baskakov attributed the Khakas language to the Eastern Hunnic branch, the Uighur group, within which it, together with Kamasin, Chulym, Shorsky, Sary-Uighur and the northern dialects of the Altaic language, forms a special Khakas subgroup [Баскаков, 1975, p. 3].

According to the All-Russian Population Census of 2020, the total number of native speakers of the Khakas language in Russia is 61,365 people, of whom approximately 44% indicated proficiency in the Khakas language as their mother tongue, which is 14% lower than the 2010 census (58%) [Borgoyakova, 2025, p. 46]. This trajectory aligns with the classification of Khakas as a “Definitely Endangered” language by UNESCO [Moseley, 2010]. Such a transition poses a significant threat to intergenerational language transfer and complicates the preservation of linguistic heritage.

REFERENCES

  1. Донидзе Г. И. Хакасский язык //Языки мира. Тюркские языки. – 1997. – С. 459.
  2. Кызласов Л. Р., Кызласов И. Л. Ключевые вопросы истории хакасов //Земля Сибирская (Страницы истории и современность): сборник статей. Абакан–Москва: Эвтектика. – 1994.
  3. Баскаков Н. А. Грамматика хакасского языка. – Nauka, 1975. – №. 1.
  4. Borgoyakova T. G., Guseynova A. V. Ethnic and Linguistic Policy and Sociolinguistic Variability of Language Shift (Example of Republic of Khakassia) //Humanities and social sciences. – 2025. – №. 3 (122). – С. 44-53.
  5. Moseley C. (ed.). Atlas of the World's Languages in Danger. – Unesco, 2010.

Workflow

The FLORES+ dev and devtest datasets were translated into Khakas from Russian by the staff of KhRILLH (Khakas Research Institute of Language, Literature and History, ХакНИИЯЛИ) in 2024-2025. After translation, the texts were proofread twice by different editors. Irina Maksimovna Chebochakova (the most competent specialist in Russian-Khakass translation) was responsible for the final version.

LLM (Gemini, GPT, or other) were not used in the translation. The texts were validated to not contain homoglyphs.

Contributors

Director: Mainagasheva Nina Semyonovna, Director of KhRILLH, Candidate of Philological Sciences, [email protected]

Chief Editor: Chebochakova Irina Maksimovna, Head of the Language Sector of KhRILLH, Candidate of Philological Sciences, [email protected]

Developer: Adeshkin Vasily Ivanovich, Researcher at KhRILLH, graduate of MIPT, [email protected]

Translators:

  1. Abdina Raisa Petrovna, Senior Researcher at KhRILLH, Candidate of Philological Sciences
  2. Abumova Olga Dmitrievna, Senior Researcher at KhRILLH, Candidate of Philological Sciences
  3. Beloglazov Pyotr Egorovich, Senior Researcher at KhRILLH, Candidate of Philological Sciences
  4. Burnakova Klara Nikolaevna, Senior Researcher at KhRILLH, Doctor of Philological Sciences
  5. Kirginekov Roman Gennadievich, Researcher at KhRILLH
  6. Kicheeva Kristina Vladimirovna, Researcher at KhRILLH
  7. Kyzlasov Artyom Samuilovich, Leading Researcher at KhRILLH, Candidate of Philological Sciences
  8. Kyzlasova Inga Ludovikovna, Associate Professor of the Department of Khakas Philology, KHSU, Candidate of Philological Sciences
  9. Mainagasheva Natalia Vladimirovna, Senior Researcher at KhRILLH, Candidate of Philological Sciences
  10. Mainagasheva Nina Semyonovna, Director of KhRILLH, Candidate of Philological Sciences
  11. Subrakova Viya Vasilyevna, Senior Researcher at KhRILLH, Candidate of Philological Sciences
  12. Torokova Evgeniya Semyonovna, Senior Researcher at KhRILLH, Candidate of Philological Sciences
  13. Chaptykova Yulia Innokentievna, Senior Researcher at KhRILLH, Candidate of Philological Sciences
  14. Chebodaeva Larisa Ilyinichna, Leading Researcher at KhRILLH, Candidate of Philological Sciences
  15. Chebochakova Irina Maksimovna, Head of the Language Sector of KhRILLH, Candidate of Philological Sciences
  16. Cheltygmasheva Larisa Viktorovna, Senior Researcher at KhRILLH, Candidate of Philological Sciences
  17. Chertykova Maria Dmitrievna, Leading researcher at KhRILLH, Doctor of Philological Sciences
  18. Chistobayeva Nadezhda Stepanovna, Head of the Folklore Sector of KhRILLH, Candidate of Philological Sciences
  19. Chochieva Alena Sergeevna, Head of the Manuscript Collection at KhRILLH, Candidate of Philological Sciences
  20. Shagdurova Olga Yurievna, Leading Researcher at KhRILLH, Candidate of Philological Sciences

Additional guidelines

For the translations, the following alphabet was used: абвгдежзийклмнопрстуфхцчшщъыьэюяёіғңҷӧӱ. It is strictly fixed and fully Cyrillic, unlike some other versions that could be found on the Internet.

Below is the detailed list of characters:

'а' \u0430 CYRILLIC SMALL LETTER A
'б' \u0431 CYRILLIC SMALL LETTER BE
'в' \u0432 CYRILLIC SMALL LETTER VE
'г' \u0433 CYRILLIC SMALL LETTER GHE
'д' \u0434 CYRILLIC SMALL LETTER DE
'е' \u0435 CYRILLIC SMALL LETTER IE
'ж' \u0436 CYRILLIC SMALL LETTER ZHE
'з' \u0437 CYRILLIC SMALL LETTER ZE
'и' \u0438 CYRILLIC SMALL LETTER I
'й' \u0439 CYRILLIC SMALL LETTER SHORT I
'к' \u043a CYRILLIC SMALL LETTER KA
'л' \u043b CYRILLIC SMALL LETTER EL
'м' \u043c CYRILLIC SMALL LETTER EM
'н' \u043d CYRILLIC SMALL LETTER EN
'о' \u043e CYRILLIC SMALL LETTER O
'п' \u043f CYRILLIC SMALL LETTER PE
'р' \u0440 CYRILLIC SMALL LETTER ER
'с' \u0441 CYRILLIC SMALL LETTER ES
'т' \u0442 CYRILLIC SMALL LETTER TE
'у' \u0443 CYRILLIC SMALL LETTER U
'ф' \u0444 CYRILLIC SMALL LETTER EF
'х' \u0445 CYRILLIC SMALL LETTER HA
'ц' \u0446 CYRILLIC SMALL LETTER TSE
'ч' \u0447 CYRILLIC SMALL LETTER CHE
'ш' \u0448 CYRILLIC SMALL LETTER SHA
'щ' \u0449 CYRILLIC SMALL LETTER SHCHA
'ъ' \u044a CYRILLIC SMALL LETTER HARD SIGN
'ы' \u044b CYRILLIC SMALL LETTER YERU
'ь' \u044c CYRILLIC SMALL LETTER SOFT SIGN
'э' \u044d CYRILLIC SMALL LETTER E
'ю' \u044e CYRILLIC SMALL LETTER YU
'я' \u044f CYRILLIC SMALL LETTER YA
'ё' \u0451 CYRILLIC SMALL LETTER IO
'і' \u0456 CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I
'ғ' \u0493 CYRILLIC SMALL LETTER GHE WITH STROKE
'ң' \u04a3 CYRILLIC SMALL LETTER EN WITH DESCENDER
'ҷ' \u04b7 CYRILLIC SMALL LETTER CHE WITH DESCENDER
'ӧ' \u04e7 CYRILLIC SMALL LETTER O WITH DIAERESIS
'ӱ' \u04f1 CYRILLIC SMALL LETTER U WITH DIAERESIS