Корпус национальных мордовских языков: принципы разработки и перспективы функционирования/ действия

Translated title of the contribution: National Corpus of the Mordvin Languages: Principles of Development and Perspectives of its Functionality/Usability

Research output: Conference materialsPaper


This paper addresses the issue of a national corpus for language documentation of the Moksha and Erzya literary languages in coordination with dialect archives comprising over 80 years of fieldwork (inclusive Shoksha, Karatai).
It shows necessary development in computer-assisted research tools and ongoing research aligned with a consistent and systematic open research project.
Original languageRussian
Publication statusAccepted/In press - 2020
MoE publication typeNot Eligible

Fields of Science

  • 6121 Languages
  • Erzya language
  • Moksha language
  • Karatai
  • Shoksha
  • Terukhan
  • Dialect texts
  • Literary texts
  • Normalization
  • Language diversity
  • universal dependencies
  • Giellalt
  • HFST
  • language documentation
  • Mordvin
  • Uralic
  • Shallow-transfer translation machine
  • lexica
  • morphology
  • syntax
  • language text corpora
  • open-source software


Experimental Treebanking for the Minority Moksha Language and Finite-State Descriptions

Rueter, J., Levina, M. & Kabaeva, N.

07/12/2018 → …

Project: Other project

Experimental Treebanking for Minority Languages with Finite-State Descriptions

Rueter, J., Tyers, F. M., Klementeva, J. & Erina, O.

01/10/2017 → …

Project: Other project

Cite this