Fieldwork and Early Literary Texts

Project: Research project

Project Details

Description (abstract)

The digital representation of minority language fieldwork and early literary texts for searchable corpora. Provides original, possible translations and normalized text with annotation for lemma, part of speech and other possible morphological analyses. Additional golden-standard annotation for syntax universal dependencies is forthcoming. This is incremental for syntactic research and language technological development.
StatusNot started


  • Suomalais-Ugrilainen Seura: €6,000.00
  • Suomalais-Ugrilainen Seura / Société Finno-Ougrienne: €3,100.00
  • Suomalais-Ugrilainen Seura / Société Finno-Ougrienne: €9,000.00

Fields of Science

  • 6121 Languages
  • Erzya language
  • Moksha language
  • Komi-Zyrian
  • dialect
  • Heikki Paasonen
  • Fieldwork
  • Folklore
  • Uotila
  • Mordvin languages
  • digitization
  • korp search
  • open-source
  • German translations
  • Russian translations
  • annotation
  • morphology
  • meta-data
  • Giellatekno
  • Kielipankki
  • Moksha Mordvin

    Rueter, J. M., 31 Mar 2023, The Uralic Languages. Abondolo, D. & Valijärvi, R-L. (eds.). 2nd Edition ed. Abingdon: Routledge, 46 p. (Routledge Language Family Series).

    Research output: Chapter in Book/Report/Conference proceedingChapterScientificpeer-review

  • SUS Fieldwork: SUST 77 v0.1, Korp

    Translated title of the contribution: Finno-Ugrian Society Fieldwork: SUST 77 v0.1, KorpRueter, J., Erina, O. & Axelson, E., Aug 2022

    Research output: Non-textual formSoftwareScientific

  • UD_Moksha-JR 2.6

    Rueter, J., Nivre, J., Zeman, D., Kabaeva, N. & Levina, M., 15 May 2020

    Research output: Non-textual formSoftwareScientificpeer-review

    Open Access