Fieldwork and Early Literary Texts

Project Details


The digital representation of minority language fieldwork and early literary texts for searchable corpora. Provides original, possible translations and normalized text with annotation for lemma, part of speech and other possible morphological analyses. Additional golden-standard annotation for syntax universal dependencies is forthcoming. This is incremental for syntactic research and language technological development.
StatusNot started


  • Suomalais-Ugrilainen Seura: €6,000.00
  • Suomalais-Ugrilainen Seura / Société Finno-Ougrienne: €9,000.00

Fields of Science

  • 6121 Languages
  • Erzya language
  • Moksha language
  • Komi-Zyrian
  • dialect
  • Heikki Paasonen
  • Fieldwork
  • Folklore
  • Uotila
  • Mordvin languages
  • digitization
  • korp search
  • open-source
  • German translations
  • Russian translations
  • annotation
  • morphology
  • meta-data
  • Giellatekno
  • Kielipankki


Experimental Treebanking for Minority Languages with Finite-State Descriptions

Rueter, J., Tyers, F. M., Klementeva, J. & Erina, O.

01/10/2017 → …

Project: Other project

Komi  morfologisk  analyseprogram

Rueter, J., Trosterud, T. & Gerstenberg, C.

01/01/2010 → …

Project: Research project

Creation of Morphological Parsers for Minority Finno-Ugrian Languages

Rueter, J., Salo, M., Rantakaulio, T., Kuprina, J., Blumberga, R. & Soosaar, S.


Project: Research project

Research Output

Келу, келу, акша келу!

Translated title of the contribution: A birch, a birch, a white birch!Rueter, J. M., 2019

Research output: Non-textual formSoftwareScientific


Rueter, J. M., 2018, (Submitted) The Uralic Languages. Routledge

Research output: Chapter in Book/Report/Conference proceedingChapterScientificpeer-review

Koltansaamen mediawiki-sanakirja

Translated title of the contribution: Skolt Sami mediawiki dictionaryHämäläinen, M. K., Rueter, J. M. & Lehtinen, M. (ed.), 2017

Research output: Non-textual formSoftwareScientific


  • 9 Oral presentation
  • 3 Organisation and participation in conferences, workshops, courses, seminars
  • 2 Invited talk
  • 2 Academic visit to other institution

On Editing Dictionaries for Uralic Languages in an Online Environment

Khalid Alnajjar (Speaker), Mika Hämäläinen (Speaker), Jack Rueter (Speaker)
10 Jan 2020

Activity: Talk or presentation typesOral presentation

NorthEuraLex 0.9

Johannes Dellert (Advisor), Jack Rueter (Advisor), Olga Erina (Consultant), Elena Klementieva (Consultant)
Apr 2019 → …

Activity: Consultancy typesConsultancy


Kindred People's Day Conference 2019

Jack Rueter (Speaker)
11 Oct 2019

Activity: Talk or presentation typesOral presentation