SUS Fieldwork: SUST 77 v0.1, Korp

Tutkimustuotos: Ei-tekstimuotoinenOhjelmistoTieteellinen


This Korp publication contains the first 25 texts of the Finno-Ugrian Society's SUST 77 Erzya-language folklore texts. The texts included consist of (1) original fieldwork transliterations; (2) automatically normalized Cyrillic text, and (3) original German-language translations. The texts have been aligned, such that the normalized versions of the original transliterations of the texts are used in the search engine. The word forms have been analyzed using a finite-state analyzer developed by Jack Rueter in collaboration with Universities in Tromso, Norway, Saransk, Mordovia and Turku, Finland. Subsequently, the words have been disambiguated and the sentences analyzed using a constraint-grammar description with the same consultancies and collaborations. In the search engine, one can locate words according to string and feature values of the morphological analyses. Once a word form is located and selected, important bibliographical information may be found in the right margin along with a word analysis and dependency tree representation. Hence the entire sentence is also shown in the original transliteration along with a German-language translation. The bibliographical information includes the identifying text number, the page range of the text, and page where the sentence is found, on the one hand, and speaker name as well as locale and date, on the other. This version demonstrates the present state of normalization, finite-state disambiguated morphological analyses and constraint-grammar syntactic analyses with a conversion to feature and dependency marking used in the Universal Dependencies project. The analyzer has been designed to recognize dialect variation and therefore may exceed the normalization.
Julkaisun otsikon käännösFinno-Ugrian Society Fieldwork: SUST 77 v0.1, Korp
Tuotoksen mediaVerkossa
TilaJulkaistu - elok. 2022
OKM-julkaisutyyppiEi sovellu


Mordwinische Volksdichtung. Gesamm. von H. Paasonen. Hrsg. und übers. von Paavo Ravila. I. Band. SUST LXXVII. 1938. XXVI + 509 p. (partial)


  • 6121 Kielitieteet

Siteeraa tätä