On the questions in developing computational infrastructure for Komi-Permyak

Jack Rueter, Niko Partanen, Larisa Ponomareva

Tutkimustuotos: Artikkeli kirjassa/raportissa/konferenssijulkaisussaKonferenssiartikkeliTieteellinenvertaisarvioitu

Abstrakti

There are two main written Komi varieties, Permyak and Zyrian. These are mutually intelligible but derive from different parts of the same Komi dialect continuum,representing the varieties prominent in the vicinity and in the cities of Syktyvkar and Kudymkar,respectively. Hence,they share a vast number of features, as well as the majority of their lexicon, yet the overlap in their dialects is very complex. This paper evaluates the degree of difference in these written varieties based on changes required for computational resources in the description of these languages when adapted fromthe Komi-Zyrian original. Primarily these changes include the FST architecture, but we are also looking at its application to the Universal Dependencies annotation scheme in the morphologies of the two languages.
Alkuperäiskielienglanti
OtsikkoProceedings of the Sixth International Workshop on Computational Linguistics of Uralic Languages
ToimittajatTommi A. Pirinen, Francis M. Tyers, Michael Rießler
Sivumäärä11
JulkaisupaikkaStroudsburg
KustantajaThe Association for Computational Linguistics
Julkaisupäivä2020
Sivut15–25
ISBN (elektroninen)978-1-952148-00-2
TilaJulkaistu - 2020
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisuussa
TapahtumaInternational Workshop on Computational Linguistics of Uralic Languages - Universität Wien, Vienna, Itävalta
Kesto: 10 tammikuuta 202011 tammikuuta 2020
Konferenssinumero: 6
https://iwclul.univie.ac.at/

Tieteenalat

  • 6121 Kielitieteet

Siteeraa tätä