Survey of Uralic Universal Dependencies development

Forskningsoutput: Kapitel i bok/rapport/konferenshandlingKonferensbidragVetenskapligPeer review

Sammanfattning

This paper attempts to evaluate some of the systematic differences in Uralic Universal Depen- dencies treebanks from a perspective that would help to introduce reasonable improvements in treebank annotation consistency within this language family. The study finds that the coverage of Uralic languages in the project is already relatively high, and the majority of typically Uralic features are already present and can be discussed on the basis of existing treebanks. Some of the idiosyncrasies found in individual treebanks stem from language-internal grammar traditions, and could be a target for harmonization in later phases.
Originalspråkengelska
Titel på gästpublikationThirdWorkshop on Universal Dependencies (UDW, SyntaxFest 2019) : Proceedings
Antal sidor7
UtgivningsortStroudsburg
FörlagThe Association for Computational Linguistics
Utgivningsdatum2019
Artikelnummer78
ISBN (elektroniskt)978-1-950737-66-6
StatusPublicerad - 2019
MoE-publikationstypA4 Artikel i en konferenspublikation
EvenemangWorkshop on Universal Dependencies - Paris, Frankrike
Varaktighet: 29 aug 201930 aug 2019
Konferensnummer: 3
https://syntaxfest.github.io/syntaxfest19/program.html

Vetenskapsgrenar

  • 6121 Språkvetenskaper

Citera det här

Rueter, J., & Partanen, N. (2019). Survey of Uralic Universal Dependencies development. I ThirdWorkshop on Universal Dependencies (UDW, SyntaxFest 2019): Proceedings [78] Stroudsburg: The Association for Computational Linguistics.
Rueter, Jack ; Partanen, Niko. / Survey of Uralic Universal Dependencies development. ThirdWorkshop on Universal Dependencies (UDW, SyntaxFest 2019): Proceedings. Stroudsburg : The Association for Computational Linguistics, 2019.
@inproceedings{b5a470f949d64509a1f811beadd07f6b,
title = "Survey of Uralic Universal Dependencies development",
abstract = "This paper attempts to evaluate some of the systematic differences in Uralic Universal Depen- dencies treebanks from a perspective that would help to introduce reasonable improvements in treebank annotation consistency within this language family. The study finds that the coverage of Uralic languages in the project is already relatively high, and the majority of typically Uralic features are already present and can be discussed on the basis of existing treebanks. Some of the idiosyncrasies found in individual treebanks stem from language-internal grammar traditions, and could be a target for harmonization in later phases.",
keywords = "6121 Languages, Treebanks, Uralic languages, universal dependencies, Komi-Zyrian, Erzya, North Sami, Finnish, Hungarian, Estonian, Morphological annotaton, syntax, parts-of-speech, Spoken language, Literary language",
author = "Jack Rueter and Niko Partanen",
year = "2019",
language = "English",
booktitle = "ThirdWorkshop on Universal Dependencies (UDW, SyntaxFest 2019)",
publisher = "The Association for Computational Linguistics",
address = "United States",

}

Rueter, J & Partanen, N 2019, Survey of Uralic Universal Dependencies development. i ThirdWorkshop on Universal Dependencies (UDW, SyntaxFest 2019): Proceedings., 78, The Association for Computational Linguistics, Stroudsburg, Workshop on Universal Dependencies, Paris, Frankrike, 29/08/2019.

Survey of Uralic Universal Dependencies development. / Rueter, Jack; Partanen, Niko.

ThirdWorkshop on Universal Dependencies (UDW, SyntaxFest 2019): Proceedings. Stroudsburg : The Association for Computational Linguistics, 2019. 78.

Forskningsoutput: Kapitel i bok/rapport/konferenshandlingKonferensbidragVetenskapligPeer review

TY - GEN

T1 - Survey of Uralic Universal Dependencies development

AU - Rueter, Jack

AU - Partanen, Niko

PY - 2019

Y1 - 2019

N2 - This paper attempts to evaluate some of the systematic differences in Uralic Universal Depen- dencies treebanks from a perspective that would help to introduce reasonable improvements in treebank annotation consistency within this language family. The study finds that the coverage of Uralic languages in the project is already relatively high, and the majority of typically Uralic features are already present and can be discussed on the basis of existing treebanks. Some of the idiosyncrasies found in individual treebanks stem from language-internal grammar traditions, and could be a target for harmonization in later phases.

AB - This paper attempts to evaluate some of the systematic differences in Uralic Universal Depen- dencies treebanks from a perspective that would help to introduce reasonable improvements in treebank annotation consistency within this language family. The study finds that the coverage of Uralic languages in the project is already relatively high, and the majority of typically Uralic features are already present and can be discussed on the basis of existing treebanks. Some of the idiosyncrasies found in individual treebanks stem from language-internal grammar traditions, and could be a target for harmonization in later phases.

KW - 6121 Languages

KW - Treebanks

KW - Uralic languages

KW - universal dependencies

KW - Komi-Zyrian

KW - Erzya

KW - North Sami

KW - Finnish

KW - Hungarian

KW - Estonian

KW - Morphological annotaton

KW - syntax

KW - parts-of-speech

KW - Spoken language

KW - Literary language

M3 - Conference contribution

BT - ThirdWorkshop on Universal Dependencies (UDW, SyntaxFest 2019)

PB - The Association for Computational Linguistics

CY - Stroudsburg

ER -

Rueter J, Partanen N. Survey of Uralic Universal Dependencies development. I ThirdWorkshop on Universal Dependencies (UDW, SyntaxFest 2019): Proceedings. Stroudsburg: The Association for Computational Linguistics. 2019. 78