Survey of Uralic Universal Dependencies development

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Abstract

This paper attempts to evaluate some of the systematic differences in Uralic Universal Dependencies treebanks from a perspective that would help to introduce reasonable improvements in treebank annotation consistency within this language family. The study finds that the coverage of Uralic languages in the project is already relatively high, and the majority of typically Uralic features are already present and can be discussed on the basis of existing treebanks. Some of the idiosyncrasies found in individual treebanks stem from language-internal grammar traditions, and could be a target for harmonization in later phases.
Original languageEnglish
Title of host publicationThirdWorkshop on Universal Dependencies (UDW, SyntaxFest 2019) : Proceedings
Number of pages7
Place of PublicationStroudsburg
PublisherThe Association for Computational Linguistics
Publication date2019
Article number78
ISBN (Electronic)978-1-950737-66-6
Publication statusPublished - 2019
MoE publication typeA4 Article in conference proceedings
EventWorkshop on Universal Dependencies - Paris, France
Duration: 29 Aug 201930 Aug 2019
Conference number: 3
https://syntaxfest.github.io/syntaxfest19/program.html

Fields of Science

  • 6121 Languages
  • Treebanks
  • Uralic languages
  • universal dependencies
  • Komi-Zyrian
  • Erzya
  • North Sami
  • Finnish
  • Hungarian
  • Estonian
  • Morphological annotaton
  • syntax
  • parts-of-speech
  • Spoken language
  • Literary language

Cite this

Rueter, J., & Partanen, N. (2019). Survey of Uralic Universal Dependencies development. In ThirdWorkshop on Universal Dependencies (UDW, SyntaxFest 2019): Proceedings [78] Stroudsburg: The Association for Computational Linguistics.
Rueter, Jack ; Partanen, Niko. / Survey of Uralic Universal Dependencies development. ThirdWorkshop on Universal Dependencies (UDW, SyntaxFest 2019): Proceedings. Stroudsburg : The Association for Computational Linguistics, 2019.
@inproceedings{b5a470f949d64509a1f811beadd07f6b,
title = "Survey of Uralic Universal Dependencies development",
abstract = "This paper attempts to evaluate some of the systematic differences in Uralic Universal Dependencies treebanks from a perspective that would help to introduce reasonable improvements in treebank annotation consistency within this language family. The study finds that the coverage of Uralic languages in the project is already relatively high, and the majority of typically Uralic features are already present and can be discussed on the basis of existing treebanks. Some of the idiosyncrasies found in individual treebanks stem from language-internal grammar traditions, and could be a target for harmonization in later phases.",
keywords = "6121 Languages, Treebanks, Uralic languages, universal dependencies, Komi-Zyrian, Erzya, North Sami, Finnish, Hungarian, Estonian, Morphological annotaton, syntax, parts-of-speech, Spoken language, Literary language",
author = "Jack Rueter and Niko Partanen",
year = "2019",
language = "English",
booktitle = "ThirdWorkshop on Universal Dependencies (UDW, SyntaxFest 2019)",
publisher = "The Association for Computational Linguistics",
address = "United States",

}

Rueter, J & Partanen, N 2019, Survey of Uralic Universal Dependencies development. in ThirdWorkshop on Universal Dependencies (UDW, SyntaxFest 2019): Proceedings., 78, The Association for Computational Linguistics, Stroudsburg, Workshop on Universal Dependencies, Paris, France, 29/08/2019.

Survey of Uralic Universal Dependencies development. / Rueter, Jack; Partanen, Niko.

ThirdWorkshop on Universal Dependencies (UDW, SyntaxFest 2019): Proceedings. Stroudsburg : The Association for Computational Linguistics, 2019. 78.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

TY - GEN

T1 - Survey of Uralic Universal Dependencies development

AU - Rueter, Jack

AU - Partanen, Niko

PY - 2019

Y1 - 2019

N2 - This paper attempts to evaluate some of the systematic differences in Uralic Universal Dependencies treebanks from a perspective that would help to introduce reasonable improvements in treebank annotation consistency within this language family. The study finds that the coverage of Uralic languages in the project is already relatively high, and the majority of typically Uralic features are already present and can be discussed on the basis of existing treebanks. Some of the idiosyncrasies found in individual treebanks stem from language-internal grammar traditions, and could be a target for harmonization in later phases.

AB - This paper attempts to evaluate some of the systematic differences in Uralic Universal Dependencies treebanks from a perspective that would help to introduce reasonable improvements in treebank annotation consistency within this language family. The study finds that the coverage of Uralic languages in the project is already relatively high, and the majority of typically Uralic features are already present and can be discussed on the basis of existing treebanks. Some of the idiosyncrasies found in individual treebanks stem from language-internal grammar traditions, and could be a target for harmonization in later phases.

KW - 6121 Languages

KW - Treebanks

KW - Uralic languages

KW - universal dependencies

KW - Komi-Zyrian

KW - Erzya

KW - North Sami

KW - Finnish

KW - Hungarian

KW - Estonian

KW - Morphological annotaton

KW - syntax

KW - parts-of-speech

KW - Spoken language

KW - Literary language

M3 - Conference contribution

BT - ThirdWorkshop on Universal Dependencies (UDW, SyntaxFest 2019)

PB - The Association for Computational Linguistics

CY - Stroudsburg

ER -

Rueter J, Partanen N. Survey of Uralic Universal Dependencies development. In ThirdWorkshop on Universal Dependencies (UDW, SyntaxFest 2019): Proceedings. Stroudsburg: The Association for Computational Linguistics. 2019. 78