Survey of Uralic Universal Dependencies development

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Abstract

This paper attempts to evaluate some of the systematic differences in Uralic Universal Dependencies treebanks from a perspective that would help to introduce reasonable improvements in treebank annotation consistency within this language family. The study finds that the coverage of Uralic languages in the project is already relatively high, and the majority of typically Uralic features are already present and can be discussed on the basis of existing treebanks. Some of the idiosyncrasies found in individual treebanks stem from language-internal grammar traditions, and could be a target for harmonization in later phases.
Original languageEnglish
Title of host publicationThird Workshop on Universal Dependencies (UDW, SyntaxFest 2019) : Proceedings
Number of pages7
Place of PublicationStroudsburg
PublisherThe Association for Computational Linguistics
Publication date2019
Article number78
ISBN (Electronic)978-1-950737-66-6
Publication statusPublished - 2019
MoE publication typeA4 Article in conference proceedings
EventWorkshop on Universal Dependencies - Paris, France
Duration: 29 Aug 201930 Aug 2019
Conference number: 3
https://syntaxfest.github.io/syntaxfest19/program.html

Fields of Science

  • 6121 Languages
  • Treebanks
  • Uralic languages
  • universal dependencies
  • Komi-Zyrian
  • Erzya
  • North Sami
  • Finnish
  • Hungarian
  • Estonian
  • Morphological annotaton
  • syntax
  • parts-of-speech
  • Spoken language
  • Literary language

Cite this