Variation in Universal Dependencies annotation: A token-based typological case study on adpossessive constructions

Kaius Sinnemäki, Viljami Lauri Juhana Haakana

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Abstract

In this paper we present a method for identifying and analyzing adnominal possessive constructions in 66 Universal Dependencies treebanks. We classify adpossessive constructions in terms of their morphological type (locus of marking) and present a workflow for detecting and analyzing them typologically. Based on a preliminary evaluation, the algorithm works fairly reliably in adpossessive constructions that are morphologically marked. However, it performs rather poorly in adpossessive constructions that are not marked morphologically, so-called zero-marked constructions, because of difficulties in identifying these constructions with the current annotation. We also discuss different types of variation in annotation in different treebanks for the same language and for treebanks of closely related languages. The research focuses on one well-circumscribed and universal construction in the hope of generating more interest in using UD for cross-linguistic comparison and for contributing towards developing yet more consistent annotation of constructions in the UD annotation scheme.
Original languageEnglish
Title of host publicationProceedings of the Fourth Workshop on Universal Dependencies (UDW 2020)
EditorsMarie-Catherine de Marneffe, Miryam de Lhoneux, Joakim Nivre, Sebastian Schuster
Number of pages10
Place of PublicationStroudsburg
PublisherThe Association for Computational Linguistics
Publication date2020
Pages158-167
ISBN (Electronic)978-1-952148-48-4
Publication statusPublished - 2020
MoE publication typeA4 Article in conference proceedings
EventWorkshop on Universal Dependencies - Online event
Duration: 13 Dec 202013 Dec 2020
Conference number: 4

Fields of Science

  • 6121 Languages
  • 113 Computer and information sciences

Cite this