Numerals and what counts

Jack Rueter, Niko Partanen, Tommi A Pirinen

Forskningsoutput: Kapitel i bok/rapport/konferenshandlingKonferensbidragVetenskapligPeer review


This study discusses the way different numerals and related expressions are currently annotated in the Universal Dependencies project, with a specific focus on the Uralic language family and only occasional references to the other language groups. We analyse different annotation conventions between individual treebanks, and aim to highlight some areas where further development work and systematization could prove beneficial. At the same time, the Universal Dependencies project already offers a wide range of conventions to mark nuanced variation in numerals and counting expressions, and the harmonization of conventions between different languages could be the next step to take. The discussion here makes specific reference to Universal Dependencies version 2.8, and some differences found may already have been harmonized in version 2.9. Regardless of whether this takes place or not, we believe that the study still forms an important documentation of this period in the project.
Titel på värdpublikationFifth Workshop on Universal Dependencies : Proceedings
RedaktörerMiryam de Lhoneux, Reut Tsarfaty
Antal sidor9
FörlagThe Association for Computational Linguistics
Utgivningsdatumdec. 2021
ISBN (elektroniskt)978-1-955917-17-9
StatusPublicerad - dec. 2021
MoE-publikationstypA4 Artikel i en konferenspublikation
EvenemangWorkshop on Universal Dependencies: UDW, SyntaxFest 2021 - [Online event], Sofia
Varaktighet: 21 mars 202225 mars 2022
Konferensnummer: 6


  • 6121 Språkvetenskaper

Citera det här