Numerals and what counts

Jack Rueter, Niko Partanen, Tommi A Pirinen

Tutkimustuotos: Artikkeli kirjassa/raportissa/konferenssijulkaisussaKonferenssiartikkeliTieteellinenvertaisarvioitu


This study discusses the way different numerals and related expressions are currently annotated in the Universal Dependencies project, with a specific focus on the Uralic language family and only occasional references to the other language groups. We analyse different annotation conventions between individual treebanks, and aim to highlight some areas where further development work and systematization could prove beneficial. At the same time, the Universal Dependencies project already offers a wide range of conventions to mark nuanced variation in numerals and counting expressions, and the harmonization of conventions between different languages could be the next step to take. The discussion here makes specific reference to Universal Dependencies version 2.8, and some differences found may already have been harmonized in version 2.9. Regardless of whether this takes place or not, we believe that the study still forms an important documentation of this period in the project.
OtsikkoFifth Workshop on Universal Dependencies : Proceedings
ToimittajatMiryam de Lhoneux, Reut Tsarfaty
KustantajaThe Association for Computational Linguistics
Julkaisupäiväjouluk. 2021
ISBN (elektroninen)978-1-955917-17-9
TilaJulkaistu - jouluk. 2021
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisuussa
TapahtumaWorkshop on Universal Dependencies: UDW, SyntaxFest 2021 - [Online event], Sofia
Kesto: 21 maalisk. 202225 maalisk. 2022
Konferenssinumero: 6


  • 6121 Kielitieteet

Siteeraa tätä