Universal dependencies for Turkish

Umut Sulubacak, Memduh Gökırmak, Francis Tyers, Çağrı Çöltekin, Joakim Nivre, Gülşen Eryiğit

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Abstract

The Universal Dependencies (UD) project was conceived after the substantial recent interest in unifying annotation schemes across languages. With its own annotation principles and abstract inventory for parts of speech, morphosyntactic features and dependency relations, UD aims to facilitate multilingual parser development, cross-lingual learning, and parsing research from a language typology perspective. This paper presents the Turkish IMST-UD Treebank, the first Turkish treebank to be in a UD release. The IMST-UD Treebank was automatically converted from the IMST Treebank, which was also recently released. We describe this conversion procedure in detail, complete with mapping tables. We also present our evaluation of the parsing performances of both versions of the IMST Treebank. Our findings suggest that the UD framework is at least as viable for Turkish as the original annotation framework of the IMST Treebank.
Original languageEnglish
Title of host publicationProceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers
EditorsYuji Matsumoto, Rashmi Prasad
Number of pages11
Place of PublicationOsaka, Japan
PublisherThe Association for Computational Linguistics
Publication dateDec 2016
Pages3444-3454
ISBN (Electronic)978-4-87974-702-0
Publication statusPublished - Dec 2016
MoE publication typeA4 Article in conference proceedings
EventInternational Conference on Computational Linguistics - Osaka, Japan
Duration: 11 Dec 201616 Dec 2016
Conference number: 26

Fields of Science

  • 6121 Languages
  • 113 Computer and information sciences

Cite this