Automatic Collocation Extraction and Classification of Automatically Obtained Bigrams

Daria Kormacheva, Lidia Pivovarova, Mihail Kopotev

Forskningsoutput: Kapitel i bok/rapport/konferenshandlingKonferensbidragVetenskapligPeer review

Sammanfattning

This paper focuses on automatic determination of the distributional preferences of words in Russian. We present the comparison of six different measures for collocation extraction, part of which are widely known, while others are less prominent or new. For these metrics we evaluate the semantic stability of automatically obtained bigrams beginning with single-token prepositions. Manual annotation of the first 100 bigrams and comparison with the dictionary of multi-word expressions are used as evaluation measures. Finally, in order to present error analysis, two prepositions are investigated in some details.
Originalspråkengelska
Titel på gästpublikationProceedings : Workshop on Computational, Cognitive, and Linguistic Approaches to the Analysis of Complex Words and Collocations (CCLCC 2014)
RedaktörerVerena Henrich, Erhard Hinrichs
Antal sidor7
UtgivningsortTübingen
FörlagUniversity of Tübingen
Utgivningsdatum2014
Sidor27-33
StatusPublicerad - 2014
MoE-publikationstypA4 Artikel i en konferenspublikation
EvenemangWorkshop on Computational, Cognitive, and Linguistic Approaches to the Analysis of Complex Words and Collocations - Tübingen, Tyskland
Varaktighet: 11 aug 201415 aug 2014
Konferensnummer: CCLCC 2014

Vetenskapsgrenar

  • 6121 Språkvetenskaper

Projekt

COLLOCATIONS, COLLIGATIONS AND CORPORA (CoCoCo)

Kopotev, M., Yangarber, R., Kormacheva, D., Pivovarova, L. & Pierce, M.

01/09/2012 → …

Projekt: Forskningsprojekt

Citera det här

Kormacheva, D., Pivovarova, L., & Kopotev, M. (2014). Automatic Collocation Extraction and Classification of Automatically Obtained Bigrams. I V. Henrich, & E. Hinrichs (Red.), Proceedings: Workshop on Computational, Cognitive, and Linguistic Approaches to the Analysis of Complex Words and Collocations (CCLCC 2014) (s. 27-33). University of Tübingen. http://www.sfs.uni-tuebingen.de/~vhenrich/cclcc_2014/