Automatic Collocation Extraction and Classification of Automatically Obtained Bigrams

Tutkimustuotos: Artikkeli kirjassa/raportissa/konferenssijulkaisussaKonferenssiartikkeliTieteellinenvertaisarvioitu

Kuvaus

This paper focuses on automatic determination of the distributional preferences of words in Russian. We present the comparison of six different measures for collocation extraction, part of which are widely known, while others are less prominent or new. For these metrics we evaluate the semantic stability of automatically obtained bigrams beginning with single-token prepositions. Manual annotation of the first 100 bigrams and comparison with the dictionary of multi-word expressions are used as evaluation measures. Finally, in order to present error analysis, two prepositions are investigated in some details.
Alkuperäiskielienglanti
OtsikkoProceedings : Workshop on Computational, Cognitive, and Linguistic Approaches to the Analysis of Complex Words and Collocations (CCLCC 2014)
ToimittajatVerena Henrich, Erhard Hinrichs
Sivumäärä7
JulkaisupaikkaTübingen
KustantajaUniversity of Tübingen
Julkaisupäivä2014
Sivut27-33
TilaJulkaistu - 2014
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisuussa
TapahtumaWorkshop on Computational, Cognitive, and Linguistic Approaches to the Analysis of Complex Words and Collocations - Tübingen, Saksa
Kesto: 11 elokuuta 201415 elokuuta 2014
Konferenssinumero: CCLCC 2014

Lisätietoja


Volume:
Proceeding volume:

Tieteenalat

  • 6121 Kielitieteet

Lainaa tätä

Kormacheva, D., Pivovarova, L., & Kopotev, M. (2014). Automatic Collocation Extraction and Classification of Automatically Obtained Bigrams. teoksessa V. Henrich, & E. Hinrichs (Toimittajat), Proceedings: Workshop on Computational, Cognitive, and Linguistic Approaches to the Analysis of Complex Words and Collocations (CCLCC 2014) (Sivut 27-33). Tübingen: University of Tübingen.
Kormacheva, Daria ; Pivovarova, Lidia ; Kopotev, Mihail. / Automatic Collocation Extraction and Classification of Automatically Obtained Bigrams. Proceedings: Workshop on Computational, Cognitive, and Linguistic Approaches to the Analysis of Complex Words and Collocations (CCLCC 2014). Toimittaja / Verena Henrich ; Erhard Hinrichs. Tübingen : University of Tübingen, 2014. Sivut 27-33
@inproceedings{db7e98b06de8493abd107168bd38a42c,
title = "Automatic Collocation Extraction and Classification of Automatically Obtained Bigrams",
abstract = "This paper focuses on automatic determination of the distributional preferences of words in Russian. We present the comparison of six different measures for collocation extraction, part of which are widely known, while others are less prominent or new. For these metrics we evaluate the semantic stability of automatically obtained bigrams beginning with single-token prepositions. Manual annotation of the first 100 bigrams and comparison with the dictionary of multi-word expressions are used as evaluation measures. Finally, in order to present error analysis, two prepositions are investigated in some details.",
keywords = "6121 Languages",
author = "Daria Kormacheva and Lidia Pivovarova and Mihail Kopotev",
note = "Volume: Proceeding volume:",
year = "2014",
language = "English",
pages = "27--33",
editor = "Verena Henrich and Erhard Hinrichs",
booktitle = "Proceedings",
publisher = "University of T{\"u}bingen",
address = "Germany",

}

Kormacheva, D, Pivovarova, L & Kopotev, M 2014, Automatic Collocation Extraction and Classification of Automatically Obtained Bigrams. julkaisussa V Henrich & E Hinrichs (toim), Proceedings: Workshop on Computational, Cognitive, and Linguistic Approaches to the Analysis of Complex Words and Collocations (CCLCC 2014). University of Tübingen, Tübingen, Sivut 27-33, Workshop on Computational, Cognitive, and Linguistic Approaches to the Analysis of Complex Words and Collocations, Tübingen, Saksa, 11/08/2014.

Automatic Collocation Extraction and Classification of Automatically Obtained Bigrams. / Kormacheva, Daria; Pivovarova, Lidia; Kopotev, Mihail.

Proceedings: Workshop on Computational, Cognitive, and Linguistic Approaches to the Analysis of Complex Words and Collocations (CCLCC 2014). toim. / Verena Henrich; Erhard Hinrichs. Tübingen : University of Tübingen, 2014. s. 27-33.

Tutkimustuotos: Artikkeli kirjassa/raportissa/konferenssijulkaisussaKonferenssiartikkeliTieteellinenvertaisarvioitu

TY - GEN

T1 - Automatic Collocation Extraction and Classification of Automatically Obtained Bigrams

AU - Kormacheva, Daria

AU - Pivovarova, Lidia

AU - Kopotev, Mihail

N1 - Volume: Proceeding volume:

PY - 2014

Y1 - 2014

N2 - This paper focuses on automatic determination of the distributional preferences of words in Russian. We present the comparison of six different measures for collocation extraction, part of which are widely known, while others are less prominent or new. For these metrics we evaluate the semantic stability of automatically obtained bigrams beginning with single-token prepositions. Manual annotation of the first 100 bigrams and comparison with the dictionary of multi-word expressions are used as evaluation measures. Finally, in order to present error analysis, two prepositions are investigated in some details.

AB - This paper focuses on automatic determination of the distributional preferences of words in Russian. We present the comparison of six different measures for collocation extraction, part of which are widely known, while others are less prominent or new. For these metrics we evaluate the semantic stability of automatically obtained bigrams beginning with single-token prepositions. Manual annotation of the first 100 bigrams and comparison with the dictionary of multi-word expressions are used as evaluation measures. Finally, in order to present error analysis, two prepositions are investigated in some details.

KW - 6121 Languages

M3 - Conference contribution

SP - 27

EP - 33

BT - Proceedings

A2 - Henrich, Verena

A2 - Hinrichs, Erhard

PB - University of Tübingen

CY - Tübingen

ER -

Kormacheva D, Pivovarova L, Kopotev M. Automatic Collocation Extraction and Classification of Automatically Obtained Bigrams. julkaisussa Henrich V, Hinrichs E, toimittajat, Proceedings: Workshop on Computational, Cognitive, and Linguistic Approaches to the Analysis of Complex Words and Collocations (CCLCC 2014). Tübingen: University of Tübingen. 2014. s. 27-33