An efficient any language approach for the integration of phrases in document retrieval

Antoine Doucet, Helena Ahonen-Myka

Tutkimustuotos: ArtikkelijulkaisuArtikkeliTieteellinenvertaisarvioitu

Kuvaus

In this paper, we address the problem of the exploitation of text phrases in a multilingual context. We propose a technique to benefit from multi-word units in adhoc document retrieval, whatever the language of the document collection. We present principles to optimize the performance improvement obtained through this approach. The work is validated through retrieval experiments conducted on Chinese, Japanese, Korean and English.
Alkuperäiskielienglanti
LehtiLanguage Resources and Evaluation
Vuosikerta44
Numero1-2
Sivut159-180
Sivumäärä22
ISSN1574-020X
DOI - pysyväislinkit
TilaJulkaistu - 2010
OKM-julkaisutyyppiA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä, vertaisarvioitu

Tieteenalat

  • 113 Tietojenkäsittely- ja informaatiotieteet

Lainaa tätä

@article{9b692f66938b4a39b7dff4d234c865cb,
title = "An efficient any language approach for the integration of phrases in document retrieval",
abstract = "In this paper, we address the problem of the exploitation of text phrases in a multilingual context. We propose a technique to benefit from multi-word units in adhoc document retrieval, whatever the language of the document collection. We present principles to optimize the performance improvement obtained through this approach. The work is validated through retrieval experiments conducted on Chinese, Japanese, Korean and English.",
keywords = "113 Computer and information sciences, Multiword expressions, document retrieval, endogenous resources",
author = "Antoine Doucet and Helena Ahonen-Myka",
year = "2010",
doi = "10.1007/s10579-009-9102-3",
language = "English",
volume = "44",
pages = "159--180",
journal = "Language Resources and Evaluation",
issn = "1574-020X",
publisher = "Springer",
number = "1-2",

}

An efficient any language approach for the integration of phrases in document retrieval. / Doucet, Antoine; Ahonen-Myka, Helena.

julkaisussa: Language Resources and Evaluation, Vuosikerta 44, Nro 1-2, 2010, s. 159-180.

Tutkimustuotos: ArtikkelijulkaisuArtikkeliTieteellinenvertaisarvioitu

TY - JOUR

T1 - An efficient any language approach for the integration of phrases in document retrieval

AU - Doucet, Antoine

AU - Ahonen-Myka, Helena

PY - 2010

Y1 - 2010

N2 - In this paper, we address the problem of the exploitation of text phrases in a multilingual context. We propose a technique to benefit from multi-word units in adhoc document retrieval, whatever the language of the document collection. We present principles to optimize the performance improvement obtained through this approach. The work is validated through retrieval experiments conducted on Chinese, Japanese, Korean and English.

AB - In this paper, we address the problem of the exploitation of text phrases in a multilingual context. We propose a technique to benefit from multi-word units in adhoc document retrieval, whatever the language of the document collection. We present principles to optimize the performance improvement obtained through this approach. The work is validated through retrieval experiments conducted on Chinese, Japanese, Korean and English.

KW - 113 Computer and information sciences

KW - Multiword expressions, document retrieval, endogenous resources

U2 - 10.1007/s10579-009-9102-3

DO - 10.1007/s10579-009-9102-3

M3 - Article

VL - 44

SP - 159

EP - 180

JO - Language Resources and Evaluation

JF - Language Resources and Evaluation

SN - 1574-020X

IS - 1-2

ER -