An efficient any language approach for the integration of phrases in document retrieval

Antoine Doucet, Helena Ahonen-Myka

Research output: Contribution to journalArticleScientificpeer-review

Abstract

In this paper, we address the problem of the exploitation of text phrases in a multilingual context. We propose a technique to benefit from multi-word units in adhoc document retrieval, whatever the language of the document collection. We present principles to optimize the performance improvement obtained through this approach. The work is validated through retrieval experiments conducted on Chinese, Japanese, Korean and English.
Original languageEnglish
JournalLanguage Resources and Evaluation
Volume44
Issue number1-2
Pages (from-to)159-180
Number of pages22
ISSN1574-020X
DOIs
Publication statusPublished - 2010
MoE publication typeA1 Journal article-refereed

Fields of Science

  • 113 Computer and information sciences
  • Multiword expressions, document retrieval, endogenous resources

Cite this

@article{9b692f66938b4a39b7dff4d234c865cb,
title = "An efficient any language approach for the integration of phrases in document retrieval",
abstract = "In this paper, we address the problem of the exploitation of text phrases in a multilingual context. We propose a technique to benefit from multi-word units in adhoc document retrieval, whatever the language of the document collection. We present principles to optimize the performance improvement obtained through this approach. The work is validated through retrieval experiments conducted on Chinese, Japanese, Korean and English.",
keywords = "113 Computer and information sciences, Multiword expressions, document retrieval, endogenous resources",
author = "Antoine Doucet and Helena Ahonen-Myka",
year = "2010",
doi = "10.1007/s10579-009-9102-3",
language = "English",
volume = "44",
pages = "159--180",
journal = "Language Resources and Evaluation",
issn = "1574-020X",
publisher = "Springer",
number = "1-2",

}

An efficient any language approach for the integration of phrases in document retrieval. / Doucet, Antoine; Ahonen-Myka, Helena.

In: Language Resources and Evaluation, Vol. 44, No. 1-2, 2010, p. 159-180.

Research output: Contribution to journalArticleScientificpeer-review

TY - JOUR

T1 - An efficient any language approach for the integration of phrases in document retrieval

AU - Doucet, Antoine

AU - Ahonen-Myka, Helena

PY - 2010

Y1 - 2010

N2 - In this paper, we address the problem of the exploitation of text phrases in a multilingual context. We propose a technique to benefit from multi-word units in adhoc document retrieval, whatever the language of the document collection. We present principles to optimize the performance improvement obtained through this approach. The work is validated through retrieval experiments conducted on Chinese, Japanese, Korean and English.

AB - In this paper, we address the problem of the exploitation of text phrases in a multilingual context. We propose a technique to benefit from multi-word units in adhoc document retrieval, whatever the language of the document collection. We present principles to optimize the performance improvement obtained through this approach. The work is validated through retrieval experiments conducted on Chinese, Japanese, Korean and English.

KW - 113 Computer and information sciences

KW - Multiword expressions, document retrieval, endogenous resources

U2 - 10.1007/s10579-009-9102-3

DO - 10.1007/s10579-009-9102-3

M3 - Article

VL - 44

SP - 159

EP - 180

JO - Language Resources and Evaluation

JF - Language Resources and Evaluation

SN - 1574-020X

IS - 1-2

ER -