Creating Expert Knowledge by Relying on Language Learners: a Generic Approach for Mass-Producing Language Resources by Combining Implicit Crowdsourcing and Language Learning

Lionel Nicolas, Verena Lyding, Claudia Borg, Karen Fort, Katerina Zdravkova, Iztok Kosem, Jaka Čibej, Špela Arhar Holdt, Alice Millour, Alexander König, Christos Rodosthenous, Federico Sangati, Anisia Katinskaia, Anabela Barreiro, Lavinia Aparaschivei, Yaakov HaCohen-Kerner

Tutkimustuotos: KonferenssimateriaalitKonferenssiesitysvertaisarvioitu

Abstrakti

We introduce in this paper a generic approach to combine implicit crowdsourcing and language learning in order to mass-produce language resources (LRs) for any language for which a crowd of language learners can be involved. We present the approach by explaining its core paradigm that consists in pairing specific types of LRs with specific exercises, by detailing both its strengths and challenges, and by discussing how much these challenges have been addressed at present. Accordingly, we also report on on-going proof-of-concept efforts aiming at developing the first prototypical implementation of the approach in order to correct and extend an LR called ConceptNet based on the input crowdsourced from language learners. We then present an international network called the European Network for Combining Language Learning with Crowdsourcing Techniques (enetCollect) that provides the context to accelerate the implementation of this generic approach. Finally, we exemplify how it can be used in several language learning scenarios to produce a multitude of NLP resources and how it can therefore alleviate the long-standing NLP issue of the lack of LRs.
Alkuperäiskielienglanti
Sivut268
TilaJulkaistu - toukokuuta 2020
OKM-julkaisutyyppiEi sovellu
TapahtumaLanguage Resources and Evaluation Conference - [LREC 2020 was cancelled]
Kesto: 11 toukokuuta 202016 toukokuuta 2020
Konferenssinumero: 12
https://lrec2020.lrec-conf.org/

Konferenssi

KonferenssiLanguage Resources and Evaluation Conference
LyhennettäLREC 2020
Ajanjakso11/05/202016/05/2020
Muu12th Edition of its Language Resources and Evaluation Conference was cancelled due to Covid 19 pandemic.
www-osoite

Tieteenalat

  • 6160 Muut humanistiset tieteet

Siteeraa tätä