Abstrakti
We present the first version of the longitudinal Revita Learner Corpus (ReLCo), for Russian. In contrast to traditional learner corpora, ReLCo is collected and annotated fully automatically, while students perform exercises using the Revita language-learning platform. The corpus currently contains 8 422 sentences exhibiting several types of errors—grammatical, lexical, orthographic, etc.—which were committed by learners during practice and were automatically annotated by Revita. The corpus provides valuable information about patterns of learner errors and can be used as a language resource for a number of research tasks, while its creation is much cheaper and faster than for traditional learner corpora. A crucial advantage of ReLCo that it grows continually while learners practice with Revita, which opens the possibility of creating an unlimited learner resource with longitudinal data collected over time. We make the pilot version of the Russian ReLCo publicly available.
Alkuperäiskieli | englanti |
---|---|
Sivut | 379 |
Sivumäärä | 384 |
Tila | Julkaistu - 2020 |
OKM-julkaisutyyppi | Ei sovellu |
Tapahtuma | Language Resources and Evaluation Conference - [LREC 2020 was cancelled] Kesto: 11 toukok. 2020 → 16 toukok. 2020 Konferenssinumero: 12 https://lrec2020.lrec-conf.org/ |
Konferenssi
Konferenssi | Language Resources and Evaluation Conference |
---|---|
Lyhennettä | LREC 2020 |
Ajanjakso | 11/05/2020 → 16/05/2020 |
Muu | 12th Edition of its Language Resources and Evaluation Conference was cancelled due to Covid 19 pandemic. |
www-osoite |
Tieteenalat
- 6160 Muut humanistiset tieteet
Projektit
-
LLL: Language Learning Lab
Yangarber, R., Katinskaia, A., Hou, J., Furlan, G. & Kylliäinen, I. P.
Projekti: Tutkimusprojekti
-
Revita: Language learning and AI
Yangarber, R., Katinskaia, A., Hou, J., Furlan, G. & Kylliäinen, I. P.
Projekti: Tutkimusprojekti