Abstrakti
This paper presents the results of the SHROOM, a shared task focused on detecting hallucinations: outputs from natural language generation (NLG) systems that are fluent, yet inaccurate. Such cases of overgeneration put in jeopardy many NLG applications, where correctness is often mission-critical. The shared task was conducted with a newly constructed dataset of 4000 model outputs labeled by 5 annotators each, spanning 3 NLP tasks: machine translation, paraphrase generation and definition modeling.The shared task was tackled by a total of 58 different users grouped in 42 teams, out of which 26 elected to write a system description paper; collectively, they submitted over 300 prediction sets on both tracks of the shared task. We observe a number of key trends in how this approach was tackled---many participants rely on a handful of model, and often rely either on synthetic data for fine-tuning or zero-shot prompting strategies. While a majority of the teams did outperform our proposed baseline system, the performances of top-scoring systems are still consistent with a random handling of the more challenging items.
| Alkuperäiskieli | englanti |
|---|---|
| Otsikko | Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024) |
| Toimittajat | Atul Kr. Ojha, A. Seza Doğruöz, Harish Tayyar Madabushi, Giovanni Da San Martino, Sara Rosenthal, Aiala Rosá |
| Sivumäärä | 15 |
| Julkaisupaikka | Stroudsburg |
| Kustantaja | The Association for Computational Linguistics |
| Julkaisupäivä | 1 kesäk. 2024 |
| Sivut | 1979-1993 |
| ISBN (elektroninen) | 979-8-89176-107-0 |
| DOI - pysyväislinkit | |
| Tila | Julkaistu - 1 kesäk. 2024 |
| OKM-julkaisutyyppi | A4 Artikkeli konferenssijulkaisuussa |
| Tapahtuma | International Workshop on Semantic Evaluation - Mexico City, Meksiko Kesto: 20 kesäk. 2024 → 21 kesäk. 2024 Konferenssinumero: 18 |
Tieteenalat
- 6121 Kielitieteet
- 113 Tietojenkäsittely- ja informaatiotieteet
Projektit
- 1 Päättynyt
-
Uncertainty-aware neural language models
Tiedemann, J. (Projektinjohtaja), Celikkanat, H. (Osallistuja), Virpioja, S. P. (Osallistuja) & Vazquez , R. (Osallistuja)
Academy of Finland, Suomen Akatemia Projektilaskutus
01/01/2022 → 01/10/2025
Projekti: Tutkimusprojekti
Palkinnot
-
Best paper Award
Varjonen, S. (Vastaanottaja), 7 jouluk. 2011
Palkinto: Palkinnot ja kunnianosoitukset
Siteeraa tätä
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver