Abstract
We conduct a manual error analysis of the CoNLL-SIGMORPHON 2017 Shared Task on Morphological Reinflection. In this task, systems are given a word in citation form (e.g., hug) and asked to produce the corresponding inflected form (e.g., the simple past hugged). This design lets us analyze errors much like we might analyze children's production errors. We propose an error taxonomy and use it to annotate errors made by the top two systems across twelve languages. Many of the observed errors are related to inflectional patterns sensitive to inherent linguistic properties such as animacy or affect; many others are failures to predict truly unpredictable inflectional behaviors. We also find nearly one quarter of the residual "errors" reflect errors in the gold data. © 2019 Association for Computational Linguistics.
| Original language | English |
|---|---|
| Title of host publication | CoNLL 2019 - 23rd Conference on Computational Natural Language Learning, Proceedings of the Conference |
| Editors | Mohit Bansal, Aline Villavicencio |
| Number of pages | 12 |
| Place of Publication | Stroudsburg, PA |
| Publisher | The Association for Computational Linguistics |
| Publication date | 2019 |
| Pages | 140-151 |
| ISBN (Electronic) | 978-1-950737-72-7 |
| Publication status | Published - 2019 |
| MoE publication type | A4 Article in conference proceedings |
| Event | Conference on Computational Natural Language Learning - Hong Kong, Hong Kong Duration: 3 Nov 2019 → 4 Nov 2019 Conference number: 23 https://www.conll.org/2019 |
Fields of Science
- Animacy
- Error taxonomy
- Linguistic properties
- Errors
- 113 Computer and information sciences
- 6121 Languages
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver