Projects per year
Abstract
We describe the design, the evaluation setup, and the results of the DiscoMT 2015 shared task, which included two subtasks, relevant to both the machine translation (MT) and the discourse communities: (i) pronoun-focused translation, a practical MT task, and (ii) cross-lingual pronoun prediction, a classification task that requires no specific MT expertise and is interesting as a machine learning task in its own right. We focused on the English–French language pair, for which MT output is generally of high quality, but has visible issues with pronoun translation due
to differences in the pronoun systems of the two languages. Six groups participated in the pronoun-focused translation task and eight groups in the cross-lingual pronoun prediction task
to differences in the pronoun systems of the two languages. Six groups participated in the pronoun-focused translation task and eight groups in the cross-lingual pronoun prediction task
Original language | Other/Unknown |
---|---|
Title of host publication | Proceedings of the Second Workshop on Discourse in Machine Translation (DiscoMT), |
Number of pages | 16 |
Publisher | The Association for Computational Linguistics |
Publication date | 1 Sept 2015 |
Pages | 1-16 |
ISBN (Print) | 978-1-94163-32-7 |
Publication status | Published - 1 Sept 2015 |
Externally published | Yes |
MoE publication type | A4 Article in conference proceedings |
Event | Workshop on Discourse in Machine Translation (DiscoMT), - Lisbon, Portugal Duration: 17 Sept 2015 → 17 Sept 2015 Conference number: 2 |
Fields of Science
- 6121 Languages
Projects
- 1 Finished
-
DiscoMT: Discourse-Oriented Statistical Machine Translation
Tiedemann, J. (Project manager), Hardmeier, C. (Principal Investigator), Loáiciga, S. (Participant) & Scherrer, Y. (Participant)
01/01/2012 → 31/12/2017
Project: Research project
Datasets
-
DiscoMT 2015 Shared Task on Pronoun Translation
Tiedemann, J. (Creator), LINDAT/CLARIN, 31 Jan 2016
DOI: http://hdl.handle.net/11372/LRT-1611, http://hdl.handle.net/11372/LRT-1611
Dataset