OpenSubtitles2018: Statistical Rescoring of Sentence Alignments in Large, Noisy Parallel Corpora

Pierre Lison, Jörg Tiedemann, Milen Kouylekov

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Original languageEnglish
Title of host publicationProceedings of the 11th International Conference on Language Resources and Evaluation (LREC 2018)
EditorsNicoletta Calzolari, Choukri Khalid, Cieri Christopher, Declerck Thierry, Goggi Sara, Hasida Koiti, Isahara Hitoshi, Maegaard Bente, Mariani Joseph, Mazo Hélène, Moreno Asuncion, Odijk Jan, Piperidis Stelios, Tokunaga Takenobu
Number of pages7
Place of PublicationParis
PublisherEuropean Language Resources Association (ELRA)
Publication date2018
Pages1742-1748
ISBN (Electronic)979-10-95546-00-9
Publication statusPublished - 2018
MoE publication typeA4 Article in conference proceedings
EventLanguage Resources and Evaluation Conference - Miyazaki, Japan
Duration: 7 May 201812 May 2018
Conference number: 11

Fields of Science

  • 113 Computer and information sciences
  • language technology
  • Natural language processing
  • 6121 Languages
  • computational linguistics

Projects

OPUS: The Open Parallel Corpus

Tiedemann, J.

01/06/2004 → …

Project: Research project

Datasets

OPUS

Tiedemann, J. (Creator), University of Helsinki, 2017

Dataset

Cite this

Lison, P., Tiedemann, J., & Kouylekov, M. (2018). OpenSubtitles2018: Statistical Rescoring of Sentence Alignments in Large, Noisy Parallel Corpora. In N. Calzolari, C. Khalid, C. Christopher, D. Thierry, G. Sara, H. Koiti, I. Hitoshi, M. Bente, M. Joseph, M. Hélène, M. Asuncion, O. Jan, P. Stelios, & T. Takenobu (Eds.), Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC 2018) (pp. 1742-1748). European Language Resources Association (ELRA). http://www.lrec-conf.org/proceedings/lrec2018/summaries/294.html