The University of Helsinki submissions to the WMT19 news translation task

Tutkimustuotos: Artikkeli kirjassa/raportissa/konferenssijulkaisussaKonferenssiartikkeliTieteellinenvertaisarvioitu

Kuvaus

In this paper, we present the University of Helsinki submissions to the WMT 2019 shared task on news translation in three language pairs: English-German, English-Finnish and Finnish-English. This year, we focused first on cleaning and filtering the training data using multiple data-filtering approaches, resulting in much smaller and cleaner training sets. For English-German, we trained both sentence-level transformer models and compared different document-level translation approaches. For Finnish-English and English-Finnish we focused on different segmentation approaches, and we also included a rule-based system for English-Finnish.
Alkuperäiskielienglanti
OtsikkoFourth Conference of Conference on Machine Translation : Proceedings of the Conference
Sivumäärä12
JulkaisupaikkaStroudsburg
KustantajaAssociation for Computational Linguistics
Julkaisupäivä1 elokuuta 2019
Sivut611-622
ISBN (elektroninen)978-1-950737-27-7
TilaJulkaistu - 1 elokuuta 2019
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisuussa
TapahtumaFourth Conference on Machine Translation: WMT19 - Florence, Italia
Kesto: 1 elokuuta 20192 elokuuta 2019
Konferenssinumero: 4

Tieteenalat

  • 113 Tietojenkäsittely- ja informaatiotieteet
  • 6121 Kielitieteet

Lainaa tätä

Talman, A., Sulubacak, U., Vazquez , R., Scherrer, Y., Virpioja, S., Raganato, A., ... Tiedemann, J. (2019). The University of Helsinki submissions to the WMT19 news translation task. teoksessa Fourth Conference of Conference on Machine Translation: Proceedings of the Conference (Sivut 611-622). Stroudsburg: Association for Computational Linguistics.
Talman, Aarne ; Sulubacak, Umut ; Vazquez , Raul ; Scherrer, Yves ; Virpioja, Sami ; Raganato, Alessandro ; Hurskainen, Arvi ; Tiedemann, Jörg. / The University of Helsinki submissions to the WMT19 news translation task. Fourth Conference of Conference on Machine Translation: Proceedings of the Conference. Stroudsburg : Association for Computational Linguistics, 2019. Sivut 611-622
@inproceedings{8326a39257134d458dae56c457b37630,
title = "The University of Helsinki submissions to the WMT19 news translation task",
abstract = "In this paper, we present the University of Helsinki submissions to the WMT 2019 shared task on news translation in three language pairs: English-German, English-Finnish and Finnish-English. This year, we focused first on cleaning and filtering the training data using multiple data-filtering approaches, resulting in much smaller and cleaner training sets. For English-German, we trained both sentence-level transformer models and compared different document-level translation approaches. For Finnish-English and English-Finnish we focused on different segmentation approaches, and we also included a rule-based system for English-Finnish.",
keywords = "113 Computer and information sciences, 6121 Languages",
author = "Aarne Talman and Umut Sulubacak and Raul Vazquez and Yves Scherrer and Sami Virpioja and Alessandro Raganato and Arvi Hurskainen and J{\"o}rg Tiedemann",
year = "2019",
month = "8",
day = "1",
language = "English",
pages = "611--622",
booktitle = "Fourth Conference of Conference on Machine Translation",
publisher = "Association for Computational Linguistics",
address = "International",

}

Talman, A, Sulubacak, U, Vazquez , R, Scherrer, Y, Virpioja, S, Raganato, A, Hurskainen, A & Tiedemann, J 2019, The University of Helsinki submissions to the WMT19 news translation task. julkaisussa Fourth Conference of Conference on Machine Translation: Proceedings of the Conference. Association for Computational Linguistics, Stroudsburg, Sivut 611-622, Fourth Conference on Machine Translation, Florence, Italia, 01/08/2019.

The University of Helsinki submissions to the WMT19 news translation task. / Talman, Aarne; Sulubacak, Umut; Vazquez , Raul; Scherrer, Yves; Virpioja, Sami; Raganato, Alessandro; Hurskainen, Arvi; Tiedemann, Jörg.

Fourth Conference of Conference on Machine Translation: Proceedings of the Conference. Stroudsburg : Association for Computational Linguistics, 2019. s. 611-622.

Tutkimustuotos: Artikkeli kirjassa/raportissa/konferenssijulkaisussaKonferenssiartikkeliTieteellinenvertaisarvioitu

TY - GEN

T1 - The University of Helsinki submissions to the WMT19 news translation task

AU - Talman, Aarne

AU - Sulubacak, Umut

AU - Vazquez , Raul

AU - Scherrer, Yves

AU - Virpioja, Sami

AU - Raganato, Alessandro

AU - Hurskainen, Arvi

AU - Tiedemann, Jörg

PY - 2019/8/1

Y1 - 2019/8/1

N2 - In this paper, we present the University of Helsinki submissions to the WMT 2019 shared task on news translation in three language pairs: English-German, English-Finnish and Finnish-English. This year, we focused first on cleaning and filtering the training data using multiple data-filtering approaches, resulting in much smaller and cleaner training sets. For English-German, we trained both sentence-level transformer models and compared different document-level translation approaches. For Finnish-English and English-Finnish we focused on different segmentation approaches, and we also included a rule-based system for English-Finnish.

AB - In this paper, we present the University of Helsinki submissions to the WMT 2019 shared task on news translation in three language pairs: English-German, English-Finnish and Finnish-English. This year, we focused first on cleaning and filtering the training data using multiple data-filtering approaches, resulting in much smaller and cleaner training sets. For English-German, we trained both sentence-level transformer models and compared different document-level translation approaches. For Finnish-English and English-Finnish we focused on different segmentation approaches, and we also included a rule-based system for English-Finnish.

KW - 113 Computer and information sciences

KW - 6121 Languages

M3 - Conference contribution

SP - 611

EP - 622

BT - Fourth Conference of Conference on Machine Translation

PB - Association for Computational Linguistics

CY - Stroudsburg

ER -

Talman A, Sulubacak U, Vazquez R, Scherrer Y, Virpioja S, Raganato A et al. The University of Helsinki submissions to the WMT19 news translation task. julkaisussa Fourth Conference of Conference on Machine Translation: Proceedings of the Conference. Stroudsburg: Association for Computational Linguistics. 2019. s. 611-622