The MeMAD Submission to the WMT18 Multimodal Translation Task

Grönroos Stig-Arne, Benoit Huet, Mikko Kurimo, Jorma Laaksonen, Bernard Merialdo, Phu Pham, Mats Sjöberg, Umut Sulubacak, Jörg Tiedemann, Raphaël Troncy, Juan Raúl Vázquez Carrillo

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Abstract

This paper describes the MeMAD project entry to the WMT Multimodal Machine
Translation Shared Task.

We propose adapting the Transformer neural machine translation (NMT) architecture to a multi-modal setting. In this paper, we also describe the preliminary experiments with text-only translation systems leading us up to this choice.

We have the top scoring system for both English-to-German and English-to-French, according to the automatic metrics for flickr18.

Our experiments show that the effect of the visual features in our system is small. Our largest gains come from the quality of the underlying text-only NMT system. We find that appropriate use of additional data is effective.
Original languageEnglish
Title of host publicationProceedings of the Third Conference on Machine Translation (WMT) : Shared Task Papers
EditorsOndřej Bojar, Rajen Chatterjee, Christian Federmann, Mark Fishel, Yvette Graham, Barry Haddow, Matthias Huck, Antonio Jimeno Yepes, Philipp Koehn, Christof Monz, Matteo Negri, Aurélie Névéol, Mariana Neves, Matt Post, Lucia Specia, Marco Turchi, Karin Verspoor
Number of pages9
Place of PublicationStroudsburg
PublisherAssociation for Computational Linguistics
Publication date1 Nov 2018
Pages603-611
ISBN (Electronic)978-1-948087-81-0
Publication statusPublished - 1 Nov 2018
MoE publication typeA4 Article in conference proceedings
EventConference on Machine Translation - Brussels, Belgium
Duration: 31 Oct 20181 Nov 2018
Conference number: 3

Fields of Science

  • 113 Computer and information sciences
  • 6121 Languages

Cite this

Stig-Arne, G., Huet, B., Kurimo, M., Laaksonen, J., Merialdo, B., Pham, P., ... Vázquez Carrillo, J. R. (2018). The MeMAD Submission to the WMT18 Multimodal Translation Task. In O. Bojar, R. Chatterjee, C. Federmann, M. Fishel, Y. Graham, B. Haddow, M. Huck, A. J. Yepes, P. Koehn, C. Monz, M. Negri, A. Névéol, M. Neves, M. Post, L. Specia, M. Turchi, ... K. Verspoor (Eds.), Proceedings of the Third Conference on Machine Translation (WMT): Shared Task Papers (pp. 603-611). Stroudsburg: Association for Computational Linguistics.
Stig-Arne, Grönroos ; Huet, Benoit ; Kurimo, Mikko ; Laaksonen, Jorma ; Merialdo, Bernard ; Pham, Phu ; Sjöberg, Mats ; Sulubacak, Umut ; Tiedemann, Jörg ; Troncy, Raphaël ; Vázquez Carrillo, Juan Raúl. / The MeMAD Submission to the WMT18 Multimodal Translation Task. Proceedings of the Third Conference on Machine Translation (WMT): Shared Task Papers. editor / Ondřej Bojar ; Rajen Chatterjee ; Christian Federmann ; Mark Fishel ; Yvette Graham ; Barry Haddow ; Matthias Huck ; Antonio Jimeno Yepes ; Philipp Koehn ; Christof Monz ; Matteo Negri ; Aurélie Névéol ; Mariana Neves ; Matt Post ; Lucia Specia ; Marco Turchi ; Karin Verspoor. Stroudsburg : Association for Computational Linguistics, 2018. pp. 603-611
@inproceedings{d755e4010d364e62aedbf0eab2f2ee8a,
title = "The MeMAD Submission to the WMT18 Multimodal Translation Task",
abstract = "This paper describes the MeMAD project entry to the WMT Multimodal MachineTranslation Shared Task.We propose adapting the Transformer neural machine translation (NMT) architecture to a multi-modal setting. In this paper, we also describe the preliminary experiments with text-only translation systems leading us up to this choice.We have the top scoring system for both English-to-German and English-to-French, according to the automatic metrics for flickr18.Our experiments show that the effect of the visual features in our system is small. Our largest gains come from the quality of the underlying text-only NMT system. We find that appropriate use of additional data is effective.",
keywords = "113 Computer and information sciences, 6121 Languages",
author = "Gr{\"o}nroos Stig-Arne and Benoit Huet and Mikko Kurimo and Jorma Laaksonen and Bernard Merialdo and Phu Pham and Mats Sj{\"o}berg and Umut Sulubacak and J{\"o}rg Tiedemann and Rapha{\"e}l Troncy and {V{\'a}zquez Carrillo}, {Juan Ra{\'u}l}",
year = "2018",
month = "11",
day = "1",
language = "English",
pages = "603--611",
editor = "Ondřej Bojar and Rajen Chatterjee and Christian Federmann and Mark Fishel and Yvette Graham and Barry Haddow and Matthias Huck and Yepes, {Antonio Jimeno} and Philipp Koehn and Christof Monz and Matteo Negri and Aur{\'e}lie N{\'e}v{\'e}ol and Mariana Neves and Matt Post and Lucia Specia and Marco Turchi and Karin Verspoor",
booktitle = "Proceedings of the Third Conference on Machine Translation (WMT)",
publisher = "Association for Computational Linguistics",
address = "International",

}

Stig-Arne, G, Huet, B, Kurimo, M, Laaksonen, J, Merialdo, B, Pham, P, Sjöberg, M, Sulubacak, U, Tiedemann, J, Troncy, R & Vázquez Carrillo, JR 2018, The MeMAD Submission to the WMT18 Multimodal Translation Task. in O Bojar, R Chatterjee, C Federmann, M Fishel, Y Graham, B Haddow, M Huck, AJ Yepes, P Koehn, C Monz, M Negri, A Névéol, M Neves, M Post, L Specia, M Turchi & K Verspoor (eds), Proceedings of the Third Conference on Machine Translation (WMT): Shared Task Papers. Association for Computational Linguistics, Stroudsburg, pp. 603-611, Conference on Machine Translation, Brussels, Belgium, 31/10/2018.

The MeMAD Submission to the WMT18 Multimodal Translation Task. / Stig-Arne, Grönroos; Huet, Benoit; Kurimo, Mikko; Laaksonen, Jorma; Merialdo, Bernard; Pham, Phu; Sjöberg, Mats; Sulubacak, Umut; Tiedemann, Jörg; Troncy, Raphaël; Vázquez Carrillo, Juan Raúl.

Proceedings of the Third Conference on Machine Translation (WMT): Shared Task Papers. ed. / Ondřej Bojar; Rajen Chatterjee; Christian Federmann; Mark Fishel; Yvette Graham; Barry Haddow; Matthias Huck; Antonio Jimeno Yepes; Philipp Koehn; Christof Monz; Matteo Negri; Aurélie Névéol; Mariana Neves; Matt Post; Lucia Specia; Marco Turchi; Karin Verspoor. Stroudsburg : Association for Computational Linguistics, 2018. p. 603-611.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

TY - GEN

T1 - The MeMAD Submission to the WMT18 Multimodal Translation Task

AU - Stig-Arne, Grönroos

AU - Huet, Benoit

AU - Kurimo, Mikko

AU - Laaksonen, Jorma

AU - Merialdo, Bernard

AU - Pham, Phu

AU - Sjöberg, Mats

AU - Sulubacak, Umut

AU - Tiedemann, Jörg

AU - Troncy, Raphaël

AU - Vázquez Carrillo, Juan Raúl

PY - 2018/11/1

Y1 - 2018/11/1

N2 - This paper describes the MeMAD project entry to the WMT Multimodal MachineTranslation Shared Task.We propose adapting the Transformer neural machine translation (NMT) architecture to a multi-modal setting. In this paper, we also describe the preliminary experiments with text-only translation systems leading us up to this choice.We have the top scoring system for both English-to-German and English-to-French, according to the automatic metrics for flickr18.Our experiments show that the effect of the visual features in our system is small. Our largest gains come from the quality of the underlying text-only NMT system. We find that appropriate use of additional data is effective.

AB - This paper describes the MeMAD project entry to the WMT Multimodal MachineTranslation Shared Task.We propose adapting the Transformer neural machine translation (NMT) architecture to a multi-modal setting. In this paper, we also describe the preliminary experiments with text-only translation systems leading us up to this choice.We have the top scoring system for both English-to-German and English-to-French, according to the automatic metrics for flickr18.Our experiments show that the effect of the visual features in our system is small. Our largest gains come from the quality of the underlying text-only NMT system. We find that appropriate use of additional data is effective.

KW - 113 Computer and information sciences

KW - 6121 Languages

M3 - Conference contribution

SP - 603

EP - 611

BT - Proceedings of the Third Conference on Machine Translation (WMT)

A2 - Bojar, Ondřej

A2 - Chatterjee, Rajen

A2 - Federmann, Christian

A2 - Fishel, Mark

A2 - Graham, Yvette

A2 - Haddow, Barry

A2 - Huck, Matthias

A2 - Yepes, Antonio Jimeno

A2 - Koehn, Philipp

A2 - Monz, Christof

A2 - Negri, Matteo

A2 - Névéol, Aurélie

A2 - Neves, Mariana

A2 - Post, Matt

A2 - Specia, Lucia

A2 - Turchi, Marco

A2 - Verspoor, Karin

PB - Association for Computational Linguistics

CY - Stroudsburg

ER -

Stig-Arne G, Huet B, Kurimo M, Laaksonen J, Merialdo B, Pham P et al. The MeMAD Submission to the WMT18 Multimodal Translation Task. In Bojar O, Chatterjee R, Federmann C, Fishel M, Graham Y, Haddow B, Huck M, Yepes AJ, Koehn P, Monz C, Negri M, Névéol A, Neves M, Post M, Specia L, Turchi M, Verspoor K, editors, Proceedings of the Third Conference on Machine Translation (WMT): Shared Task Papers. Stroudsburg: Association for Computational Linguistics. 2018. p. 603-611