Projekt per år
Sammanfattning
In this paper, we propose a multilingual encoder-decoder architecture capable of obtaining multilingual sentence representations by means of incorporating an intermediate {\em attention bridge} that is shared across all languages. That is, we train the model with language-specific encoders and decoders that are connected via self-attention with a shared layer that we call attention bridge. This layer exploits the semantics from each language for performing translation and develops into a language-independent meaning representation that can efficiently be used for transfer learning. We present a new framework for the efficient development of multilingual NMT using this model and scheduled training. We have tested the approach in a systematic way with a multi-parallel data set. We show that the model achieves substantial improvements over strong bilingual models and that it also works well for zero-shot translation, which demonstrates its ability of abstraction and transfer learning.
Originalspråk | engelska |
---|---|
Titel på värdpublikation | The 4th Workshop on Representation Learning for NLP (RepL4NLP-2019) : Proceedings of the Workshop |
Redaktörer | Isabelle Augenstein, Spandana Gella, Sebastian Ruder, Katharina Kann, Burcu Can, Johannes Welbl, Alexis Conneau, Xiang Ren, Marek Rei |
Antal sidor | 7 |
Utgivningsort | Stroudsburg |
Förlag | The Association for Computational Linguistics |
Utgivningsdatum | 2019 |
Sidor | 33-39 |
ISBN (elektroniskt) | 978-1-950737-35-2 |
Status | Publicerad - 2019 |
MoE-publikationstyp | A4 Artikel i en konferenspublikation |
Evenemang | Workshop on Representation Learning for NLP - Florence, Italien Varaktighet: 2 aug. 2019 → 2 aug. 2019 Konferensnummer: 4 |
Vetenskapsgrenar
- 6121 Språkvetenskaper
- 113 Data- och informationsvetenskap
-
FoTran: Found in Translation - Natural Language Understanding with Cross-Lingual Grounding
Tiedemann, J., Celikkanat, H., Raganato, A., Silfverberg, M., Sulubacak, U., Vazquez , R., Apidianaki, M., Attieh, J., Aulamo, M., Boggia, M., Celikkanat, H., De Gibert Bonet, O., Grönroos, S., Mickus, T., Raganato, A., Scherrer, Y., Silfverberg, M., Sjöblom, E. I., Talman, A., Vazquez , R., Virpioja, S. P., Yli-Jyrä, A. & Zosa, E.
01/09/2018 → 29/02/2024
Projekt: EU Horizon 2020: European Research Council: Consolidator Grant (H2020-ERC-COG)
-
NLUxLG: NLU with Cross-Lingual Grounding
Tiedemann, J., Talman, A., Raganato, A. & Celikkanat, H.
01/01/2018 → 31/12/2019
Projekt: Forskningsprojekt