Projekt per år
Sammanfattning
Various studies show that pretrained language models such as BERT cannot straightforwardly replace encoders in neural machine translation despite their enormous success in other tasks. This is even more astonishing considering the similarities between the architectures. This paper sheds some light on the embedding spaces they create, using average cosine similarity, contextuality metrics and measures for representational similarity for comparison, revealing that BERT and NMT encoder representations look significantly different from one another. In order to address this issue, we propose a supervised transformation from one into the other using explicit alignment and fine-tuning. Our results demonstrate the need for such a transformation to improve the applicability of BERT in MT.
Originalspråk | engelska |
---|---|
Titel på värdpublikation | Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing : Student Research Workshop |
Redaktörer | Jad Kabbara, Haitao Lin, Amandalynne Paullada, Jannis Vamvas |
Antal sidor | 11 |
Utgivningsort | Stroudsburg |
Förlag | The Association for Computational Linguistics |
Utgivningsdatum | aug. 2021 |
Sidor | 337-347 |
ISBN (tryckt) | 978-1-954085-55-8 |
DOI | |
Status | Publicerad - aug. 2021 |
MoE-publikationstyp | A4 Artikel i en konferenspublikation |
Evenemang | Annual Meeting of the Association for Computational Linguistics and the International Joint Conference on Natural Language Processing - Bangkok [Online event] Varaktighet: 5 aug. 2021 → 6 aug. 2021 Konferensnummer: 59/11 |
Vetenskapsgrenar
- 113 Data- och informationsvetenskap
- 6121 Språkvetenskaper
Projekt
- 1 Slutfört
-
FoTran: Found in Translation - Natural Language Understanding with Cross-Lingual Grounding
Tiedemann, J., Attieh, J., Aulamo, M., Boggia, M., Celikkanat, H., De Gibert Bonet, O., Grönroos, S., Mickus, T., Raganato, A., Scherrer, Y., Silfverberg, M., Sjöblom, E. I., Talman, A., Vazquez , R., Virpioja, S. P., Yli-Jyrä, A., Zosa, E., Celikkanat, H., Raganato, A., Silfverberg, M., Sulubacak, U. & Vazquez , R.
01/09/2018 → 31/03/2024
Projekt: EU Horizon 2020: European Research Council: Consolidator Grant (H2020-ERC-COG)