Projekt per år
Sammanfattning
Contextualized word representations encode rich information about syntax and semantics, alongside specificities of each context of use. While contextual variation does not always reflect actual meaning shifts, it can still reduce the similarity of embeddings for word instances having the same meaning. We explore the imprint of two specific linguistic alternations, namely passivization and negation, on the representations generated by neural models trained with two different objectives: masked language modeling and translation. Our exploration methodology is inspired by an approach previously proposed for removing societal biases from word vectors. We show that passivization and negation leave their traces on the representations, and that neutralizing this information leads to more similar embeddings for words that should preserve their meaning in the transformation. We also find clear differences in how the respective features generalize across datasets.
Originalspråk | engelska |
---|---|
Titel på värdpublikation | Proceedings of the Third BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP |
Redaktörer | Afra Alishahi, Yonatan Belinkov, Grzegorz Chrupała, Dieuwke Hupkes, Yuval Pinter, Hassan Sajjad |
Antal sidor | 13 |
Utgivningsort | Stroudsburg |
Förlag | The Association for Computational Linguistics |
Utgivningsdatum | 20 nov. 2020 |
Sidor | 136-148 |
ISBN (elektroniskt) | 978-1-952148-86-6 |
DOI | |
Status | Publicerad - 20 nov. 2020 |
MoE-publikationstyp | A4 Artikel i en konferenspublikation |
Evenemang | BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP - Online event Varaktighet: 20 nov. 2020 → 20 nov. 2020 Konferensnummer: 3 |
Vetenskapsgrenar
- 6121 Språkvetenskaper
- 113 Data- och informationsvetenskap
Projekt
- 1 Slutfört
-
FoTran: Found in Translation - Natural Language Understanding with Cross-Lingual Grounding
Tiedemann, J., Attieh, J., Aulamo, M., Boggia, M., Celikkanat, H., De Gibert Bonet, O., Grönroos, S., Mickus, T., Raganato, A., Scherrer, Y., Silfverberg, M., Sjöblom, E. I., Talman, A., Vazquez , R., Virpioja, S. P., Yli-Jyrä, A., Zosa, E., Celikkanat, H., Raganato, A., Silfverberg, M., Sulubacak, U. & Vazquez , R.
01/09/2018 → 31/03/2024
Projekt: EU Horizon 2020: European Research Council: Consolidator Grant (H2020-ERC-COG)