Projekteja vuodessa
Abstrakti
We analyze the learning dynamics of neural language and translation models using Loss Change Allocation (LCA), an indicator that enables a fine-grained analysis of parameter updates when optimizing for the loss function. In other words, we can observe the contributions of different network components at training time. In this article, we systematically study masked language modeling, causal language modeling, and machine translation. We show that the choice of training objective leads to distinctive optimization procedures, even when performed on comparable Transformer architectures. We demonstrate how the various Transformer parameters are used during training, supporting that the feed-forward components of each layer are the main contributors to the optimization procedure. Finally, we find that the learning dynamics are not affected by data size and distribution but rather determined by the learning objective.
Alkuperäiskieli | englanti |
---|---|
Otsikko | Proceedings of the 29th International Conference on Computational Linguistics |
Toimittajat | Nicoletta Calzolari, Chu-Ren Huang, Hansaem Kim, et al. |
Sivumäärä | 13 |
Julkaisupaikka | Gyeongju |
Kustantaja | International Committee on Computational Linguistics |
Julkaisupäivä | lokak. 2022 |
Sivut | 4788-4800 |
Tila | Julkaistu - lokak. 2022 |
OKM-julkaisutyyppi | A4 Artikkeli konferenssijulkaisuussa |
Tapahtuma | International Conference on Computational Linguistics - Gyeongju, Korean tasavalta (Etelä-Korea) Kesto: 12 lokak. 2022 → 17 lokak. 2022 Konferenssinumero: 29 https://coling2022.org/ |
Julkaisusarja
Nimi | International conference on computational linguistics |
---|---|
Kustantaja | International Committee on Computational Linguistics |
Numero | 1 |
Vuosikerta | 29 |
ISSN (painettu) | 2951-2093 |
Tieteenalat
- 6121 Kielitieteet
- 113 Tietojenkäsittely- ja informaatiotieteet
Projektit
- 1 Päättynyt
-
FoTran: Found in Translation - Natural Language Understanding with Cross-Lingual Grounding
Tiedemann, J., Attieh, J., Aulamo, M., Boggia, M., Celikkanat, H., De Gibert Bonet, O., Grönroos, S., Mickus, T., Raganato, A., Scherrer, Y., Silfverberg, M., Sjöblom, E. I., Talman, A., Vazquez , R., Virpioja, S. P., Yli-Jyrä, A., Zosa, E., Celikkanat, H., Raganato, A., Silfverberg, M., Sulubacak, U. & Vazquez , R.
01/09/2018 → 31/03/2024
Projekti: EU Horizon 2020: European Research Council: Consolidator Grant (H2020-ERC-COG)