Abstrakti
This paper presents a method for linking models for aligning linguistic etymological data with models for phylogenetic inference from population genetics. We begin with a large database of genetically related words—sets of cognates—from languages in a language family. We process the cognate sets to obtain a complete alignment of the data. We use the alignments as input to a model developed for phylogenetic reconstruction in population genetics. This is achieved via a natural novel projection of the linguistic data onto genetic primitives. As a result, we induce phylogenies based on aligned linguistic data. We place the method in the context of those reported in the literature, and illustrate its operation on data from the Uralic language family, which results in family trees that are very close to the “true” (expected) phylogenies.
Alkuperäiskieli | englanti |
---|---|
Otsikko | The 54th Annual Meeting of the Association for Computational Linguistics : Proceedings of the 7th Workshop on Cognitive Aspects of Computational Language Learning |
Sivumäärä | 11 |
Julkaisupaikka | Stroudsburg, PA |
Kustantaja | The Association for Computational Linguistics |
Julkaisupäivä | 2016 |
Sivut | 27-37 |
ISBN (painettu) | 978-1-945626-07-4 |
Tila | Julkaistu - 2016 |
OKM-julkaisutyyppi | A4 Artikkeli konferenssijulkaisuussa |
Tapahtuma | Cognitive Aspects of Computational Language Learning - Berlin, Saksa Kesto: 11 elok. 2016 → 11 elok. 2016 Konferenssinumero: 7 |
Tieteenalat
- 113 Tietojenkäsittely- ja informaatiotieteet
Projektit
-
LLL: Language Learning Lab
Yangarber, R. (Projektinjohtaja), Katinskaia, A. (Osallistuja), Hou, J. (Osallistuja), Furlan, G. (Osallistuja) & Kylliäinen, I. P. (Osallistuja)
Projekti: Tutkimusprojekti
-
Revita: Language learning and AI
Yangarber, R. (Projektinjohtaja), Katinskaia, A. (Osallistuja), Hou, J. (Osallistuja), Furlan, G. (Osallistuja) & Kylliäinen, I. P. (Osallistuja)
Projekti: Tutkimusprojekti