Lects in Helsinki Finnish - a probabilistic component modeling approach

Olli Kuparinen, Jaakko Peltonen, Liisa Mustanoja, Unni Leino, Jenni Santaharju

Forskningsoutput: TidskriftsbidragArtikelVetenskapligPeer review


This article examines Finnish lects spoken in Helsinki from the 1970s to the 2010s with a probabilistic model called Latent Dirichlet Allocation. The model searches for underlying components based on the linguistic features used in the interviews. Several coherent lects were discovered as components in the data, which counters the results of previous studies that report only weak co-variation between features that are assumed to present the same lect. The speakers, however, are not categorical in their linguistic behavior and tend to use more than one lect in their speech. This implies that the lects should not be considered in parallel with seemingly uniform linguistic systems such as languages, but as partial systems that constitute a network.
TidskriftLanguage Variation and Change
Sidor (från-till)1-26
Antal sidor26
StatusPublicerad - 17 maj 2021
MoE-publikationstypA1 Tidskriftsartikel-refererad


  • 6121 Språkvetenskaper

Citera det här