Lects in Helsinki Finnish - a probabilistic component modeling approach

Olli Kuparinen, Jaakko Peltonen, Liisa Mustanoja, Unni Leino, Jenni Santaharju

Research output: Contribution to journalArticleScientificpeer-review


This article examines Finnish lects spoken in Helsinki from the 1970s to the 2010s with a probabilistic model called Latent Dirichlet Allocation. The model searches for underlying components based on the linguistic features used in the interviews. Several coherent lects were discovered as components in the data, which counters the results of previous studies that report only weak co-variation between features that are assumed to present the same lect. The speakers, however, are not categorical in their linguistic behavior and tend to use more than one lect in their speech. This implies that the lects should not be considered in parallel with seemingly uniform linguistic systems such as languages, but as partial systems that constitute a network.
Original languageEnglish
Article number0954394521000041
JournalLanguage Variation and Change
Pages (from-to)1-26
Number of pages26
Publication statusPublished - 17 May 2021
MoE publication typeA1 Journal article-refereed

Fields of Science

  • 6121 Languages

Cite this