Optimization-based planning of speech articulation using general Tau Theory

Benjamin Elie, Juraj Šimko, Alice Turk

Forskningsoutput: TidskriftsbidragArtikelVetenskapligPeer review

Sammanfattning

This paper presents a model of speech articulation planning and generation based on General Tau Theory and Optimal Control Theory. Because General Tau Theory assumes that articulatory targets are always reached, the model accounts for speech variation via context-dependent articulatory targets. Targets are chosen via the optimization of a composite objective function. This function models three different task requirements: maximal intelligibility, minimal articulatory effort and minimal utterance duration. The paper shows that systematic phonetic variability can be reproduced by adjusting the weights assigned to each task requirement. Weights can be adjusted globally to simulate different speech styles, and can be adjusted locally to simulate different levels of prosodic prominence. The solution of the optimization procedure contains Tau equation parameter values for each articulatory movement, namely position of the articulator at the movement offset, movement duration, and a parameter which relates to the shape of the movement's velocity profile. The paper presents simulations which illustrate the ability of the model to predict or reproduce several well-known characteristics of speech. These phenomena include close-to-symmetric velocity profiles for articulatory movement, variation related to speech rate, centralization of unstressed vowels, lengthening of stressed vowels, lenition of unstressed lingual stop consonants, and coarticulation of stop consonants.

Originalspråkengelska
Artikelnummer103083
TidskriftSpeech Communication
Volym160
ISSN0167-6393
DOI
StatusPublicerad - maj 2024
MoE-publikationstypA1 Tidskriftsartikel-refererad

Bibliografisk information

Publisher Copyright:
© 2024 The Author(s)

Vetenskapsgrenar

  • 6121 Språkvetenskaper

Citera det här