Abstract
Prosodic patterns—and linguistic structures in general— are hierarchical in nature, providing for efficient means for en- coding information in temporally constrained situations where communicative events occur. However, there are no theoreti- cal frameworks that are capable of representing the full extent of linguistic behaviour in a cohesive way that could capture the paradigmatic and syntagmatic links between the organizational levels present in everyday speech.
Here we propose a novel theoretical and modelling account of perception and production of prosodic patterns in speech communication, derived from the influential Predictive Processing theory of neural implementation of perception and action based on a hierarchical system of generative models producing progressively more detailed probabilistic predictions of future events. The framework provides a conceptualization of the hierarchical organization of speech prosody as well as a principled way of unifying speech perception and production by postulating a single processing hierarchy shared by both modalities. We discuss the possible implications of the theory for prosodic analysis of speech communication, including conversational setting. In addition, we outline a viable computational implementation in the form of a machine learning architecture that can be used as a testbed for generating and evaluating predictions brought forth by the theory.
Here we propose a novel theoretical and modelling account of perception and production of prosodic patterns in speech communication, derived from the influential Predictive Processing theory of neural implementation of perception and action based on a hierarchical system of generative models producing progressively more detailed probabilistic predictions of future events. The framework provides a conceptualization of the hierarchical organization of speech prosody as well as a principled way of unifying speech perception and production by postulating a single processing hierarchy shared by both modalities. We discuss the possible implications of the theory for prosodic analysis of speech communication, including conversational setting. In addition, we outline a viable computational implementation in the form of a machine learning architecture that can be used as a testbed for generating and evaluating predictions brought forth by the theory.
Original language | English |
---|---|
Title of host publication | Proceedings of Speech Prosody 2022 |
Place of Publication | Baixas |
Publisher | ISCA - International Speech Communication Association |
Publication date | 24 May 2022 |
DOIs | |
Publication status | Published - 24 May 2022 |
MoE publication type | A4 Article in conference proceedings |
Event | Speech Prosody 2022 - Lisbon, Portugal Duration: 23 May 2022 → 26 May 2022 Conference number: 11 http://labfon.letras.ulisboa.pt/sp2022/index.html |
Publication series
Name | Speech prosody |
---|---|
Publisher | International Speech Communication Association |
ISSN (Electronic) | 2333-2042 |
Fields of Science
- 6161 Phonetics
- 6121 Languages