Sammanfattning
Prosodic patterns—and linguistic structures in general— are hierarchical in nature, providing for efficient means for en- coding information in temporally constrained situations where communicative events occur. However, there are no theoreti- cal frameworks that are capable of representing the full extent of linguistic behaviour in a cohesive way that could capture the paradigmatic and syntagmatic links between the organizational levels present in everyday speech.
Here we propose a novel theoretical and modelling account of perception and production of prosodic patterns in speech communication, derived from the influential Predictive Processing theory of neural implementation of perception and action based on a hierarchical system of generative models producing progressively more detailed probabilistic predictions of future events. The framework provides a conceptualization of the hierarchical organization of speech prosody as well as a principled way of unifying speech perception and production by postulating a single processing hierarchy shared by both modalities. We discuss the possible implications of the theory for prosodic analysis of speech communication, including conversational setting. In addition, we outline a viable computational implementation in the form of a machine learning architecture that can be used as a testbed for generating and evaluating predictions brought forth by the theory.
Here we propose a novel theoretical and modelling account of perception and production of prosodic patterns in speech communication, derived from the influential Predictive Processing theory of neural implementation of perception and action based on a hierarchical system of generative models producing progressively more detailed probabilistic predictions of future events. The framework provides a conceptualization of the hierarchical organization of speech prosody as well as a principled way of unifying speech perception and production by postulating a single processing hierarchy shared by both modalities. We discuss the possible implications of the theory for prosodic analysis of speech communication, including conversational setting. In addition, we outline a viable computational implementation in the form of a machine learning architecture that can be used as a testbed for generating and evaluating predictions brought forth by the theory.
Originalspråk | engelska |
---|---|
Titel på värdpublikation | Proceedings of Speech Prosody 2022 |
Utgivningsort | Baixas |
Förlag | ISCA - International Speech Communication Association |
Utgivningsdatum | 24 maj 2022 |
DOI | |
Status | Publicerad - 24 maj 2022 |
MoE-publikationstyp | A4 Artikel i en konferenspublikation |
Evenemang | Speech Prosody 2022 - Lisbon, Portugal Varaktighet: 23 maj 2022 → 26 maj 2022 Konferensnummer: 11 http://labfon.letras.ulisboa.pt/sp2022/index.html |
Publikationsserier
Namn | Speech prosody |
---|---|
Förlag | International Speech Communication Association |
ISSN (elektroniskt) | 2333-2042 |
Vetenskapsgrenar
- 6161 Fonetik
- 6121 Språkvetenskaper