Abstract
Forced alignment is an effective process to speed up linguistic research. However, most forced aligners are language-dependent, and under-resourced languages rarely have enough resources to train an acoustic model for an aligner. We present a new Finnish grapheme-based forced aligner and demonstrate its performance by aligning multiple Uralic languages and English as an unrelated language. We show that even a simple non-expert created grapheme-to-phoneme mapping can result in useful word alignments.
Original language | English |
---|---|
Title of host publication | Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa) |
Editors | Simon Dobnik, Lilja Øvrelid |
Number of pages | 6 |
Place of Publication | Linköping |
Publisher | Linköping University Electronic Press |
Publication date | 1 May 2021 |
Pages | 345-350 |
ISBN (Electronic) | 978-91-7929-614-8 |
Publication status | Published - 1 May 2021 |
MoE publication type | A4 Article in conference proceedings |
Event | Nordic Conference on Computational Linguistics - [Online event], Reykjavik, Iceland Duration: 31 May 2021 → 2 Jun 2021 Conference number: 23 https://nodalida2021.github.io/index.html |
Publication series
Name | Linköping Electronic Conference Proceedings |
---|---|
Publisher | Linköping University Electronic Press |
Number | 78 |
ISSN (Print) | 1650-3686 |
ISSN (Electronic) | 1650-3740 |
Name | NEALT Proceedings Series |
---|---|
Publisher | University of Tartu |
Number | 45 |
ISSN (Print) | 1736-8197 |
ISSN (Electronic) | 1736-6305 |
Fields of Science
- 113 Computer and information sciences
- 6121 Languages