Sammanfattning
HFST - the Helsinki Finite-State Technology toolkit was launched in 2009 (Lindén & al, 2009) and has since been used for developing a number of rule-based morphologies for processing natural language. To promote the uptake of the toolkit a training environment for linguists to learn how to use HFST has been designed in Jupyter. This paper presents an overview of the training environment and some of the recent features that have been added to HFST to keep the run-time size of the transducer reasonably small despite exceptions and negative constraints that need to be added during practical FST development.
Originalspråk | engelska |
---|---|
Titel på värdpublikation | Rule-Based Language Technology |
Redaktörer | Arvi Hurskainen, Kimmo Koskenniemi, Tommi Pirinen |
Antal sidor | 10 |
Utgivningsort | Tartu |
Förlag | Northern European Association for Language Technology |
Utgivningsdatum | 2023 |
Sidor | 60-69 |
Status | Publicerad - 2023 |
MoE-publikationstyp | A3 Del av bok eller annan forskningsbok |
Publikationsserier
Namn | NEALT Monograph Series |
---|---|
Förlag | Northern European Association for Language Technology (NEALT) |
Nummer | 2[1] |
ISSN (elektroniskt) | 1736-6291 |
Vetenskapsgrenar
- 6121 Språkvetenskaper
- 113 Data- och informationsvetenskap
Utrustning
-
CLARIN - Finländska språkresurser i gemensamt bruk
Linden, K. (Chef)
Avdelningen för digital humanioraUtrustning/facilitet: Coordination office