HFST Training Environment and Recent Additions

Forskningsoutput: Kapitel i bok/rapport/konferenshandlingKapitelVetenskapligPeer review

Sammanfattning

HFST - the Helsinki Finite-State Technology toolkit was launched in 2009 (Lindén & al, 2009) and has since been used for developing a number of rule-based morphologies for processing natural language. To promote the uptake of the toolkit a training environment for linguists to learn how to use HFST has been designed in Jupyter. This paper presents an overview of the training environment and some of the recent features that have been added to HFST to keep the run-time size of the transducer reasonably small despite exceptions and negative constraints that need to be added during practical FST development.
Originalspråkengelska
Titel på värdpublikationRule-Based Language Technology
RedaktörerArvi Hurskainen, Kimmo Koskenniemi, Tommi Pirinen
Antal sidor10
UtgivningsortTartu
FörlagNorthern European Association for Language Technology
Utgivningsdatum2023
Sidor60-69
StatusPublicerad - 2023
MoE-publikationstypA3 Del av bok eller annan forskningsbok

Publikationsserier

NamnNEALT Monograph Series
FörlagNorthern European Association for Language Technology (NEALT)
Nummer2[1]
ISSN (elektroniskt)1736-6291

Vetenskapsgrenar

  • 6121 Språkvetenskaper
  • 113 Data- och informationsvetenskap

Citera det här