CLARIN and free open source finite-state tools

    Research output: Chapter in Book/Report/Conference proceedingChapterScientificpeer-review

    Abstract

    A new emerging European research infrastructure called CLARIN and a related project called HFST are briefly described. HFST has built a programming interface on top of some existing open source finite-state packages such as SFST and OpenFST. In order to verify its utility, HFST has built open source tools on top of this HFST interface. These tools create lexical transducers, compile morphophonological two-level rules and combine them into a transducer lexicon. The tools have been tested against independently created full-scale lexicons and rules for Northern Sámi and Lule Sámi languages which have more complicated lexical and morphophonological structure than most other European languages.
    Original languageFinnish
    Title of host publicationFinite-State Methods and Natural Language Processing : Post-proceedings of the 7th International Workshop FSMNLP 2008
    EditorsJakub Piskorski, Bruce Watson, Anssi Yli-Jyrä
    Number of pages11
    PublisherIOS PRESS
    Publication date2009
    Pages3-13
    ISBN (Print)978-1-58603-975-2
    Publication statusPublished - 2009
    MoE publication typeA3 Book chapter

    Publication series

    NameFrontiers in Artificial Intelligence and Applications
    PublisherIOS Press
    Volume191
    ISSN (Print)0922-6389

    Fields of Science

    • 612 Languages and Literature
    • research infrastructures
    • morphological analysis
    • finite-state methods
    • 113 Computer and information sciences
    • finite-state transducers
    • software architecture

    Cite this