Improving Finite-State Spell-Checker Suggestions with Part of Speech N-Grams

Tommi Pirinen, Miikka Silfverberg, Krister Linden

    Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

    Abstract

    In this paper we demonstrate a finite-state implementation of context-aware spell checking utilizing an N-gram based part of speech (POS) tagger to rerank the suggestions from a simple edit-distance based spell-checker. We demonstrate the benefits of context-aware spell-checking for English and Finnish and introduce modifications that are necessary to make traditional N-gram models work for morphologically more complex languages, such as Finnish.
    Original languageEnglish
    Title of host publicationComputational Linguistics and Intelligent Text Processing : 13th International Conference, CICLing 2012
    EditorsAlexander Gelbukh
    Number of pages11
    Place of PublicationDelhi, India
    Publication date9 Mar 2012
    Publication statusPublished - 9 Mar 2012
    MoE publication typeA4 Article in conference proceedings
    EventInternational Conference on Intelligent Text Processing and Computational Linguistics - New Delhi, India
    Duration: 11 Mar 201217 Mar 2012

    Fields of Science

    • 6121 Languages
    • Language technology
    • HFST
    • spelling suggestions

    Cite this