Morpheme Segmentation Gold Standards for Finnish and English

    Research output: Chapter in Book/Report/Conference proceedingChapterScientific

    Abstract

    This document describes Hutmegs, the Helsinki University of Technology Morphological Evaluation Gold Standard package, which contains gold-standard morphological segmentations for 1.4 million Finnish and 120 000 English words. The Gold Standards comprise surface-string, or allomorph, segmentations of word forms, as well as deep-level, or morpheme, segmentations of the words.
    Original languageEnglish
    Title of host publicationPublications in Computer and Information Science : Report A77
    Number of pages33
    PublisherHelsinki University of Technology
    Publication date2004
    Publication statusPublished - 2004
    MoE publication typeB2 Book chapter

    Fields of Science

    • 6121 Languages
    • Morphology
    • Gold Standard

    Cite this