HFST - Helsinki Finite-State Technology

Project Details

Description

The goal is to create a high-performing, maintainable and modifiable set of tools for morphological analysis and generation according to the principles of open source software. The Helsinki Finite-State Transducer toolkit is intended for processing natural language. The toolkit is demonstrated by wide-coverage implementations of a number of languages of varying morphological complexity.
StatusActive
Effective start/end date01/01/2005 → …

Fields of Science

  • 113 Computer and information sciences
  • 612 Languages and Literature

Projects

FinnTreeBank: A Finnish text corpus with morphological and dependency syntactic annotation

Voutilainen, A., Purtonen, T. K., Muhonen, K. & Kumlander, M.

01/05/2010 → …

Project: Research project

FinnWordNet - A Finnish WordNet

Linden, K., Niemi, J., Hyvärinen, M., Muhonen, K. & Pääkkö, P.

01/01/2010 → …

Project: Research project

FIN-CLARIN - Kielipankki

Linden, K., Piitulainen, J., Niemi, J., Lennes, M., Bartis, I., Westerlund, H., Drobac, S., Axelson, E. & Kauppinen, P.

05/06/2006 → …

Project: Other project

Research Output

Nettidigisanakirja suomi-udmurtti

Translated title of the contribution: Nettidigisanit Finnish-UdmurtRueter, J. M., Saarinen, S., Koivunen, T. & Johnson, R., Aug 2016

Research output: Non-textual formSoftwareScientific

Open Access

Finite-State Methods and Models in Natural Language Processing

Yli-Jyrä, A. M., Kornai, A. & Sakarovitch, J., 2011, In : Natural Language Engineering. 17, 2, p. 141-144 4 p.

Research output: Contribution to journalReview ArticleScientificpeer-review

Open Access
File

Building and Using Existing Hunspell Dictionaries and TEX Hyphenators as Finite-State Automata

Pirinen, T. & Linden, K., Oct 2010, Proceedings of International Multiconference on Computer Science and Information Technology: Computational Linguistics—Applications (CLA'10 ). Ganzha, M. & Paprzycki, M. (eds.). Wisla, Poland, Vol. 5. p. 477–484 8 p. (Proceedings of the International Multiconference on Computer Science and Information Technology).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

File

Activities

Bruce Watson

Anssi Yli-Jyrä (Host)

3 Jun 201810 Jun 2018

Activity: Hosting a visitor typesAcademic visit at UH

The Power of Constraint Grammars Revisited

Anssi Yli-Jyrä (Speaker)

11 May 2017

Activity: Talk or presentation typesOral presentation

Edward Gibson

Anssi Yli-Jyrä (Host)

28 Nov 20164 Dec 2016

Activity: Hosting a visitor typesAcademic visit at UH