Projekt per år
Projektinformation
Beskrivning (abstrakt)
The hearts of the European speak 50 - 100 languages that are too important to be ignored in research, content production or education. The current application is concerned with the lack of resources needed when we implement our shared vision of a multilingual society. Due to a similar concern, the EU Commission is funding the preparatory phase of the “Common Lan- guage Resource and Language Technology Research Infrastructure (CLARIN)”, a pan-European initiative that aims to establish an interoperable and integrated research infrastructure.
Finland is one of the main partners of CLARIN, and the current application would form a significant national contribution to its success. The proposed research would be based on two particular strengths of the Finnish research: language-independent finite-state technology and open-source technology.
• Finite-state technology is very useful in natural language processing because with it lin- guistic rules can be compiled into efficient models in computer.
• The use of open source technology ensures the widest applicability of the language tech- nology infrastructure.
The purpose of the proposed basic research is to create a renewed theory of compilation of linguistic knowledge into finite-state models. The commercial grammar formalisms for finite- state morphology are based on complicated 10-20-year old algorithms that have not been adopted in free and open-source software. The applicant’s recent results indicate that more parsimonious computational equipment could improve the elegancy and generality of the modeled linguistic formalisms.
Language resource building needs practical methods. The proposed research would establish new solutions to the long-standing space explosion problem of compilers, and develop descrip- tive means for complex grammatical phenomena. The reduced need for hi-tech equipment means that language-independent modeling formalisms can become freely available, which empowers any language community to build e.g. morphological models for themselves.
Multi-lingual morphological models based on finite-state technology will be in a significant role in CLARIN, and they would enable competitive language research and advanced language technology applications. With the morphological models, computers can have a capability to translate texts from one dialect to another, facilitate reading of foreign language texts, aid in language learning, assist in natural language queries from e.g. internet, and improve the qual- ity of common content production. These applications help multi-lingual education to reduce inequality, poverty and insecurity in the society and can make a better future for our children.
Akronym | MORFCOMPILERS |
---|---|
Status | Slutfört |
Gällande start-/slutdatum | 01/01/2009 → 31/12/2011 |
Finansiering
- Unknown funder: 195 450,00 €
Vetenskapsgrenar
- 612 Språk och litteratur
- 113 Data- och informationsvetenskap
-
YK:n vuosituhatjulistus ja African kieliteknologia. Tiekartta bantukielten resurssien kehittämiseksi
Killian, D. (Deltagare)
01/01/2010 → …
Projekt: Forskningsprojekt
-
HFST - Helsinki Finite-State Technology
Linden, K. (Principal Investigator), Koskenniemi, K. (Deltagare), Yli-Jyrä, A. (Deltagare), Hulden, M. (Deltagare), Silfverberg, M. (Deltagare), Pirinen, T. (Deltagare), Axelson, E. (Deltagare), Hardwick, S. (Deltagare), Niemi, J. (Annan) & Hurskainen, A. (Deltagare)
01/01/2005 → …
Projekt: Forskningsprojekt
-
FIELDSYNERGY-TRIAL: Rationalizing Parallel Linguistic Description and Computational Modeling
Yli-Jyrä, A. (Deltagare)
01/01/2011 → 01/01/2011
Projekt: Forskningsprojekt
-
Finite-State Methods and Models in Natural Language Processing
Yli-Jyrä, A. M., Kornai, A. & Sakarovitch, J., 2011, I: Natural Language Engineering. 17, 2, s. 141-144 4 s.Forskningsoutput: Tidskriftsbidrag › Översiktsartikel › Peer review
Öppen tillgångFil -
Conversions between crisply bipartite and unambiguous automata
Yli-Jyrä, A. M., 3 maj 2010, Weighted Automata: Theory and Applications, Leipzig, May 3-7, 2010. Droste, M., Gastin, P., Kuich, W. & Vogler, H. (red.). Leipzig: Universität Leipzig, 1 s.Forskningsoutput: Kapitel i bok/rapport/konferenshandling › Konferensbidrag › Vetenskaplig
-
Finite-State Methods and Natural Language Processing: 8th International Workshop, FSMNLP 2009, Pretoria, South Africa, July 2009, Revised Selected Papers
Yli-Jyrä, A. M. (Redaktör), Kornai, A. (Redaktör), Sakarovitch, J. (Redaktör) & Watson, B. (Redaktör), 2010, Berlin Heidelberg: Springer-Verlag. 147 s. (Lecture Notes in Artificial Intelligence; vol. 6062, nr. 6062)Forskningsoutput: Bok/rapport › Antologi eller special utgåva › Vetenskaplig › Peer review
Aktiviteter
- 1 Handledare eller bihandledare av doktorsavhandling
-
Supervisor of doctoral thesis in Language Technology / Miikka Silfverberg
Linden, B. K. J. (Handledare) & Yli-Jyrä, A. (Handledare)
1 jan. 2010 → 22 okt. 2016Aktivitet: Examinationstyper › Handledare eller bihandledare av doktorsavhandling