Projekteja vuodessa
Projektin yksityiskohdat
Kuvaus (abstrakti)
The key interest underlying our proposal is to build African language resources. Their availability will enhance the use of African languages in language technology. This work will take place in close partnership with the currently developing European infrastructure (CLARIN) which will thus benefit from a better representation of African languages with their specific needs based on typological properties of these languages (in this case from the Bantu group). Capacity-building in the relevant sector, both in Africa and in Helsinki, is an essential feature of our project.
In terms of methodology, two major perspectives are represented in the project. Linguistic ques- tions concern mainly the building of resources and tools for African languages, in this case specifically Bantu languages. Our focus will be on languages of wider distribution with few resources, such as Oshikwanyama, Otjiherero, Setswana, and possibly others (Kinyarwanda, Bemba, Chichewa, etc.).
The identification of target languages is based on criteria such as the availability of material and the urgency with which computational linguistic should be carried out. The former is mainly a sci- entific constraint, depending on the availability of language descriptions, formal grammars and lex- ical databases. The second is largely determined by political factors, interest of the members of the language communities, government policies, and availability of local counterparts, competence and willingness to cooperate.
The main linguistic task will consist in assessing relevant reference material, electronic corpora, and other language resources. At the same time, some effort will be dedicated to networking ac- tivities with scholars working on the relevant African languages. From the computational side, an innovative approach (“pointwise weighted finite-state”) is pursued in the project. In practical terms, computational linguists will scrutinise the hypothesis that available reference material (much of which is generative, rule-based) can be successfully exploited for improving FS methods and a constraint- based understanding of the relevant language properties.
The scientific results concern mostly the methodological questions outlined in the previous para- graph. At the same time, this initiative will be of immediate relevance to language practitioners in African countries and thus open new paths to improved educational chances, one of the crucial factors for sustainable development of human resources in Africa.
Akronyymi | LT4AFRICA |
---|---|
Tila | Päättynyt |
Todellinen alku/loppupvm | 01/01/2010 → 31/12/2010 |
Rahoitus
- Suomen Akatemia: 47 300,00 €
Tieteenalat
- 6121 Kielitieteet
- bantukielet
- morfologia
- leksikaalinen tooni
- kielten dokumentointi
- kieliteknologiaresurssit
- 113 Tietojenkäsittely- ja informaatiotieteet
- äärellistilaiset menetelmät
- 519 Yhteiskuntamaantiede, talousmaantiede
- Afrikka
-
HFST - Helsinki Finite-State Technology
Linden, K., Koskenniemi, K., Yli-Jyrä, A., Hulden, M., Silfverberg, M., Pirinen, T., Axelson, E., Hardwick, S., Niemi, J. & Hurskainen, A.
01/01/2005 → …
Projekti: Tutkimusprojekti
-
Graph-Based Representation of Cross-Lingual Alignments
Abend, O., Ronning, M. & Miles, G.
01/01/2018 → 17/01/2020
Projekti: Tutkimusprojekti
-
FIELDSYNERGY-TRIAL: Rationalizing Parallel Linguistic Description and Computational Modeling
01/01/2011 → 01/01/2011
Projekti: Tutkimusprojekti
-
On Practical Realisation of Autosegmental Representations in Lexical Transducers of Tonal Bantu Languages
Yli-Jyrä, A., 2020, Proceedings of the Language Technologies for All (LT4All). Adda, G., Choukri, K., Kasinskaite-Buddeberg, I., Mariani, J., Mazo, H. & Sakriani, S. (toim.). Paris: European Language Resources Association (ELRA), s. 346-349 4 SivumääräTutkimustuotos: Artikkeli kirjassa/raportissa/konferenssijulkaisussa › Konferenssiartikkeli › Tieteellinen › vertaisarvioitu
Open accessTiedosto -
Optimal Kornai-Karttunen Codes for Restricted Autosegmental Representations
Yli-Jyrä, A. M., 2019, Tokens of Meaning: Papers in Honor of Lauri Karttunen. Condoravdi, C. & Holloway King, T. (toim.). Stanford: Center for the Study of Language and Information (CSLI)Tutkimustuotos: Artikkeli kirjassa/raportissa/konferenssijulkaisussa › Kirjan luku tai artikkeli › Tieteellinen › vertaisarvioitu
Aktiviteetit
-
University of Umea, Department of Computer Science
Anssi Yli-Jyrä (Vieraileva tutkija), Frank Drewes (Muu rooli) & Henrik Björklund (Muu rooli)
4 marrask. 2019 → 6 marrask. 2019Aktiviteetti: Ulkoisessa instituutiossa vierailun tyypit › Akateeminen vierailu toiseen organisaatioon
-
The Rachel and Selim Benin School of Engineering and Computer Science, The Hebrew University of Jerusalem, Israel
Anssi Yli-Jyrä (Vieraileva tutkija)
10 jouluk. 2019 → 19 jouluk. 2019Aktiviteetti: Ulkoisessa instituutiossa vierailun tyypit › Akateeminen vierailu toiseen organisaatioon
-
Adam Jardine
Anssi Yli-Jyrä (Isäntä)
30 heinäk. 2018 → 5 elok. 2018Aktiviteetti: Vierailijan isännöinnin tyypit › Isännöity akateeminen vierailu Helsingin yliopistossa