Projects per year
Project Details
Description (abstract)
The key interest underlying our proposal is to build African language resources. Their availability will enhance the use of African languages in language technology. This work will take place in close partnership with the currently developing European infrastructure (CLARIN) which will thus benefit from a better representation of African languages with their specific needs based on typological properties of these languages (in this case from the Bantu group). Capacity-building in the relevant sector, both in Africa and in Helsinki, is an essential feature of our project.
In terms of methodology, two major perspectives are represented in the project. Linguistic ques- tions concern mainly the building of resources and tools for African languages, in this case specifically Bantu languages. Our focus will be on languages of wider distribution with few resources, such as Oshikwanyama, Otjiherero, Setswana, and possibly others (Kinyarwanda, Bemba, Chichewa, etc.).
The identification of target languages is based on criteria such as the availability of material and the urgency with which computational linguistic should be carried out. The former is mainly a sci- entific constraint, depending on the availability of language descriptions, formal grammars and lex- ical databases. The second is largely determined by political factors, interest of the members of the language communities, government policies, and availability of local counterparts, competence and willingness to cooperate.
The main linguistic task will consist in assessing relevant reference material, electronic corpora, and other language resources. At the same time, some effort will be dedicated to networking ac- tivities with scholars working on the relevant African languages. From the computational side, an innovative approach (“pointwise weighted finite-state”) is pursued in the project. In practical terms, computational linguists will scrutinise the hypothesis that available reference material (much of which is generative, rule-based) can be successfully exploited for improving FS methods and a constraint- based understanding of the relevant language properties.
The scientific results concern mostly the methodological questions outlined in the previous para- graph. At the same time, this initiative will be of immediate relevance to language practitioners in African countries and thus open new paths to improved educational chances, one of the crucial factors for sustainable development of human resources in Africa.
Acronym | LT4AFRICA |
---|---|
Status | Finished |
Effective start/end date | 01/01/2010 → 31/12/2010 |
Funding
- Suomen Akatemia: €47,300.00
Fields of Science
- 6121 Languages
- Bantu languages
- morphology
- grammatical tone
- language documentation
- 113 Computer and information sciences
- finite-state methods
- 519 Social and economic geography
- African countries
- language development
- Millenium Development Goals
-
HFST - Helsinki Finite-State Technology
Linden, K., Koskenniemi, K., Yli-Jyrä, A., Hulden, M., Silfverberg, M., Pirinen, T., Axelson, E., Hardwick, S., Niemi, J. & Hurskainen, A.
01/01/2005 → …
Project: Research project
-
Graph-Based Representation of Cross-Lingual Alignments
Abend, O., Ronning, M. & Miles, G.
01/01/2018 → 17/01/2020
Project: Research project
-
FIELDSYNERGY-TRIAL: Rationalizing Parallel Linguistic Description and Computational Modeling
01/01/2011 → 01/01/2011
Project: Research project
-
On Practical Realisation of Autosegmental Representations in Lexical Transducers of Tonal Bantu Languages
Yli-Jyrä, A., 2020, Proceedings of the Language Technologies for All (LT4All). Adda, G., Choukri, K., Kasinskaite-Buddeberg, I., Mariani, J., Mazo, H. & Sakriani, S. (eds.). Paris: European Language Resources Association (ELRA), p. 346-349 4 p.Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Scientific › peer-review
Open AccessFile -
Optimal Kornai-Karttunen Codes for Restricted Autosegmental Representations
Yli-Jyrä, A. M., 2019, Tokens of Meaning: Papers in Honor of Lauri Karttunen. Condoravdi, C. & Holloway King, T. (eds.). Stanford: Center for the Study of Language and Information (CSLI)Research output: Chapter in Book/Report/Conference proceeding › Chapter › Scientific › peer-review
-
University of Umea, Department of Computer Science
Anssi Yli-Jyrä (Visiting researcher), Frank Drewes (Other role) & Henrik Björklund (Other role)
4 Nov 2019 → 6 Nov 2019Activity: Visiting an external institution types › Academic visit to other institution
-
The Rachel and Selim Benin School of Engineering and Computer Science, The Hebrew University of Jerusalem, Israel
Anssi Yli-Jyrä (Visiting researcher)
10 Dec 2019 → 19 Dec 2019Activity: Visiting an external institution types › Academic visit to other institution
-
Adam Jardine
Anssi Yli-Jyrä (Host)
30 Jul 2018 → 5 Aug 2018Activity: Hosting a visitor types › Academic visit at UH