Projects per year
Organisation profile
Organisation Profile
Language technology is a multidisciplinary field. It often comes with the label computational linguistics, natural language processing (NLP) or natural language engineering (NLE). In language technology we study methods and develop models and tools for processing human language. This includes models for natural language understanding and human language generation also across languages. In Helsinki we focus on
- Cross-lingual NLP including machine translation
- NLP for languages with a rich morphology
- NLP for low-resource languages and in the humanities
Activities and news from our research group are available at our website.
Fields of Science
- 113 Computer and information sciences
- language technology
- natural language processing
- natural language engineering
- 6121 Languages
- computational linguistics
- language technology
International and National Collaboration
Profiles
-
Mikko Aulamo
- Department of Digital Humanities - Doctoral Researcher
- Language Technology
- Doctoral Programme in Language Studies
Person: U1 Research and teaching staff, Doctoral Researcher
-
Mathias Creutz
- Department of Digital Humanities - Senior University Lecturer, Title of Docent
- Language Technology
Person: U3 Research and teaching staff
-
Ona De Gibert Bonet, PhD Student
- Department of Digital Humanities - Doctoral Researcher
- Language Technology
- Doctoral Programme in Language Studies
Person: U1 Research and teaching staff, Doctoral Researcher
Equipment
-
HTB Helsinki Term Bank for the Arts and Sciences
Onikki-Rantajääskö, T. (Manager), Kanner, A. O. (Operator), Laxström, N. M. (Operator), Enqvist, E. J. (Other) & Kettunen, H. (Other)
Department of Finnish, Finno-Ugrian and Scandinavian StudiesFacility/equipment: Database
-
nVidia GTX Titan X GPU Workstation
Yli-Jyrä, A. (Manager)
Language TechnologyFacility/equipment: Equipment
-
nVidia RTX 2080Ti GPU for a Workstation
Yli-Jyrä, A. (Manager)
Language TechnologyFacility/equipment: Equipment
-
Easy Language for accessible workplace
Onikki-Rantajääskö, T. (Project manager), Katinskaia, A. (Participant), Vanhatalo, U. (Participant), Vu Anh, D. (Participant) & Yangarber, R. (Participant)
Innovaatiorahoituskeskus Business Finland
01/10/2024 → 31/05/2025
Project: Business Finland
-
Automatic Classification and Analysis of Texts from Egyptian Antiquity
Jauhiainen, T. (Project manager), Henriksson, E. (Participant), Jauhiainen, H. (Participant) & Vierros, M. (Participant)
01/01/2024 → 30/11/2029
Project: Foundations (Private Foundations, Non-Profit Foundations, Charitable Trusts)
-
GreenNLP: Green NLP - controlling the carbon footprint in sustainable language technology
Tiedemann, J. (Project manager), Attieh, J. (Participant), Nieminen, T. J. (Participant) & Štefánik, M. (Participant)
Suomen Akatemia Projektilaskutus
01/01/2023 → 31/12/2025
Project: Research Council of Finland: Targeted Academy Project
-
High Performance Language Technologies
Tiedemann, J. (Project manager), Aulamo, M. (Participant), De Gibert Bonet, O. (Participant), Grönroos, S.-A. (Participant), Ji, S. (Participant), Mickus, T. (Participant), Vahtola, T. (Participant), Vazquez , R. (Participant) & Virpioja, S. P. (Participant)
Charles University in Prague Faculty of Science Department of Teaching and Didactics of Biology
01/09/2022 → 31/12/2025
Project: EU Horizon Europe: Innovation actions (HORIZON-IA)
-
Uncertainty-aware neural language models
Tiedemann, J. (Project manager), Celikkanat, H. (Participant), Virpioja, S. P. (Participant) & Vazquez , R. (Participant)
Academy of Finland, Suomen Akatemia Projektilaskutus
01/01/2022 → 01/10/2025
Project: Research project
-
Analyzing the Effect of Linguistic Instructions on Paraphrase Generation
Vahtola, T., Hu, S., Creutz, M., Vulić, I., Korhonen, A. & Tiedemann, J., 3 Mar 2025, Proceedings of the Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies (NoDaLiDa/Baltic-HLT 2025). p. 755-766 12 p.Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Scientific › peer-review
-
Automatic detection of place and time for Greek texts in Egypt
Jauhiainen, T., Henriksson, E., Vierros, M. & Jauhiainen, H., 2025, (Accepted/In press) Proceedings of the Thirteenth International Congress of Egyptologists (ICE XIII). (Egyptologische Uitgaven).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Scientific › peer-review
-
How Many Languages Make Good Multilingual Instruction Tuning? A Case Study on BLOOM
Ji, S. & Chen, P., 2025, Proceedings of the 31st International Conference on Computational Linguistics. Rambow, O., Wanner, L., Apidianaki, M., Al-Khalifa, H., Di Eugenio, B. & Schockaert, S. (eds.). Stroudsburg: Association for Computational Linguistics (ACL), p. 2575-2581 7 p. (International Conference on Computational Linguistics).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Scientific › peer-review
Open AccessFile -
Neo-Assyrian Imperial Religion Counts: A Quantitative Approach to the Affiliations of Kings and Queens with Their Gods and Goddesses
Gansell, A., Alstola, T., Jauhiainen, H. & Svärd, S., 30 Jan 2025, In: Journal of Ancient Near Eastern Religions. 24, 2, p. 236-274 39 p.Research output: Contribution to journal › Article › Scientific › peer-review
Open AccessFile -
A Comparison of Language Modeling and Translation as Multilingual Pretraining Objectives
Li, Z., Ji, S., Mickus, T., Segonne, V. & Tiedemann, J., 1 Nov 2024, Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. Al-Onaizan, Y., Bansal, M. & Chen, Y.-N. (eds.). Kerrville: The Association for Computational Linguistics, p. 15882-15894 13 p.Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Scientific › peer-review
Open AccessFile
Activities
-
Data Sources for Automatic Classification and Analysis of Texts from Egyptian Antiquity
Jauhiainen, T. (Speaker)
11 Dec 2024Activity: Talk or presentation types › Oral presentation
-
Machine-Readable Texts in Egyptology: Current State and Challenges
Jauhiainen, H. (Speaker)
9 Jul 2024Activity: Talk or presentation types › Oral presentation
-
Words /iri/ and /uru/ and the Origins of Emesal
Sahala, A. (Speaker)
8 Jul 2024Activity: Talk or presentation types › Oral presentation
-
Low Saxon corpus-based dialectometry
Siewert, J. (Speaker)
17 Jul 2024Activity: Talk or presentation types › Oral presentation
File -
69th Rencontre Assyriologique Internationale
Svärd, S. (Poster Presentation), Alstola, T. (Poster Presentation), Sahala, A. (Poster Presentation) & Valk, J. (Poster Presentation)
8 Jul 2024 → 12 Jul 2024Activity: Participating in or organising an event types › Organisation and participation in conferences, workshops, courses, seminars
File
Prizes
-
August Ahlqvistin, Yrjö Wichmannin, Kai Donnerin ja Artturi Kanniston rahastojen väitöskirjapalkinto
Kuparinen, O. V. (Recipient), 14 Mar 2022
Prize: Prizes and awards
-
Best paper award at DHN 2020
Mäkelä, E. (Recipient), Lagus, K. (Recipient), Lahti, L. (Recipient), Säily, T. (Recipient), Tolonen, M. (Recipient), Hämäläinen, M. (Recipient), Kaislaniemi, S. (Recipient) & Nevalainen, T. (Recipient), 23 Oct 2020
Prize: Prizes and awards
-
-
-
Datasets
-
Murreviikko: an Annotated and Normalized Corpus of Dialectal Finnish Tweets
Kuparinen, O. V. (Creator), Zenodo, 2023
Dataset
-
OcWikiAnnot: Annotated Wikipedia Corpus of Occitan
Miletic Haddad, A. (Creator), Zenodo, 20 Apr 2023
DOI: 10.5281/zenodo.7777340, https://doi.org/10.5281/zenodo.7777340
Dataset
-
OcWikiDisc: a Corpus of Wikipedia Talk Pages in Occitan
Miletic Haddad, A. (Creator) & Scherrer, Y. (Creator), Zenodo, 14 Sept 2022
DOI: 10.5281/zenodo.7079580, https://doi.org/10.5281/zenodo.7079580
Dataset
-
Machine-readable Finnish-Karelian bilingual translation dictionary
Rantakaulio, T. (Creator), Alnajjar, K. (Creator), Hämäläinen, M. (Creator), Pirinen, F. (Creator) & Rueter, J. (Creator), Zenodo, 3 Jan 2022
Dataset
-
Machine-readable Finnish-Livvi bilingual translation dictionary
Rantakaulio, T. (Creator), Alnajjar, K. (Creator), Hämäläinen, M. (Creator), Rueter, J. (Creator) & Pirinen, F. (Creator), Zenodo, 3 Jan 2022
Dataset
Press/Media
-
-
Språk(teknologi) är nyckeln till intelligens och rättvisa
20/01/2022
1 Media contribution
Press/Media: Press / Media
-
芬兰研究人员正在教人工智能讲流利的芬兰语方言
Hämäläinen, M., Alnajjar, K., Rueter, J. & Partanen, N.
10/01/2022
1 item of Media coverage
Press/Media: Press / Media
-
Inteligência artificial identifica 23 dialetos em finlandês
Hämäläinen, M., Alnajjar, K., Rueter, J. & Partanen, N.
17/12/2021
1 item of Media coverage
Press/Media: Press / Media
-
Researchers teach artificial intelligence to be fluent in Finnish dialects
Hämäläinen, M., Alnajjar, K., Partanen, N. & Rueter, J.
16/12/2021
1 Media contribution
Press/Media: Press / Media