20022019

Research output per year

If you made any changes in Pure these will be visible here soon.

Publications

2019

A Report on the Third VarDial Evaluation Campaign

Zampieri, M., Malmasi, S., Scherrer, Y., Samardžic, T., Tyers, F., Silfverberg, M. P., Klyueva, N., Pan, T-L., Huang, C-R., Ionescu, R. T., Butnaru, A. & Jauhiainen, T. S., 2019, Proceedings of the . Zampieri, M., Nakov, P., Malmasi, S., Ljubešić, N., Tiedemann, J. & Ali, A. (eds.). Stroudsburg: The Association for Computational Linguistics, p. 1-16 16 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientific

Open Access
File

Automatic Language Identification in Texts: A Survey

Jauhiainen, T., Lui, M., Zampieri, M., Baldwin, T. & Lindén, K., 25 Aug 2019, In : Journal of Artificial Intelligence Research. 65, p. 675-782 108 p.

Research output: Contribution to journalArticleScientificpeer-review

Open Access
File

Discriminating between Mandarin Chinese and Swiss-German varieties using adaptive language models

Jauhiainen, T. S., Jauhiainen, H. A. & Linden, B. K. J., 30 Apr 2019, Proceedings of the Sixth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2019) . Stroudsburg: The Association for Computational Linguistics, p. 178-187 10 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

Language and Dialect Identification of Cuneiform Texts

Jauhiainen, T. S., Jauhiainen, H. A., Alstola, T. & Linden, B. K. J., 30 Apr 2019, Proceedings of the Sixth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2019) . Stroudsburg: The Association for Computational Linguistics, p. 89-98 10 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

Language identification in texts

Jauhiainen, T., 18 May 2019, Helsinki: University of Helsinki.

Research output: ThesisDoctoral ThesisCollection of Articles

Open Access

Language Model Adaptation for Language and Dialect Identification of Text

Jauhiainen, T. S., Linden, B. K. J. & Jauhiainen, H. A., Sep 2019, In : Natural Language Engineering. 25, 5, p. 561-583 23 p., 135132491900038.

Research output: Contribution to journalArticleScientificpeer-review

Open Access
File

Suomenkielisen tekoälyn kehittämisohjelma – esiselvitys

Jauhiainen, T. (ed.), Lennes, M. (ed.) & Marttila, T. (ed.), 2019, 43 p.

Research output: Book/ReportCommissioned reportProfessional

Open Access
File

Wanca in Korp: Text corpora for underresourced Uralic languages

Jauhiainen, H., Jauhiainen, T. & Linden, K., 2019, Proceedings of the Research data and humanities (RDHUM) 2019 conference : data, methods and tools. Jantunen, J. H., Brunni, S., Kunnas, N., Palviainen, S. & Västi, K. (eds.). Oulu: University of Oulu, p. 21-40 20 p. (Studia Humaniora Ouluensia; no. 17).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File
2018

HeLI-based Experiments in Discriminating Between Dutch and Flemish Subtitles

Jauhiainen, T. S., Jauhiainen, H. A. & Linden, B. K. J., Aug 2018, Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018) . Zampieri, M., Nakov, P., Ljubešić, N., Tiedemann, J., Malmasi, S. & Ali, A. (eds.). Santa Fe: The Association for Computational Linguistics, p. 137-144 8 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

HeLI-based Experiments in Swiss German Dialect Identification

Jauhiainen, T. S., Jauhiainen, H. A. & Linden, B. K. J., Aug 2018, Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018) . Zampieri, M., Nakov, P., Ljubešić, N., Tiedemann, J., Malmasi, S. & Ali, A. (eds.). Santa Fe: The Association for Computational Linguistics, p. 254-262 9 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

Iterative Language Model Adaptation for Indo-Aryan Language Identification

Jauhiainen, T. S., Jauhiainen, H. A. & Linden, B. K. J., Aug 2018, Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018) . Zampieri, M., Nakov, P., Ljubešić, N., Tiedemann, J., Malmasi, S. & Ali, A. (eds.). Santa Fe: The Association for Computational Linguistics, p. 66-75 10 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File
2017

Evaluating HeLI with non-linear mappings

Jauhiainen, T. S., Linden, B. K. J. & Jauhiainen, H. A., 2017, Fourth Workshop on NLP for Similar Languages, Varieties and Dialects - Proceedings of the Workshop. Stroudsburg: The Association for Computational Linguistics, p. 102-108 7 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

Evaluation of language identification methods using 285 languages

Jauhiainen, T. S., Linden, B. K. J. & Jauhiainen, H. A., 2017, 21st Nordic Conference of Computational Linguistics: Proceedings of the Conference. Tiedemann, J. (ed.). Linköping: Linköping University Electronic Press, p. 183-191 9 p. (Linkping Electronic Conference Proceedings; no. 31).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File
2016

HeLI, a Word-Based Backoff Method for Language Identification

Jauhiainen, T. S., Linden, B. K. J. & Jauhiainen, H. A., 2016, Proceedings of the Third Workshop on NLP for Similar Languages, Varieties and Dialects: VarDial3, Osaka, Japan, December 12 2016. p. 153-162 10 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File
2015

Discriminating similar languages with token-based backoff

Jauhiainen, T., Jauhiainen, H. & Linden, K., 2015, Proceedings of the Joint Workshop on Language Technology for Closely Related Languages, Varieties and Dialects. The Association for Computational Linguistics, p. 44-51 8 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

Language Set Identification in Noisy Synthetic Multilingual Documents

Jauhiainen, T. S., Linden, K. & Jauhiainen, H. A., 2015, Computational Linguistics and Intelligent Text Processing. Gelbukh, A. (ed.). Springer International Publishing AG, Vol. Part I. p. 633-643 11 p. (Lecture Notes in Computer Science; vol. 9041).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

The Finno-Ugric Languages and the Internet project

Jauhiainen, H., Jauhiainen, T. & Linden, K., 15 Jan 2015, First International Workshop on Computational Linguistics for Uralic Languages: Proceedings of the Workshop. Pirinen, T., Tyers, F. & Trosterud, T. (eds.). Tromsø: Septentrio Academic Publishing, Vol. 2. p. 87–98 12 p. (Septentrio Conference Series; vol. 2015, no. 2).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File
2010

Tekstin kielen automaattinen tunnistaminen

Jauhiainen, T., 2 Nov 2010, 118 p.

Research output: ThesisMaster's thesisTheses

2002

Adaptive Dialogue Systems - Interaction with Interact

Jokinen, K., Kerminen, A., Kaipainen, M., Jauhiainen, T., Wilcock, G., Turunen, M., Hakulinen, J., Kuusisto, J. & Lagus, K., 2002, Proceedings of the 3rd SIGdial Workshop on Discourse and Dialogue. p. 64-73

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review