If you made any changes in Pure these will be visible here soon.

Publications 1987 2019

2019

Aššur and His Friends: A Statistical Analysis of Neo-Assyrian Texts

Alstola, T., Zaia, S., Sahala, A., Jauhiainen, H., Svärd, S. & Linden, K., 2019, In : Journal of Cuneiform Studies. 71, p. 159-180 22 p.

Research output: Contribution to journalArticleScientificpeer-review

Open Access
File

Automatic Language Identification in Texts: A Survey

Jauhiainen, T., Lui, M., Zampieri, M., Baldwin, T. & Lindén, K., 13 May 2019, (Accepted/In press) In : Journal of Artificial Intelligence Research. 97 p.

Research output: Contribution to journalArticleScientificpeer-review

File

Discriminating between Mandarin Chinese and Swiss-German varieties using adaptive language models

Jauhiainen, T. S., Jauhiainen, H. A. & Linden, B. K. J., 30 Apr 2019, Proceedings of the Sixth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2019) . Stroudsburg: Association for Computational Linguistics, p. 178-187 10 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

Improving OCR of historical newspapers and journals published in Finland

Drobac, S., Kauppinen, P. & Linden, K., 2019, (Accepted/In press) Proceedings of the 3nd International Conference on Digital Access to Textual Cultural Heritage. ACM

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Language and Dialect Identification of Cuneiform Texts

Jauhiainen, T. S., Jauhiainen, H. A., Alstola, T. & Linden, B. K. J., 30 Apr 2019, Proceedings of the Sixth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2019) . Stroudsburg: Association for Computational Linguistics, p. 89-98 10 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

Language Model Adaptation for Language and Dialect Identification of Text

Jauhiainen, T. S., Linden, B. K. J. & Jauhiainen, H. A., 17 Apr 2019, (Accepted/In press) In : Natural Language Engineering.

Research output: Contribution to journalArticleScientificpeer-review

2018

Challenges of transformation of research data into open data: The perspective of social sciences and humanities

Kelli, A., Mets, T., Vider, K., Värv, A., Jonsson, L., Lindén, K. & Birtonas, R., Sep 2018, In : International Journal of Technology Management & Sustainable Development. 17, 3, p. 227-251 25 p.

Research output: Contribution to journalArticleScientificpeer-review

FinnTransFrame: translating frames in the FinnFrameNet project

Lindén, K., Haltia, H., Laine, A., Luukkonen, J., Piitulainen, J. & Väisänen, N., 22 Nov 2018, In : Language Resources and Evaluation. 31 p.

Research output: Contribution to journalArticleScientificpeer-review

Open Access
File

From Evaluating to Forecasting Performance: How to Turn Information Retrieval, Natural Language Processing and Recommender Systems into Predictive Sciences

Ferro, N., Fuhr, N., Grefenstette, G., Konstan, J. A., Castells, P., Daly, E. M., Declerck, T., Ekstrand, M. D., Geyer, W., Gonzalo, J., Kuflik, T., Lindén, K., Magnini, B., Nie, J-Y., Perego, R., Shapira, B., Soboroff, I., Tintarev, N., Verspoor, K., Willemsen, M. C. & 1 othersZobel, J., 2018, In : Dagstuhl manifestos. 7, 1, p. 96-139 44 p.

Research output: Contribution to journalArticleScientific

Open Access
File

HeLI-based Experiments in Discriminating Between Dutch and Flemish Subtitles

Jauhiainen, T. S., Jauhiainen, H. A. & Linden, B. K. J., Aug 2018, Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018) . Zampieri, M., Nakov, P., Ljubešić, N., Tiedemann, J., Malmasi, S. & Ali, A. (eds.). Santa Fe: Association for Computational Linguistics, p. 137-144 8 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access

HeLI-based Experiments in Swiss German Dialect Identification

Jauhiainen, T. S., Jauhiainen, H. A. & Linden, B. K. J., Aug 2018, Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018) . Zampieri, M., Nakov, P., Ljubešić, N., Tiedemann, J., Malmasi, S. & Ali, A. (eds.). Santa Fe: Association for Computational Linguistics, p. 254-262 9 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access

Implementation of an Open Science Policy in the context of management of CLARIN language resources: a need for changes?

Kelli, A., Lindén, K., Vider, K., Labropoulou, P., Ketzan, E., Kamocki, P. & Stranák, P., 16 May 2018, Selected papers from the CLARIN Annual Conference 2017, Budapest, 18–20 September 2017. Piasecki, M. (ed.). Linköping: Linköping University Electronic Press, Vol. 147. p. 102-111 10 p. 009. (Linköping Electronic Conference Proceedings; no. 147).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

Iterative Language Model Adaptation for Indo-Aryan Language Identification

Jauhiainen, T. S., Jauhiainen, H. A. & Linden, B. K. J., Aug 2018, Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018) . Zampieri, M., Nakov, P., Ljubešić, N., Tiedemann, J., Malmasi, S. & Ali, A. (eds.). Santa Fe: Association for Computational Linguistics, p. 66-75 10 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access

Keeleressursside loomise ja kasutamisega seonduvaid isikuandmete kaitse küsimusi

Kelli, A., Vider, K., Kull, I., Siil, T., Lindén, K., Tavast, A., Värv, A., Ginter, C. & Meister, E., 2018, In : Eesti rakenduslingvistika ühingu aastaraamat. 14, p. 77-94 18 p.

Research output: Contribution to journalArticleScientificpeer-review

Open Access
File

Rethinking Summarization and Storytelling for Modern Social Multimedia

Rudinac, S., Chua, T-S., Diaz-Ferreyra, N., Friedland, G., Gornostaja, T., Huet, B., Kaptein, R., Linden, K., Moens, M-F., Peltonen, J., Redi, M., Schedl, M., Shamma, D. A., Smeaton, A. & Xie, L., 2018, p. 141-153. 13 p.

Research output: Conference materialsOther conference materialResearch

Open Access

Rethinking Summarization and Storytelling for Modern Social Multimedia

Rudinac, S., Chua, T-S., Diaz-Ferreyra, N., Friedland, G., Gornostaja, T., Huet, B., Kaptein, R., Lindén, K., Moens, M-F., Peltonen, J., Redi, M., Schedl, M., Shamma, D. A., Smeaton, A. & Xie, L., 28 Jan 2018, MultiMedia Modeling: 24th International Conference, MMM 2018, Bangkok, Thailand, February 5-7, 2018, Proceedings, Part I. Schoeffmann, K., Chalidabhongse, T. H., Ngo, C. W., Aramvith, S., O'Connor, N. E., Ho, Y-S., Gabbouj, M. & Elgammal, A. (eds.). Cham: Springer International Publishing AG, Vol. 1. p. 632-644 13 p. (LNCS; no. 10704).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Semantic Domains in Akkadian Text

Svärd, S. S., Jauhiainen, H. A., Linden, B. K. J. & Sahala, A. J. A., 7 Aug 2018, CyberResearch on the Ancient Near East and Neighboring Regions: Case Studies on Archaeological Data, Objects, Texts, and Digital Archiving. Juloux, V. B., Gansell, A. R. & di Ludovico, A. (eds.). Leiden: Brill, p. 224-256 33 p. (Digital Biblical Studies; no. 2).

Research output: Chapter in Book/Report/Conference proceedingChapterScientificpeer-review

Open Access
File

The Dagstuhl Perspectives Workshop on Performance Modeling and Prediction

Ferro, N., Fuhr, N., Grefenstette, G., Konstan, J. A., Castells, P., Daly, E. M., Declerck, T., Ekstrand, M. D., Geyer, W., Gonzalo, J., Kuflik, T., Lindén, K., Magnini, B., Nie, J-Y., Perego, R., Shapira, B., Soboroff, I., Tintarev, N., Verspoor, K., Willemsen, M. C. & 1 othersZobel, J., Jun 2018, In : SIGIR Forum. 52, 1, p. 91-101 11 p.

Research output: Contribution to journalArticleScientific

Open Access
File

Vad får AI för bild av mänskligheten?

Linden, B. K. J., 26 Nov 2018, In : Yliopistolainen : Helsingin yliopiston henkilöstölehti. 4/2018, p. 9 1 p.

Research output: Contribution to journalEditorialProfessional

Open Access
2017

Evaluating HeLI with non-linear mappings

Jauhiainen, T. S., Linden, B. K. J. & Jauhiainen, H. A., 2017, Fourth Workshop on NLP for Similar Languages, Varieties and Dialects - Proceedings of the Workshop. Stroudsburg: Association for Computational Linguistics, p. 102-108 7 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

Evaluation of language identification methods using 285 languages

Jauhiainen, T. S., Linden, B. K. J. & Jauhiainen, H. A., 2017, 21st Nordic Conference of Computational Linguistics: Proceedings of the Conference. Tiedemann, J. (ed.). Linköping: Linköping University Electronic Press, p. 183-191 9 p. (Linkping Electronic Conference Proceedings; no. 31).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

FinnFN 1.0: The Finnish frame semantic database

Lindén, K., Haltia, H., Luukkonen, J., Laine, A., Roivainen, H. & Väisänen, N., 14 Aug 2017, In : Nordic journal of linguistics. 40, 3, p. 1-25 25 p.

Research output: Contribution to journalArticleScientificpeer-review

Open Access
File

OCR and post-correction of historical Finnish texts

Drobac, S., Kauppinen, P. S. & Linden, B. K. J., 2017, Proceedings of the 21st Nordic Conference on Computational Linguistics, NoDaLiDa, 22-24 May 2017, Gothenburg, Sweden. Tiedemann, J. (ed.). Linköping: Linköping University Electronic Press, p. 70-76 7 p. (Linköping Electronic Conference Proceedings; no. 131).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File
2016

Data-Driven Spelling Correction using Weighted Finite-State Methods

Silfverberg, M., Kauppinen, P. & Linden, K., 12 Aug 2016, The 54th Annual Meeting of the Association for Computational Linguistics: Proceedings of the SIGFSM Workshop on Statistical NLP and Weighted Automata. Stroudsburg, PA: ACL, p. 51-59 9 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

Digitaalsete keeleressursside loomist ja kasutamist määrav õiguslik raamistik Eestis ja selle ühildumine CLARIN-i infrastruktuuriga

Kelli, A., Vider, K., Pisuke, H. & Linden, B. K. J., 2016, In : Eesti rakenduslingvistika ühingu aastaraamat. 12, p. 81-98 18 p.

Research output: Contribution to journalArticleScientificpeer-review

Open Access
File

Eduskunnan täysistuntovideoiden ja keskustelupöytäkirjojen automaattinen kohdistaminen AaltoASR-työkalulla

Mansikkaniemi, A., Kurimo, M., Lennes, M. E. & Linden, B. K. J., 23 May 2016.

Research output: Conference materialsAbstractResearchpeer-review

Open Access

FinnPos: an open-source morphological tagging and lemmatization toolkit for Finnish

Silfverberg, M., Ruokolainen, T., Linden, K. & Kurimo, M., Dec 2016, In : Language Resources and Evaluation. 50, 4, p. 863-878 16 p.

Research output: Contribution to journalArticleScientificpeer-review

Open Access
File

HeLI, a Word-Based Backoff Method for Language Identification

Jauhiainen, T. S., Linden, B. K. J. & Jauhiainen, H. A., 2016, Proceedings of the Third Workshop on NLP for Similar Languages, Varieties and Dialects: VarDial3, Osaka, Japan, December 12 2016. p. 153-162 10 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

In-Document Adaptation for a Human Guided Automatic Transcription Service

Mansikkaniemi, A., Kurimo, M. & Linden, B. K. J., 13 Aug 2016, Speech and Computer : 18th International Conference, SPECOM 2016, Budapest, Hungary, August 23-27, 2016, Proceedings. Ronzhin, A., Potapova, R. & Németh, G. (eds.). Cham: Springer International Publishing AG, p. 395-402 8 p. (Lecture Notes in Computer Science; vol. 9811).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

The Regulatory and Contractual Framework as an Integral Part of the CLARIN Infrastructure

Kelli, A., Vider, K. & Lindén, K., 11 Apr 2016, Selected Papers from the CLARIN Annual Conference 2015, October 14–16, 2015, Wroclaw, Poland: NEALT Proceedings Series. DeSmedt, K. (ed.). Linköping: Linköping University Electronic Press, Vol. 123. p. 13-24 12 p. 002. (Linköping Electronic Conference Proceedings; vol. 123).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

The strategic impact of META-NET on the regional, national and international level

Rehm, G., Uszkoreit, H., Ananiadou, S., Bel, N., Bieleviciene, A., Borin, L., Branco, A., Budin, G., Calzolari, N., Daelemans, W., Garabik, R., Grobelnik, M., Garcia-Mateo, C., van Genabith, J., Hajic, J., Hernaez, I., Judge, J., Koeva, S., Krek, S., Krstev, C. & 24 othersLinden, K., Magnini, B., Mariani, J., McNaught, J., Melero, M., Monachini, M., Moreno, A., Odijk, J., Ogrodniczuk, M., Pezik, P., Piperidis, S., Przepiorkowski, A., Rognvaldsson, E., Rosner, M., Pedersen, B. S., Skadina, I., De Smedt, K., Tadic, M., Thompson, P., Tufis, D., Varadi, T., Vasiljevs, A., Vider, K. & Zabarskaite, J., Jun 2016, In : Language Resources and Evaluation. 50, 2, p. 351-374 24 p.

Research output: Contribution to journalArticleScientificpeer-review

Open Access
File
2015

Automated Lossless Hyper-Minimization for Morphological Analyzers

Drobac, S., Silfverberg, M. & Linden, K., 22 Apr 2015, Proceedings of the 12th International Conference on Finite-State Methods and Natural Language Processing 2015: Collected papers. Hanneforth, T. & Wurm, C. (eds.). ACL, 5 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionProfessional

Open Access
File

Discriminating similar languages with token-based backoff

Jauhiainen, T., Jauhiainen, H. & Linden, K., 2015, Proceedings of the Joint Workshop on Language Technology for Closely Related Languages, Varieties and Dialects. Association for Computational Linguistics, p. 44-51 8 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

Extracting Semantic Frames using hfst-pmatch

Hardwick, S., Silfverberg, M. & Linden, K., May 2015, Proceedings of the 20th Nordic Conference of Computational Linguistics: NODALIDA 2015. Magyesi, B. (ed.). Lingköping: Linköping University Electronic Press, p. 305-308 4 p. 109:042. (Linköping Electronic Conference Proceedings).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

Language Set Identification in Noisy Synthetic Multilingual Documents

Jauhiainen, T. S., Linden, K. & Jauhiainen, H. A., 2015, Computational Linguistics and Intelligent Text Processing. Gelbukh, A. (ed.). Springer International Publishing AG, Vol. Part I. p. 633-643 11 p. (Lecture Notes in Computer Science; vol. 9041).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

Regulatory and Contractual Framework as an Integral Part of CLARIN Infrastructure: the Estonian and Finnish Perspectives

Kelli, A., Vider, K. & Linden, K., Oct 2015, CLARIN Annual Conference 2015: Book of Abstracts . De Smedt, K. (ed.). Wrocław, Poland: CLARIN ERIC, p. 32-36 5 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionProfessional

Open Access
File

The Finno-Ugric Languages and the Internet project

Jauhiainen, H., Jauhiainen, T. & Linden, K., 15 Jan 2015, First International Workshop on Computational Linguistics for Uralic Languages: Proceedings of the Workshop. Pirinen, T., Tyers, F. & Trosterud, T. (eds.). Tromsø: Septentrio Academic Publishing, Vol. 2. p. 87–98 12 p. (Septentrio Conference Series; vol. 2015, no. 2).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

Tutkimuksen haasteet digitaalisessa ajassa

Linden, K., 2015, In : Kansalliskirjasto. 57, p. 41-42 2 p.

Research output: Contribution to journalArticleScientific

Using HFST—Helsinki Finite-State Technology for Recognizing Semantic Frames

Lindén, K., Hardwick, S., Silfverberg, M. & Axelson, E., 9 Dec 2015, Systems and Frameworks for Computational Morphology. Springer International Publishing AG, p. 124-136 13 p. (Communications in Computer and Information Science; vol. 537).

Research output: Chapter in Book/Report/Conference proceedingChapterScientificpeer-review

Open Access
File
2014

Accelerated Estimation of Conditional Random Fields using a Pseudo-Likelihood-inspired Perceptron Variant

Ruokolainen, T., Silfverberg, M., Kurimo, M. & Linden, K., 26 Apr 2014, Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics: EACL 2014. ACL, 5 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Analyzing and Improving the Quality of a Historical News Collection using Language Technology and Statistical Machine Learning Methods

Kettunen, K., Honkela, T., Linden, K., Kauppinen, P., Pääkkönen, T. & Kervinen, J., 16 Aug 2014, IFLA World Library and Information Congress Proceedings: 80th IFLA General Conference and Assembly. Lyon, France: IFLA, 23 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

CLARA: A New Generation of Researchers in Common Language Resources and Their Applications

Koenraad De Smedt, Erhard Hinrichs, Detmar Meurers, Inguna Skadina, Bolette Pedersen, Costanza Navarretta, Núria Bel, Krister Linden, Marketa Lopatkova, Jan Hajic, Gisle andersen and Przemyslaw Lenkiewicz, 26 May 2014, Proceedings of LREC 2014. Reykjavik, Iceland: European Language Resources Association (ELRA), 9 p. #410

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

Heuristic Hyper-minimization of Finite State Lexicons

Drobac, S., Linden, K., Pirinen, T. & Silfverberg, M., 26 May 2014, Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14). Reykjavik, Iceland: European Language Resources Association (ELRA), Vol. 9. 6 p. #784

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

HFST-SweNER – A New NER Resource for Swedish

Kokkinakis, D., Niemi, J., Hardwick, S., Linden, K. & Borin, L., 26 May 2014, Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14). Calzolari, N., Choukri, K., Declerck, T., Loftsson, H., Maegaard, B., Mariani, J., Moreno, A., Odijk, J. & Piperidis, S. (eds.). Reykjavik, Iceland: European Language Resources Association (ELRA), 7 p. #391

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

Part-of-Speech Tagging using Conditional Random Fields: Exploiting Sub-Label Dependencies for Improved Accuracy

Silfverberg, M., Ruokolainen, T., Linden, K. & Kurimo, M., 22 Jun 2014, Unknown host publication.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

State-of-the-Art in Weighted Finite-State Spell-Checking

Pirinen, T. & Linden, K., 6 Apr 2014, Computational Linguistics and Intelligent Text Processing: 15th International Conference, CICLing 2014, Kathmandu, Nepal, April 6-12, 2014, Proceedings, Part II. Gelbukh, A. (ed.). Berlin Heidelberg: Springer-Verlag, Vol. 2. p. 519-532 14 p. (Lecture Notes in Computer Science; vol. 8404).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

The Strategic Impact of META-NET on the Regional, National and International Level

Georg Rehm, Hans Uszkoreit, Sophia Ananiadou, Núria Bel, Audrone Bieleviciene, Lars Borin, António Branco, Gerhard Budin, Nicoletta Calzolari, Walter Daelemans, Radovan Garabík, Marko Grobelnik, Carmen Garcia-Mateo, Josef Van Genabith, Jan Hajic, Inma Hernaez, John Judge, Svetla Koeva, Simon Krek, Cvetana Krstev, Krister Lindén, Bernardo Magnini, Joseph Mariani, John Mcnaught, Maite Melero, Monica Monachini, Asuncion Moreno, Jan Odijk, Maciej Ogrodniczuk, Piotr Pezik, Stelios Piperidis, Adam Przepiórkowski, Eiríkur Rögnvaldsson, Michael Rosner, Bolette Sandford Pedersen, Inguna Skadina, Koenraad De Smedt, Marko Tadić, Paul Thompson, Dan Tufiș, Tamás Váradi, Andrejs Vasiljevs, Kadri Vider, Jolanta Zabarskaite, 26 May 2014, Proceedings of LREC 2014. Reykjavik, Iceland: European Language Resources Association (ELRA), p. 1517-1524 8 p. #405

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File
2013

Baltic and Nordic Parts of the European Linguistic Infrastructure

Skadina, I., Vasiljevs, A., Borin, L., Linden, K., Losnegaard, G., Olsen, S., Pedersen, B., Rozis, R. & De Smedt, K., 20 May 2013, Proceedings of NODALIDA 2013. Linköping University Electronic Press, 16 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

File