Centre for Preservation and Digisation

  • Finland

Publications 2006 2018

Filter
Conference contribution
2017

Improving Optical Character Recognition of Finnish Historical Newspapers with a Combination of Fraktur & Antiqua Models and Image Preprocessing

Koistinen, J. M. O., Kettunen, K. T. & Pääkkönen, T. A., May 2017, Proceedings of the 21st Nordic Conference on Computational Linguistics, NoDaLiDa, 22-24 May 2017, Gothenburg, Sweden. Tiedeman, J. (ed.). Linköping University Electronic Press, p. 277 283 p. (Linköping Electronic Conference Proceedings; vol. 131)(NEALT Proceedings Series; vol. 29).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access

Names, Right or Wrong: Named Entities in an OCRed Historical Finnish Newspaper

Kettunen, K. T. & Ruokolainen, T. P., 1 Jun 2017, Proceedings of the 2nd International Conference on Digital Access to Textual Cultural Heritage. New York: ACM, p. 181-186 6 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access

New Multi-language digitised Newspapers and Journals from Finland Available as Data Exports for Nordic Researchers

Pääkkönen, T. A. & Kervinen, J., 14 Mar 2017, DHN 2017 - Digital humaniora i Norden: Digital humanities in the Nordic countries. Brodén, D. (ed.). Göteborg: University of Göteborg, p. 94-96 2 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionProfessional

Open Access

Tagging Named Entities in 19th Century and Modern Finnish Newspaper Material with a Finnish Semantic Tagger

Kettunen, K. T. & Löfberg, L., May 2017, Proceedings of the 21st Nordic Conference on Computational Linguistics, NoDaLiDa, 22-24 May 2017, Gothenburg, Sweden. Tiedemann, J. (ed.). Linköping: Linköping University Electronic Press, p. 29-36 8 p. (Linköping Electronic Conference Proceedings; vol. 131)(NEALT Proceedings Series; vol. 29).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

Tagging Named Entities in 19th century Finnish Newspaper Material with a Variety of Tools

Kettunen, K. T. & Ruokolainen, T. P., 14 Mar 2017, DHN 2017 - Digital humaniora i Norden: Digital humanities in the Nordic countries. Broden, D. (ed.). Göteborg: University of Göteborg, p. 68-72 4 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
2016

Between Diachrony and Synchrony: Evaluation of Lexical Quality of a Digitized Historical Finnish Newspaper and Journal Collection with Morphological Analyzers

Kettunen, K. T., Pääkkönen, T. A. & Koistinen, J. M. O., 2016, Human Language Technologies – The Baltic Perspective: Proceedings of the 7th International Conference: Human Language Technologies – The Baltic Perspective (Baltic HLT 2016). Amsterdam: IOS PRESS, p. 122-129 8 p. (Frontiers in Artificial Intelligence and Applications; no. 289).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access

Contracts Enabling Collaboration of The National Library of Finland with Media Houses in Electronic Deposit

Karppinen, P., Kaukonen, M., Pääkkönen, T. & Sorjonen, M., 15 Aug 2016, Unknown host publication. 8 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionProfessional

Open Access

Measuring Lexical Quality of a Historical Finnish Newspaper Collection – Analysis of Garbled OCR Data with Basic Language Technology Tools and Means

Kettunen, K. T. & Pääkkönen, T. A., May 2016, Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016). European Language Resources Association (ELRA), p. 956-961 6 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access

Modern Tools for Old Content - in Search of Named Entities in a Finnish OCRed Historical Newspaper Collection 1771-1910

Kettunen, K. T., Mäkelä, E., Kuokkala, J. M., Ruokolainen, T. P. & Niemi, J. A., Sep 2016, LWDA 2016 Lernen, Wissen, Daten, Analysen 2016 Proceedings of the Conference "Lernen, Wissen, Daten, Analysen". Aachen: CEUR Workshop Proceedings, (CEUR Workshop Proceedings).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

2015

Targeted Query Expansions as a Method for Searching: Mixed Quality Digitized Cultural Heritage Documents

Keskustalo, H., Kettunen, K. T., Kumpulainen, S., Ferro, N., Silvello, G., Järvelin, A., Kekäläinen, J., Arvola, P., Saastamoinen, M., Sormunen, E. & Järvelin, K., 2015, iConference 2015 Proceedings. iSchools, 7 p. (iConference).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
2014

Analyzing and Improving the Quality of a Historical News Collection using Language Technology and Statistical Machine Learning Methods

Kettunen, K., Honkela, T., Linden, K., Kauppinen, P., Pääkkönen, T. & Kervinen, J., 16 Aug 2014, IFLA World Library and Information Congress Proceedings: 80th IFLA General Conference and Assembly. Lyon, France: IFLA, 23 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File
2006

Analysis of EU Languages Through Text Compression

Kettunen, K., Sadeniemi, M., Lindh-Knuutila, T. & Honkela, T., 2006, Unknown host publication. Salakoski, T., Ginter, F., Pyysalo, S. & Pahikkala, T. (eds.). p. 99-109 11 p. (Lecture Notes in Computer Science).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review