If you made any changes in Pure these will be visible here soon.

Publications 2009 2019

2019

Measuring Semantic Abstraction of Multilingual NMT with Paraphrase Recognition and Generation Tasks

Tiedemann, J. & Scherrer, Y., 1 Jun 2019, Proceedings of the 3rd Workshop on Evaluating Vector Space Representations for NLP. Rogers, A., Drozd, A., Rumshisky, A. & Goldberg, Y. (eds.). Stroudsburg: Association for Computational Linguistics, p. 35-42 8 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

Multilingual NMT with a language-independent attention bridge

Vazquez Carrillo, J. R., Raganato, A., Tiedemann, J. & Creutz, M., 2019, The 4th Workshop on Representation Learning for NLP (RepL4NLP-2019): Proceedings of the Workshop. Augenstein, I., Gella, S., Ruder, S., Kann, K., Can, B., Welbl, J., Conneau, A., Ren, X. & Rei, M. (eds.). Stroudsburg: Association for Computational Linguistics, p. 33-39 7 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

Predicting Prosodic Prominence from Text with Pre-trained Contextualized Word Representations

Talman, A., Suni, A., Celikkanat, H., Kakouros, S., Tiedemann, J. & Vainio, M., 9 Aug 2019, (Accepted/In press) In : Nordic Conference of Computational Linguistics. 9 p.

Research output: Contribution to journalConference articleScientificpeer-review

Open Access

Revisiting NMT for normalization of early English letters

Hämäläinen, M., Säily, T., Rueter, J., Tiedemann, J. & Mäkelä, E., 2019, Proceedings of the 3rd Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature. Alex, B., Degaetano-Ortlieb, S., Kazantseva, A., Reiter, N. & Szpakowicz, S. (eds.). Stroudsburg: Association for Computational Linguistics, p. 71–75 5 p. (ACL Anthology; no. W19-25).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access

Sentence Embeddings in NLI with Iterative Refinement Encoders

Talman, A. J., Yli-Jyrä, A. & Tiedemann, J., 31 Jul 2019, In : Natural Language Engineering. 25, 4, p. 467-482 16 p.

Research output: Contribution to journalArticleScientificpeer-review

Open Access
File

The University of Helsinki submissions to the WMT19 news translation task

Talman, A., Sulubacak, U., Vazquez, R., Scherrer, Y., Virpioja, S., Raganato, A., Hurskainen, A. & Tiedemann, J., 1 Aug 2019, Fourth Conference of Conference on Machine Translation: Proceedings of the Conference. Stroudsburg: Association for Computational Linguistics, p. 611-622 12 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

What do Language Representations Really Represent?

Bjerva, J., Östling, R., Han Veiga, M., Tiedemann, J. & Augenstein, I., 2019, In : Computational Linguistics. 8 p.

Research output: Contribution to journalArticleScientificpeer-review

2018

An Analysis of Encoder Representations in Transformer-Based Machine Translation

Raganato, A. & Tiedemann, J., 2018, Proceedings of the 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP. Tal, L., Chrupała, G. & Alishahi, A. (eds.). Stroudsburg: Association for Computational Linguistics, p. 287-297 11 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

Creating a Dataset for Multilingual Fine-grained Emotion-detection Using Gamification-based Annotation

Öhman, E. S., Tiedemann, J., Honkela, T. U. & Kajava, K., 31 Oct 2018, Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis. Stroudsburg: Association for Computational Linguistics, p. 24-30 7 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

Detecting hospital-acquired infections: A document classification approach using support vector machines and gradient tree boosting

Ehrentraut, C., Ekholm, M., Tanushi, H., Tiedemann, J. & Dalianis, H., Mar 2018, In : Health informatics journal.. 24, 1, p. 24-42 19 p.

Research output: Contribution to journalArticleScientificpeer-review

Open Access
File

Emerging Language Spaces Learned From Massively Multilingual Corpora

Tiedemann, J., 2018, Proceedings of the Digital Humanities in the Nordic Countries 3rd Conference (DHN 2018). Mäkelä, E., Tolonen, M. & Tuominen, J. (eds.). Helsinki: CEUR Workshop Proceedings, Vol. 2084. p. 188-197 (CEUR Workshop Proceedings).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

Language Identification and Morphosyntactic Tagging: The Second VarDial Evaluation Campaign

Zampieri, M., Malmasi, S., Nakov, P., Ali, A., Shon, S., Glass, J., Scherrer, Y., Samardžić, T., Ljubešić, N., Tiedemann, J., van der Lee, C., Grondelaers, S., Oostdijk, N., Speelman, D., van den Bosch, A., Kumar, R., Lahiri, B. & Jain, M., 2018, Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects. Zampieri, M., Nakov, P., Ljubešić, N., Tiedemann, J., Malmasi, S. & Ali, A. (eds.). Santa Fe: Association for Computational Linguistics, p. 1-17 17 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientific

Open Access
File

Normalizing early English letters to Present-day English spelling

Hämäläinen, M., Säily, T., Rueter, J., Tiedemann, J. & Mäkelä, E., 2018, Proceedings of the 2nd Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature. Alex, B., Degaetano-Ortlieb, S., Feldman, A., Kazantseva, A., Reiter, N. & Szpakowicz, S. (eds.). Stroudsburg, PA: Association for Computational Linguistics, p. 87-96 10 p. (ACL Anthology; no. W18-45).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access

OpenSubtitles2018: Statistical Rescoring of Sentence Alignments in Large, Noisy Parallel Corpora

Lison, P., Tiedemann, J. & Kouylekov, M., 2018, Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC 2018). Calzolari, N., Khalid, C., Christopher, C., Thierry, D., Sara, G., Koiti, H., Hitoshi, I., Bente, M., Joseph, M., Hélène, M., Asuncion, M., Jan, O., Stelios, P. & Takenobu, T. (eds.). Paris: European Language Resources Association (ELRA), p. 1742-1748 7 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

Opus-MontenegrinSubs 1.0: First electronic corpus of the Montenegrin language

Bozovic, P., Erjavec, T., Tiedemann, J., Ljubesic, N. & Gorjanc, V., 2018, Proceedings of the conference on Language Technologies & Digital Humanities 2018. Fišer, D. & Pančur, A. (eds.). Ljubljana: Ljubljana University Press, p. 24-28 5 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018)

Zampieri, M. (ed.), Nakov, P. (ed.), Ljubesic, N. (ed.), Tiedemann, J. (ed.), Malmasi, S. (ed.) & Ali, A. (ed.), 2018, Stroudsburg: Association for Computational Linguistics.

Research output: Book/ReportAnthology or special issueScientificpeer-review

Open Access
File

The MeMAD Submission to the IWSLT 2018 Speech Translation Task

Sulubacak, U., Tiedemann, J., Rouhe, A., Stig-Arne, G. & Kurimo, M., 30 Oct 2018, Proceedings of the 15th International Workshop on Spoken Language Translation (IWSLT 2018). Turchi, M., Niehues, J. & Frederico, M. (eds.). Bruges, p. 89-94 6 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionProfessional

Open Access
File

The MeMAD Submission to the WMT18 Multimodal Translation Task

Stig-Arne, G., Huet, B., Kurimo, M., Laaksonen, J., Merialdo, B., Pham, P., Sjöberg, M., Sulubacak, U., Tiedemann, J., Troncy, R. & Vázquez Carrillo, J. R., 1 Nov 2018, Proceedings of the Third Conference on Machine Translation (WMT): Shared Task Papers. Bojar, O., Chatterjee, R., Federmann, C., Fishel, M., Graham, Y., Haddow, B., Huck, M., Yepes, A. J., Koehn, P., Monz, C., Negri, M., Névéol, A., Neves, M., Post, M., Specia, L., Turchi, M. & Verspoor, K. (eds.). Stroudsburg: Association for Computational Linguistics, p. 603-611 9 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

The University of Helsinki submissions to the WMT18 news task

Raganato, A., Scherrer, Y., Nieminen, T., Hurskainen, A. & Tiedemann, J., 2018, Proceedings of the Third Conference on Machine Translation: Shared Task Papers. Bojar, O., Chatterjee, R., Federmann, C., Fishel, M., Graham, Y., Haddow, B., Huck, M., Yepes, A. J., Koehn, P., Monz, C., Negri, M., Névéol, A., Neves, M., Post, M., Specia, L., Turchi, M. & Verspoor, K. (eds.). Stroudsburg: Association for Computational Linguistics, p. 488-495 8 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

The University of Helsinki Submission to the WMT19 Parallel Corpus Filtering Task

Vazquez, R., Sulubacak, U. & Tiedemann, J., 29 Jul 2018, Proceedings of the Fourth Conference on machine Translation: Volume 2: Shared Task Papers. Stroudsburg: Association for Computational Linguistics, p. 992-998 7 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File
2017

Bootstrapping a Dependency Parser for Maltese - A Real-World Test Case

Tiedemann, J. & van der Plas, L., 2017, From Semantics to Dialectometry: Festschrift in honor of John Nerbonne. Wieling, M., Kroon, M., Van Noord, G. & Bouma, G. (eds.). Milton Keynes: College Publications, p. 355-365 11 p. (Tributes; no. 32).

Research output: Chapter in Book/Report/Conference proceedingChapterScientific

Open Access

Character-based Joint Segmentation and POS Tagging for Chinese using Bidirectional RNN-CRF

Shao, Y., Hardmeier, C., Tiedemann, J. & Nivre, J., 1 Nov 2017, The Eighth International Joint Conference on Natural Language Processing: Proceedings of the Conference, Vol. 1 (Long Papers). Taipei: Asian Federation of Natural Language Processing, p. 173-183 11 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access

Continuous multilinguality with language vectors

Östling, R. & Tiedemann, J., 1 Apr 2017, 15th Conference of the European Chapter of the Association for Computational Linguistics: Proceedings of Conference, volume 2: Short Papers. Stroudsburg: Association for Computational Linguistics, p. 644-649 6 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access

Cross-Lingual Dependency Parsing for Closely Related Languages: Helsinki’s Submission to VarDial 2017

Tiedemann, J., 1 Apr 2017, Fourth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial’2017): Proceedings of the Workshop. Stroudsburg: Association for Computational Linguistics, p. 131-136 6 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access

Findings of the 2017 DiscoMT Shared Task on Cross-lingual Pronoun Prediction

Loáiciga, S., Stymne, S., Nakov, P., Hardmeier, C., Tiedemann, J., Cettolo, M. & Versley, Y., 2017, Discourse in Machine Translation (DiscoMT 2017): Proceedings of the Workshop. Stroudsburg: Association for Computational Linguistics, p. 1-16 16 p.

Research output: Chapter in Book/Report/Conference proceedingChapterScientific

Open Access

Findings of the VarDial Evaluation Campaign 2017

Zampieri, M., Malmasi, S., Ljubešić, N., Nakov, P., Ali, A., Tiedemann, J., Scherrer, Y. & Aepli, N., 1 Apr 2017, Proceedings of the Fourth Workshop on NLP for Similar Languages, Varieties and Dialects. Stroudsburg: Association for Computational Linguistics, p. 1-15 15 p.

Research output: Chapter in Book/Report/Conference proceedingChapterScientific

Open Access

Large aligned treebanks for syntax-based machine translation

Kotzé, G., Vandeghinste, V., Martens, S. & Tiedemann, J., 2017, In : Language Resources and Evaluation. 51, 2, p. 249-282 34 p.

Research output: Contribution to journalArticleScientificpeer-review

Neural Machine Translation with Extended Context

Tiedemann, J. & Scherrer, Y., 2017, Proceedings of the Third Workshop on Discourse in Machine Translation. Stroudsburg: Association for Computational Linguistics, p. 82-92 11 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

Proceedings of the 21st Nordic Conference on Computational Linguistics (NoDaLiDa)

Tiedemann, J. (ed.), 1 May 2017, Gothenburg: Linköping University Electronic Press. 337 p. (Linköping Electronic Conference Proceedings; no. 131)

Research output: Book/ReportAnthology or special issueScientificpeer-review

Open Access

Rule-based Machine Translation from English to Finnish

Hurskainen, A. & Tiedemann, J., 2017, Proceedings of the Second Conference on Machine Translation (WMT2017). Copenhagen, Denmark: Association for Computational Linguistics, p. 323-329 7 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access

The CLIN27 Shared Task: Translating Historical Text to Contemporary Language for Improving Automatic Linguistic Annotation

Tjong Kim Sang, E., Bollmann, M., Boschker, R., Casacuberta, F., Dietz, F., Dipper, S., Domingo, M., van der Goot, R., van Koppen, M., Ljubešić, N., Östling, R., Petran, F., Pettersson, E., Scherrer, Y., Schraagen, M., Sevens, L., Tiedemann, J., Vanallemeersch, T. & Zervanou, K., 2017, In : Computational Linguistics in the Netherlands. 7, p. 53-64 12 p.

Research output: Contribution to journalArticleScientificpeer-review

Open Access

The Helsinki Neural Machine Translation System

Östling, R., Scherrer, Y., Tiedemann, J. & Nieminen, T., 2017, Proceedings of the Second Conference on Machine Translation (WMT2017). Stroudsburg: Association for Computational Linguistics, p. 338-347 10 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
2016

A Linear Baseline Classifier for Cross-Lingual Pronoun Prediction

Tiedemann, J., 1 Aug 2016, The 54th Annual Meeting of the Association for Computational Linguistics: Proceedings of the First Conference on Machine Translation (WMT). Stroudsburg: The Association for Computational Linguistics, p. 616-619 4 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access

Climbing Mont BLEU: The Strange World of Reachable High-BLEU Translations

Smith, A., Hardmeier, C. & Tiedemann, J., 2016, In : Baltic Journal of Modern Computing. 4, 2, p. 269–281 13 p.

Research output: Contribution to journalArticleScientificpeer-review

Discriminating between Similar Languages and Arabic Dialect Identification: A Report on the Third DSL Shared Task

Malmasi, S., Zampieri, M., Ljubešić, N., Nakov, P., Ali, A. & Tiedemann, J., 1 Dec 2016, Third Workshop on NLP for Similar Languages, Varieties and Dialects: Proceedings of the Workshop. The COLING 2016 Organizing Committee , p. 1-14 14 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientific

Open Access

Efficient word alignment with Markov Chain Monte Carlo

Östling, R. & Tiedemann, J., 1 Oct 2016, In : The Prague Bulletin of Mathematical Linguistics. 106, p. 125-146 22 p.

Research output: Contribution to journalArticleScientificpeer-review

Open Access
File

Finding Alternative Translations in a Large Corpus of Movie Subtitles

Tiedemann, J., 2016, Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC-2016). Calzolari, N., Choukri, K., Declerck, T., Goggi, S., Grobelnik, M., Maegaard, B., Mariani, J., Mazo, H., Moreno, A., Odijk, J. & Piperidis, S. (eds.). p. 3518-3522 5 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access

Findings of the 2016 WMT Shared Task on Cross-lingual Pronoun Prediction

Guillou, L., Hardmeier, C., Nakov, P., Stymne, S., Tiedemann, J., Versley, Y., Cettolo, M., Webber, B. & Popescu-Belis, A., 1 Aug 2016, The 54th Annual Meeting of the Association for Computational Linguistics: Proceedings of the First Conference on Machine Translation (WMT). Stroudsburg: The Association for Computational Linguistics, p. 525-542 18 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientific

Open Access

OpenSubtitles2015: Extracting Large Parallel Corpora from Movie and TV Subtitles

Lison, P. & Tiedemann, J., 2016, Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC-2016). Calzolari, N., Choukri, K., Declerck, T., Goggi, S., Grobelnik, M., Maegaard, B., Mariani, J., Mazo, H., Moreno, A., Odijk, J. & Piperidis, S. (eds.).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access

OPUS -- Parallel Corpora for Everyone

Tiedemann, J., 2016, In : Baltic Journal of Modern Computing. p. 384 1 p.

Research output: Contribution to journalArticleScientific

Open Access

Phrase-Based SMT for Finnish with More Data, Better Models and Alternative Alignment and Translation Tools

Tiedemann, J., Cap, F., Kanerva, J., Ginter, F., Stymne, S., Östling, R. & Weller-Di Marco, M., 1 Aug 2016, The 54th Annual Meeting of the Association for Computational Linguistics: Proceedings of the First Conference on Machine Translation (WMT). Stroudsburg: The Association for Computational Linguistics, p. 391-398 8 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access

Synthetic Treebanking for Cross-Lingual Dependency Parsing

Tiedemann, J. & Agi, Z., Jan 2016, In : Journal of Artificial Intelligence Research. 55, p. 209-248 40 p.

Research output: Contribution to journalArticleScientificpeer-review

Open Access
File

Tagging Ingush - Language Technology For Low-Resource Languages Using Resources From Linguistic Field Work

Tiedemann, J., Nichols, J. & Sprouse, R., 1 Dec 2016, Language Technology Resources and Tools for Digital Humanities (LT4DH): Proceedings of the Workshop. Osaka, p. 148-155 8 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access

The Challenges of Multi-dimensional Sentiment Analysis Across Languages

Öhman, E., Honkela, T. & Tiedemann, J., 1 Dec 2016, Workshop on Computational Modeling of People’s Opinions, Personality, and Emotions in Social Media (PEOPLES): Proceedings of the Workshop. Osaka, Japan: The COLING 2016 Organizing Committee , p. 138-142 5 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
2015

Baseline Models for Pronoun Prediction and Pronoun-Aware Translation

Tiedemann, J., 1 Sep 2015, Proceedings of the Second Workshop on Discourse in Machine Translation (DiscoMT). The Association for Computational Linguistics, p. 108-114 7 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Boosting English-Chinese Machine Transliteration via High Quality Alignment and Multilingual Resources

Shao, Y., Tiedemann, J. & Nivre, J., 1 Jul 2015, Proceedings of the Fifth Named Entity Workshop, joint with 53rd ACL and the 7th IJCNLP. p. 56-60 5 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Cross-Lingual Dependency Parsing with Universal Dependencies and Predicted PoS Labels

Tiedemann, J., 1 Aug 2015, Proceedings of the Third International Conference on Dependency Linguistics (Depling 2015). p. 340-349 10 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Improving the Cross-Lingual Projection of Syntactic Dependencies

Tiedemann, J., 1 May 2015, Proceedings of the 20th Nordic Conference of Computational Linguistics (NODALIDA 2015). p. 191-199 9 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Morphological Segmentation and OPUS for Finnish-English Machine Translation

Tiedemann, J., Ginter, F. & Kanerva, J., 1 Sep 2015, Proceedings of the Tenth Workshop on Statistical Machine Translation. The Association for Computational Linguistics, p. 177-183 7 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Part-of-Speech Driven Cross-Lingual Pronoun Prediction with Feed-Forward Neural Networks

Callin, J., Hardmeier, C. & Tiedemann, J., 1 Sep 2015, Proceedings of the Second Workshop on Discourse in Machine Translation (DiscoMT). The Association for Computational Linguistics, p. 59-64 6 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review