If you made any changes in Pure these will be visible here soon.

Personal profile

Description of research and teaching

Curriculum vitae

I work as professor of language technology at the Department of Digital Humanities at the University of Helsinki. My main research interest is in cross-lingual NLP and machine translation.

  • Since August 2015: Professor of Language Technology at the Department of Digital Humanities / HELDIG (formerly at the Department of Modern Languages), University of Helsinki
  • September 2014 – July 2015: Senior Researcher at the Department of Linguistics and Philology, Uppsala University
  • September 2009 – August 2014: Visiting Professor at the Department of Linguistics and Philology, Uppsala University
  • September 2004 – August 2009: PostDoc researcher at the Department of Information Science/Humanities Computing (Informatiekunde), University of Groningen
  • January 2004 – August 2004: Lecturer in computational linguistics and coordinator for the language technology programme, Department of Linguistics and Philology, Uppsala University
  • 2000 – 2003: Ph.D. research at the Department of Linguistics, Uppsala University
  • 2001 – 2002: Visiting Ph.D. student, Division of Informatics, Edinburgh University, UK
  • 1997 – 1999: Research assistent, Department of Linguistics, Uppsala University
  • 1991 – 1997: Masters in Computer Science (Diplom für Informatik), “Otto-von-Guericke” University, Magdeburg, Germany

Recent Projects

Resources and Tools

  • OPUS – a collection of freely available parallel corpora and tools
  • fiskmö translator – a translation demo for the Nordic languages
  • efmaral and eflomal – tools for efficient word alignment
  • WMT en-fi 20162017: official MT test sets for Finnish-English
  • HNMT – the Helsinki Neural Machine Translation system
  • Lingua::Align – a toolbox for tree-to-tree alignment
  • Uplug – a toolbox for processing parallel corpora
  • Lingua::Ident::Blacklists – language identifier for related languages
  • Docent – a document-level SMT decoder
  • pdf2xml – a converter for PDF documents
  • subalign – tools for converting and aligning movie subtitles
  • Helsinki-NLP at github and bitbucket

Active PhD Students

Former PhD Students

Education/Academic qualification

Computational Linguistics, PhD, Uppsala University

20002003

Computer Science, M.Sc., Otto Von Guericke University, Magdeburg

19911997

Keywords

  • 6121 Languages
  • Computational Linguistics
  • machine translation
  • 113 Computer and information sciences
  • language technology
  • machine learning
  • Natural language processing
  • Artificial intelligence

International and National Collaboration Publications and projects within past five years.

Selected Publications 2009 2019

Emerging Language Spaces Learned From Massively Multilingual Corpora

Tiedemann, J., 2018, Proceedings of the Digital Humanities in the Nordic Countries 3rd Conference (DHN 2018). Mäkelä, E., Tolonen, M. & Tuominen, J. (eds.). Helsinki: CEUR Workshop Proceedings, Vol. 2084. p. 188-197 (CEUR Workshop Proceedings).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

Synthetic Treebanking for Cross-Lingual Dependency Parsing

Tiedemann, J. & Agi, Z., Jan 2016, In : Journal of Artificial Intelligence Research. 55, p. 209-248 40 p.

Research output: Contribution to journalArticleScientificpeer-review

Open Access
File

Efficient word alignment with Markov Chain Monte Carlo

Östling, R. & Tiedemann, J., 1 Oct 2016, In : The Prague Bulletin of Mathematical Linguistics. 106, p. 125-146 22 p.

Research output: Contribution to journalArticleScientificpeer-review

Open Access
File

Parallel Data, Tools and Interfaces in OPUS

Tiedemann, J., 1 May 2012, Unknown host publication. Calzolari, N., Choukri, K., Declerck, T., Dogan, M. U., Maegaard, B., Mariani, J., Odijk, J. & Piperidis, S. (eds.). p. 2214-2218 5 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Bitext Alignment

Tiedemann, J. & Hirst, G. (ed.), 2011, San Rafael, California (USA): Morgan & Claypool. (Synthesis Lectures on Human Language Technologies)

Research output: Book/ReportBookScientificpeer-review

Selected Projects 2004 2023

Selected Activities 2007 2019

What is the meaning of life? I found it in translation ...

Jörg Tiedemann (Speaker)
12 Dec 2018

Activity: Talk or presentation typesOral presentation

FoTran 2018: Found in translation

Jörg Tiedemann (Scientific Committee Chair), Hande Celikkanat (Member of organizing committee), Juan Raul Vazquez Carrillo (Member of organizing committee)
28 Sep 2018

Activity: Participating in or organising an event typesOrganisation and participation in conferences, workshops, courses, seminars