The Open Parallel Corpus

Project Details

Description

A large collection of freely available parallel corpora and associated tools and interfaces
Short titleOPUS
AcronymOPUS
StatusActive
Effective start/end date01/06/2004 → …

Fields of Science

  • 113 Computer and information sciences
  • parallel corpora
  • machine translation
  • 6121 Languages
  • computational linguistics
  • language technology
  • natural language processing
  • corpus linguistics

Research Output

  • 7 Conference contribution
  • 1 Chapter

OpenSubtitles2018: Statistical Rescoring of Sentence Alignments in Large, Noisy Parallel Corpora

Lison, P., Tiedemann, J. & Kouylekov, M., 2018, Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC 2018). Calzolari, N., Khalid, C., Christopher, C., Thierry, D., Sara, G., Koiti, H., Hitoshi, I., Bente, M., Joseph, M., Hélène, M., Asuncion, M., Jan, O., Stelios, P. & Takenobu, T. (eds.). Paris: European Language Resources Association (ELRA), p. 1742-1748 7 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

Opus-MontenegrinSubs 1.0: First electronic corpus of the Montenegrin language

Bozovic, P., Erjavec, T., Tiedemann, J., Ljubesic, N. & Gorjanc, V., 2018, Proceedings of the conference on Language Technologies & Digital Humanities 2018. Fišer, D. & Pančur, A. (eds.). Ljubljana: Ljubljana University Press, p. 24-28 5 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File

Billions of Parallel Words for Free: Building and Using the EU Bookshop Corpus

Skadiņš, R., Tiedemann, J., Rozis, R. & Deksne, D., 1 May 2014, Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC-2014). p. 1850-1855 6 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Activities

  • 1 Organisation and participation in conferences, workshops, courses, seminars
  • 1 Academic visit at UH

Johannes Graën

Jörg Tiedemann (Host)

18 Nov 201922 Nov 2019

Activity: Hosting a visitor typesAcademic visit at UH

The 19th Annual Conference of the European Association for Machine Translation (EAMT2016)

Jörg Tiedemann (Poster Presentation)

30 May 20161 Jun 2016

Activity: Participating in or organising an event typesOrganisation and participation in conferences, workshops, courses, seminars