The Impact of Copyright and Personal Data Laws on the Creation and Use of Models for Language Technologies

Aleksei Kelli, Arvi Tavast, Krister Lindén, Kadri Vider, Ramunas Birštonas, Penny Labropoulou, Irene Kull, Gaabriel Tavits, Age Värv, Pavel Stranák

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review


The authors address the legal issues relating to the creation and use of language models. The article begins with an explanation of the development of language technologies. The authors analyse the technological process within the framework copyright, related rights and personal data protection law. The authors also cover commercial use of language models. The authors’ main argument is that legal restrictions applicable to language data containing copyrighted material and personal data usually do not apply to language models. Language models are generally not considered derivative works. Due to a wide range of language models, this position is not absolute.
Original languageEnglish
Title of host publicationSelected Papers from the CLARIN Annual Conference 2019
Number of pages13
PublisherLinköping University Electronic Press
Publication date3 Jul 2020
ISBN (Electronic)978-91-7929-807-4
Publication statusPublished - 3 Jul 2020
MoE publication typeA4 Article in conference proceedings

Publication series

NameLinköping Electronic Conference Proceedings
PublisherLinköping University Press
ISSN (Electronic)1650-3740

Cite this