Plenary debates of the parliament of Finland as linked open data and in parla-CLARIN markup

Laura Sinikallio, Senka Drobac, Minna Tamper, Rafael Leal, Mikko Koho, Jouni Tuominen, Matti La Mela, Eero Hyvönen

Tutkimustuotos: Artikkeli kirjassa/raportissa/konferenssijulkaisussaKonferenssiartikkeliTieteellinenvertaisarvioitu

Abstrakti

This paper presents a knowledge graph created by transforming the plenary debates of the Parliament of Finland (1907-) into Linked Open Data (LOD). The data, totaling over 900 000 speeches, with automatically created semantic annotations and rich ontology-based metadata, are published in a Linked Open Data Service and are used via a SPARQL API and as data dumps. The speech data is part of larger LOD publication FinnParla that also includes prosopographical data about the politicians. The data is being used for studying parliamentary language and culture in Digital Humanities in several universities. To serve a wider variety of users, the entirety of this data was also produced using Parla-CLARIN markup. We present the first publication of all Finnish parliamentary debates as data. Technical novelties in our approach include the use of both Parla-CLARIN and an RDF schema developed for representing the speeches, integration of the data to a new Parliament of Finland Ontology for deeper data analyses, and enriching the data with a variety of external national and international data sources.

Alkuperäiskielienglanti
Otsikko3rd Conference on Language, Data and Knowledge, LDK 2021
ToimittajatDagmar Gromann, Gilles Serasset, Thierry Declerck, John P. McCrae, Jorge Gracia, Julia Bosque-Gil, Fernando Bobillo, Barbara Heinisch
Sivumäärä17
KustantajaSchloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
Julkaisupäivä1 elok. 2021
Sivut1-17
Artikkeli no8
ISBN (elektroninen)978-3-95977-199-3
DOI - pysyväislinkit
TilaJulkaistu - 1 elok. 2021
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisuussa
Tapahtuma3rd Conference on Language, Data and Knowledge, LDK 2021 - Zaragoza, Espanja
Kesto: 1 syysk. 20213 syysk. 2021

Julkaisusarja

NimiOpenAccess Series in Informatics
Vuosikerta93
ISSN (painettu)2190-6807

Lisätietoja

Publisher Copyright:
© Laura Sinikallio, Senka Drobac, Minna Tamper, Rafael Leal, Mikko Koho, Jouni Tuominen, Matti La Mela, and Eero Hyvönen; licensed under Creative Commons License CC-BY 4.0

Tieteenalat

  • 113 Tietojenkäsittely- ja informaatiotieteet

Siteeraa tätä