A data-driven approach to the changing vocabulary of the ‘nation’ in English, Dutch, Swedish and Finnish newspapers, 1750-1950

Tutkimustuotos: KonferenssimateriaalitKonferenssiabstraktiTutkimusvertaisarvioitu

Kuvaus

This project aims to mine two centuries worth of digitised newspapers in four languages, and to propose a methodologically sound, reusable approach to carry out quality historical research on the changing vocabulary relating to nationhood. The newspapers stem from different sources and countries, and are available in different formats. Massive digitized newspaper collections are increasingly used to address historical questions through mining textual data. They are more seldom used for comparative projects cross linguistic and national boundaries. In this paper, we address the methodological challenges the use of newspapers from different political contexts, languages and datasets poses, and lay out our approach to tackle a comparative study for the Netherlands, Finland, Sweden, and the UK.
Alkuperäiskielienglanti
TilaJulkaistu - heinäkuuta 2019
OKM-julkaisutyyppiEi sovellu
TapahtumaDigital Humanities 2019 - Utrecht, Alankomaat
Kesto: 9 heinäkuuta 201912 heinäkuuta 2019

Konferenssi

KonferenssiDigital Humanities 2019
MaaAlankomaat
KaupunkiUtrecht
Ajanjakso09/07/201912/07/2019

Tieteenalat

  • 6160 Muut humanistiset tieteet
  • 113 Tietojenkäsittely- ja informaatiotieteet

Lainaa tätä

@conference{6f1940b7132f47c1aff3227620948a0c,
title = "A data-driven approach to the changing vocabulary of the ‘nation’ in English, Dutch, Swedish and Finnish newspapers, 1750-1950",
abstract = "This project aims to mine two centuries worth of digitised newspapers in four languages, and to propose a methodologically sound, reusable approach to carry out quality historical research on the changing vocabulary relating to nationhood. The newspapers stem from different sources and countries, and are available in different formats. Massive digitized newspaper collections are increasingly used to address historical questions through mining textual data. They are more seldom used for comparative projects cross linguistic and national boundaries. In this paper, we address the methodological challenges the use of newspapers from different political contexts, languages and datasets poses, and lay out our approach to tackle a comparative study for the Netherlands, Finland, Sweden, and the UK.",
keywords = "6160 Other humanities, 113 Computer and information sciences",
author = "Simon Hengchen and Ruben Ros and Jani Marjanen",
year = "2019",
month = "7",
language = "English",
note = "Digital Humanities 2019 ; Conference date: 09-07-2019 Through 12-07-2019",

}

A data-driven approach to the changing vocabulary of the ‘nation’ in English, Dutch, Swedish and Finnish newspapers, 1750-1950. / Hengchen, Simon; Ros, Ruben; Marjanen, Jani.

2019. Abstraktin lähde: Digital Humanities 2019, Utrecht, Alankomaat.

Tutkimustuotos: KonferenssimateriaalitKonferenssiabstraktiTutkimusvertaisarvioitu

TY - CONF

T1 - A data-driven approach to the changing vocabulary of the ‘nation’ in English, Dutch, Swedish and Finnish newspapers, 1750-1950

AU - Hengchen, Simon

AU - Ros, Ruben

AU - Marjanen, Jani

PY - 2019/7

Y1 - 2019/7

N2 - This project aims to mine two centuries worth of digitised newspapers in four languages, and to propose a methodologically sound, reusable approach to carry out quality historical research on the changing vocabulary relating to nationhood. The newspapers stem from different sources and countries, and are available in different formats. Massive digitized newspaper collections are increasingly used to address historical questions through mining textual data. They are more seldom used for comparative projects cross linguistic and national boundaries. In this paper, we address the methodological challenges the use of newspapers from different political contexts, languages and datasets poses, and lay out our approach to tackle a comparative study for the Netherlands, Finland, Sweden, and the UK.

AB - This project aims to mine two centuries worth of digitised newspapers in four languages, and to propose a methodologically sound, reusable approach to carry out quality historical research on the changing vocabulary relating to nationhood. The newspapers stem from different sources and countries, and are available in different formats. Massive digitized newspaper collections are increasingly used to address historical questions through mining textual data. They are more seldom used for comparative projects cross linguistic and national boundaries. In this paper, we address the methodological challenges the use of newspapers from different political contexts, languages and datasets poses, and lay out our approach to tackle a comparative study for the Netherlands, Finland, Sweden, and the UK.

KW - 6160 Other humanities

KW - 113 Computer and information sciences

UR - https://hengchen.net/dh2019

M3 - Abstract

ER -