The Talk of Norway: a richly annotated corpus of the Norwegian parliament, 1998–2016

Emanuele Lapponi; Martin G. Søyland; Erik Velldal; Stephan Oepen

Journal ArticleOPEN ACCESS

The Talk of Norway: a richly annotated corpus of the Norwegian parliament, 1998–2016

Language Resources and Evaluation (2018) 52(3) 873-893

DOI: 10.1007/s10579-018-9411-5

31Citations

29Readers

Abstract

In this work we present the Talk of Norway (ToN) data set, a collection of Norwegian Parliament speeches from 1998 to 2016. Every speech is richly annotated with metadata harvested from different sources, and augmented with language type, sentence, token, lemma, part-of-speech, and morphological feature annotations. We also present a pilot study on party classification in the Norwegian Parliament, carried out in the context of a cross-faculty collaboration involving researchers from both Political Science and Computer Science. Our initial experiments demonstrate how the linguistic and institutional annotations in ToN can be used to gather insights on how different aspects of the political process affect classification.

Author supplied keywords

Cite

CITATION STYLE

APA

Lapponi, E., Søyland, M. G., Velldal, E., & Oepen, S. (2018). The Talk of Norway: a richly annotated corpus of the Norwegian parliament, 1998–2016. Language Resources and Evaluation, 52(3), 873–893. https://doi.org/10.1007/s10579-018-9411-5

The Talk of Norway: a richly annotated corpus of the Norwegian parliament, 1998–2016

Abstract

Author supplied keywords

Cite

Register to see more suggestions