Using natural language generation to bootstrap missing Wikipedia articles: A human-centric perspective

Lucie Aimée Kaffee; Pavlos Vougiouklis; Elena Simperl; Philipp Cimiano

Journal ArticleOPEN ACCESS

Using natural language generation to bootstrap missing Wikipedia articles: A human-centric perspective

Semantic Web (2022) 13(2) 163-194

DOI: 10.3233/SW-210431

8Citations

27Readers

Abstract

Nowadays natural language generation (NLG) is used in everything from news reporting and chatbots to social media management. Recent advances in machine learning have made it possible to train NLG systems that seek to achieve human-level performance in text writing and summarisation. In this paper, we propose such a system in the context of Wikipedia and evaluate it with Wikipedia readers and editors. Our solution builds upon the ArticlePlaceholder, a tool used in 14 under-resourced Wikipedia language versions, which displays structured data from the Wikidata knowledge base on empty Wikipedia pages. We train a neural network to generate an introductory sentence from the Wikidata triples shown by the ArticlePlaceholder, and explore how Wikipedia users engage with it. The evaluation, which includes an automatic, a judgement-based, and a task-based component, shows that the summary sentences score well in terms of perceived fluency and appropriateness for Wikipedia, and can help editors bootstrap new articles. It also hints at several potential implications of using NLG solutions in Wikipedia at large, including content quality, trust in technology, and algorithmic transparency.

Author supplied keywords

Cite

CITATION STYLE

APA

Kaffee, L. A., Vougiouklis, P., Simperl, E., & Cimiano, P. (2022). Using natural language generation to bootstrap missing Wikipedia articles: A human-centric perspective. Semantic Web, 13(2), 163–194. https://doi.org/10.3233/SW-210431

Using natural language generation to bootstrap missing Wikipedia articles: A human-centric perspective

Abstract

Author supplied keywords

Cite

Register to see more suggestions