Topic Modeling for Exploring Cancer-Related Coverage in Journalistic Texts

1Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Topic modeling has been used for many applications, but has not been applied to science and health communication research yet. In this paper, using topic modeling for this novel domain is explored, by investigating the coverage of cancer in news items from the New York Times since 1970 with the Latent Dirichlet Allocation (LDA) model. Content analysis of cancer in print media has been performed before, but at a much smaller scope and with manual rather than computational analysis. We collected 45.684 articles concerning cancer via the New York Times API to build the LDA model upon. Our results show a predominance of breast cancer in news articles as compared with other types of cancer, similar to previous studies. Additionally, our topic model shows 6 distinct topics: research on cancer, lifestyle and mortality, the healthcare system, business and insurance issues regarding cancer treatment, environmental politics and American politics on cancer-related policies. Since topic modeling is a computational technique, the model has more difficulty with understanding the meaning of the analyzed text than (most) humans. Therefore, future research will be set up to let the public contribute to analysis of a topic model.

Author supplied keywords

Cite

CITATION STYLE

APA

Hariman, N., de Vries, M., & Smeets, I. (2019). Topic Modeling for Exploring Cancer-Related Coverage in Journalistic Texts. In Communications in Computer and Information Science (Vol. 1021, pp. 43–51). Springer. https://doi.org/10.1007/978-3-030-31978-6_4

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free