Additive regularization for topic modeling in sociological studies of user-generated texts

Murat Apishev; Sergei Koltcov; Olessia Koltsova; Sergey Nikolenko; Konstantin Vorontsov

Conference Proceedings

Additive regularization for topic modeling in sociological studies of user-generated texts

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2017) 10061 LNAI 169-184

DOI: 10.1007/978-3-319-62434-1_14

14Citations

9Readers

Get full text

Abstract

Social studies of the Internet have adopted large-scale text mining for unsupervised discovery of topics related to specific subjects. A recently developed approach to topic modeling, additive regularization of topic models (ARTM), provides fast inference and more control over the topics with a wide variety of possible regularizers than developing LDA extensions. We apply ARTM to mining ethnic-related content from Russian-language blogosphere, introduce a new combined regularizer, and compare models derived from ARTM with LDA. We show with human evaluations that ARTM is better for mining topics on specific subjects, finding more relevant topics of higher or comparable quality. We also include a detailed analysis of how to tune regularization coefficients in ARTM models.

Cite

CITATION STYLE

APA

Apishev, M., Koltcov, S., Koltsova, O., Nikolenko, S., & Vorontsov, K. (2017). Additive regularization for topic modeling in sociological studies of user-generated texts. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10061 LNAI, pp. 169–184). Springer Verlag. https://doi.org/10.1007/978-3-319-62434-1_14

Additive regularization for topic modeling in sociological studies of user-generated texts

Abstract

Cite

Register to see more suggestions