Topic modeling in Twitter: Aggregating tweets by conversations

95Citations
Citations of this article
154Readers
Mendeley users who have this article in their library.

Abstract

We propose a new pooling technique for topic modeling in Twitter, which groups together tweets occurring in the same user-to-user conversation. Under this scheme, tweets and their replies are aggregated into a single document and the users who posted them are considered co-authors. To compare this new scheme against existing ones, we train topic models using Latent Dirichlet Allocation (LDA) and the Author-Topic Model (ATM) on datasets consisting of tweets pooled according to the different methods. Using the underlying categories of the tweets in this dataset as a noisy ground truth, we show that this new technique outperforms other pooling methods in terms of clustering quality and document retrieval.

Cite

CITATION STYLE

APA

Alvarez-Melis, D., & Saveski, M. (2016). Topic modeling in Twitter: Aggregating tweets by conversations. In Proceedings of the 10th International Conference on Web and Social Media, ICWSM 2016 (pp. 519–522). AAAI Press. https://doi.org/10.1609/icwsm.v10i1.14817

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free