LeadLag LDA: Estimating Topic Specific Leads and Lags of Information Outlets

5Citations
Citations of this article
34Readers
Mendeley users who have this article in their library.

Abstract

Identifying which outlet in social media leads the rest in disseminating novel information on specific topics is an interesting challenge for information analysts and social scientists. In this work, we hypothesize that novel ideas are disseminated through the creation and propagation of new or newly emphasized key words, and therefore lead/lag of outlets can be estimated by tracking word usage across these outlets. First, we demonstrate the validaty of our hypothesis by showing that a simple TF-IDF based nearest-neighbors approach can recover generally accepted lead/lag behavior on the outlets pair of ACM journal articles and conference papers. Next, we build a new topic model called LeadLag LDA that estimates the lead/lag of the outlets on specific topics. We validate the topic model using the lead/lag results from the TF-IDF nearest neighbors approach. Finally, we present results from our model on two different outlet pairs of blogs vs. news media and grant proposals vs. research publications that reveal interesting patterns.

Cite

CITATION STYLE

APA

Nallapati, R., Shi, X., McFarland, D., Leskovec, J., & Jurafsky, D. (2011). LeadLag LDA: Estimating Topic Specific Leads and Lags of Information Outlets. In Proceedings of the 5th International AAAI Conference on Weblogs and Social Media, ICWSM 2011 (pp. 558–561). AAAI Press. https://doi.org/10.1609/icwsm.v5i1.14147

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free