Topic and trend detection in text collections using latent dirichlet allocation

84Citations
Citations of this article
109Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Algorithms that enable the process of automatically mining distinct topics in document collections have become increasingly important due to their applications in many fields and the extensive growth of the number of documents in various domains. In this paper, we propose a generative model based on latent Dirichlet allocation that integrates the temporal ordering of the documents into the generative process in an iterative fashion. The document collection is divided into time segments where the discovered topics in each segment is propagated to influence the topic discovery in the subsequent time segments. Our experimental results on a collection of academic papers from CiteSeer repository show that segmented topic model can effectively detect distinct topics and their evolution over time. © Springer-Verlag Berlin Heidelberg 2009.

Cite

CITATION STYLE

APA

Bolelli, L., Ertekin, Ş., & Giles, C. L. (2009). Topic and trend detection in text collections using latent dirichlet allocation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5478 LNCS, pp. 776–780). https://doi.org/10.1007/978-3-642-00958-7_84

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free