Latent dirichlet allocation

ISSN: 10495258
202Citations
Citations of this article
464Readers
Mendeley users who have this article in their library.

Abstract

We propose a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams [6], and TTof- rnarm's aspect model, also known as probabilistic latent semantic indexing (pLSI) [3]. In the context of text modeling, our model posits that each document is generated as a mixture of topics, where the continuous-valued mixture proportions are distributed as a latent Dirichlet random variable. Inference and learning are carried out efficiently via variational algorithms. We present empirical results on applications of this model to problems in text modeling, collaborative filtering, and text classification.

Cite

CITATION STYLE

APA

Blei, D. M., Ng, A. Y., & Jordan, M. T. (2002). Latent dirichlet allocation. In Advances in Neural Information Processing Systems. Neural information processing systems foundation.

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free