Supervised topic models

ArXiv: 1003.0783
621Citations
Citations of this article
1.4kReaders
Mendeley users who have this article in their library.

Abstract

We introduce supervised latent Dirichlet allocation (sLDA), a statistical model of labelled documents. The model accommodates a variety of response types. We derive a maximum-likelihood procedure for parameter estimation, which relies on variational approximations to handle intractable posterior expectations. Prediction problems motivate this research: we use the fitted model to predict response values for new documents. We test sLDA on two real-world problems: movie ratings predicted from reviews, and web page popularity predicted from text descriptions. We illustrate the benefits of sLDA versus modern regularized regression, as well as versus an unsupervised LDA analysis followed by a separate regression.

Cite

CITATION STYLE

APA

Blei, D. M., & McAuliffe, J. D. (2008). Supervised topic models. In Advances in Neural Information Processing Systems 20 - Proceedings of the 2007 Conference. Neural Information Processing Systems.

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free