Bayesian checking for topic models

David Mimno; David Blei

Conference Proceedings

Bayesian checking for topic models

EMNLP 2011 - Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference (2011) 227-237

70Citations

275Readers

Abstract

Real document collections do not fit the independence assumptions asserted by most statistical topic models, but how badly do they violate them? We present a Bayesian method for measuring how well a topic model fits a corpus. Our approach is based on posterior predictive checking, a method for diagnosing Bayesian models in user-defined ways. Our method can identify where a topic model fits the data, where it falls short, and in which directions it might be improved. © 2011 Association for Computational Linguistics.

Cite

CITATION STYLE

APA

Mimno, D., & Blei, D. (2011). Bayesian checking for topic models. In EMNLP 2011 - Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference (pp. 227–237).

Bayesian checking for topic models

Abstract

Cite

Register to see more suggestions