Abstract
Real document collections do not fit the independence assumptions asserted by most statistical topic models, but how badly do they violate them? We present a Bayesian method for measuring how well a topic model fits a corpus. Our approach is based on posterior predictive checking, a method for diagnosing Bayesian models in user-defined ways. Our method can identify where a topic model fits the data, where it falls short, and in which directions it might be improved. © 2011 Association for Computational Linguistics.
Cite
CITATION STYLE
Mimno, D., & Blei, D. (2011). Bayesian checking for topic models. In EMNLP 2011 - Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference (pp. 227–237).
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.