Abstract
Topic models jointly learn topics and document-level topic distribution. Extrinsic evaluation of topic models tends to focus exclusively on topic-level evaluation, e.g. by assessing the coherence of topics. We demonstrate that there can be large discrepancies between topic- and document-level model quality, and that basing model evaluation on topic-level analysis can be highly misleading. We propose a method for automatically predicting topic model quality based on analysis of document-level topic allocations, and provide empirical evidence for its robustness.
Cite
CITATION STYLE
Bhatia, S., Lau, J. H., & Baldwin, T. (2017). An automatic approach for document-level topic model evaluation. In CoNLL 2017 - 21st Conference on Computational Natural Language Learning, Proceedings (pp. 206–215). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/k17-1022
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.