Incorporating probabilistic knowledge into topic models

Liang Yao; Yin Zhang; Baogang Wei; Hongze Qian; Yibing Wang

Conference Proceedings

Incorporating probabilistic knowledge into topic models

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9078 586-597

DOI: 10.1007/978-3-319-18032-8_46

23Citations

13Readers

Get full text

Abstract

Probabilistic Topic Models could be used to extract low-dimension aspects from document collections. However, such models without any human knowledge often produce aspects that are not interpretable. In recent years, a number of knowledge-based models have been proposed, which allow the user to input prior knowledge of the domain to produce more coherent and meaningful topics. In this paper, we incorporate human knowledge in the form of probabilistic knowledge base into topic models. By combining latent Dirichlet allocation, a widely used topic model with Probase, a large-scale probabilistic knowledge base, we improve the semantic coherence significantly. Our evaluation results will demonstrate the effectiveness of our method.

Author supplied keywords

Cite

CITATION STYLE

APA

Yao, L., Zhang, Y., Wei, B., Qian, H., & Wang, Y. (2015). Incorporating probabilistic knowledge into topic models. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9078, pp. 586–597). Springer Verlag. https://doi.org/10.1007/978-3-319-18032-8_46

Incorporating probabilistic knowledge into topic models

Abstract

Author supplied keywords

Cite

Register to see more suggestions