Scale-Invariant Infinite Hierarchical Topic Model

2Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.

Abstract

Hierarchical topic models have been employed to organize a large number of diverse topics from corpora into a latent tree structure. However, existing models yield fragmented topics with overlapping themes whose expected probability becomes exponentially smaller along the depth of the tree. To solve this intrinsic problem, we propose a scale-invariant infinite hierarchical topic model (ihLDA). The ihLDA adaptively adjusts the topic creation to make the expected topic probability decay considerably slower than that in existing models. Thus, it facilitates the estimation of deeper topic structures encompassing diverse topics in a corpus. Furthermore, the ihLDA extends a widely used tree-structured prior (Adams et al., 2010) in a hierarchical Bayesian way, which enables drawing an infinite topic tree from the base tree while efficiently sampling the topic assignments for the words. Experiments demonstrate that the ihLDA has better topic uniqueness and hierarchical diversity than existing approaches, including state-of-the-art neural models.

Cite

CITATION STYLE

APA

Eshima, S., & Mochihashi, D. (2023). Scale-Invariant Infinite Hierarchical Topic Model. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 11731–11746). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.findings-acl.745

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free