LDA+: An Extended LDA Model for Topic Hierarchy and Discovery

Amani Drissi; Ahmed Khemiri; Salma Sassi; Anis Tissaoui; Richard Chbeir; Abderrazek Jemai

Conference Proceedings

LDA+: An Extended LDA Model for Topic Hierarchy and Discovery

Communications in Computer and Information Science (2022) 1716 CCIS 14-26

DOI: 10.1007/978-981-19-8234-7_2

0Citations

3Readers

Get full text

Abstract

The success of topic modeling algorithms depends on their ability to analyze, index and classify large text corpora. These algorithms could be classified into two groups where the first one is oriented to classify textual corpus according to their dominant topics such as LDA, LSA and PLSA which are the most known techniques. The second group is dedicated to extract the relationships among the generated topics like HLDA, PAM and CTM. However, each algorithm among these groups is dedicated to a single task and there is no technique that makes it possible to carry out several analyses on textual corpus at the same time. In order to cope with this problem, we propose here a new technique based on LDA topic modeling to automatically classify a large text corpora according to their relevant topics, discover new topics (sub-topics) based on the extracted ones and hierarchy the generated topics in order to analyse data more deeply. Experiments have been conducted to measure the performance of our solution compared to the existing techniques. The results obtained are more than satisfactory.

Author supplied keywords

Cite

CITATION STYLE

APA

Drissi, A., Khemiri, A., Sassi, S., Tissaoui, A., Chbeir, R., & Jemai, A. (2022). LDA+: An Extended LDA Model for Topic Hierarchy and Discovery. In Communications in Computer and Information Science (Vol. 1716 CCIS, pp. 14–26). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-981-19-8234-7_2

LDA+: An Extended LDA Model for Topic Hierarchy and Discovery

Abstract

Author supplied keywords

Cite

Register to see more suggestions