Hierarchical Summarization of Text Documents Using Topic Modeling and Formal Concept Analysis

10Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Availability of large collection of text documents triggers the need for large-scale text summarization. Identification of topics and organization of documents are important for analysis and exploration of textual data. This study is a part of the growing body of research on large-scale text summarization. It is an experimental approach and it differs from earlier works in the sense that it generates topic hierarchy which simultaneously provides a hierarchical structure to the document corpus. The documents are labeled with topics using latent Dirichlet allocation (LDA) and are automatically organized in a lattice structure using formal concept analysis (FCA). This groups the semantically related documents together. The lattice is further converted to a tree for better visualization and easy exploration of data. To signify the effectiveness of the approach, we have carried out the experiment and evaluation on 20 Newsgroup Dataset. Results depict that the presented method is considerably successful in forming topic hierarchy.

Cite

CITATION STYLE

APA

Akhtar, N., Javed, H., & Ahmad, T. (2019). Hierarchical Summarization of Text Documents Using Topic Modeling and Formal Concept Analysis. In Advances in Intelligent Systems and Computing (Vol. 839, pp. 21–33). Springer Verlag. https://doi.org/10.1007/978-981-13-1274-8_2

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free