Text document topical recursive clustering and automatic labeling of a hierarchy of document clusters

6Citations
Citations of this article
13Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The overwhelming amount of textual documents available nowadays highlights the need for information organization and discovery. Effectively organizing documents into a hierarchy of topics and subtopics makes it easier for users to browse the documents. This paper borrows community mining from social network analysis to generate a hierarchy of topically coherent document clusters. It focuses on giving the document clusters descriptive labels. We propose to use betweenness centrality measure in networks of co-occurring terms to label the clusters. We also incorporate keyphrase extraction and automatic titling in cluster labeling. The results show that the cluster labeling method utilizing KEA to extract keyphrases from the documents generates the best labels overall comparing to other methods and baselines. © Springer-Verlag 2013.

Cite

CITATION STYLE

APA

Li, X., Chen, J., & Zaiane, O. (2013). Text document topical recursive clustering and automatic labeling of a hierarchy of document clusters. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7819 LNAI, pp. 197–208). https://doi.org/10.1007/978-3-642-37456-2_17

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free