An efficient ranking-centered density-based document clustering method

Wathsala Anupama Mohotti; Richi Nayak

Conference Proceedings

An efficient ranking-centered density-based document clustering method

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 10939 LNAI 439-451

DOI: 10.1007/978-3-319-93040-4_35

5Citations

6Readers

Get full text

Abstract

Document clustering is a popular method for discovering useful information from text data. This paper proposes an innovative hybrid document clustering method based on the novel concepts of ranking, density and shared neighborhood. We utilize ranked documents generated from a search engine to effectively build a graph of shared relevant documents. The high density regions in the graph are processed to form initial clusters. The clustering decisions are further refined using the shared neighborhood information. Empirical analysis shows that the proposed method is able to produce accurate and efficient solution as compared to relevant benchmarking methods.

Author supplied keywords

Cite

CITATION STYLE

APA

Mohotti, W. A., & Nayak, R. (2018). An efficient ranking-centered density-based document clustering method. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10939 LNAI, pp. 439–451). Springer Verlag. https://doi.org/10.1007/978-3-319-93040-4_35

An efficient ranking-centered density-based document clustering method

Abstract

Author supplied keywords

Cite

Register to see more suggestions