Comparison of hierachic agglomerative clustering methods for document retrieval

92Citations
Citations of this article
39Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

This paper considers the use of the single linkage, complete linkage, group average and Ward hierarchic agglomerative clustering methods for document retrieval. The methods are used to cluster seven document test collections for which queries and relevance judgements are available. Several retrieval strategies are described which allow searches to be carried out of the clustered document files resulting from the use of the four methods. These searches suggest that the group average method is the most suitable for document clustering purposes; however, searches of the unclustered document collections and of a simpler type of clustered file (based on pairs of nearest neighbours) usually result in better levels of retrieval effectiveness than searches of the clustered collections.

Cite

CITATION STYLE

APA

El-Hamdouchi, A., & Willett, P. (1989). Comparison of hierachic agglomerative clustering methods for document retrieval. Computer Journal, 32(3), 220–227. https://doi.org/10.1093/comjnl/32.3.220

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free