Abstract
This paper considers the use of the single linkage, complete linkage, group average and Ward hierarchic agglomerative clustering methods for document retrieval. The methods are used to cluster seven document test collections for which queries and relevance judgements are available. Several retrieval strategies are described which allow searches to be carried out of the clustered document files resulting from the use of the four methods. These searches suggest that the group average method is the most suitable for document clustering purposes; however, searches of the unclustered document collections and of a simpler type of clustered file (based on pairs of nearest neighbours) usually result in better levels of retrieval effectiveness than searches of the clustered collections.
Cite
CITATION STYLE
El-Hamdouchi, A., & Willett, P. (1989). Comparison of hierachic agglomerative clustering methods for document retrieval. Computer Journal, 32(3), 220–227. https://doi.org/10.1093/comjnl/32.3.220
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.