Clustering transactions using large items

Ke Wang; Chu Xu; Bing Liu

Conference ProceedingsOPEN ACCESS

Clustering transactions using large items

International Conference on Information and Knowledge Management, Proceedings (1999) 483-490

DOI: 10.1145/319950.320054

147Citations

35Readers

Abstract

In traditional data clustering, similarity of a cluster of objects is measured by pairwise similarity of objects in that cluster. We argue that such measures are not appropriate for transactions that are sets of items. We propose the notion of large items, i.e., items contained in some minimum fraction of transactions in a cluster, to measure the similarity of a cluster of transactions. The intuition of our clustering criterion is that there should be many large items within a cluster and little overlapping of such items across clusters. We discuss the rationale behind our approach and its implication on providing a better solution to the clustering problem. We present a clustering algorithm based on the new clustering criterion and evaluate its effectiveness.

Cite

CITATION STYLE

APA

Wang, K., Xu, C., & Liu, B. (1999). Clustering transactions using large items. In International Conference on Information and Knowledge Management, Proceedings (pp. 483–490). ACM. https://doi.org/10.1145/319950.320054

Clustering transactions using large items

Abstract

Cite

Register to see more suggestions