Clustering transactions with an unbalanced hierarchical product structure

Min Tzu Wang; Ping Yu Hsu; K. C. Lin; Shiuann Shuoh Chen

Conference Proceedings

Clustering transactions with an unbalanced hierarchical product structure

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2007) 4654 LNCS 251-261

DOI: 10.1007/978-3-540-74553-2_23

4Citations

5Readers

Get full text

Abstract

The datasets extracted from large retail stores often contain sparse information composed of a huge number of items and transactions, with each transaction only containing a few items. These data render basket analysis with extremely low item support, customer clustering with large intra cluster distance and transaction classifications having huge classification trees. Although a similarity measure represented by counting the depth of the least common ancestor normalized by the depth of the concept tree lifts the limitation of binary equality, it produces counter intuitive results when the concept hierarchy is unbalanced since two items in deeper subtrees are very likely to have a higher similarity than two items in shallower subtrees. The research proposes to calculate the distance between two items by counting the edge traversal needed to link them in order to solve the issues. The method is straight forward yet achieves better performance with retail store data when concept hierarchy is unbalanced. © Springer-Verlag Berlin Heidelberg 2007.

Author supplied keywords

Cite

CITATION STYLE

APA

Wang, M. T., Hsu, P. Y., Lin, K. C., & Chen, S. S. (2007). Clustering transactions with an unbalanced hierarchical product structure. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4654 LNCS, pp. 251–261). Springer Verlag. https://doi.org/10.1007/978-3-540-74553-2_23

Clustering transactions with an unbalanced hierarchical product structure

Abstract

Author supplied keywords

Cite

Register to see more suggestions