Efficiently mining maximal diverse frequent itemsets

Dingming Wu; Dexin Luo; Christian S. Jensen; Joshua Zhexue Huang

Conference Proceedings

Efficiently mining maximal diverse frequent itemsets

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11447 LNCS 191-207

DOI: 10.1007/978-3-030-18579-4_12

2Citations

16Readers

Get full text

Abstract

Given a database of transactions, where each transaction is a set of items, maximal frequent itemset mining aims to find all itemsets that are frequent, meaning that they consist of items that co-occur in transactions more often than a given threshold, and that are maximal, meaning that they are not contained in other frequent itemsets. Such itemsets are the most interesting ones in a meaningful sense. We study the problem of efficiently finding such itemsets with the added constraint that only the top-k most diverse ones should be returned. An itemset is diverse if its items belong to many different categories according to a given hierarchy of item categories. We propose a solution that relies on a purposefully designed index structure called the FP*-tree and an accompanying bound-based algorithm. An extensive experimental study offers insight into the performance of the solution, indicating that it is capable of outperforming an existing method by orders of magnitude and of scaling to large databases of transactions.

Author supplied keywords

Cite

CITATION STYLE

APA

Wu, D., Luo, D., Jensen, C. S., & Huang, J. Z. (2019). Efficiently mining maximal diverse frequent itemsets. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11447 LNCS, pp. 191–207). Springer Verlag. https://doi.org/10.1007/978-3-030-18579-4_12

Efficiently mining maximal diverse frequent itemsets

Abstract

Author supplied keywords

Cite

Register to see more suggestions