Finding top-k fuzzy frequent itemsets from databases

Haifeng Li; Yue Wang; Ning Zhang; Yuejin Zhang

Conference Proceedings

Finding top-k fuzzy frequent itemsets from databases

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2017) 10387 LNCS 22-30

DOI: 10.1007/978-3-319-61845-6_3

0Citations

1Readers

Get full text

Abstract

Frequent itemset mining is an important in data mining. Fuzzy data mining can more accurately describe the mining results in frequent itemset mining. Nevertheless, frequent itemsets are redundant for the users. A better way is to show the top-k results accordingly. In this paper, we define the score of fuzzy frequent itemset and propose the problem of top-k fuzzy frequent itemset mining, which, to the best of our knowledge, has never been focused on before. To address this problem, we employ a data structure named TopKFFITree to store the superset of the mining results, which has a significantly reduced size in comparison to all the fuzzy frequent itemsets. Then, we present an algorithm named TopK-FFI to build and maintain the data structure. In this algorithm, we employ a method to prune most of the fuzzy frequent itemsets immediately based on the monotony of itemset score. Theoretical analysis and experimental studies over 4 datasets demonstrate that our proposed algorithm can efficiently decrease the runtime and memory cost, and significantly outperform the naive algorithm Top-k-FFI-Miner.

Author supplied keywords

Cite

CITATION STYLE

APA

Li, H., Wang, Y., Zhang, N., & Zhang, Y. (2017). Finding top-k fuzzy frequent itemsets from databases. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10387 LNCS, pp. 22–30). Springer Verlag. https://doi.org/10.1007/978-3-319-61845-6_3

Finding top-k fuzzy frequent itemsets from databases

Abstract

Author supplied keywords

Cite

Register to see more suggestions