An efficient algorithm for deriving frequent itemsets from lossless condensed representation

Jian Tao Huang; Yi Pei Lai; Chieh Lo; Cheng Wei Wu

Conference Proceedings

An efficient algorithm for deriving frequent itemsets from lossless condensed representation

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11606 LNAI 216-229

DOI: 10.1007/978-3-030-22999-3_20

3Citations

5Readers

Get full text

Abstract

Mining frequent itemsets (abbr. FIs) from dense databases usually generates a large amount of itemsets, causing the mining algorithms to suffer from long execution time and high memory usage. Frequent closed itemset (abbr. FCI) is a lossless condensed representation of FI. Mining only the FCIs allows to reducing the execution time and memory usage. Moreover, with correct methods, the complete information of FIs can be derived from FCIs. Although many studies have presented various efficient approaches for mining FCIs, few of them have developed efficient algorithms for deriving FIs from FCIs. In view of this, we propose a novel algorithm called DFI-Growth for efficiently deriving FIs from FCIs. Moreover, we propose two strategies, named maximum support selection and maximum support replacement to guarantee that all the FIs and their supports can be correctly derived by DFI-Growth. To the best of our knowledge, the proposed DFI-Growth is the first kind of tree-based and pattern growth algorithm for deriving FIs from FCIs. Experiments show that DFI-Growth is superior to the most advanced deriving algorithm [12] in terms of both execution time and memory consumption.

Author supplied keywords

Cite

CITATION STYLE

APA

Huang, J. T., Lai, Y. P., Lo, C., & Wu, C. W. (2019). An efficient algorithm for deriving frequent itemsets from lossless condensed representation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11606 LNAI, pp. 216–229). Springer Verlag. https://doi.org/10.1007/978-3-030-22999-3_20

An efficient algorithm for deriving frequent itemsets from lossless condensed representation

Abstract

Author supplied keywords

Cite

Register to see more suggestions