A high-performance algorithm for frequent itemset mining

Jun Feng Qu; Mengchi Liu

Conference Proceedings

A high-performance algorithm for frequent itemset mining

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2012) 7418 LNCS 71-82

DOI: 10.1007/978-3-642-32281-5_8

2Citations

3Readers

Get full text

Abstract

Frequent itemsets, also called frequent patterns, are important information about databases, and mining efficiently frequent itemsets is a core problem in data mining area. Pattern growth approaches, such as the classic FP-Growth algorithm and the efficient FPgrowth*algorithm, can solve the problem. The approaches mine frequent itemsets by constructing recursively conditional databases that are usually represented by prefix-trees. The three major costs of such approaches are prefix-tree traversal, support counting, and prefix-tree construction. This paper presents a novel pattern growth algorithm called BFP-growth in which the three costs are greatly reduced. We compare the costs among BFP-growth, FP-Growth, and FPgrowth*, and illuminate that the costs of BFP-growth are the least. Experimental data show that BFP-growth outperforms not only FP-Growth and FPgrowth*but also several famous algorithms including dEclat and LCM, ones of the fastest algorithms, for various databases. © 2012 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Qu, J. F., & Liu, M. (2012). A high-performance algorithm for frequent itemset mining. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7418 LNCS, pp. 71–82). https://doi.org/10.1007/978-3-642-32281-5_8

A high-performance algorithm for frequent itemset mining

Abstract

Author supplied keywords

Cite

Register to see more suggestions