A fast APRIORI implementation

Ferenc Bodon

Conference Proceedings

A fast APRIORI implementation

Bodon F

Edited by

Goethals B
Zaki M

Proc. of the IEEE ICDM Workshop on Frequent Itemset Mining Implementations (FIMI'03) (2003) 90

N/ACitations

42Readers

Abstract

The efficiency of frequent itemset mining algorithms is determined mainly by three factors: the way candidates are generated, the data structure that is used and the implemen- tation details. Most papers focus on the first factor, some describe the underlying data structures, but implementa- tion details are almost always neglected. In this paper we show that the effect of implementation can be more impor- tant than the selection of the algorithm. Ideas that seem to be quite promising, may turn out to be ineffective if we descend to the implementation level. We theoretically and experimentally analyze APRIORI which is the most established algorithm for frequent item- set mining. Several implementations of the algorithm have been put forward in the last decade. Although they are im- plementations of the very same algorithm, they display large differences in running time and memory need. In this pa- per we describe an implementation of APRIORI that out- performs all implementations known to us. We analyze, the- oretically and experimentally, the principal data structure of our solution. This data structure is the main factor in the efficiency of our implementation. Moreover, we present a simple modification of APRIORI that appears to be faster than the original algorithm.

Cite

CITATION STYLE

APA

Bodon, F. (2003). A fast APRIORI implementation. In B. Goethals & M. J. Zaki (Eds.), Proc. of the IEEE ICDM Workshop on Frequent Itemset Mining Implementations (FIMI’03) (Vol. 90). Melbourne, Florida, USA.

A fast APRIORI implementation

Abstract

Cite

Register to see more suggestions