Searching frequent itemsets by clustering data: Towards a parallel approach using mapreduce

Maria Malek; Hubert Kadima

Conference Proceedings

Searching frequent itemsets by clustering data: Towards a parallel approach using mapreduce

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2013) 7652 LNCS 251-258

DOI: 10.1007/978-3-642-38333-5_26

9Citations

9Readers

Get full text

Abstract

We propose a new algorithm for searching frequent itemsets in large data bases. The idea is to start searching from a set of representative examples instead of testing the 1-itemset,the k-itemset and so on. A clustering algorithm is firstly applied in order to cluster the transactions into k clusters. The set of the k representative examples will be used as the starting point for searching frequent itemsets. Each cluster is represented by the most representative example. We show some preliminary results and we then propose a parallel version of this algorithm based on the MapReduce Framework. © 2013 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Malek, M., & Kadima, H. (2013). Searching frequent itemsets by clustering data: Towards a parallel approach using mapreduce. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7652 LNCS, pp. 251–258). https://doi.org/10.1007/978-3-642-38333-5_26

Searching frequent itemsets by clustering data: Towards a parallel approach using mapreduce

Abstract

Author supplied keywords

Cite

Register to see more suggestions