PPS: Parallel pincer search for mining frequent itemsets based on spark

Krishan Kumar Sethi; Ramesh Dharavath; Samuel Nyakotey

Conference Proceedings

PPS: Parallel pincer search for mining frequent itemsets based on spark

Advances in Intelligent Systems and Computing (2018) 614 351-363

DOI: 10.1007/978-3-319-60618-7_35

1Citations

3Readers

Get full text

Abstract

Association rule mining is one of prominent techniques to discover the relation between data items of a transactional data. The process of mining has been simplified by considering only the frequent itemsets. Pincer search is one of the frequent itemset mining method which combines top-down and bottom-up search techniques to get the benefits of both. Top-down approach in Pincer search reduces the number of candidates in pass of iterations and saves a lot of computing resources. In this work, we present a Parallel Pincer Search (PPS) which is based on distributed implementation on Spark framework. We have converted the search algorithm according to the Spark framework to make it run in parallel. Spark provides a lot of features for the iterative algorithm such as in-memory execution, efficient data structure, better fault tolerant method, etc. We implemented the PPS on a Spark cluster with multiple datasets and analysed the performance.

Author supplied keywords

Cite

CITATION STYLE

APA

Sethi, K. K., Dharavath, R., & Nyakotey, S. (2018). PPS: Parallel pincer search for mining frequent itemsets based on spark. In Advances in Intelligent Systems and Computing (Vol. 614, pp. 351–363). Springer Verlag. https://doi.org/10.1007/978-3-319-60618-7_35

PPS: Parallel pincer search for mining frequent itemsets based on spark

Abstract

Author supplied keywords

Cite

Register to see more suggestions