PPS: Parallel pincer search for mining frequent itemsets based on spark

1Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Association rule mining is one of prominent techniques to discover the relation between data items of a transactional data. The process of mining has been simplified by considering only the frequent itemsets. Pincer search is one of the frequent itemset mining method which combines top-down and bottom-up search techniques to get the benefits of both. Top-down approach in Pincer search reduces the number of candidates in pass of iterations and saves a lot of computing resources. In this work, we present a Parallel Pincer Search (PPS) which is based on distributed implementation on Spark framework. We have converted the search algorithm according to the Spark framework to make it run in parallel. Spark provides a lot of features for the iterative algorithm such as in-memory execution, efficient data structure, better fault tolerant method, etc. We implemented the PPS on a Spark cluster with multiple datasets and analysed the performance.

Cite

CITATION STYLE

APA

Sethi, K. K., Dharavath, R., & Nyakotey, S. (2018). PPS: Parallel pincer search for mining frequent itemsets based on spark. In Advances in Intelligent Systems and Computing (Vol. 614, pp. 351–363). Springer Verlag. https://doi.org/10.1007/978-3-319-60618-7_35

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free