Association rule mining is one of prominent techniques to discover the relation between data items of a transactional data. The process of mining has been simplified by considering only the frequent itemsets. Pincer search is one of the frequent itemset mining method which combines top-down and bottom-up search techniques to get the benefits of both. Top-down approach in Pincer search reduces the number of candidates in pass of iterations and saves a lot of computing resources. In this work, we present a Parallel Pincer Search (PPS) which is based on distributed implementation on Spark framework. We have converted the search algorithm according to the Spark framework to make it run in parallel. Spark provides a lot of features for the iterative algorithm such as in-memory execution, efficient data structure, better fault tolerant method, etc. We implemented the PPS on a Spark cluster with multiple datasets and analysed the performance.
CITATION STYLE
Sethi, K. K., Dharavath, R., & Nyakotey, S. (2018). PPS: Parallel pincer search for mining frequent itemsets based on spark. In Advances in Intelligent Systems and Computing (Vol. 614, pp. 351–363). Springer Verlag. https://doi.org/10.1007/978-3-319-60618-7_35
Mendeley helps you to discover research relevant for your work.