Efficient and scalable mining of frequent subgraphs using distributed graph processing systems

Tongtong Wang; Hao Huang; Wei Lu; Zhe Peng; Xiaoyong Du

Conference Proceedings

Efficient and scalable mining of frequent subgraphs using distributed graph processing systems

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 10827 LNCS 891-907

DOI: 10.1007/978-3-319-91452-7_57

2Citations

4Readers

Get full text

Abstract

Mining frequent subgraphs in large scale graph data sets helps reveal underlying knowledge. Since the mining approaches in centralized systems are often bottlenecked on calculation capacity, many parallelized solutions based on the MapReduce framework are proposed to scale out the mining process, which usually extracts frequent subgraphs in an iterative way. Nonetheless, the efficiency and scalability of these MapReduce based approaches are still bounded by the communication cost for passing the intermediate results and the unbalanced workload after a few iterations. In this paper, we propose an efficient and scalable framework for frequent subgraph mining by using distributed graph processing systems. It adopts a message-passing-free scheme among workers to reduce the communication cost, and utilizes a task scheduler to dynamically balance the workload. Experimental results on both synthetic and real-world data sets verify the efficacy of our proposed framework.

Cite

CITATION STYLE

APA

Wang, T., Huang, H., Lu, W., Peng, Z., & Du, X. (2018). Efficient and scalable mining of frequent subgraphs using distributed graph processing systems. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10827 LNCS, pp. 891–907). Springer Verlag. https://doi.org/10.1007/978-3-319-91452-7_57

Efficient and scalable mining of frequent subgraphs using distributed graph processing systems

Abstract

Cite

Register to see more suggestions