SSRW: A scalable algorithm for estimating graphlet statistics based on random walk

8Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Mining graphlet statistics is very meaningful due to its wide applications in social networks, bioinformatics and information security, etc. However, it is a big challenge to exactly count graphlet statistics as the number of subgraphs exponentially increases with the graph size, so sampling algorithms are widely used to estimate graphlet statistics within reasonable time. However, existing sampling algorithms are not scalable for large graphlets, e.g., they may get stuck when estimating graphlets with more than five nodes. To address this issue, we propose a highly scalable algorithm, Scalable subgraph Sampling via Random Walk (SSRW), for graphlet counts and concentrations. SSRW samples graphlets by generating new nodes from the neighbors of previously visited nodes instead of fixed ones. Thanks to this flexibility, we can generate any k-graphlets in a unified way and estimate statistics of k-graphlet efficiently even for large k. Our extensive experiments on estimating counts and concentrations of {4,5,6,7} -graphlets show that SSRW algorithm is scalable, accurate and fast.

Cite

CITATION STYLE

APA

Yang, C., Lyu, M., Li, Y., Zhao, Q., & Xu, Y. (2018). SSRW: A scalable algorithm for estimating graphlet statistics based on random walk. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10827 LNCS, pp. 272–288). Springer Verlag. https://doi.org/10.1007/978-3-319-91452-7_18

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free