In order to improve the efficiency of the PageRank algorithm, parallelizing methods, especially the ones based on MapReduce, interest many researchers during the past several years. Previous implementations of the PageRank algorithm on MapReduce ignore the characteristic of locality in distributed systems which is very important to reduce the I/O and network costs. In this paper, we explore the locality property and propose a new method for fast PageRank computation by supporting a subgraph as an input record for map functions. Graph partitioning techniques and a message grouping method are employed to guarantee the efficiency of communication among different subgraphs. Experiments show that our method is significantly more efficient than previous approaches without accuracy loss. The key idea to change the granularity of basic processing units from edges to subgraphs can benefit many other parallelizing algorithms for graph processing. © 2013 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Li, Q., Wang, W., Wang, P., Dai, K., Wang, Z., Wang, Y., & Sun, W. (2013). Combination of in-memory graph computation with mapreduce: A subgraph-centric method of pagerank. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7923 LNCS, pp. 173–178). Springer Verlag. https://doi.org/10.1007/978-3-642-38562-9_18
Mendeley helps you to discover research relevant for your work.