PageRank is a classic measure that effectively evaluates the node importance in large graphs, and has been applied in numerous applications ranging from data mining, Web algorithms, recommendation systems, load balancing, search, and identifying connectivity structures. Computing PageRank for large graphs is challenging and this has motivated the studies of distributed algorithms to compute PageRank. Previously, little works have been spent on the distributed PageRank algorithms with provably desired complexity and accuracy. Given a graph with n nodes and if we model the distributed computation model as the well-known congested clique model, the state-of-the-art algorithm takes O(vlog n) communication rounds to approximate the PageRank value of each node in G, with a probability at least 1 - n1. In this paper, we present improved distributed algorithms for computing PageRank. Particularly, our algorithm performs O(log log n) rounds (a significant improvement compared with O(vlog n) rounds) to approximate the PageRank values with a probability at least 1 - n1. Moreover, under a reasonable assumption, our algorithm also reduces the edge bandwidth (i.e., the maximum communication message size that can be exchanged through an edge during a communication round) by a O(log n) factor compared with the state-of-the-art algorithm. Finally, we show that our algorithm can be adapted to efficiently compute another variant of PageRank, i.e., the batch one-hop Personalized PageRanks, in O(log log n) communication rounds.
CITATION STYLE
Luo, S. (2019). Distributed PageRank computation: An improved theoretical study. In 33rd AAAI Conference on Artificial Intelligence, AAAI 2019, 31st Innovative Applications of Artificial Intelligence Conference, IAAI 2019 and the 9th AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019 (pp. 4496–4503). AAAI Press. https://doi.org/10.1609/aaai.v33i01.33014496
Mendeley helps you to discover research relevant for your work.