Efficient Personalized PageRank Computation: A Spanning Forests Sampling Based Approach

Meihao Liao; Rong Hua Li; Qiangqiang Dai; Guoren Wang

Conference ProceedingsOPEN ACCESS

Efficient Personalized PageRank Computation: A Spanning Forests Sampling Based Approach

Proceedings of the ACM SIGMOD International Conference on Management of Data (2022) 2048-2061

DOI: 10.1145/3514221.3526140

10Citations

8Readers

Get full text

Abstract

Computing the personalized PageRank vector is a fundamental problem in graph analysis. In this paper, we propose several novel algorithms to efficiently compute the personalized PageRank vector with a decay factor α based on an interesting connection between the personalized PageRank values and the weights of random spanning forests of the graph. Such a connection is derived based on a newly-developed matrix forest theorem on graphs. Based on this, we present an efficient spanning forest sampling algorithm via simulating loop-erased α-random walks to estimate the personalized PageRank vector. Compared to all existing methods, a striking feature of our approach is that its performance is insensitive w.r.t. (with respect to) the parameter α. As a consequence, our algorithm is often much faster than the state-of-the-art algorithms when α is small, which is the demanding case for many graph analysis tasks. We show that our technique can significantly improve the efficiency of the state-of-the-art algorithms for answering two well-studied personalized PageRank queries, including single source query and single target query. Extensive experiments on seven large real-world graphs demonstrate the efficiency of the proposed method.

Author supplied keywords

Cite

CITATION STYLE

APA

Liao, M., Li, R. H., Dai, Q., & Wang, G. (2022). Efficient Personalized PageRank Computation: A Spanning Forests Sampling Based Approach. In Proceedings of the ACM SIGMOD International Conference on Management of Data (pp. 2048–2061). Association for Computing Machinery. https://doi.org/10.1145/3514221.3526140

Efficient Personalized PageRank Computation: A Spanning Forests Sampling Based Approach

Abstract

Author supplied keywords

Cite

Register to see more suggestions