Efficient Personalized PageRank Computation: A Spanning Forests Sampling Based Approach

10Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Computing the personalized PageRank vector is a fundamental problem in graph analysis. In this paper, we propose several novel algorithms to efficiently compute the personalized PageRank vector with a decay factor α based on an interesting connection between the personalized PageRank values and the weights of random spanning forests of the graph. Such a connection is derived based on a newly-developed matrix forest theorem on graphs. Based on this, we present an efficient spanning forest sampling algorithm via simulating loop-erased α-random walks to estimate the personalized PageRank vector. Compared to all existing methods, a striking feature of our approach is that its performance is insensitive w.r.t. (with respect to) the parameter α. As a consequence, our algorithm is often much faster than the state-of-the-art algorithms when α is small, which is the demanding case for many graph analysis tasks. We show that our technique can significantly improve the efficiency of the state-of-the-art algorithms for answering two well-studied personalized PageRank queries, including single source query and single target query. Extensive experiments on seven large real-world graphs demonstrate the efficiency of the proposed method.

Author supplied keywords

Cite

CITATION STYLE

APA

Liao, M., Li, R. H., Dai, Q., & Wang, G. (2022). Efficient Personalized PageRank Computation: A Spanning Forests Sampling Based Approach. In Proceedings of the ACM SIGMOD International Conference on Management of Data (pp. 2048–2061). Association for Computing Machinery. https://doi.org/10.1145/3514221.3526140

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free