Abstract
Estimating the relevance or proximity between vertices in a network is a fundamental building block of network analysis and is useful in a wide range of important applications such as network-aware searches and network structure prediction. In this paper, we(1)propose to use top-kshortest-path distanceas a relevance measure, and(2)design an efficient indexing scheme for answering top-kdistance queries. Although many indexing methods have been developed for standard (top-1) distance queries, no methods can be directly applied to top-kdistance. Therefore, we develop a new framework for top-kdistance queries based on2-hop coverand then present an efficient indexing algorithm based on the recently proposedpruned landmark labelingscheme. The scalability, efficiency and robustness of our method are demonstrated in extensive experimental results. It can construct indices from large graphs comprising millions of vertices and tens of millions of edges within a reasonable running time. Having obtained the indices, we can compute the top-kdistances within a few microseconds, six orders of magnitude faster than existing methods, which require a few seconds to compute these distances. Moreover, we demonstrate the usefulness of top-kdistance as a relevance measure by applying them to link prediction, the most fundamental problem in graph data mining. We emphasize that the proposed indexing method enables the first use of top-k distance for such tasks.
Author supplied keywords
Cite
CITATION STYLE
Akiba, T., Hayashi, T., Nori, N., Iwata, Y., & Yoshida, Y. (2016). Fast top-k distance retrieval for network structural proximity features. Transactions of the Japanese Society for Artificial Intelligence, 31(2). https://doi.org/10.1527/tjsai.B-F71
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.