Abstract
To apply link prediction methods into large-scale complex network, this paper designs and implements a parallel link prediction algorithm based on MapReduce, which includes nine similarity Indices via local information. The parallel link prediction algorithm has a time complexity of O(N) in sparse networks. First, the paper verifies the validity of the algorithm on public datasets, increase in the extraction factor, recall ascends, and precision descends. The experimental results on ten large-scale datasets of variety network types show that the parallel link prediction algorithm is more effective than traditional ones, and its running time decreases with more compute units. The upper and lower bounds of AUC (area under a receiver operating characteristic curve) are proposed. The experimental results show the median of the upper and lower bounds are close to the real value of AUC, which focuses on whether prediction score is zero rather than the actual score value. The network average clustering coefficient has the greatest impact on AUC among most topological features and AUC rises as the network average clustering coefficient increases. © Copyright 2012, Institute of Software, the Chinese Academy of Science. All right reserved.
Author supplied keywords
Cite
CITATION STYLE
Rao, J., Wu, B., & Dong, Y. X. (2012). Parallel link prediction in complex network using mapreduce. Ruan Jian Xue Bao/Journal of Software, 23(12), 3175–3186. https://doi.org/10.3724/SP.J.1001.2012.04206
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.