Adaptive and parallel data acquisition from online big graphs

Zidu Yin; Kun Yue; Hao Wu; Yingjie Su

Conference Proceedings

Adaptive and parallel data acquisition from online big graphs

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 10827 LNCS 323-331

DOI: 10.1007/978-3-319-91452-7_21

3Citations

2Readers

Get full text

Abstract

Acquisition of contents from online big graphs (OBGs) like linked Web pages, social networks and knowledge graphs, is critical as data infrastructure for Web applications and massive data analysis. However, effective data acquisition is challenging due to the massive, heterogeneous, dynamically evolving properties of OBGs with unknown global topological structures. In this paper, we give an adaptive and parallel approach for effective data acquisition from OBGs. We adopt the ideas of Quasi Monte Carlo (QMC) and branch & bound methods to propose an adaptive Web-scale sampling algorithm for parallel data collection implemented upon Spark. Experimental results show the effectiveness and efficiency of our method.

Author supplied keywords

Cite

CITATION STYLE

APA

Yin, Z., Yue, K., Wu, H., & Su, Y. (2018). Adaptive and parallel data acquisition from online big graphs. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10827 LNCS, pp. 323–331). Springer Verlag. https://doi.org/10.1007/978-3-319-91452-7_21

Adaptive and parallel data acquisition from online big graphs

Abstract

Author supplied keywords

Cite

Register to see more suggestions