Sampling representative users from large social networks

14Citations
Citations of this article
31Readers
Mendeley users who have this article in their library.

Abstract

Finding a subset of users to statistically represent the original social network is a fundamental issue in Social Network Analysis (SNA). The problem has not been extensively studied in existing literature. In this paper, we present a formal definition of the problem of sampling representative users from social network. We propose two sampling models and theoretically prove their NP-hardness. To efficiently solve the two models, we present an efficient algorithm with provable approximation guarantees. Experimental results on two datasets show that the proposed models for sampling representative users significantly outperform (+6%-23% in terms of Precision® 100) several alternative methods using authority or structure information only. The proposed algorithms are also effective in terms of time complexity. Only a few seconds are needed to sampling 300 representative users from a network of 100,000 users. All data and codes are publicly available.

Cite

CITATION STYLE

APA

Tang, J., Zhang, C., Cai, K., Zhang, L., & Su, Z. (2015). Sampling representative users from large social networks. In Proceedings of the National Conference on Artificial Intelligence (Vol. 1, pp. 304–310). AI Access Foundation. https://doi.org/10.1609/aaai.v29i1.9202

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free