Research in social network analysis and statistical relational learning has produced a number of methods for learning relational models from large-scale network data. Unfortunately, these methods have been developed under the unrealistic assumption of full data access. In practice, however, the data are often collected by crawling the network, due to proprietary access, limited resources, and privacy concerns. While prior studies have examined the impact of network crawling on the structural characteristics of the resulting samples, this work presents the first empirical study designed to assess the impact of widely used network crawlers on the estimation of peer effects. Our experiments demonstrate that the estimates obtained from network samples collected by existing crawlers can be quite inaccurate, unless a significant portion of the network is crawled. Meanwhile, motivated by recent advances in partial network crawling, we develop crawl-aware relational methods that provide accurate estimates of peer effects with statistical guarantees from partial crawls.
CITATION STYLE
Yang, J., Ribeiro, B., & Neville, J. (2017). Should we be confident in peer effects estimated from partial crawls of social networks? In Proceedings of the 11th International Conference on Web and Social Media, ICWSM 2017 (pp. 708–711). AAAI Press.
Mendeley helps you to discover research relevant for your work.