Practical Methods for Imputing Follower Count Dynamics

1Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Microblogging sites have become important data sources for studying network dynamics and information transmission. Both areas of study, however, require accurate counts of indegree, or follower counts; unfortunately, collection of complete time series on follower counts can be limited by application programming interface constraints, system failures, or temporal constraints. In addition, there is almost always a time difference between the point at which follower counts are queried and the time a user posts a tweet. Here, we consider the use of three classes of simple, easily implemented methods for follower imputation: polynomial functions, splines, and generalized linear models. We evaluate the performance of each method via a case study of accounts from 236 health organizations during the 2014 Ebola outbreak. For accurate interpolation and extrapolation, we find that negative binomial regression, modeled separately for each account, using time as an interval variable, accurately recovers missing values while retaining narrow prediction intervals.

Cite

CITATION STYLE

APA

Ben Gibson, C., Sutton, J., Vos, S. K., & Butts, C. T. (2023). Practical Methods for Imputing Follower Count Dynamics. Sociological Methods and Research, 52(1), 412–437. https://doi.org/10.1177/0049124120926210

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free