Collect Ethically: Reduce Bias in Twitter Datasets

1Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The Twitter platform is appealing to researchers due to the ease of obtaining data and the ability to analyze and produce results rapidly. However, sampling Twitter data for research purposes needs to be regulated to produce unbiased results. In this paper, factors that lead to sampling bias are addressed, case studies that have been encountered are presented, and an approach is proposed to reduce sampling bias and flaws in datasets collected from Twitter. Then, experiments are conducted on two case studies, and a larger dataset is achieved by following the proposed guideline. The results indicate that using multiple Twitter application programming interfaces (APIs) for data collection is the best way to obtain a randomly sampled dataset.

Cite

CITATION STYLE

APA

Alkulaib, L., Alhamadani, A., Ji, T., & Lu, C. T. (2020). Collect Ethically: Reduce Bias in Twitter Datasets. In Communications in Computer and Information Science (Vol. 1070 CCIS, pp. 106–114). Springer. https://doi.org/10.1007/978-3-030-46140-9_11

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free