MapReduce and spark-based analytic framework using social media data for earlier flu outbreak detection

3Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Influenza and flu can be serious problems, and can lead to death, as hundred thousands of people die every year due to seasonal flu. An early warning may help to prevent the spread of flu in the population. This kind of warning can be achieved by using social media data and big data tools and techniques. In this paper, a MapReduce and Spark-based analytic framework (MRSAF) using Twitter data is presented for faster flu outbreak detection. Different analysis cases are implemented using Apache Spark, Hadoop Systems and Hadoop Eco Systems to predict flu trends in different locations using Twitter data. The data was collected using a developed crawler which works together with the Twitter API to stream and filter the tweets based on flurelated keywords. The crawler is also designed to pre-process and clean the unintended attributes of the retrieved tweets. The results of the proposed solution show a strong relationship with the weekly Center for Disease Control and Prevention (CDC) reports.

Cite

CITATION STYLE

APA

Al Essa, A., & Faezipour, M. (2017). MapReduce and spark-based analytic framework using social media data for earlier flu outbreak detection. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10357 LNAI, pp. 246–257). Springer Verlag. https://doi.org/10.1007/978-3-319-62701-4_19

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free