Big data processing using hadoop and spark: The case of meteorology data

Eslam Hussein; Ronewa Sadiki; Yahlieel Jafta; Muhammad Mujahid Sungay; Olasupo Ajayi; Antoine Bagula

Conference Proceedings

Big data processing using hadoop and spark: The case of meteorology data

Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering, LNICST (2020) 311 LNICST 180-185

DOI: 10.1007/978-3-030-41593-8_13

3Citations

7Readers

Get full text

Abstract

Meteorology is a branch of science which can be leveraged to gain useful insight into many phenomenon that have significant impacts on our daily lives such as weather precipitation, cyclones, thunderstorms, climate change. It is a highly data-driven field that involves large datasets of images captured from both radar and satellite, thus requiring efficient technologies for storing, processing and data mining to find hidden patterns in these datasets. Different big data tools and ecosystems, most of them integrating Hadoop and Spark, have been designed to address big data issues. However, despite its importance, only few works have been done on the application of these tools and ecosystems for solving meteorology issues. This paper proposes and evaluate the performance of a precipitation data processing system that builds upon the Cloudera ecosystem to analyse large datasets of images as a classification problem. The system can be used as a replacement to machine learning techniques when the classification problem consists of finding zones of high, moderate and low precipitations in satellite images.

Author supplied keywords

Cite

CITATION STYLE

APA

Hussein, E., Sadiki, R., Jafta, Y., Sungay, M. M., Ajayi, O., & Bagula, A. (2020). Big data processing using hadoop and spark: The case of meteorology data. In Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering, LNICST (Vol. 311 LNICST, pp. 180–185). Springer. https://doi.org/10.1007/978-3-030-41593-8_13

Big data processing using hadoop and spark: The case of meteorology data

Abstract

Author supplied keywords

Cite

Register to see more suggestions