Big data processing using hadoop and spark: The case of meteorology data

3Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Meteorology is a branch of science which can be leveraged to gain useful insight into many phenomenon that have significant impacts on our daily lives such as weather precipitation, cyclones, thunderstorms, climate change. It is a highly data-driven field that involves large datasets of images captured from both radar and satellite, thus requiring efficient technologies for storing, processing and data mining to find hidden patterns in these datasets. Different big data tools and ecosystems, most of them integrating Hadoop and Spark, have been designed to address big data issues. However, despite its importance, only few works have been done on the application of these tools and ecosystems for solving meteorology issues. This paper proposes and evaluate the performance of a precipitation data processing system that builds upon the Cloudera ecosystem to analyse large datasets of images as a classification problem. The system can be used as a replacement to machine learning techniques when the classification problem consists of finding zones of high, moderate and low precipitations in satellite images.

Cite

CITATION STYLE

APA

Hussein, E., Sadiki, R., Jafta, Y., Sungay, M. M., Ajayi, O., & Bagula, A. (2020). Big data processing using hadoop and spark: The case of meteorology data. In Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering, LNICST (Vol. 311 LNICST, pp. 180–185). Springer. https://doi.org/10.1007/978-3-030-41593-8_13

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free