Application of Machine Learning Algorithms to Handle Missing Values in Precipitation Data

4Citations
Citations of this article
14Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The paper presents two approaches to filling gaps in precipitation based on classification (Support-Vector Machines) and regression (EM, Random Forests, k-Nearest Neighbors) machine learning algorithms as well as the pattern-driven methodology. These methods are among of the most powerful tools for data mining in a wide range of research areas including meteorology and climatology due to the presence of a large amount of temporal and spatial observations. When collecting observations from weather stations, there are a lot of missing records. Data processing algorithms are often very sensitive to the presence of incomplete data, so missing values should be firstly imputed and only after that the complete samples can be analyzed. The possibility of a correct filling data even for high missing levels based on suggested methods is demonstrated. The observations in Potsdam and Elista for about 60 years were used. Also, comparison of various algorithms for data imputation taking into account different missing levels is presented. The proposed methodology can be successfully used for real-time data processing of information flows.

Cite

CITATION STYLE

APA

Gorshenin, A., Lebedeva, M., Lukina, S., & Yakovleva, A. (2019). Application of Machine Learning Algorithms to Handle Missing Values in Precipitation Data. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11965 LNCS, pp. 563–577). Springer. https://doi.org/10.1007/978-3-030-36614-8_43

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free