Application of Machine Learning Algorithms to Handle Missing Values in Precipitation Data

Andrey Gorshenin; Mariia Lebedeva; Svetlana Lukina; Alina Yakovleva

Conference Proceedings

Application of Machine Learning Algorithms to Handle Missing Values in Precipitation Data

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11965 LNCS 563-577

DOI: 10.1007/978-3-030-36614-8_43

4Citations

14Readers

Get full text

Abstract

The paper presents two approaches to filling gaps in precipitation based on classification (Support-Vector Machines) and regression (EM, Random Forests, k-Nearest Neighbors) machine learning algorithms as well as the pattern-driven methodology. These methods are among of the most powerful tools for data mining in a wide range of research areas including meteorology and climatology due to the presence of a large amount of temporal and spatial observations. When collecting observations from weather stations, there are a lot of missing records. Data processing algorithms are often very sensitive to the presence of incomplete data, so missing values should be firstly imputed and only after that the complete samples can be analyzed. The possibility of a correct filling data even for high missing levels based on suggested methods is demonstrated. The observations in Potsdam and Elista for about 60 years were used. Also, comparison of various algorithms for data imputation taking into account different missing levels is presented. The proposed methodology can be successfully used for real-time data processing of information flows.

Author supplied keywords

Cite

CITATION STYLE

APA

Gorshenin, A., Lebedeva, M., Lukina, S., & Yakovleva, A. (2019). Application of Machine Learning Algorithms to Handle Missing Values in Precipitation Data. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11965 LNCS, pp. 563–577). Springer. https://doi.org/10.1007/978-3-030-36614-8_43

Application of Machine Learning Algorithms to Handle Missing Values in Precipitation Data

Abstract

Author supplied keywords

Cite

Register to see more suggestions