The Effect of Splitting of Raw Data into Training and Test Subsets on the Accuracy of Predicting Spatial Distribution by a Multilayer Perceptron

12Citations
Citations of this article
14Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The paper discusses the influence of various methods for splitting raw data into test and training subsets on the accuracy of the prediction of the spatial distribution of the variable for the model based on a multilayer perceptron. A comparison of models was performed taking into account both spatial heterogeneity and the spread of values of the modeled variable with a completely random splitting option. The study was based on the results obtained during soil screening of urbanized territories in the Russian subarctic zone, Novy Urengoy city and Noyabrsk city. The spatial distribution of the chemical element chromium (Cr) in the surface layer of the soil was modeled. For both surveyed areas, the models in which a controlled splitting method was used for training demonstrated more accurate results than models trained using completely random splitting.

Author supplied keywords

Cite

CITATION STYLE

APA

Baglaeva, E. M., Sergeev, A. P., Shichkin, A. V., & Buevich, A. G. (2020). The Effect of Splitting of Raw Data into Training and Test Subsets on the Accuracy of Predicting Spatial Distribution by a Multilayer Perceptron. Mathematical Geosciences, 52(1), 111–121. https://doi.org/10.1007/s11004-019-09813-9

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free