In this work, we analyze the training samples for discovering what kind of samples are more appropriate to train the back-propagation algorithm. To do this, we propose a Gaussian function in order to identify three types of samples: Border, Safe and Average samples. Experiments on sixteen two-class imbalanced data sets where carried out, and a non-parametrical statistical test was applied. In addition, we employ the SMOTE as classification performance reference, i.e., to know whether the studied methods are competitive with respect to SMOTE performance. Experimental results show that the best samples to train the back-propagation are the average samples and the worst are the safe samples.
CITATION STYLE
Alejo, R., Monroy-De-Jesús, J., Pacheco-Sánchez, J. H., Valdovinos, R. M., Antonio-Velázquez, J. A., & Marcial-Romero, J. R. (2015). Analysing the safe, average and border samples on two-class imbalance problems in the back-propagation domain. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9423, pp. 699–707). Springer Verlag. https://doi.org/10.1007/978-3-319-25751-8_84
Mendeley helps you to discover research relevant for your work.