Bootstrap re-sampling for unbalanced data in supervised learning

Georges Dupret; Masato Koda

Journal Article

Bootstrap re-sampling for unbalanced data in supervised learning

European Journal of Operational Research (2001) 134(1) 141-156

DOI: 10.1016/S0377-2217(00)00244-7

49Citations

83Readers

Get full text

Abstract

This paper presents a technical framework to assess the impact of re-sampling on the ability of a supervised learning to correctly learn a classification problem. We use the bootstrap expression of the prediction error to identify the optimal re-sampling proportions in binary classification experiments using artificial neural networks. Based on Bayes decision rule and the a priori distribution of the objective data, an estimate for the optimal re-sampling proportion is derived as well as upper and lower bounds for the exact optimal proportion. The analytical considerations to extend the present method to cross-validation and multiple classes are also illustrated. © 2001 Elsevier Science B.V.

Author supplied keywords

Cite

CITATION STYLE

APA

Dupret, G., & Koda, M. (2001). Bootstrap re-sampling for unbalanced data in supervised learning. European Journal of Operational Research, 134(1), 141–156. https://doi.org/10.1016/S0377-2217(00)00244-7

Bootstrap re-sampling for unbalanced data in supervised learning

Abstract

Author supplied keywords

Cite

Register to see more suggestions