Bootstrap re-sampling for unbalanced data in supervised learning

49Citations
Citations of this article
83Readers
Mendeley users who have this article in their library.
Get full text

Abstract

This paper presents a technical framework to assess the impact of re-sampling on the ability of a supervised learning to correctly learn a classification problem. We use the bootstrap expression of the prediction error to identify the optimal re-sampling proportions in binary classification experiments using artificial neural networks. Based on Bayes decision rule and the a priori distribution of the objective data, an estimate for the optimal re-sampling proportion is derived as well as upper and lower bounds for the exact optimal proportion. The analytical considerations to extend the present method to cross-validation and multiple classes are also illustrated. © 2001 Elsevier Science B.V.

Cite

CITATION STYLE

APA

Dupret, G., & Koda, M. (2001). Bootstrap re-sampling for unbalanced data in supervised learning. European Journal of Operational Research, 134(1), 141–156. https://doi.org/10.1016/S0377-2217(00)00244-7

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free