Bootstrap re-sampling for unbalanced data in supervised learning

0.00 Avg rating—0 Votes

Article ID:	iaor20021600
Country:	Netherlands
Volume:	134
Issue:	1
Start Page Number:	141
End Page Number:	156
Publication Date:	Oct 2001
Journal:	European Journal of Operational Research
Authors:	Koda Masato, Dupret Georges
Keywords:	neural networks, simulation

Abstract:

This paper presents a technical framework to assess the impact of re-sampling on the ability of a supervised learning to correctly learn a classification problem. We use the bootstrap expression of the prediction error to identify the optimal re-sampling proportions in binary classification experiments using artificial neural networks. Based on Bayes decision rule and the a priori distribution of the objective data, an estimate for the optimal re-sampling proportion is derived as well as upper and lower bounds for the exact optimal proportion. The analytical considerations to extend the present method to cross-validation and multiple classes are also illustrated.

Reviews

Required fields are marked *. Your email address will not be published.