Training neural network classifiers for medical decision making: the effects of imbalanced datasets on classification performance.

  • Mazurowski M
  • Habas P
  • Zurada J
 et al. 
  • 68


    Mendeley users who have this article in their library.
  • N/A


    Citations of this article.


This study investigates the effect of class imbalance in training data when developing neural network classifiers for computer-aided medical diagnosis. The investigation is performed in the presence of other characteristics that are typical among medical data, namely small training sample size, large number of features, and correlations between features. Two methods of neural network training are explored: classical backpropagation (BP) and particle swarm optimization (PSO) with clinically relevant training criteria. An experimental study is performed using simulated data and the conclusions are further validated on real clinical data for breast cancer diagnosis. The results show that classifier performance deteriorates with even modest class imbalance in the training data. Further, it is shown that BP is generally preferable over PSO for imbalanced training data especially with small data sample and large number of features. Finally, it is shown that there is no clear preference between oversampling and no compensation approach and some guidance is provided regarding a proper selection.

Author-supplied keywords

  • Algorithms
  • Artificial Intelligence
  • Automatic Data Processing
  • Breast Neoplasms
  • Breast Neoplasms: classification
  • Breast Neoplasms: diagnosis
  • Computer Simulation
  • Computer-Assisted
  • Computer-Assisted: methods
  • Decision Making
  • Diagnosis
  • Feedback
  • Humans
  • Neural Networks (Computer)
  • ROC Curve

Get free article suggestions today

Mendeley saves you time finding and organizing research

Sign up here
Already have an account ?Sign in

Find this document


  • Maciej A. Mazurowski

  • Piotr A. Habas

  • Jacek M. Zurada

  • Joseph Y. Lo

  • Jay A. Baker

  • Georgia D. Tourassi

Cite this document

Choose a citation style from the tabs below

Save time finding and organizing research with Mendeley

Sign up for free