K Means Cluster Based Undersampling Ensemble for Imbalanced Data Classification

  • Subbulaxmi S
  • et al.
Citations of this article
Mendeley users who have this article in their library.
Get full text


Imbalanced data classification is a critical and challenging problem in both data mining and machine learning. Imbalanced data classification problems present in many application areas like rare medical diagnosis, risk management, fault-detection, etc. The traditional classification algorithms yield poor results in imbalanced classification problems. In this paper, K-Means cluster based undersampling ensemble algorithm is proposed to solve the imbalanced data classification problem. The proposed method combines K-Means cluster based undersampling and boosting method. The experimental results show that the proposed algorithm outperforms the other sampling ensemble algorithms of previous studies.




Subbulaxmi, S. S., & Arumugam, G. (2020). K Means Cluster Based Undersampling Ensemble for Imbalanced Data Classification. International Journal of Engineering and Advanced Technology, 9(3), 2074–2079. https://doi.org/10.35940/ijeat.c5188.029320

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free