GABoost: A clustering based undersampling algorithm for highly imbalanced datasets using genetic algorithm

2Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Data sets that have imbalanced class distribution is a challenging problem for many application domains. Learning from imbalanced data can’t be done efficiently using current data mining and machine learning tasks. Instead of merely using those algorithms we have to consider some other techniques to learn from those data set. One solution is to develop some preprocessing methods to balance the data sets and combine it with some existing algorithm. In this paper, we propose a new hybrid clustering based undersampling technique using genetic algorithm and AdaBoost, which is called GABoost, for learning from imbalanced data. This algorithm is an attractive alternative for SMOTEBoost, RUSBoost, CUSBoost. Based on the experimental results obtained from 44 imbalanced datasets we strongly recommend GABoost as a striking alternative for improving the performance of the learned classification model which is built using highly imbalanced dataset.

Cite

CITATION STYLE

APA

Ajilisa, O. A., Jagathyraj, V. P., & Sabu, M. K. (2019). GABoost: A clustering based undersampling algorithm for highly imbalanced datasets using genetic algorithm. In Advances in Intelligent Systems and Computing (Vol. 939, pp. 235–246). Springer Verlag. https://doi.org/10.1007/978-3-030-16681-6_24

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free