GABoost: A clustering based undersampling algorithm for highly imbalanced datasets using genetic algorithm

O. A. Ajilisa; V. P. Jagathyraj; M. K. Sabu

Conference Proceedings

GABoost: A clustering based undersampling algorithm for highly imbalanced datasets using genetic algorithm

Advances in Intelligent Systems and Computing (2019) 939 235-246

DOI: 10.1007/978-3-030-16681-6_24

2Citations

3Readers

Get full text

Abstract

Data sets that have imbalanced class distribution is a challenging problem for many application domains. Learning from imbalanced data can’t be done efficiently using current data mining and machine learning tasks. Instead of merely using those algorithms we have to consider some other techniques to learn from those data set. One solution is to develop some preprocessing methods to balance the data sets and combine it with some existing algorithm. In this paper, we propose a new hybrid clustering based undersampling technique using genetic algorithm and AdaBoost, which is called GABoost, for learning from imbalanced data. This algorithm is an attractive alternative for SMOTEBoost, RUSBoost, CUSBoost. Based on the experimental results obtained from 44 imbalanced datasets we strongly recommend GABoost as a striking alternative for improving the performance of the learned classification model which is built using highly imbalanced dataset.

Author supplied keywords

Cite

CITATION STYLE

APA

Ajilisa, O. A., Jagathyraj, V. P., & Sabu, M. K. (2019). GABoost: A clustering based undersampling algorithm for highly imbalanced datasets using genetic algorithm. In Advances in Intelligent Systems and Computing (Vol. 939, pp. 235–246). Springer Verlag. https://doi.org/10.1007/978-3-030-16681-6_24

GABoost: A clustering based undersampling algorithm for highly imbalanced datasets using genetic algorithm

Abstract

Author supplied keywords

Cite

Register to see more suggestions