Imbalanced Data Classification Algorithm Based on Integrated Sampling and Ensemble Learning

Yan Han; Mingxiang He; Qixian Lu

Conference Proceedings

Imbalanced Data Classification Algorithm Based on Integrated Sampling and Ensemble Learning

Advances in Intelligent Systems and Computing (2019) 834 615-624

DOI: 10.1007/978-981-13-5841-8_64

1Citations

4Readers

Get full text

Abstract

In order to alleviate the impact of imbalanced data on support vector machine (SVM), an integrated hybrid sampling imbalanced data classification method is proposed. First, the imbalance rate of imbalanced data is reduced by the ADASYN-NCL (Adaptive Synthetic Sampling Technique—Domain Cleanup Rule Downsampling Method) hybrid sampling method. Then, the AdaBoost algorithm framework is used to give different weight adjustments to the misclassification of minority and majority classes, and selectively integrate several classifiers to obtain better classification. Finally, use the 10 sets of imbalanced data in the KEEL database as test objects, and F-value and G-mean are used as evaluation indicators to verify the performance of the classification algorithm. The experimental results show that the classification algorithm has certain advantages for the classification effect of imbalanced data sets.

Author supplied keywords

Cite

CITATION STYLE

APA

Han, Y., He, M., & Lu, Q. (2019). Imbalanced Data Classification Algorithm Based on Integrated Sampling and Ensemble Learning. In Advances in Intelligent Systems and Computing (Vol. 834, pp. 615–624). Springer Verlag. https://doi.org/10.1007/978-981-13-5841-8_64

Imbalanced Data Classification Algorithm Based on Integrated Sampling and Ensemble Learning

Abstract

Author supplied keywords

Cite

Register to see more suggestions