Imbalanced Data Classification Algorithm Based on Integrated Sampling and Ensemble Learning

1Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In order to alleviate the impact of imbalanced data on support vector machine (SVM), an integrated hybrid sampling imbalanced data classification method is proposed. First, the imbalance rate of imbalanced data is reduced by the ADASYN-NCL (Adaptive Synthetic Sampling Technique—Domain Cleanup Rule Downsampling Method) hybrid sampling method. Then, the AdaBoost algorithm framework is used to give different weight adjustments to the misclassification of minority and majority classes, and selectively integrate several classifiers to obtain better classification. Finally, use the 10 sets of imbalanced data in the KEEL database as test objects, and F-value and G-mean are used as evaluation indicators to verify the performance of the classification algorithm. The experimental results show that the classification algorithm has certain advantages for the classification effect of imbalanced data sets.

Cite

CITATION STYLE

APA

Han, Y., He, M., & Lu, Q. (2019). Imbalanced Data Classification Algorithm Based on Integrated Sampling and Ensemble Learning. In Advances in Intelligent Systems and Computing (Vol. 834, pp. 615–624). Springer Verlag. https://doi.org/10.1007/978-981-13-5841-8_64

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free