A stacking ensemble model with SMOTE for improved imbalanced classification on credit data

5Citations
Citations of this article
44Readers
Mendeley users who have this article in their library.

Abstract

This research is based on a significant problem in credit risk analysis in the banking sector caused by class imbalance. We face the problem of the model’s inability to accurately identify risks in the ‘‘Charged Off’’ class. As a solution, we propose a stacked ensemble approach that utilizes synthetic minority over-sampling technique (SMOTE) to balance the class distribution. Experiments were conducted by applying SMOTE to the training data before training the credit model using gradient boosting (XGBoost) and random forest (RF) algorithms in a single ensemble. The results show significant improvements in precision, recall, and F1-score after applying SMOTE on the unbalanced classes. The updated model achieved a striking accuracy rate of 0,97 on resampled training data. This re-search clearly identifies the problem of class imbalance as a major challenge in credit risk analysis. The application of SMOTE in a stacked ensemble was found to be effective in improving model performance, making a valuable contribution to the development of more reliable credit models for better risk management and revenue generation in financial institutions.

Cite

CITATION STYLE

APA

Alamsyah, N., Budiman, Yoga, T. P., & Alamsyah, R. Y. R. (2024). A stacking ensemble model with SMOTE for improved imbalanced classification on credit data. Telkomnika (Telecommunication Computing Electronics and Control), 22(3), 657–664. https://doi.org/10.12928/TELKOMNIKA.v22i3.25921

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free