During the last few years, classification task in machine learning is commonly used by various real-life applications. One of the common applications is credit scoring systems where the ability to accurately predict creditworthy or non-creditworthy applicants is critically important because incorrect predictions can cause major financial loss. In this paper, we aim to focus on skewed data distribution issue faced by credit scoring system. To reduce the imbalance between the classes, we apply preprocessing on the dataset which makes combined use of random re-sampling and dimensionality reduction. Experimental results on Australian and German credit datasets with the presented preprocessing technique has shown significant performance improvement in terms of AUC and F-measure.
CITATION STYLE
Salunkhe, U. R., & Mali, S. N. (2018). A hybrid approach for preprocessing of imbalanced data in credit scoring systems. In Advances in Intelligent Systems and Computing (Vol. 673, pp. 87–95). Springer Verlag. https://doi.org/10.1007/978-981-10-7245-1_10
Mendeley helps you to discover research relevant for your work.