Classification of large imbalanced credit client data with cluster based SVM

2Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Credit client scoring on medium sized data sets can be accomplished by means of Support Vector Machines (SVM), a powerful and robust machine learning method. However, real life credit client data sets are usually huge, containing up to hundred thousands of records, with good credit clients vastly outnumbering the defaulting ones. Such data pose severe computational barriers for SVM and other kernel methods, especially if all pairwise data point similarities are requested. Hence, methods which avoid extensive training on the complete data are in high demand. A possible solution is clustering as preprocessing and classification on the more informative resulting data like cluster centers. Clustering variants which avoid the computation of all pairwise similarities robustly filter useful information from the large imbalanced credit client data set, especially when used in conjunction with a symbolic cluster representation. Subsequently, we construct credit client clusters representing both client classes, which are then used for training a non standard SVM adaptable to our imbalanced class set sizes. We also show that SVM trained on symbolic cluster centers result in classification models, which outperform traditional statistical models as well as SVM trained on all our original data. © 2012 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Stecking, R., & Schebesch, K. B. (2012). Classification of large imbalanced credit client data with cluster based SVM. In Studies in Classification, Data Analysis, and Knowledge Organization (pp. 443–451). Kluwer Academic Publishers. https://doi.org/10.1007/978-3-642-24466-7_45

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free