Credit evaluation with a data mining approach based on gradient boosting decision tree

6Citations
Citations of this article
13Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In recent years, credit evaluation has become an issue of increasing concern for financial institutions. However, since most research focuses on the risk classification process, the problem of data imbalance is ignored. In real data sets, there are often far more users with good credit than users with bad credit, and the imbalance of data often easily leads to a decline in the classification performance of the model. Therefore, previous research is very limited in practical application scenarios. In this paper, we establish a new integration method for credit evaluation, which is classified into three steps: First, data preprocessing. Before inputting samples into the model, we take a series of preprocessing steps, such as missing data processing, data dimensionality reduction. Secondly, in view of the imbalance problem, the data is divided into multiple clusters using an unsupervised clustering algorithm, and the SMOTE method is used to generate minority samples in the clusters whose ratio exceeds the threshold. Finally, the GBDT2NN and Factorization Machine methods are integrated to classify the samples. In order to verify the effectiveness of this method, we use the Kaggle competition data set for verification. The results show that this method is better than other algorithms in the field of credit evaluation in terms of recall rate and AUC value.

References Powered by Scopus

Going deeper with convolutions

39960Citations
N/AReaders
Get full text

XGBoost: A scalable tree boosting system

33424Citations
N/AReaders
Get full text

SMOTE: Synthetic minority over-sampling technique

22773Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Detection of multiple metal ions in water with a fluorescence sensor based on carbon quantum dots assisted by stepwise prediction and machine learning

23Citations
N/AReaders
Get full text

Credit Evaluation of SMEs Based on GBDT-CNN-LR Hybrid Integrated Model

7Citations
N/AReaders
Get full text

Continual three-way decisions via knowledge transfer

2Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Liu, Z., & Zhang, Y. (2021). Credit evaluation with a data mining approach based on gradient boosting decision tree. In Journal of Physics: Conference Series (Vol. 1848). IOP Publishing Ltd. https://doi.org/10.1088/1742-6596/1848/1/012034

Readers over time

‘21‘22‘23‘2402468

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 4

100%

Readers' Discipline

Tooltip

Computer Science 5

83%

Engineering 1

17%

Save time finding and organizing research with Mendeley

Sign up for free
0