Peer to peer lending risk analysis based on embedded technique and stacking ensemble learning

9Citations
Citations of this article
67Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Peer to peer lending is famous for easy and fast loans from complicated traditional lending institutions. Therefore, big data and machine learning are needed for credit risk analysis, especially for potential defaulters. However, data imbalance and high computation have a terrible effect on machine learning prediction performance. This paper proposes a stacking ensemble learning with features selection based on embedded techniques (gradient boosted trees (GBDT), random forest (RF), adaptive boosting (AdaBoost), extra gradient boosting (XGBoost), light gradient boosting machine (LGBM), and decision tree (DT)) to predict the credit risk of individual borrowers on peer to peer (P2P) lending. The stacking ensemble model is created from a stack of meta-learners used in feature selection. The feature selection+ stacking model produces an average of 94.54% accuracy and 69.10 s execution time. RF meta-learner+Stacking ensemble is the best classification model, and the LGBM meta-learner+stacking ensemble is the fastest execution time. Based on experimental results, this paper showed that the credit risk prediction for P2P lending could be improved using the stacking ensemble model in addition to proper feature selection.

Cite

CITATION STYLE

APA

Munsarif, M., Sam’an, M., & Safuan. (2022). Peer to peer lending risk analysis based on embedded technique and stacking ensemble learning. Bulletin of Electrical Engineering and Informatics, 11(6), 3483–3489. https://doi.org/10.11591/eei.v11i6.3927

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free