Modelling loss given default in peer-to-peer lending using random forests

5Citations
Citations of this article
25Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Modelling credit risk in peer-to-peer (P2P) lending is increasingly important due to the rapid growth of P2P platforms’ user bases. To support decisionmaking on granting P2P loans, diverse machine learning methods have been used in P2P credit risk models. However, such models have been limited to loan default prediction, without considering the financial impact of the loans. Loss given default (LGD) is used in modelling consumer credit risk to address this issue. Earlier approaches to modelling LGDin P2P lending tended to usemultivariate linear regression methods in order to identify the determinants of P2P loans’ credit risk. Here, we showthat thesemethods are not effective enough to process complex features present in P2P lending data.We propose a novel decision support system to LGD modeling in P2P lending. To reduce the problem of overfitting, the system uses random forest (RF) learning in two stages. First, extremely risky loans with LGD = 1 are identified using classification RF. Second, the LGD of the remaining P2P loans is predicted using regression RF. Thus, the non-normal distribution of the LGD values can be effectively modelled. We demonstrate that the proposed system is effective for the benchmark of P2P Lending Club platform as other methods currently used in LGD modelling are outperformed.

Cite

CITATION STYLE

APA

Papoušková, M., & Hajek, P. (2019). Modelling loss given default in peer-to-peer lending using random forests. In Smart Innovation, Systems and Technologies (Vol. 142, pp. 133–141). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-981-13-8311-3_12

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free