Oversampling techniques for bankruptcy prediction: Novel features from a transaction dataset

Tuong Le; Mi Young Lee; Jun Ryeol Park; Sung Wook Baik

Journal ArticleOPEN ACCESS

Oversampling techniques for bankruptcy prediction: Novel features from a transaction dataset

Symmetry (2018) 10(4)

DOI: 10.3390/sym10040079

65Citations

124Readers

Abstract

In recent years, weakened by the fall of economic growth, many enterprises fell into the crisis caused by financial difficulties. Bankruptcy prediction, a machine learning model, is a great utility for financial institutions, fund managers, lenders, governments, and economic stakeholders. Due to the number of bankrupt companies compared to that of non-bankrupt companies, bankruptcy prediction faces the problem of imbalanced data. This study first presents the bankruptcy prediction framework. Then, five oversampling techniques are used to deal with imbalance problems on the experimental dataset which were collected from Korean companies in two years from 2016 to 2017. Experimental results show that using oversampling techniques to balance the dataset in the training stage can enhance the performance of the bankruptcy prediction. The best overall Area Under the Curve (AUC) of this framework can reach 84.2%. Next, the study extracts more features by combining the financial dataset with transaction dataset to increase the performance for bankruptcy prediction and achieves 84.4% AUC.

Author supplied keywords

Cite

CITATION STYLE

APA

Le, T., Lee, M. Y., Park, J. R., & Baik, S. W. (2018). Oversampling techniques for bankruptcy prediction: Novel features from a transaction dataset. Symmetry, 10(4). https://doi.org/10.3390/sym10040079

Oversampling techniques for bankruptcy prediction: Novel features from a transaction dataset

Abstract

Author supplied keywords

Cite

Register to see more suggestions