Improving Imbalanced Data Classification in Auto Insurance by the Data Level Approaches

9Citations
Citations of this article
34Readers
Mendeley users who have this article in their library.

Abstract

Predicting the frequency of insurance claims has become a significant challenge due to the imbalanced datasets since the number of occurring claims is usually significantly lower than the number of non-occurring claims. As a result, classification models tend to have a limited ability to predict the occurrence of claims. So, in this paper, we’ll use various data level approaches to try to solve the imbalanced data problem in the insurance industry. We developed 32 machine learning models for predicting insurance claims occurrence {(under-sampling, over-sampling, the combination of over-and under-sampling (hybrid), and SMOTE) × (three Decision tree models, three boosting models, and two bagging models) = 32}, and we compared the models’ accuracies, sensitivities, and specificities to comprehend the prediction performance of the built models. The dataset contains 81628 claims, each of which is a car insurance claim. There were 5714 claims that occurred and 75914 claims that didn’t occur. According to the findings, the AdaBoost classifier with oversampling and the hybrid method had the most accurate predictions, with a sensitivity of 92.94%, a specificity of 99.82%, and an accuracy of 99.4%. And with a sensitivity of 92.48%, a specificity of 99.63%, and an accuracy of 99.1%, respectively. This paper confirmed that When analyzing imbalanced data, the AdaBoost classifier, whether using oversampling or the hybrid process, could generate more accurate models than other boosting models, Decision tree models, and bagging models.

Cite

CITATION STYLE

APA

Hanafy, M., & Ming, R. (2021). Improving Imbalanced Data Classification in Auto Insurance by the Data Level Approaches. International Journal of Advanced Computer Science and Applications, 12(6), 493–499. https://doi.org/10.14569/IJACSA.2021.0120656

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free