Modeling insurance fraud detection using imbalanced data classification

Amira Kamil Ibrahim Hassan; Ajith Abraham

Conference Proceedings

Modeling insurance fraud detection using imbalanced data classification

Advances in Intelligent Systems and Computing (2016) 419 117-127

DOI: 10.1007/978-3-319-27400-3_11

42Citations

62Readers

Get full text

Abstract

This paper proposes an innovative insurance fraud detection method to deal with the imbalanced data distribution. The idea is based on building insurance fraud detection models using Decision tree (DT), Support vector machine (SVM) and Artificial Neural Network (ANN), on data partitions derived from under-sampling (with-replacement and without-replacement) of the majority class and merging it with the minority class. Throughout the paper, ten-fold cross validation method of testing is used. Its originality lies in the use of several partitioning under-sampling approaches and choosing the best. Results from a publicly available automobile insurance fraud detection data set demonstrate that DT performs slightly better than other algorithms, so DT model was used to compare between different partitioning-under-sampling approaches. Empirical results illustrate that the proposed model gave better results.

Author supplied keywords

Cite

CITATION STYLE

APA

Hassan, A. K. I., & Abraham, A. (2016). Modeling insurance fraud detection using imbalanced data classification. In Advances in Intelligent Systems and Computing (Vol. 419, pp. 117–127). Springer Verlag. https://doi.org/10.1007/978-3-319-27400-3_11

Modeling insurance fraud detection using imbalanced data classification

Abstract

Author supplied keywords

Cite

Register to see more suggestions