Sampling methods in handling imbalanced data for Indonesia health insurance dataset

1Citations
Citations of this article
30Readers
Mendeley users who have this article in their library.

Abstract

Health insurance fraud is one of the most frequently occurring fraudulent acts and has become a concern for every insurance. According to data from The Indonesian General Insurance Association or Asosiasi Asuransi Umum Indonesia (AAUI), the private insurance industry suffered losses up to billions rupiah throughout 2018 due to the fraudulent acts commited by the perpetrators. The problem in with the number of frauds in Indonesia is that the current system is highly vulnerable and they is still done manually. The other problem from this detection is imbalance data which often occurs in fraudulent cases. In this research, we used a sampling methods using several machine learning as the baseline. The result shows that the instance hardness thresholding algorithm and extreme gradient boosting gives the best performance for all the case. It shows the method can reduced the bias and can achieve better generalization.

Cite

CITATION STYLE

APA

Kurniadi, F. I., Purwandari, K., Wulandari, A., & Permai, S. D. (2024). Sampling methods in handling imbalanced data for Indonesia health insurance dataset. IAES International Journal of Artificial Intelligence, 13(1), 348–357. https://doi.org/10.11591/ijai.v13.i1.pp348-357

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free