New hybrid ensemble method for anomaly detection in data science

Amina Mohamed Elmahalwy; Hayam M. Mousa; Khalid M. Amin

Journal ArticleOPEN ACCESS

New hybrid ensemble method for anomaly detection in data science

International Journal of Electrical and Computer Engineering (2023) 13(3) 3498-3508

DOI: 10.11591/ijece.v13i3.pp3498-3508

1Citations

25Readers

Abstract

Anomaly detection is a significant research area in data science. Anomaly detection is used to find unusual points or uncommon events in data streams. It is gaining popularity not only in the business world but also in different of other fields, such as cyber security, fraud detection for financial systems, and healthcare. Detecting anomalies could be useful to find new knowledge in the data. This study aims to build an effective model to protect the data from these anomalies. We propose a new hyper ensemble machine learning method that combines the predictions from two methodologies the outcomes of isolation forest-k-means and random forest using a voting majority. Several available datasets, including KDD Cup-99, Credit Card, Wisconsin Prognosis Breast Cancer (WPBC), Forest Cover, and Pima, were used to evaluate the proposed method. The experimental results exhibit that our proposed model gives the highest realization in terms of receiver operating characteristic performance, accuracy, precision, and recall. Our approach is more efficient in detecting anomalies than other approaches. The highest accuracy rate achieved is 99.9%, compared to accuracy without a voting method, which achieves 97%.

Author supplied keywords

Cite

CITATION STYLE

APA

Elmahalwy, A. M., Mousa, H. M., & Amin, K. M. (2023). New hybrid ensemble method for anomaly detection in data science. International Journal of Electrical and Computer Engineering, 13(3), 3498–3508. https://doi.org/10.11591/ijece.v13i3.pp3498-3508

New hybrid ensemble method for anomaly detection in data science

Abstract

Author supplied keywords

Cite

Register to see more suggestions