Imbalanced Classification in Diabetics Using Ensembled Machine Learning

9Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.

Abstract

Diabetics is one of the world’s most common diseases which are caused by continued high levels of blood sugar. The risk of diabetics can be lowered if the diabetic is found at the early stage. In recent days, several machine learning models were developed to predict the diabetic presence at an early stage. In this paper, we propose an embedded-based machine learning model that combines the split-vote method and instance duplication to leverage an imbalanced dataset called PIMA Indian to increase the prediction of diabetics. The proposed method uses both the concept of over-sampling and under-sampling along with model weighting to increase the performance of classification. Different measures such as Accuracy, Precision, Recall, and F1-Score are used to evaluate the model. The results we obtained using K-Nearest Neighbor (kNN), Naïve Bayes (NB), Support Vector Machines (SVM), Random Forest (RF), Logistic Regression (LR), and Decision Trees (DT) were 89.32%, 91.44%, 95.78%, 89.3%, 81.76%, and 80.38% respectively. The SVM model is more efficient than other models which are 21.38% more than exiting machine learning-based works.

Cite

CITATION STYLE

APA

Kumar, M. S., Khan, M. Z., Rajendran, S., Noor, A., Dass, A. S., & Prabhu, J. (2022). Imbalanced Classification in Diabetics Using Ensembled Machine Learning. Computers, Materials and Continua, 72(3), 4397–4409. https://doi.org/10.32604/cmc.2022.025865

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free