Comparative Analysis of Homogeneous and Heterogeneous Ensembles for Diabetes Classification Optimization

  • Maulana M
  • Muljono M
  • Meindiawan E
N/ACitations
Citations of this article
10Readers
Mendeley users who have this article in their library.

Abstract

Diabetes mellitus is a chronic disease with an increasing prevalence worldwide, including in Indonesia, reaching 11.7% by 2023. Early prediction of this disease is essential for more effective management. This study aims to develop a diabetes mellitus prediction model using an ensemble learning approach, including homogeneous (boosting and bagging) and heterogeneous (stacking and blending) techniques. In this study, the boosting algorithm using AdaBoost with Random Forest as the base estimator showed the highest accuracy of 98%, with balanced precision and recall. The bagging technique, which also uses Random Forest as the base estimator, achieved 97% accuracy, although slightly lower than boosting. The stacking technique, which combines XGBoost, Gradient Boosting, and Random Forest as base learners, with Random Forest as the meta-model, yields similar accuracy of 98%, but with lower prediction error, demonstrating its ability to cope with more complex data. Blending, which uses a similar approach but with training on the entire dataset, gave 98% accuracy with shorter processing time and more efficient memory usage than stacking.

Cite

CITATION STYLE

APA

Maulana, M. N., Muljono, M., & Meindiawan, E. P. A. (2025). Comparative Analysis of Homogeneous and Heterogeneous Ensembles for Diabetes Classification Optimization. Sinkron, 9(1), 512–521. https://doi.org/10.33395/sinkron.v9i1.14439

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free