Machine learning for predictive analyses in health: An example of an application to predict death in the elderly in São Paulo, Brazil

22Citations
Citations of this article
105Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

This study aims to present the stages related to the use of machine learning algorithms for predictive analyses in health. An application was performed in a database of elderly residents in the city of São Paulo, Brazil, who participated in the Health, Well-Being, and Aging Study (SABE) (n = 2,808). The outcome variable was the occurrence of death within five years of the elder’s entry into the study (n = 423), and the predictors were 37 variables related to the elder’s demographic, socioeconomic, and health profile. The application was organized according to the following stages: division of data in training (70%) and testing (30%), pre-processing of the predictors, learning, and assessment of the models. The learning stage used 5 algorithms to adjust the models: logistic regression with and without penalization, neural networks, gradient boosted trees, and random forest. The algorithms’ hyperparameters were optimized by 10-fold cross-validation to select those corresponding to the best models. For each algorithm, the best model was assessed in test data via area under the ROC curve (AUC) and related measures. All the models presented AUC ROC greater than 0.70. For the three models with the highest AUC ROC (neural networks and logistic regression with LAS-SO penalization and without penalization, respectively), quality measures of the predicted probability were also assessed. The expectation is that with the increased availability of data and trained human capital, it will be possible to develop predictive machine learning models with the potential to help health professionals make the best decisions.

Author supplied keywords

Cite

CITATION STYLE

APA

dos Santos, H. G., do Nascimento, C. F., Izbicki, R., Duarte, Y. A. de O., & Filho, A. D. P. C. (2019). Machine learning for predictive analyses in health: An example of an application to predict death in the elderly in São Paulo, Brazil. Cadernos de Saude Publica, 35(7). https://doi.org/10.1590/0102-311x00050818

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free