CaliForest

Yubin Park; Joyce C. Ho

Conference ProceedingsOPEN ACCESS

CaliForest

ACM CHIL 2020 - Proceedings of the 2020 ACM Conference on Health, Inference, and Learning (2020) 40-50

DOI: 10.1145/3368555.3384461

8Citations

8Readers

Abstract

Real-world predictive models in healthcare should be evaluated in terms of discrimination, the ability to differentiate between high and low risk events, and calibration, or the accuracy of the risk estimates. Unfortunately, calibration is often neglected and only discrimination is analyzed. Calibration is crucial for personalized medicine as they play an increasing role in the decision making process. Since random forest is a popular model for many healthcare applications, we propose CaliForest, a new calibrated random forest. Unlike existing calibration methodologies, CaliForest utilizes the out-of-bag samples to avoid the explicit construction of a calibration set. We evaluated CaliForest on two risk prediction tasks obtained from the publicly-available MIMIC-III database. Evaluation on these binary prediction tasks demonstrates that CaliForest can achieve the same discriminative power as random forest while obtaining a better-calibrated model evaluated across six different metrics. CaliForest will be published on the standard Python software repository and the code will be openly available on Github.

Author supplied keywords

Cite

CITATION STYLE

APA

Park, Y., & Ho, J. C. (2020). CaliForest. In ACM CHIL 2020 - Proceedings of the 2020 ACM Conference on Health, Inference, and Learning (pp. 40–50). Association for Computing Machinery, Inc. https://doi.org/10.1145/3368555.3384461

CaliForest

Abstract

Author supplied keywords

Cite

Register to see more suggestions