An Ensemble Random Forest Algorithm for Privacy Preserving Distributed Medical Data Mining

5Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.

Abstract

A voluminous amount of data is generated because of the inexorably widespread proliferation of electronic data maintained using the electronic health records (EHRs). Medical health facilities have great potential to discern patterns from this data and utilize them in diagnosing specific diseases or predicting the outbreak of an epidemic. This discerning of patterns might reveal sensitive information about individuals, and this information is vulnerable to misuse. This is, however, a challenging task to share such sensitive data as it compromises the privacy of patients. In this paper, a random forestbased distributed data mining approach is proposed. Performance of the proposed model is evaluated using accuracy, f-measure, and Kappa statistics analyses. Experimental results reveal that the proposed model is efficient and scalable enough in both performance and accuracy within the imbalanced data and also in maintaining the privacy by sharing only useful healthcare knowledge in the form of local models without revealing and sharing sensitive data.

Cite

CITATION STYLE

APA

Hassan, M., Butt, M. A., & Zaman, M. (2021). An Ensemble Random Forest Algorithm for Privacy Preserving Distributed Medical Data Mining. International Journal of E-Health and Medical Communications, 12(6). https://doi.org/10.4018/IJEHMC.20211101.oa8

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free