A voluminous amount of data is generated because of the inexorably widespread proliferation of electronic data maintained using the electronic health records (EHRs). Medical health facilities have great potential to discern patterns from this data and utilize them in diagnosing specific diseases or predicting the outbreak of an epidemic. This discerning of patterns might reveal sensitive information about individuals, and this information is vulnerable to misuse. This is, however, a challenging task to share such sensitive data as it compromises the privacy of patients. In this paper, a random forestbased distributed data mining approach is proposed. Performance of the proposed model is evaluated using accuracy, f-measure, and Kappa statistics analyses. Experimental results reveal that the proposed model is efficient and scalable enough in both performance and accuracy within the imbalanced data and also in maintaining the privacy by sharing only useful healthcare knowledge in the form of local models without revealing and sharing sensitive data.
CITATION STYLE
Hassan, M., Butt, M. A., & Zaman, M. (2021). An Ensemble Random Forest Algorithm for Privacy Preserving Distributed Medical Data Mining. International Journal of E-Health and Medical Communications, 12(6). https://doi.org/10.4018/IJEHMC.20211101.oa8
Mendeley helps you to discover research relevant for your work.