Systematic Review on Missing Data Imputation Techniques with Machine Learning Algorithms for Healthcare

34Citations
Citations of this article
104Readers
Mendeley users who have this article in their library.

Abstract

Missing data is one of the most common issues encountered in data cleaning process especially when dealing with medical dataset. A real collected dataset is prone to be incomplete, inconsistent, noisy and redundant due to potential reasons such as human errors, instrumental failures, and adverse death. Therefore, to accurately deal with incomplete data, a sophisticated algorithm is proposed to impute those missing values. Many machine learning algorithms have been applied to impute missing data with plausible values. However, among all machine learning imputation algorithms, KNN algorithm has been widely adopted as an imputation for missing data due to its robustness and simplicity and it is also a promising method to outperform other machine learning methods. This paper provides a comprehensive review of different imputation techniques used to replace the missing data. The goal of the review paper is to bring specific attention to potential improvements to existing methods and provide readers with a better grasps of imputation technique trends.

Cite

CITATION STYLE

APA

Ismail, A. R., Abidin, N. Z., & Maen, M. K. (2022, March 1). Systematic Review on Missing Data Imputation Techniques with Machine Learning Algorithms for Healthcare. Journal of Robotics and Control (JRC). Department of Electrical Engineering, Universitas Muhammadiyah Yogyakarta. https://doi.org/10.18196/jrc.v3i2.13133

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free