Machine learning-based missing value imputation method for clinical datasets

M. Mostafizur Rahman; D. N. Davis

Conference Proceedings

Machine learning-based missing value imputation method for clinical datasets

Lecture Notes in Electrical Engineering (2013) 229 LNEE 245-257

DOI: 10.1007/978-94-007-6190-2_19

42Citations

93Readers

Get full text

Abstract

Missing value imputation is one of the biggest tasks of data pre-processing when performing data mining. Most medical datasets are usually incomplete. Simply removing the incomplete cases from the original datasets can bring more problems than solutions. A suitable method for missing value imputation can help to produce good quality datasets for better analysing clinical trials. In this paper we explore the use of a machine learning technique as a missing value imputation method for incomplete cardiovascular data. Mean/mode imputation, fuzzy unordered rule induction algorithm imputation, decision tree imputation and other machine learning algorithms are used as missing value imputation and the final datasets are classified using decision tree, fuzzy unordered rule induction, KNN and K-Mean clustering. The experiment shows that final classifier performance is improved when the fuzzy unordered rule induction algorithm is used to predict missing attribute values for K-Mean clustering and in most cases, the machine learning techniques were found to perform better than the standard mean imputation technique. © 2013 Springer Science+Business Media Dordrecht.

Author supplied keywords

Cite

CITATION STYLE

APA

Rahman, M. M., & Davis, D. N. (2013). Machine learning-based missing value imputation method for clinical datasets. In Lecture Notes in Electrical Engineering (Vol. 229 LNEE, pp. 245–257). Springer Verlag. https://doi.org/10.1007/978-94-007-6190-2_19

Machine learning-based missing value imputation method for clinical datasets

Abstract

Author supplied keywords

Cite

Register to see more suggestions