Machine learning-based missing value imputation method for clinical datasets

42Citations
Citations of this article
93Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Missing value imputation is one of the biggest tasks of data pre-processing when performing data mining. Most medical datasets are usually incomplete. Simply removing the incomplete cases from the original datasets can bring more problems than solutions. A suitable method for missing value imputation can help to produce good quality datasets for better analysing clinical trials. In this paper we explore the use of a machine learning technique as a missing value imputation method for incomplete cardiovascular data. Mean/mode imputation, fuzzy unordered rule induction algorithm imputation, decision tree imputation and other machine learning algorithms are used as missing value imputation and the final datasets are classified using decision tree, fuzzy unordered rule induction, KNN and K-Mean clustering. The experiment shows that final classifier performance is improved when the fuzzy unordered rule induction algorithm is used to predict missing attribute values for K-Mean clustering and in most cases, the machine learning techniques were found to perform better than the standard mean imputation technique. © 2013 Springer Science+Business Media Dordrecht.

Cite

CITATION STYLE

APA

Rahman, M. M., & Davis, D. N. (2013). Machine learning-based missing value imputation method for clinical datasets. In Lecture Notes in Electrical Engineering (Vol. 229 LNEE, pp. 245–257). Springer Verlag. https://doi.org/10.1007/978-94-007-6190-2_19

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free