This paper proposes a method for substituting missing values that is based on an evolutionary algorithm for clustering. Missing values substitution has been traditionally assessed by some measures of the prediction capability of imputation methods. Although this evaluation is useful, it does not allow inferring the influence of imputed values in the ultimate modeling task (e.g., in classification). In this sense, alternative approaches to the so called prediction capability evaluation are needed. Therefore, we here also discuss the influence of imputed values in the classification task. Preliminary results obtained in a bioinformatics data set illustrate that the proposed imputation algorithm can insert less classification bias than three state of the art algorithms (i.e., KNNimpute, SKNN and IKNN). Finally, we illustrate that better prediction results do not necessarily imply in less classification bias. © 2009 Springer Berlin Heidelberg.
CITATION STYLE
De A. Silva, J., & Hruschka, E. R. (2009). An evolutionary algorithm for missing values substitution in classification tasks. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5572 LNAI, pp. 195–202). https://doi.org/10.1007/978-3-642-02319-4_23
Mendeley helps you to discover research relevant for your work.