This paper describes the application of a genetic algorithm to nearest-neighbour based imputation of sample data into a census data dataset. The genetic algorithm optimises the selection and weights of variables used for measuring distance. The results show that the measure of fit can be improved by selecting imputation variables using a genetic algorithm. The percentage of variance explained in the goal variables increases compared to a simple selection of imputation variables. This quantitative approach to the selection of imputation variables does not deny the importance of expertise. Human expertise is still essential in defining the optional set of imputation variables. © Springer-Verlag Berlin Heidelberg 2005.
CITATION STYLE
Simkin, S., Verwaart, T., & Vrolijk, H. (2005). Application of a genetic algorithm to nearest neighbour classification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3533 LNAI, pp. 544–546). Springer Verlag. https://doi.org/10.1007/11504894_73
Mendeley helps you to discover research relevant for your work.