A hybrid GP-KNN imputation for symbolic regression with missing values

Baligh Al-Helali; Qi Chen; Bing Xue; Mengjie Zhang

Conference Proceedings

A hybrid GP-KNN imputation for symbolic regression with missing values

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 11320 LNAI 345-357

DOI: 10.1007/978-3-030-03991-2_33

26Citations

14Readers

Get full text

Abstract

In data science, missingness is a serious challenge when dealing with real-world data sets. Although many imputation approaches have been proposed to tackle missing values in machine learning, most studies focus on the classification task rather than the regression task. To the best of our knowledge, no study has been conducted to investigate the use of imputation methods when performing symbolic regression on incomplete real-world data sets. In this work, we propose a new imputation method called GP-KNN which is a hybrid method employing two concepts: Genetic Programming Imputation (GPI) and K-Nearest Neighbour (KNN). GP-KNN considers both the feature and instance relevance. The experimental results show that the proposed method has a better performance comparing to state-of-the-art imputation methods in most of the considered cases with respect to both imputation accuracy and symbolic regression performance.

Author supplied keywords

Cite

CITATION STYLE

APA

Al-Helali, B., Chen, Q., Xue, B., & Zhang, M. (2018). A hybrid GP-KNN imputation for symbolic regression with missing values. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11320 LNAI, pp. 345–357). Springer Verlag. https://doi.org/10.1007/978-3-030-03991-2_33

A hybrid GP-KNN imputation for symbolic regression with missing values

Abstract

Author supplied keywords

Cite

Register to see more suggestions