Missing data imputation via denoising autoencoders: The untold story

Adriana Fonseca Costa; Miriam Seoane Santos; Jastin Pompeu Soares; Pedro Henriques Abreu

Conference Proceedings

Missing data imputation via denoising autoencoders: The untold story

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 11191 LNCS 87-98

DOI: 10.1007/978-3-030-01768-2_8

30Citations

39Readers

Get full text

Abstract

Missing data consists in the lack of information in a dataset and since it directly influences classification performance, neglecting it is not a valid option. Over the years, several studies presented alternative imputation strategies to deal with the three missing data mechanisms, Missing Completely At Random, Missing At Random and Missing Not At Random. However, there are no studies regarding the influence of all these three mechanisms on the latest high-performance Artificial Intelligence techniques, such as Deep Learning. The goal of this work is to perform a comparison study between state-of-the-art imputation techniques and a Stacked Denoising Autoencoders approach. To that end, the missing data mechanisms were synthetically generated in 6 different ways; 8 different imputation techniques were implemented; and finally, 33 complete datasets from different open source repositories were selected. The obtained results showed that Support Vector Machines imputation ensures the best classification performance while Multiple Imputation by Chained Equations performs better in terms of imputation quality.

Author supplied keywords

Cite

CITATION STYLE

APA

Costa, A. F., Santos, M. S., Soares, J. P., & Abreu, P. H. (2018). Missing data imputation via denoising autoencoders: The untold story. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11191 LNCS, pp. 87–98). Springer Verlag. https://doi.org/10.1007/978-3-030-01768-2_8

Missing data imputation via denoising autoencoders: The untold story

Abstract

Author supplied keywords

Cite

Register to see more suggestions