One relevant problem in data quality is the presence of missing data. In cases where missing data are abundant, effective ways to deal with these absences could improve the performance of machine learning algorithms. Missing data can be treated using imputation. Imputation methods replace the missing data by values estimated from the available data. This paper presents Corai, an imputation algorithm which is an adaption of Co-training, a multi-view semi-supervised learning algorithm. The comparison of Corai with other imputation methods found in the literature in three data sets from UCI with different levels of missingness inserted into up to three attributes, shows that Corai tends to perform well in data sets at greater percentages of missingness and number of attributes with missing values. © 2008 Springer Berlin Heidelberg.
CITATION STYLE
Matsubara, E. T., Prati, R. C., Batista, G. E. A. P. A., & Monard, M. C. (2008). Missing value imputation using a semi-supervised rank aggregation approach. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5249 LNAI, pp. 217–226). Springer Verlag. https://doi.org/10.1007/978-3-540-88190-2_27
Mendeley helps you to discover research relevant for your work.