Abstract
In KDD procedure, to fill in missing data typically requires a very large investment of time and energy - often 80% to 90% of a data analysis project is spent in making the data reliable enough so that the results can be trustful. In this paper, we propose a SVM regression based algorithm for filling in missing data, i.e. set the decision attribute (output attribute) as the condition attribute (input attribute) and the condition attribute as the decision attribute, then use SVM regression to predict the condition attribute values. SARS data set experimental results show that SVM regression method has the highest precision. The method with which the value of the example that has the minimum distance to the example with missing value will be taken to fill in the missing values takes the second place, and the mean and median methods have lower precision. © Springer-Verlag Berlin Heidelberg 2005.
Cite
CITATION STYLE
Feng, H., Chen, G., Yin, C., Yang, B., & Chen, Y. (2005). A SVM regression based approach to filling in missing values. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3683 LNAI, pp. 581–587). https://doi.org/10.1007/11553939_83
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.