Using classifier-based nominal imputation to improve machine learning

Xiaoyuan Su; Russell Greiner; Taghi M. Khoshgoftaar; Amri Napolitano

Conference Proceedings

Using classifier-based nominal imputation to improve machine learning

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2011) 6634 LNAI(PART 1) 124-135

DOI: 10.1007/978-3-642-20841-6_11

15Citations

7Readers

Get full text

Abstract

Many learning algorithms perform poorly when the training data are incomplete. One standard approach involves first imputing the missing values, then giving the completed data to the learning algorithm. However, this is especially problematic when the features are nominal. This work presents "classifier-based nominal imputation" (CNI), an easy-to-implement and effective nominal imputation technique that views nominal imputation as classification: it learns a classifier for each feature (that maps the other features of an instance to the predicted value of that feature), then uses that classifier to predict the missing values of that feature. Our empirical results show that learners that preprocess their incomplete training data using CNI using support vector machine or decision tree learners have significantly higher predictive accuracy than learners that (1) do not use preprocessing, (2) use baseline imputation techniques, or (3) use this CNI preprocessor with other classification algorithms. This improvement is especially apparent when the base learner is instance-based. CNI is also found helpful for other base learners, such as naïve Bayes and decision tree, on incomplete nominal data. © 2011 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Su, X., Greiner, R., Khoshgoftaar, T. M., & Napolitano, A. (2011). Using classifier-based nominal imputation to improve machine learning. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6634 LNAI, pp. 124–135). Springer Verlag. https://doi.org/10.1007/978-3-642-20841-6_11

Using classifier-based nominal imputation to improve machine learning

Abstract

Author supplied keywords

Cite

Register to see more suggestions