Online feature selection and classification with incomplete data

Habil Kalkan

Journal ArticleOPEN ACCESS

Online feature selection and classification with incomplete data

Kalkan H

Turkish Journal of Electrical Engineering and Computer Sciences (2014) 22(6) 1625-1636

DOI: 10.3906/elk-1301-181

2Citations

7Readers

Abstract

This paper presents a classification system in which learning, feature selection, and classification for incomplete data are simultaneously carried out in an online manner. Learning is conducted on a predefined model including the class-dependent mean vectors and correlation coefficients, which are obtained by incrementally processing the incoming observations with missing features. A nearest neighbor with a Gaussian mixture model, whose parameters are also estimated from the trained model, is used for classification. When a testing observation is received, the algorithm discards the missing attributes on the observation and ranks the available features by performing feature selection on the model that has been trained so far. The developed algorithm is tested on a benchmark dataset. The effect of missing features for online feature selection and classification are discussed and presented. The algorithm easily converges to the stable state of feature selection with similar accuracy results as those when using the complete and incomplete feature set with up to 50% missing data.

Author supplied keywords

Cite

CITATION STYLE

APA

Kalkan, H. (2014). Online feature selection and classification with incomplete data. Turkish Journal of Electrical Engineering and Computer Sciences, 22(6), 1625–1636. https://doi.org/10.3906/elk-1301-181

Online feature selection and classification with incomplete data

Abstract

Author supplied keywords

Cite

Register to see more suggestions