Regular data classification techniques are based mainly on two strong assumptions: (1) the existence of a reasonably large labeled set of data to be used in training; and (2) future input data instances conform to the distribution of the training set, i. e. data distribution is stationary along time. However, in the case of data stream classification, both of the aforementioned assumptions are difficult to satisfy. In this paper, we present a graph-based semi-supervised approach that extends the static classifier based on the K-associated Optimal Graph to perform online semi-supervised classification tasks. In order to learn from labeled and unlabeled patterns, here we adapt the optimal graph construction to simultaneously spread the labels in the training set. The sparse, disconnected nature of the proposed graph structure gives flexibility to cope with non-stationary classification. Experimental comparison between the proposed method and three state-of-the-art ensemble classification methods is provided and promising results have been obtained. © 2012 The Brazilian Computer Society.
CITATION STYLE
Bertini, J. R., Lopes, A. de A., & Zhao, L. (2012). Partially labeled data stream classification with the semi-supervised K-associated graph. Journal of the Brazilian Computer Society, 18(4), 299–310. https://doi.org/10.1007/s13173-012-0072-8
Mendeley helps you to discover research relevant for your work.