Stream data are always fast, real-time, infinite and change over time, in this paper, we propose a semi-supervised learning based ensemble classifier for solving recurring data concept drift problem. Our baseline classifiers group both labeled and unlabeled instances as the training points to obtain better learning efficiency from limited data samples, historical information are kept as part of weight decision factor when building the ensemble classifier, which helps keeping classifier ensemble set in a reasonable range without losing those repeated features. The empirical study shows that our new approach outperforms the general ensemble model and is suitable for recurring massive stream data classification. © 2014 Springer International Publishing.
CITATION STYLE
Zhang, B., Chen, D., Zu, Q., Mao, Y., Pan, Y., & Zhang, X. (2014). A new semi-supervised learning based ensemble classifier for recurring data stream. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8351 LNCS, pp. 759–765). Springer Verlag. https://doi.org/10.1007/978-3-319-09265-2_77
Mendeley helps you to discover research relevant for your work.