Active learning by clustering for drifted data stream classification

Jakub Zgraja; João Gama; Michał Woźniak

Conference ProceedingsOPEN ACCESS

Active learning by clustering for drifted data stream classification

Communications in Computer and Information Science (2019) 967 80-90

DOI: 10.1007/978-3-030-14880-5_7

0Citations

8Readers

Abstract

Usually, during data stream classifier learning, we assume that labels of all incoming examples are available without any delay and they are used to update employing predictive model. Unfortunately, this assumption about access to all class labels is naive and it requires relatively high budget for labeling. It causes that methods which can train data stream classifiers on the basis of partially labeled data are highly desirable. Among them, active learning [1] seems to be a promising direction, which focuses on selecting only the most valuable learning examples to be labeled and used to produce an accurate predictive model. However, designing such a system we have to ensure that a cho-sen active learning strategy is able to handle changes in data distribution and quickly adapt to changing data distribution. In this work, we focus on novel active learning strategies that are designed for effective tackling of such changes. We propose a novel active data stream classifier learning method based on query by clustering approach. Experimental evaluation of the proposed methods prove the usefulness of the proposed approach for reducing labeling cost for classifier of drifting data streams.

Author supplied keywords

Cite

CITATION STYLE

APA

Zgraja, J., Gama, J., & Woźniak, M. (2019). Active learning by clustering for drifted data stream classification. In Communications in Computer and Information Science (Vol. 967, pp. 80–90). Springer Verlag. https://doi.org/10.1007/978-3-030-14880-5_7

Active learning by clustering for drifted data stream classification

Abstract

Author supplied keywords

Cite

Register to see more suggestions