An efficient method of building an ensemble of classifiers in streaming data

Joung Woo Ryu; Mehmed M. Kantardzic; Myung Won Kim; A. Ra Khil

Conference Proceedings

An efficient method of building an ensemble of classifiers in streaming data

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2012) 7678 LNCS 122-133

DOI: 10.1007/978-3-642-35542-4_11

9Citations

16Readers

Get full text

Abstract

To efficiently refine a classifier in streaming data such as sensor data and web log data we have to decide whether each streaming unlabeled datum is selected or not. The exiting methods refine a classifier based on a regular time interval. They refine a classifier even if the classification accuracy of the classifier is high. Also it uses a classifier even if the classification accuracy is low. In this paper, our ensemble method selects data in an online process that should be labeled. The selected data are used to build new classifiers of an ensemble. Our selection methodology uses training data that are applied to generate an ensemble of classifiers over streaming data. We compared the results of our ensemble approach and of a conventional ensemble approach where new classifiers for an ensemble are periodically generated. In experiments with ten benchmark data sets including three real streaming data sets, our ensemble approach generated 12.9% new classifiers for the chunk-based ensemble approach using partially labeled samples, and used an average of 10% labeled samples for the ten data sets. In all the experiments, our ensemble approach produced comparable classification accuracy. We showed that our approach can efficiently maintain the performance of an ensemble over streaming data. © Springer-Verlag 2012.

Author supplied keywords

Cite

CITATION STYLE

APA

Ryu, J. W., Kantardzic, M. M., Kim, M. W., & Khil, A. R. (2012). An efficient method of building an ensemble of classifiers in streaming data. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7678 LNCS, pp. 122–133). https://doi.org/10.1007/978-3-642-35542-4_11

An efficient method of building an ensemble of classifiers in streaming data

Abstract

Author supplied keywords

Cite

Register to see more suggestions