A classifier using online bagging ensemble method for big data stream learning

26Citations
Citations of this article
39Readers
Mendeley users who have this article in their library.

Abstract

By combining multiple weak learners with concept drift in the classification of big data stream learning, the ensemble learning can achieve better generalization performance than the single learning approach. In this paper, we present an efficient classifier using the online bagging ensemble method for big data stream learning. In this classifier, we introduce an efficient online resampling mechanism on the training instances, and use a robust coding method based on error-correcting output codes. This is done in order to reduce the effects of correlations between the classifiers and increase the diversity of the ensemble. A dynamic updating model based on classification performance is adopted to reduce the unnecessary updating operations and improve the efficiency of learning. We implement a parallel version of EoBag, which runs faster than the serial version, and results indicate that the classification performance is almost the same as the serial one. Finally, we compare the performance of classification and the usage of resources with other state-of-the-art algorithms using the artificial and the actual data sets, respectively. Results show that the proposed algorithm can obtain better accuracy and more feasible usage of resources for the classification of big data stream.

Cite

CITATION STYLE

APA

Lv, Y., Peng, S., Yuan, Y., Wang, C., Yin, P., Liu, J., & Wang, C. (2019). A classifier using online bagging ensemble method for big data stream learning. Tsinghua Science and Technology, 24(4), 379–388. https://doi.org/10.26599/TST.2018.9010119

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free