Abstract
With the current increasing volume and dimensionality of data, traditional data classification algorithms are unable to satisfy the demands of practical classification applications of data streams. To deal with noise and concept drift in data streams, we propose an ensemble classification algorithm based on attribute reduction and a sliding window in this paper. Using mutual information, an approximate attribute reduction algorithm based on rough sets is used to reduce data dimensionality and increase the diversity of reduced results in the algorithm. A double-threshold concept drift detection method and a three-stage sliding window control strategy are introduced to improve the performance of the algorithm when dealing with both noise and concept drift. The classification precision is further improved by updating the base classifiers and their nonlinear weights. Experiments on synthetic datasets and actual datasets demonstrate the performance of the algorithm in terms of classification precision, memory use, and time efficiency.
Author supplied keywords
Cite
CITATION STYLE
Chen, Y., Li, O., Sun, Y., & Li, F. (2018). Ensemble classification of data streams based on attribute reduction and a sliding window. Applied Sciences (Switzerland), 8(4). https://doi.org/10.3390/app8040620
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.