A Genetic Programming based boosting ensemble method for the classification of distributed streaming data is proposed. The approach handles flows of data coming from multiple locations by building a global model obtained by the aggregation of the local models coming from each node. A main characteristics of the algorithm presented is its adaptability in presence of concept drift. Changes in data can cause serious deterioration of the ensemble performance. Our approach is able to discover changes by adopting a strategy based on self-similarity of the ensemble behavior, measured by its fractal dimension, and to revise itself by promptly restoring classification accuracy. Experimental results on a synthetic data set show the validity of the approach in maintaining an accurate and up-to-date GP ensemble. © Springer-Verlag Berlin Heidelberg 2007.
CITATION STYLE
Folino, G., Pizzuti, C., & Spezzano, G. (2007). Mining distributed evolving data streams using fractal GP ensembles. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4445 LNCS, pp. 160–169). Springer Verlag. https://doi.org/10.1007/978-3-540-71605-1_15
Mendeley helps you to discover research relevant for your work.