Mining distributed evolving data streams using fractal GP ensembles

Gianluigi Folino; Clara Pizzuti; Giandomenico Spezzano

Conference Proceedings

Mining distributed evolving data streams using fractal GP ensembles

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2007) 4445 LNCS 160-169

DOI: 10.1007/978-3-540-71605-1_15

19Citations

29Readers

Get full text

Abstract

A Genetic Programming based boosting ensemble method for the classification of distributed streaming data is proposed. The approach handles flows of data coming from multiple locations by building a global model obtained by the aggregation of the local models coming from each node. A main characteristics of the algorithm presented is its adaptability in presence of concept drift. Changes in data can cause serious deterioration of the ensemble performance. Our approach is able to discover changes by adopting a strategy based on self-similarity of the ensemble behavior, measured by its fractal dimension, and to revise itself by promptly restoring classification accuracy. Experimental results on a synthetic data set show the validity of the approach in maintaining an accurate and up-to-date GP ensemble. © Springer-Verlag Berlin Heidelberg 2007.

Cite

CITATION STYLE

APA

Folino, G., Pizzuti, C., & Spezzano, G. (2007). Mining distributed evolving data streams using fractal GP ensembles. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4445 LNCS, pp. 160–169). Springer Verlag. https://doi.org/10.1007/978-3-540-71605-1_15

Mining distributed evolving data streams using fractal GP ensembles

Abstract

Cite

Register to see more suggestions