Ensemble Clustering for Novelty Detection in Data Streams

Kemilly Dearo Garcia; Elaine Ribeiro de Faria; Cláudio Rebelo de Sá; João Mendes-Moreira; Charu C. Aggarwal; André C.P.L.F. de Carvalho; Joost N. Kok

Conference Proceedings

Ensemble Clustering for Novelty Detection in Data Streams

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11828 LNAI 460-470

DOI: 10.1007/978-3-030-33778-0_34

1Citations

9Readers

Get full text

Abstract

In data streams new classes can appear over time due to changes in the data statistical distribution. Consequently, models can become outdated, which requires the use of incremental learning algorithms capable of detecting and learning the changes over time. However, when a single classification model is used for novelty detection, there is a risk that its bias may not be suitable for new data distributions. A solution could be the combination of several models into an ensemble. Besides, because models can only be updated when labeled data arrives, we propose two unsupervised ensemble approaches: one combining clustering partitions using the same clustering technique; and other using different clustering techniques. We compare the performance of the proposed methods with well known novelty detection algorithms. The methods were tested on datasets commonly used in the novelty detection literature. The experimental results show that proposed ensembles have competitive performance for novelty detection in data streams.

Author supplied keywords

Cite

CITATION STYLE

APA

Garcia, K. D., de Faria, E. R., de Sá, C. R., Mendes-Moreira, J., Aggarwal, C. C., de Carvalho, A. C. P. L. F., & Kok, J. N. (2019). Ensemble Clustering for Novelty Detection in Data Streams. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11828 LNAI, pp. 460–470). Springer. https://doi.org/10.1007/978-3-030-33778-0_34

Ensemble Clustering for Novelty Detection in Data Streams

Abstract

Author supplied keywords

Cite

Register to see more suggestions