Clustering Data Streams: A Complex Network Approach

Sandy Porto; Marcos G. Quiles

Conference Proceedings

Clustering Data Streams: A Complex Network Approach

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11619 LNCS 52-65

DOI: 10.1007/978-3-030-24289-3_5

2Citations

5Readers

Get full text

Abstract

Clustering data streams is an interesting and challenging problem. Although several solutions have been proposed in the literature, some drawbacks remain. For instance, how to deal effectively with the offline process for partitioning the micro-clusters into macro-clusters is still an open problem. Typically, the k-means algorithm is considered in this phase, which despite precise results, require a mandatory user-defined parameter k, that defines the number of expected clusters. In this paper, we propose a new clustering method for data stream, named Prototype Networks. This method takes the complex network structure to represent the set of micro-clusters. This approach has proven to be advantageous mainly because these networks have an inherent community structure. As a consequence, the offline phase might be easily handled by a community detection algorithm, such as Infomap. The communities detected represents the cluster structure of the data assuming that the network construction was designed for this purpose. Computer experiments demonstrated the feasibility of the proposed approach. Moreover, the proposed method can detect automatically the number of clusters in evolving scenarios, which is a useful feature when dealing with data streams with concept drift.

Author supplied keywords

Cite

CITATION STYLE

APA

Porto, S., & Quiles, M. G. (2019). Clustering Data Streams: A Complex Network Approach. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11619 LNCS, pp. 52–65). Springer Verlag. https://doi.org/10.1007/978-3-030-24289-3_5

Clustering Data Streams: A Complex Network Approach

Abstract

Author supplied keywords

Cite

Register to see more suggestions