Clustering Data Streams: A Complex Network Approach

2Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Clustering data streams is an interesting and challenging problem. Although several solutions have been proposed in the literature, some drawbacks remain. For instance, how to deal effectively with the offline process for partitioning the micro-clusters into macro-clusters is still an open problem. Typically, the k-means algorithm is considered in this phase, which despite precise results, require a mandatory user-defined parameter k, that defines the number of expected clusters. In this paper, we propose a new clustering method for data stream, named Prototype Networks. This method takes the complex network structure to represent the set of micro-clusters. This approach has proven to be advantageous mainly because these networks have an inherent community structure. As a consequence, the offline phase might be easily handled by a community detection algorithm, such as Infomap. The communities detected represents the cluster structure of the data assuming that the network construction was designed for this purpose. Computer experiments demonstrated the feasibility of the proposed approach. Moreover, the proposed method can detect automatically the number of clusters in evolving scenarios, which is a useful feature when dealing with data streams with concept drift.

Cite

CITATION STYLE

APA

Porto, S., & Quiles, M. G. (2019). Clustering Data Streams: A Complex Network Approach. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11619 LNCS, pp. 52–65). Springer Verlag. https://doi.org/10.1007/978-3-030-24289-3_5

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free