Many data mining applications analyze structured data that span across many tables and accumulate in time. Incremental mining methods have been devised to adapt patterns to new tuples. However, they have been designed for data in one table only. We propose a method for incremental clustering on multiple interrelated streams - a multi-table stream: its components are streams that reference each other, arrive at different speeds and have attributes of a priori unknown value ranges. Our approach encompasses solutions for the maintenance of cach-es and sliding windows over the individual streams, the propagation of foreign keys across streams, the transformation of all streams into a single-table stream, and an incremental clustering algorithm that operates over that stream. We evaluate our method on two real datasets and show that it approximates well the performance of an ideal method that possesses unlimited resources and knows the future. © 2009 Springer Berlin Heidelberg.
CITATION STYLE
Siddiqui, Z. F., & Spiliopoulou, M. (2009). Combining multiple interrelated streams for incremental clustering. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5566 LNCS, pp. 535–552). https://doi.org/10.1007/978-3-642-02279-1_38
Mendeley helps you to discover research relevant for your work.