FIDS: Monitoring frequent items over distributed data streams

Robert Fuller; Mehmed Kantardzic

Conference Proceedings

FIDS: Monitoring frequent items over distributed data streams

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2007) 4571 LNAI 464-478

DOI: 10.1007/978-3-540-73499-4_35

1Citations

2Readers

Get full text

Abstract

Many applications require the discovery of items which have occur frequently within multiple distributed data streams. Past solutions for this problem either require a high degree of error tolerance or can only provide results periodically. In this paper we introduce a new algorithm designed for continuously tracking frequent items over distributed data streams providing either exact or approximate answers. We tested the efficiency of our method using two real-world data sets. The results indicated significant reduction in communication cost when compared to naïve approaches and an existing efficient algorithm called Top-K Monitoring. Since our method does not rely upon approximations to reduce communication overhead and is explicitly designed for tracking frequent items, our method also shows increased quality in its tracking results. © Springer-Verlag Berlin Heidelberg 2007.

Cite

CITATION STYLE

APA

Fuller, R., & Kantardzic, M. (2007). FIDS: Monitoring frequent items over distributed data streams. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4571 LNAI, pp. 464–478). Springer Verlag. https://doi.org/10.1007/978-3-540-73499-4_35

FIDS: Monitoring frequent items over distributed data streams

Abstract

Cite

Register to see more suggestions