An evaluation of streaming algorithms for distinct counting over a sliding window

1Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.

Abstract

Counting the number of distinct elements in a data stream (distinct counting) is a fundamental aggregation task in database query processing, query optimization, and network monitoring. On a stream of elements, it is commonly needed to compute an aggregate over only the most recent elements, leading to the problem of distinct counting over a "sliding window" of the stream. We present a detailed experimental study of the performance of different algorithms for distinct counting over a sliding window. We observe that the performance of an algorithm depends on the basic method used, as well as aspects such as the hash function, the mix of query and updates, and the method used to boost accuracy. We compare the performance of prominent algorithms and evaluate the influence of these factors, leading to practical recommendations for implementation. To the best of our knowledge, this is the first detailed experimental study of distinct counting over a sliding window.

Cite

CITATION STYLE

APA

Singh, S. A., & Tirthapura, S. (2015). An evaluation of streaming algorithms for distinct counting over a sliding window. Frontiers in ICT, 2(NOV). https://doi.org/10.3389/fict.2015.00023

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free