Lossy Conservative Update (LCU) Sketch: Succinct Approximate Count Storage

2Citations
Citations of this article
24Readers
Mendeley users who have this article in their library.

Abstract

In this paper, we propose a variant of the conservative-update Count-Min sketch to further reduce the overestimation error incurred. Inspired by ideas from lossy counting, we divide a stream of items into multiple windows, and decrement certain counts in the sketch at window boundaries. We refer to this approach as a lossy conservative update (LCU). The reduction in overestimation error of counts comes at the cost of introducing under-estimation error in counts. However, in our intrinsic evaluations, we show that the reduction in overestimation is much greater than the under-estimation error introduced by our method LCU. We apply our LCU framework to scale distributional similarity computations to web-scale corpora. We show that this technique is more efficient in terms of memory, and time, and more robust than conservative update with Count-Min (CU) sketch on this task.

Cite

CITATION STYLE

APA

Goyal, A., & Daumé, H. (2011). Lossy Conservative Update (LCU) Sketch: Succinct Approximate Count Storage. In Proceedings of the 25th AAAI Conference on Artificial Intelligence, AAAI 2011 (pp. 878–883). AAAI Press. https://doi.org/10.1609/aaai.v25i1.7976

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free