Cache-, hash- and space-efficient Bloom filters

Felix Putze; Peter Sanders; Johannes Singler

Conference Proceedings

Cache-, hash- and space-efficient Bloom filters

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2007) 4525 LNCS 108-121

DOI: 10.1007/978-3-540-72845-0_9

60Citations

62Readers

Get full text

Abstract

A Bloom filter is a very compact data structure that supports approximate membership queries on a set, allowing false positives. We propose several new variants of Bloom filters and replacements with similar functionality. All of them have a better cache-efficiency and need less hash bits than regular Bloom filters. Some use SIMD functionality, while the others provide an even better space efficiency. As a consequence, we get a more flexible trade-off between false positive rate, space-efficiency, cache-efficiency, hash-efficiency, and computational effort. We analyze the efficiency of Bloom filters and the proposed replacements in detail, in terms of the false positive rate, the number of expected cache-misses, and the number of required hash bits. We also describe and experimentally evaluate the performance of highly-tuned implementations. For many settings, our alternatives perform better than the methods proposed so far. © Springer-Verlag Berlin Heidelberg 2007.

Cite

CITATION STYLE

APA

Putze, F., Sanders, P., & Singler, J. (2007). Cache-, hash- and space-efficient Bloom filters. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4525 LNCS, pp. 108–121). Springer Verlag. https://doi.org/10.1007/978-3-540-72845-0_9

Cache-, hash- and space-efficient Bloom filters

Abstract

Cite

Register to see more suggestions