Optimizing I/O costs of multi-dimensional queries using bitmap indices

8Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Bitmap indices are efficient data structures for processing complex, multi-dimensional queries in data warehouse applications and scientific data analysis. For high-cardinality attributes, a common approach is to build bitmap indices with binning. This technique partitions the attribute values into a number of ranges, called bins, and uses bitmap vectors to represent bins (attribute ranges) rather than distinct values. In order to yield exact query answers, parts of the original data values have to be read from disk for checking against the query constraint. This process is referred to as candidate check and usually dominates the total query processing time. In this paper we study several strategies for optimizing the candidate check cost for multi-dimensional queries. We present an efficient candidate check algorithm based on attribute value distribution, query distribution as well as query selectivity with respect to each dimension. We also show that re-ordering the dimensions during query evaluation can be used to reduce I/O costs. We tested our algorithm on data with various attribute value distributions and query distributions. Our approach shows a significant improvement over traditional binning strategies for bitmap indices. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Rotem, D., Stockinger, K., & Wu, K. (2005). Optimizing I/O costs of multi-dimensional queries using bitmap indices. In Lecture Notes in Computer Science (Vol. 3588, pp. 220–229). Springer Verlag. https://doi.org/10.1007/11546924_22

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free