Physical column organization in in-memory column stores

5Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Cost models are an essential part of database systems, as they are the basis of query performance optimization. Disk based systems are well understood and sophisticated models exist to compare various data structures and to estimate query costs based on disk IO operations. Cost models for in-memory databases shift the focus from disk IOs to main memory accesses and CPU costs. However, modeling memory accesses is fundamentally different and common models do not apply anymore. In this work, we examine the plan operations scan with equality selection, scan with range selection, positional lookup and insert in inmemory column stores regarding different physical column organizations. We consider uncompressed columns, bit compressed and dictionary encoded columns with sorted and unsorted dictionaries. Furthermore, we discuss tree indices on columns and dictionaries and present a detailed parameter evaluation, considering the number of distinct values, value skewness and value disorder. Finally, we present and evaluate a cost model based on cache misses for estimating the runtime of the discussed plan operations. © Springer-Verlag 2013.

Cite

CITATION STYLE

APA

Schwalb, D., Faust, M., Krueger, J., & Plattner, H. (2013). Physical column organization in in-memory column stores. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7826 LNCS, pp. 48–63). https://doi.org/10.1007/978-3-642-37450-0_4

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free