Workload-independent data-driven vertical partitioning

Nikita Bobrov; George Chernishev; Boris Novikov

Conference Proceedings

Workload-independent data-driven vertical partitioning

Communications in Computer and Information Science (2017) 767 275-284

DOI: 10.1007/978-3-319-67162-8_27

3Citations

3Readers

Get full text

Abstract

Vertical partitioning is a well-explored area of automatic physical database design. The classic approach is as follows: Derive an optimal vertical partitioning scheme for a given database and a workload. The workload describes queries, their frequencies, and involved attributes. In this paper we identify a novel class of vertical partitioning algorithms. The algorithms of this class do not rely on knowledge of the workload, but instead use data properties that are contained in the workload itself. We propose such algorithm that uses a logical scheme represented by functional dependencies, which are derived from stored data. In order to discover functional dependencies we use TANE — a popular functional dependency extraction algorithm. We evaluate our algorithm using an industrial DBMS (PostgreSQL) on number of workloads. We compare the performance of an unpartitioned configuration with partitions produced by our algorithm and several state-of-the-art workload-aware algorithms.

Author supplied keywords

Cite

CITATION STYLE

APA

Bobrov, N., Chernishev, G., & Novikov, B. (2017). Workload-independent data-driven vertical partitioning. In Communications in Computer and Information Science (Vol. 767, pp. 275–284). Springer Verlag. https://doi.org/10.1007/978-3-319-67162-8_27

Workload-independent data-driven vertical partitioning

Abstract

Author supplied keywords

Cite

Register to see more suggestions