Algorithms for the database layout problem

Gagan Aggarwal; Tomás Feder; Rajeev Motwani; Rina Panigrahy; An Zhu

Conference Proceedings

Algorithms for the database layout problem

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2005) 3363 LNCS 189-203

DOI: 10.1007/978-3-540-30570-5_13

3Citations

3Readers

Get full text

Abstract

We present a formal analysis of the database layout problem, i.e., the problem of determining how database objects such as tables and indexes are assigned to disk drives. Optimizing this layout has a direct impact on the I/O performance of the entire system. The traditional approach of striping each object across all available disk drives is aimed at optimizing I/O parallelism; however, it is suboptimal when queries co-access two or more database objects, e.g., during a merge join of two tables, due to the increase in random disk seeks. We adopt an existing model, which takes into account both the benefit of I/O parallelism and the overhead due to random disk accesses, in the context of a query workload which includes co-access of database objects. The resulting optimization problem is intractable in general and we employ techniques from approximation algorithms to present provable performance guarantees. We show that while optimally exploiting I/O parallelism alone suggests uniformly striping data objects (even for heterogeneous files and disks), optimizing random disk access alone would assign each data object to a single disk drive. This confirms the intuition that the two effects are in tension with each other. We provide approximation algorithms in an attempt to optimize the trade-off between the two effects. We show that our algorithm achieves the best possible approximation ratio. © 2005 Springer-Verlag.

Cite

CITATION STYLE

APA

Aggarwal, G., Feder, T., Motwani, R., Panigrahy, R., & Zhu, A. (2005). Algorithms for the database layout problem. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3363 LNCS, pp. 189–203). https://doi.org/10.1007/978-3-540-30570-5_13

Algorithms for the database layout problem

Abstract

Cite

Register to see more suggestions