Optimal parallel I/O for range queries through replication

Keith Frikken; Mikhail Atallah; Sunil Prabhakar; Rei Safavi-Naini

Conference Proceedings

Optimal parallel I/O for range queries through replication

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2002) 2453 669-678

DOI: 10.1007/3-540-46146-9_66

16Citations

2Readers

Get full text

Abstract

In this paper we study the problem of declustering two dimensional datasets with replication over parallel devices to improve range query performance. The related problem of declustering without replication has been well studied. It has been established that strictly optimal declustering schemes do not exist if data is not replicated. In addition to the usual problem of identifying a good allocation, the replicated version of the problem needs to address the issue of identifying a good retrieval schedule for a given query. We address both problems in this paper. An efficient algorithm for finding a lowest cost retrieval schedule is developed. This algorithm works for any query, not just range queries. Two replicated placement schemes are presented - one that results in a strictly optimal allocation, and another that guarantees a retrieval cost that is either optimal or 1 more than the optimal for any range query.

Cite

CITATION STYLE

APA

Frikken, K., Atallah, M., Prabhakar, S., & Safavi-Naini, R. (2002). Optimal parallel I/O for range queries through replication. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2453, pp. 669–678). Springer Verlag. https://doi.org/10.1007/3-540-46146-9_66

Optimal parallel I/O for range queries through replication

Abstract

Cite

Register to see more suggestions