Optimizing the execution of multiple data analysis queries on parallel and distributed environments

Henrique Andrade; Tahsin Kurc; Alan Sussman; Joel Saltz

Journal Article

Optimizing the execution of multiple data analysis queries on parallel and distributed environments

IEEE Transactions on Parallel and Distributed Systems (2004) 15(6) 520-532

DOI: 10.1109/TPDS.2004.11

14Citations

13Readers

Get full text

Abstract

This paper investigates techniques for efficiency executing multiquery workloads from data and computation-intensive applications in parallel and/or distributed computing environments. In this context, we describe a database optimization framework that supports data and computation reuse, query scheduling, and active semantic caching to speed up the evaluation of multiquery workloads. Its most striking feature is the ability of optimizing the execution of queries in the presence of application-specific constructs by employing a customizable data and computation reuse model. Furthermore, we discuss how the proposed optimization model is flexible enough to work efficiently irrespective of the parallel/distributed environment underneath. In order to evaluate the proposed optimization techniques, we present experimental evidence using real data analysis applications. For this purpose, a common implementation for the queries under study was provided according to the database optimization framework and deployed on top of three distinct experimental configurations: a shared memory multiprocessor, a cluster of workstations, and a distributed computational Grid-like environment.

Author supplied keywords

Cite

CITATION STYLE

APA

Andrade, H., Kurc, T., Sussman, A., & Saltz, J. (2004). Optimizing the execution of multiple data analysis queries on parallel and distributed environments. IEEE Transactions on Parallel and Distributed Systems, 15(6), 520–532. https://doi.org/10.1109/TPDS.2004.11

Optimizing the execution of multiple data analysis queries on parallel and distributed environments

Abstract

Author supplied keywords

Cite

Register to see more suggestions