Hydra: Extending shared address programming for accelerator clusters

Putt Sakdhnagool; Amit Sabne; Rudolf Eigenmann

Conference Proceedings

Hydra: Extending shared address programming for accelerator clusters

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9519 140-155

DOI: 10.1007/978-3-319-29778-1_9

3Citations

3Readers

Get full text

Abstract

This work extends shared address programming to accelerator clusters by pursuing a simple form of shared-address programming, named HYDRA, where the programmer only specifies the parallel regions in the program. We present a fully automatic translation system that generates an MPI + accelerator program from a HYDRA program. Our mechanism ensures scalability of the generated program by optimizing data placement and transfer to and from the limited, discrete memories of accelerator devices. We also present a compiler design built on a highlevel IR to support multiple accelerator architectures. Evaluation results demonstrate the scalability of the translated programs on five well-known benchmarks. On average, HYDRA gains a 24.54x speedup over singleaccelerator performance when running on a 64-node Intel Xeon Phi cluster and a 27.56x speedup when running on a 64-node NVIDIA GPU cluster.

Cite

CITATION STYLE

APA

Sakdhnagool, P., Sabne, A., & Eigenmann, R. (2016). Hydra: Extending shared address programming for accelerator clusters. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9519, pp. 140–155). Springer Verlag. https://doi.org/10.1007/978-3-319-29778-1_9

Hydra: Extending shared address programming for accelerator clusters

Abstract

Cite

Register to see more suggestions