Algorithmic skeleton framework for the orchestration of GPU computations

Ricardo Marques; Hervé Paulino; Fernando Alexandre; Pedro D. Medeiros

Conference ProceedingsOPEN ACCESS

Algorithmic skeleton framework for the orchestration of GPU computations

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2013) 8097 LNCS 874-885

DOI: 10.1007/978-3-642-40047-6_86

21Citations

14Readers

Abstract

The Graphics Processing Unit (GPU) is gaining popularity as a co-processor to the Central Processing Unit (CPU). However, harnessing its capabilities is a non-trivial exercise that requires good knowledge of parallel programming, more so when the complexity of these applications is increasingly rising. Languages such as StreamIt [1] and Lime [2] have addressed the offloading of composed computations to GPUs. However, to the best of our knowledge, no support exists at library level. To this extent, we propose Marrow, an algorithmic skeleton framework for the orchestration of OpenCL computations. Marrow expands the set of skeletons currently available for GPU computing, and enables their combination, through nesting, into complex structures. Moreover, it introduces optimizations that overlap communication and computation, thus conjoining programming simplicity with performance gains in many application scenarios. We evaluated the framework from a performance perspective, comparing it against hand-tuned OpenCL programs. The results are favourable, indicating that Marrow's skeletons are both flexible and efficient in the context of GPU computing. © 2013 Springer-Verlag.

Cite

CITATION STYLE

APA

Marques, R., Paulino, H., Alexandre, F., & Medeiros, P. D. (2013). Algorithmic skeleton framework for the orchestration of GPU computations. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8097 LNCS, pp. 874–885). https://doi.org/10.1007/978-3-642-40047-6_86

Algorithmic skeleton framework for the orchestration of GPU computations

Abstract

Cite

Register to see more suggestions