Performance Portability Strategies for Grid C++ Expression Templates

Peter A. Boyle; M. A. Clark; Carleton Detar; Meifeng Lin; Verinder Rana; Alejandro Vaquero Avilés-Casco

Conference ProceedingsOPEN ACCESS

Performance Portability Strategies for Grid C++ Expression Templates

EPJ Web of Conferences (2018) 175

DOI: 10.1051/epjconf/201817509006

5Citations

7Readers

Abstract

One of the key requirements for the Lattice QCD Application Development as part of the US Exascale Computing Project is performance portability across multiple architectures. Using the Grid C++ expression template as a starting point, we report on the progress made with regards to the Grid GPU offloading strategies. We present both the successes and issues encountered in using CUDA, OpenACC and Just-In-Time compilation. Experimentation and performance on GPUs with a SU(3)×SU(3) streaming test will be reported. We will also report on the challenges of using current OpenMP 4.x for GPU offloading in the same code.

Cite

CITATION STYLE

APA

Boyle, P. A., Clark, M. A., Detar, C., Lin, M., Rana, V., & Avilés-Casco, A. V. (2018). Performance Portability Strategies for Grid C++ Expression Templates. In EPJ Web of Conferences (Vol. 175). EDP Sciences. https://doi.org/10.1051/epjconf/201817509006

Performance Portability Strategies for Grid C++ Expression Templates

Abstract

Cite

Register to see more suggestions