Performance-portable many-core plasma simulations: Porting PIConGPU to OpenPower and beyond

10Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.
Get full text

Abstract

With the appearance of the heterogeneous platform Open- Power, many-core accelerator devices have been coupled with Power host processors for the first time. Towards utilizing their full potential, it is worth investigating performance portable algorithms that allow to choose the best-fitting hardware for each domain-specific compute task. Suiting even the high level of parallelism on modern GPGPUs, our presented approach relies heavily on abstract meta-programming techniques, which are essential to focus on fine-grained tuning rather than code porting. With this in mind, the CUDA-based open-source plasma simulation code PIConGPU is currently being abstracted to support the heterogeneous OpenPower platform using our fast porting interface cupla, which wraps the abstract parallel C++11 kernel acceleration library Alpaka. We demonstrate how PIConGPU can benefit from the tunable kernel execution strategies of the Alpaka library, achieving portability and performance with single-source kernels on conventional CPUs, Power8 CPUs and NVIDIA GPUs.

Cite

CITATION STYLE

APA

Zenker, E., Widera, R., Huebl, A., Juckeland, G., Knüpfer, A., Nagel, W. E., & Bussmann, M. (2016). Performance-portable many-core plasma simulations: Porting PIConGPU to OpenPower and beyond. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9945 LNCS, pp. 293–301). Springer Verlag. https://doi.org/10.1007/978-3-319-46079-6_21

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free