Using mixed precision algorithm for LINPACK benchmark on AMD GPU

0Citations
Citations of this article
1Readers
Mendeley users who have this article in their library.
Get full text

Abstract

LINPACK is a de facto benchmark for supercomputers. Nowadays, the CPU and GPU heterogenous cluster becomes an important trendy of supercomputers. Because of high performance of mixed precision algorithm, we had developed a mixed precision high performance LINPACK software package GHPL on NVIDIA GPU cluster. In this paper, we will introduce the recent work about porting and optimizing GHPL on AMD GPU. On AMD GPU platform, we implemented a hybrid of CPU and GPU GEMM function by ACML-GPU and GotoBLAS library. According to our results, the speedup of GHPL over HPL was 3.21. In addition, we would point out the limitations of ACML-GPU library.

Cite

CITATION STYLE

APA

Zhang, X., Zhang, Y., & Wang, L. (2013). Using mixed precision algorithm for LINPACK benchmark on AMD GPU. In Lecture Notes in Earth System Sciences (Vol. 0, pp. 555–560). Springer International Publishing. https://doi.org/10.1007/978-3-642-16405-7_34

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free