Using mixed precision algorithm for LINPACK benchmark on AMD GPU

Xianyi Zhang; Yunquan Zhang; Lei Wang

Book Chapter

Using mixed precision algorithm for LINPACK benchmark on AMD GPU

Springer International Publishing, (2013), 555-560

DOI: 10.1007/978-3-642-16405-7_34

0Citations

1Readers

Get full text

Abstract

LINPACK is a de facto benchmark for supercomputers. Nowadays, the CPU and GPU heterogenous cluster becomes an important trendy of supercomputers. Because of high performance of mixed precision algorithm, we had developed a mixed precision high performance LINPACK software package GHPL on NVIDIA GPU cluster. In this paper, we will introduce the recent work about porting and optimizing GHPL on AMD GPU. On AMD GPU platform, we implemented a hybrid of CPU and GPU GEMM function by ACML-GPU and GotoBLAS library. According to our results, the speedup of GHPL over HPL was 3.21. In addition, we would point out the limitations of ACML-GPU library.

Cite

CITATION STYLE

APA

Zhang, X., Zhang, Y., & Wang, L. (2013). Using mixed precision algorithm for LINPACK benchmark on AMD GPU. In Lecture Notes in Earth System Sciences (Vol. 0, pp. 555–560). Springer International Publishing. https://doi.org/10.1007/978-3-642-16405-7_34

Using mixed precision algorithm for LINPACK benchmark on AMD GPU

Abstract

Cite

Register to see more suggestions