LINPACK is a de facto benchmark for supercomputers. Nowadays, the CPU and GPU heterogenous cluster becomes an important trendy of supercomputers. Because of high performance of mixed precision algorithm, we had developed a mixed precision high performance LINPACK software package GHPL on NVIDIA GPU cluster. In this paper, we will introduce the recent work about porting and optimizing GHPL on AMD GPU. On AMD GPU platform, we implemented a hybrid of CPU and GPU GEMM function by ACML-GPU and GotoBLAS library. According to our results, the speedup of GHPL over HPL was 3.21. In addition, we would point out the limitations of ACML-GPU library.
CITATION STYLE
Zhang, X., Zhang, Y., & Wang, L. (2013). Using mixed precision algorithm for LINPACK benchmark on AMD GPU. In Lecture Notes in Earth System Sciences (Vol. 0, pp. 555–560). Springer International Publishing. https://doi.org/10.1007/978-3-642-16405-7_34
Mendeley helps you to discover research relevant for your work.