Abstract
In hardware-aware high performance computing, block- asynchronous iteration and mixed precision iterative refinement are two techniques that may be used to leverage the computing power of SIMD accelerators like GPUs in the iterative solution of linear equation systems. Although they use a very different approach for this purpose, they share the basic idea of compensating the convergence properties of an inferior numerical algorithm by a more efficient usage of the provided computing power. In this paper, we analyze the potential of combining both techniques. Therefore, we derive a mixed precision iterative refinement algorithm using a block-asynchronous iteration as an error correction solver, and compare its performance with a pure implementation of a block-asynchronous iteration and an iterative refinement method using double precision for the error correction solver. For matrices from the University of Florida Matrix collection, we report the convergence behaviour and provide the total solver runtime using different GPU architectures. © 2012 Springer-Verlag.
Author supplied keywords
Cite
CITATION STYLE
Anzt, H., Luszczek, P., Dongarra, J., & Heuveline, V. (2012). GPU-accelerated asynchronous error correction for mixed precision iterative refinement. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7484 LNCS, pp. 908–919). https://doi.org/10.1007/978-3-642-32820-6_89
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.