In parallel computing the data distribution may have a significant impact in the application performance and accuracy. These effects can be observed using the parallel matrix-vector multiplication routine from PBLAS with different grid configurations in data distribution. Matrix-vector multiplication is an especially important operation once it is widely used in numerical simulation (e.g., iterative solvers for linear systems of equations). This paper presents a mathematical background of error propagation in elementary operations and proposes benchmarks to show how different grid configurations based on the two dimensional cyclic block distribution impacts accuracy and performance using parallel matrix-vector operations. The experimental results validate the theoretical findings. © 2011 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Rocker, B., Kolberg, M., & Heuveline, V. (2011). The impact of data distribution in accuracy and performance of parallel linear algebra subroutines. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6449 LNCS, pp. 394–407). https://doi.org/10.1007/978-3-642-19328-6_36
Mendeley helps you to discover research relevant for your work.