Efficient sparse matrix-vector multiplication on cache-based GPUs

István R. Eguly; Mike Giles

Conference Proceedings

Efficient sparse matrix-vector multiplication on cache-based GPUs

Eguly I
Giles M

2012 Innovative Parallel Computing, InPar 2012 (2012)

DOI: 10.1109/InPar.2012.6339602

38Citations

41Readers

Get full text

Abstract

Sparse matrix-vector multiplication is an integral part of many scientific algorithms. Several studies have shown that it is a bandwidth-limited operation on current hardware. On cache-based architectures the main factors that influence performance are spatial locality in accessing the matrix, and temporal locality in re-using the elements of the vector. © 2012 IEEE.

Author supplied keywords

autotuning
cache performance
conjugate gradient method
finite element method
sparse matrix-vector multiplication

Cite

CITATION STYLE

APA

Eguly, I. R., & Giles, M. (2012). Efficient sparse matrix-vector multiplication on cache-based GPUs. In 2012 Innovative Parallel Computing, InPar 2012. IEEE Computer Society. https://doi.org/10.1109/InPar.2012.6339602

Efficient sparse matrix-vector multiplication on cache-based GPUs

Abstract

Author supplied keywords

Cite

Register to see more suggestions