Efficient sparse matrix-vector multiplication on cache-based GPUs

38Citations
Citations of this article
41Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Sparse matrix-vector multiplication is an integral part of many scientific algorithms. Several studies have shown that it is a bandwidth-limited operation on current hardware. On cache-based architectures the main factors that influence performance are spatial locality in accessing the matrix, and temporal locality in re-using the elements of the vector. © 2012 IEEE.

Cite

CITATION STYLE

APA

Eguly, I. R., & Giles, M. (2012). Efficient sparse matrix-vector multiplication on cache-based GPUs. In 2012 Innovative Parallel Computing, InPar 2012. IEEE Computer Society. https://doi.org/10.1109/InPar.2012.6339602

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free