An improved sparse matrix-vector multiply based on recursive sparse blocks layout

4Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The Recursive Sparse Blocks (RSB) is a sparse matrix layout designed for coarse grained parallelism and reduced cache misses when operating with matrices, which are larger than a computer's cache. By laying out the matrix in sparse, non overlapping blocks, we allow for the shared memory parallel execution of transposed SParse Matrix-Vector multiply (SpMV), with higher efficiency than the traditional Compressed Sparse Rows (CSR) format. In this note we cover two issues. First, we propose two improvements to our original approach. Second, we look at the performance of standard and transposed shared memory parallel SpMV for unsymmetric matrices, using the proposed approach. We find that our implementation's performance is competitive with that of both the highly optimized, proprietary Intel MKL Sparse BLAS library's CSR routines, and the Compressed Sparse Blocks (CSB) research prototype. © 2012 Springer-Verlag.

Cite

CITATION STYLE

APA

Martone, M., Paprzycki, M., & Filippone, S. (2012). An improved sparse matrix-vector multiply based on recursive sparse blocks layout. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7116 LNCS, pp. 606–613). https://doi.org/10.1007/978-3-642-29843-1_69

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free