Parallel sparse matrix-vector multiplication using accelerators

Hiroshi Maeda; Daisuke Takahashi

Conference Proceedings

Parallel sparse matrix-vector multiplication using accelerators

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9787 3-18

DOI: 10.1007/978-3-319-42108-7_1

4Citations

2Readers

Get full text

Abstract

Sparse matrix-vector multiplication (SpMV) is an essential computational kernel for many applications such as scientific computing. Recently, the number of computing systems equipped with NVIDIA’s GPU and Intel’s Xeon Phi coprocessor based on the MIC architecture has been increasing. Therefore, the importance of effective algorithms for SpMV in these systems is increasing. To the best of our knowledge, while previous studies have reported CPU and GPU implementations of SpMV for a cluster andMIC implementations for a single node, implementations of SpMV for theMIC cluster have not yet been reported. In this paper, we implemented and evaluated parallel SpMV on a GPU cluster and a MIC cluster. As shown by the results, the implementation for MIC achieved relatively high performance in some matrices with a single process, but it could not achieve higher performance than other implementations with 64 MPI processes. Therefore, we implemented and evaluated the single SpMV kernel to improve the performance of parallel SpMV.

Author supplied keywords

Cite

CITATION STYLE

APA

Maeda, H., & Takahashi, D. (2016). Parallel sparse matrix-vector multiplication using accelerators. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9787, pp. 3–18). Springer Verlag. https://doi.org/10.1007/978-3-319-42108-7_1

Parallel sparse matrix-vector multiplication using accelerators

Abstract

Author supplied keywords

Cite

Register to see more suggestions