The present paper discusses scalable implementations of sparse matrix-vector products, which are crucial for high performance solutions of large-scale linear equations, on a cc-NUMA machine SGI Altix3700. Three storage formats for sparse matrices are evaluated, and scalability is attained by implementations considering the page allocation mechanism of the NUMA machine. Influences of the cache/memory bus architectures on the optimum choice of the storage format are examined, and scalable converters between storage formats shown to facilitate exploitation of storage formats of higher performance.
CITATION STYLE
Kotakemori, H., Hasegawa, H., Kajiyama, T., Nukada, A., Suda, R., & Nishida, A. (2008). Performance Evaluation of Parallel Sparse Matrix–Vector Products on SGI Altix3700 (pp. 153–163). https://doi.org/10.1007/978-3-540-68555-5_13
Mendeley helps you to discover research relevant for your work.