Performance prediction and ranking of SpMV kernels on GPU architectures

Christoph Lehnert; Rudolf Berrendorf; Jan P. Ecker; Florian Mannuss

Conference ProceedingsOPEN ACCESS

Performance prediction and ranking of SpMV kernels on GPU architectures

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9833 LNCS 90-102

DOI: 10.1007/978-3-319-43659-3_7

7Citations

9Readers

Abstract

Predicting the runtime of a sparse matrix-vector multiplication (SpMV) for different sparse matrix formats and thread mappings allows the dynamic selection of the most appropriate matrix format and thread mapping for a given matrix. This paper introduces two new generally applicable performance models for SpMV – for linear and non-linear relationships – based on machine learning techniques. This approach supersedes the common manual development of an explicit performance model for a new architecture or for a new format based on empirical data. The two new models are compared to an existing explicit performance model on different GPUs. Results show that the quality of performance prediction results, the ranking of the alternatives, and the adaptability to other formats/architectures of the two machine learning techniques is better than that of the explicit performance model.

Author supplied keywords

Cite

CITATION STYLE

APA

Lehnert, C., Berrendorf, R., Ecker, J. P., & Mannuss, F. (2016). Performance prediction and ranking of SpMV kernels on GPU architectures. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9833 LNCS, pp. 90–102). Springer Verlag. https://doi.org/10.1007/978-3-319-43659-3_7

Performance prediction and ranking of SpMV kernels on GPU architectures

Abstract

Author supplied keywords

Cite

Register to see more suggestions