Performance evaluation of sparse matrix multiplication kernels on Intel Xeon Phi

Erik Saule; Kamer Kaya; Ümit V. Çatalyürek

Conference Proceedings

Performance evaluation of sparse matrix multiplication kernels on Intel Xeon Phi

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2014) 8384 LNCS(PART 1) 559-570

DOI: 10.1007/978-3-642-55224-3_52

56Citations

86Readers

Get full text

Abstract

Intel Xeon Phi is a recently released high-performance coprocessor which features 61 cores each supporting 4 hardware threads with 512-bit wide SIMD registers achieving a peak theoretical performance of 1Tflop/s in double precision. Its design differs from classical modern processors; it comes with a large number of cores, the 4-way hyperthreading capability allows many applications to saturate the massive memory bandwidth, and its large SIMD capabilities allow to reach high computation throughput. The core of many scientific applications involves the multiplication of a large, sparse matrix with a single or multiple dense vectors which are not compute-bound but memory-bound. In this paper, we investigate the performance of the Xeon Phi coprocessor for these sparse linear algebra kernels. We highlight the important hardware details and show that Xeon Phi's sparse kernel performance is very promising and even better than that of cutting-edge CPUs and GPUs. © 2014 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Saule, E., Kaya, K., & Çatalyürek, Ü. V. (2014). Performance evaluation of sparse matrix multiplication kernels on Intel Xeon Phi. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8384 LNCS, pp. 559–570). Springer Verlag. https://doi.org/10.1007/978-3-642-55224-3_52

Performance evaluation of sparse matrix multiplication kernels on Intel Xeon Phi

Abstract

Author supplied keywords

Cite

Register to see more suggestions