A parallel sparse tensor benchmark suite on CPUs and GPUs

1Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.

Abstract

Tensor computations present significant performance challenges that impact a wide spectrum of applications. Efforts on improving the performance of tensor computations include exploring data layout, execution scheduling, and parallelism in common tensor kernels. This work presents a benchmark suite for arbitrary-order sparse tensor kernels using state-of-the-art tensor formats: coordinate (COO) and hierarchical coordinate (HiCOO). It demonstrates a set of reference tensor kernel implementations and some observations on Intel CPUs and NVIDIA GPUs. The full paper can be referred to at http://arxiv.org/abs/2001.00660.

Cite

CITATION STYLE

APA

Li, J., Lakshminarasimhan, M., Wu, X., Li, A., Olschanowsky, C., & Barker, K. (2020). A parallel sparse tensor benchmark suite on CPUs and GPUs. In Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP (pp. 403–404). Association for Computing Machinery. https://doi.org/10.1145/3332466.3374513

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free