Spada: Accelerating Sparse Matrix Multiplication with Adaptive Dataflow

Zhiyao Li; Jiaxiang Li; Taijie Chen; Dimin Niu; Hongzhong Zheng; Yuan Xie; Mingyu Gao

Conference ProceedingsOPEN ACCESS

Spada: Accelerating Sparse Matrix Multiplication with Adaptive Dataflow

International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS (2023) 2 747-761

DOI: 10.1145/3575693.3575706

17Citations

28Readers

Abstract

Sparse matrix-matrix multiplication (SpGEMM) is widely used in many scientific and deep learning applications. The highly irregular structures of SpGEMM limit its performance and efficiency on conventional computation platforms, and thus motivate a large body of specialized hardware designs. Existing SpGEMM accelerators only support specific types of rigid execution dataflow such as inner/output-product or row-based schemes. Each dataflow is only optimized for certain sparse patterns and fails to generalize with robust performance to the widely diverse SpGEMM workloads across various domains. We propose Spada, a combination of three novel techniques for SpGEMM accelerators to efficiently adapt to various sparse patterns. First, we describe a window-based adaptive dataflow that can be flexibly adapted to different modes to best match the data distributions and realize different reuse benefits. Then, our hardware architecture efficiently supports this dataflow template, with flexible, fast, and low-cost reconfigurability and effective load balancing features. Finally, we use a profiling-guided approach to detect the sparse pattern and determine the optimized dataflow mode to use, based on the key observations of sparse pattern similarity in nearby matrix regions. Our evaluation results demonstrate that Spada is able to match or exceed the best among three state-of-the-art SpGEMM accelerators, and avoid the performance degradation of the others if data distribution and dataflow mismatch. It achieves an average 1.44× speedup across a wide range of sparse matrices and compressed neural network models.

Author supplied keywords

Cite

CITATION STYLE

APA

Li, Z., Li, J., Chen, T., Niu, D., Zheng, H., Xie, Y., & Gao, M. (2023). Spada: Accelerating Sparse Matrix Multiplication with Adaptive Dataflow. In International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS (Vol. 2, pp. 747–761). Association for Computing Machinery. https://doi.org/10.1145/3575693.3575706

Spada: Accelerating Sparse Matrix Multiplication with Adaptive Dataflow

Abstract

Author supplied keywords

Cite

Register to see more suggestions