Communication bounds for convolutional neural networks

2Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Convolutional neural networks (CNNs) are important in a wide variety of machine learning tasks and applications, so optimizing their performance is essential. Moving words of data between levels of a memory hierarchy or between processors on a network is much more expensive than the cost of arithmetic, so minimizing communication is critical to optimizing performance. In this paper, we present new lower bounds on data movement for mixed precision convolutions in both single-processor and parallel distributed memory models, as well as algorithms that outperform current implementations such as Im2Col. We obtain performance figures using GEMMINI, a machine learning accelerator, where our tiling provides improvements between 13% and 150% over a vendor supplied algorithm.

Cite

CITATION STYLE

APA

Chen, A., Demmel, J., Dinh, G., Haberle, M., & Holtz, O. (2022). Communication bounds for convolutional neural networks. In Proceedings of the Platform for Advanced Scientific Computing Conference, PASC 2022. Association for Computing Machinery, Inc. https://doi.org/10.1145/3539781.3539784

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free