Boosting the performance of CNN accelerators with dynamic fine-grained channel gating

36Citations
Citations of this article
68Readers
Mendeley users who have this article in their library.

Abstract

This paper proposes a new fine-grained dynamic pruning technique for CNN inference, named channel gating, and presents an accelerator architecture that can effectively exploit the dynamic sparsity. Intuitively, channel gating identifies the regions in the feature map of each CNN layer that contribute less to the classification result and turns off a subset of channels for computing the activations in these less important regions. Unlike static network pruning, which removes redundant weights or neurons prior to inference, channel gating exploits dynamic sparsity specific to each input at run time and in a structured manner. To maximize compute savings while minimizing accuracy loss, channel gating learns the gating thresholds together with weights automatically through training. Experimental results show that the proposed approach can significantly speed up state-of-the-art networks with a marginal accuracy loss, and enable a trade-off between performance and accuracy. This paper also shows that channel gating can be supported with a small set of extensions to a CNN accelerator, and implements a prototype for quantized ResNet-18 models. The accelerator shows an average speedup of 2.3 for ImageNet when the theoretical FLOP reduction is 2.8, indicating that the hardware can effectively exploit the dynamic sparsity exposed by channel gating.

Cite

CITATION STYLE

APA

Hua, W., Zhou, Y., De Sa, C., Zhang, Z., & Suh, G. E. (2019). Boosting the performance of CNN accelerators with dynamic fine-grained channel gating. In Proceedings of the Annual International Symposium on Microarchitecture, MICRO (pp. 139–150). IEEE Computer Society. https://doi.org/10.1145/3352460.3358283

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free