GRLC: Grid-based run-length compression for energy-efficient CNN accelerator

Yoonho Park; Yesung Kang; Sunghoon Kim; Eunji Kwon; Seokhyeong Kang

Conference ProceedingsOPEN ACCESS

GRLC: Grid-based run-length compression for energy-efficient CNN accelerator

ACM International Conference Proceeding Series (2020)

DOI: 10.1145/3370748.3406576

3Citations

5Readers

Get full text

Abstract

Convolutional neural networks (CNNs) require a huge amount of off-chip DRAM access, which accounts for most of its energy consumption. Compression of feature maps can reduce the energy consumption of DRAM access. However, previous compression methods show poor compression ratio if the feature maps are either extremely sparse or dense. To improve the compression ratio efficiently, we have exploited the spatial correlation and the distribution of non-zero activations in output feature maps. In this work, we propose a grid-based run-length compression (GRLC) and have implemented a hardware for the GRLC. Compared with a previous compression method [1], GRLC reduces 11% of the DRAM access and 5% of the energy consumption on average in VGG-16, ExtractionNet and ResNet-18.

Author supplied keywords

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Park, Y., Kang, Y., Kim, S., Kwon, E., & Kang, S. (2020). GRLC: Grid-based run-length compression for energy-efficient CNN accelerator. In ACM International Conference Proceeding Series. Association for Computing Machinery. https://doi.org/10.1145/3370748.3406576

Readers' Seniority

PhD / Post grad / Masters / Doc 3

100%

Readers' Discipline

Computer Science 3

100%

GRLC: Grid-based run-length compression for energy-efficient CNN accelerator

Abstract

Author supplied keywords

References Powered by Scopus

Deep residual learning for image recognition

ImageNet Large Scale Visual Recognition Challenge

Notes on continuous stochastic phenomena.

Cited by Powered by Scopus

An Overview of Energy-Efficient Hardware Accelerators for On-Device Deep-Neural-Network Training

Zero and Narrow-Width Value-Aware Compression for Quantized Convolutional Neural Networks

Variable Precision Multiplier for CNN Accelerators Based on Booth Algorithm

Register to see more suggestions

Cite

Readers' Seniority

Readers' Discipline