Hardware-based real-time deep neural network lossless weights compression

Tomer Malach; Shlomo Greenberg; Moshe Haiut

Journal ArticleOPEN ACCESS

Hardware-based real-time deep neural network lossless weights compression

IEEE Access (2020) 8 205051-205060

DOI: 10.1109/ACCESS.2020.3037254

4Citations

5Readers

Abstract

Deep Neural Networks (DNN) are widely applied to many mobile applications demanding real-time implementation and large memory space. Therefore, it presents a new challenge for low-power and efficient implementation of a diversity of applications, such as speech recognition and image classification, for embedded edge devices. This work presents a hardware-based DNN compression approach to address the limited memory resources in edge devices. We propose a new entropy-based compression algorithm for encoding DNN weights, as well as a real-time decoding method and efficient dedicated hardware implementation. The proposed approach enables a significant reduction of the required DNN weights memory (approximately 70% and 63% for AlexNet and VGG19, respectively), while allowing the decoding of one weight per clock cycle. Results show a high compression ratio compared to well-known lossless compression algorithms. The proposed hardware decoder enables an efficient implementation of large DNN networks in low-power edge devices with limited memory resources.

Author supplied keywords

Cite

CITATION STYLE

APA

Malach, T., Greenberg, S., & Haiut, M. (2020). Hardware-based real-time deep neural network lossless weights compression. IEEE Access, 8, 205051–205060. https://doi.org/10.1109/ACCESS.2020.3037254

Hardware-based real-time deep neural network lossless weights compression

Abstract

Author supplied keywords

Cite

Register to see more suggestions