DNN feature map compression using learned representation over GF(2)

Denis Gudovskiy; Alec Hodgkinson; Luca Rigazio

Conference ProceedingsOPEN ACCESS

DNN feature map compression using learned representation over GF(2)

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11132 LNCS 502-516

DOI: 10.1007/978-3-030-11018-5_41

3Citations

24Readers

Abstract

In this paper, we introduce a method to compress intermediate feature maps of deep neural networks (DNNs) to decrease memory storage and bandwidth requirements during inference. Unlike previous works, the proposed method is based on converting fixed-point activations into vectors over the smallest GF(2) finite field followed by nonlinear dimensionality reduction (NDR) layers embedded into a DNN. Such an end-to-end learned representation finds more compact feature maps by exploiting quantization redundancies within the fixed-point activations along the channel or spatial dimensions. We apply the proposed network architectures derived from modified SqueezeNet and MobileNetV2 to the tasks of ImageNet classification and PASCAL VOC object detection. Compared to prior approaches, the conducted experiments show a factor of 2 decrease in memory requirements with minor degradation in accuracy while adding only bitwise computations.

Author supplied keywords

Cite

CITATION STYLE

APA

Gudovskiy, D., Hodgkinson, A., & Rigazio, L. (2019). DNN feature map compression using learned representation over GF(2). In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11132 LNCS, pp. 502–516). Springer Verlag. https://doi.org/10.1007/978-3-030-11018-5_41

DNN feature map compression using learned representation over GF(2)

Abstract

Author supplied keywords

Cite

Register to see more suggestions