Short floating-point representation for convolutional neural network inference

Hyeong Ju Kang

Journal ArticleOPEN ACCESS

Short floating-point representation for convolutional neural network inference

Kang H

IEICE Electronics Express (2019) 16(2) 1-11

DOI: 10.1587/elex.15.20180909

7Citations

6Readers

Abstract

Convolutional neural networks (CNNs) are being widely used in computer vision tasks, and there have been many efforts to implement CNNs in ASIC or FPGA for power-hungry environments. Instead of the previous common representation, the fixed-point representation, this letter proposes a short floating-point representation for CNNs. The short floating-point representation is based on the normal floating-point representation, but has much less width and does not have complex cases like Not-a-Number and infinity cases. The short floating-point representation, contrary to the common belief, can produce a low-complexity computation logic because the operands of the multiplier logic can be shortened by the exponent concept of the floatingpoint representation. The exponent can also reduce the total length to reduce the SRAM area. The experimental results show that the short floating-point representation with 8-bit total width achieves less-than-1-percentage-point degradation without the aid of retraining in the top-5 accuracy on very deep CNNs of up to 152 layers and gives more than a 60% area reduction in the ASIC implementation.

Author supplied keywords

Cite

CITATION STYLE

APA

Kang, H. J. (2019). Short floating-point representation for convolutional neural network inference. IEICE Electronics Express, 16(2), 1–11. https://doi.org/10.1587/elex.15.20180909

Short floating-point representation for convolutional neural network inference

Abstract

Author supplied keywords

Cite

Register to see more suggestions