Permlstm: A high energy-efficiency lstm accelerator architecture

Yong Zheng; Haigang Yang; Yiping Jia; Zhihong Huang

Journal ArticleOPEN ACCESS

Permlstm: A high energy-efficiency lstm accelerator architecture

Electronics (Switzerland) (2021) 10(8)

DOI: 10.3390/electronics10080882

6Citations

16Readers

Abstract

Pruning and quantization are two commonly used approaches to accelerate the LSTM (Long Short-Term Memory) model. However, the traditional linear quantization usually suffers from the problem of gradient vanishing, and the existing pruning methods all have the problem of producing undesired irregular sparsity or large indexing overhead. To alleviate the problem of vanishing gradient, this work proposed a normalized linear quantization approach, which first normalize operands regionally and then quantize them in a local mix-max range. To overcome the problem of irregular sparsity and large indexing overhead, this work adopts the permuted block diagonal mask matrices to generate the sparse model. Due to the sparse model being highly regular, the position of non-zero weights can be obtained by a simple calculation, thus avoiding the large indexing overhead. Based on the sparse LSTM model generated from the permuted block diagonal mask matrices, this paper also proposed a high energy-efficiency accelerator, PermLSTM that comprehensively exploits the sparsity of weights, activations, and products regarding the matrix– vector multiplications, resulting in a 55.1% reduction in power consumption. The accelerator has been realized on Arria-10 FPGAs running at 150 MHz and achieved 2.19×∼24.4× energy efficiency compared with the other FPGA-based LSTM accelerators previously reported.

Author supplied keywords

Cite

CITATION STYLE

APA

Zheng, Y., Yang, H., Jia, Y., & Huang, Z. (2021). Permlstm: A high energy-efficiency lstm accelerator architecture. Electronics (Switzerland), 10(8). https://doi.org/10.3390/electronics10080882

Permlstm: A high energy-efficiency lstm accelerator architecture

Abstract

Author supplied keywords

Cite

Register to see more suggestions