In recent years, convolutional neural networks (CNNs) have become widely adopted for computer vision tasks. FPGAs have been adequately explored as a promising hardware accelerator for CNNs owing to their high performance, energy efficiency, and reconfigurability. However, previous FPGA methods, which are based on the conventional convolutional algorithm, are often bounded by the computational capability of FPGAs. This paper first introduces four convolution algorithms: 6-loop algorithm, general matrix-matrix multiplication (GEMM), Winograd algorithm, and fast Fourier transform (FFT) algorithm. Then, we present the implementations of these algorithms at home and abroad, and also introduce their corresponding optimization techniques.
CITATION STYLE
Lu, L., Zheng, S., Xiao, Q., Chen, D., & Liang, Y. (2019). Accelerating convolutional neural networks on FPGAs. Scientia Sinica Informationis. Science Press (China). https://doi.org/10.1360/N112018-00291
Mendeley helps you to discover research relevant for your work.