An FPGA-Based Convolutional Neural Network Coprocessor

4Citations
Citations of this article
16Readers
Mendeley users who have this article in their library.

Abstract

In this paper, an FPGA-based convolutional neural network coprocessor is proposed. The coprocessor has a 1D convolutional computation unit PE in row stationary (RS) streaming mode and a 3D convolutional computation unit PE chain in pulsating array structure. The coprocessor can flexibly control the number of PE array openings according to the number of output channels of the convolutional layer. In this paper, we design a storage system with multilevel cache, and the global cache uses multiple broadcasts to distribute data to local caches and propose an image segmentation method that is compatible with the hardware architecture. The proposed coprocessor implements the convolutional and pooling layers of the VGG16 neural network model, in which the activation value, weight value, and bias value are quantized using 16-bit fixed-point quantization, with a peak computational performance of 316.0 GOP/s and an average computational performance of 62.54 GOP/s at a clock frequency of 200 MHz and a power consumption of about 9.25 W.

Cite

CITATION STYLE

APA

Qiu, C., Wang, X., Zhao, T., Li, Q., Wang, B., & Wang, H. (2021). An FPGA-Based Convolutional Neural Network Coprocessor. Wireless Communications and Mobile Computing, 2021. https://doi.org/10.1155/2021/3768724

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free