FPGA Implementation of a Deep Learning Acceleration Core Architecture for Image Target Detection

Xu Yang; Chen Zhuang; Wenquan Feng; Zhe Yang; Qiang Wang

Journal ArticleOPEN ACCESS

FPGA Implementation of a Deep Learning Acceleration Core Architecture for Image Target Detection

Applied Sciences (Switzerland) (2023) 13(7)

DOI: 10.3390/app13074144

4Citations

8Readers

Abstract

Due to the flexibility and ease of deployment of Field Programmable Gate Arrays (FPGA), more and more studies have been conducted on developing and optimizing target detection algorithms based on Convolutional Neural Networks (CNN) models using FPGAs. Still, these studies focus on improving the performance of the core algorithm and optimizing hardware structure, with few studies focusing on the unified architecture design and corresponding optimization techniques for the algorithm model, resulting in inefficient overall model performance. The essential reason is that these studies do not address arithmetic power, speed, and resource consistency. In order to solve this problem, we propose a deep learning acceleration core architecture based on FPGAs, which is designed for target detection algorithms with CNN models, using multi-channel parallelization of CNN network models to improve the arithmetic power, using scheduling tasks and intensive computation pipelining to meet the algorithm’s data bandwidth requirements and unifying the speed and area of the orchestrated computation matrix to save hardware resources. The proposed framework achieves 14 Frames Per Second (FPS) inference performance of the TinyYolo model at 5 Giga Operations Per Second (GOPS) with 30% higher running clock frequency, 2–4 times higher arithmetic power, and 28% higher Digital Signal Processing (DSP) resource utilization efficiency using less than 25% of FPGA resource usage.

Author supplied keywords

Cite

CITATION STYLE

APA

Yang, X., Zhuang, C., Feng, W., Yang, Z., & Wang, Q. (2023). FPGA Implementation of a Deep Learning Acceleration Core Architecture for Image Target Detection. Applied Sciences (Switzerland), 13(7). https://doi.org/10.3390/app13074144

FPGA Implementation of a Deep Learning Acceleration Core Architecture for Image Target Detection

Abstract

Author supplied keywords

Cite

Register to see more suggestions