Trident-YOLO: Improving the precision and speed of mobile device object detection

36Citations
Citations of this article
38Readers
Mendeley users who have this article in their library.
Get full text

Abstract

This paper introduce an efficient object detection network named Trident-You Only Look Once (YOLO), which is designed for mobile devices with limited computing power. The new architecture is improved based on YOLO v4-tiny. The authors redesign the network structure and propose a trident feature pyramid network (Trident-FPN), which can improve the precision and recall of lightweight object detection. Specifically, Trident-FPN increases the computational complexity by only a small amount of floating point operations per second (FLOPs) and obtains a multi-scale feature map of the model, which significantly lightweight object detection performance. To enlarge the receptive field of the network with the fewest FLOPs, this paper redesign the receptive field block (RFB) and spatial pyramid pooling (SPP) layer and propose tinier cross-stage partial RFBs and smaller cross-stage partial SPPs. This paper present extensive experiments, and Trident-YOLO shows strong performance compared to that of other popular models on the PASCAL VOC and MS COCO. On the MS COCO and PASCAL VOC 2007 test sets, the mean average precision (mAP) of Trident-YOLO improved by 4.5% and 5.0%, respectively. Trident-YOLO also reduce the network size by more than 54.4% compared to YOLO v4-tiny. With a 23.7% FLOP reduction, the FPS is improved by 1.9 on an Nvidia Jetson Xavier NX.

Cite

CITATION STYLE

APA

Wang, G., Ding, H., Li, B., Nie, R., & Zhao, Y. (2022). Trident-YOLO: Improving the precision and speed of mobile device object detection. IET Image Processing, 16(1), 145–157. https://doi.org/10.1049/ipr2.12340

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free