A Compression-Compilation Co-Design Framework Towards Real-Time Object Detection on Mobile Devices

0Citations
Citations of this article
33Readers
Mendeley users who have this article in their library.

Abstract

The rapid development and wide utilization of object detection techniques have aroused requirements for both accuracy and speed of object detectors. In this work, we propose a compression-compilation co-design framework to achieve real-time YOLOv4 inference on mobile devices. We propose a novel fine-grained structured pruning, which maintain high accuracy while achieving high hardware parallelism. Our pruned YOLOv4 achieves 48.9 mAP and 17 FPS inference speed on an off-the-shelf Samsung Galaxy S20 smartphone, which is 5.5× faster than the original state-of-the-art detector YOLOv4.

Cite

CITATION STYLE

APA

Cai, Y., Yuan, G., Li, H., Niu, W., Li, Y., Tang, X., … Wang, Y. (2021). A Compression-Compilation Co-Design Framework Towards Real-Time Object Detection on Mobile Devices. In 35th AAAI Conference on Artificial Intelligence, AAAI 2021 (Vol. 18, pp. 15997–16000). Association for the Advancement of Artificial Intelligence. https://doi.org/10.1609/aaai.v35i18.17992

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free