Fully motion-aware network for video object detection

Shiyao Wang; Yucong Zhou; Junjie Yan; Zhidong Deng

Conference ProceedingsOPEN ACCESS

Fully motion-aware network for video object detection

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 11217 LNCS 557-573

DOI: 10.1007/978-3-030-01261-8_33

36Citations

242Readers

Abstract

Video objection detection is challenging in the presence of appearance deterioration in certain video frames. One of typical solutions is to enhance per-frame features through aggregating neighboring frames. But the features of objects are usually not spatially calibrated across frames due to motion from object and camera. In this paper, we propose an end-to-end model called fully motion-aware network (MANet), which jointly calibrates the features of objects on both pixel-level and instance-level in a unified framework. The pixel-level calibration is flexible in modeling detailed motion while the instance-level calibration captures more global motion cues in order to be robust to occlusion. To our best knowledge, MANet is the first work that can jointly train the two modules and dynamically combine them according to the motion patterns. It achieves leading performance on the large-scale ImageNet VID dataset.

Author supplied keywords

Cite

CITATION STYLE

APA

Wang, S., Zhou, Y., Yan, J., & Deng, Z. (2018). Fully motion-aware network for video object detection. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11217 LNCS, pp. 557–573). Springer Verlag. https://doi.org/10.1007/978-3-030-01261-8_33

Fully motion-aware network for video object detection

Abstract

Author supplied keywords

Cite

Register to see more suggestions