Most multi-scale detectors face a challenge of small-size false positives due to the inadequacy of low-level features, which have small receptive field sizes and weak semantic capabilities. This paper demonstrates independent predictions from different feature layers on the same region is beneficial for reducing false positives. We propose a novel light-weight previewer block, which previews the objectness probability for the potential regression region of each prior box, using the stronger features with larger receptive fields and more contextual information for better predictions. This previewer block is generic and can be easily implemented in multi-scale detectors, such as SSD, RFBNet and MS-CNN. Extensive experiments are conducted on PASCAL VOC and KITTI pedestrian benchmark to show the superiority of the proposed method.
CITATION STYLE
Fu, Z., Jin, Z., Qi, G. J., Shen, C., Jiang, R., Chen, Y., & Hua, X. S. (2018). Previewer for multi-scale object detector. In MM 2018 - Proceedings of the 2018 ACM Multimedia Conference (pp. 265–273). Association for Computing Machinery, Inc. https://doi.org/10.1145/3240508.3240544
Mendeley helps you to discover research relevant for your work.