Multi-scale Spatially-Asymmetric Recalibration for Image Classification

2Citations
Citations of this article
149Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Convolution is spatially-symmetric, i.e., the visual features are independent of its position in the image, which limits its ability to utilize contextual cues for visual recognition. This paper addresses this issue by introducing a recalibration process, which refers to the surrounding region of each neuron, computes an importance value and multiplies it to the original neural response. Our approach is named multi-scale spatially-asymmetric recalibration (MS-SAR), which extracts visual cues from surrounding regions at multiple scales, and designs a weighting scheme which is asymmetric in the spatial domain. MS-SAR is implemented in an efficient way, so that only small fractions of extra parameters and computations are required. We apply MS-SAR to several popular building blocks, including the residual block and the densely-connected block, and demonstrate its superior performance in both CIFAR and ILSVRC2012 classification tasks.

Cite

CITATION STYLE

APA

Wang, Y., Xie, L., Qiao, S., Zhang, Y., Zhang, W., & Yuille, A. L. (2018). Multi-scale Spatially-Asymmetric Recalibration for Image Classification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11217 LNCS, pp. 523–539). Springer Verlag. https://doi.org/10.1007/978-3-030-01261-8_31

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free