Regional Gating Neural Networks for Multi-label Image Classification

18Citations
Citations of this article
19Readers
Mendeley users who have this article in their library.

Abstract

This paper proposes a novel deep learning framework for multi-label image classification, namely regional gating neural networks (RGNN). The motivation is two folds. First, global image features (including CNN based features) ignore the underlying context information among different objects in an image. Consequently, people attempt to use information from objectness regions. However, current objectness region proposal algorithms usually produce several thousand region candidates, including many classification irrelevant or even noisy regions. This leads to the second problem: how to select useful contextual regions for image classification. RGNN is an end-to-end deep learning framework that can automatically select contextual region features with specially designed gate units, which are then fused for classification. Because the gate units and the classifier are integrated in the same deep neural network pipeline, we can learn parameters of the network simultaneously. We evaluate the proposed method on PASCAL VOC 2007/2012 and MS-COCO benchmarks, and results show that RGNN is superior to existing state-of-the-art methods.

Cite

CITATION STYLE

APA

Zhao, R. W., Li, J., Chen, Y., Liu, J. M., Jiang, Y. G., & Xue, X. (2016). Regional Gating Neural Networks for Multi-label Image Classification. In British Machine Vision Conference 2016, BMVC 2016 (Vol. 2016-September, pp. 72.1-72.12). British Machine Vision Conference, BMVC. https://doi.org/10.5244/C.30.72

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free