Double Attention for Multi-Label Image Classification

16Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Multi-label image classification is an essential task in image processing. How to improve the correlation between labels by learning multi-scale features from images is a very challenging problem. We propose a Double Attention Network (DAN) to improve the correlation between image feature regions and labels, as well as between labels and labels. Firstly, the dynamic learning strategy is used to extract the multi-scale features of the image to solve the problem of inconsistent scale of objects in the image. Secondly, in order to improve the correlation between the image feature regions and the labels, we use the spatial attention module to focus on the important regions of the image to learn their salient features, while we use the channel attention module to model the correlation between the channels to improve the correlation between the labels. Finally, the output features of two attention modules are fused as one multi-label image classification model. Experiments on MS-COCO 2014 dataset, Pascal VOC 2007 dataset and NUS-WIDE dataset demonstrate that our model is significantly better than the state-of-the-art models. Besides, visualization analyses show that our model has a strong ability for image salient feature learning and label correlation capturing.

References Powered by Scopus

Deep residual learning for image recognition

174383Citations
N/AReaders
Get full text

ImageNet: A Large-Scale Hierarchical Image Database

51043Citations
N/AReaders
Get full text

Microsoft COCO: Common objects in context

28871Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Feature learning network with transformer for multi-label image classification

31Citations
N/AReaders
Get full text

Double Attention Based on Graph Attention Network for Image Multi-Label Classification

29Citations
N/AReaders
Get full text

RAWNEXT: SPEAKER VERIFICATION SYSTEM FOR VARIABLE-DURATION UTTERANCES WITH DEEP LAYER AGGREGATION AND EXTENDED DYNAMIC SCALING POLICIES

22Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Zhao, H., Zhou, W., Hou, X., & Zhu, H. (2020). Double Attention for Multi-Label Image Classification. IEEE Access, 8, 225539–225550. https://doi.org/10.1109/ACCESS.2020.3044446

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 3

100%

Readers' Discipline

Tooltip

Computer Science 3

60%

Biochemistry, Genetics and Molecular Bi... 1

20%

Engineering 1

20%

Save time finding and organizing research with Mendeley

Sign up for free