A Saliency Detection and Gram Matrix Transform-Based Convolutional Neural Network for Image Emotion Classification

6Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.

Abstract

Using the convolutional neural network (CNN) method for image emotion recognition is a research hotspot of deep learning. Previous studies tend to use visual features obtained from a global perspective and ignore the role of local visual features in emotional arousal. Moreover, the CNN shallow feature maps contain image content information; such maps obtained from shallow layers directly to describe low-level visual features may lead to redundancy. In order to enhance image emotion recognition performance, an improved CNN is proposed in this work. Firstly, the saliency detection algorithm is used to locate the emotional region of the image, which is served as the supplementary information to conduct emotion recognition better. Secondly, the Gram matrix transform is performed on the CNN shallow feature maps to decrease the redundancy of image content information. Finally, a new loss function is designed by using hard labels and probability labels of image emotion category to reduce the influence of image emotion subjectivity. Extensive experiments have been conducted on benchmark datasets, including FI (Flickr and Instagram), IAPSsubset, ArtPhoto, and Abstract. The experimental results show that compared with the existing approaches, our method has a good application prospect.

Cite

CITATION STYLE

APA

Deng, Z., Zhu, Q., He, P., Zhang, D., & Luo, Y. (2021). A Saliency Detection and Gram Matrix Transform-Based Convolutional Neural Network for Image Emotion Classification. Security and Communication Networks, 2021. https://doi.org/10.1155/2021/6854586

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free