SBMYv3: Improved MobYOLOv3 a BAM attention-based approach for obscene image and video detection

11Citations
Citations of this article
14Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Countless cybercrime instances have shown the need for detecting and blocking obscene material from social media sites. Deep learning methods (DLMs) outperformed in recognizing obscene content flooded on many online platforms. However, these contemporary DLMs primarily treat the recognition of obscene content as a simple task of binary classification, rather than focusing on the labelling of obscene areas. Hence, many of these methods could not pay attention to the fact that misclassification samples are so diverse. Therefore, this paper focuses on two aspects (i) developing a deep learning model that could classify and label the obscene portion, and (ii) generating a labelled obscene image dataset with a wide variety of obscene samples to minimize the risks of inaccurate recognition. We have proposed a method named S3Pooling based bottleneck attention module (BAM) embedded MobileNetV2-YOLOv3 (SBMYv3) for automatic detection of obscene content using an attention mechanism and a suitable pooling strategy. The key contributions of our article are: (i) generation of a well-labelled obscene image dataset with a variety of augmentation strategies using Pix-2-Pix GAN (ii) modifications to the backend architecture of YOLOv3 using MobileNetV2 and BAM to ensure focused and accurate feature extraction, and (iii) selection of an optimal pooling strategy, that is, S3Pooling strategy, while taking the design of the feature extractor into account. The proposed SBMYv3 model outperformed other state-of-the-art models with 99.26% testing accuracy, 99.39% recall, 99.13% precision, and 99.13% IoU values respectively.

Cite

CITATION STYLE

APA

Samal, S., Zhang, Y. D., Gadekallu, T. R., Nayak, R., & Balabantaray, B. K. (2023). SBMYv3: Improved MobYOLOv3 a BAM attention-based approach for obscene image and video detection. Expert Systems, 40(6). https://doi.org/10.1111/exsy.13230

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free