VisioDECT: a novel approach to drone detection using CBAM-integrated YOLO and GELAN-E models

4Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Unmanned aerial vehicles have revolutionized logistics, environmental monitoring, and aerial surveillance. Their widespread use has created security concerns, specifically regarding illegal spying, smuggling, and hazardous substance movement. Maintaining public safety and protecting sensitive locations requires effective drone detection and payload assessment. Our article proposes a vision-based system for real-time drone identification and classification utilizing YOLOv5, YOLOv8, and GELAN-E deep learning models, enhanced with novel attention mechanisms and interpretability techniques. By integrating the Convolutional Block Attention Module into the YOLO architecture, which is named AttnYOLO, the system enhances feature extraction and focuses on the most relevant regions in an image. This improvement in spatial and channel attention significantly boosts detection performance, particularly for small and occluded drones. Additionally, we employ Gradient-weighted Class Activation Mapping (EigenCAM) for visualizing model focus during detection, increasing the system’s transparency and interpretability. VisioDECT comprises 20,924 annotated photographs of six drone models in overcast, sunny, and evening circumstances. Under cloudy conditions, DenseNet201 achieved 100% classification accuracy, while DarkNet53 and InceptionV3 reached 99.99% and 99.9%, respectively. In evening scenarios, InceptionV3 had 100% accuracy, followed by DarkNet53 with 99.98%. Our proposed model, GELAN-E, excelled in detection and classification. In overcast settings, GELAN-E outperformed YOLOv8 with an accuracy of 0.988, a recall of 0.994, and a mAP50-95 score of 0.688. For evening conditions, GELAN-E achieved a higher mAP50-95 score of 0.642 compared to YOLOv8. These results demonstrate that the inclusion of attention mechanisms, along with visual interpretability, enhances drone detection performance, particularly in low-light and challenging environments, making this system ideal for real-time drone detection in civilian and military applications.

Cite

CITATION STYLE

APA

Sakib Bin Islam, M., Chowdhury, M. E. H., Hasan-Zia, M., Kashem, S. B. A., Majid, M. E., Ansaruddin Kunju, A. K., … Nashbat, M. (2025). VisioDECT: a novel approach to drone detection using CBAM-integrated YOLO and GELAN-E models. Neural Computing and Applications, 37(24), 20181–20204. https://doi.org/10.1007/s00521-025-11448-3

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free