An end-to-end steel surface defect detection approach via Swin transformer

28Citations
Citations of this article
23Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Different from most current studies using convolutional neural network (CNN), a deep learning detection method for steel plate surface defects based on Transformer is researched. This paper presents a process, network structure and detection method for steel strip surface defect detection. A Swin Transformer was used to extract hierarchical features in the detection system. Then feature pyramid networks (FPN) was used to fuse the above features to form multi-scale feature maps, and region proposal network (RPN) was adopted to generate the generate region of interest (ROI) of defects. Finally, the ROI head was used to generate defect category information and its precise location. Here, ablation experiments were conducted to explore the impact of different backbone networks and the number of stages of Swin Transformer and FPN on target detection performance to prove the rationality and efficiency of the network. On the NEU-DET dataset, the proposed algorithm achieved 81.1% mAP, which is higher than that of classic CNN detection methods such as Faster R-CNN, SSD, Yolo v3 and RepPoints.

Cite

CITATION STYLE

APA

Tang, B., Song, Z. K., Sun, W., & Wang, X. D. (2023). An end-to-end steel surface defect detection approach via Swin transformer. IET Image Processing, 17(5), 1334–1345. https://doi.org/10.1049/ipr2.12715

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free