Object detection of prohibited items in X-ray security inspection is challenging because of serious overlap, disorderly background, and high throughput. In the past few years, a variety of deep learning algorithms have been proposed and achieved satisfactory results. However, the performance of these algorithms relies heavily on the specified datasets. Moreover, establishing a large-scale X-ray image dataset by manually collecting and labeling images is prohibitively expensive and time consuming. In this paper, we propose a text-driven framework for synthesizing X-ray security inspection images based on Generative Adversarial Networks (GAN). First, a conditional GAN is developed to generate natural images of prohibited items from class labels. Second, an improved model based on a pix-to-pix GAN is implemented to convert natural images into X-ray images. Third, another HD pix-to-pixel GAN is responsible for producing high-resolution benign background images, which are subsequently fused with the generated images of prohibited items to create X-ray inspection images. Finally, the proposed method is evaluated using SOTA object detection algorithms, such as YOLO-v5, and achieving 4.6% promotion for mAP0.5 and 15.9% promotion for mAP0.5-0.95. The experimental results demonstrate that our image synthesis framework can effectively augment the datasets of prohibited items and improve the detection performance of deep learning algorithms during X-ray security screening.
CITATION STYLE
Liu, J., & Lin, T. H. (2023). A Framework for the Synthesis of X-Ray Security Inspection Images Based on Generative Adversarial Networks. IEEE Access, 11, 63751–63760. https://doi.org/10.1109/ACCESS.2023.3288087
Mendeley helps you to discover research relevant for your work.