Fortify the Guardian, Not the Treasure: Resilient Adversarial Detectors

Raz Lapid; Almog Dubin; Moshe Sipper

Journal ArticleOPEN ACCESS

Fortify the Guardian, Not the Treasure: Resilient Adversarial Detectors

Mathematics (2024) 12(22)

DOI: 10.3390/math12223451

3Citations

8Readers

Get full text

Abstract

Adaptive adversarial attacks, where adversaries tailor their strategies with full knowledge of defense mechanisms, pose significant challenges to the robustness of adversarial detectors. In this paper, we introduce RADAR (Robust Adversarial Detection via Adversarial Retraining), an approach designed to fortify adversarial detectors against such adaptive attacks while preserving the classifier’s accuracy. RADAR employs adversarial training by incorporating adversarial examples—crafted to deceive both the classifier and the detector—into the training process. This dual optimization enables the detector to learn and adapt to sophisticated attack scenarios. Comprehensive experiments on CIFAR-10, SVHN, and ImageNet datasets demonstrate that RADAR substantially enhances the detector’s ability to accurately identify adaptive adversarial attacks without degrading classifier performance.

Author supplied keywords

Cite

CITATION STYLE

APA

Lapid, R., Dubin, A., & Sipper, M. (2024). Fortify the Guardian, Not the Treasure: Resilient Adversarial Detectors. Mathematics, 12(22). https://doi.org/10.3390/math12223451

Fortify the Guardian, Not the Treasure: Resilient Adversarial Detectors

Abstract

Author supplied keywords

Cite

Register to see more suggestions