Fortify the Guardian, Not the Treasure: Resilient Adversarial Detectors

3Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Adaptive adversarial attacks, where adversaries tailor their strategies with full knowledge of defense mechanisms, pose significant challenges to the robustness of adversarial detectors. In this paper, we introduce RADAR (Robust Adversarial Detection via Adversarial Retraining), an approach designed to fortify adversarial detectors against such adaptive attacks while preserving the classifier’s accuracy. RADAR employs adversarial training by incorporating adversarial examples—crafted to deceive both the classifier and the detector—into the training process. This dual optimization enables the detector to learn and adapt to sophisticated attack scenarios. Comprehensive experiments on CIFAR-10, SVHN, and ImageNet datasets demonstrate that RADAR substantially enhances the detector’s ability to accurately identify adaptive adversarial attacks without degrading classifier performance.

Cite

CITATION STYLE

APA

Lapid, R., Dubin, A., & Sipper, M. (2024). Fortify the Guardian, Not the Treasure: Resilient Adversarial Detectors. Mathematics, 12(22). https://doi.org/10.3390/math12223451

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free