Aliasing and adversarial robust generalization of CNNs

Julia Grabinski; Janis Keuper; Margret Keuper

Journal ArticleOPEN ACCESS

Aliasing and adversarial robust generalization of CNNs

Machine Learning (2022) 111(11) 3925-3951

DOI: 10.1007/s10994-022-06222-8

8Citations

9Readers

Abstract

Many commonly well-performing convolutional neural network models have shown to be susceptible to input data perturbations, indicating a low model robustness. To reveal model weaknesses, adversarial attacks are specifically optimized to generate small, barely perceivable image perturbations that flip the model prediction. Robustness against attacks can be gained by using adversarial examples during training, which in most cases reduces the measurable model attackability. Unfortunately, this technique can lead to robust overfitting, which results in non-robust models. In this paper, we analyze adversarially trained, robust models in the context of a specific network operation, the downsampling layer, and provide evidence that robust models have learned to downsample more accurately and suffer significantly less from downsampling artifacts, aka. aliasing, than baseline models. In the case of robust overfitting, we observe a strong increase in aliasing and propose a novel early stopping approach based on the measurement of aliasing.

Author supplied keywords

Cite

CITATION STYLE

APA

Grabinski, J., Keuper, J., & Keuper, M. (2022). Aliasing and adversarial robust generalization of CNNs. Machine Learning, 111(11), 3925–3951. https://doi.org/10.1007/s10994-022-06222-8

Aliasing and adversarial robust generalization of CNNs

Abstract

Author supplied keywords

Cite

Register to see more suggestions