Abstract
Learning-based classifiers are susceptible to adversarial examples. Existing defence methods are mostly devised on individual classifiers. Recent studies showed that it is viable to increase adversarial robustness by promoting diversity over an ensemble of models. In this paper, we propose adversarial defence by encouraging ensemble diversity on learning high-level feature representations and gradient dispersion in simultaneous training of deep ensemble networks. We perform extensive evaluations under white-box and black-box attacks including transferred examples and adaptive attacks. Our approach achieves a significant gain of up to 52% in adversarial robustness, compared with the baseline and the state-of-the-art method on image benchmarks with complex data scenes. The proposed approach complements the defence paradigm of adversarial training, and can further boost the performance. The source code is available at https://github.com/ALIS-Lab/AAAI2021-PDD.
Cite
CITATION STYLE
Huang, B., Ke, Z., Wang, Y., Wang, W., Shen, L., & Liu, F. (2021). Adversarial Defence by Diversified Simultaneous Training of Deep Ensembles. In 35th AAAI Conference on Artificial Intelligence, AAAI 2021 (Vol. 9A, pp. 7823–7831). Association for the Advancement of Artificial Intelligence. https://doi.org/10.1609/aaai.v35i9.16955
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.