Mitigation of policy manipulation attacks on deep Q-networks with parameter-space noise

Vahid Behzadan; Arslan Munir

Conference Proceedings

Mitigation of policy manipulation attacks on deep Q-networks with parameter-space noise

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 11094 LNCS 406-417

DOI: 10.1007/978-3-319-99229-7_34

18Citations

24Readers

Get full text

Abstract

Recent developments establish the vulnerability of deep reinforcement learning to policy manipulation attack. In this work, we propose a technique for mitigation of such attacks based on addition of noise to the parameter space of deep reinforcement learners during training. We experimentally verify the effect of parameter-space noise in reducing the transferability of adversarial examples, and demonstrate the promising performance of this technique in mitigating the impact of whitebox and blackbox attacks at both test and training times.

Author supplied keywords

Cite

CITATION STYLE

APA

Behzadan, V., & Munir, A. (2018). Mitigation of policy manipulation attacks on deep Q-networks with parameter-space noise. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11094 LNCS, pp. 406–417). Springer Verlag. https://doi.org/10.1007/978-3-319-99229-7_34

Mitigation of policy manipulation attacks on deep Q-networks with parameter-space noise

Abstract

Author supplied keywords

Cite

Register to see more suggestions