Audio-domain position-independent backdoor attack via unnoticeable triggers

Cong Shi; Tianfang Zhang; Zhuohang Li; Huy Phan; Tianming Zhao; Yan Wang; Jian Liu; Bo Yuan; Yingying Chen

Conference ProceedingsOPEN ACCESS

Audio-domain position-independent backdoor attack via unnoticeable triggers

Proceedings of the Annual International Conference on Mobile Computing and Networking, MOBICOM (2022) 583-595

DOI: 10.1145/3495243.3560531

29Citations

16Readers

Get full text

Abstract

Deep learning models have become key enablers of voice user interfaces. With the growing trend of adopting outsourced training of these models, backdoor attacks, stealthy yet effective training-phase attacks, have gained increasing attention. They inject hidden trigger patterns through training set poisoning and overwrite the model's predictions in the inference phase. Research in backdoor attacks has been focusing on image classification tasks, while there have been few studies in the audio domain. In this work, we explore the severity of audio-domain backdoor attacks and demonstrate their feasibility under practical scenarios of voice user interfaces, where an adversary injects (plays) an unnoticeable audio trigger into live speech to launch the attack. To realize such attacks, we consider jointly optimizing the audio trigger and the target model in the training phase, deriving a position-independent, unnoticeable, and robust audio trigger. We design new data poisoning techniques and penalty-based algorithms that inject the trigger into randomly generated temporal positions in the audio input during training, rendering the trigger resilient to any temporal position variations. We further design an environmental sound mimicking technique to make the trigger resemble unnoticeable situational sounds and simulate played over-The-Air distortions to improve the trigger's robustness during the joint optimization process. Extensive experiments on two important applications (i.e., speech command recognition and speaker recognition) demonstrate that our attack can achieve an average success rate of over 99% under both digital and physical attack settings.

Author supplied keywords

Cite

CITATION STYLE

APA

Shi, C., Zhang, T., Li, Z., Phan, H., Zhao, T., Wang, Y., … Chen, Y. (2022). Audio-domain position-independent backdoor attack via unnoticeable triggers. In Proceedings of the Annual International Conference on Mobile Computing and Networking, MOBICOM (pp. 583–595). Association for Computing Machinery. https://doi.org/10.1145/3495243.3560531

Audio-domain position-independent backdoor attack via unnoticeable triggers

Abstract

Author supplied keywords

Cite

Register to see more suggestions