Generating Textual Adversaries with Minimal Perturbation

1Citations
Citations of this article
23Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Many word-level adversarial attack approaches for textual data have been proposed in recent studies. However, due to the massive search space consisting of combinations of candidate words, the existing approaches face the problem of preserving the semantics of texts when crafting adversarial counterparts. In this paper, we develop a novel attack strategy to find adversarial texts with high similarity to the original texts while introducing minimal perturbation. The rationale is that we expect the adversarial texts with small perturbation can better preserve the semantic meaning of original texts. Experiments show that, compared with state-of-the-art attack approaches, our approach achieves higher success rates and lower perturbation rates in four benchmark datasets.

Cite

CITATION STYLE

APA

Zhao, X., Xu, D., Zhang, L., & Yuan, S. (2022). Generating Textual Adversaries with Minimal Perturbation. In Findings of the Association for Computational Linguistics: EMNLP 2022 (pp. 4628–4635). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2022.findings-emnlp.57

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free