Training on only perfect Standard English corpora predisposes pre-trained neural networks to discriminate against minorities from nonstandard linguistic backgrounds (e.g., African American Vernacular English, Colloquial Singapore English, etc.). We perturb the inflectional morphology of words to craft plausible and semantically similar adversarial examples that expose these biases in popular NLP models, e.g., BERT and Transformer, and show that adversarially fine-tuning them for a single epoch significantly improves robustness without sacrificing performance on clean data.
CITATION STYLE
Tan, S., Joty, S., Kan, M. Y., & Socher, R. (2020). It’s morphin’ time! combating linguistic discrimination with inflectional perturbations. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 2920–2935). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2020.acl-main.263
Mendeley helps you to discover research relevant for your work.