Existing bias mitigation methods require social-group-specific word pairs (e.g., “man” - “woman”) for each social attribute (e.g., gender), restricting the bias mitigation to only one specified social attribute. Further, this constraint renders such methods impractical and costly for mitigating bias in understudied and/or unmarked social groups. We propose that the Stereotype Content Model (SCM) - a theoretical framework developed in social psychology for understanding the content of stereotyping - can help debiasing efforts to become social-group-agnostic by capturing the underlying connection between bias and stereotypes. SCM proposes that the content of stereotypes map to two psychological dimensions of warmth and competence. Using only pairs of terms for these two dimensions (e.g., warmth: “genuine” - “fake”; competence: “smart” - “stupid”), we perform debiasing with established methods on both pretrained word embeddings and large language models. We demonstrate that our social-group-agnostic, SCM-based debiasing technique performs comparably to group-specific debiasing on multiple bias benchmarks, but has theoretical and practical advantages over existing approaches.
CITATION STYLE
Omrani, A., Ziabari, A. S., Yu, C., Golazizian, P., Kennedy, B., Atari, M., … Dehghani, M. (2023). Social-Group-Agnostic Bias Mitigation via the Stereotype Content Model. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 1, pp. 4123–4139). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.acl-long.227
Mendeley helps you to discover research relevant for your work.