Bias in word embeddings

Orestis Papakyriakopoulos; Simon Hegelich; Juan Carlos Medina Serrano; Fabienne Marco

Conference ProceedingsOPEN ACCESS

Bias in word embeddings

FAT* 2020 - Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency (2020) 446-457

DOI: 10.1145/3351095.3372843

72Citations

161Readers

Abstract

Word embeddings are a widely used set of natural language processing techniques that map words to vectors of real numbers. These vectors are used to improve the quality of generative and predictive models. Recent studies demonstrate that word embeddings contain and amplify biases present in data, such as stereotypes and prejudice. In this study, we provide a complete overview of bias in word embeddings. We develop a new technique for bias detection for gendered languages and use it to compare bias in embeddings trained on Wikipedia and on political social media data. We investigate bias diffusion and prove that existing biases are transferred to further machine learning models. We test two techniques for bias mitigation and show that the generally proposed methodology for debiasing models at the embeddings level is insufficient. Finally, we employ biased word embeddings and illustrate that they can be used for the detection of similar biases in new data. Given that word embeddings are widely used by commercial companies, we discuss the challenges and required actions towards fair algorithmic implementations and applications.

Author supplied keywords

Cite

CITATION STYLE

APA

Papakyriakopoulos, O., Hegelich, S., Serrano, J. C. M., & Marco, F. (2020). Bias in word embeddings. In FAT* 2020 - Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency (pp. 446–457). Association for Computing Machinery, Inc. https://doi.org/10.1145/3351095.3372843

Bias in word embeddings

Abstract

Author supplied keywords

Cite

Register to see more suggestions