Joint Multiclass Debiasing of Word Embeddings

Radomir Popović; Florian Lemmerich; Markus Strohmaier

Conference Proceedings

Joint Multiclass Debiasing of Word Embeddings

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2020) 12117 LNAI 79-89

DOI: 10.1007/978-3-030-59491-6_8

4Citations

9Readers

Get full text

Abstract

Bias in Word Embeddings has been a subject of recent interest, along with efforts for its reduction. Current approaches show promising progress towards debiasing single bias dimensions such as gender or race. In this paper, we present a joint multiclass debiasing approach that is capable of debiasing multiple bias dimensions simultaneously. In that direction, we present two approaches, HardWEAT and SoftWEAT, that aim to reduce biases by minimizing the scores of the Word Embeddings Association Test (WEAT). We demonstrate the viability of our methods by debiasing Word Embeddings on three classes of biases (religion, gender and race) in three different publicly available word embeddings and show that our concepts can both reduce or even completely eliminate bias, while maintaining meaningful relationships between vectors in word embeddings. Our work strengthens the foundation for more unbiased neural representations of textual data.

Author supplied keywords

Cite

CITATION STYLE

APA

Popović, R., Lemmerich, F., & Strohmaier, M. (2020). Joint Multiclass Debiasing of Word Embeddings. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12117 LNAI, pp. 79–89). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-59491-6_8

Joint Multiclass Debiasing of Word Embeddings

Abstract

Author supplied keywords

Cite

Register to see more suggestions