Contrastive representation learning has gained much attention due to its superior performance in learning representations from both image and sequential data. However, the learned representations could potentially lead to performance disparities in downstream tasks, such as increased silencing of underrepresented groups in toxicity comment classification. In light of this challenge, in this work, we study learning fair representations that satisfy a notion of fairness known as equalized odds for text classification via contrastive learning. Specifically, we first theoretically analyze the connections between learning representations with a fairness constraint and conditional supervised contrastive objectives, and then propose to use conditional supervised contrastive objectives to learn fair representations for text classification. We conduct experiments on two text datasets to demonstrate the effectiveness of our approaches in balancing the trade-offs between task performance and bias mitigation among existing baselines for text classification. Furthermore, we also show that the proposed methods are stable in different hyperparameter settings.
CITATION STYLE
Chi, J., Shand, W., Yu, Y., Chang, K. W., Zhao, H., & Tian, Y. (2022). Conditional Supervised Contrastive Learning for Fair Text Classification. In Findings of the Association for Computational Linguistics: EMNLP 2022 (pp. 2736–2756). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2022.findings-emnlp.199
Mendeley helps you to discover research relevant for your work.