A Comparative Study on the Impact of Model Compression Techniques on Fairness in Language Models

20Citations
Citations of this article
23Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Compression techniques for deep learning have become increasingly popular, particularly in settings where latency and memory constraints are imposed. Several methods, such as pruning, distillation, and quantization, have been adopted for compressing models, each providing distinct advantages. However, existing literature demonstrates that compressing deep learning models could affect their fairness. Our analysis involves a comprehensive evaluation of pruned, distilled, and quantized language models, which we benchmark across a range of intrinsic and extrinsic metrics for measuring bias in text classification. We also investigate the impact of using multilingual models and evaluation measures. Our findings highlight the significance of considering both the pre-trained model and the chosen compression strategy in developing equitable language technologies. The results also indicate that compression strategies can have an adverse effect on fairness measures.

Cite

CITATION STYLE

APA

Ramesh, K., Pandit, S., Chavan, A., & Sitaram, S. (2023). A Comparative Study on the Impact of Model Compression Techniques on Fairness in Language Models. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 1, pp. 15762–15782). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.acl-long.878

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free