FedID: Federated Interactive Distillation for Large-Scale Pretraining Language Models

Xinge Ma; Jiangming Liu; Jin Wang; Xuejie Zhang

Conference ProceedingsOPEN ACCESS

FedID: Federated Interactive Distillation for Large-Scale Pretraining Language Models

EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings (2023) 8566-8577

DOI: 10.18653/v1/2023.emnlp-main.529

0Citations

13Readers

Abstract

The growing concerns and regulations surrounding the protection of user data privacy have necessitated decentralized training paradigms. To this end, federated learning (FL) is widely studied in user-related natural language processing (NLP). However, it suffers from several critical limitations including extensive communication overhead, inability to handle heterogeneity, and vulnerability to white-box inference attacks. Federated distillation (FD) is proposed to alleviate these limitations, but its performance is faded by confirmation bias. To tackle this issue, we propose Federated Interactive Distillation (FedID), which utilizes a small amount of labeled data retained by the server to further rectify the local models during knowledge transfer. Additionally, based on the GLUE benchmark, we develop a benchmarking framework across multiple tasks with diverse data distributions to contribute to the research of FD in NLP community. Experiments show that our proposed FedID framework achieves the best results in homogeneous and heterogeneous federated scenarios. The code for this paper is available at: https://github.com/maxinge8698/FedID.

Cite

CITATION STYLE

APA

Ma, X., Liu, J., Wang, J., & Zhang, X. (2023). FedID: Federated Interactive Distillation for Large-Scale Pretraining Language Models. In EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings (pp. 8566–8577). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.emnlp-main.529

FedID: Federated Interactive Distillation for Large-Scale Pretraining Language Models

Abstract

Cite

Register to see more suggestions