Pruning-then-Expanding Model for Domain Adaptation of Neural Machine Translation

Shuhao Gu; Yang Feng; Wanying Xie

Conference Proceedings

Pruning-then-Expanding Model for Domain Adaptation of Neural Machine Translation

NAACL-HLT 2021 - 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference (2021) 3942-3952

DOI: 10.18653/v1/2021.naacl-main.308

19Citations

82Readers

Get full text

Abstract

Domain Adaptation is widely used in practical applications of neural machine translation, which aims to achieve good performance on both general domain and in-domain data. However, the existing methods for domain adaptation usually suffer from catastrophic forgetting, large domain divergence, and model explosion. To address these three problems, we propose a method of “divide and conquer” which is based on the importance of neurons or parameters for the translation model. In this method, we first prune the model and only keep the important neurons or parameters, making them responsible for both general-domain and in-domain translation. Then we further train the pruned model supervised by the original whole model with knowledge distillation. Last we expand the model to the original size and fine-tune the added parameters for the in-domain translation. We conducted experiments on different language pairs and domains and the results show that our method can achieve significant improvements compared with several strong baselines.

Cite

CITATION STYLE

APA

Gu, S., Feng, Y., & Xie, W. (2021). Pruning-then-Expanding Model for Domain Adaptation of Neural Machine Translation. In NAACL-HLT 2021 - 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference (pp. 3942–3952). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.naacl-main.308

Pruning-then-Expanding Model for Domain Adaptation of Neural Machine Translation

Abstract

Cite

Register to see more suggestions