kNN-TL: k-Nearest-Neighbor Transfer Learning for Low-Resource Neural Machine Translation

Shudong Liu; Xuebo Liu; Derek F. Wong; Zhaocong Li; Wenxiang Jiao; Lidia S. Chao; Min Zhang

Conference ProceedingsOPEN ACCESS

kNN-TL: k-Nearest-Neighbor Transfer Learning for Low-Resource Neural Machine Translation

Proceedings of the Annual Meeting of the Association for Computational Linguistics (2023) 1 1878-1891

DOI: 10.18653/v1/2023.acl-long.105

9Citations

15Readers

Abstract

Transfer learning has been shown to be an effective technique for enhancing the performance of low-resource neural machine translation (NMT). This is typically achieved through either fine-tuning a child model with a pretrained parent model, or by utilizing the output of the parent model during the training of the child model. However, these methods do not make use of the parent knowledge during the child inference, which may limit the translation performance. In this paper, we propose a k-Nearest-Neighbor Transfer Learning (kNN-TL) approach for low-resource NMT, which leverages the parent knowledge throughout the entire developing process of the child model. Our approach includes a parent-child representation alignment method, which ensures consistency in the output representations between the two models, and a child-aware datastore construction method that improves inference efficiency by selectively distilling the parent datastore based on relevance to the child model. Experimental results on four low-resource translation tasks show that kNN-TL outperforms strong baselines. Extensive analyses further demonstrate the effectiveness of our approach. Code and scripts are freely available at https://github.com/NLP2CT/kNN-TL.

Cite

CITATION STYLE

APA

Liu, S., Liu, X., Wong, D. F., Li, Z., Jiao, W., Chao, L. S., & Zhang, M. (2023). kNN-TL: k-Nearest-Neighbor Transfer Learning for Low-Resource Neural Machine Translation. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 1, pp. 1878–1891). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.acl-long.105

kNN-TL: k-Nearest-Neighbor Transfer Learning for Low-Resource Neural Machine Translation

Abstract

Cite

Register to see more suggestions