Multi-agent mutual learning at sentence-level and token-level for neural machine translation

7Citations
Citations of this article
68Readers
Mendeley users who have this article in their library.

Abstract

Mutual learning, where multiple agents learn collaboratively and teach one another, has been shown to be an effective way to distill knowledge for image classification tasks. In this paper, we extend mutual learning to the machine translation task and operate at both the sentence-level and the token-level. Firstly, we co-train multiple agents by using the same parallel corpora. After convergence, each agent selects and learns its poorly predicted tokens from other agents. The poorly predicted tokens are determined by the acceptance-rejection sampling algorithm. Our experiments show that sequential mutual learning at the sentence-level and the token-level improves the results cumulatively. Absolute improvements compared to strong baselines are obtained on various translation tasks. On the IWSLT’14 German-English task, we get a new state-of-the-art BLEU score of 37.0. We also report a competitive result, 29.9 BLEU score, on the WMT’14 English-German task.

Cite

CITATION STYLE

APA

Liao, B., Gao, Y., & Ney, H. (2020). Multi-agent mutual learning at sentence-level and token-level for neural machine translation. In Findings of the Association for Computational Linguistics Findings of ACL: EMNLP 2020 (pp. 1715–1724). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2020.findings-emnlp.155

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free