Imitation learning for non-autoregressive neural machine translation

48Citations
Citations of this article
179Readers
Mendeley users who have this article in their library.

Abstract

Non-autoregressive translation models (NAT) have achieved impressive inference speedup. A potential issue of the existing NAT algorithms, however, is that the decoding is conducted in parallel, without directly considering previous context. In this paper, we propose an imitation learning framework for non-autoregressive machine translation, which still enjoys the fast translation speed but gives comparable translation performance compared to its auto-regressive counterpart. We conduct experiments on the IWSLT16, WMT14 and WMT16 datasets. Our proposed model achieves a significant speedup over the autoregressive models, while keeping the translation quality comparable to the autoregressive models. By sampling sentence length in parallel at inference time, we achieve the performance of 31.85 BLEU on WMT16 Ro?En and 30.68 BLEU on IWSLT16 En?De.

Cite

CITATION STYLE

APA

Wei, B., Wang, M., Zhou, H., Lin, J., & Sun, X. (2020). Imitation learning for non-autoregressive neural machine translation. In ACL 2019 - 57th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (pp. 1304–1312). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/p19-1125

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free