Effective Use of Target-side Context for Neural Machine Translation

Hideya Mino; Hitoshi Ito; Isao Goto; Ichiro Yamada; Takenobu Tokunaga

Conference ProceedingsOPEN ACCESS

Effective Use of Target-side Context for Neural Machine Translation

COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Conference (2020) 4483-4494

DOI: 10.5715/jnlp.28.731

1Citations

69Readers

Abstract

Through the progress made in a sentence-level neural machine translation (NMT), a context-aware NMT has been rapidly developed to exploit previous sentences as context. Recent work in the context-aware NMT incorporates source- or target-side contexts. In contrast to the source-side context, the target-side context causes a gap between training that utilizes a ground truth sentence and inference using a machine-translated sentence as context. This gap leads to translation quality deteriorating because the translation model is trained with only the ground truth data that cannot be used in the inference. In this paper, we propose sampling both the ground truth and the machine-translated previous sentences of the target-side for the context-aware NMT. The proposed method can make the translation model robust against mistakes and biases made at the inference. Models using our proposed approach show improvements over models using the previous approaches in English ↔ Japanese and English ↔ German translation tasks.

Cite

CITATION STYLE

APA

Mino, H., Ito, H., Goto, I., Yamada, I., & Tokunaga, T. (2020). Effective Use of Target-side Context for Neural Machine Translation. In COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Conference (pp. 4483–4494). Association for Computational Linguistics (ACL). https://doi.org/10.5715/jnlp.28.731

Effective Use of Target-side Context for Neural Machine Translation

Abstract

Cite

Register to see more suggestions