Abstract
In this paper, we improve the attention or alignment accuracy of neural machine translation by utilizing the alignments of training sentence pairs. We simply compute the distance between the machine attentions and the “true” alignments, and minimize this cost in the training procedure. Our experiments on large-scale Chinese-to-English task show that our model improves both translation and alignment qualities significantly over the large-vocabulary neural machine translation system, and even beats a state-of-the-art traditional syntax-based system.
Cite
CITATION STYLE
Mi, H., Wang, Z., & Ittycheriah, A. (2016). Supervised attentions for neural machine translation. In EMNLP 2016 - Conference on Empirical Methods in Natural Language Processing, Proceedings (pp. 2283–2288). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/d16-1249
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.