Latent alignment objectives such as CTC and AXE significantly improve non-autoregressive machine translation models. Can they improve autoregressive models as well? We explore the possibility of training autoregressive machine translation models with latent alignment objectives, and observe that, in practice, this approach results in degenerate models. We provide a theoretical explanation for these empirical results, and prove that latent alignment objectives are incompatible with teacher forcing.
Mendeley helps you to discover research relevant for your work.
CITATION STYLE
Haviv, A., Vassertail, L., & Levy, O. (2021). Can Latent Alignments Improve Autoregressive Machine Translation? In NAACL-HLT 2021 - 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference (pp. 2637–2641). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.naacl-main.209