Can Latent Alignments Improve Autoregressive Machine Translation?

Adi Haviv; Lior Vassertail; Omer Levy

Conference ProceedingsOPEN ACCESS

Can Latent Alignments Improve Autoregressive Machine Translation?

NAACL-HLT 2021 - 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference (2021) 2637-2641

DOI: 10.18653/v1/2021.naacl-main.209

6Citations

66Readers

Abstract

Latent alignment objectives such as CTC and AXE significantly improve non-autoregressive machine translation models. Can they improve autoregressive models as well? We explore the possibility of training autoregressive machine translation models with latent alignment objectives, and observe that, in practice, this approach results in degenerate models. We provide a theoretical explanation for these empirical results, and prove that latent alignment objectives are incompatible with teacher forcing.

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Haviv, A., Vassertail, L., & Levy, O. (2021). Can Latent Alignments Improve Autoregressive Machine Translation? In NAACL-HLT 2021 - 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference (pp. 2637–2641). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.naacl-main.209

Readers' Seniority

PhD / Post grad / Masters / Doc 15

65%

Researcher 5

22%

Lecturer / Post doc 2

Professor / Associate Prof. 1

Readers' Discipline

Computer Science 19

73%

Linguistics 4

15%

Engineering 2

Neuroscience 1

Can Latent Alignments Improve Autoregressive Machine Translation?

Abstract

References Powered by Scopus

Neural machine translation of rare words with subword units

Connectionist temporal classification: Labelling unsegmented sequence data with recurrent neural networks

End-to-end non-autoregressive neural machine translation with connectionist temporal classification

Cited by Powered by Scopus

O2NA: An Object-Oriented Non-Autoregressive Approach for Controllable Video Captioning

Hierarchical Latent Alignment for Non-Autoregressive Generation under High Compression Ratio

Register to see more suggestions

Cite

Readers' Seniority

Readers' Discipline