A comparative study on reordering constraints in statistical machine translation

Richard Zens; Hermann Ney

Conference ProceedingsOPEN ACCESS

A comparative study on reordering constraints in statistical machine translation

Proceedings of the Annual Meeting of the Association for Computational Linguistics (2003) 2003-July

DOI: 10.3115/1075096.1075115

78Citations

121Readers

Abstract

In statistical machine translation, the generation of a translation hypothesis is computationally expensive. If arbitrary word-reorderings are permitted, the search problem is NP-hard. On the other hand, if we restrict the possible word-reorderings in an appropriate way, we obtain a polynomial-time search algorithm. In this paper, we compare two different reordering constraints, namely the ITG constraints and the IBM constraints. This comparison includes a theoretical discussion on the permitted number of reorderings for each of these constraints. We show a connection between the ITG constraints and the since 1870 known Schröder numbers. We evaluate these constraints on two tasks: the Verbmobil task and the Canadian Hansards task. The evaluation consists of two parts: First, we check how many of the Viterbi alignments of the training corpus satisfy each of these constraints. Second, we restrict the search to each of these constraints and compare the resulting translation hypotheses. The experiments will show that the baseline ITG constraints are not sufficient on the Canadian Hansards task. Therefore, we present an extension to the ITG constraints. These extended ITG constraints increase the alignment coverage from about 87% to 96%.

Cite

CITATION STYLE

APA

Zens, R., & Ney, H. (2003). A comparative study on reordering constraints in statistical machine translation. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 2003-July). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1075096.1075115

A comparative study on reordering constraints in statistical machine translation

Abstract

Cite

Register to see more suggestions