A non-contiguous Tree Sequence Alignment-based Model for Statistical Machine Translation

19Citations
Citations of this article
90Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The tree sequence based translation model allows the violation of syntactic boundaries in a rule to capture non-syntactic phrases, where a tree sequence is a contiguous sequence of sub-trees. This paper goes further to present a translation model based on non-contiguous tree sequence alignment, where a non-contiguous tree sequence is a sequence of sub-trees and gaps. Compared with the contiguous tree sequence-based model, the proposed model can well handle non-contiguous phrases with any large gaps by means of non-contiguous tree sequence alignment. An algorithm targeting the non-contiguous constituent decoding is also proposed. Experimental results on the NIST MT-05 Chinese-English translation task show that the proposed model statistically significantly outperforms the baseline systems.

Cite

CITATION STYLE

APA

Sun, J., Zhang, M., & Tan, C. L. (2009). A non-contiguous Tree Sequence Alignment-based Model for Statistical Machine Translation. In ACL-IJCNLP 2009 - Joint Conf. of the 47th Annual Meeting of the Association for Computational Linguistics and 4th Int. Joint Conf. on Natural Language Processing of the AFNLP, Proceedings of the Conf. (pp. 914–922). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1690219.1690275

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free