Sub-sentence division for tree-based machine translation

Hao Xiong; Wenwen Xu; Haitao Mi; Yang Liu; Qun Liu

Conference ProceedingsOPEN ACCESS

Sub-sentence division for tree-based machine translation

ACL-IJCNLP 2009 - Joint Conf. of the 47th Annual Meeting of the Association for Computational Linguistics and 4th Int. Joint Conf. on Natural Language Processing of the AFNLP, Proceedings of the Conf. (2009) 137-140

DOI: 10.3115/1667583.1667626

10Citations

91Readers

Abstract

Tree-based statistical machine translation models have made significant progress in recent years, especially when replacing 1-best trees with packed forests. However, as the parsing accuracy usually goes down dramatically with the increase of sentence length, translating long sentences often takes long time and only produces degenerate translations. We propose a new method named subsentence division that reduces the decoding time and improves the translation quality for tree-based translation. Our approach divides long sentences into several sub-sentences by exploiting tree structures. Large-scale experiments on the NIST 2008 Chinese-to-English test set show that our approach achieves an absolute improvement of 1.1 BLEU points over the baseline system in 50% less time. © 2009 ACL and AFNLP.

Cite

CITATION STYLE

APA

Xiong, H., Xu, W., Mi, H., Liu, Y., & Liu, Q. (2009). Sub-sentence division for tree-based machine translation. In ACL-IJCNLP 2009 - Joint Conf. of the 47th Annual Meeting of the Association for Computational Linguistics and 4th Int. Joint Conf. on Natural Language Processing of the AFNLP, Proceedings of the Conf. (pp. 137–140). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1667583.1667626

Sub-sentence division for tree-based machine translation

Abstract

Cite

Register to see more suggestions