Dependency-based Chinese-English statistical machine translation

Xiaodong Shi; Yidong Chen; Nfeng Jia

Conference Proceedings

Dependency-based Chinese-English statistical machine translation

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2007) 4394 LNCS 385-396

DOI: 10.1007/978-3-540-70939-8_34

3Citations

4Readers

Get full text

Abstract

We present a Chinese-English Statistical Machine Translation (SMT) system based on dependency tree mappings. We use a state-of-the-art dependency parser to parse the English translation of the Penn Chinese Treebank to make it bilingual and then learn a tree-to-tree dependency mapping model. We also train a phrase-based translation model and collect a bilingual phrase lexicon to bootstrap a treelet translation model. For decoding, we use the same dependency parser on Chinese, using a log-linear framework to integrate the learned translation model with a variety of dependency tree based probability models, and then find the best English dependency tree by dynamic programming. Finally the English tree is flattened to produce the translation. We evaluate our system on the 863 and NIST 2005 Chinese-English MT test data and find that the dependency-based model significantly outperforms Caravan, our phrase-based SMT system which participated in NIST 2006 and IWSLT 2006. © Springer-Verlag Berlin Heidelberg 2007.

Cite

CITATION STYLE

APA

Shi, X., Chen, Y., & Jia, N. (2007). Dependency-based Chinese-English statistical machine translation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4394 LNCS, pp. 385–396). Springer Verlag. https://doi.org/10.1007/978-3-540-70939-8_34

Dependency-based Chinese-English statistical machine translation

Abstract

Cite

Register to see more suggestions