Syntax-based simultaneous translation through prediction of unseen syntactic constituents

Yusuke Oda; Graham Neubig; Sakriani Sakti; Tomoki Toda; Satoshi Nakamura

Conference ProceedingsOPEN ACCESS

Syntax-based simultaneous translation through prediction of unseen syntactic constituents

ACL-IJCNLP 2015 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Proceedings of the Conference (2015) 1 198-207

DOI: 10.3115/v1/p15-1020

24Citations

114Readers

Abstract

Simultaneous translation is a method to reduce the latency of communication through machine translation (MT) by dividing the input into short segments before performing translation. However, short segments pose problems for syntaxbased translation methods, as it is difficult to generate accurate parse trees for sub-sentential segments. In this paper, we perform the first experiments applying syntax-based SMT to simultaneous translation, and propose two methods to prevent degradations in accuracy: a method to predict unseen syntactic constituents that help generate complete parse trees, and a method that waits for more input when the current utterance is not enough to generate a fluent translation. Experiments on English-Japanese translation show that the proposed methods allow for improvements in accuracy, particularly with regards to word order of the target sentences.

Cite

CITATION STYLE

APA

Oda, Y., Neubig, G., Sakti, S., Toda, T., & Nakamura, S. (2015). Syntax-based simultaneous translation through prediction of unseen syntactic constituents. In ACL-IJCNLP 2015 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Proceedings of the Conference (Vol. 1, pp. 198–207). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/p15-1020

Syntax-based simultaneous translation through prediction of unseen syntactic constituents

Abstract

Cite

Register to see more suggestions