Syntax-based simultaneous translation through prediction of unseen syntactic constituents

24Citations
Citations of this article
114Readers
Mendeley users who have this article in their library.

Abstract

Simultaneous translation is a method to reduce the latency of communication through machine translation (MT) by dividing the input into short segments before performing translation. However, short segments pose problems for syntaxbased translation methods, as it is difficult to generate accurate parse trees for sub-sentential segments. In this paper, we perform the first experiments applying syntax-based SMT to simultaneous translation, and propose two methods to prevent degradations in accuracy: a method to predict unseen syntactic constituents that help generate complete parse trees, and a method that waits for more input when the current utterance is not enough to generate a fluent translation. Experiments on English-Japanese translation show that the proposed methods allow for improvements in accuracy, particularly with regards to word order of the target sentences.

Cite

CITATION STYLE

APA

Oda, Y., Neubig, G., Sakti, S., Toda, T., & Nakamura, S. (2015). Syntax-based simultaneous translation through prediction of unseen syntactic constituents. In ACL-IJCNLP 2015 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Proceedings of the Conference (Vol. 1, pp. 198–207). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/p15-1020

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free