Abstract
Simultaneous translation is a method to reduce the latency of communication through machine translation (MT) by dividing the input into short segments before performing translation. However, short segments pose problems for syntaxbased translation methods, as it is difficult to generate accurate parse trees for sub-sentential segments. In this paper, we perform the first experiments applying syntax-based SMT to simultaneous translation, and propose two methods to prevent degradations in accuracy: a method to predict unseen syntactic constituents that help generate complete parse trees, and a method that waits for more input when the current utterance is not enough to generate a fluent translation. Experiments on English-Japanese translation show that the proposed methods allow for improvements in accuracy, particularly with regards to word order of the target sentences.
Cite
CITATION STYLE
Oda, Y., Neubig, G., Sakti, S., Toda, T., & Nakamura, S. (2015). Syntax-based simultaneous translation through prediction of unseen syntactic constituents. In ACL-IJCNLP 2015 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Proceedings of the Conference (Vol. 1, pp. 198–207). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/p15-1020
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.