NTT's machine translation systems for WMT19 robustness task

7Citations
Citations of this article
81Readers
Mendeley users who have this article in their library.

Abstract

This paper describes NTT's submission to the WMT19 robustness task. This task mainly focuses on translating noisy text (e.g., posts on Twitter), which presents different difficulties from typical translation tasks such as news. Our submission combined techniques including utilization of a synthetic corpus, domain adaptation, and a placeholder mechanism, which significantly improved over the previous baseline. Experimental results revealed the placeholder mechanism, which temporarily replaces the non-standard tokens including emojis and emoticons with special placeholder tokens during translation, improves translation accuracy even with noisy texts.

Cite

CITATION STYLE

APA

Murakami, S., Morishita, M., Hirao, T., & Nagata, M. (2019). NTT’s machine translation systems for WMT19 robustness task. In WMT 2019 - 4th Conference on Machine Translation, Proceedings of the Conference (Vol. 2, pp. 544–551). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w19-5365

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free