As Easy as 1, 2, 3: Behavioural Testing of NMT Systems for Numerical Translation

6Citations
Citations of this article
58Readers
Mendeley users who have this article in their library.

Abstract

Mistranslated numbers have the potential to cause serious effects, such as financial loss or medical misinformation. In this work we develop comprehensive assessments of the robustness of neural machine translation systems to numerical text via behavioural testing. We explore a variety of numerical translation capabilities a system is expected to exhibit and design effective test examples to expose system underperformance. We find that numerical mistranslation is a general issue: major commercial systems and state-of-the-art research models fail on many of our test examples, for high- and low-resource languages. Our tests reveal novel errors that have not previously been reported in NMT systems, to the best of our knowledge. Lastly, we discuss strategies to mitigate numerical mistranslation.

Cite

CITATION STYLE

APA

Wang, J., Xu, C., Guzmán, F., El-Kishky, A., Rubinstein, B. I. P., & Cohn, T. (2021). As Easy as 1, 2, 3: Behavioural Testing of NMT Systems for Numerical Translation. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 (pp. 4711–4717). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.findings-acl.415

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free