Robust neural machine translation with ASR errors

10Citations
Citations of this article
78Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In many practical applications, neural machine translation systems have to deal with the input from automatic speech recognition (ASR) systems which may contain a certain number of errors. This leads to two problems which degrade translation performance. One is the discrepancy between the training and testing data and the other is the translation error caused by the input errors may ruin the whole translation. In this paper, we propose a method to handle the two problems so as to generate robust translation to ASR errors. First, we simulate ASR errors in the training data so that the data distribution in the training and test is consistent. Second, we focus on ASR errors on homophone words and words with similar pronunciation and make use of their pronunciation information to help the translation model to recover from the input errors. Experiments on two Chinese-English data sets show that our method is more robust to input errors and can outperform the strong Transformer baseline significantly.

Cite

CITATION STYLE

APA

Xue, H., Feng, Y., Gu, S., & Chen, W. (2020). Robust neural machine translation with ASR errors. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 15–23). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2020.autosimtrans-1.3

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free