Robust neural machine translation with ASR errors

Haiyang Xue; Yang Feng; Shuhao Gu; Wei Chen

Conference Proceedings

Robust neural machine translation with ASR errors

Proceedings of the Annual Meeting of the Association for Computational Linguistics (2020) 15-23

DOI: 10.18653/v1/2020.autosimtrans-1.3

10Citations

78Readers

Get full text

Abstract

In many practical applications, neural machine translation systems have to deal with the input from automatic speech recognition (ASR) systems which may contain a certain number of errors. This leads to two problems which degrade translation performance. One is the discrepancy between the training and testing data and the other is the translation error caused by the input errors may ruin the whole translation. In this paper, we propose a method to handle the two problems so as to generate robust translation to ASR errors. First, we simulate ASR errors in the training data so that the data distribution in the training and test is consistent. Second, we focus on ASR errors on homophone words and words with similar pronunciation and make use of their pronunciation information to help the translation model to recover from the input errors. Experiments on two Chinese-English data sets show that our method is more robust to input errors and can outperform the strong Transformer baseline significantly.

Cite

CITATION STYLE

APA

Xue, H., Feng, Y., Gu, S., & Chen, W. (2020). Robust neural machine translation with ASR errors. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 15–23). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2020.autosimtrans-1.3

Robust neural machine translation with ASR errors

Abstract

Cite

Register to see more suggestions