Augmenting translation models with simulated acoustic confusions for improved spoken language translation

18Citations
Citations of this article
81Readers
Mendeley users who have this article in their library.

Abstract

We propose a novel technique for adapting text-based statistical machine translation to deal with input from automatic speech recognition in spoken language translation tasks. We simulate likely misrecognition errors using only a source language pronunciation dictionary and language model (i.e., without an acoustic model), and use these to augment the phrase table of a standard MT system. The augmented system can thus recover from recognition errors during decoding using synthesized phrases. Using the outputs of five different English ASR systems as input, we find consistent and significant improvements in translation quality. Our proposed technique can also be used in conjunction with lattices as ASR output, leading to further improvements. © 2014 Association for Computational Linguistics.

Cite

CITATION STYLE

APA

Tsvetkov, Y., Metze, F., & Dyer, C. (2014). Augmenting translation models with simulated acoustic confusions for improved spoken language translation. In 14th Conference of the European Chapter of the Association for Computational Linguistics 2014, EACL 2014 (pp. 616–625). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/e14-1065

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free