Tag Assisted Neural Machine Translation of Film Subtitles

1Citations
Citations of this article
48Readers
Mendeley users who have this article in their library.

Abstract

We implemented a neural machine translation system that uses automatic sequence tagging to improve the quality of translation. Instead of operating on unannotated sentence pairs, our system uses pre-trained tagging systems to add linguistic features to source and target sentences. Our proposed neural architecture learns a combined embedding of tokens and tags in the encoder, and simultaneous token and tag prediction in the decoder. Compared to a baseline with unannotated training, this architecture increased the BLEU score of German to English film subtitle translation outputs by 1.61 points using named entity tags; however, the BLEU score decreased by 0.38 points using part-of-speech tags. This demonstrates that certain token-level tag outputs from off-the-shelf tagging systems can improve the output of neural translation systems using our combined embedding and simultaneous decoding extensions.

Cite

CITATION STYLE

APA

Siekmeier, A., Lee, W. K., Kwon, H., & Lee, J. H. (2021). Tag Assisted Neural Machine Translation of Film Subtitles. In IWSLT 2021 - 18th International Conference on Spoken Language Translation, Proceedings (pp. 255–262). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.iwslt-1.30

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free