Attention-based Multimodal Neural Machine Translation

Po Yao Huang; Frederick Liu; Sz Rung Shiang; Jean Oh; Chris Dyer

Conference ProceedingsOPEN ACCESS

Attention-based Multimodal Neural Machine Translation

Proceedings of the Annual Meeting of the Association for Computational Linguistics (2016) 2 639-645

DOI: 10.18653/v1/w16-2360

144Citations

167Readers

Abstract

We present a novel neural machine translation (NMT) architecture associating visual and textual features for translation tasks with multiple modalities. Transformed global and regional visual features are concatenated with text to form attendable sequences which are dissipated over parallel long short-term memory (LSTM) threads to assist the encoder generating a representation for attention-based decoding. Experiments show that the proposed NMT outperform the text-only baseline.

Cite

CITATION STYLE

APA

Huang, P. Y., Liu, F., Shiang, S. R., Oh, J., & Dyer, C. (2016). Attention-based Multimodal Neural Machine Translation. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 2, pp. 639–645). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w16-2360

Attention-based Multimodal Neural Machine Translation

Abstract

Cite

Register to see more suggestions