Choosing What to Mask: More Informed Masking for Multimodal Machine Translation

3Citations
Citations of this article
16Readers
Mendeley users who have this article in their library.

Abstract

Pre-trained language models have achieved remarkable results on several NLP tasks. Most of them adopt masked language modeling to learn representations by randomly masking tokens and predicting them based on their context. However, this random selection of tokens to be masked is inefficient to learn some language patterns as it may not consider linguistic information that can be helpful for many NLP tasks, such as multimodal machine translation (MMT). Hence, we propose three novel masking strategies for cross-lingual visual pre-training - more informed visual masking, more informed textual masking, and more informed visual and textual masking - each one focusing on learning different linguistic patterns. We apply them to Vision Translation Language Modelling for video subtitles (Sato et al., 2022) and conduct extensive experiments on the Portuguese-English MMT task. The results show that our masking approaches yield significant improvements over the original random masking strategy for downstream MMT performance. Our models outperform the MMT baseline and we achieve state-of-the-art accuracy (52.70 in terms of BLEU score) on the How2 dataset, indicating that more informed masking helps in acquiring an understanding of specific language structures and has great potential for language understanding.

Cite

CITATION STYLE

APA

Sato, J., Caseli, H., & Specia, L. (2023). Choosing What to Mask: More Informed Masking for Multimodal Machine Translation. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 4, pp. 244–253). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.acl-srw.35

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free