A Study on Multilingual Transfer Learning in Neural Machine Translation: Finding the Balance Between Languages

Adrien Bardet; Fethi Bougares; Loïc Barrault

Conference Proceedings

A Study on Multilingual Transfer Learning in Neural Machine Translation: Finding the Balance Between Languages

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11816 LNAI 59-70

DOI: 10.1007/978-3-030-31372-2_5

1Citations

8Readers

Get full text

Abstract

Transfer learning is an interesting approach to tackle the low resource languages machine translation problem. Transfer learning, as a machine learning algorithm, requires to make several choices such as selecting the training data and more particularly language pairs and their available quantity and quality. Other important choices must be made during the preprocessing step, like selecting data to learn subword units, the subsequent model’s vocabulary. It is still unclear how to optimize this transfer. In this paper, we analyse the impact of such early choices on the performance of the systems. We show that systems performance are depending on quantity of available data and proximity of the involved languages as well as the protocol used to determined the subword units model and consequently the vocabulary. We also propose a multilingual approach to transfer learning involving a universal encoder. This multilingual approach is comparable to a multi-source transfer learning setup where the system learns from multiple languages before the transfer. We analyse subword units distribution across different languages and show that, once again, preprocessing choices impact systems overall performance.

Author supplied keywords

Cite

CITATION STYLE

APA

Bardet, A., Bougares, F., & Barrault, L. (2019). A Study on Multilingual Transfer Learning in Neural Machine Translation: Finding the Balance Between Languages. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11816 LNAI, pp. 59–70). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-31372-2_5

A Study on Multilingual Transfer Learning in Neural Machine Translation: Finding the Balance Between Languages

Abstract

Author supplied keywords

Cite

Register to see more suggestions