Abstract
We describe the NRC-CNRC systems submitted to the AmericasNLP shared task on machine translation. We submitted systems translating from Spanish into Wixárika, Nahuatl, Rarámuri, and Guaraní. Our best neural machine translation systems used multilingual pretraining, ensembling, finetuning, training on parts of the development data, and subword regularization. We also submitted translation memory systems as a strong baseline.
Cite
CITATION STYLE
Knowles, R., Stewart, D., Larkin, S., & Littell, P. (2021). NRC-CNRC Machine Translation Systems for the 2021 AmericasNLP Shared Task. In Proceedings of the 1st Workshop on Natural Language Processing for Indigenous Languages of the Americas, AmericasNLP 2021 (pp. 224–233). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.americasnlp-1.25
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.