Investigating techniques for a deeper understanding of Neural Machine Translation (NMT) systems through data filtering and fine-tuning strategies

0Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In the context of this biomedical shared task, we have implemented data filters to enhance the selection of relevant training data for finetuning from the available training data sources. Specifically, we have employed textometric analysis to detect repetitive segments within the test set, which we have then used for refining the training data used to fine-tune the mBart-50 baseline model. Through this approach, we aim to achieve several objectives: developing a practical fine-tuning strategy for training biomedical in-domain fr<>en models, defining criteria for filtering in-domain training data, and comparing model predictions, finetuning data in accordance with the test set to gain a deeper insight into the functioning of Neural Machine Translation (NMT) systems.

Cite

CITATION STYLE

APA

Zhu, L., Zimina-Poirot, M., Bénard, M., Namdar, B., Ballier, N., Wisniewski, G., & Yunès, J. B. (2023). Investigating techniques for a deeper understanding of Neural Machine Translation (NMT) systems through data filtering and fine-tuning strategies. In Conference on Machine Translation - Proceedings (pp. 275–281). Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.wmt-1.28

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free