Abstract
This study explores an LLM's ability to learn new languages using explanations found in a grammar book$\unicode{x2014}$a process we term "explicit learning." To rigorously assess this ability, we design controlled translation experiments between English and constructed languages generated$\unicode{x2014}$by specific cryptographic means$\unicode{x2014}$out of Latin or French. Contrary to previous studies, our results demonstrate that LLMs do possess a measurable capacity for explicit learning. This ability, however, diminishes as the complexity of the linguistic phenomena to be learned increases. Supervised fine-tuning on ad hoc chains of thought significantly enhances LLM performance but struggles to generalize to typologically novel or more complex linguistic features. These findings point to the need for more diverse training sets and alternative fine-tuning strategies to further improve explicit learning by LLMs, benefiting low-resource languages typically described in grammar books but lacking extensive corpora.
Cite
CITATION STYLE
Marmonier, M., Bawden, R., & Sagot, B. (2025). Explicit Learning and the LLM in Machine Translation (pp. 31360–31410). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2025.emnlp-main.1599
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.