Revisiting Instruction Fine-tuned Model Evaluation to Guide Industrial Applications

Manuel Faysse; Gautier Viaud; Céline Hudelot; Pierre Colombo

Conference ProceedingsOPEN ACCESS

Revisiting Instruction Fine-tuned Model Evaluation to Guide Industrial Applications

EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings (2023) 9033-9048

DOI: 10.18653/v1/2023.emnlp-main.559

1Citations

11Readers

Abstract

Instruction Fine-Tuning (IFT) is a powerful paradigm that strengthens the zero-shot capabilities of Large Language Models (LLMs), but in doing so induces new evaluation metric requirements. We show LLM-based metrics to be well adapted to these requirements, and leverage them to conduct an investigation of task-specialization strategies, quantifying the tradeoffs that emerge in practical industrial settings. Our findings offer practitioners actionable insights for real-world IFT model deployment.

Cite

CITATION STYLE

APA

Faysse, M., Viaud, G., Hudelot, C., & Colombo, P. (2023). Revisiting Instruction Fine-tuned Model Evaluation to Guide Industrial Applications. In EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings (pp. 9033–9048). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.emnlp-main.559

Revisiting Instruction Fine-tuned Model Evaluation to Guide Industrial Applications

Abstract

Cite

Register to see more suggestions