Evaluating the Evaluation Metrics for Style Transfer: A Case Study in Multilingual Formality Transfer

Eleftheria Briakou; Sweta Agrawal; Joel Tetreault; Marine Carpuat

Conference ProceedingsOPEN ACCESS

Evaluating the Evaluation Metrics for Style Transfer: A Case Study in Multilingual Formality Transfer

EMNLP 2021 - 2021 Conference on Empirical Methods in Natural Language Processing, Proceedings (2021) 1321-1336

DOI: 10.18653/v1/2021.emnlp-main.100

21Citations

59Readers

Abstract

While the field of style transfer (ST) has been growing rapidly, it has been hampered by a lack of standardized practices for automatic evaluation. In this paper, we evaluate leading ST automatic metrics on the oft-researched task of formality style transfer. Unlike previous evaluations, which focus solely on English, we expand our focus to Brazilian-Portuguese, French, and Italian, making this work the first multilingual evaluation of metrics in ST. We outline best practices for automatic evaluation in (formality) style transfer and identify several models that correlate well with human judgments and are robust across languages. We hope that this work will help accelerate development in ST, where human evaluation is often challenging to collect.

Cite

CITATION STYLE

APA

Briakou, E., Agrawal, S., Tetreault, J., & Carpuat, M. (2021). Evaluating the Evaluation Metrics for Style Transfer: A Case Study in Multilingual Formality Transfer. In EMNLP 2021 - 2021 Conference on Empirical Methods in Natural Language Processing, Proceedings (pp. 1321–1336). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.emnlp-main.100

Evaluating the Evaluation Metrics for Style Transfer: A Case Study in Multilingual Formality Transfer

Abstract

Cite

Register to see more suggestions