HAUSER: Towards Holistic and Automatic Evaluation of Simile Generation

Qianyu He; Yikai Zhang; Jiaqing Liang; Yuncheng Huang; Yanghua Xiao; Yunwen Chen

Conference ProceedingsOPEN ACCESS

HAUSER: Towards Holistic and Automatic Evaluation of Simile Generation

Proceedings of the Annual Meeting of the Association for Computational Linguistics (2023) 1 12557-12572

DOI: 10.18653/v1/2023.acl-long.702

0Citations

9Readers

Abstract

Similes play an imperative role in creative writing such as story and dialogue generation. Proper evaluation metrics are like a beacon guiding the research of simile generation (SG). However, it remains under-explored as to what criteria should be considered, how to quantify each criterion into metrics, and whether the metrics are effective for comprehensive, efficient, and reliable SG evaluation. To address the issues, we establish HAUSER, a holistic and automatic evaluation system for the SG task, which consists of five criteria from three perspectives and automatic metrics for each criterion. Through extensive experiments, we verify that our metrics are significantly more correlated with human ratings from each perspective compared with prior automatic metrics. Resources of HAUSER are publicly available at https://github.com/Abbey4799/HAUSER.

Cite

CITATION STYLE

APA

He, Q., Zhang, Y., Liang, J., Huang, Y., Xiao, Y., & Chen, Y. (2023). HAUSER: Towards Holistic and Automatic Evaluation of Simile Generation. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 1, pp. 12557–12572). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.acl-long.702

HAUSER: Towards Holistic and Automatic Evaluation of Simile Generation

Abstract

Cite

Register to see more suggestions