A graph-theoretic summary evaluation for ROUGE

29Citations
Citations of this article
126Readers
Mendeley users who have this article in their library.

Abstract

ROUGE is one of the first and most widely used evaluation metrics for text summarization. However, its assessment merely relies on surface similarities between peer and model summaries. Consequently, ROUGE is unable to fairly evaluate summaries including lexical variations and paraphrasing. We propose a graph-based approach adopted into ROUGE to evaluate summaries based on both lexical and semantic similarities. Experiment results over TAC AESOP datasets show that exploiting the lexico-semantic similarity of the words used in summaries would significantly help ROUGE correlate better with human judgments.

Cite

CITATION STYLE

APA

ShafieiBavani, E., Ebrahimi, M., Wong, R., & Chen, F. (2018). A graph-theoretic summary evaluation for ROUGE. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018 (pp. 762–767). Association for Computational Linguistics. https://doi.org/10.18653/v1/d18-1085

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free