Neural text summarization: A critical evaluation

Wojciech Kryściński; Nitish Shirish Keskar; Bryan McCann; Caiming Xiong; Richard Socher

Conference ProceedingsOPEN ACCESS

Neural text summarization: A critical evaluation

EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Proceedings of the Conference (2019) 540-551

DOI: 10.18653/v1/d19-1051

224Citations

386Readers

Abstract

Text summarization aims at compressing long documents into a shorter form that conveys the most important parts of the original document. Despite increased interest in the community and notable research effort, progress on benchmark datasets has stagnated. We critically evaluate key ingredients of the current research setup: datasets, evaluation metrics, and models, and highlight three primary shortcomings: 1) automatically collected datasets leave the task underconstrained and may contain noise detrimental to training and evaluation, 2) current evaluation protocol is weakly correlated with human judgment and does not account for important characteristics such as factual correctness, 3) models overfit to layout biases of current datasets and offer limited diversity in their outputs.

Cite

CITATION STYLE

APA

Kryściński, W., Keskar, N. S., McCann, B., Xiong, C., & Socher, R. (2019). Neural text summarization: A critical evaluation. In EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Proceedings of the Conference (pp. 540–551). Association for Computational Linguistics. https://doi.org/10.18653/v1/d19-1051

Neural text summarization: A critical evaluation

Abstract

Cite

Register to see more suggestions