Abstractive Text Summarization Using the BRIO Training Paradigm

2Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.

Abstract

Summary sentences produced by abstractive summarization models may be coherent and comprehensive, but they lack control and rely heavily on reference summaries. The BRIO training paradigm assumes a non-deterministic distribution to reduce the model's dependence on reference summaries, and improve model performance during inference. This paper presents a straightforward but effective technique to improve abstractive summaries by fine-tuning pre-trained language models, and training them with the BRIO paradigm. We build a text summarization dataset for Vietnamese, called VieSum. We perform experiments with abstractive summarization models trained with the BRIO paradigm on the CNNDM and the VieSum datasets. The results show that the models, trained on basic hardware, outperform all existing abstractive summarization models, especially for Vietnamese.

Cite

CITATION STYLE

APA

Lam, K. N., Doan, T. G., Pham, K. T., & Kalita, J. (2023). Abstractive Text Summarization Using the BRIO Training Paradigm. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 92–99). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.findings-acl.7

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free