Mitigating Data Scarceness through Data Synthesis, Augmentation and Curriculum for Abstractive Summarization

5Citations
Citations of this article
51Readers
Mendeley users who have this article in their library.

Abstract

This paper explores three simple data manipulation techniques (synthesis, augmentation, curriculum) for improving abstractive summarization models without the need for any additional data. We introduce a method of data synthesis with paraphrasing, a data augmentation technique with sample mixing, and curriculum learning with two new difficulty metrics based on specificity and abstractiveness. We conduct experiments to show that these three techniques can help improve abstractive summarization across two summarization models and two different small datasets. Furthermore, we show that these techniques can improve performance when applied in isolation and when combined.

Cite

CITATION STYLE

APA

Magooda, A., & Litman, D. (2021). Mitigating Data Scarceness through Data Synthesis, Augmentation and Curriculum for Abstractive Summarization. In Findings of the Association for Computational Linguistics, Findings of ACL: EMNLP 2021 (pp. 2043–2052). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.findings-emnlp.175

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free