Text Augmentation Using Dataset Reconstruction for Low-Resource Classification

Adir Rahamim; Guy Uziel; Esther Goldbraich; Ateret Anaby-Tavor

Conference Proceedings

Text Augmentation Using Dataset Reconstruction for Low-Resource Classification

Proceedings of the Annual Meeting of the Association for Computational Linguistics (2023) 7389-7402

DOI: 10.18653/v1/2023.findings-acl.466

11Citations

20Readers

Get full text

Abstract

In the deployment of real-world text classification models, label scarcity is a common problem. As the number of classes increases, this problem becomes even more complex. One way to address this problem is by applying text augmentation methods. One of the more prominent methods involves using the text-generation capabilities of language models. We propose Text AUgmentation by Dataset Reconstruction (TAU-DR), a novel method of data augmentation for text classification. We conduct experiments on several multi-class datasets, showing that our approach improves the current state-of-the-art techniques for data augmentation.

Cite

CITATION STYLE

APA

Rahamim, A., Uziel, G., Goldbraich, E., & Anaby-Tavor, A. (2023). Text Augmentation Using Dataset Reconstruction for Low-Resource Classification. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 7389–7402). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.findings-acl.466

Text Augmentation Using Dataset Reconstruction for Low-Resource Classification

Abstract

Cite

Register to see more suggestions