CIAug: Equipping Interpolative Augmentation with Curriculum Learning

Ramit Sawhney; Ritesh Soun; Shrey Pandit; Megh Thakkar; Sarvagya Malaviya; Yuval Pinter

Conference ProceedingsOPEN ACCESS

CIAug: Equipping Interpolative Augmentation with Curriculum Learning

NAACL 2022 - 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference (2022) 1758-1764

DOI: 10.18653/v1/2022.naacl-main.127

2Citations

27Readers

Abstract

Interpolative data augmentation has proven to be effective for NLP tasks. Despite its merits, the sample selection process in mixup is random, which might make it difficult for the model to generalize better and converge faster. We propose CIAug, a novel curriculum-based learning method that builds upon mixup. It leverages the relative position of samples in hyperbolic embedding space as a complexity measure to gradually mix up increasingly difficult and diverse samples along training. CIAug achieves state-of-the-art results over existing interpolative augmentation methods on 10 benchmark datasets across 4 languages in text classification and named-entity recognition tasks. It also converges and achieves benchmark F1 scores 3 times faster. We empirically analyze the various components of CIAug, and evaluate its robustness against adversarial attacks.

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Sawhney, R., Soun, R., Pandit, S., Thakkar, M., Malaviya, S., & Pinter, Y. (2022). CIAug: Equipping Interpolative Augmentation with Curriculum Learning. In NAACL 2022 - 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference (pp. 1758–1764). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2022.naacl-main.127

Readers over time

Readers' Seniority

PhD / Post grad / Masters / Doc 5

56%

Researcher 3

33%

Lecturer / Post doc 1

11%

Readers' Discipline

Computer Science 9

69%

Linguistics 2

15%

Neuroscience 1

Agricultural and Biological Sciences 1

CIAug: Equipping Interpolative Augmentation with Curriculum Learning

Abstract

References Powered by Scopus

MixText: Linguistically-informed interpolation of hidden space for semi-supervised text classification

Are they our brothers? analysis and detection of religious hate speech in the Arabic Twittersphere

Flexible shaping: How learning in small steps helps

Cited by Powered by Scopus

Curricular Object Manipulation in LiDAR-based Object Detection

SymTax: Symbiotic Relationship and Taxonomy Fusion for Effective Citation Recommendation

Register to see more suggestions

Cite

Readers over time

Readers' Seniority

Readers' Discipline