Leveraging BERT with Mixup for Sentence Classification

Amit Jindal; Dwaraknath Gnaneshwar; Ramit Sawhney; Rajiv Ratn Shah

Conference ProceedingsOPEN ACCESS

Leveraging BERT with Mixup for Sentence Classification

AAAI 2020 - 34th AAAI Conference on Artificial Intelligence (2020) 13829-13830

8Citations

8Readers

Abstract

Good generalization capability is an important quality of well-trained and robust neural networks. However, networks usually struggle when faced with samples outside the training distribution. Mixup is a technique that improves generalization, reduces memorization, and increases adversarial robustness. We apply a variant of Mixup called Manifold Mixup to the sentence classification problem, and present the results along with an ablation study. Our methodology outperforms CNN, LSTM, and vanilla BERT models in generalization.

Cite

CITATION STYLE

APA

Jindal, A., Gnaneshwar, D., Sawhney, R., & Shah, R. R. (2020). Leveraging BERT with Mixup for Sentence Classification. In AAAI 2020 - 34th AAAI Conference on Artificial Intelligence (pp. 13829–13830). AAAI press.

Leveraging BERT with Mixup for Sentence Classification

Abstract

Cite

Register to see more suggestions