Conditional Set Generation Using SEQ2SEQ Models

Aman Madaan; Dheeraj Rajagopal; Niket Tandon; Yiming Yang; Antoine Bosselut

Conference ProceedingsOPEN ACCESS

Conditional Set Generation Using SEQ2SEQ Models

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022 (2022) 4874-4896

DOI: 10.18653/v1/2022.emnlp-main.324

7Citations

18Readers

Abstract

Conditional set generation learns a mapping from an input sequence of tokens to a set. Several NLP tasks, such as entity typing and dialogue emotion tagging, are instances of set generation. SEQ2SEQ models, a popular choice for set generation, treat a set as a sequence and do not fully leverage its key properties, namely order-invariance and cardinality. We propose a novel algorithm for effectively sampling informative orders over the combinatorial space of label orders. We jointly model the set cardinality and output by prepending the set size and taking advantage of the autoregressive factorization used by SEQ2SEQ models. Our method is a model-independent data augmentation approach that endows any SEQ2SEQ model with the signals of order-invariance and cardinality. Training a SEQ2SEQ model on this augmented data (without any additional annotations) gets an average relative improvement of 20% on four benchmark datasets across various models: BART-base, T5-11B, and GPT3-175B.

Cite

CITATION STYLE

APA

Madaan, A., Rajagopal, D., Tandon, N., Yang, Y., & Bosselut, A. (2022). Conditional Set Generation Using SEQ2SEQ Models. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022 (pp. 4874–4896). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2022.emnlp-main.324

Conditional Set Generation Using SEQ2SEQ Models

Abstract

Cite

Register to see more suggestions