Unobserved Local Structures Make Compositional Generalization Hard

15Citations
Citations of this article
34Readers
Mendeley users who have this article in their library.

Abstract

While recent work has shown that sequence-to-sequence models struggle to generalize to new compositions (termed compositional generalization), little is known on what makes compositional generalization hard on a particular test instance. In this work, we investigate the factors that make generalization to certain test instances challenging. We first substantiate that some examples are more difficult than others by showing that different models consistently fail or succeed on the same test instances. Then, we propose a criterion for the difficulty of an example: a test instance is hard if it contains a local structure that was not observed at training time. We formulate a simple decision rule based on this criterion and empirically show it predicts instance-level generalization well across 5 different semantic parsing datasets, substantially better than alternative decision rules. Last, we show local structures can be leveraged for creating difficult adversarial compositional splits and also to improve compositional generalization under limited training budgets by strategically selecting examples for the training set.

References Powered by Scopus

Long Short-Term Memory

77926Citations
N/AReaders
Get full text

Incorporating copying mechanism in sequence-to-sequence learning

995Citations
N/AReaders
Get full text

Building a semantic parser overnight

267Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Diverse Demonstrations Improve In-context Compositional Generalization

37Citations
N/AReaders
Get full text

Entity Tracking in Language Models

13Citations
N/AReaders
Get full text

What's the Meaning of Superhuman Performance in Today's NLU?

9Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Bogin, B., Gupta, S., & Berant, J. (2022). Unobserved Local Structures Make Compositional Generalization Hard. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022 (pp. 2731–2747). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2022.emnlp-main.175

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 13

81%

Researcher 2

13%

Lecturer / Post doc 1

6%

Readers' Discipline

Tooltip

Computer Science 15

79%

Linguistics 2

11%

Neuroscience 1

5%

Engineering 1

5%

Save time finding and organizing research with Mendeley

Sign up for free