Active Example Selection for In-Context Learning

Yiming Zhang; Shi Feng; Chenhao Tan

Conference Proceedings

Active Example Selection for In-Context Learning

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022 (2022) 9134-9148

DOI: 10.18653/v1/2022.emnlp-main.622

104Citations

85Readers

Get full text

Abstract

With a handful of demonstration examples, large-scale language models show strong capability to perform various tasks by in-context learning from these examples, without any fine-tuning. We demonstrate that in-context learning performance can be highly unstable across samples of examples, indicating the idiosyncrasies of how language models acquire information. We formulate example selection for in-context learning as a sequential decision problem, and propose a reinforcement learning algorithm for identifying generalizable policies to select demonstration examples. For GPT-2, our learned policies demonstrate strong abilities of generalizing to unseen tasks in training, with a 5.8% improvement on average. Examples selected from our learned policies can even achieve a small improvement on GPT-3 Ada. However, the improvement diminishes on larger GPT-3 models, suggesting emerging capabilities of large language models.

Cite

CITATION STYLE

APA

Zhang, Y., Feng, S., & Tan, C. (2022). Active Example Selection for In-Context Learning. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022 (pp. 9134–9148). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2022.emnlp-main.622

Active Example Selection for In-Context Learning

Abstract

Cite

Register to see more suggestions