Large-scale Lifelong Learning of In-context Instructions and How to Tackle It

Jisoo Mok; Jaeyoung Do; Sungjin Lee; Tara Taghavi; Seunghak Yu; Sungroh Yoon

Conference ProceedingsOPEN ACCESS

Large-scale Lifelong Learning of In-context Instructions and How to Tackle It

Proceedings of the Annual Meeting of the Association for Computational Linguistics (2023) 1 12573-12589

DOI: 10.18653/v1/2023.acl-long.703

3Citations

17Readers

Abstract

Jointly fine-tuning a Pre-trained Language Model (PLM) on a pre-defined set of tasks with in-context instructions has been proven to improve its generalization performance, allowing us to build a universal language model that can be deployed across task boundaries. In this work, we explore for the first time whether this attractive property of in-context instruction learning can be extended to a scenario in which tasks are fed to the target PLM in a sequential manner. The primary objective of so-called lifelong in-context instruction learning is to improve the target PLM's instance- and task-level generalization performance as it observes more tasks. DYNAINST, the proposed method to lifelong in-context instruction learning, achieves noticeable improvements in both types of generalization, nearly reaching the upper bound performance obtained through joint training.

Cite

CITATION STYLE

APA

Mok, J., Do, J., Lee, S., Taghavi, T., Yu, S., & Yoon, S. (2023). Large-scale Lifelong Learning of In-context Instructions and How to Tackle It. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 1, pp. 12573–12589). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.acl-long.703

Large-scale Lifelong Learning of In-context Instructions and How to Tackle It

Abstract

Cite

Register to see more suggestions