Parameter Efficient Multi-task Fine-tuning by Learning to Transfer Token-wise Prompts

7Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Prompt tuning has been proven to be successful on various tasks by incorporating a small number of trainable parameters while freezing large pre-trained language models (PLMs). However, it is still unsettled how to generate more proper prompts for any individual examples and how to extend prompt tuning to multi-task learning scenarios by leveraging cross-task features. To address these challenges, we propose a token-wise prompt tuning (TPT), in which a bank of finer-grained soft prompt tokens is built for multi-task learning by memory network. The tokens are retrieved from the bank against an input example and assembled to an instance-dependent prompt. Extensive experimental results on 14 datasets demonstrated that the models enhanced by our TPT performed far better than full parameter fine-tuned models and achieved state-of-the-art by tuning only 0.035% parameters.

Cite

CITATION STYLE

APA

Wu, M., Liu, W., Xu, J., Lv, C., Ling, Z. X., Li, T., … Huang, X. (2023). Parameter Efficient Multi-task Fine-tuning by Learning to Transfer Token-wise Prompts. In Findings of the Association for Computational Linguistics: EMNLP 2023 (pp. 8734–8746). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.findings-emnlp.584

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free