Abstract
Prompt tuning has been proven to be successful on various tasks by incorporating a small number of trainable parameters while freezing large pre-trained language models (PLMs). However, it is still unsettled how to generate more proper prompts for any individual examples and how to extend prompt tuning to multi-task learning scenarios by leveraging cross-task features. To address these challenges, we propose a token-wise prompt tuning (TPT), in which a bank of finer-grained soft prompt tokens is built for multi-task learning by memory network. The tokens are retrieved from the bank against an input example and assembled to an instance-dependent prompt. Extensive experimental results on 14 datasets demonstrated that the models enhanced by our TPT performed far better than full parameter fine-tuned models and achieved state-of-the-art by tuning only 0.035% parameters.
Cite
CITATION STYLE
Wu, M., Liu, W., Xu, J., Lv, C., Ling, Z. X., Li, T., … Huang, X. (2023). Parameter Efficient Multi-task Fine-tuning by Learning to Transfer Token-wise Prompts. In Findings of the Association for Computational Linguistics: EMNLP 2023 (pp. 8734–8746). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.findings-emnlp.584
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.