SMoP: Towards Efficient and Effective Prompt Tuning with Sparse Mixture-of-Prompts

Joon Young Choi; Junho Kim; Jun Hyung Park; Wing Lam Mok; Sang Keun Lee

Conference Proceedings

SMoP: Towards Efficient and Effective Prompt Tuning with Sparse Mixture-of-Prompts

EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings (2023) 14306-14316

DOI: 10.18653/v1/2023.emnlp-main.884

13Citations

18Readers

Get full text

Abstract

Prompt tuning has emerged as a successful parameter-efficient alternative to the full fine-tuning of language models. However, prior works on prompt tuning often utilize long soft prompts of up to 100 tokens to improve performance, overlooking the inefficiency associated with extended inputs. In this paper, we propose a novel prompt tuning method SMoP (Sparse Mixture-of-Prompts) that utilizes short soft prompts for efficient training and inference while maintaining performance gains typically induced from longer soft prompts. To achieve this, SMoP employs a gating mechanism to train multiple short soft prompts specialized in handling different subsets of the data, providing an alternative to relying on a single long soft prompt to cover the entire data. Experimental results demonstrate that SMoP outperforms baseline methods while reducing training and inference costs. We release our code at https://github.com/jyjohnchoi/SMoP.

Cite

CITATION STYLE

APA

Choi, J. Y., Kim, J., Park, J. H., Mok, W. L., & Lee, S. K. (2023). SMoP: Towards Efficient and Effective Prompt Tuning with Sparse Mixture-of-Prompts. In EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings (pp. 14306–14316). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.emnlp-main.884

SMoP: Towards Efficient and Effective Prompt Tuning with Sparse Mixture-of-Prompts

Abstract

Cite

Register to see more suggestions