Variance-reduced First-order Meta-learning for Natural Language Processing Tasks

Lingxiao Wang; Kevin Huang; Tengyu Ma; Quanquan Gu; Jing Huang

Conference ProceedingsOPEN ACCESS

Variance-reduced First-order Meta-learning for Natural Language Processing Tasks

NAACL-HLT 2021 - 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference (2021) 2609-2615

DOI: 10.18653/v1/2021.naacl-main.206

10Citations

59Readers

Abstract

First-order meta-learning algorithms have been widely used in practice to learn initial model parameters that can be quickly adapted to new tasks due to their efficiency and effectiveness. However, existing studies find that meta-learner can overfit to some specific adaptation when we have heterogeneous tasks, leading to significantly degraded performance. In Natural Language Processing (NLP) applications, datasets are often diverse and each task has its unique characteristics. Therefore, to address the overfitting issue when applying first-order meta-learning to NLP applications, we propose to reduce the variance of the gradient estimator used in task adaptation. To this end, we develop a variance-reduced first-order meta-learning algorithm. The core of our algorithm is to introduce a novel variance reduction term to the gradient estimation when performing the task adaptation. Experiments on two NLP applications: few-shot text classification and multi-domain dialog state tracking demonstrate the superior performance of our proposed method.

Cite

CITATION STYLE

APA

Wang, L., Huang, K., Ma, T., Gu, Q., & Huang, J. (2021). Variance-reduced First-order Meta-learning for Natural Language Processing Tasks. In NAACL-HLT 2021 - 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference (pp. 2609–2615). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.naacl-main.206

Variance-reduced First-order Meta-learning for Natural Language Processing Tasks

Abstract

Cite

Register to see more suggestions