LEXFIT: Lexical fine-tuning of pretrained language models

Ivan Vulic; Edoardo M. Ponti; Anna Korhonen; Goran Glavaš

Conference Proceedings

LEXFIT: Lexical fine-tuning of pretrained language models

ACL-IJCNLP 2021 - 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Proceedings of the Conference (2021) 1 5269-5283

DOI: 10.18653/v1/2021.acl-long.410

30Citations

74Readers

Get full text

Abstract

Transformer-based language models (LMs) pretrained on large text collections implicitly store a wealth of lexical semantic knowledge, but it is non-trivial to extract that knowledge effectively from their parameters. Inspired by prior work on semantic specialization of static word embedding (WE) models, we show that it is possible to expose and enrich lexical knowledge from the LMs, that is, to specialize them to serve as effective and universal “decontextualized” word encoders even when fed input words “in isolation” (i.e., without any context). Their transformation into such word encoders is achieved through a simple and efficient lexical fine-tuning procedure (termed LEXFIT) based on dual-encoder network structures. Further, we show that LEXFIT can yield effective word encoders even with limited lexical supervision and, via cross-lingual transfer, in different languages without any readily available external knowledge. Our evaluation over four established, structurally different lexical-level tasks in 8 languages indicates the superiority of LEXFIT-based WEs over standard static WEs (e.g., fastText) and WEs from vanilla LMs. Other extensive experiments and ablation studies further profile the LEXFIT framework, and indicate best practices and performance variations across LEXFIT variants, languages, and lexical tasks, also directly questioning the usefulness of traditional WE models in the era of large neural models.

Cite

CITATION STYLE

APA

Vulic, I., Ponti, E. M., Korhonen, A., & Glavaš, G. (2021). LEXFIT: Lexical fine-tuning of pretrained language models. In ACL-IJCNLP 2021 - 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Proceedings of the Conference (Vol. 1, pp. 5269–5283). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.acl-long.410

LEXFIT: Lexical fine-tuning of pretrained language models

Abstract

Cite

Register to see more suggestions