How Does Fine-tuning Affect the Geometry of Embedding Space: A Case Study on Isotropy

Sara Rajaee; Mohammad Taher Pilehvar

Conference ProceedingsOPEN ACCESS

How Does Fine-tuning Affect the Geometry of Embedding Space: A Case Study on Isotropy

Findings of the Association for Computational Linguistics, Findings of ACL: EMNLP 2021 (2021) 3042-3049

DOI: 10.18653/v1/2021.findings-emnlp.261

10Citations

52Readers

Abstract

It is widely accepted that fine-tuning pretrained language models usually brings about performance improvements in downstream tasks. However, there are limited studies on the reasons behind this effectiveness, particularly from the viewpoint of structural changes in the embedding space. Trying to fill this gap, in this paper, we analyze the extent to which the isotropy of the embedding space changes after fine-tuning. We demonstrate that, even though isotropy is a desirable geometrical property, fine-tuning does not necessarily result in isotropy enhancements. Moreover, local structures in pre-trained contextual word representations (CWRs), such as those encoding token types or frequency, undergo a massive change during fine-tuning. Our experiments show dramatic growth in the number of elongated directions in the embedding space, which, in contrast to pre-trained CWRs, carry the essential linguistic knowledge in the fine-tuned embedding space, making existing isotropy enhancement methods ineffective.

Cite

CITATION STYLE

APA

Rajaee, S., & Pilehvar, M. T. (2021). How Does Fine-tuning Affect the Geometry of Embedding Space: A Case Study on Isotropy. In Findings of the Association for Computational Linguistics, Findings of ACL: EMNLP 2021 (pp. 3042–3049). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.findings-emnlp.261

How Does Fine-tuning Affect the Geometry of Embedding Space: A Case Study on Isotropy

Abstract

Cite

Register to see more suggestions