Abstract
In this paper, we describe our method for detection of lexical semantic change, i.e., word sense changes over time. We examine semantic differences between specific words in two corpora, chosen from different time periods, for English, German, Latin, and Swedish. Our method was created for the SemEval 2020 Task 1: Unsupervised Lexical Semantic Change Detection. We ranked 1st in Sub-task 1: binary change detection, and 4th in Sub-task 2: ranked change detection. Our method is fully unsupervised and language independent. It consists of preparing a semantic vector space for each corpus, earlier and later; computing a linear transformation between earlier and later spaces, using Canonical Correlation Analysis and Orthogonal Transformation; and measuring the cosines between the transformed vector for the target word from the earlier corpus and the vector for the target word in the later corpus.
Cite
CITATION STYLE
Pražák, O., Přibáň, P., Taylor, S., & Sido, J. (2020). UWB at SemEval-2020 Task 1: Lexical Semantic Change Detection. In 14th International Workshops on Semantic Evaluation, SemEval 2020 - co-located 28th International Conference on Computational Linguistics, COLING 2020, Proceedings (pp. 246–254). International Committee for Computational Linguistics. https://doi.org/10.18653/v1/2020.semeval-1.30
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.