Abstract
This paper describes SChME (Semantic Change Detection with Model Ensemble), a method used in SemEval-2020 Task 1 on unsupervised detection of lexical semantic change. SChME uses a model ensemble combining signals of distributional models (word embeddings) and word frequency models where each model casts a vote indicating the probability that a word suffered semantic change according to that feature. More specifically, we combine cosine distance of word vectors combined with a neighborhood-based metric we named Mapped Neighborhood Distance (MAP), and a word frequency differential metric as input signals to our model. Additionally, we explore alignment-based methods to investigate the importance of the landmarks used in this process. Our results show evidence that the number of landmarks used for alignment has a direct impact on the predictive performance of the model. Moreover, we show that languages that suffer less semantic change tend to benefit from using a large number of landmarks, whereas languages with more semantic change benefit from a more careful choice of landmark number for alignment.
Cite
CITATION STYLE
Gruppi, M., Adalı, S., & Chen, P. Y. (2020). SChME at SemEval-2020 Task 1: A Model Ensemble for Detecting Lexical Semantic Change. In 14th International Workshops on Semantic Evaluation, SemEval 2020 - co-located 28th International Conference on Computational Linguistics, COLING 2020, Proceedings (pp. 105–111). International Committee for Computational Linguistics. https://doi.org/10.18653/v1/2020.semeval-1.11
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.