Word Sense Induction (WSI) is the task of identifying the different senses (uses) of a target word in a given text. This paper focuses on the unsupervised estimation of the free parameters of a graph-based WSI method, and explores the use of eight Graph Connectivity Measures (GCM) that assess the degree of connectivity in a graph. Given a target word and a set of parameters, GCM evaluate the connectivity of the produced clusters, which correspond to subgraphs of the initial (unclustered) graph. Each parameter setting is assigned a score according to one of the GCM and the highest scoring setting is then selected. Our evaluation on the nouns of SemEval-2007 WSI task (SWSI) shows that: (1) all GCM estimate a set of parameters which significantly outperform the worst performing parameter setting in both SWSI evaluation schemes, (2) all GCM estimate a set of parameters which outperform the Most Frequent Sense (MFS) baseline by a statistically significant amount in the supervised evaluation scheme, and (3) two of the measures estimate a set of parameters that performs closely to a set of parameters estimated in supervised manner.
CITATION STYLE
Korkontzelos, I., Klapaftis, I., & Manandhar, S. (2009). Graph Connectivity Measures for Unsupervised Parameter Tuning of Graph-Based Sense Induction Systems. In NAACL HLT 2009 - Unsupervised and Minimally Supervised Learning of Lexical Semantics, Proceedings of the Workshop (pp. 36–44). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1641968.1641973
Mendeley helps you to discover research relevant for your work.