We address the problem of unknown word sense detection: the identification of corpus occurrences that are not covered by a given sense inventory. We model this as an instance of outlier detection, using a simple nearest neighbor-based approach to measuring the resemblance of a new item to a training set. In combination with a method that alleviates data sparseness by sharing training data across lemmas, the approach achieves a precision of 0.77 and recall of 0.82. © 2006 Association for Computational Linguistics.
CITATION STYLE
Erk, K. (2006). Unknown word sense detection as outlier detection. In HLT-NAACL 2006 - Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings of the Main Conference (pp. 128–135). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1220835.1220852
Mendeley helps you to discover research relevant for your work.