It is widely accepted that tagging text with semantic information would improve the quality of lexical learning in corpus-based NLP methods. However available on-line taxonomies are rather entangled and introduce an unnecessary level of ambiguity. The noise produced by the redundant number of tags often overrides the advantage of semantic tagging. In this paper we propose an automatic method to select from WordNet a subset of domain-appropriate categories that effectively reduce the overambiguity of WordNet, and help at identifying and categorise relevant language patterns in a more compact way. The method is evaluated against a manually tagged corpus, SEMCOR.
CITATION STYLE
Cucchiarelli, A., & Velardi, P. (1997). Automatic selection of class labels from a thesaurus for an effective semantic tagging of corpora. In 5th Conference on Applied Natural Language Processing, ANLP 1997 - Proceedings (pp. 380–387). Association for Computational Linguistics (ACL). https://doi.org/10.3115/974557.974612
Mendeley helps you to discover research relevant for your work.