We present a generic approach for semantic based classification of text documents to pre-defined categories. The proposed technique is applied to the domain of patent analytics for the purpose of classifying a collection of patent documents to one or many nodes in a user-defined taxonomy. The proposed approach is a multi-step process consisting of noun extraction, word sense disambiguation, semantic relatedness computation between pair of words using WordNet and confidence score computation. The proposed algorithm resulted in good accuracy on experimental dataset and can be easily adapted and customized to other domains other the patent landscape analysis domain discussed in this paper. © 2009 Springer.
CITATION STYLE
Sureka, A., Mirajkar, P. P., Teli, P. N., Agarwal, G., & Bose, S. K. (2009). Semantic based text classification of patent documents to a user-defined taxonomy. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5678 LNAI, pp. 644–651). https://doi.org/10.1007/978-3-642-03348-3_67
Mendeley helps you to discover research relevant for your work.