Semantic based text classification of patent documents to a user-defined taxonomy

Ashish Sureka; Pranav Prabhakar Mirajkar; Prasanna Nagesh Teli; Girish Agarwal; Sumit Kumar Bose

Conference Proceedings

Semantic based text classification of patent documents to a user-defined taxonomy

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2009) 5678 LNAI 644-651

DOI: 10.1007/978-3-642-03348-3_67

3Citations

12Readers

Get full text

Abstract

We present a generic approach for semantic based classification of text documents to pre-defined categories. The proposed technique is applied to the domain of patent analytics for the purpose of classifying a collection of patent documents to one or many nodes in a user-defined taxonomy. The proposed approach is a multi-step process consisting of noun extraction, word sense disambiguation, semantic relatedness computation between pair of words using WordNet and confidence score computation. The proposed algorithm resulted in good accuracy on experimental dataset and can be easily adapted and customized to other domains other the patent landscape analysis domain discussed in this paper. © 2009 Springer.

Author supplied keywords

Cite

CITATION STYLE

APA

Sureka, A., Mirajkar, P. P., Teli, P. N., Agarwal, G., & Bose, S. K. (2009). Semantic based text classification of patent documents to a user-defined taxonomy. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5678 LNAI, pp. 644–651). https://doi.org/10.1007/978-3-642-03348-3_67

Semantic based text classification of patent documents to a user-defined taxonomy

Abstract

Author supplied keywords

Cite

Register to see more suggestions