Semantic based category-keywords list enrichment for document classification

Upasana Pandey; S. Chakraverty; Richa Mihani; Ruchika Arya; Sonali Rathee; Richa K. Sharma

Conference Proceedings

Semantic based category-keywords list enrichment for document classification

Advances in Intelligent and Soft Computing (2012) 166 AISC(VOL. 1) 297-309

DOI: 10.1007/978-3-642-30157-5_30

0Citations

2Readers

Get full text

Abstract

In this paper we present a text categorization technique that extracts semantic features of documents to generate a compact set of keywords and uses the information obtained from those keywords to perform text classification. The algorithm reduces the dimensionality of the document representation using overlapping semantics. Later, a keyword-category relationship matrix computes the extent of membership of the documents for various input predefined categories. The category of the document is then derived from membership metrics. Also, Wikipedia is used for the purpose of category lists enrichment. The proposed work has shown a new direction towards document classification for web applications. © 2012 Springer-Verlag GmbH.

Author supplied keywords

Cite

CITATION STYLE

APA

Pandey, U., Chakraverty, S., Mihani, R., Arya, R., Rathee, S., & Sharma, R. K. (2012). Semantic based category-keywords list enrichment for document classification. In Advances in Intelligent and Soft Computing (Vol. 166 AISC, pp. 297–309). https://doi.org/10.1007/978-3-642-30157-5_30

Semantic based category-keywords list enrichment for document classification

Abstract

Author supplied keywords

Cite

Register to see more suggestions