In this paper we present a text categorization technique that extracts semantic features of documents to generate a compact set of keywords and uses the information obtained from those keywords to perform text classification. The algorithm reduces the dimensionality of the document representation using overlapping semantics. Later, a keyword-category relationship matrix computes the extent of membership of the documents for various input predefined categories. The category of the document is then derived from membership metrics. Also, Wikipedia is used for the purpose of category lists enrichment. The proposed work has shown a new direction towards document classification for web applications. © 2012 Springer-Verlag GmbH.
CITATION STYLE
Pandey, U., Chakraverty, S., Mihani, R., Arya, R., Rathee, S., & Sharma, R. K. (2012). Semantic based category-keywords list enrichment for document classification. In Advances in Intelligent and Soft Computing (Vol. 166 AISC, pp. 297–309). https://doi.org/10.1007/978-3-642-30157-5_30
Mendeley helps you to discover research relevant for your work.