A knowledge-based semantic kernel for text classification

Jamal Abdul Nasir; Asim Karim; George Tsatsaronis; Iraklis Varlamis

Conference Proceedings

A knowledge-based semantic kernel for text classification

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2011) 7024 LNCS 261-266

DOI: 10.1007/978-3-642-24583-1_25

23Citations

29Readers

Get full text

Abstract

Typically, in textual document classification the documents are represented in the vector space using the "Bag of Words" (BOW) approach. Despite its ease of use, BOW representation cannot handle word synonymy and polysemy problems and does not consider semantic relatedness between words. In this paper, we overcome the shortages of the BOW approach by embedding a known WordNet-based semantic relatedness measure for pairs of words, namely Omiotis, into a semantic kernel. The suggested measure incorporates the TF-IDF weighting scheme, thus creating a semantic kernel which combines both semantic and statistical information from text. Empirical evaluation with real data sets demonstrates that our approach successfully achieves improved classification accuracy with respect to the standard BOW representation, when Omiotis is embedded in four different classifiers. © 2011 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Nasir, J. A., Karim, A., Tsatsaronis, G., & Varlamis, I. (2011). A knowledge-based semantic kernel for text classification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7024 LNCS, pp. 261–266). https://doi.org/10.1007/978-3-642-24583-1_25

A knowledge-based semantic kernel for text classification

Abstract

Author supplied keywords

Cite

Register to see more suggestions