Improved single-label text categorization by instance filtration

0Citations
Citations of this article
N/AReaders
Mendeley users who have this article in their library.

Abstract

Machine learning classifiers are widely used for text categorization however a classifier misclassifies some of the instances into a category that is relevant to their actual category. The categorization ability of a classifier can be improved by filtering dataset with better classifier and removing such category for misclassified instances. In this paper we proposed a two level approach where level-1 filters instances according to their likelihood in each category and reduce training dataset to top ranked 't' categories and their instances whereas level-2 classifier is used to classify instances with filtered training set. We employed Naïve Bayes, SVM and KNN as machine learning classifiers. Experimental evaluations on standard reuters-21578 and 20 Newsgroups datasets showed improved categorization effectiveness as measured by accuracy, precision, recall and f-measure protocols.

Cite

CITATION STYLE

APA

Khan, K. (2015). Improved single-label text categorization by instance filtration. In Proceedings of the 12th European, Mediterranean and Middle Eastern Conference on Information Systems, EMCIS 2015. European and Mediterranean Conference on Information Systems.

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free