A novel feature selection technique for text classification

4Citations
Citations of this article
16Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In this paper, a new feature selection technique called Term-Class Weight-Inverse-Class Frequency is proposed for the purpose of text classification. The technique is based on selecting the most discriminating features with respect to each class. Nevertheless, the number of selected features by our technique is equal to the multiples of the number of classes present in the collection. The vectors of the document have been built based on varying number of selected features. The effectiveness of the technique has been demonstrated by conducting a series of experiments on two benchmarking text corpora, viz., Reuters-21578 and TDT2 using KNN classifier. In addition, a comparative analysis of the results of the proposed technique with that of the state-of-the-art techniques on the datasets indicates that the proposed technique outperforms several techniques.

Cite

CITATION STYLE

APA

Guru, D. S., Ali, M., & Suhil, M. (2019). A novel feature selection technique for text classification. In Advances in Intelligent Systems and Computing (Vol. 813, pp. 721–733). Springer Verlag. https://doi.org/10.1007/978-981-13-1498-8_63

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free