A survey on filter techniques for feature selection in text mining

15Citations
Citations of this article
24Readers
Mendeley users who have this article in their library.
Get full text

Abstract

A large portion of a document is usually covered by irrelevant features. Instead of identifying actual context of the document, such features increase dimensions in the representation model and computational complexity of underlying algorithm, and hence adversely affect the performance. It necessitates a requirement of relevant feature selection in the given feature space. In this context, feature selection plays a key role in removing irrelevant features from the original feature space. Feature selection methods are broadly categorized into three groups: filter, wrapper, and embedded. Filter methods are widely used in text mining because of their simplicity, computational complexity, and efficiency. In this article, we provide a brief survey of filter feature selection methods along with some of the recent developments in this area. Keywords Text mining, Text categorization, Text clustering, Feature extraction, Feature selection, Filter methods

Cite

CITATION STYLE

APA

Bharti, K. K., & Singh, P. K. (2014). A survey on filter techniques for feature selection in text mining. In Advances in Intelligent Systems and Computing (Vol. 236, pp. 1545–1559). Springer Verlag. https://doi.org/10.1007/978-81-322-1602-5_154

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free