A survey on filter techniques for feature selection in text mining

Kusum Kumari Bharti; Pramod Kumar Singh

Conference Proceedings

A survey on filter techniques for feature selection in text mining

Advances in Intelligent Systems and Computing (2014) 236 1545-1559

DOI: 10.1007/978-81-322-1602-5_154

15Citations

24Readers

Get full text

Abstract

A large portion of a document is usually covered by irrelevant features. Instead of identifying actual context of the document, such features increase dimensions in the representation model and computational complexity of underlying algorithm, and hence adversely affect the performance. It necessitates a requirement of relevant feature selection in the given feature space. In this context, feature selection plays a key role in removing irrelevant features from the original feature space. Feature selection methods are broadly categorized into three groups: filter, wrapper, and embedded. Filter methods are widely used in text mining because of their simplicity, computational complexity, and efficiency. In this article, we provide a brief survey of filter feature selection methods along with some of the recent developments in this area. Keywords Text mining, Text categorization, Text clustering, Feature extraction, Feature selection, Filter methods

Author supplied keywords

Cite

CITATION STYLE

APA

Bharti, K. K., & Singh, P. K. (2014). A survey on filter techniques for feature selection in text mining. In Advances in Intelligent Systems and Computing (Vol. 236, pp. 1545–1559). Springer Verlag. https://doi.org/10.1007/978-81-322-1602-5_154

A survey on filter techniques for feature selection in text mining

Abstract

Author supplied keywords

Cite

Register to see more suggestions