A large portion of a document is usually covered by irrelevant features. Instead of identifying actual context of the document, such features increase dimensions in the representation model and computational complexity of underlying algorithm, and hence adversely affect the performance. It necessitates a requirement of relevant feature selection in the given feature space. In this context, feature selection plays a key role in removing irrelevant features from the original feature space. Feature selection methods are broadly categorized into three groups: filter, wrapper, and embedded. Filter methods are widely used in text mining because of their simplicity, computational complexity, and efficiency. In this article, we provide a brief survey of filter feature selection methods along with some of the recent developments in this area. Keywords Text mining, Text categorization, Text clustering, Feature extraction, Feature selection, Filter methods
CITATION STYLE
Bharti, K. K., & Singh, P. K. (2014). A survey on filter techniques for feature selection in text mining. In Advances in Intelligent Systems and Computing (Vol. 236, pp. 1545–1559). Springer Verlag. https://doi.org/10.1007/978-81-322-1602-5_154
Mendeley helps you to discover research relevant for your work.