A Filter Based Feature Selection for Imbalanced Text Classification

K. Swarnalatha; D. S. Guru; Basavaraj S. Anami; N. Vinay Kumar

Conference Proceedings

A Filter Based Feature Selection for Imbalanced Text Classification

Communications in Computer and Information Science (2019) 1037 194-205

DOI: 10.1007/978-981-13-9187-3_18

1Citations

1Readers

Get full text

Abstract

In this work, a text classification method through a filter type feature selection for imbalanced data is addressed. The model initially clusters the documents associated with a class through a hierarchical clustering there by accomplishing a balanced or near balanced class. Later, a filter type feature selection is recommended to choose the most discriminative features for text classification. Subsequently, the documents are stored in the form of interval valued data. For classification purpose, a suitable symbolic classifier is recommended. The experimentation is done with two standard benchmarking datasets viz., Reuters 21578 and TDT2. The experimental results obtained from the proposed model are better in terms of f-measure when compared to the available models.

Author supplied keywords

Cite

CITATION STYLE

APA

Swarnalatha, K., Guru, D. S., Anami, B. S., & Kumar, N. V. (2019). A Filter Based Feature Selection for Imbalanced Text Classification. In Communications in Computer and Information Science (Vol. 1037, pp. 194–205). Springer Verlag. https://doi.org/10.1007/978-981-13-9187-3_18

A Filter Based Feature Selection for Imbalanced Text Classification

Abstract

Author supplied keywords

Cite

Register to see more suggestions