K-means and wordnet based feature selection combined with extreme learning machines for text classification

Rajendra Kumar Roul; Sanjay Kumar Sahay

Conference Proceedings

K-means and wordnet based feature selection combined with extreme learning machines for text classification

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9581 103-112

DOI: 10.1007/978-3-319-28034-9_13

6Citations

7Readers

Get full text

Abstract

The incredible increase of online documents in digital form on the Web, has renewed the interest in text classification. The aim of text classification is to classify text documents into a set of pre-defined categories. But the poor quality of features selection, extremely high dimensional feature space and complexity of natural languages become the roadblock for this classification process. To address these issues, here we propose a k-means clustering based feature selection for text classification. Bi-Normal Separation (BNS) combine with Wordnet and cosine-similarity helps to form a quality and reduce feature vector to train the Extreme Learning Machine (ELM) and Multi-layer Extreme Learning Machine (ML-ELM) classifiers. For experimental purpose, 20-Newsgroups and DMOZ datasets have been used. The empirical results on these two bench- mark datasets demonstrate the applicability, efficiency and effectiveness of our approach using ELM and ML-ELM as the classifiers over state-of-the-art classifiers.

Author supplied keywords

Cite

CITATION STYLE

APA

Roul, R. K., & Sahay, S. K. (2016). K-means and wordnet based feature selection combined with extreme learning machines for text classification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9581, pp. 103–112). Springer Verlag. https://doi.org/10.1007/978-3-319-28034-9_13

K-means and wordnet based feature selection combined with extreme learning machines for text classification

Abstract

Author supplied keywords

Cite

Register to see more suggestions