A method of dimensionality reduction by selection of components in principal component analysis for text classification

Yangwu Zhang; Guohe Li; Heng Zong

Journal ArticleOPEN ACCESS

A method of dimensionality reduction by selection of components in principal component analysis for text classification

Filomat (2018) 32(5) 1499-1506

DOI: 10.2298/FIL1805499Z

6Citations

9Readers

Abstract

Dimensionality reduction, including feature extraction and selection, is one of the key points for text classification. In this paper, we propose a mixed method of dimensionality reduction constructed by principal components analysis and the selection of components. Principal components analysis is a method of feature extraction. Not all of the components in principal component analysis contribute to classification, because PCA objective is not a form of discriminant analysis (see, e.g. Jolliffe, 2002). In this context, we present a function of components selection, which returns the useful components for classification by the indicators of the performances on the different subsets of the components. Compared to traditional methods of feature selection, SVM classifiers trained on selected components show improved classification performance and a reduction in computational overhead.

Author supplied keywords

Cite

CITATION STYLE

APA

Zhang, Y., Li, G., & Zong, H. (2018). A method of dimensionality reduction by selection of components in principal component analysis for text classification. Filomat, 32(5), 1499–1506. https://doi.org/10.2298/FIL1805499Z

A method of dimensionality reduction by selection of components in principal component analysis for text classification

Abstract

Author supplied keywords

Cite

Register to see more suggestions