Feature selection method based on improved document frequency

7Citations
Citations of this article
14Readers
Mendeley users who have this article in their library.

Abstract

Feature selection is an important part of the process of text classification, there is a direct impact on the quality of feature selection because of the evaluation function. Document frequency (DF) is one of several commonly methods used feature selection, its shortcomings is the lack of theoretical basis on function construction, itwill tend to select high-frequency words in selecting. To solve the problem, we put forward a improved algorithm named DFMcombined withclass distribution of characteristics and realize the algorithm with programming, DFM were compared with some feature selection method commonly used with experimental using support vector machine, as text classification .The results show that, when feature selection, the DFM methods performance is stable at work andis better than other methodsin classification results.

Cite

CITATION STYLE

APA

Zheng, W., & Feng, G. (2014). Feature selection method based on improved document frequency. Telkomnika (Telecommunication Computing Electronics and Control), 12(4), 905–910. https://doi.org/10.12928/TELKOMNIKA.v12i4.536

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free