Improved Feature Weight Algorithm and Its Application to Text Classification

12Citations
Citations of this article
22Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Text preprocessing is one of the key problems in pattern recognition and plays an important role in the process of text classification. Text preprocessing has two pivotal steps: feature selection and feature weighting. The preprocessing results can directly affect the classifiers' accuracy and performance. Therefore, choosing the appropriate algorithm for feature selection and feature weighting to preprocess the document can greatly improve the performance of classifiers. According to the Gini Index theory, this paper proposes an Improved Gini Index algorithm. This algorithm constructs a new feature selection and feature weighting function. The experimental results show that this algorithm can improve the classifiers' performance effectively. At the same time, this algorithm is applied to a sensitive information identification system and has achieved a good result. The algorithm's precision and recall are higher than those of traditional ones. It can identify sensitive information on the Internet effectively.

References Powered by Scopus

Selecting and interpreting measures of thematic classification accuracy

1376Citations
N/AReaders
Get full text

On the specification of term values in automatic indexing

444Citations
N/AReaders
Get full text

Decision trees for mining data streams based on the mcdiarmid's bound

169Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Fault Diagnosis of High-Speed Train Bogie Based on Capsule Network

76Citations
N/AReaders
Get full text

Anode effect prediction based on support vector machine and K nearest neighbor

13Citations
N/AReaders
Get full text

A new feature selection metric for text classification: eliminating the need for a separate pruning stage

9Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Shang, S., Shi, M., Shang, W., & Hong, Z. (2016). Improved Feature Weight Algorithm and Its Application to Text Classification. Mathematical Problems in Engineering, 2016. https://doi.org/10.1155/2016/7819626

Readers over time

‘16‘17‘18‘19‘20‘21‘22‘23‘2402468

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 10

71%

Lecturer / Post doc 2

14%

Researcher 2

14%

Readers' Discipline

Tooltip

Computer Science 12

80%

Agricultural and Biological Sciences 1

7%

Decision Sciences 1

7%

Mathematics 1

7%

Save time finding and organizing research with Mendeley

Sign up for free
0