Measures of rule quality for feature selection in text categorization

Elena Montañés; Javier Fernández; Irene Díaz; Elías F. Combarro; José Ranilla

Journal Article

Measures of rule quality for feature selection in text categorization

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2003) 2810 589-598

DOI: 10.1007/978-3-540-45231-7_54

12Citations

10Readers

Get full text

Abstract

Text Categorization is the process of assigning documents to a set of previously fixed categories. A lot of research is going on with the goal of automating this time-consuming task. Several different algorithms have been applied, and Support Vector Machines have shown very good results. In this paper we propose a new family of measures taken from the Machine Learning environment to apply them to feature reduction task. The experiments are performed on two different corpus (Reuters and Ohsumed). The results show that the new family of measures performs better than the traditional Information Theory measures. © Springer-Verlag Berlin Heidelberg 2003.

Cite

CITATION STYLE

APA

Montañés, E., Fernández, J., Díaz, I., Combarro, E. F., & Ranilla, J. (2003). Measures of rule quality for feature selection in text categorization. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2810, 589–598. https://doi.org/10.1007/978-3-540-45231-7_54

Measures of rule quality for feature selection in text categorization

Abstract

Cite

Register to see more suggestions