Text Categorization is the process of assigning documents to a set of previously fixed categories. A lot of research is going on with the goal of automating this time-consuming task. Several different algorithms have been applied, and Support Vector Machines have shown very good results. In this paper we propose a new family of measures taken from the Machine Learning environment to apply them to feature reduction task. The experiments are performed on two different corpus (Reuters and Ohsumed). The results show that the new family of measures performs better than the traditional Information Theory measures. © Springer-Verlag Berlin Heidelberg 2003.
CITATION STYLE
Montañés, E., Fernández, J., Díaz, I., Combarro, E. F., & Ranilla, J. (2003). Measures of rule quality for feature selection in text categorization. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2810, 589–598. https://doi.org/10.1007/978-3-540-45231-7_54
Mendeley helps you to discover research relevant for your work.