Abstract
This paper introduces a novel approach for improving the reliability of ICA-based text classifiers, attempting to make the most of the independent components of the text data. In this framework, two issues are adressed: firstly, a relative relevance measure for category assignment is presented. And secondly, a reliability control process is included in the classifier, avoiding the classification of documents belonging to none of the categories defined during the training stage. The experiments have been conducted on a journalistic-style text corpus in Catalan, achieving encouraging results in terms of rejection accuracy. However, similar results are obtained when comparing the proposed relevance measure to the classic magnitude-based technique for category assignment.
Cite
CITATION STYLE
Sevillano, X., Alías, F., & Socoró, J. C. (2004). Reliability in ICA-based text classification. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 3195, 1213–1220. https://doi.org/10.1007/978-3-540-30110-3_153
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.