Improving text classification performance with incremental background knowledge

Catarina Silva; Bernardete Ribeiro

Conference Proceedings

Improving text classification performance with incremental background knowledge

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2009) 5768 LNCS(PART 1) 923-931

DOI: 10.1007/978-3-642-04274-4_95

4Citations

4Readers

Get full text

Abstract

Text classification is generally the process of extracting interesting and non-trivial information and knowledge from text. One of the main problems with text classification systems is the lack of labeled data, as well as the cost of labeling unlabeled data. Thus, there is a growing interest in exploring the use of unlabeled data as a way to improve classification performance in text classification. The ready availability of this kind of data in most applications makes it an appealing source of information. In this work we propose an Incremental Background Knowledge (IBK) technique to introduce unlabeled data into the training set by expanding it using initial classifiers to deliver oracle decisions. The defined incremental SVM margin-based method was tested in the Reuters-21578 benchmark showing promising results. © 2009 Springer Berlin Heidelberg.

Cite

CITATION STYLE

APA

Silva, C., & Ribeiro, B. (2009). Improving text classification performance with incremental background knowledge. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5768 LNCS, pp. 923–931). https://doi.org/10.1007/978-3-642-04274-4_95

Improving text classification performance with incremental background knowledge

Abstract

Cite

Register to see more suggestions