Learning with unlabeled data for text categorization using bootstrapping and feature projection techniques

Youngjoong Ko; Jungyun Seo

Conference Proceedings

Learning with unlabeled data for text categorization using bootstrapping and feature projection techniques

Proceedings of the Annual Meeting of the Association for Computational Linguistics (2004) 255-262

DOI: 10.3115/1218955.1218988

25Citations

101Readers

Get full text

Abstract

A wide range of supervised learning algorithms has been applied to Text Categorization. However, the supervised learning approaches have some problems. One of them is that they require a large, often prohibitive, number of labeled training documents for accurate learning. Generally, acquiring class labels for training data is costly, while gathering a large quantity of unlabeled data is cheap. We here propose a new automatic text categorization method for learning from only unlabeled data using a bootstrapping framework and a feature projection technique. From results of our experiments, our method showed reasonably comparable performance compared with a supervised method. If our method is used in a text categorization task, building text categorization systems will become significantly faster and less expensive.

Cite

CITATION STYLE

APA

Ko, Y., & Seo, J. (2004). Learning with unlabeled data for text categorization using bootstrapping and feature projection techniques. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 255–262). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1218955.1218988

Learning with unlabeled data for text categorization using bootstrapping and feature projection techniques

Abstract

Cite

Register to see more suggestions