PE-PUC: A graph based PU-learning approach for text classification

Shuang Yu; Chunping Li

Conference Proceedings

PE-PUC: A graph based PU-learning approach for text classification

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2007) 4571 LNAI 574-584

DOI: 10.1007/978-3-540-73499-4_43

12Citations

7Readers

Get full text

Abstract

This paper presents a novel solution for the problem of building text classifier using positive documents (P) and unlabeled documents (U). Here, the unlabeled documents are mixed with positive and negative documents. This problem is also called PU-Learning. The key feature of PU-Learning is that there is no negative document for training. Recently, several approaches have been proposed for solving this problem. Most of them are based on the same idea, which builds a classifier in two steps. Each existing technique uses a different method for each step. Generally speaking, these existing approaches do not perform well when the size of P is small. In this paper, we propose a new approach aiming at improving the system when the size of P is small. This approach combines the graph-based semi-supervised learning method with the two-step method. Experiments indicate that our proposed method performs well especially when the size of P is small. © Springer-Verlag Berlin Heidelberg 2007.

Cite

CITATION STYLE

APA

Yu, S., & Li, C. (2007). PE-PUC: A graph based PU-learning approach for text classification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4571 LNAI, pp. 574–584). Springer Verlag. https://doi.org/10.1007/978-3-540-73499-4_43

PE-PUC: A graph based PU-learning approach for text classification

Abstract

Cite

Register to see more suggestions