Co-training from an incremental EM perspective

Minoo Aminian

Journal Article

Co-training from an incremental EM perspective

Aminian M

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2004) 3177 765-773

DOI: 10.1007/978-3-540-28651-6_114

0Citations

1Readers

Get full text

Abstract

We study classification when the majority of data is unlabeled, and only a small fraction is labeled: the so-called semi-supervised learning situation. Blum and Mitchell's co-training is a popular semi-supervised algorithm [1] to use when we have multiple independent views of the entities to classify. An example of a multi-view situation is classifying web pages: one view may describe the pages by the words that occur on them, another view describes the pages by the words in the hyperlinks that point to them. In co-training two learners each form a model from the labeled data and then incrementally label small subsets of the unlabeled data for each other. The learners then re-estimate their model from the labeled data and the psuedo-labels provided by the learners. Though some analysis of the algorithm's performance exists [1] the computation performed is still not well understood. We propose that each view in cotraining is effectively performing incremental EM as postulated by Neal and Hinton [3], combined with a Bayesian classifier. This analysis suggests improvements over the core co-training algorithm. We introduce variations, which result in faster convergence to the maximum possible accuracy of classification than the core co-training algorithm, and therefore increase the learning efficiency. We empirically verify our claim for a number of data sets in the context of belief network learning. © Springer-Verlag Berlin Heidelberg 2004.

Cite

CITATION STYLE

APA

Aminian, M. (2004). Co-training from an incremental EM perspective. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 3177, 765–773. https://doi.org/10.1007/978-3-540-28651-6_114

Co-training from an incremental EM perspective

Abstract

Cite

Register to see more suggestions