Evaluating classification performance with only positive and unlabeled samples

Siamak Hajizadeh; Zili Li; Rolf P.B.J. Dollevoet; David M.J. Tax

Conference ProceedingsOPEN ACCESS

Evaluating classification performance with only positive and unlabeled samples

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2014) 8621 LNCS 233-242

DOI: 10.1007/978-3-662-44415-3_24

6Citations

18Readers

Abstract

Testing binary classifiers usually requires a test set with labeled positive and negative examples. In many real-world applications however, some positive objects are manually labeled while negative objects are not labeled explicitly. For instance in the detection of defects in a large collection of objects, the most obvious defects are normally found with ease, while normal-looking objects may just be ignored. In this situation, datasets will consist of only positive and unlabeled samples. Here we propose a measure to estimate the performance of a classifier with test sets lacking labeled negative examples. Experiments are performed to show the effect of several criteria on the accuracy of our estimation, including that of the assumption of "random sampling of the labeled positives". We put the measure into use for classification of real-world defect detection data with no available validation sets. © 2014 Springer-Verlag Berlin Heidelberg.

Author supplied keywords

Cite

CITATION STYLE

APA

Hajizadeh, S., Li, Z., Dollevoet, R. P. B. J., & Tax, D. M. J. (2014). Evaluating classification performance with only positive and unlabeled samples. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8621 LNCS, pp. 233–242). Springer Verlag. https://doi.org/10.1007/978-3-662-44415-3_24

Evaluating classification performance with only positive and unlabeled samples

Abstract

Author supplied keywords

Cite

Register to see more suggestions