Classification from positive and unlabeled data based on likelihood invariance for measurement

Takeshi Yoshida; Takashi Washio; Takahito Ohshiro; Masateru Taniguchi

Journal ArticleOPEN ACCESS

Classification from positive and unlabeled data based on likelihood invariance for measurement

Intelligent Data Analysis (2021) 25(1) 57-79

DOI: 10.3233/IDA-194980

3Citations

5Readers

Abstract

We propose novel approaches for classification from positive and unlabeled data (PUC) based on maximum likelihood principle. These are particularly suited to measurement tasks in which the class prior of the target object in each measurement is unknown and significantly different from the class prior used for training, while the likelihood function representing the observation process is invariant over the training and measurement stages. Our PUCs effectively work without estimating the class priors of the unlabeled objects. First, we present a PUC approach called Naive Likelihood PUC (NL-PUC) using the maximum likelihood principle in a nontrivial but rather straightforward manner. The extended version called Enhanced Likelihood PUC (EL-PUC) employs an algorithm iteratively improving the likelihood estimation of the positive class. This is advantageous when the availability of the labeled positive data is limited. These characteristics are demonstrated both theoretically and experimentally. Moreover, the practicality of our PUCs is demonstrated in a real application to single molecule measurement.

Author supplied keywords

Cite

CITATION STYLE

APA

Yoshida, T., Washio, T., Ohshiro, T., & Taniguchi, M. (2021). Classification from positive and unlabeled data based on likelihood invariance for measurement. Intelligent Data Analysis, 25(1), 57–79. https://doi.org/10.3233/IDA-194980

Classification from positive and unlabeled data based on likelihood invariance for measurement

Abstract

Author supplied keywords

Cite

Register to see more suggestions