Binary Classification with Positive Labeling Sources

2Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.

Abstract

To create a large amount of training labels for machine learning models effectively and efficiently, researchers have turned to Weak Supervision (WS), which uses programmatic labeling sources rather than manual annotation. Existing works of WS for binary classification typically assume the presence of labeling sources that are able to assign both positive and negative labels to data in roughly balanced proportions. However, for many tasks of interest where there is a minority positive class, negative examples could be too diverse for developers to generate indicative labeling sources. Thus, in this work, we study the application of WS on binary classification tasks with positive labeling sources only. We propose WEAPO, a simple yet competitive WS method for producing training labels without negative labeling sources. On 10 benchmark datasets, we show WEAPO achieves the highest averaged performance in terms of both the quality of synthesized labels and the performance of the final classifier supervised with these labels. We incorporated the implementation of WEAPO into WRENCH, an existing benchmarking platform.

Cite

CITATION STYLE

APA

Zhang, J., Wang, Y., Yang, Y., Luo, Y., & Ratner, A. (2022). Binary Classification with Positive Labeling Sources. In International Conference on Information and Knowledge Management, Proceedings (pp. 4672–4676). Association for Computing Machinery. https://doi.org/10.1145/3511808.3557552

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free