CCLM: Class-Conditional Label Noise Modelling

Albert Tatjer; Bhalaji Nagarajan; Ricardo Marques; Petia Radeva

Conference Proceedings

CCLM: Class-Conditional Label Noise Modelling

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2023) 14062 LNCS 3-14

DOI: 10.1007/978-3-031-36616-1_1

2Citations

1Readers

Get full text

Abstract

The performance of deep neural networks highly depends on the quality and volume of the training data. However, cost-effective labelling processes such as crowdsourcing and web crawling often lead to data with noisy (i.e., wrong) labels. Making models robust to this label noise is thus of prime importance. A common approach is using loss distributions to model the label noise. However, the robustness of these methods highly depends on the accuracy of the division of training set into clean and noisy samples. In this work, we dive in this research direction highlighting the existing problem of treating this distribution globally and propose a class-conditional approach to split the clean and noisy samples. We apply our approach to the popular DivideMix algorithm and show how the local treatment fares better with respect to the global treatment of loss distribution. We validate our hypothesis on two popular benchmark datasets and show substantial improvements over the baseline experiments. We further analyze the effectiveness of the proposal using two different metrics - Noise Division Accuracy and Classiness.

Author supplied keywords

Cite

CITATION STYLE

APA

Tatjer, A., Nagarajan, B., Marques, R., & Radeva, P. (2023). CCLM: Class-Conditional Label Noise Modelling. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 14062 LNCS, pp. 3–14). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-36616-1_1

CCLM: Class-Conditional Label Noise Modelling

Abstract

Author supplied keywords

Cite

Register to see more suggestions