Overcoming Shortcut Learning in a Target Domain by Generalizing Basic Visual Factors from a Source Domain

Piyapat Saranrittichai; Chaithanya Kumar Mummadi; Claudia Blaiotta; Mauricio Munoz; Volker Fischer

Conference Proceedings

Overcoming Shortcut Learning in a Target Domain by Generalizing Basic Visual Factors from a Source Domain

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2022) 13685 LNCS 294-309

DOI: 10.1007/978-3-031-19806-9_17

3Citations

20Readers

Get full text

Abstract

Shortcut learning occurs when a deep neural network overly relies on spurious correlations in the training dataset in order to solve downstream tasks. Prior works have shown how this impairs the compositional generalization capability of deep learning models. To address this problem, we propose a novel approach to mitigate shortcut learning in uncontrolled target domains. Our approach extends the training set with an additional dataset (the source domain), which is specifically designed to facilitate learning independent representations of basic visual factors. We benchmark our idea on synthetic target domains where we explicitly control shortcut opportunities as well as real-world target domains. Furthermore, we analyze the effect of different specifications of the source domain and the network architecture on compositional generalization. Our main finding is that leveraging data from a source domain is an effective way to mitigate shortcut learning. By promoting independence across different factors of variation in the learned representations, networks can learn to consider only predictive factors and ignore potential shortcut factors during inference.

Cite

CITATION STYLE

APA

Saranrittichai, P., Mummadi, C. K., Blaiotta, C., Munoz, M., & Fischer, V. (2022). Overcoming Shortcut Learning in a Target Domain by Generalizing Basic Visual Factors from a Source Domain. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 13685 LNCS, pp. 294–309). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-19806-9_17

Overcoming Shortcut Learning in a Target Domain by Generalizing Basic Visual Factors from a Source Domain

Abstract

Cite

Register to see more suggestions