Invariant Information Bottleneck for Domain Generalization

Bo Li; Yifei Shen; Yezhen Wang; Wenzhen Zhu; Colorado Reed; Dongsheng Li; Kurt Keutzer; Han Zhao

Conference ProceedingsOPEN ACCESS

Invariant Information Bottleneck for Domain Generalization

Proceedings of the 36th AAAI Conference on Artificial Intelligence, AAAI 2022 (2022) 36 7399-7407

DOI: 10.1609/aaai.v36i7.20703

111Citations

74Readers

Abstract

Invariant risk minimization (IRM) has recently emerged as a promising alternative for domain generalization. Nevertheless, the loss function is difficult to optimize for nonlinear classifiers and the original optimization objective could fail when pseudo-invariant features and geometric skews exist. Inspired by IRM, in this paper we propose a novel formulation for domain generalization, dubbed invariant information bottleneck (IIB). IIB aims at minimizing invariant risks for nonlinear classifiers and simultaneously mitigating the impact of pseudo-invariant features and geometric skews. Specifically, we first present a novel formulation for invariant causal prediction via mutual information. Then we adopt the variational formulation of the mutual information to develop a tractable loss function for nonlinear classifiers. To overcome the failure modes of IRM, we propose to minimize the mutual information between the inputs and the corresponding representations. IIB significantly outperforms IRM on synthetic datasets, where the pseudo-invariant features and geometric skews occur, showing the effectiveness of proposed formulation in overcoming failure modes of IRM. Furthermore, experiments on DomainBed show that IIB outperforms 13 baselines by 0.9% on average across 7 real datasets.

Cite

CITATION STYLE

APA

Li, B., Shen, Y., Wang, Y., Zhu, W., Reed, C., Li, D., … Zhao, H. (2022). Invariant Information Bottleneck for Domain Generalization. In Proceedings of the 36th AAAI Conference on Artificial Intelligence, AAAI 2022 (Vol. 36, pp. 7399–7407). Association for the Advancement of Artificial Intelligence. https://doi.org/10.1609/aaai.v36i7.20703

Invariant Information Bottleneck for Domain Generalization

Abstract

Cite

Register to see more suggestions