Simplified Computation and Interpretation of Fisher Matrices in Incremental Learning with Deep Neural Networks

Alexander Gepperth; Florian Wiech

Conference Proceedings

Simplified Computation and Interpretation of Fisher Matrices in Incremental Learning with Deep Neural Networks

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11728 LNCS 481-494

DOI: 10.1007/978-3-030-30484-3_39

1Citations

5Readers

Get full text

Abstract

Import recent advances in the domain of incremental or continual learning with DNNs, such as Elastic Weight Consolidation (EWC) or Incremental Moment Matching (IMM) rely on a quantity termed the Fisher information matrix (FIM). While the results obtained in this way are very promising, the use of the FIM relies on the assumptions that (a) the FIM can be approximated by its diagonal, and (b) that FIM diagonal entries are related to the variance of a DNN parameter in the context of Bayesian neural networks. In addition, the FIM is notoriously difficult to compute in automatic differentiation (AD) systems frameworks like TensorFlow, and existing implementations require an excessive use of memory due to this problem. We present the Matrix of SQuares (MaSQ), computed similarly as the FIM, but whose use in EWC-like algorithms follows directly from the calculus of derivatives and requires no additional assumptions. Additionally, MaSQ computation in AD frameworks is much simpler and more memory-efficient FIM computation. When using MaSQ together with EWC we show superior or equal performance to FIM/EWC on a variety of benchmark tasks.

Cite

CITATION STYLE

APA

Gepperth, A., & Wiech, F. (2019). Simplified Computation and Interpretation of Fisher Matrices in Incremental Learning with Deep Neural Networks. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11728 LNCS, pp. 481–494). Springer Verlag. https://doi.org/10.1007/978-3-030-30484-3_39

Simplified Computation and Interpretation of Fisher Matrices in Incremental Learning with Deep Neural Networks

Abstract

Cite

Register to see more suggestions