Continual learning (sequential learning of tasks) is challenging for deep neural networks, mainly because of catastrophic forgetting, the tendency for accuracy on previously trained tasks to drop when new tasks are learned. Although several biologically-inspired techniques have been proposed for mitigating catastrophic forgetting, they typically require additional memory and/or computational overhead. Here, we propose a novel regularization approach that combines neuronal activation-based importance measurement with neuron state-dependent learning mechanisms to alleviate catastrophic forgetting in both task-Aware and task-Agnostic scenarios. We introduce a neuronal state-dependent mechanism driven by neuronal activity traces and selective learning rules, with storage requirements for regularization parameters that grow slower with network size-compared to schemes that calculate weight importance, whose storage grows quadratically. The proposed model, NEO, is able to achieve performance comparable to other state-of-The-Art regularization based approaches to catastrophic forgetting, while operating with a reduced memory overhead.
CITATION STYLE
Daram, A., & Kudithipudi, D. (2023). NEO: Neuron State Dependent Mechanisms for Efficient Continual Learning. In ACM International Conference Proceeding Series (pp. 11–19). Association for Computing Machinery. https://doi.org/10.1145/3584954.3584960
Mendeley helps you to discover research relevant for your work.