Multi-layer network utilizing rewarded spike time dependent plasticity to learn a foraging task

Pavel Sanda; Steven Skorheim; Maxim Bazhenov

Journal ArticleOPEN ACCESS

Multi-layer network utilizing rewarded spike time dependent plasticity to learn a foraging task

PLoS Computational Biology (2017) 13(9)

DOI: 10.1371/journal.pcbi.1005705

9Citations

61Readers

Abstract

Neural networks with a single plastic layer employing reward modulated spike time dependent plasticity (STDP) are capable of learning simple foraging tasks. Here we demonstrate advanced pattern discrimination and continuous learning in a network of spiking neurons with multiple plastic layers. The network utilized both reward modulated and non-reward modulated STDP and implemented multiple mechanisms for homeostatic regulation of synaptic efficacy, including heterosynaptic plasticity, gain control, output balancing, activity normalization of rewarded STDP and hard limits on synaptic strength. We found that addition of a hidden layer of neurons employing non-rewarded STDP created neurons that responded to the specific combinations of inputs and thus performed basic classification of the input patterns. When combined with a following layer of neurons implementing rewarded STDP, the network was able to learn, despite the absence of labeled training data, discrimination between rewarding patterns and the patterns designated as punishing. Synaptic noise allowed for trial-and-error learning that helped to identify the goal-oriented strategies which were effective in task solving. The study predicts a critical set of properties of the spiking neuronal network with STDP that was sufficient to solve a complex foraging task involving pattern classification and decision making.

Cite

CITATION STYLE

APA

Sanda, P., Skorheim, S., & Bazhenov, M. (2017). Multi-layer network utilizing rewarded spike time dependent plasticity to learn a foraging task. PLoS Computational Biology, 13(9). https://doi.org/10.1371/journal.pcbi.1005705

Multi-layer network utilizing rewarded spike time dependent plasticity to learn a foraging task

Abstract

Cite

Register to see more suggestions