Learning continuous-time working memory tasks with on-policy neural reinforcement learning

Zambrano, Davide; Roelfsema, Pieter; Bohte, Sander

doi:10.1016/j.neucom.2020.11.072

D. Zambrano (Davide), P.R. Roelfsema (Pieter) and S.M. Bohte (Sander)

2021-05-13

Learning continuous-time working memory tasks with on-policy neural reinforcement learning

Neurocomputing , Volume 461 p. 635- 656

An animals’ ability to learn how to make decisions based on sensory evidence is often well described by Reinforcement Learning (RL) frameworks. These frameworks, however, typically apply to event-based representations and lack the explicit and fine-grained notion of time needed to study psychophysically relevant measures like reaction times and psychometric curves. Here, we develop and use a biologically plausible continuous-time RL scheme of CT-AuGMEnT (Continuous-Time Attention-Gated MEmory Tagging) to study these behavioural quantities. We show how CT-AuGMEnT implements on-policy SARSA learning as a biologically plausible form of reinforcement learning with working memory units using ‘attentional’ feedback. We show that the CT-AuGMEnT model efficiently learns tasks in continuous time and can learn to accumulate relevant evidence through time. This allows the model to link task difficulty to psychophysical measurements such as accuracy and reaction-times. We further show how the implementation of a separate accessory network for feedback allows the model to learn continuously, also in case of significant transmission delays between the network's feedforward and feedback layers and even when the accessory network is randomly initialized. Our results demonstrate that CT-AuGMEnT represents a fully time-continuous biologically plausible end-to-end RL model for learning to integrate evidence and make decisions.

Additional Metadata
Keywords	Reinforcement learning, Neural networks, Working memory, Selective attention, Continuous-time SARSA
Persistent URL	doi.org/10.1016/j.neucom.2020.11.072
Journal	Neurocomputing
Project	Deep Spiking Vision: Better, Faster, Cheaper
Organisation	Machine Learning
Citation APA Style AAA Style APA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Zambrano, D., Roelfsema, P., & Bohte, S. (2021). Learning continuous-time working memory tasks with on-policy neural reinforcement learning. Neurocomputing, 461, 635–656. doi:10.1016/j.neucom.2020.11.072

View at Publisher

Free Full Text ( Final Version , 3mb )

Learning continuous-time working memory tasks with on-policy neural reinforcement learning

Publication

Publication

Address

CWI researchers

Questions or comments?

Learning continuous-time working memory tasks with on-policy neural reinforcement learning

Publication

Publication

Workflow

Workflow

Add Content