Attention-Gated Brain Propagation: How the brain can implement reward-based error backpropagation

Pozzi, Isabella; Bohte, Sander; Roelfsema, Pieter

I. Pozzi (Isabella), S.M. Bohte (Sander) and P.R. Roelfsema (Pieter)

2020-12-06

Attention-Gated Brain Propagation: How the brain can implement reward-based error backpropagation

Presented at the 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada (December 2020), Virtual, Online

Much recent work has focused on biologically plausible variants of supervised learning algorithms. However, there is no teacher in the motor cortex that instructs the motor neurons and learning in the brain depends on reward and punishment. We demonstrate a biologically plausible reinforcement learning scheme for deep networks with an arbitrary number of layers. The network chooses an action by selecting a unit in the output layer and uses feedback connections to assign credit to the units in successively lower layers that are responsible for this action. After the choice, the network receives reinforcement and there is no teacher correcting the errors. We show how the new learning scheme – Attention-Gated Brain Propagation (BrainProp) – is mathematically equivalent to error backpropagation, for one output unit at a time. We demonstrate successful learning of deep fully connected, convolutional and locally connected networks on classical and hard image-classification benchmarks; MNIST, CIFAR10, CIFAR100 andTiny ImageNet. BrainProp achieves an accuracy that is equivalent to that of standard error-backpropagation, and better than state-of-the-art biologically inspired learning schemes. Additionally, the trial-and-error nature of learning is associated with limited additional training time so that BrainProp is a factor of 1-3.5 times slower. Our results thereby provide new insights into how deep learning may be implemented in the brain.

Additional Metadata
Conference	34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada
Organisation	Machine Learning
Citation APA Style AAA Style APA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Pozzi, I., Bohte, S., & Roelfsema, P. (2020). Attention-Gated Brain Propagation: How the brain can implement reward-based error backpropagation. In Advances in Neural Information Processing Systems.

Full Text ( Final Version , 2mb )

See Also
software\|data BrainProp I. Pozzi (Isabella)

Attention-Gated Brain Propagation: How the brain can implement reward-based error backpropagation

Publication

Publication

software|data
BrainProp

Address

CWI researchers

Questions or comments?

Attention-Gated Brain Propagation: How the brain can implement reward-based error backpropagation

Publication

Publication

software|data BrainProp

Workflow

Workflow

Add Content

software|data
BrainProp