A rapid and efficient learning rule for biological neural circuits

The dominant view in neuroscience is that changes in synaptic weights underlie learning. It is unclear, however, how the brain is able to determine which synapses should change, and by how much. This uncertainty stands in sharp contrast to deep learning, where changes in weights are explicitly engineered to optimize performance. However, the main tool for that, backpropagation, has two problems. One is neuro-science related: it is not biologically plausible. The other is inherent: networks trained with this rule tend to forget old tasks when learning new ones. Here we introduce the Dendritic Gated Network (DGN), a variant of the Gated Linear Network, which offers a biologically plausible alternative to backpropagation. DGNs combine dendritic ‘gating’ (whereby interneurons target dendrites to shape neuronal responses) with local learning rules to yield provably efficient performance. They are significantly more data efficient than conventional artificial networks, and are highly resistant to forgetting. Consequently, they perform well on a variety of tasks, in some cases better than backpropagation. Importantly, DGNs have structural and functional similarities to the cerebellum, a link that we strengthen by using in vivo two-photon calcium imaging to show that single interneurons suppress activity in individual dendritic branches of Purkinje cells, a key feature of the model. Thus, DGNs leverage targeted dendritic inhibition and local learning – two features ubiquitous in the brain – to achieve fast and efficient learning.

[1]  L. F. Abbott,et al.  Credit Assignment Through Broadcasting a Global Error Vector , 2021, NeurIPS.

[2]  Jinsook Kim,et al.  Molecular Layer Interneurons: Key Elements of Cerebellar Network Computation and Behavior , 2020, Neuroscience.

[3]  D. Budden,et al.  Gated Linear Networks , 2019, AAAI.

[4]  Sercan O. Arik,et al.  TabNet: Attentive Interpretable Tabular Learning , 2019, AAAI.

[5]  Siavash Golkar,et al.  A biologically plausible neural network for local supervision in cortical microcircuits , 2020, ArXiv.

[6]  Joel Veness,et al.  A Combinatorial Perspective on Transfer Learning , 2020, NeurIPS.

[7]  Christian K. Machens,et al.  Biological credit assignment through dynamic inversion of feedforward networks , 2020, NeurIPS.

[8]  Bernd Kuhn,et al.  Dendritic coincidence detection in Purkinje neurons of awake mice , 2020, bioRxiv.

[9]  P. Latham,et al.  Kernelized information bottleneck leads to biologically plausible 3-factor Hebbian learning in deep networks , 2020, NeurIPS.

[10]  Tor Lattimore,et al.  Gaussian Gated Linear Networks , 2020, NeurIPS.

[11]  Reza Shadmehr,et al.  Population coding in the cerebellum and its implications for learning from error , 2020, bioRxiv.

[12]  Adam Santoro,et al.  Backpropagation and the brain , 2020, Nature Reviews Neuroscience.

[13]  Richard Naud,et al.  Burst-dependent synaptic plasticity can coordinate learning in hierarchical circuits , 2020, Nature Neuroscience.

[14]  Court Hull,et al.  Prediction signals in the cerebellum: Beyond supervised motor learning , 2020, eLife.

[15]  Julie L. Lefebvre,et al.  Morphological pseudotime ordering and fate mapping reveal diversification of cerebellar inhibitory interneurons , 2020, Nature Communications.

[16]  Online Learning in Contextual Bandits using Gated Linear Networks , 2020, NeurIPS.

[17]  Jeffrey R. Powell,et al.  Transgenic Aedes aegypti Mosquitoes Transfer Genes into a Natural Population , 2019, Scientific Reports.

[18]  C. Pedroarena,et al.  Short-term plasticity at Purkinje to deep cerebellar nuclear neuron synapses supports a slow gain-control mechanism enabling scaled linear encoding over second-long time windows , 2019, bioRxiv.

[19]  J. J. Macklin,et al.  High-performance calcium sensors for imaging activity in neuronal populations and microcompartments , 2019, Nature Methods.

[20]  Bastiaan S. Veeling,et al.  Putting An End to End-to-End: Gradient-Isolated Learning of Representations , 2019, NeurIPS.

[21]  M. Häusser,et al.  Predictive and reactive reward signals conveyed by climbing fiber inputs to cerebellar Purkinje cells , 2019, Nature Neuroscience.

[22]  Peter C. Humphreys,et al.  Deep Learning without Weight Transport , 2019, NeurIPS.

[23]  James C. R. Whittington,et al.  Theories of Error Back-Propagation in the Brain , 2019, Trends in Cognitive Sciences.

[24]  Arild Nøkland,et al.  Training Neural Networks with Local Error Signals , 2019, ICML.

[25]  Michael Eickenberg,et al.  Greedy Layerwise Learning Can Scale to ImageNet , 2018, ICML.

[26]  Yoshua Bengio,et al.  Dendritic cortical microcircuits approximate the backpropagation algorithm , 2018, NeurIPS.

[27]  George J. Augustine,et al.  Graded Control of Climbing-Fiber-Mediated Plasticity and Learning by Inhibition in the Cerebellum , 2018, Neuron.

[28]  J. Christie,et al.  Inhibition gates supralinear Ca2+ signaling in Purkinje cell dendrites during practiced movements , 2018, eLife.

[29]  Roy V. Sillitoe,et al.  Molecular layer interneurons shape the spike activity of cerebellar Purkinje cells , 2018, Scientific Reports.

[30]  Jennifer L Raymond,et al.  Computational Principles of Supervised Learning in the Cerebellum. , 2018, Annual review of neuroscience.

[31]  Yee Whye Teh,et al.  Progress & Compress: A scalable framework for continual learning , 2018, ICML.

[32]  Pieter R. Roelfsema,et al.  Control of synaptic plasticity in deep cortical networks , 2018, Nature Reviews Neuroscience.

[33]  Tor Lattimore,et al.  Online Learning with Gated Linear Networks , 2017, ArXiv.

[34]  George J Augustine,et al.  Serial processing of kinematic signals by cerebellar circuitry during voluntary whisking , 2017, Nature Communications.

[35]  Devika Narain,et al.  A cerebellar mechanism for learning prior distributions of time intervals , 2017, Nature Communications.

[36]  Ben Deverett,et al.  Cerebellar granule cells acquire a widespread predictive feedback signal during motor learning , 2017, Nature Neuroscience.

[37]  L. Luo,et al.  Cerebellar granule cells encode the expectation of reward , 2017, Nature.

[38]  Surya Ganguli,et al.  Continual Learning Through Synaptic Intelligence , 2017, ICML.

[39]  Timothy P Lillicrap,et al.  Deep Learning with Dynamic Spiking Neurons and Fixed Feedback Weights , 2017, Neural Computation.

[40]  Razvan Pascanu,et al.  Overcoming catastrophic forgetting in neural networks , 2016, Proceedings of the National Academy of Sciences.

[41]  Christoph H. Lampert,et al.  iCaRL: Incremental Classifier and Representation Learning , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[42]  Timothy P Lillicrap,et al.  Towards deep learning with segregated dendrites , 2016, eLife.

[43]  Mario Dipoppa,et al.  Suite2p: beyond 10,000 neurons with standard two-photon microscopy , 2016, bioRxiv.

[44]  T. Ohshima,et al.  Stimulated emission from nitrogen-vacancy centres in diamond , 2016, Nature Communications.

[45]  Colin J. Akerman,et al.  Random synaptic feedback weights support error backpropagation for deep learning , 2016, Nature Communications.

[46]  Arild Nøkland,et al.  Direct Feedback Alignment Provides Learning in Deep Neural Networks , 2016, NIPS.

[47]  R. Tremblay,et al.  GABAergic Interneurons in the Neocortex: From Cellular Properties to Circuits , 2016, Neuron.

[48]  Amiram Grinvald,et al.  Accurate spike estimation from noisy calcium signals for ultrafast three-dimensional imaging of large neuronal populations in vivo , 2016, Nature Communications.

[49]  D. Wolpert,et al.  Computations underlying sensorimotor learning , 2016, Current Opinion in Neurobiology.

[50]  A. Hall,et al.  Adaptive Switching Circuits , 2016 .

[51]  C. Sotelo Molecular Layer Interneurons of the Cerebellum: Developmental and Morphological Aspects , 2015, The Cerebellum.

[52]  M. Häusser,et al.  Simultaneous all-optical manipulation and recording of neural circuit activity with cellular resolution in vivo , 2014, Nature Methods.

[53]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[54]  Jürgen Schmidhuber,et al.  Deep learning in neural networks: An overview , 2014, Neural Networks.

[55]  Reza Shadmehr,et al.  A memory of errors in sensorimotor learning , 2014, Science.

[56]  Yan Yang,et al.  Duration of complex-spikes grades Purkinje cell plasticity and cerebellar motor learning , 2014, Nature.

[57]  Nicolas Brunel,et al.  A Cerebellar Learning Model of Vestibulo-Ocular Reflex Adaptation in Wild-Type and Mutant Mice , 2014, The Journal of Neuroscience.

[58]  Stefan Carlsson,et al.  CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[59]  Arnd Roth,et al.  Structured Connectivity in Cerebellar Inhibitory Networks , 2014, Neuron.

[60]  Yoshua Bengio,et al.  An Empirical Investigation of Catastrophic Forgeting in Gradient-Based Neural Networks , 2013, ICLR.

[61]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[62]  Trevor Darrell,et al.  DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition , 2013, ICML.

[63]  Jia Liu,et al.  The Hierarchical Brain Network for Face Recognition , 2013, PloS one.

[64]  Daryl M. Gohl,et al.  Layered reward signaling through octopamine and dopamine in Drosophila , 2012, Nature.

[65]  G. Rubin,et al.  A subset of dopamine neurons signals reward for odour memory in Drosophila , 2012, Nature.

[66]  Michael Häusser,et al.  Dendritic Calcium Signaling Triggered by Spontaneous and Sensory-Evoked Climbing Fiber Input to Cerebellar Purkinje Cells In Vivo , 2011, The Journal of Neuroscience.

[67]  Kamran Khodakhah,et al.  The Role of Interneurons in Shaping Purkinje Cell Responses in the Cerebellar Cortex , 2011, The Journal of Neuroscience.

[68]  Alain Marty,et al.  Interneurons of the cerebellar cortex toggle Purkinje cells between up and down states , 2010, Proceedings of the National Academy of Sciences.

[69]  P. Dean,et al.  The cerebellar microcircuit as an adaptive filter: experimental and computational evidence , 2010, Nature Reviews Neuroscience.

[70]  Nan Zheng,et al.  Synaptic Inhibition, Excitation, and Plasticity in Neurons of the Cerebellar Nuclei , 2010, The Cerebellum.

[71]  William Wisden,et al.  Synaptic inhibition of Purkinje cells mediates consolidation of vestibulo-cerebellar motor learning , 2009, Nature Neuroscience.

[72]  Kamran Khodakhah,et al.  The Linear Computational Algorithm of Cerebellar Purkinje Cells , 2006, The Journal of Neuroscience.

[73]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[74]  S. Arber,et al.  A Developmental Switch in the Response of DRG Neurons to ETS Transcription Factor Signaling , 2005, PLoS biology.

[75]  Michael Häusser,et al.  Feed‐forward inhibition shapes the spike output of cerebellar Purkinje cells , 2005, The Journal of physiology.

[76]  E. Boyden,et al.  Cerebellum-dependent learning: the role of multiple plasticity mechanisms. , 2004, Annual review of neuroscience.

[77]  Rich Caruana,et al.  Multitask Learning , 1997, Machine Learning.

[78]  M. Kawato,et al.  A hierarchical neural-network model for control and learning of voluntary movement , 2004, Biological Cybernetics.

[79]  Doris Y. Tsao,et al.  Faces and objects in macaque cerebral cortex , 2003, Nature Neuroscience.

[80]  Cornelius Schwarz,et al.  Efficacy and short-term plasticity at GABAergic synapses between Purkinje and cerebellar nuclei neurons. , 2003, Journal of neurophysiology.

[81]  B. Barbour,et al.  Properties of Unitary Granule Cell→Purkinje Cell Synapses in Adult Rat Cerebellar Slices , 2002, The Journal of Neuroscience.

[82]  John D. Storey A direct approach to false discovery rates , 2002 .

[83]  D. Linden,et al.  Rapid, synaptically driven increases in the intrinsic excitability of cerebellar deep nuclear neurons , 2000, Nature Neuroscience.

[84]  Stefan Schaal,et al.  Locally Weighted Projection Regression : An O(n) Algorithm for Incremental Real Time Learning in High Dimensional Space , 2000 .

[85]  R. French Catastrophic forgetting in connectionist networks , 1999, Trends in Cognitive Sciences.

[86]  D. Linden,et al.  Polarity of Long-Term Synaptic Gain Change Is Related to Postsynaptic Spike Firing at a Cerebellar Inhibitory Synapse , 1998, Neuron.

[87]  Masao Ito Cerebellar learning in the vestibulo–ocular reflex , 1998, Trends in Cognitive Sciences.

[88]  D. Wolpert,et al.  Internal models in the cerebellum , 1998, Trends in Cognitive Sciences.

[89]  C. Pouzat,et al.  Developmental Regulation of Basket/Stellate Cell→Purkinje Cell Synapses in the Cerebellum , 1997, The Journal of Neuroscience.

[90]  M. Häusser,et al.  Tonic Synaptic Inhibition Modulates Neuronal Output Pattern and Spatiotemporal Synaptic Integration , 1997, Neuron.

[91]  B. R. Sastry,et al.  Postsynaptic mechanisms underlying long-term depression of GABAergic transmission in neurons of the deep cerebellar nuclei. , 1996, Journal of neurophysiology.

[92]  Anthony V. Robins,et al.  Catastrophic Forgetting, Rehearsal and Pseudorehearsal , 1995, Connect. Sci..

[93]  W. N. Ross,et al.  IPSPs strongly inhibit climbing fiber-activated [Ca2+]i increases in the dendrites of cerebellar Purkinje neurons , 1995, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[94]  T. Sejnowski,et al.  Learning and memory in the vestibulo-ocular reflex. , 1995, Annual review of neuroscience.

[95]  Francis Crick,et al.  The recent excitement about neural networks , 1989, Nature.

[96]  Michael McCloskey,et al.  Catastrophic Interference in Connectionist Networks: The Sequential Learning Problem , 1989 .

[97]  James A. Anderson,et al.  Neurocomputing: Foundations of Research , 1988 .

[98]  Stephen Grossberg,et al.  The ART of adaptive pattern recognition by a self-organizing neural network , 1987, Computer.

[99]  Stephen Grossberg,et al.  Competitive Learning: From Interactive Activation to Adaptive Resonance , 1987, Cogn. Sci..

[100]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[101]  Masao Ito,et al.  Long-lasting depression of parallel fiber-Purkinje cell transmission induced by conjunctive stimulation of parallel fibers and climbing fibers in the cerebellar cortex , 1982, Neuroscience Letters.

[102]  F. A. Miles,et al.  Plasticity in the vestibulo-ocular reflex: a new hypothesis. , 1981, Annual review of neuroscience.

[103]  R. Llinás,et al.  Electrophysiological properties of in vitro Purkinje cell somata in mammalian cerebellar slices. , 1980, The Journal of physiology.

[104]  D. Armstrong,et al.  Activity patterns of cerebellar cortical neurones and climbing fibre afferents in the awake cat. , 1979, The Journal of physiology.

[105]  D. Robinson Adaptive gain control of vestibuloocular reflex by the cerebellum. , 1976, Journal of neurophysiology.

[106]  D. Harriman CEREBELLAR CORTEX, CYTOLOGY AND ORGANIZATION , 1974 .

[107]  J. Albus A Theory of Cerebellar Function , 1971 .

[108]  D. Marr A theory of cerebellar cortex , 1969, The Journal of physiology.

[109]  J ECCLES Functional Meaning of the Patterns of Synaptic Connections in the Cerebellum , 1965, Perspectives in biology and medicine.

[110]  J. Eccles,et al.  POSTSYNAPTIC INHIBITION OF CEREBELLAR PURKINJE CELLS. , 1964, Journal of neurophysiology.

[111]  Henry J. Kelley,et al.  Gradient Theory of Optimal Flight Paths , 1960 .