3 3 A Computational Model of Avian Song Learning

Oscine song learning has an auditory phase during which a tutor song is learned and a sensorimotor phase of successive improvement that leads to adult song. A theoretical framework for song learning is presented based on the hypothesis that the primary role of the anteriorforebrainpathway of the song system is to transform an auditory template to a motor program by a form of reinforcement learning. This framework was tested by building a network model of the song-learning system including a model of the syrinx, the avian vocal organ. The model replicated the spectral envelopes of the syllables from zebra finch songs after several hundred trials of learning. The performance of the model was even better when trained on songs generated by another model having the same architecture. Experiments are proposed to further test the biological plausibility of the hypothesis, which may lead to a more detailed model of the song-learning system. Other types of sensorimotor learning based on mimicry could be implemented with a similar type of computational model. In comparison with our understanding of the preprogrammed central pattern generators found in many invertebrates and lower vertebrates responsible for complex motor behaviors (Cohen, Rossignol, and Grillner, 1988; Harris-Wanick et al., 1992; Kristan, 1992), much less is known about the representation of motor patterns acquired through experience in humans and other vertebrates, such as walking, riding a bicycle, or talking. Singing in oscine birds is a favorable system for studying the acquisition of complex motor patterns. Much is known about the ethology of birdsong learning and the influence of early auditory learning (Marler, 1963; Konishi, 1965; Marler, 1991; Catchpole and Slater, 1995). The major brain nuclei involved in song control and learning have been identified, as schematically shown in figure 33.1 (Nottebohm, Stokes, and Leonard, 1976; Bottjer et al., 1989). New data are accumulating from lesion and recording experiments on these nuclei (for reviews, see Konishi, 1985; Doupe, 1993; Margoliash, 1997; Bottjer and Arnold, 1997). KENJI DOYA Kawato Dynamic Brain Project, ERATO, Japan Science and Technology Corporation, Kyoto, Japan, and Howard Hughes Medical Institute, Salk Institute for Biological Studies, LaJolla, Calif. TERRENCE J. SEJNOWSKI Department of Biology, University of California, San Diego, and Howard Hughes Medical Institute, Salk Institute for Biological Studies, La Jolla, Calif. The primary goal of this chapter is to present a theoretical framework and a working model for song learning based on recent experimental findings. Specifically, we focus on the function of the anteriorforebrain pathway, which is not involved in song production in adult birds, but is necessary for song learning in young birds (Bottjer, Miesner, and Arnold, 1984). Our main hypothesis is that the anterior forebrain pathway works as a reinforcement learning system that is similar to the adaptive critic architecture proposed by Barto, Sutton, and Anderson (1983). The song template is a key concept in birdsong learning. A young male bird listens to a tutor song during the critical period and memorizes a template of the song; later, the bird learns to sing the stored song by comparing its own vocalization to the song template using auditory feedback (Konishi, 1965). However, it is still an open question how the song is encoded and where in the bird's brain the song template is stored. Recent experiments in zebra finch suggest that the song control system has a hierarchical organization: HVc, the high vocal center, is involved in producing a sequence of syllables, whereas its downstream nucleus RA is responsible for the subsyllabic components (Vu, Mazurek, and Kuo, 1994; Yu and Margoliash, 1996). If we assume that a song is learned in such a hierarchical fashion, the problem of song learning can be decomposed into the following three subproblems: 1. Sensory encoding: How to encode the acoustic features of syllables in such a way that they are reliably recognized. 2. Sequential memory: How to organize the network so that syllable sequences are stably memorized and reproduced. 3. Motor decoding: How to find the motor command patterns needed to replicate the acoustic features of each syllable. Existing experimental evidence does not provide straightforward solutions to these problems. A computational approach could help in exploring the biological solutions and, in particular, in providing functional + Anterior Forebrain Pathway respiratory system FIGURE 33.1 Schematic diagram of the major songbird brain nuclei involved in song control. The thinner arrows show the direct motor control pathway, and the thicker arrows show the anterior forebrain pathway. Abbreviations: Uva, nucleus uvaeformis of thalamus; NIf, nucleus interface of neostriatum; L, field L of forebrain; HVc, high vocal center (formerly called hyperstriatum ventrale, pars caudale); RA, robust nucleus of archistriatum; DM, dorsomedial part of nucleus intercollicularis; nXIIts, tracheosyringeal part of hypoglossal nucleus; AVT, ventral area of Tsai of midbrain; X, area X of lobus parolfactorius; DLM, medial part of dorsolateral nucleus of thalamus; LMAN, lateral magnocellular nucleus of anterior neostriatum. constraints on the organization of the learning system. For example, theories of unsupervised learning (von der Malsburg, 1973; Amari, 1977; Linsker, 1986; Bell and Sejnowski, 1995) suggest several possible solutions to sensory encoding problems. Studies of associative memory networks (Fukushima, 1973; Sompolinsky and Kanter, 1986; Dehaene, Changeux, and Nadal, 1987; Amari, 1988; Morita, 1996) provide constraints on representation and architectures that enable stable storage of temporal sequences. There have been extensive studies on the "inverse problem" of finding the control input for a nonlinear system to realize a given target output (Miller, Sutton, and Werbos, 1990; Gullapalli, 1995). In this chapter, we propose a working hypothesis for the functions subserved by song-related brain nuclei in songbirds (figure 33.1), with an emphasis on the role of anterior forebrain pathway in solving the motor decoding problem. Figure 33.2 illustrates various schemes for solving inverse problems using neural networks. In the first scheme (figure 33.2a), the desired output is converted to a desired motor command by an inverse model of the motor system that enables replication of the desired output in one shot. Although attractive as a model of vocal learning in other species like humans, this is not an appropriate model for vocal learning in songbirds because they require many repetitions of singing trials with auditory feedback. Another possible scheme is error correction learning (figure 33.213) that uses a linear approximation of the inverse model to convert motor output error into the motor command error for incremental learning of the control network. The problem is that the learning schemes proposed to date either use a biologically implausible algorithm (Jordan and Rumelhart, 1992) or assume the preexistence of an approximate inverse model (Kawato, Furukawa, and Suzuki, 1987; Kawato, 1990). Furthermore, in order to calculate the error in the acoustic output, a replica of the target output, or the tutor song, has to be available. The third scheme (figure 33.2~) is based on the paradigm of reinforcement learning (Sutton and Barto, 1998). It does not use an inverse model and uses a critic that evaluates the motor output by comparing the present vocal output with the tutor song. Learning is based on the correlation between stochastic changes in the motor command and the increase or decrease in the evaluation (Barto, Sutton, and Anderson, 1983; Gullapalli, 1995). There is no need to maintain a replica of the tutor song. Activation levels of auditory neurons that have selective tuning to the tutor song can be used as the

[1]  M. Konishi The role of auditory feedback in the control of vocalization in the white-crowned sparrow. , 1965, Zeitschrift fur Tierpsychologie.

[2]  F. Nottebohm,et al.  Central control of song in the canary, Serinus canarius , 1976, The Journal of comparative neurology.

[3]  J. S. McCasland,et al.  Interaction between auditory and motor activities in an avian song control nucleus. , 1981, Proceedings of the National Academy of Sciences of the United States of America.

[4]  A. Arnold,et al.  Evidence for a catecholaminergic projection to area X in the zebra finch , 1981, The Journal of comparative neurology.

[5]  John H. Brackenbury,et al.  2 – The Structural Basis of Voice Production and Its Relationship to Sound Characteristics , 1982 .

[6]  D. Margoliash Acoustic parameters underlying the responses of song-specific neurons in the white-crowned sparrow , 1983, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[7]  Richard S. Sutton,et al.  Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[8]  A. Arnold,et al.  Forebrain lesions disrupt development but not maintenance of song in passerine birds. , 1984, Science.

[9]  M. Konishi,et al.  Birdsong: from behavior to neuron. , 1985, Annual review of neuroscience.

[10]  Richard M. Casey,et al.  Theoretical models of the avian syrinx , 1985 .

[11]  Masakazu Konishi,et al.  Neuronal growth, atrophy and death in a sexually dimorphic song nucleus in the zebra finch brain , 1985, Nature.

[12]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[13]  R. Linsker,et al.  From basic network principles to neural architecture , 1986 .

[14]  D Margoliash,et al.  Preference for autogenous song by auditory neurons in a song system nucleus of the white-crowned sparrow , 1986, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[15]  Kanter,et al.  Temporal association in asymmetric neural networks. , 1986, Physical review letters.

[16]  S. Nowicki,et al.  Vocal tract resonances in oscine bird sound production: evidence from birdsongs in a helium atmosphere , 1987, Nature.

[17]  S Dehaene,et al.  Neural networks that learn temporal sequences by selection. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[18]  S. Rossignol,et al.  Neural Control of Rhythmic Movements in Vertebrates , 1988 .

[19]  S.-i. Amari Statistical neurodynamics of various versions of correlation associative memory , 1988, IEEE 1988 International Conference on Neural Networks.

[20]  Fernando Nottebohm,et al.  Organization of the zebra finch song control system: I. Representation of syringeal muscles in the hypoglossal nucleus , 1988, The Journal of comparative neurology.

[21]  Sandra A. Brown,et al.  Axonal connections of a forebrain nucleus involved with vocal learning in zebra finches , 1989, The Journal of comparative neurology.

[22]  H. Williams Multiple Representations and Auditory‐Motor Interactions in the Avian Song System , 1989 .

[23]  H. Williams Models for song learning in the zebra finch: fathers or others? , 1990, Animal Behaviour.

[24]  E. Nordeen,et al.  Selective impairment of song learning following lesions of a forebrain nucleus in the juvenile zebra finch. , 1990, Behavioral and neural biology.

[25]  F. Nottebohm,et al.  A comparative study of the behavioral deficits following lesions of various parts of the zebra finch song system: implications for vocal learning , 1991, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[26]  P. Marler Song-learning behavior: The interface with neuroethology , 1991, Trends in Neurosciences.

[27]  D. Vicario Organization of the zebra finch song control system: Functional organization of outputs from nucleus robustus archistriatalis , 1991, The Journal of comparative neurology.

[28]  R. Mooney,et al.  Two distinct inputs to an avian song nucleus activate different glutamate receptor subtypes on individual neurons. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[29]  D. Vicario Neural mechanisms of vocal production in songbirds , 1991, Current Opinion in Neurobiology.

[30]  A. Doupe,et al.  Song-selective auditory circuits in the vocal control system of the zebra finch. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[31]  Nozomu Saito,et al.  NMDA receptors participate differentially in two different synaptic inputs in neurons of the zebra finch robust nucleus of the archistriatum in vitro , 1991, Neuroscience Letters.

[32]  R. Mooney,et al.  Synaptic basis for developmental plasticity in a birdsong nucleus , 1992, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[33]  Neuronal basis of behavior , 1992, Current Opinion in Neurobiology.

[34]  Michael I. Jordan,et al.  Forward Models: Supervised Learning with a Distal Teacher , 1992, Cogn. Sci..

[35]  D. Margoliash,et al.  Temporal and harmonic combination-sensitive neurons in the zebra finch's HVc , 1992, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[36]  R. Harris-Warrick In: Dynamic Biological Networks: The Stomatogastric Nervous System , 1992 .

[37]  A. Doupe A neural circuit specialized for vocal learning , 1993, Current Opinion in Neurobiology.

[38]  S. Volman,et al.  Development of neural selectivity for birdsong during vocal learning , 1993, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[39]  R. Nicoll,et al.  NMDA-receptor-dependent synaptic plasticity: multiple forms and mechanisms , 1993, Trends in Neurosciences.

[40]  W. Schultz,et al.  Responses of monkey dopamine neurons to reward and conditioned stimuli during successive steps of learning a delayed response task , 1993, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[41]  J. Wild Descending projections of the songbird nucleus robustus archistriatalis , 1993, The Journal of comparative neurology.

[42]  H. Williams,et al.  Temporal patterning of song production: participation of nucleus uvaeformis of the thalamus. , 1993, Journal of neurobiology.

[43]  Joel L. Davis,et al.  A Model of How the Basal Ganglia Generate and Use Neural Signals That Predict Reinforcement , 1994 .

[44]  J. Deuchars,et al.  Temporal and spatial properties of local circuits in neocortex , 1994, Trends in Neurosciences.

[45]  D Margoliash,et al.  Global synchronous response to autogenous song in zebra finch HVc. , 1994, Journal of neurophysiology.

[46]  Gregory F Ball,et al.  Characterization and localization of D1 dopamine receptors in the sexually dimorphic vocal control nucleus, area X, and the basal ganglia of European starlings. , 1994, Journal of neurobiology.

[47]  K. P. Unnikrishnan,et al.  Alopex: A Correlation-Based Learning Algorithm for Feedforward and Recurrent Neural Networks , 1994, Neural Computation.

[48]  Terrence J. Sejnowski,et al.  A Novel Reinforcement Model of Birdsong Vocalization Learning , 1994, NIPS.

[49]  E.C.L. Vu,et al.  Identification of a forebrain motor programming network for the learned song of zebra finches , 1994, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[50]  K. Fox,et al.  Critical period control in sensory cortex , 1994, Current Opinion in Neurobiology.

[51]  Frank Johnson,et al.  Topographic organization of a forebrain pathway involved with vocal learning in zebra finches , 1995, The Journal of comparative neurology.

[52]  Terrence J. Sejnowski,et al.  An Information-Maximization Approach to Blind Separation and Blind Deconvolution , 1995, Neural Computation.

[53]  P. Slater,et al.  Bird Song: Biological Themes and Variations , 1995 .

[54]  Vijaykumar Gullapalli,et al.  Direct associative reinforcement learning methods for dynamic systems control , 1995, Neurocomputing.

[55]  M S Lewicki,et al.  Mechanisms underlying the sensitivity of songbird forebrain neurons to temporal order. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[56]  G. E. Vates,et al.  Feedback circuitry within a song-learning pathway. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[57]  Franz Goller,et al.  Implications for lateralization of bird song from unilateral gating of bilateral motor patterns , 1995, Nature.

[58]  M. Lewicki Intracellular Characterization of Song-Specific Neurons in the Zebra Finch Auditory Forebrain , 1996, The Journal of Neuroscience.

[59]  A. C. Yu,et al.  Temporal Hierarchical Control of Singing in Birds , 1996, Science.

[60]  Masahiko Morita,et al.  Memory and Learning of Sequential Patterns by Nonmonotone Neural Networks , 1996, Neural Networks.

[61]  A. Arnold,et al.  Developmental plasticity in neural circuits for a learned behavior. , 1997, Annual review of neuroscience.