Disentangling the Roles of Approach, Activation and Valence in Instrumental and Pavlovian Responding

Hard-wired, Pavlovian, responses elicited by predictions of rewards and punishments exert significant benevolent and malevolent influences over instrumentally-appropriate actions. These influences come in two main groups, defined along anatomical, pharmacological, behavioural and functional lines. Investigations of the influences have so far concentrated on the groups as a whole; here we take the critical step of looking inside each group, using a detailed reinforcement learning model to distinguish effects to do with value, specific actions, and general activation or inhibition. We show a high degree of sophistication in Pavlovian influences, with appetitive Pavlovian stimuli specifically promoting approach and inhibiting withdrawal, and aversive Pavlovian stimuli promoting withdrawal and inhibiting approach. These influences account for differences in the instrumental performance of approach and withdrawal behaviours. Finally, although losses are as informative as gains, we find that subjects neglect losses in their instrumental learning. Our findings argue for a view of the Pavlovian system as a constraint or prior, facilitating learning by alleviating computational costs that come with increased flexibility.

[1]  B. Skinner,et al.  Some quantitative properties of anxiety , 1941 .

[2]  Frederick Mosteller,et al.  Stochastic Models for Learning , 1956 .

[3]  M. Hamilton A RATING SCALE FOR DEPRESSION , 1960, Journal of neurology, neurosurgery, and psychiatry.

[4]  John Garcia,et al.  Relation of cue to consequence in avoidance learning , 1966 .

[5]  R. Rescorla,et al.  Two-process learning theory: Relationships between Pavlovian conditioning and instrumental learning. , 1967, Psychological review.

[6]  D. R. Williams,et al.  Auto-maintenance in the pigeon: sustained pecking despite contingent non-reinforcement. , 1969, Journal of the experimental analysis of behavior.

[7]  C. Spielberger,et al.  STAI manual for the State-trait anxiety inventory ("self-evaluation questionnaire") , 1970 .

[8]  M. Seligman On the generality of the laws of learning , 1970 .

[9]  J. Overmier,et al.  On insirumental response interaction as explaining the influences of Pavlovian CS+s upon avoidance behavior , 1971 .

[10]  J. Gray,et al.  The psychology of fear and stress , 1971 .

[11]  R. Bolles,et al.  Freezing as an avoidance response: Another look at the operant-respondent distinction☆ , 1973 .

[12]  R. W. Schulz The Psychology of Fear and Stress , 1974 .

[13]  Kumpati S. Narendra,et al.  Learning Automata - A Survey , 1974, IEEE Trans. Syst. Man Cybern..

[14]  W. Timberlake,et al.  Auto-Shaping in Rats to the Presentation of Another Rat Predicting Food , 1975, Science.

[15]  S. Iversen,et al.  5-Hydroxytryptamine and punishment , 1977, Nature.

[16]  J. Pearce,et al.  Inhibitory interactions between appetitive and aversive stimuli. , 1977 .

[17]  J. Pearce,et al.  Inhibitory interactions between appetitive and aversive stimuli. , 1977 .

[18]  P. Holland Conditioned stimulus as a determinant of the form of the Pavlovian conditioned response. , 1977, Journal of experimental psychology. Animal behavior processes.

[19]  C. Carter,et al.  Differential effects of central serotonin manipulation on hyperactive and stereotyped behaviour. , 1978, Life sciences.

[20]  M. Åsberg,et al.  A New Depression Scale Designed to be Sensitive to Change , 1979, British Journal of Psychiatry.

[21]  R. Bolles,et al.  On the Ability of Prey to Recognize Predators , 1980 .

[22]  S. Lea,et al.  Contemporary Animal Learning Theory, Anthony Dickinson. Cambridge University Press, Cambridge (1981), xii, +177 pp. £12.50 hardback, £3.95 paperback , 1981 .

[23]  F. Masterson,et al.  Species-specific defense reactions and avoidance learning , 1982, The Pavlovian Journal of Biological Science.

[24]  R. Wise Neuroleptics and operant behavior: The anhedonia hypothesis , 1982, Behavioral and Brain Sciences.

[25]  P. Lovibond Facilitation of instrumental behavior by a Pavlovian appetitive conditioned stimulus. , 1983, Journal of experimental psychology. Animal behavior processes.

[26]  K. Hollis,et al.  The biological function of Pavlovian conditioning: the best defense is a good offense. , 1984, Journal of experimental psychology. Animal behavior processes.

[27]  J. Gray The neuropsychology of anxiety. , 1985, Issues in mental health nursing.

[28]  W. Hershberger An approach through the looking-glass , 1986 .

[29]  P. Soubrié Reconciling the role of central serotonin neurons in human and animal behavior , 1986, Behavioral and Brain Sciences.

[30]  A. Beck,et al.  An inventory for measuring clinical anxiety: psychometric properties. , 1988, Journal of consulting and clinical psychology.

[31]  D. Kahneman,et al.  Experimental Tests of the Endowment Effect and the Coase Theorem , 1990, Journal of Political Economy.

[32]  G. E. Alexander,et al.  Functional architecture of basal ganglia circuits: neural substrates of parallel processing , 1990, Trends in Neurosciences.

[33]  J. Deakin,et al.  5-HT and mechanisms of defence , 1991, Journal of psychopharmacology.

[34]  M. T. Shipley,et al.  Columnar organization in the midbrain periaqueductal gray: modules for emotional expression? , 1994, Trends in Neurosciences.

[35]  P. Dayan,et al.  A framework for mesencephalic dopamine systems based on predictive Hebbian learning , 1996, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[36]  L. Houck,et al.  Foundations of Animal Behavior: Classic Papers with Commentaries , 1996 .

[37]  P. Pini Addiction , 1996, The Lancet.

[38]  J. Pearce,et al.  Acquisition of Conditioned Inhibition in Rats is Impaired by Ablation of Serotoninergic Pathways , 1996, The European journal of neuroscience.

[39]  W. Schultz,et al.  A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task , 1999, Neuroscience.

[40]  S. Ikemoto,et al.  The role of nucleus accumbens dopamine in motivated behavior: a unifying interpretation with special reference to reward-seeking , 1999, Brain Research Reviews.

[41]  J. Mirenowicz,et al.  Dissociation of Pavlovian and instrumental incentive learning under dopamine antagonists. , 2000, Behavioral neuroscience.

[42]  K. Berridge,et al.  Intra-Accumbens Amphetamine Increases the Conditioned Incentive Salience of Sucrose Reward: Enhancement of Reward “Wanting” without Enhanced “Liking” or Response Reinforcement , 2000, The Journal of Neuroscience.

[43]  Douglas W. Jones,et al.  Genotype Influences In Vivo Dopamine Transporter Availability in Human Striatum , 2000, Neuropsychopharmacology.

[44]  J. Horvitz Mesolimbocortical and nigrostriatal dopamine responses to salient non-reward events , 2000, Neuroscience.

[45]  A. Dickinson,et al.  Involvement of the central nucleus of the amygdala and nucleus accumbens core in mediating Pavlovian influences on instrumental behaviour , 2001, The European journal of neuroscience.

[46]  J. Endicott,et al.  Reliability and validity of a structured interview guide for the Hamilton Anxiety Rating Scale (SIGH‐A) , 2001, Depression and anxiety.

[47]  D. Weinberger,et al.  Serotonergic dysfunction, negative mood states, and response to alcohol. , 2001, Alcoholism, clinical and experimental research.

[48]  D. Kupfer,et al.  Amphetamine-induced dopamine release in human ventral striatum correlates with euphoria , 2001, Biological Psychiatry.

[49]  W. Schultz,et al.  Dopamine responses comply with basic assumptions of formal learning theory , 2001, Nature.

[50]  G. Ainslie Breakdown of will , 2001 .

[51]  Sham M. Kakade,et al.  Opponent interactions between serotonin and dopamine , 2002, Neural Networks.

[52]  M. El-Sabaawi Breakdown of Will , 2002 .

[53]  A. Heinz,et al.  Dopaminergic dysfunction in alcoholism and schizophrenia – psychopathological and behavioral correlates , 2002, European Psychiatry.

[54]  A. Dagher,et al.  Alcohol promotes dopamine release in the human nucleus accumbens , 2003, Synapse.

[55]  Karl J. Friston,et al.  Temporal Difference Models and Reward-Related Learning in the Human Brain , 2003, Neuron.

[56]  S. Killcross,et al.  Coordination of actions and habits in the medial prefrontal cortex of rats. , 2003, Cerebral cortex.

[57]  P. Corr,et al.  A two-dimensional neuropsychology of defense: fear/anxiety and defensive distance , 2004, Neuroscience & Biobehavioral Reviews.

[58]  Karl J. Friston,et al.  Dissociable Roles of Ventral and Dorsal Striatum in Instrumental Conditioning , 2004, Science.

[59]  Michael J. Frank,et al.  By Carrot or by Stick: Cognitive Reinforcement Learning in Parkinsonism , 2004, Science.

[60]  R. Beninger,et al.  The use of extinction to investigate the nature of neuroleptic-induced avoidance deficits , 2004, Psychopharmacology.

[61]  S. Maier,et al.  Inescapable shock activates serotonergic neurons in all raphe nuclei of rat , 2004, Behavioural Brain Research.

[62]  David J. C. MacKay,et al.  Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[63]  S. Maier,et al.  Effect of number of tailshocks on learned helplessness and activation of serotonergic and noradrenergic neurons in the rat , 2005, Behavioural Brain Research.

[64]  P. Dayan,et al.  Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control , 2005, Nature Neuroscience.

[65]  B. Balleine,et al.  Double Dissociation of Basolateral and Central Amygdala Lesions on the General and Outcome-Specific Forms of Pavlovian-Instrumental Transfer , 2005, The Journal of Neuroscience.

[66]  Michael J. Frank,et al.  Dynamic Dopamine Modulation in the Basal Ganglia: A Neurocomputational Account of Cognitive Deficits in Medicated and Nonmedicated Parkinsonism , 2005, Journal of Cognitive Neuroscience.

[67]  B. Balleine Neural bases of food-seeking: Affect, arousal and reward in corticostriatolimbic circuits , 2005, Physiology & Behavior.

[68]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[69]  P. Glimcher,et al.  Midbrain Dopamine Neurons Encode a Quantitative Reward Prediction Error Signal , 2005, Neuron.

[70]  H. Seung,et al.  JOURNAL OF THE EXPERIMENTAL ANALYSIS OF BEHAVIOR 2005, 84, 581–617 NUMBER 3(NOVEMBER) LINEAR-NONLINEAR-POISSON MODELS OF PRIMATE CHOICE DYNAMICS , 2022 .

[71]  P. Glimcher,et al.  JOURNAL OF THE EXPERIMENTAL ANALYSIS OF BEHAVIOR 2005, 84, 555–579 NUMBER 3(NOVEMBER) DYNAMIC RESPONSE-BY-RESPONSE MODELS OF MATCHING BEHAVIOR IN RHESUS MONKEYS , 2022 .

[72]  D. Kumaran,et al.  Frames, Biases, and Rational Decision-Making in the Human Brain , 2006, Science.

[73]  W. Hauber,et al.  Inactivation of the ventral tegmental area abolished the general excitatory influence of Pavlovian cues on instrumental performance. , 2006, Learning & memory.

[74]  Peter Dayan,et al.  Non-commercial Research and Educational Use including without Limitation Use in Instruction at Your Institution, Sending It to Specific Colleagues That You Know, and Providing a Copy to Your Institution's Administrator. All Other Uses, Reproduction and Distribution, including without Limitation Comm , 2022 .

[75]  M. Bouton Learning and Behavior: A Contemporary Synthesis , 2006 .

[76]  E. Vaadia,et al.  Midbrain dopamine neurons encode decisions for future action , 2006, Nature Neuroscience.

[77]  P. Dayan,et al.  Tonic dopamine: opportunity costs and the control of response vigor , 2007, Psychopharmacology.

[78]  Q. Huys Reinforcers and control : towards a computational aetiology of depression , 2007 .

[79]  J. Wickens,et al.  Striatal contributions to reward and decision making: making sense of regional variations in a reiterated processing matrix. , 2007, Annals of the New York Academy of Sciences.

[80]  K. Lesch,et al.  Long story short: the serotonin transporter in emotion regulation and social cognition , 2007, Nature Neuroscience.

[81]  Michael J. Frank,et al.  Genetic triple dissociation reveals multiple roles for dopamine in reinforcement learning , 2007, Proceedings of the National Academy of Sciences.

[82]  M. Roesch,et al.  Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards , 2007, Nature Neuroscience.

[83]  J. Krakauer,et al.  Why Don't We Move Faster? Parkinson's Disease, Movement Vigor, and Implicit Motivation , 2007, The Journal of Neuroscience.

[84]  J. Wickens,et al.  Striatal Contributions to Reward and Decision Making , 2007 .

[85]  Peter Dayan,et al.  Serotonin, Inhibition, and Negative Mood , 2007, PLoS Comput. Biol..

[86]  B. Balleine,et al.  The Neural Mechanisms Underlying the Influence of Pavlovian Cues on Human Decision Making , 2008, The Journal of Neuroscience.

[87]  Brian Knutson,et al.  Anticipatory affect: neural correlates and consequences for choice , 2008, Philosophical Transactions of the Royal Society B: Biological Sciences.

[88]  P. Dayan,et al.  Human Pavlovian–Instrumental Transfer , 2008, The Journal of Neuroscience.

[89]  K. Berridge,et al.  Mesolimbic Dopamine in Desire and Dread: Enabling Motivation to Be Generated by Localized Glutamate Disruptions in Nucleus Accumbens , 2008, The Journal of Neuroscience.

[90]  B. Sahakian,et al.  Acute Tryptophan Depletion in Healthy Volunteers Enhances Punishment Prediction but Does not Affect Reward Prediction , 2008, Neuropsychopharmacology.

[91]  T. Robbins,et al.  Serotoninergic regulation of emotional and behavioural control processes , 2008, Trends in Cognitive Sciences.

[92]  M. Andrés Learning and behavior: A contemporary synthesis , 2008 .

[93]  Karl J. Friston,et al.  Bayesian model selection for group studies , 2009, NeuroImage.

[94]  Sylvia M. L. Cox,et al.  Striatal Dopamine Responses to Intranasal Cocaine Self-Administration in Humans , 2009, Biological Psychiatry.

[95]  Karl J. Friston,et al.  Bayesian model selection for group studies (vol 46, pg 1005, 2009) , 2009 .

[96]  Michael X. Cohen,et al.  Dorsal Striatal–midbrain Connectivity in Humans Predicts How Reinforcements Are Used to Guide Decisions , 2009, Journal of Cognitive Neuroscience.

[97]  T. Robbins,et al.  Approach and avoidance learning in patients with major depression and healthy controls: relation to anhedonia , 2009, Psychological Medicine.

[98]  R. Palmiter,et al.  Dopamine Is Necessary for Cue-Dependent Fear Conditioning , 2009, The Journal of Neuroscience.

[99]  M. Ungless,et al.  Phasic excitation of dopamine neurons in ventral VTA by noxious stimuli , 2009, Proceedings of the National Academy of Sciences.

[100]  M. Frank,et al.  Genetic contributions to avoidance-based decisions: striatal D2 receptor polymorphisms , 2009, Neuroscience.

[101]  I. Toni,et al.  On the neural control of social emotional behavior. , 2009, Social cognitive and affective neuroscience.

[102]  P. Dayan,et al.  Serotonin in affective control. , 2009, Annual review of neuroscience.

[103]  T. Robbins,et al.  Reconciling the Role of Serotonin in Behavioral Inhibition and Aversion: Acute Tryptophan Depletion Abolishes Punishment-Induced Inhibition in Humans , 2009, The Journal of Neuroscience.

[104]  A. Hama Predictably Irrational: The Hidden Forces That Shape Our Decisions , 2010 .

[105]  Colin Camerer,et al.  Pavlovian Processes in Consumer Choice: The Physical Presence of a Good Increases Willingness-to-Pay , 2010 .

[106]  S. Nakanishi,et al.  Distinct Roles of Synaptic Transmission in Direct and Indirect Striatal Pathways to Reward and Aversive Behavior , 2010, Neuron.

[107]  W. Hauber,et al.  The role of nucleus accumbens dopamine in outcome encoding in instrumental and Pavlovian conditioning , 2010, Neurobiology of Learning and Memory.

[108]  K. Berridge,et al.  Desire and Dread from the Nucleus Accumbens: Cortical Glutamate and Subcortical GABA Differentially Generate Motivation and Hedonic Impact in the Rat , 2010, PloS one.

[109]  Murtaza Z Mogri,et al.  Supporting Online Material Materials and Methods Som Text Figs. S1 to S8 References Cell Type–specific Loss of Bdnf Signaling Mimics Optogenetic Control of Cocaine Reward , 2022 .

[110]  S. Haber,et al.  The Reward Circuit: Linking Primate Anatomy and Human Imaging , 2010, Neuropsychopharmacology.

[111]  M. Frank,et al.  Neurogenetics and Pharmacology of Learning, Motivation, and Cognition , 2011, Neuropsychopharmacology.

[112]  N. Daw,et al.  Serotonin and Dopamine: Unifying Affective, Activational, and Decision Functions , 2011, Neuropsychopharmacology.

[113]  P. Dayan,et al.  Opponency Revisited: Competition and Cooperation Between Dopamine and Serotonin , 2010, Neuropsychopharmacology.

[114]  Dirk Ifenthaler,et al.  Stochastic Models of Learning , 2012 .