Bonsai Trees in Your Head: How the Pavlovian System Sculpts Goal-Directed Choices by Pruning Decision Trees

When planning a series of actions, it is usually infeasible to consider all potential future sequences; instead, one must prune the decision tree. Provably optimal pruning is, however, still computationally ruinous and the specific approximations humans employ remain unknown. We designed a new sequential reinforcement-based task and showed that human subjects adopted a simple pruning strategy: during mental evaluation of a sequence of choices, they curtailed any further evaluation of a sequence as soon as they encountered a large loss. This pruning strategy was Pavlovian: it was reflexively evoked by large losses and persisted even when overwhelmingly counterproductive. It was also evident above and beyond loss aversion. We found that the tendency towards Pavlovian pruning was selectively predicted by the degree to which subjects exhibited sub-clinical mood disturbance, in accordance with theories that ascribe Pavlovian behavioural inhibition, via serotonin, a role in mood disorders. We conclude that Pavlovian behavioural inhibition shapes highly flexible, goal-directed choices in a manner that may be important for theories of decision-making in mood disorders.

[1]  B. Skinner,et al.  Some quantitative properties of anxiety , 1941 .

[2]  D. R. Williams,et al.  Auto-maintenance in the pigeon: sustained pecking despite contingent non-reinforcement. , 1969, Journal of the experimental analysis of behavior.

[3]  C. Spielberger,et al.  STAI manual for the State-trait anxiety inventory ("self-evaluation questionnaire") , 1970 .

[4]  C. Spielberger,et al.  Manual for the State-Trait Anxiety Inventory , 1970 .

[5]  Donald E. Knuth,et al.  An Analysis of Alpha-Beta Pruning , 1975, Artif. Intell..

[6]  S. Iversen,et al.  5-Hydroxytryptamine and punishment , 1977, Nature.

[7]  Jay L. Devore,et al.  Probability and statistics for engineering and the sciences , 1982 .

[8]  F. Masterson,et al.  Species-specific defense reactions and avoidance learning , 1982, The Pavlovian Journal of Biological Science.

[9]  C. Spielberger,et al.  Manual for the state-trait anxiety inventory (form Y) : "self-evaluation questionnaire" , 1983 .

[10]  P. Soubrié Reconciling the role of central serotonin neurons in human and animal behavior , 1986, Behavioral and Brain Sciences.

[11]  A. Beck,et al.  An inventory for measuring clinical anxiety: psychometric properties. , 1988, Journal of consulting and clinical psychology.

[12]  J. Teasdale Cognitive Vulnerability to Persistent Depression , 1988 .

[13]  A. Tversky,et al.  Loss Aversion in Riskless Choice: A Reference-Dependent Model , 1991 .

[14]  R. Kessler,et al.  The prediction of major depression in women: toward an integrated etiologic model. , 1993, The American journal of psychiatry.

[15]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[16]  T. Robbins,et al.  Neuropsychological impairments in unipolar depression: the influence of perceived failure on subsequent performance , 1996, Psychological Medicine.

[17]  B. Sahakian,et al.  Cognitive performance in tests sensitive to frontal lobe dysfunction in the elderly depressed , 1996, Psychological Medicine.

[18]  T. Robbins,et al.  Abnormal response to negative feedback in unipolar depression: evidence for a diagnosis specific impairment , 1997, Journal of neurology, neurosurgery, and psychiatry.

[19]  G. Goodwin Neuropsychological and neuroimaging evidence for the involvement of the frontal lobes in depression , 1997, Journal of psychopharmacology.

[20]  T. Hergueta,et al.  The mini international neuropsychiatric interview , 1998, European Psychiatry.

[21]  D. Sheehan,et al.  The Mini-International Neuropsychiatric Interview (M.I.N.I.): the development and validation of a structured diagnostic psychiatric interview for DSM-IV and ICD-10. , 1998, The Journal of clinical psychiatry.

[22]  I. Gotlib,et al.  First onset versus recurrence of depression: differential processes of psychosocial risk. , 1999, Journal of abnormal psychology.

[23]  C. Dowrick,et al.  The use of the Beck Depression Inventory to screen for depression in the general population: a preliminary analysis. , 2000, Journal of affective disorders.

[24]  S. Kasper,et al.  Association between serotonin transporter gene promoter polymorphism (5HTTLPR) and behavioral responses to tryptophan depletion in healthy women with and without family history of depression. , 2002, Archives of general psychiatry.

[25]  B. Balleine,et al.  The Role of Learning in the Operation of Motivational Systems , 2002 .

[26]  D. Kupfer,et al.  Relapse prevention with antidepressant drug treatment in depressive disorders: a systematic review , 2003, The Lancet.

[27]  A. Caspi,et al.  Influence of Life Stress on Depression: Moderation by a Polymorphism in the 5-HTT Gene , 2003, Science.

[28]  A. Caspi,et al.  Influence of Life Stress on Depression: Moderation by a Polymorphism in the 5-HTT Gene , 2003, Science.

[29]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[30]  C. Halldin,et al.  Autoradiographic distribution of serotonin transporters and receptor subtypes in human brain , 2004, Human brain mapping.

[31]  Peter Dayan,et al.  Technical Note: Q-Learning , 2004, Machine Learning.

[32]  T. Robbins,et al.  Cognitive Inflexibility After Prefrontal Serotonin Depletion , 2004, Science.

[33]  R. Hen,et al.  Early-Life Blockade of the 5-HT Transporter Alters Emotional Behavior in Adult Mice , 2004, Science.

[34]  David J. C. MacKay,et al.  Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[35]  A. Meyer-Lindenberg,et al.  5-HTTLPR polymorphism impacts human cingulate-amygdala interactions: a genetic susceptibility mechanism for depression , 2005, Nature Neuroscience.

[36]  S. Maier,et al.  Medial prefrontal cortex determines how stressor controllability affects behavior and dorsal raphe nucleus , 2005, Nature Neuroscience.

[37]  P. Dayan,et al.  Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control , 2005, Nature Neuroscience.

[38]  D. Pizzagalli,et al.  Toward an objective characterization of an anhedonic phenotype: A signal-detection approach , 2005, Biological Psychiatry.

[39]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[40]  S. Maier,et al.  Stressor controllability and learned helplessness: The roles of the dorsal raphe nucleus, serotonin, and corticotropin-releasing factor , 2005, Neuroscience & Biobehavioral Reviews.

[41]  Blai Bonet,et al.  Learning Depth-First Search: A Unified Approach to Heuristic Search in Deterministic and Non-Deterministic Settings, and Its Application to MDPs , 2006, ICAPS.

[42]  Peter Dayan,et al.  Non-commercial Research and Educational Use including without Limitation Use in Instruction at Your Institution, Sending It to Specific Colleagues That You Know, and Providing a Copy to Your Institution's Administrator. All Other Uses, Reproduction and Distribution, including without Limitation Comm , 2022 .

[43]  M. Bouton Learning and Behavior: A Contemporary Synthesis , 2006 .

[44]  Kenji Doya,et al.  Brain mechanism of reward prediction under predictable and unpredictable environmental dynamics , 2006, Neural Networks.

[45]  T. Robbins,et al.  Serotonin Transporter Polymorphism Mediates Vulnerability to Loss of Incentive Motivation Following Acute Tryptophan Depletion , 2006, Neuropsychopharmacology.

[46]  Q. Huys Reinforcers and control : towards a computational aetiology of depression , 2007 .

[47]  A. Schene,et al.  Mood is indirectly related to serotonin, norepinephrine and dopamine levels in humans: a meta-analysis of monoamine depletion studies , 2007, Molecular Psychiatry.

[48]  Sabrina M. Tom,et al.  The Neural Basis of Loss Aversion in Decision-Making Under Risk , 2007, Science.

[49]  T. Dalgleish,et al.  Autobiographical Memory Specificity and Emotional Disorder , 2007, Psychological bulletin.

[50]  Peter Dayan,et al.  Serotonin, Inhibition, and Negative Mood , 2007, PLoS Comput. Biol..

[51]  Peter Dayan,et al.  Psychiatry: Insights into depression through normative decision-making models , 2008, NIPS.

[52]  T. Robbins,et al.  Serotoninergic regulation of emotional and behavioural control processes , 2008, Trends in Cognitive Sciences.

[53]  M. Andrés Learning and behavior: A contemporary synthesis , 2008 .

[54]  J. Geddes,et al.  Comparative efficacy and acceptability of 12 new-generation antidepressants: a multiple-treatments meta-analysis , 2009, The Lancet.

[55]  B. Sahakian,et al.  A Double Dissociation in the Roles of Serotonin and Mood in Healthy Subjects , 2009, Biological Psychiatry.

[56]  P. Dayan,et al.  Serotonin in affective control. , 2009, Annual review of neuroscience.

[57]  T. Robbins,et al.  Reconciling the Role of Serotonin in Behavioral Inhibition and Aversion: Acute Tryptophan Depletion Abolishes Punishment-Induced Inhibition in Humans , 2009, The Journal of Neuroscience.

[58]  C. Otte,et al.  Current developments and controversies: does the serotonin transporter gene-linked polymorphic region (5-HTTLPR) modulate the association between stress and depression? , 2010, Current opinion in psychiatry.

[59]  J. Roiser,et al.  Reward and Punishment Processing in Depression , 2010, Biological Psychiatry.

[60]  Raymond J. Dolan,et al.  Conditioned associations and economic decision biases , 2010, NeuroImage.

[61]  Raymond J. Dolan,et al.  Disentangling the Roles of Approach, Activation and Valence in Instrumental and Pavlovian Responding , 2011, PLoS Comput. Biol..

[62]  B. Sahakian,et al.  Tryptophan depletion disinhibits punishment but not reward prediction: implications for resilience , 2011, Psychopharmacology.

[63]  P. Dayan,et al.  Opponency Revisited: Competition and Cooperation Between Dopamine and Serotonin , 2010, Neuropsychopharmacology.