Goal-directed control and its antipodes
暂无分享,去创建一个
[1] P. Dayan,et al. A Bayesian formulation of behavioral control , 2009, Cognition.
[2] Peter Dayan,et al. Values and Actions in Aversion , 2009 .
[3] P. Dayan,et al. Serotonin in affective control. , 2009, Annual review of neuroscience.
[4] Alex S. Taylor,et al. Machine intelligence , 2009, CHI.
[5] P. Dayan,et al. Flexible shaping: How learning in small steps helps , 2009, Cognition.
[6] E. Rolls,et al. The orbitofrontal cortex and beyond: From affect to decision-making , 2008, Progress in Neurobiology.
[7] John R. Anderson,et al. The acquisition of robust and flexible cognitive skills. , 2008, Journal of experimental psychology. General.
[8] David Silver,et al. Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence (2008) Achieving Master Level Play in 9 × 9 Computer Go , 2022 .
[9] B. Balleine,et al. Calculating Consequences: Brain Systems That Encode the Causal Effects of Actions , 2008, The Journal of Neuroscience.
[10] B. Balleine,et al. The Neural Mechanisms Underlying the Influence of Pavlovian Cues on Human Decision Making , 2008, The Journal of Neuroscience.
[11] John R. Anderson,et al. Solving the credit assignment problem: explicit and implicit learning of action sequences with probabilistic outcomes , 2008, Psychological research.
[12] K. Berridge,et al. Emotional environments retune the valence of appetitive versus fearful functions in nucleus accumbens , 2008, Nature Neuroscience.
[13] John R. Anderson,et al. A central circuit of the mind , 2008, Trends in Cognitive Sciences.
[14] S. Lammel,et al. Unique Properties of Mesoprefrontal Neurons within a Dual Mesocorticolimbic Dopamine System , 2008, Neuron.
[15] B. Everitt,et al. Cocaine Seeking Habits Depend upon Dopamine-Dependent Serial Connectivity Linking the Ventral with the Dorsal Striatum , 2008, Neuron.
[16] P. Dayan,et al. Human Pavlovian–Instrumental Transfer , 2008, The Journal of Neuroscience.
[17] M. Roesch,et al. Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards , 2007, Nature Neuroscience.
[18] J. O'Doherty,et al. Lights, Camembert, Action! The Role of Human Orbitofrontal Cortex in Encoding Stimuli, Rewards, and Choices , 2007, Annals of the New York Academy of Sciences.
[19] M. D’Esposito,et al. Functional Magnetic Resonance Imaging Evidence for a Hierarchical Organization of the Prefrontal Cortex , 2007, J. Cogn. Neurosci..
[20] P. Glimcher,et al. The neural correlates of subjective value during intertemporal choice , 2007, Nature Neuroscience.
[21] E. Koechlin,et al. Anterior Prefrontal Function and the Limits of Human Decision-Making , 2007, Science.
[22] G. Buzsáki,et al. Forward and reverse hippocampal place-cell sequences during ripples , 2007, Nature Neuroscience.
[23] M Kawato,et al. Internal models for motor control. , 2007, Novartis Foundation symposium.
[24] Peter Dayan,et al. Bilinearity, Rules, and Prefrontal Cortex , 2007, Frontiers Comput. Neurosci..
[25] Q. Huys. Reinforcers and control : towards a computational aetiology of depression , 2007 .
[26] C. Summerfield,et al. An information theoretical approach to prefrontal executive function , 2007, Trends in Cognitive Sciences.
[27] Vivian V. Valentin,et al. Determining the Neural Substrates of Goal-Directed Learning in the Human Brain , 2007, The Journal of Neuroscience.
[28] P. Dayan,et al. Tonic dopamine: opportunity costs and the control of response vigor , 2007, Psychopharmacology.
[29] D. Kahneman,et al. Frames and brains: elicitation and control of response tendencies , 2007, Trends in Cognitive Sciences.
[30] Peter Dayan,et al. Non-commercial Research and Educational Use including without Limitation Use in Instruction at Your Institution, Sending It to Specific Colleagues That You Know, and Providing a Copy to Your Institution's Administrator. All Other Uses, Reproduction and Distribution, including without Limitation Comm , 2022 .
[31] E. Vaadia,et al. Midbrain dopamine neurons encode decisions for future action , 2006, Nature Neuroscience.
[32] John R. Anderson,et al. From recurrent choice to skill learning: a reinforcement-learning model. , 2006, Journal of experimental psychology. General.
[33] B. Balleine,et al. Parallel incentive processing: an integrated view of amygdala function , 2006, Trends in Neurosciences.
[34] David J. Foster,et al. Reverse replay of behavioural sequences in hippocampal place cells during the awake state , 2006, Nature.
[35] Michael J. Frank,et al. Making Working Memory Work: A Computational Model of Learning in the Prefrontal Cortex and Basal Ganglia , 2006, Neural Computation.
[36] F. Velde,et al. Neural blackboard architectures of combinatorial structures in cognition , 2006 .
[37] B. Balleine. Neural bases of food-seeking: Affect, arousal and reward in corticostriatolimbic circuits , 2005, Physiology & Behavior.
[38] P. Dayan,et al. Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control , 2005, Nature Neuroscience.
[39] Jonathan D. Cohen,et al. Prefrontal cortex and flexible cognitive control: rules without symbols. , 2005, Proceedings of the National Academy of Sciences of the United States of America.
[40] L. Squire. Memory systems of the brain: A brief history and current perspective , 2004, Neurobiology of Learning and Memory.
[41] Samuel M. McClure,et al. Separate Neural Systems Value Immediate and Delayed Monetary Rewards , 2004, Science.
[42] John R Anderson,et al. An integrated theory of the mind. , 2004, Psychological review.
[43] R. Hertwig,et al. Decisions from Experience and the Effect of Rare Events in Risky Choice , 2004, Psychological science.
[44] Jonathan Evans. In two minds: dual-process accounts of reasoning , 2003, Trends in Cognitive Sciences.
[45] S. Killcross,et al. Coordination of actions and habits in the medial prefrontal cortex of rats. , 2003, Cerebral cortex.
[46] Albert K. Lee,et al. Memory of Sequential Experience in the Hippocampus during Slow Wave Sleep , 2002, Neuron.
[47] John R. Anderson,et al. Why do children learn to say “Broke”? A model of learning the past tense without feedback , 2002, Cognition.
[48] K. Berridge,et al. Positive and Negative Motivation in Nucleus Accumbens Shell: Bivalent Rostrocaudal Gradients for GABA-Elicited Eating, Taste “Liking”/“Disliking” Reactions, Place Preference/Avoidance, and Fear , 2002, The Journal of Neuroscience.
[49] H. Pashler. STEVENS' HANDBOOK OF EXPERIMENTAL PSYCHOLOGY , 2002 .
[50] D. Kahneman,et al. Heuristics and Biases: The Psychology of Intuitive Judgment , 2002 .
[51] D. Kahneman,et al. Representativeness revisited: Attribute substitution in intuitive judgment. , 2002 .
[52] Sham M. Kakade,et al. Opponent interactions between serotonin and dopamine , 2002, Neural Networks.
[53] Rajesh P. N. Rao,et al. Probabilistic Models of the Brain: Perception and Neural Function , 2002 .
[54] Isaac Meilijson,et al. Evolution of Reinforcement Learning in Uncertain Environments: A Simple Explanation for Complex Foraging Behaviors , 2002, Adapt. Behav..
[55] Michael J. Frank,et al. Interactions between frontal cortex and basal ganglia in working memory: A computational model , 2001, Cognitive, affective & behavioral neuroscience.
[56] K. Berridge,et al. Fear and Feeding in the Nucleus Accumbens Shell: Rostrocaudal Segregation of GABA-Elicited Defensive Behavior Versus Eating Behavior , 2001, The Journal of Neuroscience.
[57] J. Driver,et al. Control of Cognitive Processes: Attention and Performance XVIII , 2000 .
[58] K. Stanovich,et al. Heuristics and Biases: Individual Differences in Reasoning: Implications for the Rationality Debate? , 2002 .
[59] Nikolaus R. McFarland,et al. Striatonigrostriatal Pathways in Primates Form an Ascending Spiral from the Shell to the Dorsolateral Striatum , 2000, The Journal of Neuroscience.
[60] D. Joel,et al. The connections of the dopaminergic system with the striatum in rats and primates: an analysis with respect to the functional and compartmental organization of the striatum , 2000, Neuroscience.
[61] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..
[62] W. Schultz,et al. A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task , 1999, Neuroscience.
[63] A. Miyake,et al. Models of Working Memory: Mechanisms of Active Maintenance and Executive Control , 1999 .
[64] Jonathan D. Cohen,et al. A Biologically Based Computational Model of Working Memory , 1999 .
[65] Peter Dylan. Recurrent sampling models for the Helmholtz machine , 1999 .
[66] D M Wolpert,et al. Multiple paired forward and inverse models for motor control , 1998, Neural Networks.
[67] C. Lebiere,et al. The Atomic Components of Thought , 1998 .
[68] Peter Dayan,et al. A Neural Substrate of Prediction and Reward , 1997, Science.
[69] J. March. Learning to be risk averse. , 1996 .
[70] B. McNaughton,et al. Replay of Neuronal Firing Sequences in Rat Hippocampus During Sleep Following Spatial Experience , 1996, Science.
[71] P. Dayan,et al. A framework for mesencephalic dopamine systems based on predictive Hebbian learning , 1996, The Journal of neuroscience : the official journal of the Society for Neuroscience.
[72] Michael I. Jordan,et al. An internal model for sensorimotor integration. , 1995, Science.
[73] Geoffrey E. Hinton,et al. The Helmholtz Machine , 1995, Neural Computation.
[74] P. Goldman-Rakic,et al. Modulation of memory fields by dopamine Dl receptors in prefrontal cortex , 1995, Nature.
[75] Geoffrey E. Hinton,et al. The "wake-sleep" algorithm for unsupervised neural networks. , 1995, Science.
[76] Dimitri P. Bertsekas,et al. Dynamic Programming and Optimal Control, Two Volume Set , 1995 .
[77] Richard S. Sutton,et al. TD Models: Modeling the World at a Mixture of Time Scales , 1995, ICML.
[78] S. Epstein. Integration of the cognitive and the psychodynamic unconscious. , 1994, The American psychologist.
[79] Allen and Rosenbloom Paul S. Newell,et al. Mechanisms of Skill Acquisition and the Law of Practice , 1993 .
[80] Peter Dayan,et al. Improving Generalization for Temporal Difference Learning: The Successor Representation , 1993, Neural Computation.
[81] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[82] J. Deakin,et al. 5-HT and mechanisms of defence , 1991, Journal of psychopharmacology.
[83] A. Newell. Unified Theories of Cognition , 1990 .
[84] Richard S. Sutton,et al. Dyna, an integrated architecture for learning, planning, and reacting , 1990, SGAR.
[85] Richard S. Sutton,et al. Learning and Sequential Decision Making , 1989 .
[86] G. Logan. Toward an instance theory of automatization. , 1988 .
[87] John McCarthy,et al. SOME PHILOSOPHICAL PROBLEMS FROM THE STANDPOINT OF ARTI CIAL INTELLIGENCE , 1987 .
[88] A. Dickinson. Actions and habits: the development of behavioural autonomy , 1985 .
[89] R. Shiffrin,et al. Automatic and controlled processing revisited. , 1984, Psychological review.
[90] John R. Anderson. Acquisition of cognitive skill. , 1982 .
[91] M. Seligman,et al. Learned helplessness: Theory and evidence. , 1976 .
[92] A. Tversky,et al. BELIEF IN THE LAW OF SMALL NUMBERS , 1971, Pediatrics.
[93] D. R. Williams,et al. Auto-maintenance in the pigeon: sustained pecking despite contingent non-reinforcement. , 1969, Journal of the experimental analysis of behavior.
[94] K. Breland,et al. The misbehavior of organisms. , 1961 .
[95] E. R. Crossman. A THEORY OF THE ACQUISITION OF SPEED-SKILL∗ , 1959 .
[96] Colin Camerer,et al. Neuroeconomics: decision making and the brain , 2008 .
[97] Jonathan D. Cohen,et al. On the Control of Control: The Role of Dopamine in Regulating Prefrontal Function and Working Memory , 2007 .
[98] Emanuel Todorov,et al. Optimal Control Theory , 2006 .
[99] M. Wilson,et al. Temporally Structured Replay of Awake Hippocampal Ensemble Activity during Rapid Eye Movement Sleep , 2001, Neuron.
[100] M. Just,et al. Computational modeling of high‐level cognition and brain function , 1999, Human brain mapping.
[101] S. Chaiken,et al. Dual-process theories in social psychology , 1999 .
[102] D E Kieras,et al. A computational theory of executive cognitive processes and multiple-task performance: Part 1. Basic mechanisms. , 1997, Psychological review.
[103] S. Sloman. The empirical case for two systems of reasoning. , 1996 .
[104] A. Barto. Adaptive Critics and the Basal Ganglia , 1995 .
[105] Joel L. Davis,et al. In : Models of Information Processing in the Basal Ganglia , 2008 .
[106] Mitsuo Kawato,et al. A forward-inverse optics model of reciprocal connections between visual cortical areas , 1993 .
[107] M. Just,et al. From the SelectedWorks of Marcel Adam Just 1992 A capacity theory of comprehension : Individual differences in working memory , 2017 .
[108] D. J. Felleman,et al. Distributed hierarchical processing in the primate cerebral cortex. , 1991, Cerebral cortex.
[109] P. Goldman-Rakic,et al. Preface: Cerebral Cortex Has Come of Age , 1991 .
[110] C. Watkins. Learning from delayed rewards , 1989 .
[111] K. R. Hammond. Human judgment and social policy , 1980 .
[112] Walter Schneider,et al. Controlled and Automatic Human Information Processing: 1. Detection, Search, and Attention. , 1977 .
[113] R. Bolles. Species-specific defense reactions and avoidance learning. , 1970 .
[114] Paul M. Fitts,et al. Perceptual-Motor Skill Learning1 , 1964 .
[115] A. W. Melton. Categories of Human Learning , 1964 .