论文信息 - Dopamine: generalization and bonuses - 字舞流文

Dopamine: generalization and bonuses

Peter Dayan | Sham M. Kakade | S. Kakade | P. Dayan

[1] Sham M. Kakade,et al. Opponent interactions between serotonin and dopamine , 2002, Neural Networks.

[2] Roland E. Suri,et al. TD models of reward predictive responses in dopamine neurons , 2002, Neural Networks.

[3] Ronen I. Brafman,et al. R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning , 2001, J. Mach. Learn. Res..

[4] W. Schultz,et al. Dopamine responses comply with basic assumptions of formal learning theory , 2001, Nature.

[5] Christopher C. Pack,et al. Temporal dynamics of a neural solution to the aperture problem in visual area MT of macaque brain , 2001, Nature.

[6] Peter Dayan,et al. Motivated Reinforcement Learning , 2001, NIPS.

[7] David S. Touretzky,et al. Behavioral considerations suggest an average reward TD model of the dopamine system , 2000, Neurocomputing.

[8] W. Schultz,et al. Modifications of reward expectation-related neuronal activity during learning in primate orbitofrontal cortex. , 2000, Journal of neurophysiology.

[9] W. Schultz,et al. Reward-related neuronal activity during go-nogo task performance in primate orbitofrontal cortex. , 2000, Journal of neurophysiology.

[10] J. Hollerman,et al. Reward processing in primate orbitofrontal cortex and basal ganglia. , 2000, Cerebral cortex.

[11] E. Rolls. The orbitofrontal cortex and reward. , 2000, Cerebral cortex.

[12] S. Ikemoto,et al. The role of nucleus accumbens dopamine in motivated behavior: a unifying interpretation with special reference to reward-seeking , 1999, Brain Research Reviews.

[13] L. Peltonen,et al. Association between novelty seeking and the type 4 dopamine receptor gene in a large Finnish cohort sample. , 1999, The American journal of psychiatry.

[14] G. Schoenbaum,et al. Orbitofrontal Cortex and Representation of Incentive Value in Associative Learning , 1999, The Journal of Neuroscience.

[15] Jonathan D. Cohen,et al. Cognition and control in schizophrenia: a computational model of dopamine and prefrontal function , 1999, Biological Psychiatry.

[16] W. Schultz,et al. A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task , 1999, Neuroscience.

[17] A. Paterson,et al. Dopamine D4 Receptor Gene: Novelty or Nonsense? , 1999, Neuropsychopharmacology.

[18] Andrew Y. Ng,et al. Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping , 1999, ICML.

[19] S. Gerhand. THE PREFRONTAL CORTEX—EXECUTIVE AND COGNITIVE FUNCTIONS. , 1999 .

[20] P. Redgrave,et al. Is the short-latency dopamine response too short to signal reward error? , 1999, Trends in Neurosciences.

[21] F. Guarraci,et al. An electrophysiological characterization of ventral tegmental area dopaminergic neurons during differential pavlovian fear conditioning in the awake rabbit , 1999, Behavioural Brain Research.

[22] G. Schoenbaum,et al. Neural Encoding in Orbitofrontal Cortex and Basolateral Amygdala during Olfactory Discrimination Learning , 1999, The Journal of Neuroscience.

[23] J. Pearce,et al. The Influence of Background Stimuli on Summation in Autoshaping , 1999 .

[24] P. Holland,et al. Amygdala circuitry in attentional and representational processes , 1999, Trends in Cognitive Sciences.

[25] T. Robbins,et al. The prefrontal cortex: Executive and cognitive functions. , 1998 .

[26] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .

[27] J. Gray,et al. Dopamine's role. , 1997, Science.

[28] J. Horvitz,et al. Burst activity of ventral tegmental dopamine neurons is elicited by sensory stimuli in the awake cat , 1997, Brain Research.

[29] P. Holland,et al. The Role of an Amygdalo-Nigrostriatal Pathway in Associative Learning , 1997, The Journal of Neuroscience.

[30] C. Gallistel,et al. Toward a neurobiology of temporal cognition: advances and challenges , 1997, Current Opinion in Neurobiology.

[31] Peter Dayan,et al. A Neural Substrate of Prediction and Reward , 1997, Science.

[32] P. Holland,et al. Neurotoxic Lesions of Basolateral, But Not Central, Amygdala Interfere with Pavlovian Second-Order Conditioning and Reinforcer Devaluation Effects , 1996, The Journal of Neuroscience.

[33] V. Brown,et al. Covert Orienting of Attention in the Rat and the Role of Striatal Dopamine , 1996, The Journal of Neuroscience.

[34] M. Bardo,et al. Psychobiology of novelty seeking and drug seeking behavior , 1996, Behavioural Brain Research.

[35] P. Dayan,et al. A framework for mesencephalic dopamine systems based on predictive Hebbian learning , 1996, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[36] W. Schultz,et al. Preferential activation of midbrain dopamine neurons by appetitive rather than aversive stimuli , 1996, Nature.

[37] D. Signorini,et al. Neural networks , 1995, The Lancet.

[38] Peter Dayan,et al. Bee foraging in uncertain environments using predictive hebbian learning , 1995, Nature.

[39] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..

[40] P. Goldman-Rakic,et al. Modulation of memory fields by dopamine Dl receptors in prefrontal cortex , 1995, Nature.

[41] W. Schultz,et al. Importance of unpredictability for reward responses in primate dopamine neurons. , 1994, Journal of neurophysiology.

[42] P. Kalivas,et al. Involvement of dopamine and excitatory amino acid transmission in novelty-induced motor activity. , 1994, The Journal of pharmacology and experimental therapeutics.

[43] J. Salamone. The involvement of nucleus accumbens dopamine in appetitive and aversive motivation , 1994, Behavioural Brain Research.

[44] W. Schultz,et al. Responses of monkey dopamine neurons to reward and conditioned stimuli during successive steps of learning a delayed response task , 1993, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[45] W. Schultz. Activity of dopamine neurons in the behaving primate , 1992 .

[46] Richard S. Sutton,et al. Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming , 1990, ML.

[47] W. Schultz,et al. Dopamine neurons of the monkey midbrain: contingencies of responses to stimuli eliciting immediate behavioral reactions. , 1990, Journal of neurophysiology.

[48] Stephen Grossberg,et al. Neural dynamics of adaptive timing and temporal discrimination during associative learning , 1989, Neural Networks.

[49] S. Grossberg,et al. Neural dynamics of attentionally modulated Pavlovian conditioning: Conditioned reinforcement, inhibition, and opponent processing , 1987, Psychobiology.

[50] R M Church,et al. Properties of the Internal Clock a , 1984, Annals of the New York Academy of Sciences.

[51] Richard S. Sutton,et al. Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[52] R. Solomon,et al. An opponent-process theory of motivation. I. Temporal dynamics of affect. , 1974, Psychological review.

[53] K. Breland,et al. The misbehavior of organisms. , 1961 .

[54] Jr. William Rush Dunton,et al. THE AMERICAN JOURNAL OF PSYCHIATRY , 1944 .

[55] B. Skinner,et al. Principles of Behavior , 1944 .

[56] Bruno A. Olshausen,et al. Book Review , 2003, Journal of Cognitive Neuroscience.

[57] E. Rolls,et al. Abstract reward and punishment representations in the human orbitofrontal cortex , 2001, Nature Neuroscience.

[58] Kenji Doya,et al. Reinforcement Learning in Continuous Time and Space , 2000, Neural Computation.

[59] Peter Dayan,et al. Dopamine Bonuses , 2000, NIPS.

[60] D. Helmeste,et al. Dopamine D4 receptors. , 2000, Japanese journal of pharmacology.

[61] G. Schoenbaum,et al. Orbitofrontal cortex and basolateral amygdala encode expected outcomes during learning , 1998, Nature Neuroscience.

[62] Alexander J. Smola,et al. Neural Information Processing Systems , 1997, NIPS 1997.

[63] T. Nokes,et al. Intrinsic reinforcing properties of putatively neutral stimuli in an instrumental two-lever discrimination task , 1996 .

[64] H. V. Van Tol. The dopamine D4 receptor. , 1996, NIDA research monograph.

[65] Joel L. Davis,et al. Adaptive Critics and the Basal Ganglia , 1995 .

[66] Joel L. Davis,et al. A Model of How the Basal Ganglia Generate and Use Neural Signals That Predict Reinforcement , 1994 .

[67] C. M. Gibbs,et al. Associative transfer and stimulus selection in classical conditioning of the rabbit's nictitating membrane response to serial compound CSs. , 1979, Journal of experimental psychology. Animal behavior processes.

[68] R. Solomon,et al. An Opponent-Process Theory of Motivation , 1978 .

[69] R. Rescorla. A theory of pavlovian conditioning: The effectiveness of reinforcement and non-reinforcement , 1972 .

[70] W. F. Prokasy,et al. Classical conditioning II: Current research and theory. , 1972 .