Dopamine: generalization and bonuses
暂无分享,去创建一个
[1] Sham M. Kakade,et al. Opponent interactions between serotonin and dopamine , 2002, Neural Networks.
[2] Roland E. Suri,et al. TD models of reward predictive responses in dopamine neurons , 2002, Neural Networks.
[3] Ronen I. Brafman,et al. R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning , 2001, J. Mach. Learn. Res..
[4] W. Schultz,et al. Dopamine responses comply with basic assumptions of formal learning theory , 2001, Nature.
[5] Christopher C. Pack,et al. Temporal dynamics of a neural solution to the aperture problem in visual area MT of macaque brain , 2001, Nature.
[6] Peter Dayan,et al. Motivated Reinforcement Learning , 2001, NIPS.
[7] David S. Touretzky,et al. Behavioral considerations suggest an average reward TD model of the dopamine system , 2000, Neurocomputing.
[8] W. Schultz,et al. Modifications of reward expectation-related neuronal activity during learning in primate orbitofrontal cortex. , 2000, Journal of neurophysiology.
[9] W. Schultz,et al. Reward-related neuronal activity during go-nogo task performance in primate orbitofrontal cortex. , 2000, Journal of neurophysiology.
[10] J. Hollerman,et al. Reward processing in primate orbitofrontal cortex and basal ganglia. , 2000, Cerebral cortex.
[11] E. Rolls. The orbitofrontal cortex and reward. , 2000, Cerebral cortex.
[12] S. Ikemoto,et al. The role of nucleus accumbens dopamine in motivated behavior: a unifying interpretation with special reference to reward-seeking , 1999, Brain Research Reviews.
[13] L. Peltonen,et al. Association between novelty seeking and the type 4 dopamine receptor gene in a large Finnish cohort sample. , 1999, The American journal of psychiatry.
[14] G. Schoenbaum,et al. Orbitofrontal Cortex and Representation of Incentive Value in Associative Learning , 1999, The Journal of Neuroscience.
[15] Jonathan D. Cohen,et al. Cognition and control in schizophrenia: a computational model of dopamine and prefrontal function , 1999, Biological Psychiatry.
[16] W. Schultz,et al. A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task , 1999, Neuroscience.
[17] A. Paterson,et al. Dopamine D4 Receptor Gene: Novelty or Nonsense? , 1999, Neuropsychopharmacology.
[18] Andrew Y. Ng,et al. Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping , 1999, ICML.
[19] S. Gerhand. THE PREFRONTAL CORTEX—EXECUTIVE AND COGNITIVE FUNCTIONS. , 1999 .
[20] P. Redgrave,et al. Is the short-latency dopamine response too short to signal reward error? , 1999, Trends in Neurosciences.
[21] F. Guarraci,et al. An electrophysiological characterization of ventral tegmental area dopaminergic neurons during differential pavlovian fear conditioning in the awake rabbit , 1999, Behavioural Brain Research.
[22] G. Schoenbaum,et al. Neural Encoding in Orbitofrontal Cortex and Basolateral Amygdala during Olfactory Discrimination Learning , 1999, The Journal of Neuroscience.
[23] J. Pearce,et al. The Influence of Background Stimuli on Summation in Autoshaping , 1999 .
[24] P. Holland,et al. Amygdala circuitry in attentional and representational processes , 1999, Trends in Cognitive Sciences.
[25] T. Robbins,et al. The prefrontal cortex: Executive and cognitive functions. , 1998 .
[26] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[27] J. Gray,et al. Dopamine's role. , 1997, Science.
[28] J. Horvitz,et al. Burst activity of ventral tegmental dopamine neurons is elicited by sensory stimuli in the awake cat , 1997, Brain Research.
[29] P. Holland,et al. The Role of an Amygdalo-Nigrostriatal Pathway in Associative Learning , 1997, The Journal of Neuroscience.
[30] C. Gallistel,et al. Toward a neurobiology of temporal cognition: advances and challenges , 1997, Current Opinion in Neurobiology.
[31] Peter Dayan,et al. A Neural Substrate of Prediction and Reward , 1997, Science.
[32] P. Holland,et al. Neurotoxic Lesions of Basolateral, But Not Central, Amygdala Interfere with Pavlovian Second-Order Conditioning and Reinforcer Devaluation Effects , 1996, The Journal of Neuroscience.
[33] V. Brown,et al. Covert Orienting of Attention in the Rat and the Role of Striatal Dopamine , 1996, The Journal of Neuroscience.
[34] M. Bardo,et al. Psychobiology of novelty seeking and drug seeking behavior , 1996, Behavioural Brain Research.
[35] P. Dayan,et al. A framework for mesencephalic dopamine systems based on predictive Hebbian learning , 1996, The Journal of neuroscience : the official journal of the Society for Neuroscience.
[36] W. Schultz,et al. Preferential activation of midbrain dopamine neurons by appetitive rather than aversive stimuli , 1996, Nature.
[37] D. Signorini,et al. Neural networks , 1995, The Lancet.
[38] Peter Dayan,et al. Bee foraging in uncertain environments using predictive hebbian learning , 1995, Nature.
[39] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..
[40] P. Goldman-Rakic,et al. Modulation of memory fields by dopamine Dl receptors in prefrontal cortex , 1995, Nature.
[41] W. Schultz,et al. Importance of unpredictability for reward responses in primate dopamine neurons. , 1994, Journal of neurophysiology.
[42] P. Kalivas,et al. Involvement of dopamine and excitatory amino acid transmission in novelty-induced motor activity. , 1994, The Journal of pharmacology and experimental therapeutics.
[43] J. Salamone. The involvement of nucleus accumbens dopamine in appetitive and aversive motivation , 1994, Behavioural Brain Research.
[44] W. Schultz,et al. Responses of monkey dopamine neurons to reward and conditioned stimuli during successive steps of learning a delayed response task , 1993, The Journal of neuroscience : the official journal of the Society for Neuroscience.
[45] W. Schultz. Activity of dopamine neurons in the behaving primate , 1992 .
[46] Richard S. Sutton,et al. Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming , 1990, ML.
[47] W. Schultz,et al. Dopamine neurons of the monkey midbrain: contingencies of responses to stimuli eliciting immediate behavioral reactions. , 1990, Journal of neurophysiology.
[48] Stephen Grossberg,et al. Neural dynamics of adaptive timing and temporal discrimination during associative learning , 1989, Neural Networks.
[49] S. Grossberg,et al. Neural dynamics of attentionally modulated Pavlovian conditioning: Conditioned reinforcement, inhibition, and opponent processing , 1987, Psychobiology.
[50] R M Church,et al. Properties of the Internal Clock a , 1984, Annals of the New York Academy of Sciences.
[51] Richard S. Sutton,et al. Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.
[52] R. Solomon,et al. An opponent-process theory of motivation. I. Temporal dynamics of affect. , 1974, Psychological review.
[53] K. Breland,et al. The misbehavior of organisms. , 1961 .
[54] Jr. William Rush Dunton,et al. THE AMERICAN JOURNAL OF PSYCHIATRY , 1944 .
[55] B. Skinner,et al. Principles of Behavior , 1944 .
[56] Bruno A. Olshausen,et al. Book Review , 2003, Journal of Cognitive Neuroscience.
[57] E. Rolls,et al. Abstract reward and punishment representations in the human orbitofrontal cortex , 2001, Nature Neuroscience.
[58] Kenji Doya,et al. Reinforcement Learning in Continuous Time and Space , 2000, Neural Computation.
[59] Peter Dayan,et al. Dopamine Bonuses , 2000, NIPS.
[60] D. Helmeste,et al. Dopamine D4 receptors. , 2000, Japanese journal of pharmacology.
[61] G. Schoenbaum,et al. Orbitofrontal cortex and basolateral amygdala encode expected outcomes during learning , 1998, Nature Neuroscience.
[62] Alexander J. Smola,et al. Neural Information Processing Systems , 1997, NIPS 1997.
[63] T. Nokes,et al. Intrinsic reinforcing properties of putatively neutral stimuli in an instrumental two-lever discrimination task , 1996 .
[64] H. V. Van Tol. The dopamine D4 receptor. , 1996, NIDA research monograph.
[65] Joel L. Davis,et al. Adaptive Critics and the Basal Ganglia , 1995 .
[66] Joel L. Davis,et al. A Model of How the Basal Ganglia Generate and Use Neural Signals That Predict Reinforcement , 1994 .
[67] C. M. Gibbs,et al. Associative transfer and stimulus selection in classical conditioning of the rabbit's nictitating membrane response to serial compound CSs. , 1979, Journal of experimental psychology. Animal behavior processes.
[68] R. Solomon,et al. An Opponent-Process Theory of Motivation , 1978 .
[69] R. Rescorla. A theory of pavlovian conditioning: The effectiveness of reinforcement and non-reinforcement , 1972 .
[70] W. F. Prokasy,et al. Classical conditioning II: Current research and theory. , 1972 .