论文信息 - Using Transitional Proximity for Faster Reinforcement Learning - 字舞流文

Using Transitional Proximity for Faster Reinforcement Learning

Andrew McCallum | A. McCallum

[1] Steven Douglas Whitehead,et al. Reinforcement learning for the adaptive control of perception and action , 1992 .

[2] Ming Tan,et al. Cost-Sensitive Reinforcement Learning for Adaptive Classification and Control , 1991, AAAI.

[3] Long Ji Lin,et al. Programming Robots Using Reinforcement Learning and Teaching , 1991, AAAI.

[4] Richard S. Sutton,et al. Dyna, an integrated architecture for learning, planning, and reacting , 1990, SGAR.