Reinforcement Learning in Situated Agents: Theoretical and Practical Solutions
暂无分享,去创建一个
[1] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..
[2] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[3] H. Kesten. Accelerated Stochastic Approximation , 1958 .
[4] Mark D. Pendrith,et al. Actual Return Reinforcement Learning versus Temporal Differences: Some Theoretical and Experimental Results , 1996, ICML.
[5] Richard S. Sutton,et al. Adapting Bias by Gradient Descent: An Incremental Version of Delta-Bar-Delta , 1992, AAAI.
[6] Rodney A. Brooks,et al. Intelligence Without Reason , 1991, IJCAI.
[7] George N. Saridis,et al. Learning Applied to Successive Approximation Algorithms , 1970, IEEE Trans. Syst. Sci. Cybern..
[8] Harold J. Kushner,et al. wchastic. approximation methods for constrained and unconstrained systems , 1978 .
[9] Robert A. Jacobs,et al. Increased rates of convergence through learning rate adaptation , 1987, Neural Networks.
[10] Richard S. Sutton,et al. Goal Seeking Components for Adaptive Intelligence: An Initial Assessment. , 1981 .
[11] John Moody,et al. Learning rate schedules for faster stochastic gradient search , 1992, Neural Networks for Signal Processing II Proceedings of the 1992 IEEE Workshop.