An RLS-Based Natural Actor-Critic Algorithm for Locomotion of a Two-Linked Robot Arm
暂无分享,去创建一个
[1] Vijay R. Konda,et al. OnActor-Critic Algorithms , 2003, SIAM J. Control. Optim..
[2] T. Moon,et al. Mathematical Methods and Algorithms for Signal Processing , 1999 .
[3] H. He,et al. Efficient Reinforcement Learning Using Recursive Least-Squares Methods , 2011, J. Artif. Intell. Res..
[4] Shigenobu Kobayashi,et al. Reinforcement Learning in POMDPs with Function Approximation , 1997, ICML.
[5] Stefan Schaal,et al. Reinforcement Learning for Humanoid Robotics , 2003 .
[6] John N. Tsitsiklis,et al. Actor-Critic Algorithms , 1999, NIPS.
[7] Yishay Mansour,et al. Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.
[8] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[9] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[10] Shun-ichi Amari,et al. Natural Gradient Works Efficiently in Learning , 1998, Neural Computation.