Policy Learning for Motor Skills
暂无分享,去创建一个
[1] Yishay Mansour,et al. Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.
[2] Stefan Schaal,et al. Computational approaches to motor learning by imitation. , 2003, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.
[3] Jun Nakanishi,et al. Learning Attractor Landscapes for Learning Motor Primitives , 2002, NIPS.
[4] S. Schaal. Dynamic Movement Primitives -A Framework for Motor Control in Humans and Humanoid Robotics , 2006 .
[5] Jin Yu,et al. Natural Actor-Critic for Road Traffic Optimisation , 2006, NIPS.
[6] Bruno Siciliano,et al. Modeling and Control of Robot Manipulators , 1995 .
[7] L. Siciliano. Modelling and Control of Robot Manipulators , 2000 .
[8] Stefan Schaal,et al. Natural Actor-Critic , 2003, Neurocomputing.
[9] Andrew Zisserman,et al. Advances in Neural Information Processing Systems (NIPS) , 2007 .
[10] Geoffrey E. Hinton,et al. Using Expectation-Maximization for Reinforcement Learning , 1997, Neural Computation.
[11] Sham M. Kakade,et al. A Natural Policy Gradient , 2001, NIPS.
[12] Jun Nakanishi,et al. A unifying methodology for the control of robotic systems , 2005, 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[13] John N. Tsitsiklis,et al. Actor-Critic Algorithms , 1999, NIPS.
[14] Douglas Aberdeen,et al. POMDPs and Policy Gradients , 2006 .
[15] Stefan Schaal,et al. Learning Operational Space Control , 2006, Robotics: Science and Systems.
[16] Stefan Schaal,et al. Reinforcement Learning for Humanoid Robotics , 2003 .
[17] Douglas Aberdeen,et al. Policy-Gradient Algorithms for Partially Observable Markov Decision Processes , 2003 .