State-Regularized Policy Search for Linearized Dynamical Systems
暂无分享,去创建一个
Jan Peters | Hany Abdulsamad | Gerhard Neumann | Oleg Arenz | Jan Peters | G. Neumann | Hany Abdulsamad | O. Arenz
[1] Sergey Levine,et al. Learning Neural Network Policies with Guided Policy Search under Unknown Dynamics , 2014, NIPS.
[2] Jan Peters,et al. A Survey on Policy Search for Robotics , 2013, Found. Trends Robotics.
[3] Marc Toussaint,et al. Robot trajectory optimization using approximate inference , 2009, ICML '09.
[4] Stephen P. Boyd,et al. Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.
[5] Sergey Levine,et al. End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..
[6] David Q. Mayne,et al. Differential dynamic programming , 1972, The Mathematical Gazette.
[7] Andrew G. Barto,et al. Robot Weightlifting By Direct Policy Search , 2001, IJCAI.
[8] Anind K. Dey,et al. Modeling Interaction via the Principle of Maximum Causal Entropy , 2010, ICML.
[9] Yasemin Altun,et al. Relative Entropy Policy Search , 2010 .
[10] Yuval Tassa,et al. Synthesis and stabilization of complex behaviors through online trajectory optimization , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[11] Sergey Levine,et al. Guided Policy Search , 2013, ICML.
[12] R Bellman,et al. DYNAMIC PROGRAMMING AND LAGRANGE MULTIPLIERS. , 1956, Proceedings of the National Academy of Sciences of the United States of America.
[13] Jan Peters,et al. Reinforcement learning in robotics: A survey , 2013, Int. J. Robotics Res..
[14] Marc Toussaint,et al. On Stochastic Optimal Control and Reinforcement Learning by Approximate Inference , 2012, Robotics: Science and Systems.
[15] Jan Peters,et al. Robust policy updates for stochastic optimal control , 2014, 2014 IEEE-RAS International Conference on Humanoid Robots.
[16] E. Todorov,et al. A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems , 2005, Proceedings of the 2005, American Control Conference, 2005..