Path integral-based stochastic optimal control for rigid body dynamics
暂无分享,去创建一个
Stefan Schaal | Evangelos Theodorou | Jonas Buchli | Evangelos A. Theodorou | S. Schaal | E. Theodorou | J. Buchli
[1] W. Fleming. Exit probabilities and optimal stochastic control , 1977 .
[2] B. Øksendal. Stochastic Differential Equations , 1985 .
[3] W. Fleming,et al. Controlled Markov processes and viscosity solutions , 1992 .
[4] Robert F. Stengel,et al. Optimal Control and Estimation , 1994 .
[5] S. Shreve,et al. Stochastic differential equations , 1955, Mathematical Proceedings of the Cambridge Philosophical Society.
[6] C. Atkeson,et al. Minimax differential dynamic programming: application to a biped walking robot , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).
[7] Emanuel Todorov,et al. Iterative Linear Quadratic Regulator Design for Nonlinear Biological Movement Systems , 2004, ICINCO.
[8] H. Kappen. Linear theory for control of nonlinear stochastic systems. , 2004, Physical review letters.
[9] H. Kappen. Path integrals and symmetry breaking for optimal control theory , 2005, physics/0505066.
[10] Weiwei Li,et al. An Iterative Optimal Control and Estimation Design for Nonlinear Stochastic System , 2006, Proceedings of the 45th IEEE Conference on Decision and Control.
[11] J. Peters,et al. Using Reward-weighted Regression for Reinforcement Learning of Task Space Control , 2007, 2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning.
[12] Stefan Schaal,et al. Reinforcement learning by reward-weighted regression for operational space control , 2007, ICML '07.
[13] Christopher G. Atkeson,et al. Random Sampling of States in Dynamic Programming , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).
[14] B. Balaji. Estimation of indirectly observable Langevin states: path integral solution using statistical physics methods , 2008 .
[15] Stefan Schaal,et al. 2008 Special Issue: Reinforcement learning of motor skills with policy gradients , 2008 .
[16] B. Balaji. Universal nonlinear filtering using Feynman path integrals II: the continuous-continuous model with additive noise , 2007, 0708.1663.