Lyapunov-Constrained Action Sets for Reinforcement Learning
暂无分享,去创建一个
[1] R. E. Kalman,et al. Control System Analysis and Design Via the “Second Method” of Lyapunov: II—Discrete-Time Systems , 1960 .
[2] Francis L. Merat,et al. Introduction to robotics: Mechanics and control , 1987, IEEE J. Robotics Autom..
[3] W. Grantham,et al. Lyapunov optimal feedback control of a nonlinear inverted pendulum , 1989 .
[4] Daniel E. Koditschek,et al. Exact robot navigation using artificial potential functions , 1992, IEEE Trans. Robotics Autom..
[5] Roderic A. Grupen,et al. Robust Reinforcement Learning in Motion Planning , 1993, NIPS.
[6] Roderic A. Grupen,et al. The applications of harmonic functions to robotics , 1993, J. Field Robotics.
[7] Gerald Tesauro,et al. TD-Gammon, a Self-Teaching Backgammon Program, Achieves Master-Level Play , 1994, Neural Computation.
[8] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..
[9] Richard S. Sutton,et al. Generalization in Reinforcement Learning: Successful Examples Using Sparse Coarse Coding , 1995, NIPS.
[10] Paul E. Utgoff,et al. On integrating apprentice learning and reinforcement learning , 1996 .
[11] Gary Boone,et al. Efficient reinforcement learning: model-based Acrobot control , 1997, Proceedings of International Conference on Robotics and Automation.
[12] Gary Boone,et al. Minimum-time control of the Acrobot , 1997, Proceedings of International Conference on Robotics and Automation.
[13] Ronald E. Parr,et al. Hierarchical control and learning for markov decision processes , 1998 .
[14] Stuart J. Russell,et al. Bayesian Q-Learning , 1998, AAAI/IAAI.
[15] Andrew Y. Ng,et al. Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping , 1999, ICML.
[16] Thomas G. Dietterich. Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition , 1999, J. Artif. Intell. Res..
[17] Doina Precup,et al. Temporal abstraction in reinforcement learning , 2000, ICML 2000.
[18] Gerald DeJong,et al. Hidden Strengths and Limitations: An Empirical Investigation of Reinforcement Learning , 2000, ICML.
[19] Jude W. Shavlik,et al. Creating Advice-Taking Reinforcement Learners , 1998, Machine Learning.