暂无分享,去创建一个
Martin A. Riedmiller | Thomas Lampe | Olivier Pietquin | Nicolas Heess | Todd Hester | Fumin Wang | Bilal Piot | Jonathan Scholz | Matej Vecerík | Thomas Rothörl | N. Heess | Matej Vecerík | T. Lampe | Bilal Piot | Todd Hester | Thomas Rothörl | O. Pietquin | Jonathan Scholz | Fumin Wang | Thomas Lampe
[1] Dean Pomerleau,et al. ALVINN, an autonomous land vehicle in a neural network , 2015 .
[2] E. Todorov,et al. A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems , 2005, Proceedings of the 2005, American Control Conference, 2005..
[3] Yuval Tassa,et al. MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[4] Stefano Ermon,et al. Generative Adversarial Imitation Learning , 2016, NIPS.
[5] Siddhartha S. Srinivasa,et al. Imitation learning for locomotion and manipulation , 2007, 2007 7th IEEE-RAS International Conference on Humanoid Robots.
[6] Anind K. Dey,et al. Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.
[7] Eduardo F. Morales,et al. An Introduction to Reinforcement Learning , 2011 .
[8] P. Cochat,et al. Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.
[9] Sergey Levine,et al. Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization , 2016, ICML.
[10] Geoffrey J. Gordon,et al. A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning , 2010, AISTATS.
[11] Matthieu Geist,et al. A Cascaded Supervised Learning Approach to Inverse Reinforcement Learning , 2013, ECML/PKDD.
[12] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[13] Robert E. Schapire,et al. A Game-Theoretic Approach to Apprenticeship Learning , 2007, NIPS.
[14] Sergey Levine,et al. Guided Policy Search , 2013, ICML.
[15] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[16] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[17] Tom Schaul,et al. Prioritized Experience Replay , 2015, ICLR.
[18] Sergey Levine,et al. Continuous Deep Q-Learning with Model-based Acceleration , 2016, ICML.
[19] Andrew Y. Ng,et al. Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping , 1999, ICML.
[20] Michael H. Bowling,et al. Apprenticeship learning using linear programming , 2008, ICML '08.
[21] Tom Schaul,et al. Deep Q-learning From Demonstrations , 2017, AAAI.