Data-Efficient Generalization of Robot Skills with Contextual Policy Search
暂无分享,去创建一个
Jan Peters | Gerhard Neumann | Marc Peter Deisenroth | Andras Gabor Kupcsik | Jan Peters | M. Deisenroth | G. Neumann | A. Kupcsik
[1] Jun Nakanishi,et al. Learning Movement Primitives , 2005, ISRR.
[2] Jeff G. Schneider,et al. Exploiting Model Uncertainty Estimates for Safe Dynamic Control Learning , 1996, NIPS.
[3] Zoubin Ghahramani,et al. Sparse Gaussian Processes using Pseudo-inputs , 2005, NIPS.
[4] Christoph H. Lampert,et al. Movement templates for learning of hitting and batting , 2010, 2010 IEEE International Conference on Robotics and Automation.
[5] R. J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[6] Jan Peters,et al. Policy Search for Motor Primitives in Robotics , 2008, NIPS 2008.
[7] Carl E. Rasmussen,et al. Learning to Control a Low-Cost Manipulator using Data-Efficient Reinforcement Learning , 2011, Robotics: Science and Systems.
[8] Darwin G. Caldwell,et al. Robot motor skill coordination with EM-based Reinforcement Learning , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[9] Stefan Schaal,et al. 2008 Special Issue: Reinforcement learning of motor skills with policy gradients , 2008 .
[10] Gerhard Neumann,et al. Variational Inference for Policy Search in changing situations , 2011, ICML.
[11] Stefan Schaal,et al. Reinforcement learning of motor skills in high dimensions: A path integral approach , 2010, 2010 IEEE International Conference on Robotics and Automation.
[12] Yasemin Altun,et al. Relative Entropy Policy Search , 2010 .
[13] Carl E. Rasmussen,et al. PILCO: A Model-Based and Data-Efficient Approach to Policy Search , 2011, ICML.
[14] Carl E. Rasmussen,et al. Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.
[15] Pieter Abbeel,et al. Using inaccurate models in reinforcement learning , 2006, ICML.
[16] Jan Peters,et al. Reinforcement Learning to Adjust Robot Movements to New Situations , 2010, IJCAI.
[17] Jeff G. Schneider,et al. Autonomous helicopter control using reinforcement learning policy search methods , 2001, Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No.01CH37164).
[18] Christopher G. Atkeson,et al. A comparison of direct and model-based reinforcement learning , 1997, Proceedings of International Conference on Robotics and Automation.
[19] Peter Stone,et al. Policy gradient reinforcement learning for fast quadrupedal locomotion , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.
[20] Carl E. Rasmussen,et al. Robust Filtering and Smoothing with Gaussian Processes , 2012, IEEE Transactions on Automatic Control.
[21] Jan Peters,et al. Hierarchical Relative Entropy Policy Search , 2014, AISTATS.
[22] Jun Nakanishi,et al. Learning Attractor Landscapes for Learning Motor Primitives , 2002, NIPS.