Hierarchical reinforcement learning of multiple grasping strategies with human instructions
暂无分享,去创建一个
[1] Oliver Kroemer,et al. Combining active learning and reactive control for robot grasping , 2010, Robotics Auton. Syst..
[2] Andreas Krause,et al. Advances in Neural Information Processing Systems (NIPS) , 2014 .
[3] Masashi Sugiyama,et al. Hierarchical Policy Search via Return-Weighted Density Estimation , 2017, AAAI.
[4] Stefan Schaal,et al. STOMP: Stochastic trajectory optimization for motion planning , 2011, 2011 IEEE International Conference on Robotics and Automation.
[5] Markus Vincze,et al. Learning grasps with topographic features , 2015, Int. J. Robotics Res..
[6] Doina Precup,et al. The Option-Critic Architecture , 2016, AAAI.
[7] Sergey Levine,et al. Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection , 2016, Int. J. Robotics Res..
[8] M. Spivak. A comprehensive introduction to differential geometry , 1979 .
[9] François Laviolette,et al. Domain-Adversarial Training of Neural Networks , 2015, J. Mach. Learn. Res..
[10] Peter K. Allen,et al. Data-driven grasping , 2011, Auton. Robots.
[11] Jan Peters,et al. Hierarchical Relative Entropy Policy Search , 2014, AISTATS.
[12] C. Rasmussen,et al. Gaussian Process Priors with Uncertain Inputs - Application to Multiple-Step Ahead Time Series Forecasting , 2002, NIPS.
[13] Pieter Abbeel,et al. An Algorithmic Perspective on Imitation Learning , 2018, Found. Trends Robotics.
[14] R. Howe,et al. Human grasp choice and robotic grasp analysis , 1990 .
[15] Pieter Abbeel,et al. Stochastic Neural Networks for Hierarchical Reinforcement Learning , 2016, ICLR.
[16] Jun Nakanishi,et al. Learning Attractor Landscapes for Learning Motor Primitives , 2002, NIPS.
[17] Marc Peter Deisenroth,et al. Efficient reinforcement learning using Gaussian processes , 2010 .
[18] Abhinav Gupta,et al. Supersizing self-supervision: Learning to grasp from 50K tries and 700 robot hours , 2015, 2016 IEEE International Conference on Robotics and Automation (ICRA).
[19] John F. Canny,et al. Planning optimal grasps , 1992, Proceedings 1992 IEEE International Conference on Robotics and Automation.
[20] Tom Schaul,et al. FeUdal Networks for Hierarchical Reinforcement Learning , 2017, ICML.
[21] Jan Peters,et al. Sample and Feedback Efficient Hierarchical Reinforcement Learning from Human Preferences , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).
[22] Honglak Lee,et al. Deep learning for detecting robotic grasps , 2013, Int. J. Robotics Res..
[23] Robert Platt,et al. Using Geometry to Detect Grasp Poses in 3D Point Clouds , 2015, ISRR.
[24] Andreas Krause,et al. Information-Theoretic Regret Bounds for Gaussian Process Optimization in the Bandit Setting , 2009, IEEE Transactions on Information Theory.
[25] Oliver Kroemer,et al. Active reward learning with a novel acquisition function , 2015, Auton. Robots.
[26] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[27] David Hsu,et al. Learning Dynamic Robot-to-Human Object Handover from Human Feedback , 2016, ISRR.
[28] Danica Kragic,et al. Data-Driven Grasp Synthesis—A Survey , 2013, IEEE Transactions on Robotics.
[29] Siddhartha S. Srinivasa,et al. CHOMP: Covariant Hamiltonian optimization for motion planning , 2013, Int. J. Robotics Res..
[30] Paul J. Besl,et al. A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..
[31] Richard M. Murray,et al. A Mathematical Introduction to Robotic Manipulation , 1994 .
[32] Alexander Fabisch,et al. Active contextual policy search , 2014, J. Mach. Learn. Res..
[33] Agathe Girard,et al. Prediction at an Uncertain Input for Gaussian Processes and Relevance Vector Machines Application to Multiple-Step Ahead Time-Series Forecasting , 2002 .
[34] Vijay Kumar,et al. Robotic grasping and contact: a review , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).
[35] Christopher K. I. Williams,et al. Gaussian Processes for Machine Learning (Adaptive Computation and Machine Learning) , 2005 .
[36] Stefan Schaal,et al. Reinforcement learning by reward-weighted regression for operational space control , 2007, ICML '07.
[37] Jan Peters,et al. A Survey on Policy Search for Robotics , 2013, Found. Trends Robotics.
[38] Danica Kragic,et al. Classical grasp quality evaluation: New algorithms and theory , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[39] Peter Auer,et al. Using Confidence Bounds for Exploitation-Exploration Trade-offs , 2003, J. Mach. Learn. Res..
[40] Thomas G. Dietterich. Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition , 1999, J. Artif. Intell. Res..
[41] Ales Leonardis,et al. One-shot learning and generation of dexterous grasps for novel objects , 2016, Int. J. Robotics Res..
[42] Jan Peters,et al. Experiments with Hierarchical Reinforcement Learning of Multiple Grasping Policies , 2016, ISER.
[43] Sergey Levine,et al. Learning Hand-Eye Coordination for Robotic Grasping with Large-Scale Data Collection , 2016, ISER.
[44] Kate Saenko,et al. Grasp Pose Detection in Point Clouds , 2017, Int. J. Robotics Res..
[45] Yasemin Altun,et al. Relative Entropy Policy Search , 2010 .
[46] Peter Auer,et al. Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.
[47] Ai Poh Loh,et al. Model-based contextual policy search for data-efficient generalization of robot skills , 2017, Artif. Intell..
[48] Robert Platt,et al. Localizing Handle-Like Grasp Affordances in 3D Point Clouds , 2014, ISER.