论文信息 - Learning and generalization of complex tasks from unstructured demonstrations

Learning and generalization of complex tasks from unstructured demonstrations

We present a novel method for segmenting demonstrations, recognizing repeated skills, and generalizing complex tasks from unstructured demonstrations. This method combines many of the advantages of recent automatic segmentation methods for learning from demonstration into a single principled, integrated framework. Specifically, we use the Beta Process Autoregressive Hidden Markov Model and Dynamic Movement Primitives to learn and generalize a multi-step task on the PR2 mobile manipulator and to demonstrate the potential of our framework to learn a large library of skills over time.

[1] Stefan Schaal,et al. Robot Learning From Demonstration , 1997, ICML.

[2] Jun Nakanishi,et al. Learning Attractor Landscapes for Learning Motor Primitives , 2002, NIPS.

[3] Leslie Pack Kaelbling,et al. Effective reinforcement learning for mobile robots , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).

[4] Jun Nakanishi,et al. Learning Movement Primitives , 2005, ISRR.

[5] Monica N. Nicolescu,et al. Natural methods for robot task learning: instructive demonstrations, generalization and practice , 2003, AAMAS '03.

[6] Pradeep K. Khosla,et al. Trajectory representation using sequenced linear dynamical systems , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[7] Maja J. Mataric,et al. Performance-Derived Behavior Vocabularies: Data-Driven Acquisition of Skills from Motion , 2004, Int. J. Humanoid Robotics.

[8] Pieter Abbeel,et al. Apprenticeship learning via inverse reinforcement learning , 2004, ICML.

[9] Maja J. Mataric,et al. A spatio-temporal extension to Isomap nonlinear dimension reduction , 2004, ICML.

[10] Aude Billard,et al. Incremental learning of gestures by imitation in a humanoid robot , 2007, 2007 2nd ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[11] Manuela M. Veloso,et al. Confidence-based policy learning from demonstration using Gaussian mixture models , 2007, AAMAS '07.

[12] Anind K. Dey,et al. Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.

[13] Daniel H. Grollman,et al. Sparse incremental learning for interactive robot control policy estimation , 2008, 2008 IEEE International Conference on Robotics and Automation.

[14] Pieter Abbeel,et al. Learning for control from multiple demonstrations , 2008, ICML '08.

[15] Stefan Schaal,et al. Learning and generalization of motor skills by learning from demonstration , 2009, 2009 IEEE International Conference on Robotics and Automation.

[16] Brett Browning,et al. A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..

[17] Michael I. Jordan,et al. Sharing Features among Dynamical Systems with Beta Processes , 2009, NIPS.

[18] Dana Kulic,et al. Online Segmentation and Clustering From Continuous Observation of Whole Body Motions , 2009, IEEE Transactions on Robotics.

[19] Jochen J. Steil,et al. Imitating object movement skills with robots — A task-level approach exploiting generalization and invariance , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[20] Daniel H. Grollman,et al. Incremental learning of subtasks from unsegmented demonstration , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[21] Odest Chadwicke Jenkins,et al. Learning from demonstration using a multi-valued function regressor for time-series data , 2010, 2010 10th IEEE-RAS International Conference on Humanoid Robots.

[22] Jan Peters,et al. Movement extraction by detecting dynamics switches and repetitions , 2010, NIPS.

[23] Michael I. Jordan,et al. Joint Modeling of Multiple Related Time Series via the Beta Process , 2011, 1111.4226.

[24] Stefan Schaal,et al. Skill learning and task outcome prediction for manipulation , 2011, 2011 IEEE International Conference on Robotics and Automation.

[25] George Konidaris,et al. Value Function Approximation in Reinforcement Learning Using the Fourier Basis , 2011, AAAI.

[26] Scott Kuindersma,et al. Robot learning from demonstration by constructing skill trees , 2012, Int. J. Robotics Res..