Learning and generalization of motor skills by learning from demonstration

We provide a general approach for learning robotic motor skills from human demonstration. To represent an observed movement, a non-linear differential equation is learned such that it reproduces this movement. Based on this representation, we build a library of movements by labeling each recorded movement according to task and context (e.g., grasping, placing, and releasing). Our differential equation is formulated such that generalization can be achieved simply by adapting a start and a goal parameter in the equation to the desired position values of a movement. For object manipulation, we present how our framework extends to the control of gripper orientation and finger position. The feasibility of our approach is demonstrated in simulation as well as on the Sarcos dextrous robot arm. The robot learned a pick-and-place operation and a water-serving task and could generalize these tasks to novel situations.

[1]  J.S. Yuan,et al.  Closed-loop manipulator control using quaternion feedback , 1988, IEEE J. Robotics Autom..

[2]  Gillian M. Hayes,et al.  Imitative Learning Mechanisms in Robots and Humans , 1996 .

[3]  Stefan Schaal,et al.  Is imitation learning the route to humanoid robots? , 1999, Trends in Cognitive Sciences.

[4]  Jun Nakanishi,et al.  Learning Attractor Landscapes for Learning Motor Primitives , 2002, NIPS.

[5]  Jun Nakanishi,et al.  Movement imitation with nonlinear dynamical systems in humanoid robots , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).

[6]  K. Dautenhahn,et al.  Imitation in Animals and Artifacts , 2002 .

[7]  Jun Nakanishi,et al.  Control, Planning, Learning, and Imitation with Dynamic Movement Primitives , 2003 .

[8]  Stefan Schaal,et al.  Computational approaches to motor learning by imitation. , 2003, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[9]  Brett R Fajen,et al.  Behavioral dynamics of steering, obstacle avoidance, and route selection. , 2003, Journal of experimental psychology. Human perception and performance.

[10]  Roland Siegwart,et al.  Robot learning from demonstration , 2004, Robotics Auton. Syst..

[11]  Christopher W. Geib,et al.  Object Action Complexes as an Interface for Planning and Robot Control , 2006 .

[12]  Jun Nakanishi,et al.  Experimental Evaluation of Task Space Position/Orientation Control Towards Compliant Control for Humanoid Robots , 2007 .

[13]  S. Schaal,et al.  Combining dynamic movement primitives and potential fields for online obstacle avoidance , 2008 .

[14]  Jun Nakanishi,et al.  Operational Space Control: A Theoretical and Empirical Comparison , 2008, Int. J. Robotics Res..

[15]  Ales Ude,et al.  Perceiving Objects and Movements to Generate Actions on a Humanoid Robot , 2008 .

[16]  Stefan Schaal,et al.  Movement reproduction and obstacle avoidance with dynamic movement primitives and potential fields , 2008, Humanoids 2008 - 8th IEEE-RAS International Conference on Humanoid Robots.

[17]  Stefan Schaal,et al.  Biologically-inspired dynamical systems for movement generation: Automatic real-time goal adaptation and obstacle avoidance , 2009, 2009 IEEE International Conference on Robotics and Automation.