Learning to Serve: An Experimental Study for a New Learning From Demonstrations Framework

Learning from demonstrations is an easy and intuitive way to show examples of successful behavior to a robot. However, the fact that humans optimize or take advantage of their body and not of the robot, usually called the embodiment problem in robotics, often prevents industrial robots from executing the task in a straightforward way. The shown movements often do not or cannot utilize the degrees of freedom of the robot efficiently, and moreover can suffer from excessive execution errors. In this letter, we explore a variety of solutions that address these shortcomings. In particular, we learn sparse movement primitive parameters from several demonstrations of a successful table tennis serve. The number of parameters learned using our procedure is independent of the degrees of freedom of the robot. Moreover, they can be ranked according to their importance in the regression task. Learning few parameters, which are ranked, is a desirable feature to combat the curse of dimensionality in reinforcement learning. Real robot experiments on the Barrett WAM for a table tennis serve using the learned movement primitives show that the representation can capture successfully the style of the movement with few parameters.

[1]  Jan Peters,et al.  Noname manuscript No. (will be inserted by the editor) Policy Search for Motor Primitives in Robotics , 2022 .

[2]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[3]  Christoph H. Lampert,et al.  Movement templates for learning of hitting and batting , 2010, 2010 IEEE International Conference on Robotics and Automation.

[4]  Jan Peters,et al.  A new trajectory generation framework in robotic table tennis , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[5]  Aude Billard,et al.  Learning Stable Nonlinear Dynamical Systems With Gaussian Mixture Models , 2011, IEEE Transactions on Robotics.

[6]  Russell L. Anderson,et al.  A Robot Ping-Pong Player: Experiments in Real-Time Intelligent Control , 1988 .

[7]  Jan Peters,et al.  Optimizing robot striking movement primitives with Iterative Learning Control , 2015, 2015 IEEE-RAS 15th International Conference on Humanoid Robots (Humanoids).

[8]  Jan Peters,et al.  Online optimal trajectory generation for robot table tennis , 2018, Robotics Auton. Syst..

[9]  Marina Bosch,et al.  A Robot Ping Pong Player Experiment In Real Time Intelligent Control , 2016 .

[10]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[11]  Aude Billard,et al.  Learning control Lyapunov function to ensure stability of dynamical system-based robot reaching motions , 2014, Robotics Auton. Syst..

[12]  Bernhard Schölkopf,et al.  Learning optimal striking points for a ping-pong playing robot , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[13]  Oliver Kroemer,et al.  Learning to select and generalize striking movements in robot table tennis , 2012, AAAI Fall Symposium: Robots Learning Interactively from Human Teachers.

[14]  Oliver Kroemer,et al.  Probabilistic movement primitives for coordination of multiple human–robot collaborative tasks , 2017, Auton. Robots.

[15]  Michael I. Jordan,et al.  Multi-task feature selection , 2006 .

[16]  Jan Peters,et al.  Probabilistic Movement Primitives , 2013, NIPS.

[17]  Bernhard Schölkopf,et al.  Jointly learning trajectory generation and hitting point prediction in robot table tennis , 2016, 2016 IEEE-RAS 16th International Conference on Humanoid Robots (Humanoids).

[18]  R. Tibshirani,et al.  Least angle regression , 2004, math/0406456.

[19]  Jan Peters,et al.  A biomimetic approach to robot table tennis , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[20]  Manfred Morari,et al.  Model predictive control: Theory and practice - A survey , 1989, Autom..

[21]  Jun Nakanishi,et al.  Dynamical Movement Primitives: Learning Attractor Models for Motor Behaviors , 2013, Neural Computation.

[22]  Yoshikazu Hayakawa,et al.  Analysis of effects of rebounds and aerodynamics for trajectory of table tennis ball , 2010, Proceedings of SICE Annual Conference 2010.