Jointly learning trajectory generation and hitting point prediction in robot table tennis

This paper proposes a combined learning framework for a table tennis robot. In a typical robot table tennis setup, a single striking point is predicted for the robot on the basis of the ball's initial state. Subsequently, the desired Cartesian racket state and the desired joint states at the striking time are determined. Finally, robot joint trajectories are generated. Instead of predicting a single striking point, we propose to construct a ball trajectory prediction map, which predicts the ball's entire rebound trajectory using the ball's initial state. We construct as well a robot trajectory generation map, which predicts the robot joint movement pattern and the movement duration using the Cartesian racket trajectories without the need of inverse kinematics, where a correlation function is used to adapt these joint movement parameters according to the ball flight trajectory. With joint movement parameters, we can directly generate joint trajectories. Additionally, we introduce a reinforcement learning approach to modify robot joint trajectories such that the robot can return balls well. We validate this new framework in both the simulated and the real robotic systems and illustrate that a seven degree-of-freedom Barrett WAM robot performs well.

[1]  Jun Nakanishi,et al.  Dynamical Movement Primitives: Learning Attractor Models for Motor Behaviors , 2013, Neural Computation.

[2]  Ole Ravn,et al.  Ping-pong robotics with high-speed vision system , 2012, 2012 12th International Conference on Control Automation Robotics & Vision (ICARCV).

[3]  Fumio Miyazaki,et al.  A learning approach to robotic table tennis , 2005, IEEE Transactions on Robotics.

[4]  Jan Peters,et al.  Reinforcement Learning to Adjust Robot Movements to New Situations , 2010, IJCAI.

[5]  De Xu,et al.  Visual Measurement and Prediction of Ball Trajectory for Table Tennis Robot , 2010, IEEE Transactions on Instrumentation and Measurement.

[6]  Andrew W. Moore,et al.  Locally Weighted Learning , 1997, Artificial Intelligence Review.

[7]  De Xu,et al.  Control system design for a 5-DOF table tennis robot , 2010, 2010 11th International Conference on Control Automation Robotics & Vision.

[8]  Jan Peters,et al.  Learning table tennis with a Mixture of Motor Primitives , 2010, 2010 10th IEEE-RAS International Conference on Humanoid Robots.

[9]  Jan Peters,et al.  A biomimetic approach to robot table tennis , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[10]  De Xu,et al.  Adding Active Learning to LWR for Ping-Pong Playing Robot , 2013, IEEE Transactions on Control Systems Technology.

[11]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[12]  Bernhard Schölkopf,et al.  Learning optimal striking points for a ping-pong playing robot , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[13]  Qiang Huang,et al.  Design of a humanoid ping-pong player robot with redundant joints , 2013, 2013 IEEE International Conference on Robotics and Biomimetics (ROBIO).

[14]  Jan Peters,et al.  A Survey on Policy Search for Robotics , 2013, Found. Trends Robotics.