Online optimal trajectory generation for robot table tennis

Abstract In highly dynamic tasks that involve moving targets, planning is necessary to figure out when, where and how to intercept the target. In robotic table tennis in particular, motion planning can be very challenging due to time constraints, dimension of the search space and joint limits. Conventional planning algorithms often rely on a fixed virtual hitting plane to construct robot striking trajectories. These algorithms, however, generate restrictive strokes and can result in unnatural strategies when compared with human playing. In this paper, we introduce a new trajectory generation framework for robotic table tennis that does not involve a fixed hitting plane. A free-time optimal control approach is used to derive two different trajectory optimizers. The resulting two algorithms, Focused Player and Defensive Player, encode two different play-styles. We evaluate their performance in simulation and in our robot table tennis platform with a high speed cable-driven seven DOF robot arm. The algorithms return the balls with a higher probability to the opponent’s court when compared with a virtual hitting plane based method. Moreover, both can be run online and the trajectories can be corrected with new ball observations.

[1]  Fumio Miyazaki,et al.  A learning approach to robotic table tennis , 2005, IEEE Transactions on Robotics.

[2]  M. Powell A Direct Search Optimization Method That Models the Objective and Constraint Functions by Linear Interpolation , 1994 .

[3]  Jason L. Speyer,et al.  Primer on Optimal Control Theory , 2010 .

[4]  Russell L. Anderson,et al.  A Robot Ping-Pong Player: Experiments in Real-Time Intelligent Control , 1988 .

[5]  Jian Chu,et al.  Impedance Control and its Effects on a Humanoid Robot Playing Table Tennis , 2012 .

[6]  Ole Ravn,et al.  Ping-pong robotics with high-speed vision system , 2012, 2012 12th International Conference on Control Automation Robotics & Vision (ICARCV).

[7]  H. Schättler,et al.  Geometric Optimal Control: Theory, Methods and Examples , 2012 .

[8]  Richard S. Sutton,et al.  Introduction to Reinforcement Learning , 1998 .

[9]  Qiang Huang,et al.  Design of a humanoid ping-pong player robot with redundant joints , 2013, 2013 IEEE International Conference on Robotics and Biomimetics (ROBIO).

[10]  H. W. Sorenson,et al.  Kalman filtering : theory and application , 1985 .

[11]  Dominique Sugny Geometric Optimal Control Theory in spin systems , 2010 .

[12]  De Xu,et al.  Adding Active Learning to LWR for Ping-Pong Playing Robot , 2013, IEEE Transactions on Control Systems Technology.

[13]  Alexander Dietrich,et al.  Catching flying balls with a mobile humanoid: System overview and design considerations , 2011, 2011 11th IEEE-RAS International Conference on Humanoid Robots.

[14]  T. I. James Tsay,et al.  Self-Learning for a Humanoid Robotic Ping-Pong Player , 2011, Adv. Robotics.

[15]  L. S. Pontryagin,et al.  Mathematical Theory of Optimal Processes , 1962 .

[16]  Marina Bosch,et al.  A Robot Ping Pong Player Experiment In Real Time Intelligent Control , 2016 .

[17]  M. Spong,et al.  Robot Modeling and Control , 2005 .

[18]  De Xu,et al.  Trajectory prediction of spinning ball for ping-pong player robot , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[19]  Jan Peters,et al.  A biomimetic approach to robot table tennis , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[20]  Christoph H. Lampert,et al.  Real-time detection of colored objects in multiple camera streams with off-the-shelf hardware components , 2012, Journal of Real-Time Image Processing.

[21]  Peters Jan,et al.  Jointly learning trajectory generation and hitting point prediction in robot table tennis , 2016 .

[22]  Yongsheng Zhao,et al.  Model Based Motion State Estimation and Trajectory Prediction of Spinning Ball for Ping-Pong Robots using Expectation-Maximization Algorithm , 2017, J. Intell. Robotic Syst..

[23]  A Nakashima,et al.  Modeling of rebound phenomenon of a rigid ball with friction and elastic effects , 2010, Proceedings of the 2010 American Control Conference.

[24]  Daniel Liberzon,et al.  Calculus of Variations and Optimal Control Theory: A Concise Introduction , 2012 .

[25]  Juan A. Méndez,et al.  Ping-pong player prototype , 2003, IEEE Robotics Autom. Mag..

[26]  Russell L. Andersson Robot Ping-Pong , 2003 .

[27]  R. Shumway,et al.  AN APPROACH TO TIME SERIES SMOOTHING AND FORECASTING USING THE EM ALGORITHM , 1982 .

[28]  Bernhard Schölkopf,et al.  Learning optimal striking points for a ping-pong playing robot , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[29]  Leslie Pack Kaelbling,et al.  Tracking the spin on a ping pong ball with the quaternion Bingham filter , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[30]  Jun Nakanishi,et al.  Spatio-temporal stiffness optimization with switching dynamics , 2017, Auton. Robots.

[31]  Gerd Hirzinger,et al.  Trajectory planning for optimal robot catching in real-time , 2011, 2011 IEEE International Conference on Robotics and Automation.

[32]  Yongsheng Zhao,et al.  Real-Time Spin Estimation of Ping-Pong Ball Using Its Natural Brand , 2015, IEEE Transactions on Instrumentation and Measurement.

[33]  Aude Billard,et al.  Learning Motion Dynamics to Catch a Flying Object , 2010 .

[34]  Jan Peters,et al.  A new trajectory generation framework in robotic table tennis , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[35]  Oliver Kroemer,et al.  Learning to select and generalize striking movements in robot table tennis , 2012, AAAI Fall Symposium: Robots Learning Interactively from Human Teachers.

[36]  George M. Siouris,et al.  Applied Optimal Control: Optimization, Estimation, and Control , 1979, IEEE Transactions on Systems, Man, and Cybernetics.

[37]  J. Koenderink Q… , 2014, Les noms officiels des communes de Wallonie, de Bruxelles-Capitale et de la communaute germanophone.