Control of Musculoskeletal Systems Using Learned Dynamics Models

Controlling musculoskeletal systems, especially robots actuated by pneumatic artificial muscles, is a challenging task due to nonlinearities, hysteresis effects, massive actuator delay, and unobservable dependencies such as temperature. Despite such difficulties, muscular systems offer many beneficial properties to achieve human-comparable performance in uncertain and fast-changing tasks. For example, muscles are backdrivable and provide variable stiffness while offering high forces to reach high accelerations. In addition, the embodied intelligence deriving from the compliance might reduce the control demands for specific tasks. In this letter, we address the problem of how to accurately control musculoskeletal robots. To address this issue, we propose to learn probabilistic forward dynamics models using Gaussian processes and, subsequently, to employ these models for control. However, Gaussian processes dynamics models cannot be set up for our musculoskeletal robot as for traditional motor-driven robots because of unclear state composition, etc. We hence empirically study and discuss in detail how to tune these approaches to complex musculoskeletal robots and their specific challenges. Moreover, we show that our model can be used to accurately control an antagonistic pair of pneumatic artificial muscles for a trajectory tracking task while considering only one-step-ahead predictions of the forward model and incorporating model uncertainty.

[1]  Carl E. Rasmussen,et al.  Gaussian Processes for Data-Efficient Learning in Robotics and Control , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[3]  Neil D. Lawrence,et al.  Variational Inference for Latent Variables and Uncertain Inputs in Gaussian Processes , 2016, J. Mach. Learn. Res..

[4]  Bertrand Tondu,et al.  Modelling of the McKibben artificial muscle: A review , 2012 .

[5]  Koh Hosoda,et al.  Anthropomorphic Muscular–Skeletal Robotic Upper Limb for Understanding Embodied Intelligence , 2012, Adv. Robotics.

[6]  Duy Nguyen-Tuong,et al.  Optimizing Long-term Predictions for Model-based Policy Search , 2017, CoRL.

[7]  Daniel G. Sbarbaro-Hofer,et al.  Multivariable Generalized Minimum Variance Control Based on Artificial Neural Networks and Gaussian Process Models , 2004, ISNN.

[8]  F. Zajac Muscle and tendon: properties, models, scaling, and application to biomechanics and motor control. , 1989, Critical reviews in biomedical engineering.

[9]  Ching-Ping Chou,et al.  Static and dynamic characteristics of McKibben pneumatic artificial muscles , 1994, Proceedings of the 1994 IEEE International Conference on Robotics and Automation.

[10]  S. Billings Nonlinear System Identification: NARMAX Methods in the Time, Frequency, and Spatio-Temporal Domains , 2013 .

[11]  Duy Nguyen-Tuong,et al.  Local Gaussian Process Regression for Real Time Online Model Learning , 2008, NIPS.

[12]  Michael Günther,et al.  Intelligence by mechanics , 2007, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[13]  James Hensman,et al.  Identification of Gaussian Process State Space Models , 2017, NIPS.

[14]  Lennart Ljung,et al.  System Identification: Theory for the User , 1987 .

[15]  Zoubin Ghahramani,et al.  Sparse Gaussian Processes using Pseudo-inputs , 2005, NIPS.

[16]  Jan Peters,et al.  Model Learning with Local Gaussian Process Regression , 2009, Adv. Robotics.

[17]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[18]  Carl E. Rasmussen,et al.  Gaussian Process Training with Input Noise , 2011, NIPS.

[19]  Jan Peters,et al.  Model learning for robot control: a survey , 2011, Cognitive Processing.

[20]  Krister Svanberg,et al.  A Class of Globally Convergent Optimization Methods Based on Conservative Convex Separable Approximations , 2002, SIAM J. Optim..

[21]  Jan Peters,et al.  A lightweight robotic arm with pneumatic muscles for robot learning , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[22]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[23]  Pierre Lopez,et al.  Modeling and control of McKibben artificial muscle robot actuators , 2000 .

[24]  Jan Peters,et al.  Stability of Controllers for Gaussian Process Dynamics , 2017, J. Mach. Learn. Res..

[25]  Juš Kocijan,et al.  System Identification with GP Models , 2016 .