A framework for learning biped locomotion with dynamical movement primitives

This article summarizes our framework for learning biped locomotion using dynamical movement primitives based on nonlinear oscillators. Our ultimate goal is to establish a design principle of a controller in order to achieve natural humanlike locomotion. We suggest dynamical movement primitives as a central pattern generator (CPG) of a biped robot, an approach we have previously proposed for learning and encoding complex human movements. Demonstrated trajectories are learned through movement primitives by locally weighted regression, and the frequency of the learned trajectories is adjusted automatically by a frequency adaptation algorithm based on phase resetting and entrainment of coupled oscillators. Numerical simulations and experimental implementation on a physical robot demonstrate the effectiveness of the proposed locomotion controller. Furthermore, we demonstrate that phase resetting contributes to robustness against external perturbations and environmental changes by numerical simulations and experiments.

[1]  M. Kawato Transient and steady state phase response curves of limit cycle oscillators , 1982 .

[2]  Yoshiki Kuramoto,et al.  Chemical Oscillations, Waves, and Turbulence , 1984, Springer Series in Synergetics.

[3]  Jessica K. Hodgins,et al.  Biped gait transitions , 1991, Proceedings. 1991 IEEE International Conference on Robotics and Automation.

[4]  Katsuhisa Furuta,et al.  Juggling control using neural oscillators , 1994, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS'94).

[5]  S. Schaal,et al.  A Kendama Learning Robot Based on Bi-directional Theory , 1996, Neural Networks.

[6]  Masami Ito,et al.  A mathematical model of adaptation in rhythmic motion to environmental changes , 1997, 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation.

[7]  Mitsuo Kawato,et al.  A tennis serve and upswing learning robot based on bi-directional theory , 1998, Neural Networks.

[8]  Yasuo Kuniyoshi,et al.  Three dimensional bipedal stepping motion using neural oscillators-towards humanoid motion in the real world , 1998, Proceedings. 1998 IEEE/RSJ International Conference on Intelligent Robots and Systems. Innovations in Theory, Practice and Applications (Cat. No.98CH36190).

[9]  Christopher G. Atkeson,et al.  Constructive Incremental Learning from Only Local Information , 1998, Neural Computation.

[10]  Matthew M. Williamson,et al.  Neural control of rhythmic arm movements , 1998, Neural Networks.

[11]  Stefan Schaal,et al.  Is imitation learning the route to humanoid robots? , 1999, Trends in Cognitive Sciences.

[12]  Jun Nakanishi,et al.  Self-organizing control of urban traffic signal network , 2001, 2001 IEEE International Conference on Systems, Man and Cybernetics. e-Systems and e-Man for Cybernetics in Cyberspace (Cat.No.01CH37236).

[13]  Kazuo Tsuchiya,et al.  Adaptive gait pattern control of a quadruped locomotion robot , 2001, Proceedings 2001 IEEE/RSJ International Conference on Intelligent Robots and Systems. Expanding the Societal Role of Robotics in the the Next Millennium (Cat. No.01CH37180).

[14]  Shinya Kotosaka,et al.  Synchronized Robot Drumming by Neural Oscillator , 2001 .

[15]  Auke Jan Ijspeert,et al.  A connectionist central pattern generator for the aquatic and terrestrial gaits of a simulated salamander , 2001, Biological Cybernetics.

[16]  Daniel E. Koditschek,et al.  Phase Regulation of Decentralized Cyclic Robotic Systems , 2002, Int. J. Robotics Res..

[17]  Jun Nakanishi,et al.  Learning Attractor Landscapes for Learning Motor Primitives , 2002, NIPS.

[18]  장윤희,et al.  Y. , 2003, Industrial and Labor Relations Terms.

[19]  Yasuhiro Fukuoka,et al.  Adaptive Dynamic Walking of a Quadruped Robot on Irregular Terrain Based on Biological Concepts , 2003, Int. J. Robotics Res..

[20]  Stefan Schaal,et al.  Reinforcement Learning for Humanoid Robotics , 2003 .

[21]  Shinya Aoi,et al.  Locomotion control of a biped locomotion robot using nonlinear oscillators , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[22]  Taiga Yamasaki,et al.  Possible functional roles of phase resetting during walking , 2003, Biological Cybernetics.

[23]  Jun Morimoto,et al.  Minimax differential dynamic programming: application to a biped walking robot , 2003, SICE 2003 Annual Conference (IEEE Cat. No.03TH8734).

[24]  Hiroshi Shimizu,et al.  Self-organized control of bipedal locomotion by neural oscillators in unpredictable environment , 1991, Biological Cybernetics.

[25]  Kiyotoshi Matsuoka,et al.  Sustained oscillations generated by mutually inhibiting neurons with adaptation , 1985, Biological Cybernetics.

[26]  Jun Morimoto,et al.  An empirical exploration of a neural oscillator for biped locomotion control , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[27]  Jun Morimoto,et al.  Learning from demonstration and adaptation of biped locomotion , 2004, Robotics Auton. Syst..

[28]  Kazunori Hase,et al.  Computational evolution of human bipedal walking by a neuro-musculo-skeletal model , 1999, Artificial Life and Robotics.