Robot Program 59. Robot Programming by Demonstration

Robot programming by demonstration (PbD) has become a central topic of robotics that spans across general research areas such as human-robot interaction, machine learning, machine vision and motor control. Robot PbD started about 30 years ago, and has grown importantly during the past decade. The rationale for moving from purely preprogrammed robots to very flexible user-based interfaces for training robots to perform a task is threefold. First and foremost, PbD, also referred to as imitation learning, is a powerful mechanism for reducing the complexity of search spaces for learning. When observing either good or bad examples, one can reduce the search for a possible solution, by either starting the search from the observed good solution (local optima), or conversely, by eliminating from the search space what is known as a bad solution. Imitation learning is, thus, a powerful tool for enhancing and accelerating learning in both animals and artifacts. Second, imitation learning offers an implicit means of training a machine, such that explicit and tedious programming of a task by a human user can be minimized or eliminated (Fig. 59.1). Imitation learning is thus a natural means of interacting with a machine that would be accessible to lay people. Third, studying and modeling the coupling of perception and action, which is at the core of imitation learning, helps us to understand the mechanisms by which the self-organization of perception and action could arise during development. The reciprocal interaction of perception and action could explain how competence in motor control can be grounded in rich structure of perceptual variables, and vice versa, how the processes of perception can develop as means to create successful actions. PbD promises were thus multiple. On the one hand, one hoped that it would make learning faster, in contrast to tedious reinforcement learning methods or trials-and-error learning. On the other hand, one expected that the methods , being user-friendly, would enhance the application of robots in human daily environments. Recent progresses in the field, which we review in this chapter, show that the field has made a leap forward during the past decade toward these goals. In addition, we anticipate that these promises may be fulfilled very soon. This chapter Sect. 59.1 presents a brief historical overview of robot Programming by Demonstration (PbD), introducing several issues that will be discussed later in this chapter. Section Sect. 59.2 reviews engineering approaches to robot PbD with an emphasis …

[1]  J. Piaget Play, dreams and imitation in childhood , 1951 .

[2]  J. Maunsell,et al.  Functional properties of neurons in middle temporal visual area of the macaque monkey. II. Binocular interactions and sensitivity to binocular disparity. , 1983, Journal of neurophysiology.

[3]  Jean-Claude Latombe,et al.  An Approach to Automatic Robot Programming Based on Inductive Learning , 1984 .

[4]  Anthony Levas,et al.  A user-friendly high-level robot teaching system , 1984, ICRA.

[5]  Gerald DeJong,et al.  Explanation-based manipulator learning: Acquisition of planning ability through observation , 1985, Proceedings. 1985 IEEE International Conference on Robotics and Automation.

[6]  Christopher G. Atkeson,et al.  Using Local Models to Control Movement , 1989, NIPS.

[7]  Andrew W. Moore,et al.  Fast, Robust Adaptive Control by Learning only Forward Models , 1991, NIPS.

[8]  Katsushi Ikeuchi,et al.  Towards an assembly plan from observation. I. Assembly task recognition using face-contact relations (polyhedral objects) , 1992, Proceedings 1992 IEEE International Conference on Robotics and Automation.

[9]  Ales Ude,et al.  Trajectory generation from noisy positions of object features for teaching robot paths , 1993, Robotics Auton. Syst..

[10]  Dana H. Ballard,et al.  Recognizing teleoperated manipulations , 1993, [1993] Proceedings IEEE International Conference on Robotics and Automation.

[11]  Sheng Liu,et al.  Teaching and learning of deburring robots using neural networks , 1993, [1993] Proceedings IEEE International Conference on Robotics and Automation.

[12]  Yangsheng Xu,et al.  Hidden Markov model approach to skill learning and its application to telerobotics , 1993, IEEE Trans. Robotics Autom..

[13]  Masayuki Inaba,et al.  Learning by watching: extracting reusable task knowledge from visual observation of human performance , 1994, IEEE Trans. Robotics Autom..

[14]  Ales Ude,et al.  Acquisition of Elementary Robot Skills from Human Demonstration , 1995 .

[15]  Kerstin Dautenhahn,et al.  Getting to know each other - Artificial social intelligence for autonomous robots , 1995, Robotics Auton. Syst..

[16]  Stefan Schaal,et al.  From Isolation to Cooperation: An Alternative View of a System of Experts , 1995, NIPS.

[17]  Avinash C. Kak,et al.  Automatic learning of assembly tasks using a DataGlove system , 1995, Proceedings 1995 IEEE/RSJ International Conference on Intelligent Robots and Systems. Human Robot Interaction and Cooperative Robots.

[18]  Katsushi Ikeuchi,et al.  A robot system that observes and replicates grasping tasks , 1995, Proceedings of IEEE International Conference on Computer Vision.

[19]  Geir Hovland,et al.  Skill acquisition from human demonstration using a hidden Markov model , 1996, Proceedings of IEEE International Conference on Robotics and Automation.

[20]  Nathan Delson,et al.  Robot programming by human demonstration: adaptation and inconsistency in constrained motion , 1996, Proceedings of IEEE International Conference on Robotics and Automation.

[21]  N. Kambhatla Local models and Gaussian mixture models for statistical data processing , 1996 .

[22]  P. Bakker,et al.  Robot See, Robot Do : An Overview of Robot Imitation , 1996 .

[23]  Rüdiger Dillmann,et al.  Building elementary robot skills from human demonstration , 1996, Proceedings of IEEE International Conference on Robotics and Automation.

[24]  S. K. Tso,et al.  Hidden Markov model for intelligent extraction of robot trajectory command from demonstrated trajectories , 1996, Proceedings of the IEEE International Conference on Industrial Technology (ICIT'96).

[25]  Yangsheng Xu,et al.  Online, interactive learning of gestures for human/robot interfaces , 1996, Proceedings of IEEE International Conference on Robotics and Automation.

[26]  Yangsheng Xu,et al.  Human action learning via hidden Markov model , 1997, IEEE Trans. Syst. Man Cybern. Part A.

[27]  Sorin Moga,et al.  From Perception-Action Loops to Imitation Processes: A Bottom-Up Approach of Learning by Imitation , 1998, Appl. Artif. Intell..

[28]  Christopher G. Atkeson,et al.  Constructive Incremental Learning from Only Local Information , 1998, Neural Computation.

[29]  Michael A. Arbib,et al.  Modeling parietal-premotor interactions in primate control of grasping , 1998, Neural Networks.

[30]  D M Wolpert,et al.  Multiple paired forward and inverse models for motor control , 1998, Neural Networks.

[31]  Aude Billard,et al.  Grounding communication in autonomous robots: An experimental study , 1998, Robotics Auton. Syst..

[32]  Balaraman Ravindran,et al.  Improved Switching among Temporally Abstract Actions , 1998, NIPS.

[33]  B. Scassellati Imitation and mechanisms of joint attention: a developmental structure for building social skills on a humanoid robot , 1999 .

[34]  S. Schaal,et al.  Segmentation of endpoint trajectories does not imply segmented control , 1999, Experimental Brain Research.

[35]  Aude Billard,et al.  DRAMA, a Connectionist Architecture for Control and Learning in Autonomous Robots , 1999, Adapt. Behav..

[36]  J. Mazziotta,et al.  Cortical mechanisms of human imitation. , 1999, Science.

[37]  Perry Y. Li,et al.  Passive velocity field control of mechanical manipulators , 1995, IEEE Trans. Robotics Autom..

[38]  G. Rizzolatti,et al.  Resonance behaviors and mirror neurons. , 1999, Archives italiennes de biologie.

[39]  Stefan Schaal,et al.  Is imitation learning the route to humanoid robots? , 1999, Trends in Cognitive Sciences.

[40]  Stefan Schaal,et al.  Nonparametric Regression for Learning Nonlinear Transformations , 2000 .

[41]  W. Prinz,et al.  Compatibility between Observed and Executed Finger Movements: Comparing Symbolic, Spatial, and Imitative Cues , 2000, Brain and Cognition.

[42]  Richard A. Volz,et al.  Acquiring robust, force-based assembly skills from human demonstration , 2000, IEEE Trans. Robotics Autom..

[43]  Kerstin Dautenhahn,et al.  Of hummingbirds and helicopters: An algebraic framework for interdisciplinary studies of imitation a , 2000 .

[44]  Michael A. Arbib,et al.  Synthetic brain imaging: grasping, mirror neurons and imitation , 2000, Neural Networks.

[45]  Mohammed Yeasin,et al.  Toward automatic robot programming: learning human skill from visual data , 2000, IEEE Trans. Syst. Man Cybern. Part B.

[46]  H. Bekkering,et al.  Imitation of gestures in children is goal-directed. , 2000, The Quarterly journal of experimental psychology. A, Human experimental psychology.

[47]  H. Kozima,et al.  A Robot that Learns to Communicate with Human Caregivers , 2001 .

[48]  Abderrahmane Kheddar,et al.  Teleoperation based on the hidden robot concept , 2001, IEEE Trans. Syst. Man Cybern. Part A.

[49]  G. Rizzolatti,et al.  Neurophysiological mechanisms underlying the understanding and imitation of action , 2001, Nature Reviews Neuroscience.

[50]  Tetsuo Ono,et al.  Development of an Interactive Humanoid Robot "Robovie" - An interdisciplinary approach , 2001, ISRR.

[51]  Cristina P. Santos,et al.  Control of movement time and sequential action through attractor dynamics: A simulation study demonstrating object interception and coordination , 2001 .

[52]  Aude Billard,et al.  Learning human arm movements by imitation: : Evaluation of a biologically inspired connectionist architecture , 2000, Robotics Auton. Syst..

[53]  A. Meltzoff,et al.  A PET Exploration of the Neural Mechanisms Involved in Reciprocal Imitation , 2002, NeuroImage.

[54]  Gillian M. Hayes,et al.  Imitation as a dual-route process featuring predictive and learning components: a biologically plausible computational model , 2002 .

[55]  Yoshihiko Nakamura,et al.  Acquiring Motion Elements for Bidirectional Computation of Motion Recognition and Generation , 2002, ISER.

[56]  Jun Nakanishi,et al.  Movement imitation with nonlinear dynamical systems in humanoid robots , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).

[57]  Aude Billard,et al.  Imitation: a means to enhance learning of a synthetic protolanguage in autonomous robots , 2002 .

[58]  Katsushi Ikeuchi,et al.  Task analysis based on observing hands and objects by vision , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[59]  Maja J. Matarić,et al.  Sensory-motor primitives as a basis for imitation: linking perception to action and biology to robotics , 2002 .

[60]  Cynthia Breazeal,et al.  Recognition of Affective Communicative Intent in Robot-Directed Speech , 2002, Auton. Robots.

[61]  Michael A. Arbib,et al.  Schema design and implementation of the grasp-related mirror neuron system , 2002, Biological Cybernetics.

[62]  Rainer Stiefelhagen,et al.  Pointing gesture recognition based on 3D-tracking of face, hands and head orientation , 2003, ICMI '03.

[63]  Yoshihiko Nakamura,et al.  Keyframe compression and decompression for time series data based on the continuous hidden Markov model , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[64]  Stefan Schaal,et al.  Reinforcement Learning for Humanoid Robotics , 2003 .

[65]  Tomomasa Sato,et al.  Robot imitation of human motion based on qualitative description from multiple measurement of human and environmental data , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[66]  Katsushi Ikeuchi,et al.  Extraction of essential interactions through multiple observations of human demonstrations , 2003, IEEE Trans. Ind. Electron..

[67]  Katsu Yamane,et al.  Dynamics Filter - concept and implementation of online motion Generator for human figures , 2000, IEEE Trans. Robotics Autom..

[68]  Monica N. Nicolescu,et al.  Natural methods for robot task learning: instructive demonstrations, generalization and practice , 2003, AAMAS '03.

[69]  S. Bocionek,et al.  Robot programming by Demonstration (RPD): Supporting the induction by human interaction , 1996, Machine Learning.

[70]  Gordon Cheng,et al.  Learning tasks from observation and practice , 2004, Robotics Auton. Syst..

[71]  Jianwei Zhang,et al.  Self-valuing learning and generalization with application in visually guided grasping of complex objects , 2004, Robotics Auton. Syst..

[72]  Andrew W. Moore,et al.  Locally Weighted Learning for Control , 1997, Artificial Intelligence Review.

[73]  Rüdiger Dillmann,et al.  Teaching and learning of robot tasks via observation of human performance , 2004, Robotics Auton. Syst..

[74]  Aaron Hertzmann,et al.  Style-based inverse kinematics , 2004, SIGGRAPH 2004.

[75]  Masayuki Inaba,et al.  Development of an imitation behavior in humanoid Kenta with reinforcement learning algorithm based on the attention during imitation , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[76]  Minoru Asada,et al.  Periodic nonlinear principal component neural networks for humanoid motion segmentation, generalization, and generation , 2004, ICPR 2004.

[77]  Stefano Caselli,et al.  Leveraging on a virtual environment for robot programming by demonstration , 2004, Robotics Auton. Syst..

[78]  Yasuo Kuniyoshi,et al.  Embodied basis of invariant features in execution and perception of whole-body dynamic actions - knacks and focuses of Roll-and-Rise motion , 2004, Robotics Auton. Syst..

[79]  Jun Tani,et al.  Joint attention between a humanoid robot and users in imitation game , 2004 .

[80]  Peter Ford Dominey,et al.  Robot command, interrogation and teaching via social interaction , 2005, 5th IEEE-RAS International Conference on Humanoid Robots, 2005..

[81]  Cynthia Breazeal,et al.  Learning From and About Others: Towards Using Imitation to Bootstrap the Social Understanding of Others by Robots , 2005, Artificial Life.

[82]  Rüdiger Dillmann,et al.  Learning sequential constraints of tasks from user demonstrations , 2005, 5th IEEE-RAS International Conference on Humanoid Robots, 2005..

[83]  Stefano Caselli,et al.  Trajectory clustering and stochastic approximation for robot programming by demonstration , 2005, 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[84]  Masayuki Inaba,et al.  Intent imitation using wearable motion capturing system with on-line teaching of task attention , 2005, 5th IEEE-RAS International Conference on Humanoid Robots, 2005..

[85]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[86]  Rüdiger Dillmann,et al.  Towards Cognitive Robots: Building Hierarchical Task Representations of Manipulations from Human Demonstration , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[87]  Frédéric Kaplan,et al.  Learning to Interpret Pointing Gestures: Experiments with Four-Legged Autonomous Robots , 2005, Biomimetic Neural Learning for Intelligent Robots.

[88]  Stefan Schaal,et al.  Incremental Online Learning in High Dimensions , 2005, Neural Computation.

[89]  Monica N. Nicolescu,et al.  Task Learning through Imitation and Human-robot Interaction , 2005 .

[90]  Aude Billard,et al.  Recognition and reproduction of gestures using a probabilistic framework combining PCA, ICA and HMM , 2005, ICML.

[91]  Aude Billard,et al.  Goal-Directed Imitation in a Humanoid Robot , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[92]  Aude Billard,et al.  Dynamic updating of distributed neural representations using forward models , 2006, Biological Cybernetics.

[93]  Danica Kragic,et al.  Learning Task Models from Multiple Human Demonstrations , 2006, ROMAN 2006 - The 15th IEEE International Symposium on Robot and Human Interactive Communication.

[94]  Aude Billard,et al.  Learning Dynamical System Modulation for Constrained Reaching Tasks , 2006, 2006 6th IEEE-RAS International Conference on Humanoid Robots.

[95]  Ignazio Infantino,et al.  A cognitive framework for imitation learning , 2006, Robotics Auton. Syst..

[96]  T. Belpraeme,et al.  Imitation and Social Learning in Robots, Humans and Animals: Behavioural, Social and Communicative Dimensions , 2006 .

[97]  Estela Bicho,et al.  Goal-directed imitation for robots: A bio-inspired approach to action understanding and skill learning , 2006, Robotics Auton. Syst..

[98]  Andrea Lockerd Thomaz,et al.  Using perspective taking to learn from ambiguous demonstrations , 2006, Robotics Auton. Syst..

[99]  Aude Billard,et al.  Biologically Inspired Multimodal Integration: Interferences in a Human-Robot Interaction Game , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[100]  Aude Billard,et al.  Teaching a Humanoid Robot to Recognize and Reproduce Social Cues , 2006, ROMAN 2006 - The 15th IEEE International Symposium on Robot and Human Interactive Communication.

[101]  Stefano Caselli,et al.  Robust trajectory learning and approximation for robot programming by demonstration , 2006, Robotics Auton. Syst..

[102]  Michael A. Arbib,et al.  Mirror neurons and imitation: A computationally guided review , 2006, Neural Networks.

[103]  Gordon Cheng,et al.  Dexterous Skills Transfer by Extending Human Body Schema to a Robotic Hand , 2006, 2006 6th IEEE-RAS International Conference on Humanoid Robots.

[104]  Matthew W. Hoffman,et al.  A probabilistic model of gaze imitation and shared attention , 2006, Neural Networks.

[105]  Yoon-Kwon Hwang,et al.  Self-learning control of cooperative motion for a humanoid robot , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[106]  Chrystopher L. Nehaniv,et al.  Teaching robots by moulding behavior and scaffolding the environment , 2006, HRI '06.

[107]  Jun Tani,et al.  Dynamic and interactive generation of object handling behaviors by a small humanoid robot using a dynamic neural network model , 2006, Neural Networks.

[108]  Masaki Ogino,et al.  Interaction rule learning with a human partner based on an imitation faculty with a simple visuo-motor mapping , 2006, Robotics Auton. Syst..

[109]  Yoshihiko Nakamura,et al.  Stochastic Model of Imitating a New Observed Motion Based on the Acquired Motion Primitives , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[110]  Tony Belpaeme,et al.  A computational model of intention reading in imitation , 2006, Robotics Auton. Syst..

[111]  Tamim Asfour,et al.  Imitation Learning of Dual-Arm Manipulation Tasks in Humanoid Robots , 2006, 2006 6th IEEE-RAS International Conference on Humanoid Robots.

[112]  Masayuki Inaba,et al.  Situation Recognition and Behavior Induction based on Geometric Symbol Representation of Multimodal Sensorimotor Patterns , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[113]  Yiannis Demiris,et al.  Hierarchical attentive multiple models for execution and recognition of actions , 2006, Robotics Auton. Syst..

[114]  Raymond H. Cuijpers,et al.  Goals and means in action observation: A computational approach , 2006, Neural Networks.

[115]  Aude Billard,et al.  Parallel and distributed neural models of the ideomotor principle: An investigation of imitative cortical pathways , 2006, Neural Networks.

[116]  Chrystopher L. Nehaniv,et al.  Evaluation of robot imitation attempts: comparison of the system's and the human's perspectives , 2006, HRI '06.

[117]  Aude Billard,et al.  Incremental learning of gestures by imitation in a humanoid robot , 2007, 2007 2nd ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[118]  Aude Billard,et al.  Reinforcement learning for imitating constrained reaching movements , 2007, Adv. Robotics.

[119]  Aude Billard,et al.  Special Issue on Robot Learning by Observation, Demonstration, and Imitation , 2007, IEEE Trans. Syst. Man Cybern. Part B.

[120]  Aude Billard,et al.  What is the Teacher"s Role in Robot Programming by Demonstration? - Toward Benchmarks for Improved Learning , 2007 .

[121]  Aude Billard,et al.  On Learning, Representing, and Generalizing a Task in a Humanoid Robot , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[122]  Aude Billard,et al.  Learning of Gestures by Imitation in a Humanoid Robot , 2007 .

[123]  Rüdiger Dillmann,et al.  Incremental Learning of Tasks From User Demonstrations, Past Experiences, and Vocal Comments , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[124]  Yoshihiko Nakamura,et al.  Mimesis Scheme using a Monocular Vision System on a Humanoid Robot , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[125]  Tamim Asfour,et al.  Imitation Learning of Dual-Arm Manipulation Tasks in Humanoid Robots , 2006, 2006 6th IEEE-RAS International Conference on Humanoid Robots.