A Fast, Robust, and Incremental Model for Learning High-Level Concepts From Human Motions by Imitation

Social robots are becoming a companion in everyday life. To be well accepted by humans, they should efficiently understand meanings of their partners’ motions and body language and respond accordingly. Learning concepts by imitation brings them this ability in a user-friendly way. This paper presents a fast and robust model for incremental learning of concepts by imitation (ILoCI). In ILoCI, observed multimodal spatiotemporal demonstrations are incrementally abstracted and generalized based on their perceptual and functional similarities during the imitation. Perceptually similar demonstrations are abstracted by a dynamic model of the mirror neuron system. The functional similarities of demonstrations are also learned through a limited number of interactions with the teacher. Incremental relearning of acquired concepts together through memory rehearsal enables the learner to gradually extract and utilize the common structural relations among demonstrations to expedite the learning process especially at the initial stages. Performance of ILoCI is assessed using a standard benchmark dataset and a human–robot interaction task in which a humanoid robot learns to abstract teacher's hand motions during imitation. Its performance is also evaluated on occluded observations that are probable in real environments. The results show efficiency of ILoCI in concept acquisition, recognition, prediction, and generation in addition to its robustness to occlusions and high variability in observations.

[1]  Bence P. Ölveczky,et al.  Motor circuits are required to encode a sensory model for imitative learning , 2012, Nature Neuroscience.

[2]  Stefan Wermter,et al.  Towards integrating learning by demonstration and learning by instruction in a multimodal robotic , 2003 .

[3]  Jun Tani,et al.  On-line Imitative Interaction with a Humanoid Robot Using a Dynamic Neural Network Model of a Mirror System , 2004, Adapt. Behav..

[4]  Stefan Schaal,et al.  Robot Programming by Demonstration , 2009, Springer Handbook of Robotics.

[5]  Dana Kulic,et al.  Incremental Learning, Clustering and Hierarchy Formation of Whole Body Motion Patterns using Adaptive Hidden Markov Chains , 2008, Int. J. Robotics Res..

[6]  Kuu-Young Young,et al.  Intention Learning From Human Demonstration , 2011, J. Inf. Sci. Eng..

[7]  Tetsuya Ogata,et al.  Experience Based Imitation Using RNNPB , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[8]  Dana Kulic,et al.  Online Segmentation and Clustering From Continuous Observation of Whole Body Motions , 2009, IEEE Transactions on Robotics.

[9]  Chrystopher L. Nehaniv,et al.  Three Sources of Information in Social Learning , 2002 .

[10]  Mark Steedman,et al.  Object-Action Complexes: Grounded abstractions of sensory-motor processes , 2011, Robotics Auton. Syst..

[11]  Majid Nili Ahmadabadi,et al.  Inverse Kinematics Based Human Mimicking System using Skeletal Tracking Technology , 2017, J. Intell. Robotic Syst..

[12]  Dana Kulic,et al.  Incremental learning of full body motion primitives and their sequencing through human motion observation , 2012, Int. J. Robotics Res..

[13]  Jun Tani,et al.  Emergence of Functional Hierarchy in a Multiple Timescale Neural Network Model: A Humanoid Robot Experiment , 2008, PLoS Comput. Biol..

[14]  Aude Billard,et al.  Incremental motion learning with locally modulated dynamical systems , 2015, Robotics Auton. Syst..

[15]  Fabian Chersi,et al.  Learning Through Imitation: a Biological Approach to Robotics , 2012, IEEE Transactions on Autonomous Mental Development.

[16]  Henk Nijmeijer,et al.  Robot Programming by Demonstration , 2010, SIMPAR.

[17]  H. Moradi,et al.  Hierarchical concept learning based on functional similarity of actions , 2013, 2013 First RSI/ISM International Conference on Robotics and Mechatronics (ICRoM).

[18]  Yoshihiko Nakamura,et al.  Segmentation, Memorization, Recognition and Abstraction of Humanoid Motions Based on Correlations and Associative Memory , 2006, 2006 6th IEEE-RAS International Conference on Humanoid Robots.

[19]  E. Torres-Jara,et al.  Challenges for Robot Manipulation in Human Environments , 2006 .

[20]  Charles C. Kemp,et al.  Challenges for robot manipulation in human environments [Grand Challenges of Robotics] , 2007, IEEE Robotics & Automation Magazine.

[21]  Ana Paiva,et al.  Bidirectional Learning of Handwriting Skill in Human-Robot Interaction , 2015, HRI.

[22]  Shigeki Sugano,et al.  Imitating others by composition of primitive actions: A neuro-dynamic model , 2012, Robotics Auton. Syst..

[23]  Yoshihiko Nakamura,et al.  Embodied Symbol Emergence Based on Mimesis Theory , 2004, Int. J. Robotics Res..

[24]  Jochen J. Steil,et al.  Open-source benchmarking for learned reaching motion generation in robotics , 2015, Paladyn J. Behav. Robotics.

[25]  Majid Nili Ahmadabadi,et al.  Conceptual Imitation Learning Based on Perceptual and Functional Characteristics of Action , 2013, IEEE Transactions on Autonomous Mental Development.

[26]  Cynthia Breazeal,et al.  Learning From and About Others: Towards Using Imitation to Bootstrap the Social Understanding of Others by Robots , 2005, Artificial Life.

[27]  Eren Erdal Aksoy,et al.  Model-free incremental learning of the semantics of manipulation actions , 2015, Robotics Auton. Syst..

[28]  Michael A. Arbib,et al.  Mirror neurons and imitation: A computationally guided review , 2006, Neural Networks.

[29]  Jun Tani,et al.  Codevelopmental Learning Between Human and Humanoid Robot Using a Dynamic Neural-Network Model , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[30]  G. Rizzolatti,et al.  Hearing Sounds, Understanding Actions: Action Representation in Mirror Neurons , 2002, Science.

[31]  Stefan Wermter,et al.  Towards multimodal neural robot learning , 2004, Robotics Auton. Syst..

[32]  Tetsuya Ogata,et al.  Prediction and imitation of other's motions by reusing own forward-inverse model in robots , 2009, 2009 IEEE International Conference on Robotics and Automation.

[33]  A. Meltzoff,et al.  Explaining Facial Imitation: A Theoretical Model. , 1997, Early development & parenting.

[34]  Darwin G. Caldwell,et al.  Learning adaptive movements from demonstration and self-guided exploration , 2014, 4th International Conference on Development and Learning and on Epigenetic Robotics.

[35]  Shigeki Sugano,et al.  Open-End Human Robot Interaction from the Dynamical Systems Perspective: Mutual Adaptation and Incremental Learning , 2004, International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems.

[36]  José Santos-Victor,et al.  Abstraction Levels for Robotic Imitation: Overview and Computational Approaches , 2010, From Motor Learning to Interaction Learning in Robots.

[37]  Aude Billard,et al.  Handbook of Robotics Chapter 59 : Robot Programming by Demonstration , 2007 .

[38]  Oliver Kroemer,et al.  Interaction primitives for human-robot cooperation tasks , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[39]  Aude Billard,et al.  Recognition and reproduction of gestures using a probabilistic framework combining PCA, ICA and HMM , 2005, ICML.

[40]  Majid Nili Ahmadabadi,et al.  Conceptual Imitation Learning in a Human-Robot Interaction Paradigm , 2012, TIST.

[41]  Raymond H. Cuijpers,et al.  Generalisation of action sequences in RNNPB networks with mirror properties , 2009, ESANN 2009.

[42]  Aude Billard,et al.  Learning Stable Nonlinear Dynamical Systems With Gaussian Mixture Models , 2011, IEEE Transactions on Robotics.

[43]  Hossein Mobahi,et al.  A BIOLOGICALLY INSPIRED METHOD FOR CONCEPTUAL IMITATION USING REINFORCEMENT LEARNING , 2007, Appl. Artif. Intell..

[44]  T. Zentall,et al.  Categorization, concept learning, and behavior analysis: an introduction. , 2002, Journal of the experimental analysis of behavior.

[45]  Jun Tani,et al.  Self-organization of distributedly represented multiple behavior schemata in a mirror system: reviews of robot experiments using RNNPB , 2004, Neural Networks.

[46]  Minho Lee,et al.  Goal-oriented behavior sequence generation based on semantic commands using multiple timescales recurrent neural network with initial state correction , 2014, Neurocomputing.

[47]  Aleksandar Jevtic,et al.  Applying Ant Colony Optimization algorithms for high-level behavior learning and reproduction from demonstrations , 2015, Robotics Auton. Syst..

[48]  Benjamin Fonooni,et al.  Cognitive Interactive Robot Learning , 2014 .

[49]  L. Nadel,et al.  The medial temporal region and memory consolidation: A new hypothesis , 2014 .

[50]  A. Jonker Origins of the modern mind. Three stages in the evolution of culture and cognition , 1998 .

[51]  K. Dautenhahn,et al.  The Mirror System, Imitation, and the Evolution of Language , 1999 .

[52]  Jun Tani,et al.  Dynamic and interactive generation of object handling behaviors by a small humanoid robot using a dynamic neural network model , 2006, Neural Networks.

[53]  Hossein Hajimirsadeghi Conceptual imitation learning based on functional effects of action , 2011, 2011 IEEE EUROCON - International Conference on Computer as a Tool.

[54]  Tetsuya Ogata,et al.  Experience-based imitation using RNNPB , 2007, Adv. Robotics.