Learning Distributed Object Pushing: Individual Learning and Distributed Cooperation Protocol

In this paper we study learning in cooperative object pushing systems. The proposed approach is based on the idea of learning individual skills and then mapping the required cooperative behaviors on the learned skills. The control point for each individual robot is chosen so as to simplify the design of the reinforcement signal and reduce the role of delayed reward on individual learning in addition to make cooperative protocol simpler. A fuzzy Q-learning system for learning individual object pushing is proposed, together with a method for coordination among the robots to push the object cooperatively. The coordination method takes into account the dynamics of the object and the learned individual skills. The idea of the coordination protocol is based on the notion of ability and active regions defined in this paper. In effect, the cooperation protocol is mapped to a Q-value based policy. Simulation results supportively show that the robots learn individual and cooperative object pushing efficiently

[1]  Hamid R. Berenji,et al.  Learning and tuning fuzzy logic controllers through reinforcements , 1992, IEEE Trans. Neural Networks.

[2]  Kazuhiro Kosuge,et al.  Motion control of multiple autonomous mobile robots handling a large object in coordination , 1999, Proceedings 1999 IEEE International Conference on Robotics and Automation (Cat. No.99CH36288C).

[3]  Majid Nili Ahmadabadi,et al.  A "constrain and move" approach to distributed object manipulation , 2001, IEEE Trans. Robotics Autom..

[4]  Majid Nili Ahmadabadi,et al.  Distributed object transportation on a desired path based on Constrain and Move strategy , 2005, Robotics Auton. Syst..

[5]  Eiji Nakano,et al.  Cooperating multiple behavior-based robots for object manipulation , 1994, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS'94).

[6]  Majid Nili Ahmadabadi,et al.  A multiple robot system for cooperative object transportation with various requirements on task performing , 1999, Proceedings 1999 IEEE International Conference on Robotics and Automation (Cat. No.99CH36288C).

[7]  Bruce Randall Donald,et al.  Moving furniture with teams of autonomous robots , 1995, Proceedings 1995 IEEE/RSJ International Conference on Intelligent Robots and Systems. Human Robot Interaction and Cooperative Robots.

[8]  Lionel Jouffe,et al.  Fuzzy inference system learning by reinforcement methods , 1998, IEEE Trans. Syst. Man Cybern. Part C.

[9]  Bruce Randall Donald,et al.  Information Invariants for Distributed Manipulation , 1995, Int. J. Robotics Res..

[10]  Eiji Nakano,et al.  Solving function distribution and behavior design problem for cooperative object handling by multiple mobile robots , 2003, IEEE Trans. Syst. Man Cybern. Part A.

[11]  Kevin M. Lynch,et al.  Locally controllable manipulation by stable pushing , 1999, IEEE Trans. Robotics Autom..

[12]  Eiji Nakano,et al.  Realizing cooperative object manipulation using multiple behaviour-based robots , 1996, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS '96.

[13]  Lynne E. Parker,et al.  ALLIANCE: an architecture for fault tolerant, cooperative control of heterogeneous mobile robots , 1994, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS'94).