论文信息 - Distributed form closure for convex planar objects through reinforcement learning with local information

Distributed form closure for convex planar objects through reinforcement learning with local information

Many real world applications would involve grasp of large objects in unstructured environments. Agent-based approach to multi-robot grasp of objects would prove useful under the above circumstances. In this paper, the problem of form closure grasp for planar convex objects by multiple robots is tackled. Contrary to the previous approaches, no a priori information about the shape of the object is assumed, and the robots are not allowed to fully communicate among themselves. A distributed multi-agent based approach using Q-learning is proposed. The state space, action set and learning algorithm are formulated. The results are verified through simulations using a developed Q-learning test bed.

Majid Nili Ahmadabadi | Babak Nadjar Araabi | Farrokh Janabi-Sharifi | Amir Hossein Elahibakhsh

[1] Yoshihiko Nakamura,et al. Robustness of power grasp , 1994, Proceedings of the 1994 IEEE International Conference on Robotics and Automation.

[2] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[3] Olivier Buffet,et al. Multi-Agent Systems by Incremental Gradient Reinforcement Learning , 2001, IJCAI.

[4] Jeffrey C. Trinkle,et al. On the stability and instantaneous velocity of grasped frictionless objects , 1992, IEEE Trans. Robotics Autom..

[5] Vijay Kumar,et al. Decentralized control of cooperating mobile manipulators , 1998, Proceedings. 1998 IEEE International Conference on Robotics and Automation (Cat. No.98CH36146).

[6] Tsuneo Yoshikawa,et al. Passive and active closures by constraining mechanisms , 1996, Proceedings of IEEE International Conference on Robotics and Automation.

[7] Vijay Kumar,et al. Cooperative Transport of Planar Objects by Multiple Mobile Robots Using Object Closure , 2002, ISER.

[8] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.

[9] Daniela Rus. Coordinated Manipulation of Objects in a Plane , 1997, Algorithmica.

[10] Van-Duc Nguyen,et al. Constructing Force- Closure Grasps , 1988, Int. J. Robotics Res..

[11] Olivier Buffet,et al. Incremental reinforcement learning for designing multi-agent systems , 2001, AGENTS '01.

[12] Kazuhiro Kosuge,et al. Decentralized control of multiple robots handling an object , 1996, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS '96.

[13] M. N. Ahmadabadi,et al. Experimental Analysis of Knowledge Based Multiagent Credit Assignment , 2004 .