Knowledge-Based Multiagent Credit Assignment: A Study on Task Type and Critic Information
暂无分享,去创建一个
[1] Sachiyo Arai,et al. Multi-agent reinforcement learning for crane control problem: designing rewards for conflict resolution , 1999, Proceedings. Fourth International Symposium on Autonomous Decentralized Systems. - Integration of Heterogeneous Systems -.
[2] Shigenobu Kobayashi,et al. Rationality of Reward Sharing in Multi-agent Reinforcement Learning , 1999, PRIMA.
[3] Sachiyo Arai,et al. Multi-agent reinforcement learning for planning and scheduling multiple goals , 2000, Proceedings Fourth International Conference on MultiAgent Systems.
[4] Peter Stone,et al. Layered learning in multiagent systems - a winning approach to robotic soccer , 2000, Intelligent robotics and autonomous agents.
[5] Mitsuo Kawato,et al. Inter-module credit assignment in modular reinforcement learning , 2003, Neural Networks.
[6] Andrew W. Moore,et al. Distributed Value Functions , 1999, ICML.
[7] Peter Dayan,et al. Q-learning , 1992, Machine Learning.
[8] Maja J. Mataric,et al. Using Communication to Reduce Locality in Multi-Robot Learning , 1997, AAAI/IAAI.
[9] Craig Boutilier,et al. Planning, Learning and Coordination in Multiagent Decision Processes , 1996, TARK.
[10] John H. Holland,et al. Properties of the Bucket Brigade , 1985, ICGA.
[11] M. N. Ahmadabadi,et al. Experimental Analysis of Knowledge Based Multiagent Credit Assignment , 2004 .
[12] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..
[13] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[14] Michael P. Georgeff,et al. Commitment and Effectiveness of Situated Agents , 1991, IJCAI.
[15] Wayne Wobcke,et al. Multi-Agent Reinforcement Learning with Vicarious Rewards , 1999, Electron. Trans. Artif. Intell..
[16] Majid Nili Ahmadabadi,et al. A new approach to credit assignment in a team of cooperative Q-learning agents , 2002, IEEE International Conference on Systems, Man and Cybernetics.
[17] J. W Sander. On the Value Distribution of Arithmetic Functions , 1997 .
[18] Kagan Tumer,et al. An Introduction to Collective Intelligence , 1999, ArXiv.
[19] Shigenobu Kobayashi,et al. Rationality of reward sharing in multi-agent reinforcement learning , 1999, New Generation Computing.
[20] Sandip Sen,et al. Learning in multiagent systems , 1999 .
[21] Pradeep K. Khosla,et al. The necessity of average rewards in cooperative multirobot learning , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).
[22] Sachiyo Arai,et al. Experience-Based Reinforcement Learning to Acquire Effective Behavior in a Multi-agent Domain , 2000, PRICAI.
[23] Majid Nili Ahmadabadi,et al. Distributed form closure for convex planar objects through reinforcement learning with local information , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).
[24] John J. Grefenstette,et al. Credit assignment in rule discovery systems based on genetic algorithms , 1988, Machine Learning.
[25] Mitsuo Kawato,et al. Multiple Model-Based Reinforcement Learning , 2002, Neural Computation.