论文信息 - Effective Methods for Reinforcement Learning in Large Multi-Agent Domains (Leistungsfähige Verfahren für das Reinforcement Lernen in komplexen Multi-Agenten-Umgebungen)

Effective Methods for Reinforcement Learning in Large Multi-Agent Domains (Leistungsfähige Verfahren für das Reinforcement Lernen in komplexen Multi-Agenten-Umgebungen)

Summary Robotic soccer requires the ability of individually acting agents to cooperate. The simulation league of RoboCup therefore offers an ideal testbed for evaluating multi-agent methods. In this paper we discuss how Reinforcement Learning (RL) methods can be succesfully applied within the scenario of learning to cooperatively score a goal. Due to the complexity of the task, enhanced methods of learning have to be applied. We discuss several approaches from literature and also present an own approach. All approaches are evaluated on a discretized version of robotic soccer, which we call gridworld soccer.

Martin A. Riedmiller | Daniel Withopf | D. Withopf

[1] Sridhar Mahadevan,et al. Recent Advances in Hierarchical Reinforcement Learning , 2003, Discret. Event Dyn. Syst..

[2] Stuart J. Russell,et al. Reinforcement Learning with Hierarchies of Machines , 1997, NIPS.

[3] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..

[4] Martin A. Riedmiller,et al. Using Machine Learning Techniques in Complex Multi-Agent Domains , 2003 .

[5] Sridhar Mahadevan,et al. Decision-Theoretic Planning with Concurrent Temporally Extended Actions , 2001, UAI.

[6] Balaraman Ravindran,et al. SMDP Homomorphisms: An Algebraic Approach to Abstraction in Semi-Markov Decision Processes , 2003, IJCAI.

[7] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[8] Thomas G. Dietterich. The MAXQ Method for Hierarchical Reinforcement Learning , 1998, ICML.

[9] David Andre,et al. State abstraction for programmable reinforcement learning agents , 2002, AAAI/IAAI.