Evaluation of Batch-Mode Reinforcement Learning Methods for Solving DEC-MDPs with Changing Action Sets
暂无分享,去创建一个
[1] Martin A. Riedmiller. Neural Fitted Q Iteration - First Experiences with a Data Efficient Neural Reinforcement Learning Method , 2005, ECML.
[2] Stefan Schaal,et al. Natural Actor-Critic , 2003, Neurocomputing.
[3] Ann Nowé,et al. Coordinated exploration in multi-agent reinforcement learning: an application to load-balancing , 2005, AAMAS '05.
[4] Martin A. Riedmiller,et al. A direct adaptive method for faster backpropagation learning: the RPROP algorithm , 1993, IEEE International Conference on Neural Networks.
[5] Martin Lauer,et al. An Algorithm for Distributed Reinforcement Learning in Cooperative Multi-Agent Systems , 2000, ICML.
[6] Neil Immerman,et al. The Complexity of Decentralized Control of Markov Decision Processes , 2000, UAI.
[7] Pierre Geurts,et al. Tree-Based Batch Mode Reinforcement Learning , 2005, J. Mach. Learn. Res..
[8] Ronen I. Brafman,et al. Learning to Coordinate Efficiently: A Model-based Approach , 2003, J. Artif. Intell. Res..
[9] Claudia V. Goldman,et al. Solving Transition Independent Decentralized Markov Decision Processes , 2004, J. Artif. Intell. Res..
[10] Craig Boutilier,et al. Sequential Optimality and Coordination in Multiagent Systems , 1999, IJCAI.
[11] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.
[12] François Charpillet,et al. Coordination through mutual notification in cooperative multiagent reinforcement learning , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..
[13] Martin A. Riedmiller,et al. Reinforcement learning for DEC-MDPs with changing action sets and partially ordered dependencies , 2008, AAMAS.
[14] Michael Pinedo,et al. Scheduling: Theory, Algorithms, and Systems , 1994 .
[15] Nancy Paterson. The Library , 1912, Leonardo.
[16] Victor R. Lesser,et al. Decentralized Markov decision processes with event-driven interactions , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..
[17] Peter Dayan,et al. Q-learning , 1992, Machine Learning.
[18] Olivier Buffet,et al. Shaping multi-agent systems with gradient reinforcement learning , 2007, Autonomous Agents and Multi-Agent Systems.
[19] Thomas G. Dietterich. What is machine learning? , 2020, Archives of Disease in Childhood.
[20] Claudia V. Goldman,et al. Optimizing information exchange in cooperative multi-agent systems , 2003, AAMAS '03.