Learning a Partial Behavior for a Competitive Robotic Soccer Agent
暂无分享,去创建一个
[1] Kurt Hornik,et al. Multilayer feedforward networks are universal approximators , 1989, Neural Networks.
[2] Martin A. Riedmiller,et al. CBR for State Value Function Approximation in Reinforcement Learning , 2005, ICCBR.
[3] Ian Frank,et al. Soccer Server: A Tool for Research on Multiagent Systems , 1998, Appl. Artif. Intell..
[4] Pierre Geurts,et al. Tree-Based Batch Mode Reinforcement Learning , 2005, J. Mach. Learn. Res..
[5] Geoffrey J. Gordon,et al. Approximate solutions to markov decision processes , 1999 .
[6] Peter Stone,et al. Progress in Learning 3 vs. 2 Keepaway , 2003, RoboCup.
[7] Holger Schoener,et al. Active Learning with Neural Networks , 2007 .
[8] John N. Tsitsiklis,et al. Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.
[9] Oliver Obst,et al. Qualitative Velocity and Ball Interception , 2002, KI.
[10] Hiroaki Kitano,et al. RoboCup-2001: The Fifth Robotic Soccer World Championships , 2002, AI Mag..
[11] Richard S. Sutton,et al. Learning to predict by the methods of temporal differences , 1988, Machine Learning.
[12] Manuela M. Veloso,et al. The CMUnited-99 Champion Simulator Team , 2000, AI Mag..
[13] Martin A. Riedmiller,et al. A direct adaptive method for faster backpropagation learning: the RPROP algorithm , 1993, IEEE International Conference on Neural Networks.
[14] Hamidreza Chitsaz,et al. The Fifth Robotic Soccer World Championships , 2002 .
[15] Andrew Y. Ng,et al. Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping , 1999, ICML.