XCS with computed prediction in multistep environments
暂无分享,去创建一个
[1] Gerald Tesauro,et al. TD-Gammon, a Self-Teaching Backgammon Program, Achieves Master-Level Play , 1994, Neural Computation.
[2] Bernard Widrow,et al. Adaptive switching circuits , 1988 .
[3] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[4] Stewart W. Wilson. Classifier Systems for Continuous Payoff Environments , 2004, GECCO.
[5] Doina Precup,et al. A Convergent Form of Approximate Policy Iteration , 2002, NIPS.
[6] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[7] Stewart W. Wilson. Classifier Fitness Based on Accuracy , 1995, Evolutionary Computation.
[8] Stewart W. Wilson. Mining Oblique Data with XCS , 2000, IWLCS.
[9] Sebastian Thrun,et al. Issues in Using Function Approximation for Reinforcement Learning , 1999 .
[10] M. Colombetti,et al. An extension to the XCS classifier system for stochastic environments , 1999 .
[11] Andrew W. Moore,et al. Generalization in Reinforcement Learning: Safely Approximating the Value Function , 1994, NIPS.
[12] Stewart W. Wilson. Classifiers that approximate functions , 2002, Natural Computing.
[13] Martin V. Butz,et al. An algorithmic description of XCS , 2000, Soft Comput..
[14] Richard S. Sutton,et al. Generalization in Reinforcement Learning: Successful Examples Using Sparse Coarse Coding , 1995, NIPS.