论文信息 - XCS with computed prediction in multistep environments

XCS with computed prediction in multistep environments

XCSF extends the typical concept of learning classifier systems through the introduction of computed classifier prediction. Initial results show that XCSF's computed prediction can be used to evolve accurate piecewise linear approximations of simple functions. In this paper, we take XCSF one step further and apply it to typical reinforcement learning problems involving delayed rewards. In essence, we use XCSF as a method of generalized (linear) reinforcement learning to evolve piecewise linear approximations of the payoff surfaces of typical multistep problems. Our results show that XCSF can easily evolve optimal and near optimal solutions for problems introduced in the literature to test linear reinforcement learning methods.

Daniele Loiacono | David E. Goldberg | Stewart W. Wilson | Pier Luca Lanzi

[1] Gerald Tesauro,et al. TD-Gammon, a Self-Teaching Backgammon Program, Achieves Master-Level Play , 1994, Neural Computation.

[2] Bernard Widrow,et al. Adaptive switching circuits , 1988 .

[3] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[4] Stewart W. Wilson. Classifier Systems for Continuous Payoff Environments , 2004, GECCO.

[5] Doina Precup,et al. A Convergent Form of Approximate Policy Iteration , 2002, NIPS.

[6] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .

[7] Stewart W. Wilson. Classifier Fitness Based on Accuracy , 1995, Evolutionary Computation.

[8] Stewart W. Wilson. Mining Oblique Data with XCS , 2000, IWLCS.

[9] Sebastian Thrun,et al. Issues in Using Function Approximation for Reinforcement Learning , 1999 .

[10] M. Colombetti,et al. An extension to the XCS classifier system for stochastic environments , 1999 .

[11] Andrew W. Moore,et al. Generalization in Reinforcement Learning: Safely Approximating the Value Function , 1994, NIPS.

[12] Stewart W. Wilson. Classifiers that approximate functions , 2002, Natural Computing.

[13] Martin V. Butz,et al. An algorithmic description of XCS , 2000, Soft Comput..

[14] Richard S. Sutton,et al. Generalization in Reinforcement Learning: Successful Examples Using Sparse Coarse Coding , 1995, NIPS.