Comparison of reinforcement algorithms on discrete functions: learnability, time complexity, and scaling
暂无分享,去创建一个
[1] Michael I. Jordan,et al. Forward Models: Supervised Learning with a Distal Teacher , 1992, Cogn. Sci..
[2] A G Barto,et al. Learning by statistical cooperation of self-interested neuron-like computing elements. , 1985, Human neurobiology.
[3] R. J. Williams,et al. On the use of backpropagation in associative reinforcement learning , 1988, IEEE 1988 International Conference on Neural Networks.
[4] Geoffrey E. Hinton,et al. Learning internal representations by error propagation , 1986 .
[5] Esther Levin,et al. Accelerated Learning in Layered Neural Networks , 1988, Complex Syst..
[6] Charles W. Anderson,et al. Strategy Learning with Multilayer Connectionist Representations , 1987 .
[7] P. Anandan,et al. Pattern-recognizing stochastic learning automata , 1985, IEEE Transactions on Systems, Man, and Cybernetics.
[8] J. Peng,et al. Reinforcement learning algorithms as function optimizers , 1989, International 1989 Joint Conference on Neural Networks.