Generalization and Scaling in Reinforcement Learning
暂无分享,去创建一个
[1] Richard S. Sutton,et al. Temporal credit assignment in reinforcement learning , 1984 .
[2] P. Anandan,et al. Pattern-recognizing stochastic learning automata , 1985, IEEE Transactions on Systems, Man, and Cybernetics.
[3] A G Barto,et al. Learning by statistical cooperation of self-interested neuron-like computing elements. , 1985, Human neurobiology.
[4] Geoffrey E. Hinton,et al. Learning representations by back-propagating errors , 1986, Nature.
[5] Charles W. Anderson,et al. Learning and problem-solving with multilayer connectionist systems (adaptive, strategy learning, neural networks, reinforcement learning) , 1986 .
[6] Geoffrey E. Hinton,et al. Learning representations by back-propagation errors, nature , 1986 .
[7] D. Ackley. A connectionist machine for genetic hillclimbing , 1987 .
[8] David H. Ackley. Associative Learning via Inhibitory Search , 1988, NIPS.
[9] Robert B. Allen. Developing agent models with a neural reinforcement technique , 1989, Conference Proceedings., IEEE International Conference on Systems, Man and Cybernetics.