Learning to predict by the methods of temporal differences
暂无分享,去创建一个
[1] Arthur L. Samuel,et al. Some Studies in Machine Learning Using the Game of Checkers , 1967, IBM J. Res. Dev..
[2] J. Gillis,et al. Matrix Iterative Analysis , 1961 .
[3] Ian H. Witten,et al. An Adaptive Optimal Controller for Discrete-Time Markov Environments , 1977, Inf. Control..
[4] A G Barto,et al. Toward a modern theory of adaptive networks: expectation and prediction. , 1981, Psychological review.
[5] Richard S. Sutton,et al. Temporal credit assignment in reinforcement learning , 1984 .
[6] Thomas G. Dietterich,et al. Learning to Predict Sequences , 1985 .
[7] J. Hopfield,et al. The Logic of Limax Learning , 1985 .
[8] A G Barto,et al. Learning by statistical cooperation of self-interested neuron-like computing elements. , 1985, Human neurobiology.
[9] Geoffrey E. Hinton,et al. A Learning Algorithm for Boltzmann Machines , 1985, Cogn. Sci..
[10] J. Christensen. Learning static evaluation functions by linear regression , 1986 .
[11] Geoffrey E. Hinton,et al. Learning internal representations by error propagation , 1986 .
[12] John H. Holland,et al. Escaping brittleness: the possibilities of general-purpose learning algorithms applied to parallel rule-based systems , 1995 .
[13] S. Thomas Alexander,et al. Adaptive Signal Processing , 1986, Texts and Monographs in Computer Science.
[14] R. Sutton,et al. Simulation of the classically conditioned nictitating membrane response by a neuron-like adaptive element: Response topography, neuronal firing, and interstimulus intervals , 1986, Behavioural Brain Research.
[15] Charles W. Anderson,et al. Strategy Learning with Multilayer Connectionist Representations , 1987 .
[16] E. Kehoe,et al. Temporal primacy overrides prior training in serial compound conditioning of the rabbit’s nictitating membrane response , 1987 .
[17] A. Klopf. A neuronal model of classical conditioning , 1988 .
[18] Eric V. Denardo,et al. Dynamic Programming: Models and Applications , 2003 .
[19] S. Hampson,et al. Disjunctive models of Boolean category learning , 1987, Biological Cybernetics.