Kernel-Based Reinforcement Learning
暂无分享,去创建一个
[1] John N. Tsitsiklis,et al. Analysis of Temporal-Diffference Learning with Function Approximation , 1996, NIPS.
[2] P. S. Sastry,et al. A reinforcement learning neural network for adaptive control of Markov chains , 1997, IEEE Trans. Syst. Man Cybern. Part A.
[3] Vladimir Vapnik,et al. Statistical learning theory , 1998 .
[4] John C. Platt,et al. Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .
[5] Gavin C. Cawley,et al. Improved sparse least-squares support vector machines , 2002, Neurocomputing.
[6] Johan A. K. Suykens,et al. Weighted least squares support vector machines: robustness and sparse approximation , 2002, Neurocomputing.
[7] Nello Cristianini,et al. Kernel Methods for Pattern Analysis , 2003, ICTAI.
[8] Peter Dayan,et al. Q-learning , 1992, Machine Learning.
[9] Gary William Flake,et al. Efficient SVM Regression Training with SMO , 2002, Machine Learning.
[10] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.
[11] John N. Tsitsiklis,et al. Feature-based methods for large scale dynamic programming , 2004, Machine Learning.
[12] Richard S. Sutton,et al. Learning to predict by the methods of temporal differences , 1988, Machine Learning.