Approximate dynamic programming with Gaussian processes
暂无分享,去创建一个
[1] Carl E. Rasmussen,et al. Model-Based Reinforcement Learning with Continuous States and Actions , 2008, ESANN.
[2] C. Rasmussen,et al. Gaussian Process Priors with Uncertain Inputs - Application to Multiple-Step Ahead Time Series Forecasting , 2002, NIPS.
[3] Thomas G. Dietterich. What is machine learning? , 2020, Archives of Disease in Childhood.
[4] Thomas G. Dietterich. Adaptive computation and machine learning , 1998 .
[5] Tom Minka,et al. A family of algorithms for approximate Bayesian inference , 2001 .
[6] J. Kocijan,et al. Gaussian process model based predictive control , 2004, Proceedings of the 2004 American Control Conference.
[7] Stefan Schaal,et al. Robot Learning From Demonstration , 1997, ICML.
[8] Martin A. Riedmiller. Neural Fitted Q Iteration - First Experiences with a Data Efficient Neural Reinforcement Learning Method , 2005, ECML.
[9] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[10] Carl E. Rasmussen,et al. Bayesian Monte Carlo , 2002, NIPS.
[11] Nicholas K. Jong,et al. Kernel-Based Models for Reinforcement Learning , 2006 .
[12] John N. Tsitsiklis,et al. Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.
[13] Carl E. Rasmussen,et al. Gaussian Processes in Reinforcement Learning , 2003, NIPS.
[14] Liming Xiang,et al. Kernel-Based Reinforcement Learning , 2006, ICIC.
[15] Dimitri P. Bertsekas,et al. Dynamic Programming and Optimal Control, Two Volume Set , 1995 .
[16] Christopher G. Atkeson,et al. Using Local Trajectory Optimizers to Speed Up Global Optimization in Dynamic Programming , 1993, NIPS.
[17] Shie Mannor,et al. Reinforcement learning with Gaussian processes , 2005, ICML.
[18] Geoffrey E. Hinton,et al. Adaptive Mixtures of Local Experts , 1991, Neural Computation.
[19] Carl E. Rasmussen,et al. Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.
[20] Dimitri P. Bertsekas,et al. Dynamic programming and optimal control, 3rd Edition , 2005 .