Barycentric Interpolators for Continuous Space and Time Reinforcement Learning
暂无分享,去创建一个
[1] Rajesh Sharma,et al. Asymptotic analysis , 1986 .
[2] Stephen M. Omohundro,et al. Efficient Algorithms with Neural Network Behavior , 1987, Complex Syst..
[3] Michael I. Jordan,et al. Advances in Neural Information Processing Systems 30 , 1995 .
[4] G. Barles,et al. Convergence of approximation schemes for fully nonlinear second order equations , 1990, 29th IEEE Conference on Decision and Control.
[5] W. Fleming,et al. Controlled Markov processes and viscosity solutions , 1992 .
[6] G. Barles. Solutions de viscosité des équations de Hamilton-Jacobi , 1994 .
[7] Geoffrey J. Gordon. Stable Function Approximation in Dynamic Programming , 1995, ICML.
[8] Scott Davies,et al. Multidimensional Triangulation and Interpolation for Reinforcement Learning , 1996, NIPS.
[9] Rémi Munos,et al. A Convergent Reinforcement Learning Algorithm in the Continuous Case Based on a Finite Difference Method , 1997, IJCAI.
[10] Rémi Munos,et al. A General Convergence Method for Reinforcement Learning in the Continuous Case , 1998, ECML.
[11] H. Kushner. Numerical Methods for Stochastic Control Problems in Continuous Time , 2000 .
[12] B. Craven. Control and optimization , 2019, Mathematical Modelling of the Human Cardiovascular System.