A Convergent Reinforcement Learning Algorithm in the Continuous Case Based on a Finite Difference Method
暂无分享,去创建一个
[1] Michael I. Jordan,et al. Advances in Neural Information Processing Systems 30 , 1995 .
[2] G. Barles,et al. Convergence of approximation schemes for fully nonlinear second order equations , 1990, 29th IEEE Conference on Decision and Control.
[3] R. Emi Munos,et al. A Convergent Reinforcement Learning Algorithm in the Continuous Case : the Finite-element Reinforcement Learning , 1997 .
[4] G. Barles,et al. Comparison principle for dirichlet-type Hamilton-Jacobi equations and singular perturbations of degenerated elliptic equations , 1990 .
[5] G. Barles,et al. Convergence of approximation schemes for fully nonlinear second order equations , 1991 .
[6] Geoffrey J. Gordon. Stable Function Approximation in Dynamic Programming , 1995, ICML.
[7] Rémi Munos,et al. A General Convergence Method for Reinforcement Learning in the Continuous Case , 1998, ECML.
[8] P. Lions,et al. User’s guide to viscosity solutions of second order partial differential equations , 1992, math/9207212.
[9] G. Barles. Solutions de viscosité des équations de Hamilton-Jacobi , 1994 .
[10] Dimitri P. Bertsekas,et al. Dynamic Programming: Deterministic and Stochastic Models , 1987 .
[11] Andrew W. Moore,et al. The parti-game algorithm for variable resolution reinforcement learning in multidimensional state-spaces , 2004, Machine Learning.
[12] W. Fleming,et al. Controlled Markov processes and viscosity solutions , 1992 .
[13] Leemon C. Baird,et al. Residual Algorithms: Reinforcement Learning with Function Approximation , 1995, ICML.
[14] Marianne Akian. Méthodes multigrilles en contrôle stochastique , 1990 .
[15] Andrew W. Moore,et al. Generalization in Reinforcement Learning: Safely Approximating the Value Function , 1994, NIPS.
[16] Rajesh Sharma,et al. Asymptotic analysis , 1986 .
[17] Rémi Munos,et al. A Convergent Reinforcement Learning Algorithm in the Continuous Case: The Finite-Element Reinforcement Learning , 1996, ICML.
[18] R. Lathe. Phd by thesis , 1988, Nature.
[19] M. James. Controlled markov processes and viscosity solutions , 1994 .