Adaptive Choice of Grid and Time in Reinforcement Learning
暂无分享,去创建一个
[1] M. Falcone. A numerical approach to the infinite horizon problem of deterministic control theory , 1987 .
[2] Eberhard Bänsch,et al. Local mesh refinement in 2 and 3 dimensions , 1991, IMPACT Comput. Sci. Eng..
[3] Andrew W. Moore,et al. The parti-game algorithm for variable resolution reinforcement learning in multidimensional state-spaces , 2004, Machine Learning.
[4] Stephan Pareigis,et al. Lernen der Lösung der Bellman-Gleichung durch Beobachtung von kontinuierlichen Prozessen , 1996 .
[5] Stephan Pareigis,et al. Multi-Grid Methods for Reinforcement Learning in Controlled Diffusion Processes , 1996, NIPS.
[6] L. Grüne. An adaptive grid scheme for the discrete Hamilton-Jacobi-Bellman equation , 1997 .