Speeding-up Reinforcement Learning with Multi-step Actions
暂无分享,去创建一个
[1] Stephan Pareigis,et al. Adaptive Choice of Grid and Time in Reinforcement Learning , 1997, NIPS.
[2] Ronald E. Parr,et al. Hierarchical control and learning for markov decision processes , 1998 .
[3] Richard S. Sutton,et al. Roles of Macro-Actions in Accelerating Reinforcement Learning , 1998 .
[4] Doina Precup,et al. Using Options for Knowledge Transfer in Reinforcement Learning , 1999 .
[5] Thomas G. Dietterich. Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition , 1999, J. Artif. Intell. Res..
[6] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..