Äääöòòòò Øó Óòøöóð Ø Åùðøøôðð Ììññ Ë Blockin Blockinðð×
暂无分享,去创建一个
[1] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..
[2] Martin A. Riedmiller,et al. High Quality Thermostat Control by Reinforcement Learning - A Case Study , 1998 .