Deictic Option Schemas
暂无分享,去创建一个
Balaraman Ravindran | Andrew G. Barto | Vimal Mathew | A. Barto | Balaraman Ravindran | Vimala Mathew
[1] Dana H. Ballard,et al. Learning to perceive and act by trial and error , 1991, Machine Learning.
[2] Andrew McCallum,et al. Reinforcement learning with selective perception and hidden state , 1996 .
[3] Sandip Sen,et al. Proceedings of the fifth international conference on Autonomous agents , 2001 .
[4] Robert Givan,et al. Equivalence notions and model minimization in Markov decision processes , 2003, Artif. Intell..
[5] Thomas G. Dietterich. What is machine learning? , 2020, Archives of Disease in Childhood.
[6] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..
[7] Balaraman Ravindran,et al. Relativized Options: Choosing the Right Transformation , 2003, ICML.
[8] Philip E. Agre,et al. The dynamic structure of everyday life , 1988 .
[9] L. Kaelbling,et al. Learning with Deictic Representation , 2002 .
[10] B. Habibi,et al. Pengi : An Implementation of A Theory of Activity , 1998 .
[11] Balaraman Ravindran,et al. SMDP Homomorphisms: An Algebraic Approach to Abstraction in Semi-Markov Decision Processes , 2003, IJCAI.
[12] Sridhar Mahadevan,et al. A reinforcement learning model of selective visual attention , 2001, AGENTS '01.