论文信息 - Deictic Option Schemas

Deictic Option Schemas

Deictic representation is a representational paradigm, based on selective attention and pointers, that allows an agent to learn and reason about rich complex environments. In this article we present a hierarchical reinforcement learning framework that employs aspects of deictic representation. We also present a Bayesian algorithm for learning the correct representation for a given sub-problem and empirically validate it on a complex game environment.

[1] Dana H. Ballard,et al. Learning to perceive and act by trial and error , 1991, Machine Learning.

[2] Andrew McCallum,et al. Reinforcement learning with selective perception and hidden state , 1996 .

[3] Sandip Sen,et al. Proceedings of the fifth international conference on Autonomous agents , 2001 .

[4] Robert Givan,et al. Equivalence notions and model minimization in Markov decision processes , 2003, Artif. Intell..

[5] Thomas G. Dietterich. What is machine learning? , 2020, Archives of Disease in Childhood.

[6] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..

[7] Balaraman Ravindran,et al. Relativized Options: Choosing the Right Transformation , 2003, ICML.

[8] Philip E. Agre,et al. The dynamic structure of everyday life , 1988 .

[9] L. Kaelbling,et al. Learning with Deictic Representation , 2002 .

[10] B. Habibi,et al. Pengi : An Implementation of A Theory of Activity , 1998 .

[11] Balaraman Ravindran,et al. SMDP Homomorphisms: An Algebraic Approach to Abstraction in Semi-Markov Decision Processes , 2003, IJCAI.

[12] Sridhar Mahadevan,et al. A reinforcement learning model of selective visual attention , 2001, AGENTS '01.