Causal Graph Based Decomposition of Factored MDPs
暂无分享,去创建一个
[1] R. Bellman. A Markovian Decision Process , 1957 .
[2] Richard Fikes,et al. STRIPS: A New Approach to the Application of Theorem Proving to Problem Solving , 1971, IJCAI.
[3] David Harel,et al. Statecharts: A Visual Formalism for Complex Systems , 1987, Sci. Comput. Program..
[4] Keiji Kanazawa,et al. A model for reasoning about persistence and causation , 1989 .
[5] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[6] Michael O. Duff,et al. Reinforcement Learning Methods for Continuous-Time Markov Decision Problems , 1994, NIPS.
[7] Sebastian Thrun,et al. Finding Structure in Reinforcement Learning , 1994, NIPS.
[8] Thomas Dean,et al. Decomposition Techniques for Planning in Stochastic Domains , 1995, IJCAI.
[9] Craig Boutilier,et al. Exploiting Structure in Policy Construction , 1995, IJCAI.
[10] Andrew G. Barto,et al. Learning to Act Using Real-Time Dynamic Programming , 1995, Artif. Intell..
[11] Pattie Maes,et al. Emergent Hierarchical Control Structures: Learning Reactive/Hierarchical Relationships in Reinforcement Environments , 1996 .
[12] Stuart J. Russell,et al. Reinforcement Learning with Hierarchies of Machines , 1997, NIPS.
[13] Robert Givan,et al. Model Minimization in Markov Decision Processes , 1997, AAAI/IAAI.
[14] Milos Hauskrecht,et al. Hierarchical Solution of Markov Decision Processes using Macro-actions , 1998, UAI.
[15] Ronald E. Parr,et al. Hierarchical control and learning for markov decision processes , 1998 .
[16] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[17] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..
[18] Thomas G. Dietterich. State Abstraction in MAXQ Hierarchical Reinforcement Learning , 1999, NIPS.
[19] Jesse Hoey,et al. SPUDD: Stochastic Planning using Decision Diagrams , 1999, UAI.
[20] Michael Kearns,et al. Efficient Reinforcement Learning in Factored MDPs , 1999, IJCAI.
[21] Daphne Koller,et al. Active Learning for Parameter Estimation in Bayesian Networks , 2000, NIPS.
[22] Thomas G. Dietterich. Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition , 1999, J. Artif. Intell. Res..
[23] Andrew G. Barto,et al. Automated State Abstraction for Options using the U-Tree Algorithm , 2000, NIPS.
[24] Andrew G. Barto,et al. Automatic Discovery of Subgoals in Reinforcement Learning using Diverse Density , 2001, ICML.
[25] Carlos Guestrin,et al. Max-norm Projections for Factored MDPs , 2001, IJCAI.
[26] Sridhar Mahadevan,et al. Continuous-Time Hierarchical Reinforcement Learning , 2001, ICML.
[27] Bernhard Hengst,et al. Discovering Hierarchy in Reinforcement Learning with HEXQ , 2002, ICML.
[28] Andrew G. Barto,et al. PolicyBlocks: An Algorithm for Creating Useful Macro-Actions in Reinforcement Learning , 2002, ICML.
[29] Tommi S. Jaakkola,et al. Unsupervised Active Learning in Large Domains , 2002, UAI.
[30] Zhengzhu Feng,et al. Symbolic heuristic search for factored Markov decision processes , 2002, AAAI/IAAI.
[31] Shie Mannor,et al. Q-Cut - Dynamic Discovery of Sub-goals in Reinforcement Learning , 2002, ECML.
[32] Shlomo Zilberstein,et al. Symbolic Generalization for On-line Planning , 2002, UAI.
[33] Nuttapong Chentanez,et al. Intrinsically Motivated Reinforcement Learning , 2004, NIPS.
[34] Shie Mannor,et al. Dynamic abstraction in reinforcement learning via clustering , 2004, ICML.
[35] Malte Helmert,et al. A Planning Heuristic Based on Causal Graph Analysis , 2004, ICAPS.
[36] A. Barto,et al. An algebraic approach to abstraction in reinforcement learning , 2004 .
[37] Andrew G. Barto,et al. Using relative novelty to identify useful temporal abstractions in reinforcement learning , 2004, ICML.
[38] Alicia P. Wolfe,et al. Identifying useful subgoals in reinforcement learning by local graph partitioning , 2005, ICML.
[39] Andrew G. Barto,et al. A causal approach to hierarchical decomposition of factored MDPs , 2005, ICML.
[40] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[41] Kevin Murphy,et al. Active Learning of Causal Bayes Net Structure , 2006 .