Spatio-Temporal Abstractions in Reinforcement Learning Through Neural Encoding
暂无分享,去创建一个
Shie Mannor | Nir Baram | Tom Zahavy | Shie Mannor | Tom Zahavy | Nir Baram
[1] J. Peng,et al. Efficient Learning and Planning Within the Dyna Framework , 1993, IEEE International Conference on Neural Networks.
[2] Shie Mannor,et al. Model selection in markovian processes , 2013, KDD.
[3] J. MacQueen. Some methods for classification and analysis of multivariate observations , 1967 .
[4] Shie Mannor,et al. Graying the black box: Understanding DQNs , 2016, ICML.
[5] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[6] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..
[7] Andrew G. Barto,et al. Automatic Discovery of Subgoals in Reinforcement Learning using Diverse Density , 2001, ICML.
[8] Andrew G. Barto,et al. Skill Characterization Based on Betweenness , 2008, NIPS.
[9] Michael I. Jordan,et al. Reinforcement Learning with Soft State Aggregation , 1994, NIPS.
[10] Stuart J. Russell,et al. Markovian State and Action Abstractions for MDPs via Hierarchical MCTS , 2016, IJCAI.
[11] Doina Precup,et al. Learning Options in Reinforcement Learning , 2002, SARA.
[12] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[13] Shie Mannor,et al. Time-regularized interrupting options , 2014, ICML 2014.
[14] Richard S. Sutton,et al. Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming , 1990, ML.
[15] Gerald Tesauro,et al. TD-Gammon, a Self-Teaching Backgammon Program, Achieves Master-Level Play , 1994, Neural Computation.
[16] Shie Mannor,et al. Adaptive Skills Adaptive Partitions (ASAP) , 2016, NIPS.
[17] Benjamin Pitzer,et al. Towards perceptual shared autonomy for robotic mobile manipulation , 2011, 2011 IEEE International Conference on Robotics and Automation.
[18] Rajat Raina,et al. Efficient sparse coding algorithms , 2006, NIPS.
[19] Craig Boutilier,et al. Decision-Theoretic Planning: Structural Assumptions and Computational Leverage , 1999, J. Artif. Intell. Res..
[20] Ulrike von Luxburg,et al. A tutorial on spectral clustering , 2007, Stat. Comput..
[21] Amy McGovern. Autonomous Discovery of Abstractions through Interaction with an Environment , 2002, SARA.