暂无分享,去创建一个
Razvan Pascanu | Yujia Li | Oriol Vinyals | Daan Wierstra | Nicolas Heess | Lars Buesing | Sébastien Racanière | Theophane Weber | David P. Reichert | Peter W. Battaglia | Oriol Vinyals | N. Heess | Daan Wierstra | T. Weber | Lars Buesing | P. Battaglia | Yujia Li | Razvan Pascanu | Sébastien Racanière | S. Racanière
[1] Eric Horvitz,et al. Reasoning about beliefs and actions under computational resource constraints , 1987, Int. J. Approx. Reason..
[2] Richard S. Sutton,et al. Dyna, an integrated architecture for learning, planning, and reacting , 1990, SGAR.
[3] Stuart J. Russell,et al. Principles of Metareasoning , 1989, Artif. Intell..
[4] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[5] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[6] Rémi Coulom,et al. Efficient Selectivity and Backup Operators in Monte-Carlo Tree Search , 2006, Computers and Games.
[7] Rémi Munos,et al. Optimistic Planning of Deterministic Systems , 2008, EWRL.
[8] Bart De Schutter,et al. Reinforcement Learning and Dynamic Programming Using Function Approximators , 2010 .
[9] Rémi Munos,et al. Optimistic Optimization of Deterministic Functions , 2011, NIPS 2011.
[10] Michael L. Littman,et al. Bandit-Based Planning and Learning in Continuous-Action Markov Decision Processes , 2012, ICAPS.
[11] David Tolpin,et al. Selecting Computations: Theory and Applications , 2012, UAI.
[12] Yoshua Bengio,et al. Deep Learning of Representations , 2013, Handbook on Neural Information Processing.
[13] Yoshua Bengio,et al. Deep Learning of Representations: Looking Forward , 2013, SLSP.
[14] Yuval Tassa,et al. Learning Continuous Control Policies by Stochastic Value Gradients , 2015, NIPS.
[15] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[16] Martin A. Riedmiller,et al. Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images , 2015, NIPS.
[17] Ross A. Knepper,et al. DeepMPC: Learning Deep Latent Features for Model Predictive Control , 2015, Robotics: Science and Systems.
[18] Joelle Pineau,et al. Conditional Computation in Neural Networks for faster models , 2015, ArXiv.
[19] Alex Graves,et al. Adaptive Computation Time for Recurrent Neural Networks , 2016, ArXiv.
[20] Jitendra Malik,et al. Learning Visual Predictive Models of Physics for Playing Billiards , 2015, ICLR.
[21] Alex Graves,et al. Strategic Attentive Writer for Learning Macro-Actions , 2016, NIPS.
[22] Pieter Abbeel,et al. Value Iteration Networks , 2016, NIPS.
[23] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[24] Razvan Pascanu,et al. Interaction Networks for Learning about Objects, Relations and Physics , 2016, NIPS.
[25] Razvan Pascanu,et al. Metacontrol for Adaptive Imagination-Based Optimization , 2017, ICLR.
[26] Tom Schaul,et al. The Predictron: End-To-End Learning and Planning , 2016, ICML.
[27] Razvan Pascanu,et al. Imagination-Augmented Agents for Deep Reinforcement Learning , 2017, NIPS.
[28] Sergey Levine,et al. Deep visual foresight for planning robot motion , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).