Meta-learning for Predictive Knowledge Architectures: A Case Study Using TIDBD on a Sensor-rich Robotic Arm
暂无分享,去创建一个
Patrick M. Pilarski | Johannes Günther | Michael Rory Dawson | Nadia M. Ady | Alex Kearney | M. R. Dawson | P. Pilarski | Alex Kearney | J. Günther
[1] Mark B. Ring. Continual learning in reinforcement environments , 1995, GMD-Bericht.
[2] Patrick M. Pilarski,et al. Representing high-dimensional data to intelligent prostheses and other wearable assistive robots: A first comparison of tile coding and selective Kanerva coding , 2017, 2017 International Conference on Rehabilitation Robotics (ICORR).
[3] Patrick M. Pilarski,et al. Adaptive artificial limbs: a real-time approach to prediction and anticipation , 2013, IEEE Robotics & Automation Magazine.
[4] Patrick M. Pilarski,et al. Intelligent laser welding through representation, prediction, and control learning: An architecture with deep neural networks and reinforcement learning , 2016 .
[5] Patrick M. Pilarski,et al. Learning Feature Relevance Through Step Size Adaptation in Temporal-Difference Learning , 2019, ArXiv.
[6] Richard S. Sutton,et al. Multi-timescale nexting in a reinforcement learning robot , 2011, Adapt. Behav..
[7] A. Dickinson,et al. Neuronal coding of prediction errors. , 2000, Annual review of neuroscience.
[8] Adam M White,et al. DEVELOPING A PREDICTIVE APPROACH TO KNOWLEDGE , 2015 .
[9] Patrick M. Pilarski,et al. Horde: a scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction , 2011, AAMAS.
[10] Patrick M. Pilarski,et al. Predictions , Surprise , and Predictions of Surprise in General Value Function Architectures , 2018 .
[11] Guang-Hong Yang,et al. Fault detection for linear stochastic systems with sensor stuck faults , 2012 .
[12] Sergey Levine,et al. Self-Supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).
[13] Patrick M. Pilarski,et al. Introspective Agents: Confidence Measures for General Value Functions , 2016, AGI.
[14] Marco C. Bettoni,et al. Made-Up Minds: A Constructivist Approach to Artificial Intelligence , 1993, IEEE Expert.
[15] Richard S. Sutton,et al. Learning to predict by the methods of temporal differences , 1988, Machine Learning.
[16] Patrick M. Pilarski,et al. A Collaborative Approach to the Simultaneous Multi-joint Control of a Prosthetic Arm , 2015, 2015 IEEE International Conference on Rehabilitation Robotics (ICORR).