暂无分享,去创建一个
Martin A. Riedmiller | Roland Hafner | Rico Jonschkowski | Jonathan Scholz | Roland Hafner | Rico Jonschkowski | Jonathan Scholz
[1] Thomas G. Dietterich. What is machine learning? , 2020, Archives of Disease in Childhood.
[2] Yoshua Bengio,et al. Convolutional networks for images, speech, and time series , 1998 .
[3] Terrence J. Sejnowski,et al. Slow Feature Analysis: Unsupervised Learning of Invariances , 2002, Neural Computation.
[4] Martin A. Riedmiller. Neural Fitted Q Iteration - First Experiences with a Data Efficient Neural Reinforcement Learning Method , 2005, ECML.
[5] Geoffrey E. Hinton,et al. Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.
[6] Yuval Tassa,et al. MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[7] Marc Toussaint,et al. Learning Grounded Relational Symbols from Continuous Data for Abstract Reasoning , 2013 .
[8] David Wingate,et al. A Physics-Based Model Prior for Object-Oriented MDPs , 2014, ICML.
[9] James Philbin,et al. FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[10] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[11] Martin A. Riedmiller,et al. Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images , 2015, NIPS.
[12] Oliver Brock,et al. Learning state representations with robotic priors , 2015, Auton. Robots.
[13] Andrew Zisserman,et al. Spatial Transformer Networks , 2015, NIPS.
[14] Byron Boots,et al. Learning to Filter with Predictive State Inference Machines , 2015, ICML.
[15] Sergey Levine,et al. Backprop KF: Learning Discriminative Deterministic State Estimators , 2016, NIPS.
[16] Sergey Levine,et al. End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..
[17] Dieter Fox,et al. SE3-nets: Learning rigid body motion using deep neural networks , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).
[18] Stefano Ermon,et al. Label-Free Supervision of Neural Networks with Physics and Domain Knowledge , 2016, AAAI.
[19] Oliver Brock,et al. End-to-End Learnable Histogram Filters , 2017 .
[20] Razvan Pascanu,et al. Learning to Navigate in Complex Environments , 2016, ICLR.
[21] Tom Schaul,et al. Reinforcement Learning with Unsupervised Auxiliary Tasks , 2016, ICLR.