论文信息 - PVEs: Position-Velocity Encoders for Unsupervised Learning of Structured State Representations

PVEs: Position-Velocity Encoders for Unsupervised Learning of Structured State Representations

We propose position-velocity encoders (PVEs) which learn---without supervision---to encode images to positions and velocities of task-relevant objects. PVEs encode a single image into a low-dimensional position state and compute the velocity state from finite differences in position. In contrast to autoencoders, position-velocity encoders are not trained by image reconstruction, but by making the position-velocity representation consistent with priors about interacting with the physical world. We applied PVEs to several simulated control tasks from pixels and achieved promising preliminary results.

[1] Thomas G. Dietterich. What is machine learning? , 2020, Archives of Disease in Childhood.

[2] Yoshua Bengio,et al. Convolutional networks for images, speech, and time series , 1998 .

[3] Terrence J. Sejnowski,et al. Slow Feature Analysis: Unsupervised Learning of Invariances , 2002, Neural Computation.

[4] Martin A. Riedmiller. Neural Fitted Q Iteration - First Experiences with a Data Efficient Neural Reinforcement Learning Method , 2005, ECML.

[5] Geoffrey E. Hinton,et al. Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[6] Yuval Tassa,et al. MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[7] Marc Toussaint,et al. Learning Grounded Relational Symbols from Continuous Data for Abstract Reasoning , 2013 .

[8] David Wingate,et al. A Physics-Based Model Prior for Object-Oriented MDPs , 2014, ICML.

[9] James Philbin,et al. FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[11] Martin A. Riedmiller,et al. Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images , 2015, NIPS.

[12] Oliver Brock,et al. Learning state representations with robotic priors , 2015, Auton. Robots.

[13] Andrew Zisserman,et al. Spatial Transformer Networks , 2015, NIPS.

[14] Byron Boots,et al. Learning to Filter with Predictive State Inference Machines , 2015, ICML.

[15] Sergey Levine,et al. Backprop KF: Learning Discriminative Deterministic State Estimators , 2016, NIPS.

[16] Sergey Levine,et al. End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..

[17] Dieter Fox,et al. SE3-nets: Learning rigid body motion using deep neural networks , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[18] Stefano Ermon,et al. Label-Free Supervision of Neural Networks with Physics and Domain Knowledge , 2016, AAAI.

[19] Oliver Brock,et al. End-to-End Learnable Histogram Filters , 2017 .

[20] Razvan Pascanu,et al. Learning to Navigate in Complex Environments , 2016, ICLR.

[21] Tom Schaul,et al. Reinforcement Learning with Unsupervised Auxiliary Tasks , 2016, ICLR.