Navigation and planning in latent maps

We investigate the problem of agents planning paths to efficiently traverse environments only observed through the agent’s limited and local sensors such as laser range finders. We use a probabilistic sequential latent-variable model with an explicit global map. The entire model is trained only from local sensor readings, leveraging recent advances in approximate Bayesian inference. The generative nature of the model allows us to plan traversals of the real environment solely based on simulations within the learnt model. Further, we can exploit the Euclidean geometry imposed on our map to make use of well-known path planning algorithms. We showcase this ability by reliably and efficiently finding optimal paths in random mazes.

[1]  Nils J. Nilsson,et al.  A Formal Basis for the Heuristic Determination of Minimum Cost Paths , 1968, IEEE Trans. Syst. Sci. Cybern..

[2]  Kevin P. Murphy,et al.  Bayesian Map Learning in Dynamic Environments , 1999, NIPS.

[3]  Sebastian Thrun,et al.  Path Planning for Autonomous Vehicles in Unknown Semi-structured Environments , 2010, Int. J. Robotics Res..

[4]  Daan Wierstra,et al.  Stochastic Backpropagation and Approximate Inference in Deep Generative Models , 2014, ICML.

[5]  Richard E. Turner,et al.  Neural Adaptive Sequential Monte Carlo , 2015, NIPS.

[6]  Uri Shalit,et al.  Deep Kalman Filters , 2015, ArXiv.

[7]  Philip H. S. Torr,et al.  Playing Doom with SLAM-Augmented Deep Reinforcement Learning , 2016, ArXiv.

[8]  John J. Leonard,et al.  Past, Present, and Future of Simultaneous Localization and Mapping: Toward the Robust-Perception Age , 2016, IEEE Transactions on Robotics.

[9]  Honglak Lee,et al.  Control of Memory, Active Perception, and Action in Minecraft , 2016, ICML.

[10]  Ole Winther,et al.  Sequential Neural Models with Stochastic Layers , 2016, NIPS.

[11]  Maximilian Karl,et al.  Deep Variational Bayes Filters: Unsupervised Learning of State Space Models from Raw Data , 2016, ICLR.

[12]  Yee Whye Teh,et al.  Filtering Variational Objectives , 2017, NIPS.

[13]  Vladlen Koltun,et al.  Semi-parametric Topological Memory for Navigation , 2018, ICLR.

[14]  Fabio Viola,et al.  Generative Temporal Models with Spatial Memory for Partially Observed Environments , 2018, ICML.

[15]  Ruslan Salakhutdinov,et al.  Neural Map: Structured Memory for Deep Reinforcement Learning , 2017, ICLR.

[16]  Xueyan Jiang,et al.  Metrics for Deep Generative Models , 2017, AISTATS.

[17]  Rahul Sukthankar,et al.  Cognitive Mapping and Planning for Visual Navigation , 2017, International Journal of Computer Vision.