论文信息 - Learning long-range vision for an offroad robot

Learning long-range vision for an offroad robot

Teaching a robot to perceive and navigate in an unstructured natural world is a difficult task. Without learning, navigation systems are short-range and extremely limited. With learning, the robot can be taught to classify terrain at longer distances, but these classifiers can be fragile as well, leading to extremely conservative planning. A robust, high-level learning-based perception system for a mobile robot needs to continually learn and adapt as it explores new environments. To do this, a strong feature representation is necessary that can encode meaningful, discriminative patterns as well as invariance to irrelevant transformations. A simple realtime classifier can then be trained on those features to predict the traversability of the current terrain. One such method for learning a feature representation is discussed in detail in this work. Dimensionality reduction by learning an invariant mapping (DrLIM) is a weakly supervised method for learning a similarity measure over a domain. Given a set of training samples and their pairwise relationships, which can be arbitrarily defined, DrLIM can be used to learn a function that is invariant to complex transformations of the inputs such as shape distortion and rotation. The main contribution of this work is a self-supervised learning process for long-range vision that is able to accurately classify complex terrain, permitting improved strategic planning. As a mobile robot moves through offroad environments, it learns traversability from a stereo obstacle detector. The learning architecture is composed of a static feature extractor, trained offline for a general yet discriminative feature representation, and an adaptive online classifier. This architecture reduces the effect of concept drift by allowing the online classifier to quickly adapt to very few training samples without overtraining. After experiments with several different learned feature extractors, we conclude that unsupervised or weakly supervised learning methods are necessary for training general feature representations for natural scenes. The process was developed and tested on the LAGR mobile robot as part of a fully autonomous vision-based navigation system.

Yann LeCun | Raia Hadsell | Yann LeCun | R. Hadsell

[1] D. Hubel,et al. Receptive fields, binocular interaction and functional architecture in the cat's visual cortex , 1962, The Journal of physiology.

[2] Richard O. Duda,et al. Use of the Hough transformation to detect lines and curves in pictures , 1972, CACM.

[3] A. Meltzoff,et al. Intermodal matching by human neonates , 1979, Nature.

[4] J. Baird,et al. The locus of environmental attention , 1981 .

[5] Edward H. Adelson,et al. PYRAMID METHODS IN IMAGE PROCESSING. , 1984 .

[6] Takeo Kanade,et al. Vision and Navigation for the Carnegie-Mellon Navlab , 1987 .

[7] Dean Pomerleau,et al. ALVINN, an autonomous land vehicle in a neural network , 2015 .

[8] D.J. Kriegman,et al. Stereo vision and navigation in buildings for mobile robots , 1989, IEEE Trans. Robotics Autom..

[9] M. Turk,et al. Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[10] R. Axel,et al. A novel multigene family may encode odorant receptors: A molecular basis for odor recognition , 1991, Cell.

[11] Steven A. Shafer,et al. Anatomy of a color histogram , 1992, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[12] Joachim M. Buhmann,et al. Distortion Invariant Object Recognition in the Dynamic Link Architecture , 1993, IEEE Trans. Computers.

[13] Yann LeCun,et al. Signature Verification Using A "Siamese" Time Delay Neural Network , 1993, Int. J. Pattern Recognit. Artif. Intell..

[14] Dean A. Pomerleau,et al. Knowledge-Based Training of Artificial Neural Networks for Autonomous Robot Driving , 1993 .

[15] T. Bower,et al. Learning and Intermodal Transfer of Information in Newborns , 1994 .

[16] Charles E. Thorpe,et al. Vision-based neural network road and intersection detection and traversal , 1995, Proceedings 1995 IEEE/RSJ International Conference on Intelligent Robots and Systems. Human Robot Interaction and Cooperative Robots.

[17] David G. Lowe,et al. Similarity Metric Learning for a Variable-Kernel Classifier , 1995, Neural Computation.

[18] Martial Hebert,et al. Mapping and positioning for a prototype lunar rover , 1995, Proceedings of 1995 IEEE International Conference on Robotics and Automation.

[19] Pierrick Grandjean,et al. Fast cross-country navigation on fair terrains , 1995, Proceedings of 1995 IEEE International Conference on Robotics and Automation.

[20] Erann Gat,et al. Mars microrover navigation: performance evaluation and enhancement , 1995, Proceedings 1995 IEEE/RSJ International Conference on Intelligent Robots and Systems. Human Robot Interaction and Cooperative Robots.

[21] Yann LeCun,et al. Transformation Invariance in Pattern Recognition-Tangent Distance and Tangent Propagation , 1996, Neural Networks: Tricks of the Trade.

[22] Ah Chung Tsoi,et al. Face recognition: a convolutional neural-network approach , 1997, IEEE Trans. Neural Networks.

[23] Jitendra Malik,et al. Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[24] David J. Kriegman,et al. Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[25] Anil K. Jain,et al. Object detection using gabor filters , 1997, Pattern Recognit..

[26] Marilena Vendittelli,et al. Fuzzy maps: A new tool for mobile robot perception and planning , 1997, J. Field Robotics.

[27] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[28] Bernhard Schölkopf,et al. Nonlinear Component Analysis as a Kernel Eigenvalue Problem , 1998, Neural Computation.

[29] Yoshua Bengio,et al. Convolutional networks for images, speech, and time series , 1998 .

[30] Hyeonjoon Moon,et al. The FERET verification testing protocol for face recognition algorithms , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[31] Sebastian Thrun,et al. Learning Metric-Topological Maps for Indoor Mobile Robot Navigation , 1998, Artif. Intell..

[32] Aleix M. Martinez,et al. The AR face database , 1998 .

[33] Alonzo Kelly,et al. Stereo Vision Enhancements for Low-Cost Outdoor Autonomous Vehicles , 1998 .

[34] T. Poggio,et al. Hierarchical models of object recognition in cortex , 1999, Nature Neuroscience.

[35] David G. Lowe,et al. Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[36] Pascal Vincent,et al. A Neural Support Vector Network architecture with adaptive kernels , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[37] Narendra Ahuja,et al. Face recognition using kernel eigenfaces , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).

[38] J. Tenenbaum,et al. A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[39] Illah R. Nourbakhsh,et al. Appearance-Based Obstacle Detection with Monocular Color Vision , 2000, AAAI/IAAI.

[40] Roberto Manduchi,et al. Terrain perception for DEMO III , 2000, Proceedings of the IEEE Intelligent Vehicles Symposium 2000 (Cat. No.00TH8511).

[41] S T Roweis,et al. Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[42] Reid G. Simmons,et al. Recent progress in local and global traversability for planetary rovers , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[43] Yasushi Yagi,et al. Reactive Visual Navigation Based on Omnidirectional Sensing – Path Following and Collision Avoidance , 2001, J. Intell. Robotic Syst..

[44] Mikhail Belkin,et al. Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering , 2001, NIPS.

[45] Michael I. Jordan,et al. On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[46] Antonio Torralba,et al. Statistical Context Priming for Object Detection , 2001, ICCV.

[47] Tommy Chang,et al. Road detection and tracking for autonomous mobile robots , 2002, SPIE Defense + Commercial Sensing.

[48] Geoffrey E. Hinton,et al. Stochastic Neighbor Embedding , 2002, NIPS.

[49] Avinash C. Kak,et al. Vision for Mobile Robot Navigation: A Survey , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[50] Larry Matthies,et al. Stereo vision and rover navigation software for planetary exploration , 2002, Proceedings, IEEE Aerospace Conference.

[51] R.J. Marks,et al. Implicit learning in autoencoder novelty assessment , 2002, Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN'02 (Cat. No.02CH37290).

[52] Antonio Torralba,et al. Depth Estimation from Image Structure , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[53] James P. Ostrowski,et al. Visual motion planning for mobile robots , 2002, IEEE Trans. Robotics Autom..

[54] Martial Hebert,et al. Training Object Detection Models with Weakly Labeled Data , 2002, BMVC.

[55] Ben Southall,et al. Stereo perception on an off-road vehicle , 2002, Intelligent Vehicle Symposium, 2002. IEEE.

[56] Aleix M. Martínez,et al. Recognizing Imprecisely Localized, Partially Occluded, and Expression Variant Faces from a Single Sample per Class , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[57] Nicolas Le Roux,et al. Out-of-Sample Extensions for LLE, Isomap, MDS, Eigenmaps, and Spectral Clustering , 2003, NIPS.

[58] Antonio Torralba,et al. Using the Forest to See the Trees: A Graphical Model Relating Features, Objects, and Scenes , 2003, NIPS.

[59] G. Ulivi,et al. Indoor robot navigation using log-polar local maps , 2003 .

[60] Martial Hebert,et al. Toward generating labeled maps from color and range data for robot navigation , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[61] Martial Hebert,et al. Where and when to look: how to extend the myopic planning horizon , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[62] Sven Behnke,et al. Local Multiresolution Path Planning , 2003, RoboCup.

[63] Anthony Stentz,et al. Online adaptive rough-terrain navigation vegetation , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[64] Geoffrey E. Hinton,et al. Neighbourhood Components Analysis , 2004, NIPS.

[65] Kilian Q. Weinberger,et al. Learning a kernel matrix for nonlinear dimensionality reduction , 2004, ICML.

[66] G LoweDavid,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[67] Martial Hebert,et al. Classifier fusion for outdoor obstacle detection , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[68] Gerald Tesauro,et al. Practical issues in temporal difference learning , 1992, Machine Learning.

[69] Kunihiko Fukushima,et al. Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position , 1980, Biological Cybernetics.

[70] Yann LeCun,et al. Synergistic Face Detection and Pose Estimation with Energy-Based Models , 2004, J. Mach. Learn. Res..

[71] Martial Hebert,et al. Natural terrain classification using 3-d ladar data , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[72] Antonio Torralba,et al. Contextual Models for Object Detection Using Boosted Random Fields , 2004, NIPS.

[73] Martial Hebert,et al. Enabling learning from large datasets: applying active learning to mobile robotics , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[74] Y. LeCun,et al. Learning methods for generic object recognition with invariance to pose and lighting , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[75] Ashutosh Saxena,et al. High speed obstacle avoidance using monocular vision and reinforcement learning , 2005, ICML.

[76] Martial Hebert,et al. Semi-Supervised Self-Training of Object Detection Models , 2005, 2005 Seventh IEEE Workshops on Applications of Computer Vision (WACV/MOTION'05) - Volume 1.

[77] Antonio Torralba,et al. Describing Visual Scenes using Transformed Dirichlet Processes , 2005, NIPS.

[78] M. Leordeanu,et al. Unsupervised learning of object features from video sequences , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[79] Martial Hebert,et al. A hierarchical field framework for unified context-based classification , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[80] Jitendra Malik,et al. Efficient shape matching using shape contexts , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[81] Sebastian Thrun,et al. Adaptive Road Following using Self-Supervised Learning and Reverse Optical Flow , 2005, Robotics: Science and Systems.

[82] Antonio Torralba,et al. Learning hierarchical models of scenes, objects, and parts , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[83] Larry H. Matthies,et al. Stereo-Based Tree Traversability Analysis for Autonomous Off-Road Navigation , 2005, 2005 Seventh IEEE Workshops on Applications of Computer Vision (WACV/MOTION'05) - Volume 1.

[84] Alexei A. Efros,et al. Geometric context from a single image , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[85] Yann LeCun,et al. Off-Road Obstacle Avoidance through End-to-End Learning , 2005, NIPS.

[86] Roberto Manduchi,et al. Obstacle Detection and Terrain Classification for Autonomous Off-Road Navigation , 2005, Auton. Robots.

[87] Bill Triggs,et al. Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[88] Yann LeCun,et al. Learning a similarity metric discriminatively, with application to face verification , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[89] Sebastian Scherer,et al. Learning obstacle avoidance parameters from operator behavior , 2006, J. Field Robotics.

[90] Robert C. Bolles,et al. Outdoor Mapping and Navigation Using Stereo Vision , 2006, ISER.

[91] Alonzo Kelly,et al. Toward Reliable Off Road Autonomous Vehicles Operating in Challenging Environments , 2006, Int. J. Robotics Res..

[92] Yoshua Bengio,et al. Greedy Layer-Wise Training of Deep Networks , 2006, NIPS.

[93] Alexei A. Efros,et al. Putting Objects in Perspective , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[94] Geoffrey E. Hinton,et al. Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[95] Yann LeCun,et al. Dimensionality Reduction by Learning an Invariant Mapping , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[96] Alexei A. Efros,et al. Recovering Surface Layout from an Image , 2007, International Journal of Computer Vision.

[97] James M. Rehg,et al. Traversability classification using unsupervised on-line visual learning for outdoor robot navigation , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[98] Sebastian Thrun,et al. A Self-Supervised Terrain Roughness Estimator for Off-Road Autonomous Driving , 2006, UAI.

[99] S. Chopra,et al. On-Line Learning of Long-Range Obstacle Detection for Off-Road Robots , 2006 .

[100] Alberto Broggi,et al. A decision network based frame-work for visual off-road path detection problem , 2006, 2006 IEEE Intelligent Transportation Systems Conference.

[101] Yee Whye Teh,et al. A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[102] Sebastian Thrun,et al. Stanley: The robot that won the DARPA Grand Challenge , 2006, J. Field Robotics.

[103] Sebastian Thrun,et al. Self-supervised Monocular Road Detection in Desert Terrain , 2006, Robotics: Science and Systems.

[104] Michael Happold,et al. Enhancing Supervised Terrain Classification with Predictive Unsupervised Learning , 2006, Robotics: Science and Systems.

[105] Cordelia Schmid,et al. Combining Regions and Patches for Object Class Localization , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).

[106] J. Andrew Bagnell,et al. Improving robot navigation through self‐supervised online learning , 2006, J. Field Robotics.

[107] Gregory Z. Grudic,et al. Outdoor Path Labeling Using Polynomial Mahalanobis Distance , 2006, Robotics: Science and Systems.

[108] Eric Krotkov,et al. The DARPA LAGR program: Goals, challenges, methodology, and phase I results , 2006, J. Field Robotics.

[109] Gang Hua,et al. Discriminant Embedding for Local Image Descriptors , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[110] Andrew E. Johnson,et al. Computer Vision on Mars , 2007, International Journal of Computer Vision.

[111] Urs A. Muller,et al. SPEED-RANGE DILEMMAS FOR VISION-BASED NAVIGATION IN UNSTRUCTURED TERRAIN , 2007 .

[112] Marc'Aurelio Ranzato,et al. A Unified Energy-Based Framework for Unsupervised Learning , 2007, AISTATS.

[113] Marc'Aurelio Ranzato,et al. Sparse Feature Learning for Deep Belief Networks , 2007, NIPS.

[114] Pietro Perona,et al. Dimensionality Reduction Using Automatic Supervision for Vision-Based Terrain Learning , 2007, Robotics: Science and Systems.

[115] Urs A. Muller,et al. A multi-range vision strategy for autonomous offroad navigation , 2007 .

[116] Matthew A. Brown,et al. Learning Local Image Descriptors , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[117] Yoshua Bengio,et al. Scaling learning algorithms towards AI , 2007 .

[118] Marc'Aurelio Ranzato,et al. Unsupervised Learning of Invariant Feature Hierarchies with Applications to Object Recognition , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[119] Eric Krotkov,et al. The DARPA PerceptOR evaluation experiments , 2007, Auton. Robots.

[120] Gregory Z. Grudic,et al. Online Learning of Multiple Perceptual Models for Navigation in Unknown Terrain , 2007, FSR.

[121] M. Maimone,et al. Overview of the Mars Exploration Rovers ’ Autonomous Mobility and Vision Capabilities , 2007 .

[122] Yann LeCun,et al. Mapping and planning under uncertainty in mobile robots with long-range perception , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[123] M. Andermann,et al. Embodied Information Processing: Vibrissa Mechanics and Texture Features Shape Micromotions in Actively Sensing Rats , 2008, Neuron.

[124] Jason Weston,et al. A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.

[125] Jason Weston,et al. Deep learning via semi-supervised embedding , 2008, ICML '08.

[126] C. Stachniss,et al. Online Learning for Offroad Robots: Using Spatial Label Propagation to Learn Long-Range Traversability , 2008 .

[127] Heng Tao Shen,et al. Principal Component Analysis , 2009, Encyclopedia of Biometrics.