An approach to learning mobile robot navigation

This paper describes an approach to learning an indoor robot navigation task through trial-and-error. A mobile robot, equipped with visual, ultrasonic and laser sensors, learns to servo to a designated target object. In less than ten minutes of operation time, the robot is able to navigate to a marked target object in an office environment. The central learning mechanism is the explanation-based neural network learning algorithm (EBNN). EBNN initially learns function purely inductively using neural network representations. With increasing experience, EBNN employs domain knowledge to explain and to analyze training data in order to generalize in a more knowledgeable way. Here EBNN is applied in the context of reinforcement learning, which allows the robot to learn control using dynamic programming.

[1]  Tom M. Mitchell,et al.  Becoming Increasingly Reactive , 1990, AAAI.

[2]  Anders Krogh,et al.  Introduction to the theory of neural computation , 1994, The advanced book program.

[3]  Michael I. Jordan,et al.  MASSACHUSETTS INSTITUTE OF TECHNOLOGY ARTIFICIAL INTELLIGENCE LABORATORY and CENTER FOR BIOLOGICAL AND COMPUTATIONAL LEARNING DEPARTMENT OF BRAIN AND COGNITIVE SCIENCES , 1996 .

[4]  Chris Watkins,et al.  Learning from delayed rewards , 1989 .

[5]  Micha Sharir,et al.  Planning, geometry, and complexity of robot motion , 1986 .

[6]  Sebastian Thrun,et al.  Integrating Inductive Neural Network Learning and Explanation-Based Learning , 1993, IJCAI.

[7]  Vijaykumar Gullapalli,et al.  Reinforcement learning and its application to control , 1992 .

[8]  Raymond J. Mooney,et al.  Theory Refinement with Noisy Data , 1991 .

[9]  Bernard Widrow,et al.  The basic ideas in neural networks , 1994, CACM.

[10]  Paul S. Rosenbloom,et al.  Knowledge Level and Inductive Uses of Chunking (EBL) , 1990, AAAI.

[11]  Andrew G. Barto,et al.  Learning to Act Using Real-Time Dynamic Programming , 1995, Artif. Intell..

[12]  Tom M. Mitchell,et al.  Explanation-based learning for mobile-robot perception , 1997 .

[13]  Yann LeCun,et al.  Tangent Prop - A Formalism for Specifying Selected Invariances in an Adaptive Network , 1991, NIPS.

[14]  Sebastian Thrun,et al.  Learning One More Thing , 1994, IJCAI.

[15]  TesauroGerald Practical Issues in Temporal Difference Learning , 1992 .

[16]  J. Freidman,et al.  Multivariate adaptive regression splines , 1991 .

[17]  V. Gullapalli,et al.  Acquiring robot skills via reinforcement learning , 1994, IEEE Control Systems.

[18]  R. Masuoka,et al.  Noise robustness of EBNN learning , 1993, Proceedings of 1993 International Conference on Neural Networks (IJCNN-93-Nagoya, Japan).

[19]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[20]  Michael Hucka,et al.  Learning in Tele-autonomous Systems using Soar , 1989 .

[21]  Michael J. Pazzani,et al.  A Knowledge-intensive Approach to Learning Relational Concepts , 1991, ML.

[22]  Philip D. Wasserman,et al.  Neural computing - theory and practice , 1989 .

[23]  Sebastian Thrun,et al.  Explanation-Based Neural Network Learning for Robot Control , 1992, NIPS.

[24]  Francesco Bergadano,et al.  Guiding induction with domain theories , 1990 .

[25]  Gerald Tesauro,et al.  Practical Issues in Temporal Difference Learning , 1992, Mach. Learn..

[26]  Sebastian Thrun,et al.  Lifelong robot learning , 1993, Robotics Auton. Syst..

[27]  Sridhar Mahadevan,et al.  Automatic Programming of Behavior-Based Robots Using Reinforcement Learning , 1991, Artif. Intell..

[28]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[29]  Richard S. Sutton,et al.  Integrated Modeling and Control Based on Reinforcement Learning and Dynamic Programming , 1990, NIPS 1990.

[30]  Michael I. Jordan,et al.  Advances in Neural Information Processing Systems 30 , 1995 .

[31]  Rodney A. Brooks,et al.  Learning to Coordinate Behaviors , 1990, AAAI.

[32]  Lawrence Birnbaum,et al.  Proceedings of the eighth international workshop on Machine learning , 1991 .

[33]  T. Michael Knasel,et al.  Robotics and autonomous systems , 1988, Robotics Auton. Syst..

[34]  Dean Pomerleau,et al.  ALVINN, an autonomous land vehicle in a neural network , 2015 .

[35]  WidrowBernard,et al.  The basic ideas in neural networks , 1994 .

[36]  Sebastian Thrun,et al.  Learning to Play the Game of Chess , 1994, NIPS.