Automatic Programming of Behavior-Based Robots Using Reinforcement Learning

[1]  Leslie Pack Kaelbling,et al.  Learning in embedded systems , 1993 .

[2]  Steven D. Whitehead,et al.  A Complexity Analysis of Cooperative Mechanisms in Reinforcement Learning , 1991, AAAI.

[3]  Long Ji Lin,et al.  Programming Robots Using Reinforcement Learning and Teaching , 1991, AAAI.

[4]  Lambert E. Wixson,et al.  Scaling Reinforcement Learning Techniques via Modularity , 1991, ML.

[5]  Satinder P. Singh,et al.  Transfer of Learning Across Compositions of Sequentail Tasks , 1991, ML.

[6]  Long-Ji Lin,et al.  Self-improving reactive agents: case studies of reinforcement learning frameworks , 1991 .

[7]  Benjamin Kuipers,et al.  Learning hill-climbing functions as a strategy for generating behaviors in a mobile robot , 1991 .

[8]  David R. Pierce,et al.  Learning a Set of Primitive Actions with an Uninterpreted Sensorimotor Apparatus , 1991, ML.

[9]  Gary L. Drescher,et al.  Made-up minds - a constructivist approach to artificial intelligence , 1991 .

[10]  Dana H. Ballard,et al.  Active Perception and Reinforcement Learning , 1990, Neural Computation.

[11]  Rodney A. Brooks,et al.  Learning to Coordinate Behaviors , 1990, AAAI.

[12]  Jonathan H. Connell,et al.  Minimalist mobile robotics - a colony-style architecture for an artificial creature , 1990, Perspectives in artificial intelligence.

[13]  Andrew K. C. Wong,et al.  Performance Analysis of a Probabilistic Inductive Learning System , 1990, ML.

[14]  Claude Sammut,et al.  Is Learning Rate a Good Performance Criterion for Learning? , 1990, ML Workshop.

[15]  Richard S. Sutton,et al.  Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming , 1990, ML.

[16]  Alan D. Christiansen,et al.  Learning reliable manipulation strategies without initial physical models , 1990, Proceedings., IEEE International Conference on Robotics and Automation.

[17]  Rodney A. Brooks,et al.  The Behavior Language: User''s Guide , 1990 .

[18]  Ming Tan,et al.  Cost-Sensitive Concept Learning of Sensor Use in Approach ad Recognition , 1989, ML.

[19]  Ronald L. Rivest,et al.  Inference of finite automata using homing sequences , 1989, STOC '89.

[20]  C. Watkins Learning from delayed rewards , 1989 .

[21]  Rodney A. Brooks,et al.  A Robust Layered Control Syste For A Mobile Robot , 2022 .

[22]  Hans P. Moravec,et al.  High resolution maps from wide angle sonar , 1985, Proceedings. 1985 IEEE International Conference on Robotics and Automation.

[23]  Richard S. Sutton,et al.  Temporal credit assignment in reinforcement learning , 1984 .

[24]  Richard S. Sutton,et al.  Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[25]  Tom M. Mitchell,et al.  Generalization as Search , 2002 .