论文信息 - Automatic Programming of Behavior-Based Robots Using Reinforcement Learning - 字舞流文

Automatic Programming of Behavior-Based Robots Using Reinforcement Learning

Sridhar Mahadevan | Jonathan H. Connell | S. Mahadevan | J. Connell

[1] Leslie Pack Kaelbling,et al. Learning in embedded systems , 1993 .

[2] Steven D. Whitehead,et al. A Complexity Analysis of Cooperative Mechanisms in Reinforcement Learning , 1991, AAAI.

[3] Long Ji Lin,et al. Programming Robots Using Reinforcement Learning and Teaching , 1991, AAAI.

[4] Lambert E. Wixson,et al. Scaling Reinforcement Learning Techniques via Modularity , 1991, ML.

[5] Satinder P. Singh,et al. Transfer of Learning Across Compositions of Sequentail Tasks , 1991, ML.

[6] Long-Ji Lin,et al. Self-improving reactive agents: case studies of reinforcement learning frameworks , 1991 .

[7] Benjamin Kuipers,et al. Learning hill-climbing functions as a strategy for generating behaviors in a mobile robot , 1991 .

[8] David R. Pierce,et al. Learning a Set of Primitive Actions with an Uninterpreted Sensorimotor Apparatus , 1991, ML.

[9] Gary L. Drescher,et al. Made-up minds - a constructivist approach to artificial intelligence , 1991 .

[10] Dana H. Ballard,et al. Active Perception and Reinforcement Learning , 1990, Neural Computation.

[11] Rodney A. Brooks,et al. Learning to Coordinate Behaviors , 1990, AAAI.

[12] Jonathan H. Connell,et al. Minimalist mobile robotics - a colony-style architecture for an artificial creature , 1990, Perspectives in artificial intelligence.

[13] Andrew K. C. Wong,et al. Performance Analysis of a Probabilistic Inductive Learning System , 1990, ML.

[14] Claude Sammut,et al. Is Learning Rate a Good Performance Criterion for Learning? , 1990, ML Workshop.

[15] Richard S. Sutton,et al. Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming , 1990, ML.

[16] Alan D. Christiansen,et al. Learning reliable manipulation strategies without initial physical models , 1990, Proceedings., IEEE International Conference on Robotics and Automation.

[17] Rodney A. Brooks,et al. The Behavior Language: User''s Guide , 1990 .

[18] Ming Tan,et al. Cost-Sensitive Concept Learning of Sensor Use in Approach ad Recognition , 1989, ML.

[19] Ronald L. Rivest,et al. Inference of finite automata using homing sequences , 1989, STOC '89.

[20] C. Watkins. Learning from delayed rewards , 1989 .

[21] Rodney A. Brooks,et al. A Robust Layered Control Syste For A Mobile Robot , 2022 .

[22] Hans P. Moravec,et al. High resolution maps from wide angle sonar , 1985, Proceedings. 1985 IEEE International Conference on Robotics and Automation.

[23] Richard S. Sutton,et al. Temporal credit assignment in reinforcement learning , 1984 .

[24] Richard S. Sutton,et al. Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[25] Tom M. Mitchell,et al. Generalization as Search , 2002 .