Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming
 Richard S. Sutton,et al. Dyna, an integrated architecture for learning, planning, and reacting , 1990, SGAR.
 Richard S. Sutton,et al. Planning by Incremental Dynamic Programming , 1991, ML.
 Richard E. Korf,et al. Real-Time Heuristic Search , 1990, Artif. Intell..
 Michael C. Mozer,et al. Discovering the Structure of a Reactive Environment by Exploration , 1990, Neural Computation.
 Stuart J. Russell. Execution Architectures and Compilation , 1989, IJCAI.
 Richard S. Sutton,et al. Sequential Decision Problems and Neural Networks , 1989, NIPS 1989.
 J. W. Moore. Learning and Sequential Decision Making , 1989 .
 Robert E. Schapire,et al. A new approach to unsupervised learning in deterministic environments , 1990 .
 Charles W. Anderson,et al. Strategy Learning with Multilayer Connectionist Representations , 1987 .
 Paul J. Werbos,et al. Building and Understanding Adaptive Systems: A Statistical/Numerical Approach to Factory Automation and Brain Research , 1987, IEEE Transactions on Systems, Man, and Cybernetics.
 Geoffrey E. Hinton,et al. Schemata and Sequential Thought Processes in PDP Models , 1986 .
 Richard S. Sutton,et al. Temporal credit assignment in reinforcement learning , 1984 .
 Richard S. Sutton,et al. Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.
 D. Dennett. Why the Law of Effect will not Go Away , 1975 .
 R. Howard. Dynamic Programming and Markov Processes , 1960 .
 W. H. F. Barnes. The Nature of Explanation , 1944, Nature.