Integrated Modeling and Control Based on Reinforcement Learning and Dynamic Programming
暂无分享,去创建一个
[1] Richard E. Korf,et al. Real-Time Heuristic Search , 1990, Artif. Intell..
[2] Dana H. Ballard,et al. A Role for Anticipation in Reactive Systems that Learn , 1989, ML Workshop.
[3] Richard S. Sutton,et al. Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming , 1990, ML.
[4] James L. McClelland,et al. Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .
[5] Richard S. Sutton,et al. Temporal credit assignment in reinforcement learning , 1984 .
[6] Paul J. Werbos,et al. Building and Understanding Adaptive Systems: A Statistical/Numerical Approach to Factory Automation and Brain Research , 1987, IEEE Transactions on Systems, Man, and Cybernetics.
[7] R. Bellman. Dynamic programming. , 1957, Science.
[8] Dimitri P. Bertsekas,et al. Dynamic Programming: Deterministic and Stochastic Models , 1987 .