B-Learning: A Reinforcement Learning Algorithm, Comparison with Dynamic Programming
暂无分享,去创建一个
[1] P. Villon,et al. A real-time optimal control algorithm for water treatment plants , 1993, System Modelling and Optimization.
[2] Long Ji Lin,et al. Programming Robots Using Reinforcement Learning and Teaching , 1991, AAAI.
[3] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..
[4] C.W. Anderson,et al. Learning to control an inverted pendulum using neural networks , 1989, IEEE Control Systems Magazine.
[5] Richard S. Sutton,et al. Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.
[6] Stéphane Canu,et al. B-Learning: A Reinforcement Learning Variant for the Control of a Plant , 1994 .