B-Learning: A Reinforcement Learning Algorithm, Comparison with Dynamic Programming

In this paper we present a Reinforcement Learning method — B-Learning — for the control of a water production plant. A comparison between B-Learning and Dynamic Programming is provided from both theoretical and performance points of view. It is shown that Reinforcement-based neural control can lead to results comparable in quality to Dynamic Programming-based though less computationnally expensive.