Neural dynamic optimization for control systems. I. Background

The paper presents neural dynamic optimization (NDO) as a method of optimal feedback control for nonlinear multi-input-multi-output (MIMO) systems. The main feature of NDO is that it enables neural networks to approximate the optimal feedback solution whose existence dynamic programming (DP) justifies, thereby reducing the complexities of computation and storage problems of the classical methods such as DP. This paper mainly describes the background and motivations for the development of NDO, while the two other subsequent papers of this topic present the theory of NDO and demonstrate the method with several applications including control of autonomous vehicles and of a robot arm, respectively.

[1]  P. Werbos,et al.  Beyond Regression : "New Tools for Prediction and Analysis in the Behavioral Sciences , 1974 .

[2]  Arthur E. Bryson,et al.  Dynamic Optimization , 1998 .

[3]  F. Fairman Introduction to dynamic systems: Theory, models and applications , 1979, Proceedings of the IEEE.

[4]  Gene F. Franklin,et al.  Feedback Control of Dynamic Systems , 1986 .

[5]  Thomas Kailath,et al.  Linear Systems , 1980 .

[6]  John N. Tsitsiklis,et al.  Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[7]  Stuart E. Dreyfus,et al.  Applied Dynamic Programming , 1965 .

[8]  Robert F. Stengel,et al.  Optimal Control and Estimation , 1994 .

[9]  Stephen P. Boyd,et al.  Linear controller design: limits of performance , 1991 .

[10]  Weiping Li,et al.  Applied Nonlinear Control , 1991 .

[11]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[12]  Gene F. Franklin,et al.  Digital control of dynamic systems , 1980 .

[13]  A. Isidori Nonlinear Control Systems: An Introduction , 1986 .

[14]  Bernard Widrow,et al.  Neural dynamic optimization for control systems.II. Theory , 2001, IEEE Trans. Syst. Man Cybern. Part B.

[15]  R. Bellman Dynamic programming. , 1957, Science.

[16]  Bernard Widrow,et al.  30 years of adaptive neural networks: perceptron, Madaline, and backpropagation , 1990, Proc. IEEE.

[17]  Frank L. Lewis,et al.  Optimal Control , 1986 .

[18]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[19]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[20]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[21]  C. H. Sequin,et al.  Fault tolerance in artificial neural networks , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[22]  Robert E. Kalaba,et al.  Dynamic Programming and Modern Control Theory , 1966 .