论文信息 - First-Order Gradient Descent Training of Adaptive Discrete-Time Dynamic Networks

First-Order Gradient Descent Training of Adaptive Discrete-Time Dynamic Networks

Abstract : This paper describes the training of discrete-time dynamic systems with adaptive parameters (recurrent neural networks) using first-order gradient descent algorithms. To facilitate the explanation of these algorithms, a standard representation of a discrete-time dynamic system is defined. Any differentiable discrete dynamic system may be put in this standard representation and trained using a gradient descent algorithm. Using the standard representation, we described two general types of learning algorithms. The first is based upon the discrete-time Euler-Lagrange equations, and the second is based upon a recursive update of the output gradients. Both the epochwise and on-line versions of these algorithms are presented. When the dynamic system is implemented by a neural network, the epochwise algorithm based on the Euler-Lagrange equations is equivalent to backpropagation-through-time and the on-line method based on the recursive equation is the same as recursive backpropagation. It is shown that the epochwise versions of the algorithms are equivalent. The two on-line versions of the algorithms are shown to be approximately equivalent.

S. Piche | B. Widrow

[1] P. Werbos,et al. Beyond Regression : "New Tools for Prediction and Analysis in the Behavioral Sciences , 1974 .

[2] Geoffrey E. Hinton,et al. Learning internal representations by error propagation , 1986 .

[3] S. Thomas Alexander,et al. Adaptive Signal Processing , 1986, Texts and Monographs in Computer Science.

[4] Barak A. Pearlmutter. Learning State Space Trajectories in Recurrent Neural Networks , 1989, Neural Computation.

[5] Michael I. Jordan,et al. Generic constraints on underspecified target trajectories , 1989, International 1989 Joint Conference on Neural Networks.

[6] Ronald J. Williams,et al. A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.

[7] K S Narendra,et al. IDENTIFICATION AND CONTROL OF DYNAMIC SYSTEMS USING NEURAL NETWORKS , 1990 .

[8] Ronald J. Williams,et al. Gradient-based learning algorithms for recurrent connectionist networks , 1990 .

[9] B. Widrow,et al. Neural networks for self-learning control systems , 1990, IEEE Control Systems Magazine.