论文信息 - An EM Based Training Algorithm for Recurrent Neural Networks

An EM Based Training Algorithm for Recurrent Neural Networks

Recurrent neural networks serve as black-box models for nonlinear dynamical systems identification and time series prediction. Training of recurrent networks typically minimizes the quadratic difference of the network output and an observed time series. This implicitely assumes that the dynamics of the underlying system is deterministic, which is not a realistic assumption in many cases. In contrast, state-space models allow for noise in both the internal state transitions and the mapping from internal states to observations. Here, we consider recurrent networks as nonlinear state space models and suggest a training algorithm based on Expectation-Maximization. A nonlinear transfer function for the hidden neurons leads to an intractable inference problem. We investigate the use of a Particle Smoother to approximate the E-step and simultaneously estimate the expectations required in the M-step. The method is demonstrated for a sythetic data set and a time series prediction task arising in radiation therapy where it is the goal to predict the motion of a lung tumor during respiration.

[1] Danilo P. Mandic,et al. Recurrent Neural Networks for Prediction , 2001 .

[2] Ronald J. Williams,et al. Gradient-based learning algorithms for recurrent networks and their computational complexity , 1995 .

[3] Danilo P. Mandic,et al. Recurrent Neural Networks for Prediction: Learning Algorithms, Architectures and Stability , 2001 .

[4] G. McLachlan,et al. The EM algorithm and extensions , 1996 .

[5] Zoubin Ghahramani,et al. Learning Nonlinear Dynamical Systems Using an EM Algorithm , 1998, NIPS.

[6] Christopher M. Bishop,et al. Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[7] Nando de Freitas,et al. Fast particle smoothing: if I had a million particles , 2006, ICML.

[8] Radford M. Neal. Pattern Recognition and Machine Learning , 2007, Technometrics.

[9] Jürgen Schmidhuber,et al. Training Recurrent Networks by Evolino , 2007, Neural Computation.

[10] Ronald J. Williams,et al. Training recurrent networks using the extended Kalman filter , 1992, [Proceedings 1992] IJCNN International Joint Conference on Neural Networks.

[11] Herbert Jaeger,et al. The''echo state''approach to analysing and training recurrent neural networks , 2001 .