论文信息 - Learning Subsequential Structure in Simple Recurrent Networks

Learning Subsequential Structure in Simple Recurrent Networks

We explore a network architecture introduced by Elman (1988) for predicting successive elements of a sequence. The network uses the pattern of activation over a set of hidden units from time-step t-1, together with element t, to predict element t+1. When the network is trained with strings from a particular finite-state grammar, it can learn to be a perfect finite-state recognizer for the grammar. Cluster analyses of the hidden-layer patterns of activation showed that they encode prediction-relevant information about the entire path traversed through the network. We illustrate the phases of learning with cluster analyses performed at different points during training.

James L. McClelland | Axel Cleeremans | David Servan-Schreiber | D. Servan-Schreiber | A. Cleeremans

[1] A. Reber. Implicit learning of artificial grammars , 1967 .

[2] D. Rumelhart. Learning internal representations by back-propagating errors , 1986 .

[3] Geoffrey E. Hinton,et al. Learning representations by back-propagation errors, nature , 1986 .

[4] Terrence J. Sejnowski,et al. NETtalk: a parallel network that learns to read aloud , 1988 .

[5] Ronald J. Williams,et al. A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.

[6] Jeffrey L. Elman,et al. Finding Structure in Time , 1990, Cogn. Sci..