论文信息 - Sequence learning through PIPE and automatic task decomposition

Sequence learning through PIPE and automatic task decomposition

Analog gradient-based recurrent neural nets can learn complex prediction tasks. Most, however, tend to fail in case of long minimal time lags between relevant training events. On the other hand, discrete methods such as search in a space of event-memorizing programs are not necessarily affected at all by long time lags: we show that discrete "Probabilistic Incremental Program Evolution" (PIPE) can solve several long time lag tasks that have been successfully solved by only one analog method ("Long Short-Term Memory" — LSTM). In fact, sometimes PIPE even outperforms LSTM. Existing discrete methods, however, cannot easily deal with problems whose solutions exhibit comparatively high algorithmic complexity. We overcome this drawback by introducing filtering, a novel, general, data-driven divide-and-conquer technique for automatic task decomposition that is not limited to a particular learning method. We compare PIPE plus filtering to various analog recurrent net methods.

Jürgen Schmidhuber | Rafal Salustowicz | J. Schmidhuber | R. Salustowicz

[1] Jürgen Schmidhuber,et al. Probabilistic Incremental Program Evolution: Stochastic Search Through Program Space , 1997, ECML.

[2] Lee Spector,et al. Simultaneous evolution of programs and their control structures , 1996 .

[3] J. Pollack,et al. The Evolutionary Induction of Subroutines , 1997 .

[4] James L. McClelland,et al. Finite State Automata and Simple Recurrent Networks , 1989, Neural Computation.

[5] Astro Teller,et al. The evolution of mental models , 1994 .

[6] Scott E. Fahlman,et al. The Recurrent Cascade-Correlation Architecture , 1990, NIPS.

[7] John R. Koza,et al. Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[8] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[9] A. W. Smith,et al. Encoding sequential structure: experience with the real-time recurrent learning algorithm , 1989, International 1989 Joint Conference on Neural Networks.

[10] Barak A. Pearlmutter. Gradient calculations for dynamic recurrent neural networks: a survey , 1995, IEEE Trans. Neural Networks.

[11] Nichael Lynn Cramer,et al. A Representation for the Adaptive Generation of Simple Sequential Programs , 1985, ICGA.

[12] Rafal Salustowicz,et al. Probabilistic Incremental Program Evolution , 1997, Evolutionary Computation.