论文信息 - Improving the Generalization Performance of Multi-Layer-Perceptrons with Population-Based Incremental Learning

Improving the Generalization Performance of Multi-Layer-Perceptrons with Population-Based Incremental Learning

Based on Population-Based Incremental Learning (PBIL) we present a new approach for the evolution of neural network architectures and their corresponding weights. The main idea is to use a probability vector rather than bit strings to represent a population of networks in each generation. We show that crucial issues of neural network training can effectively be integrated into the PBIL framework. First, a Quasi-Newton method for local weight optimization is integrated and the moving average update rule of the PBIL is extended to continuous parameters in order to transmit the best network to the next generation. Second, and more important, we incorporate cross-validation to focus the evolution towards networks with optimal generalization performance. A comparison with standard pruning and stopped-training algorithms shows that our approach effectively finds small networks with increased generalization ability.

Markus Höhfeld | Elvis Galic | Markus Höhfeld | E. Galic

[1] Yong Liu,et al. Neural Network Model Selection Using Asymptotic Jackknife Estimator and Cross-Validation Method , 1992, NIPS.

[2] Rich Caruana,et al. Removing the Genetics from the Standard Genetic Algorithm , 1995, ICML.

[3] H. Tong,et al. Threshold Autoregression, Limit Cycles and Cyclical Data , 1980 .

[4] Hans-Georg Zimmermann,et al. A comparison of weight elimination methods for reducing complexity in neural networks , 1992, [Proceedings 1992] IJCNN International Joint Conference on Neural Networks.

[5] David E. Goldberg,et al. Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[6] Tariq Samad,et al. Designing Application-Specific Neural Networks Using the Genetic Algorithm , 1989, NIPS.

[7] R. Palmer,et al. Introduction to the theory of neural computation , 1994, The advanced book program.

[8] David E. Rumelhart,et al. Predicting the Future: a Connectionist Approach , 1990, Int. J. Neural Syst..

[9] Lars Kai Hansen,et al. On design and evaluation of tapped-delay neural network architectures , 1993, IEEE International Conference on Neural Networks.

[10] Heinrich Braun,et al. ENZO-M - A Hybrid Approach for Optimizing Neural Networks by Evolution and Learning , 1994, PPSN.