论文信息 - Genetic programming using a minimum description length principle

Genetic programming using a minimum description length principle

This paper introduces a Minimum Description Length (MDL) principle to de ne tness functions in Genetic Programming (GP). In traditional (Koza-style) GP, the size of trees was usually controlled by user-de ned parameters, such as the maximum number of nodes and maximum tree depth. Large tree sizes meant that the time necessary to measure their tnesses often dominated total processing time. To overcome this di culty, we introduce a method for controlling tree growth, which uses an MDL principle. Initially we choose a \decision tree" representation for the GP chromosomes, and then show how an MDL principle can be used to de ne GP tness functions. Thereafter we apply the MDL-based tness functions to some practical problems. Using our implemented system \STROGANOFF", we show how MDL-based tness functions can be applied successfully to problems of pattern recognitions. The results demonstrate that our approach is superior to usual neural networks in terms of generalization of learning.

[1] John R. Koza,et al. Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[2] A. G. Ivakhnenko,et al. Polynomial Theory of Complex Systems , 1971, IEEE Trans. Syst. Man Cybern..

[3] J. Rissanen,et al. Modeling By Shortest Data Description* , 1978, Autom..

[4] Ronald L. Rivest,et al. Inferring Decision Trees Using the Minimum Description Length Principle , 1989, Inf. Comput..

[5] Martin Anthony,et al. Computational Learning Theory , 1992 .

[6] John R. Koza,et al. Genetic programming: a paradigm for genetically breeding populations of computer programs to solve problems , 1990 .

[7] Terrence J. Sejnowski,et al. Analysis of hidden units in a layered network trained to classify sonar targets , 1988, Neural Networks.

[8] Manoel Fernando Tenorio,et al. Self-organizing network for optimum supervised learning , 1990, IEEE Trans. Neural Networks.