How Far Is It from Here to There? A Distance That Is Coherent with GP Operators

The distance between pairs of individuals is a useful concept in the study of evolutionary algorithms. It is particularly useful to define a distance which is coherent with, i.e. related to, the action of a particular operator. We present the first formal, general definition of this operator-distance coherence. We also propose a new distance function, based on the multi-step transition probability (MSTP), that is coherent with any GP operator for which the one-step transition probability (1STP) between individuals can be defined. We give an algorithm for 1STP in the case of subtree mutation. Because MSTP is useful in GP investigations, but impractical to compute, we evaluate a variety of means to approximate it. We show that some syntactic distance measures give good approximations, and attempt to combine them to improve the approximation using a GP symbolic regression method. We conclude that 1STP itself is a sufficient indicator of MSTP for subtree mutation.

[1]  Riccardo Poli,et al.  Geometric landscape of homologous crossover for syntactic trees , 2005, 2005 IEEE Congress on Evolutionary Computation.

[2]  Michael O'Neill,et al.  A Fine-Grained View of GP Locality with Binary Decision Diagrams as Ant Phenotypes , 2010, PPSN.

[3]  Paul M. B. Vitányi,et al.  Clustering by compression , 2003, IEEE Transactions on Information Theory.

[4]  Una-May O’Reilly Using a distance metric on genetic programs to understand genetic operators , 1997, 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation.

[5]  Wolfgang Banzhaf,et al.  Explicit Control of Diversity and Effective Variation Distance in Linear Genetic Programming , 2002, EuroGP.

[6]  Neil D. Lawrence,et al.  Missing Data in Kernel PCA , 2006, ECML.

[7]  Marc Sebban,et al.  Learning Stochastic Tree Edit Distance , 2006, ECML.

[8]  Krzysztof Krawiec,et al.  Approximating geometric crossover in semantic space , 2009, GECCO.

[9]  Leonardo Vanneschi,et al.  A Study of Fitness Distance Correlation as a Difficulty Measure in Genetic Programming , 2005, Evolutionary Computation.

[10]  Anikó Ekárt,et al.  A Metric for Genetic Programs and Fitness Sharing , 2000, EuroGP.

[11]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[12]  Robert Schaefer Parallel Problem Solving from Nature - PPSN XI, 11th International Conference, Kraków, Poland, September 11-15, 2010. Proceedings, Part II , 2010, PPSN.

[13]  Leonardo Vanneschi,et al.  Crossover-Based Tree Distance in Genetic Programming , 2008, IEEE Transactions on Evolutionary Computation.

[14]  Leonardo Vanneschi,et al.  Theory and practice for efficient genetic programming , 2004 .

[15]  Michael O'Neill,et al.  Semantic Aware Crossover for Genetic Programming: The Case for Real-Valued Function Regression , 2009, EuroGP.

[16]  K. Chellapilla,et al.  Investigating the influence of depth and degree of genotypic change on fitness in genetic programming , 1999 .

[17]  Una-May O'Reilly,et al.  Computational complexity analysis of simple genetic programming on two problems modeling isolated program semantics , 2010, FOGA '11.

[18]  Tim Jones Evolutionary Algorithms, Fitness Landscapes and Search , 1995 .