Linkage learning using the maximum spanning tree of the dependency graph

The goal of linkage learning in evolutionary algorithms is to identify the interactions between variables of a problem. Knowing the linkage information helps search algorithms to find the optimum solution efficiently and reliably in hard problems. This paper presents a simple approach for linkage learning based on the graph theory. A graph is used as the structure to keep the the pairwise dependencies between variables of the problem. We call this graph 'the underlying dependency graph of the problem'. Then maximum spanning tree (MST) of the dependency graph is found. It is shown that MST contains all the necessary linkage if the dependency graph is built upon enough population. In this approach, pairwise dependencies calculated based on a perturbation based identification method, are used as the variable dependencies. The proposed approach has the advantage of being capable of learning the linkage without the need for the costly fit-to-data evaluations for model search. It is parameter-less and the algorithm description is simple. The proposed technique is tested on several benchmark problems and it is shown to be able to compete with similar approaches. Based on the experimental results it can successfully find the linkage groups in polynomial number of fitness evaluations.

[1]  Dirk Thierens,et al.  Mixing in Genetic Algorithms , 1993, ICGA.

[2]  David E. Goldberg,et al.  Dependency Structure Matrix Analysis: Offline Utility of the Dependency Structure Matrix Genetic Algorithm , 2004, GECCO.

[3]  Heinz Mühlenbein,et al.  FDA -A Scalable Evolutionary Algorithm for the Optimization of Additively Decomposed Functions , 1999, Evolutionary Computation.

[4]  David E. Goldberg,et al.  Hierarchical Bayesian Optimization Algorithm , 2006, Scalable Optimization via Probabilistic Modeling.

[5]  Masaharu Munetomo,et al.  Identifying Linkage Groups by Nonlinearity/Non-monotonicity Detection , 1999 .

[6]  David E. Goldberg,et al.  Convergence time for the linkage learning genetic algorithm , 2004, IEEE Congress on Evolutionary Computation.

[7]  Ivo Everts,et al.  Extended Compact Genetic Algorithm , 2004 .

[8]  Masaharu Munetomo,et al.  Linkage identification based on epistasis measures to realize efficient genetic algorithms , 2002, Proceedings of the 2002 Congress on Evolutionary Computation. CEC'02 (Cat. No.02TH8600).

[9]  Petr Posík,et al.  Parameter-less local optimizer with linkage identification for deterministic order-k decomposable problems , 2011, GECCO '11.

[10]  Prabhas Chongstitvatana,et al.  Direct and explicit building blocks identification and composition algorithm , 2009, 2009 IEEE Congress on Evolutionary Computation.

[11]  Hillol Kargupta,et al.  The performance of the gene expression messy genetic algorithm on real test functions , 1996, Proceedings of IEEE International Conference on Evolutionary Computation.

[12]  Amin Nikanjam,et al.  A new DSM clustering algorithm for linkage groups identification , 2010, GECCO '10.

[13]  J. A. Lozano,et al.  Estimation of Distribution Algorithms: A New Tool for Evolutionary Computation , 2001 .

[14]  D. Goldberg,et al.  A Survey of Linkage Learning Techniques in Genetic and Evolutionary Algorithms , 2007 .

[15]  David E. Goldberg,et al.  Dependency Structure Matrix, Genetic Algorithms, and Effective Recombination , 2009, Evolutionary Computation.

[16]  Heinz Mühlenbein,et al.  The Equation for Response to Selection and Its Use for Prediction , 1997, Evolutionary Computation.