Regularized continuous estimation of distribution algorithms

Regularization is a well-known technique in statistics for model estimation which is used to improve the generalization ability of the estimated model. Some of the regularization methods can also be used for variable selection that is especially useful in high-dimensional problems. This paper studies the use of regularized model learning in estimation of distribution algorithms (EDAs) for continuous optimization based on Gaussian distributions. We introduce two approaches to the regularized model estimation and analyze their effect on the accuracy and computational complexity of model learning in EDAs. We then apply the proposed algorithms to a number of continuous optimization functions and compare their results with other Gaussian distribution-based EDAs. The results show that the optimization performance of the proposed RegEDAs is less affected by the increase in the problem size than other EDAs, and they are able to obtain significantly better optimization values for many of the functions in high-dimensional settings.

[1]  Francisco Herrera,et al.  A practical tutorial on the use of nonparametric statistical tests as a methodology for comparing evolutionary and swarm intelligence algorithms , 2011, Swarm Evol. Comput..

[2]  Michèle Sebag,et al.  Extending Population-Based Incremental Learning to Continuous Search Spaces , 1998, PPSN.

[3]  Wray L. Buntine Theory Refinement on Bayesian Networks , 1991, UAI.

[4]  Xiaoping Li,et al.  Estimation of distribution algorithm for permutation flow shops with total flowtime minimization , 2011, Comput. Ind. Eng..

[5]  Dirk Thierens,et al.  Adaptive variance scaling in continuous multi-objective estimation-of-distribution algorithms , 2007, GECCO '07.

[6]  Concha Bielza,et al.  Learning an L1-Regularized Gaussian Bayesian Network in the Equivalence Class Space , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[7]  Concha Bielza,et al.  Estimation of Distribution Algorithms as Logistic Regression Regularizers of Microarray Classifiers , 2009, Methods of Information in Medicine.

[8]  Martin Pelikan,et al.  Scalable Optimization via Probabilistic Modeling: From Algorithms to Applications (Studies in Computational Intelligence) , 2006 .

[9]  Eleazar Eskin,et al.  Multi-marker tagging single nucleotide polymorphism selection using estimation of distribution algorithms , 2010, Artif. Intell. Medicine.

[10]  Bradley Efron,et al.  Maximum Likelihood and Decision Theory , 1982 .

[11]  Marcus Gallagher,et al.  On the importance of diversity maintenance in estimation of distribution algorithms , 2005, GECCO '05.

[12]  Matteo Matteucci,et al.  Introducing ℓ1-regularized logistic regression in Markov Networks based EDAs , 2011, 2011 IEEE Congress of Evolutionary Computation (CEC).

[13]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[14]  Dirk Thierens,et al.  Expanding from Discrete to Continuous Estimation of Distribution Algorithms: The IDEA , 2000, PPSN.

[15]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[16]  J. A. Lozano,et al.  Estimation of Distribution Algorithms: A New Tool for Evolutionary Computation , 2001 .

[17]  Xin Yao,et al.  Unified eigen analysis on multivariate Gaussian based estimation of distribution algorithms , 2008, Inf. Sci..

[18]  Nikolaus Hansen,et al.  The CMA Evolution Strategy: A Comparing Review , 2006, Towards a New Evolutionary Computation.

[19]  Jiadong Yang,et al.  Effective structure learning for EDA via L1-regularizedbayesian networks , 2010, GECCO '10.

[20]  N. Meinshausen,et al.  High-dimensional graphs and variable selection with the Lasso , 2006, math/0608017.

[21]  Concha Bielza,et al.  Mateda-2.0: A MATLAB package for the implementation and analysis of estimation of distribution algorithms , 2010 .

[22]  José Antonio Lozano,et al.  Scatter Search in software testing, comparison and collaboration with Estimation of Distribution Algorithms , 2006, Eur. J. Oper. Res..

[23]  R. Tibshirani,et al.  Least angle regression , 2004, math/0406456.

[24]  Shih-Hsin Chen,et al.  Addressing the advantages of using ensemble probabilistic models in Estimation of Distribution Algorithms for scheduling problems , 2013 .

[25]  C. Stein Inadmissibility of the Usual Estimator for the Mean of a Multivariate Normal Distribution , 1956 .

[26]  A. E. Hoerl,et al.  Ridge Regression: Applications to Nonorthogonal Problems , 1970 .

[27]  K. Strimmer,et al.  Statistical Applications in Genetics and Molecular Biology A Shrinkage Approach to Large-Scale Covariance Matrix Estimation and Implications for Functional Genomics , 2011 .

[28]  Pedro Larrañaga,et al.  Optimization in Continuous Domains by Learning and Simulation of Gaussian Networks , 2000 .

[29]  Pedro Larrañaga,et al.  Protein Folding in Simplified Models With Estimation of Distribution Algorithms , 2008, IEEE Transactions on Evolutionary Computation.

[30]  Qingfu Zhang,et al.  A Hybrid Estimation of Distribution Algorithm for CDMA Cellular System Design , 2008, Int. J. Comput. Intell. Appl..

[31]  Petr Posík Preventing Premature Convergence in a Simple EDA Via Global Step Size Setting , 2008, PPSN.

[32]  Pedro Larrañaga,et al.  Mathematical modelling of UMDAc algorithm with tournament selection. Behaviour on linear and quadratic functions , 2002, Int. J. Approx. Reason..

[33]  Anne Auger,et al.  EEDA : A New Robust Estimation of Distribution Algorithms , 2004 .

[34]  J. A. Lozano,et al.  Towards a New Evolutionary Computation: Advances on Estimation of Distribution Algorithms (Studies in Fuzziness and Soft Computing) , 2006 .

[35]  Chen Fang,et al.  An effective estimation of distribution algorithm for the multi-mode resource-constrained project scheduling problem , 2012, Comput. Oper. Res..

[36]  Qingfu Zhang,et al.  On the convergence of a class of estimation of distribution algorithms , 2004, IEEE Transactions on Evolutionary Computation.

[37]  H. L. Le Roy,et al.  Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability; Vol. IV , 1969 .

[38]  T. Richardson,et al.  Estimation of a covariance matrix with zeros , 2005, math/0508268.

[39]  Concha Bielza,et al.  Mateda-2.0: Estimation of Distribution Algorithms in MATLAB , 2010 .

[40]  Concha Bielza,et al.  Optimal row and column ordering to improve table interpretation using estimation of distribution algorithms , 2011, J. Heuristics.

[41]  Heinz Mühlenbein,et al.  Schemata, Distributions and Graphical Models in Evolutionary Optimization , 1999, J. Heuristics.

[42]  Raphael T. Haftka,et al.  A double-distribution statistical algorithm for composite laminate optimization , 2004 .

[43]  R. Tibshirani,et al.  Sparse inverse covariance estimation with the graphical lasso. , 2008, Biostatistics.

[44]  Peter A. N. Bosman,et al.  Matching inductive search bias and problem structure in continuous Estimation-of-Distribution Algorithms , 2008, Eur. J. Oper. Res..

[45]  Olivier Ledoit,et al.  A well-conditioned estimator for large-dimensional covariance matrices , 2004 .

[46]  Marcus Gallagher,et al.  Convergence analysis of UMDAC with finite populations: a case study on flat landscapes , 2009, GECCO '09.

[47]  Pedro Larrañaga,et al.  Feature Subset Selection by Bayesian network-based optimization , 2000, Artif. Intell..

[48]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[49]  R. Tibshirani,et al.  Sparse estimation of a covariance matrix. , 2011, Biometrika.

[50]  M. Bilodeau,et al.  Theory of multivariate statistics , 1999 .

[51]  Petr Pos ´ ik Preventing Premature Convergence in a Simple EDA Via Global Step Size Setting , 2008 .

[52]  Arturo Hernández Aguirre,et al.  The gaussian polytree EDA for global optimization , 2011, GECCO '11.

[53]  Mark W. Schmidt,et al.  Learning Graphical Model Structure Using L1-Regularization Paths , 2007, AAAI.

[54]  David E. Goldberg,et al.  Bayesian Optimization Algorithm, Population Sizing, and Time to Convergence , 2000, GECCO.

[55]  Concha Bielza,et al.  Peakbin Selection in Mass Spectrometry Data Using a Consensus Approach with Estimation of Distribution Algorithms , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[56]  David E. Goldberg,et al.  Real-Coded Bayesian Optimization Algorithm: Bringing the Strength of BOA into the Continuous World , 2004, GECCO.

[57]  G. Bortolan,et al.  The problem of linguistic approximation in clinical decision making , 1988, Int. J. Approx. Reason..

[58]  H. Mühlenbein,et al.  From Recombination of Genes to the Estimation of Distributions I. Binary Parameters , 1996, PPSN.

[59]  H. Akaike A new look at the statistical model identification , 1974 .

[60]  Concha Bielza,et al.  A review on probabilistic graphical models in evolutionary computation , 2012, J. Heuristics.

[61]  C. Mallows Some Comments on Cp , 2000, Technometrics.

[62]  T. N. Sriram Asymptotics in Statistics–Some Basic Concepts , 2002 .

[63]  Franz Rothlauf,et al.  Behaviour of UMDA/sub c/ with truncation selection on monotonous functions , 2005, 2005 IEEE Congress on Evolutionary Computation.

[64]  Max Henrion,et al.  Propagating uncertainty in bayesian networks by probabilistic logic sampling , 1986, UAI.

[65]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[66]  D. Goldberg,et al.  BOA: the Bayesian optimization algorithm , 1999 .

[67]  Concha Bielza,et al.  A review of estimation of distribution algorithms in bioinformatics , 2008, BioData Mining.

[68]  Siddhartha Shakya,et al.  DEUM : a framework for an estimation of distribution algorithm based on Markov random fields , 2006 .

[69]  David Maxwell Chickering,et al.  Dependency Networks for Inference, Collaborative Filtering, and Data Visualization , 2000, J. Mach. Learn. Res..

[70]  C. L. Mallows Some comments on C_p , 1973 .

[71]  A. K. Ziver,et al.  Estimation of distribution algorithms for nuclear reactor fuel management optimisation , 2006 .

[72]  Pedro Larrañaga,et al.  Side chain placement using estimation of distribution algorithms , 2007, Artif. Intell. Medicine.

[73]  R. A. Leibler,et al.  On Information and Sufficiency , 1951 .

[74]  G. Schwarz Estimating the Dimension of a Model , 1978 .