论文信息 - Studies in Continuous Black-box Optimization

Studies in Continuous Black-box Optimization

We present a collection of novel, state-of-the-art algorithms for solving problems in the class of continuous black-box optimization. Natural Evolution Strategies are a family of algorithms that constitutes a general-purpose approach. Maintaining a parameterized distribution on the set of solution candidates, the natural gradient is used to update the distribution's parameters in the direction of higher expected fitness. A collection of techniques have been introduced that addresses issues of convergence, robustness, computational complexity and algorithm speed. We also demonstrated how the principle of artificial curiosity can guide exploration in the context of costly optimization, introducing a response surface method that estimates the interestingness of each candidate point using Gaussian process regression. The results show best published performance on various standard benchmarks, as well as competitive performance on others.

Tom Schaul | T. Schaul

[1] Martin A. Riedmiller,et al. A direct adaptive method for faster backpropagation learning: the RPROP algorithm , 1993, IEEE International Conference on Neural Networks.

[2] Tom Schaul,et al. Coherence Progress: A Measure of Interestingness Based on Fixed Compressors , 2011, AGI.

[3] Bernhard Sendhoff,et al. Three dimensional evolutionary aerodynamic design optimization with CMA-ES , 2005, GECCO '05.

[4] Tom Schaul,et al. A Natural Evolution Strategy for Multi-objective Optimization , 2010, PPSN.

[5] Petros Koumoutsakos,et al. Optimization based on bacterial chemotaxis , 2002, IEEE Trans. Evol. Comput..

[6] C. D. Gelatt,et al. Optimization by Simulated Annealing , 1983, Science.

[7] Tom Schaul,et al. Fitness Expectation Maximization , 2008, PPSN.

[8] J. A. Lozano,et al. Estimation of Distribution Algorithms: A New Tool for Evolutionary Computation , 2001 .

[9] Anne Auger,et al. BBOB 2009: Comparison Tables of All Algorithms on All Noiseless Functions , 2010 .

[10] Tom Schaul,et al. Artificial curiosity for autonomous space exploration , 2011 .

[11] Kuldip K. Paliwal,et al. Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..

[12] E. Cartan,et al. Sur la représentation géométrique des systèmes matériels non holonomes , 1929 .

[13] Kalyanmoy Deb,et al. A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[14] Andreas Krause,et al. Nonmyopic active learning of Gaussian processes: an exploration-exploitation approach , 2007, ICML '07.

[15] Darren Robinson,et al. A hybrid CMA-ES and HDE optimisation algorithm with application to solar energy potential , 2009, Appl. Soft Comput..

[16] J. Schmidhuber,et al. Frontier Search , 2009 .

[17] David A. Cohn,et al. Active Learning with Statistical Models , 1996, NIPS.

[18] Peter A. N. Bosman,et al. Learning Probabilistic Tree Grammars for Genetic Programming , 2004, PPSN.

[19] Ingo Rechenberg,et al. Evolutionsstrategie : Optimierung technischer Systeme nach Prinzipien der biologischen Evolution , 1973 .

[20] Raymond Ros,et al. A Simple Modification in CMA-ES Achieving Linear Time and Space Complexity , 2008, PPSN.

[21] Tom Schaul,et al. Exploring parameter space in reinforcement learning , 2010, Paladyn J. Behav. Robotics.

[22] J. Doye,et al. Global Optimization by Basin-Hopping and the Lowest Energy Structures of Lennard-Jones Clusters Containing up to 110 Atoms , 1997, cond-mat/9803344.

[23] Tom Schaul,et al. Exponential natural evolution strategies , 2010, GECCO '10.

[24] H. Jaap van den Herik,et al. Solving Go on Small Boards , 2003, J. Int. Comput. Games Assoc..

[25] Hans-Georg Beyer,et al. The Theory of Evolution Strategies , 2001, Natural Computing Series.

[26] Garrison W. Cottrell,et al. Learning Mackey-Glass from 25 Examples, Plus or Minus 2 , 1993, NIPS.

[27] Tom Schaul,et al. Scalable Neural Networks for Board Games , 2009, ICANN.

[28] Anne Auger,et al. Real-Parameter Black-Box Optimization Benchmarking 2009: Noiseless Functions Definitions , 2009 .

[29] Mohamed Chetouani,et al. Optimizing feature complementarity by evolution strategy: Application to automatic speaker verification , 2009, Speech Commun..

[30] Risto Miikkulainen,et al. Evolving Neural Networks to Play Go , 2004, Applied Intelligence.

[31] Chih-Wen Liu,et al. Non-smooth/non-convex economic dispatch by a novel hybrid differential evolution algorithm , 2007 .

[32] Christian Igel,et al. Empirical evaluation of the improved Rprop learning algorithms , 2003, Neurocomputing.

[33] Lehel Csató,et al. Sparse On-Line Gaussian Processes , 2002, Neural Computation.

[34] Lin Wu,et al. A Scalable Machine Learning Approach to Go , 2006, NIPS.

[35] Oscar Cordón,et al. An experimental study on the applicability of evolutionary algorithms to craniofacial superimposition in forensic identification , 2009, Inf. Sci..

[36] Richard E. Korf,et al. Frontier search , 2005, JACM.

[37] Risto Miikkulainen,et al. Accelerated Neural Evolution through Cooperatively Coevolved Synapses , 2008, J. Mach. Learn. Res..

[38] W. J. Studden,et al. Theory Of Optimal Experiments , 1972 .

[39] Jürgen Leitner,et al. Evolving ANNs for Spacecraft Rendezvous and Docking , 2010 .

[40] Jürgen Schmidhuber,et al. Simple Algorithmic Principles of Discovery, Subjective Beauty, Selective Attention, Curiosity and Creativity , 2007, ALT.

[41] Jürgen Schmidhuber,et al. Driven by Compression Progress: A Simple Principle Explains Essential Aspects of Subjective Beauty, Novelty, Surprise, Interestingness, Attention, Curiosity, Creativity, Art, Science, Music, Jokes , 2008, ABiALS.

[42] Anne Auger,et al. Log-Linear Convergence and Divergence of the Scale-Invariant (1+1)-ES in Noisy Environments , 2011, Algorithmica.

[43] Pierre Baldi,et al. Bayesian surprise attracts human attention , 2005, Vision Research.

[44] Shun-ichi Amari,et al. Why natural gradient? , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[45] Julian Togelius,et al. Countering Poisonous Inputs with Memetic Neuroevolution , 2008, PPSN.

[46] Kaoru Iwamoto,et al. Go for Beginners , 1976 .

[47] Nir Oren,et al. Evolving Neural Networks for the Capture Game , 2002 .

[48] Tom Schaul,et al. Multi-Dimensional Deep Memory Atari-Go Players for Parameter Exploring Policy Gradients , 2010, ICANN.

[49] Pedro Larrañaga,et al. Estimation of Distribution Algorithms , 2002, Genetic Algorithms and Evolutionary Computation.

[50] Tom Schaul,et al. A scalable neural network architecture for board games , 2008, 2008 IEEE Symposium On Computational Intelligence and Games.

[51] Huaiyu Zhu. On Information and Sufficiency , 1997 .

[52] María D. Jaraíz-Simón,et al. A Differential Evolution Based Algorithm to Optimize the Radio Network Design Problem , 2006, 2006 Second IEEE International Conference on e-Science and Grid Computing (e-Science'06).

[53] Raymond Ros,et al. Real-Parameter Black-Box Optimization Benchmarking 2009: Experimental Setup , 2009 .

[54] Takuji Nishimura,et al. Mersenne twister: a 623-dimensionally equidistributed uniform pseudo-random number generator , 1998, TOMC.

[55] Terrence J. Sejnowski,et al. Temporal Difference Learning of Position Evaluation in the Game of Go , 1993, NIPS.

[56] Richard S. Sutton,et al. Reinforcement Learning of Local Shape in the Game of Go , 2007, IJCAI.

[57] David J. C. MacKay,et al. Information-Based Objective Functions for Active Data Selection , 1992, Neural Computation.

[58] John R. Koza,et al. Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[59] Tom Schaul,et al. Curiosity-driven optimization , 2011, 2011 IEEE Congress of Evolutionary Computation (CEC).

[60] Donald R. Jones,et al. Efficient Global Optimization of Expensive Black-Box Functions , 1998, J. Glob. Optim..

[61] Risto Miikkulainen,et al. Incremental Evolution of Complex General Behavior , 1997, Adapt. Behav..

[62] Timothy F. Havel,et al. Derivatives of the Matrix Exponential and Their Computation , 1995 .

[63] Mauro Birattari,et al. Swarm Intelligence , 2012, Lecture Notes in Computer Science.

[64] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[65] Dave Cliff,et al. Tracking the Red Queen: Measurements of Adaptive Progress in Co-Evolutionary Simulations , 1995, ECAL.

[66] G. Box,et al. On the Experimental Attainment of Optimum Conditions , 1951 .

[67] Richard K. Belew,et al. Methods for Competitive Co-Evolution: Finding Opponents Worth Beating , 1995, ICGA.

[68] Dirk P. Kroese,et al. The Cross Entropy Method: A Unified Approach To Combinatorial Optimization, Monte-carlo Simulation (Information Science and Statistics) , 2004 .

[69] Carl E. Rasmussen,et al. Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[70] David A. Cohn,et al. Neural Network Exploration Using Optimal Experiment Design , 1993, NIPS.

[71] Anne Auger,et al. Identification of the isotherm function in chromatography using CMA-ES , 2007, 2007 IEEE Congress on Evolutionary Computation.

[72] Simon M. Lucas,et al. Coevolution versus self-play temporal difference learning for acquiring position evaluation in small-board go , 2005, IEEE Transactions on Evolutionary Computation.

[73] Hans-Paul Schwefel,et al. TWO-PHASE NOZZLE AND HOLLOW CORE JET EXPERIMENTS. , 1970 .

[74] Petros Koumoutsakos,et al. A Method for Handling Uncertainty in Evolutionary Optimization With an Application to Feedback Control of Combustion , 2009, IEEE Transactions on Evolutionary Computation.

[75] Lothar Thiele,et al. Multiobjective Optimization Using Evolutionary Algorithms - A Comparative Case Study , 1998, PPSN.

[76] Tobias Pfingsten,et al. Bayesian Active Learning for Sensitivity Analysis , 2006, ECML.

[77] A. P. Wieland,et al. Evolving neural network controllers for unstable systems , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.

[78] Andrew W. Moore,et al. Memory-based Stochastic Optimization , 1995, NIPS.

[79] John H. Holland,et al. Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[80] Jenq-Neng Hwang,et al. Query-based learning applied to partially trained multilayer perceptrons , 1991, IEEE Trans. Neural Networks.

[81] Luca Maria Gambardella,et al. Assessment of neural networks training strategies for histomorphometric analysis of synchrotron radiation medical images , 2010 .

[82] Alex Graves,et al. Supervised Sequence Labelling with Recurrent Neural Networks , 2012, Studies in Computational Intelligence.

[83] Carl E. Rasmussen,et al. Gaussian process dynamic programming , 2009, Neurocomputing.

[84] W. K. Hastings,et al. Monte Carlo Sampling Methods Using Markov Chains and Their Applications , 1970 .

[85] Stefan Schaal,et al. Natural Actor-Critic , 2003, Neurocomputing.

[86] Anne Auger,et al. Convergence results for the (1, lambda)-SA-ES using the theory of phi-irreducible Markov chains , 2005, Theor. Comput. Sci..

[87] Stefan Roth,et al. Covariance Matrix Adaptation for Multi-objective Optimization , 2007, Evolutionary Computation.

[88] David E. Goldberg,et al. Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[89] Dennis Weyland,et al. A Rigorous Analysis of the Harmony Search Algorithm: How the Research Community can be Misled by a "Novel" Methodology , 2010, Int. J. Appl. Metaheuristic Comput..

[90] Sham M. Kakade,et al. A Natural Policy Gradient , 2001, NIPS.

[91] Dirk V. Arnold,et al. Improving Evolution Strategies through Active Covariance Matrix Adaptation , 2006, 2006 IEEE International Conference on Evolutionary Computation.

[92] Marco Locatelli,et al. Bayesian Algorithms for One-Dimensional Global Optimization , 1997, J. Glob. Optim..

[93] Alex Lubberts and Risto Miikkulainen. Co-Evolving a Go-Playing Neural network , 2001 .

[94] H. P. Schwefel,et al. Numerische Optimierung von Computermodellen mittels der Evo-lutionsstrategie , 1977 .

[95] Tom Schaul,et al. Q-Error as a Selection Mechanism in Modular Reinforcement-Learning Systems , 2011, IJCAI.

[96] Michael Finkel,et al. Solving computationally-demanding reliability-based design problems in hydrogeology. , 2008 .

[97] Nikolaus Hansen,et al. Completely Derandomized Self-Adaptation in Evolution Strategies , 2001, Evolutionary Computation.

[98] Risto Miikkulainen,et al. Competitive Coevolution through Evolutionary Complexification , 2011, J. Artif. Intell. Res..

[99] Christian Igel,et al. Evolutionary tuning of multiple SVM parameters , 2005, ESANN.

[100] N. Metropolis,et al. Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[101] Isao Ono,et al. Bidirectional Relation between CMA Evolution Strategies and Natural Evolution Strategies , 2010, PPSN.

[102] Shun-ichi Amari,et al. Natural Gradient Works Efficiently in Learning , 1998, Neural Computation.

[103] Tom Schaul,et al. Stochastic search using the natural gradient , 2009, ICML '09.

[104] Edmondo Minisci,et al. Comparative study on the application of evolutionary optimization techniques to orbit transfer maneuvers , 2008 .

[105] John A. Nelder,et al. A Simplex Method for Function Minimization , 1965, Comput. J..

[106] J. Shepherd,et al. Modeling morphology evolution and mechanical behavior during thermo-mechanical processing of semi-crystalline polymers , 2006 .

[107] Pierre Baldi,et al. The Principled Design of Large-Scale Recursive Neural Network Architectures--DAG-RNNs and the Protein Structure Prediction Problem , 2003, J. Mach. Learn. Res..

[108] Tom Schaul,et al. Efficient natural evolution strategies , 2009, GECCO.

[109] A. Auger. Convergence results for the ( 1 , )-SA-ES using the theory of-irreducible Markov chains , 2005 .

[110] Tom Schaul,et al. High dimensions and heavy tails for natural evolution strategies , 2011, GECCO '11.

[111] Tom Schaul,et al. Episodic Reinforcement Learning by Logistic Reward-Weighted Regression , 2008, ICANN.

[112] Jeff G. Schneider,et al. Covariant Policy Search , 2003, IJCAI.

[113] E. Miguez,et al. An application of an evolution strategy in power distribution system planning , 1998, 1998 IEEE International Conference on Evolutionary Computation Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98TH8360).

[114] Christian Igel,et al. Gradient-Based Adaptation of General Gaussian Kernels , 2005, Neural Computation.

[115] Lih-Yuan Deng,et al. The Cross-Entropy Method: A Unified Approach to Combinatorial Optimization, Monte-Carlo Simulation, and Machine Learning , 2006, Technometrics.

[116] Cajo J. F. ter Braak,et al. A Markov Chain Monte Carlo version of the genetic algorithm Differential Evolution: easy Bayesian computing for real parameter spaces , 2006, Stat. Comput..

[117] Nando de Freitas,et al. An Introduction to MCMC for Machine Learning , 2004, Machine Learning.

[118] Jürgen Schmidhuber,et al. Curious model-building control systems , 1991, [Proceedings] 1991 IEEE International Joint Conference on Neural Networks.

[119] Kenneth O. Stanley,et al. Generating large-scale neural networks through discovering geometric regularities , 2007, GECCO '07.

[120] Jürgen Schmidhuber,et al. Developmental robotics, optimal artificial curiosity, creativity, music, and the fine arts , 2006, Connect. Sci..

[121] Tom Schaul,et al. Towards Practical Universal Search , 2010, AGI 2010.

[122] Risto Miikkulainen,et al. Evolving a Roving Eye for Go , 2004, GECCO.

[123] John E. Dennis,et al. Optimization Using Surrogate Objectives on a Helicopter Test Example , 1998 .

[124] Donald R. Jones,et al. A Taxonomy of Global Optimization Methods Based on Response Surfaces , 2001, J. Glob. Optim..

[125] Jürgen Schmidhuber,et al. Multidimensional Recurrent Neural Networks , 2007 .

[126] Hans-Paul Schwefel,et al. Evolution strategies – A comprehensive introduction , 2002, Natural Computing.

[127] Dirk Thierens,et al. Enhancing the Performance of Maximum-Likelihood Gaussian EDAs Using Anticipated Mean Shift , 2008, PPSN.

[128] X. Pang,et al. Neural network design for J function approximation in dynamic programming , 1998, adap-org/9806001.

[129] Klaus Obermayer,et al. Gaussian process regression: active data selection and test point rejection , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[130] T. Munich,et al. Offline Handwriting Recognition with Multidimensional Recurrent Neural Networks , 2008, NIPS.

[131] Tom Schaul,et al. Natural Evolution Strategies , 2008, 2008 IEEE Congress on Evolutionary Computation (IEEE World Congress on Computational Intelligence).

[132] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[133] Jan Peters,et al. Machine Learning for motor skills in robotics , 2008, Künstliche Intell..

[134] James Foulds. Learning to Play the Game of Go , 2006 .

[135] S. Hochreiter,et al. REINFORCEMENT DRIVEN INFORMATION ACQUISITION IN NONDETERMINISTIC ENVIRONMENTS , 1995 .

[136] Julian Togelius,et al. Ontogenetic and Phylogenetic Reinforcement Learning , 2009, Künstliche Intell..

[137] Corso Elvezia. Probabilistic Incremental Program Evolution , 1997 .

[138] K. Chaloner,et al. Bayesian Experimental Design: A Review , 1995 .

[139] Christian Igel,et al. Registration of bone structures in 3D ultrasound and CT data: Comparison of different optimization strategies , 2005 .

[140] Yoshua Bengio,et al. Convolutional networks for images, speech, and time series , 1998 .

[141] Yi Zhang,et al. Exploration and Exploitation in Adaptive Filtering Based on Bayesian Active Learning , 2003, ICML.

[142] Rainer Storn,et al. Differential Evolution – A Simple and Efficient Heuristic for global Optimization over Continuous Spaces , 1997, J. Glob. Optim..