论文信息 - Designing neural networks through neuroevolution

Designing neural networks through neuroevolution

Much of recent machine learning has focused on deep learning, in which neural network weights are trained through variants of stochastic gradient descent. An alternative approach comes from the field of neuroevolution, which harnesses evolutionary algorithms to optimize neural networks, inspired by the fact that natural brains themselves are the products of an evolutionary process. Neuroevolution enables important capabilities that are typically unavailable to gradient-based approaches, including learning neural network building blocks (for example activation functions), hyperparameters, architectures and even the algorithms for learning themselves. Neuroevolution also differs from deep learning (and deep reinforcement learning) by maintaining a population of solutions during search, enabling extreme exploration and massive parallelization. Finally, because neuroevolution research has (until recently) developed largely in isolation from gradient-based neural network research, it has developed many unique and effective techniques that should be effective in other machine learning areas too. This Review looks at several key aspects of modern neuroevolution, including large-scale computing, the benefits of novelty and diversity, the power of indirect encoding, and the field’s contributions to meta-learning and architecture search. Our hope is to inspire renewed interest in the field as it meets the potential of the increasing computation available today, to highlight how many of its ideas can provide an exciting resource for inspiration and hybridization to the deep learning, deep reinforcement learning and machine learning communities, and to explain how neuroevolution could prove to be a critical tool in the long-term pursuit of artificial general intelligence.Deep neural networks have become very successful at certain machine learning tasks partly due to the widely adopted method of training called backpropagation. An alternative way to optimize neural networks is by using evolutionary algorithms, which, fuelled by the increase in computing power, offers a new range of capabilities and modes of learning.

[1] F. Attneave,et al. The Organization of Behavior: A Neuropsychological Theory , 1949 .

[2] J. Knott. The organization of behavior: A neuropsychological theory , 1951 .

[3] A. Turing. The chemical basis of morphogenesis , 1952, Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences.

[4] B. Underwood,et al. Fate of first-list associations in transfer theory. , 1959, Journal of experimental psychology.

[5] Boris Polyak. Some methods of speeding up the convergence of iteration methods , 1964 .

[6] A. Lindenmayer. Mathematical models for cellular interactions in development. I. Filaments with one-sided inputs. , 1968, Journal of theoretical biology.

[7] Kenneth Alan De Jong,et al. An analysis of the behavior of a class of genetic adaptive systems. , 1975 .

[8] H. Meinhardt. Models of biological pattern formation , 1982 .

[9] R. Dawkins,et al. The extended phenotype : the gene as the unit of selection , 1982 .

[10] Geoffrey E. Hinton,et al. Learning representations by back-propagating errors , 1986, Nature.

[11] David E. Goldberg,et al. Genetic Algorithms with Sharing for Multimodalfunction Optimization , 1987, ICGA.

[12] D. R. McGregor,et al. Designing application-specific neural networks using the structured genetic algorithm , 1992, [Proceedings] COGANN-92: International Workshop on Combinations of Genetic Algorithms and Neural Networks.

[13] Beatrice Santorini,et al. Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[14] Xin Yao,et al. A review of evolutionary artificial neural networks , 1993, Int. J. Intell. Syst..

[15] Frédéric Gruau,et al. Genetic Synthesis of Modular Neural Networks , 1993, ICGA.

[16] Frédéric Gruau,et al. Automatic Definition of Modular Neural Networks , 1994, Adapt. Behav..

[17] Jan Paredis,et al. Coevolutionary computation , 1995 .

[18] Richard K. Belew,et al. Methods for Competitive Co-Evolution: Finding Opponents Worth Beating , 1995, ICGA.

[19] Jan Paredis,et al. Coevolutionary Computation , 1995, Artificial Life.

[20] Risto Miikkulainen,et al. Evolving Obstacle Avoidance Behavior in a Robot Arm , 1996 .

[21] Samir W. Mahfoud. Niching methods for genetic algorithms , 1996 .

[22] Jordan B. Pollack,et al. Coevolution of a Backgammon Player , 1996 .

[23] Francesco Mondada,et al. Evolution of homing navigation in a real mobile robot , 1996, IEEE Trans. Syst. Man Cybern. Part B.

[24] Francesco Mondada,et al. Evolution of Plastic Neurocontrollers for Situated Agents , 1996 .

[25] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[26] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[27] R. French. Catastrophic forgetting in connectionist networks , 1999, Trends in Cognitive Sciences.

[28] Ning Qian,et al. On the momentum term in gradient descent learning algorithms , 1999, Neural Networks.

[29] Lothar Thiele,et al. Multiobjective evolutionary algorithms: a comparative case study and the strength Pareto approach , 1999, IEEE Trans. Evol. Comput..

[30] Jordan B. Pollack,et al. Automatic design and manufacture of robotic lifeforms , 2000, Nature.

[31] Masahiro Fujita,et al. Evolving robust gaits with AIBO , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[32] Dario Floreano,et al. Evolutionary robots with on-line self-organization and behavioral fitness , 2000, Neural Networks.

[33] Timothy B. Stockwell,et al. The Sequence of the Human Genome , 2001, Science.

[34] R. Pfeifer,et al. Repeated structure and dissociation of genotypic and phenotypic complexity in artificial ontogeny , 2001 .

[35] Jordan B. Pollack,et al. Creating High-Level Components with a Generative Representation for Body-Brain Evolution , 2002, Artificial Life.

[36] Kalyanmoy Deb,et al. A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[37] Risto Miikkulainen,et al. Evolving Neural Networks through Augmenting Topologies , 2002, Evolutionary Computation.

[38] Fumio Hara,et al. Morpho-functional Machines: The New Species , 2012, Springer Japan.

[39] Risto Miikkulainen,et al. A Taxonomy for Artificial Embryogeny , 2003, Artificial Life.

[40] Rich Caruana,et al. Multitask Learning , 1997, Machine Learning.

[41] Riccardo Poli,et al. Evolving the Topology and the Weights of Neural Networks Using a Dual Representation , 2004, Applied Intelligence.

[42] Risto Miikkulainen,et al. Real-time neuroevolution in the NERO video game , 2005, IEEE Transactions on Evolutionary Computation.

[43] G. Striedter. Principles of brain evolution. , 2005 .

[44] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[45] Nicholas J. Radcliffe,et al. Genetic set recombination and its application to neural network topology optimisation , 1993, Neural Computing & Applications.

[46] Rolf Pfeifer,et al. How the body shapes the way we think - a new view on intelligence , 2006 .

[47] Mitchell A. Potter,et al. EVOLVING NEURAL NETWORKS WITH COLLABORATIVE SPECIES , 2006 .

[48] Dario Floreano,et al. Analog Genetic Encoding for the Evolution of Circuits and Networks , 2007, IEEE Transactions on Evolutionary Computation.

[49] Kenneth O. Stanley,et al. Compositional Pattern Producing Networks : A Novel Abstraction of Development , 2007 .

[50] Dario Floreano,et al. Neuroevolution: from architectures to learning , 2008, Evol. Intell..

[51] Dario Floreano,et al. Evolutionary Advantages of Neuromodulated Plasticity in Dynamic, Reward-based Scenarios , 2008, ALIFE.

[52] Tom Schaul,et al. Natural Evolution Strategies , 2008, 2008 IEEE Congress on Evolutionary Computation (IEEE World Congress on Computational Intelligence).

[53] S. Herculano‐Houzel. The Human Brain in Numbers: A Linearly Scaled-up Primate Brain , 2009, Front. Hum. Neurosci..

[54] Kenneth O. Stanley,et al. A Hypercube-Based Encoding for Evolving Large-Scale Neural Networks , 2009, Artificial Life.

[55] Kenneth O. Stanley. A Hypercube-Based Indirect Encoding for Evolving Large-Scale Neural Networks , 2009 .

[56] V. Ramakrishnan,et al. Measurement of the top-quark mass with dilepton events selected using neuroevolution at CDF. , 2008, Physical review letters.

[57] Kenneth O. Stanley,et al. Autonomous Evolution of Topographic Regularities in Artificial Neural Networks , 2010, Neural Computation.

[58] Jürgen Schmidhuber,et al. Evolving neural networks in compressed weight space , 2010, GECCO '10.

[59] Sebastian Risi,et al. Indirectly Encoding Neural Plasticity as a Pattern of Local Rules , 2010, SAB.

[60] Kenneth O. Stanley,et al. Abandoning Objectives: Evolution Through the Search for Novelty Alone , 2011, Evolutionary Computation.

[61] Kenneth O. Stanley,et al. Evolving a diversity of virtual creatures through novelty search and local competition , 2011, GECCO '11.

[62] Kenneth O. Stanley,et al. Picbreeder: A Case Study in Collaborative Evolutionary Exploration of Design Space , 2011, Evolutionary Computation.

[63] Julian Togelius,et al. Search-Based Procedural Content Generation: A Taxonomy and Survey , 2011, IEEE Transactions on Computational Intelligence and AI in Games.

[64] Hod Lipson,et al. Evolving three-dimensional objects with a generative encoding inspired by developmental biology , 2011, ECAL.

[65] Kenneth O. Stanley,et al. On the Performance of Indirect Encoding Across the Continuum of Regularity , 2011, IEEE Transactions on Evolutionary Computation.

[66] Gregory Gutin,et al. A New Approach to Population Sizing for Memetic Algorithms: A Case Study for the Multidimensional Assignment Problem , 2010, Evolutionary Computation.

[67] Sarah Benson-Amram,et al. Innovative problem solving by wild spotted hyenas , 2012, Proceedings of the Royal Society B: Biological Sciences.

[68] Tara N. Sainath,et al. FUNDAMENTAL TECHNOLOGIES IN MODERN SPEECH RECOGNITION Digital Object Identifier 10.1109/MSP.2012.2205597 , 2012 .

[69] Dong Yu,et al. Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[70] Joshua Evan Auerbach,et al. On the relationship between environmental and morphological complexity in evolved robots , 2012, GECCO '12.

[71] Stéphane Doncieux,et al. Encouraging Behavioral Diversity in Evolutionary Robotics: An Empirical Study , 2012, Evolutionary Computation.

[72] Sebastian Risi,et al. An Enhanced Hypercube-Based Encoding for Evolving the Placement, Density, and Connectivity of Neurons , 2012, Artificial Life.

[73] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[74] Sebastian Risi,et al. A unified approach to evolving plasticity and neural geometry , 2012, The 2012 International Joint Conference on Neural Networks (IJCNN).

[75] Gul Muhammad Khan,et al. Fast learning neural networks using Cartesian genetic programming , 2013, Neurocomputing.

[76] Evolutionary robotics , 2013, CACM.

[77] Jean-Baptiste Mouret,et al. On the Relationships between Generative Encodings, Regularity, and Learning Abilities when Evolving Plastic Artificial Neural Networks , 2013, PloS one.

[78] Hod Lipson,et al. Unshackling evolution: evolving soft robots with multiple materials and a powerful generative encoding , 2013, GECCO '13.

[79] Hod Lipson,et al. The evolutionary origins of modularity , 2012, Proceedings of the Royal Society B: Biological Sciences.

[80] Julian Francis Miller,et al. NeuroEvolution: Evolving Heterogeneous Artificial Neural Networks , 2014, Evolutionary Intelligence.

[81] Jean-Baptiste Mouret,et al. Evolving neural networks that are both modular and regular: HyperNEAT plus the connection cost technique , 2014, GECCO.

[82] Risto Miikkulainen,et al. A Neuroevolution Approach to General Atari Game Playing , 2014, IEEE Transactions on Computational Intelligence and AI in Games.

[83] Jürgen Schmidhuber,et al. Deep learning in neural networks: An overview , 2014, Neural Networks.

[84] Antoine Cully,et al. Robots that can adapt like animals , 2014, Nature.

[85] Jean-Baptiste Mouret,et al. Illuminating search spaces by mapping elites , 2015, ArXiv.

[86] Jason Yosinski,et al. Deep neural networks are easily fooled: High confidence predictions for unrecognizable images , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[87] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.

[88] Ruslan Salakhutdinov,et al. Path-SGD: Path-Normalized Optimization in Deep Neural Networks , 2015, NIPS.

[89] Jean-Baptiste Mouret,et al. Neural Modularity Helps Organisms Evolve to Learn New Skills without Forgetting Old Skills , 2015, PLoS Comput. Biol..

[90] Geoffrey E. Hinton,et al. Deep Learning , 2015, Nature.

[91] Jordan B. Pollack,et al. Gene Regulatory Network Evolution Through Augmenting Topologies , 2015, IEEE Transactions on Evolutionary Computation.

[92] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[93] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[94] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.

[95] Risto Miikkulainen,et al. Distributed Age-Layered Novelty Search , 2016 .

[96] Daniele Gravina,et al. Surprise Search: Beyond Objectives and Novelty , 2016, GECCO.

[97] A Nguyen,et al. Understanding Innovation Engines: Automated Creativity and Improved Stochastic Optimization via Deep Learning , 2016, Evolutionary Computation.

[98] Jeff Clune,et al. Evolvability Search: Directly Selecting for Evolvability in order to Study and Produce It , 2016, GECCO.

[99] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[100] Julian Francis Miller,et al. Recurrent Cartesian Genetic Programming of Artificial Neural Networks , 2016, Genetic Programming and Evolvable Machines.

[101] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[102] Jean-Baptiste Mouret,et al. Does Aligning Phenotypic and Genotypic Modularity Improve the Evolution of Neural Networks? , 2016, GECCO.

[103] Jian Sun,et al. Identity Mappings in Deep Residual Networks , 2016, ECCV.

[104] Peter L. Bartlett,et al. RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning , 2016, ArXiv.

[105] David Pfau,et al. Convolution by Evolution: Differentiable Pattern Producing Networks , 2016, GECCO.

[106] Thomas Miconi,et al. Learning to learn with backpropagation of Hebbian plasticity , 2016, 1609.02228.

[107] Kenneth O. Stanley,et al. Quality Diversity: A New Frontier for Evolutionary Computation , 2016, Front. Robot. AI.

[108] Jürgen Schmidhuber,et al. A Wavelet-based Encoding for Neuroevolution , 2016, GECCO.

[109] Jian Peng,et al. Genetic Policy Optimization , 2017, ICLR 2018.

[110] Zeb Kurth-Nelson,et al. Learning to reinforcement learn , 2016, CogSci.

[111] Chrisantha Fernando,et al. PathNet: Evolution Channels Gradient Descent in Super Neural Networks , 2017, ArXiv.

[112] Xi Chen,et al. Evolution Strategies as a Scalable Alternative to Reinforcement Learning , 2017, ArXiv.

[113] Razvan Pascanu,et al. Overcoming catastrophic forgetting in neural networks , 2016, Proceedings of the National Academy of Sciences.

[114] Quoc V. Le,et al. HyperNetworks , 2016, ICLR.

[115] Kilian Q. Weinberger,et al. Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[116] Quoc V. Le,et al. Neural Architecture Search with Reinforcement Learning , 2016, ICLR.

[117] Jeff Clune,et al. Diffusion-based neuromodulation can eliminate catastrophic forgetting in simple neural networks , 2017, PloS one.

[118] Jürgen Schmidhuber,et al. LSTM: A Search Space Odyssey , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[119] Surya Ganguli,et al. Continual Learning Through Synaptic Intelligence , 2017, ICML.

[120] Max Jaderberg,et al. Population Based Training of Neural Networks , 2017, ArXiv.

[121] Kenneth O. Stanley,et al. Minimal criterion coevolution: a new approach to open-ended search , 2017, GECCO.

[122] Sergey Ioffe,et al. Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning , 2016, AAAI.

[123] Kenneth O. Stanley,et al. Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning , 2017, ArXiv.

[124] Demis Hassabis,et al. Mastering the game of Go without human knowledge , 2017, Nature.

[125] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.

[126] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.

[127] Elliot Meyerson,et al. Discovering evolutionary stepping stones through behavior domination , 2017, GECCO.

[128] Quoc V. Le,et al. Large-Scale Evolution of Image Classifiers , 2017, ICML.

[129] Kenneth O. Stanley,et al. Safe mutations for deep and recurrent neural networks through output gradients , 2017, GECCO.

[130] Tom Schaul,et al. Rainbow: Combining Improvements in Deep Reinforcement Learning , 2017, AAAI.

[131] Marcin Andrychowicz,et al. Parameter Space Noise for Exploration , 2017, ICLR.

[132] David Budden,et al. Distributed Prioritized Experience Replay , 2018, ICLR.

[133] Kenneth O. Stanley,et al. Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents , 2017, NeurIPS.

[134] Kenneth O. Stanley,et al. Differentiable plasticity: training plastic neural networks with backpropagation , 2018, ICML.

[135] Risto Miikkulainen,et al. From Nodes to Networks: Evolving Recurrent Neural Networks , 2018, ArXiv.

[136] Shane Legg,et al. Noisy Networks for Exploration , 2017, ICLR.

[137] Jason Yosinski,et al. An Intriguing Failing of Convolutional Neural Networks and the CoordConv Solution , 2018, NeurIPS.

[138] Elliot Meyerson,et al. Pseudo-task Augmentation: From Deep Multitask Learning to Intratask Sharing - and Back , 2018, ICML.

[139] Nicola Pezzotti,et al. Differentiable Image Parameterizations , 2018, Distill.

[140] Jeff Clune,et al. Deep Curiosity Search: Intra-Life Exploration Improves Performance on Challenging Deep Reinforcement Learning Problems , 2018, ArXiv.

[141] J. Clune,et al. The Surprising Creativity of Digital Evolution , 2018, ALIFE.

[142] Joel Z. Leibo,et al. Human-level performance in first-person multiplayer games with population-based deep reinforcement learning , 2018, ArXiv.

[143] Benjamin Recht,et al. Simple random search provides a competitive approach to reinforcement learning , 2018, ArXiv.

[144] Sebastian Risi,et al. Born to Learn: the Inspiration, Progress, and Future of Evolved Plastic Artificial Neural Networks , 2017, Neural Networks.

[145] Risto Miikkulainen,et al. The Surprising Creativity of Digital Evolution: A Collection of Anecdotes from the Evolutionary Computation and Artificial Life Research Communities , 2018, Artificial Life.

[146] Elliot Meyerson,et al. Evolutionary architecture search for deep multitask networks , 2018, GECCO.

[147] Jakub W. Pachocki,et al. Emergent Complexity via Multi-Agent Competition , 2017, ICLR.

[148] Pieter Abbeel,et al. Evolved Policy Gradients , 2018, NeurIPS.

[149] Vijay Vasudevan,et al. Learning Transferable Architectures for Scalable Image Recognition , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[150] Kenneth O. Stanley,et al. The Emergence of Canalization and Evolvability in an Open-Ended, Interactive Evolutionary System , 2017, Artificial Life.

[151] Chris Dyer,et al. On the State of the Art of Evaluation in Neural Language Models , 2017, ICLR.

[152] Quoc V. Le,et al. AutoAugment: Learning Augmentation Policies from Data , 2018, ArXiv.

[153] Frank Hutter,et al. Neural Architecture Search: A Survey , 2018, J. Mach. Learn. Res..

[154] Quoc V. Le,et al. AutoAugment: Learning Augmentation Strategies From Data , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[155] Alok Aggarwal,et al. Regularized Evolution for Image Classifier Architecture Search , 2018, AAAI.

[156] Sergey Levine,et al. Diversity is All You Need: Learning Skills without a Reward Function , 2018, ICLR.

[157] Xin Yao,et al. Evolutionary Generative Adversarial Networks , 2018, IEEE Transactions on Evolutionary Computation.

[158] Elliot Meyerson,et al. Evolving Deep Neural Networks , 2017, Artificial Intelligence in the Age of Neural Networks and Brain Computing.

[159] A. E. Eiben,et al. Evolving embodied intelligence from materials to machines , 2019, Nat. Mach. Intell..