论文信息 - Evolutionary training and abstraction yields algorithmic generalization of neural computers

Evolutionary training and abstraction yields algorithmic generalization of neural computers

A key feature of intelligent behaviour is the ability to learn abstract strategies that scale and transfer to unfamiliar problems. An abstract strategy solves every sample from a problem class, no matter its representation or complexity – like algorithms in computer science. Neural networks are powerful models for processing sensory data, discovering hidden patterns, and learning complex functions, but they struggle to learn such iterative, sequential or hierarchical algorithmic strategies. Extending neural networks with external memories has increased their capacities in learning such strategies, but they are still prone to data variations, struggle to learn scalable and transferable solutions, and require massive training data. We present the Neural Harvard Computer (NHC), a memory-augmented network based architecture, that employs abstraction by decoupling algorithmic operations from data manipulations, realized by splitting the information flow and separated modules. This abstraction mechanism and evolutionary training enable the learning of robust and scalable algorithmic solutions. On a diverse set of 11 algorithms with varying complexities, we show that the NHC reliably learns algorithmic solutions with strong generalization and abstraction: perfect generalization and scaling to arbitrary task configurations and complexities far beyond seen during training, and being independent of the data representation and the task domain.

[1] Ronald L. Rivest,et al. Introduction to Algorithms , 1990 .

[2] Anders Krogh,et al. A Simple Weight Decay Can Improve Generalization , 1991, NIPS.

[3] Colin Giles,et al. Learning Context-free Grammars: Capabilities and Limitations of a Recurrent Neural Network with an External Stack Memory (cid:3) , 1992 .

[4] Michael C. Mozer,et al. A Connectionist Symbol Manipulator that Discovers the Structure of Context-Free Languages , 1992, NIPS.

[5] Padhraic Smyth,et al. Discrete recurrent neural networks for grammatical inference , 1994, IEEE Trans. Neural Networks.

[6] Yoav Freund,et al. A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[7] Yoav Freund,et al. A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[8] Risto Miikkulainen,et al. Evolving Neural Networks through Augmenting Topologies , 2002, Evolutionary Computation.

[9] Long Ji Lin,et al. Self-improving reactive agents based on reinforcement learning, planning and teaching , 1992, Machine Learning.

[10] Pierre-Yves Oudeyer,et al. What is Intrinsic Motivation? A Typology of Computational Approaches , 2007, Frontiers Neurorobotics.

[11] Tom Schaul,et al. Natural Evolution Strategies , 2008, 2008 IEEE Congress on Evolutionary Computation (IEEE World Congress on Computational Intelligence).

[12] Peter Stone,et al. Transfer Learning for Reinforcement Learning Domains: A Survey , 2009, J. Mach. Learn. Res..

[13] Jason Weston,et al. Curriculum learning , 2009, ICML '09.

[14] Charles Kemp,et al. How to Grow a Mind: Statistics, Structure, and Abstraction , 2011, Science.

[15] Qiang Yang,et al. Lifelong Machine Learning Systems: Beyond Learning Algorithms , 2013, AAAI Spring Symposium: Lifelong Machine Learning.

[16] Marco Mirolli,et al. Intrinsically Motivated Learning Systems: An Overview , 2013, Intrinsically Motivated Learning in Natural and Artificial Systems.

[17] Jürgen Schmidhuber,et al. Deep learning in neural networks: An overview , 2014, Neural Networks.

[18] Jason Weston,et al. End-To-End Memory Networks , 2015, NIPS.

[19] Jason Weston,et al. Memory Networks , 2014, ICLR.

[20] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[21] Tomas Mikolov,et al. Inferring Algorithmic Patterns with Stack-Augmented Recurrent Nets , 2015, NIPS.

[22] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[23] Phil Blunsom,et al. Learning to Transduce with Unbounded Memory , 2015, NIPS.

[24] Sebastian Risi,et al. Evolving Neural Turing Machines for Reward-based Learning , 2016, GECCO.

[25] Richard Socher,et al. Ask Me Anything: Dynamic Memory Networks for Natural Language Processing , 2015, ICML.

[26] Wojciech Zaremba,et al. Learning Simple Algorithms from Examples , 2015, ICML.

[27] Lukasz Kaiser,et al. Neural GPUs Learn Algorithms , 2015, ICLR.

[28] Joshua B. Tenenbaum,et al. Building machines that learn and think like people , 2016, Behavioral and Brain Sciences.

[29] Sergio Gomez Colmenarejo,et al. Hybrid computing using a neural network with dynamic external memory , 2016, Nature.

[30] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[31] Taghi M. Khoshgoftaar,et al. A survey of transfer learning , 2016, Journal of Big Data.

[32] Marcin Andrychowicz,et al. Neural Random Access Machines , 2015, ERCIM News.

[33] Nando de Freitas,et al. Neural Programmer-Interpreters , 2015, ICLR.

[34] Quoc V. Le,et al. Neural Programmer: Inducing Latent Programs with Gradient Descent , 2015, ICLR.

[35] Xi Chen,et al. Evolution Strategies as a Scalable Alternative to Reinforcement Learning , 2017, ArXiv.

[36] Dawn Xiaodong Song,et al. Making Neural Programming Architectures Generalize via Recursion , 2017, ICLR.

[37] Kenneth O. Stanley,et al. Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents , 2017, NeurIPS.

[38] Sebastian Risi,et al. HyperENTM: Evolving Scalable Neural Turing Machines through HyperNEAT , 2017, ArXiv.

[39] Ivo D. Dinov,et al. Deep learning for neural networks , 2018 .

[40] Chris Dyer,et al. Neural Arithmetic Logic Units , 2018, NeurIPS.

[41] Benjamin Recht,et al. Simple random search of static linear policies is competitive for reinforcement learning , 2018, NeurIPS.

[42] Germain Forestier,et al. Deep learning for time series classification: a review , 2018, Data Mining and Knowledge Discovery.

[43] Learning Algorithmic Solutions to Symbolic Planning Tasks with a Neural Computer , 2019, ArXiv.

[44] Chong Wang,et al. Neural Logic Machines , 2019, ICLR.

[45] George Konidaris,et al. On the necessity of abstraction , 2019, Current Opinion in Behavioral Sciences.

[46] Stefan Wermter,et al. Continual Lifelong Learning with Neural Networks: A Review , 2019, Neural Networks.

[47] Matti Pietikäinen,et al. Deep Learning for Generic Object Detection: A Survey , 2018, International Journal of Computer Vision.

[48] Jen Jen Chung,et al. Neuroevolution of a Modular Memory-Augmented Neural Network for Deep Memory Problems , 2019, Evolutionary Computation.

[49] Jane X. Wang,et al. Reinforcement Learning, Fast and Slow , 2019, Trends in Cognitive Sciences.

[50] Alexander Rosenberg Johansen,et al. Neural Arithmetic Units , 2020, ICLR.

[51] Truyen Tran,et al. Neural Stored-program Memory , 2019, ICLR.

[52] R. Hadsell,et al. Neural Execution of Graph Algorithms , 2019, International Conference on Learning Representations.