论文信息 - Learning in fractured problems with constructive neural network algorithms

Learning in fractured problems with constructive neural network algorithms

Evolution of neural networks, or neuroevolution, has been a successful approach to many low-level control problems such as pole balancing, vehicle control, and collision warning. However, certain types of problems — such as those involving strategic decision-making — have remained difficult to solve. This dissertation proposes the hypothesis that such problems are difficult because they are fractured: The correct action varies discontinuously as the agent moves from state to state. To evaluate this hypothesis, a method for measuring fracture using the concept of function variation of optimal policies is proposed. This metric is used to evaluate a popular neuroevolution algorithm, NEAT, empirically on a set of fractured problems. The results show that (1) NEAT does not usually perform well on such problems, and (2) the reason is that NEAT does not usually generate local decision regions, which would be useful in constructing a fractured decision boundary. To address this issue, two neuroevolution algorithms that model local decision regions are proposed: RBF-NEAT, which biases structural search by adding basis-function nodes, and Cascade-NEAT, which constrains structural search by constructing cascaded topologies. These algorithms are compared to NEAT on a set of fractured problems, demonstrating that this approach can improve performance significantly. A meta-level algorithm, SNAP-NEAT, is then developed to combine the strengths of NEAT, RBF-NEAT, and Cascade-NEAT. An evaluation in a set of benchmark problems shows that it is possible to achieve good performance even when it is not known a priori whether a problem is fractured or not. A final empirical comparison of these methods demonstrates that they can scale up to real-world tasks like keepaway and half-field soccer. These results shed new light on why constructive neuroevolution algorithms have difficulty in certain domains and illustrate how bias and constraint can be used to improve performance. Thus, this dissertation shows how neuroevolution can be scaled up from learning low-level control to learning strategic decision-making problems.

Risto Miikkulainen | Nate Kohl | Nate Kohl | R. Miikkulainen

[1] Risto Miikkulainen,et al. Efficient evolution of neural networks through complexification , 2004 .

[2] Daniele Loiacono,et al. XCS with computed prediction for the learning of Boolean functions , 2005, 2005 IEEE Congress on Evolutionary Computation.

[3] Peter J. Angeline,et al. An evolutionary algorithm that constructs recurrent neural networks , 1994, IEEE Trans. Neural Networks.

[4] Zhenyu Yang,et al. Genetic and Evolutionary Computation Conference (GECCO-2008) , 2008, GECCO 2008.

[5] Jimmy Secretan,et al. Picbreeder: evolving pictures collaboratively online , 2008, CHI.

[6] Kenneth O. Stanley,et al. Generative encoding for multiagent learning , 2008, GECCO '08.

[7] Héctor Pomares,et al. Parallel Multi-objective Memetic RBFNNs Design and Feature Selection for Function Approximation Problems , 2007, IWANN.

[8] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .

[9] David E. Goldberg,et al. Probability Matching, the Magnitude of Reinforcement, and Classifier System Bidding , 1990, Machine Learning.

[10] Michèle Sebag,et al. Adaptive operator selection with dynamic multi-armed bandits , 2008, GECCO '08.

[11] Ali M. S. Zalzala,et al. Evolving hybrid RBF-MLP networks using combined genetic/unsupervised/supervised learning , 1998 .

[12] Ming Li,et al. An Introduction to Kolmogorov Complexity and Its Applications , 2019, Texts in Computer Science.

[13] Bruce A. Whitehead,et al. Cooperative-competitive genetic evolution of radial basis function centers and widths for time series prediction , 1996, IEEE Trans. Neural Networks.

[14] Bernhard E. Boser,et al. A training algorithm for optimal margin classifiers , 1992, COLT '92.

[15] Haralambos Sarimveis,et al. A new algorithm for developing dynamic radial basis function neural network models based on genetic algorithms , 2002, Comput. Chem. Eng..

[16] Christian Igel,et al. Neuroevolution for reinforcement learning using evolution strategies , 2003, The 2003 Congress on Evolutionary Computation, 2003. CEC '03..

[17] Александр Сергеевич Леонов,et al. Замечания о полной вариации функций нескольких переменных и многомерном аналоге принципа выбора Хелли@@@On the total variation for functions of several variables and a multidimensional analog of Helly's selection principle , 1998 .

[18] Xin Yao,et al. Evolving artificial neural networks , 1999, Proc. IEEE.

[19] Jan M. Maciejowski,et al. Model discrimination using an algorithmic information criterion , 1979, Autom..

[20] P. Angeline. Evolving basis functions with dynamic receptive fields , 1997, 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation.

[21] David McLean,et al. On global-local artificial neural networks for function approximation , 2006, IEEE Trans. Neural Networks.

[22] Risto Miikkulainen,et al. Competitive Coevolution through Evolutionary Complexification , 2011, J. Artif. Intell. Res..

[23] John Moody,et al. Fast Learning in Networks of Locally-Tuned Processing Units , 1989, Neural Computation.

[24] L. Darrell Whitley,et al. Genetic Reinforcement Learning for Neurocontrol Problems , 2004, Machine Learning.

[25] Benjamin Kuipers,et al. Qualitative Modeling and Heterogeneous Control of Global System Behavior , 2002, HSCC.

[26] Jason Weston,et al. Large-scale kernel machines , 2007 .

[27] S. Bochner,et al. Lectures on Fourier integrals : with an author's supplement on monotonic functions, Stieltjes integrals, and harmonic analysis , 1959 .

[28] Risto Miikkulainen,et al. Evolving Keepaway Soccer Players through Task Decomposition , 2003, GECCO.

[29] Narasimhan Sundararajan,et al. An efficient sequential learning algorithm for growing and pruning RBF (GAP-RBF) networks , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[30] Franz Oppacher,et al. Combining Competitive And Cooperative Coevolution For Training Cascade Neural Networks , 2002, GECCO.

[31] S. Lawrence,et al. Function Approximation with Neural Networks and Local Methods: Bias, Variance and Smoothness , 1996 .

[32] Richard S. Sutton,et al. Generalization in Reinforcement Learning: Successful Examples Using Sparse Coarse Coding , 1995, NIPS.

[33] Peter Ross,et al. Adapting Operator Settings in Genetic Algorithms , 1998, Evolutionary Computation.

[34] Zhiye Zhao,et al. Design of structural modular neural networks with genetic algorithm , 2003 .

[35] Stephen R. Marsland,et al. A self-organising network that grows when required , 2002, Neural Networks.

[36] Guido Bugmann,et al. Normalized Gaussian Radial Basis Function networks , 1998, Neurocomputing.

[37] Peter Stone,et al. Reinforcement Learning for RoboCup Soccer Keepaway , 2005, Adapt. Behav..

[38] Risto Miikkulainen,et al. Transfer of Neuroevolved Controllers in Unstable Domains , 2004, GECCO.

[39] H. Barbosa. On Adaptive Operator Probabilities in Real Coded Genetic Algorithms , 2000 .

[40] Risto Miikkulainen,et al. Coevolving Strategies for General Game Playing , 2007, 2007 IEEE Symposium on Computational Intelligence and Games.

[41] Héctor Pomares,et al. Multiobjective evolutionary optimization of the size, shape, and position parameters of radial basis function networks for function approximation , 2003, IEEE Trans. Neural Networks.

[42] Christian Lebiere,et al. The Cascade-Correlation Learning Architecture , 1989, NIPS.

[43] Risto Miikkulainen,et al. Active Guidance for a Finless Rocket Using Neuroevolution , 2003, GECCO.

[44] Larry Bull,et al. Accuracy-based Neuro And Neuro-fuzzy Classifier Systems , 2002, GECCO.

[45] Yoshua. Bengio,et al. Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..

[46] Tin Kam Ho,et al. Complexity Measures of Supervised Classification Problems , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[47] Risto Miikkulainen,et al. Evolving Neural Networks through Augmenting Topologies , 2002, Evolutionary Computation.

[48] Julian Togelius,et al. Point-to-Point Car Racing: an Initial Study of Evolution Versus Temporal Difference Learning , 2007, 2007 IEEE Symposium on Computational Intelligence and Games.

[49] Jooyoung Park,et al. Universal Approximation Using Radial-Basis-Function Networks , 1991, Neural Computation.

[50] Martin V. Butz,et al. Context-dependent predictions and cognitive arm control with XCSF , 2008, GECCO '08.

[51] Risto Miikkulainen,et al. Evolving a real-world vehicle warning system , 2006, GECCO.

[52] A. P. Wieland,et al. Evolving neural network controllers for unstable systems , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.

[53] Risto Miikkulainen,et al. Cultural enhancement of neuroevolution , 2002 .

[54] Stewart W. Wilson. Classifier Conditions Using Gene Expression Programming , 2008, IWLCS.

[55] Risto Miikkulainen,et al. Coevolution of Role-Based Cooperation in Multiagent Systems , 2009, IEEE Transactions on Autonomous Mental Development.

[56] Lin Guo,et al. Combining genetic optimisation with hybrid learning algorithm for radial basis function neural networks , 2003 .

[57] Kenneth O. Stanley. A Hypercube-Based Indirect Encoding for Evolving Large-Scale Neural Networks , 2009 .

[58] Kenneth A. De Jong,et al. Cooperative Coevolution: An Architecture for Evolving Coadapted Subcomponents , 2000, Evolutionary Computation.

[59] Risto Miikkulainen,et al. Solving Non-Markovian Control Tasks with Neuro-Evolution , 1999, IJCAI.

[60] Risto Miikkulainen,et al. Efficient Reinforcement Learning through Symbiotic Evolution , 1996, Machine Learning.

[61] Jun Li,et al. Q-RAN: A Constructive Reinforcement Learning Approach for Robot Behavior Learning , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[62] Justinian Rosca,et al. Hierarchical learning with procedural abstraction mechanisms , 1997 .

[63] Risto Miikkulainen,et al. Neuroevolution of an automobile crash warning system , 2005, GECCO '05.

[64] Didier Guériot,et al. RBF neural network, basis functions and genetic algorithm , 1997, Proceedings of International Conference on Neural Networks (ICNN'97).

[65] Daniele Loiacono,et al. Classifier prediction based on tile coding , 2006, GECCO '06.

[66] Risto Miikkulainen,et al. Robust non-linear control through neuroevolution , 2003 .

[67] Dirk Thierens,et al. An Adaptive Pursuit Strategy for Allocating Operator Probabilities , 2005, BNAIC.

[68] A. Kolmogorov. Three approaches to the quantitative definition of information , 1968 .

[69] Kurt Hornik,et al. Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[70] Geoffrey E. Hinton,et al. Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[71] Gregory J. Chaitin,et al. A recent technical report , 1974, SIGA.

[72] Stewart W. Wilson. Classifiers that approximate functions , 2002, Natural Computing.

[73] Stephen A. Billings,et al. Radial basis function network configuration using genetic algorithms , 1995, Neural Networks.

[74] E. Kamke. Das Lebesgue-Stieltjes-Integral , 1958 .

[75] Yoshua Bengio,et al. Scaling learning algorithms towards AI , 2007 .

[76] Matthew Taylor and Shimon Whiteson and Peter Stone,et al. Comparing Evolutionary and Temporal Difference Methods for Reinforcement Learning , 2006 .

[77] Risto Miikkulainen,et al. Real-time neuroevolution in the NERO video game , 2005, IEEE Transactions on Evolutionary Computation.

[78] Jorma Rissanen,et al. The Minimum Description Length Principle in Coding and Modeling , 1998, IEEE Trans. Inf. Theory.

[79] Kenneth O. Stanley,et al. A Hypercube-Based Encoding for Evolving Large-Scale Neural Networks , 2009, Artificial Life.

[80] D. R. McGregor,et al. Designing application-specific neural networks using the structured genetic algorithm , 1992, [Proceedings] COGANN-92: International Workshop on Combinations of Genetic Algorithms and Neural Networks.

[81] Héctor Pomares,et al. Improving the Performance of Multi-objective Genetic Algorithm for Function Approximation Through Parallel Islands Specialisation , 2006, Australian Conference on Artificial Intelligence.

[82] Kenneth O. Stanley,et al. A novel generative encoding for exploiting neural network sensor and output geometry , 2007, GECCO '07.

[83] Martin V. Butz. Kernel-based, ellipsoidal conditions in the real-valued XCS classifier system , 2005, GECCO '05.

[84] David E. Goldberg,et al. Genetic Algorithms with Sharing for Multimodalfunction Optimization , 1987, ICGA.

[85] Zuhair Bandar,et al. Neural network architectures and overtopping predictions , 2005 .

[86] Hans-Martin Gutmann,et al. A Radial Basis Function Method for Global Optimization , 2001, J. Glob. Optim..

[87] Risto Miikkulainen,et al. Efficient Non-linear Control Through Neuroevolution , 2006, ECML.

[88] Peter Stone,et al. Keepaway Soccer: From Machine Learning Testbed to Benchmark , 2005, RoboCup.

[89] Nicholas J. Radcliffe,et al. Genetic set recombination and its application to neural network topology optimisation , 1993, Neural Computing & Applications.

[90] Larry D. Pyeatt,et al. A comparison between cellular encoding and direct encoding for genetic neural networks , 1996 .

[91] Risto Miikkulainen,et al. Evolving a Roving Eye for Go , 2004, GECCO.

[92] Minoru Asada,et al. Non-Physical Intervention in Robot Learning Based on LfE Method , 1995 .

[93] Charles W. Anderson,et al. Comparison of CMACs and radial basis functions for local function approximators in reinforcement learning , 1997, Proceedings of International Conference on Neural Networks (ICNN'97).

[94] Vladimir Vapnik,et al. Chervonenkis: On the uniform convergence of relative frequencies of events to their probabilities , 1971 .

[95] Bryant A. Julstrom,et al. What Have You Done for Me Lately? Adapting Operator Probabilities in a Steady-State Genetic Algorithm , 1995, ICGA.

[96] Y Lu,et al. A Sequential Learning Scheme for Function Approximation Using Minimal Radial Basis Function Neural Networks , 1997, Neural Computation.

[97] Joydeep Ghosh,et al. An overview of radial basis function networks , 2001 .

[98] David B. Fogel,et al. Evolving Neural Control Systems , 1995, IEEE Expert.

[99] John C. Platt. A Resource-Allocating Network for Function Interpolation , 1991, Neural Computation.

[100] Jordan B. Pollack,et al. Creating High-Level Components with a Generative Representation for Body-Brain Evolution , 2002, Artificial Life.

[101] Risto Miikkulainen,et al. Automatic feature selection in neuroevolution , 2005, GECCO '05.

[102] Larry Bull,et al. Self-adaptive constructivism in Neural XCS and XCSF , 2008, GECCO '08.

[103] Lawrence Davis,et al. Adapting Operator Probabilities in Genetic Algorithms , 1989, ICGA.

[104] Li Jun,et al. Q-Learning with a growing RBF network for behavior learning in mobile robotics , 2005 .

[105] Leslie Pack Kaelbling,et al. Learning in embedded systems , 1993 .

[106] Peter Stone,et al. Half Field Offense in RoboCup Soccer: A Multiagent Reinforcement Learning Case Study , 2006, RoboCup.

[107] Todd Peterson,et al. An RBF network alternative for a hybrid architecture , 1998, 1998 IEEE International Joint Conference on Neural Networks Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36227).