Improving generalisation of genetic programming for high-dimensional symbolic regression with feature selection

Feature selection is a desired process when learning from high-dimensional data. However, it is seldom considered in Genetic Programming (GP) for high-dimensional symbolic regression. This work aims to develop a new method, Genetic Programming with Feature Selection (GPWFS), to improve the generalisation ability of GP for symbolic regression. GPWFS is a two-stage method. The main task of the first stage is to select important/informative features from fittest individuals, and the second stage uses a set of selected features, which is a subset of original features, for regression. To investigate the learning/optimisation performance and generalisation capability of GPWFS, a set of experiments using standard GP as a baseline for comparison have been conducted on six real-world high-dimensional symbolic regression datasets. The experimental results show that GPWFS can have better performance both on the training sets and the test sets on most cases. Further analysis on the solution size, the number of distinguished features and total number of used features in the evolved models shows that using GPWFS can induce more compact models with better interpretability and lower computational costs than standard GP.

[1]  Leonardo Vanneschi,et al.  Genetic programming for computational pharmacokinetics in drug discovery and development , 2007, Genetic Programming and Evolvable Machines.

[2]  Urvesh Bhowan,et al.  Genetic Programming for Feature Selection and Question-Answer Ranking in IBM Watson , 2015, EuroGP.

[3]  Elie Bienenstock,et al.  Neural Networks and the Bias/Variance Dilemma , 1992, Neural Computation.

[4]  David A. Cohn,et al.  Improving generalization with active learning , 1994, Machine Learning.

[5]  Marc Parizeau,et al.  Genetic Programming, Validation Sets, and Parsimony Pressure , 2006, EuroGP.

[6]  A. Asuncion,et al.  UCI Machine Learning Repository, University of California, Irvine, School of Information and Computer Sciences , 2007 .

[7]  Dana H. Ballard,et al.  Rooted-tree schemata in genetic programming , 1999 .

[8]  Leonardo Vanneschi,et al.  Open issues in genetic programming , 2010, Genetic Programming and Evolvable Machines.

[9]  Mario Graff,et al.  Genetic programming as a feature selection algorithm , 2014, 2014 IEEE International Autumn Meeting on Power, Electronics and Computing (ROPEC).

[10]  Si Wu,et al.  Improving support vector machine classifiers by modifying kernel functions , 1999, Neural Networks.

[11]  Carlos M. Fonseca,et al.  On the Generalization Ability of Geometric Semantic Genetic Programming , 2015, EuroGP.

[12]  Michael O'Neill,et al.  Improving the Generalisation Ability of Genetic Programming with Semantic Similarity based Crossover , 2010, EuroGP.

[13]  Xiaodong Li,et al.  Multi-objective techniques in genetic programming for evolving classifiers , 2005, 2005 IEEE Congress on Evolutionary Computation.

[14]  Grant Dick,et al.  A Re-Examination of the Use of Genetic Programming on the Oral Bioavailability Problem , 2015, GECCO.

[15]  Meland,et al.  THE USE OF MOLECULAR PROFILING TO PREDICT SURVIVAL AFTER CHEMOTHERAPY FOR DIFFUSE LARGE-B-CELL LYMPHOMA , 2002 .

[16]  Leonardo Vanneschi,et al.  Measuring bloat, overfitting and functional complexity in genetic programming , 2010, GECCO '10.

[17]  Conor Ryan,et al.  On size, complexity and generalisation error in GP , 2014, GECCO.

[18]  Edmund K. Burke,et al.  On improving genetic programming for symbolic regression , 2005, 2005 IEEE Congress on Evolutionary Computation.

[19]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[20]  Mohammad Mehdi Ebadzadeh,et al.  Improving GP generalization: a variance-based layered learning approach , 2014, Genetic Programming and Evolvable Machines.

[21]  Kumar Chellapilla,et al.  Data mining using genetic programming: the implications of parsimony on generalization error , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[22]  Krzysztof Krawiec,et al.  Multiple regression genetic programming , 2014, GECCO.

[23]  Alok Baveja,et al.  Computing , Artificial Intelligence and Information Technology A data-driven software tool for enabling cooperative information sharing among police departments , 2002 .

[24]  Conor Ryan,et al.  A Simple Approach to Lifetime Learning in Genetic Programming-Based Symbolic Regression , 2014, Evolutionary Computation.

[25]  R. Bellman Dynamic programming. , 1957, Science.

[26]  John R. Koza,et al.  GENETIC PROGRAMMING FOR ECONOMIC MODELING , 1994 .

[27]  Samaneh Sadat Mousavi Astarabadi,et al.  Avoiding Overfitting in Symbolic Regression Using the First Order Derivative of GP Trees , 2015, GECCO.

[28]  Ivo Gonçalves,et al.  Balancing Learning and Overfitting in Genetic Programming with Interleaved Sampling of Training Data , 2013, EuroGP.

[29]  Mengjie Zhang,et al.  Pareto front feature selection: using genetic programming to explore feature space , 2009, GECCO.

[30]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[31]  Leonardo Vanneschi,et al.  A Quantitative Study of Learning and Generalization in Genetic Programming , 2011, EuroGP.

[32]  Leonardo Vanneschi,et al.  Geometric Semantic Genetic Programming for Real Life Applications , 2013, GPTP.

[33]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[34]  Marcos André Gonçalves,et al.  Aggressive and effective feature selection using genetic programming , 2012, 2012 IEEE Congress on Evolutionary Computation.

[35]  Leonardo Vanneschi,et al.  A New Implementation of Geometric Semantic GP and Its Application to Problems in Pharmacokinetics , 2013, EuroGP.

[36]  Maria Semenkina,et al.  Empirical study of self-configuring genetic programming algorithm performance and behaviour , 2015 .

[37]  Peter Nordin,et al.  The Effect of Extensive Use of the Mutation Operator on Generalization in Genetic Programming Using Sparse Data Sets , 1996, PPSN.

[38]  Xin Yao,et al.  A Survey on Evolutionary Computation Approaches to Feature Selection , 2016, IEEE Transactions on Evolutionary Computation.

[39]  Ibrahim Kushchu,et al.  Genetic programming and evolutionary generalization , 2002, IEEE Trans. Evol. Comput..

[40]  Conor Ryan,et al.  On Improving Generalisation in Genetic Programming , 2009, EuroGP.