An evolutionary multi-objective approach for prototype generation

k-NN is one of the most popular and effective models for pattern classification. However, it has two main drawbacks that hinder the application of this method for large data sets: (1) the whole training set has to be stored in memory, and (2) for classifying a test pattern it has to be compared to all other training instances. In order to overcome these shortcomings, prototype generation (PG) methods aim to reduce the size of the training set while maintaining or increasing the classification performance of k-NN. Accordingly, most PG methods aim to generate instances that try to maximize classification performance. Nevertheless, in most cases, the reduction objective is only implicitly optimized. This paper introduces EMOPG, a novel approach to PG based on multi-objective optimization that explicitly optimizes both objectives: accuracy and reduction. Under EMOPG, prototypes are initialized with a subset of training instances selected through a tournament, according to a weighting term. A multi-objective evolutionary algorithm, PAES (Pareto Archived Evolution Strategy), is implemented to adjust the position of the initial prototypes. The optimization process aims to simultaneously maximize the classification performance of prototypes while reducing the number of instances with respect to the training set. A strategy for selecting a single solution from the set of non-dominated solutions is proposed. We evaluate the performance of EMOPG using a suite of benchmark data sets and compare the performance of our proposal with respect to the one obtained by alternative techniques. Experimental results show that our proposed method offers a better trade-off between accuracy and reduction than other methods.

[1]  Utpal Garain,et al.  Prototype reduction using an artificial immune model , 2008, Pattern Analysis and Applications.

[2]  David W. Corne,et al.  Approximating the Nondominated Front Using the Pareto Archived Evolution Strategy , 2000, Evolutionary Computation.

[3]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[4]  Gary B. Lamont,et al.  Evolutionary Algorithms for Solving Multi-Objective Problems , 2002, Genetic Algorithms and Evolutionary Computation.

[5]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[6]  Francisco Herrera,et al.  A Taxonomy and Experimental Study on Prototype Generation for Nearest Neighbor Classification , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[7]  Xinjie Yu Introduction to evolutionary algorithms , 2010, The 40th International Conference on Computers & Indutrial Engineering.

[8]  Fernando Fernández,et al.  Evolutionary Design of Nearest Prototype Classifiers , 2004, J. Heuristics.

[9]  Philip S. Yu,et al.  Top 10 algorithms in data mining , 2007, Knowledge and Information Systems.

[10]  DebK.,et al.  A fast and elitist multiobjective genetic algorithm , 2002 .

[11]  Tony R. Martinez,et al.  Reduction Techniques for Instance-Based Learning Algorithms , 2000, Machine Learning.

[12]  Marco Laumanns,et al.  SPEA2: Improving the Strength Pareto Evolutionary Algorithm For Multiobjective Optimization , 2002 .

[13]  Loris Nanni,et al.  Particle swarm optimization for prototype reduction , 2009, Neurocomputing.

[14]  Inés María Galván,et al.  AMPSO: A New Particle Swarm Method for Nearest Neighborhood Classification , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[15]  R. K. Ursem Multi-objective Optimization using Evolutionary Algorithms , 2009 .

[16]  Francisco Herrera,et al.  Differential evolution for optimizing the positioning of prototypes in nearest neighbor classification , 2011, Pattern Recognit..

[17]  Francisco Herrera,et al.  Prototype Selection for Nearest Neighbor Classification: Taxonomy and Empirical Study , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Hugo Jair Escalante,et al.  Genetic Programming of Prototypes for Pattern Classification , 2013, IbPRIA.

[19]  J. Periaux,et al.  Evolutionary Methods for Design, Optimization and Control with Applications to Industrial Problems , 2001 .