Differential evolution based on reinforcement learning with fitness ranking for solving multimodal multiobjective problems

Abstract In multimodal multiobjective optimization problems (MMOOPs), there is more than one Pareto-optimal Set (PS) in the decision space corresponding to the same Pareto Front(PF). How to dynamically adjust the evolution direction of the population adaptively is a key problem, to ensure approaching the PF in the global sense with good convergence while finding out more PSs. In this paper, a novel Differential Evolution algorithm based on Reinforcement Learning with Fitness Ranking (DE-RLFR) is proposed. The DE-RLFR is based on the Q-learning framework, and each individual in the population is considered an agent. The fitness ranking values of each agent are used to encode hierarchical state variables. Three typical DE mutation operations are employed as optional actions for the agent. Based on the analysis of the distribution characteristics of the population in objective space, decision space and fitness-ranking space, we design a reward function of the 〉state, action〈 pairs to guide the population to move to the PF asymptotically. According to its reinforcement learning experience represented by the corresponding Q table value, each agent could adaptively select a mutation strategy to generate offspring individuals. The evaluation results on eleven MMOOP test functions show that DE-RLFR could quickly and effectively find multiple PSs in the decision space, and approach PF in the global sense.

[1]  Jing J. Liang,et al.  A Multiobjective Particle Swarm Optimizer Using Ring Topology for Solving Multimodal Multiobjective Problems , 2018, IEEE Transactions on Evolutionary Computation.

[2]  Jing J. Liang,et al.  A Self-organizing Multi-objective Particle Swarm Optimization Algorithm for Multimodal Multi-objective Problems , 2018, ICSI.

[3]  Chee Peng Lim,et al.  A new Reinforcement Learning-based Memetic Particle Swarm Optimizer , 2016, Appl. Soft Comput..

[4]  Qingfu Zhang,et al.  MOEA/D: A Multiobjective Evolutionary Algorithm Based on Decomposition , 2007, IEEE Transactions on Evolutionary Computation.

[5]  Kalyanmoy Deb,et al.  Multi-objective Genetic Algorithms: Problem Difficulties and Construction of Test Problems , 1999, Evolutionary Computation.

[6]  Eckart Zitzler,et al.  Improving hypervolume-based multiobjective evolutionary algorithms by using objective reduction methods , 2007, 2007 IEEE Congress on Evolutionary Computation.

[7]  Madalina M. Drugan,et al.  Reinforcement learning versus evolutionary computation: A survey on hybrid algorithms , 2019, Swarm Evol. Comput..

[8]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[9]  Hisao Ishibuchi,et al.  A many-objective test problem for visually examining diversity maintenance behavior in a decision space , 2011, GECCO '11.

[10]  Kalyanmoy Deb,et al.  Omni-optimizer: A Procedure for Single and Multi-objective Optimization , 2005, EMO.

[11]  Daniel J. Lizotte,et al.  Multi-Objective Markov Decision Processes for Data-Driven Decision Support , 2016, J. Mach. Learn. Res..

[12]  Jing J. Liang,et al.  A novel scalable test problem suite for multimodal multiobjective optimization , 2019, Swarm Evol. Comput..

[13]  Ponnuthurai N. Suganthan,et al.  Recent advances in differential evolution - An updated survey , 2016, Swarm Evol. Comput..

[14]  Mariette Awad,et al.  Decision Making in Multiagent Systems: A Survey , 2018, IEEE Transactions on Cognitive and Developmental Systems.

[15]  Kalyanmoy Deb,et al.  Data mining methods for knowledge discovery in multi-objective optimization: Part A - Survey , 2017, Expert Syst. Appl..

[16]  Marco Laumanns,et al.  SPEA2: Improving the strength pareto evolutionary algorithm , 2001 .

[17]  Jing J. Liang,et al.  Multimodal multi-objective optimization: A preliminary study , 2016, 2016 IEEE Congress on Evolutionary Computation (CEC).

[18]  Jian Cheng,et al.  Robust Dynamic Multi-Objective Vehicle Routing Optimization Method , 2018, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[19]  Pratyusha Rakshit,et al.  Realization of an Adaptive Memetic Algorithm Using Differential Evolution and Q-Learning: A Case Study in Multirobot Path Planning , 2013, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[20]  Xin-She Yang,et al.  Bio-inspired computation: Where we stand and what's next , 2019, Swarm Evol. Comput..

[21]  Hisao Ishibuchi,et al.  A Decomposition-Based Evolutionary Algorithm for Multi-modal Multi-objective Optimization , 2018, PPSN.

[22]  Günter Rudolph,et al.  Capabilities of EMOA to Detect and Preserve Equivalent Pareto Subsets , 2007, EMO.

[23]  Hao Wang,et al.  Towards Analyzing Multimodality of Continuous Multiobjective Landscapes , 2016, PPSN.

[24]  Kalyanmoy Deb,et al.  An Evolutionary Many-Objective Optimization Algorithm Using Reference-Point-Based Nondominated Sorting Approach, Part I: Solving Problems With Box Constraints , 2014, IEEE Transactions on Evolutionary Computation.

[25]  Jing J. Liang,et al.  Multimodal multiobjective optimization with differential evolution , 2019, Swarm Evol. Comput..

[26]  Mu-Chun Su,et al.  A Q-learning-based swarm optimization algorithm for economic dispatch problem , 2015, Neural Computing and Applications.

[27]  Eckart Zitzler,et al.  Indicator-Based Selection in Multiobjective Search , 2004, PPSN.