论文信息 - Particle swarm optimization approaches to coevolve strategies for the iterated prisoner's dilemma

Particle swarm optimization approaches to coevolve strategies for the iterated prisoner's dilemma

This paper presents and investigates the application of coevolutionary training techniques based on particle swarm optimization (PSO) to evolve playing strategies for the nonzero sum problem of the iterated prisoner's dilemma (IPD). Three different coevolutionary PSO techniques are used, differing in the way that IPD strategies are presented: A neural network (NN) approach in which the NN is used to predict the next action, a binary PSO approach in which the particle represents a complete playing strategy, and finally, a novel approach that exploits the symmetrical structure of man-made strategies. The last technique uses a PSO algorithm as a function approximator to evolve a function that characterizes the dynamics of the IPD. These different PSO approaches are compared experimentally with one another, and with popular man-made strategies. The performance of these approaches is evaluated in both clean and noisy environments. Results indicate that NNs cooperate well, but may develop weak strategies that can cause catastrophic collapses. The binary PSO technique does not have the same deficiency, instead resulting in an overall state of equilibrium in which some strategies are allowed to exploit the population, but never dominate. The symmetry approach is not as successful as the binary PSO approach in maintaining cooperation in both noisy and noiseless environments-exhibiting selfish behavior against the benchmark strategies and depriving them of receiving almost any payoff. Overall, the PSO techniques are successful at generating a variety of strategies for use in the IPD, duplicating and improving on existing evolutionary IPD population observations.

Andries Petrus Engelbrecht | Nelis Franken | A. Engelbrecht | N. Franken

[1] D. Fogel,et al. Evolving continuous behaviors in the Iterated Prisoner's Dilemma. , 1996, Bio Systems.

[2] D.B. Fogel,et al. A self-learning evolutionary chess program , 2004, Proceedings of the IEEE.

[3] W. Hamilton,et al. The evolution of cooperation. , 1984, Science.

[4] James Kennedy,et al. Small worlds and mega-minds: effects of neighborhood topology on particle swarm performance , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[5] R. Axelrod,et al. How to Cope with Noise in the Iterated Prisoner's Dilemma , 1995 .

[6] G. Hardin,et al. Tragedy of the Commons , 1968 .

[7] Kristian Lindgren,et al. Evolutionary phenomena in simple dynamics , 1992 .

[8] Ioan Cristian Trelea,et al. The particle swarm optimization algorithm: convergence analysis and parameter selection , 2003, Inf. Process. Lett..

[9] J. Neumann,et al. Theory of Games and Economic Behavior. , 1945 .

[10] A.P. Engelbrecht,et al. Learning to play games using a PSO-based competitive learning approach , 2004, IEEE Transactions on Evolutionary Computation.

[11] Maurice Clerc,et al. The particle swarm - explosion, stability, and convergence in a multidimensional complex space , 2002, IEEE Trans. Evol. Comput..

[12] H. Peyton Young,et al. Cooperation in the long-run , 1991 .

[13] J. Kennedy,et al. Population structure and particle swarm performance , 2002, Proceedings of the 2002 Congress on Evolutionary Computation. CEC'02 (Cat. No.02TH8600).

[14] Andries Petrus Engelbrecht,et al. Comparing PSO structures to learn the game of checkers from zero knowledge , 2003, The 2003 Congress on Evolutionary Computation, 2003. CEC '03..

[15] Xin Yao,et al. Co-Evolution in Iterated Prisoner's Dilemma with Intermediate Levels of Cooperation: Application to Missile Defense , 2002, Int. J. Comput. Intell. Appl..

[16] Frans van den Bergh,et al. An analysis of particle swarm optimizers , 2002 .

[17] David B. Fogel,et al. Evolving neural networks to play checkers without relying on expert knowledge , 1999, IEEE Trans. Neural Networks.

[18] Andries Petrus Engelbrecht,et al. Evolving intelligent game-playing agents , 2004, South Afr. Comput. J..

[19] David B. Fogel,et al. Evolving Behaviors in the Iterated Prisoner's Dilemma , 1993, Evolutionary Computation.

[20] James Kennedy,et al. Particle swarm optimization , 1995, Proceedings of ICNN'95 - International Conference on Neural Networks.

[21] Russell C. Eberhart,et al. A discrete binary version of the particle swarm algorithm , 1997, 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation.

[22] David B. Fogel,et al. Evolution, neural networks, games, and intelligence , 1999, Proc. IEEE.

[23] David B. Fogel,et al. Blondie24: Playing at the Edge of AI , 2001 .

[24] Xin Yao,et al. Speciation as automatic categorical modularization , 1997, IEEE Trans. Evol. Comput..

[25] Yongling Zheng,et al. On the convergence analysis and parameter selection in particle swarm optimization , 2003, Proceedings of the 2003 International Conference on Machine Learning and Cybernetics (IEEE Cat. No.03EX693).

[26] Robert Axelrod,et al. The Evolution of Strategies in the Iterated Prisoner's Dilemma , 2001 .

[27] David B. Fogel,et al. Anaconda defeats Hoyle 6-0: a case study competing an evolved checkers program against commercially available software , 2000, Proceedings of the 2000 Congress on Evolutionary Computation. CEC00 (Cat. No.00TH8512).

[28] Xin Yao,et al. On Evolving Robust Strategies for Iterated Prisoner's Dilemma , 1993, Evo Workshops.

[29] Xin Yao,et al. Does extra genetic diversity maintain escalation in a co-evolutionary arms race , 2000 .

[30] Andries Petrus Engelbrecht,et al. PSO approaches to coevolve IPD strategies , 2004, Proceedings of the 2004 Congress on Evolutionary Computation (IEEE Cat. No.04TH8753).

[31] Andries Petrus Engelbrecht,et al. Using neighbourhoods with the guaranteed convergence PSO , 2003, Proceedings of the 2003 IEEE Swarm Intelligence Symposium. SIS'03 (Cat. No.03EX706).

[32] Robert Axelrod,et al. Bibliography on the Evolution of Cooperation , 1988 .

[33] Cornelis J. Franken,et al. PSO-based coevolutionary Game Learning , 2004 .

[34] Andries Petrus Engelbrecht,et al. A study of particle swarm optimization particle trajectories , 2006, Inf. Sci..

[35] E. Ozcan,et al. Particle swarm optimization: surfing the waves , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).