Cloud Driven Design of a Distributed Genetic Programming Platform

We describe how we design FlexGP, a distributed genetic programming (GP) system to efficiently run on the cloud. The system has a decentralized, fault-tolerant, cascading startup where nodes start to compute while more nodes are launched. It has a peer-to-peer neighbor discovery protocol which constructs a robust communication network across the nodes. Concurrent with neighbor discovery, each node launches a GP run differing in parameterization and training data from its neighbors. This factoring of parameters across learners produces many diverse models for use in ensemble learning.

[1]  Xavier Llorà,et al.  Scaling Genetic Algorithms Using MapReduce , 2009, 2009 Ninth International Conference on Intelligent Systems Design and Applications.

[2]  Juan Julián Merelo Guervós,et al.  A Peer-to-Peer Approach to Genetic Programming , 2011, EuroGP.

[3]  Jimmy J. Lin,et al.  Scaling Populations of a Genetic Algorithm for Job Shop Scheduling Problems Using MapReduce , 2010, 2010 IEEE Second International Conference on Cloud Computing Technology and Science.

[4]  Sean Owen,et al.  Mahout in Action , 2011 .

[5]  Kalyan Veeramachaneni,et al.  Flex-GP: Genetic Programming on the Cloud , 2012, EvoApplications.

[6]  Ke Wang,et al.  Parallel learning to rank for information retrieval , 2011, SIGIR.

[7]  Xavier Llorà,et al.  Scaling eCGA model building via data-intensive computing , 2010, IEEE Congress on Evolutionary Computation.

[8]  Una-May O'Reilly,et al.  A Library to Run Evolutionary Algorithms in the Cloud Using MapReduce , 2012, EvoApplications.

[9]  Márk Jelasity,et al.  T-Man: Gossip-based fast overlay topology construction , 2009, Comput. Networks.

[10]  Giandomenico Spezzano,et al.  A Jxta Based Asynchronous Peer-to-Peer Implementation of Genetic Programming , 2006, J. Softw..

[11]  Juan Julián Merelo Guervós,et al.  EvAg: a scalable peer-to-peer evolutionary algorithm , 2010, Genetic Programming and Evolvable Machines.

[12]  Joseph M. Hellerstein,et al.  Distributed GraphLab: A Framework for Machine Learning in the Cloud , 2012, Proc. VLDB Endow..