Cooperative Co-Evolutionary Module Identification With Application to Cancer Disease Module Discovery

Module identification or community detection in complex networks has become increasingly important in many scientific fields because it provides insight into the relationship and interaction between network function and topology. In recent years, module identification algorithms based on stochastic optimization algorithms such as evolutionary algorithms have been demonstrated to be superior to other algorithms on small- to medium-scale networks. However, the scalability and resolution limit (RL) problems of these module identification algorithms have not been fully addressed, which impeded their application to real-world networks. This paper proposes a novel module identification algorithm called cooperative co-evolutionary module identification to address these two problems. The proposed algorithm employs a cooperative co-evolutionary framework to handle large-scale networks. We also incorporate a recursive partitioning scheme into the algorithm to effectively address the RL problem. The performance of our algorithm is evaluated on 12 benchmark complex networks. As a medical application, we apply our algorithm to identify disease modules that differentiate low- and high-grade glioma tumors to gain insights into the molecular mechanisms that underpin the progression of glioma. Experimental results show that the proposed algorithm has a very competitive performance compared with other state-of-the-art module identification algorithms.

[1]  Ville Tirronen,et al.  Recent advances in differential evolution: a survey and experimental analysis , 2010, Artificial Intelligence Review.

[2]  Robert Clarke,et al.  Gene Module Identification from Microarray Data Using Nonnegative Independent Component Analysis , 2008, Gene regulation and systems biology.

[3]  Weidong Tian,et al.  An iterative network partition algorithm for accurate identification of dense network modules , 2011, Nucleic acids research.

[4]  P. Bork,et al.  Proteome survey reveals modularity of the yeast cell machinery , 2006, Nature.

[5]  Haluk Bingol,et al.  Community Detection in Complex Networks Using Genetic Algorithms , 2006, 0711.0491.

[6]  Alexander Bailey,et al.  Genetic Programming for the Automatic Inference of Graph Models for Complex Networks , 2014, IEEE Transactions on Evolutionary Computation.

[7]  Yonghong Xiao,et al.  Pattern of retinoblastoma pathway inactivation dictates response to CDK4/6 inhibition in GBM , 2010, Proceedings of the National Academy of Sciences.

[8]  Xiaohua Hu,et al.  Dynamic identifying protein functional modules based on adaptive density modularity in protein-protein interaction networks , 2015, BMC Bioinformatics.

[9]  Iñaki Inza,et al.  Dealing with the evaluation of supervised classification algorithms , 2015, Artificial Intelligence Review.

[10]  Efrén Mezura-Montes,et al.  Differential evolution in constrained numerical optimization: An empirical study , 2010, Inf. Sci..

[11]  Maoguo Gong,et al.  Memetic algorithm for community detection in networks. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[12]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[13]  Yong Wang,et al.  Combining Multiobjective Optimization With Differential Evolution to Solve Constrained Optimization Problems , 2012, IEEE Transactions on Evolutionary Computation.

[14]  Chris Hankin,et al.  Fast multi-scale detection of overlapping communities using local criteria , 2014, Computing.

[15]  Leon Danon,et al.  Comparing community structure identification , 2005, cond-mat/0505245.

[16]  M. Tadesse,et al.  Pathway and Network Approaches for Identification of Cancer Signature Markers from Omics Data , 2015, Journal of Cancer.

[17]  R. Storn,et al.  Differential evolution a simple and efficient adaptive scheme for global optimization over continu , 1997 .

[18]  Robert A Lustig,et al.  Multicentric Glioblastoma Multiforme in a Patient with BRCA‐1 Invasive Breast Cancer , 2006, The breast journal.

[19]  H Aburatani,et al.  The critical role of cyclin D2 in cell cycle progression and tumorigenicity of glioblastoma stem cells , 2013, Oncogene.

[20]  Marissa Friedman,et al.  Glioblastoma: Molecular Pathways, Stem Cells and Therapeutic Targets , 2015, Cancers.

[21]  Andrew M. Tyrrell,et al.  Evolving Classifiers to Recognize the Movement Characteristics of Parkinson's Disease Patients , 2014, IEEE Transactions on Evolutionary Computation.

[22]  N. Smorodinsky,et al.  The breast cancer-associated MUC1 gene generates both a receptor and its cognate binding protein. , 1999, Cancer research.

[23]  Mark Hoogendoorn,et al.  Parameter Control in Evolutionary Algorithms: Trends and Challenges , 2015, IEEE Transactions on Evolutionary Computation.

[24]  Mike Tyers,et al.  BioGRID: a general repository for interaction datasets , 2005, Nucleic Acids Res..

[25]  Kalyanmoy Deb,et al.  An Evolutionary Many-Objective Optimization Algorithm Using Reference-Point-Based Nondominated Sorting Approach, Part I: Solving Problems With Box Constraints , 2014, IEEE Transactions on Evolutionary Computation.

[26]  P. Kleihues,et al.  Population-based studies on incidence, survival rates, and genetic alterations in astrocytic and oligodendroglial gliomas. , 2005, Journal of neuropathology and experimental neurology.

[27]  Xiaodong Li,et al.  Cooperative Coevolution With Route Distance Grouping for Large-Scale Capacitated Arc Routing Problems , 2014, IEEE Transactions on Evolutionary Computation.

[28]  Kevin E. Bassler,et al.  Improved community structure detection using a modified fine-tuning strategy , 2009, ArXiv.

[29]  Yuren Zhou,et al.  Performance Analysis of Evolutionary Algorithms for the Minimum Label Spanning Tree Problem , 2014, IEEE Transactions on Evolutionary Computation.

[30]  Yong Wang,et al.  An improved (μ + λ)-constrained differential evolution for constrained optimization , 2013, Inf. Sci..

[31]  Chia-Hsuan Yeh,et al.  Social Networks and Asset Price Dynamics , 2015, IEEE Transactions on Evolutionary Computation.

[32]  Juergen Thiele,et al.  The 2008 World Health Organization classification system for myeloproliferative neoplasms , 2009, Cancer.

[33]  Xiaodong Li,et al.  Cooperative Co-Evolution With Differential Grouping for Large Scale Optimization , 2014, IEEE Transactions on Evolutionary Computation.

[34]  A. Arenas,et al.  Community detection in complex networks using extremal optimization. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[35]  Shan He,et al.  Disease module identification from an integrated transcriptomic and interactomic network using evolutionary community extraction , 2013 .

[36]  R. Albert,et al.  The large-scale organization of metabolic networks , 2000, Nature.

[37]  Judith A. Blake,et al.  Ten Quick Tips for Using the Gene Ontology , 2013, PLoS Comput. Biol..

[38]  A. Barabasi,et al.  Network medicine : a network-based approach to human disease , 2010 .

[39]  Alain Hertz,et al.  A framework for the description of evolutionary algorithms , 2000, Eur. J. Oper. Res..

[40]  Qingfu Zhang,et al.  An Evolutionary Many-Objective Optimization Algorithm Based on Dominance and Decomposition , 2015, IEEE Transactions on Evolutionary Computation.

[41]  Tapabrata Ray,et al.  Differential Evolution With Dynamic Parameters Selection for Optimization Problems , 2014, IEEE Transactions on Evolutionary Computation.

[42]  Francisco Herrera,et al.  A study on the use of non-parametric tests for analyzing the evolutionary algorithms’ behaviour: a case study on the CEC’2005 Special Session on Real Parameter Optimization , 2009, J. Heuristics.

[43]  Vladimir Batagelj,et al.  Some analyses of Erdős collaboration graph , 2000, Soc. Networks.

[44]  Gary D. Bader,et al.  An automated method for finding molecular complexes in large protein interaction networks , 2003, BMC Bioinformatics.

[45]  Amedeo Caflisch,et al.  Efficient Modularity Optimization: Multi-Step Greedy Algorithm and Vertex Mover Refinement , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[46]  Bin Liu,et al.  Michigan Molecular Interactions (MiMI): putting the jigsaw puzzle together , 2006, Nucleic Acids Res..

[47]  Saraswati Sukumar,et al.  The Hox genes and their roles in oncogenesis , 2010, Nature Reviews Cancer.

[48]  Yong Wang,et al.  Community Detection in Social and Biological Networks Using Differential Evolution , 2012, LION.

[49]  Hanno Steen,et al.  Development of human protein reference database as an initial platform for approaching systems biology in humans. , 2003, Genome research.

[50]  T. Murata,et al.  Advanced modularity-specialized label propagation algorithm for detecting communities in networks , 2009, 0910.1154.

[51]  Lazaros G. Papageorgiou,et al.  Module detection in complex networks using integer optimisation , 2010, Algorithms for Molecular Biology.

[52]  Thomas Lengauer,et al.  A new measure for functional similarity of gene products based on Gene Ontology , 2006, BMC Bioinformatics.

[53]  John Scott Social Network Analysis , 1988 .

[54]  Qiang Huang,et al.  Community Detection Using Cooperative Co-evolutionary Differential Evolution , 2012, PPSN.

[55]  Maoguo Gong,et al.  Complex Network Clustering by Multiobjective Discrete Particle Swarm Optimization Based on Decomposition , 2014, IEEE Transactions on Evolutionary Computation.

[56]  Javier Béjar,et al.  Clustering algorithm for determining community structure in large networks. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[57]  Sean R. Davis,et al.  NCBI GEO: archive for functional genomics data sets—update , 2012, Nucleic Acids Res..

[58]  M. Newman,et al.  Identifying the role that animals play in their social networks , 2004, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[59]  R. Fisher Statistical methods for research workers , 1927, Protoplasma.

[60]  Yong Wang,et al.  Locating Multiple Optimal Solutions of Nonlinear Equation Systems Based on Multiobjective Optimization , 2015, IEEE Transactions on Evolutionary Computation.

[61]  Benjamin H. Good,et al.  Performance of modularity maximization in practical contexts. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[62]  Andreas Noack,et al.  Multilevel local search algorithms for modularity clustering , 2011, JEAL.

[63]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[64]  Brian W. Kernighan,et al.  An efficient heuristic procedure for partitioning graphs , 1970, Bell Syst. Tech. J..

[65]  S. Burma,et al.  Epidermal growth factor receptor in glioma: signal transduction, neuropathology, imaging, and radioresistance. , 2010, Neoplasia.

[66]  Clara Pizzuti,et al.  A Multiobjective Genetic Algorithm to Find Communities in Complex Networks , 2012, IEEE Transactions on Evolutionary Computation.

[67]  A Díaz-Guilera,et al.  Self-similar community structure in a network of human interactions. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[68]  M E Newman,et al.  Scientific collaboration networks. I. Network construction and fundamental results. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[69]  S. Fortunato,et al.  Resolution limit in community detection , 2006, Proceedings of the National Academy of Sciences.

[70]  Aidong Zhang,et al.  A “Seed-Refine” Algorithm for Detecting Protein Complexes From Protein Interaction Data , 2007, IEEE Transactions on NanoBioscience.

[71]  Rainer Storn,et al.  Differential Evolution – A Simple and Efficient Heuristic for global Optimization over Continuous Spaces , 1997, J. Glob. Optim..

[72]  Xin Yao,et al.  Large scale evolutionary optimization using cooperative coevolution , 2008, Inf. Sci..

[73]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[74]  Yanhui Hu,et al.  Integrating protein-protein interaction networks with phenotypes reveals signs of interactions , 2013, Nature Methods.

[75]  Weixiong Zhang,et al.  Identifying network communities with a high resolution. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[76]  A. Arenas,et al.  Macro- and micro-structure of trust networks , 2002, cond-mat/0206240.

[77]  Yijie Wang,et al.  Functional module identification in protein interaction networks by interaction patterns , 2014, Bioinform..

[78]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[79]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[80]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[81]  Anthony Brabazon,et al.  Comparing methods for module identification in grammatical evolution , 2012, GECCO '12.

[82]  Xin Yao,et al.  DiME: A Scalable Disease Module Identification Algorithm with Application to Glioma Progression , 2014, PloS one.