Determining modular organization of protein interaction networks by maximizing modularity density

BackgroundWith ever increasing amount of available data on biological networks, modeling and understanding the structure of these large networks is an important problem with profound biological implications. Cellular functions and biochemical events are coordinately carried out by groups of proteins interacting each other in biological modules. Identifying of such modules in protein interaction networks is very important for understanding the structure and function of these fundamental cellular networks. Therefore, developing an effective computational method to uncover biological modules should be highly challenging and indispensable.ResultsThe purpose of this study is to introduce a new quantitative measure modularity density into the field of biomolecular networks and develop new algorithms for detecting functional modules in protein-protein interaction (PPI) networks. Specifically, we adopt the simulated annealing (SA) to maximize the modularity density and evaluate its efficiency on simulated networks. In order to address the computational complexity of SA procedure, we devise a spectral method for optimizing the index and apply it to a yeast PPI network.ConclusionsOur analysis of detected modules by the present method suggests that most of these modules have well biological significance in context of protein complexes. Comparison with the MCL and the modularity based methods shows the efficiency of our method.

[1]  Dmitrij Frishman,et al.  MIPS: a database for genomes and protein sequences , 1999, Nucleic Acids Res..

[2]  S. Dongen Graph clustering by flow simulation , 2000 .

[3]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[4]  A. Barabasi,et al.  Hierarchical Organization of Modularity in Metabolic Networks , 2002, Science.

[5]  Gary D. Bader,et al.  An automated method for finding molecular complexes in large protein interaction networks , 2003, BMC Bioinformatics.

[6]  Haidong Wang,et al.  Discovering molecular pathways from protein interaction and gene expression data , 2003, ISMB.

[7]  David Martin,et al.  Functional classification of proteins for the prediction of cellular function from a protein-protein interaction network , 2003, Genome Biology.

[8]  Michael I. Jordan,et al.  Learning Spectral Clustering , 2003, NIPS.

[9]  L. Mirny,et al.  Protein complexes and functional modules in molecular networks , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[10]  D. Bu,et al.  Topological structure analysis of the protein-protein interaction network in budding yeast. , 2003, Nucleic acids research.

[11]  Alexander Rives,et al.  Modular organization of cellular networks , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Igor Jurisica,et al.  Protein complex prediction via cost-based clustering , 2004, Bioinform..

[13]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[14]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[15]  D. Bu,et al.  the protein–protein interaction network , 2004 .

[16]  Padhraic Smyth,et al.  A Spectral Clustering Approach To Finding Communities in Graph , 2005, SDM.

[17]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[18]  R. Guimerà,et al.  Functional cartography of complex metabolic networks , 2005, Nature.

[19]  Siëlle Gramser Fake pottery buries theory of early start for Christianity , 2005, Nature.

[20]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[21]  Jing Zhao,et al.  Hierarchical modularity of nested bow-ties in metabolic networks , 2006, BMC Bioinformatics.

[22]  Jacques van Helden,et al.  Evaluation of clustering algorithms for protein-protein interaction networks , 2006, BMC Bioinformatics.

[23]  Shihua Zhang,et al.  Identification of overlapping community structure in complex networks using fuzzy c-means clustering , 2007 .

[24]  Aidong Zhang,et al.  Semantic integration to identify overlapping functional modules in protein interaction networks , 2007, BMC Bioinformatics.

[25]  S. Fortunato,et al.  Resolution limit in community detection , 2006, Proceedings of the National Academy of Sciences.

[26]  Pietro Liò,et al.  Bottleneck Genes and Community Structure in the Cell Cycle Network of S. pombe , 2007, PLoS Comput. Biol..

[27]  BMC Systems Biology , 2007 .

[28]  Zhi Wang,et al.  Correction: In Search of the Biological Significance of Modular Structures in Protein Networks , 2007, PLoS Comput. Biol..

[29]  Xiang-Sun Zhang,et al.  Graph kernels, hierarchical clustering, and network community structure: experiments and comparative analysis , 2007 .

[30]  Luonan Chen,et al.  Discovering functions and revealing mechanisms at molecular level from biological networks , 2007, Proteomics.

[31]  Ying Wang,et al.  Quantitative Function for Community Detection , 2012, Physical review. E, Statistical, nonlinear, and soft matter physics.

[32]  Caroline C. Friedel,et al.  Bootstrapping the Interactome: Unsupervised Identification of Protein Complexes in Yeast , 2008, J. Comput. Biol..

[33]  E. Stone,et al.  Modulated Modularity Clustering as an Exploratory Tool for Functional Genomic Inference , 2009, PLoS genetics.