DiME: A Scalable Disease Module Identification Algorithm with Application to Glioma Progression

Disease module is a group of molecular components that interact intensively in the disease specific biological network. Since the connectivity and activity of disease modules may shed light on the molecular mechanisms of pathogenesis and disease progression, their identification becomes one of the most important challenges in network medicine, an emerging paradigm to study complex human disease. This paper proposes a novel algorithm, DiME (Disease Module Extraction), to identify putative disease modules from biological networks. We have developed novel heuristics to optimise Community Extraction, a module criterion originally proposed for social network analysis, to extract topological core modules from biological networks as putative disease modules. In addition, we have incorporated a statistical significance measure, B-score, to evaluate the quality of extracted modules. As an application to complex diseases, we have employed DiME to investigate the molecular mechanisms that underpin the progression of glioma, the most common type of brain tumour. We have built low (grade II) - and high (GBM) - grade glioma co-expression networks from three independent datasets and then applied DiME to extract potential disease modules from both networks for comparison. Examination of the interconnectivity of the identified modules have revealed changes in topology and module activity (expression) between low- and high- grade tumours, which are characteristic of the major shifts in the constitution and physiology of tumour cells during glioma progression. Our results suggest that transcription factors E2F4, AR and ETS1 are potential key regulators in tumour progression. Our DiME compiled software, R/C++ source code, sample data and a tutorial are available at http://www.cs.bham.ac.uk/~szh/DiME.

[1]  Chi V. Dang,et al.  c-Myc Target Genes Involved in Cell Growth, Apoptosis, and Metabolism , 1999, Molecular and Cellular Biology.

[2]  C. Dang,et al.  Function of the c‐Myc oncoprotein , 1992, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[3]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[4]  D. Davies,et al.  Squamous cell cancers contain a side population of stem-like cells that are made chemosensitive by ABC transporter blockade , 2008, British Journal of Cancer.

[5]  T. Nakayama,et al.  Expression of the ets-1 Proto-Oncogene in Human Colorectal Carcinoma , 2001, Modern Pathology.

[6]  Thomas C Chen,et al.  TGF-B2 and soluble p55 TNFR modulate VCAM-1 expression in glioma cells and brain derived endothelial cells , 1997, Journal of Neuroimmunology.

[7]  James Bailey,et al.  Information theoretic measures for clusterings comparison: is a correction for chance necessary? , 2009, ICML '09.

[8]  Aidong Zhang,et al.  A “Seed-Refine” Algorithm for Detecting Protein Complexes From Protein Interaction Data , 2007, IEEE Transactions on NanoBioscience.

[9]  Mahlon D. Johnson,et al.  Transcriptional differences between normal and glioma-derived glial progenitor cells identify a core set of dysregulated genes. , 2013, Cell reports.

[10]  J. Uhm IDH1 mutation is sufficient to establish the glioma hypermethylator phenotype , 2012 .

[11]  Zahra Amirghofran,et al.  Androgen receptor expression in relation to apoptosis and the expression of cell cycle related proteins in prostate cancer , 2008, Pathology & Oncology Research.

[12]  D. Beer,et al.  Decreased Selenium-Binding Protein 1 in Esophageal Adenocarcinoma Results from Posttranscriptional and Epigenetic Regulation and Affects Chemosensitivity , 2010, Clinical Cancer Research.

[13]  E. Levina,et al.  Community extraction for social networks , 2010, Proceedings of the National Academy of Sciences.

[14]  R. Badge,et al.  Sox8 gene expression identifies immature glial cells in developing cerebellum and cerebellar tumours. , 2001, Brain research. Molecular brain research.

[15]  Ruedi Aebersold,et al.  Yeast endosulfines control entry into quiescence and chronological life span by inhibiting protein phosphatase 2A. , 2013, Cell reports.

[16]  S. Srivastava,et al.  TMPRSS2-ERG fusion, a common genomic alteration in prostate cancer activates C-MYC and abrogates prostate epithelial differentiation , 2008, Oncogene.

[17]  Carlos Prieto,et al.  Human Gene Coexpression Landscape: Confident Network Derived from Tissue Transcriptomic Profiles , 2008, PloS one.

[18]  S. Fortunato,et al.  Resolution limit in community detection , 2006, Proceedings of the National Academy of Sciences.

[19]  Hong Yi,et al.  The Function and Significance of SELENBP1 Downregulation in Human Bronchial Epithelial Carcinogenic Process , 2013, PloS one.

[20]  Pedro Martínez,et al.  Identification of survival‐related genes of the phosphatidylinositol 3′‐kinase signaling pathway in glioblastoma multiforme , 2008, Cancer.

[21]  Gerald C. Chu,et al.  P53 and Pten control neural and glioma stem/progenitor cell renewal and differentiation , 2008, Nature.

[22]  B. Erovic,et al.  The effect of nimesulide, a selective cyclooxygenase‐2 inhibitor, on Ets‐1 and Ets‐2 expression in head and neck cancer cell lines , 2005, Head & neck.

[23]  M. Acencio,et al.  HTRIdb: an open-access database for experimentally verified human transcriptional regulation interactions , 2012, BMC Genomics.

[24]  Jeffrey Q. Jiang,et al.  Towards Prediction and Prioritization of disease genes by the modularity of human phenome-genome assembled network , 2010, J. Integr. Bioinform..

[25]  T. Terasaki,et al.  Correlation of induction of ATP binding cassette transporter A5 (ABCA5) and ABCB1 mRNAs with differentiation state of human colon tumor. , 2007, Biological & pharmaceutical bulletin.

[26]  Shan He,et al.  Disease module identification from an integrated transcriptomic and interactomic network using evolutionary community extraction , 2013 .

[27]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[28]  Peter M Black,et al.  Survival rates and patterns of care for patients diagnosed with supratentorial low‐grade gliomas , 2006, Cancer.

[29]  B. Efron,et al.  The Jackknife: The Bootstrap and Other Resampling Plans. , 1983 .

[30]  Desok Kim,et al.  Androgen receptor gene amplification and protein expression in recurrent prostate cancer. , 2003, The Journal of urology.

[31]  Santo Fortunato,et al.  Finding Statistically Significant Communities in Networks , 2010, PloS one.

[32]  Shibo Jiang,et al.  Extracellular matrix protein betaig-h3/TGFBI promotes metastasis of colon cancer by enhancing cell extravasation. , 2008, Genes & development.

[33]  R. McLendon,et al.  Alterations of the TP53 gene in human gliomas. , 1994, Cancer research.

[34]  Weixiong Zhang,et al.  A general co-expression network-based approach to gene expression analysis: comparison and applications , 2010, BMC Systems Biology.

[35]  T. Shi,et al.  Human SBK1 is dysregulated in multiple cancers and promotes survival of ovary cancer SK-OV-3 cells , 2010, Molecular Biology Reports.

[36]  S. Oñate,et al.  Interleukin-4 enhances prostate-specific antigen expression by activation of the androgen receptor and Akt pathway , 2003, Oncogene.

[37]  S. Shibata,et al.  Expression of the Ets-1 proto-oncogene correlates with malignant potential in human astrocytic tumors. , 1999, Modern pathology : an official journal of the United States and Canadian Academy of Pathology, Inc.

[38]  A. Barabasi,et al.  Network medicine : a network-based approach to human disease , 2010 .

[39]  M. Lesniak,et al.  CD4+CD25+FoxP3+ T-cell infiltration and heme oxygenase-1 expression correlate with tumor grade in human gliomas , 2007, Journal of Neuro-Oncology.

[40]  P. Keely,et al.  R-Ras controls membrane protrusion and cell migration through the spatial regulation of Rac and Rho. , 2004, Molecular biology of the cell.

[41]  Sangsoo Kim,et al.  Gene expression Differential coexpression analysis using microarray data and its application to human cancer , 2005 .

[42]  Fred Glover,et al.  Tabu Search - Part II , 1989, INFORMS J. Comput..

[43]  Kurt Hornik,et al.  A CLUE for CLUster Ensembles , 2005 .

[44]  A. Arenas,et al.  Macro- and micro-structure of trust networks , 2002, cond-mat/0206240.

[45]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[46]  Rafael A Irizarry,et al.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data. , 2003, Biostatistics.

[47]  B. Efron The jackknife, the bootstrap, and other resampling plans , 1987 .

[48]  L. Chin,et al.  Malignant astrocytic glioma: genetics, biology, and paths to treatment. , 2007, Genes & development.

[49]  Vladimir Batagelj,et al.  Some analyses of Erdős collaboration graph , 2000, Soc. Networks.

[50]  Harold W. Kuhn,et al.  The Hungarian method for the assignment problem , 1955, 50 Years of Integer Programming.

[51]  M E Newman,et al.  Scientific collaboration networks. I. Network construction and fundamental results. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[52]  A Díaz-Guilera,et al.  Self-similar community structure in a network of human interactions. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[53]  Fred W. Glover,et al.  Tabu Search - Part I , 1989, INFORMS J. Comput..

[54]  J. Fletcher,et al.  ABC transporters in cancer: more than just drug efflux pumps , 2010, Nature Reviews Cancer.

[55]  M. Zaaroor,et al.  Targeted therapy for high-grade glioma with the TGF-β2 inhibitor trabedersen: results of a randomized and controlled phase IIb study , 2010, Neuro-oncology.

[56]  Santo Fortunato,et al.  Limits of modularity maximization in community detection , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[57]  B. O'neill,et al.  Glioblastoma survival in the United States before and during the temozolomide era , 2012, Journal of Neuro-Oncology.

[58]  F. Radicchi,et al.  Statistical significance of communities in networks. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[59]  Gary D. Bader,et al.  An automated method for finding molecular complexes in large protein interaction networks , 2003, BMC Bioinformatics.

[60]  R. Berkowitz,et al.  Selenium binding protein 1 in ovarian cancer , 2006, International journal of cancer.

[61]  Subha Madhavan,et al.  Rembrandt: Helping Personalized Medicine Become a Reality through Integrative Translational Research , 2009, Molecular Cancer Research.

[62]  C. Sander,et al.  Automated Network Analysis Identifies Core Pathways in Glioblastoma , 2010, PloS one.

[63]  K. Kang,et al.  C-myc amplification altered the gene expression of ABC- and SLC-transporters in human breast epithelial cells. , 2009, Molecular pharmaceutics.

[64]  B. Efron Nonparametric estimates of standard error: The jackknife, the bootstrap and other methods , 1981 .