Multiomics modeling of the immunome, transcriptome, microbiome, proteome and metabolome adaptations during human pregnancy

Motivation: Multiple biological clocks govern a healthy pregnancy. These biological mechanisms produce immunologic, metabolomic, proteomic, genomic and microbiomic adaptations during the course of pregnancy. Modeling the chronology of these adaptations during full‐term pregnancy provides the frameworks for future studies examining deviations implicated in pregnancy‐related pathologies including preterm birth and preeclampsia. Results: We performed a multiomics analysis of 51 samples from 17 pregnant women, delivering at term. The datasets included measurements from the immunome, transcriptome, microbiome, proteome and metabolome of samples obtained simultaneously from the same patients. Multivariate predictive modeling using the Elastic Net (EN) algorithm was used to measure the ability of each dataset to predict gestational age. Using stacked generalization, these datasets were combined into a single model. This model not only significantly increased predictive power by combining all datasets, but also revealed novel interactions between different biological modalities. Future work includes expansion of the cohort to preterm‐enriched populations and in vivo analysis of immune‐modulating interventions based on the mechanisms identified. Availability and implementation: Datasets and scripts for reproduction of results are available through: https://nalab.stanford.edu/multiomics‐pregnancy/. Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  Larry Gold,et al.  Nucleic Acid Ligands With Protein-like Side Chains: Modified Aptamers and Their Use as Diagnostic and Therapeutic Agents , 2014, Molecular therapy. Nucleic acids.

[2]  B. Groner,et al.  Prolactin induces phosphorylation of Tyr694 of Stat5 (MGF), a prerequisite for DNA binding and induction of transcription. , 1994, The EMBO journal.

[3]  Kurt Hornik,et al.  kernlab - An S4 Package for Kernel Methods in R , 2004 .

[4]  Matteo Bottai,et al.  Cross-Country Individual Participant Analysis of 4.1 Million Singleton Births in 5 Countries with Very High Human Development Index Confirms Known Associations but Provides No Biologic Explanation for 2/3 of All Preterm Births , 2016, PloS one.

[5]  Bing Ren,et al.  Genome-wide mapping of allele-specific protein-DNA interactions in human cells , 2008, Nature Methods.

[6]  O. J. Dunn Multiple Comparisons among Means , 1961 .

[7]  H. Stefánsson,et al.  Genetics of gene expression and its effect on disease , 2008, Nature.

[8]  D. Pe’er,et al.  An Integrated Approach to Uncover Drivers of Cancer , 2010, Cell.

[9]  Kuo-Chen Chou,et al.  Using stacked generalization to predict membrane protein types based on pseudo-amino acid composition. , 2006, Journal of theoretical biology.

[10]  Holger H. Hoos,et al.  Analysing differences between algorithm configurations through ablation , 2016, J. Heuristics.

[11]  Chris Sander,et al.  Time to Recurrence and Survival in Serous Ovarian Tumors Predicted from Integrated Genomic Profiles , 2011, PloS one.

[12]  M. Farrar,et al.  Interleukin-2 and STAT5 in regulatory T cell development and function , 2013, JAK-STAT.

[13]  Pelin Yilmaz,et al.  The SILVA ribosomal RNA gene database project: improved data processing and web-based tools , 2012, Nucleic Acids Res..

[14]  Karen Sachs,et al.  Multiplexed mass cytometry profiling of cellular states perturbed by small-molecule regulators , 2012, Nature Biotechnology.

[15]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[16]  Andrija Bosnjak,et al.  Pre-term delivery and periodontal disease: a case-control study from Croatia. , 2006, Journal of clinical periodontology.

[17]  R. Sun,et al.  Expression of prolactin receptor and response to prolactin stimulation of human NK cell lines , 2004, Cell Research.

[18]  Jamie Perin,et al.  Global, regional, and national causes of child mortality: an updated systematic analysis for 2010 with time trends since 2000 , 2012, The Lancet.

[19]  Y. Chien,et al.  γδ T cells: first line of defense and beyond. , 2014, Annual review of immunology.

[20]  David Barber,et al.  Bayesian Classification With Gaussian Processes , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  A. G. Betz,et al.  Regulatory T cells mediate maternal tolerance to the fetus , 2004, Nature Immunology.

[22]  Guangtao Ge,et al.  Classification of premalignant pancreatic cancer mass-spectrometry data using decision tree ensembles , 2008, BMC Bioinformatics.

[23]  S L Delp,et al.  Physical activity is associated with changes in knee cartilage microstructure. , 2018, Osteoarthritis and cartilage.

[24]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[25]  Lihua Jiang,et al.  Optimized Analytical Procedures for the Untargeted Metabolomic Profiling of Human Urine and Plasma by Combining Hydrophilic Interaction (HILIC) and Reverse-Phase Liquid Chromatography (RPLC)–Mass Spectrometry* , 2015, Molecular & Cellular Proteomics.

[26]  Tracy R. Keeney,et al.  Aptamer-based multiplexed proteomic technology for biomarker discovery , 2010, Nature Precedings.

[27]  Chih-Jen Lin,et al.  A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[28]  P. Waldmann,et al.  Evaluation of the lasso and the elastic net in genome-wide association studies , 2013, Front. Genet..

[29]  Nima Aghaeepour,et al.  Residential agricultural pesticide exposures and risks of preeclampsia , 2018, Environmental research.

[30]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[31]  Marylyn D. Ritchie,et al.  ATHENA: the analysis tool for heritable and environmental network associations , 2014, Bioinform..

[32]  Christine L. Sun,et al.  Temporal and spatial variation of the human microbiota during pregnancy , 2015, Proceedings of the National Academy of Sciences.

[33]  H. Senn,et al.  Probabilistic quotient normalization as robust method to account for dilution of complex biological mixtures. Application in 1H NMR metabonomics. , 2006, Analytical chemistry.

[34]  Leo Breiman,et al.  Stacked regressions , 2004, Machine Learning.

[35]  Steven P. Lund,et al.  A Bayesian Integrative Genomic Model for Pathway Analysis of Complex Traits , 2012, Genetic epidemiology.

[36]  G. Shaw,et al.  Transdisciplinary translational science and the case of preterm birth , 2012, Journal of Perinatology.

[37]  Robert Tibshirani,et al.  Hierarchical Clustering With Prototypes via Minimax Linkage , 2011, Journal of the American Statistical Association.

[38]  P. Arck,et al.  Pregnancy Around the Clock. , 2018, Trends in molecular medicine.

[39]  J. Tiedje,et al.  Naïve Bayesian Classifier for Rapid Assignment of rRNA Sequences into the New Bacterial Taxonomy , 2007, Applied and Environmental Microbiology.

[40]  Wanjun Chen,et al.  The mucosal immune system in the oral cavity—an orchestra of T cell diversity , 2014, International Journal of Oral Science.

[41]  Rachel B. Brem,et al.  Integrating large-scale functional genomic data to dissect the complexity of yeast regulatory networks , 2008, Nature Genetics.

[42]  Ronald J Wong,et al.  Simultaneously Monitoring Immune Response and Microbial Infections during Pregnancy through Plasma cfRNA Sequencing. , 2017, Clinical chemistry.

[43]  D. Relman,et al.  An ecological and evolutionary perspective on human–microbe mutualism and disease , 2007, Nature.

[44]  R. Tibshirani,et al.  An immune clock of human pregnancy , 2017, Science Immunology.

[45]  T. Inder,et al.  Neonatal MRI to predict neurodevelopmental outcomes in preterm infants. , 2006, The New England journal of medicine.

[46]  J. Konkel,et al.  Tissue-Specific Immunity at the Oral Mucosal Barrier. , 2017, Trends in immunology.

[47]  A. Califano,et al.  Dialogue on Reverse‐Engineering Assessment and Methods , 2007, Annals of the New York Academy of Sciences.

[48]  J. Castle,et al.  An integrative genomics approach to infer causal associations between gene expression and disease , 2005, Nature Genetics.

[49]  Amanda J. C. Sharkey,et al.  On Combining Artificial Neural Nets , 1996, Connect. Sci..

[50]  G. Nolan,et al.  A benchmark for evaluation of algorithms for identification of cellular correlates of clinical outcomes , 2016, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[51]  Hans-Peter Kriegel,et al.  Protein function prediction via graph kernels , 2005, ISMB.

[52]  Roger M Arce,et al.  Periodontal disease severity is related to high levels of C-reactive protein in pre-eclampsia , 2007, Journal of hypertension.

[53]  Adrian Erlebacher,et al.  Immunology of the maternal-fetal interface. , 2013, Annual review of immunology.

[54]  René Druckmann,et al.  Progesterone and the immunology of pregnancy , 2005, The Journal of Steroid Biochemistry and Molecular Biology.

[55]  Concha Bielza,et al.  Machine Learning in Bioinformatics , 2008, Encyclopedia of Database Systems.

[56]  Albert Y. Zomaya,et al.  A Review of Ensemble Methods in Bioinformatics , 2010, Current Bioinformatics.

[57]  Deborah Levine,et al.  Fetal Imaging , 2014, Journal of ultrasound in medicine : official journal of the American Institute of Ultrasound in Medicine.

[58]  Tomasz Burzykowski,et al.  Linear Mixed Effects Model , 2021, Encyclopedia of Gerontology and Population Aging.

[59]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[60]  Eddy J. Bautista,et al.  Integrative Personal Omics Profiles during Periods of Weight Gain and Loss. , 2018, Cell systems.

[61]  Holger H. Hoos,et al.  Ensemble-based prediction of RNA secondary structures , 2013, BMC Bioinformatics.

[62]  M. Olinto,et al.  Periodontal disease and perinatal outcomes: a case-control study. , 2007, Journal of clinical periodontology.

[63]  Robert Tibshirani,et al.  A proteomic clock of human pregnancy , 2017, American journal of obstetrics and gynecology.

[64]  Christine L. Sun,et al.  Replication and refinement of a vaginal microbial signature of preterm birth in two racially distinct cohorts of US women , 2017, Proceedings of the National Academy of Sciences.

[65]  Adam B. Olshen,et al.  Integrative clustering of multiple genomic data types using a joint latent variable model with application to breast and lung cancer subtype analysis , 2009, Bioinform..

[66]  RuiSun,et al.  Expression of prolactin receptor and response to prolactin stimulation of human NK cell lines , 2004 .

[67]  G. Hajishengallis,et al.  Periodontitis: from microbial immune subversion to systemic inflammation , 2014, Nature Reviews Immunology.

[68]  François Goffinet,et al.  Maternal periodontitis and causes of preterm birth : The case-control , 2010 .

[69]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[70]  Ju Han Kim,et al.  Synergistic effect of different levels of genomic data for cancer clinical outcome prediction , 2012, J. Biomed. Informatics.

[71]  Nima Aghaeepour,et al.  Implementing Mass Cytometry at the Bedside to Study the Immunological Basis of Human Diseases: Distinctive Immune Features in Patients with a History of Term or Preterm Birth , 2015, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[72]  Amy P. Murtha,et al.  Maternal Periodontal Disease Is Associated With an Increased Risk for Preeclampsia , 2003, Obstetrics and gynecology.

[73]  M. Ritchie,et al.  Methods of integrating data to uncover genotype–phenotype interactions , 2015, Nature Reviews Genetics.

[74]  Kim-Anh Lê Cao,et al.  mixOmics: An R package for ‘omics feature selection and multiple data integration , 2017, bioRxiv.

[75]  Kim-Anh Lê Cao,et al.  DIABLO - an integrative, multi-omics, multivariate method for multi-group classification , 2017 .

[76]  Edward M. Reingold,et al.  Graph drawing by force‐directed placement , 1991, Softw. Pract. Exp..

[77]  Rachel B. Brem,et al.  Stitching together Multiple Data Dimensions Reveals Interacting Metabolomic and Transcriptomic Networks That Modulate Cell Regulation , 2012, PLoS biology.

[78]  Peter Bühlmann,et al.  MissForest - non-parametric missing value imputation for mixed-type data , 2011, Bioinform..

[79]  Sam Mesiano,et al.  Novel concepts on pregnancy clocks and alarms: redundancy and synergy in human parturition. , 2016, Human reproduction update.

[80]  Adam B. Olshen,et al.  Integrative clustering of multiple genomic data types using a joint latent variable model with application to breast and lung cancer subtype analysis , 2009, Bioinform..

[81]  Jeffrey C Murray,et al.  A solution pathway for preterm birth: accelerating a priority research agenda , 2013, The Lancet. Global health.

[82]  Sarah A. Teichmann,et al.  Single-Cell RNA Sequencing Reveals T Helper Cells Synthesizing Steroids De Novo to Contribute to Immune Homeostasis , 2014, Cell reports.

[83]  Gary M Shaw,et al.  What factors are related to recurrent preterm birth among underweight women?* , 2018, The journal of maternal-fetal & neonatal medicine : the official journal of the European Association of Perinatal Medicine, the Federation of Asia and Oceania Perinatal Societies, the International Society of Perinatal Obstetricians.

[84]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[85]  Henk Ritzema,et al.  Frequency and regression analysis. , 1994 .

[86]  Hongfei Lin,et al.  Extracting Drug-Drug Interaction from the Biomedical Literature Using a Stacked Generalization-Based Approach , 2013, PloS one.

[87]  Andrey A. Shabalin,et al.  Matrix eQTL: ultra fast eQTL analysis via large matrix operations , 2011, Bioinform..

[88]  Anthony A Kossiakoff,et al.  Crystal structure and site 1 binding energetics of human placental lactogen. , 2006, Journal of molecular biology.

[89]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[90]  Atul J Butte,et al.  Risky Business: Meeting the Structural Needs of Transdisciplinary Science. , 2017, The Journal of pediatrics.