Inferring reaction network structure from single-cell, multiplex data, using toric systems theory

The goal of many single-cell studies on eukaryotic cells is to gain insight into the biochemical reactions that control cell fate and state. In this paper we introduce the concept of effective stoichiometric space (ESS) to guide the reconstruction of biochemical networks from multiplexed, fixed time-point, single-cell data. In contrast to methods based solely on statistical models of data, the ESS method leverages the power of the geometric theory of toric varieties to begin unraveling the structure of chemical reaction networks (CRN). This application of toric theory enables a data-driven mapping of covariance relationships in single cell measurements into stoichiometric information, one in which each cell subpopulation has its associated ESS interpreted in terms of CRN theory. In the development of ESS we reframe certain aspects of the theory of CRN to better match data analysis. As an application of our approach we process cytomery- and image-based single-cell datasets and identify differences in cells treated with kinase inhibitors. Our approach is directly applicable to data acquired using readily accessible experimental methods such as Fluorescence Activated Cell Sorting (FACS) and multiplex immunofluorescence. Author summary We introduce a new notion, which we call the effective stoichiometric space (ESS), that elucidates network structure from the covariances of single-cell multiplexed data. The ESS approach differs from methods that are based on purely statistical models of data: it allows a completely new and data-driven translation of the theory of toric varieties in geometry and specifically their role in chemical reaction networks (CRN). In the process, we reframe certain aspects of the theory of CRN. As illustrations of our approach, we find stoichiometry in different single-cell datasets, and pinpoint dose-dependence of network perturbations in drug-treated cells.

[1]  James B. Rawlings,et al.  The QSSA in Chemical Kinetics: As Taught and as Practiced , 2014 .

[2]  Allon M. Klein,et al.  Single-cell mapping of gene expression landscapes and lineage in the zebrafish embryo , 2018, Science.

[3]  J. W. Silverstein,et al.  Eigenvalues of large sample covariance matrices of spiked population models , 2004, math/0408165.

[4]  M. Elowitz,et al.  Challenges and emerging directions in single-cell analysis , 2017, bioRxiv.

[5]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[6]  Facundo Mémoli,et al.  Topological Methods for the Analysis of High Dimensional Data Sets and 3D Object Recognition , 2007, PBG@Eurographics.

[7]  Sean C. Bendall,et al.  From single cells to deep phenotypes in cancer , 2012, Nature Biotechnology.

[8]  Kenneth L. Ho,et al.  Numerical algebraic geometry for model selection and its application to the life sciences , 2015, Journal of The Royal Society Interface.

[9]  M. Selbach,et al.  Global quantification of mammalian gene expression control , 2011, Nature.

[10]  R. Jackson,et al.  General mass action kinetics , 1972 .

[11]  Jaejik Kim,et al.  Statistical Model for Biochemical Network Inference , 2013, Commun. Stat. Simul. Comput..

[12]  Martin Feinberg,et al.  Foundations of Chemical Reaction Network Theory , 2019, Applied Mathematical Sciences.

[13]  B. Becher,et al.  The end of omics? High dimensional single cell analysis in precision medicine , 2019, European journal of immunology.

[14]  Juan Carlos Fernández,et al.  Multiobjective evolutionary algorithms to identify highly autocorrelated areas: the case of spatial distribution in financially compromised farms , 2014, Ann. Oper. Res..

[15]  Alicia Dickenstein,et al.  Multistationarity in Structured Reaction Networks , 2018, Bulletin of mathematical biology.

[16]  Gheorghe Craciun,et al.  Toric Differential Inclusions and a Proof of the Global Attractor Conjecture , 2015, 1501.02860.

[17]  Badal Joshi,et al.  A survey of methods for deciding whether a reaction network is multistationary , 2014, 1412.5257.

[18]  N. Friedman,et al.  Stochastic protein expression in individual cells at the single molecule level , 2006, Nature.

[19]  Alicia Dickenstein,et al.  Chemical Reaction Systems with Toric Steady States , 2011, Bulletin of mathematical biology.

[20]  R. Goody,et al.  The original Michaelis constant: translation of the 1913 Michaelis-Menten paper. , 2011, Biochemistry.

[21]  M. Elowitz,et al.  Regulatory activity revealed by dynamic correlations in gene expression noise , 2008, Nature Genetics.

[22]  D. Fingar,et al.  Regulation and function of ribosomal protein S6 kinase (S6K) within mTOR signalling networks. , 2012, The Biochemical journal.

[23]  Sandro Santagata,et al.  Highly multiplexed immunofluorescence imaging of human tissues and tumors using t-CyCIF and conventional optical microscopes , 2018 .

[24]  Sandro Santagata,et al.  Highly multiplexed immunofluorescence imaging of human tissues and tumors using t-CyCIF and conventional optical microscopes , 2018, eLife.

[25]  Tal Nawy,et al.  Single-cell sequencing , 2013, Nature Methods.

[26]  Dariusz Plewczynski,et al.  A combined systems and structural modeling approach repositions antibiotics for Mycoplasma genitalium , 2015, Comput. Biol. Chem..

[27]  Rong Fan,et al.  Protein signaling networks from single cell fluctuations and information theory profiling. , 2011, Biophysical journal.

[28]  Martin Helmer,et al.  Complexity of model testing for dynamical systems with toric steady states , 2019, Adv. Appl. Math..

[29]  William W. Chen,et al.  Classic and contemporary approaches to modeling biochemical reactions. , 2010, Genes & development.

[30]  Long Cai,et al.  Single cell systems biology by super-resolution imaging and combinatorial labeling , 2012, Nature Methods.

[31]  M. Feinberg Chemical reaction network structure and the stability of complex isothermal reactors—II. Multiple steady states for networks of deficiency one , 1988 .

[32]  Ferdinand Verhulst,et al.  Singular perturbation methods for slow–fast dynamics , 2007 .

[33]  K. Sachs,et al.  Causal Protein-Signaling Networks Derived from Multiparameter Single-Cell Data , 2005, Science.

[34]  Judit Zsuga,et al.  The Hill equation and the origin of quantitative pharmacology , 2012 .

[35]  A. Goriely,et al.  Component retention in principal component analysis with application to cDNA microarray data , 2007, Biology Direct.

[36]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[37]  Jeremy Gunawardena,et al.  The rational parameterization theorem for multisite post-translational modification systems. , 2009, Journal of theoretical biology.

[38]  J. Buhmann,et al.  Highly multiplexed imaging of tumor tissues with subcellular resolution by mass cytometry , 2014, Nature Methods.

[39]  Jonathan R. Karr,et al.  A Whole-Cell Computational Model Predicts Phenotype from Genotype , 2012, Cell.

[40]  Garry P. Nolan,et al.  Simultaneous measurement of multiple active kinase states using polychromatic flow cytometry , 2002, Nature Biotechnology.

[41]  Martin Feinberg,et al.  Stability and instability in isothermal CFSTRs with complex chemistry: Some recent results , 2013 .

[42]  Shi-Yong Sun,et al.  mTOR kinase inhibitors as potential cancer therapeutic drugs. , 2013, Cancer letters.

[43]  A. Regev,et al.  Spatial reconstruction of single-cell gene expression data , 2015 .

[44]  Min Hao,et al.  Mammalian Target of Rapamycin (mTOR) Regulates Transforming Growth Factor-β1 (TGF-β1)-Induced Epithelial-Mesenchymal Transition via Decreased Pyruvate Kinase M2 (PKM2) Expression in Cervical Cancer Cells , 2017, Medical science monitor : international medical journal of experimental and clinical research.

[45]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[46]  Marc Massot,et al.  Entropic structure of multicomponent reactive flows with partial equilibrium reduced chemistry , 2004 .

[47]  Mercedes Pérez Millán,et al.  MAPK's networks and their capacity for multistationarity due to toric steady states. , 2014, Mathematical biosciences.

[48]  Li Qiu,et al.  Unitarily Invariant Metrics on the Grassmann Space , 2005, SIAM J. Matrix Anal. Appl..

[49]  L. Ferré Selection of components in principal component analysis: a comparison of methods , 1995 .

[50]  Alicia Dickenstein,et al.  Toric dynamical systems , 2007, J. Symb. Comput..

[51]  Bernd Sturmfels,et al.  Algebraic Systems Biology: A Case Study for the Wnt Pathway , 2015, Bulletin of Mathematical Biology.

[52]  P. Waage,et al.  Studies concerning affinity , 1986 .

[53]  Peter K. Sorger,et al.  Highly multiplexed imaging of single cells using a high-throughput cyclic immunofluorescence method , 2015, Nature Communications.

[54]  Gene H. Golub,et al.  Numerical methods for computing angles between linear subspaces , 1971, Milestones in Matrix Computation.

[55]  Alicia Dickenstein,et al.  The Structure of MESSI Biological Systems , 2016, SIAM J. Appl. Dyn. Syst..