This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Moment-based analysis of biochemical networks in a heterogeneous population of communicating cells

David T. Gonzales Max Planck Institute of Molecular Cell Biology and Genetics, 01307 Dresden, Germany Center for Systems Biology Dresden, 01307 Dresden, Germany T-Y Dora Tang Max Planck Institute of Molecular Cell Biology and Genetics, 01307 Dresden, Germany Christoph Zechner Max Planck Institute of Molecular Cell Biology and Genetics, 01307 Dresden, Germany Center for Systems Biology Dresden, 01307 Dresden, Germany Correspondence to: zechner@mpi-cbg.de

Cells can utilize chemical communication to exchange information and coordinate their behavior in the presence of noise. Communication can reduce noise to shape a collective response, or amplify noise to generate distinct phenotypic subpopulations. Here we discuss a moment-based approach to study how cell-cell communication affects noise in biochemical networks that arises from both intrinsic and extrinsic sources. We derive a system of approximate differential equations that captures lower-order moments of a population of cells, which communicate by secreting and sensing a diffusing molecule. Since the number of obtained equations grows combinatorially with number of considered cells, we employ a previously proposed model reduction technique, which exploits symmetries in the underlying moment dynamics. Importantly, the number of equations obtained in this way is independent of the number of considered cells such that the method scales to arbitrary population sizes. Based on this approach, we study how cell-cell communication affects population variability in several biochemical networks. Moreover, we analyze the accuracy and computational efficiency of the moment-based approximation by comparing it with moments obtained from stochastic simulations.

1  Introduction

In recent years, significant progress has been made in developing computational models and algorithms to study stochastic biochemical networks inside living cells. Most commonly, these models are based on the chemical master equation (CME), whose solution provides a time-dependent probability distribution over molecular concentrations [1]. CME-based models can account for the discrete and random nature of biochemical reactions (intrinsic noise) as well as population heterogeneity stemming from differences in each cell’s microenvironment (extrinsic variability) [2]. The computational analysis of the CME is challenging but several efficient numerical techniques have been proposed in the past, including stochastic simulation algorithms [3], moment-based methods [4, 5] and combinations thereof [6, 7, 8].

However, the majority of existing approaches to study noise in cell populations rely on the assumption that individual cells act independently of each other. More concretely, each cell’s dynamics is considered to be an independent and identically distributed realization of the same stochastic process. Evidently, this assumption is violated in systems where cells communicate with one another to coordinate their behavior [9, 10]. Typical examples include quorum-sensing systems in bacterial colonies [11], or paracrine communication in higher organisms [12]. Understanding the interplay between cell-cell communication and the stochastic behavior of individual cells is an important challenge and demands for suitable mathematical approaches. However, extending standard techniques to account for cell-cell communication leads to computational difficulties, because the dimensionality of the resulting models increases with the number of cells in a population.

Recently, first attempts have been made to develop more tractable stochastic models of systems of communicating cells. In [9], for instance, the authors use a moment-based method to study how neighbour-neighbour-coupling affects concentration fluctuations in a tissue. A related approach has been proposed in [13] to study community effects in cells that interact with each other by secreting and sensing certain signalling molecules. In particular, the authors show how the dimensionality of the considered model can be dramatically reduced by exploiting certain symmetries in the governing equations. However, both approaches account exclusively for intrinsic noise, whereas extrinsic sources have not been considered. In [14], the authors study how chemical communication via a quorum sensing molecule affects intrinsic and extrinsic noise in cell communities. To obtain tractable simulations, they used a tailored approach that combines stochastic simulations with a quasi-steady state approximation to eliminate fast variables from the model.

In this article, we discuss a moment-based approach to study noise in communicating cells that arises from both intrinsic and extrinsic sources. In particular, we focus on a secrete-and-sense model [10, 13], where individual cells can secrete a signalling molecule to the extracellular environment, which in turn can be sensed by other cells in the population. We first derive a system of differential equations that captures lower order moments of the population. Since the number of obtained equations grows combinatorially with number of considered cells, we perform a symmetry-based model reduction based on the work of Batmanov et al. (2012) [13]. The dimensionality of the reduced model is independent of the population size, which makes it computationally very efficient. We employ this approach to study how noise is affected by cell-to-cell communication in a few biochemical systems.

The rest of the paper is structured as follows. A general model of biochemical networks in chemically interacting cell populations is presented in Section 2. In Section 3, we provide a stochastic description of this model based on the CME and derive a general equation for its moment dynamics. In Section LABEL:reduction, we show how the number of moment equations can be reduced by exploiting symmetries of the model. Finally, we apply our approach to analyze several biochemical networks in Section 5.

2  Reaction networks in a population of chemically interacting cells

We consider a population of NN genetically identical cells that communicate with each other through a diffusing signalling molecule (Fig. 1). Each individual cell ii is associated with an identical set of SS chemical species Xi,1,,Xi,S\mathrm{X}_{i,1},\ldots,\mathrm{X}_{i,S} that interact with one another via MM biochemical reaction channels. Without loss of generality, we consider the first S1S-1 species to be confined to the intracellular environment of cell ii. The SSth species corresponds to the signalling molecule, which can shuttle between the intra- and extracellular environment through transport reactions. For simplicity, we consider the external environment to be well-mixed, such that the import of signalling molecules into any cell ii does not depend on the spatial configuration of the system. In total, the system can be described by a reaction network

k=1Sαj,kXi,kk=1Sβj,kXi,kXi,SXE,\begin{split}\sum_{k=1}^{S}\alpha_{j,k}\mathrm{X}_{i,k}&\xrightarrow{}\sum_{k=1}^{S}\beta_{j,k}\mathrm{X}_{i,k}\\ \mathrm{X}_{i,S}&\xleftrightharpoons{}\mathrm{X}_{E},\end{split} (1)

for i=1,,Ni=1,\ldots,N and j=1,,M2j=1,\ldots,M-2. In (1), αj,k\alpha_{j,k} and βj,k\beta_{j,k} correspond to the reactant and product coefficients of the respective reaction. The species XE\mathrm{X}_{E} denotes the signalling molecule in the external environment.

Refer to caption
Figure 1: Secrete-and-sense model of a cell population. Each cell’s dynamics is described by the same reaction network, whereas one of the reactants serves as a signalling molecule. The latter can diffuse to the homogeneous extracellular environment from where it can be sense by any of the cells.

In total, the network comprises NS+1NS+1 chemical species and NMNM chemical reactions. We define by Xi(t)=(Xi,1(t),,Xi,S(t))X_{i}(t)=(X_{i,1}(t),\ldots,X_{i,S}(t)) the state of cell ii, which collects the copy numbers of all species associated with this cell at time tt. The state of the overall system is then given by X(t)=(X1(t),,XN(t),XE(t))X(t)=(X_{1}(t),\ldots,X_{N}(t),X_{E}(t)), with XE(t)X_{E}(t) as the number of signalling molecules in the external environment. Moreover, we denote by νi,kNS+1\nu_{i,k}\in\mathbb{Z}^{NS+1} the stoichiometric change vector associated with the kkth reaction channel of cell ii consistent with reaction network (1).

3  Moment dynamics of heterogeneous cell communities

If both the intra- and extracellular environments are well-mixed, we can describe the state X(t)X(t) as a continuous-time Markov chain, whose state probability distribution P(x,t)=P(X(t)=x)P(x,t)=P(X(t)=x) admits a master equation of the form

dP(x,t)dt=i=1Nj=1M[ai,j(xνi,j,ci,j)P(xνi,j,t)ai,j(x,ci,j)P(x,t)].\begin{split}\frac{\mathrm{d}P(x,t)}{\mathrm{d}t}&=\sum_{i=1}^{N}\sum_{j=1}^{M}[a_{i,j}(x-\nu_{i,j},c_{i,j})P(x-\nu_{i,j},t)\\ &\qquad\qquad\qquad\qquad\quad-a_{i,j}(x,c_{i,j})P(x,t)].\end{split} (2)

In (2), the function ai,ja_{i,j} is the reaction propensity associated with reaction jj in cell ii. Throughout this article, we consider the propensities to obey the law of mass action such that ai,j(x,ci,j)=ci,jgj(xi,xE)a_{i,j}(x,c_{i,j})=c_{i,j}g_{j}(x_{i},x_{E}) with xix_{i} as the part of the state vector associated with cell ii, xEx_{E} as the abundance of the signalling molecule in the environment, ci,j+c_{i,j}\in\mathbb{R}^{+} as a stochastic rate constant, and gjg_{j} as a polynomial. Note that while gjg_{j} is identical for all cells, we allow the individual rate constants ci,jc_{i,j} to vary across the population. This provides a means to account for extrinsic sources of cell-to-cell variability [5]. We consider the reaction rate constants for each cell ii to be independent random vectors Ci=(Ci,1,,Ci,M)C_{i}=(C_{i,1},\ldots,C_{i,M}) drawn from a common probability distribution Cipc()C_{i}\sim p_{c}(\cdot) for i=1,,Ni=1,\ldots,N. Note that deterministic (non-varying) reaction rates can be accounted for by letting pcp_{c} be a Dirac measure with respect to this parameter. With C=(C1,,CN)C=(C_{1},\ldots,C_{N}), we can formulate a master equation for the conditional distribution P(x,tc)=P(X(t)=xC=c)P(x,t\mid c)=P(X(t)=x\mid C=c), i.e.,

dP(x,tc)dt=i=1Nj=1M[ai,j(xνi,j,ci,j)P(xνi,j,tc)ai,j(x,ci,j)P(x,tc)].\begin{split}\frac{\mathrm{d}P(x,t\mid c)}{\mathrm{d}t}&=\sum_{i=1}^{N}\sum_{j=1}^{M}[a_{i,j}(x-\nu_{i,j},c_{i,j})P(x-\nu_{i,j},t\mid c)\\ &\qquad\qquad\qquad-a_{i,j}(x,c_{i,j})P(x,t\mid c)].\end{split} (3)

3.1 Moment dynamics

To analyze the cell community model, we resort to a moment-based approach, which provides a lower-dimensional description of the population and its heterogeneity. More precisely, we seek for the population moments

ϕ(X,C)=ϕ(X,C)C=x𝒳ϕ(x,C)P(x,tC),\begin{split}\langle\phi(X,C)\rangle&=\big{\langle}\langle\phi(X,C)\mid C\rangle\big{\rangle}\\ &=\Bigg{\langle}\sum_{x\in\mathcal{X}}\phi(x,C)P(x,t\mid C)\Bigg{\rangle},\end{split} (4)

with ϕ:(x,c)\phi:(x,c)\rightarrow\mathbb{R} as a monomial in (x,c)(x,c) and 𝒳\mathcal{X} as the domain of X(t)X(t). Note that we omit the dependency of the moments on time for the sake of a compact notation. In order to derive a differential equation for the time evolution of ϕ(X,C)\langle\phi(X,C)\rangle, we calculate the derivative of (4) and insert the r.h.s. of (3)

dϕ(X,C)dt=x𝒳ϕ(x,C)i=1Nj=1M[ai,j(xνi,j,Ci,j)P(xνi,j,tC)ai,j(x,Ci,j)P(x,tC)].\small\begin{split}&\frac{\mathrm{d}\langle\phi(X,C)\rangle}{\mathrm{d}t}=\Bigg{\langle}\sum_{x\in\mathcal{X}}\phi(x,C)\sum_{i=1}^{N}\sum_{j=1}^{M}[a_{i,j}(x-\nu_{i,j},C_{i,j})\\ &\qquad P(x-\nu_{i,j},t\mid C)-a_{i,j}(x,C_{i,j})P(x,t\mid C)]\Bigg{\rangle}.\end{split} (5)

Using a change of variable, it is straightforward to show that (5) simplifies to

dϕ(X,C)dt=i=1Nj=1Mϕ(X+νi,j,C)ai,j(X,Ci,j)Cϕ(X,C)ai,j(X,Ci,j)C,\small\begin{split}&\frac{\mathrm{d}\langle\phi(X,C)\rangle}{\mathrm{d}t}=\Bigg{\langle}\sum_{i=1}^{N}\sum_{j=1}^{M}\Big{\langle}\phi(X+\nu_{i,j},C)a_{i,j}(X,C_{i,j})\mid C\Big{\rangle}\\ &\qquad\qquad\qquad-\Big{\langle}\phi(X,C)a_{i,j}(X,C_{i,j})\mid C\Big{\rangle}\Bigg{\rangle},\end{split} (6)

where the inner brackets denote expectations conditionally on the random parameters CC. Now, using double expectations, we obtain

dϕ(X,C)dt=i=1Nj=1Mϕ(X+νi,j,C)ai,j(X,Ci,j)ϕ(X,C)ai,j(X,Ci,j).\small\begin{split}\frac{\mathrm{d}\langle\phi(X,C)\rangle}{\mathrm{d}t}&=\sum_{i=1}^{N}\sum_{j=1}^{M}\Big{\langle}\phi(X+\nu_{i,j},C)a_{i,j}(X,C_{i,j})\Big{\rangle}\\ &\qquad\qquad\quad-\Big{\langle}\phi(X,C)a_{i,j}(X,C_{i,j})\Big{\rangle}.\end{split} (7)

Eq. (7) describes the time evolution of moments and cross moments for a heterogeneous population of secrete-and-sense cells. It can thus be seen as an extension of the moment equations derived in [5] to account for cell-to-cell communication. Indeed, if we set the transport rates to zero, all cells in the population become independent of each other such that the two approaches yield equivalent solutions.

Note that depending on the details of the system, the moment equations from (7) may not be closed. This is the case in the presence of second-order reactions, or even first-order reactions if their rate constants are randomly distributed across the population. Moment-closure approximation (MA) techniques provide a popular means to address this problem by imposing certain assumptions on the underlying state probability distribution [4, 5, 15]. More precisely, this allows us to replace the higher order moments that appear on the r.h.s. of (7) by functions of the lower order moments. These functions are referred to as closure functions and their particular form depends on the distributional assumption we make. Popular choices include the normal [15] and lognormal [4] closure functions and we will adopt those in the present study. To check the accuracy of our MAs in this study, we compare them with moments calculated using the Stochastic Simulation Algorithm (SSA) [3]. Throughout this article, we consider moments of up to second order and replace all third-order moments that appear on the r.h.s. of (7) using MA functions provided in Table 1.

Table 1: Normal & lognormal closure functions of order three.
MA X1X2X3\langle X_{1}X_{2}X_{3}\rangle
Normal X1X2X3+X2X1X3\langle X_{1}\rangle\langle X_{2}X_{3}\rangle+\langle X_{2}\rangle\langle X_{1}X_{3}\rangle
+X3X1X22X1X2X3+\langle X_{3}\rangle\langle X_{1}X_{2}\rangle-2\langle X_{1}\rangle\langle X_{2}\rangle\langle X_{3}\rangle
Lognormal X1X2X2X3X1X3X1X2X3\dfrac{\langle X_{1}X_{2}\rangle\langle X_{2}X_{3}\rangle\langle X_{1}X_{3}\rangle}{\langle X_{1}\rangle\langle X_{2}\rangle\langle X_{3}\rangle}

4  Symmetry-based model reduction

While eq. (7) provides a more tractable description of the cell community than (2), its dimensionality still scales combinatorially with the number of considered cells NN. In case we consider all first and second order moments, the number of equations is given by

Keq=2(NS+1)+(NS+NM+12)(NM2),K_{eq}=2(NS+1)+\binom{NS+NM^{\prime}+1}{2}-\binom{NM^{\prime}}{2}, (8)

where MM^{\prime} is the number of rate constants among the MM reactions that vary from cell to cell. The first term in (8) accounts for 1st and 2nd order moments of the chemical species, and the remaining terms account for all cross-moments of species and rate constants that change over time.

The number of first and second order moment equations scales quadratically with NN, which limits the above approach to relatively small population sizes. However, the moment system can be reduced substantially by taking into account the symmetries of the considered model as has been proposed in [13]. More precisely, if we consider all initial cell states Xi(0)X_{i}(0) to be identically distributed, the moment dynamics of each cell will be equivalent and indistinguishable for all times t>0t>0 such that Xi,k=Xj,k\langle X_{i,k}\rangle=\langle X_{j,k}\rangle, Xi,k2=Xj,k2\langle X_{i,k}^{2}\rangle=\langle X_{j,k}^{2}\rangle, Xi,kXj,l=Xm,kXn,l\langle X_{i,k}X_{j,l}\rangle=\langle X_{m,k}X_{n,l}\rangle, and Xi,kXE=Xj,kXE\langle X_{i,k}X_{E}\rangle=\langle X_{j,k}X_{E}\rangle for any iji\neq j, mnm\neq n, kk and ll. Consequently, we can obtain a reduced model by considering only the moments up to the second order and cross-moments of the states of any two reference cells Xi(t)X_{i}(t) and Xj(t)X_{j}(t) as well as the amount of signalling molecules in the external environment XE(t)X_{E}(t). The resulting set of equations can be reduced further by eliminating all moments associated with one of the two cells jj (e.g., Xj,k\langle X_{j,k}\rangle or Xj,kXE\langle X_{j,k}X_{E}\rangle) since those are identical to the corresponding moments of cell ii. In total, the required number of equations is then given by

K^eq=2(S+1)+(S+1+M2)+(S+M2)2(M2)+S,\begin{split}\hat{K}_{eq}=2(S+1)+&\binom{S+1+M^{\prime}}{2}\\ \qquad\qquad\qquad\qquad&+\binom{S+M^{\prime}}{2}-2\binom{M^{\prime}}{2}+S,\end{split} (9)

which is thus independent of the population size.

4.1 Illustrative example

Consider the toy model

Xi,1Ci,1Xi,2Xi,2c3c2XE,\begin{gathered}\begin{aligned} \mathrm{X}_{i,1}\xrightarrow{C_{i,1}}\mathrm{X}_{i,2}\\ \end{aligned}\qquad\quad\begin{aligned} \mathrm{X}_{i,2}\xleftrightharpoons[c_{3}]{c_{2}}\mathrm{X}_{E},\end{aligned}\end{gathered} (10)

with i=1,,Ni=1,\ldots,N. We consider the rate Ci,1C_{i,1} to be randomly distributed across the population, while c2c_{2} and c3c_{3} are identical in all cells.

To demonstrate how the original system can be reduced based on symmetries, we distinguish between two different cases. The first case concerns equations that are the same in the original and reduced model. These are equations of the moments that do not involve the signalling molecule in the external environment such as Xi,1\langle X_{i,1}\rangle, Xi,2\langle X_{i,2}\rangle, or Xi,1Xi,2\langle X_{i,1}X_{i,2}\rangle. For example, in both the original and reduced models, the expectation of species Xi,2X_{i,2} satisfies

dXi,2dt=Ci,1Xi,1+c2XEc3Xi,2.\frac{\mathrm{d}\langle X_{i,2}\rangle}{\mathrm{d}t}=\langle C_{i,1}X_{i,1}\rangle+c_{2}\langle X_{E}\rangle-c_{3}\langle X_{i,2}\rangle. (11)

The dynamics of moments and cross-moments involving the external signalling molecule depend on all cells in the population due to the transport reactions. In the case of Xi,2XE\langle X_{i,2}X_{E}\rangle, the original equation is:

dXi,2XEdt=Ci,1Xi,1XE+c2XE2c2XE+c3Xi,22c3Xi,2XEc3Xi,2+kic3Xk,2Xi,2c2NXi,2XE.\begin{split}\frac{\mathrm{d}\langle X_{i,2}X_{E}\rangle}{\mathrm{d}t}&=\langle C_{i,1}X_{i,1}X_{E}\rangle+c_{2}\langle X_{E}^{2}\rangle-c_{2}\langle X_{E}\rangle\\ &\hskip-10.00002pt+c_{3}\langle X_{i,2}^{2}\rangle-c_{3}\langle X_{i,2}X_{E}\rangle-c_{3}\langle X_{i,2}\rangle\\ &\hskip-10.00002pt+\sum_{k\neq i}c_{3}\langle X_{k,2}X_{i,2}\rangle-c_{2}N\langle X_{i,2}X_{E}\rangle.\end{split} (12)

The terms in the sum capture the dependencies between two cells kk and ii. Since all terms in the sum are identical due to the symmetry of the population, they can be replaced by the contribution of any cell jij\neq i multiplied by (N1)(N-1),

dXi,2XEdt=Ci,1Xi,1XE+c2XE2c2XE+c3Xi,22c3Xi,2XEc3Xi,2+c3(N1)Xj,2Xi,2c2NXi,2XE.\begin{split}\frac{\mathrm{d}\langle X_{i,2}X_{E}\rangle}{\mathrm{d}t}&=\langle C_{i,1}X_{i,1}X_{E}\rangle+c_{2}\langle X_{E}^{2}\rangle-c_{2}\langle X_{E}\rangle\\ &\hskip-10.00002pt+c_{3}\langle X_{i,2}^{2}\rangle-c_{3}\langle X_{i,2}X_{E}\rangle-c_{3}\langle X_{i,2}\rangle\\ &\hskip-10.00002pt+c_{3}(N-1)\langle X_{j,2}X_{i,2}\rangle-c_{2}N\langle X_{i,2}X_{E}\rangle.\end{split} (13)

Therefore, if we perform analogous manipulations for all other moments and cross-moments involving the signalling molecule (e.g., XE\langle X_{E}\rangle, Xi,1XE\langle X_{i,1}X_{E}\rangle) and eliminate all remaining redundancies between cell ii and jj, we arrive at a system of 1717 coupled differential equations, independent of the population size NN. For further information on the symmetry-based model reduction approach, the reader may refer to [13].

5  Case studies

In this section, we use the described moment-based approach to study how cell-cell communication affects noise in different biochemical networks. Python scripts used for this study are available in github.com/zechnerlab/CommunityMoments. Moment equations were solved using a SciPy numerical solver (solve_ivp). SSA sample paths from Gillespie’s Direct Method were simulated using Tellurium [16] and used to test the accuracy of the moment-based approach. For the sake of a compact notation, molecular species are assigned different letters and reaction rate parameters are assigned letters with superscripts.

5.1 Birth-death process

As a first example, we study a birth-death process in a heterogeneous population of interacting cells, i.e.,

CibPiCidPictctQ,\begin{gathered}\begin{aligned} \emptyset\xrightarrow{C^{b}_{i}}\mathrm{P}_{i}\xrightarrow{C^{d}_{i}}\emptyset\\ \end{aligned}\qquad\qquad\begin{aligned} \mathrm{P}_{i}\xleftrightharpoons[c^{t}]{c^{t}}\mathrm{Q},\end{aligned}\end{gathered} (14)

for i=1,,Ni=1,\ldots,N. Here, the birth and death reaction rate constants are independent random variables as indicated by capital letters CibC^{b}_{i} and CidC^{d}_{i}, while the transport rate ctc^{t} is fixed. For this reaction network, a symmetry-reduced model of up to the second order can be described using K^eq=12\hat{K}_{eq}=12 differential equations regardless of the population size.

The goal of this case study is to study how cell-to-cell communication affects the variability in the abundance of Pi\mathrm{P}_{i}. To quantify variability, we use two metrics: the first one is the coefficient of variation (CV) defined as

CV[Pi]=Pi2Pi2Pi2,\mathrm{CV}[P_{i}]=\sqrt{\frac{\langle P_{i}^{2}\rangle-\langle P_{i}\rangle^{2}}{\langle P_{i}\rangle^{2}}}, (15)

for any i=1,,Ni=1,\ldots,N. The CV captures the expected variation in protein abundance inside single cells across different populations. Due to the symmetry, we have CV[Pi]=CV[Pk]\mathrm{CV}[P_{i}]=\mathrm{CV}[P_{k}] for any ii and kk. Furthermore, we define the pair variation (PV)

PV[Pi,Pk]=(PiPk)2PiPk,\mathrm{PV}[P_{i},P_{k}]=\sqrt{\frac{\langle(P_{i}-P_{k})^{2}\rangle}{\langle P_{i}\rangle\langle P_{k}\rangle}}, (16)

which captures the expected variation between two different cells ii and kk within the same population. Note that in the absence of cell-cell communication, (15) and (16) are identical up to a scaling factor of 2\sqrt{2}. For a communicating population, however, this is not the case due to correlations between cells in the population.

Refer to caption
Figure 2: Moment dynamics of the birth-death system generated by the symmetry-reduced model (Normal MA). First (A) and second order (B) moment dynamics of species PiP_{i} with a population size of N=10N=10 and ct=[0,0.1,0.1]c^{t}=[0,0.1,0.1]. CV[PiP_{i}] and PV[Pi,PjP_{i},P_{j}] decrease as transport rate increases for a fixed population size N=10N=10 (C and D). Increasing the population size N=[5,10,50]N=[5,10,50] for a fixed ct=0.1c^{t}=0.1 decreases the CV[PiP_{i}], but not PV[Pi,PjP_{i},P_{j}] (E and F). Other parameters and initial conditions were set to Cib=1\langle C^{b}_{i}\rangle=1, Var[Cib]=0.01\mathrm{Var}[C^{b}_{i}]=0.01, Cid=0.01\langle C^{d}_{i}\rangle=0.01, Var[Cid]=1e6\mathrm{Var}[C^{d}_{i}]=1e-6, Pi(0)=20\langle P_{i}(0)\rangle=20, Var[Pi(0)]=25\mathrm{Var}[P_{i}(0)]=25, Q(0)=0\langle Q(0)\rangle=0, and Var[Q(0)]=0\mathrm{Var}[{Q}(0)]=0. Shaded areas are bootstrapped 95% confidence intervals (CI) from 10001000 SSA realizations.

In Figs. 2A and B, we compare the first and second order moments of species PiP_{i} for the symmetry-reduced model of size N=10N=10 with stochastic simulations and found a good agreement between the two approaches. In Figs. 2C-F, we show the dependency of CV[PiP_{i}] and PV[PiP_{i},PjP_{j}] as a function of the transport rate as well as the population size. The coefficient of variation CV[PiP_{i}] decreases with increasing transport rates and to some extent also with the population size. While the pair variation PV[PiP_{i},PjP_{j}] shows a similar decrease with increasing transport rates, it seems to be independent of the population size.

We also show how extrinsic variability changes the steady-state variability of Pi\mathrm{P}_{i} by increasing the CV of the birth rate parameter CibC^{b}_{i}, while keeping all other parameters constant. Both CV[PiP_{i}] and PV[PiP_{i},PjP_{j}] increase with increasing extrinsic variability, whereas large transport rates can attenuate this effect due to spatial averaging of PiP_{i} levels (Fig. 3).

Refer to caption
Figure 3: Steady-state CV[PiP_{i}] (A) and PV[Pi,PjP_{i},P_{j}] (B) for varying transport rates (ctc^{t}) and population heterogeneity (CV[Cib][C^{b}_{i}]). Other parameters and initial conditions were set to Cib=1\langle C^{b}_{i}\rangle=1, cd=0.01c^{d}=0.01, Pi(0)=20\langle P_{i}(0)\rangle=20, Var[Pi(0)]=25\mathrm{Var}[P_{i}(0)]=25, Q(0)=0\langle Q(0)\rangle=0, Var[Q(0)]=0\mathrm{Var}[{Q}(0)]=0, and N=10N=10. Steady-state moments were determined from the reduced model with normal closure at t=1000t=1000. Error bars are bootstrapped 95% CIs from 10001000 SSA realizations.

5.2 Autocatalytic circuit

Next, we focus on an autocatalytic system defined by

CibAiCidAicaAi+AiAictctB.\begin{gathered}\begin{aligned} \emptyset\xrightarrow{C^{b}_{i}}\mathrm{A}_{i}\xrightarrow{C^{d}_{i}}\emptyset\\ \mathrm{A}_{i}\xrightarrow{c^{a}}\mathrm{A}_{i}+\mathrm{A}_{i}\end{aligned}\qquad\qquad\begin{aligned} \mathrm{A}_{i}\xleftrightharpoons[c^{t}]{c^{t}}\mathrm{B}.\end{aligned}\end{gathered} (17)

Here we consider CibC^{b}_{i} and CidC^{d}_{i} to be randomly distributed across the population, while the autocatalytic rate cac^{a} and transport rate ctc^{t} are fixed. In this example, the total number of reduced equations is K^eq=12\hat{K}_{eq}=12. To obtain a closed set of moments, we applied the lognormal closure and checked its accuracy by comparing it to Monte Carlo estimates of the moments calculated over 10001000 SSA realizations (Fig. 4). We generally found a good agreement between the MA and the SSA. Similar to the birth-death systems, we observe that both CV and PV of species Ai\mathrm{A}_{i} decrease with increasing transport rates. Increasing the population size does not affect the PV, but decreases its CV.

Refer to caption
Figure 4: Moment dynamics of the autocatalytic system generated by the symmetry-reduced model (Lognormal MA). First (A) and second order (B) moment dynamics of species AiA_{i} for a population size of N=10N=10 and ct=[0,0.01,0.1]c^{t}=[0,0.01,0.1]. CV[AiA_{i}] and PV[Ai,AjA_{i},A_{j}] decrease as transport rate increases for a fixed population size N=10N=10 (C and D). Increasing the population size from N=[5,10,50]N=[5,10,50] for a fixed ct=0.1c^{t}=0.1 decreases CV[AiA_{i}] (E), while PV[Ai,AjA_{i},A_{j}] remains the same (F). Other parameters and initial conditions were set to Cib=1\langle C^{b}_{i}\rangle=1, Var[Cib]=0.01\mathrm{Var}[C^{b}_{i}]=0.01, ca=0.08c^{a}=0.08, Cid=0.1\langle C^{d}_{i}\rangle=0.1, Var[Cid]=1e4\mathrm{Var}[C^{d}_{i}]=1e-4, Ai(0)=20\langle A_{i}(0)\rangle=20, Var[Ai(0)]=25\mathrm{Var}[{A_{i}}(0)]=25, B(0)=0\langle B(0)\rangle=0, and Var[B(0)]=0\mathrm{Var}[{B}(0)]=0. Shaded areas are bootstrapped 95% CIs from 10001000 SSA realizations.

To study the relationship between the CV and PV, we plotted CV[AiA_{i}] against PV[AiA_{i},AjA_{j}] in Figure 5. As expected, CV and PV are related by a factor of 2\sqrt{2} when ct=0c^{t}=0. Slight deviations from this scaling are possible due to the approximations involved in the derivation of the moment equations. In the presence of communication, PV[AiA_{i},AjA_{j}] drops below the 2\sqrt{2} scaling law, indicating that the variability between cells in the same population is smaller than the variability of cells across different populations.

5.3 Genetic feedback circuit

Lastly, we tested the moment-based method using a larger system. In particular, we focus on a genetic feedback circuit given by

DiCimDi+MiMiCipMi+PiDi+PicdcaDPiDPicfDPi+MiMidmPidpPictctQ,\begin{gathered}\begin{aligned} \mathrm{D}_{i}\xrightarrow{C^{m}_{i}}\mathrm{D}_{i}+\mathrm{M}_{i}\\ \mathrm{M}_{i}\xrightarrow{C^{p}_{i}}\mathrm{M}_{i}+\mathrm{P}_{i}\\ \mathrm{D}_{i}+\mathrm{P}_{i}\xleftrightharpoons[c^{d}]{c^{a}}\mathrm{DP}_{i}\\ \mathrm{DP}_{i}\xrightarrow{c^{f}}\mathrm{DP}_{i}+\mathrm{M}_{i}\end{aligned}\qquad\quad\begin{aligned} \mathrm{M}_{i}\xrightarrow{d^{m}}\emptyset\\ \mathrm{P}_{i}\xrightarrow{d^{p}}\emptyset\\ \mathrm{P}_{i}\xleftrightharpoons[c^{t}]{c^{t}}\mathrm{Q},\end{aligned}\end{gathered} (18)

for i=1,,Ni=1,\ldots,N. Here, we consider the reaction rate constants associated with transcription CimC^{m}_{i} and translation CipC^{p}_{i} to be randomly distributed across the population, while all other rate parameters are fixed. Transcriptional feedback of the gene circuit is mediated by the protein product (P\mathrm{P}), which binds DNA (D\mathrm{D}) to form a complex DP\mathrm{DP}. In the bound state, the gene can be transcribed with rate constant cfc^{f}. Depending on the ratio of the bound and unbound transcription rate, the feedback mechanism can either enhance (cf>Cimc^{f}>\langle C^{m}_{i}\rangle) or inhibit (cf<Cimc^{f}<\langle C^{m}_{i}\rangle) gene expression. We consider a population of N=10N=10 cells and applied a lognormal closure to solve for the reduced moment dynamics. The total number of reduced equations for this system is K^eq=48\hat{K}_{eq}=48. Fig. 6 shows that the time evolution of the mRNA moments obtained from the symmetry-reduced model and SSA are very similar to each other.

Refer to caption
Figure 5: CV[AiA_{i}] vs. PV[AiA_{i},AjA_{j}] of the autocatalytic network at different transport rates ct=[0,0.01,0.1]c^{t}=[0,0.01,0.1]. CV[AiA_{i}] and PV[AiA_{i},AjA_{j}] are related by a factor of 2\sqrt{2} (diagonal) at ct=0c^{t}=0, while deviations from this scaling occur when ct>0c^{t}>0. Values of CV[AiA_{i}] and PV[AiA_{i},AjA_{j}] were taken from the time-courses corresponding to the case N=10N=10 in Fig. 4.

To analyze the computational efficiency of the moment-based approach, we recorded run times for the positive feedback gene circuit for different population sizes NN and compared it against the SSA (Table 2). As expected, the run time for the moment-based model does not increase with NN while the run time associated with the SSA increases by orders of magnitude.

Table 2: Run times (s) of the reduced moment-based approach and SSA for the positive feedback gene circuit*.
NN Reduced moments SSA (1 sample path)
10 0.068±0.0050.068\pm 0.005 0.142±0.0040.142\pm 0.004
100 0.065±0.0040.065\pm 0.004 12.300±0.12312.300\pm 0.123
1000 0.064±0.0050.064\pm 0.005 2569.20±52.932569.20\pm 52.93

*Mean and standard deviation from 10 replicates.

6  Conclusions

We presented and analyzed an efficient moment-based approach to study noise in heterogeneous populations of communicating cells. By exploiting certain symmetries of the underlying moment dynamics [13], the number of differential equations needed to describe the population could be strongly reduced. Indeed, the dimensionality of the reduced model is independent of the number of considered cells, which enables the analysis of regulatory processes in large populations. In principle, the method applies to arbitrary intracellular reaction networks, although its accuracy relies on the availability of suitable moment closure functions. In the present work, we tested the approach using three different case studies, where it accurately captured first and second order moments using normal and lognormal closure functions. Our analyses show that moment-based approximations provide an effective way to study how biochemical noise and population heterogeneity affect the collective dynamics of communicating cells.

Refer to caption
Figure 6: Moment dynamics of the genetic feedback circuit generated by the symmetry-reduced model (Lognormal MA). First (A) and second order (B) moment dynamics of species MiM_{i} for populations of N=10N=10 cells with negative feedback (cif=0.1c^{f}_{i}=0.1) and positive feedback (cif=1c^{f}_{i}=1) and their corresponding time evolution of CV[MiM_{i}] (C) and PV[Mi,MjM_{i},M_{j}] (D). Reaction rate parameters and initial conditions were set to Di(0)=30\langle D_{i}(0)\rangle=30, Var[Di(0)]=25\mathrm{Var}[{D_{i}}(0)]=25, Mi(0)=Pi(0)=DPi(0)=5\langle M_{i}(0)\rangle=\langle P_{i}(0)\rangle=\langle DP_{i}(0)\rangle=5, Var[Mi(0)]=Var[Pi(0)]=Var[DPi(0)]=1\mathrm{Var}[{M_{i}}(0)]=\mathrm{Var}[{P_{i}}(0)]=\mathrm{Var}[{DP_{i}}(0)]=1, Q(0)=5\langle Q(0)\rangle=5, Var[Q(0)]=0\mathrm{Var}[{Q}(0)]=0, Cim=0.5\langle C^{m}_{i}\rangle=0.5, Var[Cim]=0.01\mathrm{Var}[{C^{m}_{i}}]=0.01, Cip=0.05\langle C^{p}_{i}\rangle=0.05, Var[Cip]=2.5e5\mathrm{Var}[C^{p}_{i}]=2.5e-5, ca=0.01c^{a}=0.01, cd=0.01c^{d}=0.01, dm=dp=0.2d^{m}=d_{p}=0.2, and ct=0.8c^{t}=0.8. Shaded areas are bootstrapped 95% CIs from 10001000 SSA realizations.

References

  • [1] DT Gillespie. A rigorous derivation of the chemical master equation. Physica A: Statistical Mechanics and its Applications, 188(1-3):404–425, 1992.
  • [2] MB Elowitz, AJ Levine, ED Sigga, and PS Swain. Stochastic gene expression in a single cell. Science, 297(5584):1183–1186, 2002.
  • [3] DT Gillespie. Exact stochastic simulation of coupled chemical reactions. Journal of Physical Chemistry, 81(25):2340–2361, 1977.
  • [4] A Singh and JP Hespanha. Lognormal moment closures for biochemical reactions. Proceedings of the 45th IEEE Conference on Decision and Control, pages 2063–2068, 2006.
  • [5] C Zechner, J Ruess, P Krenn, S Pelet, M Peter, J Lygeros, and H Koeppl. Moment-based inference predicts bimodality in transient gene expression. PNAS, 109(21):8340–8345, 2012.
  • [6] H Salis and Y Kaznessis. Accurate hybrid stochastic simulation of a system of coupled chemical or biochemical reactions. The Journal of Chemical Physics, 122(5):054103, 2005.
  • [7] L Duso and C Zechner. Selected-node stochastic simulation algorithm. The Journal of Chemical Physics, 148(16):164108, 2005.
  • [8] A Ganguly, D Altintan, and H Koeppl. Jump-diffusion approximation of stochastic reaction dynamics: error bounds and algorithms. Multiscale Modeling & Simulation, 13(4):1390–1419, 2015.
  • [9] S Smith and R Grima. Single-cell variability in multicellular organisms. Nature Communications, 9(1):345, 2018.
  • [10] H Youk and WA Lim. Secreting and sensing the same molecule allows cells to achieve versatile social behaviors. Science, 343(1242782), 2014.
  • [11] AM Stevens, KM Dolan, and EP Greenberg. Synergistic binding of the Vibrio fischeri LuxR transcriptional activator domain and rna polymerase to the lux promoter region. PNAS, 91(26):12619–12623, 1994.
  • [12] LN Handly, A Pilko, and R Wollman. Paracrine communication maximizes cellular response fidelity in wound signaling. eLIFE, 4(e09652), 2015.
  • [13] K Batmanov, C Kuttler, F Lemaire, C Lhoussaine, and C Versari. Symmetry-based model reduction for approximate stochastic analysis. in Computational Methods in Systems Biology, London: Springer, pages 49–68, 2012.
  • [14] Y Boada, A Vignoni, and J Picó. Engineered control of genetic variability reveals interplay among quorum sensing, feedback regulation, and biochemical noise. ACS Synthetic Biology, 6(10):1903–1912, 2017.
  • [15] D Schnoerr, G Sanguinetti, and R Grima. Comparison of different moment-closure approximations for stochastic chemical kinetics. The Journal of Chemical Physics, 143(185101), 2015.
  • [16] K Choi, JK Medley, M Konig, K Stocking, L Smith, S Gu, and HM Sauro. Tellurium: An extensible Python-based modeling environment for systems and synthetic biology. Biosystems, 171:74–79, 2018.