This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

A Model of Sequential Branching in Hierarchical Cell Fate Determination

David V. Foster111Institute for Biocomplexity and Informatics, University of Calgary, Calgary, Alberta T2N 1N4 dfoster@ucalgary.ca Jacob G. Foster222Complexity Science Group, Department of Physics and Astronomy, University of Calgary, Calgary, Alberta T2N 1N4 Sui Huang333Institute for Biocomplexity and Informatics, University of Calgary, Calgary, Alberta T2N 1N4 Stuart A. Kauffman444Institute for Biocomplexity and Informatics, University of Calgary, Calgary, Alberta T2N 1N4
Abstract

Multi-potent stem or progenitor cells undergo a sequential series of binary fate decisions, which ultimately generate the diversity of differentiated cells. Efforts to understand cell fate control have focused on simple gene regulatory circuits that predict the presence of multiple stable states, bifurcations and switch-like transitions. However, existing gene network models do not explain more complex properties of cell fate dynamics such as the hierarchical branching of developmental paths. Here, we construct a generic minimal model of the genetic regulatory network controlling cell fate determination, which exhibits five elementary characteristics of cell differentiation: stability, directionality, branching, exclusivity, and promiscuous expression. We argue that a modular architecture comprising repeated network elements reproduces these features of differentiation by sequentially repressing selected modules and hence restricting the dynamics to lower dimensional subspaces of the high-dimensional state space. We implement our model both with ordinary differential equations (ODEs), to explore the role of bifurcations in producing the one-way character of differentiation, and with stochastic differential equations (SDEs), to demonstrate the effect of noise on the system. We further argue that binary cell fate decisions are prevalent in cell differentiation due to general features of the underlying dynamical system. This minimal model makes testable predictions about the structural basis for directional, discrete and diversifying cell phenotype development and thus can guide the evaluation of real gene regulatory networks that govern differentiation.

keywords:
cell differentiation , genetic regulatory network , dynamics , modularity
journal: Journal of Theoretical Biology

1 Introduction

The genetic regulatory network (GRN) plays an essential role in controlling cell differentiation and maintaining the stability of cell types. Each cell type has its own stable gene expression profile (the set of genes in its genome which are actively expressed) [24]. Thus, understanding cell differentiation requires a model of the behavior of the many interacting genes that establish the expression profile associated with each cell type.

Gene regulatory interactions are often described as a network, with each node representing a gene and links between two genes indicating a regulatory interaction. The large-scale characteristics of genetic regulatory networks have been studied in some detail [2, 8], but only in terms of network structure: the degree sequence [27], the prevalence of network motifs [28, 32], and so forth. In terms of GRN dynamics, most efforts have been “bottom-up” — devoted to simulating specific regulatory interactions or analyzing models of small genetic circuits [1, 4] and therefore focused on specific subsections of the network. Our goal is to work in a “top-down” fashion to construct a minimal dynamical model which captures many important features of cell differentiation. Most significantly, we capture the sequential branching differentiation of cells from the multipotent progenitor cell to more lineage-restricted cells and finally to terminal cells.

To construct our model, we first note five elementary characteristics of cell differentiation which we wish to capture: stability, directionality, branching, exclusivity, and promiscuous expression. Of course, this is not meant to be an all-encompassing or fully detailed description of cell differentiation. However, these simplified characteristics prove rich enough to create an interesting model. We review here our definition of each characteristic.

Stability : Cell types should be stable states of the dynamics. A cell which is in specific cell type should remain in that cell type for a “long time” (relative to the time scale of gene activation, transcription and translation), unless perturbed by an external signal.

Directionality: Progenitor cells should reliably differentiate into a variety of more specialized cell types [18] and should only very rarely regain pluripotency by de-differentiating [22].

Branching: Differentiation should occur through discrete lineage specifications, in which a multipotent cell makes a binary fate decision and commits to one of two different lineages.

Exclusivity: Once cells in our model have committed to a specific lineage, they should lose the ability to produce cell types from the other lineage. Each differentiation event both selects a specific lineage and forecloses the possibility of the other.

Promiscuous expression: Multipotent progenitor cells should simultaneously express genes specific to all of their prospective lineages. For example, a hematopoietic stem cell expresses genes found in all of the lineages into which it can ultimately differentiate [6].

2 Model specification

Refer to caption
Figure 1: Transition of a module in our network from the multipotent cell type to one of the progenitor cell types. Modules consist of two genes, each of which activates its own transcription and inhibits the transcription of the other gene. In the top panels, both genes in module A are active at a moderate level, represented by the gray coloration. This intermediate steady state is poised between the two subordinate lineages. Modules B and C are also both expressing their genes at a moderate level, as a result of the influence of module A. In the lower panels, the cell has differentiated from the multipotent cell type to progenitor type 1. Module A has switched from the intermediate stable state to the stable state in which gene x1x_{1} is highly active (as represented by the black coloration), whereas gene x2x_{2} has become inactive (as represented by the white coloration). As a result, module C has become completely inactive due to the signals from module A, whereas module B remains in the intermediate steady state, and is thus prepared to make further cell fate decisions.

To create a model which exhibits these characteristics, we take our inspiration from the reported network properties of real GRNs. Biological networks exhibit modularity, i.e. they can be divided into largely independent subnetworks, called modules [16, 21, 29]. The nodes in a given module interact strongly with other nodes in the module, but only weakly with nodes in other modules. In our model, the network is composed of many modules, each responsible for a specific cell fate decision. These modules influence each other in a hierarchically organized fashion, as shown in Fig. 1, which ensures that differentiation events occur in a branching, sequential fashion. The dynamical state associated with each cell type is a result of the dynamics of the associated module. Arrows pointing from one module to another indicate influence of the former on the latter. Influence between modules acts as follows: a parameter in the regulated module depends on the expression level of a gene in the regulating module. In the top pairs of panels the system is in the “Multi” (multipotent) state, and in the bottom pair, the system is in one of the two available progenitor cell types. On the dynamical side, the module A undergoes a change in its state (the specifics of which we will discuss shortly). In making the change, it exerts an influence on the two modules downstream, in this case turning off module C and allowing module B to enter a state which corresponds to “Pro 1” (Progenitor 1). Notice that like module A, modules B and C are also linked to down-stream modules on which they exert influence, and that module A is itself influenced by a module “higher up” on the differentiation cascade. This allows the process to proceed in series through as many levels as are required. Our model suggests an important role for modularity: the topology of the regulatory network sequentially restricts the trajectory of the dynamical system to ever smaller subspaces of the state space. These restrictions ensure that a given cell type can only take the appropriate pathways of subsequent differentiation. The dynamics of each module of the network imposes the branching structure on cell differentiations.

3 Dynamics of a single module

Previous work has studied the behavior of GRNs using a variety of different dynamical system models: Boolean networks [13, 23, 25], differential equations [4, 14], and chemical master equations [7, 30]. Although each model has its merits, for our current purposes differential equations provide the best combination of rich behavior and analytical tractability.

3.1 Deterministic model

To model a set of NN interacting genes with differential equations, we associate the expression level (number of proteins produced by the gene which are present in the cell) of each gene with a variable xix_{i}. The state of the system is given by the vector 𝐱=[x1xN]{\bf x}=[x_{1}\ldots x_{N}]. If we plot the expression level of each gene as one axis in a Cartesian coordinate system, the state space of the dynamics—the space of all possible states 𝐱{\bf x}—is {+}N\{\mathbb{R}^{+}\}^{N}, where the restriction to the non-negative reals +\mathbb{R}^{+} follows from the fact that negative gene expression levels are unphysical. Since genes influence the transcription rates of other genes, the rate of transcription (or equally the instantaneous rate of change of the expression level) of each gene, xi˙\dot{x_{i}}, is given by:

x˙i=Fi(𝐱)\dot{x}_{i}=F_{i}({\bf x}) (1)

For a given state, the instantaneous rate of change of the expression level of each gene defines a vector 𝐱˙=[x˙1x˙N]{\bf\dot{x}}=[\dot{x}_{1}\ldots\dot{x}_{N}], which points in the direction of the future time evolution of the state under the dynamics. These vectors trace out the trajectory of the system, i.e. the time series of states the system will visit.

A fixed point is any point in state space in which the state of the system is “trapped” under the deterministic dynamics (ignoring noise). Formally, this occurs for any point 𝐱{\bf x}^{\star} such that x˙i=Fi(𝐱)=0\dot{x}_{i}^{\star}=F_{i}({\bf x}^{\star})=0 for all ii. The fixed point is stable to small perturbations if the eigenvalues of the Jacobian at 𝐱{\bf x}^{\star} are all negative. In our model, each cell type is represented by a stable fixed point of the dynamics. Our goal is to construct the simplest plausible set of genetic regulatory interactions for which the state of the system moves, either by random fluctuations or external signals, through a sequence of branching, exclusive steps from the fixed point which represents a multipotent cell type, to states which represent a series of intermediary cell types, and finally to a terminal cell type.

Huang et al. [19] provide experimental evidence that the genetic circuit controlling hematopoietic (blood cell) stem cells consists of a pair of mutually repressing genes, each of which also acts as a self-activator. Such a feedback loop is reported to be an over-represented motif in real genetic networks [27]. We have chosen the Huang switch for our cell-fate controlling module, as shown in Fig. 1. The differential equations describing the Huang switch are:

x1˙=a1x1nx1n+θ1n+b1ψ1nx2n+ψ1nk1x1x2˙=a2x2nx2n+θ2n+b2ψ2nx1n+ψ2nk2x2\begin{array}[]{l}\displaystyle\dot{x_{1}}=\frac{a_{1}x_{1}^{n}}{x_{1}^{n}+\theta_{1}^{n}}+\frac{b_{1}\psi_{1}^{n}}{x_{2}^{n}+\psi_{1}^{n}}-k_{1}x_{1}\\ \\ \displaystyle\dot{x_{2}}=\frac{a_{2}x_{2}^{n}}{x_{2}^{n}+\theta_{2}^{n}}+\frac{b_{2}\psi_{2}^{n}}{x_{1}^{n}+\psi_{2}^{n}}-k_{2}x_{2}\\ \end{array} (2)

These equations model the dynamics of self-activation and mutual inhibition with Hill functions. The parameters a1,a2a_{1},a_{2} give the maximum transcription rate due to self-activation, and θ1,θ2\theta_{1},\theta_{2} control the concentration of protein x1x_{1} or x2x_{2} at which the inflection point in the Hill function occurs. Parameters b1,b2b_{1},b_{2} give the basal rate of transcription, which is suppressed by a downward sloping Hill function with ψ1,ψ2\psi_{1},\psi_{2} controlling the concentration of protein at which its inflection point occurs. The cooperativity of the interactions is given by nn, which governs the steepness of the Hill function. The rate of protein degradation is controlled by k1,k2k_{1},k_{2}.

To simplify our analysis we impose certain symmetries on the equations, and select specific values for certain parameters to produce the desired behavior. We make the maximum contributions of the basal expression rate and the self-activation the same (ai=bia_{i}=b_{i}) and thus have a single parameter which controls the overall maximum transcription rate of the gene. For simplicity, we also set the inflection points of the two Hill functions (for the self-activation and inhibition) to the same value (θi=ψi=θ\theta_{i}=\psi_{i}=\theta). Finally, we choose to make all parameter values aa, θ\theta, and kk identical between the two equations (so k1=k2k_{1}=k_{2}, etc.), thus making the equations symmetric between x1x_{1} and x2x_{2}. We can write the new set of parameter values as:

a1,a2,b1,b2=aθ1,θ2,ψ1,ψ2=θ=1n=4k1,k2=k\begin{array}[]{l}\displaystyle a_{1},a_{2},b_{1},b_{2}=a\\ \displaystyle\theta_{1},\theta_{2},\psi_{1},\psi_{2}=\theta=1\\ \displaystyle n=4\\ \displaystyle k_{1},k_{2}=k\\ \end{array} (3)

where we have chosen to set the inflection point θ=1\theta=1 and the cooperativity n=4n=4. These parameter choices are not unique, and do not require careful tuning: as long as n4n\geq 4 a wide range of values of θ\theta, aa, and kk produce the desired tri-stable behavior. This leaves us with the new simplified equations:

x1˙=a(x14x14+1+1x24+1)kx1x2˙=a(x24x24+1+1x14+1)kx2\begin{array}[]{l}\displaystyle\dot{x_{1}}=a\left(\frac{x_{1}^{4}}{x_{1}^{4}+1}+\frac{1}{x_{2}^{4}+1}\right)-kx_{1}\\ \\ \displaystyle\dot{x_{2}}=a\left(\frac{x_{2}^{4}}{x_{2}^{4}+1}+\frac{1}{x_{1}^{4}+1}\right)-kx_{2}\\ \end{array} (4)

These equations have only two free parameters: aa and kk. These are the parameters which are influenced by other modules, following the schematic shown in Fig 1, and as detailed below. In order to better understand the effect of these parameters, we explore the bifurcation that occurs as they are changed. A bifurcation is a dynamical event in which a small change of a parameter leads to a qualitative change in the dynamics of the system. Bifurcations can destabilize fixed points, or cause points which were previously unstable to become stable. Thus bifurcations that change the stability of a fixed point are the dynamical system analogues of cell fate decisions in differentiating cells, where we represent cell types as fixed points for simplicity. For example, a parameter change may destabilize the fixed point representing a multipotent cell type, thus driving the state to a different fixed point, which represents the new progenitor cell type. Note that while a fixed point may be stable under the deterministic dynamics of the system, in the presence of noise even a stable fixed point may not trap the system indefinitely; see the next subsection.

Refer to caption
Figure 2: Each panel shows the state space of the two gene toggle switch for the parameter values chosen (a=2a=2 and k=1.0,1.166,1.333,1.5k=1.0,1.166,1.333,1.5). For k=1.0k=1.0 there are three stable fixed points in the dynamics (shown as black dots) and two unstable fixed points (shown as gray diamonds). The arrows give the trajectory of the system at each point in phase space, with longer arrows indicating faster movement and shorter arrows slower movement. As kk changes from 11 to 1.51.5 the module undergoes a subcritical pitchfork bifurcation [34] (shown in more detail in Fig. 3). The two unstable fixed points converge on the interior stable fixed point, eventually destabilizing it when k1.3k\approx 1.3 and leaving only the two exterior stable fixed points.

For a=2a=2 and k=1k=1, the two variable system is “tri-stable”, meaning that there are three stable fixed points in the dynamcs. This state space is shown in the top left panel of Fig 2. The three stable fixed points are given by black dots: there is an intermediate fixed point, characterized by moderate amounts of x1x_{1} and x2x_{2}, and two exterior fixed points, characterized by large amounts of one variable, and small amounts of the other. The gray diamonds represent the two unstable fixed points. Leaving the parameter a=2a=2 fixed and tuning kk from 11 to 1.51.5 drives a bifurcation from tri-stablity to bi-stabliltiy, as shown in the subsequent three panels of Fig. 2. The intermediate fixed point is destabilized, leaving only the two exterior stable fixed points. The bifurcation also occurs if the overall transcription rate, aa, is increased while kk is kept constant—only the ratio of these parameters has a real effect on the qualitative dynamics. Their absolute values merely set the time scale.

Refer to caption
Figure 3: The x1x_{1} position of the fixed points as the parameter kk changes from 1.01.0 to 1.51.5. The black circles represent stable fixed points while the gray diamonds represent unstable fixed points. As the module undergoes a subcritical pitchfork bifurcation, a cell at the interior stable fixed point moves along the trajectory shown by the arrow a. When k1.3k\approx 1.3 the fixed point is destabilized and the cell is pushed by any random fluctuations from the now unstable fixed point to one of the exterior fixed points following either trajectory b or the dotted trajectory. Once the cell has been drawn to one of the exterior fixed points, if the module reverses the bifurcation and once again becomes tri-stable, the cell will remain at its new stable fixed point, following trajectory c. So once the state of the system has undergone the bifurcation, it cannot easily be driven back to its starting position, thus making the differentiation event modeled by this bifurcation one-way.

In their analysis of the tri-stable switch, Huang et al. [19] argue that the “interior” stable fixed point, characterized by intermediate expression levels of both genes, represents the hematopoietic stem cell, and the two “exterior” stable fixed points represent the two committed lineages. They provide evidence that the differentiation from hematopoiesis to the erythroid or myelomonocytic lineages is caused by the destabilization of the intermediate fixed point via a subcritical pitchfork bifurcation. This in turn forces the cell to shift to one of the two other fixed points. In Fig. 3 we show the bifurcation diagram corresponding to this event. The black dots indicate the stable fixed points, and the gray, the unstable fixed points. The xx-axis shows the value of the bifurcation parameter, kk. When k=1.0k=1.0 the circuit is tri-stable: there are three stable fixed points. When k1.3k\approx 1.3, the interior stable fixed point is destabilized by the approach of the two unstable fixed points. Thus, the state of the system will eventually migrate to one of the two remaining stable fixed points under perturbation. However, once the system sits at an exterior fixed points, it will not return to the interior fixed point if we reverse the process and decrease the bifurcation parameter—the exterior fixed points remain stable to small perturbations throughout. Thus, to reverse the cell differentiation, not only would the interior fixed point need to be re-stabilized; the state of the system would also need to be driven from the exterior fixed point. This ensures that cell differentiation is directional: a simple, transient, one parameter perturbation (increase in kk) can drive the system toward specificity, but there is no similarly simple way to de-differentiate back to multipotency.

We chose the tri-stable switch proposed by Huang et al for its simplicity and empirical support. Several other models have been proposed for the genetic control circuit in hematopoiesis [3, 5, 31]. Each of these represents a reasonable model for a genetic switch, and would most likely function as desired if used as modules in the hierarchical, modular network proposed here. This underscores a key strength of the top-down approach: important properties of the dynamics are driven by the hierarchical architecture and may be insensitive to many of the specifics of each module.

3.2 Stochastic model

Noise is ubiquitous in biological systems and important for many functions [12]. The GRN as a whole, but especially the genes responsible for the process of development (and cell differentiation), must be organized for reliable performance. Thus we require that our model behaves robustly in the presence of noise. Feedback loops [10] and modularity [35] have been shown to contribute to robustness, and because our model makes use of these design characteristics, we find that robustness is easily attained. We simulate noise in the dynamics using a Langevin equation:

dxi=Fi(𝐱)dt+gdWidx_{i}=F_{i}({\bf x})\ dt+g\ dW_{i} (5)

Here, Fi(𝐱)F_{i}({\bf x}) is the same function as in the deterministic dynamics. In the second term, WiW_{i} is a Wiener process (an independent white noise process). This process introduces additive noise with a magnitude given by the parameter gg and with no dependence on the state 𝐱{\bf x}. Such noise was proposed by Kepler and Elston [26] for models of genetic regulatory interactions. To simulate the forward evolution of the system for a length of time Δt\Delta{t}, we must integrate the Wiener process over the interval which gives us Δtξi\sqrt{\Delta{t}}\ \xi_{i}, where ξi\xi_{i} is a Gaussian random variable with zero mean and unit variance. Intuitively, the magnitude of the integral grows like Δt\sqrt{\Delta{t}} because it behaves like a one dimensional random walk: for a 11-d random walk, the expected displacement after nn steps is proportional to n\sqrt{n}. Thus we can compute the new state of the system after a time step Δt\Delta{t} using the Forward Euler integration scheme:

xi(t+Δt)=xi(t)+ΔtFi(𝐱)+Δtgξix_{i}(t+\Delta{t})=x_{i}(t)+\Delta{t}\ F_{i}({\bf x})+\sqrt{\Delta{t}}\ g\ \xi_{i} (6)

Although the Forward Euler scheme is only first order accurate, the equations are quite simple and simulating them is very fast. Thus, we can easily choose Δt\Delta{t} to be small enough to ensure accuracy. Naturally, higher order schemes could be applied as needed. Note that since the variables in our system describe the concentration of protein products, we require that no variable drop below zero: a negative concentration is non-physical.

Refer to caption
Figure 4: The behavior of the module in the presence of noise as it approaches the bifurcation. Each panel shows the final positions of 500 simulations (“cells”) starting from the interior stable fixed point simulated for 4040 units of tt (in Δt=.001\Delta{t}=.001 time steps). The noise magnitude is g=.1g=.1 and the overall activity a=2.0a=2.0 for various values of kk. Black circles show the positions of the stable fixed points and gray squares show the end points of each simulation. As kk increases, the variance of expression levels of x1x_{1} and x2x_{2} increases near the interior stable fixed point. For k=1.0,1.1k=1.0,1.1 none of the simulated cells “escape” from this fixed point. For k=1.2k=1.2 approximately 50 cells (or 10%10\%) escape, divided evenly between the two exterior fixed points. When k=1.3k=1.3 approximately 90%90\% escape. In similar simulations at the same noise strength, g=0.1g=0.1, none of the “cells” starting near one of the exterior fixed points transitioned to the interior fixed point for any parameter values tested.

In the presence of noise, we find that the transition between stability and instability for the interior stable fixed point is no longer abrupt. As the module approaches the bifurcation point, the restoring force which pushes the state of the system back towards this interior fixed point weakens. We simulated 500 cells starting at each of the three stable fixed points for values of k=1.0,1.1,1.2,1.3k=1.0,1.1,1.2,1.3 with g=0.1g=0.1 and Δt=0.001\Delta{t}=0.001 for 40,00040,000 time steps (4040 units of tt, a fairly lengthy period in simulation time relative to the timescale of transcription and translation). The distribution of the final states for cells starting at the interior stable fixed point spreads as kk increases (shown in Fig. 4), i.e. as the value of kk increases, the variance of the gene expression increases. Our model hence makes a sharp prediction that may soon be testable in vivo, using flow cytometry to measure the approximate expression level of a gene for each cell in a population of cells approaching differentiation. If our model is correct, the expression level of significant genes should become more varied across the population as the cells approach a cell fate decision. Thus, observing the variance of genes as cells differentiate would provide a clearer picture of which genes are controlling cell fate decisions: those with higher variance.

Before the actual bifurcation point is reached, we see non-trivial numbers of cells have already moved to one of the exterior, ‘differentiated’ fixed points. This indicates that for values of k1.2k\gtrsim 1.2 the interior stable fixed point becomes metastable (weakly stable). Thus the state of the system will not remain near this fixed point indefinitely in the presence of noise. As we discuss later, progenitor cell types may be metastable fixed points in which the state of the GRN rests for a short time before continuing its differentiation. This would explain the difficulty of culturing and maintaining such cell types in vitro.

In closing this section, we note that our model provides a potential dynamical explanation of the observed prevalence of binary (as opposed to trinary, etc.) branches in cell differentiation. The subcritical pitchfork bifurcation found in our modules arises from the topology of the regulatory interactions and is present for many choices of parameter values without careful tuning. This bifurcation leads naturally to binary cell differentiations. In general, only one of the eigenvalues of the interior fixed point becomes positive at the bifurcation. This implies that flows along the other eigenvectors near the interior fixed point still push the state of the system toward that fixed point. This mechanism concentrates flows along the unstable eigenvector. In Fig. 4 the distribution of final states does not spread evenly through the state space; it spreads preferentially along trajectories following the two sides of the unstable manifold of the interior fixed point. Near the fixed point, the unstable manifold is tangent to the unstable eigenvector; farther from the fixed point, the unstable manifold need not be a straight line. These trajectories, in turn, lead to one of the two exterior fixed points. Trinary branchings, since they cannot depend on such simple properties of the state space, are comparatively much more fragile and require correspondingly careful selection of parameters. We have found in our model that binary branching occurs ‘for free’ from a robustly occurring pitchfork bifurcation, with the simulated cells flowing away from their starting state along one of the two ends of the single, destabilized eigenvector.

4 Sequential Differentiation

Refer to caption
Figure 5: The modular network and consequences of a single cell fate decision, in detail. The kk and aa parameters in all three modules are functionally dependent on one of the genes in the module directly above them. In the case of module A, we denote the stimulus as a generic external signal ΔA\Delta_{A} to reflect the fact that it could come from another gene, or some external differentiation signal. In the top panel, the state of the system is poised in the intermediate, meta-stable state of module A (kA=1.25k_{A}=1.25). Modules B and C are both in the intermediate stable state, expressing x3x_{3}, x4x_{4}, x5x_{5}, and x6x_{6} at a moderate level. In the lower panel, module A has undergone differentiation, driven by noise, switching to one of the two exterior stable states, x1x_{1} high, x2x_{2} low. This in turn has caused module B to become meta-stable by increasing the value of kBk_{B} from 1.01.0 to 1.251.25. Module C is inactivated by the change in parameter aCa_{C} from 2.02.0 to 0.20.2, essentially silencing the genes in the module.

Now we are ready to build the full hierarchical model. In Fig. 5, we show a differentiation event in more detail: three modules are shown, A,B,A,B, and CC. Module AA influences BB and CC. In the top panel, module AA is in the intermediate fixed point, but this fixed point is only meta-stable. This meta-stability is caused by the influence of the signal ΔA\Delta_{A}. If module AA is part of a larger cascade, then the signal will originate from a gene in the module above AA in the hierarchy; if AA is at the top of the hierarchy then the signal may originate from outside the cell. Module AA expresses moderate levels of x1x_{1} and x2x_{2}, which in turn influence the parameters kB,aBk_{B},a_{B} and kC,aCk_{C},a_{C} in modules BB and CC, through the functional relationships:

kB=G(x1),kC=G(x2)aB=H(x1),aC=H(x2)\begin{array}[]{l}\displaystyle k_{B}=G(x_{1}),k_{C}=G(x_{2})\\ \displaystyle a_{B}=H(x_{1}),a_{C}=H(x_{2})\\ \end{array} (7)

For simplicity, the same functions GG and HH are used throughout the network. In Fig. 6 we show how the piecewise linear functions GG and HH respond to changes in the variable on which they depend. The specific functional form chosen is not important; sequential differentiation will occur as long as the functions satisfy two basic requirements. First, if the value of xx increases substantially, then the interior fixed point in the influenced module should be destabilized, preparing it to undergo differentiation. Thus, the function G(x)G(x), which controls kk, should increase slightly in value in response to higher values of xx. This higher value of G(x)G(x) will drive the bifurcation above. Second, if the value of xx decreases dramatically, indicating that the gene xx has been shut off, then the transcription rate, aa, of the downstream module, which is under the influence of xx, should be dramatically decreased. To achieve this, for small values of xx, H(x)H(x) should be small, effectively silencing the genes in the influenced module.

Refer to caption
Figure 6: The piecewise linear response functions GG and HH. In the top panel we show the function GG, which controls the decay rate kk, and in the bottom panel we show the function HH, which controls the transcription rate aa. Specifically, G=1.0G=1.0 for x2.7x\leq 2.7 and G=1.25G=1.25 for x3.0x\geq 3.0, interpolating between these values for 2.7x3.02.7\leq x\leq 3.0. Similarly H=0.2H=0.2 for x1.0x\leq 1.0 and H=2.0H=2.0 for x1.5x\geq 1.5, interpolating between these values for 1.0x1.51.0\leq x\leq 1.5.

Returning to Fig. 5, moderate levels of x1x_{1} and x2x_{2} result in parameters in modules BB and CC which maintain stable, moderate expression levels of their genes x3x_{3}, x4x_{4}, x5x_{5} and x6x_{6}. Note that any modules downstream of BB and CC would also express their genes at an intermediate level. Since module AA is in a meta-stable state, the state of the system is driven by noise after some time to one of the two exterior fixed points; in the Figure, x1x_{1} high, x2x_{2} low. The result is shown in the lower panel: module CC has been silenced by the decrease in expression of x2x_{2} (thus also shutting off all the genes in modules downstream from it, under the control of x5x_{5} or x6x_{6}). Module BB, under the influence of increased x1x_{1}, has become meta-stable, and is thus waiting to drive a differentiation event as the system “chooses” between expressing x3x_{3} and x4x_{4}.

The proposed links integrating the modules correspond to the simplest possible regulatory connections that generate the observed branching behavior. In most models, regulatory control acts on the process of transcription. However, the decay rate kk can also act as a control parameter, if one protein promotes the degradation of another protein. In the preceding discussion, we chose to use both kk and aa as control parameters to simplify the model. Since the qualitative behavior of the modules depends only on the relative values of the decay kk and the activation aa, the desired behavior can be accomplished using only the activation rate as a parameter. This requires that the activation rate in a regulated module respond non-monotonically to the relevant gene in the controlling module: for increased levels of the relevant gene, the activation rate decreases slightly (which has the same effect as increasing kk), whereas for decreased levels of the relevant gene, the activation rate drops dramatically, as it does for H(x)H(x), to silence the module. Precisely what regulatory connections appear in real networks remains to be seen; however, modules of mutually regulating transcription factors clearly are linked in real GRNs.

Thus our hierarchical network model allows a single, multipotent cell type to access a multitude of distinct, terminal states in a biologically sensible way. Cinquin and Demongeot [5] present an alternative mathematical model: a single high-dimensional genetic switch with more than two possible outcomes. Their model produces promiscuous expression of antagonistic factors (which represents the undifferentiated state), whereupon a symmetry breaking bifurcation results in increased expression of one of the genes in the circuit, and decreased expression of all the others. While Cinquin and Demongeot argue that such multi-switches are necessary to capture particular biologically observed differentiations, as a generic mechanism for generating a diversity of terminal cell types, the modular, hierarchical approach taken in our model is more robust: it is not reliant on the highly symmetric, all-to-all coupling present in the Cinquin model. Our model should work with a diversity of different modules, so long as they conform to certain, loose criteria. Although we imposed an x1x_{1}, x2x_{2} symmetry on the Huang circuits used in this paper, we did so only to reduce the number of parameters and simplify the discussion: the Huang switch maintains tristability, and all its other important properties, when the parameters are asymmetric [19].

There has been much interesting mathematical work on symmetry breaking in biological systems [9, 11, 33]. “Coupled cell systems” [9] present a potentially interesting mathematical toolkit for analyzing the dynamics of modular networks, such as the network proposed in this paper: dynamical components called “cells” interact with one another according to a given network of connections. Studying the hierarchical network architecture proposed in this paper using the tools of coupled cell systems may provide useful dynamical insights, and is a fruitful avenue for future work.

5 Whole network results

Refer to caption
Figure 7: A simulation of three interacting modules with noise g=0.1g=0.1. At t=5t=5 the interior fixed point of module AA is changed from stable to meta-stable by an external stimulus. For the next 10\approx 10 units of time tt the top module remains meta-stable, then finally falls into the x1x_{1} high exterior fixed point, as shown in the top panel. This silences module CC (and thus genes x5x_{5} and x6x_{6}, which are rendered inactive) as shown in the bottom panel. Finally, in the middle panel we see module BB, which became meta-stable after module AA made its cell fate decision and entered the exterior fixed point. Module BB remains meta-stable for 25\approx 25 units of tt until it transitions to an exterior fixed point, completing the differentiation of the cell through this segment of the network.

Fig. 7 shows the result of a simple three module simulation, with the noise parameter g=0.1g=0.1. The differentiation tree functions properly in the presence of noise: at the outset, all genes are expressed at a moderate level. At time t=5t=5, a signal is introduced which destabilizes the top module. After some time, the state of the the top module is decided in favor of x1x_{1}. As x2x_{2} decreases, we see that genes x5x_{5} and x6x_{6} are also silenced. The genes x3x_{3} and x4x_{4} continue to be expressed at moderate levels for some time, until through random fluctuations, the system exits the meta-stable state, resulting in increased x4x_{4} expression and decreased x3x_{3} expression. The individual cell fate decisions are strongly stochastic in the model as presented. Once the interior fixed point is destabilized, random fluctuations determine into which exterior fixed point the state of the system falls. This is consistent with the stochastic component of cell fate decisions observed in previous work [15, 20]. As noted in Huang et al. [19], the perturbation from the interior fixed point can be done asymmetrically, resulting in “intrinsic” as opposed to stochastic cell fate decisions.

At the outset, when any outcome was possible and the cell multipotent, all of the genes are expressed. As the cell differentiates and loses its pluripotency, the genes which correspond to the lineages which are no longer accessible to the cell are silenced. This sequence of silencing events provides a clear explanation of the exclusivity of cell differentiation, as well as a mechanism for ensuring an orderly progression of cell types. Consider a differentiation tree with nn levels and assume the tree is complete and symmetric, so that n1n-1 binary decisions occur. Fig. 1 is an example of such a tree for n=2n=2. A general tree will contain (2n1)(2^{n}-1) modules and N=2(2n1)N=2(2^{n}-1) genes. At the beginning of the differentiation process all NN genes will be expressed promiscuously at intermediate levels. When the module at the top of the tree transitions to one of its two possible exterior fixed points, the cascade of silencing turns off an entire (n1)(n-1)-level sub-tree. Thus the effective dimension of the state space is reduced from NN to (N2(2n11)1)(N-2(2^{n-1}-1)-1). The silenced genes are fixed near 0 and subsequent time evolution of the system can only take place in the remaining state space, which is associated exclusively with the available lineages. Exclusivity and orderly cell type progression follow immediately from this successive restriction of the dynamics to smaller and smaller subspaces. This picture also yields an experimental prediction. If the dynamical mechanism proposed here is correct, then differentiation from multipotent to more specific cell types should be characterized by a sequence of gene silencing events, with many more genes silenced (on average) early in the differentiation process and fewer genes silenced as the terminal type is achieved.

As seen in the simulation described above, individual cells can remain in the meta-stable state for a long time before continuing their differentiation. We simulated 20002000 iterations of the model, and measured the length of time the system spent in meta-stable states. We found an exponential distribution of waiting times, with some of our simulated cells spending as many as 290290 units of tt in the meta-stable state. For the purposes of comparison, we note that once the state escapes the region of state space close to the meta-stable fixed point and the transition to an exterior fixed points begins, the transition often occurs in less than 55 units of tt. So while individual cells cannot be maintained in these meta-stable states indefinitely, such states can be quite long-lasting. We have not considered the proliferation of cells (i.e. division maintaining cell type), as described by Hoffmann et al [17]. The proliferation rate is a function of the state of the GRN. Thus if the proliferation rate is high in the meta-stable states of intermediate gene expression, a population of cells in the meta-stable state could be maintained indefinitely. The number of progenitor cells found in vivo is the result of a balance between: cells differentiating into the meta-stable progenitor state from more multipotent cell types; progenitor cells proliferating; and progenitor cells differentiating out of the progenitor cell type to a more lineage-committed state through the stochastic dynamics described above. Thus the meta-stability of the dynamical fixed point may provide an explanation for the limited capacity for self-renewal of some progenitor cell types in vitro.

6 Conclusion

Our model reproduces all of the desired behaviors of cell differentiation by using established network properties of real genetic networks. Cell differentiation occurs between dynamically distinct, stable cell types. Cell fate decisions are directional: small parameter changes can easily drive differentiation down the tree, but not back up again. Differentiation occurs in a branching, exclusive fashion, due to the properties of the bifurcation and the progressive restriction of available state space. And finally, a cell type higher in the differentiation tree exhibits promiscuous expression of genes specific to all its accessible lineages, by construction. It is easy to add additional control mechanisms to this model; for example, we could require external signals, in concert with the influence of “higher” modules, to drive each bifurcation. Our goal, however, is merely to outline a minimal dynamical system capturing certain generic features of cell differentiation.

In addition, our model suggests a new role for modularity and an explanation for the prevalence of binary cell fate decisions. Modules allow for the easy implementation of repeated behaviors, in this case facilitating the repeated branchings observed in cell fate decisions. The regulatory structure of the hierarchical, modular network topology serves to channel trajectories in the state space, by progressively restricting them to lower dimensional subspaces. This guides the state of the system through an orderly progression of fixed points, representing the various cell types. Further, binary cell fate decisions emerge effortlessly and robustly from the topology of our network modules due to the behavior of flows near the interior fixed point of intermediate gene expression. The dynamical simplicity and robustness of binary cell fate decisions could explain their prevalence in cell differentiation. Our model thus illustrates that many of the important properties of cell differentiation can be captured quite naturally by a system of loosely coupled genetic modules.

Building progressively more accurate whole-system models that recapitulate key functionalities provides an important complement to the traditional “bottom up” approach, which seeks maximal integration of all available details. A “top down” approach, focused on less detailed minimal models, allows us to focus on understanding the influence of the topology of the genetic regulatory network on the resulting dynamics. The details of our individual modules and their interconnections are of course oversimplified and incomplete. However, emerging evidence from the topology of real genetic networks suggests that the overall modular architecture is likely to be correct. We hope that future experimental results will show to what degree the genetic regulatory network in general, and the cell differentiation network in particular, can be treated as a collection of loosely coupled modules. Our model makes two clear predictions regarding the cell differentiation network. On the one hand, the variance in expression level of genes involved with the differentiation process should increase across a population of cells as the population approaches differentiation. On the other, the progressive restriction of available state space should be reflected in a sequence of gene silencing events as cells proceed from more multipotent, promiscuously expressing types down a particular lineage, with fewer genes (on average) going silent as the terminal cell type is approached. As large-scale models improve through interaction with experimental evidence, we believe that the top down and bottom up approaches can eventually “meet in the middle”, fitting carefully specified subsections of the genetic network into a larger network framework.

References

  • Ackers [1982] Ackers, G.K., e. a. (1982) , Proc. Natl. Acad. Sci. 79, 1129
  • Alon [2009] Alon, U. (2009) , Science 301, 1866
  • Bokes [2009] Bokes, P., e. a. (2009) , Math. Med. & Biol. 26, 117
  • Chickarmane [2006] Chickarmane, V., e. a. (2006) , PLoS Comput. Biol. 2(9)(e123)
  • Cinquin and Demongeot [2005] Cinquin, O. and Demongeot, J. (2005) , J. Theor. Biol. 233, 391
  • Cross and Enver [1997] Cross, M. and Enver, T. (1997) , Current Opinion in Genetics & Development 7, 609
  • Danos [2008] Danos, V., e. a. (2008) , In Proceedings of the First International Workshop, Formal Methods in Systems Biology, FMSB ’2008. (J. Fisher ed.), Vol. 5054 of Lecture Notes in BioInformatics, pp. 103–122, Cambridge, UK: Springer, Berlin, Germany
  • Davidson [2002] Davidson, E., e. a. (2002) , Science 295, 1669
  • Dias and Lamb [2006] Dias, A. and Lamb, J. (2006) , Physica D 233, 93
  • Eldar [2002] Eldar, A., e. a. (2002) , Nature 419, 304
  • Elmhirst [2004] Elmhirst, T. (2004) , International Journal of Bifurcation and Chaos 14(3), 1017
  • Elowitz [2002] Elowitz, M.B., e. a. (2002) , Science 297, 1183
  • Espinosa-Soto [2004] Espinosa-Soto, C., e. a. (2004) , The Plant Cell 16, 2923
  • Gardner [2000] Gardner, T.S., e. a. (2000) , Nature 403, 339
  • Graf [2002] Graf, T. (2002) , Blood 99, 3089
  • Hartwell [1999] Hartwell, L.H., e. a. (1999) , Nature 402, C47
  • Hoffmann M. [2008] Hoffmann M., e. a. (2008) , PLoS One 3(8)(e2922)
  • Huang [2005] Huang, S., e. a. (2005) , Phys. Rev. Lett. 94, 128701
  • Huang [2007] Huang, S., e. a. (2007) , Developmental Biology 305, 695
  • Hume [2000] Hume, D. (2000) , Blood 96, 2323
  • Ihmels [2002] Ihmels, J., e. a. (2002) , Nature Genetics 31, 370
  • Kal and Spradling [2004] Kal, T. and Spradling, A. (2004) , Nature 428, 564
  • Kauffman [2003] Kauffman, S.A., e. a. (2003) , Proc. Natl. Acad. Sci. 100, 14796
  • Kauffman [1969] Kauffman, S. (1969) , J. Theor. Biol. 22, 437
  • Kauffman [2004] Kauffman, S. (2004) , J. Theor. Biol. 230, 581
  • Kepler and T. [2001] Kepler, T. and T., E. (2001) , Biophys. J. 81, 3116
  • Lee [2002] Lee, T.I., e. a. (2002) , Science 298, 799
  • Milo [2002] Milo, R., e. a. (2002) , Science 298, 824
  • Ravasz [2002] Ravasz, E., e. a. (2002) , Science 297, 1551
  • Ribeiro [2006] Ribeiro, A., e. a. (2006) , Journal of Computational Biology 13(9), 1630
  • Roeder and Glauche [2006] Roeder, I. and Glauche, I. (2006) , J. Theor. Biol. 241, 852
  • Shen-Orr [2002] Shen-Orr, S., e. a. (2002) , Nature Genetics 31, 64
  • Stewart [2003] Stewart, I., e. a. (2003) , Bifurcation, Symmetry and Patterns, pp. 3–54, Chicago, IL: Springer
  • Strogatz [1994] Strogatz, S. (1994) , Nonlinear Dynamics and Chaos, pp. 55–60, Cambridge, MA.: Perseus Publishing
  • Variano and McCoy [2004] Variano, E. and McCoy, J. (2004) , Phys. Rev. Lett. 92, 188701