Beyond the adiabatic limit in systems with fast environments:
a $\tau$ -leaping algorithm

Ernesto Berríos-Caro ernesto.berrios@postgrad.manchester.ac.uk Theoretical Physics, Department of Physics and Astronomy, School of Natural Sciences, Faculty of Science and Engineering, The University of Manchester, Manchester M13 9PL, United Kingdom Tobias Galla tobias.galla@ifisc.uib-csic.es Theoretical Physics, Department of Physics and Astronomy, School of Natural Sciences, Faculty of Science and Engineering, The University of Manchester, Manchester M13 9PL, United Kingdom Instituto de Física Interdisciplinar y Sistemas Complejos, IFISC (CSIC-UIB), Campus Universitat Illes Balears, E-07122 Palma de Mallorca, Spain

Abstract

We propose a $\tau$ -leaping simulation algorithm for stochastic systems subject to fast environmental changes. Similar to conventional $\tau$ -leaping the algorithm proceeds in discrete time steps, but as a principal addition it captures environmental noise beyond the adiabatic limit. The key idea is to treat the input rates for the $\tau$ -leaping as (clipped) Gaussian random variables with first and second moments constructed from the environmental process. In this way, each step of the algorithm retains environmental stochasticity to sub-leading order in the time scale separation between system and environment. We test the algorithm on several toy examples with discrete and continuous environmental states, and find good performance in the regime of fast environmental dynamics. At the same time, the algorithm requires significantly less computing time than full simulations of the combined system and environment. In this context we also discuss several methods for the simulation of stochastic population dynamics in time-varying environments with continuous states.

I Introduction

The modelling of dynamical systems in biology and other disciplines necessarily requires simplifying assumptions and a level of coarse graining. If all processes we know about are included, then the model becomes so complicated that it cannot be simulated or analysed. Even if simulation or analysis is possible further study of such a model will rarely be enlightening. Excessive detail makes hard to identify the key mechanisms at work and to understand what model components are responsible for these mechanisms. At the same time, some element of realism must be maintained. The model must not be so stylised to miss the key ingredients and behaviour it is meant to capture. The principal challenge, therefore, is to find the right level of detail, given the intended purpose.

The choice between stochastic and deterministic modelling approaches is one aspect of this discussion. If more detailed stochastic models mark one end of the spectrum, then many traditional models in mathematical biology or chemistry sit at the opposite end. These models are often built on a small number of ordinary or partial differential equations (e.g. Murray (2002, 2003)). This deterministic approach is valid if one can assume that the same initial conditions will always lead to the same outcome. For many applications involving very large systems this is a perfectly sensible approach.

However, it is now also universally recognised that stochasticity in the time-evolution of many systems is key in shaping the outcome, see e.g. Goel and Richter-Dyn (2004); Ewens (2004); Traulsen and Hauert (2010). Consequently a number of analytical and computational methods has been developed for the study of stochastic systems. One focus is on systems with discrete interacting individuals. What these individuals represent depends on the context, they could be members of different species in population dynamics, individual animals or humans in models of an epidemic, or molecules in chemical reaction systems Goel and Richter-Dyn (2004); Castellano et al. (2009); Keeling and Rohani (2008); Kampen (2007).

One particular point of interest within this class of individual-based systems are models operating in a time-dependent environment. This environment is not part of the system proper, but its state has an effect on what happens in the system. In a model of a population of bacteria for example, the reproduction or death rates could depend on external conditions such as the availability of nutrients or the presence of toxins Acar et al. (2008); Patra and Klumpp (2015). In population dynamics, the carrying capacity could vary in time Wienand et al. (2017, 2018); Taitelbaum et al. (2020), and in epidemics the infection rate is subject to seasonal changes Black and McKane (2010). The focus of our paper is on such individual-based models in time-varying external environments.

Analytical approaches to stochastic systems with discrete individuals usually start from the chemical master equation. In limited cases direct solution is possible, for example using generating functions. However, this is the exception, and a number of approximation schemes have consequently been developed. These include Kramers–Moyal and system-size expansions, leading to Fokker–Planck equations and descriptions in terms of stochastic differential equations Kampen (2007); Gardiner (2004). These schemes sacrifice the granularity of a discrete-agent system, and instead describe the dynamics in terms of continuous densities. This approach can be successful in particular for large populations. Any particular event then only results in a small change in the composition of the population relative to its size. Individual-based approaches and descriptions based on deterministic differential equations been extended to models of population dynamics in switching environments, for a selection of work see Kussell and Leibler (2005); Kepler and Elston (2001); Thattai and Van Oudenaarden (2004); Swain et al. (2002); Assaf et al. (2013a); Duncan et al. (2015); Assaf et al. (2013b); Ashcroft et al. (2014); Wienand et al. (2017); West et al. (2018); Assaf et al. (2008); Hufton et al. (2019a).

There are however situations in which one would rather avoid giving up the discrete nature of the population. For example, granularity is crucial for extinction processes (the number of individuals of the species about to go extinct is small by definition). In other situations the population may not be large enough to warrant a description in terms of continuous densities. For example, copy numbers in genetic circuits can be of the order of tens to hundreds (see e.g. Eldar and Elowitz (2010)), and it is difficult to justify a continuum limit. It then becomes necessary to carry out numerical simulations of the discrete individual-based process. The method of choice is the Gillespie algorithm (Gillespie, 1976, 1977), generating a statistically accurate ensemble of sample paths of the continuous-time dynamics.

In most applications the rate of events scales with the size of the population so that each individual experiences an ${\cal O}(1)$ number of reactions per unit time. The Gillespie method then runs into difficulties when the population is large, and with it the number of reactions per unit time. The computational cost of generating sample paths to up the desirable end point can then become very high. Similarly, a time scale separation between the dynamics in the population and the environment may make simulations challenging. If the environment is very fast compared to the population, a significant number of environmental events needs to be executed between events in the population. This aggravates the above limitations for large populations, and simulations can become problematic even for intermediate population sizes. One possible approach to this consists of assuming that the environment is ‘infinitely’ fast compared to the population. This is known as quasi steady state approximation Bowen et al. (1963); Segel and Slemrod (1989), or the ‘adiabatic limit’ Lin and Buchler (2018a); Hufton et al. (2019a). For related work see also Newby and Bressloff (2010); Ashcroft et al. (2014); Bressloff (2016, 2017a, 2017b). If this limit is taken then the environmental dynamics can be ‘averaged out’, and effective reaction rates can be used for the population. While computationally convenient, this approach discards any stochasticity from the environmental process. This sets another limitation, in particular when it is not valid to assume that the environment is infinitely fast compared to the population.

The objective of this work is therefore to design and test an algorithm for systems with fast environmental dynamics, but which also captures some elements of the environmental noise. We call this discrete-time algorithm $\tau$ FE – $\tau$ -leaping for fast environments. It is built on the ideas of the conventional $\tau$ -leaping algorithm (Gillespie, 2001), but with modifications such as to preserve elements of the stochasticity of the environmental process. To do this, we assume that the environment is fast compared to the population, but not infinitely fast. More precisely, in each step of the algorithm we take into account sub-leading contributions in the time-scale separation.

The key new element of our algorithm is how we deal with the environment. We do not take the adiabatic limit, instead we treat the reaction rates in the population as random variables during each step. The rates are drawn from a distribution at the beginning of each step, and then remain fixed during the time step. The distribution of rates can change from one step to the next, and is constructed to reflect statistical features of the original environmental dynamics.

Each step of the $\tau$ FE algorithm consists of two parts: First a realisation of reaction rates is drawn from the appropriate distribution. Then a conventional $\tau$ -leaping step is carried out with these rates. The core of our paper consists of the construction of the ‘appropriate distribution’ for the reaction rates. These ideas were introduced in a previous work Hufton et al. (2019b) for a simple case of a two-species birth-death process in an environment which can take two discrete states. In the present paper we develop this further. We develop and test a more general algorithm for environments with more than two discrete states. As we will show, the algorithm can also be extended to continuous environmental dynamics.

The remainder of the paper is organised as follows. In Sec. II we describe the general setup of the type of system we simulate. We also outline the general principles of the $\tau$ FE algorithm. In Sec. III we then make the necessary preparations for the introduction of the algorithm. In particular, we compute the statistics of reaction rates which are fed into the conventional $\tau$ -leaping step. We then describe the algorithm in detail. In Sec. IV, we test the $\tau$ FE algorithm in different models with discrete environmental states. In Sec. V, we then describe how the $\tau$ FE algorithm can be used when the environment takes continuous states. Specifically, we consider an Ornstein-Uhlenbeck process. In this context we also describe how known algorithms can be adapted to simulate continuous environments. Finally, we provide a discussion of our results and overall conclusions in Sec. VII.

II Model setup and general principles of the algorithm

II.1 Model definitions and notation

We look at systems composed of discrete individuals. We will refer to this synonymously as the ‘system proper’, or ‘the population’. Each of the individuals is of one of $S$ species (or types), labelled $i=1,\dots,S$ . We write $n_{i}$ for the number of individuals of species $i$ in the population, and $\mathbf{n}=(n_{1},\dots,n_{S})$ . The system evolves in an external environment, whose state we write as $\sigma$ . These states are time dependent, and can either take discrete values or be continuous.

The dynamics in the population proceeds through reactions $r=1,\dots,R$ . Each of these reactions converts a number of individuals from one type into another. Time in the model is continuous, and we assume that the dynamics is Markovian. We then write $R_{r,\sigma}(\mathbf{n})$ for the rate of reaction $r$ if the environment is in state $\sigma$ and the population in state $\mathbf{n}$ . The stoichiometric coefficient $\nu_{r,i}$ indicates how the number of individuals of type $i$ changes when a reaction of type $r$ occurs. Each $\nu_{r,i}$ is an integer, which can be negative, zero, or positive. We write ${\mbox{\boldmath$\nu$}}_{r}=(\nu_{r,1},\dots,\nu_{r,S})$ . The rates $R_{r,\sigma}(\mathbf{n})$ and the stoichiometric coefficients fully specify the dynamics of the population when the environment is in state $\sigma$ .

The state of the environment undergoes a Markovian stochastic process, governed by a master equation if states are discrete or by a stochastic differential equation in the case of continuous environmental states. These dynamics can depend on the state of the population $\mathbf{n}$ . If the environmental states are discrete we write $q_{\sigma\to\sigma^{\prime}}(\tau)$ for the probability of finding the environment in state $\sigma^{\prime}$ at a particular point in time, given that $\tau$ units of time earlier it was in state $\sigma$ . If the environment is continuos then $q_{\sigma\to\sigma^{\prime}}(\tau)$ is a probability density for $\sigma^{\prime}$ (at given $\sigma$ ). We call $q_{\sigma\to\sigma^{\prime}}(\tau)$ the transition kernel of the environmental process. We write ${\mbox{\boldmath$\rho$}}^{*}$ for the stationary distribution of the environmental dynamics. For discrete environmental states the entries $\rho^{*}_{\sigma}$ denote the probability of finding the system in state $\sigma$ in the stationary state. For continuous environments $\rho^{*}_{\sigma}$ is the stationary probability density for $\sigma$ .

II.2 General principles of the $\tau$ -leaping algorithm for systems in fast environments

A conventional reaction system (without external environment) is governed by a chemical master equation of the form

	$\displaystyle\frac{\mathrm{d}}{\mathrm{d}t}P(\mathbf{n},t)=$
	$\displaystyle\sum_{r}\big{[}R_{r}(\mathbf{n}-{\mbox{\boldmath$\nu$}}_{r})P(\mathbf{n}-{\mbox{\boldmath$\nu$}}_{r},t)-R_{r}(\mathbf{n})P(\mathbf{n},t\big{)}].$		(1)

The notation is as in Sec. II.1, the only difference is that there is no subscript $\sigma$ , as there is no environment. Sample paths entail events (reactions) which can occur at any point in continuous time, separated by exponentially distributed random waiting times. In each such event the state of the system $\mathbf{n}$ changes, and accordingly the reaction rates $R_{r}(\mathbf{n})$ can also change. Sample paths can be generated for example using the celebrated Gillespie algorithm (Gillespie, 1976, 1977).

The $\tau$ -leaping algorithm for such conventional reaction systems is built around the idea of keeping reaction rates constant over finite time steps of length $\tau$ Gillespie (2001). That is to say, if the state of the population is $\mathbf{n}$ at time $t$ , then the assumption is made that this state $\mathbf{n}$ and the rates $R_{r}(\mathbf{n})$ do not change until the end of the time step. The algorithm does not account for potential changes of the rates as individual reactions occur, and instead directly ‘leaps’ to time $t+\tau$ . This is justified provided the so-called ‘leap condition’ is fulfilled Gillespie (2001): broadly speaking the time step $\tau$ must be sufficiently small so that the state $\mathbf{n}$ in the continuous-time system does not change significantly in a time interval of length $\tau$ .

Making the approximation of constant $\mathbf{n}$ in the time interval from $t$ to $t+\tau$ , the number of reactions of type $r$ that fire in this interval follows a Poissonian distribution with parameter $\tau R_{r}(\mathbf{n})$ . Accordingly, realisations of Poissonian random variables $m_{1},\dots,m_{R}$ are drawn, and the corresponding numbers of each reactions are executed simultaneously. This generates a new state $\mathbf{n}^{\prime}$ at time $t+\tau$ , with entries $n_{i}^{\prime}=n_{i}+\sum_{r=1}^{R}m_{r}\nu_{i,r}$ . The process then repeats with updated rates $R_{r}(\mathbf{n}^{\prime})$ .

The idea of the $\tau$ -leaping algorithm we introduce for systems in external environments is similar. As in the conventional algorithm we discretise time, and keep the composition of the population $\mathbf{n}$ fixed during each iteration. It is only updated at the end of each step. From now on we use $\Delta t$ for the duration of a step instead of $\tau$ .

The difference to the conventional case is the external environment. If the environmental state space is discrete then switches of the environment can in principle be simulated in continuous time along with the other reactions (using Gillespie algorithm (Gillespie, 1976, 1977)). They can also be dealt with by means of the conventional $\tau$ -leaping algorithm, again along with the other reactions. These are natural simulation approaches when the environment operates on a similar time scale as the reactions in the population. Not much can then be gained by distinguishing between environmental processes and the dynamics in the system proper.

If the environment is infinitely fast compared to the reactions in the population, then the environment reaches stationarity on very short time scales. One can average over environmental states, see for example Bowen et al. (1963); Segel and Slemrod (1989); Newby and Bressloff (2010); Bressloff and Newby (2014); Hufton et al. (2019b, a). If the environment is discrete, for example, we can use average rates

R_{r}^{*}(\mathbf{n})\equiv\sum_{\sigma}\rho^{*}_{\sigma}R_{r,\sigma}(\mathbf{n}).

(2)

In the case of continuous environments the sum is to be replaced with an integral. These rates are functions of $\mathbf{n}$ only, the environmental process has been averaged out. Noise from the environmental process plays no role in the dynamics if these average rates are used. This corresponds to making a quasi-stationary state approximation for the fast-moving environment Bowen et al. (1963); Segel and Slemrod (1989).

The aim of this paper is to go beyond this adiabatic limit, and to construct a $\tau$ -leaping algorithm which captures some elements of extrinsic noise. We focus on the limit of a fast, but not infinitely fast environmental dynamics.

Broadly speaking the $\tau$ FE algorithm is constructed around the idea of treating the reaction rates $R_{r}(\mathbf{n})$ as stochastic variables in each discrete time step. These random variables represent the rates one obtains when averaging the environmental process over the time step $\Delta t$ . Assuming that the rate of change of the environment is finite these average rates will remain stochastic. In the limit of infinitely fast environments the deterministic limit in Eq. (2) is recovered, and there is no stochasticity from the environment.

To construct the random reaction rates for each step, we make an approximation: we use a Gaussian distribution for the rates, with means as in Eq. (2) and with variances and correlations derived from the original combined process of the population and environment. We describe this in detail in the next section.

III Construction of the $\tau$ FE algorithm for systems with discrete environments

III.1 Preliminary analysis of the environmental process

Here we assume the environmental states are discrete, $\sigma\in\{1,\dots,M\}$ . The dynamics of the environment is governed by the rates $\lambda A_{\sigma\to\sigma^{\prime}}(\mathbf{n})$ for transitions from $\sigma$ to $\sigma^{\prime}$ . The factor $\lambda$ is introduced to control the time-scale separation between reactions in the population and the switching of the environment. We use the notation $\mathbf{A}(\mathbf{n})$ for the $M\times M$ matrix with elements $A_{\sigma\to\sigma^{\prime}}(\mathbf{n})$ . We also set $A_{\sigma\to\sigma}(\mathbf{n})=-\sum_{\sigma^{\prime}\neq\sigma}A_{\sigma\to\sigma^{\prime}}(\mathbf{n})$ . The combined dynamics of population and environment are then described by the master equation

	$\displaystyle\frac{\mathrm{d}}{\mathrm{d}t}P(\mathbf{n},\sigma,t)=$
	$\displaystyle\sum_{r}\big{[}R_{r,\sigma}(\mathbf{n}-{\mbox{\boldmath$\nu$}}_{r})P(\mathbf{n}-{\mbox{\boldmath$\nu$}}_{r},\sigma,t)-R_{r,\sigma}(\mathbf{n})P(\mathbf{n},\sigma,t)\big{]}$
	$\displaystyle+\lambda\sum_{\sigma^{\prime}}\big{[}A_{\sigma^{\prime}\to\sigma}(\mathbf{n})P(\mathbf{n},\sigma^{\prime},t)-A_{\sigma\to\sigma^{\prime}}(\mathbf{n})P(\mathbf{n},\sigma,t)\big{]}.$		(3)

The rates $\lambda A_{\sigma\to\sigma^{\prime}}(\mathbf{n})$ can depend on the state of the population, $\mathbf{n}$ . This means that $\mathbf{n}$ and $\sigma$ do not necessarily evolve in time independently. However, as mentioned above the state $\mathbf{n}$ of the system is kept constant during each $\tau$ -leaping step. This in turn means that the transition rates for the environment also remain constant during each step.

We now focus on one such time step, starting at time $t$ and ending at $t+\Delta t$ . We assume that $\mathbf{n}$ remains constant during this time interval. For the remainder of Sec. III.1 we suppress the potential dependence of $\mathbf{A}$ on $\mathbf{n}$ , although it is always implied. We write $\rho_{\sigma}(t^{\prime})$ for the probability that the environment is in state $\sigma$ at time $t\in[t,t+\Delta t]$ . We then have the master equation

\frac{\mathrm{d}{\mbox{\boldmath$\rho$}}}{\mathrm{d}t^{\prime}}=\lambda\mathbf{A}{\mbox{\boldmath$\rho$}}

(4)

for the environmental dynamics. The stationary distribution ${\mbox{\boldmath$\rho$}}^{*}$ for the environment is the solution of $\mathbf{A}{\mbox{\boldmath$\rho$}}^{*}=0$ . If $\mathbf{A}$ depends on $\mathbf{n}$ , then $\rho^{*}$ will also be a function of $\mathbf{n}$ . Assuming that the environmental process is irreducible this stationary distribution is unique for any one $\mathbf{n}$ .

The stochastic matrix $\mathbf{A}$ has one zero eigenvalue, which we write as $\mu_{1}=0$ . The remaining eigenvalues are denoted by $\mu_{2},\dots,\mu_{M}$ . We then have $\mu_{2},\dots,\mu_{M}<0$ . The corresponding (right) eigenvalues are written as $\mathbf{v}_{1}={\mbox{\boldmath$\rho$}}^{*}$ (the eigenvector corresponding to eigenvalue $0$ ), and $\mathbf{v}_{2},\dots,\mathbf{v}_{M}$ respectively for the remaining eigenvectors. These are all understood to be column vectors of length $M$ .

We note that the general solution of Eq. (4) can be written in the form

{\mbox{\boldmath$\rho$}}(t^{\prime})={\mbox{\boldmath$\rho$}}^{*}+\sum_{\ell=2}^{M}c_{\ell}e^{\lambda\mu_{\ell}(t^{\prime}-t)}\mathbf{v}_{\ell},

(5)

with coefficients $c_{\ell}$ determined by the initial condition at the beginning of the time step $t^{\prime}=t$ . More precisely these coefficients can be obtained from the linear system

\sum_{\ell=2}^{M}c_{\ell}\mathbf{v}_{\ell}={\mbox{\boldmath$\rho$}}(t)-{\mbox{\boldmath$\rho$}}^{*}.

(6)

We remark that there are $M-1$ coefficients, $c_{\ell}$ ( $\ell=2,\dots,M$ ). The system in Eq. (6) technically consists of $M$ equations, but these are not independent due to normalisation of the probabilities on the right-hand side.

Calculating the probability $q_{\sigma\to\sigma^{\prime}}(\Delta t)$ to find the environment in state $\sigma^{\prime}$ at the end of the time step if it was in $\sigma$ at the beginning of the step is now mainly a matter of computing the coefficients $c_{\ell}$ . We write $c_{\ell,\sigma}$ for the value the coefficient $c_{\ell}$ takes when $\rho_{\sigma^{\prime}}(t)=\delta_{\sigma^{\prime},\sigma}$ (i.e., when the system starts in state $\sigma$ at the beginning of the step).

We then have

q_{\sigma\to\sigma^{\prime}}(\Delta t)=\rho^{*}_{\sigma^{\prime}}+\sum_{\ell=2}^{M}c_{\ell,\sigma}e^{\lambda\mu_{\ell}\Delta t}v_{\ell,\sigma^{\prime}},

(7)

where $v_{\ell,\sigma^{\prime}}$ is the $\sigma^{\prime}$ -entry of the eigenvector $\mathbf{v}_{\ell}$ of $\mathbf{A}$ .

If the matrix $\mathbf{A}$ depends on the population state $\mathbf{n}$ , the parameters $\mu_{\ell},v_{\ell,\sigma^{\prime}},$ and $c_{\ell,\sigma}$ can also depend on $\mathbf{n}$ . For simplicity of notation we have not included this potential dependence in the above equations.

III.2 Time-averaged reaction rates as random variables

The $\tau$ -leaping algorithm proceeds in discrete time intervals of length $\Delta t$ . We continue to focus on one such interval $[t,t+\Delta t]$ . The state of the population at the beginning of the step is $\mathbf{n}$ and we assume that this state does not change until the end of the interval. We do however take into account the fact that the state of environment $\sigma$ can undergo changes in the interval from $t$ to $t+\Delta t$ . As a consequence, $R_{r,\sigma}(\mathbf{n})$ (at fixed $\mathbf{n})$ is also a function of time.

We then introduce the time-averaged quantities

\overline{R_{r}}(\mathbf{n})\equiv\frac{1}{\Delta t}\int_{t}^{t+\Delta t}\mathrm{d}t~{}R_{r,\sigma(t)}(\mathbf{n}),

(8)

noting that the time average is over the duration of the time step only as opposed to a long-term asymptotic time average. Given that the time step $\Delta t$ is finite ( $\Delta t<\infty$ ) and assuming that the environment fluctuates with finite rates ( $\lambda<\infty$ ), the quantity $\overline{R_{r}}(\mathbf{n})$ is a stochastic variable as it depends on the realisation of the environmental process. In one given time interval, the rates $\overline{R_{r}}(\mathbf{n})$ for different $r$ will be correlated as they all derive from the same path of the environment. As we will show below, the fluctuations of the random variables $\overline{R_{r}}(\mathbf{n})$ in any one time step are inversely proportional to $\lambda\Delta t$ to leading order. In the limit $\lambda\Delta t\to\infty$ the $\overline{R_{r}}(\mathbf{n})$ become deterministic.

We assume that the distribution for $\sigma$ at the beginning of the time step is the stationary distribution ${\mbox{\boldmath$\rho$}}^{*}$ . This is the case for example, if then environmental state is drawn from the stationary distribution at the beginning of the simulation. The distribution for $\sigma(t^{\prime})$ is then also the stationary distribution at each time $t^{\prime}\in[t,t+\Delta t]$ . Writing $\left\langle{\dots}\right\rangle$ for the average over realisations of the environmental process, we then have

\left\langle{\overline{R_{r}}(\mathbf{n})}\right\rangle=R_{r}^{*}(\mathbf{n}),

(9)

with $R_{r}^{*}(\mathbf{n})$ as in Eq. (2).

However, $\sigma(t^{\prime})$ ( $t^{\prime}\in[t,t+\Delta t]$ ) will generally be correlated with $\sigma(t)$ . Neglecting these means to operate in the adiabatic limit. We would like to retain some of these correlations. In order to compute second moments $\left\langle{\overline{R}_{r}(\mathbf{n})\overline{R}_{s}(\mathbf{n})}\right\rangle$ we first use Eq. (8). This leads to averages of the type $\left\langle{R_{r,\sigma(t_{1})}(\mathbf{n})R_{s,\sigma(t_{2})}(\mathbf{n})}\right\rangle$ , where $t_{1}$ and $t_{2}$ are two times in the interval from $t$ to $t+\Delta t$ . The second moments can then be expressed in terms of $q_{\sigma\to\sigma^{\prime}}(\cdot)$ as follows

	$\displaystyle\left\langle{\overline{R}_{r}(\mathbf{n})\overline{R}_{s}(\mathbf{n})}\right\rangle=$
	$\displaystyle\frac{1}{\Delta t^{2}}\sum_{\sigma\sigma^{\prime}}\int_{t}^{t+\Delta t}\mathrm{d}t_{1}\int_{t_{1}}^{t_{1}+\Delta t}\mathrm{d}t_{2}\Big{\{}\rho^{*}_{\sigma}q_{\sigma\to\sigma^{\prime}}(t_{2}-t_{1})$
	$\displaystyle\quad\quad\times\big{[}R_{r,\sigma}(\mathbf{n})R_{s,\sigma^{\prime}}(\mathbf{n})+R_{r,\sigma^{\prime}}(\mathbf{n})R_{s,\sigma}(\mathbf{n})\big{]}\Big{\}}.$		(10)

Further details are given in Appendix A. Using Eq. (7) we find

	$\displaystyle\left\langle{\overline{R}_{r}(\mathbf{n})\overline{R}_{s}(\mathbf{n})}\right\rangle-R_{r}^{}(\mathbf{n})R_{s}^{}(\mathbf{n})=$
	$\displaystyle\frac{1}{\Delta t^{2}}\sum_{\sigma\sigma^{\prime}}\sum_{\ell=2}^{M}\bigg{\{}\rho^{*}_{\sigma}c_{\ell,\sigma}v_{\ell,\sigma^{\prime}}$
	$\displaystyle\times\big{[}R_{r,\sigma}(\mathbf{n})R_{s,\sigma^{\prime}}(\mathbf{n})+R_{r,\sigma^{\prime}}(\mathbf{n})R_{s,\sigma}(\mathbf{n})\big{]}$
	$\displaystyle~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}\times\int_{t}^{t+\Delta t}\mathrm{d}t_{1}\int_{t_{1}}^{t_{1}+\Delta t}\mathrm{d}t_{2}~{}e^{\lambda\mu_{\ell}(t_{2}-t_{1})}\bigg{\}}.$		(11)

For fixed $\ell\in\{2,\dots,M\}$ the integral in the last expression evaluates to

	$\displaystyle\int_{t}^{t+\Delta t}\mathrm{d}t_{1}\int_{t_{1}}^{t_{1}+\Delta t}\mathrm{d}t_{2}~{}e^{\lambda\mu_{\ell}(t_{2}-t_{1})}=$
	$\displaystyle-\dfrac{\Delta t}{\lambda\mu_{\ell}}+\frac{1}{(\lambda\mu_{\ell})^{2}}\left[e^{\lambda\mu_{\ell}\Delta t}-1\right].$		(12)

For $\lambda\Delta t\gg 1$ the first term dominates after inserting into Eq. (11), as also observed in Hufton et al. (2016). We are then left with

	$\displaystyle\left\langle{\overline{R}_{r}(\mathbf{n})\overline{R}_{s}(\mathbf{n})}\right\rangle-R_{r}^{}(\mathbf{n})R_{s}^{}(\mathbf{n})=$
	$\displaystyle-\frac{1}{\lambda\Delta t}\sum_{\sigma\sigma^{\prime}}\sum_{\ell=2}^{M}\bigg{\{}\frac{1}{\mu_{\ell}}R_{r,\sigma}(\mathbf{n})R_{s,\sigma^{\prime}}(\mathbf{n})$
	$\displaystyle~{}~{}~{}~{}\times\big{[}\rho^{}_{\sigma}c_{\ell,\sigma}v_{\ell,\sigma^{\prime}}+\rho^{}_{\sigma^{\prime}}c_{\ell,\sigma^{\prime}}v_{\ell,\sigma}\big{]}\bigg{\}}.$		(13)

The main challenge in implementing the $\tau$ FE algorithm is then to find the average rates from Eq. (9) for all $r$ , and the second moments from Eq. (13) for any pair $r,s$ of reactions affected by the environment.

III.3 Description of the algorithm

Without loss of generality, we assume that only the rates for the reactions $r=1,\dots,L$ ( $L\leq R$ ) depend on the environmental state $\sigma$ .

The $\tau$ FE algorithm with time step $\Delta t$ proceeds as follows:

1.

Initiate the population in state $\mathbf{n}(0)$ . Set time to $t=0$ .
2.

Compute $R^{*}_{r}(\mathbf{n})$ for $r=1,\dots,L$ using Eq. (2), and the covariances $\Xi_{rs}(\mathbf{n})\equiv\left\langle{\overline{R}_{r}(\mathbf{n})\overline{R}_{s}(\mathbf{n})}\right\rangle-R_{r}^{*}(\mathbf{n})R_{s}^{*}(\mathbf{n})$ using Eq. (13) for every pair $r,s\in\{1,\dots,L\}$ .
3.

(i) First consider the reactions with rates dependent on the environment: Draw correlated Gaussian random numbers $\ell_{1},\dots,\ell_{L}$ such that $\left\langle{\ell_{r}}\right\rangle=R^{*}_{r}(\mathbf{n})$ , and $\left\langle{\ell_{r}\ell_{s}}\right\rangle-\left\langle{\ell_{r}}\right\rangle\left\langle{\ell_{s}}\right\rangle=\Xi_{rs}(\mathbf{n})$ . If $\ell_{r}<0$ for any $r\in\{1,\dots,L\}$ set $\ell_{r}=0$ .

(ii) For the remaining reactions $r\in\{L+1,\dots,R\}$ set $\ell_{r}=R_{r}(\mathbf{n})$ . These are the reactions with rates independent of the environment.
5.

Draw independent Poissonians random numbers $m_{r}$ , $r=1,\dots,R$ , each with parameter $\ell_{r}\Delta t$ .
6.

Update the state of the population, $\mathbf{n}(t+\Delta t)=\mathbf{n}(t)+\sum_{r}m_{r}{\mbox{\boldmath$\nu$}}_{r}$ .
7.

Increment time by $\Delta t$ and go to 2.

We note that the mean of the $\ell_{r}$ in step 3(i) is of order $(\lambda\Delta t)^{0}$ , and their variance of order $(\lambda\Delta t)^{-1}$ . Truncation of the $\ell_{r}$ will therefore only be required very rarely when $\lambda\Delta t\gg 1$ .

Evaluating the expressions in Eqs. (2) and (13) in step 2 requires eigenvalues $\mu_{\ell}$ of the transition matrix $\mathbf{A}(\mathbf{n})$ for the environment, the eigenvectors, $\mathbf{v}_{\ell}$ (including the stationary distribution $\mathbf{v}_{1}={\mbox{\boldmath$\rho$}}^{*}$ ), and the coefficients $c_{\ell,\sigma}$ for all $\sigma$ . If the environmental process is independent of the state of the population (the $A_{\sigma\to\sigma^{\prime}}$ are not functions of $\mathbf{n}$ ), then these quantities do not depend on $\mathbf{n}$ , and only need to be calculated once at the beginning.

In Sec. IV we first test the $\tau$ FE algorithm on different models with discrete environments. However that the algorithm can also be extended to the case of environmental dynamics with continuous states. This will be discussed in Sec. V.

IV Application of the $\tau$ FE algorithm to models with discrete environmental states

We now consider three examples of systems with discrete environmental states.

The first example (Sec. IV.1) is a genetic circuit. The role of the environment is here played by a process of binding and unbinding to promoters of the genes described by the model. Gene regulatory systems can exhibit time scale separation as discussed for example in Gunawardena (2014); Buchler et al. (2003); Lin and Buchler (2018b). Mathematically the model describes a population with two types of individuals and an environment with two states (bound/unbound). The environmental dynamics depends on the state of the population.

The second example (Sec. IV.2) is a toy model with two species in the population and three environmental states. The environmental process in this example is independent of the state of the population.

Sec. IV.3 finally focuses on a bimodal genetic switch with two species in the population, and an environmental process with three states, and with rates which depend on the state of the population.

IV.1 Genetic circuit: two system-independent environments, two species

This system models the dynamics of two genes, which produce two different regulatory proteins: X (a transcription factor) and Y (an inhibitor that titrates $X$ into an inactive complex). Specifically, we use the activator-titration circuit described in Lin and Buchler (2018a). The reactions are as follows:

	$\displaystyle\emptyset\xrightarrow{\Omega\beta_{X}}X,\quad\emptyset\xrightarrow{\Omega\beta_{Y,\sigma}}Y,\quad(\sigma=0,1)$
	$\displaystyle X\xrightarrow{\delta_{X}}\emptyset,\quad Y\xrightarrow{\delta_{Y}}\emptyset,$
	$\displaystyle X+Y\xrightarrow{\alpha/\Omega}\emptyset,$
	$\displaystyle E_{0}\xrightarrow{\lambda n_{X}\kappa_{Y}/\Omega}E_{1},\quad\text{and,}\quad E_{1}\xrightarrow{\lambda\theta_{Y}}E_{0},$		(14)

where the $E_{\sigma}$ denote states of the environment $(\sigma=0,1)$ . These two environmental states represent situations in which a transcription factor $X$ is bound to the promoter of gene $Y$ (state $E_{1}$ ), or no transcription factor is bound ( $E_{0}$ ), respectively. The first two reactions in Eq. (14) describe the production of the two proteins ( $X$ and $Y$ ). The production rates are $\beta_{X}$ and $\beta_{Y,\sigma}$ . The former is independent of the environmental state, the latter explicitly depends on $\sigma$ (i.e., on the presence or absence of a bound transcription factor). The reactions in the second line of Eq. (14) describe degradation of $X$ and $Y$ , and the reaction in the third line captures titration. The binding and unbinding processes of the transcription factor are described by the reactions in the last line. The parameter $\Omega$ in the reaction rates determines the typical number of particles in the system, for further details see Lin and Buchler (2018a). We write $n_{X}$ for the number of $X$ -particles in the system, and similarly $n_{Y}$ is the number of $Y$ -particles. One finds $n_{X},n_{Y}={\cal O}(\Omega)$ in the stationary state.

Refer to caption — Figure 1: Simulation output for the model of the genetic-circuit in Eq. (14). Panel (a) shows a sample path obtained from Gillespie simulations of the full model. Panel (b) is a sample path from the $\tau$ FE algorithm [ $\lambda=10^{3}$ in panels (a) and (b)]. Panels (c) and (d) show the stationary distributions of $n_{X}+n_{Y}$ and $n_{X}-n_{Y}$ , respectively, for $\lambda=10^{3}$ , while (e) and (f) are for $\lambda=10^{4}$ . In each panel (c)–(f) we report the Jensen-Shannon divergence (JSD) between the distributions obtained using the two different simulation methods. Remaining parameters: $\Omega=10^{3},\beta_{X}=2,\beta_{Y,0}=0,\beta_{Y,1}=10,\delta_{X}=\delta_{Y}=1,\kappa_{Y}=1,\theta_{Y}=0.5,\text{and }\alpha=10$ . For the $\tau$ FE we have used a time step $\Delta t=0.1$ .

Mathematically, the model consists of two species in the population (with numbers of particles $n_{X},n_{Y}$ ), and two environmental states, $\sigma=0,1$ . We therefore have $S=2,M=2$ . Eqs. (9) and (13) can be evaluated explicitly for this case, see also Hufton et al. (2019b). The only process affected by the state of the environment is the production of $Y$ , with rate $\beta_{Y,\sigma}$ . This rate becomes a (clipped) Gaussian random variable in the $\tau$ FE algorithm, with first moment

\left\langle{\overline{\beta}_{Y}}\right\rangle=\beta_{Y}^{*}=\dfrac{\theta_{Y}\beta_{Y,0}+n_{X}\kappa_{Y}\beta_{Y,1}/\Omega}{\theta_{Y}+n_{X}\kappa_{Y}/\Omega},

(15)

and with variance

	$\displaystyle\sigma_{\beta_{Y}\beta_{Y}}^{2}$	$\displaystyle\equiv\langle\bar{\beta}_{Y}^{2}\rangle-{\beta_{Y}^{*}}^{2}$
		$\displaystyle=\dfrac{2n_{X}\kappa_{Y}\theta_{Y}/\Omega}{\lambda\Delta t(n_{X}\kappa_{Y}/\Omega+\theta_{Y})}\left(\beta_{Y,0}-\beta_{Y,1}\right)^{2}.$		(16)

Further details of the derivation can be found in Appendix B.

Simulation results for this model are shown in Fig. 1. In panels (a) and (b) we illustrate typical sample paths obtained from Gillespie simulations of the full model (population and environment), and from the $\tau$ FE algorithm, respectively. We also show the stationary distributions for the quantities $n_{X}+n_{Y}$ and $n_{X}-n_{Y}$ as obtained from both simulation algorithms. The distributions in panels (c) and (d) are for $\lambda=10^{3}$ (i.e., moderately fast environmental dynamics), there are then remaining discrepancies between the $\tau$ FE algorithm and simulations of the full model. In panels (e) and (f) the time-scale separation is larger ( $\lambda=10^{4}$ ). The agreement improves as indicated by the Jensen-Shannon divergence (JSD) Fuglede and Topsoe (2004); Lin (1991) given in the figure.

We note at this point that the average CPU time to run a sample path up to time $t=10^{3}$ with parameters as in Fig. 1 (e) and (f) is $2.94$ seconds for the Gillespie algorithm, and $0.03$ seconds for the $\tau$ FE algorithm (with a time step $\Delta t=0.1$ ). These average simulation times are obtained from ten runs. They indicate that the $\tau$ FE algorithm can significantly increase efficiency while producing results of the quality shown in Fig. 1. We stress that our primary interest is the relative comparison of computing times, and not on absolute simulation times ¹¹1For completeness, we add that simulations were performed on a MacBook Pro (Mid 2014), with processor 2.6 GHz Dual-Core Inter Core i5, and memory 8 GB 1600 MHz DDR3..

IV.2 Birth-death process: three environments, two species

Next, we consider a two-species birth-death process subject to an external environment which can be in one of three different states. This is a toy model chosen for illustration and does not represent any specific natural system. However, it captures elements of models of population dynamics.

The species in the population are labeled $A$ and $B$ , and the environmental states $\sigma=0,1,2$ . Particles of type $A$ are produced with rate $\Omega\alpha_{\sigma}$ , and particles of type $B$ with rate $\Omega\beta_{\sigma}$ . The subscript $\sigma$ indicates explicit dependence on the state of the environment. Particles are removed with constant per capita rates $d_{A}$ and $d_{B}$ respectively. The parameter $\Omega$ again sets the typical size of the population. We write $n_{A}$ and $n_{B}$ for the number of individuals of either species. The environmental states cycle stochastically through the sequence $\sigma=0,1,2,0,\dots$ , with rate constants $\lambda k_{1}$ , $\lambda k_{2}$ and $\lambda k_{0}$ for the three transitions. Mathematically, the reactions in this model are

	$\displaystyle\emptyset\xrightarrow{\Omega\alpha_{\sigma}}A,\quad\emptyset\xrightarrow{\Omega\beta_{\sigma}}B,\quad(\sigma=0,1,2)$
	$\displaystyle A\xrightarrow{d_{A}}\emptyset,\quad B\xrightarrow{d_{B}}\emptyset,$
	$\displaystyle E_{0}\xrightarrow{\lambda k_{1}}E_{1},\quad E_{1}\xrightarrow{\lambda k_{2}}E_{2},\quad E_{2}\xrightarrow{\lambda k_{0}}E_{0},$		(17)

where as before $E_{\sigma}$ denotes the environment. The rates $k_{0},k_{1},$ and $k_{2}$ are constant parameters, independent of the population state.

Details of the calculation of the average rates and their second moments can be found in Appendix C. The average production rates for the two types of particles are

	$\displaystyle\alpha^{*}$	$\displaystyle=$	$\displaystyle\dfrac{k_{0}\alpha_{0}+k_{1}\alpha_{1}+k_{2}\alpha_{2}}{k_{0}+k_{1}+k_{2}},$
	$\displaystyle\beta^{*}$	$\displaystyle=$	$\displaystyle\dfrac{k_{0}\beta_{0}+k_{1}\beta_{1}+k_{2}\beta_{2}}{k_{0}+k_{1}+k_{2}},$		(18)

while the covariance $\sigma_{\alpha\beta}=\sigma_{\beta\alpha}\equiv\langle\bar{\alpha}\bar{\beta}\rangle-\alpha^{*}\beta^{*}$ takes the form

$\displaystyle\sigma_{\alpha\beta}$	$\displaystyle=\dfrac{\theta^{2}}{\lambda\Delta t}\Big{\{}(\alpha_{0}-\alpha_{1})(\beta_{0}-\beta_{1})\left(3k_{0}^{2}-k_{0,1}k_{0,2}\right)$
	$\displaystyle\quad+(\alpha_{1}-\alpha_{2})(\beta_{1}-\beta_{2})\left(3k_{1}^{2}-k_{1,0}k_{1,2}\right)$
	$\displaystyle\quad+(\alpha_{0}-\alpha_{2})(\beta_{0}-\beta_{2})\left(3k_{2}^{2}-k_{2,0}k_{2,1}\right)\Big{\}},$	(19)

with

\theta^{2}\equiv\dfrac{k_{0}k_{1}k_{2}}{(k_{0}k_{1}+k_{1}k_{2}+k_{2}k_{0})^{3}},

(20)

and $k_{\sigma,\sigma^{\prime}}=k_{\sigma}-k_{\sigma^{\prime}}$ , for $\sigma,\sigma^{\prime}\in\{0,1,2\}$ . The variance $\sigma_{\alpha\alpha}\equiv\langle\bar{\alpha}\bar{\alpha}\rangle-(\alpha^{*})^{2}$ , is obtained by replacing all instances of $\beta_{\sigma}$ on the right-hand side of Eq. (19) with $\alpha_{\sigma}$ . The analog $\sigma_{\beta\beta}$ is obtained similarly by replacing $\alpha_{\sigma}$ with $\beta_{\sigma}$ .

In Fig. 2 we report results from numerical simulations for this model, both from conventional Gillespie algorithm of the full systems of environment and population, and using the $\tau$ FE algorithm. Panels (a)–(f) show the stationary distributions of $n_{A}+n_{B}$ and $n_{A}-n_{B}$ . As seen from the data for example in panel (a) the $\tau$ FE algorithm displays deviations from Gillespie simulations of the full model when the environmental process is not sufficiently fast. We quantify these deviations again through the Jensen–Shannon divergence between the two distributions. The deviations reduce as the time scale separation $\lambda$ is increased, i.e., when the environmental process becomes faster relative to the dynamics within the population.

In order to examine if the $\tau$ FE algorithm accurately reproduces dynamical features (i.e., properties of the system beyond the stationary distribution), we show spectral densities of the time series for $n_{A}$ and $n_{B}$ in Fig. 2(g) and (h). The spectral densities are defined as

	$\displaystyle S_{AA}(\omega)=\langle\|\hat{n}_{A}(\omega)\|^{2}\rangle,$
	$\displaystyle S_{AB}(\omega)=\langle\hat{n}_{A}^{\dagger}(\omega)\hat{n}_{B}(\omega)\rangle,$		(21)

where $\hat{n}_{A}(\omega)$ and $\hat{n}_{B}(\omega)$ are the Fourier transforms of $n_{A}(t)$ and $n_{B}(t)$ , respectively. The dagger denotes complex conjugation. The data from the $\tau$ FE algorithm (open symbols) in Fig. 2(g) and (h) compares well with spectra obtained from direct Gillespie simulations of the full model (solid lines). This shows the $\tau$ FE method indeed captures the dynamics of $n_{A}$ and $n_{B}$ . We also provide a comparison against the spectral densities obtained from conventional $\tau-$ leaping simulations in the adiabatic limit, i.e., simulations with constant rates $\alpha^{*}$ and $\beta^{*}$ for the production events [Eq. (18)]. These are shown as full markers in Fig. 2(g) and (h). One then finds more substantial systematic deviations. This is because environmental fluctuations are discarded in the adiabatic limit. The $\tau$ FE algorithm on the other hand captures the stochasticity of the environment to sub-leading order in $\lambda^{-1}$ in each iteration step.

IV.3 Bimodal genetic switch: three system-state dependent environments, two species

We now consider a model studied in Lin et al. (2018); Hufton et al. (2019a), describing a single gene $G$ with a promoter site which can bind to a total of up to $N$ molecules of protein. The number of protein molecules bound, $\sigma$ , plays the role of the environment in this setting. The rate for transitions from $\sigma$ to $\sigma+1$ depends on the number of protein molecules. The reactions in this model can be summarised as follows,

$\displaystyle G_{\sigma}+P\xrightleftharpoons[\lambda k_{-}]{\lambda k_{+}/\Omega}G_{\sigma+1},$	$\displaystyle\quad\text{for}\quad\sigma<N,$
$\displaystyle G_{\sigma}\xrightarrow{\Omega b_{\sigma}}G_{\sigma}+M,$
$\displaystyle M\xrightarrow{d}\emptyset,\quad M\xrightarrow{\beta}M+P,$	$\displaystyle\quad P\xrightarrow{\delta}\emptyset,$	(22)

where $M$ and $P$ refer to molecules of mRNA and protein, respectively. The production rate $b_{\sigma}$ for mRNA depends on the number of protein molecules bound to the promoter. We refer to Lin et al. (2018); Hufton et al. (2019a) for further details. In the following we write $n_{M}$ and $n_{P}$ for the numbers of particles of either type. One interesting feature of this model is that the distribution of the protein and mRNA populations can become bimodal, as illustrated in Fig. 3. This leads to bistability, with trajectories transitioning between the two modes of the joint distribution of $n_{P}$ and $n_{M}$ . Hence, the model describes a genetic switch.

In this model only the production rate of mRNA molecules is affected by the state of the environment. The average mRNA-production rate is found as

b^{*}=\dfrac{k_{-}^{2}b_{0}+k_{-}\tilde{k}_{+}b_{1}+\tilde{k}_{+}^{2}b_{2}}{k_{-}^{2}+k_{-}\tilde{k}_{+}+\tilde{k}_{+}^{2}},

(23)

with $\tilde{k}_{+}=k_{+}n_{P}/\Omega$ . The second moment of the production rate takes the form

$\displaystyle\sigma_{bb}^{2}$	$\displaystyle\equiv\langle\bar{b}^{2}\rangle-{b^{*}}^{2}$
	$\displaystyle=\dfrac{\theta^{2}}{\lambda\Delta t}\Big{\{}(b_{0}-b_{1})^{2}k_{-}\left(k_{-}^{2}+\tilde{k}_{+}k_{-}-\tilde{k}_{+}^{2}\right)$
	$\displaystyle\quad+(b_{0}-b_{2})^{2}2k_{-}\tilde{k}_{+}\left(k_{-}+\tilde{k}_{+}\right)$
	$\displaystyle\quad+(b_{1}-b_{2})^{2}\tilde{k}_{+}\left(\tilde{k}_{+}^{2}+\tilde{k}_{+}k_{-}-k_{-}^{2}\right)\Big{\}},$	(24)

with

\theta^{2}=\dfrac{2k_{-}\tilde{k}_{+}}{\left(k_{-}^{2}+k_{-}\tilde{k}_{+}+\tilde{k}_{+}^{2}\right)^{3}}.

(25)

Details of the calculation leading to Eqs. (23) and (24) can be found in Appendix D.

Figure 3 shows the stationary joint distribution of the number of mRNA and protein molecules for different values of the time-scale separation parameter $\lambda$ . The figure shows data from Gillespie simulations of the full model [panels (a)–(c)], and data from the $\tau$ FE algorithm [panels (d)–(f)]. The $\tau$ FE algorithm captures the distribution profile with two local maxima. For low values of $\lambda$ (i.e., a relatively slow environmental process) the distribution obtained from $\tau$ FE tends to be wider than those from the Gillespie algorithm. The agreement improves for faster environments, as indicated again by the Jensen–Shannon distances in Fig. 3.

In Fig. 4, we show the distribution and means of the sojourn times $t_{\ell}$ and $t_{\rm h}$ near the lower and higher modes of the stationary distribution. More precisely this is the time between entering and leaving a designated region around each of the modes. The lower maximum of the stationary distribution is sharper than the upper maximum (Fig. 3). Accordingly, we have chosen a smaller region at the lower mode than at the upper mode. For the lower mode, we use the region $0\leq n_{M}\leq 20$ , $0\leq n_{P}\leq 1100$ which encloses the mode at $(n_{M},n_{P})=(10,500)$ . For the higher mode we use the region $20\leq n_{M}\leq 80$ , $1100\leq n_{P}\leq 2700$ enclosing the mode at $(n_{M},n_{P})=(30,1800)$ .

The data shown in the figure is constructed from one long sample path (run until $t=10^{6}$ ), recording the points in time at which the system enters or leaves either region. Gillespie simulations operate in continuous time and the $\tau$ FE algorithm in discrete time. In order to remove any artefacts resulting from this difference, the same time resolution ( $0.05$ ) is used in both algorithms for the measurement of arrival and departure times. Because the lower mode is sharper than the upper maximum and because the sizes of the two detection regions are different the sojourn time $t_{\ell}$ at the lower mode is found to be smaller than that at the higher mode, $t_{\rm h}$ .

The distributions of sojourn times in Figs. 4 (a) and (b) indicate that the $\tau$ FE algorithm captures this dynamic quantity, provided the environmental process is sufficiently fast. This is confirmed in panels (c) and (d), where we show the mean sojourn times as a function of the relative speed $\lambda$ of the environment compared to the population dynamics. As seen in both panels, the $\tau$ FE algorithm generates accurate measurements of the mean sojourn times $\left\langle{t_{\ell}}\right\rangle$ and $\left\langle{t_{\rm h}}\right\rangle$ in the limit $\lambda\gg 1$ .

At the same time, stochastic effects due to the random environmental process are captured for large but finite $\lambda$ . This can be seen in Fig. 4 (d): the mean sojourn time $\left\langle{t_{\rm h}}\right\rangle$ drops significantly as the environmental process becomes slower, and hence additional noise is injected into the population (there is no environmental noise in the adiabatic limit). While there are quantitative differences compared to exact simulations, the $\tau$ FE algorithm captures this reduction of $\left\langle{t_{h}}\right\rangle$ . Panel (c) reveals that there are also limitations to the precision of the $\tau$ FE algorithm. The mean sojourn time $\left\langle{t_{\ell}}\right\rangle$ near the lower mode is affected much less by a reduction of the time-scale separation parameter $\lambda$ than the mean sojourn time at the upper mode. This indicates that the escape from this region is driven mostly by intrinsic noise rather than by environmental stochasticity. While the data from the two algorithms remains within approximately $10\%$ for sufficiently fast environmental dynamics ( $\lambda\gtrsim 10^{4}$ ) the $\tau$ FE algorithm is unable to capture the small rise of $\left\langle{t_{\ell}}\right\rangle$ observed in Gillespie simulations for intermediate values of $\lambda$ .

$\lambda$	Gillespie	$\tau$ FE
1250	1.35	0.04
2500	1.89	0.08
5000	2.62	0.17
10000	4.30	0.31
20000	7.77	0.57

Table 1: Mean computation time (in seconds) required to simulate one sample path up to

t=10^{3}

of the bimodal genetic-switch system defined in Eq. (22). Measurements are from ten independent sample runs, using Gillespie simulations of the full model, and the

\tau

FE algorithm respectively. Parameters are as in Fig. 3. For the

\tau

FE algorithm we set

\Delta t=100/\lambda

In Table 1 we compare the the computing time required for both the Gillespie algorithm and the $\tau$ FE method for different values of $\lambda$ . The data in the table is the CPU time required to generate one sample path up to time $t=10^{3}$ , averaged over ten runs. The model parameters are as in Figs. 3 and 4.

The full model comprises the reactions in the population and the environmental switching. The rates for the former reactions are independent of $\lambda$ , the rates for the latter scale linearly in $\lambda$ . Accordingly, one expects the computing time for Gillespie simulations of the full model to be linear in $\lambda$ , with a non-zero intercept. The data in the table is consistent with this. We note that Gillespie algorithm does not require any time discretisation.

The running time for the $\tau$ FE algorithm depends on the choice of the time step. The time step in turn affects the accuracy of the outcome. If $\Delta t$ is large, then $\tau$ FE simulations are fast, but the approximation to the continuous-time full model becomes less good. On the other hand the time step must not be too small, as the construction of the algorithm requires sufficient averaging of the environmental process in each step [Eqs. (11)–(13)]. The time step for the $\tau$ FE algorithm in Figs. 3 and 4, and in Table 1 is chosen inversely proportional to $\lambda$ . This is to ensure that each time step captures a sufficient number of switches of the environmental state. Accordingly, we expect the computing time for the $\tau$ FE algorithm to scale linearly in $\lambda$ , with no intercept. Again, the running times we measured in our simulations are consistent with this expectation. Overall, Table 1 shows that the $\tau$ FE algorithm is able to generate data of the accuracy as in Figs. 3 and 4 while reducing the computing effort approximately ten fold compared to full Gillespie simulations.

V Numerical simulation of continuous-environmental systems

V.1 Setup

We turn now to systems which are subject to an environment with continuous states. Specifically, we follow Assaf et al. (2013a) and assume that the environmental state $\sigma$ follows an Ornstein–Uhlenbeck process (see also Roberts et al. (2015); Assaf et al. (2013b)),

\frac{\mathrm{d}\sigma}{\mathrm{d}t}=\lambda(m-\sigma)+\sqrt{2\lambda v^{2}}\,\eta(t),

(26)

where $\eta(t)$ is Gaussian white noise of unit amplitude, in particular $\left\langle{\eta(t)\eta(t^{\prime})}\right\rangle=\delta(t-t^{\prime})$ . The parameter $m$ is the average value of $\sigma$ in the long run, whilst $v$ controls the magnitude of noise. As before, the parameter $\lambda>0$ indicates how quickly the environment changes relative to the dynamics in the population; $\lambda$ is the equivalent of $1/\tau_{c}$ in the notation of Assaf et al. (2013a).

The probability distribution of finding the environment in state $\sigma$ at time $t$ , given that was in state $\sigma^{\prime}$ at time $t^{\prime}$ , can be obtained from the Fokker–Planck equation for the Ornstein–Uhlenbeck process, and is given by (see e.g. Klebaner (2005); Risken (1996))

	$\displaystyle q_{\sigma^{\prime}\to\sigma}(t-t^{\prime})=\sqrt{\dfrac{1}{2\pi v^{2}(1-e^{-2\lambda(t-t^{\prime})})}}$
	$\displaystyle\times\exp\left[-\frac{\left(\sigma-\sigma^{\prime}e^{-\lambda(t-t^{\prime})}-m\left(1-e^{-\lambda(t-t^{\prime})}\right)\right)^{2}}{2v^{2}\left(1-e^{-2\lambda(t-t^{\prime})}\right)}\right].$		(27)

For $t\to\infty$ (and $t^{\prime}$ fixed) this quantity tends to the stationary distribution

\rho^{*}_{\sigma}=\sqrt{\dfrac{1}{2\pi v^{2}}}\exp\left[-\dfrac{\left(\sigma-m\right)^{2}}{2v^{2}}\right].

(28)

We note that it is not a requirement for the $\tau$ FE algorithm that the environment follows an Ornstein–Uhlenbeck process. However, both functions $q_{\sigma^{\prime}\to\sigma}(t-t^{\prime})$ and $\rho^{*}_{\sigma}$ are required, as discussed in more detail below.

We proceed to describe how the $\tau$ FE algorithm can be implemented for models with continuous environments (Sec. V.2).

In the case of discrete environments, continuous-time sample paths of the full model can be generated using the conventional Gillespie algorithm. This is an exact procedure: the ensemble of these sample paths faithfully describes the statistics of the full model. In Sec. IV we have used this as a benchmark to test the $\tau$ FE algorithm. We are not aware of any analogous exact simulation method for models of discrete populations in a stochastic environment with continuous states. In order to test the $\tau$ FE algorithm we therefore compare outcomes against those from approximation methods to generate paths of the combined set of the population and the environment. Several such methods exist, we describe these in Sec. V.3. The tests of the $\tau$ FE algorithm against the baseline of these methods are described in Sec. VI.

V.2 Implementation of the $\tau$ FE algorithm for continuous environments

We proceed similar to discrete case in Sec. III, replacing the sums over $\sigma$ in Eqs. (2) and (10) with integrals. We then have

R_{r}^{*}(\mathbf{n})=\int_{-\infty}^{\infty}\mathrm{d}\sigma\rho^{*}_{\sigma}R_{r,\sigma}(\mathbf{n}),

(29)

and the relation for the second moments turns into

	$\displaystyle\left\langle{\overline{R}_{r}(\mathbf{n})\overline{R}_{s}(\mathbf{n})}\right\rangle=$
	$\displaystyle\frac{1}{\Delta t^{2}}\int_{-\infty}^{\infty}\mathrm{d}\sigma\int_{-\infty}^{\infty}\mathrm{d}\sigma^{\prime}\int_{t}^{t+\Delta t}\mathrm{d}t_{1}\int_{t_{1}}^{t+\Delta t}\mathrm{d}t_{2}$
	$\displaystyle\times\Big{\{}\rho^{*}_{\sigma}q_{\sigma\to\sigma^{\prime}}(t_{2}-t_{1})$
	$\displaystyle\big{[}R_{r,\sigma}(\mathbf{n})R_{s,\sigma^{\prime}}(\mathbf{n})+R_{r,\sigma^{\prime}}(\mathbf{n})R_{s,\sigma}(\mathbf{n})\big{]}\Big{\}}.$		(30)

Depending on the form of the stationary distribution $\rho^{*}_{\sigma}$ , the kernel $q_{\sigma\to\sigma^{\prime}}(t_{2}-t_{1})$ and the rates $R_{r,\sigma}(\mathbf{n})$ the integrals in Eqs. (29) and (30) can be carried out, and closed-form analytical expressions can be obtained. In Sec. VI we explore a number of different examples, further scenarios are also discussed Appendix F. Once the average rates and the second moments are calculated, the $\tau$ FE algorithm is implemented as described in Sec. III.3.

V.3 Conventional simulation approaches for discrete populations in continuous environments

In this section we summarise ‘conventional’ approaches to simulating discrete Markovian systems subject to environmental dynamics with continuous states. By ‘conventional’ we mean methods which produce explicit (approximate) sample paths of the environmental process. This is in contrast to the $\tau$ FE algorithm, which generates paths only of the system proper.

V.3.1 Gillespie algorithm with discretised environmental states (GADE)

This approach is based on a discretisation of the space of environmental states, time remains continuous. Once such a discretisation for the environmental states is carried out, the combined states of the population and environment are also discrete. Simulations can be carried out using the conventional Gillespie method. We will refer to this method as GADE (Gillespie approach with discretised environment).

The key step in this approach is to find an appropriate dynamics in the space of discretised environmental states. We describe this in the context of the Ornstein–Uhlenbeck process in Eq. (26). We discretise the environmental state into integer multiples of $\Delta\sigma$ , i.e., the environment takes states $\dots,-2\Delta\sigma,-\Delta\sigma,0,\Delta\sigma,2\Delta\sigma,\dots$ . Transitions from one state $k\Delta\sigma$ can only occur to states $(k\pm 1)\Delta\sigma$ . The transition rates are constructed such that this discrete process recovers the continuous Ornstein–Uhlenbeck dynamics in the limit $\Delta\sigma\to 0$ . The details of the construction are described in Appendix E, we here only report the main outcome. Specifically, the rates to transition from state $k\Delta\sigma$ to $(k\pm 1)\Delta\sigma$ can be chosen as

T_{k}^{\pm}=\frac{\lambda}{2\Delta\sigma}\left[\pm(m-k\Delta\sigma)+\frac{2v^{2}}{\Delta\sigma}\right].

(31)

This process can then be simulated using the standard Gillespie algorithm, along with the events in the population. We note that the rates $T_{k}^{\pm}$ need to be non-negative, i.e., we require $|m-k\Delta\sigma|<2v^{2}/\Delta\sigma$ , for all $k$ . In practice, this can be achieved by truncating the set of possible states $k\Delta\sigma$ . More precisely, we disallow transitions out of the region $\{k:|m-k\Delta\sigma|\leq K\}$ , with a given cutoff $K$ . Provided that $K$ is sufficiently large truncations will only be required rarely. Once a cutoff $K$ is chosen we must require $\Delta\sigma\leq 2v^{2}/K$ to guarantee non-negativity of the $T_{k}^{\pm}$ . The variance of the Ornstein–Uhlenbeck process for $\sigma$ is given by $v^{2}$ in the long run [Eq. (28)], so $K\propto v$ is a sensible choice. This results in maximum value for $\Delta\sigma$ which is also proportional to $v$ .

V.3.2 Discrete-time simulation with explicit environmental dynamics (DEED)

Approximate sample paths of the combined system of population and environment can also be generated in a discrete-time simulation. We refer to this as DEED (discrete-time simulation with explicit environmental dynamics). The time step $\Delta t$ needs to be sufficiently small to capture the details of the environmental process with characteristic time scale $\tau_{c}=\lambda^{-1}$ . We therefore require $\Delta t\lesssim\lambda^{-1}$ . One possible implementation is as follows:

1.

Suppose we have arrived at time $t$ , and the state of the population is $\mathbf{n}(t)$ and that of the environment $\sigma(t)$ . Obtain $\sigma(t+\Delta t)$ from Eq. (26) using the Euler-Maruyama method Maruyama (1955).
2.

Use $\sigma(t)$ and $\mathbf{n}(t)$ to calculate the rates $p_{r}(t)=\Delta t\times R_{r,\sigma(t)}[\mathbf{n}(t)]$ for $r=1,\dots,R$ .
3.

Provided $\Delta t$ is small enough, the $p_{r}(t)$ are all less than one. To lowest order in $\Delta t$ they are the probabilities that a reaction of type $r$ occurs in the next $\Delta t$ . For each $r=1,\dots,R$ implement one reaction of this type with probability $p_{r}(t)$ . With probability $1-p_{r}(t)$ no reaction of type $r$ occurs. Executing all reactions that fire, one obtains $\mathbf{n}(t+\Delta t)$ .
4.

Increment time by $\Delta t$ , and go to step 1.

Step 3 disregards the possibility that a particular reaction fires multiple times during one time step. This is a valid approximation, provided that the $p_{r}(t)=\Delta t\times R_{r,\sigma(t)}$ are much smaller than one. As an alternative step 3 could be replaced by a conventional $\tau$ -leaping step. The number of reactions of type $r$ that fire is then a Poissonian random variable with parameter $p_{r}(t)$ .

V.3.3 Thinning algorithm by Lewis

A population subject to a dynamic external environment with continuous state space can also be simulated using the so-called thinning algorithm by Lewis Lewis and Shedler (1979). This algorithm generates a statistically faithful ensemble of sample paths for Markovian systems with discrete states and transition rates with explicit external time dependence.

In the context of our model the population is such a system. If the environmental dynamics is independent of the population then realisations $\sigma(t)$ for the environment can be generated in advance independently from the population. For instance, sample solutions of the Ornstein-Uhlenbeck process in Eq. (26) could be generated. Each such realisation $\sigma(t)$ then determines a realisation of time-dependent rates $R_{r}(\mathbf{n},t)\equiv R_{r,\sigma(t)}(\mathbf{n})$ for the population. The Lewis algorithm can then be used to produce sample paths for the population dynamics.

In practice, numerical approximation schemes are required to generate realisations for the environment. For example, Eq. (26) can be solved numerically using the Euler–Maruyama method, with time step $\Delta t$ . As discussed above this time step needs to be sufficiently small ( $\Delta t\lesssim\lambda^{-1}$ ) to resolve the short-time features of the environmental process. The Lewis algorithm then uses this as an input and generates sample paths for the population in continuous time.

VI Application of the $\tau$ FE to continuous-environmental models

In this section we test the $\tau$ FE algorithm on a number of different examples of models with continuous environmental states. Simulation outcomes are compared against those from the algorithms described in Sec. V.3.

VI.1 Toy model: Population dynamics with production and removal rates proportional to $\sigma^{2}$

We first consider a production-removal process for a single species. The environmental state $\sigma(t)$ follows the Ornstein–Uhlenbeck process in Eq. (26). The corresponding transition kernel $q_{\sigma\to\sigma^{\prime}}(\tau)$ is given in Eq. (27), and the stationary distribution $\rho_{\sigma}^{*}$ in Eq. (28). The production rate in the population is assumed to be $R_{b,\sigma}=\beta\sigma^{2}$ , and the removal rate $R_{d,\sigma}=\delta\sigma^{2}$ . These are not chosen with any particular natural system in mind, instead this example serves as an illustration (see also Appendix F for similar calculations for two related examples).

From (29) we obtain

	$\displaystyle R_{b}^{*}$	$\displaystyle=$	$\displaystyle\beta(m^{2}+v^{2}),$
	$\displaystyle R_{d}^{*}$	$\displaystyle=$	$\displaystyle\delta(m^{2}+v^{2}).$		(32)

The second moments of the rates $\overline{R}_{b}(n)$ and $\overline{R}_{d}(n)$ can be calculated from Eq. (30). We find

	$\displaystyle\left\langle{\overline{R}_{b}(n)\overline{R}_{d}(n)}\right\rangle-R_{b}^{}(n)R_{d}^{}(n)=$
	$\displaystyle\frac{\beta\delta v^{2}e^{-2\lambda\Delta t}}{\lambda^{2}\Delta t^{2}}\Big{[}8m^{2}e^{\lambda\Delta t}+$
	$\displaystyle e^{2\lambda\Delta t}\left(8m^{2}(\lambda\Delta t-1)+v^{2}(2\lambda\Delta t-1)\right)+v^{2}\Big{]}.$		(33)

for the covariance. The expressions for the variances are similar, with suitable replacements $\beta\delta\to\beta^{2}$ and $\beta\delta\to\delta^{2}$ in the prefactor in Eq. (33). This covariance matrix and the means in Eq. (32) are then used in the $\tau$ FE algorithm.

$\lambda^{-1}$	GADE	DEED	$\tau$ FE
$1\times 10^{-2}$	28.47	3.84	0.79 $\times 10^{-2}$
$5\times 10^{-3}$	53.20	7.88	0.16 $\times 10^{-1}$
$1\times 10^{-3}$	288.30	40.91	0.08
$5\times 10^{-4}$	576.69	82.78	0.15
$1\times 10^{-4}$	3022.47	397.63	0.79

Table 2: Mean computing time (in seconds) required for one simulation run of the model described in Sec. VI.1 until

t=10^{3}

. Data is from ten independent runs, parameters are as in Figure 5, i.e.,

\beta=1.1,\delta=1.0,m=1,

and

v^{2}=5\times 10^{-4}

. For GADE we set

\Delta\sigma=10^{-3}

; for the DEED approach we set

\lambda\Delta t=1/100

; for the

\tau

FE algorithm we set

\lambda\Delta t=10

Figure 5 shows simulation results from the $\tau$ FE algorithm, as well as from the GADE and DEED schemes (Secs. V.3.1 and V.3.2 respectively). Panel (a) shows that all simulation methods result in linear growth (parameters are such that $\beta>\delta$ , i.e., the growth rate is always larger than the death rate). Panel (b) confirms that GADE and DEED both generate the correct statistics for the stationary distribution of the environmental process [the solid line is the Gaussian distribution in Eq. (28)]. In panel (c) we focus on a fixed time $t=10$ , and show that all three simulation methods results in very similar distributions for the number of individuals in the population $n$ at that time. Panel (d) finally shows a dynamic quantity, the Fourier spectrum $S(\omega)$ of the time series $n(t)$ , or equivalently the Fourier transform of the correlation function of $n$ . Again, all three simulation methods produce very similar results.

In Table 2 we compare the average computing time required by the different algorithms to generate a trajectory up to time $t=10^{3}$ . We show data for varying values of the typical time scale $\lambda^{-1}$ of the environmental process. GADE does not require any discretisation of time. For the DEED approach we use $\Delta t=1/(100\lambda)$ . For the $\tau$ FE method we choose $\Delta t=10/\lambda$ . This is in-line with the requirements $\Delta t\lesssim\lambda^{-1}$ for DEED, and $\Delta t\gtrsim\lambda^{-1}$ for $\tau$ FE. The choice of time steps will be discussed in further detail below.

The data in the table indicates that the simulation time scales approximately linearly with $\lambda$ for all three algorithms tested, provided $\lambda$ is sufficiently large. This is to be expected: The rates for the environmental events in the GADE simulations (Sec. V.3.1) scale as $\lambda$ , and therefore dominate the events in the population for $\lambda\gg 1$ . Each typical Gillespie step then advances time by an amount proportional to $\lambda^{-1}$ , and ${\cal O}(\lambda)$ such steps are required to reach the designated end time. A similar argument applies to the DEED algorithm (Sec. V.3.2) and for the $\tau$ FE algorithm: For both of these we use time steps $\Delta t\propto\lambda^{-1}$ , so again the number of iteration steps required scales as $\lambda$ .

The key message from Table 2 is that, for the choice of time steps made in the table, the computing time required by the $\tau$ FE algorithm is substantially lower than that for the other two simulation methods. Given the linear dependence on $\lambda$ , this increase in efficiency can be extrapolated to environments operating on time scales faster than the smallest time scale shown in the table (i.e., to the range $\lambda>10^{4}$ ). We note that, due to the smaller time step, DEED produces a finer resolution of sample paths in time than $\tau$ FE. When we make our comparison we have average macroscopic quantities in mind (such as those in Fig. 5), and not necessarily the generation of individual paths with the highest possible resolution in time.

We now briefly discuss the choice of time steps for the $\tau$ FE method and for DEED. In principle, we could have increased or decreased the step for either method. This would then reduce or increase the computing time required to reach the designated end point. It might also affect the accuracy of the outcome. Our choice of $\Delta t=10/\lambda$ for $\tau$ FE is motivated by the good agreement with GADE in Fig. 5, noting that GADE does not require any discretisation of time. Similarly, for the example discussed below in Sec. VI.2 good agreement with analytical predictions is found for this choice, see the regime of small $\lambda^{-1}$ in Fig. 6. Our conclusion is therefore that the $\tau$ FE algorithm is able to produce results of the accuracy as in Fig. 5 with computing times as reported in Table 2.

The DEED algorithm requires $\Delta t\lesssim\lambda^{-1}$ to be able to resolve the environmental dynamics. Our choice $\Delta t=1/(100\lambda)$ in Table 2 is well below this requirement, and the algorithm can in principle be speed up by choosing a larger time step. If we were to exhaust the limit and used $\Delta t=\lambda^{-1}$ for DEED then this would reduce the computing time by about a factor of one hundred in Table 2. For for $\lambda^{-1}=10^{-3}$ this would mean a reduction from approximately $40$ seconds to $0.4$ seconds per sample path. Using this larger time step also results in noticeable deviations in measurements of the quantities in Fig. 5 from continuous-time GADE simulations. But even if we accept this and use the hundred fold larger time step for DEED the $\tau$ FE algorithm would remain approximately five times faster, requiring $0.08$ seconds per sample path at $\lambda^{-1}=10^{-3}$ , see Table 2.

We have also conducted tests with Lewis’ thinning algorithm. To do this we have first generated sample paths of the Ornstein–Uhlenbeck process for the environment [Eq. (26)] using an Euler–Maruyama scheme. This is then fed into the Lewis’ algorithm for systems with time dependent rates. Given that the typical time scale of the environment is $\lambda^{-1}$ , the largest sensible time step for the Euler–Maruyama scheme is $\Delta t=\lambda^{-1}$ , similar to DEED. This choice minimises the computing time for the Lewis’ approach. We therefore use this time step to compare the efficiency of the Lewis’ approach with that of $\tau$ FE. We find that the thinning algorithm is considerably slower than the $\tau$ FE approach. For $\lambda^{-1}=10^{-3}$ , for example, we obtain a simulation time of approximately $13$ seconds per run up to $t=10^{3}$ compared to $0.08$ seconds for $\tau$ FE (see Table 2).

VI.2 Genetic switch with Hill-like regulatory function

As a final example we consider a model of protein production subject to a continuous environment discussed in Assaf et al. (2013a). The model entails positive feedback, in that the presence of protein has the potential to increase production of protein. There is one single species in the model (protein), we write the number of protein molecules as $n$ . We also define $x=n/\Omega$ , where $\Omega$ is again a model parameter setting the typical size of the system. The production rate of protein is given by

f(x,\sigma)=\alpha_{0}+(1-\alpha_{0}+\sigma)\Theta(x-x_{0}),

(34)

where $0<\alpha_{0}<1$ and $x_{0}>0$ are constants, and where $\Theta(x)$ is the Heaviside function. Protein molecules also decay with unit rate. In the absence of environmental influence ( $\sigma\equiv 0$ ), the production rate is thus unity when $x>x_{0}$ , and $\alpha_{0}<1$ when $x<x_{0}$ . For $\sigma\equiv 0$ the mean re-scaled number of protein follows the rate equation

\dot{\bar{x}}=f(\bar{x})-\bar{x},

(35)

where time is measured in units of generations. We choose $\alpha_{0}<x_{0}<1$ . Eq. (35) has three fixed points $x_{1}^{*}<x_{2}^{*}<x_{3}^{*}$ , where $x_{1}^{*}=\alpha_{0}$ and $x_{3}^{*}=1$ are attractors, and $x_{2}^{*}=x_{0}$ is a repeller. Similar to Assaf et al. (2013a), we refer to $x_{1}^{*}$ and $x_{3}^{*}$ as the ‘low’ and ‘high’ states, respectively.

The environmental process $\sigma(t)$ modulates the production rate when $x>x_{0}$ . As in Assaf et al. (2013a) we asssume that $\sigma$ follows an Ornstein-Uhlenbeck process of the form given in Eq. (26). The noisy system has the potential to switch between the ‘high’ and ‘low’ states. To test the performance of the $\tau$ FE algorithm, we focus on the mean switching time (MST) to transit from the high state to the low state. This time is studied and calculated in Assaf et al. (2013a), we denote it by $\left\langle{\tau_{\text{high}\rightarrow\text{low}}}\right\rangle$ . In simulations we start the system in the high state, and measure the first time the system reaches the low state.

Only the production of protein is affected by the state $\sigma$ of the environment, we write $R_{{\rm prod},\sigma}(\mathbf{n})=f(x,\sigma)$ , with $f$ as in Eq. (34). Inserting this in Eqs. (29) and (30), and after straightforward calculations, we obtain

R_{\rm prod}^{*}(n)=\alpha_{0}+(1-\alpha_{0})\Theta(n/\Omega-x_{0}),

(36)

and the second moment

	$\displaystyle\left\langle{(\overline{R}_{\rm prod}(n))^{2}}\right\rangle-[R_{\rm prod}^{*}(n)]^{2}=$
	$\displaystyle\dfrac{2v^{2}}{\lambda^{2}\Delta t^{2}}\left[\lambda\Delta t+(e^{-\lambda\Delta t}-1)\right]\Theta(n/\Omega-x_{0}).$		(37)

In Fig. 6 we show the MST measured in simulations using the different approaches described in in Sec. V. Assaf et al. Assaf et al. (2013a) report non-monotonous behaviour of the MST as a function of $\tau_{c}=\lambda^{-1}$ . As seen in Fig. 6 the $\tau$ FE algorithm reproduces this behaviour. For fast environmental dynamics (low $\lambda^{-1}$ ) the MST obtained from the $\tau$ FE algorithm is in good agreement with measurements obtained from the other simulation methods, and with the analytical approximations from Assaf et al. (2013a). The agreement extends over several decades of values of $\tau_{c}=\lambda^{-1}$ .

At the same time we observe that the $\tau$ FE algorithm requires significantly less computing time than the GADE or DEED approaches. For $\lambda^{-1}=10^{-1}$ for example, we measured an average computing time of $2\times 10^{-3}$ seconds to generate one run of the system up to time $10^{3}$ with the $\tau$ FE algorithm ( $\Delta t=10/\lambda$ ). GADE required $0.674$ seconds, and DEED $4.2$ seconds (for a time step $\Delta t=10^{-4}$ ).

We note that we have implemented DEED as described in Sec. V.3.2. In particular at most one reaction of each type can fire in each time step (step 3 of the algorithm). This requires a sufficiently small time step $\Delta t$ to ensure $p_{r}(t)<1$ for all $r$ . This is achieved by our choice $\Delta t=10^{-4}$ . Alternatively step 3 of the DEED algorithm could be replaced by a (conventional) $\tau$ -leaping step. Larger choices of the time step $\Delta t$ are then possible, up to the limit of $\Delta t\approx\lambda^{-1}$ to ensure that the environmental dynamics are captured appropriately. Focusing on $\lambda^{-1}=10^{-1}$ we expect that increasing the time step by a factor of a thousand (from $10^{-4}$ to $10^{-1}$ ) would reduce the simulation time by at most a factor of a thousand for a $\tau$ -leaping version of DEED. This would result in a computing time of approximately $4\times 10^{-3}$ for one simulation run up to $t=10^{3}$ instead of the $4.2$ seconds reported for DEED in the previous paragraph. This is comparable with the CPU time required by the $\tau$ FE algorithm ( $2\times 10^{-3}$ seconds), but would resolve environmental fluctuations with lower accuracy. For example one observes systematic deviations for the stationary distribution of the environment in Fig. 5(b).

VII Discussion and conclusions

In summary, we have presented $\tau$ FE, a variant of the $\tau$ -leaping stochastic simulation algorithm for systems subject to fast environmental dynamics. Just like conventional $\tau$ -leaping the algorithm operates in discrete time. The rates of the reactions in the system proper are treated as constant during each time step, and the numbers of different reactions firing in one step have Poissonian statistics.

The key difference compared to conventional $\tau$ -leaping is the external environment. In the full continuous-time model reaction rates which depend on the environmental state fluctuate in time even when the state of the population does not change. An adiabatic approximation would consist of assuming an infinitely fast environment and of replacing the reaction rates by their means with respect to the stationary distribution of the environmental process. This is justified if the relaxation time scale of the environmental process is infinitely shorter than the time step of the simulation.

The $\tau$ FE algorithm goes beyond this approximation, and is based on time averages of reaction rates over the finite time step. For finite speeds of the environment these average rates are random variables. If the environmental dynamics is fast we can make a Gaussian approximation. The rates feeding into the $\tau$ -leaping step are clipped Gaussian random numbers designed to retain the first and second moments of the actual environmental dynamics. It is important to note that this not the same as drawing an environmental state $\sigma$ from the stationary distribution $\rho^{*}_{\sigma}$ , and then using the rates $R_{r,\sigma}(\mathbf{n})$ for the next $\tau$ -leaping step. Instead, the covariance matrix of the rates $\overline{R}_{r}(\mathbf{n})$ in Eq. (8) is calculated as described in Eqs. (10) for discrete environments, and in Eq. (30) for continuous environmental states.

The choice of time step for the $\tau$ FE algorithm requires careful consideration. On the one hand the time step must be long enough to justify the averaging procedure over the environmental dynamics and the Gaussian assumption for the reaction rates in the $\tau$ -leaping step. Broadly speaking $\lambda\Delta t$ must be sufficiently large ( $\lambda\Delta t\gg 1$ ). At the same time the so-called leap condition for the $\tau$ -leaping part of the algorithm must be fulfilled (Gillespie, 2001). This means that the state of the system must not change significantly in each iteration step, as a constant state $\mathbf{n}$ of the population is an assumption made in setting up the $\tau$ -leaping. Mathematically, this means that the change of the number of particles in the system in a time step must be much smaller than the typical number of particles in the system. Assuming that the stoichiometric coefficients do not scale with the system size $\Omega$ this means that $\Delta t\times R_{r,\sigma}(\mathbf{n})$ must be much smaller than $\Omega$ . Noting that $R_{r,\sigma}(\mathbf{n})$ is of order $\Omega$ in many applications we thus require that $\Delta t$ is much smaller than one. For $\lambda\gg 1$ and $\Delta t$ proportional to $\lambda^{-1}$ this condition is often relatively easy to meet in practice.

We have tested the $\tau$ FE algorithm on a number of systems with discrete and continuous environments. This includes examples of systems which can be addressed analytically and models motivated by applications in biology. Our tests focus on stationary distributions, but also dynamic features such as Fourier spectra of fluctuations or first-passage time distributions. In all cases we have tested the $\tau$ FE method produces good agreement with results from conventional simulation methods in the regime of fast environmental dynamics. This is the regime for which $\tau$ FE is designed. Naturally, quantitative deviations are found when the time scales of the environmental dynamics and system proper are insufficiently separated.

We stress that $\tau$ FE goes beyond simulations in the adiabatic limit, and is able to capture the dependence of macroscopic observables on the time scale separation, provided this dependence is sufficiently strong [see e.g. Figs. 4(d) and 6)]. At the same time our analysis also reveals limitations of the algorithm. If the dependence of observables on the time scale separation is weak such as in Fig. 4(c), then $\tau$ FE may not be able to fully resolve these dependencies. When the environment is fast the quantitative agreement with simulations of the full system is however still within approximately $10\%$ in the example in Fig. 4(c).

The computing time required for the $\tau$ FE algorithm to generate sample paths up to a designated end time is proportional to the inverse time step. The time step on the other hand is typically a multiple of the characteristic time scale $\lambda^{-1}$ of the environmental dynamics. This means that the computational effort scales approximately linearly in the time scale separation $\lambda$ . In all cases we have tested we found that $\tau$ FE is considerably more efficient for the measurement of macroscopic quantities than alternative simulation algorithms.

In summary, we think the $\tau$ FE algorithm has passed the initial selection of tests presented in this paper. It provides an promising approach to probing the regime of fast environmental dynamics, and captures effects induced by extrinsic noise beyond the adiabatic limit. The algorithm is particularly valuable for systems in which the regime of intermediate time scale separation can be accessed with conventional simulation methods. The accuracy of the $\tau$ FE algorithm can then be assessed in this regime (an example can be found in Fig. 6). If the comparison is favourable, then it is justified to use $\tau$ FE in the regime of increasing time scale separation.

Acknowledgements

We would like to thank Yen Ting Lin (Los Alamos) for useful discussions and feedback on earlier versions of the manuscript. EBC acknowledges a President’s Doctoral Scholarship (The University of Manchester). TG acknowledges funding from the Spanish Ministry of Science, Innovation and Universities, the Agency AEI and FEDER (EU) under the grant PACSS (RTI2018-093732-B-C22), and the Maria de Maeztu program for Units of Excellence in R&D (MDM-2017-0711).

References

Murray (2002) J. D. Murray, Mathematical Biology I. An Introduction, 3rd ed., Interdisciplinary Applied Mathematics, Vol. 17 (Springer, New York, 2002).
Murray (2003) J. D. Murray, Mathematical Biology II: Spatial Models and Biomedical Applications, Interdisciplinary Applied Mathematics, Vol. 18 (Springer New York, 2003).
Goel and Richter-Dyn (2004) N. S. Goel and N. Richter-Dyn, Stochastic Models in Biology (The Blackburn Press, 2004).
Ewens (2004) W. Ewens, Mathematical Population Genetics 1 (Springer-Verlag, New York, 2004).
Traulsen and Hauert (2010) A. Traulsen and C. Hauert, “Stochastic evolutionary game dynamics,” in Reviews of Nonlinear Dynamics and Complexity (Wiley-VCH Verlag GmbH and Co. KGaA, 2010) pp. 25–61.
Castellano et al. (2009) C. Castellano, S. Fortunato, and V. Loreto, Reviews of Modern Physics 81, 591 (2009).
Keeling and Rohani (2008) M. J. Keeling and P. Rohani, Modeling Infectious Diseases in Humans and Animals (Princeton University Press, 2008).
Kampen (2007) N. V. Kampen, Stochastic processes in physics and chemistry (North Holland, 2007).
Acar et al. (2008) M. Acar, J. T. Mettetal, and A. Van Oudenaarden, Nature Genetics 40, 471 (2008).
Patra and Klumpp (2015) P. Patra and S. Klumpp, Physical Biology 12, 046004 (2015).
Wienand et al. (2017) K. Wienand, E. Frey, and M. Mobilia, Physical Review Letters 119, 158301 (2017).
Wienand et al. (2018) K. Wienand, E. Frey, and M. Mobilia, Journal of The Royal Society Interface 15, 20180343 (2018).
Taitelbaum et al. (2020) A. Taitelbaum, R. West, M. Assaf, and M. Mobilia, Physical Review Letters 125, 048105 (2020).
Black and McKane (2010) A. J. Black and A. J. McKane, Journal of Theoretical Biology 267, 85 (2010).
Gardiner (2004) C. W. Gardiner, Handbook of stochastic methods for physics, chemistry and the natural sciences, 3rd ed., Springer Series in Synergetics, Vol. 13 (Springer-Verlag, Berlin, 2004).
Kussell and Leibler (2005) E. Kussell and S. Leibler, Science 309, 2075 (2005).
Kepler and Elston (2001) T. B. Kepler and T. C. Elston, Biophysical Journal 81, 3116 (2001).
Thattai and Van Oudenaarden (2004) M. Thattai and A. Van Oudenaarden, Genetics 167, 523 (2004).
Swain et al. (2002) P. S. Swain, M. B. Elowitz, and E. D. Siggia, Proceedings of the National Academy of Sciences (USA) 99, 12795 (2002).
Assaf et al. (2013a) M. Assaf, E. Roberts, Z. Luthey-Schulten, and N. Goldenfeld, Physical Review Letters 111, 058102 (2013a).
Duncan et al. (2015) A. Duncan, S. Liao, T. Vejchodskỳ, R. Erban, and R. Grima, Physical Review E 91, 042111 (2015).
Assaf et al. (2013b) M. Assaf, M. Mobilia, and E. Roberts, Physical Review Letters 111, 238101 (2013b).
Ashcroft et al. (2014) P. Ashcroft, P. M. Altrock, and T. Galla, Journal Royal Society Interface 11, 20140663 (2014).
West et al. (2018) R. West, M. Mobilia, and A. M. Rucklidge, Physical Review E 97, 022406 (2018).
Assaf et al. (2008) M. Assaf, A. Kamenev, and B. Meerson, Physical Review E 78, 041123 (2008).
Hufton et al. (2019a) P. G. Hufton, Y. T. Lin, and T. Galla, Physical Review E 99, 032122 (2019a).
Eldar and Elowitz (2010) A. Eldar and M. Elowitz, Nature 467, 167 (2010).
Gillespie (1976) D. T. Gillespie, Journal of Computational Physics 22, 403 (1976).
Gillespie (1977) D. T. Gillespie, The Journal of Physical Chemistry 81, 2340 (1977).
Bowen et al. (1963) J. Bowen, A. Acrivos, and A. Oppenheim, Chemical Engineering Science 18, 177 (1963).
Segel and Slemrod (1989) L. A. Segel and M. Slemrod, SIAM Review 31, 446 (1989).
Lin and Buchler (2018a) Y. T. Lin and N. E. Buchler, Journal of The Royal Society Interface 15, 20170804 (2018a).
Newby and Bressloff (2010) J. M. Newby and P. C. Bressloff, Bulletin of Mathematical Biology 72, 1840 (2010).
Bressloff (2016) P. C. Bressloff, Physical Review E 94, 042129 (2016).
Bressloff (2017a) P. C. Bressloff, Physical Review E 95, 012124 (2017a).
Bressloff (2017b) P. C. Bressloff, Physical Review E 95, 012138 (2017b).
Gillespie (2001) D. T. Gillespie, The Journal of Chemical Physics 115, 1716 (2001).
Hufton et al. (2019b) P. G. Hufton, Y. T. Lin, and T. Galla, Physical Review E 99, 032121 (2019b).
Bressloff and Newby (2014) P. C. Bressloff and J. M. Newby, Physical Review E 89, 042701 (2014).
Hufton et al. (2016) P. G. Hufton, Y. T. Lin, T. Galla, and A. J. McKane, Physical Review E 93, 052119 (2016).
Gunawardena (2014) J. Gunawardena, The FEBS Journal 281, 473 (2014).
Buchler et al. (2003) N. E. Buchler, U. Gerland, and T. Hwa, Proceedings of the National Academy of Sciences (USA) 100, 5136 (2003).
Lin and Buchler (2018b) Y. T. Lin and N. E. Buchler, Journal Royal Society Interface 15 (2018b).
Fuglede and Topsoe (2004) B. Fuglede and F. Topsoe, in International Symposium on Information Theory, 2004. Proceedings. (2004) p. 31.
Lin (1991) J. Lin, IEEE Transactions on Information Theory 37, 145 (1991).
Note (1) For completeness, we add that simulations were performed on a MacBook Pro (Mid 2014), with processor 2.6 GHz Dual-Core Inter Core i5, and memory 8 GB 1600 MHz DDR3.
Lin et al. (2018) Y. T. Lin, P. G. Hufton, E. J. Lee, and D. A. Potoyan, PLOS Computational Biology 14, 1 (2018).
Roberts et al. (2015) E. Roberts, S. Be’er, C. Bohrer, R. Sharma, and M. Assaf, Physical Review E 92, 062717 (2015).
Klebaner (2005) F. C. Klebaner, Introduction to stochastic calculus with applications (World Scientific Publishing Company, Singapore, 2005).
Risken (1996) H. Risken, in The Fokker-Planck Equation (Springer, 1996) pp. 63–95.
Maruyama (1955) G. Maruyama, Rendiconti del Circolo Matematico di Palermo 4, 48 (1955).
Lewis and Shedler (1979) P. W. Lewis and G. S. Shedler, Naval Research Logistics Quarterly 26, 403 (1979).

Appendix A Second moments of rates

In this Appendix we calculate the second moments of the quantities $\overline{R}_{r}(\mathbf{n})$ ( $r=1,\dots,R$ ) defined in Eq. (8). Without loss of generality we assume that the time interval in question starts at $t=0$ , the end point is then $\Delta t$ . Assuming the space of environmental states is discrete, we have

$\displaystyle\left\langle{\overline{R}_{r}(\mathbf{n})\overline{R}_{s}(\mathbf{n})}\right\rangle$	$\displaystyle=$	$\displaystyle\frac{1}{\Delta t^{2}}\int_{0}^{\Delta t}\mathrm{d}t_{1}\int_{0}^{\Delta t}\mathrm{d}t_{2}\left\langle{R_{r,\sigma(t_{1})}(\mathbf{n})R_{s,\sigma(t_{2})}(\mathbf{n})}\right\rangle$	(38)
	$\displaystyle=$	$\displaystyle\frac{1}{\Delta t^{2}}\int_{0}^{\Delta t}\mathrm{d}t_{1}\int_{t_{1}}^{\Delta t}\mathrm{d}t_{2}\left\langle{R_{r,\sigma(t_{1})}(\mathbf{n})R_{s,\sigma(t_{2})}(\mathbf{n})}\right\rangle+\frac{1}{\Delta t^{2}}\int_{0}^{\Delta t}\mathrm{d}t_{2}\int_{t_{2}}^{\Delta t}\mathrm{d}t_{1}\left\langle{R_{r,\sigma(t_{1})}(\mathbf{n})R_{s,\sigma(t_{2})}(\mathbf{n})}\right\rangle$
	$\displaystyle=$	$\displaystyle\frac{1}{\Delta t^{2}}\sum_{\sigma\sigma^{\prime}}\int_{0}^{\Delta t}\mathrm{d}t_{1}\int_{t_{1}}^{\Delta t}\mathrm{d}t_{2}~{}\rho^{*}_{\sigma}q_{\sigma\to\sigma^{\prime}}(t_{2}-t_{1})R_{r,\sigma}(\mathbf{n})R_{s,\sigma^{\prime}}(\mathbf{n})$
		$\displaystyle+\frac{1}{\Delta t^{2}}\sum_{\sigma\sigma^{\prime}}\int_{0}^{\Delta t}\mathrm{d}t_{2}\int_{t_{2}}^{\Delta t}\mathrm{d}t_{1}~{}\rho^{*}_{\sigma^{\prime}}q_{\sigma^{\prime}\to\sigma}(t_{1}-t_{2})R_{r,\sigma}(\mathbf{n})R_{s,\sigma^{\prime}}(\mathbf{n})$
	$\displaystyle=$	$\displaystyle\frac{1}{\Delta t^{2}}\sum_{\sigma\sigma^{\prime}}\int_{0}^{\Delta t}\mathrm{d}t_{1}\int_{t_{1}}^{\Delta t}\mathrm{d}t_{2}~{}\rho^{*}_{\sigma}q_{\sigma\to\sigma^{\prime}}(t_{2}-t_{1})R_{r,\sigma}(\mathbf{n})R_{s,\sigma^{\prime}}(\mathbf{n})$
		$\displaystyle+\frac{1}{\Delta t^{2}}\sum_{\sigma\sigma^{\prime}}\int_{0}^{\Delta t}\mathrm{d}t_{1}\int_{t_{1}}^{\Delta t}\mathrm{d}t_{2}~{}\rho^{*}_{\sigma}q_{\sigma\to\sigma^{\prime}}(t_{2}-t_{1})R_{r,\sigma^{\prime}}(\mathbf{n})R_{s,\sigma}(\mathbf{n}).$

In the first step we have applied the definition of the over-bar average [Eq. (8)]. In the third step we have carried out the average over realisations of the environmental process. In the last step we have renamed $t_{1}\leftrightarrow t_{2}$ and $\sigma\leftrightarrow\sigma^{\prime}$ in the second term. Therefore

\displaystyle\left\langle{\overline{R}_{r}(\mathbf{n})\overline{R}_{s}(\mathbf{n})}\right\rangle

\displaystyle=

\displaystyle\frac{1}{\Delta t^{2}}\sum_{\sigma\sigma^{\prime}}\int_{0}^{\Delta t}\mathrm{d}t_{1}\int_{t_{1}}^{\Delta t}\mathrm{d}t_{2}~{}\rho^{*}_{\sigma}q_{\sigma\to\sigma^{\prime}}(t_{2}-t_{1})\big{[}R_{r,\sigma}(\mathbf{n})R_{s,\sigma^{\prime}}(\mathbf{n})+R_{r,\sigma^{\prime}}(\mathbf{n})R_{s,\sigma}(\mathbf{n})\big{]}.

(39)

Up to a shift of the start point of the time step, this is identical to Eq. (10).

As explained in Section V.2, the sums over $\sigma$ become integrals when the environment takes continuous states. We then find Eq. (30).

When the environmental space is discrete, we can use Eq. (7) and find

$\displaystyle\left\langle{\overline{R}_{r}(\mathbf{n})\overline{R}_{s}(\mathbf{n})}\right\rangle$	$\displaystyle=\frac{1}{\Delta t^{2}}\sum_{\sigma\sigma^{\prime}}\int_{0}^{\Delta t}\mathrm{d}t_{1}\int_{t_{1}}^{\Delta t}\mathrm{d}t_{2}~{}\rho^{}_{\sigma}\rho^{}_{\sigma^{\prime}}\big{[}R_{r,\sigma}(\mathbf{n})R_{s,\sigma^{\prime}}(\mathbf{n})+R_{r,\sigma^{\prime}}(\mathbf{n})R_{s,\sigma}(\mathbf{n})\big{]}$
	$\displaystyle+\frac{1}{\Delta t^{2}}\sum_{\sigma\sigma^{\prime}}\sum_{\ell=2}^{M}\int_{0}^{\Delta t}\mathrm{d}t_{1}\int_{t_{1}}^{\Delta t}\mathrm{d}t_{2}~{}\rho^{*}_{\sigma}c_{\ell,\sigma}v_{\ell,\sigma^{\prime}}e^{-\lambda\mu_{\ell}(t_{2}-t_{1})}\big{[}R_{r,\sigma}(\mathbf{n})R_{s,\sigma^{\prime}}(\mathbf{n})+R_{r,\sigma^{\prime}}(\mathbf{n})R_{s,\sigma}(\mathbf{n})\big{]}$
	$\displaystyle=R_{r,\rm avg}(\mathbf{n})R_{s,\rm avg}(\mathbf{n})$
	$\displaystyle+\frac{1}{\Delta t^{2}}\sum_{\sigma\sigma^{\prime}}\sum_{\ell=2}^{M}\rho^{*}_{\sigma}c_{\ell,\sigma}v_{\ell,\sigma^{\prime}}\big{[}R_{r,\sigma}(\mathbf{n})R_{s,\sigma^{\prime}}(\mathbf{n})+R_{r,\sigma^{\prime}}(\mathbf{n})R_{s,\sigma}(\mathbf{n})\big{]}\int_{0}^{\Delta t}\mathrm{d}t_{1}\int_{t_{1}}^{\Delta t}\mathrm{d}t_{2}~{}e^{\lambda\mu_{\ell}(t_{2}-t_{1})}.$	(40)

Appendix B Further details for systems with two species and two environmental states

The case of two species and two environmental states ( $S=2,M=2$ ) was studied in Hufton et al. (2019b), and a simple version of the $\tau$ FE algorithm was presented for this restricted case. We assume $\sigma$ switches from state $0$ to state $1$ with rate $\lambda k_{1}$ , and from $1$ to $0$ with rate $\lambda k_{0}$ . The environmental transition matrix then becomes

\mathbf{A}=\left(\begin{array}[]{cc}-k_{1}&k_{0}\\ k_{1}&-k_{0}\end{array}\right),

(41)

whose eigenvalues are $\mu_{1}=0$ and $\mu_{2}=-(k_{0}+k_{1})$ . The respective eigenvectors take the form

\mathbf{v}_{1}={\mbox{\boldmath$\rho$}}^{*}=\frac{1}{k_{0}+k_{1}}\left(\begin{array}[]{c}k_{0}\\ k_{1}\end{array}\right)\quad\text{and}\quad\mathbf{v}_{2}=\left(\begin{array}[]{c}1\\ -1\end{array}\right),

(42)

where ${\mbox{\boldmath$\rho$}}^{*}$ has been normalised to represent the stationary distribution for $\sigma$ . The coefficients $c_{2,0}$ and $c_{2,1}$ are obtained from Eq. (6), for the initial conditions ${\mbox{\boldmath$\rho$}}(0)=(1,0)$ and ${\mbox{\boldmath$\rho$}}(0)=(0,1)$ . We find

c_{2,0}=\frac{k_{1}}{k_{0}+k_{1}}\quad\text{and}\quad c_{2,1}=\frac{-k_{0}}{k_{0}+k_{1}}.

(43)

Putting all together in Eq. (13), and after straightforward calculations we arrive at

\displaystyle\Xi_{rs}\equiv\left\langle{\overline{R}_{r}(\mathbf{n})\overline{R}_{s}(\mathbf{n})}\right\rangle-R_{r}^{*}(\mathbf{n})R_{s}^{*}(\mathbf{n})=\frac{\theta^{2}}{\lambda\Delta t}\left[R_{r,1}(\mathbf{n})-R_{r,0}(\mathbf{n})\right]\left[R_{s,1}(\mathbf{n})-R_{s,0}(\mathbf{n})\right],

(44)

where $\theta^{2}=2k_{0}k_{1}/(k_{0}+k_{1})^{3}$ . The indices $r$ and $s$ stand for reactions affected by the environment. As explained in Section III.3, to simulate the $\tau$ FE algorithm we need to draw correlated Gaussian random numbers $\overline{R}_{r}$ with means

R^{*}_{r}(\mathbf{n})=\dfrac{k_{0}R_{r,0}+k_{1}R_{r,0}}{k_{0}+k_{1}},

(45)

for $r=1,2$ , and covariance matrix

\mathbf{\Sigma}=\left(\begin{array}[]{cc}\Xi_{11}&\Xi_{12}\\ \Xi_{21}&\Xi_{22}\end{array}\right).

(46)

One way to do this is by drawing independent Gaussian random numbers $z_{1}$ and $z_{2}$ with mean zero and unit variance, and then to set

\left(\begin{array}[]{c}\overline{R}_{1}(\mathbf{n})\\ \overline{R}_{2}(\mathbf{n})\end{array}\right)=\left(\begin{array}[]{c}R_{1}^{*}(\mathbf{n})\\ R_{2}^{*}(\mathbf{n})\end{array}\right)+\mathbf{C}\left(\begin{array}[]{c}z_{1}\\ z_{2}\end{array}\right),

(47)

with a matrix $\mathbf{C}$ that fulfils $\mathbf{C}\mathbf{C}^{T}=\mathbf{\Sigma}$ , where $T$ denotes the transpose. This matrix is not unique. We use

\mathbf{C}=\dfrac{\mathbf{\Sigma}}{\sqrt{\theta^{2}/(\lambda\Delta t)\left\{\left[R_{1,1}(\mathbf{n})-R_{1,0}(\mathbf{n})\right]^{2}+\left[R_{2,1}(\mathbf{n})-R_{2,0}(\mathbf{n})\right]^{2}\right\}}}.

(48)

Appendix C Birth-death process with two species and three environmental states

In the example in Sec. IV.2 we have the following transition matrix for the environmental process

\mathbf{A}=\left(\begin{array}[]{ccc}-k_{1}&0&k_{0}\\ k_{1}&-k_{2}&0\\ 0&k_{2}&-k_{0}\end{array}\right).

(49)

The eigenvalues of this matrix are

\mu_{1}=0,\quad\mu_{2}=-\frac{1}{2}\left(k_{0}+k_{1}+k_{2}+\Gamma\right),\quad\text{and},\quad\mu_{3}=-\frac{1}{2}\left(k_{0}+k_{1}+k_{2}-\Gamma\right),

(50)

with $\Gamma=\sqrt{k_{0}^{2}+k_{1}^{2}+k_{2}^{2}-2(k_{0}k_{1}+k_{1}k_{2}+k_{2}k_{0})}$ . The associated eigenvectors take the form

\mathbf{v}_{1}={\mbox{\boldmath$\rho$}}^{*}=\frac{1}{k_{0}k_{1}+k_{1}k_{2}+k_{2}k_{0}}\left(\begin{array}[]{c}k_{2}k_{0}\\ k_{0}k_{1}\\ k_{1}k_{2}\end{array}\right),

(51)

and

\mathbf{v}_{2}=\left(\begin{array}[]{c}(-k_{0}+k_{1}-k_{2}+\Gamma)/(2k_{2})\\ (k_{0}-k_{1}-k_{2}-\Gamma)/(2k_{2})\\ 1\end{array}\right),\quad\quad\mathbf{v}_{3}=\left(\begin{array}[]{c}(-k_{0}+k_{1}-k_{2}-\Gamma)/(2k_{2})\\ (k_{0}-k_{1}-k_{2}+\Gamma)/(2k_{2})\\ 1\end{array}\right).

(52)

Using Eq. (6) and three sets of initial conditions (each concentrated on one environmental state) we find

c_{2,0}=\dfrac{k_{1}k_{2}\left(k_{0}+k_{1}+k_{2}-\Gamma\right)}{2\Gamma(k_{0}k_{1}+k_{1}k_{2}+k_{2}k_{0})},\quad c_{3,0}=-\dfrac{k_{1}k_{2}\left(k_{0}+k_{1}+k_{2}+\Gamma\right)}{2\Gamma(k_{0}k_{1}+k_{1}k_{2}+k_{2}k_{0})},

(53)

as well as

c_{2,1}=-\dfrac{k_{2}\left(k_{0}(k_{1}+2k_{2})+k_{1}\left(-k_{1}+k_{2}+\Gamma\right)\right)}{2\Gamma(k_{0}k_{1}+k_{1}k_{2}+k_{2}k_{0})},\quad c_{3,1}=\dfrac{k_{2}\left(k_{0}(k_{1}+2k_{2})+k_{1}\left(-k_{1}+k_{2}-\Gamma\right)\right)}{2\Gamma(k_{0}k_{1}+k_{1}k_{2}+k_{2}k_{0})},

(54)

and finally

c_{2,2}=\dfrac{k_{0}\left(-k_{1}^{2}-k_{2}^{2}+k_{0}(k_{1}+k_{2})+k_{1}\Gamma+k_{2}\Gamma\right)}{2\Gamma(k_{0}k_{1}+k_{1}k_{2}+k_{2}k_{0})},\quad c_{3,2}=\dfrac{k_{0}\left(k_{1}^{2}+k_{2}^{2}-k_{0}(k_{1}+k_{2})+k_{1}\Gamma+k_{2}\Gamma\right)}{2\Gamma(k_{0}k_{1}+k_{1}k_{2}+k_{2}k_{0})}.

(55)

Putting all together in Eqs. (2) and (13) and after further tedious but straightforward calculations, we arrive at the expressions in Eqs. (18) and (19).

In order to draw the correlated Gaussian random numbers $\bar{\alpha}$ and $\bar{\beta}$ required for the $\tau$ -leaping step, we proceed as in Appendix B. We construct the covariance matrix $\mathbf{\Sigma}$ [Eq. (46)] and then find a matrix $\mathbf{C}$ such that $\mathbf{C}\mathbf{C}^{T}=\mathbf{\Sigma}$ . We then draw independent Gaussian random numbers $z_{1}$ and $z_{2}$ with mean zero and unit variance, and use an expresion analogous to that in Eq. (47) to obtain $\bar{\alpha}$ and $\bar{\beta}$ . The matrix $\mathbf{C}$ we use is

\mathbf{C}=A\left(\begin{array}[]{cc}\dfrac{\sigma_{\alpha\alpha}+B}{\sigma_{\alpha\beta}}&1\\ 1&\dfrac{\sigma_{\beta\beta}+B}{\sigma_{\alpha\beta}}\end{array}\right),

(56)

with $\sigma_{\alpha\alpha}$ and $\sigma_{\alpha\beta}$ as given in Eq. (19), and

A=\dfrac{\sigma_{\alpha\beta}}{\sqrt{\sigma_{\alpha\alpha}+\sigma_{\beta\beta}+B}},

(57)

and

\displaystyle B=\dfrac{\sqrt{3}k_{0}k_{1}k_{2}}{\lambda\Delta t(k_{0}k_{1}+k_{1}k_{2}+k_{2}k_{0})^{2}}\times\lvert\alpha_{0}(\beta_{2}-\beta_{1})+\alpha_{1}(\beta_{0}-\beta_{2})+\alpha_{2}(\beta_{1}-\beta_{0})\rvert.

(58)

Appendix D Bimodal genetic switch

For the model in Sec. IV.3 the rates of the environmental transitions depend on the number of proteins $n_{P}$ in the population. We assume that $n_{P}$ remains constant during each $\tau$ -leaping step. The environmental transition matrix then becomes

\mathbf{A}=\left(\begin{array}[]{ccc}-\tilde{k}_{+}&k_{-}&0\\ \tilde{k}_{+}&-\tilde{k}_{+}-k_{-}&k_{-}\\ 0&\tilde{k}_{+}&-k_{-}\end{array}\right),

(59)

with $\tilde{k}_{+}=k_{+}n_{P}/\Omega$ . The eigenvalues of this matrix are

\displaystyle\mu_{1}=0,\quad\mu_{2}=-k_{-}-\tilde{k}_{+}-\sqrt{k_{-}\tilde{k}_{+}},\quad\mu_{3}=-k_{-}-\tilde{k}_{+}+\sqrt{k_{-}\tilde{k}_{+}},

(60)

while the associated eigenvectors take the form

\mathbf{v}_{1}={\mbox{\boldmath$\rho$}}^{*}=\frac{1}{k_{-}^{2}+k_{-}\tilde{k}_{+}+\tilde{k}_{+}^{2}}\left(\begin{array}[]{c}k_{-}^{2}\\ k_{-}\tilde{k}_{+}\\ \tilde{k}_{+}^{2}\end{array}\right),

\displaystyle\mathbf{v}_{2}=\left(\begin{array}[]{c}\sqrt{k_{-}/\tilde{k}_{+}}\\ \left(-\sqrt{k_{-}}-\sqrt{\tilde{k}_{+}}\right)/\sqrt{\tilde{k}_{+}}\\ 1\end{array}\right),\quad\text{and,}\quad\mathbf{v}_{3}=\left(\begin{array}[]{c}-\sqrt{k_{-}/\tilde{k}_{+}}\\ \left(\sqrt{k_{-}}-\sqrt{\tilde{k}_{+}}\right)/\sqrt{\tilde{k}_{+}}\\ 1\end{array}\right).

(67)

Applying Eq. (6) for different initial conditions as above, we obtain

c_{2,0}=\dfrac{\tilde{k}_{+}^{3/2}}{2\sqrt{k_{-}}\left(k_{-}+\sqrt{k_{-}\tilde{k}_{+}}+\tilde{k}_{+}\right)},\quad c_{3,0}=-\dfrac{\tilde{k}_{+}^{3/2}}{2\sqrt{k_{-}}\left(k_{-}-\sqrt{k_{-}\tilde{k}_{+}}+\tilde{k}_{+}\right)},

(68)

as well as

c_{2,1}=-\dfrac{\tilde{k}_{+}+\sqrt{k_{-}\tilde{k}_{+}}}{2\left(k_{-}+\sqrt{k_{-}\tilde{k}_{+}}+\tilde{k}_{+}\right)},\quad c_{3,1}=-\dfrac{\tilde{k}_{+}-\sqrt{k_{-}\tilde{k}_{+}}}{2\left(k_{-}-\sqrt{k_{-}\tilde{k}_{+}}+\tilde{k}_{+}\right)},

(69)

and finally

c_{2,2}=-\dfrac{k_{-}}{2\left(k_{-}+\sqrt{k_{-}\tilde{k}_{+}}+\tilde{k}_{+}\right)},\quad c_{3,2}=-\dfrac{k_{-}}{2\left(k_{-}-\sqrt{k_{-}\tilde{k}_{+}}+\tilde{k}_{+}\right)}.

(70)

Putting this together in Eqs. (2) and (13) and after straightforward calculations, we arrive at the expressions in Eqs. (23) and (24).

Since only one reaction is affected by the environmental state, it is only necessary to drawn one Gaussian random number with mean $b^{*}$ and variance $\sigma_{bb}$ in each step of the $\tau$ FE algorithm, with $b^{*}$ and $\sigma_{bb}$ given in Eqs. (23) and (24) respectively.

Appendix E Gillespie algorithm with discretised environmental dynamics (GADE)

In this Appendix we briefly describe the constructions of the rates given in Eq. (31). They define a continuous-time dynamics on a discrete state space approximating the Ornstein–Uhlenbeck process in Eq. (26).

Matching the first moments of movements. We first look at the mean drift of $\sigma$ , i.e., the mean change of $\sigma$ per unit time. Suppose the environment is in a given state $\sigma$ . The mean drift in the Ornstein–Uhlenbeck process [Eq. (26)] is then $\lambda(m-\sigma)$ .

Suppose now the above discrete- $\sigma$ process is in state $\sigma=k\Delta\sigma$ . Then $\sigma$ increases to $\sigma+\Delta\sigma$ with rate $T_{k}^{+}$ and decreases to $\sigma-\Delta\sigma$ with rate $T_{k}^{-}$ . The expected change (per unit time) is therefore $\Delta\sigma\times(T_{k}^{+}-T_{k}^{-})$ .

We conclude that we need to impose

\Delta\sigma\times(T_{k}^{+}-T_{k}^{-})=\lambda(m-k\Delta\sigma).

(71)

Matching the variance of movements. Next we look at the variance of movements of $\sigma$ . For the Ornstein–Uhlenbeck process in Eq. (26) the second moment of movements (per unit time) is given by $2\lambda v^{2}$ . In the discrete- $\sigma$ process, the second moment of movements is $(\Delta\sigma)^{2}\times(T_{k}^{+}+T_{k}^{-})$ . To match the Ornstein–Uhlenbeck process, we then need to impose

(\Delta\sigma)^{2}\times(T_{k}^{+}+T_{k}^{-})=2\lambda v^{2}.

(72)

Overall solution. Simultaneously solving Eqs. (71) and (72) for $T_{k}^{+}$ and $T_{k}^{-}$ we arrive at Eq. (31).

Appendix F Additional examples of production-removal processes in continuous environments

In this Appendix we include results for the variances and covariances $\left\langle{\overline{R}_{r}(n)\overline{R}_{s}(n)}\right\rangle-R_{r}^{*}(\mathbf{n})R_{s}^{*}(n)$ for two further exemplar systems in which the environment follows the Ornstein–Uhlenbeck process in Eq. (26). We set $m=0$ for both examples. Both systems describe production and removal dynamics of a single species. In the first example, production and removal rates are proportional to $\sigma$ when $\sigma>0$ and zero otherwise. In the second example the rates are each proportional to $|\sigma|$ . These examples are not used in the main paper, we report them here for completeness, as they may prove useful for future applications of the $\tau$ FE algorithm.

F.1 Rates $R_{r,\sigma}(n)=\alpha_{r}\sigma\Theta(\sigma)$

We look at the example $R_{r,\sigma}(n)=\alpha_{r}\sigma\Theta(\sigma)$ , where $\Theta(\sigma)$ is the Heaviside function, $\Theta(\sigma)=1$ for $\sigma>0$ and $\Theta(\sigma)=0$ otherwise. For $m=0$ , we find

R_{r}^{*}(n)=\alpha_{r}\dfrac{v}{\sqrt{\pi}},

(73)

and

	$\displaystyle\left\langle{\overline{R}_{r}(n)\overline{R}_{s}(n)}\right\rangle-R_{r}^{}(n)R_{s}^{}(n)=$
	$\displaystyle\quad\frac{\alpha_{r}\alpha_{s}v^{2}}{24\pi\Delta t^{2}}\Bigg{\{}\dfrac{1}{\lambda^{2}}\left[24\pi\left(\lambda\Delta t-1\right)-\pi^{2}+12\log^{2}(2)\right]+$
	$\displaystyle\quad\dfrac{4e^{-\lambda\Delta t}}{\lambda^{2}}\left[3\left(6\sqrt{e^{2\lambda\Delta t}-1}+\pi\right)-4e^{\lambda\Delta t}\log\left(\sqrt{e^{2\lambda\Delta t}-1}+e^{\lambda\Delta t}\right)+6\tan^{-1}\left(\frac{1}{\sqrt{e^{2\lambda\Delta t}-1}}\right)\right]$
	$\displaystyle\quad-\dfrac{32}{\lambda^{2}}\text{Re}\left(i\sin^{-1}\left(e^{\lambda\Delta t}\right)\right)-6\bigg{[}\dfrac{1}{\lambda^{2}}\log^{2}\left(\sqrt{1-e^{-2\lambda\Delta t}}+1\right)+\dfrac{4\Delta t}{\lambda}\log\left(\sqrt{1-e^{-2\lambda\Delta t}}+1\right)$
	$\displaystyle\quad-\dfrac{4\Delta t\log(2)}{\lambda}-\dfrac{\log(4)}{\lambda^{2}}\log\left(\sqrt{1-e^{-2\lambda\Delta t}}+1\right)-\dfrac{4\Delta t}{\lambda}\tanh^{-1}\left(e^{-\lambda\Delta t}\sqrt{e^{2\lambda\Delta t}-1}\right)$
	$\displaystyle\quad-\dfrac{2}{\lambda^{2}}\text{Li}_{2}\left(\frac{1}{2}\left(1-\sqrt{1-e^{-2\lambda\Delta t}}\right)\right)+\dfrac{\log^{2}(2)}{\lambda^{2}}+2\Delta t^{2}\bigg{]}-24\Bigg{\}},$		(74)

where Re( $\cdot$ ) denotes the real part, and $\text{Li}_{2}(\cdot)$ is the polylogarithm of order 2.

F.2 Rates $R_{r,\sigma}(n)=\alpha_{r}|\sigma|$

For this case (and setting again $m=0$ ), we find

R_{r}^{*}(n)=\alpha_{r}\dfrac{2v}{\sqrt{\pi}},

(75)

and

	$\displaystyle\left\langle{\overline{R}_{r}(n)\overline{R}_{s}(n)}\right\rangle-R_{r}^{}(\mathbf{n})R_{s}^{}(n)=$
	$\displaystyle\quad\frac{\alpha_{r}\alpha_{s}v^{2}}{6\pi\Delta t^{2}}\Bigg{\{}\dfrac{12\Delta t}{\lambda}\left[-2\log\left(\sqrt{1-e^{-2\lambda\Delta t}}+1\right)+2\tanh^{-1}\left(\sqrt{1-e^{-2\lambda\Delta t}}\right)+\pi+\log(4)\right]$
	$\displaystyle\quad+\dfrac{1}{\lambda^{2}}\Bigg{[}72e^{-\lambda\Delta t}\sqrt{e^{2\lambda\Delta t}-1}-6\log^{2}\left(\frac{1}{2}\left(\sqrt{1-e^{-2\lambda\Delta t}}+1\right)\right)+24e^{-\lambda\Delta t}\tan^{-1}\left(\frac{1}{\sqrt{e^{2\lambda\Delta t}-1}}\right)$
	$\displaystyle\quad-48\tanh^{-1}\left(e^{-\lambda\Delta t}\sqrt{e^{2\lambda\Delta t}-1}\right)+12\text{Li}_{2}\left(\frac{1}{2}\left(1-\sqrt{1-e^{-2\lambda\Delta t}}\right)\right)-\pi^{2}-12\pi+12\log^{2}(2)\Bigg{]}$
	$\displaystyle\quad-12\left(\Delta t^{2}+2\right)\Bigg{\}}.$		(76)

Beyond the adiabatic limit in systems with fast environments: a τ\tau-leaping algorithm