The origin of the Poisson distribution in stochastic dynamics of gene expression

Julian Lee jul@ssu.ac.kr Department of Bioinformatics and Life Science, Soongsil University, Seoul, Korea

Abstract

The Poisson distribution is the probability distribution of the number of independent events in a given period of time. Although the Poisson distribution appears ubiquitously in various stochastic dynamics of gene expression, both as time-dependent distributions and the stationary distributions, underlying independent events that give rise to such distributions have not been clear, especially in the presence of the degradation of gene products, which is not a Poisson process. I show that, in fact, the variable that follows the Poisson distribution is the number of independent events where biomolecules are created, which are destined to survive until the end of a given time duration. This new viewpoint allows us to derive time-dependent Poisson distributions as solutions of master equations for general class of protein production and degradation dynamics, including models with time-dependent rates and a non-Markovian model with delayed degradation. I then derive analytic forms of general time-dependent probability distributions by combining the Poisson distribution with the binomial or the multinomial distributions.

Gene Regulatory Network, Stochastic fluctuation, Poisson distribution

I Introduction

Gene regulatory network(GRN) controls the life process by producing and degrading various kinds of proteins that perform important biological functions. It is a well-known fact that the time evolution of mRNA and/or protein molecules in such a network is stochastic [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24]. Even when various external conditions such as cellular environments are identical, there is always intrinsic noise due to remaining uncontrolled factors that influence the GRN of interest, making its dynamics stochastic. It has been suggested that biological organisms may have evolved to take advantage of such fluctuations [13].

In simple theoretical models of stochastic GRN dynamics, the Poisson distribution,

P_{\rm Poisson}(n;\mu)=e^{-\mu}\frac{\mu^{n}}{n!},

(1)

often appears as a probability distribution for the number of mRNA or protein molecules, both as time-dependent and stationary distributions [1, 3, 15, 16, 21, 23, 24]. In fact, the Poisson distribution arises from a Poisson process, where the probability of an event occurring during a short time interval $[t,t+dt]$ is independent of events in other time regions [25, 26, 27]. Now, consider a simple transcription process where a gene is always active, and an mRNA molecule $X$ is transcribed with a rate $\alpha$ ,

\varnothing\xrightarrow{\alpha}X.

(2)

It is clear that the creation events are indeed independent events, forming a Poisson process. Therefore, if we start from zero molecule at $t=0$ , then the number $n$ of mRNA molecules at any later time $t$ is exactly the same as the number of creation events up to that time-point, and consequently, the Poisson distribution is actually the distribution of the number of creation events during the time interval $[0,t]$ . However, consider a model where a degradation

X\xrightarrow{\beta}\varnothing.

(3)

is also included. Now, the degradation is not a Poisson process, although sometimes it is erroneously written as such in the literature. Since a molecule that already got degraded cannot be degraded again, the probability of a degradation event happening during the short time period $[t,t+dt]$ definitely depends on how many degradation events happened in times earlier than $t$ . However, the Poisson distributions still appear ubiquitously in this class of models as the distributions of the number of mRNA or protein molecules [1, 3, 15, 16, 21, 23, 24], making its origin mysterious. Therefore, I address the following question in this paper: Given that the Poisson distribution appears so ubiquitously in the stochastic dynamics of GRN as the distribution of molecule number of mRNA or protein, is this molecule number equal to the number of certain independent events that happened during a given time interval? As I will show, the answer to this question is affirmative. In fact, it is the number of events where mRNA molecules are created, which are destined to be survive until the end of a given interval. The answer is very simple once stated, almost on the verge of being trivial. The most probable reason that it has not been discussed in the literature is that it may have been counter-intuitive to consider a birth of a particle with a given fate. However, it is important to note that although the fate of a molecule is not determined at the time of its creation, the probability of its given fate at the end of the time interval, death or survival, is already determined at the time of its creation, and it is all that matters in defining a Poisson process.

Using this viewpoint of Poisson processes, I can not only rederive Poisson distribution for models of gene expression with time-dependent rates and a non-Markovian model with time delay, I can also derive analytic forms of general time-dependent distributions by combining the Poisson distribution with other distributions. In fact, time-dependent distributions for arbitrary initial distributions are obtained for the Markovian model by combining the Poisson distribution with the binomial distribution, and distributions for very general class of initial distributions are obtained for a non-Markovian model by combining the Poisson distribution with the multinomial distribution.

The remainder of the paper is organized according to the order of increasing complexity. In section II, I will briefly review Poisson processes, where I will emphasize the importance of inhomogeneous Poisson processes, consisting of independent events that do not necessarily follow identical distributions. In section III, I will consider molecule creations without degradations, a textbook example of a Poisson process. In section IV, I will describe the molecule degradation, which is ${\it not}$ a Poisson process. I show that a general time-dependent probability distribution can be expressed as a superposition of binomial distributions. In section V, I will describe molecule creations with degradations and the underlying Poisson process. I show that any time-dependent distribution can be expressed in terms of the Poisson and the binomial distributions. The result will be generalized to a non-Markovian model with delayed degradation in section VI, where I will not only rederive the Poisson distribution in an earlier work with less effort, but also derive a general time-dependent distribution by combining the Poisson distribution with the multinomial distribution. Finally, the discussions are given in section VII.

II The Poisson distribution and Poisson Processes

The Poisson distribution in Eq.(1) is specified by a single parameter $\mu$ , the expected number of independent events in a given time duration. If we partition the time interval $[0,t]$ into $N$ sub-intervals of small size $\Delta t\equiv t/N$ , then the probability of events happening more than once in each sub-interval is $O\left((\Delta t)^{2}\right)$ , and the probability of single occurrence of the event takes the form $p=\lambda\Delta t+O\left((\Delta t)^{2}\right)$ when such a probability is time-independent. Therefore, the total number of events in $[0,t]$ approximately follows the binomial distribution¹¹1The range of $n$ in the probability distribution $P(n,t)$ will be considered to be all the integers without any restriction in this work, unless specified otherwise. This is acceptable as long as we set $P(n,t)=0$ for illegitimate values of $n$ . All the explicit forms of probability distributions considered here such as binomial, multinomial, and Poisson distributions contain factorials of negative integers in the denominators whenever the value of the molecule number is out of legitimate range, and vanish due to the fact that $(j!)^{-1}=[\Gamma(j+1)]^{-1}=0$ whenever $j$ is a negative integer. Taking the range of the molecule number to be the whole integer is especially convenient for summation, where the summation index can be shifted freely, which I will do throughout the text without further elaboration.,

P_{\rm binom}(n;\{N,p\})=\frac{N!}{n!(N-n)!}p^{n}(1-p)^{N-n},

(4)

the distribution of the number of successes in $N$ independent and identical trials, with success probability of $p$ at each trial. The Poisson distribution is recovered by taking the limit of $N\to\infty$ with $\mu(t)=Np=\lambda t$ fixed (Appendix A):

P_{\rm Poisson}(n;\mu(t))=\frac{e^{-\mu(t)}}{n!}\mu(t)^{n},

(5)

where $\mu(t)$ is the expected number of events in the time-interval $[0,t]$ .

An important point to note here is that the condition of identical trials is not actually required to obtain the Poisson distribution. We only have to require independent trials [25, 26]. In this case, the binomial distribution in Eq.(4) is generalized to

\tilde{P}_{\rm\ binom}(n;\{p_{1},\cdots,p_{N}\})=\sum_{\{i_{1}<i_{2}\cdots<i_{n}\}}p_{i_{1}}p_{i_{2}}\cdots p_{i_{n}}\prod_{k\notin\{i_{1},\cdots,i_{n}\}}\left(1-p_{k}\right)

(6)

where the probability of success at $i$ -th trial is $p_{i}$ , and the summation is over all distinct set of $n$ indices $\{i_{1}<i_{2},\cdots<i_{n}\}\subset\{1,2,\cdots N\}$ . Again, the distribution in Eq.(6) becomes the Poisson distribution in the limit of $\Delta t\to 0$ , now with the time-dependent function $\lambda(t)$ , so that $\mu(t)=\int_{0}^{t}\lambda(t^{\prime})$ [26](Appendix A). This is the process where the probability of an event happening in an infinitesimal time interval $[t,t+dt]$ is $\lambda(t)dt$ , called an inhomogeneous Poisson process, to distinguish it from the case with constant $\lambda$ , called a homogeneous Poisson process.

III Molecule creation without degradation

This is a simple birth process

\varnothing\xrightarrow{\alpha(t)}X,

(7)

where $X$ describes an mRNA or protein molecule, and the creation rate $\alpha(t)$ is time-dependent in general²²2All the time-dependent rates in this work are the rates with predetermined time-dependences, and the corresponding model should not be confused with the one where the rates themselves are stochastic variables [28, 21, 23, 24].. Since molecule creations at distinct time points are independent of each other, these creation events form a Poisson process. If the number of molecules is zero at $t=0$ , then the molecule number $n$ at a later time $t>0$ is equal to the number of creation events in the time interval $[0,t]$ . Therefore, the probability distribution of $n$ is given by the Poisson distribution, $P(n,t)=P_{\rm Poisson}(n;\mu(t))$ , with $\mu(t)=\int^{t}_{0}\alpha(t^{\prime})dt^{\prime}$ .

Even when the number of molecules takes a non-zero value $n_{0}$ at $t=0$ , the number of molecules $n$ at a later time $t>0$ is simply $n_{0}+n^{\prime}$ where $n^{\prime}$ is the number of creation events in the time interval $[0,t]$ . Therefore, $n^{\prime}=n-n_{0}$ still follows the Poisson distribution, leading to a shifted Poisson distribution for $n$ ,

P(n,t)=\frac{e^{-\mu(t)}}{(n-n_{0})!}\mu(t)^{n-n_{0}}.

(8)

The dynamics of the probability distribution is Markovian, described by the master equation

\frac{\partial P(n,t)}{\partial t}=\alpha(t)\left[P(n-1,t)-P(n,t)\right].

(9)

It is straightforward to check that Eq.(8) is a solution of the master equation (9) by direct substitution. By the linearity of the master equation (9), an analytic form of the general solution with an arbitrary initial distribution $P(n,0)=v(n)$ can be constructed by superposing the expressions in Eq.(8),

P(n,t)=\sum_{n_{0}}\frac{e^{-\mu(t)}}{(n-n_{0})!}\mu(t)^{n-n_{0}}v(n_{0}).

(10)

From Eq.(10), we see that the master equation (9) admits stationary solutions

P_{\rm st}(n)\sum_{n_{0}}\frac{e^{-\mu(\infty)}}{(n-n_{0})!}\mu(\infty)^{n-n_{0}}v(n_{0})

(11)

that depend on initial distributions, if and only if $\mu(\infty)=\int_{0}^{\infty}\alpha(s)ds$ exists. In particular, for a constant rate $\alpha$ , there is no stationary solution because $\mu(t)=\alpha t$ increases indefinitely with time, and $P(n,t)$ converges to zero pointwise for all $n$ . In this case, all the states in the Markov chain are transient states [29].

IV Model only with degradation

This is a simple death process

X\xrightarrow{\beta(t)}\varnothing.

(12)

In contrast to the creation process in the previous section, this process is not a Poisson process, because any molecule that has been degraded cannot be degraded again. Therefore, the degradation events are not independent, and the probability of a degradation event depends on the number of molecules available. The master equation describing the dynamics of this model is

\frac{\partial P(n,t)}{\partial t}=\beta(t)\left[(n+1)P(n+1,t)-nP(n,t)\right].

(13)

In order to get a non-zero number of molecules, the initial number $n_{0}$ of molecules must be non-zero. The number of molecules at later times can be obtained by using the fact that the event of a given molecule surviving at a later time $t$ is independent of what happens to other molecules, whose probability $p_{\rm surv}(t,0)$ is given by

p_{\rm surv}(t,0)=\exp\left(-\int_{0}^{t}\beta(s)ds\right).

(14)

Therefore, the number $n$ of surviving molecules at time $t$ follows the binomial distribution

P(n,t)=P_{\rm binom}(n;\{n_{0},p_{\rm surv}(t,0)\})=\frac{n_{0}!}{n!(n_{0}-n)!}p_{\rm surv}(t,0)^{n}(1-p_{\rm surv}(t,0))^{n_{0}-n}.

(15)

It can be checked that the expression in Eq.(15) is the solution of the master equation (13) by direct substitution. Again, the general solution with an arbitrary initial distribution $P(n,0)=v(n)$ can be constructed by the superposition

P(n,t)=\sum_{n_{0}}\frac{n_{0}!}{n!(n_{0}-n)!}p_{\rm surv}(t,0)^{n}(1-p_{\rm surv}(t,0))^{n_{0}-n}v(n_{0}).

(16)

A Poisson distribution appears in the special case where the initial distribution is Poissonian:

v(n)=\frac{e^{-\mu_{0}}\mu_{0}^{n}}{n!},

(17)

in which case we obtain the Poisson distribution $P(n,t)=P_{\rm Poisson}(n;\mu(t))$ with

\mu(t)=\mu_{0}p_{\rm surv}(t,0),

(18)

due to the fact that the superposition of binomial distributions weighted by a Poisson distribution is again a Poisson distribution [26](Appendix B). In the next section, I will show that this distribution can be interpreted as a result of a Poisson process.

When the integral $\int_{0}^{t}\beta(s)ds$ diverges as $t\to\infty$ , as in the case of a constant $\beta$ , we get $\lim_{t\to\infty}p_{\rm surv}(t,0)=0$ . In this case, we see from Eq.(16) that the probability distribution approaches the stationary distribution

\lim_{t\to\infty}P(n,t)=P_{\rm st}(n)=\delta_{n,0},

(19)

regardless of the initial distribution.

V Model with both creation and degradation

This model is described by

\varnothing\xrightarrow{\alpha(t)}X,\quad X\xrightarrow{\beta(t)}\varnothing,

(20)

which belongs to a class of models called the birth-death models [25, 26, 27]. Let us first consider the case where the number of $X$ molecules is initially zero. Molecules are created in the time interval $[0,t]$ , but only a fraction of them survive at $t$ , which I will call surviving molecules. When a molecule is created in the short time interval $[t^{\prime},t^{\prime}+dt^{\prime}]$ with $t^{\prime}<t$ , its fate at $t$ is undetermined, but the probability of its survival $p_{\rm surv}(t,t^{\prime})$ at time $t$ is already determined to be

p_{\rm surv}(t,t^{\prime})=\exp\left(-\int_{t^{\prime}}^{t}\beta(s)ds\right),

(21)

and consequently, the probability that a surviving molecule is created in $[t^{\prime},t^{\prime}+dt^{\prime}]$ is given by $\alpha(t^{\prime})p_{\rm surv}(t,t^{\prime})dt^{\prime}$ , independent of events that happen in other regions of time. Therefore, the number of molecules at a later time $t$ is equal to the number these independent events in $[0,t]$ , following the Poisson distribution $P_{\rm Poisson}(n;\mu(t))$ with

\mu(t)=\int_{0}^{t}\alpha(s)p_{\rm surv}(t,s)ds=\int_{0}^{t}\alpha(s)\exp\left(-\int_{s}^{t}\beta(u)du\right)ds.

(22)

The master equation for this model is

\frac{\partial P(n,t)}{\partial t}=\alpha(t)\left[P(n-1,t)-P(n,t)\right]+\beta(t)\left[(n+1)P(n+1,t)-nP(n,t)\right],

(23)

and it is straightforward to check by direct substitution that $P_{\rm Poisson}(n;\mu(t))$ with $\mu(t)$ given by Eq.(22) is a solution of this equation.

We noted in the previous section that the distribution of the molecule number in the degradation-only model is Poissonian for $t>0$ if the initial distribution at $t=0$ is also Poissonian. In fact, this distribution can be considered as a special case of the Poisson distribution in the model with a time-dependent creation rate $\alpha(t)$ , by shifting the origin of time: We start from zero molecule at some time $t_{0}<0$ so that we obtain a Poisson distribution for $P(n,0)$ . We then require that $\alpha(t)=0$ for $t>0$ so that the model reduces to the degradation-only model for $t>0$ .

The probability distribution converges to a stationary Poisson distribution if and only if the expected number of surviving molecules at $t\to\infty$ , given by the integral

\mu(\infty)=\int_{0}^{\infty}\alpha(s)\exp\left(-\int_{s}^{\infty}\beta(u)du\right)ds,

(24)

is finite. For example, if the rates $\alpha$ and $\beta$ are constants, we get

\mu(t)=\int_{0}^{t}\alpha e^{-(t-s)\beta}ds=\frac{\alpha}{\beta}(1-e^{-\beta t}),

(25)

and

\mu(\infty)=\frac{\alpha}{\beta}.

(26)

Now consider a more general situation where the initial number of molecules is $n_{0}$ . The number of molecules at a later time $t$ is the sum of the number $n_{1}$ of surviving molecules among initial $n_{0}$ particles, whose distribution follows the binomial distribution in Eq.(15), and the number $n_{2}$ of surviving molecules created during the interval $[0,t]$ , which follows the Poisson distribution. Therefore, the probability distribution for the molecule number $n$ is

	$\displaystyle P(n,t)$	$\displaystyle=$	$\displaystyle\sum_{n_{1}+n_{2}=n}P_{\rm binom}(n_{1};\{n_{0},p_{\rm surv}(t,0)\})P_{\rm Poisson}(n_{2};\mu(t))$		(27)
		$\displaystyle=$	$\displaystyle\sum_{n_{1}}\frac{n_{0}!}{n_{1}!(n_{0}-n_{1})!}\frac{e^{-\mu(t)}\mu(t)^{n-n_{1}}}{(n-n_{1})!}p_{\rm surv}(t,0)^{n_{1}}(1-p_{\rm surv}(t,0))^{n_{0}-n_{1}}$		(27)

where $p_{\rm surv}(t,0)$ and $\mu(t)$ are given by Eq.(14) and Eq.(22), respectively. It is straightforward to check that the solution (27) is a solution of the master equation (23) by direct substitution (Appendix C). By the linearity of the master equation, the solution for an arbitrary initial distribution $P(n,0)=v(n)$ can be constructed by superposing the distributions in Eq.(27),

\displaystyle P(n,t)

\displaystyle=

\displaystyle\sum_{n_{0},n_{1}}\frac{v(n_{0})n_{0}!e^{-\mu(t)}\mu(t)^{n-n_{1}}}{n_{1}!(n_{0}-n_{1})!(n-n_{1})!}p_{\rm surv}(t,0)^{n_{1}}(1-p_{\rm surv}(t,0))^{n_{0}-n_{1}}.

(28)

VI Model with time-delayed degradation

Due to a complex mechanism of protein degradation, there can be a time-delay in protein degradation. Model with time-delayed degradation attempts to capture such a behavior, and once a molecule enters the degradation process, it gets degraded after a fixed time delay $\tau$ [15]. A more general model where the molecule is allowed to get degraded before $\tau$ was also constructed [16], which I will examine in this section in detail. The model is

\varnothing\xrightarrow{\alpha}X_{A},\quad X_{A}\xrightarrow{\gamma}\varnothing,\quad X_{A}\xrightarrow{\beta}X_{I},\quad X_{I}\xRightarrow[\tau]{}\varnothing,\quad X_{I}\xrightarrow{\zeta}\varnothing

(29)

Here, $X_{I}$ is an inactive molecule that has entered the degradation process. An active molecule is denoted by $X_{A}$ , which can undergo instantaneous degradation with rate $\gamma$ . It can enter delayed degradation process with rate $\beta$ , at which point it becomes inactive and get degraded after time $\tau$ with certainty, but it can also undergo instantaneous degradation with rate $\zeta$ before the delayed degradation process is complete. Although it is straightforward to allow for time-dependent rates, I will keep them time-independent in order to compare the result with that of ref. [16], as well as for notational simplicity.

This model is non-Markovian, and the master equation for the probability distribution $P(n_{A},n_{I},t)$ takes the form

$\displaystyle\frac{dP(n_{A},n_{I},t)}{dt}$	$\displaystyle=$	$\displaystyle(E_{A}^{-1}-1)\alpha P(n_{A},n_{I},t)+(E_{A}-1)\gamma n_{A}P(n_{A},n_{I},t)$	(30)
	$\displaystyle+$	$\displaystyle(E_{A}E_{I}^{-1}-1)\beta n_{A}P(n_{A},n_{I},t)+(E_{I}-1)\zeta n_{I}P(n_{A},n_{I},t)$
	$\displaystyle+$	$\displaystyle\sum_{n^{\prime}_{A}}P^{*}(n_{A},n_{I}-1,\tau\|n^{\prime}_{A}-1)\beta n^{\prime}_{A}P(n^{\prime}_{A},t-\tau)e^{-\zeta\tau},$

where $E^{k}_{A,I}f(n_{A,I})\equiv f(n_{A,I}+k)$ , $P(n_{A},t)\equiv\sum_{n_{I}}P(n_{A},n_{I},t)$ is the marginal probability for $n_{A}$ , and $P^{*}(n_{A},n_{I},t|n^{\prime}_{A}-1)$ is the probability under the initial condition of $n^{\prime}_{A}-1$ active molecules and no inactive molecules, obtained by neglecting the time-delayed degradation [16]:

$\displaystyle\frac{dP^{*}(n_{A},n_{I},t\|n^{\prime}_{A}-1)}{dt}$	$\displaystyle=$	$\displaystyle(E_{A}^{-1}-1)\alpha P^{}(n_{A},n_{I},t\|n^{\prime}_{A}-1)+(E_{A}-1)\gamma n_{A}P^{}(n_{A},n_{I},t\|n^{\prime}_{A}-1)$	(31)
	$\displaystyle+$	$\displaystyle(E_{A}E_{I}^{-1}-1)\beta n_{A}P^{*}(n_{A},n_{I},t\|n^{\prime}_{A}-1)$
	$\displaystyle+$	$\displaystyle(E_{I}-1)\zeta n_{I}P^{*}(n_{A},n_{I},t\|n^{\prime}_{A}-1).$

Eq. (30) along with Eq. (31) admits the Poisson distribution as a time-dependent solution if the particle number is initially zero or the initial distribution is Poissonian [16]. However, it is easy to derive the Poisson distribution without going through the complicated process of solving Eqs.(30) and (31), by considering the underlying Poisson process.

Note that one of three things happen in a given short time interval of $[t^{\prime},t^{\prime}+dt^{\prime}]$ : either a molecule that will survive at $t$ as an active one is created, the one that will be survive as an inactive one is created³³3This should not be confused with a birth of an inactive molecule. Every molecule is born active, and only an active molecule can turn into an inactive one. Here I am considering a birth of an active molecule that will eventually survive as an inactive molecule at the later time $t$ , or none of these happens. These events are independent of creation events in other regions in time. Also, although the creation of a surviving active molecule and a surviving inactive molecule are exclusive events, they can be treated as independent events because they are rare events: even if we assume they are independent, probability of more than two molecules created in $[t^{\prime},t^{\prime}+dt^{\prime}]$ is negligible. Therefore, exclusive and independent events are indistinguishable if they are rare (Appendix D), and the probability distribution for the numbers $n_{A}$ , $n_{I}$ of $X_{A}$ , $X_{I}$ are given by the product of Poisson distributions,

P(n_{A},n_{I},t)=\frac{e^{-\mu_{A}(t)-\mu_{I}(t)}}{n_{A}!n_{I}!}\mu_{A}(t)^{n_{A}}\mu_{I}(t)^{n_{I}}.

(32)

It only remains to compute $\mu_{A}(t)$ and $\mu_{I}(t)$ .

Suppose that a molecule is created at time $t^{\prime}$ . In order for the molecule to remain active at a later time $t$ , (i) it should not undergo instantaneous degradation ( $X_{A}\xrightarrow{\gamma}\varnothing$ ), and (ii) should not turn into an inactive molecule $(X_{A}\xrightarrow{\beta}X_{I})$ , during the intervening time. Therefore, the conditional probability $p_{A}(t-t^{\prime})$ that this molecule will remain active at $t$ is given by the exponential distribution in $t-t^{\prime}$ ,

p_{A}(t-t^{\prime})=e^{-a(t-t^{\prime})},

(33)

where $a\equiv\beta+\gamma$ , and we get

\mu_{A}(t)=\int_{0}^{t}\alpha p_{A}(t-t^{\prime})dt^{\prime}=\frac{\alpha}{a}\left(1-e^{-at}\right).

(34)

For a molecule created at time $t^{\prime}$ to survive at a later $t$ as an inactive molecule, (i) it must turn into an inactive one within a short time interval $[t_{I},t_{I}+dt_{I}]$ such that $\max(t^{\prime},t-\tau)<t_{I}<t$ , (ii) it should not undergo instantaneous degradation ( $X_{A}\xrightarrow{\gamma}\varnothing$ ) in the time between $t^{\prime}$ and $t_{I}$ , and (iii) it should not undergo instantaneous degradation ( $X_{I}\xrightarrow{\zeta}\varnothing$ ) in the time between $t_{I}$ and $t$ . For a given value of $t_{I}$ , such a conditional probability $\rho_{I}(t,t_{I}/t^{\prime})dt_{I}$ is given as

\rho_{I}(t,t_{I}/t^{\prime})dt_{I}=e^{-a(t_{I}-t^{\prime})}e^{-\zeta(t-t_{I})}\beta dt_{I}

(35)

Integrating over $t_{I}$ , we get the conditional probability $p_{I}(t-t^{\prime})$ that a molecule created at time $t^{\prime}$ will survive as an inactive one at $t$ :

	$\displaystyle p_{I}(t-t^{\prime})$	$\displaystyle=$	$\displaystyle\int_{\max(t^{\prime},t-\tau)}^{t}\rho_{I}(t,t_{I}/t^{\prime})dt_{I}=\frac{\beta e^{at^{\prime}-\zeta t}}{\zeta-a}\left(e^{(\zeta-a)t}-e^{(\zeta-a){\max(t^{\prime},t-\tau)}}\right)$		(38)
		$\displaystyle=$	$\displaystyle\left\{\begin{array}[]{ll}{\beta}(\zeta-a)^{-1}\left(e^{a(t^{\prime}-t)}-e^{\zeta(t^{\prime}-t)}\right)&\quad(0\leq t-t^{\prime}\leq\tau),\\ {\beta}(\zeta-a)^{-1}e^{a(t^{\prime}-t)}\left(1-e^{(a-\zeta)\tau}\right)&\quad(\tau\leq t-t^{\prime}<t).\end{array}\right.$		(38)

The expected number of surviving inactive molecules, $\mu_{I}(t)$ , is then obtained by multiplying $p_{I}(t-t^{\prime})$ by the creation rate $\alpha$ and integrating over $t^{\prime}$ :

\displaystyle\mu_{I}(t)=\int_{0}^{t}\alpha p_{I}(t-t^{\prime})dt^{\prime}

\displaystyle=

\displaystyle\left\{\begin{array}[]{ll}\frac{\alpha\beta}{\zeta-a}\left[\frac{1}{a}(1-e^{-at})-\frac{1}{\zeta}(1-e^{-\zeta t})\right]&\quad(0\leq t<\tau),\\ \frac{\alpha\beta}{a}\left[\frac{1-e^{-\zeta\tau}}{\zeta}+\frac{1-e^{(a-\zeta)\tau}}{a-\zeta}e^{-at}\right]&\quad(t\geq\tau).\end{array}\right.

(41)

Eq.(32) along with Eq.(34) and Eq.(41) completely specify the time-dependent Poisson distribution when we start from zero molecule at $t=0$ , which agrees with the result in ref. [16]. The total number of molecules, $n=n_{A}+n_{I}$ , also follows a Poisson distribution with the expectation value $\mu(t)=\mu_{A}(t)+\mu_{I}(t)$ , due to the fact that the sum of two variables that follow Poisson distributions also follows a Poisson distribution [27]. The Poisson process underlying the Poisson distribution for $n=n_{A}+n_{I}$ is the creation of particles destined to survive until time $t$ , regardless of being active or inactive.

Finally, as in the case of the Markovian model in the previous section, we can derive general time-dependent distributions by combining the Poisson distribution with the multinomial distribution. However, note that it is almost impossible to allow arbitrary initial distribution at $t=0$ . If a non-zero initial value of $n_{I}$ is allowed, there is an ambiguity in the evolution of the system for $t>0$ because the final degradation of the initial inactive molecules depend on the exact time points at $t<0$ that they got inactivated. Therefore, I will only consider the case where only active particles are present at $t=0$ , whose number is $n^{0}$ . Given an active molecule at $t=0$ , the probability that it will survive as an active molecule and the probability that it will survive as an inactive molecule, at a later time $t$ , are given by Eqs.(33) and (38), respectively, with $t^{\prime}=0$ :

	$\displaystyle p_{A}(t)$	$\displaystyle=$	$\displaystyle e^{-at},$
	$\displaystyle p_{I}(t)$	$\displaystyle=$	$\displaystyle\left\{\begin{array}[]{ll}{\beta}(\zeta-a)^{-1}\left(e^{-at}-e^{-\zeta t}\right)&\quad(t<\tau),\\ {\beta}(\zeta-a)^{-1}e^{-at}\left(1-e^{(a-\zeta)\tau}\right)&\quad(t\geq\tau).\end{array}\right.$		(44)

Therefore, the probability that there are $n^{\prime}_{A}$ surviving active molecules and $n^{\prime}_{I}$ surviving active molecules among initial $n_{0}$ active molecules, at time $t$ , is given by the multinomial distribution,

	$\displaystyle P_{\rm mult}(n^{\prime}_{A},n^{\prime}_{I};\{n_{0},p_{A}(t),p_{I}(t)\})$	$\displaystyle\equiv$	$\displaystyle\frac{n_{0}!}{n^{\prime}_{A}!n^{\prime}_{I}!(n_{0}-n^{\prime}_{A}-n^{\prime}_{I})!}p_{A}(t)^{n^{\prime}_{A}}p_{I}(t)^{n^{\prime}_{I}}$		(45)
			$\displaystyle\times(1-p_{A}(t)-p_{I}(t))^{n_{0}-n^{\prime}_{A}-n^{\prime}_{I}}.$		(45)

The numbers of active and inactive molecules at $t$ are decomposed as $n_{A}=n^{\prime}_{A}+n^{\prime\prime}_{A}$ , $n_{I}=n^{\prime}_{I}+n^{\prime\prime}_{I}$ , where $n^{\prime\prime}_{A}$ and $n^{\prime\prime}_{I}$ are the numbers of the surviving active and inactive molecules created in the time interval $[0,t]$ . Therefore, the probability distribution for $n_{A}$ and $n_{I}$ at $t$ is given by the combination of the multinomial and the Poisson distribution:

$\displaystyle P(n_{A},n_{I},t)$	$\displaystyle=$	$\displaystyle\sum_{n^{\prime}_{A}+n^{\prime\prime}_{A}=n_{A}}\sum_{n^{\prime}_{I}+n^{\prime\prime}_{I}=n_{I}}P_{\rm mult}(n^{\prime}_{A},n^{\prime}_{I};\{n_{0},p_{A}(t),p_{I}(t)\})$	(46)
	$\displaystyle\times$	$\displaystyle P_{\rm Poisson}(n^{\prime\prime}_{A};\mu_{A}(t))P_{\rm Poisson}(n^{\prime\prime}_{I};\mu_{I}(t))$
	$\displaystyle=$	$\displaystyle\sum_{n^{\prime}_{A}}\sum_{n^{\prime}_{I}}\frac{n_{0}!}{n^{\prime}_{A}!n^{\prime}_{I}!(n_{0}-n^{\prime}_{A}-n^{\prime}_{I})!}p_{A}(t)^{n^{\prime}_{A}}p_{I}(t)^{n^{\prime}_{I}}$
		$\displaystyle\times(1-p_{A}(t)-p_{I}(t))^{n_{0}-n^{\prime}_{A}-n^{\prime}_{I}}$
		$\displaystyle\times\frac{e^{-\mu_{A}(t)-\mu_{I}(t)}\mu_{A}(t)^{n_{A}-n^{\prime}_{A}}\mu_{I}(t)^{n_{I}-n^{\prime}_{I}}}{(n_{A}-n^{\prime}_{A})!(n_{I}-n^{\prime}_{I})!}.$

We can check that the distribution in Eq.(46) is indeed a solution of the master equation Eq.(30) by direct substitution (Appendix E). Again, by the linearity of the master equation, the time-dependent probability distribution for an arbitrary initial distribution of active molecules,

P(n^{0}_{A},n^{0}_{I},0)=v(n^{0}_{A})\delta_{n^{0}_{I},0},

(47)

is obtained by the superposition of Eq.(46) weighted by $v(n_{0})$ ,

$\displaystyle P(n_{A},n_{I},t)$	$\displaystyle=$	$\displaystyle\sum_{n_{0}}\sum_{n^{\prime}_{A}}\sum_{n^{\prime}_{I}}v(n_{0})\frac{n_{0}!}{n^{\prime}_{A}!n^{\prime}_{I}!(n_{0}-n^{\prime}_{A}-n^{\prime}_{I})!}p_{A}(t)^{n^{\prime}_{A}}p_{I}(t)^{n^{\prime}_{I}}$	(48)
		$\displaystyle\times(1-p_{A}(t)-p_{I}(t))^{n_{0}-n^{\prime}_{A}-n^{\prime}_{I}}$
		$\displaystyle\times\frac{e^{-\mu_{A}(t)-\mu_{I}(t)}\mu_{A}(t)^{n_{A}-n^{\prime}_{A}}\mu_{I}(t)^{n_{I}-n^{\prime}_{I}}}{(n_{A}-n^{\prime}_{A})!(n_{I}-n^{\prime}_{I})!}.$

VII Discussions

Poisson distributions appear ubiquitously in stochastic dynamics of gene expression, and the Poisson noise is considered to be the most basic type of noise when analyzing various components of stochastic fluctuations. However, when gene products are allowed to get degraded, it has not been clear whether the molecule number following a Poisson distribution is equal to the number of certain independent events in time, and if this is the case, what is the corresponding events. I answered this question in this work, by showing that the number of molecules distributed according to the Poisson distribution is actually equal to the number of creations of the molecules that are destined to survive until the end of a given time period, which are indeed independent events in time that form an inhomogeneous Poisson process. Using this viewpoint, I could derive the Poisson distribution not only for the Markovian model with time-dependent-rates, but also for the model with time-delayed degradation, without performing the difficult task of solving the non-Markovian master equation. Furthermore, I could obtain the time-dependent distribution for an arbitrary initial distribution in the case of the Markovian model, by combining the Poisson distribution with the binomial distribution, and the time-dependent distribution for an arbitrary initial distribution of active molecules in the case of the non-Markovian model, by combining the Poisson distribution with the multinomial distribution.

Of course, the molecule number follows the Poisson distribution only under the simplifying assumption of independent creation events. In the case of the protein, the number of protein molecules follows the Poisson distribution only if we approximate the transcription and the translation as a one-step process. In reality, the mRNA molecule has a non-zero life-time, during which protein gets translated with a certain rate, leading to bursty translations [8, 9, 10]. The creations of protein molecules are not a Poisson process in such a model, since the translation rate depends on the mRNA concentration. The Poissonian description also breaks down in most of the models with stochastic rates [28, 21, 23, 24]. In this class of models, the creation rate itself is a random variable distributed according to a probability distribution. A model with a stochastic creation rate can be used as an approximate model to describe the regulation of the expression by the transcription factor binding [28, 23], or to emulate the extrinsic noise due to heterogeneous cellular environments [23]. When the creation rate is a stochastic variable, the creations of the molecules form a Poisson process only if the creation rate has no temporal correlations, which is not valid for most of the non-trivial models. Despite these limitations, general analytic solutions found in this work may be of value as a basis of perturbations to obtain more realistic descriptions of the gene-regulatory network. For example, one may numerically solve the master equation for the stochastic rate, and then use the analytic solution for a given realization of the creation rate, so that the expectation values of various physical quantities are expressed as weighted averages over numerical distributions of the stochastic rates. Similar analyses may be done for other sophisticated models of gene regulatory network by combining the analytic solutions found in the current work with other analytic solutions and/or numerical computations.

Acknowledgements.

This work was supported by the National Research Foundation of Korea, funded by the Ministry of Science and ICT (NRF-2020R1A2C1005956).

References

Thattai and van Oudenaarden [2001] M. Thattai and A. van Oudenaarden, Intrinsic noise in gene regulatory networks, Proc. Natl. Acad. Sci. 98, 8614 (2001).
Elowitz et al. [2002] M. B. Elowitz, A. J. Levine, E. D. Siggia, and P. S. Swain, Stochastic gene expression in a single cell, Science 297, 1183 (2002).
Swain et al. [2002] P. S. Swain, M. B. Elowitz, and E. D. Siggia, Intrinsic and extrinsic contributions to stochasticity in gene expression, Proc. Natl. Acad. Sci. 99, 12795 (2002).
Paulsson [2004] J. Paulsson, Summing up the noise in gene networks, Nature (London) 427, 415 (2004).
Raser and O’Shea [2004] J. M. Raser and E. K. O’Shea, Control of stochasticity in eukaryotic gene expression, Science 304, 1811 (2004).
Golding et al. [2005] I. Golding, J. Paulsson, S. M. Zawilski, and E. C. Cox, Real-time kinetics of gene activity in individual bacteria, Cell 123, 1025 (2005).
Newman et al. [2006] J. R. S. Newman, S. Ghaemmaghami, J. Ihmels, D. K. Breslow, M. Noble, J. L. DeRisi, and J. S. Weissman, Single-cell proteomic analysis of s. cerevisiae reveals the architecture of biological noise, Nature (London) 441, 840 (2006).
Cai et al. [2006] L. Cai, N. Friedman, and X. S. Xie, Stochastic protein expression in individual cells at the single molecule level, Nature (London) 440, 358 (2006).
Friedman et al. [2006] N. Friedman, L. Cai, and X. S. Xie, Linking stochastic dynamics to population distribution: An analytical framework of gene expression, Phys. Rev. Lett. 97, 168302 (2006).
Yu et al. [2006] J. Yu, J. Xiao, X. Ren, K. Lao, and X. S. Xie, Probing gene expression in live cells, one protein molecule at a time, Science 311, 1600 (2006).
Lipshtat et al. [2006] A. Lipshtat, A. Loinger, N. Q. Balaban, and O. Biham, Genetic toggle switch without cooperative binding, Phys. Rev. Lett. 96, 188101 (2006).
Raj and van Oudenaarden [2008] A. Raj and A. van Oudenaarden, Nature, nurture, or chance: Stochastic gene expression and its consequences, Cell 135, 216 (2008).
Eldar and Elowitz [2010] A. Eldar and M. B. Elowitz, Functional roles for noise in genetic circuits, Nature 467, 167 (2010).
Taniguchi et al. [2010] Y. Taniguchi, P. J. Choi, G.-W. Li, H. Chen, M. Babu, J. Hearn, A. Emili, and X. S. Xie, Quantifying e. coli proteome and transcriptome with single-molecule sensitivity in single cells, Science 329, 533 (2010).
Miȩkisz et al. [2011] J. Miȩkisz, J. Poleszczuk, M. Bodnar, and U. Foryś, Stochastic models of gene expression with delayed degradation, Bull. Math. Biol. 73, 2231 (2011).
Lafuerza and Toral [2011] L. F. Lafuerza and R. Toral, Exact solution of a stochastic protein dynamics model with delayed degradation, Phys. Rev. E 84, 051121 (2011).
Neuert et al. [2013] G. Neuert, B. Munsky, R. Z. Tan, L. Teytelman, M. Khammash, and A. van Oudenaarden, Systematic identification of signal-activated stochastic gene regulation, Science 339, 584 (2013).
Sanchez and Golding [2013] A. Sanchez and I. Golding, Genetic determinants and cellular constraints in noisy gene expression, Science 342, 1188 (2013).
Kumar et al. [2014] N. Kumar, T. Platini, and R. V. Kulkarni, Exact distributions for stochastic gene expression models with bursting and feedback, Phys. Rev. Lett. 113, 268105 (2014).
Ge et al. [2015] H. Ge, H. Qian, and X. S. Xie, Stochastic phenotype transition of a single cell in an intermediate region of gene state switching, Phys. Rev. Lett. 114, 078101 (2015).
Lim et al. [2015] Y. R. Lim, J.-H. Kim, S. J. Park, G.-S. Yang, S. Song, S.-K. Chang, N. K. Lee, and J. Sung, Quantitative understanding of probabilistic behavior of living cells operated by vibrant intracellular networks, Phys. Rev. X 5, 031014 (2015).
Lee and Lee [2018] J. Lee and J. Lee, Quantitative analysis of a transient dynamics of a gene regulatory network, Phys. Rev. E 98, 062404 (2018).
Ham et al. [2020] L. Ham, R. D. Brackston, and M. P. Stumpf, Extrinsic noise and heavy-tailed laws in gene expression, Phys. Rev. Lett. 124, 108101 (2020).
Lin and Amir [2021] J. Lin and A. Amir, Disentangling intrinsic and extrinsic gene expression noise in growing cells, Phys. Rev. Lett. 126, 078101 (2021).
Van Kampen [2007] N. G. Van Kampen, Stochastic Processesin Physics and Chemistry (Elsevier, Amsterdam, 2007).
Ross [1996] S. M. Ross, Stochastic Processes (Wiley, Hoboken, 1996).
Stirzaker [2005] D. Stirzaker, Stochastic Processes & Models (Oxford, New York, 2005).
Peccoud and Ycart [1995] J. Peccoud and B. Ycart, Markovian modeling of gene-product synthesis, Theor. Popul. Biol. 48, 222 (1995).
Isaacson and Madsen [1976] D. L. Isaacson and R. W. Madsen, Markov Chains, Theory and Applications (Wiley, New York, 1976).

Appendix A The Poisson distribution as the limit of the distribution for independent trials

The number of successes in $N$ independent trials, with the success probability at $i$ -th trial being $p_{i}$ , follows the probability distribution which is a generalization of the binomial distribution,

\tilde{P}_{\rm\ binom}(n;\{p_{1},\cdots,p_{N}\})=\sum_{\{i_{1}<i_{2}\cdots<i_{n}\}}p_{i_{1}}p_{i_{2}}\cdots p_{i_{n}}\prod_{k\notin\{i_{1},\cdots,i_{n}\}}\left(1-p_{k}\right)

(49)

I want to show that the Poisson distribution can be obtained from this expression in the limit of $N\to\infty$ with $\mu=\sum_{j=0}^{N}p_{j}$ fixed. One can derive the desired result using the generating function $F(z)\equiv\sum_{n}P(n)z^{n}$ . On one hand, the generating function $\tilde{F}(z;\{p_{1},\cdots,p_{N}\})_{\rm binom}$ for the distribution $\tilde{P}_{\rm\ binom}(n;\{p_{1},\cdots,p_{N}\})$ is

$\displaystyle\tilde{F}(z;\{p_{1},\cdots,p_{N}\})_{\rm binom}$	$\displaystyle\equiv$	$\displaystyle\sum z^{n}\tilde{P}_{\rm\ binom}(n;\{p_{1},\cdots,p_{N}\})$	(50)
	$\displaystyle=$	$\displaystyle\sum_{n=0}^{N}z^{n}\sum_{\{i_{1}<i_{2}\cdots<i_{n}\}}p_{i_{1}}p_{i_{2}}\cdots p_{i_{n}}\prod_{k\notin\{i_{1},\cdots,i_{n}\}}\left(1-p_{k}\right)$
	$\displaystyle=$	$\displaystyle\sum_{n=0}^{N}\sum_{\{i_{1}<i_{2}\cdots<i_{n}\}}(zp_{i_{1}})(zp_{i_{2}})\cdots(zp_{i_{n}})\prod_{k\notin\{i_{1},\cdots,i_{n}\}}\left(1-p_{k}\right)$
	$\displaystyle=$	$\displaystyle\prod_{j=1}^{N}\left[zp_{j}+(1-p_{j})\right]$

On the other hand, the generating function $F_{\rm Poisson}(z)$ for the Poisson distribution is

F_{\rm Poisson}(z;\mu)=\sum_{n}\frac{z^{n}\mu^{n}e^{-\mu}}{n!}=e^{\mu(z-1)}.

(51)

Now, Eq.(50) can be rewritten as

	$\displaystyle\tilde{F}(z;\{p_{1},\cdots,p_{N}\})_{\rm binom}$	$\displaystyle=$	$\displaystyle\prod_{j=1}^{N}\left[1+(z-1)p_{j}\right]=\prod_{j=1}^{N}\left[e^{(z-1)p_{j}}+O(p_{j}^{2})\right]$		(52)
		$\displaystyle=$	$\displaystyle e^{\mu(z-1)}+O(1/N).$		(52)

Therefore,

\lim_{N\to\infty}\tilde{F}(z;\{p_{1},\cdots,p_{N}\})_{\rm binom}=F_{\rm Poisson}(z;\mu),

(53)

from which we deduce

\lim_{N\to\infty}\tilde{P}_{\rm\ binom}(n;\{p_{1},\cdots,p_{N}\})=P_{\rm\ Poisson}(n;\mu).

(54)

Since the binomial distribution is a special case of $\tilde{P}_{\rm\ binom}(n;\{p_{1},\cdots,p_{N}\})$ with $p_{j}=p$ for all $j$ , it is easy so see that the Poisson distribution is obtained from the binomial distribution by taking the limit of $N\to\infty$ with $\mu=Np$ fixed.

Appendix B The superposition of binomial distribution weighted by a Poisson distribution is again a Poisson distribution

Consider a binomial distribution for $n_{0}$ independent and identical trials with success probability $p$ at each trial, and suppose that $n_{0}$ is itself stochastic, distributed with Poisson distribution with the expectation value $\mu$ . Then the number $n$ of success again follows a Poisson distribution, with the expected number events being $\mu p$ [26]:

\sum_{n_{0}}P_{\rm Poisson}(n_{0};\mu)P_{\rm binom}(n;\{n_{0},p\})=P_{\rm Poisson}(n;\mu p).

(55)

The Poisson distribution at the left-hand side of Eq.(55) can be considered to come from a Poisson process where an event happens within a short time interval $[t,t+dt]$ with probability $\lambda(t)dt$ , so that $\mu=\int_{0}^{t}\lambda(t^{\prime})dt^{\prime}$ . Now, whenever such an event happens, we also toss a coin with head probability $p$ , and count the event only when the coin produces the head. It is intuitively clear that the number of counted events in the time interval $[0,t]$ obviously follows the Poisson distribution with expected number $\mu p$ , as given in the right-hand side of Eq.(55). Before formally proving Eq.(55), I first prove the discrete version obtained by replacing the Poisson distributions in Eq.(55) by the binomial distributions,

\sum_{n_{0}}P_{\rm binom}(n_{0};\{N,q\})P_{\rm binom}(n;\{n_{0},p\})=P_{\rm binom}(n;\{N,qp\}),

(56)

which may be easier to grasp intuitively. This expression arises in the situation where we toss two coins $N$ times, the probability of the head for two coins at each trial being $p$ and $q$ , respectively. The success for a given trial is defined as the event that both coins produce heads. Then the number of successes follows the binomial distribution with the success probability $pq$ at each trial, which is the right-hand side of Eq.(56). The left-hand side is its decomposition using a conditional probability. We first compute the probability $P_{\rm binom}(n_{0};\{N,q\})$ that the first coin produced heads $n_{0}$ times. We then compute the probability $P_{\rm binom}(n;\{n_{0},p\})$ that among $n_{0}$ trials with the first coin producing heads, $n$ of them have the second coin producing the heads. It is intuitively clear that the summation at the left-hand side is equal to the right-hand side, but one can also explicitly show that

	$\displaystyle\sum_{n_{0}}P_{\rm binom}(n_{0};\{N,q\})P_{\rm binom}(n;\{n_{0},p\})$	(57)
$\displaystyle=$	$\displaystyle\sum_{n_{0}}\frac{N!}{n_{0}!(N-n_{0})!}q^{n_{0}}(1-q)^{N-n_{0}}\times\frac{n_{0}!}{n!(n_{0}-n)!}p^{n}(1-p)^{n_{0}-n}$
$\displaystyle=$	$\displaystyle\frac{N!}{(N-n)!n!}(pq)^{n}\sum_{n_{0}}\frac{(N-n)!}{(N-n_{0})!(n_{0}-n)!}q^{n_{0}-n}(1-p)^{n_{0}-n}(1-q)^{N-n_{0}}$
$\displaystyle=$	$\displaystyle\frac{N!}{(N-n)!n!}(pq)^{n}\sum_{j}\frac{(N-n)!}{(N-n-j)!j!}[q(1-p)]^{j}(1-q)^{N-n-j}$
$\displaystyle=$	$\displaystyle\frac{N!}{(N-n)!n!}(pq)^{n}\left[q(1-p)+1-q\right]^{N-n}=\frac{N!}{(N-n)!n!}(pq)^{n}\left[1-pq\right]^{N-n}$
$\displaystyle=$	$\displaystyle P_{\rm binom}(n;\{N,qp\}),$

proving Eq.(56). It is straightforward to extend the proof to the inhomogeneous case where $p$ and $q$ are different for each trial, and Eq.(55) is then obtained in the limit of $N\to\infty$ with $\mu=\sum_{i=1}^{N}q_{i}$ fixed. However, one can also prove Eq.(55) directly:

	$\displaystyle\sum_{n_{0}}P_{\rm Poisson}(n_{0};\mu)P_{\rm binom}(n;\{n_{0},p\})$	(58)
$\displaystyle=$	$\displaystyle\sum_{n_{0}}\frac{e^{-\mu}\mu^{n_{0}}}{n_{0}!}\times\frac{n_{0}!}{n!(n_{0}-n)!}p^{n}(1-p)^{n_{0}-n}=\frac{e^{-\mu}}{n!}(\mu p)^{n}\sum_{n_{0}}\frac{\mu^{n_{0}-n}}{(n_{0}-n)!}(1-p)^{n_{0}-n}$
$\displaystyle=$	$\displaystyle\frac{e^{-\mu}}{n!}(\mu p)^{n}\sum_{j}\frac{[\mu(1-p)]^{j}}{j!}=\frac{e^{-\mu}}{n!}(\mu p)^{n}e^{\mu(1-p)}=\frac{e^{-\mu p}}{n!}(\mu p)^{n}=P_{\rm Poisson}(n;\mu p).$

Appendix C The time-dependent distribution of the Markovian model with non-zero initial number of molecules (Eq.(27)) is the solution of the master equation (23).

To show that the distribution given by Eq.(27) is the solution of the master equation, it is convenient use the generating function $F(z,t)\equiv\sum_{n}P(n,t)z^{n}$ . Then the master equation (23) turns into

\partial_{t}F(z,t)=(z-1)(\alpha(t)-\beta(t)\partial_{z})F(z,t).

(59)

The generating function for the distribution in Eq.(27) is

$\displaystyle F(z,t)$	$\displaystyle=$	$\displaystyle\sum_{n}z^{n}\sum_{n_{1}}\frac{n_{0}!}{n_{1}!(n_{0}-n_{1})!}\frac{e^{-\mu(t)}\mu(t)^{n-n_{1}}}{(n-n_{1})!}p_{\rm surv}(t,0)^{n_{1}}(1-p_{\rm surv}(t,0))^{n_{0}-n_{1}}$	(60)
	$\displaystyle=$	$\displaystyle\sum_{n_{1}}z^{n_{1}}\frac{n_{0}!}{n_{1}!(n_{0}-n_{1})!}p_{\rm surv}(t,0)^{n_{1}}(1-p_{\rm surv}(t,0))^{n_{0}-n_{1}}\times\sum_{n_{2}}\frac{e^{-\mu(t)}\mu(t)^{n_{2}}z^{n_{2}}}{(n_{2})!}$
	$\displaystyle=$	$\displaystyle\left[1+p_{\rm surv}(t,0)(z-1)\right]^{n_{0}}e^{\mu(t)(z-1)}$

On one hand, by substituting $F(z)$ to the left-hand side of Eq.(59), we get

	$\displaystyle\partial_{t}F(z,t)$	$\displaystyle=$	$\displaystyle n_{0}(z-1)\left[1+p_{\rm surv}(t,0)\right]^{n_{0}-1}\dot{p}_{\rm surv}(t,0)e^{\mu(t)(z-1)}$		(61)
		$\displaystyle+$	$\displaystyle(z-1)\left[1+p_{\rm surv}(t,0)(z-1)\right]^{n_{0}}e^{\mu(t)(z-1)}\dot{\mu}(t).$		(61)

On the other hand, by substituting $F(z)$ to the right-hand side of Eq.(59), we get

$\displaystyle(z-1)(\alpha(t)-\beta(t)\partial_{z})F(z,t)$	$\displaystyle=$	$\displaystyle\alpha(t)(z-1)\left[1+p_{\rm surv}(t,0)(z-1)\right]^{n_{0}}e^{\mu(t)(z-1)}$	(62)
	$\displaystyle-$	$\displaystyle n_{0}\beta(t)(z-1)\left[1+p_{\rm surv}(t,0)(z-1)\right]^{n_{0}-1}p_{\rm surv}(t,0)e^{\mu(t)(z-1)}$
	$\displaystyle-$	$\displaystyle\beta(t)(z-1)\mu(t)\left[1+p_{\rm surv}(t,0)(z-1)\right]^{n_{0}}e^{\mu(t)(z-1)}$

Since

	$\displaystyle\dot{p}_{\rm surv}(t,0)$	$\displaystyle=$	$\displaystyle-\beta(t)p_{\rm surv}(t,0),$
	$\displaystyle\dot{\mu}(t)$	$\displaystyle=$	$\displaystyle\alpha(t)-\beta(t)\mu(t),$		(63)

which can be easily checked by taking the time derivatives of $p_{\rm surv}(t,0)$ and $\mu(t)$ in Eqs.(21) and (22), we see that the expressions in Eq.(61) and Eq.(62) are equal. Therefore, the distribution in Eq.(27) is the solution of the master equation (23).

Appendix D The Poisson distribution is the limit of the distribution for independent trials, when there are multiple alternatives

The binomial distribution arises in independent and identical trials when there are only two alternatives at each trial. When there are multiple alternatives, whose number is $m$ , then the numbers of outcomes $(n_{1},\cdots n_{m-1})$ in the total $N$ trials follow the multinomial distribution

P_{\rm mult}(n_{1},\cdots,n_{m-1};\{N,p_{1},\cdots p_{m-1}\})=\frac{N!}{n_{1}!n_{2}!\cdots n_{m}!}p_{1}^{n_{1}}p_{2}^{n_{2}}\cdots p_{m}^{n_{m}}

(64)

if the probability of $k$ -the alternative happening at each trial is $p_{k}$ , where $n_{m}=N-\sum_{i=1}^{m-1}n_{i}$ and $p_{m}=1-\sum_{i=1}^{m-1}p_{i}$ . The corresponding generating function is:

$\displaystyle F_{\rm mult}(z_{1},z_{2}\cdots z_{m-1})$	$\displaystyle\equiv$	$\displaystyle\sum_{n_{1},\cdots n_{m-1}}P_{\rm mult}\left(n_{1},\cdots,n_{m-1}\right)z_{1}^{n_{1}}\cdots z_{m-1}^{n_{m-1}}$	(65)
	$\displaystyle=$	$\displaystyle\sum_{n_{1},\cdots n_{m-1}}\frac{N!}{n_{1}!n_{2}!\cdots n_{m}!}p_{1}^{n_{1}}p_{2}^{n_{2}}\cdots p_{m}^{n_{m}}z_{1}^{n_{1}}\cdots z_{m-1}^{n_{m-1}}$
	$\displaystyle=$	$\displaystyle\left[p_{m}+z_{1}p_{1}+\cdots+z_{m-1}p_{m-1}\right]^{N}$

It is straightforward to write down the generating function for the inhomogeneous counterpart,

	$\displaystyle\tilde{F}_{\rm mult}(z_{1},z_{2}\cdots z_{m-1})$	$\displaystyle=$	$\displaystyle\prod_{j=1}^{N}\left[p_{m}^{(j)}+z_{1}p_{1}^{(j)}+\cdots+z_{m-1}p_{m-1}^{(j)}\right]$		(66)
		$\displaystyle=$	$\displaystyle\prod_{j=1}^{N}\left[1+(z_{1}-1)p_{1}^{(j)}+\cdots+(z_{m-1}-1)p_{m-1}^{(j)}\right],$		(66)

where $p_{k}^{(j)}$ denotes the probability of the occurrence of $k$ -th alternative happening at $j$ -th trial, with $p_{m}^{(j)}\equiv 1-\sum_{k=1}^{m-1}p_{k}^{(j)}$ .

We now take the limit $N\to\infty$ , with $\mu_{k}=\sum_{j}p^{(j)}_{k}$ fixed for $1\leq k\leq m-1$ . We get

$\displaystyle\tilde{F}_{\rm mult}(z_{1},z_{2}\cdots z_{m-1})$	$\displaystyle=$	$\displaystyle\prod_{j=1}^{N}\left[\exp\left((z_{1}-1)p_{1}^{(j)}+\cdots+(z_{m-1}-1)p_{m-1}^{(j)}\right)+O(1/N^{2})\right]$	(67)
	$\displaystyle=$	$\displaystyle\exp\left((z_{1}-1)\sum_{j=1}^{N}p_{1}^{(j)}+\cdots+(z_{m-1}-1)\sum_{j=1}^{N}p_{m-1}^{(j)}\right)+O(1/N)$
		$\displaystyle\quad\exp\left((z_{1}-1)\mu_{1}+\cdots+(z_{m-1}-1)\mu_{m-1}\right),$

which is nothing but the generating function for $m-1$ independent Poisson distributions, with expected number of occurrences of $k$ -th alternative being $\mu_{k}$ .

Note that for finite $N$ , $k$ events with $k=1,\cdots m-1$ are exclusive events and are therefore not independent. However, even if we assume they are independent, probability of such events occurring more than once becomes negligible in the limit $N\to\infty$ , because they are rare events with probability being $O(1/N)$ . Therefore, the exclusive and independent events become indistinguishable. Let us illustrate this point with the homogeneous case with $m=3$ . The corresponding multinomial distribution describes the case where we toss a three-faced coin, with two heads denoted as $A$ and $B$ . There are three possible outcomes at each trial: The head $A$ with probability $p$ , the head $B$ with probability $q$ , or tail with probability $1-p-q$ . The probability distribution is given by the multinomial distribution,

P(n_{A},n_{B})=\frac{N!}{n_{A}!n_{B}!(N-n_{A}-n_{B})!}p^{n_{A}}q^{n_{B}}(1-p-q)^{N-n_{A}-n_{B}},

(68)

which was already shown above to approach the Poisson distribution

P(n_{A},n_{B})=\frac{e^{-\mu_{A}-\mu_{B}}}{n_{A}!n_{B}!}\mu_{A}^{n_{A}}\mu_{B}^{n_{B}}.

(69)

in the limit of $N\to\infty$ with $\mu_{A}=Np$ and $\mu_{B}=Nq$ fixed. Now compare this with the case where we toss two independent two-sided coins denoted as $A$ and $B$ at each trial, which produce heads with probabilities $p$ and $q$ , respectively. In contrast to the previous model, outcomes of head $A$ and head $B$ are independent, and the simultaneous heads of $A$ and $B$ are now allowed so that there are four outcomes at each trial. The probability distribution for $n_{A}$ and $n_{B}$ is now the product of binomial distributions,

P(n_{A},n_{B})=\frac{N!}{n_{A}!(N-n_{A})!}p^{n_{A}}(1-p)^{N-n_{A}}\frac{N!}{n_{B}!(N-n_{B})!}q^{n_{B}}(1-q)^{N-n_{B}}.

(70)

Since each binomial distribution approaches Poisson distribution, we again get Eq.(69) in the limit of $N\to\infty$ with $\mu_{A}$ and $\mu_{B}$ fixed. In this limit, the probability $pq$ of both coins producing heads, being $O(1/N^{2})$ , becomes negligible. Therefore, independent events happening in the short time interval of size $O(N^{-1})$ become effectively exclusive, or vice versa, in the limit of $N\to\infty$ . This is the reason why multinomial distribution, or its inhomogeneous counterpart, factorizes into independent Poisson distributions in this limit.

Appendix E The time-dependent distribution of the non-Markovian model with non-zero initial number of molecules (Eq.(46)) is the solution of the master equation (30).

To show that the distribution given by Eq.(46) is the solution of the master equation, it is convenient use the generating function $G(z,w,t)\equiv\sum_{n_{A}}\sum_{n_{I}}P(n_{A},n_{I},t)z^{n_{A}}w^{n_{I}}$ . Then the master equation (30) turns into [16]

$\displaystyle\partial_{t}G(z,w,t)$	$\displaystyle=$	$\displaystyle\left[\gamma(1-z)+\beta(w-z)\right]\partial_{z}G+\zeta(1-w)\partial_{w}G+\alpha(z-1)G(z,w,t)$	(71)
	$\displaystyle+$	$\displaystyle\beta e^{-\zeta\tau}(1-w)\sum_{n_{A}}\left[1+\Phi(z,w,\tau)\right]^{n_{A}-1}n_{A}P(n_{A},t-\tau)$
	$\displaystyle\times$	$\displaystyle\exp\left[\alpha\int_{0}^{\tau}dt^{\prime}\Phi(z,w,t^{\prime})\right]\theta(t-\tau),$

where

\Phi(z,w,t)\equiv(z-1)p_{A}(t)+(w-1)p_{I}(t)

(72)

with $p_{A}(t)$ and $p_{I}(t)$ given in Eq.(44). The last term in Eq.(71) involving $\Phi(z,w,t)$ describes the process where the active molecule that entered the degradation process at $t-\tau$ gets degraded. Therefore, for the initial condition where $n_{I}=0$ at $t=0$ , this term is absent for $t<\tau$ , implemented here by the step function $\theta(t-\tau)$ .

From Eq.(46), we get the marginal probability distribution:

$\displaystyle P(n_{A},t)$	$\displaystyle=$	$\displaystyle\sum_{n_{I}}\sum_{n^{\prime}_{A}}\sum_{n^{\prime}_{I}}\frac{n_{0}!}{n^{\prime}_{A}!n^{\prime}_{I}!(n_{0}-n^{\prime}_{A}-n^{\prime}_{I})!}p_{A}(t)^{n^{\prime}_{A}}p_{I}(t)^{n^{\prime}_{I}}$	(73)
		$\displaystyle\times(1-p_{A}(t)-p_{I}(t))^{n_{0}-n^{\prime}_{A}-n^{\prime}_{I}}$
		$\displaystyle\times\frac{e^{-\mu_{A}(t)-\mu_{I}(t)}\mu_{A}(t)^{n_{A}-n^{\prime}_{A}}\mu_{I}(t)^{n_{I}-n^{\prime}_{I}}}{(n_{A}-n^{\prime}_{A})!(n_{I}-n^{\prime}_{I})!}$
	$\displaystyle=$	$\displaystyle\sum_{n^{\prime}_{A}}\sum_{n^{\prime}_{I}}\frac{n_{0}!}{n^{\prime}_{A}!n^{\prime}_{I}!(n_{0}-n^{\prime}_{A}-n^{\prime}_{I})!}p_{A}(t)^{n^{\prime}_{A}}p_{I}(t)^{n^{\prime}_{I}}$
		$\displaystyle\times(1-p_{A}(t)-p_{I}(t))^{n_{0}-n^{\prime}_{A}-n^{\prime}_{I}}$
		$\displaystyle\times\frac{e^{-\mu_{A}(t)-\mu_{I}(t)}\mu_{A}(t)^{n_{A}-n^{\prime}_{A}}}{(n_{A}-n^{\prime}_{A})!}\sum_{n_{I}}\frac{\mu_{I}(t)^{n_{I}}}{n_{I}!}$
	$\displaystyle=$	$\displaystyle\sum_{n^{\prime}_{A}}\sum_{n^{\prime}_{I}}\frac{n_{0}!}{n^{\prime}_{A}!n^{\prime}_{I}!(n_{0}-n^{\prime}_{A}-n^{\prime}_{I})!}p_{A}(t)^{n^{\prime}_{A}}p_{I}(t)^{n^{\prime}_{I}}$
		$\displaystyle\times(1-p_{A}(t)-p_{I}(t))^{n_{0}-n^{\prime}_{A}-n^{\prime}_{I}}\frac{e^{-\mu_{A}(t)}\mu_{A}(t)^{n_{A}-n^{\prime}_{A}}}{(n_{A}-n^{\prime}_{A})!}$
	$\displaystyle=$	$\displaystyle\sum_{n^{\prime}_{A}}\frac{n_{0}!}{n^{\prime}_{A}!(n_{0}-n^{\prime}_{A})!}p_{A}(t)^{n^{\prime}_{A}}(1-p_{A}(t))^{n_{0}-n^{\prime}_{A}}\frac{e^{-\mu_{A}(t)}\mu_{A}(t)^{n_{A}-n^{\prime}_{A}}}{(n_{A}-n^{\prime}_{A})!}$
	$\displaystyle=$	$\displaystyle\sum_{n^{\prime}_{A}}P_{\rm binom}(n^{\prime}_{A};\{n_{0},p_{A}(t)\})P_{\rm Poisson}(n_{A}-n^{\prime}_{A};\mu_{A}(t)),$

and the generating function for $t\geq 0$ :

$\displaystyle G(z,w,t)$	$\displaystyle=$	$\displaystyle\sum_{n_{A}}\sum_{n_{I}}\sum_{n^{\prime}_{A}}\sum_{n^{\prime}_{I}}\frac{z^{n_{A}}w^{n_{I}}n_{0}!}{n^{\prime}_{A}!n^{\prime}_{I}!(n_{0}-n^{\prime}_{A}-n^{\prime}_{I})!}p_{A}(t)^{n^{\prime}_{A}}p_{I}(t)^{n^{\prime}_{I}}$	(74)
		$\displaystyle\times(1-p_{A}(t)-p_{I}(t))^{n_{0}-n^{\prime}_{A}-n^{\prime}_{I}}$
		$\displaystyle\times\frac{e^{-\mu_{A}(t)-\mu_{I}(t)}\mu_{A}(t)^{n_{A}-n^{\prime}_{A}}\mu_{I}(t)^{n_{I}-n^{\prime}_{I}}}{(n_{A}-n^{\prime}_{A})!(n_{I}-n^{\prime}_{I})!}$
	$\displaystyle=$	$\displaystyle\sum_{n^{\prime}_{A}}\sum_{n^{\prime}_{I}}\frac{z^{n^{\prime}_{A}}w^{n^{\prime}_{I}}n_{0}!}{n^{\prime}_{A}!n^{\prime}_{I}!(n_{0}-n^{\prime}_{A}-n^{\prime}_{I})!}p_{A}(t)^{n^{\prime}_{A}}p_{I}(t)^{n^{\prime}_{I}}$
		$\displaystyle\times(1-p_{A}(t)-p_{I}(t))^{n_{0}-n^{\prime}_{A}-n^{\prime}_{I}}$
		$\displaystyle\times\sum_{n^{\prime\prime}_{A}}\sum_{n^{\prime\prime}_{I}}z^{n^{\prime\prime}_{A}}w^{n^{\prime\prime}_{I}}\frac{e^{-\mu_{A}(t)-\mu_{I}(t)}\mu_{A}(t)^{n^{\prime\prime}_{A}}\mu_{I}(t)^{n^{\prime\prime}_{I}}}{(n^{\prime\prime}_{A})!(n^{\prime\prime}_{I})!}$
	$\displaystyle=$	$\displaystyle\left[1+p_{A}(t)(z-1)+p_{I}(t)(w-1)\right]^{n_{0}}e^{\mu_{A}(t)(z-1)+\mu_{I}(t)(w-1)}$
	$\displaystyle=$	$\displaystyle\left[1+\Phi(z,w,t)\right]^{n_{0}}e^{\mu_{A}(t)(z-1)+\mu_{I}(t)(w-1)}.$

First, we compute the summation in the last term at the right-hand side of Eq.(71) by substituting the expressions for $P(n_{A},t)$ and $G(z,w,t)$ :

	$\displaystyle\sum_{n_{A}}\left[1+\Phi(z,w,\tau)\right]^{n_{A}-1}n_{A}P(n_{A},t-\tau)$	(75)
$\displaystyle=$	$\displaystyle\sum_{n_{A}}\left[1+\Phi(z,w,\tau)\right]^{n_{A}-1}n_{A}\sum_{n^{\prime}_{A}}P_{\rm binom}(n^{\prime}_{A};\{n_{0},p_{A}(t-\tau)\})P_{\rm Poisson}(n_{A}-n^{\prime}_{A};\mu_{A}(t-\tau))$
$\displaystyle=$	$\displaystyle\sum_{n^{\prime}_{A}}\sum_{n_{A}}\left[1+\Phi(z,w,\tau)\right]^{n_{A}-1}n_{A}P_{\rm binom}(n^{\prime}_{A};\{n_{0},p_{A}(t-\tau)\})P_{\rm Poisson}(n_{A}-n^{\prime}_{A};\mu_{A}(t-\tau))$
$\displaystyle=$	$\displaystyle\sum_{n^{\prime}_{A}}\sum_{n_{A}}\left[1+\Phi(z,w,\tau)\right]^{n_{A}+n^{\prime}_{A}-1}(n_{A}+n^{\prime}_{A})P_{\rm binom}(n^{\prime}_{A};\{n_{0},p_{A}(t-\tau)\})P_{\rm Poisson}(n_{A};\mu_{A}(t-\tau))$
$\displaystyle=$	$\displaystyle\sum_{n^{\prime}_{A}}\left[1+\Phi(z,w,\tau)\right]^{n^{\prime}_{A}}\frac{n_{0}!p_{A}(t-\tau)^{n^{\prime}_{A}}}{n^{\prime}_{A}!(n_{0}-n^{\prime}_{A})!}(1-p_{A}(t-\tau))^{n_{0}-n^{\prime}_{A}}$
$\displaystyle\times$	$\displaystyle\sum_{n_{A}}\left[1+\Phi(z,w,\tau)\right]^{n_{A}-1}e^{-\mu_{A}(t-\tau)}\frac{\mu_{A}(t-\tau)^{n_{A}}}{(n_{A}-1)!}$
$\displaystyle+$	$\displaystyle\sum_{n^{\prime}_{A}}\left[1+\Phi(z,w,\tau)\right]^{n^{\prime}_{A}-1}\frac{n_{0}!p_{A}(t-\tau)^{n^{\prime}_{A}}}{(n^{\prime}_{A}-1)!(n_{0}-n^{\prime}_{A})!}(1-p_{A}(t-\tau))^{n_{0}-n^{\prime}_{A}}$
$\displaystyle\times$	$\displaystyle\sum_{n_{A}}\left[1+\Phi(z,w,\tau)\right]^{n_{A}}e^{-\mu_{A}(t-\tau)}\frac{\mu_{A}(t-\tau)^{n_{A}}}{n_{A}!}$
$\displaystyle=$	$\displaystyle\left[1+\Phi(z,w,\tau)p_{A}(t-\tau)\right]^{n_{0}}\mu_{A}(t-\tau)\exp\left[\mu_{A}(t-\tau)\Phi(z,w,\tau)\right]$
$\displaystyle+$	$\displaystyle n_{0}p_{A}(t-\tau)\left[1+\Phi(z,w,\tau)p_{A}(t-\tau)\right]^{n_{0}-1}\exp\left[\mu_{A}(t-\tau)\Phi(z,w,\tau)\right]$
$\displaystyle=$	$\displaystyle\left[1+\Phi(z,w,t)\right]^{n_{0}}\mu_{A}(t-\tau)\exp\left[\mu_{A}(t-\tau)\Phi(z,w,\tau)\right]$
$\displaystyle+$	$\displaystyle n_{0}p_{A}(t-\tau)\left[1+\Phi(z,w,t)\right]^{n_{0}-1}\exp\left[\mu_{A}(t-\tau)\Phi(z,w,\tau)\right]$

where in order to get the last line, we used the fact that

p_{A}(\tau)p_{A}(t-\tau)=p_{A}(t),\quad p_{I}(\tau)p_{A}(t-\tau)=p_{I}(t)

(76)

so that

	$\displaystyle\Phi(z,w,\tau)p_{A}(t-\tau)$	$\displaystyle=$	$\displaystyle(z-1)p_{A}(\tau)p_{A}(t-\tau)+(w-1)p_{I}(\tau)p_{A}(t-\tau)$		(77)
		$\displaystyle=$	$\displaystyle(z-1)p_{A}(t)+(w-1)p_{I}(t)=\Phi(t).$		(77)

Therefore, the right-hand side of Eq.(30) becomes

	$\displaystyle\left[\gamma(1-z)+\beta(w-z)\right]\partial_{z}G+\zeta(1-w)\partial_{w}G+\alpha(z-1)G(z,w,t)$	(78)
$\displaystyle+$	$\displaystyle\beta e^{-\zeta\tau}(1-w)\sum_{n_{A}}\left[1+\Phi(z,w,\tau)\right]^{n_{A}-1}n_{A}P(n_{A},t-\tau)\exp\left[\alpha\int_{0}^{\tau}dt^{\prime}\Phi(z,w,t^{\prime})\right]\theta(t-\tau)$
$\displaystyle=$	$\displaystyle n_{0}\left[1+\Phi(z,w,t)\right]^{n_{0}-1}\left[\left(-a(z-1)+\beta(w-1)\right)p_{A}(t)-\zeta(w-1)p_{I}(t)\right]e^{\mu_{A}(t)(z-1)+\mu_{I}(t)(w-1)}$
$\displaystyle+$	$\displaystyle\left[1+\Phi(z,w,t)\right]^{n_{0}}\left[\left(-a(z-1)+\beta(w-1)\right)\mu_{A}(t)-\zeta(w-1)\mu_{I}(t)+\alpha(z-1)\right]e^{\mu_{A}(t)(z-1)+\mu_{I}(t)(w-1)}$
$\displaystyle-$	$\displaystyle\beta e^{-\zeta\tau}(w-1)\mu_{A}(t-\tau)\left[1+\Phi(z,w,t)\right]^{n_{0}}\exp\left[\mu_{A}(t-\tau)\Phi(z,w,\tau)+\alpha\int_{0}^{\tau}dt^{\prime}\Phi(z,w,t^{\prime})\right]\theta(t-\tau)$
$\displaystyle-$	$\displaystyle\beta e^{-\zeta\tau}(w-1)n_{0}p_{A}(t-\tau)\left[1+\Phi(z,w,t)\right]^{n_{0}-1}\exp\left[\mu_{A}(t-\tau)\Phi(z,w,\tau)+\alpha\int_{0}^{\tau}dt^{\prime}\Phi(z,w,t^{\prime})\right]\theta(t-\tau)$
$\displaystyle=$	$\displaystyle n_{0}\left[1+\Phi(z,w,t)\right]^{n_{0}-1}\left[\left(-a(z-1)+\beta(w-1)\right)p_{A}(t)-\zeta(w-1)p_{I}(t)\right]e^{\mu_{A}(t)(z-1)+\mu_{I}(t)(w-1)}$
$\displaystyle+$	$\displaystyle\left[1+\Phi(z,w,t)\right]^{n_{0}}\left[\left(-a(z-1)+\beta(w-1)\right)\mu_{A}(t)-\zeta(w-1)\mu_{I}(t)+\alpha(z-1)\right]e^{\mu_{A}(t)(z-1)+\mu_{I}(t)(w-1)}$
$\displaystyle-$	$\displaystyle\beta e^{-\zeta\tau}(w-1)\mu_{A}(t-\tau)\left[1+\Phi(z,w,t)\right]^{n_{0}}e^{\mu_{A}(t)(z-1)+\mu_{I}(t)(w-1)}\theta(t-\tau)$
$\displaystyle-$	$\displaystyle\beta e^{-\zeta\tau}(w-1)\theta(t-\tau)n_{0}p_{A}(t-\tau)\left[1+\Phi(z,w,t)\right]^{n_{0}-1}e^{\mu_{A}(t)(z-1)+\mu_{I}(t)(w-1)}\theta(t-\tau)$

where in order to get the last line, I used the fact that

\mu_{A}(t-\tau)p_{A}(\tau)=\mu_{A}(t)-\mu_{A}(\tau),\quad\mu_{A}(t-\tau)p_{I}(\tau)=\mu_{I}(t)-\mu_{I}(\tau),

(79)

and

\alpha\int_{0}^{\tau}\Phi(t^{\prime})dt^{\prime}=(z-1)\mu_{A}(\tau)+(w-1)\mu_{I}(\tau),

(80)

so that

\exp\left[\mu_{A}(t-\tau)\Phi(z,w,\tau)+\alpha\int_{0}^{\tau}dt^{\prime}\Phi(z,w,t^{\prime})\right]=e^{\mu_{A}(t)(z-1)+\mu_{I}(t)(w-1)}.

(81)

Next, substituting the expression for $G(z,w,t)$ in Eq.(74) to the left-hand side of Eq.(71), we get

$\displaystyle\partial_{t}G(z,w,t)$	$\displaystyle=$	$\displaystyle n_{0}\left[1+\Phi(z,w,t)\right]^{n_{0}-1}$	(82)
	$\displaystyle\times$	$\displaystyle\left[(z-1)\dot{p}_{A}(t)+(w-1)\dot{p}_{I}(t)\right]e^{\mu_{A}(t)(z-1)+\mu_{I}(t)(w-1)}$
	$\displaystyle+$	$\displaystyle\left[1+\Phi(z,w,t)\right]^{n_{0}}e^{\mu_{A}(t)(z-1)+\mu_{I}(t)(w-1)}$
	$\displaystyle\times$	$\displaystyle\left[(z-1)\dot{\mu}_{A}(t)+(w-1)\dot{\mu}_{I}(t)\right]$

Since

$\displaystyle\dot{p}_{A}(t)$	$\displaystyle=$	$\displaystyle-ap_{A}(t),$
$\displaystyle\dot{p}_{I}(t)$	$\displaystyle=$	$\displaystyle\beta p_{A}(t)-\zeta p_{I}(t)-\beta e^{-\zeta\tau}p_{A}(t-\tau)\theta(t-\tau),$
$\displaystyle\dot{\mu}_{A}(t)$	$\displaystyle=$	$\displaystyle\alpha-a\ \mu_{A}(t),$
$\displaystyle\dot{\mu}_{I}(t)$	$\displaystyle=$	$\displaystyle-\zeta\mu_{I}(t)+\beta\left[\mu_{A}(t)-e^{-\zeta\tau}\mu(t-\tau)\theta(t-\tau)\right],$	(83)

we see that the expressions in Eq.(78) and Eq.(82) are equal. Therefore, the distribution in Eq.(46) is the solution of the master equation (30).