This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Mean extinction times in cyclic coevolutionary rock-paper-scissors dynamics

Markus Schütt and Jens Christian Claussen111Corresponding author.
Institut für Neuro- und Bioinformatik
Universität zu Lübeck, D-23562 Lübeck, Germany
(February 24, 2010)
Abstract

Dynamical mechanisms that can stabilize the coexistence or diversity in biology are generally of fundamental interest. In contrast to many two-strategy evolutionary games, games with three strategies and cyclic dominance like the rock-paper-scissors game (RPS) stabilize coexistence and thus preserve biodiversity in this system. In the limit of infinite populations, resembling the traditional picture of evolutionary game theory, replicator equations predict the existence of a fixed point in the interior of the phase space. But in finite populations, strategy frequencies will run out of the fixed point because of stochastic fluctuations, and strategies can even go extinct. For three different processes and for zero-sum and non-zero-sum RPS as well, we present results of extensive simulations for the mean extinction time (MET), depending on the number of agents NN, and we introduce two analytical approaches for the derivation of the MET.

1 Introduction

There are several examples of cyclic dominance in biology: Males of the californian lizards Uta stansburiana are known to inherit three different mating strategies which cyclically dominate each other [22, 24]. Another example is given by three strains of Escherichia coli [14], in tropical marine ecosystems [4] or in high arctic vertebrates [10]. Cyclic dominance is also important in some theoretical models like the susceptible-infected-recovered-susceptible (SIRS) model in epidemiology [2], in cyclic extensions of the Lotka-Volterra model [31, 32] or the Public Goods Game [13, 11]. Here, cyclic dominance is a way to preserve the coexistence of strategies (what is often called ‘biodiversity’ in a biological context) in the models.

A straightforward model system for cyclic dominance is the rock-paper-scissors game (RPS) well known from schoolyards. It contains the three strategies ‘rock’, ‘paper’, and ‘scissors’ with a simple domination rule: Paper wraps rock, scissors cut paper, rock crushes scissors. The payoff a dominating strategy gets from the dominated one is set to 11 for normalization, the payoff for a tie is 0, and a dominated strategy gets a payoff s-s, where we assume s<0-s<0. Hence the payoff matrix of this game reads

P=(0s110ss10).P=\begin{pmatrix}0&-s&1\\ 1&0&-s\\ -s&1&0\end{pmatrix}. (1)

For the standard choice s=1s=1 we have a zero-sum RPS, for all other values the game is non zero-sum. One can simply intuit the impact of ss [5]: For huge ss it is important for a player to avoid loosing, so it is successful playing the same strategy as the majority; hence an equilibrium that includes all three strategies is unstable. On the other hand, for small ss it is more important to win occasionally so that the mixed equilibrium becomes stable. Experiments indicate s<1s<1 for the lizard example [36] and s>1s>1 for E. coli [14].

For a long time, the resulting dynamics have been described by a meanfield approximation in the limit of infinite populations. The traditional way to describe such evolutionary games was the standard replicator equation, an equation of motion for the density of an arbitrary strategy α\alpha [12],

x˙α=ψxα(πα(x)π(x)),\dot{x}_{\alpha}=\psi x_{\alpha}(\pi^{\alpha}(\vec{x})-\langle\pi(\vec{x})\rangle), (2)

where ψ\psi is a constant prefactor which can be absorbed in the time scale by a constant rescaling of time. πα(x)=x1Pα,1+x2Pα,2+x3Pα,3\pi^{\alpha}(\vec{x})=x_{1}P_{\alpha,1}+x_{2}P_{\alpha,2}+x_{3}P_{\alpha,3} and π(x)\langle\pi(\vec{x})\rangle are the payoff of strategy α\alpha and the average payoff of the whole population, respectively. The adjusted replicator equation [23]

x˙α=xαΓ+π(x)(πα(x)π(x)),\dot{x}_{\alpha}=\frac{x_{\alpha}}{\Gamma+\langle\pi(\vec{x})\rangle}(\pi^{\alpha}(\vec{x})-\langle\pi(\vec{x})\rangle), (3)

has been used less frequently. The prefactor 1/(Γ+π(x))1/(\Gamma+\langle\pi(\vec{x})\rangle) can be absorbed in the time scale by a dynamical rescaling of time for symmetric conflicts like the RPS, preserving stability properties. For asymmetric conflicts, like Dawkins Battle of the Sexes [7], this is not possible, as the average payoffs in each population do not coincide; hence the adjusted replicator equation can lead to qualitatively modified dynamics [26, 6]. The derivation of the replicator equations and Fokker-Planck-equations (comprising the first-order corrections) for the intrinsic noise are commonly taken from the master equation of the underlying discrete stochastic process [27, 28, 26, 31, 29].

Both replicator equations predict the existence of a fixed point at (xR,xP,xS)=(13,13,13)\left(x_{R},x_{P},x_{S}\right)=\left(\frac{1}{3},\frac{1}{3},\frac{1}{3}\right). The FP is neutrally stable for s=1s=1, asymptotically stable for s<1s<1, and unstable for s>1s>1. In the case of finite populations of size NN, the population can drive out of the fixed point because of stochastic fluctuations, and after some time one of the strategies will go extinct. Once this has happened, a second strategy will die out soon because of the lack of cyclic dominance, and the third strategy will therefore survive forever. Several efforts have been made to overbear this shortcoming especially of the zero-sum RPS, for example spatial discretization of populations (for a review see [25]), mobility [21], the introduction of intelligent update rules (best response [3]), the introduction of the parameter ss as mentioned above [5], or the computation of a critical system size above which coexistence of strategies is likely [26], but it is still an open question how long it takes on average until the first strategy has gone extinct. In this paper we investigate the scaling of the mean time to the extinction of the first strategy (mean extinction time, MET), depending on the system size NN and the parameter ss, for well mixed populations and three evolutionary processes (with two of them having the standard and accordingly the adjusted replicator equation as limits NN\to\infty), and present two analytical approaches which give theoretical insight in the scaling of the MET.

2 Evolutionary processes

Contrary to replicator equations describing the dynamics of relative abundance densities, real populations are finite (and discrete), appropriate modeling thus bases on discrete stochastic processes (of birth and death). In the classical Moran process [16] an individual aa is chosen with probability proportional to its fitness. It reproduces, and the offspring replaces another randomly chosen individual bb. In the frequency-dependent Moran process [18] which extends the classical Moran process [16] by considering coevolution, the fitness is not static but depends on the frequencies of the strategies. For better comparison with the processes mentioned beneath, in each time step we choose an individual aa at random which reproduces with probability

ϕMα(a)(nα,nβ,nγ)=121ω+ωπα(a)1ω+ωπ,\phi_{M}^{\alpha(a)}(n_{\alpha},n_{\beta},n_{\gamma})=\frac{1}{2}\frac{1-\omega+\omega\pi_{\alpha(a)}}{1-\omega+\omega\left\langle\pi\right\rangle}, (4)

and the offspring replaces another randomly chosen individual bb. Greek lettes α,β,γ\alpha,\beta,\gamma can assume each of the strategies R,S,PR,S,P, respectively. Here, ω\omega is the selection strength, the nin_{i} are the number of agents playing strategy ii, πα=(n1Pα,1+n2Pα,2+n3Pα,3)/(N1)\pi_{\alpha}=(n_{1}P_{\alpha,1}+n_{2}P_{\alpha,2}+n_{3}P_{\alpha,3})/(N-1) the payoff for an individual of strategy α\alpha, and π\langle\pi\rangle the average payoff of the whole population. In the limit NN\to\infty, the Moran process leads to the so-called adjusted replicator equation [26]. Note that a factor 12\frac{1}{2} is introduced here for a better comparability with the two processes mentioned beneath.

The Moran process is a well-established stochastic process for evolutionary birth-death (for growing population sizes, see e.g. [33]) dynamics with overlapping generations and therefore (in its frequency-dependent extension) serves as a standard model of evolutionary game theory. However, the Moran process requires perfect global information about the whole population, an assumption that can be unrealistic and undesirable. Because of that, two local processes have been mentioned. Again we choose an individual aa at random for reproduction. Another randomly chosen individual bb changes its strategy to the strategy of aa with probability

ϕluβ(b)α(a)(nα,nβ,nγ)=12+ω(πα(a)πβ(b))2Δπmax\phi_{lu}^{\beta(b)\rightarrow\alpha(a)}(n_{\alpha},n_{\beta},n_{\gamma})=\frac{1}{2}+\frac{\omega(\pi_{\alpha(a)}-\pi_{\beta(b)})}{2\Delta\pi_{max}} (5)

for the local update [26] and

ϕFβ(b)α(a)(nα,nβ,nγ)=11+eω(πα(a)πβ(b))\phi_{F}^{\beta(b)\rightarrow\alpha(a)}(n_{\alpha},n_{\beta},n_{\gamma})=\frac{1}{1+e^{-\omega(\pi_{\alpha(a)}-\pi_{\beta(b)})}} (6)

for the Fermi process [34], respectively, where Δπmax\Delta\pi_{max} is the maximum of the possible payoff difference. In the limit NN\to\infty, these processes lead to the standard replicator equation (local update) [26], and a nonlinear replicator equation (Fermi process) [30] with similar properties, respectively. The common approach for the derivation of the equations of motion and the first-order corrections for demographic stochasticity are to derive a Fokker-Planck equation from the master equation for the respective stochastic process [27, 28, 26, 6, 29].

Here we focus on the mean extinction time for the Moran and Local Update processes. For each process, the probability of increasing the population strength of the strategy α\alpha by 11 in a single time step and decreasing the population strength of strategy β\beta by 11, is given by

Tprocess(nα,nβ,nγnα+1,nβ1,nγ)=nαNnβNϕprocessβα(nα,nβ,nγ).T_{process}(n_{\alpha},n_{\beta},n_{\gamma}\rightarrow n_{\alpha}+1,n_{\beta}-1,n_{\gamma})=\frac{n_{\alpha}}{N}\frac{n_{\beta}}{N}\phi_{process}^{\beta\rightarrow\alpha}(n_{\alpha},n_{\beta},n_{\gamma}). (7)

This quantity is known as hopping rate. Note that the sum over all hopping rates is 1\neq 1 because reactions are possible that do not lead to changes in strategy frequencies. The time scale is chosen so that one reaction takes place every unit time step, including empty steps with no strategy change.

3 Mean extinction times

Refer to caption
Refer to caption
Refer to caption
Figure 1: Semi-logarithmic plots of the simulation data for the METs for the local update process (a), the Moran process (b) and the Fermi process (c). The METs were divided by N2N^{2}. Each point is an average over the extinction times of 1000 simulations. Red squares: s=1.0s=1.0, ω=0.50\omega=0.50, black rhombi: s=1.0s=1.0, ω=0.05\omega=0.05, orange upwards triangles: s=1.2s=1.2, ω=0.50\omega=0.50, yellow downwards triangles: s=1.2,ω=0.05s=1.2,\omega=0.05, green open circles: s=0.8s=0.8, ω=0.05\omega=0.05, blue filled circles: s=0.8s=0.8, ω=0.50\omega=0.50. For the Moran process ω=0.45\omega=0.45 has been used instead of ω=0.50\omega=0.50 to keep the transition probabilities in the interval [0,1]\left[0,1\right].

Compared to the time scale a mutant occurs – re-introducing a strategy – the time scale characterizing the survival properties of a genetic strain is defined by the mean extinction time. For two strategies, and fixed NN, one has a onedimensional Markov process for which a closed expression allows to proceed analytically as has been demonstrated by Antal and Scheuring [1]. For higher-dimensional cases unfortunately exact general solutions of the mean extinction time (or a mean-first-passage time [19]) problems are not known; in this case the situation is furthermore hampered by the location-dependent dynamics within a simplex boundary. So in most cases one has to rely on systematic numerical investigations. We have carried out extensive simulations to quantify the mean times until one of the three strategies has gone extinct. We have analyzed the mean extinction times for all three described processes depending on the number of agents NN and the parameters ss and ω\omega. In general, we observe the following properties (see Fig. 1): For all three processes the MET becomes independent of the selection strength ω\omega for s=1.0s=1.0. Further on, the MET for s=1.0s=1.0 is identical for all three processes and proportional to N2N^{2},

text(s=1)(0.54±0.02)N2.\langle t_{ext}\rangle(s=1)\approx(0.54\pm 0.02)N^{2}. (8)

As expected, for s<1s<1 the MET is larger and for s>1s>1 smaller than for s=1s=1, but for weak selection the difference is small. In a logarithmic plot of the MET divided by N2N^{2} one can easily see that text(s=1.0)N2\langle t_{ext}(s=1.0)\rangle\propto N^{2}, but also text(s<1)N2ef(s,ω)N\langle t_{ext}(s<1)\rangle\propto N^{2}e^{f(s,\omega)N} with f(s,ω)>0f(s,\omega)>0 and independent of NN. For the Moran process the stabilizing effect for s<1s<1 is larger than for the other processes.

On the first view, for s>1s>1 the MET seems to be text(s>1)N2ef(s,ω)N\langle t_{ext}(s>1)\rangle\propto N^{2}e^{-f(s,\omega)N}. But such a proportionality would imply a disappearance of the MET for large NN. This is unrealistic because even the shortest way would require N/3N/3 steps. For that a dependence text(s>1)=c1N2+c2N2ef(s,ω)N\langle t_{ext}(s>1)\rangle=c_{1}N^{2}+c_{2}N^{2}e^{-f(s,\omega)N} is more likely (with f(s,ω)f(s,\omega) having the same properties as for the s<1s<1 case), as it is predicted by one of our analytical approaches.

In summary, it can be said that the MET switches from polynomial to exponential scaling at s=1s=1. This agrees with the drift reversal picture from [26, 5]. So the (global) Moran process and the (local) local update and Fermi process behave similar, but the impact of the parameters ss and ω\omega is much stronger for the Moran process.

4 Analytical approaches

An exact analytical solution for two- or higher dimensional Markov chains is impossible, and a direct computation of the MET of such a three-strategy evolutionary game is not feasible. Although some efforts have been made, for example the work of Reichenbach et al. [20], wich have analyzed mean extinction properties in an urn model of a three-strategy game, employing the usual approach of applying van Kampen’s linear noise approximation and deriving a Fokker-Planck equation [35, 27, 28, 26].

Adapting this approach to the zero-sum RPS for the local update and the Moran process, we find a predicted MET proportional to N2N^{2} and independent of ω\omega, which is in accordance with the simulation data. Unfornately this approach has some shortcomings: Although the overall scaling is correct, the numerical value (prefactor) of the predicted MET is too great by a factor of 5\approx 5, and it is not possible to use this approach for the non zero-sum RPS or in the Fermi process. For that we have developed two analytical approaches which do better. We will present both approaches for the local update. Following the same schemes for the other two processes is, although possible, a bit more sophisticated because not all integrals can be done analytically here in general. As we have seen from the numerical investigations, the s1s\neq{}1 payoffs override differences between the underlying processes, so that it is warranted to concentrate on one analytically more feasible process in the remainder.

4.1 First approach - Expected changes in the distance to the fixed point

To compute – with help of approximations – the MET for the general case, we need an appropriate coordinate system. For the standard replicator equation and s=1s=1,

H=xRxPxSH=-x_{R}x_{P}x_{S} (9)

is a constant of motion which assumes the value H=1/27H=-1/27 in the fixed point and H=0H=0 on the edge of the phase space, the simplex S3S_{3}. Here, xRx_{R}, xPx_{P} and xSx_{S} are the frequencies of the strategies RR, PP and SS, respectively. For s<1s<1, HH is a Lyapunov function of the replicator equations with H˙<0\dot{H}<0, and so the inner fixed point ist asymptotically stable [5]. For s>1s>1 the fixed point is unstable, and the attractor of the system moves to the edge of the simplex. Via a transformation of the fixed point to the origin of the phase space, and by inserting xR+xP+xS=1x_{R}+x_{P}+x_{S}=1, which guarantees the conservation of the total density, we find for HH:

H=(x+1/3)(y+1/3)(1/3xy)H=-(x+1/3)(y+1/3)(1/3-x-y) (10)

For large absolute values of HH, the curves with H=const.H=const. resemble slightly deformed circles, but for H0H\rightarrow 0 they approach the triangle form of the simplex. In the following we will use HH as an observable to measure the effective distance to the fixed point. But obviously we need a second coordinate to describe a point in the phase space, so we will use a system with the variables xx and HH by eliminating the yy-coordinate. This gives us two solutions for y(x,H)y(x,H), an upper branch above the fixed point, and a corresponding lower branch below. We need both because it is not possible to describe all points of the phase space (x,y)(x,y) by only one solution (see figure 2). So we have

yu=3x9x24+108H+12x+324Hx27x254x3+81x46(1+3x)y_{u}=\frac{-3x-9x^{2}-\sqrt{4+108H+12x+324Hx-27x^{2}-54x^{3}+81x^{4}}}{6(1+3x)} (11)

and

yd=3x9x2+4+108H+12x+324Hx27x254x3+81x46(1+3x).y_{d}=\frac{-3x-9x^{2}+\sqrt{4+108H+12x+324Hx-27x^{2}-54x^{3}+81x^{4}}}{6(1+3x)}. (12)

Here the indices ‘uu’ and ‘dd’ stand for ‘up’ and ‘down’, respectively, because by re-inserting of xx and HH in the first (second) equation one will get only values of yy that lie above (under) a separator. In two points, xminx_{min} and xmaxx_{max}, both solutions are identical so that the curve H=const.H=const. is closed. We can compute these from the condition yu=ydy_{u}=y_{d} and solving for xx,

Refer to caption
Refer to caption
Figure 2: (a) A plot of curves for which H=const.H=const. holds, for different values of HH. Different colours on each curve mark the different solution branches yuy_{u} und ydy_{d}. The black line connects the values xminx_{min} and xmaxx_{max}, respectively. (b) Plots of the simulation data for the MET from the local update (symbols) for different pairs of ss and ω\omega depending on NN and the corresponding results from the analytical approach presented in section 4.1 (lines, with a not filled version of the corresponding symbol at the end). Red circles: s=1.0s=1.0, ω=0.5\omega=0.5, orange squares: s=1.2s=1.2, ω=0.5\omega=0.5, yellow triangles: s=1.2s=1.2, ω=0.05\omega=0.05, green rhombi: s=0.8s=0.8, ω=0.05\omega=0.05.
xmin\displaystyle x_{min} =\displaystyle= 13(cos(13arg(54H+63H(27H+1)1))\displaystyle\frac{1}{3}\left(-\cos\left(\frac{1}{3}\arg\left(-54H+6\sqrt{3}\sqrt{H(27H+1)}-1\right)\right)\right.
3sin(13arg(54H+63H(27H+1)1))+1)\displaystyle\left.-\sqrt{3}\sin\left(\frac{1}{3}\arg\left(-54H+6\sqrt{3}\sqrt{H(27H+1)}-1\right)\right)+1\right)

and

xmax\displaystyle x_{max} =\displaystyle= 13(cos(13arg(54H+63H(27H+1)1))\displaystyle\frac{1}{3}\left(-\cos\left(\frac{1}{3}\arg\left(-54H+6\sqrt{3}\sqrt{H(27H+1)}-1\right)\right)\right.
+3sin(13arg(54H+63H(27H+1)1))+1)\displaystyle\left.+\sqrt{3}\sin\left(\frac{1}{3}\arg\left(-54H+6\sqrt{3}\sqrt{H(27H+1)}-1\right)\right)+1\right)

and a third (formal) solution which here is of no use because for typical values of HH it produces values outside the simplex. arg(z)\arg(z) is the argument of the complex number zz.

Now we compute the expectation value of the change in HH for a single time step for the local update process. For time tt, the system may be in an arbitrary allowed point (x,H)(x,H). For that we will need the hopping rates in the (x,H)(x,H) coordinates. For example, for the change δ1=1N,δ2=1N\delta_{1}=\frac{1}{N},\delta_{2}=-\frac{1}{N} the hopping rate reads after inserting y=yuy=y_{u},

Tu(1N,1N)=(9x2+3x+f+2)216(3x+1)(27ψx2+9(ψ2)x+(2s+1)fψ6)T_{u}\left(\frac{1}{N},-\frac{1}{N}\right)=-\frac{\left(-9x^{2}+3x+f+2\right)}{216(3x+1)}\left(27\psi x^{2}+9(\psi-2)x+(2s+1)f\psi-6\right) (15)

with

f=(3x+1)((3x+1)(23x)2+108H).f=\sqrt{(3x+1)\left((3x+1)(2-3x)^{2}+108H\right)}. (16)

Note that we will use all variables as if they were continous, although in fact they are not, and we use T(δ1,δ2)T(\delta_{1},\delta_{2}) as an abbreviation for T(x1,x2,x3x1+δ1,x2+δ2,x3+δ3)T(x_{1},x_{2},x_{3}\rightarrow x_{1}+\delta_{1},x_{2}+\delta_{2},x_{3}+\delta_{3}) due to the conservation of total density (where δ1\delta_{1} und δ2\delta_{2} can take all values of {1/N,0,1/N}\{-1/N,0,1/N\} as long as they are not identical).

Refer to caption
Refer to caption
Figure 3: (a) A plot of the expectation values of the changes in HH, δH(H)\delta H(H), multiplied with N2N^{2}, for the local update process. The dashed lines show this quantity for N=798N=798, the continous lines for N=396N=396. red : s=1.0s=1.0, ω=0.5\omega=0.5, orange: s=1.2s=1.2, ω=0.5\omega=0.5, yellow: s=1.2s=1.2, ω=0.05\omega=0.05, green: s=0.8,ω=0.05s=0.8,\omega=0.05, blue: s=0.8s=0.8, ω=0.5\omega=0.5. (b) Additional illustration of the meaning of δH(H)\delta H(H) (s=0.8s=0.8, ω=0.50\omega=0.50). If the observable hh has a certain value HtH_{t} at a time tt, HH is expected to make a step in the direction of the arrow belonging to this specific value of HH. The lengths of the arrows are over-emphasized by a factor of 4000040000.

Further on we will need the change in HH that emerges from a change of the (x,y)(x,y)-values in the (x,H)(x,H)-coordinates as well. To derive this, we write H(x+δ1,y+δ2)H(x+\delta_{1},y+\delta_{2}) of the changed values of xx and yy, subduct the initial value H(x,y)H(x,y), and transform the result into the (x,H)(x,H)-coordinates. For y=yuy=y_{u} and the same change as in the example above we get

Δu(1N,1N)=(27Nx29(N+2)x+Ng6)36(3xN+N)2(9x23x+f2)\Delta_{u}\left(\frac{1}{N},-\frac{1}{N}\right)=\frac{\left(-27Nx^{2}-9(N+2)x+Ng-6\right)}{36(3xN+N)^{2}}\left(9x^{2}-3x+f-2\right) (17)

with

g=(3x+1)(27x327x2+108H+4)g=\sqrt{(3x+1)\left(27x^{3}-27x^{2}+108H+4\right)} (18)

By multiplying all hopping rates T(δ1,δ2)T\left(\delta_{1},\delta_{2}\right) with the associated changes in HH, Δ(δ1,δ2)\Delta\left(\delta_{1},\delta_{2}\right), and summing over all terms, we get a relatively simple equation for the expectation value of the change in HH in a single time step,

δH(x,H)=δ1,δ2T(δ1,δ2)Δ(δ1,δ2)\displaystyle\delta H(x,H)=\sum_{\delta_{1},\delta_{2}}T\left(\delta_{1},\delta_{2}\right)\Delta\left(\delta_{1},\delta_{2}\right)
=H(9+27x+(1+27H)N(1+s)ωΔπmax+27N(1+s)x3ωΔπmax)3N2(1+3x),\displaystyle=-\frac{H(9+27x+(1+27H)N(-1+s)\frac{\omega}{\Delta\pi_{max}}+27N(-1+s)x^{3}\frac{\omega}{\Delta\pi_{max}})}{3N^{2}(1+3x)},

with δHu(x,H)=δHd(x,H)=:δH(x,H)\delta H_{u}\left(x,H\right)=\delta H_{d}\left(x,H\right)=:\delta H\left(x,H\right). Averaging over xx by integrating over xx from xminx_{min} to xmaxx_{max} and normalizing by the length of whole interval we derive

δH(H)\displaystyle\delta H(H) =\displaystyle= (6a2+(6b3)a+6b23b+2)N(s1)ψ+186N2\displaystyle\frac{\left(6a^{2}+(6b-3)a+6b^{2}-3b+2\right)N(s-1)\psi+18}{6N^{2}} (20)
3H(s1)ψlog(3b+13a+1)(ab)N\displaystyle-\frac{3H(s-1)\psi\log\left(\frac{3b+1}{3a+1}\right)}{(a-b)N}

with a=xmina=x_{min}, b=xmaxb=x_{max}. The simplest way to approximate the mean extinction time for a system starting in the fixed point of the replicator equations is then to define a map

Hi+1=Hi+δH(Hi)H_{i+1}=H_{i}+\delta H(H_{i}) (21)

and to iterate it until Ht0H_{\langle t\rangle}\approx 0 is reached. The MET is then simply the number of necessary steps.

As one can see in the right hand side of Figure 2, this result is in good agreement with the simulation data, for the dependence of ss and ω\omega, as well as for the numerical values. For s=1s=1 the MET is predicted to be proportional to N2N^{2} and independent of ω\omega because δH(H)\delta H(H) is independent of ω\omega for s=1s=1. It is in good agreement with the simulations that the MET for s1s\neq 1 and weak selection differs only slightly from the MET for s=1s=1, while for strong selection and s>1s>1 it is clearly smaller than for s=1s=1.

Unfornately, for s<1s<1 this approach works only for small selection and not to great NN, because for s<1s<1 non-positive expectation values of the changes in HH are possible, so if we start the mentioned iterated map in the fixed point and repeat the iteration again and again, we would arrive at a value of HH where the expected change is 0 at some time, and the predicted MET would be infinite. The part of the interval where the expectation value is negative is bigger for larger ω\omega than for smaller ω\omega, and it grows with increasing NN, so in this cases this approach is not sufficient.

4.2 Second approach - Fokker-Planck equation

For the local update process, a Fokker-Planck equation (FPE) for the time evolution of the probability density is given by (summation over double indices)

tP(x,t)=i[αi(x)P(x,t)]+12ij[Bij(x)P(x,t)],\partial_{t}P(\vec{x},t)=-\partial_{i}[\alpha_{i}(\vec{x})P(\vec{x},t)]+\frac{1}{2}\partial_{ij}[B_{ij}(\vec{x})P(\vec{x},t)], (22)

where the coefficients are

αi=δxδxiT(xx+δx)\alpha_{i}=\sum_{\delta\vec{x}}\delta x_{i}T(\vec{x}\rightarrow\vec{x}+\delta\vec{x}) (23)

and

Bij(x)=δxδxiδxjT(xx+δx),B_{ij}(\vec{x})=\sum_{\delta\vec{x}}\delta x_{i}\delta x_{j}T(\vec{x}\rightarrow\vec{x}+\delta\vec{x}), (24)

which gives the following results

αR\displaystyle\alpha_{R} =\displaystyle= (2xP+xR)(1+3xR)ψ3N\displaystyle-\frac{(2x_{P}+x_{R})(1+3x_{R})\psi}{3N} (25)
αP\displaystyle\alpha_{P} =\displaystyle= (1+3xP)(xP+2xR)ψ3N\displaystyle\frac{(1+3x_{P})(x_{P}+2x_{R})\psi}{3N} (26)
BRR\displaystyle B_{RR} =\displaystyle= 2+3xR9xR29N2\displaystyle\frac{2+3x_{R}-9x_{R}^{2}}{9N^{2}} (27)
BRP\displaystyle B_{RP} =\displaystyle= BPR=(1+3xP)(1+3xR)N2\displaystyle B_{PR}=-\frac{(1+3x_{P})(1+3x_{R})}{N^{2}} (28)
BPP\displaystyle B_{PP} =\displaystyle= 2+3xP9xP29N2.\displaystyle\frac{2+3x_{P}-9x_{P}^{2}}{9N^{2}}. (29)

Transforming this into polar coordinates xR=rcosϕx_{R}=r\cos\phi, xP=rsinϕx_{P}=r\sin\phi, we find for the coefficients of the FPE

αR\displaystyle\alpha_{R} =rψ(3rcos(ϕ)+1)(2cos(ϕ)+2(s+1)sin(ϕ)+r(s1)(sin(2ϕ)+2))6N\displaystyle=-\frac{r\psi(3r\cos(\phi)+1)(2\cos(\phi)+2(s+1)\sin(\phi)+r(s-1)(\sin(2\phi)+2))}{6N} (30)
αP\displaystyle\alpha_{P} =rψ(3rsin(ϕ)+1)(2(s+1)cos(ϕ)2ssin(ϕ)+r(s1)(sin(2ϕ)+2))6N\displaystyle=-\frac{r\psi(3r\sin(\phi)+1)(-2(s+1)\cos(\phi)-2s\sin(\phi)+r(s-1)(\sin(2\phi)+2))}{6N} (31)
BRR\displaystyle B_{RR} =9r2cos2(ϕ)+3rcos(ϕ)+29N2\displaystyle=\frac{-9r^{2}\cos^{2}(\phi)+3r\cos(\phi)+2}{9N^{2}} (32)
BRP=BPR\displaystyle B_{RP}=B_{PR} =(3rcos(ϕ)+1)(3rsin(ϕ)+1)9N2\displaystyle=-\frac{(3r\cos(\phi)+1)(3r\sin(\phi)+1)}{9N^{2}} (33)
BPP\displaystyle B_{PP} =9r2sin2(ϕ)+3rsin(ϕ)+29N2,\displaystyle=\frac{-9r^{2}\sin^{2}(\phi)+3r\sin(\phi)+2}{9N^{2}}, (34)

where ψ=ω/Δπmax\psi=\omega/\Delta\pi_{max}, and hence for the FPE itself

P(0,0,1)\displaystyle P^{(0,0,1)} =6N(s1)ψsin(2ϕ)r2+N(12r21)(s1)ψ93N2P(0,0,0)\displaystyle=\frac{6N(s-1)\psi\sin(2\phi)r^{2}+N\left(12r^{2}-1\right)(s-1)\psi-9}{3N^{2}}P^{(0,0,0)} (35)
+(rψ(3(s+2)cos(ϕ)+(7s+2)cos(3ϕ)2(4s+(2s+7)cos(2ϕ)+5)sin(ϕ))12N\displaystyle+\left(\frac{r\psi(-3(s+2)\cos(\phi)+(7s+2)\cos(3\phi)-2(4s+(2s+7)\cos(2\phi)+5)\sin(\phi))}{12N}\right.
(s+1)ψ(sin(2ϕ)+2)6N+cos(ϕ)+3cos(3ϕ)sin(ϕ)+3sin(3ϕ)12N2r+cos(2ϕ)9N2r2)P(0,1,0)\displaystyle\left.-\frac{(s+1)\psi(\sin(2\phi)+2)}{6N}+\frac{\cos(\phi)+3\cos(3\phi)-\sin(\phi)+3\sin(3\phi)}{12N^{2}r}+\frac{\cos(2\phi)}{9N^{2}r^{2}}\right)P^{(0,1,0)}
+(r2ψ((2s+1)cos(ϕ)+(2s+7)cos(3ϕ)(s+2)sin(ϕ)+(7s+2)sin(3ϕ))12N\displaystyle+\left(\frac{r^{2}\psi((2s+1)\cos(\phi)+(2s+7)\cos(3\phi)-(s+2)\sin(\phi)+(7s+2)\sin(3\phi))}{12N}\right.
+r3(s1)ψ(cos(ϕ)sin(ϕ)+1)N+r(N(s1)ψ+N(s+1)cos(2ϕ)ψ18)6N2\displaystyle\left.+\frac{r^{3}(s-1)\psi(\cos(\phi)\sin(\phi)+1)}{N}+\frac{r(-N(s-1)\psi+N(s+1)\cos(2\phi)\psi-18)}{6N^{2}}\right.
+cos(ϕ)cos(3ϕ)+sin(ϕ)+sin(3ϕ)8N2+cos(ϕ)sin(ϕ)+19N2r)P(1,0,0)\displaystyle\left.+\frac{\cos(\phi)-\cos(3\phi)+\sin(\phi)+\sin(3\phi)}{8N^{2}}+\frac{\cos(\phi)\sin(\phi)+1}{9N^{2}r}\right)P^{(1,0,0)}
(cos(ϕ)sin(ϕ))(9cos(ϕ)sin(ϕ)r+3r+cos(ϕ)+sin(ϕ))9N2rP(1,1,0)\displaystyle-\frac{(\cos(\phi)-\sin(\phi))(9\cos(\phi)\sin(\phi)r+3r+\cos(\phi)+\sin(\phi))}{9N^{2}r}P^{(1,1,0)}
+9rcos(ϕ)9rcos(3ϕ)+9rsin(ϕ)+4sin(2ϕ)+9rsin(3ϕ)+872N2r2P(0,2,0)\displaystyle+\frac{9r\cos(\phi)-9r\cos(3\phi)+9r\sin(\phi)+4\sin(2\phi)+9r\sin(3\phi)+8}{72N^{2}r^{2}}P^{(0,2,0)}
+36r2+3cos(ϕ)r+9cos(3ϕ)r+3sin(ϕ)r9sin(3ϕ)r4sin(2ϕ)+872N2P(2,0,0),\displaystyle+\frac{-36r^{2}+3\cos(\phi)r+9\cos(3\phi)r+3\sin(\phi)r-9\sin(3\phi)r-4\sin(2\phi)+8}{72N^{2}}P^{(2,0,0)},

with P(λ,μ,ν):=λrλμϕμνtνP(r,ϕ,t)P^{(\lambda,\mu,\nu)}:=\frac{\partial^{\lambda}}{\partial r^{\lambda}}\frac{\partial^{\mu}}{\partial\phi^{\mu}}\frac{\partial^{\nu}}{\partial t^{\nu}}P(r,\phi,t).

If we now assume the probability density to be independent of the angle ϕ\phi (which is a good approximation at least for small rr), all terms which contain derivatives with respect to ϕ\phi will drop out. By averaging over ϕ\phi afterwards by integrating from 0 to 2π2\pi over ϕ\phi and dividing the result by 2π2\pi we get a probability density which is only independent of rr and tt. Hence the FPE reads

tP(r,t)=r(N(12r21)(s1)ψ9)3N2r2P(r,t)\displaystyle\partial_{t}P(r,t)=\frac{r\left(N\left(12r^{2}-1\right)(s-1)\psi-9\right)}{3N^{2}\sqrt{r^{2}}}P(r,t) (37)
+3(N(6r21)(s1)ψ18)r2+218N2r2rP(r,t)\displaystyle+\frac{3\left(N\left(6r^{2}-1\right)(s-1)\psi-18\right)r^{2}+2}{18N^{2}\sqrt{r^{2}}}\partial_{r}P(r,t)
+r(29r2)18N2r2r2P(r,t)\displaystyle+\frac{r\left(2-9r^{2}\right)}{18N^{2}\sqrt{r^{2}}}\partial_{r}^{2}P(r,t)
=:a(r)P(r,t)v(r)rP(r,t)+D(r)r2P(r,t).\displaystyle=:a(r)P(r,t)-v(r)\partial_{r}P(r,t)+D(r)\partial_{r}^{2}P(r,t).

Let us now approximate the diffusion by its value at r=0r=0, this gives us

D=19N2.D=\frac{1}{9N^{2}}. (38)

rr can theoretically vary between 0 and 2/32/3 for some values of ϕ\phi, but most extinction events will happen around those point of the edge of the simplex that are closest to the inner fixed point. These points have the distance r13r\gtrsim\frac{1}{3} from the inner FP. Because of that we search for the solution of the one-dimensional random walk descripted by eq. 37 in the interval [0,L]=[0,13][0,L]=[0,\frac{1}{3}]. Next, let us approximate the convection velocity v(r)v(r) by its value for mediate rr, e.g. r=16=L2r=\frac{1}{6}=\frac{L}{2}, and we get

v={0.00462963Nψ0.166667N2,s=0.816N2,s=1.00.00462963Nψ0.166667N2,s=1.2v=\left\{\begin{array}[]{cc}\frac{-0.00462963N\psi-0.166667}{N^{2}},&s=0.8\\ &\\ -\frac{1}{6N^{2}},&s=1.0\\ &\\ \frac{0.00462963N\psi-0.166667}{N^{2}},&s=1.2\end{array}\right. (39)
Refer to caption
Refer to caption
Figure 4: Left: A plot of the convection velocity v(r)v(r) for the local update with N=3960N=3960. As one can see, v(r)v(r) is 0\approx 0 for s=1s=1. This corresponds to the neutral stability of the fixed point in the limit NN\rightarrow\infty for s=1s=1. For s<1s<1 v(r)v(r) is negative, corresponding to the asymptotical stable FP, and for s>1s>1 v(r)v(r) is positive corresponding to the unstable FP. For r0r\rightarrow 0 v(r)v(r) diverges for all parameter values to -\infty, which results from the simplifications in combination with the existence of a FP at r=0r=0. Right: Semilogarithmic plot of simulation data of the MET in the local update process, divided by N2N^{2} (symbols) for several pairs of ss and ω\omega, and the corresponding values predicted by the approach in section 4.2 (lines, with a not filled version of the corresponding symbol at the end). Red circles: s=1.0s=1.0, ω=0.5\omega=0.5, orange squares: s=1.2s=1.2, ω=0.5\omega=0.5, yellow rhombi: s=1.2s=1.2, ω=0.05\omega=0.05, green upward triangles: s=0.8s=0.8, ω=0.05\omega=0.05, blue downward triangles: s=0.8s=0.8, ω=0.5\omega=0.5

(We cannot approximate it by its value at r=0r=0 as we have done for the diffusion because v(0)v(0) is -\infty corresponding to the FP.) For a last simplification, we neglect the term that includes the probability density itself. Hence, the FPE reads

tP(r,t)=vrP(r,t)+Dr2P(r,t).\partial_{t}P(r,t)=-v\partial_{r}P(r,t)+D\partial_{r}^{2}P(r,t). (40)

The computation of the MET for such an equation with one reflecting border at r=0r=0 and one absorbing border at r=Lr=L is a standard problem, for example, see [19]. One can find the MET by introducing a time integrated probability density,

P0(r)=0P(r,t)𝑑t.P_{0}(r)=\int_{0}^{\infty}P(r,t)dt. (41)

With this, the time-integrated FPE reads:

vP0(x)r=D2P0(x)x2v\frac{\partial P_{0}(x)}{\partial r}=D\frac{\partial^{2}P_{0}(x)}{\partial x^{2}} (42)

with the boundary conditions P0(r)=0P_{0}(r)=0 at r=Lr=L and J(r)=0j(r,t)𝑑t=0vP(r,t)DrP(r,t)dt=1J(r)=\int_{0}^{\infty}j(r,t)dt=\int_{0}^{\infty}vP(r,t)-D\partial_{r}P(r,t)dt=1 at r=0r=0. There, J(r)J(r) is the time-integrated probability current. The constraint J(0)=1J(0)=1 corresponds to the injection of an unit probability current at t=0t=0, this means a start of the system in the origin. The solution of the differential equation is given by

P0(r)=1v(1ev(xL)/D)P_{0}(r)=\frac{1}{v}\left(1-e^{v(x-L)/D}\right) (43)

Then the MET is simply

t=0LP0(r)𝑑r=LvDv2(1evL/D),\langle t\rangle=\int_{0}^{L}P_{0}(r)dr=\frac{L}{v}-\frac{D}{v^{2}}(1-e^{-vL/D}), (44)

what leads to

t={N2(0.00154321Nψ+0.183191e0.0138889Nψ0.166667)(0.00462963Nψ+0.166667)2,s=0.80.594885N2,s=0e0.0138889NψN2(e0.0138889Nψ(0.00154321Nψ0.166667)+0.183191)(0.1666670.00462963Nψ)2,s=1.2\langle t\rangle=\left\{\begin{array}[]{cc}\frac{N^{2}\left(-0.00154321N\psi+0.183191e^{0.0138889N\psi}-0.166667\right)}{(0.00462963N\psi+0.166667)^{2}},&s=0.8\\ &\\ 0.594885N^{2},&s=0\\ &\\ \frac{e^{-0.0138889N\psi}N^{2}\left(e^{0.0138889N\psi}(0.00154321N\psi-0.166667)+0.183191\right)}{(0.166667-0.00462963N\psi)^{2}},&s=1.2\end{array}\right. (45)

or in general

t=72e572N(s1)ψ12N2(e572N(s1)ψ+12(5N(s1)ψ108)+72e)(365N(s1)ψ)2.\displaystyle\langle t\rangle=\frac{72e^{-\frac{5}{72}N(s-1)\psi-\frac{1}{2}}N^{2}\left(e^{\frac{5}{72}N(s-1)\psi+\frac{1}{2}}(5N(s-1)\psi-108)+72e\right)}{(36-5N(s-1)\psi)^{2}}. (46)

The proportional dependence of the MET on NN and the other parameters is very good, as one can see in Fig. 4. This approach predicts the MET to be proportional to N2N^{2} and independent of the selection strength ω\omega for s=1s=1, while for s<1s<1 it forecasts the MET to be substantially proportional to N2eκ1ψNN^{2}e^{\kappa_{1}\psi N}, and for s>1s>1 in good approximation proportional to c1N2+c2eκ2ψNc_{1}N^{2}+c_{2}e^{\kappa_{2}\psi N}.

5 Summary

Evolutionary games with three cyclically dominating strategies can stabilize the coexistence of strategies superior compared to games with only two strategies which include a dominating strategy. In finite populations, it is still likely that one of these strategies goes extinct, though on average it takes a time which is considerably longer than in two strategy games. We have analyzed the mean times to the extinction of one of the three strategies numerically in the RPS-game for three different evolutionary processes, the local update, the frequency-dependent Moran process and the Fermi process. These mean extinction times (MET) have the same fundamental dependencies of the number of agents NN, the parameter ss and the selection strength ω\omega.

For the zero-sum RPS game (s=1s=1), the MET is proportional to N2N^{2} and independent of ω\omega for all three processes; even the proportionality constants are the same. For s<1s<1, we find a stabilization of biodiversity, as the MET grows exponentially with NN. In contrast to that, for s>1s>1, there is a destabilization of biodiversity, and the MET now grows smaller than in the zero-sum RPS (where it grows N2\propto N^{2}), namely, a part of the proportionality factor of the zero-sum RPS now decays exponentially with NN. In both non zero-sum cases, the coefficient that is multiplied with the NN in the exponent, grows with both ω\omega and (s1)(s-1).

Solving such two-dimensional Markov chains is not possible, but we have developed two analytical approaches for the approximation of the dependencies of the MET of NN and the parameters ss and ω\omega. The first approach can be used for s1s\geq 1 only, but serves for all three processes. In contrast, the second approach gives a good approximation for all parameter values, but it can be used completely analytical only for the local update process, as for the other two processes not all integrals can be done analytical.

In summary, the fixation time – as a long-time property of the process – shows a crossover from exponential to polynomial scaling with the population size, being fully consistent with the critical population size deived from the “drift reversal” picture which is based on the short-time dependence of the average drift in phase space - repelling versus attracting towards the fixed point. Thus both approaches to describe stochastic stability in finite populations here lead to the same conclusions: coexistence is preserved (stabilized) for a positive-sum cyclic game, whereas negative-sum games as well as small populations destabilize coexistence.

References

  • [1] Antal, T., and Scheuring, I., Fixation of Strategies for an Evolutionary Game in Finite Populations, Bulletin of Mathematical Biology 68,1923-1944 (2006)
  • [2] Axelrod R., and May, R.M., Infectious Diseases of Humans, (Oxford University Press, Oxford, 1991)
  • [3] Blume, L.E., The Statistical Mechanics of Best-Response Strategy Revision, Games Econ. Behav. 11, 111-145, (1995)
  • [4] Buss, L.W., Competitive intransitivity and size-frequency distributions of interacting populations, Proceedings of the National Academy of Science, USA, 77, 5355 (1980)
  • [5] Claussen, J.C., and Traulsen, A., Cyclic dominance and biodiversity in well mixed populations, Phys. Rev. Letters 100, 058104, (2008)
  • [6] Claussen, J.C., Drift reversal in asymmetric coevolutionary conflicts: Influence of microscopic processes and population size, The European Physical Journal B 60, 391-399(2007)
  • [7] Dawkins, R., The Selfish Gene, Oxford University Press, (New York, 1976)
  • [8] Durrett R., and Levin, S., Allelopathy in spatially distributed populations, Journal of Theoretical Biology 185, 165-174 (1997)
  • [9] Durrett, R., and Levin, S., Spatial aspects of interspecific competition, Theoretical Population Biology 53, 30-43, (1998)
  • [10] Gilg, O., Hanski, I., and Sittler, B., Cyclic dynamics in a simple vertebrate predator-prey community, Science 302, 866 (2003)
  • [11] Hauert, C., de Monte, S., Hofbauer, J., and Sigmund, K., Volunteering as red queen mechanism for cooperation in public goods games, Science 296, 1129 (2002)
  • [12] Hofbauer J., and Sigmund, K., Evolution and the Theory of Games, Cambridge University Press, Cambridge, England, (1998)
  • [13] Kagel J.H., and Roth, A.E., (eds.), The Handbook of Experimental Economics, Princeton University Press, Princeton, (1995)
  • [14] Kerr, B., Riley, M.A., Feldman M.W., and Bohannan, B.J.M., Local dispersal promotes biodiversity in a real-life game of rock-paper-scissors, Nature 418, 171-174 (2002)
  • [15] Kirkup B.C., and Riley, M.A., Antibiotic-mediated antagonism leads to a bacterial game of rock-paper-scissors in vivo, Nature 428, 412-414 (2004)
  • [16] Moran, P.A.P., The statistical processes of evolutionary theory, Clarendon Press (1962)
  • [17] Nowak, M.A., Evolutionary Dynamics, Harvard (2007)
  • [18] Nowak, M.A., Sasaki, A., Taylor, C., and Fudenberg, D., Emergence of cooperation and evolutionary stability in finite populations, Nature (London) 428, 646 (2004)
  • [19] Redner, S., A Guide to First-Passage Processes, Springer, (Berlin, 1983)
  • [20] Reichenbach, T., Mobilia, M., and Frey, E., Coexistence versus extinction in the stochastic cyclic Lotka-Volterra model, Phys Rev E 74, 051907 (2006)
  • [21] Reichenbach, T., Mobilia, M., and Frey, E., Mobility promotes and jeopardizes biodiversity in rock-paper-scissors games, Nature 448, 1046-49 (2007)
  • [22] Sinervo, B, and Lively, C., The rock-paper-scissors game and the evolution of alternative male strategies, Nature 380, 240, (1996)
  • [23] Smith, J.M., Evolution and the Theory of Games, (Cambridge University Press, Cambridge, England, 1982)
  • [24] Smith, J.M., The games lizards play, Nature 380, 198 (1996)
  • [25] Szabó, G., and Fáth, G., Evolutionary games on graphs, Physics Reports, 446, 97-216 (2007)
  • [26] Traulsen, A., Claussen, J.C., and Hauert, C., Coevolutionary dynamics: from finite to infinite populations, Physical Review Letters 95, 238701 (2005)
  • [27] D. Helbing, Interrelations between stochastic equations for systems with pair interactions, Physica A 181, 29 (1992)
  • [28] D. Helbing, Stochastic and Boltzmann-like models for behavioral changes, and their relation to game theory, Physica A 193, 241 (1993)
  • [29] Chalub F.A., Souza M.O., From discrete to continuous evolution models: A unifying approach to drift-diffusion and replicator dynamics, Theor. Pop. Biol 76, 268 (2009)
  • [30] Traulsen, A., Pacheco, J. M., and Nowak, M.A., Pairwise comparison and selection temperature in evolutionary game dynamics, J. Theor. Biol. 246, 522 (2007)
  • [31] McKane, A. J., and Newman, T. J., Predator-Prey Cycles from Resonant Amplification of Demographic Stochasticity, Phys. Rev. Lett. 94, 218102 (2005)
  • [32] Butler, T., and Goldenfeld, N., Robust ecological pattern formation induced by demographic noise, Phys. Rev. E 80, 030902 (2009)
  • [33] Poncela, J., Gómez-Gardeñes, J., Traulsen, A., and Moreno, Y., Evolutionary game dynamics in a growing structured population, New J. Phys. 11 083031 (2009)
  • [34] Traulsen, A. Nowak, M.A., and Pacheco, J.M., Stochastic dynamics of invasion and fixation, Physical Review E 74, 011909 (2006)
  • [35] van Kampen, N., Stochastic Processes in Physics and Chemistry, (North Holland, Amsterdam, 1981)
  • [36] Zamudio, K.R., and Sinervo, B., Polygyny, mate-guarding, and posthumous fertilization as alternative male mating strategies, Proceedings of the National Academy of Science, USA, 97, 14427 (2000)