This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

The pace of evolution across fitness valleys

Chaitanya S. Gokhale 1, Yoh Iwasa 2, Martin A. Nowak 3, and Arne Traulsen 1 1 Max-Planck-Institute for Evolutionary Biology, August-Thienemann-Str. 2, 24306 Plön, Germany, 2 Department of Biology, Faculty of Sciences, Kyushu University, Fukuoka 812-8581, Japan, 3 Program for Evolutionary Dynamics, Department of Mathematics, Department of Organismic and Evolutionary Biology, Harvard University, Cambridge MA 02138, USA
Abstract

How fast does a population evolve from one fitness peak to another? We study the dynamics of evolving, asexually reproducing populations in which a certain number of mutations jointly confer a fitness advantage. We consider the time until a population has evolved from one fitness peak to another one with a higher fitness. The order of mutations can either be fixed or random. If the order of mutations is fixed, then the population follows a metaphorical ridge, a single path. If the order of mutations is arbitrary, then there are many ways to evolve to the higher fitness state. We address the time required for fixation in such scenarios and study how it is affected by the order of mutations, the population size, the fitness values and the mutation rate.

I Introduction

Evolutionary dynamics is based on natural selection, mutation and genetic drift nowak:2006bo . It can be illustrated as the dynamics of a population in an abstract, typically high-dimensional fitness landscape. Since individuals with higher fitness produce more offspring, the average density of individuals is highest close to the fitness maxima. Many such features as the stationary population density in the fitness landscape or the mutation rate under which a population can still be concentrated around a fitness maximum have been addressed eigen:1977aa ; eigen:1989aa ; wilke:2005aa ; nowak:1992aa . An important question is how a population evolves towards a fitness peak via several intermediate states. If the intermediate states have the same fitness as the initial state, then evolution to higher fitness states is neutral at first and thus poses no significant problems nimwegen:2000 . If the intermediate states have lower fitness than the initial state, then a fitness valley has to be overcome and it is more difficult to reach the fitness peak weinreich:2006aa ; poelwijk:2007aa . In this case, population stuck on a local peak cannot escape by natural selection alone, because there is no evolutionary trajectory with successively advantageous mutations. Instead, neutral genetic drift becomes important.

Here, we consider the dynamics of these systems from a different perspective. We address the average time a population needs to transfer from one peak to another one. For small mutation rates and finite populations, we calculate this average time analytically. When mutation rates are high, we can describe the system by a set of differential equations and obtain the relevant times from a numerical integration of the differential equations. In this framework, the relevant question is how fast a population evolves traulsen:2007ee .

In particular, we can address the question whether a population evolves faster from one peak to another via dd mutations if

  • (i)

    mutations have to occur in a certain order, i.e. only a single evolutionary trajectory is available or

  • (ii)

    the order of the mutations does not matter, i.e. there are d!d! evolutionary paths.

In the simplest case the intermediate fitness values are identical in both the cases and equal to that of the initial state. Thus the only difference remaining is the number of available paths. When the order of mutations is not fixed then multiple paths are available and the evolutionary dynamics will be faster when compared to a single path. We can then ask the question: Does a population evolve faster on a narrow ridge or a broad valley? This implies that we move away from the simplest case mentioned above and decrease the fitness in the intermediate states of the multiple paths compared to the fitness in the intermediate states of the single path. We show how the pace of evolution depends on the depth of the valley, the number of intermediate states and the size of the population.

In general, evolutionary dynamics depends crucially on the size of the population. In a small population a single mutation will typically reach fixation or extinction before another mutation can arise. The population moves as a whole step by step on the fitness landscape. For large populations, even for small mutation rates usually multiple types arise at the same time. This results in a non-zero population density in many states at the same time. For intermediate mutation rates, the population can either move stepwise across the fitness landscape or move several stpdf without getting concentrated in one of the intermediate states. This phenomenon has been termed stochastic tunneling iwasa:2004aa . If the mutation rates are too small, tunneling does not occur because it is unlikely that a second mutation arises before the first one has reached fixation or has gone extinct. If the mutation rates are high, tunneling occurs trivially, because the system can be approximated by differential equations for the densities in the different states. These different scenarios including the limiting cases of stepwise evolution (typical for small populations) and continuous evolution (typical for large populations) can also be observed when the population size is kept constant, but the mutation rates are increased. For computer simulations increasing the mutation rate is more convenient than simulating huge populations for moderate mutation rates.

One important example for an evolutionary process in which the timescales are of crucial importance is the somatic evolution of cancer frank:2007aa . Cancer progression has been investigated mathematically since the 1950s fisher:1959 ; nordling:1953 ; armitage:1954 . Of special interest are the tumor suppressor genes knudson:1971aa ; michor:2004aa . In a normal cell, there are two alleles of the tumor suppressor gene. The mutation in the first allele is neutral if the second wild-type allele can sufficiently perform the function. Inactivation of both the alleles confers a selective advantage to the cell and can lead to cancer progression. This is an example in which the order of mutations does not matter. Many cancers also require certain particular mutations that initiate cancer growth and pave the way for the accumulation of further mutations vogelstein:2004aa . Recently, it has been shown that after cancer initiation, a large number of different mutations may be involved in cancer progression sjoeblom:2006sj ; wood:2007ri ; jones:2008ty ; jones:2008ec . So far, it is unclear if the mutations have to occur in a specific order or if there is more variation in the order beerenwinkel:2007aa ; gerstung:2008aa .

For simplicity, we consider only very simple fitness landscapes here in which the fitness in all the intermediate states is identical. In natural systems, these fitness values will differ and also the mutation rate may not be constant. In addition, sometimes the order of mutations will matter and sometimes, it will not. Thus, sometimes a particular mutation will be a prerequisite to obtain a new function, but sometimes new mutations do not require any prerequisites. For example, this is the case in the evolution of resistance to β\beta lactam antibiotics studied by weinreich:2005aa and weinreich:2006aa . However, here we focus on a very simple model to highlight the general aspects of the dynamics by analytical and numerical considerations.

This paper is organized as follows. We begin with the description of the two ways to order the mutations, the single path and the hypercube. We then derive analytical approximations of the fixation times for small mutation rates and discuss the effect of the different parameters on the fixation times. Next, we address the dynamics for intermediate and high mutation rates. Finally, we explore biological examples which can be modeled using this approach.

II Model

To model evolutionary dynamics in a haploid population of size NN, we use the Moran process nowak:2006bo ; moran:1962ef . In each time step, one individual is selected at random, but proportional to fitness. It produces one offspring, which replaces a randomly chosen individual. In one generation, each individual reproduces on average once. During reproduction, mutations occur with probability μ\mu. We are interested in the time it takes until dd mutations reach fixation in the population, starting from a homogeneous population in the initial state without any mutants. Moreover, we aim to explore the dynamical features of this process. We restrict ourselves to two different cases that allow the derivation of some analytical results.

II.1 Single path

If the mutations can occur only in a particular order, we have a single evolutionary path, see Fig. 1 for an illustration. Individuals in the initial state have fitness r0=1r_{0}=1 and individuals in the final state have fitness rd>1r_{d}>1. It is instructive to characterize an individual by a string of dd sites, which can either be wild-type or mutated. If the order of mutations is fixed, then a particular mutation requires another particular mutation as a prerequisite. For simplicity, we assume that all the d1d-1 intermediate states have the same fitness rj=s<rdr_{j}=s<r_{d} (j=1,,d1j=1,\ldots,d-1). For s<1s<1, the joint effect of the set of mutations make up for the loss of fitness caused by the individually deleterious mutations. This can be considered as a very special case of epistasis weinreich:2005ab .

II.2 Hypercube

If the order of mutations does not matter, evolutionary dynamics takes place on a hypercube in dd dimensions cf. Fig. 1. Thus, there are 2d2^{d} different types of individuals. In the initial state, we have dd possible mutations. In the next step, d1d-1 mutations are available. Consequently, we have d!d! possible paths to fixation. Again, we assume r0=1r_{0}=1 and rd>1r_{d}>1. Further, all individuals with some, but not all mutations have fitness s<rds<r_{d}.

Refer to caption
Figure 1: The order of mutations determines the geometry for evolutionary dynamics, shown here for d=3d=3 sites (e.g. genes, nucleotide sites etc.). If mutations can only occur in a particular order, only a single path is available (left). If the order of mutations is arbitrary, evolutionary dynamics occurs on a hypercube (right). The initial states have fitness 11 and the final states fitness r1r\geqslant 1. All intermediate states are assumed to have the same fitness s<rs<r. States are labeled by bit-strings, 0 is an wild-type site and 11 is a mutated site.

If the mutation probability is small, we do not need to make specific assumptions on the mutation process. But when the mutation probability increases, we can no longer be certain that only a single mutation occurs during reproduction. For simplicity, we do not consider the possibility of backward mutations. Although back mutations are often relevant, especially to escape from evolutionary dead ends dePristo:2007pi , it is not straightforward to define the speed of evolution in a system with backward mutations. This is due to the fact that for sufficiently high mutation rates, fixation in the final state might never occur. Other definitions of the end state of the system become arbitrary to a certain extend. The probability umm+ku_{m\to m+k} that the offspring of an individual with mm mutations has m+km+k mutations (mm+kdm\leq m+k\leq d) is

umm+k=(dmk)μk(1μ)dmk.\displaystyle u_{m\to m+k}=\binom{d-m}{k}\mu^{k}(1-\mu)^{d-m-k}. (1)

This equation is valid for the hypercube, where the order of mutations does not matter. Here, (dmk)\binom{d-m}{k} is the number of different types of mutants with kk additional mutations, μk\mu^{k} is the probability that mutations occur at kk sites and (1μ)dmk(1-\mu)^{d-m-k} is the probability that no mutation occurs at the remaining dmkd-m-k sites. For the single path, there is only one possibility to arrange the m+km+k mutations. Thus, for k>0k>0, umm+ku_{m\to m+k} is identical to Eq. (1), except that the binomial factor has to be dropped. The probability ummu_{m\to m} that no mutation occurs follows from normalization, umm=1k=1dmumm+ku_{m\to m}=1-\sum_{k=1}^{d-m}u_{m\to m+k}. Our analytical calculations for small mutation rates as well as the considerations for high mutation rates are independent of the precise form of the mutation rates. However, we need to specify the form of the mutation probabilities to perform our numerical simulations for intermediate and high mutation rates.

III Small mutation rates

III.1 The pace of evolution for small mutation rates

For small mutation probabilities, double mutations can be neglected. Since mutations occur rarely, we can calculate the average time until dd mutations are fixed in the population analytically. Let us first address the evolutionary dynamics when mutants with fitness rmr_{m} are already present in a resident population of fitness rwr_{w}, but no new mutations occur. This scenario is relevant when mutation rates are sufficiently small. The probability to increase the number of mutants from jj to j+1j+1 is

Tj+=rmjrmj+rw(Nj)NjN.\displaystyle T^{+}_{j}=\frac{r_{m}j}{r_{m}j+r_{w}(N-j)}\frac{N-j}{N}. (2)

Similarly, the number of mutants decreases from jj to j1j-1 with probability

Tj=rw(Nj)rmj+rw(Nj)jN.\displaystyle T^{-}_{j}=\frac{r_{w}(N-j)}{r_{m}j+r_{w}(N-j)}\frac{j}{N}. (3)

The probability that kk mutants take over the entire population is given by nowak:2006bo ; karlin:1975xg ; ewens:2004qe ; crow:1970ck

ϕk(rmrw)=1+i=1k1j=1iTjTj+1+i=1N1j=1iTjTj+=1(rwrm)k1(rwrm)N.\displaystyle\phi_{k}\left(\frac{r_{m}}{r_{w}}\right)=\frac{1+\sum_{i=1}^{k-1}{\prod_{j=1}^{i}{\frac{T^{-}_{j}}{T^{+}_{j}}}}}{1+\sum_{i=1}^{N-1}{\prod_{j=1}^{i}{\frac{T^{-}_{j}}{T^{+}_{j}}}}}=\frac{1-\left(\frac{r_{w}}{r_{m}}\right)^{k}}{1-\left(\frac{r_{w}}{r_{m}}\right)^{N}}. (4)

If a mutant reaches fixation, the average number of generations this process takes is given by goel:1974aa ; antal:2006aa

τfix(rmrw)=1Nk=1N1l=1kϕlTl+m=l+1kTmTm+.\displaystyle\tau_{\rm fix}\left(\frac{r_{m}}{r_{w}}\right)=\frac{1}{N}\sum_{k=1}^{N-1}\sum_{l=1}^{k}\frac{\phi_{l}}{T_{l}^{+}}\prod_{m=l+1}^{k}\frac{T^{-}_{m}}{T^{+}_{m}}. (5)

For a neutral process with rm=rwr_{m}=r_{w}, this reduces to τfix=N1\tau_{\rm fix}=N-1. For sufficiently large NN, this is the maximum conditional fixation time of a mutant. Even for disadvantageous mutants (rm<rwr_{m}<r_{w}) the conditional fixation time is smaller than N1N-1 antal:2006aa . Since there are μN\mu N mutations per generation, the time between two mutations is 1μN\frac{1}{\mu N}. Thus, for μN2\mu\ll N^{-2} a mutant reaches fixation before the next one arises and mutations will not occur when a mutant is already present. Thus the population evolves by a process where the mutations occur one after the other, which has been termed periodic selection atwood:1951aa and theoretically described as the strong-selection weak-mutation regime gillespie:1983aa ; gillespie:2004bo .

The total time τ\tau until a mutation reaches fixation in a population is the sum of the waiting time until a successful mutant occurs and the fixation time of the mutant τ=τwait+τfix\tau=\tau_{\rm wait}+\tau_{\rm fix}. The waiting time is the inverse of the mutation rate divided by the probability that a particular mutant is successful,

τwait(rmrw)=1μN1ϕ1(rmrw).\displaystyle\tau_{\rm wait}\left(\frac{r_{m}}{r_{w}}\right)=\frac{1}{\mu N}\frac{1}{\phi_{1}\left(\frac{r_{m}}{r_{w}}\right)}. (6)

For μ0\mu\to 0, we have τwait\tau_{\rm wait}\to\infty, but τfix\tau_{\rm fix} remains approximately constant. Thus, ττwait\tau\approx\tau_{\rm wait} for small mutation rates. In principle, we could calculate τfix\tau_{\rm fix} in the presence of mutations. But since our approximation is only valid for small mutation rates, this will be a minor correction.

For μN2\mu\ll N^{-2}, the population is homogeneous most of the time. Only occasionally, a mutant arises and reaches fixation or goes to extinction. The total time until dd mutations are fixed in the population is the sum of the waiting times for the successful mutants plus the time of the dd fixation events. For a single path with initial fitness 11, intermediate fitness ss and final fitness rr, we find for the total time τS\tau^{S}

τS\displaystyle\tau^{S} =τwait(s)+(d2)τwait(1)+τwait(r/s)\displaystyle=\tau_{\rm wait}\left(s\right)+(d-2)\tau_{\rm wait}\left(1\right)+\tau_{\rm wait}\left(r/s\right) (7)
+τfix(s)+(d2)τfix(1)+τfix(r/s).\displaystyle+\tau_{\rm fix}\left(s\right)+(d-2)\tau_{\rm fix}\left(1\right)+\tau_{\rm fix}\left(r/s\right).

For small μ\mu, we have τfixτwait\tau_{fix}\ll\tau_{wait} and hence the total time can be approximated by

τS\displaystyle\tau^{S} =1μ[1Nϕ1(s)+d2+1Nϕ1(r/s)]\displaystyle=\frac{1}{\mu}\left[\frac{1}{N\phi_{1}(s)}+d-2+\frac{1}{N\phi_{1}(r/s)}\right] (8)

Consider now a “fitness valley”, in which the intermediate states have fitness s<1s<1, but the final state has fitness r>1r>1. To move from the fitness peak in the initial state to the fitness state in the final state, first a disadvantageous mutation has to be fixed in the population. Since ϕ1(s<1)1N\phi_{1}(s<1)\ll\frac{1}{N}, the waiting time of such an event is very long. The waiting time for the neutral mutations, τwait(1)=1μ\tau_{\rm wait}\left(1\right)=\frac{1}{\mu} and the waiting time for a successful mutation, τwait(r/s)\tau_{\rm wait}\left(r/s\right) are significantly shorter. Thus, τS\tau^{S} is dominated by 1μNϕ1(s)\frac{1}{\mu N\phi_{1}(s)} for s<1s<1 and sufficiently high NN in a fitness valley. Fig. 2 shows a good agreement between exact numerical simulations and our analytical approximation for small mutation rates Eq. (8).

If the order of mutations is arbitrary, evolutionary dynamics occurs on a hypercube. In this case, the whole process will be faster, as we have d!d! possible paths instead of a single one. Now, the waiting times in the different states depend on the number of mutations that are still available. For the total time τH\tau^{H}, we obtain,

τH\displaystyle\tau^{H} =1dτwait(s)+k=1d21dkτwait(1)+τwait(rs)\displaystyle=\frac{1}{d}\tau_{\rm wait}\left(s\right)+\sum_{k=1}^{d-2}\frac{1}{d-k}\tau_{\rm wait}\left(1\right)+\tau_{\rm wait}\left(\frac{r}{s}\right) (9)
+τfix(s)+(d2)τfix(1)+τfix(rs)\displaystyle+\tau_{\rm fix}\left(s\right)+(d-2)\tau_{\rm fix}\left(1\right)+\tau_{\rm fix}\left(\frac{r}{s}\right)

Note that the time of the fixations alone is identical for the hypercube and the single path. Neglecting these fixation times for small μ\mu ( as τfixτwait\tau_{fix}\ll\tau_{wait}) yields

τH\displaystyle\tau^{H} =1μ[1dNϕ1(s)+k=1d21dk+1Nϕ1(rs)].\displaystyle=\frac{1}{\mu}\left[\frac{1}{dN\phi_{1}(s)}+\sum_{k=1}^{d-2}\frac{1}{d-k}+\frac{1}{N\phi_{1}(\frac{r}{s})}\right]. (10)

Since 1dNϕ1(s)<1Nϕ1(s)\frac{1}{dN\phi_{1}(s)}<\frac{1}{N\phi_{1}(s)} and k=1d21dk<k=1d21=d2\sum_{k=1}^{d-2}\frac{1}{d-k}<\sum_{k=1}^{d-2}1=d-2 it is obvious that τH<τS\tau^{H}<\tau^{S}, i.e. evolutionary dynamics is faster if the order of mutations is arbitrary. For fitness valleys with s<1s<1 and a large population size, τH\tau^{H} is dominated by 1dμNϕ1(s)\frac{1}{d\mu N\phi_{1}(s)}. As dd more mutations are available, this is always faster than the corresponding equation for a single path, see Fig. 2.

Refer to caption
Figure 2: Fixation time for a single path (squares) and a hypercube (circles) with small mutation rates (μN2\mu\ll N^{-2}) for different intermediate fitness values. Evolutionary dynamics is always faster in the hypercube. Solid lines show the analytical approximation for small mutation rates, Eqs. (8) and (10). Numerical simulations shown by symbols agree well with the analytical approximation (population size N=100N=100, mutation rate μ=105\mu=10^{-5}, d=5d=5, r=1.1r=1.1, simulations averaged over a 1000 realizations).

III.2 Thresholds of the waiting times

Next, we derive expressions for some interesting thresholds of the waiting times in the limit of small mutation rates. Since evolutionary dynamics is always faster if many paths are available, we now compare a fitness valley in which many paths are available to a single path in which the order of mutations is important, but fitness does not decrease in the course of evolution. The basic question we address here is, whether it is faster to cross a broad valley or a narrow ridge in fitness space. In other words, we compare τS(s=1)\tau^{S}(s=1) to τH(s<1)\tau^{H}(s<1). Since we consider only small mutation rates μ\mu, we neglect the fixation times τfix\tau_{\rm fix} here, although they will not be identical in the two scenarios. For s=1s=1, the single path is neutral. We decrease ss in the hypercube until we have identical waiting times. This yields an implicit expression for ss,

d1+1N11rN11r=1dN11sN11s+k=1d21dk+1N1(sr)N1sr\displaystyle d-1+\frac{1}{N}\frac{1-\frac{1}{r^{N}}}{1-\frac{1}{r}}=\frac{1}{dN}\frac{1-\frac{1}{s^{N}}}{1-\frac{1}{s}}+\sum_{k=1}^{d-2}{\frac{1}{d-k}}+\frac{1}{N}\frac{1-(\frac{s}{r})^{N}}{1-\frac{s}{r}} (11)

From this equation, we can numerically determine ss for any given NN. For large NN, Eq. (11) simplifies to

d(d1)k=1d2ddk=1N11sN11seN(1s)1N(1s),d(d-1)-\sum_{k=1}^{d-2}{\frac{d}{d-k}}=\frac{1}{N}\frac{1-\frac{1}{s^{N}}}{1-\frac{1}{s}}\approx\frac{e^{N(1-s)-1}}{N(1-s)}, (12)

where we used (1x/N)Nex(1-x/N)^{-N}\to e^{x} for large NN. Thus, the quantity N(1s)N(1-s) becomes constant for large NN, see Fig. 3. Thus, we can now say how broad and deep a fitness valley has to be to lead to the same cumulative waiting time as a single neutral path.

Refer to caption
Figure 3: The figure shows the threshold values for which evolution on a hypercube with fitness s<1s<1 in the intermediate states proceeds as fast as on a single neutral path with s=1s=1. Full lines show N(1s)N(1-s) based on the numerical solution of Eq. 11. Above the lines, evolution proceeds faster on the hypercube. Below them, the neutral single path is faster. For NN\to\infty, the lines converge to a constant, see Eq. (12). The symbols show the results from numerical simulations for N=5N=5 and d=10d=10 (triangles), N=20N=20 and d=5d=5 (circles) as well as N=80N=80 and d=2d=2 (squares). Symbols are open (red) when the single path is faster and filled (blue) when the hypercube is faster (μ=105\mu=10^{-5}, r=1.1r=1.1, simulations averaged over a 1000 realizations).

Next, we address the effect of the intermediate fitness ss, which has an important influence on the cumulative waiting time τ\tau. Fig. 2 shows how the waiting time decreases with increasing fitness in the intermediate states ss. If ss comes very close to the fitness in the final state, the waiting time increases. This increase is seen both in the single path and the hypercube. An increase in the intermediate state fitness will not always lead to a reduction in waiting times. Instead, the fixation times reach a minimum when the fitness growth is constant between any two consecutive states weinreich:2005tx ; traulsen:2007ee . For the hypercube, the fastest trajectory will be steeper than on a single path: at first, many mutations are available and a big fitness increase is not necessary. Later, fewer mutations are available and thus, the fitness should increase faster. The precise form of the trajectory will in this case depend on the number of mutations dd and the population size NN. We note that a similar reasoning can be applied to construct a fitness landscape that allows to cross a fitness valley fastest: the fastest trajectory has the same form regardless if a fitness peak is approached (r>1r>1) or a fitness minimum is approached (r<1r<1). Thus, the fastest way to cross a fitness valley is to descend to the minimum with exponentially decreasing fitness and to increase from the minimum again with exponentially increasing fitness.

Now, we turn to the effect of the intermediate fitness ss on the individual waiting times. Eqs. (8) and (10) both consist of three terms each. The first term denotes the time required to leave the initial state. The second term is the time spent in moving through all the intermediate states. This second term is independent of ss, because the transitions are neutral. The last term denotes the time required to reach the ultimate state from the penultimate state. For small values of ss, the probability to fixate the disadvantageous mutation is very small. Thus, the total time is dominated by the first term. When ss is increased to a threshold value s1s_{1}, then the time for leaving the first state is identical to the waiting time in the intermediate states. For the hypercube, s1s_{1} is given by 1dτwait(s1)=k=1d21dkτwait(1)\frac{1}{d}\tau_{\rm wait}\left(s_{1}\right)=\sum_{k=1}^{d-2}{\frac{1}{d-k}}\tau_{\rm wait}\left(1\right), which reduces to

11s1N11s1=dNk=1d21dk.\displaystyle\frac{1-\frac{1}{s_{1}^{N}}}{1-\frac{1}{s_{1}}}=dN\sum_{k=1}^{d-2}{\frac{1}{d-k}}. (13)

This equation can be solved numerically for specific values of NN and dd. For the single path, the right hand side of this equation has to be replaced by N(d2)N(d-2). For s>s1s>s_{1}, the time to cross the intermediate states is larger than the waiting time in the first state. On the hypercube, we can define a second threshold for which the waiting time in the first state is the same as the time required to reach the final state from the penultimate state. This arises because the effective mutation rate in state 0 is dd times larger than the effective mutation rate in state d1d-1. The threshold s2s_{2} is given by 1dτwait(s2)=τwait(rs2)\frac{1}{d}\tau_{\rm wait}\left(s_{2}\right)=\tau_{\rm wait}\left(\frac{r}{s_{2}}\right) or

1d11s2N11s21=1(s2r)N1s2r\displaystyle\frac{1}{d}\frac{1-\frac{1}{s_{2}^{N}}}{1-\frac{1}{s_{2}^{1}}}=\frac{1-\left(\frac{s_{2}}{r}\right)^{N}}{1-\frac{s_{2}}{r}} (14)

Again, s2s_{2} has to be determined numerically. For a single path, the factor d1d^{-1} in Eq. (14) has to be dropped. Thus, the threshold s2s_{2} occurs for s>1s>1 and is simply given by s2=rs_{2}=\sqrt{r}.

The fixation time is also strongly influenced by the number of mutations dd. A larger dd increases the length of the path and usually also the fixation times. For the single path, this increase results only from the increase in the time required to cross the intermediate states, because the time for leaving the initial state and the time to reach the final state from the penultimate state are independent of dd. The time required to reach the ultimate state from the penultimate state is also independent of dd for the hypercube, but the time required to leave the initial state decreases with increasing dd. This is because as dd increases, there are more states available in the first error class and thus the effective rate of mutation out of the initial state increases. As for the single path, the time to cross the intermediate states increases with dd in the hypercube. For the hypercube, this interplay can lead to a non-monotonic dependence of the fixation time on dd. For example, for N=100N=100 and s=0.95s=0.95, the fixation time τH\tau^{H} decreases with dd for d<31d<31, but it increases with dd for d>31d>31. In contrast, the fixation time always increases monotonically with dd for the single path.

Increasing the fitness of the final state rr increases the advantage of the final state over the intermediate states. This will result in the decrease in the time required for the population to make the last move. Increasing rr has no effect on the time required to cross the intermediate states or the time required to move away from the initial state. As a result, those two times remain constant even as rr increases, both in the single path and the hypercube.

Refer to caption
Figure 4: The probability of tunneling across the hypercube (circles, blue) is larger than in the single path (squares, red) due to the higher effective mutation rate. The tunneling across the valley denoted by the filled symbols (s=0.9s=0.9) is always larger than the probability of tunneling across a flat fitness landscape denoted by open symbols (s=1.0s=1.0). This arises from the fact that the stepwise accumulation of mutations would involve the fixation of a disadvantageous mutation in the first step. All symbols show the probabilities that the population tunnels at least across one state for the single path or at least one error class for the hypercube (N=100N=100, d=5d=5, r=1.1r=1.1, averaged over a 1000 realizations).

IV Intermediate mutation rates

The analytical approach is only valid as long as the mutation rate is small, μN2\mu\ll N^{-2}. For higher mutation rates, the population does not have to consist of at most two different types at any time. Instead, dd mutations can be fixed in the population without sequentially fixing one after the other. This process has been termed stochastic tunneling and is of great importance in the context of cancer initiation iwasa:2004aa ; michor:2004aa ; nowak:2004bb ; beerenwinkel:2007aa . Tunneling across fitness valleys is more likely than tunneling across a flat fitness landscape (see Fig. 4). Even for d=2d=2, the evolutionary dynamics is characterized by a doubly stochastic process, which makes analytical approaches tedious iwasa:2004aa . As discussed above, for μN21\mu N^{2}\ll 1 the population usually contains at most two different types. In this case, the probability of stochastic tunneling will be very small. On the other hand, for μN>1\mu N>1, at least one mutant is produced per generation. Thus, the probability of stochastic tunneling approaches 11. For N2<μ<N1N^{-2}<\mu<N^{-1}, the mutations are sometimes fixed sequentially and sometimes via stochastic tunneling. Fig. 5 shows how the tunneling probability increases from 0 to 11 in this interval.

Refer to caption
Figure 5: The probability of tunneling across a neutral hypercube (circles) is always higher than the probability of tunneling across a neutral single path (squares). Here, the probability that the system tunnels across at least one state or one error class is shown for d=5d=5 and N=1000N=1000. As expected, for Nμ>1N\mu>1 the probability of tunneling approaches 11. In contrast to our conservative estimate that tunneling can be neglected only as long as the mutation rate is below N2N^{-2}, even for mutation rates as large as 100N2100N^{-2} the probability of tunneling remains close to zero (s=1.0s=1.0, r=1.1r=1.1, averaged over a 1000 realizations).

For intermediate mutation rates, it is likely that the population contains more than two different types. The types with beneficial mutations will compete for fixation. This process is termed clonal interference crow:1970ck ; fisher:1930fi ; muller:1932aa ; gerrish:1998aa ; park:2007aa . Clonal interference has been considered to slow down adaptation, but recently it has been shown that it can have a positive influence on a rugged fitness landscape gerrish:1998aa ; wilke:2004aa ; jain:2007aa .

The states in a single path can be characterized by the number of mutations. In the hypercube, the states are characterized by the types of mutations that have already occurred. Thus, there are many different types that have undergone a specific number of mutations. However, all types that have already accumulated kk mutations can be pooled into the error class kk. The number of different types in the error class kk is given by (dk)=d!k!(dk)!\binom{d}{k}=\frac{d!}{k!(d-k)!}.

In a single path, a population is said to tunnel across a state if it passes through a state without ever reaching fixation in that state. Analogously, in a hypercube a population said to tunnel across an error class if it passes through that error class without ever reaching fixation in it. Within the error classes, tunneling can occur across individual states, but also across several states at once. This means that the whole population passes only across that particular state and not across any other, without ever reaching fixation in that particular state. Tunneling across an error class can also occur in a second way: the population can use all of the available states in the error class, but the total number of individuals in the error class never reaches NN. Thus, the probability of tunneling via the individual states is always lower than the probability of tunneling across the error classes. Fig. 6 shows the relation between the different probabilities of tunneling in the hypercube with respect to the rate of mutation μ\mu for the special case d=2d=2.

Refer to caption
Figure 6: For d=2d=2, the probability of tunneling across the intermediate state is slightly higher in the single path (squares) than in the hypercube (open circles), shown for N=100N=100 here. This is because the effective mutation rate into the intermediate state is twice as big in the hypercube, leading to a higher probability of fixation. Filled circles show the probability to tunnel across individual states of the hypercube. For μN>1\mu N>1, the system always tunnels. In the hypercube, both states are used for this. As expected, we need μN2\mu\ll N^{-2} for the tunneling probability to vanish (s=1.0s=1.0, r=1.1r=1.1, averaged over a 1000 realizations).

Due to higher effective rates of mutation, the probability of tunneling across a hypercube is expected to be greater than or equal to the probability of tunneling across a single path. However, numerical simulations reveal that for d=2d=2 the probability of tunneling in a single path is higher than in the hypercube. This is a special case: for d=2d=2 in a hypercube, the number of states into which the initial state can mutate into is 22. The effective rate of mutations is thus twice as much as in the single path. The number of states which can be mutated into next is one, both in the single path and the hypercube. Thus the rate at which the individuals are pushed into the first state is higher in hypercube than in the single path while the rate of individuals being pushed out is the same. Thus there is a higher probability of reaching fixation in the first error class in a hypercube (see Fig. 6). We only observe this effect for d=2d=2, for d>2d>2, the probability of tunneling is higher in a hypercube than in a single path, as expected (see Fig.  4).

V High mutation rates

For μN>1\mu N>1, the stochastic features of the dynamics become less important. In this case, the system can be described by a set of d+1d+1 deterministic differential equations for the fraction xk(t)x_{k}(t) of the population that has kk mutations jain:2007aa . Obviously, we have k=0dxk(t)=1\sum_{k=0}^{d}x_{k}(t)=1. Transitions out of state 0 occur with probability T0=(1x0ϕu00)x0T_{0\to}=(1-\frac{x_{0}}{\phi}u_{0\to 0})x_{0}, where ϕ=x0+(1x0xd)s+xdr\phi=x_{0}+(1-x_{0}-x_{d})\,s+x_{d}\,r is the average fitness of the population. This includes all the reproductive events except for the one where a type 0 is produced. Transitions into state 0 occur with probability T0=x0ϕu00(1x0)T_{\to 0}=\frac{x_{0}}{\phi}u_{0\to 0}(1-x_{0}). Thus, the fraction of individuals in the initial state follows the differential equation

x˙0(t)=1N[x0ϕu00(1x0)(1x0ϕu00)x0].\displaystyle\dot{x}_{0}(t)=\frac{1}{N}\left[\frac{x_{0}}{\phi}u_{0\to 0}(1-x_{0})-(1-\frac{x_{0}}{\phi}u_{0\to 0})x_{0}\right]. (15)

The probability that an offspring is of type kk is given by λk=j=0kxjrjϕujk\lambda_{k}=\sum_{j=0}^{k}\frac{x_{j}r_{j}}{\phi}u_{j\to k}. The difference between the hypercube and the single path only occurs in the quantity ujku_{j\to k}, which is given above for both cases. The sum in λk\lambda_{k} is over all individuals with kk or less mutations and rjr_{j} is the fitness of individuals with jj mutations. This leads to the differential equation for the fraction of individuals with kk mutations,

x˙k(t)=1N[λk(1xk)(1λk)xk],\displaystyle\dot{x}_{k}(t)=\frac{1}{N}\left[\lambda_{k}(1-x_{k})-(1-\lambda_{k})x_{k}\right], (16)

where k=0,,dk=0,\ldots,d. Of course, the special case k=0k=0 recovers Eq. (15). This set of d+1d+1 differential equations describes how the system moves from state k=0k=0 to state k=dk=d. In general, only a numerical solution of this system of equations is feasible. While this allows us to infer details of the dynamics, our main interest is the time required for fixation of dd number of mutations. Thus, we solve the differential equation numerically using a standard Runge-Kutta algorithm press:2007aa . To find an equivalent to the fixation time in a stochastic simulation, we average between fixation (xd=1x_{d}=1) and the time when there are on average less than 11 individuals outside the final state (xd=11Nx_{d}=1-\frac{1}{N}). Thus, the fixation time is the time when the solution of the differential equation crosses xd=112Nx_{d}=1-\frac{1}{2N}.

Refer to caption
Figure 7: The fixation times decrease with increasing mutation rate. Fixation always occurs faster on the hypercube (circles) than in the single path (squares). For small mutation rates, mutations fixate sequentially and the fixation time can be well approximated by Eqs. (8) and (10). Here, the fixation times decrease as μ1\mu^{-1}. For high mutation rates, the system can be approximated by a set of deterministic differential equations and the simulation results for the fixation times can be approximated based on the numerical solution of Eq. (16). In this case, fixation times decrease in general slower than μ1\mu^{-1} with increasing mutation rate (population size N=1000N=1000, d=5d=5, s=1s=1, r=1.1r=1.1, averages over 1000 realizations).

Fig. 7 shows an overview of the fixation times, covering the full range of mutation rates. For small mutation rates, we have sequential fixation of mutations and the time can be well approximated by Eqs. (8) and (10). For high mutation rates, the numerical solution of Eq. (16) leads to a good approximation for the fixation times.

VI Discussion

We have determined the average time during which a population moves from a certain initial state to a final state of higher fitness. The initial and the final states are separated by a fixed number of mutations dd. The mutations jointly confer a fitness advantage to the final mutant which can be represented by a peak in the fitness landscape. If the intermediate mutations need to occur in a specific order for the evolution of the final mutant then it corresponds to the single path. Otherwise, evolution occurs on a hypercube and there are d!d! ways of reaching the final state.

We have explored the simplest system in which the fitness in all intermediate states is the same. As expected, the fixation times on a hypercube are shorter than on a single path, due to the presence of multiple paths available in a hypercube. This observation leads to the question: for which parameters does the hypercube show shorter fixation times than the single path, even with an added disadvantage? The fitness in the intermediate states was then set to lower values than the ones in the single path. Up to a certain threshold value of the fitness of the intermediate states, the hypercube shows shorter fixation times than in the single path. The value of the threshold depends on the population size, total number of required mutations and the fitness in the final state.

The fixation times for large populations largely depend on the fitness function and are qualitatively independent of the order of mutations. Let us first focus on a flat landscape: when the intermediate states have a fitness equal to the fitness of the initial wild-type, then for small mutation rates large populations have shorter fixation times than small populations. This is because the neutral rate of evolution does not depend on the population size. But the waiting time for fixation of the last mutation becomes shorter with larger population size. For intermediate mutation rates, tunneling starts earlier in larger populations. This leads to a marked decrease in the fixation time with larger population size. For high mutation rates, the time to fixation is no longer dominated by the time for the first mutant to reach the final state, but by the time until all individuals are in that state. Due to this, for high mutation rates the time required for fixation can be shorter in smaller population as compared to larger populations. Next, we focus on fitness valley: If the fitness landscape consists of a valley with reduced fitness of the intermediate states, small populations have an advantage for small mutation rates, as they can easily leave the initial state and enter the valley. But for high mutation rates, large populations reach fixation faster, because they can explore states within and beyond the fitness valley more easily.

Our numerical simulations reveal that tunneling can be neglected even when the mutation rate exceeds N2N^{-2}, at least by one order of magnitude. Thus, Eqs. (8) and (10) provide good estimates for the fixation times even in relatively large populations. Concrete values for fixation times are collected in Table 1. They reveal that even in long-term studies of experimental evolution, it is difficult to observe the consecutive fixation of neutral mutants cooper:2003aa . Consecutive fixation of advantageous mutants, however, is significantly faster. For example, while Table 1 reveals a fixation time of 1011\sim 10^{11} generations on a single path for d=10d=10, s=1s=1 and N=106N=10^{6}, an optimal choice of the intermediate fitness values traulsen:2007ee would lead to a fixation time of 107\sim 10^{7} generations.

NN d=3d=3 d=10d=10
Single Path Hypercube Single Path Hypercube
10210^{2} 2.109992.10999 0.9433250.943325 9.109999.10999 2.038962.03896
10410^{4} 2.00112.0011 0.8344330.834433 9.00119.0011 1.930071.93007
10610^{6} 2.000012.00001 0.8333440.833344 9.000019.00001 1.928981.92898
Table 1: The time required for fixation of dd mutations in units of 101010^{10} generations for a mutation rate of μ=1010\mu=10^{-10} based on Eqs. (8) and (10). The intermediate mutations are neutral, s=1s=1. For small mutation rates, the fixation times scale linearly with μ1\mu^{-1}. For NN\to\infty, the fixation time on the single path approaches μ1(d1)\mu^{-1}(d-1) and the fixation time on the hypercube approaches μ1k=0d2(dk)1\mu^{-1}\sum_{k=0}^{d-2}(d-k)^{-1}. However, the mutation rates have to decrease with increasing NN to make the approximation for the fixation times valid. (initial fitness 1.01.0 and final fitness r=1.1r=1.1).

While we have focused on the simplest possible system which allows analytical approximations, experimental studies reveal of course a much higher complexity. weinreich:2006aa studied experimentally the point mutations in the β\beta-lactamase gene of bacteria. β\beta lactam antibiotics are commonly used, but the bacteria can develop resistance to the drugs. Five point mutations in a particular allele of the β\beta-lactamase gene increases the resistance of the bacteria to cefotaxime by a factor of 100,000\sim 100,000. Theoretically the mutations leading from the wild-type allele to the resistant allele can occur in 5!=1205!=120 ways. These can be represented by a hypercube of d=5d=5. But in only 1818 of the 120120 trajectories, the intermediate mutations are either neutral with respect to the initial state or beneficial. Weinreich and colleagues have shown that these have the highest probability of realization. For all beneficial intermediates the fastest way to reach the final state would be when the relative fitness increase between any two consecutive mutations is constant traulsen:2007ee , but usually in nature several different mutations are available and the population first evolves to states that provide the highest selective advantage.

In another experimental study the sequence space of the 5s rRNA of a marine bacterium, Vibrio proteolyticus was explored lee:1997 . The sequences from Vibrio proteolyticus and Vibrio alginolyticus differ in only four positions. All the possible intermediates were constructed by the authors and the fitness of each was calculated chao:1985aa . Two of the valid intermediates have a fitness lower than the initial wild-type. We have shown how such fitness valleys can be crossed by exploring the phenomenon of tunneling or multiple mutations (for high mutation rates). Thus, the population does not need to move in a Wrightian fashion (the whole population moving as a whole across the valley).

The theory discussed herein deals with basic evolutionary concepts which are important to the kind of biological examples described above. More complex properties of the experimental studies like more general cases of epistasis and compensatory mutations can easily be incorporated, but there is a huge number of possibilities. Even if we are only interested in the ordering of fitness values, we can have up to 2d!2^{d}! distinct epistatic patterns. Thus, one should rather focus on concrete systems instead. For example, one could simulate the dynamics in a system with experimentally derived fitness values and mutation rates. Not all the paths of a hypercube might be accessible for selection, but still some of them might prove to be significant depending upon the particular values of the parameters, such as fitness values and population size. Our goal here was to characterize the simplest features of the dynamics of a population crossing a fitness valley. This approach can be helpful when more realistic scenarios are addressed.

VII Acknowledgment

We thank Andrew Murray for initiating this project. C.S.G. and A.T. are grateful for financial support from the Emmy-Noether program of the DFG. M.A.N. is supported by the John Templeton Foundation, the NSF/NIH joint program in mathematical biology (NIH Grant R01GM078986), the Bill and Melinda Gates Foundation (Grand Challenges Grant 37874), and J. Epstein.

References

  • (1) Nowak MA (2006) Evolutionary Dynamics. Harvard University Press, Cambridge, MA.
  • (2) Eigen M, Schuster P (1977) The hypercycle. A principle of natural self-organization. Part A: Emergence of the hypercycle. Die Naturwiss 64:541–565.
  • (3) Eigen M, McCaskill J, Schuster P (1989) The molecular quasi-species. Adv Chem Phys 75:149–263.
  • (4) Wilke C (2005) Quasispecies theory in the context of population genetics. BMC Evol. Biol. 5:44.
  • (5) Nowak MA (1992) What is a quasispecies? Trends in Ecology and Evolution 7:118–121.
  • (6) van Nimwegen E, Crutchfield J (2000) Metastable evolutionary dynamics: crossing fitness barriers or escaping via neutral paths? Bull Math Biol 62:799–848.
  • (7) Weinreich D, Delaney N, DePristo M, Hartl D (2006) Darwinian evolution can follow only very few mutational paths to fitter proteins. Science 312:111–114.
  • (8) Poelwijk FJ, Kiviet DJ, Weinreich DM, Tans SJ (2007) Empirical fitness landscales reveal accessible evolutionary paths. Nature 445:383–386.
  • (9) Traulsen A, Iwasa Y, Nowak MA (2007) The fastest evolutionary trajectory. J. Theor. Biol. 249:617–623.
  • (10) Iwasa Y, Michor F, Nowak MA (2004) Stochastic tunnels in evolutionary dynamics. Genetics 166:1571–1579.
  • (11) Frank SA (2007) Evolutionary Dynamics of Cancer. Princeton Univ. Press.
  • (12) Fisher J (1959) Multiple-mutation theory of carcinogenesis. Nature 181:651–652.
  • (13) Nordling C (1953) A new theory on cancer inducing mechanism. Br J Cancer 7:68–72.
  • (14) Armitage P, Doll R (1954) The age distribution of cancer and a multi-stage theory of carcinogenesis. Br J Cancer 8:1–12.
  • (15) Knudson AG (1971) Mutation and cancer: Statistical study of retinoblastoma. Proc. Natl. Acad. Sci. U.S.A. 68:820–823.
  • (16) Michor F, Iwasa Y, Nowak MA (2004) Dynamics of cancer progression. Nat. Rev. Cancer 4:197–205.
  • (17) Vogelstein B, Kinzler K (2004) Cancer genes and the pathways they control. Nat. Med. 10:789–799.
  • (18) Sjöblom T, Jones S, Wood L, Parsons D, Lin J, et al. (2006) The consensus coding sequences of human breast and colorectal cancers. Science 314:268–274.
  • (19) Wood LD, Parsons DW, Jones S, Lin J, Sjöblom T, et al. (2007) The genomic landscapes of human breast and colorectal cancers. Science 318:1108–1113.
  • (20) Jones S, Chen WD, Parmigiani G, Diehl F, Beerenwinkel N, et al. (2008) Comparative lesion sequencing provides insights into tumor evolution. Proc. Natl. Acad. Sci. U.S.A. 105:4283–4288.
  • (21) Jones S, Zhang X, Parsons DW, Lin JCH, Leary RJ, et al. (2008) Core signaling pathways in human pancreatic cancers revealed by global genomic analyses. Science 321:1801–1806.
  • (22) Beerenwinkel N, Antal T, Dingli D, Traulsen A, Kinzler KW, et al. (2007) Genetic progression and the waiting time to cancer. PLoS Comput. Biol. 3:e225.
  • (23) Gerstung M, Beerenwinkel N (2008) Waiting time models of cancer progression. Mathematical Population Studies, to appear. arXiv:0807.3638
  • (24) Weinreich D (2005) The rank ordering of genotypic fitness values predicts genetic constraint on natural selection on landscapes lacking sign epistasis. Genetics 171:1397–1405.
  • (25) Moran PAP (1962) The Statistical Processes of Evolutionary Theory. Oxford: Clarendon Press, Oxford.
  • (26) Weinreich DM, Watson R, Chao L (2005) Perspective: sign epistasis and genetic constraint on evolutionary trajectories. Evolution 56:1165–1174.
  • (27) DePristo MA, Hartl DL, Weinreich DM (2007) Mutational reversions during adaptive protein evolution. Mol. Biol. Evol. 24:1608–1610.
  • (28) Karlin S, Taylor HMA (1975) A First Course in Stochastic Processes. London: Academic, 2nd edition edition.
  • (29) Ewens WJ (2004) Mathematical Population Genetics. Springer, New York.
  • (30) Crow JF, Kimura M (1970) An Introduction to Population GeneticsTheory. Harper and Row, New York, NY.
  • (31) Goel N, Richter-Dyn N (1974) Stochastic Models in Biology. Academic Press, New York.
  • (32) Antal T, Scheuring I (2006) Fixation of strategies for an evolutionary game in finite populations. Bull Math Biol 68:1923–1944.
  • (33) Atwood KC, Schneider KL, Ryan FJ (1951) Periodic selection in escherichia coli. Proc. Natl. Acad. Sci. U.S.A. 37:146–155.
  • (34) Gillespie JH (1983) Some properties of finite populations experiencing strong selection and weak mutation. Am Nat. 121:691–708.
  • (35) Gillespie JH (2004 (2nd edition)) Population Genetics : A Concise Guide. The Johns Hopkins University Press.
  • (36) Weinreich DM, Chao L (2005) Rapid evolutionary escape by large populations from local fitness peaks is likely in nature. Evolution 59:1175–1182.
  • (37) Nowak MA, Michor F, Komarova NL, Iwasa Y (2004) Evolutionary dynamics of tumor suppressor gene inactivation. Proc. Natl. Acad. Sci. U.S.A. 101:10635–10638.
  • (38) Fisher RA (1930) The Genetical Theory of Natural Selection. Clarendon Press, Oxford.
  • (39) Muller HJ (1932) Some genetic aspects of sex. Am Nat 66:118–138.
  • (40) Gerrish PJ, Lenski RE (1998) The fate of competing beneficial mutations in a asexual population. Genetica 102/103:127–144.
  • (41) Park SC, Krug J (2007) Clonal interference in large populations. Proc. Natl. Acad. Sci. U.S.A. 104:18135–18140.
  • (42) Wilke C (2004) The speed of adaptation in large asexual populations. Genetics 167:2045–2053.
  • (43) Jain K, Krug J (2007) Deterministic and stochastic regimes of asexual evolution on rugged fitness landscapes. Genetics 175:1275–1288.
  • (44) Press WH, Teukolsky SA, Vetterling WT, Flannery BP (2007) Numerical Recipes 3rd Edition: The Art of Scientific Computing. New York: Cambridge University Press.
  • (45) Cooper T, Rozen D, Lenski RE (2003) Parallel changes in gene expression after 20,000 generations of evolution in escherichia coli. Proc. Natl. Acad. Sci. U.S.A. 100:1072–1077.
  • (46) Lee TH, DSouza LM, Fox GE (1997) Equally parsimonious pathways through an RNA sequence space are not equally likely. J. Mol. Evol. 45:278–284.
  • (47) Chao L, McBroom SM (1985) Evolution of transposable elements: an IS10 insertion increases fitness in Escherichia coli. Molecular Biology and Evolution 2:359–369.