This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Behavioral Mistakes Support Cooperation
in an N-Person Repeated Public Goods Game

Jung-Kyoo Choi1 and Jun Sok Huhh2

1 School of Economics and Trade, Kyungpook National University, Daegu 41566,. Korea
2 NCSoft, Gyeonggi-do, 13494, Korea
jkchoi@knu.ac.kr
Abstract

This study investigates the effect of behavioral mistakes on the evolutionary stability of the cooperative equilibrium in a repeated public goods game. Many studies show that behavioral mistakes have detrimental effects on cooperation because they reduce the expected length of mutual cooperation by triggering the conditional retaliation of the cooperators. However, this study shows that behavioral mistakes could have positive effects. Conditional cooperative strategies are either neutrally stable or are unstable in a mistake-free environment, but we show that behavioral mistakes can make all of the conditional cooperative strategies evolutionarily stable. We show that behavioral mistakes stabilize the cooperative equilibrium based on the most intolerant cooperative strategy by eliminating the behavioral indistinguishability between conditional cooperators in the cooperative equilibrium. We also show that mistakes make the tolerant conditional cooperative strategies evolutionarily stable by preventing the defectors from accumulating the free-rider’s advantages. Lastly, we show that the behavioral mistakes could serve as a criterion for the equilibrium selection among cooperative equilibria.


Keywords: Evolutionary stability of cooperative strategy; Repeated public goods game; Behavioral mistakes; Equilibrium selection
JEL Classification to be added

1 Introduction

Previous research has shown that in the context of a social dilemma, repeated interactions induce agents to consider the possibility of retaliation and thus to support a cooperative equilibrium (Fudenberg & Maskin, 1986; Fundenberg & Maskin, 1990; Axelrod & Hamilton, 1981; Taylor, 1987). This result is known as the folk theorem in economics or the reciprocity hypothesis in evolutionary biology. However, there are some criticisms of the repeated game approach. One of the issues is that repetition supports too many Nash equilibria, which raises the question of equilibrium selection (Fundenberg & Maskin, 1990; Binmore & Samuelson, 1992; Axelrod, 1997; Choi, 2007; Sethi & Somanathan, 2003). In this regard, many efforts have been made to find sharper criteria for selecting efficient equilibria (Binmore & Samuelson, 1992; Fundenberg & Maskin, 1990; Sethi & Somanathan, 2003).

Setting aside the problems of multiple equilibria and equilibrium selection, other problems remain, particularly from an evolutionary perspective. Previous studies show that cooperators are unlikely to evolve when they are rare in a population, especially when the group size is large (Gintis, 2006; Boyd & Richerson, 1988), and that even the efficient equilibria that are supported by game repetition do not satisfy dynamic stability (Young & Foster, 1991; Gintis, 2006; Boyd & Lorberbaum, 1987; Farrell & Ware, 1989; Yao, 1996).

Among these two issues of inaccessibility and instability, the latter issue is the primary issue that we address in this paper. We will reconfirm the conclusion from previous studies that cooperation cannot be sustained in the long run, and we will investigate the role of behavioral mistakes in stabilizing the cooperative equilibrium. In this paper, we analyze a model of a repeated public goods game in an error-prone environment where the individuals make unintended behavioral mistakes. The starting point of this paper is closely related to several previous studies. Joshi (1987) and Boyd & Richerson (1988) studied the effectiveness of repetition in a social dilemma situation where more than two agents are involved. Both studies assumed a repeated nn-person public goods game structure in which conditional retaliation is triggered whenever the number of cooperators is less than the level of tolerance displayed by each individual. These studies introduced conditional cooperative strategies that differ in the number of defectors that are tolerated before retaliation is triggered. In this setting they showed that the only conditional cooperative strategy that is stable is the one that does not tolerate any defections, and the other conditional cooperative strategies that tolerate some defections are not stable at all.

In this regard, many studies note that the conditional cooperative strategy that does not tolerate any defections is only neutrally stable in that it can remove defectors from its population when the game is repeated with sufficiently high probability, but it allows other cooperative strategies with higher tolerance levels to remain in its population. It is because, in the absence of defectors in the population, all of the conditional cooperative strategies, regardless of their tolerance levels, are behaviorally indistinguishable and, as a result, receive the same payoffs. Mutant cooperative strategies with higher tolerance levels enter the population and can cause a random drift as long as there are no defectors in the population. This process will ultimately result in a population state having too many tolerant conditional cooperators, which makes the population vulnerable to the invasion of defectors into the population. This is drift problem well known in evolutionary biology. In other words, a population that is composed of individuals using the conditional cooperative strategy with the least tolerance is subject to drift by which the dynamic path is ultimately led to a situation where the population is occupied only by defectors. Cooperation cannot be sustained in the long run and this dynamic instability is due to the fact that none of the cooperative strategies are evolutionarily stable (Samuelson, 2002; Choi, 2007).

Here, we introduce a crucial question that remains unaddressed. Are the previous conclusions modified if the individual agents are allowed to make mistakes? There have been some studies that examine the effect of mistakes on the cooperative equilibrium in a repeated game. Most of these studies, however, only partially address the effect of behavioral errors, primarily in the context of their detrimental effects on cooperation (Boyd, 1989; Bendor & Mookherjee, 1987).

In this paper, we will reframe the model in an error-prone environment and show how introducing behavioral mistakes significantly modifies the conclusions that are drawn from an error-free assumption. As many studies have shown, mistakes have a detrimental effect on cooperation because they trigger retaliation and reduce the expected duration of the game. However, we will show that, in the presence of behavioral mistakes, the conditional cooperative strategy can become evolutionarily stable. In Section 2, we set up a model of a public goods game and introduce the dynamic stability issue of cooperative equilibrium. We identify the dynamic property of cooperative equilibrium supported by repetition and show why neutral stability does not guarantee the sustainability of cooperation in the long run. In Section 3, we present a model of a public goods game where the cooperators make mistakes and show that the cooperators’ mistakes make all of the conditional cooperative strategies evolutionarily stable if the game is repeated with a sufficiently high probability. We also show that the rate of mistakes serves as an equilibrium selection criterion. Lastly, in Section 4 we show that the conclusions drawn in the previous sections still hold in the more general cases.

2 An nn-Person Public Goods Game without Errors

Suppose that groups of nn individuals are randomly drawn from the population to interact repeatedly in an nn-person public goods game111The model of this section is based on Boyd & Richerson (1988).. Each individual’s payoff depends on his or her action and the actions of the nβˆ’1n-1 other individuals in the group. Let jj be the number of cooperators among the other members in the group. Define F​(C|j)F(C|j) and F​(D|j)F(D|j) as the payoff of the one-shot game for cooperation and defection, respectively, when there are jj cooperators among the nβˆ’1n-1 other individuals. If the player is a cooperator, then the number of cooperators in the group becomes j+1j+1; otherwise, the number of cooperators remains jj. Therefore, we have

F​(C|j)=bβ‹…(j+1)nβˆ’cF​(D|j)=bβ‹…jn.\displaystyle\begin{aligned} F(C|j)&=\frac{b\cdot(j+1)}{n}-c\\ F(D|j)&=\frac{b\cdot j}{n}.\end{aligned} (1)

Assume that this public goods game is repeated in each group with probability Ξ΄\delta. For simplicity, assume that individuals do not discount the future. Furthermore, consider a situation with incomplete information: players know how many cooperators in their group were in the previous stage, but they do not know exactly who defected.

Assume that the individuals only remember the outcome of the last stage. The individuals now decide whether to cooperate in this stage conditional on how many cooperators were in the previous stage. Define individual ii’s pure strategy set, Si={Tk|0≀k≀n}S_{i}=\{T_{k}|0\leq k\leq n\} for an integer kk. The TkT_{k} strategy is similar to a tit-for-tat strategy in a two-person prisoner’s dilemma and can be expressed as follows: cooperate if kk or more of the other nβˆ’1n-1 individuals cooperate in the group during the previous stage and defect otherwise. We will call a player who uses the TkT_{k} strategy a kk-cooperator. Thus, for example, a person following the Tnβˆ’1T_{n-1} strategy (i.e., an [nβˆ’1][n-1]-cooperator) will cooperate only if every other individual cooperates, a person with the T0T_{0} strategy will cooperate unconditionally, and a person with the TnT_{n} strategy is an unconditional defector because it is impossible to have nn cooperators among nβˆ’1n-1 the others. The subscript kk (if k<nk<n), therefore, refers to a player’s degree of willingness to retaliate. We call Tnβˆ’1T_{n-1} the hardest (or the least tolerant) strategy because the [nβˆ’1][n-1]-cooperator does not tolerate any defectors in their group.

Consider a situation where every conditional cooperator has the same kk, and a small number of TnT_{n} players (unconditional defectors) appear in this population. Let the share of mutants in the post-entry population be μ\mu, where μ∈(0,1)\mu\in(0,1).

When the game is repeated with the probability Ξ΄\delta, the payoff for TkT_{k} and TnT_{n} from the repeated game is as follows:

V​(Tk|j)={11βˆ’Ξ΄β€‹F​(C|j)ifΒ j>k11βˆ’Ξ΄β€‹F​(C|j)ifΒ j=kF​(C|j)+Ξ΄1βˆ’Ξ΄β€‹F​(D|0)ifΒ j<kV​(Tn|j)={11βˆ’Ξ΄β€‹F​(D|j)ifΒ j>kF​(D|j)+Ξ΄1βˆ’Ξ΄β€‹F​(D|0)ifΒ j=kF​(D|j)+Ξ΄1βˆ’Ξ΄β€‹F​(D|0)ifΒ j<k.\displaystyle\begin{aligned} V(T_{k}|j)&=\begin{cases}\frac{1}{1-\delta}F(C|j)&\text{if $j>k$}\\ \frac{1}{1-\delta}F(C|j)&\text{if $j=k$}\\ F(C|j)+\frac{\delta}{1-\delta}F(D|0)&\text{if $j<k$}\end{cases}\\ V(T_{n}|j)&=\begin{cases}\frac{1}{1-\delta}F(D|j)&\text{if $j>k$}\\ F(D|j)+\frac{\delta}{1-\delta}F(D|0)&\text{if $j=k$}\\ F(D|j)+\frac{\delta}{1-\delta}F(D|0)&\text{if $j<k$}.\end{cases}\end{aligned} (2)

After playing a repeated public goods game, the players have the chance to update their strategy. Let pkp_{k} and pnp_{n} be the population frequency of the kk-cooperators and the unconditional defectors, respectively. We assume that the strategy-updating occurs in the following way: Whether the unconditional defectors can increase their frequency in a population where pk=1βˆ’ΞΌp_{k}=1-\mu and pn=ΞΌp_{n}=\mu depends on whether the expected payoff for the TnT_{n} strategy is greater than that for the TkT_{k} strategy. When pk=1βˆ’ΞΌp_{k}=1-\mu, the expected payoffs for the TkT_{k} strategy and the TnT_{n} strategy are

W​(Tk|pk=1βˆ’ΞΌ)=βˆ‘j=0kβˆ’1V​(Tk|j)​m​(j,pk)+V​(Tk|k)​m​(k,pk)+βˆ‘j=k+1nβˆ’1V​(Tk|j)​m​(j,pk)W​(Tn|pk=1βˆ’ΞΌ)=βˆ‘j=0kβˆ’1V​(Tn|j)​m​(j,pk)+V​(Tn|k)​m​(k,pk)+βˆ‘j=k+1nβˆ’1V​(Tn|j)​m​(j,pk),\displaystyle\begin{aligned} W(T_{k}|p_{k}=1-\mu)&=\sum_{j=0}^{k-1}V(T_{k}|j)m(j,p_{k})+V(T_{k}|k)m(k,p_{k})+\sum_{j=k+1}^{n-1}V(T_{k}|j)m(j,p_{k})\\ W(T_{n}|p_{k}=1-\mu)&=\sum_{j=0}^{k-1}V(T_{n}|j)m(j,p_{k})+V(T_{n}|k)m(k,p_{k})+\sum_{j=k+1}^{n-1}V(T_{n}|j)m(j,p_{k}),\end{aligned} (3)

where m​(j,pk)m(j,p_{k}) is the probability that an individual finds him/herself in a group in which there are jj other individuals following the TkT_{k} strategy, which can be written as

m​(j,pk)=(nβˆ’1j)​pkj​(1βˆ’pk)nβˆ’1βˆ’j.\displaystyle m(j,p_{k})=\binom{n-1}{j}p_{k}^{j}(1-p_{k})^{n-1-j}.

In general, when a mutant strategy Tkβ€²T_{k^{\prime}} appears with a share of ΞΌ\mu in a population that is homogeneously composed of individuals following the TkT_{k} strategy (where kβ€²β‰ kk^{\prime}\neq k), the TkT_{k} strategy is neutrally stable if and only if there exists a range of μ∈[0,ΞΌΒ―]\mu\in[0,\bar{\mu}] that satisfies W​(Tk|pk=1βˆ’ΞΌ)β‰₯W​(Tkβ€²|pk=1βˆ’ΞΌ)W(T_{k}|p_{k}=1-\mu)\geq W(T_{k^{\prime}}|p_{k}=1-\mu) and that is evolutionarily stable when the condition is satisfied with the strict inequality. This stability condition is equivalent to W​(Tk|pk=1)β‰₯W​(Tkβ€²|pk=1)W(T_{k}|p_{k}=1)\geq W(T_{k^{\prime}}|p_{k}=1), and W​(Tk|pkβ€²=1)>W​(Tkβ€²|pkβ€²=1)W(T_{k}|p_{k^{\prime}}=1)>W(T_{k^{\prime}}|p_{k^{\prime}}=1) if W​(Tk|pk=1)=W​(Tkβ€²|pk=1)W(T_{k}|p_{k}=1)=W(T_{k^{\prime}}|p_{k}=1) (Weibull, 1997).

Proposition 1.

In an n-person public goods game where the game is repeated with probability Ξ΄\delta, the Tnβˆ’1T_{n-1} strategy is neutrally stable.

Proof.

For the Tnβˆ’1T_{n-1} strategy to be neutrally stable, there should exist a ΞΌΒ―\bar{\mu} such that for μ∈[0,ΞΌΒ―]\mu\in[0,\bar{\mu}],

W​(Tnβˆ’1|pnβˆ’1=1βˆ’ΞΌ)β‰₯W​(Tkβ€²|pnβˆ’1=1βˆ’ΞΌ),βˆ€kβ€²β‰ nβˆ’1\displaystyle W(T_{n-1}|p_{n-1}=1-\mu)\geq W(T_{k^{\prime}}|p_{n-1}=1-\mu),\quad\forall k^{\prime}\neq n-1 (4)

is satisfied.

(i) For kβ€²<nβˆ’1k^{\prime}<n-1. Suppose that a mutant with Tkβ€²T_{k^{\prime}} (where kβ€²<nβˆ’1k^{\prime}<n-1) appears in a population where pnβˆ’1=1p_{n-1}=1. In the absence of a TnT_{n} strategy, all of the conditional cooperative strategies (i.e., all of the Tkβ€²T_{k^{\prime}} strategies where kβ€²<nβˆ’1k^{\prime}<n-1 and the Tnβˆ’1T_{n-1} strategy) are behaviorally indistinguishable among themselves and receive the same payoffs. In other words, any TkT_{k} strategy players (if k<nk<n begin the game by playing CC and continue playing CC until the game ends because there would be no defection in the group in the absence of TnT_{n}. Therefore, we have W​(Tnβˆ’1|pnβˆ’1=1βˆ’ΞΌ)=W​(Tkβ€²|pnβˆ’1=1βˆ’ΞΌ)=(bβˆ’c)/(1βˆ’Ξ΄)W(T_{n-1}|p_{n-1}=1-\mu)=W(T_{k^{\prime}}|p_{n-1}=1-\mu)=(b-c)/(1-\delta) for all kβ€²<nβˆ’1k^{\prime}<n-1 and all ΞΌ\mu.

(ii) For kβ€²=nk^{\prime}=n. Suppose that a mutant having TnT_{n} appears in a population that is homogeneously composed of the Tnβˆ’1T_{n-1} strategy players. Where pnβˆ’1=1p_{n-1}=1 we have m​(nβˆ’1,pnβˆ’1)=m​(nβˆ’1,1)m(n-1,p_{n-1})=m(n-1,1). Therefore, the payoff to the Tnβˆ’1T_{n-1} is W​(Tnβˆ’1|pnβˆ’1=1)=11βˆ’Ξ΄β€‹F​(C|nβˆ’1)=11βˆ’Ξ΄β€‹(bβˆ’c)W(T_{n-1}|p_{n-1}=1)=\frac{1}{1-\delta}F(C|n-1)=\frac{1}{1-\delta}(b-c), and the payoff to the TnT_{n} is W​(Tn|pnβˆ’1=1)=F​(D|nβˆ’1)+Ξ΄1βˆ’Ξ΄β€‹F​(D|0)=b​(nβˆ’1)nW(T_{n}|p_{n-1}=1)=F(D|n-1)+\frac{\delta}{1-\delta}F(D|0)=\frac{b(n-1)}{n}. Now we obtain the condition, 11βˆ’Ξ΄β€‹(bβˆ’c)>b​(nβˆ’1)n\frac{1}{1-\delta}(b-c)>\frac{b(n-1)}{n}, which can be rearranged to Ξ΄>cβˆ’(b/n)bβˆ’(b/n)\delta>\frac{c-(b/n)}{b-(b/n)}. Because c<bc<b, there exists a Ξ΄<1\delta<1 that satisfies the condition. Therefore, when the game is repeated with a sufficiently high Ξ΄\delta, we have W​(Tnβˆ’1|pnβˆ’1=1)>W​(Tn|pnβˆ’1=1)W(T_{n-1}|p_{n-1}=1)>W(T_{n}|p_{n-1}=1). ∎

Applying the same logic to the stability of the other conditional cooperative strategies, we can easily show that the other softer TkT_{k} strategies (where k<nβˆ’1k<n-1) are not stable. The TkT_{k} strategies with k<nβˆ’1k<n-1 tolerate the defector’s free riding behavior as long as the number of defectors in the group does not exceed nβˆ’kβˆ’1n-k-1. As a result, these strategies allow the defectors to invade the TkT_{k} population.

Corollary 1.

In an nn-person public goods game where the game is repeated with probability Ξ΄\delta, the TkT_{k} strategies where k<nβˆ’1k<n-1, are not stable.

Proof.

Suppose that a mutant, TnT_{n}, appears in a population that is entirely composed of TkT_{k} strategy players (k<nβˆ’1k<n-1), i.e., pk=1p_{k}=1. The expected payoff for the TkT_{k} strategy when pkp_{k} is equal to 1 is W​(Tk|pk=1)=(bβˆ’c)/(1βˆ’Ξ΄)W(T_{k}|p_{k}=1)=(b-c)/(1-\delta) and the expected payoff to the mutant TnT_{n} strategy when pkp_{k} is equal to 1 is W​(Tk|pk=1)=b​(nβˆ’1)n/(1βˆ’Ξ΄)W(T_{k}|p_{k}=1)=\frac{b(n-1)}{n}/(1-\delta). Because b​(nβˆ’1)n>bβˆ’c\frac{b(n-1)}{n}>b-c by the assumption of the payoff structure, we have W​(Tk|pk=1βˆ’ΞΌ)<W​(Tn|pk=1βˆ’ΞΌ)W(T_{k}|p_{k}=1-\mu)<W(T_{n}|p_{k}=1-\mu) for a sufficiently small ΞΌ\mu and for all k<nβˆ’1k<n-1. ∎

That the Tnβˆ’1T_{n-1} strategy is only neutrally stable and not evolutionarily stable raises interesting issues regarding the dynamic property of the equilibrium. Consider a population homogeneously that is composed of individuals using Tnβˆ’1T_{n-1}, the least tolerant cooperatrive strategy. Because the Tnβˆ’1T_{n-1} individuals do not allow any free riders in their group, the defection strategy cannot invade this population if the probability of game repetition is sufficiently high. The defectors will be retaliated against immediately and eliminated by the numerically predominant Tnβˆ’1T_{n-1} individuals. However, other Tk<nβˆ’1T_{k<n-1} strategies can invade the Tnβˆ’1T_{n-1} population because all of the conditional cooperators are behaviorally indistinguishable and receive the same payoffs when the defectors do not exist. And this indistinguishability allows the Tk<nβˆ’1T_{k<n-1} strategies to remain in the Tnβˆ’1T_{n-1} population. Once a sufficient number of Tk<nβˆ’1T_{k<n-1} strategies accumulate in the population, the defectors can invade and gain benefits from free riding because the Tk<nβˆ’1T_{k<n-1} strategies tolerate some defections. Therefore, due to this behavioral indistinguishability between Tnβˆ’1T_{n-1} and Tk<nβˆ’1T_{k<n-1} in the absence of a defection strategy, the equilibrium state where only the Tnβˆ’1T_{n-1} strategy is present will not persist over long periods.

In the following section, we introduce behavioral mistakes and show how these mistakes change the previous conclusions obtained from an error-free environment.

3 An nn-Person Public Goods Game and Behavioral Errors

3.1 Introducing Mistakes

Let us assume that only conditional cooperators make mistakes (to make a contribution, a cooperator needs to take an action, whereas a defector simply does nothing). For simplicity, a player who made a mistake does not know that it is he/she that made the mistake. Cooperators only know the total number of cooperations in their group, and they determine whether they would cooperate or defect in the next round depending on this number.

Because some cooperators defect by mistake, the payoff to individuals now depends on the number of errors made by the cooperators as well as the number of cooperators in a group. Let jj be the number of cooperators among nβˆ’1n-1 other members in a group. Let Οˆβ€‹(j,q,Ξ΅)\psi(j,q,\varepsilon) be the probability that qq individuals among the jj other cooperators in a group make mistakes when the probability of making a mistake is Ξ΅\varepsilon.

Οˆβ€‹(j,q,Ξ΅)=(jq)​Ρq​(1βˆ’Ξ΅)jβˆ’q​forΒ Ξ΅>0.\displaystyle\psi(j,q,\varepsilon)=\binom{j}{q}\varepsilon^{q}(1-\varepsilon)^{j-q}~\text{for $\varepsilon>0$}.

The payoffs from the one-shot game, given that jj among nβˆ’1n-1 other individuals choose cooperation, now become

FΡ​(C|j)\displaystyle F^{\varepsilon}(C|j) =(1βˆ’Ξ΅)(βˆ‘q=0jψ(j,q,Ξ΅)(bΓ—jβˆ’q+1nβˆ’c))+Ξ΅(βˆ‘q=0jψ(j,q,Ξ΅)(bΓ—jβˆ’qn)\displaystyle=(1-\varepsilon)\left(\sum_{q=0}^{j}\psi(j,q,\varepsilon)(b\times{j-q+1\over n}-c)\right)+\varepsilon\left(\sum_{q=0}^{j}\psi(j,q,\varepsilon)(b\times{j-q\over n}\right)
FΡ​(D|j)\displaystyle F^{\varepsilon}(D|j) =βˆ‘q=0jΟˆβ€‹(j,q,Ξ΅)​(bΓ—jβˆ’qn).\displaystyle=\sum_{q=0}^{j}\psi(j,q,\varepsilon)\left(b\times{j-q\over n}\right).

Suppose that the game is repeated with probability Ξ΄\delta. Let VΡ​(Tk|j)V^{\varepsilon}(T_{k}|j) be the payoff for the TkT_{k} strategy when jj among the nβˆ’1n-1 other members in his or her group are the conditional cooperators and the conditional cooperators play defection by mistake with probability Ξ΅\varepsilon. Then, the behavioral mistakes affect VΡ​(Tk|j)V^{\varepsilon}(T_{k}|j) via whether the conditional cooperators maintain their cooperation in the following stages as well as via a one shot payoff, FΡ​(C|j)F^{\varepsilon}(C|j) and FΡ​(D|j)F^{\varepsilon}(D|j).

3.2 The stability of the Tnβˆ’1T_{n-1} strategy

We showed in Proposition 1 that the Tnβˆ’1T_{n-1} strategy is neutrally stable (not evolutionarily stable). Now we will show how behavioral mistakes affect the stability of the Tnβˆ’1T_{n-1} strategy. Consider a population that is homogeneously composed of individuals using the Tnβˆ’1T_{n-1} strategy. A player with the Tnβˆ’1T_{n-1} strategy receives FΡ​(C|nβˆ’1)F^{\varepsilon}(C|n-1) from the first round and maintains cooperation in the next round only if there is no defection in the current round. Therefore, the mutual cooperation continues only if there are neither defectors’ defecting nor cooperators’ mistakenly defecting. In this environment, the payoff for the Tnβˆ’1T_{n-1} strategy from the repeated game is

VΡ​(Tnβˆ’1|nβˆ’1)=FΡ​(C|nβˆ’1)+(1βˆ’Ξ΅)n​δ​V​(Tnβˆ’1|nβˆ’1,Ξ΅).\displaystyle V^{\varepsilon}(T_{n-1}|n-1)=F^{\varepsilon}(C|n-1)+(1-\varepsilon)^{n}\delta V(T_{n-1}|n-1,\varepsilon). (5)

Rearranging the above condition gives

VΡ​(Tnβˆ’1|nβˆ’1)=FΡ​(C|nβˆ’1)1βˆ’(1βˆ’Ξ΅)n​δ,\displaystyle V^{\varepsilon}(T_{n-1}|n-1)={F^{\varepsilon}(C|n-1)\over 1-(1-\varepsilon)^{n}\delta}, (6)

where (1βˆ’Ξ΅)n(1-\varepsilon)^{n} is the probability that no conditional cooperators make mistakes, that is, the probability that the periods of mutual cooperation continue without triggering retaliation by any mistakes. Suppose that there appears a mutant with the TnT_{n} strategy in this population. This strategy will immediately trigger the Tnβˆ’1T_{n-1} players’ retaliation. Therefore, the payoff for the TnT_{n} strategy is

VΡ​(Tn|nβˆ’1)=FΡ​(D|nβˆ’1)+Ξ΄β€‹βˆ‘q=0nβˆ’1Οˆβ€‹(nβˆ’1,q,Ξ΅)Γ—0.\displaystyle V^{\varepsilon}(T_{n}|n-1)=F^{\varepsilon}(D|n-1)+\delta\sum_{q=0}^{n-1}\psi(n-1,q,\varepsilon)\times 0. (7)

Suppose that there appears a mutant with the TkT_{k} strategy (k<nβˆ’1k<n-1) in the Tnβˆ’1T_{n-1} population. This kk-cooperator plays CC in the first round and keeps cooperating as long as the number of defections that are mistakenly played by the conditional cooperators is less than or equal to nβˆ’kβˆ’1n-k-1. The payoff for the TkT_{k} strategy from the repeated game in a population where all of the other players are Tnβˆ’1T_{n-1} becomes

VΡ​(Tk|nβˆ’1)=FΡ​(C|nβˆ’1,Ξ΅)\displaystyle V^{\varepsilon}(T_{k}|n-1)=F^{\varepsilon}(C|n-1,\varepsilon) +\displaystyle+ (1βˆ’Ξ΅)n​δ​VΡ​(Tk|nβˆ’1)\displaystyle(1-\varepsilon)^{n}\delta V^{\varepsilon}(T_{k}|n-1) (8)
+\displaystyle+ Ξ΄β€‹βˆ‘q=1nβˆ’kβˆ’1Οˆβ€‹(nβˆ’1,q,Ξ΅)​FΡ​(C|0),\displaystyle\delta\sum_{q=1}^{n-k-1}\psi(n-1,q,\varepsilon)F^{\varepsilon}(C|0),

where k<nβˆ’1k<n-1. The last term on the right-hand side expresses the payoff for the TkT_{k} strategy in the next round when Tnβˆ’1T_{n-1} players have already withdrawn their cooperation and Tk<nβˆ’1T_{k<n-1} players continue cooperating. This situation occurs when the number of mistakes in one stage is greater than 1 and less than or equal to nβˆ’kβˆ’1n-k-1, and kk cooperators will also withdraw cooperation thereafter. The rearrangement gives

VΡ​(Tk|nβˆ’1)\displaystyle V^{\varepsilon}(T_{k}|n-1) =\displaystyle= FΡ​(C|nβˆ’1,Ξ΅)+βˆ‘q=1nβˆ’kβˆ’1Οˆβ€‹(n,q,Ξ΅)​FΡ​(C|0)1βˆ’(1βˆ’Ξ΅)n​δ\displaystyle{F^{\varepsilon}(C|n-1,\varepsilon)+\sum_{q=1}^{n-k-1}\psi(n,q,\varepsilon)F^{\varepsilon}(C|0)\over 1-(1-\varepsilon)^{n}\delta}
=\displaystyle= FΡ​(C|nβˆ’1,Ξ΅)+βˆ‘q=1nβˆ’kβˆ’1Οˆβ€‹(n,q,Ξ΅)​δ​(1βˆ’Ξ΅)​(bnβˆ’c)1βˆ’(1βˆ’Ξ΅)n​δ,\displaystyle{F^{\varepsilon}(C|n-1,\varepsilon)+\sum_{q=1}^{n-k-1}\psi(n,q,\varepsilon)\delta(1-\varepsilon)({b\over n}-c)\over 1-(1-\varepsilon)^{n}\delta},

where k<nβˆ’1k<n-1. Here, the behavioral mistakes have two effects on the evolution of the conditional cooperative strategies. First, the mistakes reduce the period of mutual cooperation and have a detrimental effect on the evolution of cooperation. Suppose that a group is entirely composed of Tnβˆ’1T_{n-1} players. Without mistakes, all of the members continue cooperating until the game ends, and the expected length of the period of mutual cooperation is 11βˆ’Ξ΄\frac{1}{1-\delta}. When the players make mistakes with probability Ξ΅\varepsilon, even one mistake will trigger retaliation from the other Tnβˆ’1T_{n-1} players, and the mutual cooperation ends. Therefore, the period of mutual cooperation is reduced to 11βˆ’(1βˆ’Ξ΅)n​δ\frac{1}{1-(1-\varepsilon)^{n}\delta} and the value of V​(Tnβˆ’1|nβˆ’1,Ξ΅)V(T_{n-1}|n-1,\varepsilon) in (6) becomes smaller.

Second, in an error-prone environment, each conditional cooperator reacts differently to the others’ mistakes according to his/her tolerance level. For example, Tnβˆ’1T_{n-1} players immediately begin retaliation by withdrawing cooperation, Tnβˆ’2T_{n-2} players tolerate one mistake, and Tnβˆ’hT_{n-h} players tolerate hβˆ’1h-1 mistakes. Note that the behavioral indistinguishability between the TkT_{k} strategies (where k<nk<n) disappears in the absence of TnT_{n}, and the payoffs for the TkT_{k} strategies are no longer the same, even in a situation where the TnT_{n} strategy does not exist. As mentioned in the previous section, the dynamic instability occurs because the Tnβˆ’1T_{n-1} strategy is only neutrally stable, not evolutionarily stable. In other words, the dynamic problem occurs because both Tnβˆ’1T_{n-1} and Tk<nβˆ’1T_{k<n-1} receive the same payoff in the absence of a defection strategy in an error-free environment. As soon as the possibility of making errors is introduced, the problem of indistinguishability between all of the conditional cooperative strategies in the absence of a defect strategy can be solved because each conditional cooperative strategy reacts differently to the error that the other players make.

The following proposition shows that mistakes can eliminate the indistinguishability between the Tk<nT_{k<n} strategies so that the state where all of the individuals are Tnβˆ’1T_{n-1} players is evolutionarily stable.

Proposition 2.

In an n-person public goods game where conditional cooperators have a TkT_{k} strategy (where k∈{0,β‹―,nβˆ’1}k\in\{0,\cdots,n-1\}), the Tnβˆ’1T_{n-1} strategy is evolutionarily stable when the probability of game repetition is sufficiently close to 1 and the probability of making mistakes is positive but sufficiently small.

Proof.

The condition for the Tnβˆ’1T_{n-1} strategy being evolutionarily stable with respect to some other strategy is

W​(Tnβˆ’1|pnβˆ’1=1)>W​(Tk<nβˆ’1|pnβˆ’1=1)andW​(Tnβˆ’1|pnβˆ’1=1)>W​(Tn|pnβˆ’1=1).\displaystyle\begin{array}[]{ccc}W(T_{n-1}|p_{n-1}=1)&>&W(T_{k<n-1}|p_{n-1}=1)\\ &\textrm{and}&\\ W(T_{n-1}|p_{n-1}=1)&>&W(T_{n}|p_{n-1}=1).\end{array} (13)

In the proof of Proposition 1, we have already shown that the first condition in Eq (13) is not satisfied with a strict inequality when the players do not make any mistakes. That is, when the probability of making mistakes is zero, we have W​(Tnβˆ’1|pnβˆ’1=1)=W​(Tk<nβˆ’1|pnβˆ’1=1)W(T_{n-1}|p_{n-1}=1)=W(T_{k<n-1}|p_{n-1}=1). However, if the probability of making mistakes is positive, then we have

W​(Tnβˆ’1|pnβˆ’1=1)=VΡ​(Tnβˆ’1|nβˆ’1)​m​(nβˆ’1,1)=FΡ​(C|nβˆ’1)1βˆ’(1βˆ’Ξ΅)n​δ.\displaystyle W(T_{n-1}|p_{n-1}=1)=V^{\varepsilon}(T_{n-1}|n-1)m(n-1,1)=\frac{F^{\varepsilon}(C|n-1)}{1-(1-\varepsilon)^{n}\delta}~. (14)

(i) Suppose that a mutant with the Tkβ€²T_{k^{\prime}} (where kβ€²<nβˆ’1k^{\prime}<n-1 ) appears in a population where pnβˆ’1=1p_{n-1}=1. With the possibility of mistakes, the expected payoff to the Tkβ€²T_{k^{\prime}} when pnβˆ’1=1p_{n-1}=1 is W​(Tkβ€²|pnβˆ’1=1)=VΡ​(Tkβ€²|nβˆ’1)​m​(nβˆ’1,1)W(T_{k^{\prime}}|p_{n-1}=1)=V^{\varepsilon}(T_{k^{\prime}}|n-1)m(n-1,1). According to Eq (8), the above equation can be expressed as

W​(Tkβ€²|pnβˆ’1=1)\displaystyle W(T_{k^{\prime}}|p_{n-1}=1) =\displaystyle= VΡ​(Tkβ€²|nβˆ’1)​m​(nβˆ’1,1)\displaystyle V^{\varepsilon}(T_{k^{\prime}}|n-1)m(n-1,1) (15)
=\displaystyle= FΡ​(C|nβˆ’1)+βˆ‘q=1nβˆ’kβ€²βˆ’1Οˆβ€‹(n,q,Ξ΅)​δ​(1βˆ’Ξ΅)​(bnβˆ’c)1βˆ’(1βˆ’Ξ΅)n​δ.\displaystyle\frac{F^{\varepsilon}(C|n-1)+\sum_{q=1}^{n-k^{\prime}-1}\psi(n,q,\varepsilon)\delta(1-\varepsilon)(\frac{b}{n}-c)}{1-(1-\varepsilon)^{n}\delta}.

Because 0<bn<c0<\frac{b}{n}<c by the assumption, comparing Eq (14) and Eq (15) gives

W​(Tnβˆ’1|pnβˆ’1=1)>W​(Tkβ€²|pnβˆ’1=1)for any ​kβ€²<nβˆ’1,\displaystyle W(T_{n-1}|p_{n-1}=1)>W(T_{k^{\prime}}|p_{n-1}=1)\qquad\textrm{for any }k^{\prime}<n-1, (16)

when the probability of making a mistake is positive.

(ii) Suppose that a mutant having the TnT_{n} strategy appears in a population where pnβˆ’1=1p_{n-1}=1. When Ξ΅>0\varepsilon>0, the payoff to TnT_{n} from the repeated public goods game in this situation is

W​(Tn|pnβˆ’1=1)\displaystyle W(T_{n}|p_{n-1}=1) =\displaystyle= VΡ​(Tn|nβˆ’1)​m​(nβˆ’1,1)\displaystyle V^{\varepsilon}(T_{n}|n-1)m(n-1,1) (17)
=\displaystyle= FΡ​(D|nβˆ’1).\displaystyle F^{\varepsilon}(D|n-1).

Therefore, for W​(Tnβˆ’1|pnβˆ’1=1)>W​(Tn|pnβˆ’1=1)W(T_{n-1}|p_{n-1}=1)>W(T_{n}|p_{n-1}=1), we should have

VΡ​(Tnβˆ’1|nβˆ’1)=FΡ​(C|nβˆ’1)1βˆ’(1βˆ’Ξ΅)n​δ>FΡ​(D|nβˆ’1)=VΡ​(Tn|nβˆ’1).\displaystyle V^{\varepsilon}(T_{n-1}|n-1)={F^{\varepsilon}(C|n-1)\over 1-(1-\varepsilon)^{n}\delta}>F^{\varepsilon}(D|n-1)=V^{\varepsilon}(T_{n}|n-1). (18)

The above can be satisfied if

(1βˆ’Ξ΅)n​δ>1βˆ’FΡ​(C|nβˆ’1,Ξ΅)FΡ​(D|nβˆ’1,Ξ΅).\displaystyle(1-\varepsilon)^{n}\delta>1-{F^{\varepsilon}(C|n-1,\varepsilon)\over F^{\varepsilon}(D|n-1,\varepsilon)}. (19)

We can easily show that FΡ​(C|nβˆ’1,Ξ΅)FΡ​(D|nβˆ’1,Ξ΅)=nnβˆ’1​(1βˆ’cb)\frac{F^{\varepsilon}(C|n-1,\varepsilon)}{F^{\varepsilon}(D|n-1,\varepsilon)}=\frac{n}{n-1}(1-\frac{c}{b}) and that this is always less than 1 by the assumption bnβˆ’c<0\frac{b}{n}-c<0. That is, the right-hand side of the condition is always less than 1. Therefore, with a sufficiently large Ξ΄\delta and sufficiently a small Ξ΅\varepsilon, the above condition is satisfied. ∎

As long as there is a probability of making mistakes, the indistinguishability between the hardest cooperators and the other softer cooperators disappears, which makes the Tnβˆ’1T_{n-1} equilibrium state evolutionarily stable under certain parameter values. The TkT_{k} strategies (k<nβˆ’1k<n-1) are now behaviorally distinguishable from the Tnβˆ’1T_{n-1} strategy because the conditional cooperative strategies are, depending on the level of kk, different in their responding to the mistakes, even though there are no defectors in the population. In other words, in an environment where the players make mistakes, the TkT_{k} strategies cannot invade a population that is homogeneously composed of individuals using the Tnβˆ’1T_{n-1} strategy.

3.3 The stability of the TkT_{k} strategy (k<nβˆ’1k<n-1).

Mistakes also affect the evolutionary stability of the softer strategies. Note that the TkT_{k} strategies when k<nβˆ’1k<n-1 are not stable at all in an error-free environment because these strategies allow the universal defection strategy (i.e., the TnT_{n} strategy) to enjoy benefits from free riding on the cooperation of these strategies (see Corollary 1).

To see the effect of mistakes on the stability of the TkT_{k} strategy, consider a population that entirely composed of players who have a TkT_{k} strategy with the same hardness level kk. A player with the TkT_{k} strategy receives FΡ​(C|nβˆ’1)F^{\varepsilon}(C|n-1) from the first round and maintains cooperation in the next round as long as the number of defections is less than nβˆ’kβˆ’1n-k-1. Therefore, the mutual cooperation continues when

  • β€’

    the number of mistakes made by the nβˆ’1n-1 other conditional cooperators is less than or equal to nβˆ’kβˆ’2n-k-2 (in this case, the maintenance of mutual cooperation does not depend on whether the focal individual makes a mistake or not), or

  • β€’

    the number of mistakes made by the nβˆ’1n-1 other conditional cooperators is exactly equal to nβˆ’kβˆ’1n-k-1, and the focal player does not make a mistake.

The first case occurs with probability βˆ‘q=0nβˆ’kβˆ’2Οˆβ€‹(nβˆ’1,q,Ξ΅)\sum_{q=0}^{n-k-2}\psi(n-1,q,\varepsilon) and the second case with probability Οˆβ€‹(nβˆ’1,nβˆ’kβˆ’1,Ξ΅)Γ—(1βˆ’Ξ΅)\psi(n-1,n-k-1,\varepsilon)\times(1-\varepsilon). By the same token, the mutual cooperation breaks when

  • β€’

    the number of mistakes made by the nβˆ’1n-1 other conditional cooperators is exactly equal to nβˆ’kβˆ’1n-k-1, and the focal player makes a mistake, or

  • β€’

    the number of mistakes made by the nβˆ’1n-1 other conditional cooperators is greater than nβˆ’kβˆ’1n-k-1.

Each case occurs with probability Οˆβ€‹(nβˆ’1,nβˆ’kβˆ’1,Ξ΅)Γ—Ξ΅\psi(n-1,n-k-1,\varepsilon)\times\varepsilon and βˆ‘q=nβˆ’knβˆ’1Οˆβ€‹(nβˆ’1,q,Ξ΅)\sum_{q=n-k}^{n-1}\psi(n-1,q,\varepsilon), respectively. Summing up all of the possible cases, the expected payoff for the TkT_{k} strategy when the population is entirely composed of individuals following the TkT_{k} strategy (k<nβˆ’1k<n-1) is written in the following way.

VΡ​(Tk|nβˆ’1)=FΡ​(C|nβˆ’1)+δ​{βˆ‘q=0nβˆ’kβˆ’2Οˆβ€‹(nβˆ’1,q,Ξ΅)​VΡ​(Tk|nβˆ’1)+Οˆβ€‹(nβˆ’1,nβˆ’kβˆ’1,Ξ΅)​[Ρ×0+(1βˆ’Ξ΅)Γ—VΡ​(Tk|nβˆ’1)]+βˆ‘q=nβˆ’knβˆ’1Οˆβ€‹(nβˆ’1,q,Ξ΅)Γ—0,\displaystyle\begin{aligned} V^{\varepsilon}(T_{k}|n-1)=&F^{\varepsilon}(C|n-1)+\\ &\delta\begin{cases}\sum_{q=0}^{n-k-2}\psi(n-1,q,\varepsilon)V^{\varepsilon}(T_{k}|n-1)+~\\ \psi(n-1,n-k-1,\varepsilon)[\varepsilon\times 0+(1-\varepsilon)\times V^{\varepsilon}(T_{k}|n-1)]+~\\ \sum_{q=n-k}^{n-1}\psi(n-1,q,\varepsilon)\times 0,\end{cases}\end{aligned} (20)

if k<nβˆ’1k<n-1.

Suppose that there appears one defector (i.e., a player with strategy TnT_{n}) in the homogenous population of a TkT_{k} strategy where every player has the same tolerance level k<nβˆ’1k<n-1. When the game is repeated with probability Ξ΄\delta , the payoff to the TnT_{n} player from the repeated game is calculated by

VΡ​(Tn|nβˆ’1)\displaystyle V^{\varepsilon}(T_{n}|n-1) =\displaystyle= FΡ​(D|nβˆ’1)\displaystyle F^{\varepsilon}(D|n-1)
+Ξ΄β€‹βˆ‘q=0nβˆ’kβˆ’2Οˆβ€‹(nβˆ’1,q,Ξ΅)​VΡ​(Tn|nβˆ’1)⏟(i)+Ξ΄β€‹βˆ‘q=nβˆ’kβˆ’1nβˆ’1Οˆβ€‹(nβˆ’1,q,Ξ΅)Γ—0⏟(i​i).\displaystyle+\underbrace{\delta\sum_{q=0}^{n-k-2}\psi(n-1,q,\varepsilon)V^{\varepsilon}(T_{n}|n-1)}_{(i)}+\underbrace{\delta\sum_{q=n-k-1}^{n-1}\psi(n-1,q,\varepsilon)\times 0}_{(ii)}.

As long as the number of defections, either due to mistakes by the TkT_{k} players or the defection of the TnT_{n} player, is less than nβˆ’kβˆ’1n-k-1, TkT_{k} players will tolerate defections and maintain cooperation. In this case, the TnT_{n} player keeps enjoying a free rider’s advantage from the repeated interaction (see (i) in Eq (3.3)). When the number of defections exceeds nβˆ’kβˆ’1n-k-1, all of the TkT_{k} players trigger retaliation, which gives every member a 0 payoff in the following rounds (see (ii) in Eq (3.3)).

It will be shown that it is crucial to have VΡ​(Tk|nβˆ’1)βˆ’VΡ​(Tn|nβˆ’1)>0V^{\varepsilon}(T_{k}|n-1)-V^{\varepsilon}(T_{n}|n-1)>0 to prove the evolutionarily stability of the TkT_{k} strategy. Rearranging Eq (20) and Eq (3.3) gives

VΡ​(Tk|nβˆ’1)=FΡ​(C|nβˆ’1)1βˆ’Ξ΄β€‹βˆ‘q=0nβˆ’kβˆ’2Οˆβ€‹(nβˆ’1,q,Ξ΅)βˆ’Ξ΄β€‹(1βˆ’Ξ΅)β€‹Οˆβ€‹(nβˆ’1,nβˆ’kβˆ’1,Ξ΅)\displaystyle V^{\varepsilon}(T_{k}|n-1)=\frac{F^{\varepsilon}(C|n-1)}{1-\delta\sum_{q=0}^{n-k-2}\psi(n-1,q,\varepsilon)-\delta(1-\varepsilon)\psi(n-1,n-k-1,\varepsilon)} (22)

and

VΡ​(Tn|nβˆ’1)=FΡ​(D|nβˆ’1)1βˆ’Ξ΄β€‹βˆ‘q=0nβˆ’kβˆ’2Οˆβ€‹(nβˆ’1,q,Ξ΅),\displaystyle V^{\varepsilon}(T_{n}|n-1)=\frac{F^{\varepsilon}(D|n-1)}{1-\delta\sum_{q=0}^{n-k-2}\psi(n-1,q,\varepsilon)}, (23)

respectively. The condition that the payoff for the TkT_{k} strategy is greater than the payoff for the TnT_{n} strategy is

VΡ​(Tk|nβˆ’1)βˆ’VΡ​(Tn|nβˆ’1)\displaystyle V^{\varepsilon}(T_{k}|n-1)-V^{\varepsilon}(T_{n}|n-1)
=F​(C|nβˆ’1)1βˆ’Ξ΄β€‹βˆ‘q=0nβˆ’kβˆ’2Οˆβ€‹(nβˆ’1,q,Ξ΅)βˆ’Ξ΄β€‹(1βˆ’Ξ΅)β€‹Οˆβ€‹(nβˆ’1,nβˆ’kβˆ’1,Ξ΅)βˆ’F​(D|nβˆ’1)1βˆ’Ξ΄β€‹βˆ‘q=0nβˆ’kβˆ’2Οˆβ€‹(nβˆ’1,q,Ξ΅)\displaystyle=\frac{F(C|n-1)}{1-\delta\sum_{q=0}^{n-k-2}\psi(n-1,q,\varepsilon)-\delta(1-\varepsilon)\psi(n-1,n-k-1,\varepsilon)}-\frac{F(D|n-1)}{1-\delta\sum_{q=0}^{n-k-2}\psi(n-1,q,\varepsilon)}
=F​(D|nβˆ’1)1βˆ’Ξ΄β€‹βˆ‘q=0nβˆ’kβˆ’2Οˆβ€‹(nβˆ’1,q,Ξ΅)βˆ’Ξ΄β€‹(1βˆ’Ξ΅)β€‹Οˆβ€‹(nβˆ’1,nβˆ’kβˆ’1,Ξ΅)Γ—[Δ​(Ξ΅;k)βˆ’(1βˆ’FΡ​(C|nβˆ’1)FΡ​(D|nβˆ’1))],\displaystyle=\frac{F(D|n-1)}{1-\delta\sum_{q=0}^{n-k-2}\psi(n-1,q,\varepsilon)-\delta(1-\varepsilon)\psi(n-1,n-k-1,\varepsilon)}\times\left[\Delta(\varepsilon;k)-\left(1-\frac{F^{\varepsilon}(C|n-1)}{F^{\varepsilon}(D|n-1)}\right)\right],

where Δ​(Ξ΅;k)=δ​(1βˆ’Ξ΅)β€‹Οˆβ€‹(nβˆ’1,nβˆ’kβˆ’1,Ξ΅)1βˆ’Ξ΄β€‹βˆ‘q=0nβˆ’kβˆ’2Οˆβ€‹(nβˆ’1,q,Ξ΅)\Delta(\varepsilon;k)=\frac{\delta(1-\varepsilon)\psi(n-1,n-k-1,\varepsilon)}{1-\delta\sum_{q=0}^{n-k-2}\psi(n-1,q,\varepsilon)}. Note that the sign of VΡ​(Tk|nβˆ’1)βˆ’VΡ​(Tn|nβˆ’1)V^{\varepsilon}(T_{k}|n-1)-V^{\varepsilon}(T_{n}|n-1) depends on the sign of Δ​(Ξ΅;k)βˆ’(1βˆ’FΡ​(C|nβˆ’1)FΡ​(D|nβˆ’1))\Delta(\varepsilon;k)-(1-\frac{F^{\varepsilon}(C|n-1)}{F^{\varepsilon}(D|n-1)}). Because it holds that 0<1βˆ’FΡ​(C|nβˆ’1)FΡ​(D|nβˆ’1)<10<1-\frac{F^{\varepsilon}(C|n-1)}{F^{\varepsilon}(D|n-1)}<1, and FΡ​(C|nβˆ’1)FΡ​(D|nβˆ’1)\frac{F^{\varepsilon}(C|n-1)}{F^{\varepsilon}(D|n-1)} is fixed by the payoff structure of the public goods game, it is crucial to have a sufficiently large Δ​(Ξ΅;k)\Delta(\varepsilon;k).

A TkT_{k} strategy (where k∈{1,β‹―,nβˆ’2}k\in\{1,\cdots,n-2\}) is evolutionarily stable if

W​(Tk|pk=1)>W​(Tkβ€²|pk=1)βˆ€kβ€²β‰ k.\displaystyle W(T_{k}|p_{k}=1)>W(T_{k^{\prime}}|p_{k}=1)\quad\forall k^{\prime}\neq k. (25)

In the following proposition, we will show that all of the conditional cooperative strategies are evolutionarily stable when Ξ΅\varepsilon is positive and within a proper open interval and when Ξ΄\delta is sufficiently large. We will show that W​(Tk|pk=1)>W​(Tkβ€²|pk=1)W(T_{k}|p_{k}=1)>W(T_{k^{\prime}}|p_{k}=1) for k∈{1,…,nβˆ’2}k\in\{1,\ldots,n-2\} in all of the following three cases where (1) a mutant is a conditional cooperator who has a softer strategy (i.e., kβ€²<kk^{\prime}<k), (2) a mutant is a conditional cooperator who has a harder strategy (i.e., k<kβ€²<nk<k^{\prime}<n), or (3) a mutant is a defector (i.e., kβ€²=nk^{\prime}=n).

Proposition 3.

In an n-person public goods game where the conditional cooperators have a TkT_{k} strategy (k∈{1,2,β‹―,nβˆ’1})(k\in\{1,2,\cdots,n-1\}), the TkT_{k} strategies where 0<k<nβˆ’10<k<n-1 are evolutionarily stable when the probability of game repetition is sufficiently close to 1 and the probability of making mistakes is positive and sufficiently small.

Proof.

See Appendix A. ∎


TkT_{k} strategies with (0<k<nβˆ’10<k<n-1) are not stable at all in an error-free environment but become evolutionarily stable when mistakes are introduced, i.e., they do not allow the invasion of other strategies into their population because the mistakes made by the conditional cooperators produce unintended defections among themselves and leave little room for the defector’s free riding. The proof of the above proposition consists of three steps. First, when conditional cooperators make mistakes the expected payoff for the TkT_{k} strategy in the repeated game is greater than the expected payoff to the mutant TnT_{n} strategy (see Appendix A, Lemma 1). Behavioral mistakes, if they occur with a sufficiently low probability, no longer allow the defectors to invade the population that homogeneously consists of individuals using the TkT_{k} strategy (where 0<k<nβˆ’10<k<n-1). Behavioral mistakes produce unintended defections among the cooperators (i.e., even without a defector’s invasion), so that softer conditional cooperators are less likely to tolerate a defector’s invasion. Second, if the expected payoff for the TkT_{k} strategy in the repeated game is greater than that for the mutant TnT_{n} strategy, then it is also true that the expected payoff for the TkT_{k} strategy in the repeated game is greater than the mutant Tkβ€²T_{k^{\prime}} strategy’s payoff if kβ€²k^{\prime} is greater than kk, i.e., if the mutants are harder conditional cooperators (see Appendix A, Lemma 2). Third, behavioral mistakes also make the expected payoff to the TkT_{k} strategy in the repeated game when one of these strategies invades TkT_{k} population greater than the other conditional cooperative strategies that havekβ€²k^{\prime} lower than kk (see Appendix A, Lemma 3). Error makes those strategies distinguishable and makes the payoff for the TkT_{k} strategy higher (same logic as proof for Proposition 2).

In an error-free environment, no conditional cooperative strategies are evolutionarily stable. The Tnβˆ’1T_{n-1} strategy is neutrally stable and is subject to the drift ultimately destroying the cooperative equilibrium, and none of the Tk<nβˆ’1T_{k<n-1} strategies are stable at all. Proposition 2 and Proposition 3 say that behavioral errors, if they occur with sufficiently low probability, can make all conditional cooperative strategies evolutionarily stable.

3.4 Error and selection among evolutionarily stable strategies

In the above section, we showed that conditional cooperative strategies, TkT_{k} for k∈{1,β‹―,nβˆ’1}k\in\{1,\cdots,n-1\}, are evolutionarily stable when Ξ΅\varepsilon is positive but sufficiently low. Furthermore, each TkT_{k} strategy has its own critical value of Ξ΅k\varepsilon_{k} such that the strategy is evolutionary stable in the range of Ρ∈(0,Ξ΅k)\varepsilon\in(0,\varepsilon_{k}). If the critical value of each TkT_{k} strategy differs according to the hardness level kk, the magnitude of the probability that a behavioral error occurs could serve as a criterion for equilibrium selection.

In this section, we will show that the range of Ξ΅\varepsilon supporting the evolutionary stability of the TkT_{k} strategy varies according to the tolerance of the conditional cooperative strategies. Let Ξ΅k\varepsilon_{k} be the supremum of Ξ΅\varepsilon that supports the evolutionary stability of a TkT_{k} strategy (where k∈{1,β‹―,nβˆ’1}k\in\{1,\cdots,n-1\}). We will show that Ξ΅k\varepsilon_{k} increases as kk decreases, that is, 0<Ξ΅nβˆ’1<Ξ΅nβˆ’2<β‹―<Ξ΅10<\varepsilon_{n-1}<\varepsilon_{n-2}<\cdots<\varepsilon_{1}. For example, if the probability of making an error is sufficiently low so that 0<Ξ΅<Ξ΅nβˆ’10<\varepsilon<\varepsilon_{n-1}, then all of the conditional cooperative strategies are evolutionarily stable. If the error rate is in the range of Ξ΅nβˆ’1≀Ρ<Ξ΅nβˆ’2\varepsilon_{n-1}\leq\varepsilon<\varepsilon_{n-2}, then all of the conditional cooperative strategies except for Tnβˆ’1T_{n-1} are evolutionarily stable, and so on. Lastly, if the error rate is higher than or equal to Ξ΅1\varepsilon_{1}, no conditional cooperative strategies are evolutionarily stable, in which case only the universal defection strategy, TnT_{n}, is evolutionarily stable. In other words, all conditional cooperative strategies are evolutionarily stable when 0<Ξ΅<Ξ΅nβˆ’10<\varepsilon<\varepsilon_{n-1} and fewer of the conditional cooperative strategies remain evolutionarily stable as Ξ΅\varepsilon increases.

Proposition 4.

Let Ξ΅k\varepsilon_{k} be the supremum of Ξ΅\varepsilon that supports the evolutionary stability of the TkT_{k} strategy (where k∈{1,β‹―,nβˆ’1}k\in\{1,\cdots,n-1\}). Then, Ξ΅k\varepsilon_{k} increases as kk decreases, that is, 0<Ξ΅nβˆ’1<Ξ΅nβˆ’2<β‹―<Ξ΅10<\varepsilon_{n-1}<\varepsilon_{n-2}<\cdots<\varepsilon_{1}.

Proof.

See Appendix B. ∎

In other words, as Ξ΅\varepsilon increases, the TkT_{k} strategies with a lower kk (i.e., with a higher tolerance level) remain evolutionarily stable. To understand the role of behavioral mistakes in the equilibrium selection in the above proposition, we need to examine the two effects that the mistakes produce. First, the mistakes lower the probability that the mutually cooperative phase continues because they trigger retaliation towards cooperators’ unintended defections. Secondly, they also reduce the possibility that the defectors enjoy benefits from free riding on tolerant cooperators. If the incumbent conditional cooperators have a lower kk, the first effect is less detrimental because the cooperative equilibrium based on the lower kk is less vulnerable to a breakdown when a mistake occurs with higher probability.

4 The General Case Where Ξ΄\delta is Sufficiently High But Not at the Limit of 11

One should note that only the supremum of Ξ΅\varepsilon matters in Proposition 4. We proved the proposition at the Ξ΄=1\delta=1 limit, in which case the infimum becomes meaningless as long as the rate of mistakes is positive. At the Ξ΄=1\delta=1 limit, error will at some point terminate the game and enable the TkT_{k} strategy with a low kk to block the defectors from endlessly accumulating a free rider’s benefit. That is, at the Ξ΄=1\delta=1 limit, error appears to work as long as Ξ΅\varepsilon is positive (See Figure 4 (a)).

Refer to caption

(a) Ξ΄=1\delta=1

Refer to caption
(b) Ξ΄=0.9\delta=0.9

Refer to caption
(c) Ξ΄=0.8\delta=0.8

FigureΒ 1.Β : Δ​(Ξ΅;k)\Delta(\varepsilon;k) with Ξ΄\delta. (n=10n=10)

However, when Ξ΄\delta is large but not at the limit of 1, the TkT_{k} strategy (0<k<nβˆ’10<k<n-1) needs a sufficiently high rate of Ξ΅\varepsilon to block the defectors’ free riding. The role of error in supporting the evolutionary stability of cooperative strategies depends on whether it produces enough defections among the cooperators before the defectors’ invasion. Note that the number of mistakes to prevent defectors from invading depends on the level of tolerance. As kk becomes lower, more unintended defections among the cooperators are needed to terminate the game repetition (i.e., to block a defector from gaining a free riding benefit) in case one defector appears in a group. In other words, there should be an infimum of Ξ΅\varepsilon to make the TkT_{k} strategy evolutionarily stable. In this section, we show that the conclusions derived in the former section are still valid with some modification, especially for the infimum in the general case where Ξ΄\delta is sufficiently high but not at the limit of 1.

Here, we analyze a more general case where δ\delta is less than 1. At the δ=1\delta=1 limit, we showed that each TkT_{k} strategy, where k<nk<n, has the supremum, Ρk\varepsilon_{k}, such that the TkT_{k} strategy is evolutionary stable for Ρ∈(0,Ρk)\varepsilon\in(0,\varepsilon_{k}) and that Ρk\varepsilon_{k} is increasing as kk decreases.

In the following propositions, we will now show that each conditional cooperative TkT_{k} strategy has its own band of error rate (Ρ¯k,Ρ¯k(\underline{\varepsilon}_{k},\overline{\varepsilon}_{k}) that makes the TkT_{k} strategy evolutionarily stable. In other words, there appears an infimum of Ρ\varepsilon that supports the evolutionary stability of each TkT_{k} strategy and the infimum moves toward zero as δ\delta approaches 1. Now we can provide a generalized versions of Proposition 2, Proposition 3 and Proposition 4.

Proposition 5.

In an n-person public goods game where the conditional cooperators have a TkT_{k} strategy (k∈{1,2,β‹―,nβˆ’1})(k\in\{1,2,\cdots,n-1\}), all of the TkT_{k} strategies are evolutionarily stable when the probability of game repetition is sufficiently high and the probability of making mistakes is in the range of (Ρ¯k,Ρ¯k\underline{\varepsilon}_{k},\overline{\varepsilon}_{k}).

Proof.

See Appendix C. ∎

Proposition 6.

In an n-person public goods game where conditional cooperators have a TkT_{k} strategy (k∈{1,2,β‹―,nβˆ’1})(k\in\{1,2,\cdots,n-1\}), both Ρ¯k\underline{\varepsilon}_{k} and Ρ¯k\overline{\varepsilon}_{k} increase as kk decreases and Ρ¯nβˆ’1=0\underline{\varepsilon}_{n-1}=0, that is, 0=Ρ¯nβˆ’1<Ρ¯nβˆ’2<β‹―<Ρ¯10=\underline{\varepsilon}_{n-1}<\underline{\varepsilon}_{n-2}<\cdots<\underline{\varepsilon}_{1} and 0<Ρ¯nβˆ’1<Ρ¯nβˆ’2<β‹―<Ρ¯1<10<\overline{\varepsilon}_{n-1}<\overline{\varepsilon}_{n-2}<\cdots<\overline{\varepsilon}_{1}<1, when the probability of game repetition is sufficiently high.

Proof.

See Appendix D. ∎

Figure 4 makes our argument clear. All of the above propositions show that the stability condition critically depends on the sign of Δ​(Ξ΅;k)βˆ’(1βˆ’FΡ​(C|nβˆ’1)FΡ​(D|nβˆ’1))\Delta(\varepsilon;k)-(1-\frac{F^{\varepsilon}(C|n-1)}{F^{\varepsilon}(D|n-1)}). This means that Δ​(Ξ΅;k)\Delta(\varepsilon;k) should be sufficiently large to offset 1βˆ’FΡ​(C|nβˆ’1)FΡ​(D|nβˆ’1)1-\frac{F^{\varepsilon}(C|n-1)}{F^{\varepsilon}(D|n-1)}. The three panels in Figure 4 show Δ​(Ξ΅;k)\Delta(\varepsilon;k) for three different values of Ξ΄\delta. One can easily check the following. First, the Δ​(Ξ΅;k)\Delta(\varepsilon;k) value becomes larger as Ξ΄\delta increases, and our previous propositions hold if Ξ΄\delta approaches 1. Second, when Ξ΄\delta is less than one, the probability of mistakes should be sufficiently large (i.e., the infimum is important as well as its supremum to support cooperation) and the infimum becomes zero when Ξ΄\delta approaches 1.

5 Conclusion

This study examined the effect of behavioral mistakes on the dynamic stability of the cooperative equilibrium in a repeated public goods game. This study shows that while a behavioral mistake has detrimental effects on cooperation because it reduces the length of the period of mutual cooperation by triggering the conditional cooperators’ retaliation, a behavioral mistake also has a positive effect by making the conditional cooperative strategies evolutionarily stable. This paper shows that the behavioral mistakes stabilize the cooperative equilibrium based on the hardest cooperative strategy by eliminating the behavioral indistinguishability between the conditional cooperative strategies at the cooperative equilibrium. This paper shows that the mistakes also stabilize the cooperative equilibrium based on a softer cooperative strategy by producing unintended defection among the cooperators and making the softer conditional cooperators less tolerant of a defector’s invasion. Finally, the paper shows that the error rate, Ξ΅\varepsilon, could serve as an equilibrium selection criterion because each equilibrium that is based on a different level of tolerance is supported by different ranges for the error rate.

Appendix A The Proof of Proposition 3

Consider a population that is entirely composed of conditional cooperators who have the same hardness (where k<nβˆ’1k<n-1). The payoff to the TkT_{k} strategy when pk=1p_{k}=1 is

W​(Tk|pk=1)\displaystyle W(T_{k}|p_{k}=1) =\displaystyle= VΡ​(Tk|nβˆ’1)​m​(nβˆ’1,pk=1)\displaystyle V^{\varepsilon}(T_{k}|n-1)m(n-1,p_{k}=1)
=\displaystyle= FΡ​(C|nβˆ’1)\displaystyle F^{\varepsilon}(C|n-1)
+δ​[βˆ‘q=0nβˆ’kβˆ’2Οˆβ€‹(nβˆ’1,q,Ξ΅)+(1βˆ’Ξ΅)β€‹Οˆβ€‹(nβˆ’1,nβˆ’kβˆ’1,Ξ΅)]​VΡ​(Tk|nβˆ’1),\displaystyle+\delta\left[\sum_{q=0}^{n-k-2}\psi(n-1,q,\varepsilon)+(1-\varepsilon)\psi(n-1,n-k-1,\varepsilon)\right]V^{\varepsilon}(T_{k}|n-1),

which gives

W​(Tk|pk=1)\displaystyle W(T_{k}|p_{k}=1) =\displaystyle= VΡ​(Tk|nβˆ’1)\displaystyle V^{\varepsilon}(T_{k}|n-1)
=\displaystyle= FΡ​(C|nβˆ’1)1βˆ’Ξ΄β€‹βˆ‘q=0nβˆ’kβˆ’2Οˆβ€‹(nβˆ’1,q,Ξ΅)βˆ’Ξ΄β€‹(1βˆ’Ξ΅)β€‹Οˆβ€‹(nβˆ’1,nβˆ’kβˆ’1,Ξ΅),\displaystyle\frac{F^{\varepsilon}(C|n-1)}{1-\delta\sum_{q=0}^{n-k-2}\psi(n-1,q,\varepsilon)-\delta(1-\varepsilon)\psi(n-1,n-k-1,\varepsilon)},

because m​(nβˆ’1,pk=1)=1m(n-1,p_{k}=1)=1. The payoff to the mutant TnT_{n} strategy is

W​(Tn|pk=1)\displaystyle W(T_{n}|p_{k}=1) =\displaystyle= VΡ​(Tn|nβˆ’1)​m​(nβˆ’1,pk=1)\displaystyle V^{\varepsilon}(T_{n}|n-1)m(n-1,p_{k}=1) (28)
=\displaystyle= FΡ​(D|nβˆ’1)+Ξ΄β€‹βˆ‘q=0nβˆ’kβˆ’2Οˆβ€‹(nβˆ’1,q,Ξ΅)​VΡ​(Tn|nβˆ’1),\displaystyle F^{\varepsilon}(D|n-1)+\delta\sum_{q=0}^{n-k-2}\psi(n-1,q,\varepsilon)V^{\varepsilon}(T_{n}|n-1),

which gives

W​(Tn|pk=1)=VΡ​(Tn|nβˆ’1)=FΡ​(D|nβˆ’1)1βˆ’Ξ΄β€‹βˆ‘q=0nβˆ’kβˆ’2Οˆβ€‹(nβˆ’1,q,Ξ΅).\displaystyle W(T_{n}|p_{k}=1)=V^{\varepsilon}(T_{n}|n-1)=\frac{F^{\varepsilon}(D|n-1)}{1-\delta\sum_{q=0}^{n-k-2}\psi(n-1,q,\varepsilon)}. (29)

Now, the following three lemmas show that any TkT_{k} (for 0<k<nβˆ’1)0<k<n-1) is evolutionarily stable for a sufficiently large Ξ΄\delta close to 11 and a sufficiently small Ρ∈(0,1)\varepsilon\in(0,1). The proof is presented at the end.

Lemma 1.

For a sufficiently large Ξ΄\delta close to 11, there exists a unique Ξ΅k\varepsilon_{k} such that VΡ​(Tk|nβˆ’1)βˆ’VΡ​(Tn|nβˆ’1)>0V^{\varepsilon}(T_{k}|n-1)-V^{\varepsilon}(T_{n}|n-1)>0 for Ρ∈(0,Ξ΅k)\varepsilon\in(0,\varepsilon_{k}).

Proof.

The sign of VΡ​(Tk|nβˆ’1)βˆ’VΡ​(Tn|nβˆ’1)V^{\varepsilon}(T_{k}|n-1)-V^{\varepsilon}(T_{n}|n-1) is determined by Δ​(Ξ΅;k)βˆ’(1βˆ’FΡ​(C|nβˆ’1)FΡ​(D|nβˆ’1))\Delta(\varepsilon;k)-(1-\frac{F^{\varepsilon}(C|n-1)}{F^{\varepsilon}(D|n-1)}) where Δ​(Ξ΅;k)=δ​(1βˆ’Ξ΅)β€‹Οˆβ€‹(nβˆ’1,nβˆ’kβˆ’1,Ξ΅)1βˆ’Ξ΄β€‹βˆ‘q=0nβˆ’kβˆ’2Οˆβ€‹(nβˆ’1,q,Ξ΅)\Delta(\varepsilon;k)=\frac{\delta(1-\varepsilon)\psi(n-1,n-k-1,\varepsilon)}{1-\delta\sum_{q=0}^{n-k-2}\psi(n-1,q,\varepsilon)}. When Ξ΄\delta is close to 1, we have

limΞ΄β†’1Δ​(Ξ΅;k)=\displaystyle\lim_{\delta\rightarrow 1}\Delta(\varepsilon;k)= (1βˆ’Ξ΅)β€‹Οˆβ€‹(nβˆ’1,nβˆ’kβˆ’1,Ξ΅)1βˆ’βˆ‘q=0nβˆ’kβˆ’2Οˆβ€‹(nβˆ’1,q,Ξ΅)\displaystyle\frac{(1-\varepsilon)\psi(n-1,n-k-1,\varepsilon)}{1-\sum_{q=0}^{n-k-2}\psi(n-1,q,\varepsilon)}
=\displaystyle= (1βˆ’Ξ΅)β€‹Οˆβ€‹(nβˆ’1,nβˆ’kβˆ’1,Ξ΅)βˆ‘q=nβˆ’kβˆ’1nβˆ’1Οˆβ€‹(nβˆ’1,q,Ξ΅)\displaystyle\frac{(1-\varepsilon)\psi(n-1,n-k-1,\varepsilon)}{\sum_{q=n-k-1}^{n-1}\psi(n-1,q,\varepsilon)}
=\displaystyle= (1βˆ’Ξ΅)​(nβˆ’1nβˆ’kβˆ’1)βˆ‘q=0k(nβˆ’1nβˆ’kβˆ’1+q)​(Ξ΅1βˆ’Ξ΅)q\displaystyle\frac{(1-\varepsilon)\binom{n-1}{n-k-1}}{\sum_{q=0}^{k}\binom{n-1}{n-k-1+q}\left(\frac{\varepsilon}{1-\varepsilon}\right)^{q}} (30)

and

limΞ΅β†’0limΞ΄β†’1Δ​(Ξ΅;k)=1\displaystyle\lim_{\varepsilon\rightarrow 0}\lim_{\delta\rightarrow 1}\Delta(\varepsilon;k)=1
limΞ΅β†’1limΞ΄β†’1Δ​(Ξ΅;k)=0.\displaystyle\lim_{\varepsilon\rightarrow 1}\lim_{\delta\rightarrow 1}\Delta(\varepsilon;k)=0.

Because 0<1βˆ’FΡ​(C|nβˆ’1)FΡ​(D|nβˆ’1)<10<1-\frac{F^{\varepsilon}(C|n-1)}{F^{\varepsilon}(D|n-1)}<1,

{VΡ​(Tk|nβˆ’1)βˆ’VΡ​(Tn|nβˆ’1)>0ifΒ Ξ΅β†’0VΡ​(Tk|nβˆ’1)βˆ’VΡ​(Tn|nβˆ’1)<0ifΒ Ξ΅β†’1.\displaystyle\begin{cases}V^{\varepsilon}(T_{k}|n-1)-V^{\varepsilon}(T_{n}|n-1)>0&\text{if $\varepsilon\to 0$}\\ V^{\varepsilon}(T_{k}|n-1)-V^{\varepsilon}(T_{n}|n-1)<0&\text{if $\varepsilon\to 1$}.\end{cases}

Because VΡ​(β‹…)V^{\varepsilon}(\cdot) is continuous for Ρ∈(0,1)\varepsilon\in(0,1), then there exists at least one Ξ΅\varepsilon that makes VΡ​(Tk|nβˆ’1)=VΡ​(Tn|nβˆ’1)V^{\varepsilon}(T_{k}|n-1)=V^{\varepsilon}(T_{n}|n-1). Now, to prove the uniqueness of this Ξ΅\varepsilon, Ξ΅k\varepsilon_{k}, we take first derivative of limΞ΄β†’1Δ​(Ξ΅,k)\lim_{\delta\rightarrow 1}\Delta(\varepsilon,k):

d​limΞ΄β†’1Ξ”d​Ρ=\displaystyle\frac{\mathrm{d}\lim_{\delta\rightarrow 1}\Delta}{\mathrm{d}\varepsilon}=
βˆ’(nβˆ’1nβˆ’kβˆ’1)β€‹βˆ‘q=0k(nβˆ’1nβˆ’kβˆ’1+q)​(Ξ΅1βˆ’Ξ΅)qβˆ’(1βˆ’Ξ΅)​(nβˆ’1nβˆ’kβˆ’1)β€‹βˆ‘q=0kq​(nβˆ’1nβˆ’kβˆ’1+q)​(1(1βˆ’Ξ΅)2)qβˆ’1(βˆ‘q=0k(nβˆ’1nβˆ’kβˆ’1+q)​(Ξ΅1βˆ’Ξ΅)q)2\displaystyle\frac{-\binom{n-1}{n-k-1}\sum_{q=0}^{k}\binom{n-1}{n-k-1+q}\left(\frac{\varepsilon}{1-\varepsilon}\right)^{q}-(1-\varepsilon)\binom{n-1}{n-k-1}\sum_{q=0}^{k}q\binom{n-1}{n-k-1+q}\left(\frac{1}{(1-\varepsilon)^{2}}\right)^{q-1}}{\left(\sum_{q=0}^{k}\binom{n-1}{n-k-1+q}\left(\frac{\varepsilon}{1-\varepsilon}\right)^{q}\right)^{2}}

which is always negative for Ρ∈(0,1)\varepsilon\in(0,1). ∎

Lemma 2.

Suppose that a mutant Tkβ€²T_{k^{\prime}} (for kβ€²βˆˆ{k+1,…,nβˆ’1}k^{\prime}\in\{k+1,\ldots,n-1\} ) appears in a homogeneous population of TkT_{k}. VΡ​(Tk|nβˆ’1)βˆ’VΡ​(Tkβ€²|nβˆ’1)V^{\varepsilon}(T_{k}|n-1)-V^{\varepsilon}(T_{{k^{\prime}}}|n-1) has the same sign as VΡ​(Tk|nβˆ’1)βˆ’VΡ​(Tn|nβˆ’1)V^{\varepsilon}(T_{k}|n-1)-V^{\varepsilon}(T_{n}|n-1).

Proof.

The payoff for strategy TaT_{a} from the repeated game is given by

VΡ​(Tkβ€²|nβˆ’1)=\displaystyle V^{\varepsilon}(T_{k^{\prime}}|n-1)= FΡ​(C|nβˆ’1)+δ​{βˆ‘q=0nβˆ’kβ€²βˆ’1Οˆβ€‹(nβˆ’1,q,Ξ΅)β‹…VΡ​(Tkβ€²|nβˆ’1)+βˆ‘q=nβˆ’kβ€²nβˆ’kβˆ’2Οˆβ€‹(nβˆ’1,q,Ξ΅)β‹…VΡ​(Tn|nβˆ’1)+βˆ‘q=nβˆ’kβˆ’1nβˆ’1Οˆβ€‹(nβˆ’1,q,Ξ΅)β‹…0.\displaystyle F^{\varepsilon}(C|n-1)+\delta\begin{cases}\sum_{q=0}^{n-k^{\prime}-1}\psi(n-1,q,\varepsilon)\cdot V^{\varepsilon}(T_{k^{\prime}}|n-1)+~\\ \sum_{q=n-k^{\prime}}^{n-k-2}\psi(n-1,q,\varepsilon)\cdot V^{\varepsilon}(T_{n}|n-1)+~\\ \sum_{q=n-k-1}^{n-1}\psi(n-1,q,\varepsilon)\cdot 0.\end{cases}

To check the stability, it is noted that

V​(Tk|nβˆ’1)βˆ’V​(Tkβ€²|nβˆ’1)=\displaystyle V(T_{k}|n-1)-V(T_{k^{\prime}}|n-1)=
(Ξ΄β€‹βˆ‘q=nβˆ’kβ€²nβˆ’kβˆ’2Οˆβ€‹(nβˆ’1,q,Ξ΅)+δ​(1βˆ’Ξ΅)β€‹Οˆβ€‹(nβˆ’1,nβˆ’kβ€²βˆ’1,Ξ΅))​F​(D|nβˆ’1)(1βˆ’Ξ΄β€‹βˆ‘q=0nβˆ’kβˆ’2Οˆβ€‹(nβˆ’1,q,Ξ΅)βˆ’Ξ΄β€‹(1βˆ’Ξ΅)β€‹Οˆβ€‹(nβˆ’1,nβˆ’kβˆ’1,Ξ΅))​(1βˆ’Ξ΄β€‹βˆ‘q=0nβˆ’kβ€²βˆ’1Οˆβ€‹(nβˆ’1,q,Ξ΅))Γ—\displaystyle\frac{(\delta\sum_{q=n-k^{\prime}}^{n-k-2}\psi(n-1,q,\varepsilon)+\delta(1-\varepsilon)\psi(n-1,n-k^{\prime}-1,\varepsilon))F(D|n-1)}{(1-\delta\sum_{q=0}^{n-k-2}\psi(n-1,q,\varepsilon)-\delta(1-\varepsilon)\psi(n-1,n-k-1,\varepsilon))(1-\delta\sum_{q=0}^{n-k^{\prime}-1}\psi(n-1,q,\varepsilon))}\times
[FΡ​(C|nβˆ’1)FΡ​(D|nβˆ’1)βˆ’Ξ΄β€‹βˆ‘q=nβˆ’kβ€²nβˆ’kβˆ’2Οˆβ€‹(nβˆ’1,q,Ξ΅)​{1βˆ’Ξ΄β€‹βˆ‘q=0nβˆ’kβˆ’2Οˆβ€‹(nβˆ’1,q,Ξ΅)βˆ’Ξ΄β€‹(1βˆ’Ξ΅)β€‹Οˆβ€‹(nβˆ’1,nβˆ’kβˆ’1,Ξ΅)}(1βˆ’Ξ΄β€‹βˆ‘q=0nβˆ’kβˆ’2Οˆβ€‹(nβˆ’1,q,Ξ΅))​{Ξ΄β€‹βˆ‘q=nβˆ’kβ€²nβˆ’kβˆ’2Οˆβ€‹(nβˆ’1,q,Ξ΅)+δ​(1βˆ’Ξ΅)β€‹Οˆβ€‹(nβˆ’1,nβˆ’kβˆ’1,Ξ΅)}⏟(βˆ—βˆ—)],\displaystyle\left[\frac{F^{\varepsilon}(C|n-1)}{F^{\varepsilon}(D|n-1)}-\underbrace{\frac{\delta\sum_{q=n-k^{\prime}}^{n-k-2}\psi(n-1,q,\varepsilon)\{1-\delta\sum_{q=0}^{n-k-2}\psi(n-1,q,\varepsilon)-\delta(1-\varepsilon)\psi(n-1,n-k-1,\varepsilon)\}}{(1-\delta\sum_{q=0}^{n-k-2}\psi(n-1,q,\varepsilon))\{\delta\sum_{q=n-k^{\prime}}^{n-k-2}\psi(n-1,q,\varepsilon)+\delta(1-\varepsilon)\psi(n-1,n-k-1,\varepsilon)\}}}_{(**)}\right],

and

(βˆ—βˆ—)≀[1βˆ’Ξ΄β€‹(1βˆ’Ξ΅)β€‹Οˆβ€‹(nβˆ’1,nβˆ’kβˆ’1,Ξ΅)1βˆ’Ξ΄β€‹βˆ‘q=0nβˆ’kβˆ’2Οˆβ€‹(nβˆ’1,q,Ξ΅)].\displaystyle(**)\leq\left[1-\dfrac{\delta(1-\varepsilon)\psi(n-1,n-k-1,\varepsilon)}{1-\delta\sum_{q=0}^{n-k-2}\psi(n-1,q,\varepsilon)}\right].

The RHS of the last inequality is the exactly the same condition for VΡ​(Tk|nβˆ’1)βˆ’VΡ​(Tn|nβˆ’1)>0V^{\varepsilon}(T_{k}|n-1)-V^{\varepsilon}(T_{n}|n-1)>0, which completes the proof. ∎

Lemma 3.

When a mutant Tkβ€²T_{k^{\prime}} (for kβ€²<kk^{\prime}<k) appears in the homogeneous population of TkT_{k}, VΡ​(Tk|nβˆ’1)βˆ’VΡ​(Tkβ€²|nβˆ’1)>0V^{\varepsilon}(T_{k}|n-1)-V^{\varepsilon}(T_{k^{\prime}}|n-1)>0.

Proof.

It is easy to see that the TkT_{k} is strictly dominant against a single mutant Tkβ€²T_{k^{\prime}} for all parameters because bnβˆ’c<0\frac{b}{n}-c<0 (See also the proof of Proposition 2). ∎

The proof for Proposition 3 is as follows:

Proof.

According to Lemma 1, there exists an Ξ΅k\varepsilon_{k} such that VΡ​(Tk|nβˆ’1)βˆ’VΡ​(Tn|nβˆ’1)>0V^{\varepsilon}(T_{k}|n-1)-V^{\varepsilon}(T_{n}|n-1)>0 if Ξ΅<Ξ΅k\varepsilon<\varepsilon_{k} and VΡ​(Tk|nβˆ’1)βˆ’VΡ​(Tn|nβˆ’1)≀0V^{\varepsilon}(T_{k}|n-1)-V^{\varepsilon}(T_{n}|n-1)\leq 0 otherwise. According to Lemma 2, VΡ​(Tk|nβˆ’1)βˆ’VΡ​(Tkβ€²|nβˆ’1)>0V^{\varepsilon}(T_{k}|n-1)-V^{\varepsilon}(T_{k^{\prime}}|n-1)>0 for k<kβ€²<nk<k^{\prime}<n if Ξ΅\varepsilon satisfies VΡ​(Tk|nβˆ’1)βˆ’VΡ​(Tn|nβˆ’1)β‰₯0V^{\varepsilon}(T_{k}|n-1)-V^{\varepsilon}(T_{n}|n-1)\geq 0. Lastly, according to Lemma 3, VΡ​(Tk|nβˆ’1)V^{\varepsilon}(T_{k}|n-1) is always greater than VΡ​(Tkβ€²|nβˆ’1)V^{\varepsilon}(T_{k^{\prime}}|n-1) if kβ€²<kk^{\prime}<k. These three lemmas lead to the conclusion that TkT_{k} is evolutionarily stable if Ρ∈(0,Ξ΅k)\varepsilon\in(0,\varepsilon_{k}). ∎

Appendix B The Proof of Proposition 4

Proof.

According to Lemma 1, because FΡ​(C|nβˆ’1)FΡ​(D|nβˆ’1)βˆ’1\frac{F^{\varepsilon}(C|n-1)}{F^{\varepsilon}(D|n-1)}-1 is fixed, Δ​(Ξ΅;k)\Delta(\varepsilon;k) determines VΡ​(Tk|nβˆ’1)βˆ’VΡ​(Tn|nβˆ’1)V^{\varepsilon}(T_{k}|n-1)-V^{\varepsilon}(T_{n}|n-1). Thus, when Δ​(Ξ΅;k)>Δ​(Ξ΅;k+1)\Delta(\varepsilon;k)>\Delta(\varepsilon;k+1) holds for Ρ∈(0,1)\varepsilon\in(0,1), the proof is completed.

Δ​(Ξ΅;k+1)βˆ’\displaystyle\Delta(\varepsilon;k+1)- Δ​(Ξ΅;k)=\displaystyle\Delta(\varepsilon;k)=
(1βˆ’Ξ΅)​[Οˆβ€‹(nβˆ’1,nβˆ’kβˆ’2,Ξ΅)1βˆ’βˆ‘q=0nβˆ’kβˆ’3Οˆβ€‹(nβˆ’1,q,Ξ΅)βˆ’Οˆβ€‹(nβˆ’1,nβˆ’kβˆ’1,Ξ΅)1βˆ’βˆ‘q=0nβˆ’kβˆ’2Οˆβ€‹(nβˆ’1,q,Ξ΅)]⏟(βˆ—βˆ—βˆ—)\displaystyle(1-\varepsilon)\underbrace{\left[\frac{\psi(n-1,n-k-2,\varepsilon)}{1-\sum_{q=0}^{n-k-3}\psi(n-1,q,\varepsilon)}-\frac{\psi(n-1,n-k-1,\varepsilon)}{1-\sum_{q=0}^{n-k-2}\psi(n-1,q,\varepsilon)}\right]}_{(***)}

(βˆ—βˆ—βˆ—)(***) can be rearranged to

11+(nβˆ’1nβˆ’kβˆ’1)(nβˆ’1nβˆ’kβˆ’2)+β‹―+(nβˆ’1nβˆ’1)(nβˆ’1nβˆ’kβˆ’2)βˆ’11+(nβˆ’1nβˆ’k)(nβˆ’1nβˆ’kβˆ’1)+β‹―+(nβˆ’1nβˆ’1)(nβˆ’1nβˆ’kβˆ’1).\displaystyle\frac{1}{1+\frac{\binom{n-1}{n-k-1}}{\binom{n-1}{n-k-2}}+\cdots+\frac{\binom{n-1}{n-1}}{\binom{n-1}{n-k-2}}}-\frac{1}{1+\frac{\binom{n-1}{n-k}}{\binom{n-1}{n-k-1}}+\cdots+\frac{\binom{n-1}{n-1}}{\binom{n-1}{n-k-1}}}.

It is easy to check that the denominator of the first part is always larger than that of the second, which confirms that Δ​(Ξ΅;k)>Δ​(Ξ΅;k+1)\Delta(\varepsilon;k)>\Delta(\varepsilon;k+1). In other words, the TnT_{n} strategy can always invade a population that is entirely composed of individuals using Tk+1T_{k+1} strategy if Ξ΅=Ξ΅k\varepsilon=\varepsilon_{k}. Following the previous lemmas, there should be an Ξ΅k+1\varepsilon_{k+1} such that the Tk+1T_{k+1} strategy is evolutionarily stable for Ρ∈(0,Ξ΅k+1)\varepsilon\in(0,\varepsilon_{k+1}). VΡ​(Tk+1|nβˆ’1)βˆ’VΡ​(Tn|nβˆ’1)>0V^{\varepsilon}(T_{k+1}|n-1)-V^{\varepsilon}(T_{n}|n-1)>0 for Ρ∈(0,Ξ΅k+1)\varepsilon\in(0,\varepsilon_{k+1}) and VΡ​(Tk+1|nβˆ’1)βˆ’VΡ​(Tn|nβˆ’1)<0V^{\varepsilon}(T_{k+1}|n-1)-V^{\varepsilon}(T_{n}|n-1)<0 for Ξ΅=Ξ΅k\varepsilon=\varepsilon_{k}, which implies that Ξ΅k+1<Ξ΅k\varepsilon_{k+1}<\varepsilon_{k} for k∈{1,…,nβˆ’2}k\in\{1,\ldots,n-2\}. ∎

Appendix C The Proof of Proposition 5

In the proof of Proposition 3 (Appendix A), only Lemma 1 needs the necessary condition that Ξ΄\delta is at the limit of 11. We will show that VΡ​(Tk|nβˆ’1)βˆ’VΡ​(Tn|nβˆ’1)>0V^{\varepsilon}(T_{k}|n-1)-V^{\varepsilon}(T_{n}|n-1)>0 for Ρ∈(Ξ΅kΒ―,Ξ΅kΒ―)\varepsilon\in(\underline{\varepsilon_{k}},\overline{\varepsilon_{k}}) as long as Ξ΄\delta is sufficiently high such that F​(C|nβˆ’1)F​(D|nβˆ’1)βˆ’1+Δ​(Ξ΅;k)>0\frac{F(C|n-1)}{F(D|n-1)}-1+\Delta(\varepsilon;k)>0 but is not close to 11. We will provide three lemmas and present the proof of the proposition at the end.

First of all, Δ​(Ξ΅;k)\Delta(\varepsilon;k) is rewritten as

Δ​(Ξ΅;k)=δ​(1βˆ’Ξ΅)β€‹Οˆβ€‹(nβˆ’1,nβˆ’kβˆ’1,Ξ΅)1βˆ’Ξ΄β€‹βˆ‘q=0nβˆ’kβˆ’2Οˆβ€‹(nβˆ’1,q,Ξ΅)=Ξ΄D1​(Ξ΅;k)+D2​(Ξ΅;k),\displaystyle\Delta(\varepsilon;k)=\frac{\delta(1-\varepsilon)\psi(n-1,n-k-1,\varepsilon)}{1-\delta\sum_{q=0}^{n-k-2}\psi(n-1,q,\varepsilon)}=\frac{\delta}{D_{1}(\varepsilon;k)+D_{2}(\varepsilon;k)},

where

D1​(Ξ΅;k)=1βˆ’Ξ΄(nβˆ’1nβˆ’kβˆ’1)​(Ξ΅)βˆ’n+k+1​(1βˆ’Ξ΅)βˆ’kβˆ’1,D2​(Ξ΅;k)=Ξ΄β€‹βˆ‘q=0k(nβˆ’1nβˆ’kβˆ’1+q)(nβˆ’1nβˆ’kβˆ’1)​(Ξ΅)q​(1βˆ’Ξ΅)βˆ’qβˆ’1.\displaystyle D_{1}(\varepsilon;k)=\frac{1-\delta}{\binom{n-1}{n-k-1}}(\varepsilon)^{-n+k+1}(1-\varepsilon)^{-k-1},~D_{2}(\varepsilon;k)=\delta\sum_{q=0}^{k}\frac{\binom{n-1}{n-k-1+q}}{\binom{n-1}{n-k-1}}(\varepsilon)^{q}(1-\varepsilon)^{-q-1}.

D1D_{1} and D2D_{2} are obtained by

1βˆ’Ξ΄β€‹βˆ‘q=0nβˆ’kβˆ’2Οˆβ€‹(nβˆ’1,q,Ξ΅)\displaystyle 1-\delta\sum_{q=0}^{n-k-2}\psi(n-1,q,\varepsilon) =(1βˆ’βˆ‘q=0nβˆ’kβˆ’2Οˆβ€‹(nβˆ’1,q,Ξ΅))+βˆ‘q=0nβˆ’kβˆ’2Οˆβ€‹(nβˆ’1,q,Ξ΅)​(1βˆ’Ξ΄)\displaystyle=\left(1-\sum_{q=0}^{n-k-2}\psi(n-1,q,\varepsilon)\right)+\sum_{q=0}^{n-k-2}\psi(n-1,q,\varepsilon)(1-\delta)
=βˆ‘q=nβˆ’kβˆ’1nβˆ’1Οˆβ€‹(nβˆ’1,q,Ξ΅)+(1βˆ’βˆ‘q=nβˆ’kβˆ’1nβˆ’1Οˆβ€‹(nβˆ’1,q,Ξ΅))​(1βˆ’Ξ΄)\displaystyle=\sum_{q=n-k-1}^{n-1}\psi(n-1,q,\varepsilon)+\left(1-\sum_{q=n-k-1}^{n-1}\psi(n-1,q,\varepsilon)\right)(1-\delta)
=(1βˆ’Ξ΄)+Ξ΄β€‹βˆ‘q=nβˆ’kβˆ’1nβˆ’1Οˆβ€‹(nβˆ’1,q,Ξ΅).\displaystyle=(1-\delta)+\delta\sum_{q=n-k-1}^{n-1}\psi(n-1,q,\varepsilon).

Now, our discussion is based on D1​(Ξ΅;β‹…)+D2​(Ξ΅;β‹…)D_{1}(\varepsilon;\cdot)+D_{2}(\varepsilon;\cdot) instead of Δ​(Ξ΅;β‹…)\Delta(\varepsilon;\cdot); because D1,D2>0D_{1},D_{2}>0, this can be one-to-one mapped inversely onto Ξ”\Delta.

Lemma 4.

D1​(Ξ΅;k)+D2​(Ξ΅;k)D_{1}(\varepsilon;k)+D_{2}(\varepsilon;k) is strictly convex for Ρ∈(0,1)\varepsilon\in(0,1).

Proof.

It is to be shown that D1′′​(Ξ΅,β‹…)>0D_{1}^{{}^{\prime\prime}}(\varepsilon,\cdot)>0 and D2′′​(Ξ΅,β‹…)>0D_{2}^{{}^{\prime\prime}}(\varepsilon,\cdot)>0 for Ρ∈(0,1)\varepsilon\in(0,1). A direct calculation shows that

βˆ‚2[(Ξ΅)βˆ’n+k+1​(1βˆ’Ξ΅)βˆ’kβˆ’1]βˆ‚Ξ΅2=\displaystyle\frac{\partial^{2}[(\varepsilon)^{-n+k+1}(1-\varepsilon)^{-k-1}]}{\partial\varepsilon^{2}}=
[n​(1+n)​Ρ2βˆ’2​(n+1)​(nβˆ’kβˆ’1)​Ρ+(nβˆ’k)​(nβˆ’kβˆ’1)]​(1βˆ’Ξ΅)βˆ’kβˆ’3​Ρkβˆ’nβˆ’1>0\displaystyle~~~~~~~~\left[n(1+n)\varepsilon^{2}-2(n+1)(n-k-1)\varepsilon+(n-k)(n-k-1)\right](1-\varepsilon)^{-k-3}\varepsilon^{k-n-1}>0
βˆ‚2[(Ξ΅)q​(1βˆ’Ξ΅)βˆ’qβˆ’1]βˆ‚Ξ΅2=[2​Ρ2+4​Ρ​q+q​(qβˆ’1)]​(Ξ΅)qβˆ’2​(1βˆ’Ξ΅)βˆ’qβˆ’2>0.\displaystyle\frac{\partial^{2}[(\varepsilon)^{q}(1-\varepsilon)^{-q-1}]}{\partial\varepsilon^{2}}=[2\varepsilon^{2}+4\varepsilon q+q(q-1)](\varepsilon)^{q-2}(1-\varepsilon)^{-q-2}>0.

for Ρ∈(0,1)\varepsilon\in(0,1). ∎

Lemma 5.

For k∈{1,2,β‹―,nβˆ’2}k\in\{1,2,\cdots,n-2\}, there exists at least one Ξ΅kβˆ—βˆˆ(0,1)\varepsilon_{k}^{*}\in(0,1) such that D1′​(Ξ΅kβˆ—,k)+D2′​(Ξ΅kβˆ—,k)=0D_{1}^{{}^{\prime}}(\varepsilon_{k}^{*},k)+D_{2}^{{}^{\prime}}(\varepsilon_{k}^{*},k)=0. For k=nβˆ’1k=n-1, no such Ξ΅kβˆ—\varepsilon_{k}^{*} exists.

Proof.

First of all, it is given that D1′​(Ξ΅,nβˆ’1)+D2′​(Ξ΅,nβˆ’1)=1δ​(1βˆ’Ξ΅)nD_{1}^{{}^{\prime}}(\varepsilon,n-1)+D_{2}^{{}^{\prime}}(\varepsilon,n-1)=\frac{1}{\delta(1-\varepsilon)^{n}}, which shows that no Ξ΅kβˆ—\varepsilon_{k}^{*} exists. A direct calculation shows that

βˆ‚[(Ξ΅)βˆ’n+k+1​(1βˆ’Ξ΅)βˆ’kβˆ’1]βˆ‚Ξ΅\displaystyle\frac{\partial[(\varepsilon)^{-n+k+1}(1-\varepsilon)^{-k-1}]}{\partial\varepsilon} =[k+1βˆ’(1βˆ’Ξ΅)​n]​(Ξ΅)kβˆ’n​(1βˆ’Ξ΅)βˆ’2βˆ’k\displaystyle=[k+1-(1-\varepsilon)n](\varepsilon)^{k-n}(1-\varepsilon)^{-2-k} (31)
βˆ‚[(Ξ΅)q​(1βˆ’Ξ΅)βˆ’qβˆ’1]βˆ‚Ξ΅\displaystyle\frac{\partial[(\varepsilon)^{q}(1-\varepsilon)^{-q-1}]}{\partial\varepsilon} =q​(Ξ΅)qβˆ’1​(1βˆ’Ξ΅)βˆ’qβˆ’1+(q+1)​(Ξ΅)q​(1βˆ’Ξ΅)βˆ’qβˆ’2.\displaystyle=q(\varepsilon)^{q-1}(1-\varepsilon)^{-q-1}+(q+1)(\varepsilon)^{q}(1-\varepsilon)^{-q-2}. (32)

For a sufficiently small Ξ΅\varepsilon, (31) is negative, and its absolute value can be arbitrarily larger, but (32) is positive, and its value can be arbitrarily smaller. Then, we have D1′​(Ξ΅kβˆ—,k)+D2′​(Ξ΅kβˆ—,k)<0D_{1}^{{}^{\prime}}(\varepsilon_{k}^{*},k)+D_{2}^{{}^{\prime}}(\varepsilon_{k}^{*},k)<0 for a sufficiently small Ξ΅\varepsilon. For a sufficiently large Ξ΅\varepsilon, it is easy to check that D1′​(Ξ΅kβˆ—,k)+D2′​(Ξ΅kβˆ—,k)>0D_{1}^{{}^{\prime}}(\varepsilon_{k}^{*},k)+D_{2}^{{}^{\prime}}(\varepsilon_{k}^{*},k)>0. As D1′​(β‹…)D_{1}^{\prime}(\cdot) and D2′​(β‹…)D_{2}^{\prime}(\cdot) are continuous, there exists at least one Ξ΅kβˆ—βˆˆ(0,1)\varepsilon_{k}^{*}\in(0,1) that makes D1′​(Ξ΅kβˆ—,k)+D2′​(Ξ΅kβˆ—,k)=0D_{1}^{{}^{\prime}}(\varepsilon_{k}^{*},k)+D_{2}^{{}^{\prime}}(\varepsilon_{k}^{*},k)=0. ∎

Lemma 6.

For a given k∈{1,2,β‹―,nβˆ’2}k\in\{1,2,\cdots,n-2\}, D1​(Ξ΅,k)+D2​(Ξ΅,k)D_{1}(\varepsilon,k)+D_{2}(\varepsilon,k) has a unique minimum over Ρ∈(0,1)\varepsilon\in(0,1). The infimum of D1​(Ξ΅,nβˆ’1)+D2​(Ξ΅,nβˆ’1)D_{1}(\varepsilon,n-1)+D_{2}(\varepsilon,n-1) is obtained at Ξ΅=0\varepsilon=0.

Proof.

It is trivial for the case of k=nβˆ’1k=n-1. For 0<k<nβˆ’10<k<n-1, the previous two lemmas are the conditions that D1​(Ξ΅,k)+D2​(Ξ΅,k)D_{1}(\varepsilon,k)+D_{2}(\varepsilon,k) has a unique minimum somewhere at Ρ∈(0,1)\varepsilon\in(0,1). ∎

The proof for Proposition 5 is as follows:

Proof.

At first, it is easy to check that k=nβˆ’1k=n-1 makes Ρ¯nβˆ’1=0\underline{\varepsilon}_{n-1}=0, and the proof can be provided by the same method used in Proposition 3. Lemma 6 implies that Δ​(Ξ΅,k)\Delta(\varepsilon,k) should have a single maximum over Ρ∈(0,1)\varepsilon\in(0,1) for 0<k<nβˆ’10<k<n-1. When the proper Ξ΄\delta is given, there exists an Ρ∈(Ρ¯k,Ρ¯k)\varepsilon\in(\underline{\varepsilon}_{k},\overline{\varepsilon}_{k}) that makes VΡ​(Tk|nβˆ’1)βˆ’VΡ​(Tn|nβˆ’1)>0V^{\varepsilon}(T_{k}|n-1)-V^{\varepsilon}(T_{n}|n-1)>0. ∎

Appendix D The Proof of Proposition 6

For the proof, we need a characterization of the shapes of D1​(Ξ΅;k)+D2​(Ξ΅;k)D_{1}(\varepsilon;k)+D_{2}(\varepsilon;k) over Ρ∈(0,1)\varepsilon\in(0,1) for the different kk. For this characterization, the following three lemmas are provided, and the proof is presented at the end.

Lemma 7.

For any k∈{1,2,β‹―,nβˆ’2}k\in\{1,2,\cdots,n-2\}, D1′​(Ξ΅;k+1)+D2′​(Ξ΅;k+1)>D1′​(Ξ΅;k)+D2′​(Ξ΅;k)D_{1}^{{}^{\prime}}(\varepsilon;k+1)+D_{2}^{{}^{\prime}}(\varepsilon;k+1)>D_{1}^{{}^{\prime}}(\varepsilon;k)+D_{2}^{{}^{\prime}}(\varepsilon;k) over Ρ∈(0,1)\varepsilon\in(0,1).

Proof.

A direct calculation shows that

D1′​(Ξ΅;k+1)βˆ’D1′​(Ξ΅;k)=\displaystyle D_{1}^{{}^{\prime}{}}(\varepsilon;k+1)-D_{1}^{{}^{\prime}{}}(\varepsilon;k)=
(1βˆ’Ξ΄)​(1βˆ’Ξ΅)βˆ’kβˆ’3​Ρkβˆ’n​[Ρ​(nβˆ’1βˆ’k+nβˆ’1)​(k+n​(Ξ΅βˆ’1)+2)βˆ’(1βˆ’Ξ΅)​(nβˆ’1βˆ’k+nβˆ’2)​(k+n​(Ξ΅βˆ’1)+1)](nβˆ’1βˆ’k+nβˆ’2)​(nβˆ’1βˆ’k+nβˆ’1)>0,\displaystyle~~\frac{(1-\delta)(1-\varepsilon)^{-k-3}\varepsilon^{k-n}[\varepsilon\binom{n-1}{-k+n-1}(k+n(\varepsilon-1)+2)-(1-\varepsilon)\binom{n-1}{-k+n-2}(k+n(\varepsilon-1)+1)]}{\binom{n-1}{-k+n-2}\binom{n-1}{-k+n-1}}>0,
D2′​(Ξ΅;k+1)βˆ’D2′​(Ξ΅;k)=\displaystyle D_{2}^{{}^{\prime}{}}(\varepsilon;k+1)-D_{2}^{{}^{\prime}{}}(\varepsilon;k)=
Ξ΄β€‹βˆ‘q=0k[(n​q(k+1)​(nβˆ’kβˆ’1)​(kβˆ’q+1)​(n+qβˆ’kβˆ’1))​{q​(Ξ΅)qβˆ’1​(1βˆ’Ξ΅)βˆ’qβˆ’1+(q+1)​(Ξ΅)q​(1βˆ’Ξ΅)βˆ’qβˆ’2}]\displaystyle~~\delta\sum_{q=0}^{k}\left[\left(\frac{nq}{(k+1)(n-k-1)(k-q+1)(n+q-k-1)}\right)\left\{q(\varepsilon)^{q-1}(1-\varepsilon)^{-q-1}+(q+1)(\varepsilon)^{q}(1-\varepsilon)^{-q-2}\right\}\right]
+Ξ΄(nβˆ’1nβˆ’kβˆ’2)​[q​Ρk​(1βˆ’Ξ΅)βˆ’kβˆ’2+(q+1)​Ρk+1​(1βˆ’Ξ΅)βˆ’kβˆ’3]>0\displaystyle~~+\frac{\delta}{\binom{n-1}{n-k-2}}[q\varepsilon^{k}(1-\varepsilon)^{-k-2}+(q+1)\varepsilon^{k+1}(1-\varepsilon)^{-k-3}]>0

∎

Let us define Ξ΅kβˆ—=arg min​{D1​(Ξ΅;k)+D2​(Ξ΅;k):Ρ∈(0,1)}\varepsilon^{*}_{k}=\text{arg min}\{D_{1}(\varepsilon;k)+D_{2}(\varepsilon;k):\varepsilon\in(0,1)\} for k∈{1,2,β‹―,nβˆ’1}k\in\{1,2,\cdots,n-1\}.

Lemma 8.

For any 0<k<nβˆ’10<k<n-1, D1​(Ξ΅kβˆ—;k+1)+D2​(Ξ΅kβˆ—;k+1)=D1​(Ξ΅kβˆ—;k)+D2​(Ξ΅kβˆ—;k)D_{1}(\varepsilon_{k}^{*};k+1)+D_{2}(\varepsilon_{k}^{*};k+1)=D_{1}(\varepsilon_{k}^{*};k)+D_{2}(\varepsilon_{k}^{*};k) holds.

Proof.

Ξ΅kβˆ—\varepsilon^{*}_{k} satisfies D1′​(Ξ΅kβˆ—;k)+D2′​(Ξ΅kβˆ—;k)=0D_{1}^{{}^{\prime}}(\varepsilon_{k}^{*};k)+D_{2}^{{}^{\prime}}(\varepsilon_{k}^{*};k)=0. We should have [D1​(Ξ΅kβˆ—;k)βˆ’D1​(Ξ΅kβˆ—;k+1)]+[D2​(Ξ΅kβˆ—;k)βˆ’D2​(Ξ΅kβˆ—;k+1)]=0[D_{1}(\varepsilon_{k}^{*};k)-D_{1}(\varepsilon_{k}^{*};k+1)]+[D_{2}(\varepsilon_{k}^{*};k)-D_{2}(\varepsilon_{k}^{*};k+1)]=0 for the proof. A direct calculation along with D1′​(Ξ΅kβˆ—;k)+D2′​(Ξ΅kβˆ—;k)=0D_{1}^{{}^{\prime}}(\varepsilon_{k}^{*};k)+D_{2}^{{}^{\prime}}(\varepsilon_{k}^{*};k)=0 shows that βˆ’Ξ΅kβˆ—nβˆ’kβˆ’1​D1′​(Ξ΅kβˆ—,k)=[D1​(Ξ΅kβˆ—;k)βˆ’D1​(Ξ΅kβˆ—;k+1)]=Ξ΅kβˆ—nβˆ’kβˆ’1​D2′​(Ξ΅kβˆ—,k)-\frac{\varepsilon_{k}^{*}}{n-k-1}D_{1}^{{}^{\prime}}(\varepsilon_{k}^{*},k)=[D_{1}(\varepsilon_{k}^{*};k)-D_{1}(\varepsilon_{k}^{*};k+1)]=\frac{\varepsilon_{k}^{*}}{n-k-1}D_{2}^{{}^{\prime}}(\varepsilon_{k}^{*},k). Some calculation shows that Ξ΅kβˆ—nβˆ’kβˆ’1​D2′​(Ξ΅kβˆ—,k)+[D2​(Ξ΅kβˆ—;k)βˆ’D2​(Ξ΅kβˆ—;k+1)]=0\frac{\varepsilon_{k}^{*}}{n-k-1}D_{2}^{{}^{\prime}}(\varepsilon_{k}^{*},k)+[D_{2}(\varepsilon_{k}^{*};k)-D_{2}(\varepsilon_{k}^{*};k+1)]=0. ∎

Lemma 9.

For 0<k<nβˆ’10<k<n-1,

{D1​(Ξ΅;k)+D2​(Ξ΅;k)>D1​(Ξ΅;k+1)+D2​(Ξ΅;k+1)for Ρ∈(0,Ξ΅kβˆ—)D1​(Ξ΅;k)+D2​(Ξ΅;k)<D1​(Ξ΅;k+1)+D2​(Ξ΅;k+1)for Ρ∈(Ξ΅kβˆ—,1).\displaystyle\begin{cases}D_{1}(\varepsilon;k)+D_{2}(\varepsilon;k)>D_{1}(\varepsilon;k+1)+D_{2}(\varepsilon;k+1)&\text{for $\varepsilon\in(0,\varepsilon_{k}^{*})$}\\ D_{1}(\varepsilon;k)+D_{2}(\varepsilon;k)<D_{1}(\varepsilon;k+1)+D_{2}(\varepsilon;k+1)&\text{for $\varepsilon\in(\varepsilon_{k}^{*},1)$}.\end{cases}
Proof.

Let us define that

d​(Ξ΅)=[D1​(Ξ΅;k)+D2​(Ξ΅;k)]βˆ’[D1​(Ξ΅;k+1)+D2​(Ξ΅;k+1)].\displaystyle d(\varepsilon)=\left[D_{1}(\varepsilon;k)+D_{2}(\varepsilon;k)\right]-\left[D_{1}(\varepsilon;k+1)+D_{2}(\varepsilon;k+1)\right].

It is noted that d′​(Ξ΅)<0d^{\prime}(\varepsilon)<0 by Lemma 7, and d​(Ξ΅kβˆ—)=0d(\varepsilon_{k}^{*})=0 by Lemma 8. As two curves cannot be tangent, d​(Ξ΅)d(\varepsilon) should change its sign around Ξ΅kβˆ—\varepsilon_{k}^{*}. We prove that d​(Ξ΅)d(\varepsilon) should change its sign only once around Ξ΅kβˆ—\varepsilon_{k}^{*}, from positive to negative. i) For a sufficiently small value of Ξ΅\varepsilon, D1​(Ξ΅;k)βˆ’D1​(Ξ΅;k+1)D_{1}(\varepsilon;k)-D_{1}(\varepsilon;k+1) can become arbitrarily larger while D2​(Ξ΅;k+1)βˆ’D2​(Ξ΅;k)D_{2}(\varepsilon;k+1)-D_{2}(\varepsilon;k) can become arbitrarily smaller. Hence, there exists an Ξ΅^k\hat{\varepsilon}_{k} such that d​(Ξ΅)>0d(\varepsilon)>0 is ensured for Ρ∈(0,Ξ΅^k)\varepsilon\in(0,\hat{\varepsilon}_{k}). ii) Let us assume that d​(Ξ΅)d(\varepsilon) would change its sign around Ξ΅^k<Ξ΅kβˆ—\hat{\varepsilon}_{k}<\varepsilon_{k}^{*} from positive to negative. As d​(Ξ΅^k)=0d(\hat{\varepsilon}_{k})=0 and d′​(Ξ΅)<0d^{\prime}(\varepsilon)<0, d​(Ξ΅)<0d(\varepsilon)<0 holds for Ρ∈(Ξ΅^k,1)\varepsilon\in(\hat{\varepsilon}_{k},1). Thus, d​(Ξ΅kβˆ—)=0d(\varepsilon_{k}^{*})=0 cannot be satisfied without violating the strict convexity of D1+D2D_{1}+D_{2}. iii) Once d​(Ξ΅kβˆ—)=0d(\varepsilon_{k}^{*})=0 is realized, by the same logic, d​(Ξ΅)<0d(\varepsilon)<0 holds for Ρ∈(Ξ΅kβˆ—,1)\varepsilon\in(\varepsilon_{k}^{*},1). ∎

The proof for Proposition 6 is as follows:

Proof.

According to Lemma 9, for 0<Ξ΅<Ξ΅kβˆ—0<\varepsilon<\varepsilon^{*}_{k}, Δ​(Ξ΅;k)<Δ​(Ξ΅;k+1)\Delta(\varepsilon;k)<\Delta(\varepsilon;k+1) holds, and Δ​(Ξ΅;k)>Δ​(Ξ΅;k+1)\Delta(\varepsilon;k)>\Delta(\varepsilon;k+1) holds for Ξ΅kβˆ—<Ξ΅<1\varepsilon^{*}_{k}<\varepsilon<1. Also the lemma implies that the single-maximum curves of Δ​(Ξ΅;k)\Delta(\varepsilon;k) and Δ​(Ξ΅;k+1)\Delta(\varepsilon;k+1) should be configured such that the only intersection between the two is realized at the descending part of Δ​(Ξ΅;k+1)\Delta(\varepsilon;k+1). When a sufficiently high Ξ΄\delta is given, the proposition is satisfied. ∎

References

  • Axelrod (1997) Axelrod, R. (1997). The Complexity of Cooperation: Agent-Based Models of Competition and Collaboration. Princeton University Press, 1st printing ed.
  • Axelrod & Hamilton (1981) Axelrod, R., & Hamilton, W.Β D. (1981). The evolution of cooperation. Science, 211(4489), 1390–1396.
  • Bendor & Mookherjee (1987) Bendor, J., & Mookherjee, D. (1987). Institutional structure and the logic of ongoing collective action. The American Political Science Review, 81(1), 129–154.
  • Binmore & Samuelson (1992) Binmore, K., & Samuelson, L. (1992). Evolutionary stability in repeated games played by finite automata. Journal of Economic Theory, 57(2), 278–305.
  • Boyd (1989) Boyd, R. (1989). Mistakes allow evolutionary stability in the repeated prisoner’s dilemma game. Journal of Theoretical Biology, 136(1), 47–56.
  • Boyd & Lorberbaum (1987) Boyd, R., & Lorberbaum, J.Β P. (1987). No pure strategy is evolutionarily stable in the repeated prisoner’s dilemma game. Nature, 327(6117), 58–59.
  • Boyd & Richerson (1988) Boyd, R., & Richerson, P. (1988). The evolution of reciprocity in sizable groups. Journal of Theoretical Biology, 132(3), 337–356.
  • Choi (2007) Choi, J.-K. (2007). Trembles may support cooperation in a repeated prisoner’s dilemma game. Journal of Economic Behavior & Organization, 63(3), 384–393.
  • Farrell & Ware (1989) Farrell, J., & Ware, R. (1989). Evolutionary stability in the repeated prisoner’s dilemma. Theoretical Population Biology, 36(2), 161–166.
  • Fudenberg & Maskin (1986) Fudenberg, D., & Maskin, E. (1986). The folk theorem in repeated games with discounting or with incomplete information. Econometrica, 54(3), 533–554.
  • Fundenberg & Maskin (1990) Fundenberg, D., & Maskin, E. (1990). Evolution and cooperation in noisy repeated games. The American Economic Review, 80(2), 274–279.
  • Gintis (2006) Gintis, H. (2006). Behavioral ethics meets natural justice. Politics, Philosophy & Economics, 5(1), 5–32.
  • Joshi (1987) Joshi, N.Β V. (1987). Evolution of cooperation by reciprocation within structured demes. Journal of Genetics, 66(1), 69–84.
  • Samuelson (2002) Samuelson, L. (2002). Evolution and game theory. The Journal of Economic Perspectives, 16(2), 47–66.
  • Sethi & Somanathan (2003) Sethi, R., & Somanathan, E. (2003). Understanding reciprocity. Journal of Economic Behavior & Organization, 50(1), 1–27.
  • Taylor (1987) Taylor, M. (1987). The possibility of cooperation. Cambridge University Press, revised ed.
  • Weibull (1997) Weibull, J.Β W. (1997). Evolutionary Game Theory. The MIT Press.
  • Yao (1996) Yao, X. (1996). Evolutionary stability in the n-person iterated prisoner’s dilemma. Bio Systems, 37(3), 189–197.
  • Young & Foster (1991) Young, P.Β H., & Foster, D. (1991). Cooperation in the long-run. Games and Economic Behavior, 3(1), 145–156.