This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Conditional investment strategy in evolutionary trust games with repeated group interactions

Linjie Liu Xiaojie Chen xiaojiechen@uestc.edu.cn College of Science, Northwest A & F University, Yangling 712100, China School of Mathematical Sciences, University of Electronic Science and Technology of China, Chengdu 611731, China
Abstract

It has a long tradition to study trust behavior among humans or artificial agents by investigating the trust game. Although previous studies based on evolutionary game theory have revealed that trust and trustworthiness can be promoted if network structure or reputation is considered, they often assume that interactions among agents are one-shot and investors do not consider the investment environment before making decisions, which collide with many realistic situations. In this paper, we introduce the conditional investment strategy into the repeated NN-player trust game, in which conditional investors decide to invest or not depending on their assessment of the trustworthiness level of the group. By using the approach of the Markov decision process, we study the evolutionary dynamics of trust in repeated group interactions with the conditional investment strategy. We find that conditional investors can form an effective alliance with trustworthy trustees, hence they can sweep out untrustworthy trustees. Moreover, we verify that such alliance can exist in a wide range of model parameters. These results may explain why trusting in others and reciprocating them with trustworthy actions can be sustained in game interactions among intelligent agents.

keywords:
Evolutionary game theory , the NN-player trust game , trustworthiness , conditional investment strategy , Markov decision process
journal: Information Sciences

1 Introduction

Trust is a value-laden concept in Distributed Artificial Intelligence (DAI), and it is also fundamental for cooperative behavior and other forms of prosocial behavior [17, 27, 32, 42, 45]. However, a basic fact accepted by the public is that trust is often accompanied by risks. As Deutsch defined, entering a trust relationship is to choose an ambiguous path that can lead to a beneficial event or a harmful event depending on the behavior of the other person [13]. In other words, there are risks in the interactions with potentially untrustworthy agents, while the interactions with trusted agents will lead to benefits [39]. Thus understanding how trust and trustworthiness evolve in Multi-agent Systems (MAS) and exploring the conditions in which they will emerge have always been great challenges [19].

Evolutionary game theory provides a theoretical framework to study the above problems [15, 25, 29, 31, 41, 48], and the trust game (TG) has been widely used to study trust and trustworthiness as a typical paradigm [1, 22]. The classical TG model involves interactions between an investor and a trustee where the investor first decides whether he/she is willing to invest his/her funds to the trustee, and then the trustee decides how much to give back to the investor (or not at all) [4, 23]. Previous theoretical work has proved that the subgame perfect equilibrium of TG is that the investor invests zero and the trustee returns zero [18]. The above two-player TG model perfectly characterizes pairwise interactions between an investor and a trustee and has recently been extended to group interactions of multiple players [1, 8, 22, 24]. A representative NN-player trust game (NTG) framework was developed by Abbass et al. [1], and then introduced to a population of agents playing TG concurrently in a well-mixed environment. Along this line, many researchers have considered network structure [8, 9, 24], punishment strategy [16], asymmetric demographic parameters [26], and reputation [22], into the NTG model to explore the evolution of trust.

Although there were attempts for enhancing the level of trust from different aspects, we notice that these studies are usually carried out in the framework of one-shot interaction, to our knowledge [8, 9, 16, 24, 26]. Indeed, real interactions are repeated rather than the one-shot interaction assumed above [11, 10, 30, 33, 28]. For example, we are likely to interact frequently with friends, co-workers, and economic partners in our daily life. Second, agents’ behaviors may be affected by noise (such as imitation error and behavioral mutation) in the process of social learning [37, 38, 47], which is usually ignored by previous theoretical models. Furthermore, in repeated interactions, investors have the opportunity to adjust their actions according to the investment environment they encountered [30, 34, 28]. Furthermore, it is very common for investors to choose conditional investment behavior in their daily economic activities when interacting with other agents [2, 5]. Concretely, the decision to invest or not will depend on one’s own assessment of the abundance of trustworthy trustees in the group. However, it is still unclear what the effects of conditional investment strategy on the evolution of trust in the repeated NTG are.

In order to answer the above question, we construct an evolutionary game model based on the NTG where the interactions among agents are repeated (see Fig.1). In our model, we consider three strategists: investor, untrustworthy trustee, and trustworthy trustee. Different from previous works [22, 24], we assume investors will invest deterministically in the first round of the game, but in the subsequent game rounds their decision to invest or not depends on the number of the trustworthy trustee in the group. Using the approach of the Markov decision process (MDP), we study the stochastic evolutionary dynamics in finite populations. We find that the introduction of the conditional investment strategy can lead to the emergence of high trust and high trustworthiness in repeated group interactions. Furthermore, we verify that such evolutionary outcomes are robust against model parameters.

2 Related Works

The formalization of the NTG that we follow here was proposed in Ref. [1], where Abbass et al. investigated the evolutionary dynamics of trust in an infinite well-mixed population. By analyzing the replicator dynamics, they found that both the whole society and all agents can obtain the maximal wealth when the initial population contains no untrustworthy agents. While the initial population consists of one single untrustworthy agent, untrustworthy agents would spread rapidly to the whole population. Furthermore, they found that the system will eventually converge to a stable state, in which untrustworthy agents will not completely occupy the whole population, because a fraction of the population would always remain trustworthy even if there are few or no investors.

Nevertheless, the analysis of replicator dynamics in the previous literature requires that the network structure is a complete graph, and thus this approach cannot be used to analyze the evolutionary dynamics when agents interact with local neighborhoods in social networks. Subsequently, Chica et al. studied the evolution of trust in different social networks including regular lattices, scale-free, and random networks [8]. By performing simulations, they found that trust can be promoted when agents interact on a social network even if there are untrustworthy agents in the population initially. In addition, the level of trust is influenced by network structure, the temptation to defect, and the initial number of untrustworthy trustees in the population. Along this line, the effects of different evolutionary update rules on promoting the evolution of trust were investigated [9], and simulation results showed that updating rules play an important role in promoting trust and improving global net wealth.

However, some other studies have found that social networks cannot produce a high level of trust [22, 24]. Recently, Kumar et al. investigated the trust game where the trustor’s investment and the trustee’s return of the investment are two important parameters on different social networks [24]. By performing Monte Carlo simulations, they found that the network structure has little effect on the evolution of trust and trustworthiness. In particular, trust cannot evolve in well-mixed populations, lattices, random, or scale-free networks. Considering that in the real society, the investment behavior of investors is often related to the reputation of the trustees, and agents with good reputation are more likely to attract investors’ investment, Hu et al. studied networked NTG with an adaptive reputation based on the third-party moral assessment system [22]. They showed that the frequency of untrustworthy trustees will decrease when rational investors can get the reputation score of the trustee.

The literature mentioned above does not consider the case in which agents’ decision-making may be affected by random factors, including imitation error and behavioral mutation. Considering this, Lim explored the evolutionary dynamics of trust in finite populations and found that the combination of strong selection in the population of investors and weak selection in the population of trustees can promote the emergence of high trustworthiness and high trust [26]. It is worth noting that they analyzed the stationary distribution of the system in the assumption of weak-mutation limit. The above assumption has been justified in population genetics [12, 14], but this approximate does not seem to be suitable for modeling social learning [47].

Different from previous works both in infinite populations [1] and structure populations [8], we study the evolutionary dynamics of trust in a finite population where imitation error and behavioral mutation are both exist. Besides, we release the assumptions that the mutation rate is significantly small [26] and the interactions between agents are one-shot [1, 8, 24, 26], and investigate the stochastic dynamics of trust in finite populations with repeated group interactions when mutation rates are arbitrarily large. Accordingly, we are committed to solving the following questions: can potentially richer evolutionary dynamics be produced? Can a high level of trust and trustworthiness be reached in the NTG with repeated group interactions? To answer these questions, we establish an NTG model where the conditional investment strategy is involved in repeated group interactions. Concretely, conditional investors decide to invest only when the number of trustworthy trustees in the game group reaches their expected threshold. We study the evolutionary dynamics of strategies in a finite population by using the approach of MDP. Our concrete theoretical model and methods are presented in the following section.

Refer to caption
Figure 1: Illustration of two different interactions scenarios in repeated NN-player trust games where the repeated probability is set as ww. NN agents are randomly sampled to form a group for playing the repeated trust game where a conditional investor will act as an unconditional investor in the first round, and then can agree or refuse to invest depending on whether the number of trustworthy agents in the group exceeds the expected threshold in subsequent game rounds.

3 Theoretical Model and Methods

3.1 Repeated NTG

Let us assume a finite well-mixed population with ZZ agents who play a repeated NTG [1, 8]. At every time step, NN agents are chosen and offered the opportunity to participate in the repeated NTG. As considered in previous work [28], the trust game will be repeated with probability ww with 0<w<10<w<1 (also called the discount factor in previous work [21]), resulting in an average number of r=1/wr=1/w rounds. In this game, each agent needs to make two decisions in advance. First, agent needs to decide whether to act as a trustee or an investor. The second is to decide whether to be trustworthy or not. Here, we consider three baseline strategists in this NTG:

  • 1.

    Conditional Investor (CICI): Pays an observation cost σ\sigma before the game to gather information including the number of trustworthy agents and untrustworthy agents in the group, and always chooses to invest tvtv in the first round. Subsequently, CICI agents invest only when the number of trustworthy agents in the group is not less than their expected threshold MM.

  • 2.

    Trustworthy Agent (TT): Returns the received fund obtained in each round multiplied by RTR_{T} to the investors.

  • 3.

    Untrustworthy Agent (UU): Returns nothing to the investors in all game rounds.

We shall first notice that, by adopting CICI, an agent will always invest in the first round and subsequently either act as an investor or as an outsider, depending on whether the number of trustworthy trustees in the group has reached the expected level MM (see Fig.1). If the expected level is reached, CICI agents are willing to pay tvtv to trustees, and each trustee receives the same amount of benefit NCItvNT+NU\frac{N_{CI}tv}{N_{T}+N_{U}}, where NCI,NTN_{CI},N_{T}, and NUN_{U} respectively denote the number of CI,TCI,T, and UU agents in the group. Then each TT agent returns RTtvNT+NU\frac{R_{T}tv}{N_{T}+N_{U}} to every CICI agent and reserves RTNCItvNT+NU\frac{R_{T}N_{CI}tv}{N_{T}+N_{U}} for himself/herself. While each UU agent returns nothing to CICI agents, but keeps RUNCItvNT+NU\frac{R_{U}N_{CI}tv}{N_{T}+N_{U}} for himself/herself. The parameters RTR_{T} and RUR_{U} respectively denote the multiply factor of TT agents and UU agents. If the number of TT agents in the group does not reach the expected level, every agent in the group can obtain nothing from the subsequent game rounds. Thus the payoffs of CI,TCI,T, and UU agents obtained from the game are depicted by

ΠCI\displaystyle\Pi_{CI} ={RTNTtvNNCI1tv+(RTNTtvNNCI1tv)(r1)Θ(NTM)σ,ifNCIN1;0,otherwise.\displaystyle=\left\{\begin{aligned} &\frac{R_{T}N_{T}tv}{N-N_{CI}-1}-tv+(\frac{R_{T}N_{T}tv}{N-N_{CI}-1}-tv)(r-1)\Theta(N_{T}-M)-\sigma,\quad\text{if}\ N_{CI}\neq N-1\ ;\\ &0,\quad\text{otherwise.}\end{aligned}\right. (1)
ΠT\displaystyle\Pi_{T} ={RTNCItvNNCI+RTNCItvNNCI(r1)Θ(NT+1M),ifNCIN;0,otherwise.\displaystyle=\left\{\begin{aligned} &\frac{R_{T}N_{CI}tv}{N-N_{CI}}+\frac{R_{T}N_{CI}tv}{N-N_{CI}}(r-1)\Theta(N_{T}+1-M),\quad\text{if}\ N_{CI}\neq N\ ;\\ &0,\quad\text{otherwise.}\end{aligned}\right. (2)
ΠU\displaystyle\Pi_{U} ={RUNCItvNNCI+RUNCItvNNCI(r1)Θ(NTM),ifNCIN;0,otherwise,\displaystyle=\left\{\begin{aligned} &\frac{R_{U}N_{CI}tv}{N-N_{CI}}+\frac{R_{U}N_{CI}tv}{N-N_{CI}}(r-1)\Theta(N_{T}-M),\quad\text{if}\ N_{CI}\neq N\ ;\\ &0,\quad\text{otherwise,}\end{aligned}\right. (3)

where Θ(k)\Theta(k) is the Heaviside function (that is, Θ(k)=1\Theta(k)=1 whenever k0k\geq 0, being zero otherwise). The threshold values MM range from 1 to N1N-1. If M<1M<1 all CICI agents will choose to invest unconditionally and M>N1M>N-1 means always choosing not to invest fund.

In a finite population with ZZ agents, NN agents are randomly sampled to form a game group and accumulate their payoffs by interacting with other group agents according with the payoff functions of NTG defined above. Concretely, when there are iCIi_{CI} agents choosing to adopt the CICI strategy, iTi_{T} agents choosing to adopt the TT strategy, iU=ZiCIiTi_{U}=Z-i_{CI}-i_{T} agents choosing to adopt the UU strategy in a finite population, the probability of finding jCIj_{CI} CICI agents, jTj_{T} TT agents, and jUj_{U} UU agents in a sample game group can be depicted by the multiple hypergeometric distribution, given as

H(jCI,jT,N,iCI,iT,Z)=(iCIjCI)(iTjT)(ZiCIiTNjCIjT)(ZN),\displaystyle H(j_{CI},j_{T},N,i_{CI},i_{T},Z)=\frac{\tbinom{i_{CI}}{j_{CI}}\tbinom{i_{T}}{j_{T}}\tbinom{Z-i_{CI}-i_{T}}{N-j_{CI}-j_{T}}}{\tbinom{Z}{N}},

which describes the configuration of the population for a given time. Accordingly, for a given configuration i=(iCI,iT)\textbf{i}=({i_{CI},i_{T}}), the average payoffs of CI,TCI,T, and UU agents can be respectively computed as

fCI\displaystyle f_{CI} =\displaystyle= jCI=0N1jT=0N1jCI(iCI1jCI)(iTjT)(ZiCIiTN1jCIjT)(Z1N1)\displaystyle\sum\limits_{j_{CI}=0}^{N-1}\sum\limits_{j_{T}=0}^{N-1-j_{CI}}\frac{\tbinom{i_{CI}-1}{j_{CI}}\tbinom{i_{T}}{j_{T}}\tbinom{Z-i_{CI}-i_{T}}{N-1-j_{CI}-j_{T}}}{\tbinom{Z-1}{N-1}}
×\displaystyle\times ΠCI(jCI+1,jT),\displaystyle\Pi_{CI}(j_{CI}+1,j_{T}),
fT\displaystyle f_{T} =\displaystyle= jCI=0N1jT=0N1jCI(iCIjCI)(iT1jT)(ZiCIiTN1jCIjT)(Z1N1)\displaystyle\sum\limits_{j_{CI}=0}^{N-1}\sum\limits_{j_{T}=0}^{N-1-j_{CI}}\frac{\tbinom{i_{CI}}{j_{CI}}\tbinom{i_{T}-1}{j_{T}}\tbinom{Z-i_{CI}-i_{T}}{N-1-j_{CI}-j_{T}}}{\tbinom{Z-1}{N-1}}
×\displaystyle\times ΠT(jCI,jT+1),\displaystyle\Pi_{T}(j_{CI},j_{T}+1),
fU\displaystyle f_{U} =\displaystyle= jCI=0N1jT=0N1jCI(iCIjCI)(iTjT)(ZiCIiT1N1jCIjT)(Z1N1)\displaystyle\sum\limits_{j_{CI}=0}^{N-1}\sum\limits_{j_{T}=0}^{N-1-j_{CI}}\frac{\tbinom{i_{CI}}{j_{CI}}\tbinom{i_{T}}{j_{T}}\tbinom{Z-i_{CI}-i_{T}-1}{N-1-j_{CI}-j_{T}}}{\tbinom{Z-1}{N-1}}
×\displaystyle\times ΠU(jCI,jT),\displaystyle\Pi_{U}(j_{CI},j_{T}),

where ΠCI,ΠT,\Pi_{CI},\Pi_{T}, and ΠU\Pi_{U} are respectively shown in Eqs. (LABEL:eq1)-(LABEL:eq3).

3.2 Social Learning

The average payoffs of agents obtained above stand for their social success, then we can analyze the evolutionary dynamics of strategy adopting by using social learning [35, 36, 37], which means that the most successful strategy will more often tend to be adopted by other agents. Generally, the above social learning can be characterized by the pairwise comparison rule. Concretely, the probability that an agent adopting strategy AA with the payoff fAf_{A} imitates another agent’s strategy BB with payoff fBf_{B} is given by the Fermi function [40]

P(fBfA)=11+exp(β(fBfA)),\displaystyle P(f_{B}-f_{A})=\frac{1}{1+\exp(-\beta(f_{B}-f_{A}))}, (4)

where β\beta characterizes the intensity of selection, controlling how the imitation process depends on the difference between the payoffs of two agents. For β\beta\rightarrow\infty (strong imitation), any difference in the payoffs will produce a sizeable effect on imitation probability. For β0\beta\rightarrow 0 (weak selection), the strategy is imitated randomly. In between these extremes, the difference of the payoffs and stochastic effects associated with errors can both impact imitation probability.

Furthermore, we introduce the exploration term: with probability 1μ1-\mu, an agent with strategy AA adopts another agent’s strategy BB according to the social learning rule described above, and with probability μ\mu, he/she randomly selects a strategy different from the current one from the strategy space. Thus, the probability that an agent with strategy AA adopts strategy BB according to the mutation-selection process can be written as

TAB\displaystyle T_{A\rightarrow B} =\displaystyle= (1μ)[iAZiBZ111+exp(β(fBfA))]+μiA2Z.\displaystyle(1-\mu)[\frac{i_{A}}{Z}\frac{i_{B}}{Z-1}\frac{1}{1+\exp({-\beta(f_{B}-f_{A})})}]+\mu\frac{i_{A}}{2Z}. (5)

The existence of behavioral mutation makes the population never fixate in any of the three possible monomorphic configurations. This fact renders the stationary distribution and the gradient of selection as the appropriate quantities to analyze the behavior of the population.

3.3 Markov Decision Process

We know that the update process only relies on the current state i(t)\textbf{i}(t) of the system, so that i(t)={iCI,iT}\textbf{i}(t)=\{i_{CI},i_{T}\} has Markov property. Thus the evolutionary dynamics of CI, T, and U can be described by the Markov chain in two-dimensional space. Then the evolutionary dynamics of the system can be analyzed by investigating the probability distribution function pi(t)p_{\textbf{i}}(t) providing information about the pervasiveness of each configuration at time t, and it satisfies the following discrete time Master Equation [46]

pi(t+τ)pi(t)=i{Tiipi(t)Tiipi(t)},\displaystyle p_{\textbf{i}}(t+\tau)-p_{\textbf{i}}(t)=\sum_{\textbf{i}^{{}^{\prime}}}\left\{T_{\textbf{i}\textbf{i}^{{}^{\prime}}}p_{\textbf{i}^{{}^{\prime}}}(t)-T_{\textbf{i}^{{}^{\prime}}\textbf{i}}p_{\textbf{i}}(t)\right\},

where TiiT_{\textbf{i}\textbf{i}^{{}^{\prime}}} and TiiT_{\textbf{i}^{{}^{\prime}}\textbf{i}} denote the transition probabilities between configurations i\textbf{i}^{\prime} and i. Technically, we can obtain the so-called stationary distribution p¯i\bar{p}_{\textbf{i}}, by searching the eigenvector associated with the eigenvalue 1 of the transition matrix Λ=[Tij]T\Lambda=[T_{\textbf{ij}}]^{T} with dimension (Z+1)(Z+2)2×(Z+1)(Z+2)2\frac{(Z+1)(Z+2)}{2}\times\frac{(Z+1)(Z+2)}{2}. The transition probability between two adjacent states can be computed as

Ti(iCI,iT)i(iCI+1,iT)\displaystyle T_{\textbf{i}(i_{CI},i_{T})\rightarrow\textbf{i}^{{}^{\prime}}(i_{CI}+1,i_{T})} =\displaystyle= TUCI,\displaystyle T_{U\rightarrow CI},
Ti(iCI,iT)i(iCI1,iT)\displaystyle T_{\textbf{i}(i_{CI},i_{T})\rightarrow\textbf{i}^{{}^{\prime}}(i_{CI}-1,i_{T})} =\displaystyle= TCIU,\displaystyle T_{CI\rightarrow U},
Ti(iCI,iT)i(iCI,iT+1)\displaystyle T_{\textbf{i}(i_{CI},i_{T})\rightarrow\textbf{i}^{{}^{\prime}}(i_{CI},i_{T}+1)} =\displaystyle= TUT,\displaystyle T_{U\rightarrow T},
Ti(iCI,iT)i(iCI,iT1)\displaystyle T_{\textbf{i}(i_{CI},i_{T})\rightarrow\textbf{i}^{{}^{\prime}}(i_{CI},i_{T}-1)} =\displaystyle= TTU,\displaystyle T_{T\rightarrow U},
Ti(iCI,iT)i(iCI1,iT+1)\displaystyle T_{\textbf{i}(i_{CI},i_{T})\rightarrow\textbf{i}^{{}^{\prime}}(i_{CI}-1,i_{T}+1)} =\displaystyle= TCIT,\displaystyle T_{CI\rightarrow T},
Ti(iCI,iT)i(iCI+1,iT1)\displaystyle T_{\textbf{i}(i_{CI},i_{T})\rightarrow\textbf{i}^{{}^{\prime}}(i_{CI}+1,i_{T}-1)} =\displaystyle= TTCI.\displaystyle T_{T\rightarrow CI}.

The transition probability between two nonadjacent states i and v is Tiv=0T_{\textbf{i}\textbf{v}}=0. Thus the probability of the system staying in the current state is

Ti(iCI,iT)i(iCI,iT)\displaystyle T_{\textbf{i}(i_{CI},i_{T})\rightarrow\textbf{i}(i_{CI},i_{T})} =\displaystyle= 1ii(Tii+Tii).\displaystyle 1-\sum_{\textbf{i}\neq\textbf{i}^{{}^{\prime}}}(T_{\textbf{i}\textbf{i}^{{}^{\prime}}}+T_{\textbf{i}^{{}^{\prime}}\textbf{i}}).

In one discrete time step, the probability of the system transferring from one state to another can be calculated by equation (5).

In addition to the analysis of the stationary distribution of the system, another important quantity for studying the evolutionary dynamics in finite populations is the gradient of selection, which indicates the most likely evolutionary path when the system leaves the current configuration [47]. The gradient of selection is described as

i\displaystyle\nabla_{\textbf{i}} =(TiCI+TiCI)uCI+(TiT+TiT)uT,\displaystyle=(T_{\textbf{i}}^{CI+}-T_{\textbf{i}}^{CI-})\textbf{u}_{\textbf{CI}}+(T_{\textbf{i}}^{T+}-T_{\textbf{i}}^{T-})\textbf{u}_{\textbf{T}}, (6)

where uCI\textbf{u}_{\textbf{CI}} and uT\textbf{u}_{\textbf{T}} are a set of standard orthogonal bases, and we set uCI=(1,0)T\textbf{u}_{\textbf{CI}}=(1,0)^{T} and uT=(0,1)T\textbf{u}_{\textbf{T}}=(0,1)^{T} in this work. TiCI+(TiCI)T_{\textbf{i}}^{CI+}(T_{\textbf{i}}^{CI-}) and TiT+(TiT)T_{\textbf{i}}^{T+}(T_{\textbf{i}}^{T-}) respectively denote the probabilities that the numbers of CICI agents and TT agents increase (decrease) one, which read

TiCI+\displaystyle T_{\textbf{i}}^{CI+} =\displaystyle= TUCI+TTCI,\displaystyle T_{U\rightarrow CI}+T_{T\rightarrow CI},
TiCI\displaystyle T_{\textbf{i}}^{CI-} =\displaystyle= TCIU+TCIT,\displaystyle T_{CI\rightarrow U}+T_{CI\rightarrow T},
TiT+\displaystyle T_{\textbf{i}}^{T+} =\displaystyle= TUT+TCIT,\displaystyle T_{U\rightarrow T}+T_{CI\rightarrow T},
TiT\displaystyle T_{\textbf{i}}^{T-} =\displaystyle= TTU+TTCI.\displaystyle T_{T\rightarrow U}+T_{T\rightarrow CI}.

Furthermore, we provide an important index to describe the average level of each strategy. Concretely, the average levels of CI,T,CI,T, and UU strategies, averaging over all possible states i, weighted with the corresponding stationary distribution pi¯\bar{p_{\textbf{i}}}, are computed as

ρ¯CI\displaystyle\bar{\rho}_{CI} =\displaystyle= iiiCIpi¯Z,\displaystyle\sum_{\textbf{i}}\frac{\textbf{i}_{i_{CI}}\bar{p_{\textbf{i}}}}{Z},
ρ¯T\displaystyle\bar{\rho}_{T} =\displaystyle= iiiTpi¯Z,\displaystyle\sum_{\textbf{i}}\frac{\textbf{i}_{i_{T}}\bar{p_{\textbf{i}}}}{Z},
ρ¯U\displaystyle\bar{\rho}_{U} =\displaystyle= iiiUpi¯Z,\displaystyle\sum_{\textbf{i}}\frac{\textbf{i}_{i_{U}}\bar{p_{\textbf{i}}}}{Z},

where iiCI,iiT\textbf{i}_{i_{CI}},\textbf{i}_{i_{T}}, and iiU\textbf{i}_{i_{U}} denote the number of CI,TCI,T, and UU agents in the configuration i, respectively.

Subsequently, by using fS(i)f_{S}(\textbf{i}) combined with the stationary distribution p¯i\bar{p}_{\textbf{i}}, we can calculate the average payoff of one agent with SS strategy as

f¯S=ipi¯fS(i),\displaystyle\bar{f}_{S}=\sum_{\textbf{i}}\bar{p_{\textbf{i}}}f_{S}(\textbf{i}),

where S=CI,T,S=CI,T, or UU.

In the following, we investigate the gradient of selection and the stationary distribution to study the evolutionary dynamics of CICI, TT, and UU strategies in finite well-mixed populations.

Refer to caption
Figure 2: Evolutionary dynamics of Conditional Investor (CICI), Trustworthy agents (TT), and Untrustworthy agents (UU) for three different threshold values MM in finite populations. Arrows in the simplex S3S_{3} represent the gradient of selection, which shows the most likely evolutionary paths after the system leaving the current state, calculated from equation (6). The colorbar describes the values of stationary distribution at each configuration. The darker the color, the longer the population spends in these states. Parameters are Z=100,N=4,tv=1,RU=8,RT=6,μ=1/Z,w=0.8,σ=0.1,Z=100,N=4,tv=1,R_{U}=8,R_{T}=6,\mu=1/Z,w=0.8,\sigma=0.1, and β=5\beta=5. Here, M=0M=0 in panel (a), M=2M=2 in panel (b), and M=4M=4 in panel (c).
Refer to caption
Figure 3: Average frequency of strategies CICI (blue cylinder), TT (red cylinder), and UU (purple cylinder) for different investment threshold MM. The intermediate investment threshold is conducive to the formation of CICI and TT alliance, and then effectively improves the level of trust. Parameters are Z=100,N=4,tv=1,RU=8,RT=6,w=0.8,μ=1/Z,σ=0.1,Z=100,N=4,tv=1,R_{U}=8,R_{T}=6,w=0.8,\mu=1/Z,\sigma=0.1, and β=5\beta=5.
Refer to caption
Figure 4: Average payoffs of CICI (blue cylinder), TT (red cylinder), and UU (purple cylinder) agents for different investment threshold values MM. The advantage of the alliance formed by CICI and TT agents can be boosted by the intermediate investment threshold MM. Parameters are Z=100,N=4,tv=1,RU=8,RT=6,μ=1/Z,w=0.8,σ=0.1,Z=100,N=4,tv=1,R_{U}=8,R_{T}=6,\mu=1/Z,w=0.8,\sigma=0.1, and β=5\beta=5.

4 Results

Refer to caption
Figure 5: Robustness of the evolutionary outcomes of CI,TCI,T, and UU to the changes of the multiply factor of UU agents RUR_{U}. Panel (a) shows the average levels of three strategies as a function of RUR_{U}. Panels (b)-(d) show the stationary distribution and the gradient of selection for three different RUR_{U} values. Parameter values are Z=100,N=4,M=2,tv=1,w=0.8,RT=6,μ=1/Z,β=5,σ=0.1Z=100,N=4,M=2,tv=1,w=0.8,R_{T}=6,\mu=1/Z,\beta=5,\sigma=0.1 in panel (a); Z=100,N=4,M=2,tv=1,RU=6.66,w=0.8,RT=6,μ=1/Z,β=5,σ=0.1Z=100,N=4,M=2,tv=1,R_{U}=6.66,w=0.8,R_{T}=6,\mu=1/Z,\beta=5,\sigma=0.1 in panel (b); Z=100,N=4,M=2,tv=1,RU=7.98,w=0.8,RT=6,μ=1/Z,β=5,σ=0.1Z=100,N=4,M=2,tv=1,R_{U}=7.98,w=0.8,R_{T}=6,\mu=1/Z,\beta=5,\sigma=0.1 in panel (c); Z=100,N=4,M=2,tv=1,RU=9.96,w=0.8,σ=0.1,RT=6,μ=1/Z,β=5Z=100,N=4,M=2,tv=1,R_{U}=9.96,w=0.8,\sigma=0.1,R_{T}=6,\mu=1/Z,\beta=5 in panel (d).
Refer to caption
Figure 6: Robustness of the evolutionary outcomes of CI,TCI,T, and UU to the changes of the investment cost tvtv. Panel (a) shows the average levels of three strategies as a function of tvtv. Panels (b)-(d) show the stationary distribution and the gradient of selection for three different tvtv values. Parameter values are Z=100,N=4,M=2,RU=8,w=0.8,RT=6,μ=1/Z,β=5,σ=0.1Z=100,N=4,M=2,R_{U}=8,w=0.8,R_{T}=6,\mu=1/Z,\beta=5,\sigma=0.1 in panel (a); Z=100,N=4,M=2,tv=3,RU=8,w=0.8,RT=6,μ=1/Z,β=5,σ=0.1Z=100,N=4,M=2,tv=3,R_{U}=8,w=0.8,R_{T}=6,\mu=1/Z,\beta=5,\sigma=0.1 in panel (b); Z=100,N=4,M=2,tv=7,RU=8,w=0.8,RT=6,μ=1/Z,β=5,σ=0.1Z=100,N=4,M=2,tv=7,R_{U}=8,w=0.8,R_{T}=6,\mu=1/Z,\beta=5,\sigma=0.1 in panel (c); Z=100,N=4,M=2,tv=11,RU=8,w=0.8,σ=0.1,RT=6,μ=1/Z,β=5Z=100,N=4,M=2,tv=11,R_{U}=8,w=0.8,\sigma=0.1,R_{T}=6,\mu=1/Z,\beta=5 in panel (d).

We first present the results of evolutionary dynamics in finite populations with the conditional investment strategy for different values of MM. In Fig. 2, we investigate the stationary distribution and the gradient of selection to study what roles conditional investment plays in the evolution of trust. Clearly, if the investment tendency is strong enough (M=0M=0), CICI agents will become unconditional investors. We can notice that the population will spend a significant time on configurations near the TUTU-edge (TT means trustworthy agents and UU means untrustworthy agents) of the triangle simplex, as shown in Fig. 2(a). Besides, most of the arrows flow to the coexistence states of TT and UU where UU is highly prevalent. The weakening of investment tendency will lead the population to spending most of the time in states with a high prevalence of TT and CICI (see Fig. 2(b)). If MM further increases to M=NM=N, CICI agents invest in the first round but refuse to invest during the remaining rounds. In this case, each agent only retains the benefits of the first round of the game, and no one can obtain non-zero payoffs from the remaining rounds. The evolutionary outcomes in finite populations are similar to the result reported in Fig. 2(a), that is, the system spends most of the time in states where TT and UU agents coexist (see Fig. 2(c)).

Furthermore, we investigate how the investment threshold influences the stationary frequency of strategies CI,T,CI,T, and UU, as shown in Fig. 3. We find that when the threshold MM is zero, UU can be more prevalent than CICI and TT. With the increase of threshold, CICI and TT agents can form a strong alliance, preventing the invasion of UU agents. If MM further increases to NN, CICI agents only invest in the first round, and thus all agents can only obtain benefit from the first round. In this case, the advantage of UU agents is greater than that of TT and CICI agents.

To probe deeper into the underlying mechanisms responsible for such a significant improvement of trust stemming from the investment threshold, we show in Fig. 4 how the overall average payoffs of CI,TCI,T, and UU agents change with MM. It can be seen that the average payoffs of all agents increase first and then decrease. Particularly, when M=0M=0 or M=NM=N, the average payoffs of CICI agents are negative (see the inset of Fig. 4). In general, an intermediate investment threshold can ensure a more evident advantage of alliance formed by CICI and TT agents over UU agents, even if this advantage can shrink with the increase of MM.

Refer to caption
Figure 7: Robustness of the evolutionary outcomes of CI,TCI,T, and UU to the changes of the observation cost σ\sigma. Panel (a) shows the average levels of three strategies as a function of σ\sigma. Panels (b)-(d) show the stationary distribution and the gradient of selection for three different σ\sigma values. Parameter values are Z=100,N=4,M=2,RU=8,w=0.8,RT=6,μ=1/Z,β=5,tv=1Z=100,N=4,M=2,R_{U}=8,w=0.8,R_{T}=6,\mu=1/Z,\beta=5,tv=1 in panel (a); Z=100,N=4,M=2,tv=3,RU=8,w=0.8,RT=6,μ=1/Z,β=5,σ=1Z=100,N=4,M=2,tv=3,R_{U}=8,w=0.8,R_{T}=6,\mu=1/Z,\beta=5,\sigma=1 in panel (b); Z=100,N=4,M=2,tv=1,RU=8,w=0.8,RT=6,μ=1/Z,β=5,σ=3Z=100,N=4,M=2,tv=1,R_{U}=8,w=0.8,R_{T}=6,\mu=1/Z,\beta=5,\sigma=3 in panel (c); Z=100,N=4,M=2,tv=1,RU=8,w=0.8,σ=5,RT=6,μ=1/Z,β=5Z=100,N=4,M=2,tv=1,R_{U}=8,w=0.8,\sigma=5,R_{T}=6,\mu=1/Z,\beta=5 in panel (d).
Refer to caption
Figure 8: Robustness of the evolutionary outcomes of CI,TCI,T, and UU to the changes of the discount factor ww. Panel (a) shows the average levels of three strategies as a function of ww. Panels (b)-(d) show the stationary distribution and the gradient of selection for three different ww values. Parameter values are Z=100,N=4,M=2,RU=8,σ=0.1,RT=6,μ=1/Z,β=5,tv=1Z=100,N=4,M=2,R_{U}=8,\sigma=0.1,R_{T}=6,\mu=1/Z,\beta=5,tv=1 in panel (a); Z=100,N=4,M=2,tv=3,RU=8,w=2/3,RT=6,μ=1/Z,β=5,σ=0.1Z=100,N=4,M=2,tv=3,R_{U}=8,w=2/3,R_{T}=6,\mu=1/Z,\beta=5,\sigma=0.1 in panel (b); Z=100,N=4,M=2,tv=1,RU=8,w=4/5,RT=6,μ=1/Z,β=5,σ=0.1Z=100,N=4,M=2,tv=1,R_{U}=8,w=4/5,R_{T}=6,\mu=1/Z,\beta=5,\sigma=0.1 in panel (c); Z=100,N=4,M=2,tv=1,RU=8,w=6/7,σ=0.1,RT=6,μ=1/Z,β=5Z=100,N=4,M=2,tv=1,R_{U}=8,w=6/7,\sigma=0.1,R_{T}=6,\mu=1/Z,\beta=5 in panel (d).

In order to illustrate the robustness of all results obtained in this paper, we first present how evolutionary dynamics of CI,TCI,T, and UU change with the multiply factor of UU agents, RUR_{U} (see Fig. 5). Concretely, we show how the average levels of three strategies vary with RUR_{U} in Fig. 5(a), and we find the average levels of CI,TCI,T, and UU remain almost constant with the increase of RUR_{U}. In addition, we show the evolutionary outcomes of the gradient of selection and the stationary distribution, which allow visualization of the dynamics, for three different values RUR_{U} in Fig. 5(b)-(d). It shows that the population spends a significant time near the CICI-TT edge of the simplex S3S_{3}, and most of the arrows in the simplex flow to the intermediate region of CICI-TT edge, indicating that conditional investors can form an alliance with trustworthy trustees. It is worth pointing out that we set RU=6.66R_{U}=6.66, 7.98, and 9.969.96, which can respectively correspond to the mild, moderate, and harsh interaction environment in the trust game (see Ref. [8]). Different from previous findings in [8], our results show that trust can be maintained no matter whether the environment of the trust game is mild or harsh.

Refer to caption
Figure 9: Robustness of the evolutionary outcomes of CI,TCI,T, and UU to the changes of the intensity of selection β\beta. Panel (a) shows the average levels of three strategies as a function of β\beta. Panels (b)-(d) show the stationary distribution and the gradient of selection for three different β\beta values. Parameter values are Z=100,N=4,M=2,RU=8,σ=0.1,RT=6,w=0.8,μ=1/Z,tv=1Z=100,N=4,M=2,R_{U}=8,\sigma=0.1,R_{T}=6,w=0.8,\mu=1/Z,tv=1 in panel (a); Z=100,N=4,M=2,tv=3,RU=8,w=0.8,RT=6,μ=1/Z,β=2,σ=0.1Z=100,N=4,M=2,tv=3,R_{U}=8,w=0.8,R_{T}=6,\mu=1/Z,\beta=2,\sigma=0.1 in panel (b); Z=100,N=4,M=2,tv=1,RU=8,w=0.8,RT=6,μ=1/Z,β=6,σ=0.1Z=100,N=4,M=2,tv=1,R_{U}=8,w=0.8,R_{T}=6,\mu=1/Z,\beta=6,\sigma=0.1 in panel (c); Z=100,N=4,M=2,tv=1,RU=8,w=0.8,σ=0.1,RT=6,μ=1/Z,β=10Z=100,N=4,M=2,tv=1,R_{U}=8,w=0.8,\sigma=0.1,R_{T}=6,\mu=1/Z,\beta=10 in panel (d).
Refer to caption
Figure 10: Robustness of the evolutionary outcomes of CI,TCI,T, and UU to the changes of the mutation rate μ\mu. Panel (a) shows the average levels of three strategies as a function of μ\mu. Panels (b)-(d) show the stationary distribution and the gradient of selection for three different μ\mu values. Parameter values are Z=100,N=4,M=2,RU=8,σ=0.1,RT=6,w=0.8,β=5,tv=1Z=100,N=4,M=2,R_{U}=8,\sigma=0.1,R_{T}=6,w=0.8,\beta=5,tv=1 in panel (a); Z=100,N=4,M=2,tv=3,RU=8,w=0.8,RT=6,μ=105,β=5,σ=0.1Z=100,N=4,M=2,tv=3,R_{U}=8,w=0.8,R_{T}=6,\mu=10^{-5},\beta=5,\sigma=0.1 in panel (b); Z=100,N=4,M=2,tv=1,RU=8,w=0.8,RT=6,μ=104,β=5,σ=0.1Z=100,N=4,M=2,tv=1,R_{U}=8,w=0.8,R_{T}=6,\mu=10^{-4},\beta=5,\sigma=0.1 in panel (c); Z=100,N=4,M=2,tv=1,RU=8,w=0.8,σ=0.1,RT=6,μ=103,β=5Z=100,N=4,M=2,tv=1,R_{U}=8,w=0.8,\sigma=0.1,R_{T}=6,\mu=10^{-3},\beta=5 in panel (d).
Refer to caption
Figure 11: Robustness of the evolutionary outcomes of CI,TCI,T, and UU to the changes of the group size NN. Panel (a) shows the average levels of three strategies as a function of NN. Panels (b)-(d) show the stationary distribution and the gradient of selection for three different NN values. Parameter values are Z=100,M/N=0.5,RU=8,σ=0.1,RT=6,w=0.8,β=5,μ=1/Z,tv=1Z=100,M/N=0.5,R_{U}=8,\sigma=0.1,R_{T}=6,w=0.8,\beta=5,\mu=1/Z,tv=1 in panel (a); Z=100,N=6,M=3,tv=3,RU=8,w=0.8,RT=6,μ=1/Z,β=5,σ=0.1Z=100,N=6,M=3,tv=3,R_{U}=8,w=0.8,R_{T}=6,\mu=1/Z,\beta=5,\sigma=0.1 in panel (b); Z=100,N=8,M=4,tv=1,RU=8,w=0.8,RT=6,μ=1/Z,β=5,σ=0.1Z=100,N=8,M=4,tv=1,R_{U}=8,w=0.8,R_{T}=6,\mu=1/Z,\beta=5,\sigma=0.1 in panel (c); Z=100,N=10,M=5,tv=1,RU=8,w=0.8,σ=0.1,RT=6,μ=1/Z,β=5Z=100,N=10,M=5,tv=1,R_{U}=8,w=0.8,\sigma=0.1,R_{T}=6,\mu=1/Z,\beta=5 in panel (d).

In what follows, we investigate the impact of the investment cost of investors on the evolutionary dynamics (see Fig. 6). We find that with the increase of tvtv value, the average levels of three strategies are almost unchanged (see Fig. 6(a)). In addition, we find that changing the tvtv values does not influence the gradient of selection and the stationary distribution of the system (see Fig. 6(b)-(d)).

In Fig. 7, we further present how observation cost affects the evolutionary dynamics of CICI, TT, and UU strategies. We find that the increase of observation cost will not change the evolutionary advantage of CICI and TT alliance over UU agents. Concretely, the average level of CICI decreases with the observation cost, the average level of TT increases, but the average level of UU remains almost unchanged (see Fig. 7 (a)). Furthermore, we find that the entire population will spend most time near configurations in which CICI and TT agents coexist. In addition, with the increase of observation cost, the background shadow area is gradually moving towards the vertex TT (see Fig. 7(b)-(d)).

In Fig. 8, we investigate the role of discount factors ww in the evolutionary dynamics of CICI, TT, and UU strategies. Our results show that the average levels of these three strategies remain almost unchanged with the increase of ww (see Fig. 8 (a)). Besides, for three different ww values, we find that our results are robust, that is, the population will spend most of the time in the intermediate region of CITCI-T edge, and most arrows flow to the coexistence states of CICI and TT (see Fig. 8(b)-(d)).

Furthermore, the effects of the mutation rate μ\mu and the intensity of selection β\beta on the evolutionary dynamics are investigated (Figs. 9 and 10). When β\beta is small (β<103\beta<10^{-3}) or when μ\mu is significantly large (close to 1), the evolutionary process is mainly affected by imitation error or behavioral mutation, which leads to the average levels of all strategies close to 1/31/3 (see Figs. 9(a) and 10(a)). As β\beta increases or μ\mu decreases, a strategy’s performance becomes increasingly important for the strategy’s survival, eventually favoring the emerging alliance of the CICI and TT strategies, which crowds out the most UU agents (see Fig. 9 (b)-(d) and Fig. 10 (b)-(d)).

In what follows, we investigate how evolutionary dynamics of CI,TCI,T, and UU strategies change with the group size NN (Fig. 11). Here we set M/N=0.5M/N=0.5, and we find that with the increase of NN, the average levels of CI,TCI,T, and UU remain almost unchanged (see Fig. 11(a)). In particular, the average level of TT is the highest, the second largest frequency is formed by CICI, while UU makes up the smallest fraction. In addition, the dynamic visualization results show that the population spends most time in the intermediate region of CITCI-T edge, and most arrows point to the states where CICI and TT agents coexist when NN changes appropriately (see Fig. 11(b)-(d)).

At the end of this section, we would like to point out that the work done here can have wider implications on other areas such as cognitive science, behavioral science, artificial intelligence, economics, and management science. Particularly, in the management system, developing and maintaining trust can promote the emergence of social exchange and economic transactions, which is significantly important for the effectiveness of management and organization. By investigating the evolutionary dynamics of trust in repeated group interactions, we show that the introduction of the conditional investment strategy can lead to the emergence of high trust and high trustworthiness in repeated group interactions, which can explain why trusting in others and reciprocating them with trustworthy actions can be sustained in social and economic interactions.

5 Conclusions and Discussion

In this work, we study the evolutionary dynamics of the conditional investment strategy in the NTG in which agents engage in repeated group interactions. Different from the setting of NTG considered in previous works [1, 22], we consider that investors can adjust their investment decisions according to the investment environment in repeated group interactions framework. Concretely, we use the tolerance threshold as an important parameter to describe the investment tendency of investors. We find that an intermediate threshold can lead to the evolution of high trust and high trustworthiness, while too low threshold level or too high threshold level cannot promote the evolution of trust. In summary, as the answers to the initially proposed questions, we can conclude that the introduction of the conditional investment strategy provides an avenue for trust to thrive in repeated group interactions.

As we have emphasized above, our model setup is well aligned with reality in which investors generally make decisions based on the current investment environment rather than investing blindly. A good example where our model could apply is the trust relationship between consumers and suppliers [20]. Consumers will refuse to consume if they find more suppliers selling inferior products during daily shopping on Amazon. Therefore, the collection of information is particularly important. The key assumption of conditional investment is that investors pay a permanent observation cost σ\sigma to collect information about the trustworthiness of the trustee, and then make decisions in the subsequent possible group interactions based on what they observe. We have shown that increasing σ\sigma leads to the decrease of the frequency of conditional investors and the increase of the frequency of trustworthy agents, while has little effect on the frequency of untrustworthy agents. Therefore, the result that the alliance formed by CICI and TT agents can effectively resist the invasion of UU agents is robust to the observation cost.

Lastly, it is worth emphasizing that in addition to the observation cost, our results are more robust to the change of other model parameters. Previous work on networked NTG has revealed that the level of trust is correlated with how “difficult” the game is [8]. In our model, even though the social dilemma is extremely difficult (i.e. the ratio of temptation to defect RURT>1.66\frac{R_{U}}{R_{T}}>1.66), a high level of trust can always be achieved (see Fig. 5(a)). Besides, for the sake of mathematical convenience, analysis of evolutionary dynamics of NTG has been mostly dealt with either in the limit of rare mutations [26] or in the limit of weak selection [44]. Here our approach can be applied to arbitrary mutation rate and arbitrary intensity of selection values. We find that appropriate changes in mutation rate and intensity of selection will not affect our main results (see Fig. 9 and Fig. 10).

Future work could explore the evolutionary dynamics of the repeated NTG in structured populations where the interactions among agents are typically not random but rather limited to a subset of the population [6, 43, 49]. Indeed, the network where agents interact with others has different structures, and the MDP approach adopted in our work is suitable for complete graph networks and can also be extended to other network structures [7]. Besides, we can use theoretical approximation and agent-based simulations to study the evolutionary dynamics of trust on any population structure [3].

CRediT authorship contribution statement

Linjie Liu: Conceptualization, Methodology, Writing - original draft. Xiaojie Chen: Conceptualization, Formal analysis, Writing - review & editing, Supervision.

Declaration of Competing Interest

The authors declare that they have no competing financial interests.

Acknowledgment

This work was supported by the National Natural Science Foundation of China (Grants Nos. 61976048 and 62036002) and the Fundamental Research Funds of the Central Universities of China. L.L. acknowledges the support from Special Project of Scientific and Technological Innovation (Grant No. 2452022107).

References

  • [1] H. Abbass, G. Greenwood, E. Petraki, NN -player trust game and its replicator dynamics, IEEE Trans. Evol. Comput. 20(3) (2016) 470-474.
  • [2] A. Ahmed, Conditional reciprocity in the investment game, Soc. Sci. J. 48(2) (2011) 404-408.
  • [3] B. Allen, G. Lippner, Y. T. Chen, B. Fotouhi, N. Momeni, S. T. Yau, M. A. Nowak, Evolutionary dynamics on any population structure, Nature 544(7649) (2017) 227-230.
  • [4] J. Berg, J. Dickhaut, K. McCabe, Trust, reciprocity, and social history, Games Econ. Behav. 10(1) (1995) 122-142.
  • [5] S. R. Bond, M. Söderbom, Conditional investment–cash flow sensitivities and financing constraints, J. Eur. Econ. Assoc. 11(1) (2013) 112-136.
  • [6] X. Chen, F. Fu, L. Wang, Interaction stochasticity supports cooperation in spatial prisoner’s dilemma, Phys. Rev. E 78(5) (2008) 051120.
  • [7] Y. Chen, K. R. Liu, Indirect reciprocity game modelling for cooperation stimulation in cognitive networks, IEEE Trans. Commun. 59(1) (2011) 159-168.
  • [8] M. Chica, R. Chiong, M. Kirley, H. Ishibuchi, A networked NN-player trust game and its evolutionary dynamics, IEEE Trans. Evol. Comput. 22(6) (2018) 866-878.
  • [9] M. Chica, R. Chiong, J. Ramasco, and H. Abbass, Effects of update rules on networked n-player trust game dynamics, Commun. Nonlinear Sci. Numer. Simul. 79(2019) 104870.
  • [10] R. Chiong, M. Kirley, Effects of iterated interactions in multiplayer spatial evolutionary games, IEEE Trans. Evol. Comput. 16(4) (2012) 537-555.
  • [11] S. Y. Chong, X. Yao, Behavioral diversity, choices and noise in the iterated prisoner’s dilemma, IEEE Trans. Evol. Comput. 9(6) (2005) 540-551.
  • [12] J. F. Crow, An Introduction to Population Genetics Theory, (Harper and Row New York 1970).
  • [13] M. Deutsch, Cooperation and trust: Some theoretical notes, Nebr. Sym. Motiv. (1962) 275–320.
  • [14] W. J. Ewens, Mathematical Population Genetics, (Springer New York 2004).
  • [15] L. Fang, G. Shi, L. Wang, Y. Li, S. Xu, Y. Guo, Incentive mechanism for cooperative authentication: An evolutionary game approach. Inf. Sci. 527 (2020) 369-381
  • [16] X. Fang, X. Chen, Evolutionary dynamics of trust in the N-player trust game with individual reward and punishment, Eur. Phys. J. B 94 (2021) 176. .
  • [17] P. García-Victoria, M. Cavaliere, M. A. Gutiérrez-Naranjo, M. Cárdenas-Montes, Evolutionary game theory in a cell: A membrane computing approach. Inf. Sci. 589 (2022) 580-594.
  • [18] W. Güth, P. Ockenfels, M. Wendel, Cooperation based on trust. An experimental investigation, J. Econ. Psychol. 18(1) (1997) 15-43.
  • [19] T. A. Han, C. Perret, S. T. Powers, When to (or not to) trust intelligent machines: Insights from an evolutionary game theory analysis of trust in repeated games, Cogn. Syst. Res. 68 (2021) 111-124.
  • [20] F. Hawlitschek, T. Teubner, C. Weinhardt, Trust in the sharing economy, Die Unternehmung 70(1) (2016) 26-44.
  • [21] C. Hilbe, S̆. S̆imsa, K. Chatterjee, M. A. Nowak, Evolution of cooperation in stochastic games, Nature 559(7713) (2018) 246-249.
  • [22] Z. Hu, X. Li, J. Wang, C. Xia, Z. Wang, M. Perc, Adaptive reputation promotes trust in social networks, IEEE Trans. Net. Sci. Eng. 40(1) (2021) 2327-4697.
  • [23] B. King-Casas, D. Tomlin, C. Anen, C. F. Camerer, S. R. Quartz, P. R. Montague, Getting to know you: reputation and trust in a two-person economic exchange, Science 308(5718) (2005) 78–83.
  • [24] A. Kumar, V. Capraro, and M. Perc, The evolution of trust and trustworthiness, J. R. Soc. Interface 17(169) (2020) 20200491.
  • [25] H. Liang, Y. Cui, X. Ren, X. Wang, Almost sure exponential stability of two-strategy evolutionary games with multiplicative noise. Inf. Sci. 579 (2021) 888-903.
  • [26] I. S. Lim, Stochastic evolutionary dynamics of trust games with asymmetric parameters, Phys. Rev. E 102(6) (2020) 062419.
  • [27] L. Liu, X. Chen, Evolutionary game dynamics in multiagent systems with prosocial and antisocial exclusion strategies, Knowl-Based Syst. 188 (2020) 104835.
  • [28] L. Liu, Z. Xiao, X. Chen, A. Szolnoki, Early exclusion leads to cyclical cooperation in repeated group interactions. J. R. Soc. Interface 19(188) (2022) 20210755.
  • [29] G. Martínez-Cánovas, E. Del Val, V. Botti, P. Hernández, M. Rebollo, A formal model based on game theory for the analysis of cooperation in distributed service discovery. Inf. Sci. 326 (2016) 59-70.
  • [30] L. A. Martinez-Vaquero, F. C. Santos, V. Trianni, Signalling boosts the evolution of cooperation in repeated group interactions, J. R. Soc. Interface 17(172) (2020) 20200635.
  • [31] M. Perc, J. Gómez-Gardenes, A. Szolnoki, L. M. Floría, Y. Moreno, Evolutionary dynamics of group interactions on structured populations: a review, J. R. Soc. Interface, 10(80) (2013) 20120997.
  • [32] M. Perc, J. J. Jordan, D. G. Rand, Z. Wang, S. Boccaletti, and A. Szolnoki, The evolution of trust and cooperation between strangers: A computational model, Phys. Rep. 687 (2017) 1-51.
  • [33] J. Quan, Y. Zhou, X. Ma, X. Wang, J. B. Yang, Integrating emotion-imitating into strategy learning improves cooperation in social dilemmas with extortion, Knowl-Based Syst. 233 (2021) 107550.
  • [34] J. Quan, X. Li, X. Wang, The evolution of cooperation in spatial public goods game with conditional peer exclusion, Chaos 29(10) (2019) 103137.
  • [35] J. Quan, W. Liu, Y. Chu, X. Wang, Stochastic evolutionary voluntary public goods game with punishment in a Quasi-birth-and-death process. Sci. Rep. 7 (2017) 16110.
  • [36] J. Quan, J. Zheng, X. Wang, X. Yang, Benefits of asynchronous exclusion for the evolution of cooperation in stochastic evolutionary optional public goods games. Sci. Rep. 9 (2019) 8208.
  • [37] K. Sigmund, D. H. Silva, A. Traulsen, C. Hauert, Social learning promotes institutions for governing the commons, Nature 466(7308) (2010) 861-863.
  • [38] W. Sun, L. Liu, X. Chen, A. Szolnoki, V. V. Vasconcelos, Combination of institutional incentives for cooperative governance of risky commons, iScience 24(8) (2021) 102844.
  • [39] R. L. Swinth, The establishment of the trust relationship, J. Conflict Resolut 11(3) (1967) 335–344.
  • [40] G. Szabó, C. Tőke, Evolutionary prisoner’s dilemma game on a square lattice, Phys. Rev. E 58(1) (1998) 69.
  • [41] A. Szolnoki, M. Perc, Correlation of positive and negative reciprocity fails to confer an evolutionary advantage: Phase transitions to elementary strategies, Phys. Rev. X, 3(4) (2013) 041021.
  • [42] A. Szolnoki, M. Mobilia, L. L. Jiang, B. Szczesny, A. M. Rucklidge, M. Perc, Cyclic dominance in evolutionary games: a review, J. R. Soc. Interface, 11(100) (2014) 20140735.
  • [43] A. Szolnoki, M. Perc, Conformity enhances network reciprocity in evolutionary social dilemmas, J. R. Soc. Interface 12(103) (2015) 20141299.
  • [44] C. E. Tarnita, Fairness and trust in structured populations, Games 6(3) (2015) 214-230.
  • [45] R. Urena, G. Kou, Y. Dong, F. Chiclana, E. Herrera-Viedma, A review on trust propagation and opinion dynamics in social networks and group decision making frameworks. Inf. Sci. 478 (2019) 461-475.
  • [46] K. Van, G. Nicolaas, Stochastic Processes in Physics and Chemistry, (Elsevier 1992).
  • [47] V. V. Vasconcelos, F. C. Santos, J. M. Pacheco, A bottom-up institutional approach to cooperative governance of risky commons, Nat. Clim. Change 3(9) (2013) 797–801.
  • [48] C. Zhang, S. Liu, Z. Wang, F. J. Weissing, J. Zhang, The “self-bad, partner-worse” strategy inhibits cooperation in networked populations, Inf. Sci. 585 (2022) 58-69.
  • [49] J. Zhang, J. Lou, J. Qiu, J. Lu, Dynamics and convergence of hyper-networked evolutionary games with time delay in strategies, Inf. Sci. 563 (2021) 166-182.