This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Mean-Field Game Analysis of SIR Model with Social Distancing

Samuel Cho Program for Quantitative and Computational Biology, Princeton University
Abstract

The current COVID-19 pandemic has proven that proper control and prevention of infectious disease require creating and enforcing the appropriate public policies. One critical policy imposed by the policymakers is encouraging the population to practice social distancing (i.e. controlling the contact rate among the population). Here we pose a mean-field game model of individuals each choosing a dynamic strategy of making contacts, given the trade-off of gaining utility but also risking infection from additional contacts. We compute and compare the mean-field equilibrium (MFE) strategy, which assumes each individual acting selfishly to maximize its own utility, to the socially optimal strategy, which maximizes the total utility of the population. We prove that the optimal decision of the infected is always to make more contacts than the level at which it would be socially optimal, which reinforces the important role of public policy to reduce contacts of the infected (e.g. quarantining, sick paid leave). Additionally, we include cost to incentivize people to change strategies, when computing the socially optimal strategies. We find that with this cost, policies reducing contacts of the infected should be further enforced after the peak of the epidemic has passed. Lastly, we compute the price of anarchy (PoA) of this system, to understand the conditions under which large discrepancies between the MFE and socially optimal strategies arise, which is when intervening public policy would be most effective.

1 Introduction

The current COVID-19 pandemic has been evidence that posing a public policy to fight such a crisis is an extremely difficult and interdisciplinary task. A major component of such policies is urging people to practice social distancing. By reducing interpersonal contacts, the spread of infection can be slowed down. However, it is uncertain how to incentivize people to practice social distancing, when there are clearly numerous benefits to making contacts, such as working for income, general desire for freedom and social relationships.

Given this trade-off between the additional utility from making more contacts, and the additional chance of infection from those contacts, we will compute and compare 1) the selfish strategies, in which individuals make contacts to optimize only their own utilities and 2) socially optimal strategies, in which individuals make contacts which optimize the utility of the total population.

The approach of focusing on the economic causes and epidemiological consequences is referred to as economic epidemiology[1]. Recent work has focused on treating economic factors behind contact and mixing decisions as part of the disease transmission mechanism[2, 3, 4, 5, 6]. More specifically, some previous work in this field has explicitly included contact behaviors as control variables into the classical SIR model and have shown that different dynamics can emerge with adaptive behavior[7, 8, 9]. Disease dynamics has also been studied with game theory, focusing mostly on steady-state problems or those related to vaccination[10, 11, 12]. Optimal control theory has also been used to study policy interventions on infectious disease dynamics[13, 14]. Our work here builds on these previous models to pose a mean-field game problem of social distancing, explicitly modeling the feedback between the individuals and the population structure. The individual, which is susceptible (S), infected (I), or recovered (R), each chooses a dynamic contact strategy to maximize its accumulated utility over the time period.

Mean-field games is a recently developed field of mathematics, studying dynamic game-theoretic problems of infinite number of players[15, 16]. Our model is a relatively simple formulation of the mean-field games, in which we have a deterministic game with 3 discrete states (S,I,R) in continuous time. Depending on the number of contacts each individual makes, the dynamics of the infectious population follow the SIR model, and the anticipation of this population structure influences the computation of an individual’s optimal strategy. We take mean-field assumptions such that 1) the number of individuals in the population is large (N)(N\to\infty), 2) the individuals are homogeneous within each compartment and the population is well-mixed, and 3) individuals engage in symmetric interactions.

2 Model

2.1 SIR dynamics

The focus of this work is on understanding the effect of social distancing behavior and the corresponding utility trade-offs on the epidemiological dynamics, so we will keep the epidemiology as simple as possible as a baseline model. We will devote a section to including an exposed class (SEIR), but leave further expansion on the epidemiology as future work.

We consider an SIR model without births or deaths. The recovered (R) is an absorbing state, where individuals gain permanent immunity from the disease. For now, we do not consider exposed period or asymptomatic or presymptomatic transmissions.

Without birth or death, the population is fixed, and we let xzx_{z} be the fraction of the population in compartment z{S,I,R}z\in\{S,I,R\}. Then the dynamics of the SIR system is

[left=\empheqlbrace]x˙S\displaystyle[left=\empheqlbrace]\dot{x}_{S} =C()βxSxI\displaystyle=-C(\cdot)\beta x_{S}x_{I}
x˙I\displaystyle\dot{x}_{I} =C()βxSxIμxI\displaystyle=C(\cdot)\beta x_{S}x_{I}-\mu x_{I}
x˙R\displaystyle\dot{x}_{R} =μxI\displaystyle=\mu x_{I} (1)

β\beta is the likelihood that infection happens given a contact between S and I, and μ\mu is the constant rate of recovery from infection. C()C(\cdot) is the rate that an S individual and an I individual make contact, which is dependent on whether individuals choose to socially distance or not. Let us write the number of contacts made by a z{S,I,R}z\in\{S,I,R\} individual at time tt as cz(t)c_{z}(t).

It is evident that C()C(\cdot) should increase as cS(t)c_{S}(t) and cI(t)c_{I}(t) increase, but there are two intuitive ways of writing an expression for C()C(\cdot), analogous to frequency-dependent or density-dependent disease transmission.

i) Frequency-dependent
Looking only at the transmission term (suppressing notation for dependence on tt for brevity),

x˙S=βcSxScIxIcSxS+cIxI+cRxR\dot{x}_{S}=-\beta\frac{c_{S}x_{S}c_{I}x_{I}}{c_{S}x_{S}+c_{I}x_{I}+c_{R}x_{R}} (2)

where a contact by S has probability cIxIcSxS+cIxI+cRxR\frac{c_{I}x_{I}}{c_{S}x_{S}+c_{I}x_{I}+c_{R}x_{R}} of being with I. Therefore, the transmission term depends on the relative frequency between the contact strategies.

ii) Density-dependent
Similarly the transmission term is

x˙S=βcSxScIxI\dot{x}_{S}=-\beta^{\prime}c_{S}x_{S}c_{I}x_{I} (3)

Here, the transmission term depends on not only the relative frequency, but also the absolute density of the contact strategies. If the whole population is doubling or halving contacts, the transmission should increase or decrease accordingly, and this density-dependent formulation is consistent with such intuitions. For the rest of this paper, we will take the density-dependent formulation. (Note that β\beta^{\prime} term here is technically different from the β\beta term above, but for simplicity, any β\beta referred to for the rest of this paper is β\beta^{\prime} of the density-dependent formulation.)

2.2 Utility of contacts

cz(t)c_{z}(t) is the contact strategy which is explicitly chosen by an individual. Large cz(t)c_{z}(t) means going out to work and socializing, while smaller cz(t)c_{z}(t) means refraining from those activities (i.e. social distancing). uz(cz)u_{z}(c_{z}) denotes the utility received by a zz individual from making czc_{z} contacts, which is a combination of economic gains and personal well-being. The realistic conditions that we impose are the following: Starting from zero contacts, increasing the number of contacts results in increased utility, which is the baseline universal need for social interactions, as well as monetary gain from economic activities. We assume that the marginal increase in utility decreases with more contacts, until eventually, more contacts become detrimental.

Therefore, the appropriate functional form of uz(cz)u_{z}(c_{z}) must be a concave function with an interior maximum. One appropriate function form with interpretations of the parameters is uz(cz)=(bzczcz2)γazu_{z}(c_{z})=(b_{z}c_{z}-c_{z}^{2})^{\gamma}-a_{z} where cz[0,bz]c_{z}\in[0,b_{z}][8]. bzb_{z} is a parameter of how the disease impacts the marginal economic productivity of the individual. Large bzb_{z} means that the individual has the choice to make more contacts, as well as receive higher marginal utility from making contacts. aza_{z} is a parameter of the baseline cost of being infected, such as an individual’s general propensity to be healthy. These two parameters decompose the effect of the disease into economic cost and health cost. Some assumptions on these parameters are that bS=bR>bIb_{S}=b_{R}>b_{I} and aS=aR<aIa_{S}=a_{R}<a_{I}, since the infected become impaired in economic productivity, as well as suffer the health cost compared to the S and R. γ\gamma changes the concave shape of the function, and γ(0,1]\gamma\in(0,1] ensures that uz(cz)u_{z}(c_{z}) is concave everywhere in the domain.

From this utility function, we see that each individual has some optimal level of social contacts. The utility of a zz individual is maximized at bz2\frac{b_{z}}{2}, which is each individual’s optimal contact strategy in the absence of adaptive behavior in response to the risk of infectious disease.

Refer to caption
Figure 1: uz(cz)u_{z}(c_{z}) is shown where z{S,R}z\in\{S,R\} are healthy (green) and z{I}z\in\{I\} is infected (red) and the parameters are bS=bR=10b_{S}=b_{R}=10, bI=6b_{I}=6, aS=aR=0a_{S}=a_{R}=0, aI=1a_{I}=1, and γ=0.25\gamma=0.25. A healthy individual gains utility from making more contacts, but eventually does not want to make more, when cS=5c_{S}=5. The infected suffers the baseline cost, but also gains some utility from making contacts, although at a lower rate compared to the healthy.

2.3 Value function

Over time period, t[0,T]t\in[0,T], the total utility of an individual is the sum of utility gained at each time point. For example, a recovered individual with continuous contact decision c(t)c(t) receives total utility 0TuR(c(t))𝑑t\int_{0}^{T}u_{R}(c(t))dt. Typically, a discounting term, δ\delta which discounts the future utility compared to the present utility, is included, but we set δ=1\delta=1 for simplicity. The general results here do not change with δ<1\delta<1.

Let us define VR(t)V_{R}(t) to be the total future utility expected by the R individual at time tt until TT. From this formulation, it follows that terminal condition is VR(T)=0V_{R}(T)=0, and

VR(t)=tTuR(c(t))𝑑t=uR(c(t))dt+t+dtTuR(c(t))𝑑t=uR(c(t))dt+VR(t+dt)V_{R}(t)=\int_{t}^{T}u_{R}(c(t))dt=u_{R}(c(t))dt+\int_{t+dt}^{T}u_{R}(c(t))dt=u_{R}(c(t))dt+V_{R}(t+dt) (4)

Similarly for S and I individuals, we define VS(t)V_{S}(t) and VI(t)V_{I}(t), which depend on the rate at which the individuals move between the SIR states. (Fig. 2)

Refer to caption
Figure 2: Between time tt and t+dtt+dt, S, I, and R individuals move between the states at these rates. The S individual becomes infected at a rate dependent on contact rates. The I individual recovers at a constant rate of μdt\mu dt. The recovered individual remains in the state.

From these transition rates, we can write the Bellman equations, which give the value functions for individuals in S, I, and R states, respectively, as:

VS(t)\displaystyle V_{S}(t) =maxcS{tt+dtuS(cS)𝑑t+(1C()βxIdt)VS(t+dt)+C()βxIdtVI(t+dt)}\displaystyle=\max_{c_{S}}\Big{\{}\int_{t}^{t+dt}u_{S}(c_{S})dt+(1-C(\cdot)\beta x_{I}dt)V_{S}(t+dt)+C(\cdot)\beta x_{I}dtV_{I}(t+dt)\Big{\}}
VI(t)\displaystyle V_{I}(t) =maxcI{tt+dtuI(cI)𝑑t+(1νdt)VI(t+dt)+νdtVR(t+dt)}\displaystyle=\max_{c_{I}}\Big{\{}\int_{t}^{t+dt}u_{I}(c_{I})dt+(1-\nu dt)V_{I}(t+dt)+\nu dtV_{R}(t+dt)\Big{\}}
VR(t)\displaystyle V_{R}(t) =maxcR{tt+dtuR(cR)𝑑t+VR(t+dt)}\displaystyle=\max_{c_{R}}\Big{\{}\int_{t}^{t+dt}u_{R}(c_{R})dt+V_{R}(t+dt)\Big{\}} (5)

with terminal conditions, VS(T)=VI(T)=VR(T)=0V_{S}(T)=V_{I}(T)=V_{R}(T)=0.

2.4 Mean Field Equilibrium solution

The SIR dynamics and the Bellman equations are coupled by the contact strategies, cS,cI,cRc_{S},c_{I},c_{R}, and the population, xS,xI,xRx_{S},x_{I},x_{R}. The solution to this problem is the mean-field equilibrium (MFE), which is the fixed point (cSeq,cIeq,cReq,xS,xI,xR)(c_{S}^{eq},c_{I}^{eq},c_{R}^{eq},x_{S},x_{I},x_{R}) such that 1) the strategies cSeq(t)c_{S}^{eq}(t), cIeq(t)c_{I}^{eq}(t), cReq(t)c_{R}^{eq}(t) are the optimal solutions in equations 38 given xS,xI,x_{S},x_{I}, and xRx_{R} and 2) xSx_{S}, xIx_{I}, and xRx_{R} are solutions to the system of ODEs in equation 35 given the optimal strategies.

2.5 Socially Optimal solution

Additionally, we can characterize the socially optimal solution. Similar to the second-best equilibrium, discussed by Lipsey and Lancaster[17][18], if the infected individuals do not pursue the most utility-maximizing decisions, there is another equilibrium, in which the population can attain higher utility on average. If a central planner chooses the contact strategies of the population to maximize the utility of the entire population, the resulting socially optimal solution solves the dynamic optimization problem,

cSopt,cIopt,cRopt=argmax0TxS(t)uS(cS(t))+xI(t)uI(cI(t))+xR(t)uR(cR(t))dt.c_{S}^{opt},c_{I}^{opt},c_{R}^{opt}=\arg\max\int_{0}^{T}x_{S}(t)u_{S}(c_{S}(t))+x_{I}(t)u_{I}(c_{I}(t))+x_{R}(t)u_{R}(c_{R}(t))dt. (6)

We can also define modified version of the socially optimal problem, in which there is cost to move away from the MFE solution. Given the MFE solution, (cSeq,cIeq,cReq)(c_{S}^{eq},c_{I}^{eq},c_{R}^{eq}), the modified problem is

cSopt,cIopt,cRopt=argmax0T\displaystyle c_{S}^{opt},c_{I}^{opt},c_{R}^{opt}=\arg\max\int_{0}^{T} [xS(t)uS(cS(t))+xI(t)uI(cI(t))+xR(t)uR(cR(t))\displaystyle\Big{[}x_{S}(t)u_{S}(c_{S}(t))+x_{I}(t)u_{I}(c_{I}(t))+x_{R}(t)u_{R}(c_{R}(t))
12kzxz(czczeq)2]dt.\displaystyle-\frac{1}{2}k\sum_{z}x_{z}(c_{z}-c_{z}^{eq})^{2}\Big{]}dt. (7)

3 Results

3.1 Mean Field Equilibrium solution

The MFE solution is characterized through Proposition 1, 2, 3.

Proposition 1.

The optimal strategy for an R individual is cReq=0.5bRc_{R}^{eq}=0.5b_{R} and the corresponding optimal value function is VR(t)=uRmaxt+uRmaxTV_{R}(t)=-u_{R}^{max}t+u_{R}^{max}T.

Proof.

We substitute the Taylor expansion VR(t+dt)=VR(t)+V˙R(t)dtV_{R}(t+dt)=V_{R}(t)+\dot{V}_{R}(t)dt, and take only the first order dtdt terms.

VR(t)\displaystyle V_{R}(t) =maxcR{tt+dtuR(cR)𝑑t+VR(t)+V˙R(t)dt}\displaystyle=\max_{c_{R}}\Big{\{}\int_{t}^{t+dt}u_{R}(c_{R})dt+V_{R}(t)+\dot{V}_{R}(t)dt\Big{\}} (8)
V˙R(t)\displaystyle\Longrightarrow-\dot{V}_{R}(t) =maxcR{uR(cR)}\displaystyle=\max_{c_{R}}\Big{\{}u_{R}(c_{R})\Big{\}} (9)

uRu_{R} attains its maximum, uRmaxu_{R}^{max}, when cReq=0.5bRc_{R}^{eq}=0.5b_{R}, so we have

V˙R(t)=uRmaxVR(t)=uRmaxt+C\dot{V}_{R}(t)=-u_{R}^{max}\Longrightarrow V_{R}(t)=-u_{R}^{max}t+C (10)

Substituting the terminal condition, VR(T)=0V_{R}(T)=0, we have VR(t)=uRmaxt+uRmaxTV_{R}(t)=-u_{R}^{max}t+u_{R}^{max}T. ∎

In this model, the recovered individuals gain total immunity, so they always choose optimal contact rates.

Proposition 2.

The optimal strategy for an individual in the infected state is cIeq=0.5bIc_{I}^{eq}=0.5b_{I} and the corresponding optimal value function is VI(t)=VR(T)uRmaxuImaxμ(1eμ(tT))V_{I}(t)=V_{R}(T)-\frac{u_{R}^{max}-u_{I}^{max}}{\mu}(1-e^{\mu(t-T)})

Proof.

Substituting the first order Taylor expansion and taking only the first order terms,

VI(t)=maxcI{tt+dtuI(cI)𝑑t+(1μdt)(VI(t)+V˙I(t)dt)+μdt(VR(t)+V˙R(t)dt)}\displaystyle V_{I}(t)=\max_{c_{I}}\Big{\{}\int_{t}^{t+dt}u_{I}(c_{I})dt+(1-\mu dt)(V_{I}(t)+\dot{V}_{I}(t)dt)+\mu dt(V_{R}(t)+\dot{V}_{R}(t)dt)\Big{\}} (11)
\displaystyle\Longrightarrow VI(t)=maxcI{tt+dtuI(cI)𝑑t+VI(t)+V˙I(t)dtμVI(t)dt+μVR(t)dt}\displaystyle V_{I}(t)=\max_{c_{I}}\Big{\{}\int_{t}^{t+dt}u_{I}(c_{I})dt+V_{I}(t)+\dot{V}_{I}(t)dt-\mu V_{I}(t)dt+\mu V_{R}(t)dt\Big{\}} (12)
\displaystyle\Longrightarrow V˙I(t)=maxcI{uI(cI)+μ(VR(t)VI(t))}\displaystyle-\dot{V}_{I}(t)=\max_{c_{I}}\Big{\{}u_{I}(c_{I})+\mu(V_{R}(t)-V_{I}(t))\Big{\}} (13)

Because uI(cI)u_{I}(c_{I}) is the only term dependent on cIc_{I}, the value function attains its maximum, uImaxu_{I}^{max}, when cIeq=0.5bIc_{I}^{eq}=0.5b_{I}.

V˙I(t)=uImax+μ(VR(t)VI(t))-\dot{V}_{I}(t)=u_{I}^{max}+\mu\Big{(}V_{R}(t)-V_{I}(t)\Big{)} (14)

VR(t)V_{R}(t) is known from Proposition 1, and this is a first-order linear ordinary differential equation that can be explicitly solved with integrating factor. Using the terminal condition, VI(T)=0V_{I}(T)=0, we can find

VI(t)=VR(t)uRmaxuImaxμ(1eμ(tT))V_{I}(t)=V_{R}(t)-\frac{u_{R}^{max}-u_{I}^{max}}{\mu}(1-e^{\mu(t-T)}) (15)

The I individuals, similar to the R individuals, do not change from the optimal contact rate. From the above formulation, we see that VI(t)V_{I}(t) is bounded by VR(t)V_{R}(t), and VR(t)VI(t)V_{R}(t)-V_{I}(t) increases as μ\mu decreases, since it implies that I individuals spend longer time as an infected before recovering.

Proposition 3.

The optimal strategy for a susceptible individual is cSeq(t)<0.5bSc_{S}^{eq}(t)<0.5b_{S} during time 0t<T0\leq t<T.

Proof.

Substituting the Taylor expansion and taking only the first order terms,

VS(t)=maxcStt+dtuS(cS)𝑑t+(1cScIβxIdt)VS(t+dt)+cScIβxIdtVI(t+dt)\displaystyle V_{S}(t)=\max_{c_{S}}\int_{t}^{t+dt}u_{S}(c_{S})dt+\Big{(}1-c_{S}c_{I}\beta x_{I}dt\Big{)}V_{S}(t+dt)+c_{S}c_{I}\beta x_{I}dtV_{I}(t+dt) (16)
\displaystyle\Longrightarrow VS(t)=maxcStt+dtuS(cS)𝑑t+VS(t)+V˙S(t)dtcScIβxIdt(VS(t)VI(t))\displaystyle V_{S}(t)=\max_{c_{S}}\int_{t}^{t+dt}u_{S}(c_{S})dt+V_{S}(t)+\dot{V}_{S}(t)dt-c_{S}c_{I}\beta x_{I}dt(V_{S}(t)-V_{I}(t)) (17)
\displaystyle\Longrightarrow V˙S(t)=maxcSuS(cS)cScIβxI(VS(t)VI(t))\displaystyle-\dot{V}_{S}(t)=\max_{c_{S}}u_{S}(c_{S})-c_{S}c_{I}\beta x_{I}(V_{S}(t)-V_{I}(t)) (18)

The objective function is concave, so we set the derivative equal to 0 to find cSc_{S}^{*}.

duSdcS|cS=cSeq=0.5bIβxI(VS(t)VI(t))\displaystyle\frac{du_{S}}{dc_{S}}\Big{\rvert}_{c_{S}=c_{S}^{eq}}=0.5b_{I}\beta x_{I}(V_{S}(t)-V_{I}(t)) (19)

uSu_{S} is concave, so cSeqc_{S}^{eq} can be uniquely found. Also, we have VS(t)>VI(t)V_{S}(t)>V_{I}(t) for all t<Tt<T. This is because starting from the terminal condition VS(T)=VI(T)=0V_{S}(T)=V_{I}(T)=0, if at any time tt we are sufficiently close to VS=VIV_{S}=V_{I}, V˙S(t)=uSmax<uImax=V˙I(t)\dot{V}_{S}(t)=-u_{S}^{max}<-u_{I}^{max}=\dot{V}_{I}(t). Therefore, 0.5bIβxI(VSVI)>00.5b_{I}\beta x_{I}(V_{S}-V_{I})>0 for t<Tt<T, and cSeq<0.5bSc_{S}^{eq}<0.5b_{S}. ∎

Note that cSeqc_{S}^{eq} is smaller for bigger values of bIβxI(VS(t)VI(t))b_{I}\beta x_{I}(V_{S}(t)-V_{I}(t)), which means that the susceptible individuals should decrease contact if 1) infected population gets large, 2) the disease spreads easily, 3) cost of being infected is large, or 4) if the disease minimally affects the ability of the infected.

Numerical solution

We use discrete time steps Δt\Delta t to find the numerical solutions to the equations. With some initial c0=(cS0,cI0,cR0)c^{0}=(c_{S}^{0},c_{I}^{0},c_{R}^{0}), we compute x0=(xS0,xI0,xR0)x^{0}=(x_{S}^{0},x_{I}^{0},x_{R}^{0}) using the ODE forward equations. Then, c1c^{1} can be computed via backward induction with the given x0x^{0}. We continue this until we find ckc^{k} and xkx^{k} such that ck=ck1c^{k}=c^{k-1} and xk=xk1x^{k}=x^{k-1}, which is the MFE solution.

Refer to caption
Figure 3: The top figure shows xzx_{z} for z{S,I,R}z\in\{S,I,R\}, the SIR dynamics for MFE solution (solid lines) and the classical case without adaptive behaviors (dashed lines). Bottom figure shows czeq(t)c_{z}^{eq}(t) for z{S,I,R}z\in\{S,I,R\}.

Figure 3 shows the MFE numerical solution with disease parameters β=0.03\beta=0.03 and μ=0.1\mu=0.1, and utility parameters bS=bR=10b_{S}=b_{R}=10, bI=6b_{I}=6, aS=aR=0a_{S}=a_{R}=0, aI=4a_{I}=4, and γ=0.25\gamma=0.25. As we showed in Proposition 1 and 2, the recovered and infected individuals have no incentive to lower their contact rates, and so they continue with optimal level of contact rates. The susceptible individuals lower their contact rate to balance their immediate utilities and their expected cost of possibly getting infected. In result, the spread of infection is mitigated by the behavioral changes by the susceptible population, compared to the classical case in which no adaptive behavior is considered. The infection curve has lower peak due to the behavioral changes, but also a longer tail, because there are more remaining susceptible individuals to get the disease after the peak.

Refer to caption Refer to caption
(a) Cumulative curves (0.01β0.05)(0.01\leq\beta\leq 0.05) (b) Cumulative curves (0.02μ0.1)(0.02\leq\mu\leq 0.1)
Refer to caption Refer to caption
(c) Cumulative curves (2bI10)(2\leq b_{I}\leq 10) (d) Cumulative curves (2aI10)(2\leq a_{I}\leq 10)
Figure 4: The cumulative epidemic size is shown for the MFE solution (red) and the classical SIR (black) for the range of given parameter. For each (a)-(d), five cumulative curves are shown corresponding to five values of the given parameter in the range, where the more transparent lines are smaller parameter values.

Figure 4 shows the cumulative epidemic curve of the MFE solution and the classical SIR for different parameter values of β,μ,bI,\beta,\mu,b_{I}, and aIa_{I}. For any set of parameter values, the MFE solution, because of the adaptive behaviors, results in smaller final size as well as more gradual spread of the epidemic.

If the disease is more infectious (large β\beta), the MFE solution shows faster spread as well as larger final size, although mitigated compared to what the classical SIR model would predict (Figure 4a). If the disease has longer recovery time (small μ\mu), the classical SIR model predicts faster spread. However, the MFE solution predicts a different result. For example, when μ=0.02\mu=0.02, the classical case predicts that because the average infectious period is long, the number of infected population rises faster. However, the behavioral decision of individuals are considered in the MFE solution, and so a more gradual epidemic is predicted, since it is optimal to suppress contacts to avoid getting infected in the first place (Fig. 4b).

bIb_{I} and aIa_{I} are new parameters introduced to consider the effect of behavioral change. bIb_{I} denotes the economic utility, depending on social activity, and aIa_{I} denotes the health utility of the individual. If the disease does not affect the productivity of the infected individual (high bIb_{I}), the disease will spread as if its transmission rate β\beta is higher (Fig. 4c). In the case of COVID-19, the symptoms of many have been mild to moderate, which might have contributed to the perceived higher transmission rate.

The epidemic spread as predicted by the classical SIR model would not change with changes in aIa_{I}, but the MFE solution shows smaller epidemic and flatter growth rate for high aIa_{I}, as individuals choose to make less contacts to avoid the high cost of becoming infected (Fig. 4d).

3.2 Socially optimal solution

The MFE solution, discussed above, considers the case in which individuals maximize their own utility. For the socially optimal solution, we pose a centralized control problem, in which we find (cSopt,cIopt,cRopt)(c_{S}^{opt},c_{I}^{opt},c_{R}^{opt}) which maximizes the average utility of the entire population. Therefore we solve

cSopt(t),cIopt(t),cRopt(t)\displaystyle c_{S}^{opt}(t),c_{I}^{opt}(t),c_{R}^{opt}(t) =argmax0TxS(t)uS(cS(t))+xI(t)uI(cI(t))+xR(t)uR(cR(t))dt\displaystyle=\arg\max\int_{0}^{T}x_{S}(t)u_{S}(c_{S}(t))+x_{I}(t)u_{I}(c_{I}(t))+x_{R}(t)u_{R}(c_{R}(t))dt (20)
subject to: {x˙S=cScIβxSxIx˙I=cScIβxSxIνxIx˙R=νxI\displaystyle\begin{cases}\dot{x}_{S}=-c_{S}c_{I}\beta x_{S}x_{I}\\ \dot{x}_{I}=c_{S}c_{I}\beta x_{S}x_{I}-\nu x_{I}\\ \dot{x}_{R}=\nu x_{I}\\ \end{cases} (21)

In order to solve the optimal control problem, we use Pontryagin’s maximum principle, which gives the necessary conditions for the optimal controls, given the evolving dynamics of the system.

Theorem 3.1 (Pontryagin’s Maximum Principle).

Let 𝐱=[xS,xI,xR]T\bm{x}=[x_{S},x_{I},x_{R}]^{T} and 𝐜=[cS,cI,cR]T\bm{c}=[c_{S},c_{I},c_{R}]^{T}. For the given deterministic dynamics, 𝐱˙=f(𝐱,𝐜)\dot{\bm{x}}=f(\bm{x},\bm{c}), the Hamiltonian is defined as

H(𝒙,𝒄,𝝀,t):=L(𝒙,𝒄)+𝝀Tf(𝒙,𝒄)H(\bm{x},\bm{c},\bm{\lambda},t):=L(\bm{x},\bm{c})+\bm{\lambda}^{T}f(\bm{x},\bm{c})

where 𝛌(t)\bm{\lambda}(t) is the costate trajectory. If 𝐱(t)\bm{x}(t), 𝐜opt(t)\bm{c}^{opt}(t) is the optimal trajectory in 0tT0\leq t\leq T from 𝐱(0)\bm{x}(0), then 𝛌(t)\bm{\lambda}(t) satisfies

𝝀˙=H𝒙(𝒙opt,𝒄opt,𝝀,t)=L𝒙(𝒙opt,𝒄opt)+𝝀Tf𝒙(𝒙opt,𝒄opt)-\dot{\bm{\lambda}}=H_{\bm{x}}(\bm{x}^{opt},\bm{c}^{opt},\bm{\lambda},t)=L_{\bm{x}}(\bm{x}^{opt},\bm{c}^{opt})+\bm{\lambda}^{T}f_{\bm{x}}(\bm{x}^{opt},\bm{c}^{opt})

and 𝐜opt\bm{c}^{opt} is the solution to the optimization problem,

𝒄opt=argmax𝒄H(𝒙opt,𝒄,𝝀)\bm{c}^{opt}=\operatorname*{argmax}_{\bm{c}}H(\bm{x}^{opt},\bm{c},\bm{\lambda})
Proposition 4.

The socially optimal contact rate of the infected, cIoptc_{I}^{opt} must always be less than the MFE contact rate, cIeqc_{I}^{eq} during time 0t<T0\leq t<T.

Proof.

We can apply the Pontryagin’s maximum principle, which gives the necessary condition for optimality. If cSopt,cIopt,cRoptc_{S}^{opt},c_{I}^{opt},c_{R}^{opt} are optimal solutions, then there exist Lagrangian multipliers, λS(t),λI(t),λR(t)\lambda_{S}(t),\lambda_{I}(t),\lambda_{R}(t), such that λS(T)=0,λI(T)=0,λR(T)=0\lambda_{S}(T)=0,\lambda_{I}(T)=0,\lambda_{R}(T)=0, and for t<Tt<T, they satisfy:

λ˙S\displaystyle-\dot{\lambda}_{S} =cSoptcIoptβxI(λIλS)+uS(cSopt)\displaystyle=c_{S}^{opt}c_{I}^{opt}\beta x_{I}(\lambda_{I}-\lambda_{S})+u_{S}(c_{S}^{opt}) (22)
λ˙I\displaystyle-\dot{\lambda}_{I} =cSoptcIoptβxS(λIλS)+μ(λRλI)+uI(cIopt)\displaystyle=c_{S}^{opt}c_{I}^{opt}\beta x_{S}(\lambda_{I}-\lambda_{S})+\mu(\lambda_{R}-\lambda_{I})+u_{I}(c_{I}^{opt}) (23)
λ˙R\displaystyle-\dot{\lambda}_{R} =uR(cRopt)\displaystyle=u_{R}(c_{R}^{opt}) (24)
cSopt,cIopt,cRopt\displaystyle c_{S}^{opt},c_{I}^{opt},c_{R}^{opt} =argmaxcScIβxSxI(λIλS)+μxI(λRλI)+zxzuz(cz)\displaystyle=\arg\max c_{S}c_{I}\beta x_{S}x_{I}(\lambda_{I}-\lambda_{S})+\mu x_{I}(\lambda_{R}-\lambda_{I})+\sum_{z}x_{z}u_{z}(c_{z}) (25)

Note the similarities between our expressions for λS,λI,λR\lambda_{S},\lambda_{I},\lambda_{R} and VS,VI,VRV_{S},V_{I},V_{R} from the MFE, with the additional term in (23) as the only difference. Intuitively, this additional term represents the infected individual caring about the consequences of its contact strategy on the population, which is not considered by the selfish I individual in its MFE objective function. It can be interpreted as the I individual’s internalized negative externalities (i.e. thinking about our actions and their impact on others). First, we see that the cRc_{R} term in the objective function can be separated, and so we can find the maximizer, cRopt=0.5bR=cReqc_{R}^{opt}=0.5b_{R}=c_{R}^{eq}, which gives λR(t)=VR(t)\lambda_{R}(t)=V_{R}(t) for all tt. Therefore, the optimization problem for cSc_{S}^{*} and cIc_{I}^{*} is

cSopt,cIopt=argmaxxSuS(cS)+xIuI(cI)cScIβxSxI(λSλI)c_{S}^{opt},c_{I}^{opt}=\arg\max x_{S}u_{S}(c_{S})+x_{I}u_{I}(c_{I})-c_{S}c_{I}\beta x_{S}x_{I}(\lambda_{S}-\lambda_{I}) (26)

Clearly, βxSxI>0\beta x_{S}x_{I}>0 for all tt, assuming nontrivial initial conditions, xS(0)>0x_{S}(0)>0 and xI(0)>0x_{I}(0)>0. Also, λSλI>0\lambda_{S}-\lambda_{I}>0 for t<Tt<T. Starting from the given terminal conditions λS(T)=λI(T)=0\lambda_{S}(T)=\lambda_{I}(T)=0, if at any point we get sufficiently close to λS=λI\lambda_{S}=\lambda_{I}, we see that λ˙S<λI˙\dot{\lambda}_{S}<\dot{\lambda_{I}}, so λS\lambda_{S} will be strictly larger than λI\lambda_{I} in 0t<T0\leq t<T. Therefore, βxSxI(λSλI)>0\beta x_{S}x_{I}(\lambda_{S}-\lambda_{I})>0 for all 0t<T0\leq t<T.

With the utility function uz(cz)=(bzczcz2)γazu_{z}(c_{z})=(b_{z}c_{z}-c_{z}^{2})^{\gamma}-a_{z}, we will prove that cIopt<0.5bIc_{I}^{opt}<0.5b_{I} for all tt by dividing the problem into two cases: i) γ<1\gamma<1 and ii) γ=1\gamma=1.

i) γ<1\gamma<1
From (26), cIopt=0.5bIc_{I}^{opt}=0.5b_{I} only if cSoptβxSxI(λSλI)=0c_{S}^{opt}\beta x_{S}x_{I}(\lambda_{S}-\lambda_{I})=0, which is only true if cSopt=0c_{S}^{opt}=0. However, we see that for some given cIoptc_{I}^{opt}, cSoptc_{S}^{opt} can be computed to be

duSdcS|cS=cSoptcIoptβxI(λSλI)=0\displaystyle\frac{du_{S}}{dc_{S}}\Big{\rvert}_{c_{S}=c_{S}^{opt}}-c_{I}^{opt}\beta x_{I}(\lambda_{S}-\lambda_{I})=0 (27)
\displaystyle\Longrightarrow γ(bS2cSopt)(bScSoptcSopt2)1γ=cIoptβxI(λSλI)\displaystyle\frac{\gamma(b_{S}-2c_{S}^{opt})}{(b_{S}c_{S}^{opt}-c_{S}^{opt2})^{1-\gamma}}=c_{I}^{opt}\beta x_{I}(\lambda_{S}-\lambda_{I}) (28)

Since the left side monotonically decreases from \infty to 0 in the domain [0,0.5bI][0,0.5b_{I}], cSoptc_{S}^{opt} can be uniquely found and since the right side is bounded, cSopt>0c_{S}^{opt}>0. Therefore, cIopt<0.5bIc_{I}^{opt}<0.5b_{I} must be true.

ii) γ=1\gamma=1
Plugging in γ=1\gamma=1 gives the optimization problem as

cSopt,cIopt=argmaxxS(bScScS2)+xI(bIcIcI2)cScIβxSxI(λSλI)c_{S}^{opt},c_{I}^{opt}=\arg\max x_{S}(b_{S}c_{S}-c_{S}^{2})+x_{I}(b_{I}c_{I}-c_{I}^{2})-c_{S}c_{I}\beta x_{S}x_{I}(\lambda_{S}-\lambda_{I}) (29)

Assume for contradiction that cIopt=0.5bIc_{I}^{opt}=0.5b_{I} at some t<Tt<T. This is only possible if cSopt=0c_{S}^{opt}=0, which implies that bScIoptβxI(λSλI)b_{S}\leq c_{I}^{opt}\beta x_{I}(\lambda_{S}-\lambda_{I}) holds true. Since the optimal contact rates at time tt is given as cSopt(t)=0c_{S}^{opt}(t)=0 and cIopt(t)=0.5bIc_{I}^{opt}(t)=0.5b_{I}, we can plug these in to get

λ˙S(t)\displaystyle\dot{\lambda}_{S}(t) =0\displaystyle=0 (30)
λ˙I(t)\displaystyle\dot{\lambda}_{I}(t) =μ(λR(t)λI(t))uImax\displaystyle=-\mu(\lambda_{R}(t)-\lambda_{I}(t))-u_{I}^{max} (31)

With the known dynamics of λS,λI,\lambda_{S},\lambda_{I}, and xIx_{I}, we can use the first order Taylor expansion to find

xI(t+dt)\displaystyle x_{I}(t+dt) (λS(t+dt)λI(t+dt))=(xIμxIdt)(λSλI+μ(λRλI)dt+uImaxdt)\displaystyle\Big{(}\lambda_{S}(t+dt)-\lambda_{I}(t+dt)\Big{)}=(x_{I}-\mu x_{I}dt)\Big{(}\lambda_{S}-\lambda_{I}+\mu(\lambda_{R}-\lambda_{I})dt+u_{I}^{max}dt\Big{)} (32)
=xI(λSλI+μ(λRλI)dt+uImaxdtμ(λSλI)dt)>xI(λSλI)\displaystyle=x_{I}\Big{(}\lambda_{S}-\lambda_{I}+\mu(\lambda_{R}-\lambda_{I})dt+u_{I}^{max}dt-\mu(\lambda_{S}-\lambda_{I})dt\Big{)}>x_{I}(\lambda_{S}-\lambda_{I}) (33)

where the last inequality is due to λR\lambda_{R} being the upper bound of λS\lambda_{S}. We see that cIoptβxI(λSλI)c_{I}^{opt}\beta x_{I}(\lambda_{S}-\lambda_{I}) is monotonically increasing in time, and therefore, if (cSopt,cIopt)=(0,0.5bI)(c_{S}^{opt},c_{I}^{opt})=(0,0.5b_{I}) is the optimal solution at time tt, it is also the solution at time t+dtt+dt. This means that λSλI\lambda_{S}-\lambda_{I} must be increasing in time, but then it cannot satisfy the transversality condition, λS(T)=λI(T)=0\lambda_{S}(T)=\lambda_{I}(T)=0. Therefore, by contradiction, cI<0.5bIc_{I}^{*}<0.5b_{I} for all 0t<T0\leq t<T. ∎

This proposition implies that the selfish behavior of the infected is never optimal for the population, and so the appropriate policy in all cases is to decrease the contacts of the infected below the level that they want to selfishly make.

Numerical Solution

For the same set of parameters as in Figure 3, we can compute the socially optimal solution.

Refer to caption Refer to caption
Refer to caption Refer to caption
Figure 5: (a) shows the same MFE solution as in Figure 3. (b) shows the socially optimal solution, which maximizes the total utility of the population. (c) and (d) show the MFE solution and socially optimal solution for a different initial population, (xS(0),xI(0))=(0.3,0.7)x_{S}(0),x_{I}(0))=(0.3,0.7).

The optimal solution as shown in Figure 5b is to completely suppress the contacts of the I, so that no additional infection can take place, while the susceptible population resumes normal activities. In fact, this is most often the socially optimal strategy. An exception is if a large part of the population was already infected initially (Figure 5c,d). Here, 70 percent of the population is initially infected, and so the optimal contact strategy is to completely isolate the susceptible until the number of infected decreases to some level, at which, we go back to complete quarantining of the I and the resuming of normal activities by the S. This type of policy is the complete shutdown implemented to various degrees in the United States, in which it is better to completely insulate the susceptibles until the number of infected population decrease back to a manageable level.

3.3 Cost of central planning

A key piece missing in our characterization of the socially optimal solution is the various costs to controlling the contact rate of the population. The assumption in Section 3.2 that the central planner can freely choose contact rates is far from realistic. For example, in the optimal solutions of Figure 5b, complete suppression of contact rates of the infected must be done through quarantining or providing proper incentives to keep infected people from making social contacts, which all cost resources. Additionally, people do not generally like their choices to be decided by an authority, so this cost also includes the ”loss in freedom,” which the central planner of wants to minimize. Therefore, we add to the problem in 3.2, a cost for deviating away from the population’s MFE.

cSopt(t),cIopt(t),cRopt(t)=argmax𝒄0T\displaystyle c_{S}^{opt}(t),c_{I}^{opt}(t),c_{R}^{opt}(t)=\operatorname*{argmax}_{\bm{c}}\int_{0}^{T} [xS(t)uS(cS(t))+xI(t)uI(cI(t))+xR(t)uR(cR(t))\displaystyle\Big{[}x_{S}(t)u_{S}(c_{S}(t))+x_{I}(t)u_{I}(c_{I}(t))+x_{R}(t)u_{R}(c_{R}(t))
12kxz(czczeq)2]dt\displaystyle-\frac{1}{2}k\sum x_{z}(c_{z}-c_{z}^{eq})^{2}\Big{]}dt (34)

We add a new parameter, k0k\geq 0, which balances the competing objectives of maximizing total utility and minimizing the deviations from selfish strategies, similar to a regularization parameter in machine learning. When k=0k=0, the solution to the objective function is the socially optimal solution, and when kk is large, the solution is the MFE solution. In the intermediate values of kk, we can find the solution which balances the trade-off.

Refer to caption
Refer to caption
Refer to caption
Figure 6: With initial condition xI(0)=0.3x_{I}(0)=0.3, we compute the socially optimal contact strategies and corresponding population dynamics for k=0.4,0.7k=0.4,0.7, and 11. As kk is increased, it becomes more expensive to shift the infected strategies from the selfish strategy, and so the socially optimal contacts of the infected is larger, as the cost of lowering it begins to outweigh the utility benefits.

As we see in Figure 6, for different values of kk, we compute the socially optimal contact rates of the population. Another interpretation is that given limited resources to control contact rates, we find how it should be distributed during t[0,T]t\in[0,T]. We see a common result among the range of values for kk, which is that when susceptibles have low contact rates, it is less important to keep the contact rates of the infected as low. Instead, it is more optimal use of resources to make sure the contact rates of the infected after the peak of the epidemic is lowered because this is when the susceptibles start ramping up to normal activities, thus posing higher risk of a second epidemic. Another reason is that because the number of infected is smaller at this time and the imposed cost is fixed per capita, the same amount of resources is more efficiently used by controlling the smaller population of infected rather than the larger infected population during the peak.

3.4 Application to SEIR Model

Here we show the numerical solutions to the MFE problem when exposed class (E) is added to the model. First, we consider the case in which the exposed individuals are not yet infectious, but incubating the disease. Second, we consider the case in which the exposed individuals are presymptomatic, meaning that they are infectious before they show symptoms, although at a lower rate compared to the infectious individuals.

The incubation period is a random variable which has exponential distribution with parameter ν\nu (The average incubation period is ν1\nu^{-1}). Another variable which is introduced is βE\beta_{E}, the presymptomatic transmission rate of individuals in the exposed class. If there is no presymptomatic transmission, βE=0\beta_{E}=0. The SEIR model is

[left=\empheqlbrace]x˙S\displaystyle[left=\empheqlbrace]\dot{x}_{S} =C()βxSxIC()βExSxE\displaystyle=-C(\cdot)\beta x_{S}x_{I}-C(\cdot)\beta_{E}x_{S}x_{E}
x˙E\displaystyle\dot{x}_{E} =C()βxSxI+C()βExSxEνxE\displaystyle=C(\cdot)\beta x_{S}x_{I}+C(\cdot)\beta_{E}x_{S}x_{E}-\nu x_{E}
x˙I\displaystyle\dot{x}_{I} =νxEμxI\displaystyle=\nu x_{E}-\mu x_{I}
x˙R\displaystyle\dot{x}_{R} =μxI\displaystyle=\mu x_{I} (35)

Because the susceptible and the exposed classes do not have visible symptoms, an individual cannot differentiate between being in the S or E class. Therefore, we define the asymptomatic class, which includes both the susceptible class and the exposed class. The value function of this class will be taken as the weighted average between the S and E classes,

VA=xSVS+xEVExS+xEV_{A}=\frac{x_{S}V_{S}+x_{E}V_{E}}{x_{S}+x_{E}} (36)

where the corresponding value functions are given as

VS(t)\displaystyle V_{S}(t) =maxcS{tt+dtuS(cS)dt+(1C()βxIdtC()βExEdt)VS(t+dt)\displaystyle=\max_{c_{S}}\Big{\{}\int_{t}^{t+dt}u_{S}(c_{S})dt+\Big{(}1-C(\cdot)\beta x_{I}dt-C(\cdot)\beta_{E}x_{E}dt\Big{)}V_{S}(t+dt) (37)
+(C()βxIdt+C()βExEdt)VE(t+dt)}\displaystyle\hskip 71.13188pt+\Big{(}C(\cdot)\beta x_{I}dt+C(\cdot)\beta_{E}x_{E}dt\Big{)}V_{E}(t+dt)\Big{\}}
VE(t)\displaystyle V_{E}(t) =maxcE{tt+dtuS(cE)𝑑t+(1νdt)VE(t+dt)+νdtVI(t+dt)}\displaystyle=\max_{c_{E}}\Big{\{}\int_{t}^{t+dt}u_{S}(c_{E})dt+(1-\nu dt)V_{E}(t+dt)+\nu dtV_{I}(t+dt)\Big{\}}
VI(t)\displaystyle V_{I}(t) =maxcI{tt+dtuI(cI)𝑑t+(1μdt)VI(t+dt)+μdtVR(t+dt)}\displaystyle=\max_{c_{I}}\Big{\{}\int_{t}^{t+dt}u_{I}(c_{I})dt+(1-\mu dt)V_{I}(t+dt)+\mu dtV_{R}(t+dt)\Big{\}}
VR(t)\displaystyle V_{R}(t) =maxcR{tt+dtuR(cR)𝑑t+VR(t+dt)}\displaystyle=\max_{c_{R}}\Big{\{}\int_{t}^{t+dt}u_{R}(c_{R})dt+V_{R}(t+dt)\Big{\}} (38)

Therefore the optimal contact rates chosen by the asymptomatic class is

VA(t)=maxcA{xSLS(cA)+xELE(cA)xS+xE}V_{A}(t)=\max_{c_{A}}\Big{\{}\frac{x_{S}L_{S}(c_{A})+x_{E}L_{E}(c_{A})}{x_{S}+x_{E}}\Big{\}} (39)

where LS()L_{S}(\cdot) and LE()L_{E}(\cdot) are the objective functions for the SS and EE classes in equations 38.

Refer to caption Refer to caption
(a) ν1[2,15]\nu^{-1}\in[2,15] (b) βE[β16,β2]\beta_{E}\in[\frac{\beta}{16},\frac{\beta}{2}]
Figure 7: (a) The average incubation time period, ν1\nu^{-1} is varied from 2 to 15 days, and the corresponding infection curve and the optimal contact decisions of the susceptibles are shown from lighter to darker curves. (b) The presymptomatic transmission rate as a fraction of β\beta, the regular symptomatic transmission rate is varied from 116\frac{1}{16} to 12\frac{1}{2} and the corresponding infection curve and the optimal contact decisions of the susceptibles are shown from lighter to darker curves.

With the parameters from Figure 3, we include an exposed class and vary the incubation parameter ν\nu, where ν1\nu^{-1} is the average incubation period (Figure 7a) or the presymptomatic transmission rate βE\beta_{E} (Figure 7b.

As seen in Figure 7a, if there is an epidemic of an infectious disease with longer incubation period, it becomes optimal for susceptible individuals to not engage in social distancing. This is because larger incubation period suggests higher uncertainty of whether one was already exposed or not. The infection curve also reaches a higher final size because of more contact behavior, although at a slower rate because of the incubation period.

Additionally, if the infectious disease causes presymptomatic transmission, the resulting infection curve and optimal contact decisions of the susceptibles can be seen in Figure 7b. If βE\beta_{E}, the presymptomatic transmission rate, is large, it quickly promotes more behavior because individuals of the exposed class unknowingly make social contacts, and quickly infect many susceptibles.

Here, we presented preliminary results of this model when an exposed class is included. Further studies are needed such as computing the socially optimal solution with and without cost, as we did for the SIR model.

3.5 Price of Anarchy

We compute the price of anarchy (PoA), which is a measure of how much the system degrades due to the selfish strategies of individuals in each S, I, R compartment. In the context of mean-field games, it is the ratio of the total utility of the population adopting MFE strategies to the total utility of the population adopting socially optimal strategies[19]. The PoA is given by

PoA=Vopt(0)xS(0)VS(0)+xI(0)VI(0)+xR(0)VR(0)\text{PoA}=\frac{V_{opt}(0)}{x_{S}(0)V_{S}(0)+x_{I}(0)V_{I}(0)+x_{R}(0)V_{R}(0)} (40)

where VoptV_{opt} is the maximum of the objective function in (20).

Refer to caption Refer to caption
(a) PoA of β[0.005,0.05]\beta\in[0.005,0.05] (b) PoA of μ[0.015,0.15]\mu\in[0.015,0.15]
Refer to caption Refer to caption
(c) PoA of bI[1,8]b_{I}\in[1,8] (d) PoA of aI[1,5]a_{I}\in[1,5]
Figure 8: We vary the model parameters and compute the price of anarchy (left) and the total population utility of the socially optimal strategy (right, dashed) and the MFE strategy (right, solid). The parameter space where PoA is high is where intervening public policy would be most needed, since this is the case in which selfish strategies are most degrading the total utility.

The PoA, computed for ranges of β,μ,bI,\beta,\mu,b_{I}, and aIa_{I}, is shown in Fig. 8. While one parameter is changed, the others were kept fixed with the parameter values from Fig. 3.

As β\beta is made larger, PoA increases because each selfish contact behavior of the infected individual becomes magnified by the high transmission rate per contact (Fig. 8a). We see a similarly increasing trend as μ\mu gets smaller, because it increases the time spent infected (Fig. 8b). However, an increase in β\beta results in decreasing marginal gain in PoA while a decrease in μ\mu results in increasing marginal gain in PoA. Even for unreasonably large β\beta, the overall utility suffers by around 11%, while small μ\mu can cause a decline of 45%. The PoA is affected more by changes in μ\mu, the recovery rate of the disease.

Fig. 8c shows that an intermediate value of bIb_{I} results in the largest PoA. This non-monotonic relationship is because of the trade-offs of large bIb_{I}. On one hand, large bIb_{I} means that the disease does not affect the day-to-day productivity of the infected as severely, and so the infected are not as penalized. On the other hand, this also means that the infected are able to be more active and make more contacts, which infects more susceptibles. These two opposing effects are balanced near bI=5b_{I}=5, where the PoA is at its maximum. When aIa_{I} is large (Fig. 8d), the baseline cost of getting infected is larger, which results in less utility at each time point.

4 Discussion

4.1 Possible additions

The public policy response to COVID-19 is an extremely complex problem with many factors which this paper has not covered. This model and analysis are, in many ways, the simplest baseline case from which we can make more realistic to fit a particular disease. First, we can pose the problem with different compartmental models. For example, if we take the SIS model, we would see different optimal strategies since the infected also face the burden of social distancing. In the case of COVID-19, it will be most useful to include a compartment with asymptomatic transmission, which behaves like S from the central planner’s perspective. The socially optimal strategies depended on being able to distinguish between the S and the I, when it is not always the case. The lack of available testing of COVID-19, for example, provides the uncertainty within the population as well as from the central planner’s perspective. Second, we can add heterogeneity to the population by including additional state variables such as age, socioeconomic status, or level of prosociality. By explicitly adding the different subpopulations, we can understand the game theoretic dilemma at play between the old vs. the young, the financially stable vs. the unstable, or the prosocial vs. antisocial. Then, more specific policies may be proposed that target the contact strategies of a particular group.

4.2 Conclusions from this model

While our analysis does not include many important factors, we can still make some general conclusions.

Selfish strategies still ”flatten the curve.”

By including adaptive behavior of individuals, our model predicts epidemic curves with flatter growth rate, compared to the classical counterparts. In our simple model, the curve is flattened naturally because of susceptibles who weigh the trade-off between current utility of making contacts and the future cost of getting infected. The recovered and the infected do not have any trade-offs to decrease contacts. Even if only the susceptibles are practicing social distancing, it still decreases the number of contacts of the system, and so the infected population reaches a smaller peak (Fig. 3, 4). It should be emphasized that the curve is flattened because individuals anticipate future growth in infections and decrease their contacts to avoid being exposed to the infected individuals. If the possible outbreak is flat-out denied by the media, then individuals will not adapt their behaviors, causing an unmitigated large peak in the infected population. Therefore, it is the responsibility of the policymakers to clearly communicate the existence and extent of the spreading disease.

Selfish strategy of the infected is never socially optimal.

We prove that the socially optimal strategy of the infected, cIoptc_{I}^{opt}, is always less than cIeqc_{I}^{eq}. Therefore, policies in response to the epidemic should decrease the contact rates of the infected. We see examples of policies with this aim such as quarantining the infected or granting paid sick leave to individuals who tested positive. Both policies respectively decrease the contact rates of the infected directly or indirectly by decreasing bIb_{I}, reducing the potential gain in utility of making more contacts. Because reducing the contacts of the infected is so important, policymakers might consider even more aggressive policies.

It is important to control the infected contact rates, following the peak of the epidemic.

If cost is imposed to the central planner in changing the contact rates of the individuals, we find the new socially optimal contact rates, depending on kk, which is the per capita unit cost of changing contact rates. For k=0k=0 and k1k\gg 1 respectively, we find the cost-imposed socially optimal solution to be the previously computed 𝒄opt\bm{c}^{opt} and 𝒄eq\bm{c}^{eq}. For kk values in between (Fig. 8), we commonly see that when it is too costly to decrease the infected contact rate for the whole time period, it is most beneficial to at least focus on decreasing after the epidemic has subsided. An assumption here is that cost of central planning is constant in time, when it may not be in real world situations. When outbreak is at its peak, more public attention is on the disease, and it may be easier to implement social distancing or secure funding for quarantining. However, when the disease has subsided, it might be harder to convince the public to behave differently.

This result reinforces the need for formal social distancing policy which goes beyond the peak of the epidemic. When the disease is prevalent, social distancing can be naturally favored due to individual optimization, but to sustain it for longer requires centralized public policy to prevent second peaks. This general result is in agreement with other studies of COVID-19 policies which mention the likely possibility of second peaks[20, 21, 22]. Additional work, using more realistic central planning cost depending on time and population structure, will help us better understand how such long-term social distancing policies should be implemented.

Disease with exposed period is unfavorable for promoting social distancing.

As the preliminary results show in Section 3.4, if an epidemic of an infectious disease with an incubation period breaks out in a population, it will be harder to maintain social distancing behaviors in the population. In the case of COVID-19, which is believed to have an incubation period of up to two weeks before symptoms, each asymptomatic individual will be more uncertain concerning its infection status, and thus make it harder to prolong social distancing. Additionally, it is believed that those exposed to COVID-19 can transmit the disease before showing symptoms, which will further make it hard to maintain social distancing, as seen in Figure 7b.

Policies are most needed for diseases with low μ\mu, high aIa_{I}, high β\beta and intermediate bIb_{I}.

By computing the price of anarchy, we can measure the effect of different parameters on how much the system is degraded by the selfish behaviors. Diseases with low μ\mu, high aIa_{I}, high β\beta, intermediate bIb_{I}, in this order, seem to most affect the population such that their selfish behaviors will degrade the system more compared to central intervention.

An interesting future work will be to put different diseases on the spectrum of these 4 variables, depending on its epidemiological characteristics as well as its economic and health effects on the infected. Then, we can roughly categorize diseases which need to be centrally intervened in the case of an outbreak.

References

  • [1] Charles Perrings, Carlos Castillo-Chavez, Gerardo Chowell, Peter Daszak, Eli P Fenichel, David Finnoff, Richard D Horan, A Marm Kilpatrick, Ann P Kinzig, Nicolai V Kuminoff, et al. Merging economics and epidemiology to improve the prediction and management of infectious disease. EcoHealth, 11(4):464–475, 2014.
  • [2] Mark Gersovitz and Jeffrey S Hammer. Infectious diseases, public policy, and the marriage of economics and epidemiology. The world bank research observer, 18(2):129–157, 2003.
  • [3] Mark Gersovitz and Jeffrey S Hammer. The economical control of infectious diseases. The Economic Journal, 114(492):1–27, 2004.
  • [4] Scott Barrett and Michael Hoel. Optimal disease eradication. Environment and Development Economics, pages 627–652, 2007.
  • [5] Sebastian Funk, Erez Gilad, Chris Watkins, and Vincent AA Jansen. The spread of awareness and its impact on epidemic outbreaks. Proceedings of the National Academy of Sciences, 106(16):6872–6877, 2009.
  • [6] Sebastian Funk, Marcel Salathé, and Vincent AA Jansen. Modelling the influence of human behaviour on the spread of infectious diseases: a review. Journal of the Royal Society Interface, 7(50):1247–1256, 2010.
  • [7] Timothy C Reluga. Game theory of social distancing in response to an epidemic. PLoS computational biology, 6(5), 2010.
  • [8] Eli P Fenichel, Carlos Castillo-Chavez, M Graziano Ceddia, Gerardo Chowell, Paula A Gonzalez Parra, Graham J Hickling, Garth Holloway, Richard Horan, Benjamin Morin, Charles Perrings, et al. Adaptive human behavior in epidemiological models. Proceedings of the National Academy of Sciences, 108(15):6306–6311, 2011.
  • [9] Benjamin R Morin, Eli P Fenichel, and CARLOS CASTILLO-CHAVEZ. Sir dynamics with economically driven contact rates. Natural resource modeling, 26(4):505–525, 2013.
  • [10] Peter J Francis. Optimal tax/subsidy combinations for the flu season. Journal of Economic Dynamics and Control, 28(10):2037–2054, 2004.
  • [11] Timothy C Reluga. An sis epidemiology game with two subpopulations. Journal of Biological Dynamics, 3(5):515–531, 2009.
  • [12] Josu Doncel, Nicolas Gast, and Bruno Gaujal. A mean-field game analysis of sir dynamics with vaccination. 2017.
  • [13] Suzanne Lenhart and John T Workman. Optimal control applied to biological models. CRC press, 2007.
  • [14] Rachael Miller Neilan and Suzanne Lenhart. An introduction to optimal control with an application in disease modeling. In Modeling Paradigms and Analysis of Disease Trasmission Models, pages 67–82, 2010.
  • [15] Jean-Michel Lasry and Pierre-Louis Lions. Mean field games. Japanese journal of mathematics, 2(1):229–260, 2007.
  • [16] Minyi Huang, Roland P Malhamé, Peter E Caines, et al. Large population stochastic dynamic games: closed-loop mckean-vlasov systems and the nash certainty equivalence principle. Communications in Information & Systems, 6(3):221–252, 2006.
  • [17] Richard G Lipsey and Kelvin Lancaster. The general theory of second best. The review of economic studies, 24(1):11–32, 1956.
  • [18] Tetsushi Ohdaira and Takao Terano. Cooperation in the prisoner’s dilemma game based on the second-best decision. Journal of Artificial Societies and Social Simulation, 12(4):7, 2009.
  • [19] René Carmona, Christy V Graves, and Zongjun Tan. Price of anarchy for mean field games. ESAIM: Proceedings and Surveys, 65:349–383, 2019.
  • [20] Francesco Di Lauro, István Z Kiss, and Joel Miller. The timing of one-shot interventions for epidemic control. medRxiv, 2020.
  • [21] Dylan Morris, Joshua B Plotkin, Fernando W Rossine, and Simon Levin. Optimal, near-optimal, and robust epidemic control. OSF Preprints. April, 4, 2020.
  • [22] Neil Ferguson, Daniel Laydon, Gemma Nedjati Gilani, Natsuko Imai, Kylie Ainslie, Marc Baguelin, Sangeeta Bhatia, Adhiratha Boonyasiri, ZULMA Cucunuba Perez, Gina Cuomo-Dannenburg, et al. Report 9: Impact of non-pharmaceutical interventions (npis) to reduce covid19 mortality and healthcare demand. 2020.