This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

From individual-based epidemic models to McKendrick-von Foerster PDEs: A guide to modeling and inferring COVID-19 dynamics

Félix Foutel-Rodier111First author. Département de Mathématiques, Université du Québec à Montréal,
Montréal, QC, Canada
François Blanquart SMILE Group, Center for Interdisciplinary Research in Biology UMR 7241,
Collège de France, CNRS, INSERM U 1050,
PSL Research University, Paris, France
Infection, Antimicrobials, Modeling, Evolution UMR 1137,
Université de Paris, INSERM, Paris, France
Philibert Courau SMILE Group, Center for Interdisciplinary Research in Biology UMR 7241,
Collège de France, CNRS, INSERM U 1050,
PSL Research University, Paris, France
Peter Czuppon SMILE Group, Center for Interdisciplinary Research in Biology UMR 7241,
Collège de France, CNRS, INSERM U 1050,
PSL Research University, Paris, France
Institute for Evolution and Biodiversity, University of Münster, 48149 Münster, Germany
Jean-Jil Duchamps Laboratoire de mathématiques de Besançon UMR 6623,
Université Bourgogne Franche-Comté, CNRS, F-25000 Besançon, France
Jasmine Gamblin SMILE Group, Center for Interdisciplinary Research in Biology UMR 7241,
Collège de France, CNRS, INSERM U 1050,
PSL Research University, Paris, France
Élise Kerdoncuff SMILE Group, Center for Interdisciplinary Research in Biology UMR 7241,
Collège de France, CNRS, INSERM U 1050,
PSL Research University, Paris, France
Institut de Systématique, Biodiversité, Évolution UMR 7205,
Muséum National d’Histoire Naturelle, CNRS, Paris, France
Rob Kulathinal SMILE Group, Center for Interdisciplinary Research in Biology UMR 7241,
Collège de France, CNRS, INSERM U 1050,
PSL Research University, Paris, France
Department of Biology, Temple University, Philadelphia, PA, USA
Léo Régnier SMILE Group, Center for Interdisciplinary Research in Biology UMR 7241,
Collège de France, CNRS, INSERM U 1050,
PSL Research University, Paris, France
Laura Vuduc SMILE Group, Center for Interdisciplinary Research in Biology UMR 7241,
Collège de France, CNRS, INSERM U 1050,
PSL Research University, Paris, France
Amaury Lambert222Co-last authors. SMILE Group, Center for Interdisciplinary Research in Biology UMR 7241,
Collège de France, CNRS, INSERM U 1050,
PSL Research University, Paris, France
Institut de Biologie de l’ENS, École Normale Supérieure,
CNRS UMR 8197 INSERM U 1024, Paris, Franc
Emmanuel Schertzer Faculty of Mathematics, University of Vienna,
Oskar-Morgenstern-Platz 1, 1090 Wien, Austria
Abstract

We present a unifying, tractable approach for studying the spread of viruses causing complex diseases requiring to be modeled using a large number of types (e.g., infective stage, clinical state, risk factor class). We show that recording each infected individual’s infection age, i.e., the time elapsed since infection, has three benefits.

First, regardless of the number of types, the age distribution of the population can be described by means of a first-order, one-dimensional partial differential equation (PDE) known as the McKendrick-von Foerster equation. The frequency of type ii is simply obtained by integrating the probability of being in state ii at a given age against the age distribution. This representation induces a simple methodology based on the additional assumption of Poisson sampling to infer and forecast the epidemic. We illustrate this technique using French data from the COVID-19 epidemic.

Second, our approach generalizes and simplifies standard compartmental models using high-dimensional systems of ordinary differential equations (ODEs) to account for disease complexity. We show that such models can always be rewritten in our framework, thus, providing a low-dimensional yet equivalent representation of these complex models.

Third, beyond the simplicity of the approach, we show that our population model naturally appears as a universal scaling limit of a large class of fully stochastic individual-based epidemic models, where the initial condition of the PDE emerges as the limiting age structure of an exponentially growing population starting from a single individual.

1 Introduction

1.1 Challenges posed by complex diseases such as COVID-19

The transmission of pathogens between species is a global concern [38, 11]. As such zoonotic episodes are expected to become increasingly common in humans, it is critical to develop analytic tools that can quickly transform epidemiological observations into informed public policy in order to mitigate and control epidemics.

A novel coronavirus, SARS-CoV-2, has recently crossed the species barrier into humans and, within months, has rapidly spread to all corners of our planet [60]. The sheer scale of this pandemic has overburdened our medical infrastructure, caused fatalities estimated well into millions, and shut down entire economies. Remarkably, the rapid spread of COVID-19 and its consequences can be attributed to the unique life cycle of a 30,000 base pair single-stranded virus. SARS-CoV-2 is an airborne pathogen transmitted by both symptomatic and asymptomatic carriers in close proximity to non-infected individuals. Milder COVID-19 symptoms include a dry cough, fever, and/or shortness of breath while more serious cases include respiratory failure and possible death. With millions of infections and hundreds of thousands of documented deaths and recoveries, the COVID-19 pandemic is providing a wealth of independent estimates of important clinical characteristics that can help predict health outcomes specific for a country or region.

It quickly became understood that accurate descriptions of the life cycle of this disease needed to distinguish between several stages of the disease, referred to as compartments, depending on whether an infected individual is infectious or not, symptomatic or not, hospitalized, etc. However it remains unclear to what extent making precise predictions of the dynamics of such a complex disease requires to have a precise knowledge of clinical features such as incubation period, generation time, and duration times between infection, symptom establishment, hospitalization, recovery and death, to know how these durations correlate and what are the exact probabilities of transition between stages.

In this work, we consider a fully stochastic, generic epidemiological model with an arbitrary number of compartments, that encompasses life cycles of most complex diseases and that of COVID-19 in particular. We show how structuring the infected population by its infection age, i.e., time elapsed since infection, allows us to decouple dependencies between stages and to time. More specifically, when the population size is large enough, the joint evolution of all compartment sizes can be described by means of a linear, first-order partial differential equation (PDE) known as the McKendrick-von Foerster equation describing the number n(t,a)n(t,a) of infecteds of (infection) age aa at time tt. The boundary condition at age 0 is driven by the infection rate from infecteds of age aa, averaged over all possible courses of infection, and the number of individuals of age aa in compartment ii at time tt is obtained by thinning n(t,a)n(t,a) by a factor p(a,i)p(a,i) which is the probability of being in compartment ii conditional on having age aa, averaged over all possible courses of infection.

In the case of COVID-19, we display a simple procedure to infer these parameters, some from the biological literature and most from time series of numbers of severe cases, hospitalized cases, discharged patients and deaths that can be applied easily to any regional or national dataset. We also allow for time inhomogeneity in the infection rate to account for temporary mitigation measures such as lockdowns or social distancing. We apply this procedure to French COVID-19 data from March to May 2020 and estimate various parameters of interest including the reproduction number in different phases of the epidemic (before, during, and after lockdown) and biological parameter values that we compare to empirical estimates.

1.2 Decorated age-structured epidemic models

The large population size limit of our stochastic model is a PDE “decorated” with compartments. This point of view extends the usual sets of ODEs used in epidemiology, and allows us to represent in the same framework a large class of deterministic epidemic models. Before describing the stochastic model underlying such decorated PDEs, let us illustrate this notion by recalling the well-known derivation of the classic SIR set of ODEs from an age-structured model.

Consider the solution (n(t,a);t,a0)(n(t,a);\,t,a\geq 0) to the following partial differential equation:

tn+an\displaystyle\partial_{t}n+\partial_{a}n =0\displaystyle=0 (1)
t0,n(t,0)\displaystyle\forall t\geq 0,\;n(t,0) =S(t)0n(t,a)τ(a)da\displaystyle=S(t)\int_{0}^{\infty}n(t,a)\tau(a)\mathrm{d}a
a0,n(0,a)\displaystyle\forall a\geq 0,\;n(0,a) =x0g(a)\displaystyle=x_{0}g(a)
t0,S(t)\displaystyle\forall t\geq 0,\;S(t) =10n(t,a)da.\displaystyle=1-\int_{0}^{\infty}n(t,a)\mathrm{d}a.

where 0x010\leq x_{0}\leq 1, and g,τ0g,\tau\geq 0 fulfill

0g(a)da=1,0τ(a)da<.\int_{0}^{\infty}g(a)\mathrm{d}a=1,\quad\int_{0}^{\infty}\tau(a)\mathrm{d}a<\infty.

Equation (1) was first proposed to describe the dynamics of an epidemic where infected individuals are structured by their age of infection, and is known as the Kermack-McKendrick model [36]. Note that in the original work of [36] the model is formulated as the solution to a convolution equation rather than as a PDE, but that the two formulations are equivalent, see Section 2.2. In this context, the age of an individual refers to the time elapsed since its infection, and not to its actual age. Then, n(t,a)n(t,a) is the density at time tt of all individuals with age (of infection) aa, and S(t)S(t) the density of individuals still susceptible to the disease. The differential term describes the aging process: the age of an individual increases linearly with time at rate one. The interpretation of the age boundary condition of (1) is that individuals with age aa infect susceptible individuals at a rate τ(a)\tau(a) that only depends on their age.

Remark 1.

It is important to note that n(t,a)n(t,a) counts all individuals that have been infected at time tat-a, and not only infective individuals with age aa. Thus, Eq. (1) lacks the usual recovery term. Moreover, τ(a)\tau(a) is not the average rate at which infective individuals with age aa yield infections, but the average infection rate of any individual with age aa. (The former is obtained from the latter by discounting all individuals with age aa that are not infectious anymore.)

In order to recover the SIR model, suppose that τ\tau is given by

a0,τ(a)=βeγa,\forall a\geq 0,\quad\tau(a)=\beta e^{-\gamma a},

for some γ,β>0\gamma,\beta>0. Further define

t0,I(t)=0n(t,a)eγada,R(t)=0n(t,a)(1eγa)da,\forall t\geq 0,\quad I(t)=\int_{0}^{\infty}n(t,a)e^{-\gamma a}\mathrm{d}a,\quad R(t)=\int_{0}^{\infty}n(t,a)(1-e^{-\gamma a})\mathrm{d}a,

Then a simple calculation shows that (S,I,R)(S,I,R) solves the following well-known system of ODEs:

S˙\displaystyle\dot{S} =βIS\displaystyle=-\beta IS (2)
I˙\displaystyle\dot{I} =βISγI\displaystyle=\beta IS-\gamma I
R˙\displaystyle\dot{R} =γI.\displaystyle=-\gamma I.

The previous expressions have an interesting probabilistic interpretation. Consider a Markov process (X(a);a0)(X(a);\,a\geq 0) with two states II and RR. Suppose that it starts from II and jumps to RR at rate γ\gamma. The process (X(a);a0)(X(a);\,a\geq 0) can be interpreted as describing the sequence of states (infective then recovered) visited by a typical individual in the microscopic model underlying (2). Then, clearly

p(a,I)(X(a)=I)=eγa,p(a,R)(X(a)=R)=1eγa,p(a,I)\coloneqq\mathbb{P}(X(a)=I)=e^{-\gamma a},\quad p(a,R)\coloneqq\mathbb{P}(X(a)=R)=1-e^{-\gamma a},

so that

I(t)=0n(t,a)p(a,I)da,R(t)=0n(t,a)p(a,R)da.I(t)=\int_{0}^{\infty}n(t,a)p(a,I)\mathrm{d}a,\quad R(t)=\int_{0}^{\infty}n(t,a)p(a,R)\mathrm{d}a. (3)

Furthermore, suppose that a typical infected individual yields new infections at constant rate β\beta while it is in state II. Then, the mean number of new infections occurring in the time interval [a,a+da][a,a+\mathrm{d}a] is

βeγada=τ(a)da.\beta e^{-\gamma a}\mathrm{d}a=\tau(a)\mathrm{d}a.

The picture that emerges from this simple calculation is that, instead of keeping track of the number of individuals in each compartment, one can consider the age structure of the population, given by Eq. (1). The dynamics of the age structure is uniquely prescribed by the average number τ(a)\tau(a) of infections that an individual yields at age aa. The individual counts in each compartment can then be recovered by integrating against the age structure the one-dimensional marginals of a process (X(a);a0)(X(a);\,a\geq 0) that describes the sequence of compartments visited by a typical individual in the population. We say that the PDE is “decorated” with compartments, as the process (X(a);a0)(X(a);\,a\geq 0) is used to recover the counts in the compartments, and only influences the dynamics of the infection which is described by the sole Eq. (1) through the average infection rate τ(a)\tau(a).

This viewpoint has several advantages compared to the usual ODE setting of (2). First, we can make sense of (3) for any process (X(a);a0)(X(a);\,a\geq 0). If this process is not Markovian, the number of individuals in each compartment no longer solves a system of ODEs similar to (2). Hence, this approach allows to go beyond the usual ODE framework. This generalization is of great modeling interest since a hypothesis underlying sets of ODEs is that the sojourn times in each compartment and the time between successive infections are exponentially distributed. In particular they cannot account for sojourn time distributions that are peaked around a value, which have been reported for instance for COVID-19 [41, 58].

Second, regardless of the number of compartments, the age structure of the population is described by the same one-dimensional PDE. This is particularly valuable when models have a large number of compartments, as in the context of COVID-19 [50, 19, 12, 17], as it avoids the use of high-dimensional systems of ODEs that are cumbersome to study mathematically. However, this requires to work with the PDE (1) rather than with ODEs, resulting in a mild computational cost.

Third, Eq. (1) only involves the mean number of infections τ(a)\tau(a) induced by individuals at age aa averaged over all compartments. In particular, it is unnecessary to assess to which compartments individuals belong when they yield new infections. This is in contrast with the usual ODE framework, where for each compartment an infection rate needs to be prescribed. As we will see, τ(a)\tau(a) relates to well-known epidemiological quantities that can be assessed directly from the data.

The main contribution of our work is to show that such decorated age-structured models arise naturally as the law of large numbers limit of a wide class of general stochastic epidemic models that we now introduce.

1.3 Generic stochastic model assumptions

We consider a population model in which individuals are either susceptible, if they have not yet met the disease, or infected. Our definition of infected is broader than usual: an individual is infected if it has been infected in the past. In particular, individuals that have recovered or died from the disease are still infected, even if they are not infective anymore. At any point in time, an infected individual is in one of several states, that will also be referred to as compartments, types, classes, or stages. The set of all such states is denoted by 𝒮\mathcal{S} and is assumed to be finite. Depending on the disease complexity, the number of stages can vary. In the SARS-CoV-2 example, typical stages are asymptomatic, mild case, severe case, hospitalized, intensive care unit, recovered, and dead (see Figure 5).

We assume that upon infection a susceptible individual immediately changes state and never becomes susceptible anymore (ruling out multiple infections, in particular), and that it will eventually end in one of two states: recovered or dead. The sequence of states visited by an individual xx is then encoded by a stochastic process Xx(Xx(a);a0)X_{x}\coloneqq(X_{x}(a);\,a\geq 0) valued in 𝒮\mathcal{S}, where the random variable Xx(a)X_{x}(a) is the state of xx at age of infection aa. We call (Xx(a);a0)(X_{x}(a);\,a\geq 0) the life-cycle process.

Each individual is further endowed with a random point process 𝒫x\mathcal{P}_{x} on [0,)[0,\infty), called the infection point process. Each atom of 𝒫x\mathcal{P}_{x} gives the age at which xx makes an infectious contact with another individual in the population, and we assume that all atoms are distinct so that secondary infections cannot occur simultaneously. (By an infectious contact, we mean a contact that would lead to a new infection if the target individual is susceptible to the disease. The pair (𝒫x,Xx)(\mathcal{P}_{x},X_{x}) characterizes the course of the infection of individual xx. We assume that these pairs, for all individuals xx, are i.i.d. copies of the same pair (𝒫,X)(\mathcal{P},X) that describes the infection of a typical individual in the population.

In order to make the mathematical treatment of the model easier, we make the simplifying assumption that the number of susceptible individuals is large compared to the number of infected individuals, so that each infectious contact leads to a new infection. (We neglect the saturation in the number of infections due to the finiteness of the population.) The epidemic is then described by a branching process known as a Crump-Mode-Jagers process [31, 54]: an individual xx infected at time σx\sigma_{x} will produce NxN_{x} new secondary infections at times σx+A1,,σx+ANx\sigma_{x}+A_{1},\dots,\sigma_{x}+A_{N_{x}}, where (A1,,ANx)(A_{1},\dots,A_{N_{x}}) are the atoms of 𝒫x\mathcal{P}_{x}, that is ,

𝒫x=i=1NxδAi.\mathcal{P}_{x}=\sum_{i=1}^{N_{x}}\delta_{A_{i}}.

(This branching hypothesis is relaxed in a recent work by some of the authors [18].)

Lastly, we superimpose time heterogeneity to this process by means of a contact rate (c(t);t0)(c(t);\,t\geq 0) valued in [0,1][0,1] thinning the infection process. More precisely, if tt is a potential time of infection for individual xx, we ignore the infection with probability 1c(t)1-c(t). This contact rate can model the effect of vaccination, or density-dependence (i.e., relaxing the branching assumption due to an excess of removed or of deceased individuals), or of governmental mitigation measures (i.e., social distancing, lockdown).

The infection process is more formally constructed in Section 2.1.

Remark 2.

As already discussed in the previous section, in the SIR example 𝒮={I,R}\mathcal{S}=\{I,R\}, and (X(a);a0)(X(a);\,a\geq 0) is a Markov process started from II that jumps to RR at rate γ\gamma. Moreover, the infection point process is given by

𝒫=AiPδAi𝟙{X(Ai)=I}\mathcal{P}=\sum_{A_{i}\in P}\delta_{A_{i}}\mathbbm{1}_{\{X(A_{i})=I\}}

where PP is a homogeneous Poisson point process on [0,)[0,\infty) with intensity β\beta.

Remark 3.

We emphasize that the pairs (𝒫x,Xx)(\mathcal{P}_{x},X_{x}) are assumed to be independent, but not the variables 𝒫x\mathcal{P}_{x} and XxX_{x}. In the simple SIR example they are not independent since there can be no atoms of 𝒫x\mathcal{P}_{x} after the recovery time. In the same spirit, one could assume that the infection potential of a given individual is reduced once in the hospital and that individuals with many atoms in their infection process 𝒫x\mathcal{P}_{x} (high infectiosity) are identified and isolated.

1.4 Statement of the main results

The stochastic epidemic models we consider here are fairly general and can exhibit quite complex dependencies (i) between states and time, due to the lack of any Markov-type assumption, (ii) between states, due to possibly hidden structuring variables impacting the life cycle, (iii) between state and infection rate, and (iv) between past and future infection events. The main result of this work is that despite this apparent complexity, most of this complexity vanishes when the size of the population is large. More specifically, we show that in the limit of large populations (obtained by starting from a large initial population or as a consequence of natural exponential growth), the population of infected individuals structured by age (of the infection) can be described by means of a one-dimensional PDE, and that the counts in each compartment are recovered by decorating this PDE with the life-cycle process. The limiting expression only depends on:

  1. 1.

    The average infection rate

    τ(da)𝔼(𝒫(da)),\tau(\mathrm{d}a)\coloneqq\mathbb{E}(\mathcal{P}(\mathrm{d}a)),

    formally defined as the intensity measure of 𝒫\mathcal{P}. We make the simplifying assumption that τ\tau has a density w.r.t. the Lebesgue measure and, with a slight abuse of notation, we still denote it by τ(a)\tau(a).

  2. 2.

    The one-dimensional marginals of the life-cycle process

    p(a,i)(X(a)=i).p(a,i)\coloneqq\mathbb{P}(X(a)=i).

We prove two main theorems that are two laws of large numbers for the age and compartment structure in the population: one started from a large number of individuals, and the other from a single individual.

Large initial population.

Let us start the population with a large number NN of infected individuals at time t=0t=0, with i.i.d. initial infection ages with law gg. (See Section 2.1 for a formal definition of the initial age.) Define the empirical measure of ages and compartments at time tt as

μtN(da×{i})σx<tδ(tσx,Xx(tσx))(da×{i}),\displaystyle\mu^{N}_{t}(\mathrm{d}a\times\{i\})\coloneqq\sum_{\sigma_{x}<t}\delta_{(t-\sigma_{x},X_{x}(t-\sigma_{x}))}(\mathrm{d}a\times\{i\}), (4)

where σx\sigma_{x} denotes the infection time of xx, and the sum is taken over all individuals xx infected before time tt. (This includes the initial individuals infected before time 0.) The measure μtN\mu_{t}^{N} is a random point measure that encodes the ages and compartments of all individuals that have been infected before time tt.

Let us also introduce niN(t)n^{N}_{i}(t), the number of individuals in compartment ii at time tt, defined as

niN(t)σx<t𝟙{Xx(tσx)=i}=μtN([0,)×{i}).n^{N}_{i}(t)\coloneqq\sum_{\sigma_{x}<t}\mathbbm{1}_{\{X_{x}(t-\sigma_{x})=i\}}=\mu_{t}^{N}\big{(}[0,\infty)\times\{i\}\big{)}.
Theorem 1 (NN individuals).

Start the population with NN individuals with i.i.d. initial ages distributed according to gg. Then, for any t>0t>0, the following convergence holds in the weak topology

1NμtN(da×{i})Nn(t,a)p(a,i)daa.s.\frac{1}{N}\mu^{N}_{t}(\mathrm{d}a\times\{i\})\underset{N\to\infty}{\longrightarrow}n(t,a)p(a,i)\mathrm{d}a\quad\text{a.s.}

where (n(t,a);t,a0)(n(t,a);\,t,a\geq 0) is the solution to

tn+an\displaystyle\partial_{t}n+\partial_{a}n =0\displaystyle=0 (5)
t0,n(t,0)\displaystyle\forall t\geq 0,\;n(t,0) =c(t)0n(t,a)τ(a)da\displaystyle=c(t)\int_{0}^{\infty}n(t,a)\tau(a)\mathrm{d}a
a0,n(0,a)\displaystyle\forall a\geq 0,\;n(0,a) =g(a).\displaystyle=g(a).

As a consequence, for any t>0t>0,

1NniN(t)N0n(t,a)p(a,i)daa.s.\frac{1}{N}n^{N}_{i}(t)\underset{N\to\infty}{\longrightarrow}\int_{0}^{\infty}n(t,a)p(a,i)\mathrm{d}a\quad\text{a.s.} (6)

The limiting age structure of the population is thus described by Eq. (5), which is a linear version of Eq. (1), known as the McKendrick-von Foerster equation. Note that it also has an additional c(t)c(t) term accounting for the reduced contact rate, resulting in a time heterogeneity. As in Section 1.2, the number of individuals in each compartment is recovered by decorating the PDE with the one-dimensional marginals of the life-cycle process.

After lockdown onset.

Our second result displays a similar, but more subtle, convergence in the case when the process is supercritical, where natural growth leads by itself to large population sizes. We say that the process is supercritical if

0τ(a)da>1,\int_{0}^{\infty}\tau(a)\mathrm{d}a>1,

in which case there exists α>0\alpha>0 such that

0eαaτ(a)da=1.\int_{0}^{\infty}e^{-\alpha a}\tau(a)\mathrm{d}a=1.

(The parameter α\alpha is the Malthusian parameter of the CMJ process when c1c\equiv 1.) Let Z(t)Z(t) denote the total population size at time tt and assume that Z(0)=1Z(0)=1, i.e., we start from a single individual. Suppose that (tK;K0)(t_{K};\,K\geq 0) is a sequence of stopping times (with respect to the natural filtration of the process, see Section 3) such that tKt_{K}\to\infty on the non-extinction event. By a slight abuse of notation, denote by μtK\mu^{K}_{t} the empirical measure of ages and types as in (4), but under the assumption that the contact rate at time tt is equal to c(ttK)c(t-t_{K}) where cc is equal to 1 for negative arguments. We are motivated by modeling a situation where the infection is separated into two distinct phases:

  1. 1.

    The epidemic develops until a certain random time tKt_{K}. For instance, tKt_{K} could be the time at which the number of recorded deaths exceeds a large threshold KK. We assume no suppression before tKt_{K}.

  2. 2.

    We let the contact rate vary after time tKt_{K} according to the function (c(ttK);t0)(c(t-t_{K});\,t\geq 0), e.g., due to mitigation measures and/or behavioral changes (i.e., lockdown phase).

In this setting, we can derive the following version of the law of large numbers for ages and compartments.

Theorem 2 (One individual).

Suppose that the process is supercritical and that the population is started from one individual. Conditional on non-extinction:

  1. 1.

    There exists a r.v. WW_{\infty} such that W>0W_{\infty}>0 a.s. and

    σx<tKeαtKKWin probability.\sum_{\sigma_{x}<t_{K}}e^{-\alpha t_{K}}\underset{K\to\infty}{\longrightarrow}W_{\infty}\quad\text{in probability.}
  2. 2.

    For any t>0t>0, we have

    eαtKμtK+tK(da×{i})KWn(t,a)p(a,i)dae^{-\alpha t_{K}}\mu^{K}_{t_{K}+t}(\mathrm{d}a\times\{i\})\underset{K\to\infty}{\longrightarrow}W_{\infty}n(t,a)p(a,i)\mathrm{d}a

    in probability for the weak topology, where (n(t,a);t,a0)(n(t,a);\,t,a\geq 0) is the solution to (5) with initial condition g(a)=αeαag(a)=\alpha e^{-\alpha a}.

This result states that, when the large population size is obtained by natural population growth, the population has an exponentially distributed initial age profile with a random size WW_{\infty} determined by the early infection events. Moreover, the parameter of the exponential distribution corresponds to the exponential growth rate of the epidemic prior to the enforcement of control measures. This result can prove useful in applications, as the exponential growth rate can be readily estimated from incidence data, whereas the age structure of the population can hardly be directly assessed. It is a quite generic phenomenon that the macroscopic behavior of population models started from a few individuals is described by a deterministic system, with a random initial condition resulting from the stochasticity of the initial population growth [4, 2].

Summary.

The macroscopic behavior of the epidemic is characterized by the sole intensity measure τ\tau and dictates an explicit age structure of the population. The class structure is deduced by integrating the life-cycle process against the limiting age profile. This suggests an alternative point of view on epidemic models, as age-structured models decorated with classes.

In order to validate our approach, we use those deterministic approximations to infer epidemiological parameters (reproduction number before and during lockdown) from recent empirical observations, and show that our findings are in accordance with the current literature.

1.5 Inference on the French COVID-19 epidemic

We have illustrated the practical interest of our approach by carrying out parameter inference on data from the early French COVID-19 epidemic. We focus on two important inference aspects of this epidemic: providing estimates for key epidemiological quantities, such as the reproduction number that allows to assess the impact of control measures; and predicting the number of individuals in ICU and hospital to monitor the pressure on the healthcare system.

In our framework, the first task only requires a simple and parsimonious model that can be adjusted on incidence data, whereas fitting the number of individuals in ICU requires a more complex model that better accounts for the population heterogeneity.

The early COVID-19 epidemic in France.

After a rapid increase in the number of detected cases and deaths, the French government issued a first nation-wide lockdown from March 17 2020 to May 11 2020. From March 18 2020, it has provided publicly available daily reports of the number of ICU and hospital admissions, hospital deaths, as well as the number of occupied ICU and hospital beds, and discharged individuals. The daily number of detected cases was also reported, but was considered as unreliable during this period due to the high variation in the number of tests performed. No additional control measure was enforced during this period.

Estimating epidemiological parameters from incidence data.

In order to estimate the impact of lockdown we consider a parsimonious model that requires to estimate few parameters. It is illustrated in Figure 5, and we refer to it as the admission model. Upon infection, individuals either develop a mild form of COVID-19 from which they will recover, or a more severe form that will eventually lead to a hospital admission. Then, hospitalized individuals either recover and are discharged after some amount of time, or are moved to ICU. Finally, individuals in ICU either die or recover. A more detailed description of the model and its parameters is given in Section 4.3.

Refer to caption
Figure 1: Best fit of the admission model. Solid lines correspond to the number of hospital admissions, ICU admissions and deaths predicted by the admission model. The dots are the corresponding observed values. The dispersion of the observations is mainly due to unreported data during the weekends, that are only reported at the start of the following week.

We have fitted this model to the following three time series: daily number of admissions in hospital and ICU, and daily deaths. Fitting such “incidence” time series only requires to estimate the entrance time in each compartment, and not the corresponding sojourn times. The best fitting model is represented in Figure 1, and the inference procedure is described in Section 4. We see that our simple model reproduces quite well the shapes of the three incidence time series. The estimated reproduction number after lockdown is Rpost=0.745R_{\mathrm{post}}=0.745, and that before lockdown is Rpre=3.25R_{\mathrm{pre}}=3.25. Thus we estimate that the lockdown yielded a reduction of the reproduction number by a factor 4.364.36. Moreover, the estimated number of infections having occurred before March 18 2020 is W=9.85×105W=9.85\times 10^{5}. All these estimates are in line with that of other studies on the same dataset [50, 53].

Fitting prevalence data.

Our second objective is to fit three additional time series: the number of occupied hospital and ICU beds, and the number of discharged individuals. Using the simple admission model only yields a poor fit of these new times series, see Figure 7. We have identified two main causes for this discrepancy. First, we have made the simplifying assumption that individuals are always admitted to ICU prior to death. However, it has been reported that a large fraction of deaths do not involve a preliminary ICU admission [40], and our assumption leads to a fraction of deaths among ICU patients much higher than that previously reported. Second, there are many heterogeneities in the population, such as the (actual) age, that are known to play an important role in the severity of the symptoms of COVID-19 and that are not accounted for.

Thus, we used a more detailed model to reproduce all six time series, which is illustrated in Figure 6 and referred to as the occupancy model. Again, a detailed description of the model and its parameters is available in Section 4.4. The main two differences with the admission model are that a fraction of individuals die shortly after hospital admission, and that we distinguish between individuals who recover fast after their admission and individuals who recover slowly. The best-fitting model is displayed in Figure 2. Again, it reproduces quite well the shapes of all the time-series. Under the occupancy model, the estimated reproduction number after lockdown is Rpost=0.734R_{\mathrm{post}}=0.734 and the estimated number of infections before March 17 2020 is W=9.52×105W=9.52\times 10^{5}. These estimates are close to those obtained under the admission model, indicating that the predictions made by the simple admission model are quite robust to the addition of model details.

Refer to caption
Figure 2: Best fit of the admission model. The solid lines correspond to the number of deaths, discharges, occupied ICU and hospital beds and ICU and hospital admissions predicted by the occupancy model. The dots are the corresponding observed values.

Overall, our inference work suggests that a simple model can be used to determine “global” epidemiological parameters, such as the reproduction number and total number of infections, whereas obtaining a prediction for the number of individuals in hospital or ICU requires to use a more detailed model that accounts for population heterogeneity. Moreover, it demonstrates that decorated age-structured models can be readily used to carry out parameter inference in the context of COVID-19, even when the underlying compartment structure is quite complex.

Section 4 contains a detailed description of the inference procedure, as well as a comparison of the various estimates that we obtain with estimates currently available in the literature.

1.6 Connection with ODE models

Section 1.2 shows that the SIR model can be seen as a decorated age-structured model, with well-chosen Markov life-cycle process and Poisson infection point process. This section extends this representation to a broader class of ODE models. We will be interested in solutions to the following set of ODEs:

i𝒮,n˙i(t)=S(t)j𝒮βjinj(t)+j𝒮qjinj(t)S(t)=1j𝒮nj(t).\displaystyle\begin{split}\forall i\in\mathcal{S},\quad\dot{n}_{i}(t)&=S(t)\sum_{j\in\mathcal{S}}\beta_{ji}n_{j}(t)+\sum_{j\in\mathcal{S}}q_{ji}n_{j}(t)\\ S(t)&=1-\sum_{j\in\mathcal{S}}n_{j}(t).\end{split} (7)

The parameter βij0\beta_{ij}\geq 0 gives the rate of new infections from individuals in compartment ii such that the newly infected individual starts in compartment jj. The matrix TT with entries (βij)(\beta_{ij}) is referred to as the transmission matrix. For iji\neq j, qij0q_{ij}\geq 0 corresponds to transition rate from compartment ii to compartment jj. The transition matrix with entries (qij)(q_{ij}) is denoted by QQ, and we further impose that

i𝒮,qii=jiqij.\forall i\in\mathcal{S},\quad q_{ii}=-\sum_{j\neq i}q_{ij}.

This class of ODE models encompasses many common epidemic models, including the SIR and SEIR models, as well as all models described in [8, Chapter 4] for instance.

Proposition 3.
  1. 1.

    Suppose that (X(a);a0)(X(a);\,a\geq 0) is a Markov process with jump matrix Q=(qij;i,j𝒮)Q=(q_{ij};\,i,j\in\mathcal{S}), and that conditional on the life-cycle process, 𝒫\mathcal{P} is a Poisson point process with rate λi\lambda_{i} when X(a)=iX(a)=i. Then, if (n(t,a);t,a0)(n(t,a);\,t,a\geq 0) is a solution to (1),

    n~i(t)=0n(t,a)p(a,i)da\widetilde{n}_{i}(t)=\int_{0}^{\infty}n(t,a)p(a,i)\mathrm{d}a (8)

    solves (7) with βij=λip(0,j)\beta_{ij}=\lambda_{i}p(0,j).

  2. 2.

    The solution of (7) with transmission and transition matrices TT and QQ can be written as (8) if and only if rank(T)=1\operatorname{rank}(T)=1.

Proof.

By differentiating both sides of (8) w.r.t. time, we obtain

ddtn~i(t)\displaystyle\frac{\mathrm{d}}{\mathrm{d}t}\widetilde{n}_{i}(t) =0tn(t,a)p(a,i)da=0an(t,a)p(a,i)da\displaystyle=\int_{0}^{\infty}\partial_{t}n(t,a)p(a,i)\mathrm{d}a=-\int_{0}^{\infty}\partial_{a}n(t,a)p(a,i)\mathrm{d}a
=n(t,0)p(0,i)+0n(t,a)ap(a,i)da.\displaystyle=n(t,0)p(0,i)+\int_{0}^{\infty}n(t,a)\partial_{a}p(a,i)\mathrm{d}a.

By conditioning on the process XX and using standard properties of Poisson point processes, we can compute the intensity measure of 𝒫\mathcal{P} as

τ,f=𝔼[0λX(a)f(a)da]=0f(a)i𝒮p(a,i)λida,\langle\tau,f\rangle=\mathbb{E}\Big{[}\int_{0}^{\infty}\lambda_{X(a)}f(a)\mathrm{d}a\Big{]}=\int_{0}^{\infty}f(a)\sum_{i\in\mathcal{S}}p(a,i)\lambda_{i}\mathrm{d}a,

so that

a0,τ(a)=i𝒮p(a,i)λi.\forall a\geq 0,\quad\tau(a)=\sum_{i\in\mathcal{S}}p(a,i)\lambda_{i}.

By using the boundary condition of (1) and the fact that (X(a);a0)(X(a);\,a\geq 0) is a Markov process with generator QQ we get

ddtn~i(t)\displaystyle\frac{\mathrm{d}}{\mathrm{d}t}\widetilde{n}_{i}(t) =p(0,i)S(t)0n(t,a)j𝒮λjp(a,j)da+0n(t,a)j𝒮qjip(a,j)da\displaystyle=p(0,i)S(t)\int_{0}^{\infty}n(t,a)\sum_{j\in\mathcal{S}}\lambda_{j}p(a,j)\mathrm{d}a+\int_{0}^{\infty}n(t,a)\sum_{j\in\mathcal{S}}q_{ji}p(a,j)\mathrm{d}a
=S(t)j𝒮p(0,i)λjn~j(t)+j𝒮qjin~j(t)\displaystyle=S(t)\sum_{j\in\mathcal{S}}p(0,i)\lambda_{j}\widetilde{n}_{j}(t)+\sum_{j\in\mathcal{S}}q_{ji}\widetilde{n}_{j}(t)

from which the first item follows.

It is clear that if βij=λip(0,j)\beta_{ij}=\lambda_{i}p(0,j) then rank(T)=1\operatorname{rank}(T)=1. Fix some TT and QQ. If rank(T)=1\operatorname{rank}(T)=1, then it can be decomposed as

i,j𝒮,βij=λipj\forall i,j\in\mathcal{S},\quad\beta_{ij}=\lambda_{i}p_{j}

where (pj;j𝒮)(p_{j};\,j\in\mathcal{S}) is a probability vector and λi0\lambda_{i}\geq 0. This decomposition can for instance be recovered from

λi=j𝒮βij,pj=βijλi.\lambda_{i}=\sum_{j\in\mathcal{S}}\beta_{ij},\quad p_{j}=\frac{\beta_{ij}}{\lambda_{i}}.

Consider a Markov process (X(a);a0)(X(a);\,a\geq 0) with transition matrix QQ and X(0)X(0) distributed as (pj;j𝒮)(p_{j};\,j\in\mathcal{S}). If 𝒫\mathcal{P} is such that infections occur at rate λi\lambda_{i} when X(a)=iX(a)=i, the first item proves that (7) can be written as (8). ∎

The previous result provides a simple criterion for a system of ODEs to be represented as a decorated age-structured PDE. This criterion has already been proposed previously and is referred to as the “separable mixing” assumption [15, 14]. A direct consequence of this result is that not all ODE models can be represented using a single decorated age-structured PDE. However many models fulfill the requirement that rank(T)=1\operatorname{rank}(T)=1, including models with a single infectious state, those where new infected individuals always start in the same state, and all classical models exposed in [8, Chapter 4]. In that sense, our framework greatly extends the usual systems of ODEs widely used in epidemic modeling.

An important situation where rank(T)>1\operatorname{rank}(T)>1 is that of heterogeneous contact rates in the population. For instance, the contact rate could depend heavily on the (actual) age group to which individuals belong, and more contacts are made within the same age class than between classes. A second example is that of spatial heterogeneity, where contacts are more likely to occur between spatially close individuals. It remains possible to derive a representation of (7) in the case rank(T)>1\operatorname{rank}(T)>1 using an age-structured model, but this would require to use a multi-dimensional version of (1), which is a straightforward extension of this model.

1.7 Relation with previous works and outline

Deterministic epidemic models where the infectivity depends on the individuals’ age of infection were first introduced in [36], and their mathematical properties have been further studied thoroughly [48, 13, 55], see [30] for a recent account. However, these models have received surprisingly little attention in applications compared to their ODE counterpart, which have been widely used for instance in the context of COVID-19 [49, 50, 19, 12, 17]. In this direction, let us mention [24] which makes use of an epidemic model with memory and [28] were a set of transport PDEs similar to (1) is used. Note that, in contrast with our approach, the PDE in the latter work has two dimensions and is structured according to the time since the entrance in the compartments rather than by infection age. We hope that our work illustrates well the practical potential of such general models. The relation between age-structured and ODE epidemic models exposed in Section 1.2 is known since their very introduction [36], and has been acknowledged multiple times since then [43, 16, 7, 30].

In the most general formulation of a CMJ process, individuals can carry a trait valued in an abstract measure space that encodes all the information about their infection [31, 54]. We have restricted this information to the sequence of compartments visited by each individual, but we could have included some additional details, such as the evolution of the viral load for instance, which could be modeled as a continuous trait following a diffusion. Our result would carry over to the general setting, with a modified limiting equation in Theorem 1 which has already been proposed in [43, Eq. (2.5)] and [44, Chapter IV, Section 1.3]. (Note that none of these works is concerned with the underlying stochastic model.) However, we believe that our current formalism, where the state of an infected individual can be described by a discrete set of compartments, is flexible enough for applications, while being easier to grasp than the general case.

Non-Markov epidemic models have already been investigated, see e.g. [52, 46, 3, 5]. In particular our work shares similarities with a recent series of work in this direction [46, 23, 47]. Let us briefly show how to translate the model in [23] in our framework. Let λ=(λ(a);a0)\lambda=(\lambda(a);\,a\geq 0) be some random function giving the infectiousness of a typical individual in the population. Conditional on λ\lambda, let 𝒫\mathcal{P} be a time-inhomogeneous Poisson point process with intensity λ\lambda, define

ζ=inf{a:λ(a)>0},η+ζ=sup{a:λ(a)>0}\zeta=\inf\{a:\lambda(a)>0\},\qquad\eta+\zeta=\sup\{a:\lambda(a)>0\}

and set

a0,X(a)={Eif a[0,ζ)Iif a[ζ,ζ+η)Rif a[ζ+η,).\forall a\geq 0,\quad X(a)=\begin{cases}E&\text{if $a\in[0,\zeta)$}\\ I&\text{if $a\in[\zeta,\zeta+\eta)$}\\ R&\text{if $a\in[\zeta+\eta,\infty)$}.\end{cases}

Then, up to the choice of the initial condition and the fact that we make a branching assumption, the model considered in [23] coincides with the model considered here for this specific choice of the pair (𝒫,X)(\mathcal{P},X). (It coincides exactly with the extension of our model considered in [18].)

The main result in [23] shows that the fraction of individuals in each state (SS, EE, II, and RR) converges to the solution of a set of Volterra equations, see their Theorem 2.1. A straightforward computation shows that the latter equations are equivalent to the non-linear version our decorated McKendrick-von Foerster PDE obtained by replacing (5) by (1), with the specific choice of (𝒫,X)(\mathcal{P},X) described above.

The idea of representing a general branching population by its age structure has a rich history in probability theory [31, 20, 35, 33, 32, 29, 57, 22] and the connection with the McKendrick-von Foerster PDE has been acknowledged several times [20, 29]. In the latter two works, the authors allow for birth and death rates that may depend not only on abundances of each type, but also on the whole age structure of the population. This impressive level of generalization comes at the cost of assuming that the process describing the evolution of the empirical measure on ages and types is Markovian. In particular, birth and death rates are not allowed to depend on past individual birth events. The Markov property then allows the use of a generator for the empirical measure and with some extra finite second moment assumptions on the intensity measure, this approach allows the authors to obtain a law of large numbers and a central limit theorem.

Even if the current work is not as mathematically challenging as that alluded to above, we believe that our point of view does deserve to be highlighted in the current sanitary crisis since it provides both a general modeling framework and an efficient inference methodology. Furthermore, since we ignore finite population effects, our proofs are quite elementary compared to [20, 29] and should be accessible to a much wider audience interested in such a modeling approach. Finally, as far as we can tell, the duality result exposed in Section 2.4 is new and can presumably be extended to more general branching processes where birth and death rates are allowed to be frequency-dependent. In [18], some of the authors of the present work show that this duality result has a natural counterpart in a model with a finite but large population.

Outline.

The remainder of the paper is organized as follows. Section 2 is devoted to the study of Eq. (5). After providing a formal construction of the branching process that we consider in Section 2.1, the definition of a weak solution to (5) is given in Section 2.2. Then, we derive two probabilistic representations of this solution: we show in Section 2.3 that it corresponds to the first moment of the branching process that we are studying, when viewed as a random measure on the ages of infection; Section 2.4 provides a construction of the weak solution using a dual genealogical process. The two laws of large numbers are proved in Section 3. Finally, Section 4 describes the inference procedure, and compares the estimates that we obtain to known estimates from the literature.

2 Two Feynman-Kac formulæ

2.1 Assumptions and notation

CMJ branching process with suppression.

Recall that the infection process is modeled by a Crump-Mode-Jagers (CMJ) branching process [31, 45] with no death, starting from one individual called the progenitor (or root of the tree). It can be briefly constructed as follows.

Using the Ulam-Harris labeling, the population can be indexed by

𝒰{}n1n.\mathscr{U}\coloneqq\{\emptyset\}\cup\bigcup_{n\geq 1}^{\infty}\mathbb{N}^{n}.

The set 𝒰\mathscr{U} encodes a tree where xi(x,i)xi\coloneqq(x,i) is the ii-th child of xx. Each individual x𝒰x\in\mathscr{U} is characterized by a pair (𝒫x,Xx)(\mathcal{P}_{x},X_{x}) embodying respectively the processes of secondary infection events from xx and of types carried by xx. Each pair (𝒫x,Xx)(\mathcal{P}_{x},X_{x}) is an i.i.d. copy of the pair (𝒫,X)(\mathcal{P},X) with law \mathscr{L}, except when xx is the root, where it is distributed as (𝒫~,X~)(\tilde{\cal P},\tilde{X}) with law ~\tilde{\mathscr{L}} (more on that below).

An infection time σx\sigma_{x} can be assigned to all individuals inductively as follows, with the convention that σx=\sigma_{x}=\infty for individuals that are not infected. Suppose that σ\sigma_{\emptyset} is known (see below). Then, if σx<\sigma_{x}<\infty has been defined, let A1,,ANxA_{1},\dots,A_{N_{x}} denote the atoms of 𝒫x\mathcal{P}_{x} in increasing order. That is,

𝒫x=i=1NxδAi\mathcal{P}_{x}=\sum_{i=1}^{N_{x}}\delta_{A_{i}}

with A1<<ANxA_{1}<\dots<A_{N_{x}}. Set σxi=\sigma_{xi}=\infty for i>Nxi>N_{x}, and, independently for each iNxi\leq N_{x}, set

σxi={σx+Ai with probability c(σx+Ai) with probability 1c(σx+Ai),\sigma_{xi}=\begin{cases}\sigma_{x}+A_{i}&\text{ with probability $c(\sigma_{x}+A_{i})$}\\ \infty&\text{ with probability $1-c(\sigma_{x}+A_{i})$},\end{cases}

where we recall that (c(t);t0)(c(t);\,t\geq 0) is the contact rate. This amounts to trimming the tree by pruning the subtree stemming from xx with probability 1c(σx)1-c(\sigma_{x}), see Figure 3.

Refer to caption
Figure 3: The initial individual (𝒫~,X~)(\tilde{\cal P},\tilde{X}) is represented by a black segment. In Section 2.1, we assume that at time t=0t=0, the age of the initial individual (length of the blue segment) is distributed according to a probability density gg. If a branching event is observed at time tt (see e.g., black dots), the infection occurs with probability c(t)c(t). In the CMJ, this amounts to prune the corresponding subtree with probability c(t)c(t) (dotted red tree).

Initial shifted law.

In order to connect the distribution of the CMJ to the McKendrick-von Foerster equation, we allow the progenitor to have an initial age with an arbitrary distribution. Let AA be a r.v. distributed according to some density gg. Define the infection time of \emptyset as σ=A\sigma_{\emptyset}=-A. The secondary infections induced by the progenitor occur at some times σ+A~1\sigma_{\emptyset}+\widetilde{A}_{1}, …, σ+A~N~\sigma_{\emptyset}+\widetilde{A}_{\widetilde{N}}, where (A~1,,A~N~)(\widetilde{A}_{1},\dots,\widetilde{A}_{\widetilde{N}}) are the atoms of a point process 𝒫~\widetilde{\mathcal{P}} defined as

𝒫~=Ai𝒫𝟙{Ai>A}δAi,\widetilde{\mathcal{P}}=\sum_{A_{i}\in\mathcal{P}}\mathbbm{1}_{\{A_{i}>A\}}\delta_{A_{i}},

where the pair (𝒫,X)(\mathcal{P},X) has law \mathscr{L}. The point process 𝒫~\widetilde{\mathcal{P}} is obtained from 𝒫\mathcal{P} by erasing all atoms that would lead to an infection before t=0t=0. Define X~=X\widetilde{X}=X_{\emptyset}, and let ~\tilde{\mathscr{L}} be the distribution of (𝒫~,X~)(\widetilde{\mathcal{P}},\widetilde{X}). We refer to ~\tilde{\mathscr{L}} as the initial shifted law. The infection times (σx;x𝒰{})(\sigma_{x};\,x\in\mathscr{U}\setminus\{\emptyset\}) are then defined recursively as above, from i.i.d. pairs (𝒫x,Xx;x𝒰{})(\mathcal{P}_{x},X_{x};\,x\in\mathscr{U}\setminus\{\emptyset\}) with the original law \mathscr{L}.

Assumptions.

The following assumptions will be made implicitly in the remainder of our work. For simplicity, we assume that the contact rate (c(t);t0)(c(t);\,t\geq 0) is a piecewise continuous function, and that for any a0a\geq 0, the process (X(a);a0)(X(a);\,a\geq 0) is a.s. continuous at aa.

Recall that the intensity measure of the point process 𝒫\mathcal{P} is denoted by τ\tau, and implicitly defined as

τ,f=𝔼[𝒫,f]\langle\tau,f\rangle=\mathbb{E}\big{[}\langle\mathcal{P},f\rangle\big{]}

for any test function ff, where we used the notation μ,f=fdμ\langle\mu,f\rangle=\int f\mathrm{d}\mu. We assume that τ\tau has a density w.r.t. the Lebesgue measure that we still denote by (τ(a);a0)(\tau(a);\,a\geq 0), and assume that

R00τ(a)da<.R_{0}\coloneqq\int_{0}^{\infty}\tau(a)\mathrm{d}a<\infty.

We also assume that there exists a unique parameter α\alpha\in\mathbb{R}, the so-called Malthusian parameter of the (untrimmed) CMJ process, such that

0exp(αa)τ(a)da=1.\int_{0}^{\infty}\exp(-\alpha a)\tau(a)\mathrm{d}a=1. (9)

The parameter α\alpha can be either positive (supercritical case) or negative (subcritical case).

2.2 McKendrick-von Foerster PDE: Weak solutions

This section provides the existence and uniqueness of weak solutions to Eq. (5). Even if similar results are well-known for the time-homogeneous McKendrick-von Foerster equation [30, Chapter 1], we derive them briefly here for the sake of completeness.

In order to motivate our definition of weak solutions, we start by giving a well-known formal resolution of the PDE using the method of characteristics. Fix a>0a>0. Let

A(t)=atA(t)=a-t

Then

ddsn(ts,A(s))=tn(ts,A(s))an(ts,A(s))=0,\displaystyle\frac{\mathrm{d}}{\mathrm{d}s}n(t-s,A(s))=-\partial_{t}n(t-s,A(s))-\partial_{a}n(t-s,A(s))=0,

so that sn(ts,as)s\mapsto n(t-s,a-s) is conserved along the characteristics, i.e.,

s<a,n(t,a)=n(ts,as).\forall s<a,\ \ n(t,a)=n(t-s,a-s).

It follows that

n(t,a)={g(at)when a>tb(ta)when at\displaystyle n(t,a)=\begin{cases}g(a-t)&\text{when $a>t$}\\ b(t-a)&\text{when $a\leq t$}\end{cases} (10)

where

b(t)=c(t)0n(t,a)τ(a)dab(t)=c(t)\int_{0}^{\infty}n(t,a)\tau(a)\mathrm{d}a

is the number of new infections at time tt. We now determine the function bb. Injecting the previous expression into the “age” boundary condition of the PDE, we obtain a convolution equation for bb: for every t>0t>0

b(t)=c(t)0tb(ta)τ(a)da+c(t)tg(at)τ(a)da.b(t)=c(t)\int_{0}^{t}b(t-a)\tau(a)\mathrm{d}a+c(t)\int_{t}^{\infty}g(a-t)\tau(a)\mathrm{d}a. (11)

Recall that α\alpha denotes the Malthusian parameter defined in (9).

Lemma 4.

There exists a unique solution bb to (11) which is locally integrable. Moreover, for any δ0\delta\geq 0 such that δ>α\delta>\alpha we have b1,δb\in\mathscr{L}^{1,\delta}, where 1,δ\mathscr{L}^{1,\delta} denotes the set of all functions f:+f\colon\mathbb{R}_{+}\to\mathbb{R} such that fL1,δ0eδt|f(t)|dt<\lVert f\rVert_{L^{1,\delta}}\coloneqq\int_{0}^{\infty}e^{-\delta t}\lvert f(t)\rvert\mathrm{d}t<\infty.

Proof.

Fix δ>α\delta>\alpha and let L1,δL^{1,\delta} denote the quotient space of 1,δ\mathscr{L}^{1,\delta} by the relation δ\sim_{\delta}, where fδgf\sim_{\delta}g if fgL1,δ=0\lVert f-g\rVert_{L^{1,\delta}}=0. Then define the linear operator Φ:L1,δL1,δ\Phi\colon L^{1,\delta}\to L^{1,\delta} by

Φf:tc(t)0tf(tu)τ(u)du.\Phi f\colon t\mapsto c(t)\int_{0}^{t}f(t-u)\tau(u)\mathrm{d}u.

Then we have

ΦfL1,δ\displaystyle\lVert\Phi f\rVert_{L^{1,\delta}} =0eδtΦf(t)dt=0eδtc(t)0tf(tu)τ(u)dudt\displaystyle=\int_{0}^{\infty}e^{-\delta t}\Phi f(t)\mathrm{d}t=\int_{0}^{\infty}e^{-\delta t}c(t)\int_{0}^{t}f(t-u)\tau(u)\mathrm{d}u\mathrm{d}t
=0eδuf(u)uτ(tu)eδ(tu)c(t)dtdu.\displaystyle=\int_{0}^{\infty}e^{-\delta u}f(u)\int_{u}^{\infty}\tau(t-u)e^{-\delta(t-u)}c(t)\mathrm{d}t\mathrm{d}u.

Now using that

uτ(tu)eδ(tu)c(t)dt0τ(t)eδtdt<1\int_{u}^{\infty}\tau(t-u)e^{-\delta(t-u)}c(t)\mathrm{d}t\leq\int_{0}^{\infty}\tau(t)e^{-\delta t}\mathrm{d}t<1

we obtain that Φ<1\lVert\Phi\rVert<1. Define

ΨIdΦ.\Psi\coloneqq\operatorname{Id}-\Phi.

Then Ψ\Psi is invertible with inverse k0Φk\sum_{k\geq 0}\Phi^{k}. Note that equation (11) can be written as

Ψ(b)=F,\Psi(b)=F,

where

F:tc(t)tτ(a)g(at)da.F\colon t\mapsto c(t)\int_{t}^{\infty}\tau(a)g(a-t)\mathrm{d}a.

Noting that FL1,δF\in L^{1,\delta} as

0eδtF(t)dt0tτ(a)g(at)dadt<\int_{0}^{\infty}e^{-\delta t}F(t)\mathrm{d}t\leq\int_{0}^{\infty}\int_{t}^{\infty}\tau(a)g(a-t)\mathrm{d}a\mathrm{d}t<\infty

proves existence and uniqueness of the solution bb to (11) in L1,δL^{1,\delta}. Now for any two functions b1b_{1} and b2b_{2} such that b1δb2b_{1}\sim_{\delta}b_{2} and b1b_{1} and b2b_{2} both satisfy (11), we have b1=b2b_{1}=b_{2} (i.e., there is a single element in the equivalence class of bb verifying (11) for all tt). This shows uniqueness of the solution bb to (11) in 1,δ\mathscr{L}^{1,\delta}.

Since all elements of 1,δ\mathscr{L}^{1,\delta} are locally integrable, this also shows the existence of a locally integrable solution to (11). Its uniqueness can be proved following the exact same reasoning as previously, replacing integrations on [0,)[0,\infty) by integration on compact intervals. ∎

Definition 5.

We say that (n(t,a);t,a0)(n(t,a);\,t,a\geq 0) is the weak solution to the McKendrick-von Foerster PDE with initial condition gg if it satisfies the relation (10) where (b(t);t0)(b(t);\,t\geq 0) is the unique locally integrable solution to (11) displayed in the previous lemma.

2.3 A forward Feynman-Kac formula

Consider a CMJ with initial shifted law and define

Z(t)x𝟙{σx(0,t]},B(t)𝔼(Z(t))Z(t)\coloneqq\sum_{x}\mathbbm{1}_{\{\sigma_{x}\in(0,t]\}},\quad B(t)\coloneqq\mathbb{E}\big{(}Z(t)\big{)}

where Z(t)Z(t) is interpreted as the number of infections between 0 and tt. Recall that R0=0τ(u)du<R_{0}=\int_{0}^{\infty}\tau(u)\mathrm{d}u<\infty guarantees that B(t)<B(t)<\infty for all t0t\geq 0. Finally, BB is non-decreasing and we denote by dB\mathrm{d}B the Stieljes measure associated to BB.

Lemma 6.

There exists a locally integrable function (b(t);t0)(b(t);\,t\geq 0) such that

dB(t)=b(t)dt.\mathrm{d}B(t)=b(t)\mathrm{d}t.

Further, bb coincides with the unique locally integrable solution of the convolution equation (11).

Proof.

The fact that dB\mathrm{d}B has a density easily follows from the fact that τ\tau has a density. The fact that B(t)<B(t)<\infty ensures that bb is locally integrable.

Define 𝒫¯x\bar{\mathcal{P}}_{x} the infection measure obtained from 𝒫x\mathcal{P}_{x} after random thinning by the function (c(t);t0)(c(t);\,t\geq 0). Namely, conditional on σx\sigma_{x} and the atoms A1<A2<A_{1}<A_{2}<\cdots of 𝒫x\mathcal{P}_{x}, we remove independently each of the atoms with respective probabilities 1c(σx+A1),1c(σx+A2),1-c(\sigma_{x}+A_{1}),1-c(\sigma_{x}+A_{2}),\dots, whereas the other atoms remain unchanged.

Fix t>0t>0. Let knk\leq n\in{\mathbb{N}}. Define 𝕋k,n(𝒫x){\mathbb{T}}^{k,n}(\mathcal{P}_{x}) as the measure obtained from 𝒫x\mathcal{P}_{x} as follows. Conditional on the atoms A1<A2<A_{1}<A_{2}<\cdots of 𝒫x\mathcal{P}_{x}, we remove independently each of the atoms with respective probabilities

1maxz(tkn,tk+1n]c(z+A1),1maxz(tkn,tk+1n]c(z+A2),1-\max_{z\in(t\frac{k}{n},t\frac{k+1}{n}]}c(z+A_{1}),1-\max_{z\in(t\frac{k}{n},t\frac{k+1}{n}]}c(z+A_{2}),\cdots

and leave other atoms unchanged. Note that the thinning procedure is now independent of the starting time σx\sigma_{x}. Further, if σx(tkn,tk+1n]\sigma_{x}\in(t\frac{k}{n},t\frac{k+1}{n}], the point measure 𝕋k,n(𝒫x){\mathbb{T}}^{k,n}(\mathcal{P}_{x}) dominates 𝒫¯x\bar{\mathcal{P}}_{x}.

We decompose the births on (0,t](0,t] into two parts: individuals stemming from the root \emptyset and a second part from subsequent births. Using the fact that for every individual xx, the (un-suppressed) random measure 𝒫x\mathcal{P}_{x} is independent of its birth time σx\sigma_{x} (see second equality below), and setting M(t)0t0g(a)τ(a+u)c(u)daduM(t)\coloneqq\int_{0}^{t}\int_{0}^{\infty}g(a)\tau(a+u)c(u)\mathrm{d}a\mathrm{d}u, we get

B(t)\displaystyle B(t) =k=0n1x𝔼(𝟙(σx(tkn,tk+1n])0tσx𝒫¯x(da))+M(t)\displaystyle=\sum_{k=0}^{n-1}\sum_{x\neq\emptyset}\mathbb{E}\bigg{(}\mathbbm{1}\Big{(}\sigma_{x}\in\Big{(}t\frac{k}{n},t\frac{k+1}{n}\Big{]}\Big{)}\int_{0}^{t-\sigma_{x}}\bar{\cal P}_{x}(\mathrm{d}a)\bigg{)}+M(t)
k=0n1x𝔼(𝟙(σx(tkn,tk+1n])0ttkn𝕋k,n(𝒫x)(da))+M(t)\displaystyle\leq\sum_{k=0}^{n-1}\sum_{x\neq\emptyset}\mathbb{E}\bigg{(}\mathbbm{1}\Big{(}\sigma_{x}\in\Big{(}t\frac{k}{n},t\frac{k+1}{n}\Big{]}\Big{)}\int_{0}^{t-t\frac{k}{n}}\mathbb{T}^{k,n}(\mathcal{P}_{x})(\mathrm{d}a)\bigg{)}+M(t)
=k=0n1x𝔼(𝟙(σx(tkn,tk+1n]))𝔼(0ttkn𝕋k,n(𝒫)(da))+M(t)\displaystyle=\sum_{k=0}^{n-1}\sum_{x\neq\emptyset}\mathbb{E}\bigg{(}\mathbbm{1}\Big{(}\sigma_{x}\in\Big{(}t\frac{k}{n},t\frac{k+1}{n}\Big{]}\Big{)}\bigg{)}\mathbb{E}\bigg{(}\int_{0}^{t-t\frac{k}{n}}\mathbb{T}^{k,n}(\mathcal{P})(\mathrm{d}a)\bigg{)}+M(t)
=k=0n1𝔼(x𝟙(σx(tkn,tk+1n]))𝔼(0ttkn𝕋k,n(𝒫)(da))+M(t)\displaystyle=\sum_{k=0}^{n-1}\mathbb{E}\bigg{(}\sum_{x\neq\emptyset}\mathbbm{1}\Big{(}\sigma_{x}\in\Big{(}t\frac{k}{n},t\frac{k+1}{n}\Big{]}\Big{)}\bigg{)}\mathbb{E}\bigg{(}\int_{0}^{t-t\frac{k}{n}}\mathbb{T}^{k,n}(\mathcal{P})(\mathrm{d}a)\bigg{)}+M(t)
=k=0n1(B(tk+1n)B(tkn))𝔼(0ttkn𝕋k,n(𝒫)(da))+M(t)\displaystyle=\sum_{k=0}^{n-1}\bigg{(}B\Big{(}t\frac{k+1}{n}\Big{)}-B\Big{(}t\frac{k}{n}\Big{)}\bigg{)}\mathbb{E}\bigg{(}\int_{0}^{t-t\frac{k}{n}}\mathbb{T}^{k,n}(\mathcal{P})(\mathrm{d}a)\bigg{)}+M(t)
=k=0n1(B(tk+1n)B(tkn))0ttknck,n(u)τ(u)du+M(t).\displaystyle=\sum_{k=0}^{n-1}\bigg{(}B\Big{(}t\frac{k+1}{n}\Big{)}-B\Big{(}t\frac{k}{n}\Big{)}\bigg{)}\int_{0}^{t-t\frac{k}{n}}c^{k,n}(u)\tau(u)\mathrm{d}u+M(t).

with ck,n(y)=maxv(tkn,tk+1n]c(y+v)c^{k,n}(y)=\max_{v\in(t\frac{k}{n},t\frac{k+1}{n}]}c(y+v). In particular, if tk/nxtk/n\to x, and x+yx+y is a continuity point of cc, we have ck,n(y)c(x+y)c^{k,n}(y)\to c(x+y). We will pass to the limit nn\to\infty in the latter inequality. Recall that cc is bounded (and valued in [0,1][0,1]) and right-continuous. The first term on the RHS can be written under the form

k=0n1(B(tk+1n)B(tkn))0ttknck,n(u)τ(u)du=0tf(n)(y)dB(y),\sum_{k=0}^{n-1}\bigg{(}B\Big{(}t\frac{k+1}{n}\Big{)}-B\Big{(}t\frac{k}{n}\Big{)}\bigg{)}\int_{0}^{t-t\frac{k}{n}}c^{k,n}(u)\tau(u)\mathrm{d}u=\int_{0}^{t}f^{(n)}(y)\mathrm{d}B(y),

where

f(n)(y)=0t[y]nτ(u)supv([y]n,[y]n+tn]c(v+u)duand[y]n=tnny/t.f^{(n)}(y)=\int_{0}^{t-[y]_{n}}\tau(u)\sup_{v\in([y]_{n},\;[y]_{n}+\frac{t}{n}]}c(v+u)\mathrm{d}u\quad\text{and}\quad[y]_{n}=\frac{t}{n}\lfloor ny/t\rfloor.

We will now apply twice the Bounded Convergence Theorem. On the one hand, for a fixed value of yy, as nn\to\infty

𝟙[0,t[y]n](u)τ(u)supv([y]n,[y]n+tn]c(v+u)𝟙[0,ty](u)τ(u)c(y+u)Lebesgue a.e.\mathbbm{1}_{[0,t-[y]_{n}]}(u)\tau(u)\sup_{v\in([y]_{n},[y]_{n}+\frac{t}{n}]}c(v+u)\longrightarrow\mathbbm{1}_{[0,t-y]}(u)\tau(u)c(y+u)\quad\text{Lebesgue a.e.}

Further, the latter term (i.e., the integrand in the integral defining f(n)f^{(n)}) is uniformly bounded by τ\tau and 0τ(u)du<\int_{0}^{\infty}\tau(u)\mathrm{d}u<\infty. A first application of the Bounded Convergence Theorem implies that for every yy, as nn\to\infty

f(n)(y)0tyc(y+u)τ(u)du.f^{(n)}(y)\to\int_{0}^{t-y}c(y+u)\tau(u)\mathrm{d}u.

On the other hand, the uniform bound, f(n)(y)R0=0τ(u)duf^{(n)}(y)\leq R_{0}=\int_{0}^{\infty}\tau(u)\mathrm{d}u for all y,ny,n, allows us to again apply the Bounded Convergence Theorem, so we get

B(t)0tb(y)0tyc(y+u)τ(u)dudy+0t0g(a)τ(a+u)c(u)dadu.\displaystyle B(t)\leq\int_{0}^{t}b(y)\int_{0}^{t-y}c(y+u)\tau(u)\mathrm{d}u\mathrm{d}y+\int_{0}^{t}\int_{0}^{\infty}g(a)\tau(a+u)c(u)\mathrm{d}a\mathrm{d}u.

By replacing the max\max by a min\min and using a similar argument, one can establish the same lower bound and strengthen the latter inequality into an equality. A simple change of variable v=u+yv=u+y and interchanging the order of integration yields

B(t)=0tc(v)0vτ(vy)b(y)dydv+0t0g(a)τ(a+u)c(u)dadu.B(t)=\int_{0}^{t}c(v)\int_{0}^{v}\tau(v-y)b(y)\mathrm{d}y\mathrm{d}v+\int_{0}^{t}\int_{0}^{\infty}g(a)\tau(a+u)c(u)\mathrm{d}a\mathrm{d}u.

Finally, differentiating with respect to tt yields the desired result. ∎

Corollary 7 (Forward Feynman-Kac formula).

For every t0t\geq 0, define

μ¯t(da×{i})n(t,a)×(X(a)=i)da,\bar{\mu}_{t}(\mathrm{d}a\times\{i\})\coloneqq n(t,a)\times\mathbb{P}(X(a)=i)\mathrm{d}a,

where nn is the unique weak solution to the McKendrick-von Foerster PDE with initial condition gg. Then

μ¯t(da×{i})=𝔼(x𝟙{σx<t}δ(tσx,Xx(tσx))(da×{i}))\bar{\mu}_{t}(\mathrm{d}a\times\{i\})=\mathbb{E}\Big{(}\sum_{x}\mathbbm{1}_{\{\sigma_{x}<t\}}\delta_{(t-\sigma_{x},X_{x}(t-\sigma_{x}))}(\mathrm{d}a\times\{i\})\Big{)} (12)

where the expected value is taken with respect to a CMJ process starting with one individual with infection and life-process distributed according to the shifted law ~g\tilde{\mathscr{L}}_{g}.

Proof.

Define

μ¯t(da×{i})𝔼(x𝟙{σx<t}δ(tσx,Xx(tσx))(da×{i}))\bar{\mu}^{\prime}_{t}(\mathrm{d}a\times\{i\})\coloneqq\mathbb{E}\Big{(}\sum_{x}\mathbbm{1}_{\{\sigma_{x}<t\}}\delta_{\left(t-\sigma_{x},X_{x}(t-\sigma_{x})\right)}(\mathrm{d}a\times\{i\})\Big{)}

We need to check that μ¯t=μ¯t\bar{\mu}^{\prime}_{t}=\bar{\mu}_{t} on the space of finite measures. Let FF be a non-negative, bounded, continuous function on +×𝒮\mathbb{R}_{+}\times\mathcal{S} and hh a non-negative, continuous function with compact support in +\mathbb{R}_{+}. As in the previous lemma, we have

0h(t)F(a,i)μ¯t(da,di)dt\displaystyle\int_{0}^{\infty}h(t)\int F(a,i)\bar{\mu}^{\prime}_{t}(\mathrm{d}a,\mathrm{d}i)\mathrm{d}t =x𝔼(0h(t)F(tσx,Xx(tσx))𝟙{σx<t}dt)\displaystyle=\sum_{x\neq\emptyset}\mathbb{E}\Big{(}\int_{0}^{\infty}h(t)F\big{(}t-\sigma_{x},X_{x}(t-\sigma_{x})\big{)}\mathbbm{1}_{\{\sigma_{x}<t\}}\mathrm{d}t\Big{)}
+00h(t)𝔼(F(t+a,X(t+a))g(a)dadt.\displaystyle+\int_{0}^{\infty}\int_{0}^{\infty}h(t)\mathbb{E}\big{(}F(t+a,X(t+a)\big{)}g(a)\mathrm{d}a\mathrm{d}t.

Let (I)(I) be the first term on the RHS. For every nn\in{\mathbb{N}}^{*}

(I)\displaystyle(I) =k0x𝔼(σxh(t)F(tσx,Xx(tσx))𝟙(σx(kn,k+1n])dt)\displaystyle=\sum_{k\geq 0}\sum_{x\neq\emptyset}\mathbb{E}\bigg{(}\int_{\sigma_{x}}^{\infty}h(t)F\big{(}t-\sigma_{x},X_{x}(t-\sigma_{x})\big{)}\mathbbm{1}\Big{(}\sigma_{x}\in\Big{(}\frac{k}{n},\frac{k+1}{n}\Big{]}\Big{)}\mathrm{d}t\bigg{)}
=k0x𝔼(0h(t+σx)F(t,Xx(t))𝟙(σx(kn,k+1n])dt)\displaystyle=\sum_{k\geq 0}\sum_{x\neq\emptyset}\mathbb{E}\bigg{(}\int_{0}^{\infty}h(t+\sigma_{x})F\big{(}t,X_{x}(t)\big{)}\mathbbm{1}\Big{(}\sigma_{x}\in\Big{(}\frac{k}{n},\frac{k+1}{n}\Big{]}\Big{)}\mathrm{d}t\bigg{)}
k0x𝔼(0maxu(kn,k+1n]h(t+u)F(t,Xx(t))𝟙(σx(kn,k+1n])dt)\displaystyle\leq\sum_{k\geq 0}\sum_{x\neq\emptyset}\mathbb{E}\bigg{(}\int_{0}^{\infty}\max_{u\in(\frac{k}{n},\frac{k+1}{n}]}h(t+u)F\big{(}t,X_{x}(t)\big{)}\mathbbm{1}\Big{(}\sigma_{x}\in\Big{(}\frac{k}{n},\frac{k+1}{n}\Big{]}\Big{)}\mathrm{d}t\bigg{)}
=k0x0maxu(kn,k+1n]h(t+u)𝔼(F(t,X(t)))(σx(kn,k+1n])dt\displaystyle=\sum_{k\geq 0}\sum_{x\neq\emptyset}\int_{0}^{\infty}\max_{u\in(\frac{k}{n},\frac{k+1}{n}]}h(t+u)\mathbb{E}\bigg{(}F\big{(}t,X(t)\big{)}\bigg{)}\mathbb{P}\bigg{(}\sigma_{x}\in\Big{(}\frac{k}{n},\frac{k+1}{n}\Big{]}\bigg{)}\mathrm{d}t
=k00maxu(kn,k+1n]h(t+u)𝔼(F(t,X(t)))(B(k+1n)B(kn))dt.\displaystyle=\sum_{k\geq 0}\int_{0}^{\infty}\max_{u\in(\frac{k}{n},\frac{k+1}{n}]}h(t+u)\mathbb{E}\bigg{(}F\big{(}t,X(t)\big{)}\bigg{)}\bigg{(}B\Big{(}\frac{k+1}{n}\Big{)}-B\Big{(}\frac{k}{n}\Big{)}\bigg{)}\mathrm{d}t.

By reasoning along the same lines as in Lemma 6 (i.e., applying the Bounded Convergence Theorem several times), one can show that the RHS converges to

00h(t+y)𝔼(F(t,X(t)))b(y)dtdy\int_{0}^{\infty}\int_{0}^{\infty}h(t+y)\mathbb{E}\Big{(}F\big{(}t,X(t)\big{)}\Big{)}b(y)\mathrm{d}t\mathrm{d}y

as nn\to\infty and thus

0h(t)F(a,i)μ¯t(da,di)dt\displaystyle\int_{0}^{\infty}h(t)\int F(a,i)\bar{\mu}^{\prime}_{t}(\mathrm{d}a,\mathrm{d}i)\mathrm{d}t 00h(t+y)𝔼(F(t,X(t)))b(y)dtdy\displaystyle\leq\int_{0}^{\infty}\int_{0}^{\infty}h(t+y)\mathbb{E}\Big{(}F\big{(}t,X(t)\big{)}\Big{)}b(y)\mathrm{d}t\mathrm{d}y
+0h(t)0𝔼(F(t+a,X(t+a)))g(a)dadt.\displaystyle\qquad+\int_{0}^{\infty}h(t)\int_{0}^{\infty}\mathbb{E}\Big{(}F\big{(}t+a,X(t+a)\big{)}\Big{)}g(a)\mathrm{d}a\mathrm{d}t.

By a similar argument, the inequality can be strengthened into an equality. After some simple changes of variables we get

0h(t)F(a,i)μ¯s(da,di)dt\displaystyle\int_{0}^{\infty}h(t)\int F(a,i)\bar{\mu}^{\prime}_{s}(\mathrm{d}a,\mathrm{d}i)\mathrm{d}t =0h(t)0t𝔼(F(a,X(a)))b(ta)dadt\displaystyle=\int_{0}^{\infty}h(t)\int_{0}^{t}\mathbb{E}\Big{(}F\big{(}a,X(a)\big{)}\Big{)}b(t-a)\mathrm{d}a\mathrm{d}t
+0h(t)0𝔼(F(t+a,X(t+a)))g(a)dadt.\displaystyle\qquad+\int_{0}^{\infty}h(t)\int_{0}^{\infty}\mathbb{E}\Big{(}F\big{(}t+a,X(t+a)\big{)}\Big{)}g(a)\mathrm{d}a\mathrm{d}t.

Moreover we have

0h(t)F(a,i)μ¯t(da,di)dt\displaystyle\int_{0}^{\infty}h(t)\int F(a,i)\bar{\mu}_{t}(\mathrm{d}a,\mathrm{d}i)\mathrm{d}t =i𝒮0h(t)0F(a,i)n(t,a)p(a,i)dadt\displaystyle=\sum_{i\in\mathcal{S}}\int_{0}^{\infty}h(t)\int_{0}^{\infty}F(a,i)n(t,a)p(a,i)\mathrm{d}a\mathrm{d}t
=i𝒮[0h(t)0tF(a,i)b(ta)p(a,i)dadt+0h(t)tF(a,i)g(at)p(a,i)dadt]\displaystyle=\begin{multlined}\sum_{i\in\mathcal{S}}\bigg{[}\int_{0}^{\infty}h(t)\int_{0}^{t}F(a,i)b(t-a)p(a,i)\mathrm{d}a\mathrm{d}t\\ \qquad\qquad+\int_{0}^{\infty}h(t)\int_{t}^{\infty}F(a,i)g(a-t)p(a,i)\mathrm{d}a\mathrm{d}t\bigg{]}\end{multlined}\sum_{i\in\mathcal{S}}\bigg{[}\int_{0}^{\infty}h(t)\int_{0}^{t}F(a,i)b(t-a)p(a,i)\mathrm{d}a\mathrm{d}t\\ \qquad\qquad+\int_{0}^{\infty}h(t)\int_{t}^{\infty}F(a,i)g(a-t)p(a,i)\mathrm{d}a\mathrm{d}t\bigg{]}

so that

0h(t)F(a,i)μ¯t(da,di)dt=0h(t)F(a,i)μ¯t(da,di)dt.\int_{0}^{\infty}h(t)\int F(a,i)\bar{\mu}_{t}(\mathrm{d}a,\mathrm{d}i)\mathrm{d}t=\int_{0}^{\infty}h(t)\int F(a,i)\bar{\mu}^{\prime}_{t}(\mathrm{d}a,\mathrm{d}i)\mathrm{d}t.

It is easy to check that the two functions tμ¯t,Ft\mapsto\langle\bar{\mu}_{t},F\rangle and tμ¯t,Ft\mapsto\langle\bar{\mu}^{\prime}_{t},F\rangle are both continuous. As a consequence, we have μ¯t,F=μ¯t,F\langle\bar{\mu}_{t},F\rangle=\langle\bar{\mu}^{\prime}_{t},F\rangle for every test function FF, concluding the proof. ∎

2.4 Dual CMJ process and backward Feynman-Kac formula

We end this section by making a connection between a dual process – interpreted as an ancestral process – and the (PDE) method of characteristics. In addition, this approach provides a probabilistic proof of uniqueness for the PDE.

Let \mathcal{M} be any random point measure with intensity measure τ(du)\tau(\mathrm{d}u). Fix a,T>0a,T>0. We now construct a dual process using the measure \mathcal{M}, which can be seen as a generalized Bellman-Harris branching process (individuals have a finite lifetimes, births only occur upon death). Let us first describe the process with no suppression (i.e., c1c\equiv 1).

  • Start with a single particle at time t=0t=0. Assume that the residual lifetime of this original particle is aa, so that this particle dies out at time aa.

  • As in a Bellman-Harris process, the number of offspring of an individual and their lifetime durations are independent of the parent’s characteristics.

  • Upon death, each individual xx is endowed with an independent copy x\mathcal{M}_{x} of \mathcal{M}: the number of offspring of xx is given by the number of atoms of x\mathcal{M}_{x} and their lifetime durations are given by the positions of the atoms in x\mathcal{M}_{x}.

The dual process with suppression c1c\not\equiv 1 can be coupled with the case c1c\equiv 1. Given a realization of the process, if a branching occurs at time tt, the children are killed independently with probability c(Tt)c(T-t). (Note that as in the original CMJ process, suppression translates into trimming the dual tree.)

Remark 4.

We note that there are as many dual processes as there are point processes with intensity measure τ\tau. Here are a few natural choices:

  1. 1.

    Take =𝒫\mathcal{M}=\mathcal{P}.

  2. 2.

    Let \mathcal{M} be a Poisson Point Process with intensity measure τ(du)\tau(\mathrm{d}u). In this particular case, the dual process is a Bellman-Harris branching process (i.e., the offspring lifetime durations are independent conditional on offspring number). We note that τ(du)\tau(\mathrm{d}u) appears naturally when considering the ancestral spine of a critical CMJ, see e.g. [51]. The measure τ\tau can be obtained by size-biasing 𝒫\mathcal{P} (i.e. biasing by the total mass of 𝒫\mathcal{P}) and then recording the age of the individual at a uniformly chosen birth event.

Let (Yt;tT)(Y_{t};\,t\leq T) be the stochastic process valued in n+n\cup_{n\in{\mathbb{N}}}{\mathbb{R}}_{+}^{n} recording the residual life-times at time tt listed in increasing order, i.e. if Yt=(Yt(1),,Yt(n))Y_{t}=(Y_{t}^{(1)},\cdots,Y_{t}^{(n)}) there are nn particles alive at time tt and Yt(k)Y_{t}^{(k)} is the residual life-time of the kk-th individual with Yt(1)<<Yt(n)Y_{t}^{(1)}<\cdots<Y_{t}^{(n)}. (We assumed that τ\tau has a density so that the residual lifetimes are distinct a.s.) In particular, the particle labelled 11 at any given time tt will be the first to expire, and at death time t+Yt(1)t+Y_{t}^{(1)} a random number of children is produced. We let dim(Yt)\dim(Y_{t}) denote the number of particules alive at time tt, i.e., the dimension of the vector YtY_{t}.

Proposition 8 (Backward Feynman-Kac formula).

For any probability density gg, we have

n(T,a)=𝔼^a(idim(YT)g(YT(i)))n(T,a)=\widehat{\mathbb{E}}_{a}\Bigg{(}\sum_{i\leq\dim(Y_{T})}g(Y_{T}^{(i)})\Bigg{)} (13)

where (n(t,a);t,a0)(n(t,a);\,t,a\geq 0) is the unique solution to the McKendrick-von Foerster equation started from gg, and 𝔼^a\widehat{\mathbb{E}}_{a} is the distribution of the process (Yt;tT)(Y_{t};\,t\leq T) starting with an individual with residual lifetime aa.

Proof.

Let t1<<tk<t_{1}<\cdots<t_{k}<\cdots be the successive branching times of the dual branching process. Since τ\tau has a density, there is a single branching particle at the successive branching times t1,t_{1},\dots Define the process

Zsidim(Ys)n(Ts,Ys(i))Z_{s}\coloneqq\sum_{i\leq\dim(Y_{s})}n(T-s,Y_{s}^{(i)})

See also Figure 4 for a pictorial representation of the process. It is plain from the definition that nn is preserved along the characteristics of the PDE, i.e., that for every xx the function sn(Ts,xs)s\to n(T-s,x-s) remains constant on [0,x)[0,x). As a consequence, (Zs;s0)(Z_{s};\,s\geq 0) remains constant on every interval [tn,tn+1)[t_{n},t_{n+1}), with the convention t0=0t_{0}=0. Define znZtnz_{n}\coloneqq Z_{t_{n}} the value of the process (Zt;t0)(Z_{t};\,t\geq 0) at the nn-th branching time. Let (n;n)(\mathcal{F}_{n};\,n\in{\mathbb{N}}) be the filtration induced by the process (zn;n)(z_{n};\,n\in{\mathbb{N}}). By definition of the dual process, we have

Ytn(i)=Ytn1(i)(tntn1)=Ytn1(i)Ytn1(1).Y_{t_{n}-}^{(i)}=Y_{t_{n-1}}^{(i)}-(t_{n}-t_{n-1})=Y_{t_{n-1}}^{(i)}-Y_{t_{n-1}}^{(1)}.

For every n>1n>1

𝔼^a(zn|n1)\displaystyle\widehat{\mathbb{E}}_{a}\left(z_{n}\ |\ \mathcal{F}_{n-1}\right) =2idim(zn1)n(Ttn,Ytn(i))+c(Ttn)𝔼[,n(Ttn,)]\displaystyle=\sum_{2\leq i\leq\dim(z_{n-1})}n(T-t_{n},Y_{t_{n}-}^{(i)})+c(T-t_{n})\mathbb{E}\big{[}\langle\mathcal{M},n(T-t_{n},\cdot)\rangle\big{]}
=2idim(zn1)n(Ttn,Ytn(i))+c(Ttn)0n(Ttn,a)τ(da)\displaystyle=\sum_{2\leq i\leq\dim(z_{n-1})}n(T-t_{n},Y_{t_{n}-}^{(i)})+c(T-t_{n})\int_{0}^{\infty}n(T-t_{n},a)\tau(\mathrm{d}a)
=2idim(zn1)n(Ttn,Ytn(i))+n(Ttn,0)\displaystyle=\sum_{2\leq i\leq\dim(z_{n-1})}n(T-t_{n},Y_{t_{n}-}^{(i)})+n(T-t_{n},0)
=zn1,\displaystyle=z_{n-1},

where the third equality follows from the age boundary of the McKendrick-von Foerster equation, and the last equality from the identity n(ts,as)=n(t,a)n(t-s,a-s)=n(t,a). As already mentioned, the process (Zs;s0)(Z_{s};\,s\geq 0) is constant between two branching times. As a consequence, (Zs;s0)(Z_{s};\,s\geq 0) is a martingale (w.r.t. its natural fltration) so for every s0s\geq 0,

n(T,a)=𝔼^a(idim(Ys)n(Ts,Ys(i))).n(T,a)=\widehat{\mathbb{E}}_{a}\bigg{(}\sum_{i\leq\dim(Y_{s})}n(T-s,Y_{s}^{(i)})\bigg{)}.

Relation (13) follows by taking s=Ts=T in the latter expression. ∎

Refer to caption
Figure 4: Graphical representation of the process (Zs;sT)(Z_{s};\,s\leq T), where s=Tts=T-t. We start with a single individual with residual lifetime aa. In this picture, time flows downwards for the branching process. The residual lifetime of the initial individual decreases linearly at speed one until reaching 0 (this corresponds to time Tt1T-t_{1} in our representation). At this time, the particle dies and produces 2 red particules. Residual lifetimes travel along the characteristics of the McKendrick-von Foerster PDE until reaching the spatial boundary condition {a=0}\{a=0\} where a new branching occurs.

3 Proofs of the main results

In this section, we provide the proofs of the two laws of large numbers stated in Section 1.4. The proof of Theorem 1 is a direct consequence of the results of the previous section.

Proof of Theorem 1.

Recall the definition of the empirical measure μtN\mu^{N}_{t}. It can be written as

μtN(da×{i})=1Nk=1Nμt1,(k)(da×{i})\mu^{N}_{t}(\mathrm{d}a\times\{i\})=\frac{1}{N}\sum_{k=1}^{N}\mu^{1,(k)}_{t}(\mathrm{d}a\times\{i\}) (14)

where (μt1,(k);k1)(\mu^{1,(k)}_{t};\,k\geq 1) are independent copies of μt1\mu^{1}_{t}. Let ff be an arbitrary continuous and bounded function on +×𝒮\mathbb{R}_{+}\times\mathcal{S}. The law of large numbers combined with Corollary 7 implies that

1NμtN,fμ¯t,fa.s.\big{\langle}\frac{1}{N}\mu^{N}_{t},f\big{\rangle}\longrightarrow\langle\bar{\mu}_{t},f\rangle\quad\text{a.s.}

which ends the proof. ∎

We now turn our attention to the proof of Theorem 2, the law of large number started from a single individual. Let us briefly recall the setting of this result.

We consider a sequence (tK;K0)(t_{K};\,K\geq 0) of random times with tKt_{K}\to\infty a.s. on the non-extinction event. We assume that the process starts from a single individual infected at time t=0t=0 and that the contact rate cKc^{K} of the CMJ depends on KK in the following way: cK(t)C(ttK)c^{K}(t)\coloneqq C(t-t_{K}), where (C(t);t)(C(t);\,t\in\mathbb{R}) is a piecewise continuous function in [0,1][0,1] such that C(t)=1C(t)=1 for all t<0t<0.

Proof of Theorem 2.

The result will follow by viewing the population at time tK+tt_{K}+t as an adequate random characteristic of the population at time tKt_{K}. Let us recall some basic facts about random characteristics of CMJ processes in our context. We refer to [35, 54] for a more thorough account on this notion.

We consider a plain CMJ with no contact rate or initial shifted law. Every individual is characterized by an independent pair of random variables (𝒫x,Xx)(\mathcal{P}_{x},X_{x}). A random characteristic is a real-valued stochastic process (χ(a);a0)(\chi(a);\,a\geq 0) that can be constructed from the collection (𝒫x,Xx;x𝒰)(\mathcal{P}_{x},X_{x};\,x\in\mathscr{U}). (More formally it is a càdlàg process measurable w.r.t. the σ\sigma-field induced by these variables.) By convention, it is extended to a process defined on the whole real line by setting χ(a)0\chi(a)\coloneqq 0 for a<0a<0.

For an individual x𝒰x\in\mathscr{U} let us write χx\chi_{x} for the random characteristic constructed from the collection (𝒫xy,Xxy;y𝒰)(\mathcal{P}_{xy},X_{xy};\,y\in\mathscr{U}). It is the characteristic constructed from the tree rooted at xx of all descendants of xx. The branching process counted by the random characteristic χ\chi is then defined as

Zχ(t)=x𝒰χx(tσx).Z^{\chi}(t)=\sum_{x\in\mathscr{U}}\chi_{x}(t-\sigma_{x}).

We now recall one of the main results of Jagers and Nerman [34], namely Theorem 5.8 (see also Theorem 4, Appendix A in [54]). Recall that α\alpha denotes the Malthusian parameter defined in (9). On top of all the assumptions above, we make the two following extra assumptions

  1. (a)

    The characteristic fulfills

    n0supnun+1eαu𝔼(χ(u))<.\sum_{n\geq 0}\sup_{n\leq u\leq n+1}e^{-\alpha u}\mathbb{E}(\chi(u))<\infty.
  2. (b)

    The map a𝔼(χ(a))a\mapsto\mathbb{E}(\chi(a)) is continuous a.e. with respect to the Lebesgue measure.

Then there exists a positive r.v. WW_{\infty} (independent of the choice of χ\chi) such that conditional on non-extinction

Zχ(t)exp(αt)W0αeαa𝔼(χ(a))dain probability as t.Z^{\chi}(t)\exp(-\alpha t)\to W_{\infty}\int_{0}^{\infty}\alpha e^{-\alpha a}\mathbb{E}(\chi(a))\mathrm{d}a\quad\text{in probability as $t\to\infty$.}

To illustrate the method, we recall that if we take χ(a)=𝟙+(a)\chi(a)=\mathbbm{1}_{{\mathbb{R}}_{+}}(a) then Zχ(t)Z^{\chi}(t) coincides with Z(t)Z(t), the total number of births before time tt. For this particular choice of (deterministic) characteristic, the two properties above are immediately satisfied (recall that α>0\alpha>0), so that conditional on non-extinction

x𝟙{σx<t}exp(αt)Win probability.\sum_{x}\mathbbm{1}_{\{\sigma_{x}<t\}}\exp(-\alpha t)\to W_{\infty}\quad\text{in probability}.

This convergence ensures that the first item of our theorem is satisfied.

To prove the second item, let us set

𝒫=i=1NδAi,\mathcal{P}_{\emptyset}=\sum_{i=1}^{N_{\emptyset}}\delta_{A_{i}},

for the atoms of the infection point process of the ancestor 𝒫\mathcal{P}_{\emptyset}. Further denote by μt(i)\mu^{(i)}_{t} the empirical measure of ages and compartments at time tt of the progeny of the ii-th child of the ancestor, thinned by the contact rate (c(t);t0)(c(t);\,t\geq 0), i.e.,

t0,μt(i)=x𝒰𝟙{σ~ix<t}δ(tσ~ix,Xix(tσ~ix)),\forall t\geq 0,\quad\mu^{(i)}_{t}=\sum_{x\in\mathscr{U}}\mathbbm{1}_{\{\widetilde{\sigma}_{ix}<t\}}\delta_{(t-\widetilde{\sigma}_{ix},X_{ix}(t-\widetilde{\sigma}_{ix}))},

where σ~x\widetilde{\sigma}_{x} refers to the infection time of xx once the tree has been thinned. (That is, σ~x=σx\widetilde{\sigma}_{x}=\sigma_{x} if xx remains infected after the thinning, or σ~=\widetilde{\sigma}=\infty otherwise.) Our characteristic of interest can now be defined as

χ(t,f)(a)=f(a+t,X(a+t))+i=1N𝟙{Ai[a,a+t]}μa+t(i),f.\chi^{(t,f)}(a)=f(a+t,X_{\emptyset}(a+t))+\sum_{i=1}^{N_{\emptyset}}\mathbbm{1}_{\{A_{i}\in[a,a+t]\}}\langle\mu^{(i)}_{a+t},f\rangle.

for a fixed time t0t\geq 0 and a fixed bounded continuous function ff. On the one hand, it should be clear that

Zχ(t,f)(tK)=(d)μtK+tK,f.Z^{\chi^{(t,f)}}(t_{K})\overset{\mathrm{(d)}}{=}\langle\mu^{K}_{t_{K}+t},f\rangle. (15)

To see this, note that the process Zχ(t,f)Z^{\chi^{(t,f)}} is obtained from a plain CMJ with no thinning, so that only infections after time tKt_{K} are removed due to the contact rate.

On the other hand, χ(t,f)(a)\chi^{(t,f)}(a) can be obtained by starting from an initial individual with age a-a, removing all atoms from its infection point process before time 0, and integrating the empirical measure of the resulting CMJ at time tt against ff. This is the description of the CMJ with initial shifted law ~\tilde{\mathscr{L}} conditional on A=aA=a, so that

𝔼(0g(a)χ(t,f)(a)da)=μ¯t,f,\mathbb{E}\Big{(}\int_{0}^{\infty}g(a)\chi^{(t,f)}(a)\mathrm{d}a\Big{)}=\langle\bar{\mu}_{t},f\rangle, (16)

where in μ¯t\bar{\mu}_{t} the age of the initial ancestor is distributed according to gg.

Therefore, up to checking (a) and (b), Theorem 5.8 in [35] shows that, as KK\to\infty,

μtK+tK,fW𝔼(0αeαaχ(t,f)(a)da)\langle\mu^{K}_{t_{K}+t},f\rangle\longrightarrow W_{\infty}\mathbb{E}\Big{(}\int_{0}^{\infty}\alpha e^{-\alpha a}\chi^{(t,f)}(a)\mathrm{d}a\Big{)}

in probability, which in combination with (16) proves the result.

All what remains to be shown is that (a) and (b) are fulfilled. That a𝔼(χ(t,f)(a))a\mapsto\mathbb{E}(\chi^{(t,f)}(a)) is continuous a.e. follows directly from the fact that τ\tau has a density. Condition (b) is a consequence of the following stochastic domination

χ(t,f)(a)f(1+i=1NZ(i)(t))\chi^{(t,f)}(a)\leq\lVert f\rVert_{\infty}\Big{(}1+\sum_{i=1}^{N_{\emptyset}}Z^{(i)}(t)\Big{)}

where (Z(i)(t);t0)(Z^{(i)}(t);\,t\geq 0) are i.i.d. copies of the CMJ without thinning, independent of 𝒫\mathcal{P}_{\emptyset}. ∎

4 Inference procedure

In this section, we illustrate how to use our framework to make inferences from macroscopic observables of the epidemic, e.g., incidence of positively tested patients, hospital or ICU (intensive care unit) admissions, deaths, etc. We show how to use those observables to extract the underlying age structure of the population, estimate model parameters, and forecast the future of the epidemic.

We focused on a longitudinal case study in France. From March 18 2020, the French government has provided daily reports of the numbers of ICU and hospital admissions, of deaths, of discharged patients, and of occupied ICU and hospital beds. Moreover, several theoretical studies have already been conducted on the same dataset. This allowed us to fix the values of some crucial biological parameters that had already been estimated and to carry out a comparison with our method. We want to emphasize that the aim of this section is to provide a mathematical framework in which convergence results can be rigorously proved while remaining flexible enough for other applications. Our goal is not to provide new estimates of epidemiological parameters for France, as many robust estimates are already available. For instance we do not provide confidence intervals for our estimates, and neither do we conduct a sensibility analysis.

The remainder of the section is laid out as follows. In Section 4.1 we identify the mathematical quantities that impact the dynamics of the epidemic for large population sizes, and show how to turn them into a likelihood. Section 4.2 then presents the choice of distribution we made for these quantities and the parameters that need to be estimated. Finally, estimation of these parameters from the French incidence data is performed in Section 4.3 and Section 4.4. We start by fitting a simple model in Section 4.3 and then show how this model can be made more complex to account for more complex data in Section 4.4.

4.1 Deriving the likelihood

Under the assumptions of Theorem 2, the number of individuals in a given state ii at time tt converges to

ni(t)=0n(t,a)p(a,i)da,n_{i}(t)=\int_{0}^{\infty}n(t,a)p(a,i)\mathrm{d}a, (17)

where (n(t,a);t,a0)(n(t,a);\,t,a\geq 0) is the solution to (5). The required assumptions are in essence that the epidemic has been ongoing for a long enough time at the lockdown onset for the infected population to be large, which we assume to hold true for France as the number of cases on March 16 2020 was on the order of thousands of individuals.

Therefore, we take (17) as the predicted number of individuals in state ii in our model. In order to turn (17) into a likelihood, we assume that the observed number of individuals in state ii at time tt is distributed according to some discrete law centered on the predicted value, which we take to be a Poisson distribution. Then, the likelihood for the whole time period is obtained by assuming that the observations are independent across states and time. The explicit expression for the likelihood is provided in Section A.3.

Remark 5.

The assumption that observations are independent is obviously not met. For instance, the number of occupied hospital beds is cumulative, so that any error is propagated from one day to the other. Moreover, there is a clear weekly effect in data that is not accounted for here. As deriving robust estimates is not the main purpose of this work, we prefer to keep this independence assumption that leads to simple expressions for the likelihood, while being aware of its limitation. This assumption could be relaxed by modeling explicitly the observation process and its potential errors.

Remark 6.

We have decided to use an expression for the likelihood similar to that in [50, 53]. Such an expression only poorly accounts for the deviation of the stochastic model from its deterministic limit. Better accounting for this effect would require to use the likelihood of the stochastic model, or a Gaussian approximation of it obtained by deriving a functional central limit theorem. In our context the covariance structure of the population could be obtained by adapting the expressions in [34, Section 3] to incorporate the contact rate (c(t);t0)(c(t);\,t\geq 0). However the resulting expressions would be quite cumbersome and computationally costly so that we prefer to use our simpler Poisson likelihood.

Under our assumptions, the likelihood only depends on (17), which is in turn determined by four quantities that need to be parametrized:

  1. 1.

    The intensity measure of the infection point process (τ(a);a0)(\tau(a);\,a\geq 0).

  2. 2.

    The initial number of infected individuals and their age profile.

  3. 3.

    The contact rate after lockdown (c(t);t0)(c(t);\,t\geq 0).

  4. 4.

    The one-dimensional marginals of the life-cycle process (p(a,i);a0)(p(a,i);\,a\geq 0) for i𝒮i\in\mathcal{S}.

4.2 Parametrization of the model

Average infection measure.

Recall the definition of τ\tau and R0R_{0} from Section 2.1 and further define

a0,τ^(a)=τ(a)R0.\forall a\geq 0,\quad\hat{\tau}(a)=\frac{\tau(a)}{R_{0}}.

The total mass of τ\tau, R0R_{0}, corresponds to the mean number of secondary infections induced by a single infected individual if c1c\equiv 1. Thus R0R_{0} is the basic reproduction number at the start of the epidemic, when no control measure is enforced. In order to distinguish it from the reproduction number during lockdown, it will be denoted by RpreR_{\mathrm{pre}}. We leave it as a parameter to infer.

The function (τ^(a);a0)(\hat{\tau}(a);\,a\geq 0) is the density of a probability measure known as the generation time distribution [59, 9]. This distribution has been estimated shortly after the epidemic onset by several studies [21, 27, 10]. We use the estimation of [21], and assume that τ^\hat{\tau} is a Weibull distribution, that is

a0,τ^(a)=kλ(aλ)k1e(a/λ)k,\forall a\geq 0,\quad\hat{\tau}(a)=\frac{k}{\lambda}\Big{(}\frac{a}{\lambda}\Big{)}^{k-1}e^{-(a/\lambda)^{k}}, (18)

where the values of the shape parameter kk and scale parameter λ\lambda are recalled in Table 1.

Initial condition.

According to Theorem 2, the initial age structure of the population is

a0,n(0,a)=Wαeαa,\forall a\geq 0,\quad n(0,a)=W\alpha e^{-\alpha a},

where α\alpha is the Malthusian parameter of the epidemic prior to implementation of control measures, and WW is the number of infected individuals at t=0t=0, that is, at the lockdown onset. The parameter α\alpha corresponds to the exponential growth rate of any observable of the epidemic during this period. We chose to estimate it from the cumulative number of deaths, which appeared to be more reliable than the number of detected cases as the number of tests conducted in the early phase of the epidemic in France varied greatly. It was estimated using a linear regression on the logarithm of the number of deaths from March 7 to March 20 2020, and the corresponding basic reproduction number before lockdown, RpreR_{\mathrm{pre}}, was computed using the Euler-Lotka equation (9) assuming that the generation time distribution is given by (18). Both estimates are shown in Table 1.

Notation Description Value Source
α\alpha Pre-lockdown exponential growth rate 0.315 E
RpreR_{\mathrm{pre}} Basic reproduction number before lockdown 3.25 E
kk Shape parameter of the generation time 2.83 [21]
λ\lambda Scale parameter of the generation time 5.67 [21]
Table 1: Parameter values common to both models. In the “Source” column, “E” indicates that the parameter has been estimated in the present work.

Contact rate.

The contact rate (c(t);t0)(c(t);\,t\geq 0) accounts for the temporal variations in transmissions after the lockdown onset. As we focus on the period from March to May 2020 where no additional control measure has been enforced in France, we will assume that (c(t);t0)(c(t);\,t\geq 0) is constant and denote by c0c_{0} its value, that is, cc0c\equiv c_{0}. The reproduction number after the lockdown is denoted by Rpostc0RpreR_{\mathrm{post}}\coloneqq c_{0}R_{\mathrm{pre}}.

Life-cycle.

The last quantities that need to be defined are the one-dimensional marginals of the life-cycle process (X(a);a0)(X(a);\,a\geq 0). These could be directly estimated from hospital patient pathways as in [41, 58, 40]. However, when such data is not available they need to be estimated from individual counts in each compartment. In this case, we propose the following parametrization of the process (X(a);a0)(X(a);\,a\geq 0) based on Gamma-distributed sojourn times.

Let us denote by (Xn;n0)(X_{n};\,n\geq 0) the sequence of states visited by (X(a);a0)(X(a);\,a\geq 0). We assume that (Xn;n0)(X_{n};\,n\geq 0) is a Markov chain on 𝒮\mathcal{S}, and that it ends either in a “dead” or “recovered” state, that are assumed to be absorbing.

For i𝒮i\in\mathcal{S}, the sojourn time in ii is supposed to be Gamma-distributed with mean mim_{i} and variance mi/γm_{i}/\gamma, for some global dispersion parameter γ\gamma shared across all states. More precisely, let (Dn;n0)(D_{n};\,n\geq 0) denote the sequence of sojourn times of (X(a);a0)(X(a);\,a\geq 0), that is, DnD_{n} is the sojourn time in state XnX_{n}. We assume that conditional on (Xn;n0)(X_{n};\,n\geq 0), the variables (Dn;n0)(D_{n};\,n\geq 0) are independent. Moreover, if Xn=inX_{n}=i_{n}, then DnD_{n} follows a Gamma distribution with mean minm_{i_{n}} and variance min/γm_{i_{n}}/\gamma, that is,

DnγγminΓ(γmin)uγmin1eγudu.D_{n}\sim\frac{\gamma^{\gamma m_{i_{n}}}}{\Gamma(\gamma m_{i_{n}})}u^{\gamma m_{i_{n}}-1}e^{-\gamma u}\mathrm{d}u.

Thus, the one-dimensional marginals are parametrized by the transitions of a Markov chain (Xn;n0)(X_{n};\,n\geq 0) on 𝒮\mathcal{S}, as well as by one parameter mim_{i} for each i𝒮i\in\mathcal{S}, and a global parameter γ\gamma. Under this parametrization the one-dimensional marginals can be efficiently computed, while only requiring one parameter for the sojourn time in each state. Two concrete examples of Markov chains (Xn;n0)(X_{n};\,n\geq 0) are discussed in the next sections.

4.3 Inference with the admission model

The first model that we consider, the admission model, is a parsimonious model designed to obtain estimates of the reproduction number during lockdown, and of the number of infections in France in early March. It is illustrated in Figure 5. We fit it to the three “incidence” time series: the daily number of admissions in hospital and ICU, and the daily number of deaths.

Description of the model.

Upon infection, with probability 1phosp1-p_{\mathrm{hosp}}, an individual develops a mild form of COVID-19 and is placed in state II, which encompasses all cases that do not require a hospitalization. With probability phospp_{\mathrm{hosp}} the individual has a severe infection and is placed in state CC. Individuals in state CC are eventually hospitalized and moved to state HH. Then, with probability pICUp_{\mathrm{ICU}} individuals in state HH are admitted in ICU and moved to state UU. Otherwise they eventually recover and are discharged. Finally, individuals in state UU die with probability pdeathp_{\mathrm{death}}, or recover with probability 1pdeath1-p_{\mathrm{death}}. In this model, only individuals in ICU may die.

Refer to caption
Figure 5: Illustration of the admission model.

As we are fitting the number of individuals that enter a state, and not the number of individuals that are currently in that state, we only need to track the times THT_{H}, TUT_{U}, and TDT_{D} elapsed between infection and hospital admission, ICU admission, and death, respectively.

Inference results.

Estimations of phospp_{\mathrm{hosp}}, pICUp_{\mathrm{ICU}} and of the death probability conditional on hospitalization (equal in our setting to pICU×pdeathp_{\mathrm{ICU}}\times p_{\mathrm{death}}) in France have already been conducted in [50]. We used these estimates and considered the values of phospp_{\mathrm{hosp}}, pICUp_{\mathrm{ICU}}, and pdeathp_{\mathrm{death}} as fixed. All other parameters were estimated using a maximum likelihood procedure described in Section A.3. The parameter estimations are provided in Table 2, and the corresponding predicted values for the time series under consideration are displayed in Figure 1. Overall, our simple model seems to match the observed data. Note however that the model overestimates the number of ICU admissions in the second part of the lockdown. This is likely due to a temporal reduction in the ICU admission probability which has been reported in [50].

Our estimation of the basic reproduction number during the lockdown period is Rpost=0.745R_{\mathrm{post}}=0.745. This suggests that lockdown has reduced the basic reproduction number by a factor c0=0.23c_{0}=0.23 compared to the beginning of the epidemic. Moreover, we estimated that W=9.85×105W=9.85\times 10^{5} infections have occurred in France before March 17. Both these values are in line with previous estimates for France [53, 50].

We did not impose that TH<TUT_{H}<T_{U} in the inference procedure. Interestingly we found that the data are best explained by assuming that the mean of THT_{H} is 14.414.4 days, whereas the mean of TUT_{U} is 11.411.4 days. This indicates that the delay between infection and hospital admission is shorter for individuals that end up in ICU, compared to the average time between infection and hospitalization. Therefore it would be more appropriate to allow individuals to have an admission to hospital delay that is different depending on whether they will end up in ICU or not, modeling the fact that they have a more severe form of the disease. We estimated the mean of TDT_{D}, the time between infection and death, to be 18.618.6 days. This estimate is lower than but consistent with previous estimates based on the study of individual-case data [60, 41, 58].

Notation Description Value Source
RpostR_{\mathrm{post}} Reproduction number during lockdown 0.7450.745 E
WW Total number of infections before March 17 2020 9.85×1059.85\times 10^{5} E
phospp_{\mathrm{hosp}} Probability of being hospitalized 0.0360.036 [50]
pICUp_{\mathrm{ICU}} Probability of entering ICU conditional on being at the hospital 0.190.19 [50]
pICUpdeathp_{\mathrm{ICU}}\cdot p_{\mathrm{death}} Death probability conditional on being hospitalized 0.1810.181 [50]
THT_{H} Delay between infection and hospital admission 14.414.4 days E
TUT_{U} Delay between infection and ICU admission 11.411.4 days E
TDT_{D} Delay between infection and death 18.618.6 days E
γ\gamma Scale parameter common to all Gamma distributions 0.463 E
Table 2: Inferred parameter set for the admission model. The values indicated for the durations correspond to the means of the Gamma distributions. In the “Source” column, “E” indicates that the parameter has been estimated in the current work.

4.4 Inference with the occupancy model

We now consider a model aimed at providing predictions for the number of hospitalized individuals and ICU patients. The model is fitted to the three “incidence” time-series, and to three additional “prevalence” time-series: the number of occupied hospital and ICU beds, and the number of discharged hospital patients.

A first attempt to fit the prevalence curves could be to keep the admission model of Figure 5 and to estimate the time between hospital admission and discharge using the observed number of occupied ICU, hospital beds, and discharged patients. However this only yields a poor fit of the data (see Section A.4). We identified two main reasons for this discrepancy. First, we assumed that all individuals are admitted to ICU prior to death. Using the probability estimated in [50] then yields that the probability of dying conditional on being in ICU is 0.9530.953. This value is unrealistically high, and we need to assume that a fraction of hospital deaths occur without going through the ICU. Second, under the admission model, the delay between hospital admission and discharge is almost unimodal. However, the observed number of occupied hospital beds rises fast but falls slowly. Such a shape cannot be easily accounted for by a unimodal distribution for the time spent in hospital.

Description of the model.

Taking into account the previous two points required us to make the model more complex. The resulting model, referred to as the occupancy model, is illustrated in Figure 6. We now consider that upon infection, individuals go to one of three states depending on the severity of their infection:

  • The state CuC_{u} which gathers critical infections that lead to death or ICU admission. The probability of having a critical infection is denoted by pcritp_{\mathrm{crit}}.

  • The state ChC_{h} which corresponds to severe infections that require a hospitalization but are not critical. Such infections occur with probability psevp_{\mathrm{sev}}.

  • The II state which consists of all mild infections that do not lead to a hospital admission, and occur with probability 1pcritpsev1-p_{\mathrm{crit}}-p_{\mathrm{sev}}.

Individuals in state ChC_{h} are admitted to hospital after a duration DChD_{C_{h}}. Then, with probability pshortp_{\mathrm{short}} they are discharged after a duration DshortD_{\mathrm{short}}, while with probability 1pshort1-p_{\mathrm{short}} they are discharged after a duration DlongD_{\mathrm{long}}.

Critically infected individuals are admitted to hospital after a duration DCuD_{C_{u}}. Upon arrival at hospital, they die immediately with probability dhospd_{\mathrm{hosp}}, or go to ICU after a duration DHuD_{H_{u}}. Individuals in ICU die with probability dICUd_{\mathrm{ICU}} after a delay DDD_{D}. Otherwise they are discharged after a stay of length DUD_{U}.

Refer to caption
Figure 6: Illustration of the occupancy model

Inference results.

In our model, the probability of hospital admission is pcrit+psevp_{\mathrm{crit}}+p_{\mathrm{sev}}, the probability of ICU admission is pcrit(1dhosp)p_{\mathrm{crit}}(1-d_{\mathrm{hosp}}) and that of death is pcrit(dhosp+(1dhosp)dICU)p_{\mathrm{crit}}(d_{\mathrm{hosp}}+(1-d_{\mathrm{hosp}})d_{\mathrm{ICU}}). We have fixed these three values to those estimated in [50], and we only had one remaining parameter out of 4 (pcritp_{\mathrm{crit}}, psevp_{\mathrm{sev}}, dshortd_{\mathrm{short}}, dICUd_{\mathrm{ICU}}) to estimate from the data. We have fixed the time DUD_{U} to 1.51.5 days as estimated in [50]. All other parameters were estimated using a maximum likelihood method described in Section A.3. The estimated parameter set is shown in Table 3, while Figure 2 shows the best-fitting model.

The estimated parameters provide a good fit of the six observed time series. Again, the model has a tendency to overestimate the ICU admissions in the second part of the lockdown, which has the same interpretation as for the admission model.

Under the occupancy model, we estimated that Rpost=0.734R_{\mathrm{post}}=0.734, and W=9.52×105W=9.52\times 10^{5}. These estimates are extremely close to those made with the admission model. The estimated mean time between infection and death or hospital, ICU admission are respectively 19.519.5 days, 13.713.7 days and 12.512.5 days. Again we see that these estimates in the more complex model are consistent with those of the simple model. The mean recovery time from hospital is 19.419.4 days for severe infections, and 28.228.2 days for critical infections. This yields an overall mean recovery time of 20.020.0 days. Finally, we estimated that the death probability conditional on being in ICU is 0.7090.709. This yields that in our model a fraction 0.2560.256 of all deaths occur shortly after hospital admission. This result is consistent with [50] that estimated that a fraction 0.150.15 of all deaths occurred within the first day after hospital admission. However, it has been reported in [25] that the death probability of ICU patients is 0.230.23. Our estimated value is thus unrealistically high. This indicates that there is a fraction of hospital deaths that occur without any ICU admission, and not quickly after hospital admission, that our model is not accounting for.

Notation Description Value Source
RpostR_{\mathrm{post}} Reproduction number during lockdown 0.7340.734 E
WW Total number of infections before March 17 2020 9.52×1059.52\times 10^{5} E
pcrit+psevp_{\mathrm{crit}}+p_{\mathrm{sev}} Probability of being hospitalized 0.0360.036 [50]
pcrit(1dhosp)pcrit+psev\frac{p_{\mathrm{crit}}(1-d_{\mathrm{hosp}})}{p_{\mathrm{crit}}+p_{\mathrm{sev}}} Probability of entering ICU conditional on being at the hospital 0.190.19 [50]
dhosp+(1dhosp)dICU1+psev/pcrit\frac{d_{\mathrm{hosp}}+(1-d_{\mathrm{hosp}})d_{\mathrm{ICU}}}{1+p_{\mathrm{sev}}/p_{\mathrm{crit}}} Death probability conditional on being hospitalized 0.1810.181 [50]
dICUd_{\mathrm{ICU}} Probability of death conditional on being in ICU 0.7090.709 E
pshortp_{\mathrm{short}} Probability of a short stay at hospital 0.7010.701 E
DChD_{C_{h}} Delay between severe infection and hospital admission 14.514.5 days E
DshortD_{\mathrm{short}} Delay between hospital admission and quick discharge 7.367.36 days E
DlongD_{\mathrm{long}} Delay between hospital admission and slow discharge 47.547.5 days E
DCuD_{C_{u}} Delay between critical infection and hospital admission 11.011.0 days E
DHD_{H} Delay between hospital admission and ICU admission 1.51.5 days [50]
DUD_{U} Delay between ICU admission and discharge 28.228.2 days E
DDD_{D} Delay between ICU admission and death 9.909.90 days E
γ\gamma Scale parameter common to all Gamma distributions 0.3160.316 E
Table 3: Inferred parameter set for the occupancy model. The values indicated for the durations correspond to the means of the Gamma distributions. In the “Source” column, “E” indicates that the parameter has been estimated in the current work.

Our estimates, though they are not the key message of the present paper, can nevertheless draw attention to potential heterogeneities in the infected population. We estimated that the mean time between infection and ICU admission is shorter than that between infection and hospital admission. This suggests that the time between infection and severe symptom onset is shorter for critical infection, that lead to ICU admission, than for milder ones. Moreover, fitting the prevalence time series required to divide the hospital and death compartments in two subcompartments, indicating that the data are not well explained by a simple homogeneous model, as seen in Figure 7. Such heterogeneity could originate from underlying structuring variables, such as comorbidity or (actual) age, that we are not accounting for. Many estimates of clinical features, such as the incubation period, are obtained from a pooled dataset that does not take heterogeneity in the population into account [1, 41, 39, 56, 6, 42, 17]. When estimating the total number of infected individuals using only a fraction of the detected cases, e.g., using the hospital admissions or deaths, it is interesting to keep in mind that the time periods estimated from pooled studies could be inaccurate for the fraction of infected individuals under consideration.

Acknowledgements

The authors thank the Center for Interdisciplinary Research in Biology (CIRB) for funding and the taskforce MODCOV19 of the INSMI (CNRS) for their technical/scientific support during the epidemic. P.C. has received funding from the European Union’s Horizon 2020 research and innovation program under the Marie Skłodowska-Curie grant agreement PolyPath 844369.

References

  • [1] Jantien A. Backer, Don Klinkenberg, and Jacco Wallinga. Incubation period of 2019 novel coronavirus (2019-nCoV) infections among travellers from Wuhan, China, 20–28 January 2020. Eurosurveillance, 25(5), 2020.
  • [2] J. Baker, P. Chigansky, K. Hamza, and F. C. Klebaner. Persistence of small noise and random initial conditions. Advances in Applied Probability, 50(A):67–81, 2018.
  • [3] Frank Ball. A unified approach to the distribution of total size and total area under the trajectory of infectives in epidemic models. Advances in Applied Probability, 18:289–310, 1986.
  • [4] A. D. Barbour, P. Chigansky, and F. C. Klebaner. On the emergence of random initial conditions in fluid limits. Journal of Applied Probability, 53:1193–1205, 2016.
  • [5] Andrew D Barbour. The duration of the closed stochastic epidemic. Biometrika, 62:477–482, 1975.
  • [6] Qifang Bi, Yongsheng Wu, Shujiang Mei, Chenfei Ye, Xuan Zou, Zhen Zhang, Xiaojian Liu, Lan Wei, Shaun A. Truelove, Tong Zhang, Wei Gao, Cong Cheng, Xiujuan Tang, Xiaoliang Wu, Yu Wu, Binbin Sun, Suli Huang, Yu Sun, Juncen Zhang, Ting Ma, Justin Lessler, and Tiejian Feng. Epidemiology and transmission of covid-19 in 391 cases and 1286 of their close contacts in shenzhen, china: a retrospective cohort study. The Lancet Infectious Diseases, 20:911–919, 2020.
  • [7] Fred Brauer. The Kermack–McKendrick epidemic model revisited. Mathematical Biosciences, 198:119–131, 2005.
  • [8] Fred Brauer, Carlos Castillo-Chavez, and Zhilan Feng. Mathematical Models in Epidemiology. Texts in Applied Mathematics. Springer-Verlag New York, 2019.
  • [9] Tom Britton and Gianpaolo Scalia Tomba. Estimation in emerging epidemics: biases and remedies. Journal of The Royal Society Interface, 16:20180670, 2019.
  • [10] Danilo Cereda, Mattia Manica, Marcello Tirani, Francesca Rovida, Vittorio Demicheli, Marco Ajelli, Piero Poletti, Filippo Trentini, Giorgio Guzzetta, Valentina Marziano, Raffaella Piccarreta, Antonio Barone, Michele Magoni, Silvia Deandrea, Giulio Diurno, Massimo Lombardo, Marino Faccini, Angelo Pan, Raffaele Bruno, Elena Pariani, Giacomo Grasselli, Alessandra Piatti, Maria Gramegna, Fausto Baldanti, Alessia Melegaro, and Stefano Merler. The early phase of the COVID-19 epidemic in Lombardy, Italy. Epidemics, 37:100528, 2021.
  • [11] Dorothy H. Crawford. Deadly companions: how microbes shaped our history. Oxford University Press, second edition, 2018.
  • [12] Laura Di Domenico, Giulia Pullano, Chiara E. Sabbatini, Pierre-Yves Boëlle, and Vittoria Colizza. Impact of lockdown on COVID-19 epidemic in île-de-france and possible exit strategies. BMC Medicine, 18:240, 2020.
  • [13] Odo Diekmann. Limiting behaviour in an epidemic model. Nonlinear Analysis: Theory, Methods & Applications, 1:459–470, 1977.
  • [14] Odo Diekmann, Hans Heesterbeek, and Tom Britton. Mathematical Tools for Understanding Infectious Disease Dynamics. Princeton Series in Theoretical and Computational Biology. Princeton University Press, 2013.
  • [15] Odo Diekmann, J. A. P. Heesterbeek, and J. A. J. Metz. On the definition and the computation of the basic reproduction ratio R0R_{0} in models for infectious diseases in heterogeneous populations. Journal of Mathematical Biology, 28:365–382, 1990.
  • [16] Odo Diekmann, J.A.J. Metz, and J.A.P. Heesterbeek. The legacy of Kermack and McKendrick. In D. Mollison, editor, Epidemic Models: Their Structure and Relation to Data. Cambridge University Press, Cambridge, 1995.
  • [17] R. Djidjou-Demasse, Y. Michalakis, M. Choisy, M. T. Sofonea, and S. Alizon. Optimal covid-19 epidemic control until vaccine deployment. medRxiv, 2020.
  • [18] Jean-Jil Duchamps, Félix Foutel-Rodier, and Emmanuel Schertzer. General epidemiological models: Law of large numbers and contact tracing. 2021.
  • [19] Theodoros Evgeniou, Mathilde Fekom, Anton Ovchinnikov, Raphael Porcher, Camille Pouchol, and Nicolas Vayatis. Epidemic models for personalised COVID-19 isolation and exit policies using clinical risk predictions. Production and Operations Management, Special Issue on Managing Pandemics: A POM Perspective, 2021.
  • [20] Jie Yen Fan, Kais Hamza, Peter Jagers, and Fima Klebaner. Convergence of the age structure of general schemes of population processes. Bernoulli, 26:893–926, 2020.
  • [21] Luca Ferretti, Chris Wymant, Michelle Kendall, Lele Zhao, Anel Nurtay, Lucie Abeler-Dörner, Michael Parker, David Bonsall, and Christophe Fraser. Quantifying SARS-CoV-2 transmission suggests epidemic control with digital contact tracing. Science, 368(6491), 2020.
  • [22] Régis Ferrière and Viet Chi Tran. Stochastic and deterministic models for age-structured populations with genetically variable traits. ESAIM: Proc., 27:289–310, 2009.
  • [23] Raphaël Forien, Guodong Pang, and Étienne Pardoux. Epidemic models with varying infectivity. SIAM Journal on Applied Mathematics, 81:1893–1930, 2021.
  • [24] Raphaël Forien, Guodong Pang, and Étienne Pardoux. Estimating the state of the COVID-19 epidemic in France using a model with memory. Royal Society Open Science, 8:202327, 2021.
  • [25] Santé Publique France. COVID-19 : point épidémiologique du 4 juin 2020, 2020.
  • [26] Santé Publique France. Données hospitalières relatives à l’épidémie de COVID-19, 2020.
  • [27] Tapiwa Ganyani, Cécile Kremer, Dongxuan Chen, Andrea Torneri, Christel Faes, Jacco Wallinga, and Niel Hens. Estimating the generation interval for coronavirus disease (COVID-19) based on symptom onset data, March 2020. Eurosurveillance, 25, 2020.
  • [28] Stéphane Gaubert, Marianne Akian, Xavier Allamigeon, Marin Boyet, Baptiste Colin, Théotime Grohens, Laurent Massoulié, David P. Parsons, Frédéric Adnet, Érick Chanzy, Laurent Goix, Frédéric Lapostolle, Éric Lecarpentier, Christophe Leroy, Thomas Loeb, Jean-Sébastien Marx, Caroline Télion, Laurent Tréluyer, and Pierre Carli. Understanding and monitoring the evolution of the Covid-19 epidemic from medical emergency calls: the example of the Paris area. Comptes Rendus. Mathématique, 358:843–875, 2020.
  • [29] Kais Hamza, Peter Jagers, and Fima C Klebaner. The age structure of population-dependent general branching processes in environments with a high carrying capacity. Proceedings of the Steklov Institute of Mathematics, 282:90–105, 2013.
  • [30] Hisashi Inaba. Age-Structured Population Dynamics in Demography and Epidemiology. Springer Singapore, Singapore, 2017.
  • [31] Peter Jagers. Branching processes with biological applications. Wiley, 1975.
  • [32] Peter Jagers and Fima C Klebaner. Population-size-dependent and age-dependent branching processes. Stochastic Processes and their Applications, 87(2):235–254, 2000.
  • [33] Peter Jagers and Fima C Klebaner. Population-size-dependent, age-structured branching processes linger around their carrying capacity. Journal of Applied Probability, 48:249–260, 2011.
  • [34] Peter Jagers and Olle Nerman. The growth and composition of branching populations. Advances in Applied Probability, 16:221–259, 1984.
  • [35] Peter Jagers and Olle Nerman. Limit theorems for sums determined by branching and other exponentially growing processes. Stochastic Processes and their Applications, 17:47–71, 1984.
  • [36] William O. Kermack and Anderson G. McKendrick. A contribution to the mathematical theory of epidemics. Proceedings of the Royal Society of London. Series A, Containing Papers of a Mathematical and Physical Character, 115(772):700–721, 1927.
  • [37] Mi-Young Kim. Long-time stability of numerical solutions to Gurtin–MacCamy equations by method of characteristics. Applied Mathematics and Computation, 176:552–562, 2006.
  • [38] H. Krauss. Zoonoses : infectious diseases transmissible from animals to humans. ASM Press, 2003.
  • [39] Stephen A Lauer, Kyra H Grantz, Qifang Bi, Forrest K Jones, Qulu Zheng, Hannah R Meredith, Andrew S Azman, Nicholas G Reich, and Justin Lessler. The incubation period of coronavirus disease 2019 (COVID-19) from publicly reported confirmed cases: estimation and application. Annals of internal medicine, 172:577–582, 2020.
  • [40] Noémie Lefrancq, Juliette Paireau, Nathanaël Hozé, Noémie Courtejoie, Yazdan Yazdanpanah, Lila Bouadma, Pierre-Yves Boëlle, Fanny Chereau, Henrik Salje, and Simon Cauchemez. Evolution of outcomes for patients hospitalised during the first 9 months of the SARS-CoV-2 pandemic in France: A retrospective national surveillance data analysis. The Lancet Regional Health - Europe, 5:100087, 2021.
  • [41] Natalie M. Linton, Tetsuro Kobayashi, Yichi Yang, Katsuma Hayashi, Andrei R. Akhmetzhanov, Sung-mok Jung, Baoyin Yuan, Ryo Kinoshita, and Hiroshi Nishiura. Incubation period and other epidemiological characteristics of 2019 novel coronavirus infections with right truncation: A statistical analysis of publicly available case data. Journal of Clinical Medicine, page 538, 2020.
  • [42] Clément Massonnaud, Jonathan Roux, and Pascal Crépey. COVID-19: Forecasting short term hospital needs in France. medRxiv, 2020.
  • [43] J. A. J. Metz. The epidemic in a closed population with all susceptibles equally vulnerable; some results for large susceptible populations and small initial infections. Acta Biotheoretica, 27:75–123, 1978.
  • [44] J. A. J. Metz and Odo Diekmann. The Dynamics of Physiologically Structured Populations. Lecture Notes in Biomathematics. Springer, Berlin, Heidelberg, 1986.
  • [45] Olle Nerman. On the convergence of supercritical general (C-M-J) branching processes. Zeitschrift für Wahrscheinlichkeitstheorie und Verwandte Gebiete, 57:365–395, 1981.
  • [46] Guodong Pang and Etienne Pardoux. Functional limit theorems for non-Markovian epidemic models. 2020.
  • [47] Guodong Pang and Etienne Pardoux. Functional law of large numbers and pdes for epidemic models with infection-age dependent infectivity. 2021.
  • [48] Joannes Reddingius. Notes on the mathematical theory of epidemics. Acta Biotheoretica, 20:125–157, 1971.
  • [49] Lionel Roques, Etienne K Klein, Julien Papaix, Antoine Sar, and Samuel Soubeyrand. Using early data to estimate the actual infection fatality ratio from COVID-19 in France. Biology, 9:97, 2020.
  • [50] Henrik Salje, Cécile Tran Kiem, Noémie Lefrancq, Noémie Courtejoie, Paolo Bosetti, Juliette Paireau, Alessio Andronico, Nathanaël Hozé, Jehanne Richet, Claire-Lise Dubost, Yann Le Strat, Justin Lessler, Daniel Levy-Bruhl, Arnaud Fontanet, Lulla Opatowski, Pierre-Yves Boelle, and Simon Cauchemez. Estimating the burden of SARS-CoV-2 in France. Science, 369(6500):208–211, 2020.
  • [51] Emmanuel Schertzer and Florian Simatos. Height and contour processes of Crump-Mode-Jagers forests (I): general distribution and scaling limits in the case of short edges. Electronic Journal of Probability, 23:43pp., 2018.
  • [52] Thomas Sellke. On the asymptotic distribution of the size of a stochastic epidemic. Journal of Applied Probability, 20:390–394, 1983.
  • [53] Mircea T. Sofonea, Bastien Reyné, Baptiste Elie, Ramsès Djidjou-Demasse, Christian Selinger, Yannis Michalakis, and Samuel Alizon. Memory is key in capturing covid-19 epidemiological dynamics. Epidemics, 35:100459, 2021.
  • [54] Ziad Taïb. Branching Processes and Neutral Evolution, volume 93 of Lecture Notes in Biomathematics. Springer Berlin Heidelberg, Berlin, Heidelberg, 1992.
  • [55] Horst R. Thieme. Renewal theorems for some mathematical models in epidemiology. Journal of Integral Equations, 8(3):185–216, 1985.
  • [56] Lauren C Tindale, Jessica E Stockdale, Michelle Coombe, Emma S Garlock, Wing Yin Venus Lau, Manu Saraswat, Louxin Zhang, Dongxuan Chen, Jacco Wallinga, and Caroline Colijn. Evidence for transmission of covid-19 prior to symptom onset. eLife, 9:e57149, 2020.
  • [57] Viet Chi Tran. Large population limit and time behaviour of a stochastic particle model describing an age-structured population. ESAIM: Probability and Statistics, 12:345–386, 2008.
  • [58] Robert Verity, Lucy C. Okell, Ilaria Dorigatti, Peter Winskill, Charles Whittaker, Natsuko Imai, Gina Cuomo-Dannenburg, Hayley Thompson, Patrick G. T. Walker, Han Fu, Amy Dighe, Jamie T. Griffin, Marc Baguelin, Sangeeta Bhatia, Adhiratha Boonyasiri, Anne Cori, Zulma Cucunubá, Rich FitzJohn, Katy Gaythorpe, Will Green, Arran Hamlet, Wes Hinsley, Daniel Laydon, Gemma Nedjati-Gilani, Steven Riley, Sabine van Elsland, Erik Volz, Haowei Wang, Yuanrong Wang, Xiaoyue Xi, Christl A. Donnelly, Azra C. Ghani, and Neil M. Ferguson. Estimates of the severity of coronavirus disease 2019: a model-based analysis. The Lancet Infectious Diseases, 20:669–677, 2020.
  • [59] Jacco Wallinga and Marc Lipsitch. How generation intervals shape the relationship between growth rates and reproductive numbers. Proceedings of the Royal Society B: Biological Sciences, 274:599–604, 2007.
  • [60] Joseph T. Wu, Kathy Leung, Mary Bushman, Nishant Kishore, Rene Niehus, Pablo M. de Salazar, Benjamin J. Cowling, Marc Lipsitch, and Gabriel M. Leung. Estimating clinical severity of COVID-19 from the transmission dynamics in Wuhan, China. Nature Medicine, 26:506–510, 2020.

Appendix A Appendix

A.1 Numerical simulation of the PDE

Computing the likelihood of our model requires to obtain an expression for the solution (n(t,a);t,a0)(n(t,a);\,t,a\geq 0) of the PDE (5). This equation was solved numerically using a backward difference scheme based on the method of characteristics [37].

For h>0h>0, we approximate the value of (n(t,a);t,a0)(n(t,a);\,t,a\geq 0) on the lattice {(ih,kh);iT,kA}\{(ih,kh);\,i\leq T^{*},k\leq A^{*}\} by the array (u(k,i);k,i)(u(k,i);\,k,i) defined as follows:

kT1,iA1,u(k+1,i+1)=u(k,i)\displaystyle\forall k\leq T^{*}-1,\;i\leq A^{*}-1,\quad u(k+1,i+1)=u(k,i)
iA,u(0,i)=x0g(ih)\displaystyle\forall i\leq A^{*},\quad u(0,i)=x_{0}g(ih)
kT1,u(k+1,0)=hi=1Aτ(ih)u(k+1,i).\displaystyle\forall k\leq T^{*}-1,\quad u(k+1,0)=h\sum_{i=1}^{A^{*}}\tau(ih)u(k+1,i).

Note that individuals with age larger than hAhA^{*} are discarded. This maximal age was chosen so that individuals with age greater than hAhA^{*} have negligible infection rate, and belong to the dead or recovered compartment with high probability.

A.2 Expected number of entrances in a state

Theorem 2 provides an expression for the number of individuals ni(t)n_{i}(t) in compartment ii at time tt. As we want to fit “incidence” data, that is, the number of individuals that enter a given compartment, we need to derive an expression for this quantity. To this aim, for two compartments i,j𝒮i,j\in\mathcal{S}, let us write iji\preceq j if an individual must visit ii before it gets to jj. For instance, in the admission model we have CHIDC\preceq H\preceq I\preceq D. Then, the number of entrances ei(t,t+s)e_{i}(t,t+s) in ii between tt and t+st+s is given by

ei(t,t+s)=ni(t+s)ni(t)+jijinj(t+s)nj(t).\displaystyle e_{i}(t,t+s)=n_{i}(t+s)-n_{i}(t)+\sum_{\begin{subarray}{c}j\succeq i\\ j\neq i\end{subarray}}n_{j}(t+s)-n_{j}(t). (19)

The last term in the previous sum corresponds to the number of individuals who leave compartment ii during [t,t+s][t,t+s]. Expression (19) can be readily used to derive the expected number of entrance in ii from (n(t,a);t,a0)(n(t,a);\,t,a\geq 0) and (p(a,j);a0)(p(a,j);\,a\geq 0).

It is interesting to note that (19) only depends on the distribution of the entrance time TiT_{i} of (X(a);a0)(X(a);\,a\geq 0) in ii, defined as:

Tiinf{a0:X(a)=i}T_{i}\coloneqq\inf\{a\geq 0:X(a)=i\}

with the convention inf=\inf\emptyset=\infty. To see this, one can write

ei(t,t+s)\displaystyle e_{i}(t,t+s) =0(n(t+s,a)n(t,a))(jipj(a))da\displaystyle=\int_{0}^{\infty}\big{(}n(t+s,a)-n(t,a)\big{)}\Big{(}\sum_{j\succeq i}p_{j}(a)\Big{)}\mathrm{d}a
=0(n(t+s,a)n(t,a))(Tia)da.\displaystyle=\int_{0}^{\infty}\big{(}n(t+s,a)-n(t,a)\big{)}\mathbb{P}(T_{i}\leq a)\mathrm{d}a.

From an inference perspective, this is quite convenient since computing ei(t,t+s)e_{i}(t,t+s) only requires to infer the distribution of TiT_{i}.

A.3 Likelihood computation

The daily incidence and prevalence data for France between March 18 and May 11 were taken from [26]. The days during this time period are indexed by {1,2,,tmax}\{1,2,\dots,t_{\max}\}, where day 11 is March 18 and day tmaxt_{\max} is May 11.

For i{H,U,D}i\in\{H,U,D\}, we denote by eiobs(t)e^{\mathrm{obs}}_{i}(t) the reported number of admissions to hospital, ICU, or the number of deaths on day tt, respectively. Moreover, for i{H,U,R}i\in\{H,U,R\}, we denote by niobs(t)n_{i}^{\mathrm{obs}}(t) the reported number of occupied beds in hospital, ICU, or the number of discharged patients on day tt, respectively. Let us further denote by

π(k;λ)=eλλkk!\pi(k;\lambda)=e^{-\lambda}\frac{\lambda^{k}}{k!}

the probability mass function of a Poisson distribution with parameter λ\lambda.

Then, the likelihood of a parameter set θ\theta under the admission model is given by

Lad(θ)=i{H,U,D}k=1tmaxπ(eiobs(tk);ei(tk1,tk)).L_{\mathrm{ad}}(\theta)=\prod_{i\in\{H,U,D\}}\prod_{k=1}^{t_{\max}}\pi\big{(}e^{\mathrm{obs}}_{i}(t_{k});e_{i}(t_{k-1},t_{k})\big{)}.

The expected number of entrances in state ii, ei(tk1,tk)e_{i}(t_{k-1},t_{k}), is computed using (19) with the one-dimensional marginals of the admission model and the numerical approximation of (5) described in Section A.1.

For the occupancy model, the likelihood of the parameter set θ\theta is given by

Loc(θ)=i{H,U,D}k=1tmaxπ(eiobs(tk);ei(tk1,tk))×i{H,U,R}k=1tmaxπ(niobs(tk);ni(tk)).L_{\mathrm{oc}}(\theta)=\prod_{i\in\{H,U,D\}}\prod_{k=1}^{t_{\max}}\pi\big{(}e^{\mathrm{obs}}_{i}(t_{k});e_{i}(t_{k-1},t_{k})\big{)}\times\prod_{i\in\{H,U,R\}}\prod_{k=1}^{t_{\max}}\pi\big{(}n^{\mathrm{obs}}_{i}(t_{k});n_{i}(t_{k})\big{)}. (20)

Again, the value of ei(tk1,tk)e_{i}(t_{k-1},t_{k}) is computed using (19) and that of ni(tk)n_{i}(t_{k}) using (17), but using the one-dimensional marginals of the occupancy model. Note that under this more complex model, there are two pathways to hospital (critical and severe infection), two pathways to death (with or without hospital admission), and three pathways to hospital discharge (fast discharge, slow discharge, or discharge of ICU patients). The predicted values for the number of individuals in each of these compartments is obtained by summing over all pathways leading to the corresponding state.

For both models, we looked for the parameter set θ\theta maximizing the likelihood. It was obtained using the minimize function of the Python scipy.optimize module, using a Nelder-Mead algorithm. We selected as initial point of the optimization algorithm a set of parameters that were close to the existing estimates in the literature, or which seemed realistic if such estimates did not exist.

A.4 Best fitting prevalence curves under admission model

Recall the admission model from Section 4.3. By adding two parameters to the model, one for the mean time between hospital admission and discharge, the other for the mean time between ICU admission and discharge, we can derive an expression for the likelihood of the prevalence and incidence time series under the admission model, similar to (20). The best-fitting values for these two parameters were obtained by maximizing the likelihood with all other parameters values fixed to those estimated in Table 2. The corresponding model is displayed in Figure 7.

Refer to caption
Figure 7: Best fit of the admission model for prevalence data.