This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

A time-modulated Hawkes process to model the spread of COVID-19 and the impact of countermeasures111The simulation code and data used in this work are available under GPL v3 on GitHub: https://github.com/michelegaretto/covid.git

Michele Garetto michele.garetto@unito.it Emilio Leonardi emilio.leonardi@polito.it Giovanni Luca Torrisi giovanniluca.torrisi@cnr.it Università degli Studi di Torino, C.so Svizzera 185, Torino, Italy Politecnico di Torino, C.so Duca degli Abruzzi 24, Torino, Italy IAC-CNR, Via dei Taurini 19, Roma, Italy
Abstract

Motivated by the recent outbreak of coronavirus (COVID-19), we propose a stochastic model of epidemic temporal growth and mitigation based on a time-modulated Hawkes process. The model is sufficiently rich to incorporate specific characteristics of the novel coronavirus, to capture the impact of undetected, asymptomatic and super-diffusive individuals, and especially to take into account time-varying counter-measures and detection efforts. Yet, it is simple enough to allow scalable and efficient computation of the temporal evolution of the epidemic, and exploration of what-if scenarios. Compared to traditional compartmental models, our approach allows a more faithful description of virus specific features, such as distributions for the time spent in stages, which is crucial when the time-scale of control (e.g., mobility restrictions) is comparable to the lifetime of a single infection. We apply the model to the first and second wave of COVID-19 in Italy, shedding light onto several effects related to mobility restrictions introduced by the government, and to the effectiveness of contact tracing and mass testing performed by the national health service.

keywords:
COVID-19 , Compartmental models, Branching process , Hawkes process
journal: Annual Reviews in Control

1 Introduction and related work

Models of epidemic propagation are especially useful when they provide key insights while retaining simplicity and generality. For example, in mathematical epidemiology the SIR model reveals in very simple terms the fundamental role of the basic reproduction number (R0R_{0}) which governs the macroscopic, long-term evolution of the outbreak in a homogeneous population. The vast majority of models developed for the novel SARS-CoV-2 are extensions to the classic SIR, along the standard approach of introducing additional compartments to describe different phases of the infection, the presence of asymptomatic, symptomatic or pauci-symptomatic individuals, the set of quarantined, hospitalized people, and so on. Such models lead to a system of coupled ODE’s with fixed or time-varying coefficients to be estimated from traces. A very incomplete list of modeling efforts pursued in this direction, early applied to COVID-19, includes the SEIR models in [1, 2, 3, 4, 5], the SIRD models in [6, 7], the SEPIA model in [8], the SIDHARTE model in [9].

In this paper we develop a slightly more sophisticated model that allows a more accurate representation of native characteristics of a specific virus, such as actual distributions of the duration of incubation, pre-symptomatic and symptomatic phases, for various categories of infected. This level of detail is important when the intensity of applied countermeasures varies significantly over time-scales comparable to that of an individual infection, and we believe it is essential to address fundamental questions such as: i) when and to what extent can we expect to see the effect of specific mobility restrictions introduced by a national government at a given point in time? ii) what is the impact of hard vs partial lockdowns enforced for given numbers of days? when can restrictions be safely released to restart economic and social activities while still keeping the epidemic under control?

Specifically, we propose and analyse a novel, modulated version of the marked Hawkes process, a self-exciting stochastic point process with roots in geophysics and finance [10]. In a nutshell, in the standard marked Hawkes process each event ii with mark mim_{i}, occurring at time tit_{i}, generates new events with stochastic intensity ν(tti,mi)\nu(t-t_{i},m_{i}), where ν(,mi)\nu(\cdot,m_{i}) is a generic kernel. The process unfolds through successive generations of events, starting from so-called immigrants (generation zero). In our model events represent individual infections, and a modulating function μ(t)\mu(t), which scales the overall intensity of the process at real time tt, allows us to take into account the impact of time-varying mobility restrictions and social distance limitations. In addition, we model the transition of infected, undetected individuals to a quarantined state at inhomogeneous rate ρ(t)\rho(t), to describe the time-varying effectiveness of contact tracing and mass testing.

We mention that branching processes of various kind, including Hawkes processes, have been proposed in various biological contexts [11, 12, 13, 14]. In [15], a probabilistic extension of (deterministic) discrete-time SEIR models, based on multi-type branching processes, has been recently applied to COVID-19 to capture the impact of detailed distributions of the time spent in different phases, together with mobility restrictions and contact tracing. In parallel to us, authors of [16] have proposed a Hawkes process with spatio-temporal “covariates” for modeling COVID-19 in the US, together with an EM algorithm for parameter inference. The work in [16] is somehow orthogonal to us, since they focus on spatial and demographic features, aiming at predicting the trend of confirmed cases and deaths in each county. In contrast, we introduce marks and stages to natively model the course of an infection for different categories of individuals, with the fundamental distinction between real and detected cases. Moreover, we take into account the impact of time-varying detection efforts and contact tracing. At last, we obtain analytical expressions for the first two moments of the number of individuals who have been infected within a given time.

We demonstrate the applicability of our model to the novel COVID-19 pandemic by considering real traces related to the first and second wave of coronavirus in Italy. Our fitting exercise, though largely preliminary and based on incomplete information, suggests that our approach has good potential and can be effectively used both for planning counter-measures and to provide an a-posteriori explanation of observed epidemiological curves.

The paper is organized as follows. In Section 2 we provide the mathematical formulation of the proposed modulated Hawkes process to describe the temporal evolution of the epidemic. In Section 3 we motivate our approach by comparing it to the standard SIR model. Some mathematical results related to the moment generating function of our process are presented in Section 4. In Section 5 we describe our COVID-19 model based on the proposed approach. We separately fit our model to the first and second wave of COVID-19 in Italy in Sections 6 and 7, respectively, offering hopefully interesting insights about what happened in this country (and similarly in other European countries) during the recent pandemic. We conclude in Section 8.

2 Mathematical formulation of modulated Hawkes process

We first briefly recall the classic Hawkes process restricted to the temporal dimension (the spatio-temporal formulation is similar, but we focus in this work on the purely temporal version). Events of the process (or points) occur at times T={Ti}i1T=\{T_{i}\}_{i\geq 1}, which are \mathbb{R}-valued random variables. A subset II of these points, called immigrants, are produced by an inhomogeneous Poisson process of given intensity σ(t)\sigma(t). Each immigrant, independently of others, is the originator of a progeny (or cluster) of other points, dispersed in the future through a self-similar branching structure: a first generation of points is produced with intensity ν(tTj)\nu(t-T_{j}), where TjT_{j} is the occurrence time of immigrant jj, and ν:++\nu:\mathbb{R}^{+}\rightarrow\mathbb{R}^{+} is a kernel function. Each point of the first generation, in turn, generates new offsprings in a similar fashion, creating the second generation of points, and so on.

The above process can be easily extended to account for different types of points with type-specific kernel functions. Types are denoted by marks M={Mi}i1M=\{M_{i}\}_{i\geq 1}, which are assumed to be i.i.d. random variables with values on an arbitrary measurable space (M,)(\mathrm{M},\mathcal{M}), with a probability distribution \mathbb{Q}. Let N(t,m)N(t,m) be the counting process associated to the marked points N={(Ti,Mi)}i1N=\{(T_{i},M_{i})\}_{i\geq 1}. The (conditional) stochastic intensity λ(t)\lambda(t) of the overall process is then given by:

λ(t)=σ(t)+0tν(ts,m)N(ds,dm)=σ(t)+Tk(0,t)ν(tTk,Mk)\displaystyle\lambda(t)=\sigma(t)+\int_{0}^{t}\nu(t-s,m)N({\rm\,d}s,{\rm\,d}m)=\sigma(t)+\sum_{{T_{k}}\cap(0,t)}\nu(t-T_{k},M_{k})

where ν:+×M+\nu:\mathbb{R}^{+}\times\mathrm{M}\rightarrow\mathbb{R}^{+} is a type-dependent kernel function. In the following we assume ν¯(t):=𝔼[ν(t,M1)]\overline{\nu}(t):=\mathbb{E}[\nu(t,M_{1})] to be summable,. i.e.

R0=0ν¯(t)dt<R_{0}=\int_{0}^{\infty}\overline{\nu}(t)\mathrm{d}t<\infty (1)

Note that R0R_{0} is the average number of offsprings generated by each point, which is usually referred to as basic reproduction number in epidemiology. The process is called subcritical if R0<1R_{0}<1, supercritical if R0>1R_{0}>1.

Our main modification to the above classic marked Hawkes process NN is to modulate the instantaneous generation rate of offsprings N=NIN^{\prime}=N\setminus I by a positive, bounded function μ(t):++\mu(t):\mathbb{R}^{+}\rightarrow\mathbb{R}^{+}, representing the impact of mobility restriction countermeasures. By so doing, we obtain the modified stochastic intensity of the process:

λ(t)\displaystyle\lambda(t) =σ(t)+μ(t)[0tν(ts,m)N(ds,dm)]\displaystyle=\sigma(t)+\mu(t)\left[\int_{0}^{t}\nu(t-s,m)N({\rm\,d}s,{\rm\,d}m)\right]
=σ(t)+μ(t)[Tk(0,t)ν(tTk,Mk)]\displaystyle=\sigma(t)+\mu(t)\left[\sum_{{T_{k}}\cap(0,t)}\nu(t-T_{k},M_{k})\right] (2)

Note that, when μ(t)=μ\mu(t)=\mu is constant, we re-obtain a classic Hawkes process with modified kernel μν()\mu\,\nu(). In general, the obtained process is no longer self-similar. In particular, the average number of offsprings generated by a point becomes a function of time:

R(t)=0μ(t+τ)ν¯(τ)dτR(t)=\int_{0}^{\infty}\mu(t+\tau)\overline{\nu}(\tau){\rm\,d}\tau (3)

which provides the infamous real-time reproduction number usually referred to on the media as ‘Rt index’.

We emphasize that the process can initially start in the supercritical regime, and then it can become subcritical for effect of a decreasing function μ()\mu(), a case of special interest in our application to waves of COVID-19. We mention that the great bulk of literature related to the Hawkes process and its applications to geophysics and finance focuses on the subcritical regime, whereas our formulation applies also to the supercritical regime, which is more germane to epidemics.

The conditional intensity (2) can be easily de-conditioned with respect to NN, obtaining the ‘average’ stochastic intensity λ¯(t):=𝔼[λ(t)]\overline{\lambda}(t):=\mathbb{E}[\lambda(t)]:

λ¯(t)\displaystyle\overline{\lambda}(t) =σ(t)+μ(t)𝔼[(0,t)×Mν(ts,m)N(ds,dm)]\displaystyle=\sigma(t)+\mu(t)\mathbb{E}\left[\int_{(0,t)\times\mathrm{M}}\nu(t-s,m)N(\mathrm{d}s,\mathrm{d}m)\right]
=σ(t)+μ(t)𝔼[(0,t)×Mν(ts,m)λ(s)ds(dm)]\displaystyle=\sigma(t)+\mu(t)\mathbb{E}\left[\int_{(0,t)\times\mathrm{M}}\nu(t-s,m)\lambda(s)\,\mathrm{d}s\mathbb{Q}(\mathrm{d}m)\right]
=σ(t)+μ(t)𝔼[0tν¯(ts)λ(s)ds]\displaystyle=\sigma(t)+\mu(t)\mathbb{E}\left[\int_{0}^{t}\overline{\nu}(t-s)\lambda(s)\mathrm{d}s\right]
=σ(t)+μ(t)0tν¯(ts)λ¯(s)ds.\displaystyle=\sigma(t)+\mu(t)\int_{0}^{t}\overline{\nu}(t-s)\overline{\lambda}(s)\mathrm{d}s. (4)

where we recall that ν¯(t)=𝔼[ν(t,M1)]\overline{\nu}(t)=\mathbb{E}[\nu(t,M_{1})].

We observe that (2) is a linear Volterra equation of the second kind, which can be efficiently solved numerically.222For example, the standard trapezoidal rule allows obtaining a discretized version of λ¯(t)\overline{\lambda}(t) by a matrix inversion. In the special case of constant modulating function, (2) reduces to a convolution equation which can be analyzed and solved by means of Laplace transform techniques (see A).

At last we introduce the total number of points up to time tt, N(t)N(t) (regardless of their associated marks), and its average:

N¯(t)=0tλ¯(τ)dτ\overline{N}(t)=\int_{0}^{t}\overline{\lambda}(\tau){\rm\,d}\tau (5)

3 Comparison with SIR model

When the kernel function has an exponential shape, i.e., ν(t)=Keγt\nu(t)=Ke^{-\gamma t}, it is possible to establish a simple connection between the Hawkes process and the classic SIR model [17]. Specifically, consider a stochastic SIR model with an infinite population of susceptible individuals, where each infected generates new infections at rate β\beta, and recovers at rate γ\gamma (i.e., after an exponentially distributed amount of time of mean 1/γ1/\gamma). Then the average intensity of the process generated by this stochastic SIR, averaging out the times at which nodes recover, has exactly the (conditional) intensity of a Hawkes process starting with the same number of initially infected nodes, no further immigrants, and ν(t)=βeγt\nu(t)=\beta e^{-\gamma t} [17]. This can be intuitively understood by considering that in SIR the average, effective rate with which an infected individual generates new infections after time tt since it became infected equals β\beta times the probability that the node has not yet recovered, which is eγte^{-\gamma t}.

We remark that in [17] authors push this equivalence a bit further, by showing that a SIR model with finite population N0N_{0} is equivalent to a modulated Hawkes process similar to (2), where μ(t)=1N(t)/N0\mu(t)=1-N(t)/N_{0}. In our model and its application to COVID-19, we do not consider the impact of finite population size, and assume an infinite population of susceptible individuals333This assumption is largely acceptable at the beginning of an epidemic.. Therefore, we do not need the ‘correction factor’ 1N(t)/N01-N(t)/N_{0}, and instead use the modulating function μ(t)\mu(t) to model the (process-independent) effect of mobility restrictions. A similar effect due to mobility restrictions can be incorporated into the SIR model, by applying factor μ(t)\mu(t) to the infection rate β\beta.

Given the above connection between a (possibly modulated) Hawkes process and the traditional SIR, one might ask what is the benefit of our approach with respect to SIR-like models. In contrast to SIR, the Hawkes process allows us to choose an arbitrary kernel ν(t)\nu(t), not necessarily exponential. With this freedom, can we observe dynamics significantly different from those that can be obtained by a properly chosen exponential shape? We answer this question in the positive with the help of an illustrative scenario.

Consider an epidemic starting at time zero with I0=1000I_{0}=1000 infected individuals.444Note that in our model the number of immigrants has a Poisson distribution with mean 0σ(t)dt\int_{0}^{\infty}\sigma(t){\rm\,d}t. However to reduce the variance we have preferred to start simulations with the same deterministic number of infected. Further, note that analytical results for the mean trajectory of infected nodes are not affected by this choice. We fix to g=10g=10 (days) the average generation time, which is the mean temporal separation between a new infection belonging to generation ii and its parent in generation i1i-1. Note that the above constraint implies that 0tν(t)dt=g\int_{0}^{\infty}t\,\nu(t){\rm\,d}t=g.

We also normalize 0ν(t)dt=1\int_{0}^{\infty}\nu(t){\rm\,d}t=1, since we can use μ(t)\mu(t) to scale the infection rate, in addition to considering time-varying effects due to mobility restriction. Note that in SIR we can only satisfy the above constraints by choosing the exponential kernel ν(t)=1getg\nu(t)=\frac{1}{g}e^{-\frac{t}{g}}, t>0t>0. Instead, in our Hawkes model we have much more freedom.

Refer to caption
Figure 1: Evolution of mean number of infected for different kernel shapes, in the case of constant μ=1.2\mu=1.2 (supercritical) or constant μ=0.9\mu=0.9 (subcritical), with I0=1000I_{0}=1000, g=10g=10.

First, we consider a scenario in which μ(t)\mu(t) is constant, equal to either 1.2 (supercritical case) or 0.9 (subcritical case). In Fig. 1 we show the temporal evolution of the mean number of infected N¯(t)\overline{N}(t), for three kernel shapes that allows for an explicit solution of (2),(5) using Laplace transform (A). The delta shape corresponds to the kernel function ν¯(t)=δ(tg)\overline{\nu}(t)=\delta(t-g), where δ()\delta(\cdot) is Dirac’s delta function, for which

N¯delta(t)=I0μt/g+11μ1\overline{N}_{\rm delta}(t)=I_{0}\frac{\mu^{\lfloor t/g\rfloor+1}-1}{\mu-1} (6)

The exp shape corresponds to the exponential kernel ν¯(t)=1getg\overline{\nu}(t)=\frac{1}{g}e^{-\frac{t}{g}} (SIR model), for which

N¯exp(t)=I0[1+μμ1(et(μ1)/g1)]\overline{N}_{\rm exp}(t)=I_{0}\left[1+\frac{\mu}{\mu-1}\left(e^{t(\mu-1)/g}-1\right)\right] (7)

The hyper shape corresponds to the hyper-exponential kernel:
ν¯(t)=p1α1eα1t+p2α2eα2t\overline{\nu}(t)=p_{1}\alpha_{1}e^{-\alpha_{1}t}+p_{2}\alpha_{2}e^{-\alpha_{2}t}, where 0p110\leq p_{1}\leq 1, p2=1p1p_{2}=1-p_{1}, p1/α1+p2/α2=gp_{1}/\alpha_{1}+p_{2}/\alpha_{2}=g, which also permits obtaining an explicit, though lengthy expression of N¯(t)\overline{N}(t) that we omit here. Specifically, the ‘hyper’ curve on Fig. 1 corresponds to the case 1/α1=31/\alpha_{1}=3, 1/α2=301/\alpha_{2}=30.

We observe that, in the subcritical case (μ=0.9\mu=0.9), the mean number of infected saturates to the same value, irrespective of the kernel shape; this can be explained by the fact that the final size of the epidemics is described by the same branching process for all kernel shapes (i.e., a branching process in which the offspring distribution is Poisson with mean 0.9). In the supercritical case (μ=1.2\mu=1.2), instead, the mean number of infected grows exponentially as Θ(eηt)\Theta(e^{\eta t}) (as tt grows large), where, interestingly, η>1\eta>1 depends on the particular shape (notice the log yy axes on Fig. 1). In particular, the hyper-exponential kernel can produce arbitrarily large η\eta (A). We conclude that, even when we fix the average generation time gg, different kernels can produce largely different (in order sense) evolutions of N¯(t)\overline{N}(t). Note that, by introducing compartments, SIR-like models can match higher-order moments of the generation time, but our results suggest that N¯(t)\overline{N}(t) depends on all moments of it, i.e., on the precise shape of the kernel. Moreover, recall that some shapes are difficult to approximate by a phase-type approximation (e.g., the rectangular shape, or more in general, kernels with finite support).

The strong impact of the specific kernel shape becomes even more evident when we consider a time-varying μ(t)\mu(t), as in our modulated Hawkes process. As an example, consider the COVID-inspired scenario in which the modulating function μ(t)\mu(t) corresponds to the black curve in Fig. 2: during the first 30 days, μ(t)\mu(t) decreases linearly from 3 to 0.3; it stays constant at 0.3 for the next 60 days; and it goes back linearly to 3 during the next 30 days, after which it stays constant at 33.

Refer to caption
Figure 2: Evolution of mean number of infected (left y axes) in the COVID-like scenario, for different kernel shapes. Shaded areas denote 95%95\%-level confidence intervals obtained by 100 simulation runs. The right y axes refers to modulating function μ(t)\mu(t) (black curve).

In Fig. 2 we show the mean number of infected estimated by simulation (averaging 100 independent runs), for the three kernel shapes exp, delta, and uniform, where uniform corresponds to the kernel ν¯(t)=12g\overline{\nu}(t)=\frac{1}{2g}, t(0,2g)t\in(0,2g), while the shapes exp, delta have been already introduced above. Shaded areas around each curve denote 95%95\%-level confidence intervals.

We observe huge discrepancies among the trajectories of N¯(t)\overline{N}(t) obtained under the three kernels. In particular, after the first ‘wave’ of 30 days, the exp kernel produces about four times more infections than those produced by the delta kernel. This can be explained by the fact that μ(t)\mu(t) varies significantly on a time window (30 days) comparable to the average generation time (10 days).

It is also interesting to compare what happens on the second wave starting at day 90, after all 3 curves have settled down to an almost constant value: now discrepancies are even more dramatic: although we observe a very fast resurgence of the epidemic in all cases, this happens with significant delays from one curve to another. This is due to the fact that the number of individuals who are still infectious after the subcritical period (when μ<1\mu<1) is largely different, especially considering that both the uniform and delta kernels have finite support (20 and 10 days, respectively) much smaller than the duration of the subcritical period, whereas the exp shape has infinite support: this implies that under the uniform and delta kernels the virus survives the subcritical period only through chains of infections belonging to successive generations, whereas under the exp shape in principle the epidemic can restart just thanks to the original immigrants at time 0, who are still weakly infectious when we re-enter the supercritical regime (around day 100). Actually, in our experiment, under the delta kernel the epidemic died out in 68 out of 100 simulation runs, which explains the large confidence intervals obtained in this case at end of the observation window. Under the uniform kernel, the epidemic died only in 3 out of 100 runs, while it always survived under the exp kernel.

We conclude that, even while fixing the mean generation time, the precise shape of the kernel function can play an important role in predicting the process dynamics and the impact of countermeasures, especially when the time-scale of control is comparable to the time-scale of an individual contagion. One might obtain a good fitting with measured data also by using an exponential shape, and a properly chosen modulating function μ(t)\mu(t), but this is undesirable, since the required μ(t)\mu(t) would no longer reflect the actual evolution of mobility and interpersonal contact restrictions. In the case of COVID-19, several researchers have actually attempted to incorporate into analytical models detailed information about people mobility, using for example data provided by cellular network operators or smartphone apps [18, 19].

4 Moment generating function

Our modulated Hawkes process is stochastic in nature, hence it is important to characterize how realizations of the process are concentrated around the mean trajectory derived in Sec. 2. This characterization is instrumental, for example, in designing simulation campaigns with proper number of runs. Moreover, note that it is entirely possible that the epidemic gets extinct at its early stages, or in between two successive waves, as we have seen in the scenario in Fig. 2. Actually, an epidemic could die out even when starting in the supercritical regime (consider the case of a single immigrant at time tt, who does not generate any offspring with probability eR(t)e^{-R(t)}), something that is not captured by deterministic mean-field approaches. Therefore, it is interesting to understand the variability of the process at any time.

Under mild assumptions an expression for the moment generating function of the number of points in [0,t)[0,t) can be given, and an iterative procedure can be applied to compute the moment of any order nn in terms of moments of smaller order, though with increasing combinatorial complexity.

In this section we limit ourselves to reporting the main results on this moments’ characterization, without their mathematical proofs, to keep the paper focused on the application to COVID-19 and its control measures.

Hereon, we shall assume

K1:=sup(t,m)(0,)×Mν(t,m)(0,),K2:=supt(0,)μ(t)(0,).K_{1}:=\sup_{(t,m)\in(0,\infty)\times\mathrm{M}}\nu(t,m)\in(0,\infty),\quad K_{2}:=\sup_{t\in(0,\infty)}\mu(t)\in(0,\infty). (8)

Also, we fix a horizon τ(0,)\tau\in(0,\infty), and, for t(0,τ)t\in(0,\tau), we denote with St,τS_{t,\tau} the number of points, up to time τ\tau, in the cluster generated by an immigrant at tt (including the immigrant). We denote with |||\cdot| the modulus of a complex number.

We are interested in N(τ)N(\tau), i.e., the total number of points generated up to time τ\tau, irrespective of their mark.

Theorem 4.1 (Moment generating function of N(τ)N(\tau))

Assume (1) and (8). Then there exists θc>0\theta_{c}>0 such that, for any zΘcz\in\Theta_{c},

Θc:={z:Rez<θc},\Theta_{c}:=\{z\in\mathbb{C}:\,\,\mathrm{Re}z<\theta_{c}\},

we have

𝔼[ezN(τ)]=exp(0τ(G(t,z)1)σ(t)dt)\mathbb{E}[\mathrm{e}^{zN(\tau)}]=\exp\left(\int_{0}^{\tau}(G(t,z)-1)\,\sigma(t)\mathrm{d}t\right) (9)

and

supzΘc|𝔼[ezN(τ)]|<.\sup_{z\in\Theta_{c}}|\mathbb{E}[\mathrm{e}^{zN(\tau)}]|<\infty. (10)

Here

G(t,z):=𝔼[ezSt,τ],G(t,z):=\mathbb{E}[\mathrm{e}^{zS_{t,\tau}}], (11)

is the solution of the following functional equation.

G(t,z)=ez𝔼[e0τ(G(v,z)1)μ(v)ν(vt,M1)dv],(t,z)(0,τ)×Θc.G(t,z)=\mathrm{e}^{z}\mathbb{E}\left[\mathrm{e}^{\int_{0}^{\tau}(G(v,z)-1)\mu(v)\nu(v-t,M_{1})\,\mathrm{d}v}\right],\quad(t,z)\in(0,\tau)\times\Theta_{c}. (12)

From the moment generating function we can obtain an expression for the first and second moment of N(τ)N(\tau) (the first moment can also be obtained directly from the average stochastic intensity, as we have done in Sec. 2.).

Theorem 4.2 (The first two moments of N(τ)N(\tau))

Assume (1) and (8). Then

𝔼[N(τ)]=N¯(τ)=0τλ¯(t)dt=0τE1(t)σ(t)dt\displaystyle\mathbb{E}[N(\tau)]=\overline{N}(\tau)={\int_{0}^{\tau}\overline{\lambda}(t)\,\mathrm{d}t}={\int_{0}^{\tau}E_{1}(t)\sigma(t)\,\mathrm{d}t} (13)

and

𝔼[N(τ)2]=0τE2(t)σ(t)dt+(0τE1(t)σ(t)dt)2,\displaystyle\mathbb{E}[N(\tau)^{2}]={\int_{0}^{\tau}E_{2}(t)\sigma(t)\,\mathrm{d}t}+{\left(\int_{0}^{\tau}E_{1}(t)\sigma(t)\,\mathrm{d}t\right)^{2}}, (14)

with

E2(t)\displaystyle E_{2}(t) =G′′(t,0)=𝔼[St,τ2]=1+tτE2(v)μ(v)ν¯(vt)dv+\displaystyle=G^{\prime\prime}(t,0)=\mathbb{E}[S^{2}_{t,\tau}]=1+\int_{t}^{\tau}E_{2}(v)\mu(v)\overline{\nu}(v-t)\,\mathrm{d}v\,+
+2tτE1(v)μ(v)ν¯(vt)dv+(tτE1(v)μ(v)ν¯(vt)dv)2\displaystyle+{2}\int_{t}^{\tau}E_{1}(v)\mu(v)\overline{\nu}(v-t)\,\mathrm{d}v+\left(\int_{t}^{\tau}E_{1}(v)\mu(v)\overline{\nu}(v-t)\mathrm{d}v\right)^{2} (15)
E1(t)=G(t,0)=𝔼[St,τ]=1+tτλ(t)¯(v)dvE_{1}(t)=G^{\prime}(t,0)=\mathbb{E}[S_{t,\tau}]=1+\int_{t}^{\tau}\overline{\lambda^{(t)}}(v)\,\mathrm{d}v (16)

and

λ(t)¯(v)=μ(v)ν¯(vt)+μ(v)tvν¯(vu)λ(t)¯(u)du.\overline{\lambda^{(t)}}(v)=\mu(v)\overline{\nu}(v-t)+\mu(v)\int_{t}^{v}\overline{\nu}(v-u)\overline{\lambda^{(t)}}(u)\,\mathrm{d}u. (17)

Note that (4.2) and (17) are second-type inhomogeneous Volterra equations. In the particular case in which μ()\mu() is constant, solutions for (4.2) and (17) can be found by applying a standard Laplace transform methodology.

5 COVID-19 Model

Now we describe how we applied the modulated Hawkes process introduced before to model the propagation dynamics of COVID-19. The proposed model could actually be used to represent the dynamics of other similar viruses as well.

First, we take advantage of the fact that we do not need to consider a unique kernel function for all infected. Indeed, the presence of marks allows us to introduce different classes of infectious individuals with specific kernel functions. Specifically, we have considered three classes of infectious: symptomatic, asymptomatic and superspreader, denoted by symbols {s,a,h}\{s,a,h\}. We assume that, when a person gets infected, it is assigned a random class C{s,a,h}C\in\{s,a,h\} with probabilities ps,pa,php_{s},p_{a},p_{h}, respectively, ps+pa+ph=1p_{s}+p_{a}+p_{h}=1, 0ps10\leq p_{s}\leq 1, 0pa10\leq p_{a}\leq 1, 0pu10\leq p_{u}\leq 1.

As the name suggests, symptomatic people are those who will develop evident symptoms of infection, and we assume that because of that they will be effectively quarantined at home or hospitalized at the onset of symptoms. On the contrary, the asymptomatic mark is given to individuals who will not develop strong enough symptoms to be quarantined. Therefore, they will be able to infect other people for the entire duration of the disease, though at low infection rate (unless they get scrutinized by mass testing, as explained later). At last, superspreaders are individuals who exert a high infection rate but do not get quarantined due to several possible reasons (unless they get scrutinized by mass testing). This class also includes people with mild symptoms, who become highly contagious because of their mobility pattern (e.g., participation to ‘superspreading events’). Though the above classification of infectious individuals is a simplified one, a properly chosen mix of the three considered classes can represent a wide range of different scenarios.

Irrespective of their class, we will assume that all infectious people go through the following sequence of stages: first, there is a random incubation time, denoted by r.v. II, with given cdf FI()F_{I}(). During this time we assume the all infected exert a low infection rate λlow\lambda_{\rm low}. Then, there is a crucial pre-symptomatic period, during which infected in classes {s,h}\{s,h\} already exert a high infection rate λhigh\lambda_{\rm high}, while infected in class aa still exert low infection rate λlow\lambda_{\rm low}. For simplicity, we have assumed the presymptomatic phase to have constant duration ww.

The following evolution of the infection rate of an individual depends on the class: since we assume that symptomatic people get effectively quarantined, they no longer infect other individuals after the onset of symptoms. We model this fact introducing a quarantined class of people (denoted by qq), and deterministically moving all infected in class ss to class qq after time I+wI+w.

People in classes {a,h}\{a,h\} continue to be infectious during a disease period of random duration DD, with given cdf FD()F_{D}(). The difference between these two classes is that infectious in class aa (hh) exert, during the disease period, infection rate λlow\lambda_{\rm low} (λhigh\lambda_{\rm high}), respectively. At last, we assume that people in classes {a,h}\{a,h\} enter a residual period of random duration EE, with cdf FE()F_{E}(), during which all of them exert infection rate λlow\lambda_{\rm low}. We introduced this additional phase because some people who recovered from COVID-19 where found to be still contagious several days (even weeks) after the end of the disease period. Note that durations I,D,EI,D,E are assumed to be independent, and that the complete mark associated to an infected is the tuple M=(C,I,D,E)M=(C,I,D,E).

Refer to caption
Figure 3: Kernel functions for classes s,a,hs,a,h (from top to bottom), for given durations I,D,EI,D,E of stages.

An illustration of the three class-dependent kernels ν(t,s),ν(t,a),ν(t,h)\nu(t,s),\nu(t,a),\nu(t,h), conditioned on the durations (I,D,E)(I,D,E), is reported in Fig. 3.

Remark 5.1

We emphasize that our model of COVID-19 is targeted at predicting the process of new infections, rather than the current number of people hosting the virus in various conditions. In particular, we do not explicitly describe the dynamics of symptomatic but quarantined people and their exit process (i.e., recovery or, in the worst case, death), since such dynamics have no effect on the spreading process (under the assumption of perfect quarantine). However, if desired, one could describe through appropriate probabilities and time distributions how quarantined people split between those isolated at home and those who get hospitalized, the fraction going to intensive care, and those who unfortunately die.

Parameter symbol COVID-19 fitted value
incubation period I tri([2,12], mean 6)
pre-symptoms period w 2
disease period D unif([2,12])
residual period E exp(10)
low infection rate λlow\lambda_{\rm low} 0.05
high infection rate λhigh\lambda_{\rm high} 1
symptomatic probability psp_{s} 0.06
asymptomatic probability pap_{a} 0.91
super-spreader probability php_{h} 0.03
Table 1: Virus-specific parameters.

The parameters introduced so far, summarized in Table 1, are related to specific characteristics of the virus. We now model properties of the specific environment where the virus spreads, taking into account the impact of countermeasures. First, we need to specify the immigration process σ(t)\sigma(t). To keep the model as simple as possible, we have assumed that the system starts at a given time with I0I_{0} new infections, i.e., immigrants arrive as a single burst concentrated at one specific instant.

The impact of countermeasures is taken into account in the model in two different ways. First, modulating function μ(t)\mu(t) can be used to model the instantaneous reduction of the infection rate at time tt due to the current mobility restrictions, and, more in general, changes in the environment which affect the ability of the virus to propagate in the susceptible population (such as seasonal effects). Typically, μ(t)\mu(t) is a decreasing function in the initial part of the epidemics, in response to new regulations introduced by the government, while it goes up again when mobility restrictions are progressively released.

Second, we assume that any infected not already found to be positive is tested (e.g., by means of massive swab campaigns) at individual, instantaneous rate ρ(t)\rho(t), which reflects both the amount of resources employed by the health service to discover the infected population, and the effectiveness of tracking. Recall that, when an infected is found positive, we assume it transits to the quarantined state qq and stops infecting other people.

Refer to caption
Figure 4: State transitions of an infected individual.

Fig. 4 shows the resulting transitions that can occur for the different classes. Note that class qq collects all cases known to the health authorities at a given time tt, and thus coincides with the set of detected cases. Infected in classes s,a,hs,a,h are still unknown to the health service (but they are detectable). When they stop to be positive, people in classes a,ha,h transit to a class uu (undetected), which collects all infected who remain unknown to the health service.

Environment-related parameters are summarized in Table 2.

Parameter symbol fitted value for first wave in Italy
initial day Δt-\Delta t -20
initial number of infected I0I_{0} 370
modulating function μ(t)\mu(t) Ta=3T_{a}=-3, μa=3.71\mu_{a}=3.71
Tb=30T_{b}=30, μb=0.31\mu_{b}=0.31, (see Fig. 5)
detection rate ρ(t)\rho(t) ρ(t)=0.000115t\rho(t)=0.000115\,t,  (see Fig. 5)
Table 2: Environment-specific parameters.
Remark 5.2

We emphasize that the model described so far does not explicitly model spatial effects and sub-populations. As such, it is more suitable to describe a homogeneous scenario, where its environmental parameters (including modulating functions μ(t)\mu(t) and detection rate ρ(t)\rho(t)) can be reasonably assumed to apply to all individuals of the population irrespective of their spatial location. The natural candidate of such scenario is a single nation or just a sub-region of it. By so doing, we can assume that common national regulations are applied, as well as common health-care practices. However, our approach based on Hawkes processes could be extended to incorporate spatial effects, by considering spatio-temporal kernel functions ν(𝐱,t)\nu({\bm{x}},t), where 𝐱{\bm{x}} is a spatial vector originated at the point at which a new infection occurs. Another possibility would be to consider a multivariate temporal Hawkes process, where each component represents a homogeneous region, and the different regional processes can interact with each other.

5.1 Computation of the real-time reproduction number R(t)R(t)

From our model, it is possible to compute in a native way the average number of infections caused by an individual who gets infected at time tt, i.e., the real-time reproduction number R(t)R(t). To compute R(t)R(t), we condition on the duration xx of the incubation time, on the duration yy of the disease time, on the duration zz of the residual time, and on the class assigned to the node getting infected at time tt (note that they are all independent of each other):

R(t)=xyz(λlow0xμ(t+τ)u(t,τ)dτ+paλlowxx+w+y+zμ(t+τ)u(t,τ)dτ+(ps+ph)λhighxx+wμ(t+τ)u(t,τ)dτ+phλhighx+wx+w+yμ(t+τ)u(t,τ)dτ+phλlowx+w+yx+w+y+zμ(t+τ)u(t,τ)dτ)dFE(z)dFD(y)dFI(x)R(t)=\int_{x}\int_{y}\int_{z}\Bigg{(}\lambda_{\rm low}\int_{0}^{x}\mu(t+\tau)u(t,\tau){\rm\,d}\tau+p_{a}\lambda_{\rm low}\int_{x}^{x+w+y+z}\mu(t+\tau)u(t,\tau){\rm\,d}\tau+\\ (p_{s}+p_{h})\lambda_{\rm high}\int_{x}^{x+w}\mu(t+\tau)u(t,\tau){\rm\,d}\tau+p_{h}\lambda_{\rm high}\int_{x+w}^{x+w+y}\mu(t+\tau)u(t,\tau){\rm\,d}\tau+\\ p_{h}\lambda_{\rm low}\int_{x+w+y}^{x+w+y+z}\mu(t+\tau)u(t,\tau){\rm\,d}\tau\Bigg{)}{\rm\,d}F_{E}(z){\rm\,d}F_{D}(y){\rm\,d}F_{I}(x) (18)

where

u(t,τ)=ett+τρ(s)dsu(t,\tau)=e^{-\int_{t}^{t+\tau}\rho(s){\rm\,d}s}

is the probability that a node which gets infected at time tt is still undetected at time t+τt+\tau.

6 Model fitting for the first wave of COVID-19 in Italy

We fit the model to real data related to the spread of COVID-19 in Italy, publicly available on GitHub [20]. Italy was the country where the epidemics first spread outside of China into Europe, causing about 34600 deaths at the end of June 2020 during the first wave.

Our main goal was to match the evolution of the number of detected cases, represented in the model by individuals in class qq. The actual count is provided by the Italian government on a daily basis since February 24th 2020. We take this date as our reference day zero. However, it is largely believed that the epidemics started well before the end of February. Indeed, it became soon clear that detected cases were just the top of a much bigger iceberg, as the prevalence of asymptomatic infection was initially largely unknown, which significantly complicated the first modeling efforts to forecast the epidemic evolution. During June-July 2020, a blood-test campaign (aimed at detecting IgG antibodies) was conducted on a representative population of 64660 people to understand the actual diffusion of the first wave of COVID-19 in Italy [21]. As a main result, it has been estimated that 1 482 000 people have been infected. This figure provides a fundamental hint to properly fit our model, and indeed while exploring the parameter space we decided to impose that the total number of predicted cases at the end of June (day 120) is roughly 1 500 000.

For the durations II,ww,DD,EE of the different stages of an infection, we have tried to follow estimates in the medical literature. In particular, the duration of the incubation period is believed to range between 2 and 12 days, with a sort of bell shape around about 5 days [22, 23, 24], which we have approximated, for simplicity, by a (asymmetric) triangular distribution with support [2,12][2,12] and mean 6. Moreover, we have fixed the duration of the pre-symptomatic phase to 2 days. For the duration of the disease period we have taken a uniform distribution on [2,12][2,12], while the residual time is modeled by an exponential distribution with mean 10 days [25].

The other virus-related parameters have been set as reported in the third column of Table 1. Note that, since we further apply the external modulating function μ(t)\mu(t), we could arbitrarily normalize to 1 the value of λhigh\lambda_{\rm high}, while we set λlow=0.05\lambda_{\rm low}=0.05.

For what concerns environment-related parameters (see Table 2), a first problem was to choose a proper initial day Δt-\Delta t (recall that day zero is the first day of the trace) at which to start the process with I0I_{0} initially infected individuals. While different pairs (Δt,I0)(\Delta t,I_{0}) are essentially equivalent, we decided not to start the process with too few cases and too much in advance with respect to day 0, to limit the variance of a single simulation run. We ended up setting Δt=20\Delta t=20, while I0=370I_{0}=370 was selected as explained later.

Refer to caption
Figure 5: Chosen profiles and parameters of functions μ(t)\mu(t) and ρ(t)\rho(t) for the first wave of COVID-19 in Italy.

Mobility restrictions in Italy were progressively enforced by national laws starting a few days before day 0, first limited to red zones in Lombardy, and soon extended to the entire country through a series of increasingly restrictive regulations, introduced over the next 30 days. Instead of trying the capture the step-wise nature of such restrictions, we assume μ(t)\mu(t) to take the simpler profile depicted in Figure 5, i.e., a high initial value μa\mu_{a} before time TaT_{a}, a low final value μb\mu_{b} after time TbT_{b}, and a linear segment connecting point (Ta,μa)(T_{a},\mu_{a}) to point (Tb,μb)(T_{b},\mu_{b}). We decided to set Ta=3T_{a}=-3 and Tb=30T_{b}=30 to reflect the time window in which mobility restriction were introduced.

In Fig. 5 we also show our choice for the profile of detection rate ρ(t)\rho(t), i.e., a linear increase starting from day 0, with coefficient α\alpha, ρ(t)=max{0,αt}\rho(t)=\max\{0,\alpha t\}. This profile is justified by the fact that the first wave caught Italy totally unprepared (ρ=0\rho=0 before day zero), while massive swabs were only gradually deployed over time after day zero.

The critical parameters I0,μa,μb,αI_{0},\mu_{a},\mu_{b},\alpha were fitted by a minimum mean square error (MMSE) estimation technique based on the curve of detected cases in the time window [0120][0-120] days, after all other parameters were manually selected as detailed above.

Refer to caption
Figure 6: Evolution of the number of detected cases according to model and real data. Cumulative number (left y axes) and its daily variation (right y axes).
Refer to caption
Figure 7: Evolution of the actual number of cases according to model only. Cumulative number (left y axes) and its daily variation (right y axes).

Fig. 6 shows the final outcome of our fitting, comparing the evolution of the number of detected cases according to model and real data. We show both the cumulative number (left yy axes) and its daily increment (right yy axes). Analytical predictions for the mean trajectory of detected cases were obtained by averaging 100 simulation runs. The shaded region around analytical curves (barely visible) shows 95%-level confidence intervals computed for each of the 120 days.

We emphasize that a similar good match could be possibly obtained by other sets of parameters. Our purpose here was not to compute the best possible fit in the entire parameter space (which would be nearly impossible), but to show that the model is rich enough to capture the behavior observed on the real trace after a reasonable choice of most of its parameters, driven by their physical meaning.

Fig. 7 shows instead the evolution of the real number of cases (both the cumulative number and its daily variation) according to the model only, in the absence of data. Note however that we have constrained ourselves to obtain a total number of about 1 500 000 cases on day 120, as suggested by the serological test [21].

Interestingly, by looking at the values of μa\mu_{a} and μb\mu_{b} computed by our MMSE, it appears that the national lockdown was able to reduce the spreading ability of the virus within the Italian population by a factor of about 12 (from 3.71 to 0.31). We will see later on in Fig. 12 that our fitting of μ(t)\mu(t) for the first wave is consistent with mobility trends estimated from the usage of the Apple maps application [19].

Refer to caption
Figure 8: Real-time reproduction number computed by the model, compared to its estimation based on the trace of detected case, according to the Wallinga-Teunis method (with 95% confidence interval denoted as shared region).

In Fig. 8 we show the real-time reproduction number computed by (18). We also applied the classic method of Wallinga-Teunis [26], implemented as in the R0 package [27], to estimate R(t)R(t) from the trace of detected cases. To apply their method, we used a generation time obtained from a Gamma distribution with mean 6.6 (shape 1.87, scale 0.28), which has been proposed for the first wave of COVID-19 in Italy [28]. Interestingly, though both estimates of R(t)R(t) exhibit qualitatively the same behavior, the model-based value of R(t)R(t), which considers also undetected cases, tends to be smaller.

6.1 What-if scenarios

Having fitted the model to the available trace, we proceed to exploit the model to examine interesting what-if scenarios. First, we investigate what would have happened (according to the fitted model) if lockdown restrictions were shifted in time by an amount of days δ\delta. This means that we keep all parameters the same, except that we translate horizontally in time the profile of μ(t)\mu(t) depicted in Fig. 5. In Figure 9 we show the total number of cases that would have occurred for values of δ{7,3,0,3,7}\delta\in\{-7,-3,0,3,7\}.

Refer to caption
Figure 9: What-if scenario: total number of cases if restrictions were shifted in time by δ\delta days.

We observe that a shift of just 3 days corresponds to a factor of roughly 2 in the number of cases. This translates, dramatically, into an equivalent impact on the number of deaths, if we assume that the mortality rate would have stayed the same555This is somehow optimistic, since mortality also depends on the saturation level of intensive therapy facilities. (i.e., 34600/15000002.3%34600/1500000\sim 2.3\%). In other words, a postponement (anticipation) of lockdown restrictions by just 3 days wold have caused twice (half) the number of deaths, which is a rather impressive result.

Refer to caption
Figure 10: What-if scenario: impact of detection rate ρ(t)=αt\rho(t)=\alpha t on the real and detected number of cases, varying α\alpha.

In Figure 10 we investigate instead the impact of detection rate ρ(t)\rho(t), by changing its slope α\alpha (recall Fig. 5). We report both the number of real cases and the number of detected cases predicted by the model, when all other fitted parameters are kept the same. We consider what would have occurred with α=0\alpha=0 (which means that infectious people are never tested), and by doubling the intensity of the detection rate (a tracking system twice more efficient). As expected, with α=0\alpha=0 only symptomatic cases (ps=6%p_{s}=6\%) are eventually detected. This time the effect on the final number of cases (or deaths) is not as dramatic as in the previous what-if scenario. This suggests that the impact of mass testing in Italy during the first wave was marginal, and doubling the efforts would not have produced significant changes in the final outcome.

7 Model fitting for the second wave of COVID-19 in Italy

The second wave of COVID-19 hit Italy in late summer 2020, as in many other European countries, mainly as en effect of relaxed mobility in July/August and possibly other seasonal effects. It is interesting to compare the second wave with the first wave, by looking at the daily increments of detected cases and deaths, see Fig. 11.

Refer to caption
Figure 11: Daily increment of the number of deaths (black curve, right yy-axes), daily increment of the number of detected (red curve, left yy axes), and estimated daily increment of real cases projected back from the curve of deaths (green curve, left yy-axes).

We observe that the number of daily deaths is similar between the two waves. The daily increment of detected cases, instead, is very different (around 5000 at the peak of the first wave, around 35000 at the peak of the second wave, a 7-fold increase). This can be explained by the much larger capacity of the health service to perform swabs and track down the infected, built on the experience gained from the first wave. It also suggests that, differently from the first wave, the impact of ρ(t)\rho(t) (the individual rate at which an infectious is detected) is expected to be much more important during the second wave than in the first wave, requiring a careful treatment of it in the model.

We have indeed tried to fit our model to the second wave in Italy, keeping all parameters related to the virus (previously fitted for the first wave) unchanged (Table 1), and adapting only environment-specific parameters (Table 2). A major difficulty that we had to face was the unknown (at the time of writing) actual diffusion of the virus in the population during the second wave. Recall that, for the first wave, we exploited a blood-test campaign to get a reference for the total number of real cases at the end of the first wave. For the second wave we employed instead a different approach based on a projection back in the past of the increment of deaths. A similar idea has been adopted in [29] to estimate the time-varying reproduction number in different European country at the onset of the first wave.

Specifically, we got from [30] the indication of the median (11 days) and IQR (6-18 days) of the amount of time from symptoms onset to death during Oct-Dec 2020, that we fitted by a Gamma distribution (shape 1.65, scale 8.45). By convolving such Gamma distribution with the distribution of incubation time, and the pre-symptoms period, we obtained a distribution of the total time from infection to death, that we used to estimate the time in the past at which each dead person was initially infected. By amplifying the number of infections leading to death by the inverse of the mortality rate we eventually obtained an estimate of the daily increment of real cases, as reported in Fig. 11 (green curve, left yy-axes).

We chose August 1st (day 160 on Fig. 11) as starting date of the new infection process producing the second wave in Italy. This time, instead of manually searching for suitable profiles of μ(t)\mu(t) and ρ(t)\rho(t) generating the expected curves of real and detected cases, we adopted a novel “reverse-engineering” approach: we took the daily increments of real and detected cases as input to the model, and computed the functions μ(t)\mu(t) and ρ(t)\rho(t) that would exactly produce in the model the given numbers of real and detected cases, on each day666Discretized values of functions μ(t)\mu(t) and ρ(t)\rho(t) are uniquely determined in the model, once we constrain ourselves to produce a given number of real and detected cases at each time instant..

Refer to caption
Figure 12: Modulating functions μ(t)\mu(t) for the first wave (black, fitted) and second wave (blue, reversed), and average mobility according to Apple maps application.

In Fig. 12 we show the obtained ‘reversed’ μ(t)\mu(t) (blue curve), starting from day 0 (August 1st) of the time reference adopted for the second wave. We also report the average mobility measured in Italy by the Apple maps application [19], where we have given equal weight (1/3) to driving, transit and walking mobility. We observe that the ‘reversed’ μ(t)\mu(t) qualitatively follows the same behavior of mobility measured on the maps application, with a gradual increase during the month of August followed by a gradual decrease as people (and the government) started to react to the incipient second wave.

For completeness, we have also reported on Fig. 12 the fitted μ(t)\mu(t) for the first wave (black curve). We observe that, during the first wave, the quick introduction of hard lockdown caused an abrupt decay of both measured mobility and fitted μ(t)\mu(t), characterized by a bigger reduction in a shorter time. The second wave, instead, was characterized by a smoother transition, due to the different choice of applying just partial lockdowns and progressive restrictions more diluted over time777The fact that, during the second wave, mobility values similar to those of the first wave do not translate into equally similar values of μ(t)\mu(t) can be attributed to increased awareness of people during the second wave..

In Fig. 13 we show instead the ‘reversed’ function ρ(t)\rho(t), focusing on the second wave. We also report on the same plot the daily increments of real and detected cases, which allow us to better understand the obtained profile of ρ(t)\rho(t), highlighting an interesting phenomenon occurred around day 30. Indeed, we observe that, during the first month, the epidemic was closely tracked by the national health system, but at some point, around day 30, the curve of detected cases stops increasing, and stays roughly constant during the entire second month, lagging more and more behind the otherwise exploding curve of real cases (time window 30-60). As consequence, function ρ(t)\rho(t), which describes the effectiveness of individual tracking, falls down to a minimum reached at around day 60. This behavior can be interpreted as an effect of the saturation of the capacity to perform swabs, resulting in a progressive collapse of the tracking system, as actually experienced by many people during those days.

Refer to caption
Figure 13: Reverse detection rate ρ(t)\rho(t) for the second wave (left y-axes), and daily increase of real and detected cases (right y-axes).

Our analysis shows that, in contrast to what might be believed by just looking at the curve of detected cases, September (days 30-60) was, perhaps, the most critical period for the outbreak of the second wave of COVID-19 in Italy. During this period, the detection capacity of the national health system was saturated, and could not keep the pace with the rapid growth of real cases, giving instead the illusion of maintaining the epidemic under control. Our reverse-engineering approach can thus shed some light on what actually happened at the onset of the second wave, and quantitatively assess the collapse of the tracking system.

8 Conclusions

We have proposed a time-modulated version of the Hawkes process to describe the temporal evolution of an epidemic within an infinite population of susceptible individuals. Our approach allows us to take into account precise distributions for the time spent in different stages of the infection, which is of paramount importance when the intensity of countermeasures (mobility restrictions, testing and tracing) varies significantly on time-scales comparable to that of an individual infection. We have applied the model to the spread of COVID-19 in Italy, either by a direct fit of its parameter (first wave), or by a novel reverse fit (second wave) which allows us, in retrospect, to understand from data the time-varying effectiveness of applied countermeasures. Future work will extend the model to overcome some of its current limitations, like the impact of spatial effects and other sources of heterogeneity in the population, such as age groups. We think the proposed approach is promising and could be usefully applied to explain the epidemic progress and forecast/assess the impact of control/mitigation measures.

Appendix A Explicit solution of λ¯(t)\overline{\lambda}(t) for constant μ(t)=μ\mu(t)=\mu

When μ()μ\mu(\cdot)\equiv\mu is constant, denoting with * the convolution product, we can rewrite equation (2) as

λ¯(t)=σ(t)+μ(ν¯λ¯)(t),t>0,\overline{\lambda}(t)=\sigma(t)+\mu\cdot(\overline{\nu}*\overline{\lambda})(t),\quad t>0,

which can be easily solved in the transformed domain. For a non-negative function f:++f:\mathbb{R}_{+}\to\mathbb{R}_{+}, we denote by

f^(s):=+estf(t)dt,s\widehat{f}(s):=\int_{\mathbb{R}_{+}}\mathrm{e}^{-st}f(t)\,\mathrm{d}t,\quad s\in\mathbb{C}

the Laplace transform of ff. Then we have:

λ¯^(s)=σ^(s)1μν¯^(s)for Re(s)>Re(zmax),\widehat{\overline{\lambda}}(s)=\frac{\widehat{\sigma}(s)}{1-\mu\widehat{\overline{\nu}}(s)}\qquad\text{for }\mathrm{Re}(s)>\mathrm{Re}(z_{\max}),

with zmaxz_{\max} equal to the zero of 1μν¯^(s)1-\mu\widehat{\overline{\nu}}(s) with largest real part.

In addition, formally, as long as μ<1\mu<1 and σ\sigma is bounded, we can write

λ¯(t)=σ(t)+σ(t)i=1μiν¯i(t)\overline{\lambda}(t)=\sigma(t)+\sigma(t)*\sum_{i=1}^{\infty}\mu^{i}\overline{\nu}^{*i}(t)

where ν¯i(t)\overline{\nu}^{*i}(t) is the ii-th fold convolution of v¯(t)\overline{v}(t).

An analytical expression of λ¯(t)\overline{\lambda}(t) can be obtained when σ(s)\sigma(s) and ν¯(s)\overline{\nu}(s) are both rational. Table 3 reports the dominant888 We recall that a non negative function f(t)f(t) is said to be Θ(1)\Theta(1), iff there exist two constants 0<cC<0<c\leq C<\infty such that cf(t)Cc\leq f(t)\leq C, for sufficiently large tt. term of λ¯(t)\overline{\lambda}(t) (and also of N¯(t)\overline{N}(t)) for the case in which σ(t)=βexp(βt)\sigma(t)=\beta\exp(-\beta t), and ν¯(t)\overline{\nu}(t) takes different shapes satisfying:

0ν¯(t)dt=1;0tν¯(t)dt=1α\int_{0}^{\infty}\overline{\nu}(t){\rm\,d}t=1\quad;\quad\int_{0}^{\infty}t\,\overline{\nu}(t){\rm\,d}t=\frac{1}{\alpha}
shape ν¯(t)\overline{\nu}(t) λ¯(t)\overline{\lambda}(t)
delta δ(t1α)\delta(t-\frac{1}{\alpha}) Θ(1)eαlog(μ)t\Theta(1)\,{\mathrm{e}}^{\alpha\log(\mu)t}
erl-22 (2α)2te2αt(2\alpha)^{2}t{\mathrm{e}}^{-2\alpha t} Θ(1)e2α(μ1)t\Theta(1)\,{\mathrm{e}}^{2\alpha(\sqrt{\mu}-1)t}
exp αeαt\alpha{\mathrm{e}}^{-\alpha t} Θ(1)eα(μ1)t\Theta(1)\,{\mathrm{e}}^{\alpha(\mu-1)t}
hyper1 2z2e2zz+1αt+2e2z+1αt(z+1)2\frac{{2z^{2}}{\mathrm{e}}^{-\frac{2z}{z+1}\alpha t}+2{\mathrm{e}}^{-\frac{2}{z+1}\alpha t}}{(z+1)^{2}} Θ(1)eαt[(μ1)(1+z2)2z+(μ2(1+z2)2+(z21)2(12μ)(1+z)2]\Theta(1)\,{\mathrm{e}}^{\alpha t\left[\frac{(\mu-1)(1+z^{2})-2z+\sqrt{(\mu^{2}(1+z^{2})^{2}+(z^{2}-1)^{2}(1-2\mu)}}{(1+z)^{2}}\right]}
hyper2 z3αezαt+αeαztz(z+1)\frac{z^{3}\alpha{\mathrm{e}}^{-z\alpha t}+\alpha e^{-\frac{\alpha}{z}t}}{z(z+1)} Θ(1)eαt(μ1)(1+z2)μz+μ2z2+(z1)2[μ2(z2+1)2μ(z2+z+1)+(z+1)2]2z\Theta(1)\,{\mathrm{e}}^{\alpha t\frac{(\mu-1)(1+z^{2})-\mu z+\sqrt{\mu^{2}z^{2}+(z-1)^{2}[\mu^{2}(z^{2}+1)-2\mu(z^{2}+z+1)+(z+1)^{2}]}}{2z}}
Table 3: Dominant term of λ¯(t)\overline{\lambda}(t) for different kernel shapes having the same average generation time g=1/αg=1/\alpha.

Besides the simple deterministic (delta) and exponential (exp) shapes, we consider the Erlang-2 (erl-22) and two variants of hyper-exponential, whose general form is ν¯(t)=p1α1eα1t+p2α2eα2t\overline{\nu}(t)=p_{1}\alpha_{1}e^{-\alpha_{1}t}+p_{2}\alpha_{2}e^{-\alpha_{2}t}. To reduce the degrees of freedom of the general hyper-exponential, we have assumed a particular relationship between p1/p2p_{1}/p_{2} and α1/α2\alpha_{1}/\alpha_{2}, which allows us to introduce a single parameter zz. Specifically, we have set either p1/p2=α1/α2=zp_{1}/p_{2}=\alpha_{1}/\alpha_{2}=z (denoted by hyper1) or p1/p2=α1/α2=zp_{1}/p_{2}=\sqrt{\alpha_{1}/\alpha_{2}}=z (denoted by hyper2). Note that in both cases the variance of the corresponding hyper-exponential distribution increases with z1z\geq 1, becoming arbitrarily large as zz\to\infty.

For all kernel shapes, and μ>1\mu>1, the growth rate of λ¯(t)\overline{\lambda}(t) is exponential, i.e., λ¯(t)=Θ(eηt)\overline{\lambda}(t)=\Theta(e^{\eta t}), and the same holds for N¯(t)=0tλ¯(τ)dτ\overline{N}(t)=\int_{0}^{t}\overline{\lambda}(\tau){\rm\,d}\tau. The rate η\eta of the exponential growth, however, depends on the specific shape.

In particular, for the two considered cases of hyper-exponential shape, η\eta is an increasing function of zz: while in the hyper1 case η\eta saturates to 2(μ1)α2(\mu-1)\alpha, as zz\rightarrow\infty, in the hyper2 case η\eta is even unbounded, as zz\rightarrow\infty.

We also notice that the rate of the exponential kernel is larger than the rate of the Erlang-2 kernel, which is in turn larger than the rate of the deterministic kernel.

These results confirm that, even under a constant modulating function μ(t)\mu(t), the epidemic growth rate depends on the specific shape of the kernel function ν¯(t)\overline{\nu}(t).

References