This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Fat Tails and Black Swans: Exact Results for Multiplicative Processes with Resets

D. H. Zanette zanette@cab.cnea.gov.ar Centro Atómico Bariloche and Instituto Balseiro, Comisión Nacional de Energía Atómica and Universidad Nacional de Cuyo, Consejo Nacional de Investigaciones Científicas y Técnicas, Av. E. Bustillo 9500, R8402AGP San Carlos de Bariloche, Río Negro, Argentina.    S. Manrubia smanrubia@cnb.csic.es Grupo Interdisciplinar de Sistemas Complejos (GISC), Madrid, Spain; Centro Nacional de Biotecnología (CSIC); c/ Darwin 3, 28049 Madrid, Spain.
Abstract

We consider a class of multiplicative processes which, added with stochastic reset events, give origin to stationary distributions with power-law tails —ubiquitous in the statistics of social, economic, and ecological systems. Our main goal is to provide a series of exact results on the dynamics and asymptotic behaviour of increasingly complex versions of a basic multiplicative process with resets, including discrete and continuous-time variants and several degrees of randomness in the parameters that control the process. In particular, we show how the power-law distributions are built up as time elapses, how their moments behave with time, and how their stationary profiles become quantitatively determined by those parameters. Our discussion emphasizes the connection with financial systems, but these stochastic processes are also expected to be fruitful in modeling a wide variety of social and biological phenomena.

preprint: AIP/123-QED

City sizes, word usage, surname abundance, personal income or stock market returns are examples of power-law distributed quantities. Such kind of distribution, ubiquitous in the natural and social sciences, holds “atypical” properties that have awoken the interest of researchers for over a century.Auerbach (1913); Estoup (1916); Yule (1924); Zipf (1949); Simon (1955) The dynamics of this kind of data exhibit extreme, catastrophic, “unexpected” black-swan-like events.Taleb (2007a) Distribution moments, such as the average or the variance, are highly volatile and poorly predict future properties of the process. In that context of uncertainty, knowledge of the dominant mechanisms underlying power-law distributions is relevant to directly compare the short-time properties of data series to actual asymptotic properties, and to eventually evaluate the reliability of forecast algorithms. Stochastic multiplicative processes (SMPs) with reset events, introduced two decades agoManrubia and Zanette (1999) as a generic mechanism to generate power laws, have multiple applications in a variety of situations.Evans and Majumdar (2011); Montero and Villarroel (2013) In this contribution, we derive several finite-time properties of SMPs with reset events with the aim of improving our understanding of the poor predictability of the dynamical process. The discussion of our results in a financial context clarifies the relationship between gain and risk in investing strategies, and provides clues to control the frequency and magnitude of extreme events.

I Introduction

“At first glance, the facts of human life do not seem to subordinate themselves, as do the phenomena of nature, to certain general laws. Statistics show, however, that this is just a gradual difference due to the more complicated nature of human relations, and that man only has to learn to read the statistics correctly in order to extract more general conclusions; and it is not uncommon to find interesting and strange laws.”

The quote above opens a report written in 1913 by Felix Auerbach,Auerbach (1913) where he discussed the skewed abundance distribution of cities of various sizes in Germany, in what is likely the first published document on the topic. Beyond the identification of an uncommon statistical pattern, Auerbach’s paper reveals a visionary intuition on the existence of general laws underlying the statistical properties of social phenomena. Three decades later, in his book “Human Behavior and The principle of least effort,”Zipf (1949) George Kingsley Zipf compiled multiple data with the aim of demonstrating regularities and subsequently discussing plausible, simple underlying principles. The distribution of city sizes and the frequency of words in written texts —an observation that goes back to 1916Estoup (1916)— are undoubtedly the two best-known examples, but Zipf also analyzed the length of intervals between repetitions in a Mozart concerto, the number of retail stores of like kind, the number of passengers travelling by airway, the personal income in different countries —first described by Vilfredo Pareto in 1896Pareto (1896)— and the number of composers of chamber music as a function of their year of birth, among many others. He ordered each set of events by decreasing size, S(k)S(k), assigning rank k=1k=1 to the largest event, k=2k=2 to the second largest, and so on. Though the relationship S(k)kαS(k)\propto k^{-\alpha}, with α1\alpha\approx 1, was pervasively found (and became known as Zipf’s law), there were also many cases of data characterized by different values of α\alpha, and even by other functional relations. Ever since, research on the mechanisms underlying emergent properties of collective human behaviour has been pursued with much effort and also remarkable success.Newman (2004)

Quantities whose frequency distributions are given by power-law functions have always awaken especial interest in the study of a variety of social systems, as it has the search for simple processes behind this kind of distribution.Mitzenmacher (2004); Zanette and Manrubia (2007) Pure stochastic multiplicative processes (SMPs) yield lognormal distributions for the relevant variables,Montroll and Shlesinger (1982) and can produce bona fide power laws when acting in conjunction with additional mechanisms. Actually, SMPs appear at the core of many successful explanatory models for power laws, such as Yule’s birth-death process to explain the distribution of taxa,Yule (1924) or Simon’s model to derive the abundance of words in written texts.Simon (1955) These early models rationalize power-law distributions in other systems as well, as in city sizesZanette (2008) or growth of business firms.Fu et al. (2005) Furthermore, suitable variations of Simon’s model reproduce the observed abundance of family names,Zanette and Manrubia (2001); Manrubia and Zanette (2002) while it yields distributions that fit remarkably well the actual frequency of words in written textsZanette and Montemurro (2005) and the usage of notes in musical compositions. Zanette (2006)

Stochastic processes with multiplicative noise plus reinjection are characterized by distributions with power-law tails.Takayasu, Sato, and Takayasu (1997); Sornette (1998a) This kind of mechanism has applications, for instance, in population dynamics and investment portfolio growth.Sornette (1998b) A variant of reinjection that sets a minimum value for the dynamical variable Levy and Solomon (1996) can be interpreted in an economic context as a subsidy that keeps individuals above a critical poverty line. SMPs with conservation of the total population, which is a generalization of Zeldovich’s intermittency model,Zel’dovich et al. (1987) has been used to explain Zipf’s law for cities.Zanette and Manrubia (1997) Introduction of reset events in SMPs renders power laws with exponents which depend on the reset probability and on the distribution of growth rates,Manrubia and Zanette (1999) a situation that can account for the power-law tail of the personal income distribution.Nirei and Souma (2004) Though reset events were introduced in the generic context of SMPs yielding power laws,Manrubia and Zanette (1999) this mechanism has been subsequently applied to a broad variety of problems.Evans and Majumdar (2011); Montero and Villarroel (2013); Montero, Masó-Puigdellosas, and Villarroel (2017); Evans, Majumdar, and Schehr (2019)

SMPs belong to a class of processes that can exhibit non-self-averaging effects. In physics, this property was first described for spin glasses, where the fact that different realizations of the process visit different areas of the phase space, even in the thermodynamic limit, yields different observable quantities (e.g., moments of the distribution of visited states) for each of the system replicas.Mézard, Parisi, and Virasoro (1987) Later, it was shown that non-self-averaging behavior was present in various systems,Derrida (1997); Derrida and Jung-Muller (1999) such as sums of power-law distributed random variables, branching processes, and one-dimensional random walks with return to the origin —a simple case of stochastic additive processes with resets.Meylahn, Sabhapandit, and Touchette (2015) A major consequence of the lack of self-averaging is that quantitative properties of the system estimated through averages over time or realizations can be highly unreliable and deeply differ from the actual asymptotic properties of the stochastic process.

Interestingly, the same phenomenology has been observed in economics and finance, where its relevance is difficult to overstate.Gabaix (2009) In these contexts, multiplicative processes are implicit in Gibrat’s law,Gibrat (1931) (or “law or proportionate effect”), which states that the rate of growth of a business firm is independent of its absolute size. Though, as a pure SMP, Gibrat’s law implies a lognormal distribution of the relevant variable, the ample evidence of power-law distributed quantities in economics and finance,Gabaix (2009) suggests that additional mechanisms must be at play. Indeed, since sustained exponential growth is unfeasible in reality, Gibrat’s law cannot hold at all times. Financial crises and market crashes act as “control mechanisms” in the form of catastrophic, punctuated interruptions of the idealized multiplicative growth. Actually, examples of such events abound: price indices suffered severe drops in the crises of 2008 and 2011Lagi et al. (2011) (see Fig. 1); in the 1980s, some banks lost more money than they ever made in their history;Caprio and Klingebiel (1999) financial bubbles more than often cause market crashes, like the well-known Wall Street crash of 1929 and many others. The abrupt loss of gains in a time much shorter than that required to accumulate them is commonplace in finance, and resets appear as a qualitatively suitable mechanism to mimic such dynamics.

Nassim Taleb’s Black Swan TheoryTaleb (2007b) explains the prominent role of such “unexpected” catastrophic events —as rare black swans among ordinary birds— not only in finance, but also in many other contexts, such as history, technology, and science. These hardly predictable, large-magnitude occurrences have vastly stronger effects than regular episodes, due to psychological biases and a generalized poor understanding of the role of probability in social phenomena, related to the subjective notions of luck and fate.Taleb (2007a) Such disproportionate consequences are a direct corollary of the unavoidable shortness of historical records, which tends to magnify the exceptionality of those events.

Refer to caption
Figure 1: Monthly price indices for three major commodities between years 2001 and 2012 (with 2002-2004 average =100=100). Sustained multiplicative growth might precede severe drops to low values. Among the factors put forward a posteriori to explain such events, one finds weather, increasing demand for meat, use of biofuels, and variations in currency exchange rates. However, on the basis of a deterministic model, it has been shown that the two sharp peaks around 2008 and 2011 are specifically due to investor speculation.Lagi et al. (2011) Public-domain data from the Food and Agriculture Organization (United Nations, www.fao.org).

In this contribution, we revisit SMPs with resets with the aim of improving our understanding of the dynamics of such processes. We derive a number of exact results that allow for precise comparison with quantities obtained from dynamical simulations and clarify the limitations of time-series or realization averages to estimate the moments of the stochastic process —and therefore to achieve statistical predictions of future events based on knowledge of the past. Sections II, III, and IV discuss three models of increasing complexity and present exact results for their dynamics. Their applicability in a financial context is further discussed in Section V, which closes this paper by highlighting other possible applications and future extensions of SMPs with resets.

II Uniform multiplication and reset

The simplest version of the class of multiplicative processes that we consider here is a Markov chain for a variable xtx_{t}, evolving in discrete time t=0,1,t=0,1,\dots, with xt>0x_{t}>0 for all tt and x0=1x_{0}=1. At each time step, xtx_{t} is either multiplied by a constant positive factor μ1\mu\neq 1 or it is reset to its initial value. The two instances occur with probabilities 1r1-r and rr, respectively (0<r<10<r<1). Namely,

xt+1={μxtwith probability 1r,1with probability r.x_{t+1}=\left\{\begin{array}[]{ll}\mu x_{t}&\mbox{with probability $1-r$},\\ 1&\mbox{with probability $r$}.\end{array}\right. (1)

Through the change of variables x=|lnx/lnμ|x^{\prime}=|\ln x/\ln\mu|, this stochastic process is equivalent to the so-called Sisyphus random walk, for which several of the results discussed in this section have been obtained in previous work.Montero and Villarroel (2016) For the sake of concreteness, we focus on the choice μ>1\mu>1, which implies that xt1x_{t}\geq 1 for all tt. Power-law tails in the probability distribution for large xx, in fact, develop for μ>1\mu>1 only. However, results can be straightforwardly extended to the case μ<1\mu<1, by exploiting the symmetry of the problem under the transformation μμ1\mu\to\mu^{-1}, xtxt1x_{t}\to x_{t}^{-1}. In this case, the probability exhibits power-law behavior for x0x\to 0.

Figure 2 shows the time dependence of xtx_{t} along two realizations of process (1), with μ=1.1\mu=1.1 and different values of the reset probability rr. The evolution consists of a succession of “bursts” of exponential growth induced by the multiplicative process, each of them terminated by a reset event. For r=0.08r=0.08, the less frequent reset events occasionally allow for rather high bursts, as compared with those obtained for r=0.12r=0.12 in the time span of the simulation.

Refer to caption
Figure 2: Two typical realizations of the stochastic process of Eq. (1), for μ=1.1\mu=1.1 and two values of the reset probability: r=0.08r=0.08 (dark curve) and r=0.12r=0.12 (light-shaded curve).

The stochastic process (1) can be readily dealt with, by noting that the variable xtx_{t} can only adopt the values 1,μ,μ2,1,\mu,\mu^{2},\dots or, generally, μm\mu^{m}, for m=0,1,2,m=0,1,2,\dots. From Eq. (1) it immediately follows that the probability pt(m)p_{t}^{(m)} that the variable equals μm\mu^{m} satisfies the Chapman-Kolmogorov equation

pt+1(m)=(1r)pt(m1)(1δm0)+rδm0,p_{t+1}^{(m)}=(1-r)p_{t}^{(m-1)}(1-\delta_{m0})+r\delta_{m0}, (2)

where δij\delta_{ij} is Kronecker’s delta. The solution reads

pt(m)={r(1r)mfor m<t,p0(mt)(1r)tfor mt,p_{t}^{(m)}=\left\{\begin{array}[]{ll}r(1-r)^{m}&\mbox{for $m<t$},\\ p_{0}^{(m-t)}(1-r)^{t}&\mbox{for $m\geq t$},\end{array}\right. (3)

in terms of a generic initial condition p0(m)p_{0}^{(m)}. For x0=1x_{0}=1, we have p0(m)=δm0p_{0}^{(m)}=\delta_{m0}, and

pt(m)={r(1r)mfor m<t,(1r)mfor m=t,0for m>t,p_{t}^{(m)}=\left\{\begin{array}[]{ll}r(1-r)^{m}&\mbox{for $m<t$},\\ (1-r)^{m}&\mbox{for $m=t$},\\ 0&\mbox{for $m>t$},\end{array}\right. (4)

cf. Eq. (7) in Ref. 39. In Eqs. (3) and (4), all the information about the initial condition is accounted for in the range mtm\geq t so that, as time elapses, it becomes progressively relegated to exponentially higher values of xx. The long-time stationary behavior of the probability pt(m)p_{t}^{(m)} is built up from small values of mm, given by the first line in both equations:

p(m)=r(1r)m,p_{\infty}^{(m)}=r(1-r)^{m}, (5)

cf. Eq. (11) in Ref. 39.

The moment xγt=mμγmpt(m)\langle x^{\gamma}\rangle_{t}=\sum_{m}\mu^{\gamma m}p_{t}^{(m)} can be exactly calculated for any order γ\gamma at all times. It reads

xγt=r1(1r)μγ+[xγ0r1(1r)μγ](1r)tμγt,\langle x^{\gamma}\rangle_{t}=\frac{r}{1-(1-r)\mu^{\gamma}}+\left[\langle x^{\gamma}\rangle_{0}-\frac{r}{1-(1-r)\mu^{\gamma}}\right](1-r)^{t}\mu^{\gamma t}, (6)

with xγ0=1\langle x^{\gamma}\rangle_{0}=1 for x0=1x_{0}=1. In contrast with the probability pt(m)p_{t}^{(m)} which, for asymptotically long times, always attains the stationary form given by Eq. (5), the moment xγt\langle x^{\gamma}\rangle_{t} converges to a finite value, namely

xγ=r1(1r)μγ,\langle x^{\gamma}\rangle_{\infty}=\frac{r}{1-(1-r)\mu^{\gamma}}, (7)

only if (1r)μγ<1(1-r)\mu^{\gamma}<1. The convergence is exponential in time, within a typical time scale |ln[(1r)μγ]|1|\ln[(1-r)\mu^{\gamma}]|^{-1}. On the contrary, if (1r)μγ1(1-r)\mu^{\gamma}\geq 1, xγt\langle x^{\gamma}\rangle_{t} diverges exponentially with time. This convergence or divergence of the moment xγt\langle x^{\gamma}\rangle_{t} depending on whether the order γ\gamma is respectively lower or larger than the critical value γc=|ln(1r)/lnμ|\gamma_{\rm c}=|\ln(1-r)/\ln\mu| is compatible with a probability distribution (for a continuous variable xx) decaying as the power law x1γcx^{-1-\gamma_{\rm c}}. In fact, starting from the stationary distribution of Eq. (5), extending the index m=lnx/lnμm=\ln x/\ln\mu to the real positive domain, and changing variables from mm to xx, we get the stationary (“fat-tailed”) power-law probability distribution

f(x)=rlnμx1|ln(1r)/lnμ|.f_{\infty}(x)=\frac{r}{\ln\mu}x^{-1-|\ln(1-r)/\ln\mu|}. (8)

II.1 Dissecting the process: properties of individual realizations

In certain applications, it is necessary to “dissect” each possible realization of the stochastic process, determining the probability of each different way in which the system can evolve. Specifically, for Eq. (1), we first ask what is the probability that, up to a given time τ\tau, the evolution consists of a succession of exactly kk bursts of multiplicative growth (k=1,,τ+1k=1,\dots,\tau+1, including one-step bursts between two contiguous reset events). Equivalently, we ask for the probability that exactly k1k-1 resets occur somewhere between times t=1t=1 and τ\tau, both inclusive. For k=1k=1, the probability that no reset has occurred is (1r)τ(1-r)^{\tau}. In turn, each one of the τ\tau options for one reset (k=2k=2) has probability (1r)τ1r(1-r)^{\tau-1}r. Generally, the probability that the evolution up to time τ\tau includes k1k-1 resets, i.e. kk bursts, is

ρτ(k)=(τk1)(1r)τk+1rk1,\rho_{\tau}^{(k)}=\left(\begin{array}[]{c}\tau\\ k-1\end{array}\right)(1-r)^{\tau-k+1}r^{k-1}, (9)

for 0kτ0\leq k\leq\tau.

Now, consider the set of realizations where, up to time τ\tau, the evolution consists of exactly kk bursts. How many of all these bursts have a duration of m+1m+1 steps (m=0,1,,τm=0,1,\dots,\tau), i.e. at how many times does xtx_{t} reach the value μm\mu^{m} just before the burst ends (either by a reset event or because time τ\tau has been reached)? This number can be calculated by considering the so-called compositions of τ\tau into kk parts,Heubach and Mansour (2009) and turns out to be

Kτ(k,m)={1for k=1 and m=τ,k(τm1k2)for 1<kτm+1 ,0otherwise.K_{\tau}^{(k,m)}=\left\{\begin{array}[]{ll}1&\mbox{for $k=1$ and $m=\tau$},\\ \\ k\left(\begin{array}[]{c}\tau-m-1\\ k-2\end{array}\right)&\mbox{for $1<k\leq\tau-m+1$ },\\ \\ 0&\mbox{otherwise}.\end{array}\right. (10)

Taking into account that, if xtx_{t} reaches the value μm\mu^{m} during a burst, all the lower values 1,μ,μ2,,μm11,\mu,\mu^{2},\dots,\mu^{m-1} have also been previously attained, the total number of times that xt=μmx_{t}=\mu^{m} has been attained at any moment tt up to time τ\tau, in all the realizations with exactly kk bursts, can be calculated from Kτ(k,m)K_{\tau}^{(k,m)}, yielding

Mτ(k,m)={k(τmk1)for 1kτm+1 ,0otherwise.M_{\tau}^{(k,m)}=\left\{\begin{array}[]{ll}k\left(\begin{array}[]{c}\tau-m\\ k-1\end{array}\right)&\mbox{for $1\leq k\leq\tau-m+1$ },\\ \\ 0&\mbox{otherwise}.\end{array}\right. (11)

Finally, if the system has evolved up to time τ\tau, the probability that the value of xtx_{t} is μm\mu^{m} at a randomly chosen time tτt\leq\tau reads

k=1τ+1ρτ(k)Mt(k,m)m=0τMt(k,m)=1+r(τm)1+τ(1r)m,\sum_{k=1}^{\tau+1}\rho_{\tau}^{(k)}\frac{M_{t}^{(k,m)}}{\sum_{m^{\prime}=0}^{\tau}M_{t}^{(k,m^{\prime})}}=\frac{1+r(\tau-m)}{1+\tau}(1-r)^{m}, (12)

which, up to the normalization factor (1+τ)1(1+\tau)^{-1}, corresponds to Eq. (39) in Ref. 39. Note carefully the difference between Eqs. (4) and (12). While the former gives the probability that a given value of xx is attained after tt evolution steps, the latter is the probability constructed by recording the frequency of all the values of xx along the whole evolution until time τ\tau. For t=τt=\tau, the two expressions coincide in the limit τ\tau\to\infty only; cf. Eq. (5).

II.2 Remarks on self-averaging and ergodicity

It is well known that systems involving stochastic variables with fat-tailed distributions exhibit peculiar statistical features, directly associated with the fact that some of the leading distribution moments —such as the mean value and/or the variance— are not finite.Montroll and Shlesinger (1982); Sornette (1998a) Although, for any finite time tt, the moments for the variable xtx_{t} in the stochastic process (1) are all finite, as given by Eq. (6), similar features manifest themselves as time elapses and a fat-tailed asymptotic distribution progressively builds up.

Lack of self-averaging —namely, the failure of a scaled sum of stochastic variables to converge to a well-defined value— is intuitive for distributions with divergent mean value. This phenomenon has been recognized as a typical feature of random multiplicative processes,Redner (1990) and has been widely discussed for a variety of systems of physical interest,Derrida (1997); Derrida and Jung-Muller (1999) including Markov processes with resets.Meylahn, Sabhapandit, and Touchette (2015) Self-averaging also fails when the mean value is finite but the variance diverges, a situation relevant to the theory of financial revenues and related problems in economics.Taleb (2007b, 2009) The fat-tailed asymptotic distribution of process (1) has finite mean value x\langle x\rangle and divergent variance σ2=x2x2\sigma^{2}=\langle x^{2}\rangle-\langle x\rangle^{2} for 1<γc=|ln(1r)/lnμ|<21<\gamma_{\rm c}=|\ln(1-r)/\ln\mu|<2. The upper panel of Fig. 3 shows xtx_{t}, up to t=104t=10^{4}, along a single realization of the process with μ=1.1\mu=1.1 and r=0.1r=0.1, for which γc1.1\gamma_{\rm c}\approx 1.1. The light-shaded curve is the prediction of Eq. (6) for the mean value (i.e., with γ=1\gamma=1), which approaches x=10\langle x\rangle_{\infty}=10 for long times. The large fluctuations of xtx_{t} around its expected average are apparent in the main plot and in the inset, which shows a detail for intermediate times.

Refer to caption
Figure 3: Upper panel: xtx_{t} as a function of time, along a single realization of the stochastic process (1) for μ=1.1\mu=1.1 and r=0.1r=0.1 (dark curve). The light-shaded curve stands for the analytical prediction for xt\langle x\rangle_{t}, Eq. (6). The inset shows a close-up for intermediate times. Middle panel: Cumulative average of xtx_{t} (dark curve; see main text) for the same realization as in the upper panel, and the corresponding analytical prediction, Eq. (13) (light-shaded curve). Lower panel: As in the middle panel, averaging over 10610^{6} realizations of the stochastic process. The inset shows the result for short times.

The middle panel of Fig. 3 shows the accumulated average of the stochastic variable, defined as x¯t=(t+1)1τ=0txτ\bar{x}_{t}=(t+1)^{-1}\sum_{\tau=0}^{t}x_{\tau}, for the same realization as in the upper panel. The light-shaded curve stands for the corresponding analytical value,

x¯t=\displaystyle{\langle\bar{x}\rangle}_{t}= r1(1r)μ\displaystyle\frac{r}{1-(1-r)\mu} (13)
+[1r1(1r)μ]1(1r)t+1μt+1(t+1)[1(1r)μ],\displaystyle+\left[1-\frac{r}{1-(1-r)\mu}\right]\frac{1-(1-r)^{t+1}\mu^{t+1}}{(t+1)[1-(1-r)\mu]},

derived from Eq. (6). Comparison of the two curves puts in evidence, in particular, the role of large values of xtx_{t} in building up the cumulative average. Their contribution, in fact, punctuates with abrupt upward jumps the otherwise decaying evolution of x¯t\bar{x}_{t}. It is clear, however, that —although the typical convergence time for the mean value, |ln[(1r)μ]|1100|\ln[(1-r)\mu]|^{-1}\approx 100, has long passed— x¯t\bar{x}_{t} is nowhere close to x\langle x\rangle_{\infty}. This non-self-averaging effect remains even if an additional average is performed over many realizations of the stochastic process —as implicit in Eqs. (6) and (13). In the lower panel of Fig. 3 we plot x¯t\bar{x}_{t} averaged over 10610^{6} realizations. While, as shown in the inset, the coincidence between numerical and analytical results is very good for small tt, a sustained sizable discrepancy —which decreases only very slowly as time elapses— persists for long times.

Somehow disappointingly, there is no formal theory to describe the behavior of the mean value of a stochastic variable —calculated as the average of successive random draws from a prescribed distribution— when its variance is infinite. Heuristic approaches based on the estimation of bootstrap distributions or the use of surrogate tail-trimmed variables, however, have been advanced in applications to finance.Brooks (2019) A different viewpoint is provided by the extreme-value theory, which focuses on the statistics of the rare events where the variable exhibits severe deviations from the mean value.Redner (1990); Novak (2011) In our case, they correspond to the high bursts in the upper panel of Fig. 3, which cause the sharp steps in the middle panel.

For the stochastic process (1) we can ask what is the mean waiting time wXw_{X} until xtx_{t} reaches, for the first time, a given (large) value X=μMX=\mu^{M}. This event happens at the MM-th step of a sufficiently long burst, assuming that all the preceding bursts have been shorter than MM steps. The corresponding waiting time is the total duration of all the preceding bursts plus MM additional steps. Its mean value turns out to be

wX=1r[1(1r)M1]1rXγc,w_{X}=\frac{1}{r}\left[\frac{1}{(1-r)^{M}}-1\right]\approx\frac{1}{r}X^{\gamma_{\rm c}}, (14)

where the approximation holds for large XX, i.e. for large MM; cf. Eq. (18) in Ref. 39.

On the other hand, the mean number dXd_{X} of random draws from the stationary distribution (5) until the value X=μMX=\mu^{M} is obtained, assuming that all the preceding draws have produced lower values, can be immediately found:

dX=r(1r)M=rXγc.d_{X}=\frac{r}{(1-r)^{M}}=rX^{\gamma_{\rm c}}. (15)

Note that the waiting time wXw_{X} is larger than dXd_{X} by a factor r2r^{-2} (=100=100 in the realizations of Fig. 3). This difference can be viewed as a form of “transient non-ergodicity” in our Markov chain. Along the stochastic process, in fact, observation of xt=Xx_{t}=X requires waiting the total duration of a succession of shorter bursts where xtx_{t} adopts one or more times each value lower than XX. Instead, when drawing the stochastic variable directly from the distribution, the occurrence of XX is independent of the preceding draws. Evaluation of the distribution from the time evolution of xtx_{t}, consequently, can be much slower than from random realizations of the stochastic variable.

III The non-uniform case

Although the stochastic process (1) already captures the key mechanisms that lead to the generation of a power-law distribution for the variable xtx_{t}, it is useful to introduce a couple of generalizations that relax some assumptions implicit in the formulation of the above multiplicative process but, at the same time, preserve most of its analytical tractability. In particular, we now admit that the multiplicative coefficient μ\mu and the value of xtx_{t} after each reset event vary randomly with time. Namely, we consider the Markov chainManrubia and Zanette (1999)

xt+1={μtxtwith probability 1r,stwith probability r,x_{t+1}=\left\{\begin{array}[]{ll}\mu_{t}x_{t}&\mbox{with probability $1-r$},\\ s_{t}&\mbox{with probability $r$},\end{array}\right. (16)

where, at each time step, μt>0\mu_{t}>0 and st>0s_{t}>0 are drawn from distributions P(μ)P(\mu) and F(s)F(s), respectively. In this variant, in contrast with Eq. (1), xtx_{t} generally adopts continuous positive values, and its probability is described by a distribution ft(x)f_{t}(x). The initial value x0x_{0} is chosen from a prescribed distribution f0(x)f_{0}(x). The previous case is reobtained taking delta-like profiles for P(μ)P(\mu), F(s)F(s), and f0(x)f_{0}(x).

In order to avoid hindering our presentation with the discussion of special pathological situations, we assume that the moments of the distributions P(μ)P(\mu) and F(s)F(s),

μγ=0μγP(μ)𝑑μ,sγ=0sγF(s)𝑑s,\langle\mu^{\gamma}\rangle=\int_{0}^{\infty}\mu^{\gamma}P(\mu)d\mu,\ \ \ \ \ \langle s^{\gamma}\rangle=\int_{0}^{\infty}s^{\gamma}F(s)ds, (17)

are finite for any order γ(,)\gamma\in(-\infty,\infty). This is the case if P(μ)P(\mu) and F(s)F(s) drop rapidly enough to zero for both μ,s0\mu,s\to 0 and μ,s\mu,s\to\infty. In typical applications, in particular, the reset values sts_{t} are restricted to some finite domain, so that the support of the distribution F(s)F(s) is a bounded interval, and the finiteness of sγ\langle s^{\gamma}\rangle is guaranteed for all γ\gamma. The results that we obtain below, however, do not always require that μγ\langle\mu^{\gamma}\rangle and sγ\langle s^{\gamma}\rangle are finite.

In order to obtain the solution of the stochastic process (16), it is convenient to consider the evolution of the logarithmic variable yt=lnxty_{t}=\ln x_{t}:

yt+1={yt+νywith probability 1r,utwith probability r,y_{t+1}=\left\{\begin{array}[]{ll}y_{t}+\nu_{y}&\mbox{with probability $1-r$},\\ u_{t}&\mbox{with probability $r$},\end{array}\right. (18)

where νt=lnμt\nu_{t}=\ln\mu_{t} and ut=lnstu_{t}=\ln s_{t}. The stochastic variables νt\nu_{t} and utu_{t} are respectively drawn from distributions Q(ν)Q(\nu) and G(u)G(u), with Q(ν)dν=P(μ)dμQ(\nu)d\nu=P(\mu)d\mu and G(u)du=F(s)dsG(u)du=F(s)ds. In other words, Q(ν)=P(expν)expνQ(\nu)=P(\exp\nu)\exp\nu and G(u)=F(expu)expuG(u)=F(\exp u)\exp u. The same transformation yields for the initial distribution g0(y)g_{0}(y) in terms of f0(x)f_{0}(x). The Markov chain for yty_{t}, Eq. (18), is an additive (generally, drift plus diffusion) stochastic process with resets.

The Chapman-Kolmogorov equation for the probability distribution of xtx_{t}, ft(x)f_{t}(x), is

ft+1(x)=(1r)0P(μ)μ1ft(μ1x)𝑑μ+rF(x).f_{t+1}(x)=(1-r)\int_{0}^{\infty}P(\mu)\mu^{-1}f_{t}\left(\mu^{-1}x\right)d\mu+rF(x). (19)

Correspondingly, the probability distribution for yty_{t}, gt(y)g_{t}(y), satisfies

gt+1(y)=(1r)Q(ν)gt(yν)𝑑ν+rG(y).g_{t+1}(y)=(1-r)\int_{-\infty}^{\infty}Q(\nu)g_{t}(y-\nu)d\nu+rG(y). (20)

In the Fourier representation, which we define as g^t(η)=gt(y)exp(2πiηy)𝑑y\hat{g}_{t}(\eta)=\int_{-\infty}^{\infty}g_{t}(y)\exp(-2\pi i\eta y)dy and analogous expressions for all the other functions of yy, the solution to Eq. (20) reads

g^t(η)\displaystyle\hat{g}_{t}(\eta) =rG^(η)1(1r)Q^(η)\displaystyle=\frac{r\hat{G}(\eta)}{1-(1-r)\hat{Q}(\eta)} (21)
+[g^0(η)rG^(η)1(1r)Q^(η)](1r)tQ^(η)t.\displaystyle+\left[\hat{g}_{0}(\eta)-\frac{r\hat{G}(\eta)}{1-(1-r)\hat{Q}(\eta)}\right](1-r)^{t}\hat{Q}(\eta)^{t}.

Since, up to a multiplicative constant in its variable, Q^(η)\hat{Q}(\eta) is the characteristic function of the variable ν\nu, it satisfies |Q^(η)|1|\hat{Q}(\eta)|\leq 1 for all η\eta. Therefore, g^t(η)\hat{g}_{t}(\eta) converges to the stationary solution

g^(η)=rG^(η)1(1r)Q^(η)\hat{g}_{\infty}(\eta)=\frac{r\hat{G}(\eta)}{1-(1-r)\hat{Q}(\eta)} (22)

for tt\to\infty.

Except for the term including the initial condition, the solution g^t(η)\hat{g}_{t}(\eta) in Eq. (21) is directly proportional to G^(η)\hat{G}(\eta), the Fourier transform of the distribution of the stochastic variable immediately after each reset event —see second line of Eq. (18). The same happens with the stationary distribution g^(η)\hat{g}_{\infty}(\eta) in Eq. (22). This proportionality reveals that gt(y)g_{t}(y) and, correspondingly, ft(x)f_{t}(x) are given —up to a term involving their initial values— by a linear superposition of contributions coming from each possible value of the variable after resets, which act as mutually independent “starts” for the ensuing multiplicative process. Upon Fourier antitransforming, in fact, both gt(y)g_{t}(y) and ft(x)f_{t}(x) would be respectively given by convolutions of G(u)G(u) and F(s)F(s) with distributions representing the contribution of each uu and ss. The effect of having admitted that after resets the variable can adopt different values is, therefore, rather straightforward.

By Fourier antitransforming Eq. (21), it is in principle possible to find the solution gt(y)g_{t}(y) to equation (20). In turn, using the identity gt(y)dy=ft(x)dxg_{t}(y)dy=f_{t}(x)dx, we would obtain the solution to equation (19), ft(x)=x1gt(lnx)f_{t}(x)=x^{-1}g_{t}(\ln x). For generic forms of Q(ν)Q(\nu) and G(u)G(u), however, this calculation can seldom be explicitly performed. On the other hand, it is straightforward to exactly find the moments of the distribution ft(x)f_{t}(x),

xγt=0xγft(x)𝑑x,\langle x^{\gamma}\rangle_{t}=\int_{0}^{\infty}x^{\gamma}f_{t}(x)dx, (23)

for any order γ\gamma, by noting that xγt=g^t(iγ/2π)\langle x^{\gamma}\rangle_{t}=\hat{g}_{t}(i\gamma/2\pi). Evaluating Eq. (21) at η=iγ/2π\eta=i\gamma/2\pi, in fact, we get

xγt\displaystyle\langle x^{\gamma}\rangle_{t} =rsγ1(1r)μγ\displaystyle=\frac{r\langle s^{\gamma}\rangle}{1-(1-r)\langle\mu^{\gamma}\rangle} (24)
+[xγ0rsγ1(1r)μγ](1r)tμγt,\displaystyle+\left[\langle x^{\gamma}\rangle_{0}-\frac{r\langle s^{\gamma}\rangle}{1-(1-r)\langle\mu^{\gamma}\rangle}\right](1-r)^{t}\langle\mu^{\gamma}\rangle^{t},

which generalizes Eq. (6). Note that, except for the term involving the initial condition, xγt\langle x^{\gamma}\rangle_{t} is proportional to the corresponding moment of the distribution of values after resets, F(s)F(s).

Much as for the stochastic process (1), while the distribution for xtx_{t} always attains a well-defined asymptotic form, given by Eq. (22) in the Fourier representation for the logarithmic variable yty_{t}, the moment xγt\langle x^{\gamma}\rangle_{t} converges to a finite value,

xγ=rsγ1(1r)μγ,\langle x^{\gamma}\rangle_{\infty}=\frac{r\langle s^{\gamma}\rangle}{1-(1-r)\langle\mu^{\gamma}\rangle}, (25)

only if (1r)μγ<1(1-r)\langle\mu^{\gamma}\rangle<1. It is interesting to note that —while, in the uniform case considered in Section II, convergence or divergence of xγt\langle x^{\gamma}\rangle_{t} depends on whether γ\gamma is respectively lower or larger than a critical value γc\gamma_{\rm c}— in the present case it can happen that the moment converges for γ\gamma inside a finite interval (γ,γ+\gamma_{-},\gamma_{+}), with γ<0<γ+\gamma_{-}<0<\gamma_{+}, and diverges elsewhere. This requires, in particular, that the distribution P(μ)P(\mu) allows for values of μ\mu at both sides of μ=1\mu=1. Take, for instance, the two-delta distribution P(μ)=aδ(μμ0)+(1a)δ(μμ01)P(\mu)=a\delta(\mu-\mu_{0})+(1-a)\delta(\mu-\mu_{0}^{-1}) with 0<a<10<a<1 and μ01\mu_{0}\neq 1. The interval of convergence for the order γ\gamma is limited by

γ±=1lnμ0ln1±14a(1a)(1r)22a(1r),\gamma_{\pm}=\frac{1}{\ln\mu_{0}}\ln\frac{1\pm\sqrt{1-4a(1-a)(1-r)^{2}}}{2a(1-r)}, (26)

where, without generality loss, we have assumed μ0>1\mu_{0}>1. Figure 4 shows the interval ends γ±\gamma_{\pm} as functions of aa for three values of rr.

Refer to caption
Figure 4: The ends γ±\gamma_{\pm} of the interval of convergence of the moment xγt\langle x^{\gamma}\rangle_{t} for the multiplicative process with resets (16), with μ\mu drawn from the distribution P(μ)=aδ(μμ0)+(1a)δ(μμ01)P(\mu)=a\delta(\mu-\mu_{0})+(1-a)\delta(\mu-\mu_{0}^{-1}) and μ0>1\mu_{0}>1. The values of γ±\gamma_{\pm} are scaled by lnμ0\ln\mu_{0}, and plotted as functions of aa for three values of the reset probability rr.

The long-time convergence or divergence of xγt\langle x^{\gamma}\rangle_{t} for γ\gamma respectively inside and outside the interval(γ,γ+)(\gamma_{-},\gamma_{+}) is compatible with a distribution f(x)f_{\infty}(x) which behaves as x1γ+x^{-1-\gamma_{+}} for xx\to\infty (in agreement with the results of Section II), and as x1γx^{-1-\gamma_{-}} for x0x\to 0. Finding its complete form, however, would require to antitransform Eq. (22).

IV State-dependent multiplication and reset frequency with continuous time

Coming back to the uniform case considered in Section II, a continuous-time description can be straightforwardly introduced by first assigning a duration Δt\Delta t to each evolution step. Writing the multiplicative coefficient in the first line of Eq. (1) as μ=1+λΔt\mu=1+\lambda\Delta t and taking the limit Δt0\Delta t\to 0, the purely multiplicative part of the stochastic process for x(t)x(t) becomes a linear equation, x˙=λx\dot{x}=\lambda x, whose solution grows or decays exponentially, depending on the sign of λ\lambda. This exponential evolution is punctuated by the reset events, which now occur with frequency (probability per unit time) qq, related to the reset probability rr of Eq. (1) through r=qΔtr=q\Delta t. As in Section II, we assume here that the stochastic variable is reset to x=1x=1 after each event.

The Chapman-Kolmogorov equation for the probability distribution f(x,t)f(x,t) reads now

tf+λx(xf)=qf(x,t)+qδ(x1)0f(x,t)𝑑x.\partial_{t}f+\lambda\partial_{x}(xf)=-qf(x,t)+q\delta(x-1)\int_{0}^{\infty}f(x^{\prime},t)dx^{\prime}. (27)

Its interpretation as a continuity equation is transparent. The left-hand side describes probability drift, driven by multiplication, towards larger or smaller values of xx, depending on λ\lambda being positive or negative, respectively. The loss and gain terms in the right-hand side, meanwhile, stand for probability sinks and sources associated with reset events. It can be readily seen that Eq. (27) preserves the distribution norm 0f(x,t)𝑑x=1\int_{0}^{\infty}f(x,t)dx=1 at all times, so that the integral in the last term is, in reality, a constant (see, however, the generalization in Eq. (30) below). Moreover, if λ>0\lambda>0 and the support of the initial condition f0(x)f_{0}(x) is included in the interval [1,)[1,\infty) or, conversely, if λ<0\lambda<0 and the support of f0(x)f_{0}(x) is included in (0,1](0,1], the distribution f(x,t)f(x,t) will only adopt non-zero values inside either interval at all times. To simplify the presentation, and in agreement with our discussion in Section II, we assume that λ>0\lambda>0 and restrict the analysis to the interval [1,)[1,\infty).

Under these conditions, Eq. (27) can be fully solved by treating the delta-like gain term in the right-hand side as a boundary condition at x=1x=1. In fact, assuming that the solution jumps from f(x,t)=0f(x,t)=0 for x<1x<1 to a finite value f(1+,t)f(1^{+},t) just above the boundary, the delta-like term must be balanced by the xx-derivative in the left-hand side, so that f(1+,t)=q/λf(1^{+},t)=q/\lambda. Taking this condition into account, the solution is

f(x,t)=\displaystyle f(x,t)= exp[(λ+q)t]f0[xexp(λt)]Θ[xexp(λt)1]\displaystyle\exp[-(\lambda+q)t]f_{0}[x\exp(-\lambda t)]\Theta[x\exp(-\lambda t)-1] (28)
+qλx1q/λΘ[1xexp(λt)],\displaystyle+\frac{q}{\lambda}x^{-1-q/\lambda}\Theta[1-x\exp(-\lambda t)],

where Θ(x)\Theta(x) is Heaviside’s step function. The distribution f(x,t)f(x,t) is neatly divided into two contributions, corresponding to the two terms in the right-hand side of Eq. (28). For x>exp(λt)x>\exp(\lambda t), in the first term, we have the contribution of the initial condition, which shifts towards increasingly larger values of xx and, at the same time, is exponentially damped and stretched. For x<exp(λt)x<\exp(\lambda t), in the second term, the asymptotic power-law distribution is established. For tt\to\infty, this second contribution spans the whole domain of the variable xx, yielding

f(x)=qλx1q/λ,f_{\infty}(x)=\frac{q}{\lambda}x^{-1-q/\lambda}, (29)

to be compared with Eq. (8). As an illustration, Fig. 5 shows, in log-log scale, four snapshots of f(x,t)f(x,t) as a function of xx, evolving from an initial condition f0(x)=exp(1x)f_{0}(x)=\exp(1-x), with x[1,)x\in[1,\infty) and λ=q=1\lambda=q=1. The exponential cutoff originating in f0(x)f_{0}(x) shifts to the right as time elapses, while the asymptotic distribution f(x)=x2f_{\infty}(x)=x^{-2} builds up from the left.

Refer to caption
Figure 5: The probability distribution f(x,t)f(x,t), Eq. (28), as a function of xx for t=0t=0, 22, 55 and \infty, starting from an initial condition f0(x)=exp(1x)f_{0}(x)=\exp(1-x), with λ=q=1\lambda=q=1. Dashed vertical lines stand at the boundary x=exp(λt)x=\exp(\lambda t), which separates the contributions of the initial condition and the asymptotic long-time distribution, f(x)=x2f_{\infty}(x)=x^{-2}.

Equation (27) can be immediately generalized to the case where both the coefficient λ\lambda and the reset frequency qq depend on the current value of the stochastic variable xx. This extension reads

tf+x[λ(x)f]=q(x)f(x,t)+δ(x1)Φ(t),\partial_{t}f+\partial_{x}[\lambda(x)f]=-q(x)f(x,t)+\delta(x-1)\Phi(t), (30)

with

Φ(t)=0q(x)f(x,t)𝑑x.\Phi(t)=\int_{0}^{\infty}q(x^{\prime})f(x^{\prime},t)dx^{\prime}. (31)

Note that the factor xx in the xx-derivative of Eq. (27) has been absorbed by λ(x)\lambda(x). As in the homogeneous case above, we focus on the problem restricted to the interval x[1,)x\in[1,\infty), with λ(x)>0\lambda(x)>0 for all xx. Treating the last term in the right-hand side of Eq. (30) as a boundary condition at x=1x=1, we find

f(1+,t)=Φ(t)λ(1).f(1^{+},t)=\frac{\Phi(t)}{\lambda(1)}. (32)

In practice, this condition requires to resort to a self-consistent calculation, where f(x,t)f(x,t) is first solved for arbitrary Φ(t)\Phi(t), in such a way that Eq. (32) is satisfied, and then Φ(t)\Phi(t) is found from Eq. (31). Whether this calculation can be explicitly performed depends on the functional form of λ(x)\lambda(x) and q(x)q(x).

On the other hand, assuming that f(x,t)f(x,t) tends to a well-defined limit for long times, the asymptotic distribution can be readily written as

f(x)=Φλ(x)exp[1xq(x)λ(x)𝑑x],f_{\infty}(x)=\frac{\Phi_{\infty}}{\lambda(x)}\exp\left[-\int_{1}^{x}\frac{q(x^{\prime})}{\lambda(x^{\prime})}dx^{\prime}\right], (33)

with Φ\Phi_{\infty} the asymptotic value of Φ(t)\Phi(t) for tt\to\infty. Equation (33) makes it clear that Φ\Phi_{\infty} is essentially fixed by the normalization of f(x)f_{\infty}(x). In fact, it can be easily verified that Eqs. (31) to (33) are mutually consistent if the distribution is normalized to unity. Such normalization, however, requires that the integral in the exponential of Eq. (33) diverges as xx\to\infty, namely,

1q(x)λ(x)𝑑x=.\int_{1}^{\infty}\frac{q(x^{\prime})}{\lambda(x^{\prime})}dx^{\prime}=\infty. (34)

This condition is verified when the reset frequency increases and/or the multiplicative coefficient decreases sufficiently fast as xx grows. It expresses the fact that there is no probability “leaking” towards large values of xx due to excessively weakened resets and/or strengthened multiplication.

From Eq. (33), moreover, it is apparent that admitting state-dependent multiplication and reset frequency can significantly widen the class of stationary distributions covered by the model, well beyond power-law decaying functions. As an illustration, consider the case of constant reset frequency qq, with a multiplicative coefficient with algebraic dependence on xx, namely, λ(x)=λ0x1α\lambda(x)=\lambda_{0}x^{1-\alpha}. Condition (34) is fulfilled if α0\alpha\geq 0. For α>0\alpha>0, the resulting distribution is

f(x)=qλ0x1αexp[q(xα1)λ0α],f_{\infty}(x)=\frac{q}{\lambda_{0}x^{1-\alpha}}\exp\left[-\frac{q(x^{\alpha}-1)}{\lambda_{0}\alpha}\right], (35)

which reduces to Eq. (29) for α0\alpha\to 0. This form of f(x)f_{\infty}(x), shown in Fig. 6 for λ0=q=1\lambda_{0}=q=1 and some values of the exponent α\alpha, is closely related to the Weibull and the stretched exponential distribution, which play a key role in the quantitative description of several socioeconomic systems. The present model, thus, provides a unified mechanism for the occurrence of a variety of distributions relevant to this kind of problems.

Refer to caption
Figure 6: The probability distribution of Eq. (35) for λ0=q=1\lambda_{0}=q=1 and four values of the exponent α\alpha. For α=0\alpha=0, it reduces to the power-law distribution f(x)=x2f_{\infty}(x)=x^{-2}.

V Discussion and conclusion

Stochastic multiplicative processes (SMPs) with reset events, in their several variants, are characterized by fat-tailed, often power-law, distributions of the relevant variable. These processes show punctuated behaviour, where periods of growth are terminated by sudden shifts to a previous state. High-order moments of the probability distribution diverge in most variants of the general model. These features are characteristic of, among others, financial processes, a context where higher-order moments are relevant. Kurtosis, for instance, is typically used as a measure of financial risk: divergence of the fourth moment (even with finite variance and skewness) yields unreliable predictions if evaluated using finite data sets.Taleb (2007b, 2009) An additional difficulty in the numerical estimation of moments —and, therefore, of risk— is due to the very slow convergence of estimated values to the exact values of the process, even if the associated moments are finite. This “slow law of large numbers” is caused by the large weight of rare events (black swans), which take a lot of data to show up, and prevent a proper estimation of the moments of such processes through the moments of a sample. Taleb (2009)

The interpretation of SMPs with resets in the context of finance is quite straightforward. Regardless of whether the model is implemented in discrete or continuous time, the relevant variable, either xtx_{t} or x(t)x(t), can be understood as the gain accumulated at a given time tt, while its dynamics are controlled by two parameters, one characterizing the momentary gain (μ\mu or λ\lambda), and another one quantifying the risk of the investment (rr or qq). The uniform model analyzed in Section II would therefore describe an ensemble of gamblers devoid of strategy and democratically dependent on luck. Still, they experience dissimilar fortunes and could be, a posteriori, evaluated as poor, mediocre, or excellent investors. It has been suggested that the search for causality in lucky realizations of random processes, together with the elimination of unlucky series from samples, might lead to highly misleading interpretations of the actual causal mechanisms of data, a phenomenon of particular relevance in finance known as “survivorship bias.” Taleb (2005)

The simple model with constant risk and momentary gain clarifies other aspects of the process as well. The exponent of the power law depends on both parameters: the higher the frequency and magnitude of resets, the smaller the exponent (in absolute value). That is, black swans will have larger impact if the momentary gain increases, and also if the risk is high, as it could have been guessed. Furthermore, the estimation of the waiting time to observe an event of size XX shows that the typical, additional waiting time until an event of, say, twice that size occurs is 2γc2^{\gamma_{c}} time steps longer. If γc\gamma_{c} is unknown, or if its estimation is affected by large errors, predictions of the expected time until the next black swan and of its magnitude can be highly inaccurate, at best.

The non-uniform model in Section III extends those results by considering a broad class of probability distributions for momentary gains and reset values, while the risk is kept fixed. Momentary gain is now time-dependent and can be larger or smaller than one (in the latter case, it becomes a momentary loss), this generalization leading to two-sided power laws and to moments converging within a finite interval of γ\gamma values. The qualitative properties of this model are fully comparable to those of the uniform case, though additional situations can be now embraced. The accumulated gain can increase or decrease in random amounts at each time step, and the distribution of reset values F(s)F(s) could take into account agents with different strategies regarding the fraction of their gains put at stake. Prudent investors might save part of their previous gains to avoid too severe drops. The details of this strategy directly affect the moments of the accumulated gain distribution, though the power-law function persists.

Finally, in Section IV, we have studied a variant where the momentary gain and risk can depend on the accumulated gain. In this case, the model has been formulated in continuous time for the sake of analytical tractability, but can be interpreted along the same lines as discrete-time models. This last case incorporates the possibility that agents, perhaps content with the gains accumulated so far, develop cautious strategies by investing in products with lower momentary gain but which are less risk-prone. Interestingly, this non-greedy strategy can transform the power-law distribution into different fat-tailed distributions, where the effects of black swans might be strongly suppressed.

A relevant situation that we have not explicitly explored in this work is the plausible relationship between risk and gain. In practice, it comes to reason that financial gain cannot be maximized while simultaneously minimizing risk. For example, if the momentary gain in the uniform model becomes a linear function of risk, μ=1+κr\mu=1+\kappa r (κ>0\kappa>0), then γc1/κ\gamma_{c}\approx 1/\kappa for small risk. Under the constraint of mutual dependency, strategies decrease the number of their degrees of freedom, and similar restrictions would hold if the relationship between gain and risk affects their statistical distribution (note that, implicitly, Eq. (34) establishes a form of weak constraint linking the two parameters). Still, the implementation of realistic constraints in scenarios where, furthermore, those distributions depend on the accumulated gain appears as an interesting avenue to explore. In a different context, it has been shown that a reset probability that depends on time can improve the efficiency of search processes. Kuśmierz and Toyoizumi (2019) Another extension with potential applicability is the consideration of trends in, e.g., the minimum reset value. A variety of situations might be subject to such trends, as Fig. 1 illustrates for food price indices. The trend might be positive (inflation), negative (deflation), or be itself subject to large variations, as it happens with hyperinflation followed by currency devaluation.

Finance offer multiple situations that can potentially be modelled through SMPs with resets and, occasionally, additional mechanisms tailored to specific scenarios. Possible applications are however broader and extend beyond the many cases already described in this work. Beare and Toda (2019) In population dynamics, also dominated by multiplicative (demographic) growth processes, the fast, local extinction of a population is often followed by “reinjection” in the form of a small number of migrating individuals. This situation could describe parasitic infection bursts in metapopulations Buendía, Muñoz, and Manrubia (2018) and explain the persistence of populations that would otherwise become extinct. Adler (1993) Resets could be also rephrased as any process that finishes the multiplicative growth, since the properties described do not depend on whether the reset is repeatedly applied to realizations that have a continuity in time or to many different realizations that are independently “born,” and then terminated at the time of resetting. An example can be found in models for cascade fracture with stopping events. Yamamoto and Yamazaki (2012) Applications of SMPs to such areas as material sciences and population dynamics would require their generalization to spatially extended systems, considering coupled stochastic processes occurring at neighbor sites.

Given the ubiquity of multiplicative processes possibly amended by a variety of mechanisms —resets, among many others— the profusion of power-law (or fat-tailed) distributed quantities found in natural and social sciences no longer comes as a surprising fact. Such distributions entail non-trivial dynamical properties, as those characterized in this work, that severely limit our ability to predict future outcomes of the process. Despite the advances of over a century of research on this topic, further applications and deeper analytical approaches are yet to come to improve our understanding of the mechanisms generating fat-tailed distributions and our control of black-swan-like events.

Acknowledgements

S. M. is supported by grant FIS2017-89773-P (MINECO/FEDER, E. U.).

References

  • Auerbach (1913) F. Auerbach, “Das Gesetz der Bevölkerungskonzentration,” in Petermanns Geographische Mitteilungen, Vol. 59, edited by P. Langhans (Justus Pertes, Gotha, 1913) pp. 74–76, available online at https://archive.org/details/Auerbach1913. Translation of first paragraph by S.M.
  • Estoup (1916) J. B. Estoup, Gammes sténographiques (Institut Sténographique de France, Paris, 1916).
  • Yule (1924) G. U. Yule, “A mathematical theory of evolution, based on the conclusions of Dr. J. C. Willis,” Proc. R. Soc. London B 213, 21–87 (1924), https://doi.org/10.1098/rstb.1925.0002.
  • Zipf (1949) G. K. Zipf, Human Behavior and the Principle of Least Effort (Addison-Wesley, Cambridge, Massachusetts, 1949).
  • Simon (1955) H. A. Simon, “On a class of skew distribution functions,” Biometrika 42, 425–440 (1955), https://doi.org/10.1093/biomet/42.3-4.425.
  • Taleb (2007a) N. N. Taleb, The Black Swan: The Impact of the Highly Improbable (Random House, New York, 2007).
  • Manrubia and Zanette (1999) S. C. Manrubia and D. H. Zanette, “Stochastic multiplicative processes with reset events,” Phys. Rev. E 59, 4945–4948 (1999), https://doi.org/10.1103/PhysRevE.59.4945.
  • Evans and Majumdar (2011) M. R. Evans and S. N. Majumdar, “Diffusion with stochastic resetting,” Phys. Rev. Lett. 106, 160601 (2011), https://doi.org/10.1103/PhysRevLett.106.160601.
  • Montero and Villarroel (2013) M. Montero and J. Villarroel, “Monotonic continuous-time random walks with drift and stochastic reset events,” Phys. Rev. E 87, 012116 (2013), https://doi.org/10.1103/PhysRevE.87.012116.
  • Pareto (1896) V. Pareto, Cours d’économie politique (Rouge, Lausanne, 1896).
  • Newman (2004) M. E. J. Newman, “Power laws, Pareto distributions and Zipf’s law,” Contemp. Phys. 46, 323–351 (2004), https://doi.org/10.1080/00107510500052444.
  • Mitzenmacher (2004) M. Mitzenmacher, “A brief history of generative models for power law and lognormal distributions,” Internet Math. 1, 226–251 (2004), https://doi.org/10.1080/15427951.2004.10129088.
  • Zanette and Manrubia (2007) D. H. Zanette and S. C. Manrubia, “Multiplicative processes in social systems,” in Complex Population Dynamics. Nonlinear Modeling in Ecology, Epidemiology and Genetics, Lecture Notes in Complex Systems, Vol. 7, edited by B. Blasius, J. Kurths,  and L. Stone (World Scientific, Singapore, 2007) pp. 129–158, https://doi.org/10.1142/6600.
  • Montroll and Shlesinger (1982) E. W. Montroll and M. F. Shlesinger, “On 1/f noise and other distributions with long tails,” Proc. Natl. Acad. Sci. USA 79, 3380–3383 (1982), https://doi.org/10.1073/pnas.79.10.3380.
  • Zanette (2008) D. H. Zanette, “Multiplicative processes and city sizes,” in The Dynamics of Complex Urban Systems. An Interdisciplinary Approach, edited by S. Albeverio, D. Andrey, P. Giordano,  and A. vancheri (Physica-Verlag, Heidelberg, 2008) pp. 457–472, https://doi.org/10.1007/978-3-7908-1937-3.
  • Fu et al. (2005) D. Fu, F. Pammolli, S. V. Buldyrev, M. Riccaboni, K. Matia, K. Yamasaki,  and H. E. Stanley, “The growth of business firms: Theoretical framework and empirical evidence,” Proc. Natl. Acad. Sci. USA 102, 18801–18806 (2005), https://doi.org/10.1073/pnas.0509543102.
  • Zanette and Manrubia (2001) D. H. Zanette and S. C. Manrubia, “Vertical transmission of culture and the distribution of family names,” Physica A 295, 1–8 (2001), https://doi.org/10.1016/S0378-4371(01)00046-2.
  • Manrubia and Zanette (2002) S. C. Manrubia and D. H. Zanette, “At the boundary between biological and cultural evolution: The origin of surname distributions,” Journal of Theoretical Biology 216, 461–477 (2002), https://doi.org/10.1006/jtbi.2002.3002.
  • Zanette and Montemurro (2005) D. H. Zanette and M. Montemurro, “Dynamics of text generation with realistic Zipf’s distribution,” J. Quant. Ling. 12, 29–40 (2005), https://doi.org/10.1080/09296170500055293.
  • Zanette (2006) D. H. Zanette, “Zipf’s law and the creation of musical context,” Musicae Scientiae 10, 3–18 (2006), https://doi.org/10.1177/102986490601000101.
  • Takayasu, Sato, and Takayasu (1997) H. Takayasu, A.-H. Sato,  and M. Takayasu, “Stable infinite variance fluctuations in randomly amplified Langevin systems,” Phys. Rev. Lett. 79, 966–969 (1997), https://doi.org/10.1103/PhysRevLett.79.966.
  • Sornette (1998a) D. Sornette, “Multiplicative processes and power laws,” Phys. Rev. E 57, 4811–4813 (1998a), https://doi.org/10.1103/PhysRevE.57.4811.
  • Sornette (1998b) D. Sornette, “Linear stochastic dynamics with nonlinear fractal properties,” Physica A 250, 295–314 (1998b), https://doi.org/10.1016/S0378-4371(97)00543-8.
  • Levy and Solomon (1996) M. Levy and S. Solomon, “Power laws are logarithmic Boltzmann laws,” Int. J. Mod. Phys. C 7, 595–601 (1996), https://doi.org/10.1142/S0129183196000491.
  • Zel’dovich et al. (1987) Y. B. Zel’dovich, S. A. Molchanov, A. A. Ruzmaikin,  and D. D. Sokolov, “Intermittency in random media,” Sov. Phys. Usp. 30, 353–369 (1987), https://doi.org/10.1070/PU1987v030n05ABEH002867.
  • Zanette and Manrubia (1997) D. H. Zanette and S. C. Manrubia, “Role of intermittency in urban development: A model of large-scale city formation,” Phys. Rev. Lett. 79, 523–526 (1997), https://doi.org/10.1103/PhysRevLett.79.523.
  • Nirei and Souma (2004) M. Nirei and W. Souma, “Income distribution and stochastic multiplicative process with reset event,” in The Complex Dynamics of Economic Interaction, edited by M. Gallegati, A. P. Kirman,  and M. Marsili (Springer, Berlin, 2004) pp. 161–168, https://doi.org/10.1007/978-3-642-17045-4.
  • Montero, Masó-Puigdellosas, and Villarroel (2017) M. Montero, A. Masó-Puigdellosas,  and J. Villarroel, “Continuous-time random walks with reset events,” Eur. Phys. J B 90, 176 (2017), https://doi.org/10.1140/epjb/e2017-80348-4.
  • Evans, Majumdar, and Schehr (2019) M. R. Evans, S. N. Majumdar,  and G. Schehr, “Stochastic resetting and applications,”  (2019), available online at https://arxiv.org/abs/1910.07993.
  • Mézard, Parisi, and Virasoro (1987) M. Mézard, G. Parisi,  and M. Virasoro, Spin Glass Theory and Beyond. An Introduction to the Replica Method and Its Applications (World Scientific, Singapore, 1987) https://doi.org/10.1142/0271.
  • Derrida (1997) B. Derrida, “From random walks to spin glasses,” Physica D 107, 186–198 (1997), https://doi.org/10.1016/S0167-2789(97)00086-9.
  • Derrida and Jung-Muller (1999) B. Derrida and B. Jung-Muller, “The genealogical tree of a chromosome,” J. Stat. Phys. 94, 277–298 (1999), https://doi.org/10.1023/A:1004579800589.
  • Meylahn, Sabhapandit, and Touchette (2015) J. M. Meylahn, S. Sabhapandit,  and H. Touchette, “Large deviations for Markov processes with resetting,” Phys. Rev. E 92, 062148 (2015), https://doi.org/10.1103/PhysRevE.92.062148.
  • Gabaix (2009) X. Gabaix, “Power laws in economics and finance,” Ann. Rev. Econ. 1, 255–293 (2009), https://doi.org/10.3386/w14299.
  • Gibrat (1931) R. Gibrat, Les inégalités économiques (Recueil Sirey, Paris, 1931).
  • Lagi et al. (2011) M. Lagi, Y. Bar-Yam, K. Z. Bertrand,  and Y. Bar-Yam, “The food crises: A quantitative model of food prices including speculators and ethanol conversion,” SSRN , 1932247 (2011), https://doi.org/10.2139/ssrn.1932247.
  • Caprio and Klingebiel (1999) G. Caprio and D. Klingebiel, Bank Insolvencies: Cross-Country Experience (The World Bank, Washington, DC, 1999) https://doi.org/10.1596/1813-9450-1620.
  • Taleb (2007b) N. N. Taleb, “Black swans and the domains of statistics,” Am. Stat. 61, 198–200 (2007b), https://doi.org/10.1198/000313007X219996.
  • Montero and Villarroel (2016) M. Montero and J. Villarroel, “Directed random walk with random restarts: The Sisyphus random walk,” Phys. Rev. E 94, 032132 (2016), https://doi.org/10.1103/PhysRevE.94.032132.
  • Heubach and Mansour (2009) S. Heubach and T. Mansour, Combinatorics of Compositions and Words. Discrete Mathematics and its Applications (CRC Press, Boca Raton, 2009).
  • Redner (1990) S. Redner, “Random multiplicative processes: An elementary tutorial,” Am. J. Phys. 58, 267–273 (1990), https://doi.org/10.1119/1.16497.
  • Taleb (2009) N. N. Taleb, “Finiteness of variance is irrelevant in the practice of quantitative finance,” Complexity 14, 66–76 (2009), https://doi.org/10.1002/cplx.20263.
  • Brooks (2019) C. Brooks, Introductory Econometrics for Finance (Cambridge University Press, Oxford, 2019) https://doi.org/10.1017/9781108524872.
  • Novak (2011) S. Y. Novak, Extreme Value Methods with Applications to Finance (Chapman & Hall/CRC Press, London, 2011).
  • Taleb (2005) N. N. Taleb, Fooled by Randomness. The Hidden Role of Chance in Life and in the Markets (Random House, New York, 2005).
  • Kuśmierz and Toyoizumi (2019) Ł. Kuśmierz and T. Toyoizumi, “Robust random search with scale-free stochastic resetting,” Phys. Rev. E 100, 032110 (2019), https://doi.org/10.1103/PhysRevE.100.032110.
  • Beare and Toda (2019) B. K. Beare and A. A. Toda, “Geometrically stopped Markovian random growth processes and Pareto tails,”  (2019), available online at https://arxiv.org/abs/1712.01431v2.
  • Buendía, Muñoz, and Manrubia (2018) V. Buendía, M. A. Muñoz,  and S. Manrubia, “Limited role of spatial self-structuring in emergent trade-offs during pathogen evolution,” Sci. Rep. 8, 12476 (2018), https://doi.org/10.1038/s41598-018-30945-1.
  • Adler (1993) F. R. Adler, “Migration alone can produce persistence of host-parasitoid models,” Am. Nat. 141, 642–650 (1993), https://doi.org/10.1086/285496.
  • Yamamoto and Yamazaki (2012) K. Yamamoto and Y. Yamazaki, “Power-law behavior in a cascade process with stopping events: A solvable model,” Phys. Rev. E 85, 011145 (2012), https://doi.org/10.1103/PhysRevE.85.011145.