This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Long versus Short Time Scales:
the Rough Dilemma and Beyond

Matthieu Garcin Léonard de Vinci Pôle Universitaire, Research Center, 92916 Paris la Défense. Email: matthieu.garcin@devinci.fr.    Martino Grasselli Department of Mathematics “Tullio Levi Civita”, University of Padova, via Trieste 63, 35121 Padova, Italy, and Léonard de Vinci Pôle Universitaire, Research Center, 92916 Paris la Défense. Email: grassell@math.unipd.it.
Acknowledgments: We thank Elisa Alòs, Fabienne Comte, Christa Cuchiero, Eva Flonner, Gilles Pagès, Andrea Pallavicini, and Mathieu Rosenbaum for useful comments on a preliminary version. We also thank the participants of the 2019 Quantitative Methods in Finance conference, Sydney for useful comments.
(2021)
Abstract

Using a large dataset on major FX rates, we test the robustness of the rough fractional volatility model over different time scales, by including smoothing and measurement errors into the analysis. Our findings lead to new stylized facts in the log-log plots of the second moments of realized variance increments against lag which exhibit some convexity in addition to the roughness and stationarity of the volatility. The very low perceived Hurst exponents at small scales is consistent with the rough framework, while the higher perceived Hurst exponents for larger scales leads to a nonlinear behavior of the log-log plot that has not been described by models introduced so far.

2010 Mathematics Subject Classification. 60F10, 91G99, 91B25.

Keywords: Fractional Brownian motion, rough volatility, realized variance, time series, intra-day data.

1 Introduction

It is well known that a constant volatility is not consistent with time series data nor implied volatility surfaces. Several stochastic volatility models have been introduced in the last decades in order to reproduce the stylized facts of time series observed for both historical and implied volatility, see e.g. Stein and Stein (1994), Heston (1993), Barndorff-Nielsen and Shephard (2001), Hagan et al. (2002), affine models like in Duffie et al. (2003), ARCH, GARCH, their non-parametric extensions Garcin and Goulet (2019), and many others. In a different (implied volatility) perspective, Dupire (1994) assumed that the volatility becomes a deterministic function of time and of the current state of the asset, thus leading to the local volatility approach, which provides a theoretically perfect reproduction of the implied volatility surface.

All these approaches adopt the classic Brownian framework for the noise of both the underlying and its volatility. Motivated by an apparent presence of long memory in the volatility process, see e.g. Lo (1991), some researchers (see e.g. Ding et al. (1993), Baillie (1996), Bollerslev and Mikkelsen (1996), Andersen and Bollerslev (1997), Breidt et al. (1998)…) modelled the log-volatility noise with a fractional Brownian motion, leading to the so called fractional stochastic volatility (FSV) model111A fractional Brownian motion (fBm) BHB_{H} of Hurst exponent H(0,1)H\in(0,1) is a Gaussian process with a non trivial covariance function, namely a non Markovian process that allows for long or short memory, according to resp. H>0.5H>0.5 or H<0.5H<0.5. The case H=0.5H=0.5 corresponds to the classic (Markovian) Brownian motion. .

Comte and Renault (1998) suggested a model where the driving fractional Brownian motion has Hurst parameter H>0.5H>0.5, in order to take into account for the stylized fact suggesting that the volatility is a long memory process. Their estimation procedure for the Hurst parameter was based on the method of Geweke and Porter-Hudak (1983), involving the slow decay of the autocorrelation function (which is supposed to be of power law with exponent less than one) and it reveals to be problematic, as the asymptotic behaviour of the covariance function cannot be directly computed without assuming a specific functional form.

Recently, a new paradigm for the volatility process has been introduced by Gatheral et al. (2018), who affirmed a universal phenomenon: volatility is rough and cannot be described by a SDE driven by a classic Brownian motion. In particular, they showed that the autocorrelation function of the volatility does not behave as a power law, at least at the time scales ranging from one day till 2 months considered in their observation, so that they disentangled the question about the long memory of the volatility from the asymptotic behaviour of the autocorrelation function. Then, Gatheral et al. (2018) introduced a model, where the logarithm of the volatility is driven by a fractional Brownian motion (fBm) with a Hurst exponent that is empirically found to be very low, thus leading to rough trajectories for the volatility. For this reason, the approach is referred to as the rough fractional stochastic volatility model (RFSV). Using absolute moments estimation on a wide range of scales (from 1 day to approximatively 50 days), the Hurst exponent is found to be close to 0.14 both for the log-volatility of S&P 500 and the NASDAQ, together with other major indexes. The series of volatilities used in the seminal paper Gatheral et al. (2018) come from the realized variance estimates from the Oxford-Man Institute of Quantitative Finance Realized Library 222http://realized.oxford-man.ox.ac.uk/data/download. The Oxford-Man Institute’s Realized Library contains a selection of daily non-parametric estimates of volatility of financial assets, including realized variance and realized kernel estimates., between January 3, 2000 and March 31, 2014. In their approach, the daily squared volatility is estimated using the quadratic variations of the log-prices at a five-minute frequency (about 96 observations per day, in total about 3500 days). Another fundamental result in Gatheral et al. (2018) states that the estimation of the Hurst exponent HH is robust across time, scales and markets (equity indexes and FX). Before this empirical evidence of low Hurst exponents, based on series of historical volatilities, Alòs et al. (2007) suggested that values of HH below 1/21/2 should reproduce another stylized fact regarding implied volatilities, namely the short-date skew.

One may wonder if the results of Gatheral et al. (2018) depend on the particular estimation procedure adopted. There are indeed different ways to compute the volatility proxies (see e.g. the Fourier-based estimators for the realized variance in Cuchiero and Teichmann (2015) or the Parkinson estimator Parkinson (1980) that one can adopt when the realized variance is not available), or methods that even circumvent the absolute moment estimation procedure, like the Whittle-type estimation methods used in Fukasawa et al. (2019) to find directly the Hurst exponent. However, it is possible to show that all these methods lead to qualitatively similar results. We will show this and investigate daily time series on equity indexes using the Parkinson estimator in a forthcoming paper.

We thus observe two branches of the literature of fractional volatility leading to opposite conclusions:

  • The traditional econometric approach, which studies the speed of the decay of the autocovariance function, concludes that there is long memory.

  • The rough volatility approach, in which the Hurst exponent is estimated from scaling properties of the series, uses thus estimators from literature of econophysics and of the statistics of stochastic processes. It concludes that there is no long memory.

In these two approaches, the tools are not the same, the range of scales analysed may differ, and the conclusions diverge.

A paper that somehow adopts both approaches is Bennedsen et al. (2016), where the authors present a two-factor stochastic volatility model which is rough at the short time scales, but it presents stationarity at longer time scales, according to the traditional approach that considers the autocovariance function. This mixing effect leads to an effective Hurst parameter varying on different observation time scales. In particular, the authors find a Hurst parameter for the S&P 500 index ranging from few cents up to 0.2, as the observation time scale grows from one minute to few hundreds minutes. It is worth noticing that Bennedsen et al. (2016) do consider just intra-day data: it would be interesting to see if their model is able to reproduce the stylized facts that we empirically observe also for longer time scales with daily data as we are going to describe.

Despite some technical issues arising from the fact that a volatility process driven by a fractional Brownian motion is not a semi-martingale (and the corresponding integrals require particular care), rough volatility models have been largely investigated in recent theoretical and empirical literature, to the point that today one can find more than one hundred papers on the subject333See e.g. the papers on the website https://sites.google.com/site/roughvol/home.. Surprisingly, the empirical investigation basically relies on the dataset from the Oxford-Man Institute’s Realized Library, which is indeed very useful, but far to be the most complete.

The paper has two main contributions: the first is a modification of Gatheral et al. (2018)’s method taking errors into account, and the second is to report some findings from FX data. As volatility is unobserved, we include the estimation error into the analysis. In fact, what we observe is only noisy volatility. We model two types of noise, coming from measurement and smoothing (arising from the fact that we replace the unobserved spot variance with the integrated variance estimator). We quantify the impact of all these noises in the estimation of the Hurst exponent and we filter out their effects. What remains after the filtering still deviates from the straight line behaviour predicted by the rough volatility model paradigm. In particular, we extend previous empirical studies to a wider range of time scales, beyond the ones investigated in Gatheral et al. (2018), in order to check whether data are consistent with the scaling properties predicted by fractional volatility models. The most striking difference with these models is a strong convexity of the log-log plot. In other words, the Hurst exponent perceived at small scales is lower than the Hurst exponent perceived at larger scales. We also include the mean reversion of the volatility in our investigation: when we consider large time scales, we expect to be able to observe the stationarity features that were precluded in the previous studies, since the time windows investigated so far were too small compared to the mean reversion frequency. In order to be able to observe such mean reversion effect, we obviously need long time series for the underlyings. Our results show that the presence of only one fBm is not enough in order to meet all the stylized facts, convexity and stationarity, empirically observed in our dataset. The main conclusion is that, by broadening the ranges of scales, we highlight new stylized facts about the scaling of volatility processes that cannot be reproduced by the usual rough volatility model driven by a fractional Brownian motion.

The paper is organised as follows: in Section 2, we review Gatheral et al. (2018) and other preceding studies on the estimation of the Hurst parameter, such as Fukasawa et al. (2019) that pointed out some source of spurious roughness. Then, we propose a modification that includes the measurement and smoothing errors, with some reasoning supported by a simulation study. We re-examin the analysis of Gatheral et al. (2018) by including errors in Section 3, where we propose some ways to filter out the errors. Finally, in Section 4 we perform an empirical analysis on FX for a large dataset and we provide evidence of new stylized facts that are not present in past literature. Section 5 concludes.

2 Estimation of the Hurst exponent

2.1 Absolute moments estimation of the Hurst exponent

Consider the dynamics of a price process SS that evolves according to the SDE

dSt=(.)dt+StσtdBt,t0,dS_{t}=(.)dt+S_{t}\sigma_{t}dB_{t},t\geq 0, (1)

where BB is a standard Brownian motion defined in a probability space that satisfies the usual technical conditions. The process SS is assumed to be a semimartingale in order to avoid arbitrage opportunities.

The volatility price process σ\sigma is not directly observable, one can only deduce indirectly its properties through some observable proxies like the realized variance process, defined as

σ^δ,t2=(t1)δutδ|ΔlogS¯u|2,\widehat{\sigma}^{2}_{\delta,t}=\sum_{(t-1)\delta\leq u\leq t\delta}|\Delta\log\overline{S}_{u}|^{2}, (2)

where S¯\overline{S} is a piecewise constant process which jumps at every sampling time of SS to the observed value of SS at the time. If there is no measurement error and the sampling frequency goes to infinity we have that

σ^δ,t2(t1)δtδσu2𝑑u,\widehat{\sigma}^{2}_{\delta,t}\rightarrow\int_{(t-1)\delta}^{t\delta}\sigma^{2}_{u}du,

in probability, which justifies the choice of (2) for a proxy of the realized variance process, as well as its square root for the daily spot volatility, in the case where δ\delta corresponds to the length of one day.

In Gatheral et al. (2018), the authors performed a linear regression in order to fit the empirical absolute moment of order kk of the log-volatility, defined as

1nt=1n|logσ^δ,t+τlogσ^δ,t|k.\frac{1}{n}\sum_{t=1}^{n}{|\log\widehat{\sigma}_{\delta,t+\tau}-\log\widehat{\sigma}_{\delta,t}|^{k}}. (3)

They found a good fit, for different values of kk, with the function

log1nt=1n|logσ^δ,t+τlogσ^δ,t|kkHlogτ+ηk,\log\frac{1}{n}\sum_{t=1}^{n}{|\log\widehat{\sigma}_{\delta,t+\tau}-\log\widehat{\sigma}_{\delta,t}|^{k}}\approx kH\log\tau+\eta_{k}, (4)

for a very small value of H0.1H\approx 0.1.

This special scaling property, together with some empirical stylized facts on the Gaussian nature of the log-variance (see e.g. Andersen et al. (2003)), induced Gatheral et al. (2018) to assume a particular SDE for the log-volatility of the form

dlogσt2=ηdBtH,d\log\sigma^{2}_{t}=\eta dB^{H}_{t}, (5)

where η\eta is a constant and BHB^{H} is a fractional Brownian motion with Hurst parameter HH.

In fact, a fractional Brownian motion (fBm) of Hurst exponent H(0,1)H\in(0,1) and scale parameter η2\eta^{2} has stationary increments that satisfy

𝔼[|BtHBtτH|k]=2k/2Γ(k+12)Γ(12)ηkτkH,τ,k0,\mathds{E}\left[|B^{H}_{t}-B^{H}_{t-\tau}|^{k}\right]=\frac{2^{k/2}\Gamma(\frac{k+1}{2})}{\Gamma(\frac{1}{2})}\eta^{k}\tau^{kH},\qquad\tau,k\geq 0, (6)

where Γ(.)\Gamma(.) denotes the Gamma function, see Kolmogorov (1940); Mandelbrot and van Ness (1968).

Turning things around, we define the absolute empirical moment of order kk of the increments of a process XX (playing the role of the log-volatility process), in a time interval [0,N][0,N] for a given scale τ\tau:444 In reality, we are using a version of Mk,τ,N(X)M_{k,\tau,N}(X) with overlapping increments. This allows us to slightly increase the convergence of this empirical absolute moment Lo and MacKinlay (1988).

Mk,τ,N(X)=1N/τi=1N/τ|XiτX(i1)τ|k.M_{k,\tau,N}(X)=\frac{1}{\lfloor N/\tau\rfloor}\sum_{i=1}^{\lfloor N/\tau\rfloor}{|X_{i\tau}-X_{(i-1)\tau}|^{k}}. (7)

Using equation (6), it follows that ln(Mk,τ,N(X))\ln(M_{k,\tau,N}(X)) is proportional to HH if XX is a fBm as increments are stationary. This is the basis for estimators of Hurst exponents, see e.g. Benassi et al. (1998); Garcin (2017). In particular, we can compute such empirical absolute moments for a great number of scales, and the estimator of HH is then 1/k1/k times the slope of the regression of ln(Mk,τ,N(X))\ln(M_{k,\tau,N}(X)) on ln(τ)\ln(\tau) Coeurjolly (2005). As a consequence, when plotting ln(Mk,τ,N(X))\ln(M_{k,\tau,N}(X)) as a function of ln(τ)\ln(\tau) (also called log-log plot), we should get a straight line if XX is a fBm.

When restricting the analysis to small scales (say up to 2.5 months), one obtains such a straight line for log-log plot of realized volatilities, as first underlined by Gatheral et al. (2018). We reproduce this empirical observation in Figure 1, with two stock indices and data from the Oxford-Man Institute’s Realized Library.

Refer to caption
Figure 1: Log-log plot, log(τ)log(M2,τ,1(X))\log(\tau)\mapsto\log(M_{2,\tau,1}(X)), where XX is the log of realized volatility of two stock indices, NASDAQ index (black) and S&P index (grey), between March 2000 and November 2018 sampled at a five-minute step. Time scales range between 1 day till 2.5 months.

Note that the estimate H0.1H\approx 0.1 is in contrast with past literature on fractional volatility like Comte and Renault (1998), where the long memory was associated to H>0.5H>0.5. In particular, H<0.5H<0.5 means that the volatility path is rougher than semimartingales, and is consistent to a power law for the term structure of implied volatility skew empirically observed in option markets, see Gatheral et al. (2018) and references therein.

2.2 Impact of the mean-reversion at large scales

An empirical stylized fact for the volatility is that it should be a stationary process, for both mathematical tractability and financial interpretability mostly at large times. In order to be consistent with a stationarity assumption, the rough volatility model in Gatheral et al. (2018) extends equation (5) to the case where the log-volatility follows a fractional Ornstein-Uhlenbeck process (fOU, see Cheridito et al. (2003)) with a very long reversion time scale, so that the effect of this mean reversion is invisible at the scales of the study (time scales considered are between one day and two months). However, when dealing with scales longer than two months, we should expect a deviation from the linear behaviour in the log-log plot for stationarized fBm processes (for example a fOU or the inverse Lamperti transform of a fBm, see e.g. Cheridito et al. (2003); Garcin (2019); Šapina et al. (2020)). We illustrate the phenomenon in the simple case where k=2k=2, namely the second absolute moment for the fBm, that can be rewritten as follows:

M2,τ(X)\displaystyle M_{2,\tau}(X) =𝔼[|XtXtτ|2]\displaystyle=\mathds{E}\left[|X_{t}-X_{t-\tau}|^{2}\right]
=𝔼[|Xt|2]+𝔼[|Xtτ|2]2Cov(Xt,Xtτ)\displaystyle=\mathds{E}\left[|X_{t}|^{2}\right]+\mathds{E}\left[|X_{t-\tau}|^{2}\right]-2Cov\left(X_{t},X_{t-\tau}\right)

Now, for large scales, the first two terms become similar and independent of τ\tau. If the third term, the covariance, is a decreasing function wrt τ\tau (decreasing to 0), then M2,τ(X)M_{2,\tau}(X) becomes an increasing function that flattens for large τ\tau. As a consequence, the function log(τ)log(M2,τ(X))\log(\tau)\rightarrow\log(M_{2,\tau}(X)) behaves as a straight line for small values of τ\tau, but as the time scale increases, the slope decreases gradually until reaching zero and we observe a concave behaviour for large scales. In conclusion, if we include the stationary volatility stylized fact into account, we should observe a potential decrease in the slope of the straight line in the log-log plot, or equivalently a smaller Hurst exponent for larger scales.

2.3 Spurious roughness and noisy volatility

It is well known that it is possible to simulate a spurious effect of roughness in the volatility paths just by playing with the drift of the volatility process, in particular by introducing a strong mean reversion effect, see for example Figure 2 in Fukasawa et al. (2019) or Rogers (2019). There is indeed another easy way to generate spurious roughness, namely by adding noise into the observations, as also pointed out by Fukasawa et al. (2019). In fact, what we measure is not volatility but just noisy volatility. In this subsection, we first provide a motivational example where we show that the presence of an additive noise may lead to a spurious roughness effect in a simulation study, where parameters are in line with empirical findings of the historical series that we shall consider in our empirical investigation. We then show how to quantify the impact of the noises in the estimation of the Hurst exponent in order to filter out the bias.

We assume that the log-price log(St)\log(S_{t}) follows an Ito process with stochastic variance σt2\sigma^{2}_{t}. Let σ^t\widehat{\sigma}_{t} be the estimated volatility of day tt, obtained using for example the realized volatility. These estimators provide us with a noisy version of the true and unobserved volatility, σt\sigma_{t}. We assume an additive model of measurement noise555Note that here we assume an additive model for the variance, not for the volatility. We can justify this choice by the additive nature of the variance, which makes it possible to apply the central limit theorem and to get an asymptotic distribution of the measurement error Rootzen (1980); Jacod and Protter (1998); Barndorff-Nielsen and Shephard (2002).:

σ^t2=σt2+εt,\widehat{\sigma}_{t}^{2}=\sigma_{t}^{2}+\varepsilon_{t}, (8)

where t{1,,N}t\in\{1,...,N\} and where the ε1,,εN\varepsilon_{1},...,\varepsilon_{N} are i.i.d. centered random variables.

Moreover, we consider the simple case where the variance of the log-prices follows a geometric Brownian motion:

σt2=σ02exp(βBt12β2t).\sigma_{t}^{2}=\sigma_{0}^{2}\exp\left(\beta B_{t}-\frac{1}{2}\beta^{2}t\right). (9)

Endowed with the estimation of the parameters σ0\sigma_{0} and β\beta for (9), we simulate the model (8) for different values of the standard deviation of the noise and it turns out that for certain values of the noise variance, a spurious roughness effect appears naturally, in line with Fukasawa et al. (2019): Figure 2 shows a typical spurious rough simulated path for a volatility satisfying (9).

Refer to caption
Figure 2: Simulation of the noisy volatility model (8), in which the standard deviation of the noise is 25%25\% times the variance σt2\sigma^{2}_{t} of the log-price (the statistical framework exposed in Section 2.4 justifies this parametric dependence to σt2\sigma^{2}_{t}). The variance process of the price follows a geometric Brownian motion with the parameters estimated as for EUR/USD, i.e. σ0=4.62×103\sigma_{0}=4.62\times 10^{-3} and β=3.8%\beta=3.8\%.

This result strongly depends on the input volatility of the volatility (and on the model too, which is here limited to the geometric Brownian motion for the variance process). In particular, we used the same vol of vol for all the simulations, which is the vol of vol relevant for EUR/USD. With higher vol of vol, the impact of the noise in the estimation of the Hurst exponent is decreased, as one can see in Figure 3.

Refer to caption
Figure 3: Estimated Hurst exponent at small scales for various volatilities of the volatility, obtained by simulation of the prices and estimation of the realized volatility. The variance process of the price follows a geometric Brownian motion with the volatility parameter estimated as for EUR/USD, i.e. σ0=4.62×103\sigma_{0}=4.62\times 10^{-3}, and β\beta ranging from 0.1%0.1\% till 20%20\%.

In conclusion, a spurious roughness effect may appear in the presence of a noise. It will be therefore important to measure the standard deviation of the noise in order to filter out the bias from the estimation of the Hurst exponent.

2.4 Theoretical noise of the volatility proxy

In this subsection we focus on the first source of bias in the estimates, namely the measurement error, coming from the fact that the volatility process is not observable.

We can calculate the realized variance at a on-day scale for a given day, using nn log-returns:

σ^t2(n)=i=0n1[log(Sti/nSt(i+1)/n)]2.\widehat{\sigma}^{2}_{t}(n)=\sum_{i=0}^{n-1}{\left[\log\left(\frac{S_{t-i/n}}{S_{t-(i+1)/n}}\right)\right]^{2}}.

The realized variance is an approximation of the integrated variance:

σ¯t2=t1tσu2𝑑u.\bar{\sigma}^{2}_{t}=\int_{t-1}^{t}{\sigma^{2}_{u}du}.

If the log-price follows an Ito process, from e.g. Rootzen (1980); Jacod and Protter (1998); Barndorff-Nielsen and Shephard (2002), we know the asymptotic distribution of the difference between the realized variance and the integrated variance, conditionally to the volatility process (σu)u[t1,t](\sigma_{u})_{u\in[t-1,t]}:

n(σ^t2(n)σ¯t2)n𝒩(0,2t1tσu4𝑑u).\sqrt{n}\left(\widehat{\sigma}^{2}_{t}(n)-\bar{\sigma}^{2}_{t}\right)\overset{n\rightarrow\infty}{\longrightarrow}\mathcal{N}\left(0,2\int_{t-1}^{t}{\sigma^{4}_{u}du}\right).

That is, asymptotically, σ^t2(n)σ¯t2\widehat{\sigma}^{2}_{t}(n)-\bar{\sigma}^{2}_{t} is a centered Gaussian variable of variance 2nt1tσu4𝑑u\frac{2}{n}\int_{t-1}^{t}{\sigma^{4}_{u}du}. By neglecting the variations of σt\sigma_{t} in a day, we approximate the variance of the εt\varepsilon_{t} in equation (8) by 2σt4/n2\sigma_{t}^{4}/n. For FX rates, sampled every minute, we have n=1440n=1440.

2.5 A second type of noise: the smoothing effect

Motivated by the inaccuracy of the Hurst estimation, we now focus on the noise that comes from the fact that we try to estimate the Hurst exponent of the volatility process by applying estimation methods for the integrated volatility. Therefore, when we compute the variance of an increment of the realized volatility in the estimation of the Hurst exponent, for a given scale τ\tau, we are in fact computing the integrated variance of the increment of the volatility process for a scale varying between τ\tau minus one day and τ\tau plus one day. This is referred to as to the smoothing error, namely the bias introduced by taking the integral instead of the spot (unobservable) argument, see also Appendix 3 in Gatheral et al. (2018), where methods and equations are similar to what we’re going to show in this subsection. In particular, in what follows, we show that this smoothing effect tends to overestimate the Hurst exponent. It may also explain why, in the Nasdaq log-log plots obtained by Gatheral et al. (2018), the slope of straight line is slightly higher for the lowest log-scales, what is illustrated in Figure 4.

Refer to caption
Figure 4: Theoretical log second moment of the increments of σ^t,N,d2\widehat{\sigma}^{2}_{t,N,d}, with d=1d=1 and N=100N=100, for an input Hurst exponent equal to 0.15. The grey dotted lines are the tangents for scales 1 and 2 (log-scales 0 and 0.7), and scales 5 and 10 (log-scales 1.6 and 2.3), leading respectively to estimated Hurst exponents of 0.42 and 0.24.
Proposition 2.1.

Let the variance process σt2\sigma_{t}^{2} follow a fBm of Hurst exponent HH and variance ξ2\xi^{2}. Let the realized variance for d1d\geq 1 days and N1N\geq 1 observations per day be defined by:

σ^t,N,d2=dNi=0N1σtid/N2.\widehat{\sigma}^{2}_{t,N,d}=\frac{d}{N}\sum_{i=0}^{N-1}{\sigma_{t-id/N}^{2}}.

Then, the variance of an increment of duration τd\tau d of σ^t,N,d2\widehat{\sigma}^{2}_{t,N,d} is:

𝔼[(σ^t,N,d2σ^tτd,N,d2)2]=ξ2d2N2i=0N1j=0N1(|τd+(ji)dN|2H|(ji)dN|2H).\mathds{E}\left[(\widehat{\sigma}^{2}_{t,N,d}-\widehat{\sigma}^{2}_{t-\tau d,N,d})^{2}\right]=\xi^{2}\frac{d^{2}}{N^{2}}\sum_{i=0}^{N-1}{\sum_{j=0}^{N-1}{\left(\left|\tau d+\frac{(j-i)d}{N}\right|^{2H}-\left|\frac{(j-i)d}{N}\right|^{2H}\right)}}.

For d=1d=1, we also have the asymptotic expression:

limN+𝔼[(σ^t,N,12σ^tτ,N,12)2]=ξ2τ2Hf(τ,H),\underset{N\rightarrow+\infty}{\lim}\mathds{E}\left[(\widehat{\sigma}^{2}_{t,N,1}-\widehat{\sigma}^{2}_{t-\tau,N,1})^{2}\right]=\xi^{2}\tau^{2H}f(\tau,H), (10)

where

f(τ,H)=τ2(2H+1)(2H+2)((1+1τ)2H+222(1τ)2H+2+(11τ)2H+2).f(\tau,H)=\frac{\tau^{2}}{(2H+1)(2H+2)}\left(\left(1+\frac{1}{\tau}\right)^{2H+2}-2-2\left(\frac{1}{\tau}\right)^{2H+2}+\left(1-\frac{1}{\tau}\right)^{2H+2}\right). (11)
Proof.

We write first the straightforward decomposition:

𝔼[(σ^t,N,d2σ^tτd,N,d2)2]=𝔼[σ^t,N,d4]+𝔼[σ^tτd,N,d4]2𝔼[σ^t,N,d2σ^tτd,N,d2]=d2N2i,j=0N1(𝔼[σtidN2σtjdN2]+𝔼[σtτdidN2σtτdjdN2]2𝔼[σtidN2σtτdjdN2]).\begin{array}[]{cl}&\mathds{E}\left[(\widehat{\sigma}^{2}_{t,N,d}-\widehat{\sigma}^{2}_{t-\tau d,N,d})^{2}\right]\\ =&\mathds{E}\left[\widehat{\sigma}_{t,N,d}^{4}\right]+\mathds{E}\left[\widehat{\sigma}_{t-\tau d,N,d}^{4}\right]-2\mathds{E}\left[\widehat{\sigma}^{2}_{t,N,d}\widehat{\sigma}^{2}_{t-\tau d,N,d}\right]\\ =&\frac{d^{2}}{N^{2}}\sum_{i,j=0}^{N-1}{\left(\mathds{E}\left[\sigma^{2}_{t-i\frac{d}{N}}\sigma^{2}_{t-j\frac{d}{N}}\right]+\mathds{E}\left[\sigma^{2}_{t-\tau d-i\frac{d}{N}}\sigma^{2}_{t-\tau d-j\frac{d}{N}}\right]-2\mathds{E}\left[\sigma^{2}_{t-i\frac{d}{N}}\sigma^{2}_{t-\tau d-j\frac{d}{N}}\right]\right)}.\end{array}

Then, using the fact that 𝔼[σti2σtj2]=ξ22(|ti|2H+|tj|2H|titj|2H)\mathds{E}\left[\sigma^{2}_{t_{i}}\sigma^{2}_{t_{j}}\right]=\frac{\xi^{2}}{2}\left(|t_{i}|^{2H}+|t_{j}|^{2H}-|t_{i}-t_{j}|^{2H}\right), according to the fBm assumption, we get:

𝔼[(σ^t,N,d2σ^tτd,N,d2)2]=ξ2d22N2i,j=0N1([|tidN|2H+|tjdN|2H|(ji)dN|2H]+[|tτdidN|2H+|tτdjdN|2H|(ji)dN|2H]2[|tidN|2H+|tτdjdN|2H|τd+(ji)dN|2H]).\begin{array}[]{ccl}\mathds{E}\left[(\widehat{\sigma}^{2}_{t,N,d}-\widehat{\sigma}^{2}_{t-\tau d,N,d})^{2}\right]&=&\xi^{2}\frac{d^{2}}{2N^{2}}\sum_{i,j=0}^{N-1}{\left(\left[\left|t-i\frac{d}{N}\right|^{2H}+\left|t-j\frac{d}{N}\right|^{2H}-\left|\frac{(j-i)d}{N}\right|^{2H}\right]\right.}\\ &&+\left[\left|t-\tau d-i\frac{d}{N}\right|^{2H}+\left|t-\tau d-j\frac{d}{N}\right|^{2H}-\left|\frac{(j-i)d}{N}\right|^{2H}\right]\\ &&\left.-2\left[\left|t-i\frac{d}{N}\right|^{2H}+\left|t-\tau d-j\frac{d}{N}\right|^{2H}-\left|\tau d+\frac{(j-i)d}{N}\right|^{2H}\right]\right).\end{array}

Noting that in the above equation i,j=0N1|tτdidN|2H=i,j=0N1|tτdjdN|2H\sum_{i,j=0}^{N-1}{\left|t-\tau d-i\frac{d}{N}\right|^{2H}}=\sum_{i,j=0}^{N-1}{\left|t-\tau d-j\frac{d}{N}\right|^{2H}} and that i,j=0N1|tidN|2H=i,j=0N1|tjdN|2H\sum_{i,j=0}^{N-1}{\left|t-i\frac{d}{N}\right|^{2H}}=\sum_{i,j=0}^{N-1}{\left|t-j\frac{d}{N}\right|^{2H}}, we finally obtain:

𝔼[(σ^t,N,d2σ^tτd,N,d2)2]=ξ2d2N2i=0N1j=0N1(|τd+(ji)dN|2H|(ji)dN|2H).\mathds{E}\left[(\widehat{\sigma}^{2}_{t,N,d}-\widehat{\sigma}^{2}_{t-\tau d,N,d})^{2}\right]=\xi^{2}\frac{d^{2}}{N^{2}}\sum_{i=0}^{N-1}{\sum_{j=0}^{N-1}{\left(\left|\tau d+\frac{(j-i)d}{N}\right|^{2H}-\left|\frac{(j-i)d}{N}\right|^{2H}\right)}}.

From this result, we conclude for the asymptotic case:

limN+𝔼[(σ^t,N,12σ^tτ,N,12)2]=ξ20101(|τ+vu|2H|vu|2H)𝑑u𝑑v=ξ2τ2Hf(τ,H).\begin{array}[]{ccl}\underset{N\rightarrow+\infty}{\lim}\mathds{E}\left[(\widehat{\sigma}^{2}_{t,N,1}-\widehat{\sigma}^{2}_{t-\tau,N,1})^{2}\right]&=&\xi^{2}\int_{0}^{1}{\int_{0}^{1}{\left(\left|\tau+v-u\right|^{2H}-\left|v-u\right|^{2H}\right)du}dv}\\ &=&\xi^{2}\tau^{2H}f(\tau,H).\end{array}

The asymptotic version of the realized variance σ^.\widehat{\sigma}_{.} is the integrated variance σ¯.\bar{\sigma}_{.}. In Proposition 2.1, we get an expression for the variance of its increments which is consistent with the one provided by Gatheral et al. (2018) and which is more concise than the equivalent expression obtained for the realized variance.

As one can see in Figure 5, though the bias is limited for higher values of HH, it is very significant for lower Hurst exponents.

Refer to caption
Figure 5: Theoretically estimated Hurst exponent for σ^t,N,d2\widehat{\sigma}^{2}_{t,N,d}, with d=1d=1 and N=100N=100, for various values of input Hurst exponents. The black line is the estimation on scales 1 and 2 (log-scales 0 and 0.7), the grey one on scales 5 and 10 (log-scales 1.6 and 2.3). The dotted line is the identity.
Remark 2.1.

There is also a third type of error, namely the microstructure noise, coming from decimalization, absence of distinction between bid and ask, price formation and other issues typicically faced in intraday data (see e.g. Robert and Rosenbaum (2011a, b) and references therein). This microstructure noise affects the price at each observation in a roughly similar manner as the measurement noise already analysed. It tends to overestimate the variance process. In order to limit the impact of the microstructure noise, one can consider higher time steps for estimating the integrated variance. In line with the empirical literature, a 5-minute or even a 1-minute frequency turns out to limit the impact of the microstructure noise in the FX market (Bubák et al., 2011; Lallouache and Abergel, 2013; Kuck and Maderitsch, 2019), so we will neglect this error in our empirical analysis.

3 Filtering the noises

In this section, we are only working with moments of order 2, with an increment duration τ\tau\in\mathbb{N} such that 1τN1\leq\tau\ll N:

M2,τ(X)=1(Nτ+1)i=τN|XiXiτ|2.M_{2,\tau}(X)=\frac{1}{(N-\tau+1)}\sum_{i=\tau}^{N}{|X_{i}-X_{i-\tau}|^{2}}. (12)

We want to filter the measurement noise and the smoothing error. More precisely, for the measurement noise we simply correct the bias in the estimation of all the absolute moments in the log-log plot for the variance, and for the smoothing error we correct the multiplicative error in the moment appearing in equation (10). To put it simply, we are looking for the moments of increments of σt2\sigma^{2}_{t}, but as we cannot observe σt2\sigma^{2}_{t} we work instead with σ¯t2\bar{\sigma}^{2}_{t}, thus introducing the smoothing error. In addition, σ¯t2\bar{\sigma}^{2}_{t} is approximated by σ^t2\widehat{\sigma}^{2}_{t}, whose difference with σ¯t2\bar{\sigma}^{2}_{t} is the measurement noise.

The way the noise is to be filtered strongly depends on the model one assumes for the volatility dynamic. In the next two subsections, we present two filtering methods based on two competing models. The first one is consistent with the RFSV approach. The second model uses the same underlying dynamic, a fBm, but applies it to the variance process instead of the volatility process. This approach is more consistent with dynamics inspired by the ARCH process, which are traditionally invoked in econometrics.

3.1 The log-volatility as a fBm

The first model we consider if the RFSV model Gatheral et al. (2018): σt=σexp(ξBtH)\sigma_{t}=\sigma\exp(\xi B^{H}_{t}), where BtHB^{H}_{t} is a fBm of Hurst exponent HH and σ,ξ>0\sigma,\xi>0. With this assumption, the theoretical log-log plot may differ from a straight line because of measurement noise and smoothing error. We present a method to filter these noises from the log-log plot. If the RFSV model depicted all the scaling features of the spot volatility series, the filtered plot should result in a straight line.

The log-log plot of the log-volatility should be based on M2,τ(log(σ))M_{2,\tau}(\log(\sigma)). However, we only observe M2,τ(log(σ^))M_{2,\tau}(\log(\widehat{\sigma})). Thanks to the successive approximations explained below, we get the following relation between M2,τ(log(σ))M_{2,\tau}(\log(\sigma)) and M2,τ(log(σ^))M_{2,\tau}(\log(\widehat{\sigma})):

M2,τ(log(σ.))=14M2,τ(log(σ.2))14σ04M2,τ(σ.2)14σ04f(τ,H)1M2,τ(σ¯.2)14f(τ,H)1M2,τ(log(σ¯.2))14f(τ,H)1[M2,τ(log(σ^.2))4n]=f(τ,H)1[M2,τ(log(σ^.))1n],\begin{array}[]{ccl}M_{2,\tau}(\log(\sigma_{.}))&=&\frac{1}{4}M_{2,\tau}(\log(\sigma^{2}_{.}))\\ &\approx&\frac{1}{4\sigma_{0}^{4}}M_{2,\tau}(\sigma^{2}_{.})\\ &\approx&\frac{1}{4\sigma_{0}^{4}}f(\tau,H)^{-1}M_{2,\tau}(\bar{\sigma}^{2}_{.})\\ &\approx&\frac{1}{4}f(\tau,H)^{-1}M_{2,\tau}(\log(\bar{\sigma}^{2}_{.}))\\ &\approx&\frac{1}{4}f(\tau,H)^{-1}\left[M_{2,\tau}(\log(\widehat{\sigma}^{2}_{.}))-\frac{4}{n}\right]\\ &=&f(\tau,H)^{-1}\left[M_{2,\tau}(\log(\widehat{\sigma}_{.}))-\frac{1}{n}\right],\end{array} (13)

where σ0\sigma_{0}\in\mathbb{R} and ff is defined in equation (11).

  • The approximation between the first and the second line is based on a first-order Taylor expansion of the logarithm, around an arbitrary value σ02\sigma_{0}^{2}:

    M2,τ(log(σ.2))=𝔼[(log(σ.+τ2)log(σ.2))2]𝔼[(log(σ02)+σ.+τ2σ02σ02log(σ02)σ.2σ02σ02)2]=1σ04𝔼[(σ.+τ2σ.2)2]=1σ04M2,τ(σ.2).\begin{array}[]{ccl}M_{2,\tau}(\log(\sigma^{2}_{.}))&=&\mathds{E}[(\log(\sigma^{2}_{.+\tau})-\log(\sigma^{2}_{.}))^{2}]\\ &\approx&\mathds{E}\left[\left(\log(\sigma_{0}^{2})+\frac{\sigma^{2}_{.+\tau}-\sigma_{0}^{2}}{\sigma_{0}^{2}}-\log(\sigma_{0}^{2})-\frac{\sigma^{2}_{.}-\sigma_{0}^{2}}{\sigma_{0}^{2}}\right)^{2}\right]\\ &=&\frac{1}{\sigma_{0}^{4}}\mathds{E}\left[\left(\sigma^{2}_{.+\tau}-\sigma^{2}_{.}\right)^{2}\right]\\ &=&\frac{1}{\sigma_{0}^{4}}M_{2,\tau}(\sigma^{2}_{.}).\end{array}

    In addition to the error left by the Taylor expansion, another source of error may appear in the equations above and in all this section regarding the difference between the empirical moments, M2,τM_{2,\tau}, and the theoretical ones. However, for long time series as ours, the difference is small with respect to other approximations and is equal to zero in average.

  • The approximation between the second and the third line is based on equation (10), that is M2,τ(σ¯.2)=M2,τ(σ.2)f(τ,H)M_{2,\tau}(\bar{\sigma}^{2}_{.})=M_{2,\tau}(\sigma^{2}_{.})f(\tau,H), and thus filters the smoothing error, assuming that σ2\sigma^{2} follows a fBm of Hurst exponent HH. In fact, the assumption that log(σ)\log(\sigma) follows a fBm, which is the assumption in line with the rough volatility model of Gatheral et al. (2018), leads to the same approximation thanks to the first-order Taylor expansion of the logarithm introduced above, as M2,τ(log(σ.))14σ04M2,τ(σ.2)M_{2,\tau}(\log(\sigma_{.}))\approx\frac{1}{4\sigma_{0}^{4}}M_{2,\tau}(\sigma^{2}_{.}), which can be seen by following the same argument as in Proposition B.1 of Fukasawa et al. (2019) who quantified the difference between the log of integrated variance and the integral of log variance.

  • The approximation between the third and the fourth line is based on the same kind of Taylor expansion than between the first and the second line.

  • The approximation between the fourth and the fifth line exploits the measurement noise between σ^2\widehat{\sigma}^{2} and σ¯2\bar{\sigma}^{2}. We have seen that if the log-price follows an Ito process, then σ^t2=σ¯t2+εt\widehat{\sigma}^{2}_{t}=\bar{\sigma}^{2}_{t}+\varepsilon_{t} where εt\varepsilon_{t} is asymptotically a centered Gaussian variable of variance 2nt1tσu4𝑑u2nσ¯t4\frac{2}{n}\int_{t-1}^{t}{\sigma_{u}^{4}du}\approx\frac{2}{n}\bar{\sigma}_{t}^{4}. Thus, as an approximation, we consider that σ^t2σ¯t2(1+αt)\widehat{\sigma}^{2}_{t}\approx\bar{\sigma}^{2}_{t}(1+\alpha_{t}), with αt𝒩(0,2/n)\alpha_{t}\sim\mathcal{N}(0,2/n). According to Theorem H.1 of Fukasawa et al. (2019), the αt\alpha_{t} are independent variables, also independent of the σ¯t2\bar{\sigma}^{2}_{t}, then:

    M2,τ(log(σ^.2))𝔼[(log(σ¯.+τ2)log(σ¯.2)+log(1+α.+τ)log(1+α.))2]=M2,τ(log(σ¯.2))+𝔼[(log(1+α.+τ)log(1+α.))2]=M2,τ(log(σ¯.2))+2𝔼[(log(1+α.))2]2(𝔼[log(1+α.)])2=M2,τ(log(σ¯.2))+2𝕍ar(log(1+α.))M2,τ(log(σ¯.2))+2𝕍ar(α.),\begin{array}[]{ccl}M_{2,\tau}(\log(\widehat{\sigma}^{2}_{.}))&\approx&\mathds{E}[(\log(\bar{\sigma}^{2}_{.+\tau})-\log(\bar{\sigma}^{2}_{.})+\log(1+\alpha_{.+\tau})-\log(1+\alpha_{.}))^{2}]\\ &=&M_{2,\tau}(\log(\bar{\sigma}^{2}_{.}))+\mathds{E}[(\log(1+\alpha_{.+\tau})-\log(1+\alpha_{.}))^{2}]\\ &=&M_{2,\tau}(\log(\bar{\sigma}^{2}_{.}))+2\mathds{E}[(\log(1+\alpha_{.}))^{2}]-2(\mathds{E}[\log(1+\alpha_{.})])^{2}\\ &=&M_{2,\tau}(\log(\bar{\sigma}^{2}_{.}))+2\mathbb{V}ar(\log(1+\alpha_{.}))\\ &\approx&M_{2,\tau}(\log(\bar{\sigma}^{2}_{.}))+2\mathbb{V}ar(\alpha_{.}),\end{array} (14)

    where we go from the first to the second line thanks to the fact that α.+τ\alpha_{.+\tau} and α.\alpha_{.} are identically distributed, then to the third line because they are iid and to the last line by a Taylor expansion for the second moment of the logarithm of a random variable.

In equation (13), we filter successively the measurement noise by translating each absolute moment by a value of 1/n1/n. Then, we filter the smoothing error by dividing the result by f(τ,H)f(\tau,H), with a properly chosen HH, consistent with the rough framework.

We show on a theoretical example, illustrated in Figure 6, that, despite all the approximations cited above, the filtering method we propose is fairly accurate.

Refer to caption
Figure 6: Average of log(τ)log(M2,τ(X))\log(\tau)\mapsto\log(M_{2,\tau}(X)), where XX is the log of the realized volatility (with n=36n=36 observations each day, thus corresponding to a 4040-minute time step; as a robustness check, we have led the same analysis for other values of nn and have obtained similar results) of simulated prices with a volatility following an RFSV model, with H=0.25H=0.25 and ξ=10%\xi=10\%. The black dots are the unfiltered observations and the straight line is the theoretical log-log plot for the RFSV model. The grey dots are the observations filtered by removing 1/n1/n from each moment (filtering of the measurement noise), the red ones by filtering both the measurement noise and the smoothing error by the means of equation (13).

3.2 The log-variance as a fBm

If, instead of the RFSV model assumed in the subsection above, the log-variance follows a fBm, the approximations are then transformed into the following, when using the realized variance proxy:

M2,τ(log(σ.2))1σ04M2,τ(σ.2)1σ04f(τ,H)1M2,τ(σ¯.2)f(τ,H)1M2,τ(log(σ¯.2))f(τ,H)1[M2,τ(log(σ^.2))4n].\begin{array}[]{ccl}M_{2,\tau}(\log(\sigma^{2}_{.}))&\approx&\frac{1}{\sigma_{0}^{4}}M_{2,\tau}(\sigma^{2}_{.})\\ &\approx&\frac{1}{\sigma_{0}^{4}}f(\tau,H)^{-1}M_{2,\tau}(\bar{\sigma}^{2}_{.})\\ &\approx&f(\tau,H)^{-1}M_{2,\tau}(\log(\bar{\sigma}^{2}_{.}))\\ &\approx&f(\tau,H)^{-1}\left[M_{2,\tau}(\log(\widehat{\sigma}^{2}_{.}))-\frac{4}{n}\right].\end{array}

The main difference with the model in which the log-volatility is an fBm is a factor 4 in the filter of the measurement noise.

4 Empirical results for exchange rates

In this section we implement the empirical absolute moment regressions for volatilities of the most liquid exchange rates. We base our analysis on a real data set from Interactive Brokers. It gathers high-frequency rates between the 18th December 2006 and the 19th June 2019, sampled at a one-minute step, for a total of more than 4.6 millions data for each of the following pairs: EUR/USD, EUR/GBP, EUR/JPY, EUR/CAD, EUR/AUD, GBP/USD, GBP/JPY, USD/JPY, AUD/USD, and AUD/JPY. From these rates, we estimate daily volatilities, using the 1,440 observations in each day. We thus have for each series 3,206 consecutive observations of daily volatility that will be estimated by the realized volatility, defined as the square root of the average quadratic one-minute log-variation. We display the results in Figure 7 and Table 1.

Refer to caption
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 7: Log-log plot (in black), log(τ)log(M2,τ,1(X))\log(\tau)\mapsto\log(M_{2,\tau,1}(X)), where XX is the log of realized volatility of various FX rates between December 2006 and June 2019 sampled at a one-minute step, for a total of 4.6 millions data for each of the following pairs: EUR/USD, EUR/GBP, EUR/JPY, EUR/CAD, EUR/AUD, GBP/USD, GBP/JPY, USD/JPY, AUD/USD, and AUD/JPY. Time scales range between 1 day till 3 years. The log-log plot in red is obtained by applying the denoising filter.

In Figure 7, we observe various regimes for the volatility process, depending on the time scale. In general, for log-scales between 0 and 3 (that is to say between 1 day and 3 weeks), a linear regression of small slope holds for empirical absolute moments. Between 3 and a higher abscissa which depends on the rate considered (so between 3 weeks and a time horizon \mathcal{H}, which is located between 4.5 months and 12 months), the slope steepens. Above this long time horizon \mathcal{H}, the slope flattens and, at scales which depend again on the sample, aberrant oscillations appear, due to the limited size of the sample. Therefore, we observe a threefold behaviour of the dynamics: for small time scales (1 day till 3 weeks) we get a small Hurst exponent, at higher scales (3 weeks till \mathcal{H}) we have a higher Hurst exponent, and for very large scales (above \mathcal{H}) we observe stationarity.

Scale Small scales Large scales
EUR/USD 0.075 0.244
EUR/GBP 0.086 0.217
EUR/JPY 0.117 0.190
EUR/CAD 0.068 0.184
EUR/AUD 0.106 0.192
GBP/USD 0.093 0.157
GBP/JPY 0.112 0.148
USD/JPY 0.098 0.144
AUD/USD 0.108 0.196
AUD/JPY 0.135 0.152
Table 1: Perceived Hurst exponent before denoising, estimated as half the slope of linear regressions for the plots presented in Figure 7 restricted a time horizon lower than \mathcal{H}. Small scales refer to the range from 1 day till 3 weeks, large scales refer to a range depending on the sample considered: from 2 months till 4.5 months for EUR/JPY, EUR/CAD, EUR/AUD, GBP/JPY, and AUD/JPY, from 2 till 5 months for AUD/USD, from 3 till 8 months for EUR/USD, from 5 till 9 months for USD/JPY, and from 5 till 12 months for EUR/GBP and GBP/USD.

Now we turn our attention to the measurement noise and the smoothing error. We apply the methodology described in the previous section and we filter these noises from the observations. We display the results for the EUR/USD time series in Figure 8, with the successive application of filters.

The filtering of the measurement noise steepens the log-log plot (grey curve). With a 5-minute time step in the computation of realized variance, this effect is slight. Like for the initial curve, we observe convexity when the measurement noise is filtered: higher estimated Hurst exponent for low-frequency increments than for high-frequency increments of volatility. The second filter is about the smoothing error. It accentuates the convexity (red curve). We conclude that the filtered log-log plot still shows convexity. The fBm assumption, for which we should have the same slope of the log-log plot for all scales, is not enough to explain the observations.

Refer to caption
Figure 8: Filtering noise for EUR/USD. Log-log plot for the the realized variance with 5-minute time step (in black). The measurement noise has been filtered by translating each absolute moment by a value of 1/n1/n as in Equation (13) (log-log plot in grey). Smoothing error has been additionally filtered by dividing the result by f(τ,H)f(\tau,H), with HH chosen to be in line with the estimates (the results are indeed robust wrt the choice of HH, consistent with the rough framework (log-log plot in red). Time scales range between 1 day till 3 years.

We display similar results for all the studied FX rates in Figure 7. We consider a 1-minute time step in the computation of the realized variance. For FX rates, this amounts to 1440 observations each day and the corresponding measurement noise is almost invisible. We therefore do not decompose the noise filtering in this figure and simply display the raw (black) and fully filtered (red) curves. Once again, we observe convexity in the raw and filtered log-log plots.

These results deserve some comments, since the filtering relies on some approximations:

  • Taylor expansions of the logarithm.

  • Identification of the empirical moments to the theoretical ones.

  • For the filtering of the smoothing error, we assume that the distortion of the log-log plot is the one consistent with a model in which the variance follows an fBm, rather than the rough volatility model that assumes instead that the log-volatility is an fBm. Nevertheless, the smoothing error tends to make the log-plot more concave because the increments of duration τ\tau of the realized variance mitigates increments of the spot variance of duration τ\tau with increments of shorter duration, for which the covariance is much stronger. However, we still observe that, even without this approximative filtering, the log-plot is convex in our empirical findings, see Figure 8. On top of that, we have conducted the same analysis as above on FX rates, by replacing proxies of volatility by proxies of variance. The filtered version is very similar, with still a clear convexity: the slope of the log-log plot increases with the time scale. We have not displayed the log-log plots of empirical variance processes in this paper since they are very close to the ones we obtained for the volatility. It supplements our findings of the previous subsection: neither the volatility nor the variance seems to follow a fBm, according to both raw and filtered log-log plots.

  • The filtering function ff for the smoothing error depends on an HH parameter. But the value of the true Hurst exponent is unknown and should be the output of the filtering, not the input. In Figures 7 and 8, we have input an HH equal to the large-scale perceived Hurst exponent before denoising666 We thus take into account the fact that the HH perceived at large scales is less affected by noise than the HH at small scales., as displayed in Table 1, for instance H=0.244H=0.244 for EUR/USD. But whatever the value of HH, the filtered log-log plot (red) shows a stronger convexity than the grey curve, which is also convex. The lower the input HH in the function ff, the stronger this convexity. If the true dynamic was an fBm of Hurst exponent HH, the filtered log-log plot, with HH as input of ff, should be a straight line of slope 2H2H. Since no HH results in such a straight line, the fBm cannot describe on its own the dynamic of log-volatilities.

Remark 4.1.

When the time step used in the realized variance increases, the measurement noise increases as well. As a consequence, the slope of the log-log plot decreases (because of the greater impact of the noise in the estimation of the Hurst exponent), as one can see in Figure 9, but empirically the global shape of the log-log plot remains unchanged, still showing convexity. In other words, the choice of the number of time steps in the computation of the realized variance has empirically a limited impact on the estimation of the Hurst exponent in our dataset.

Refer to caption
Figure 9: Estimated Hurst exponent at small scales (lower than 21 days, in black) and high scales (between 54 and 149 days, in grey) for various time steps in the realized variance, obtained on EUR/USD.

5 Conclusion

In this paper we performed a extensive empirical investigation on a large dataset of exchange rates. Our findings are twofold: first, we confirmed that the estimation of the Hurst exponent is indeed below 0.5 for time scales close to the ones considered in past literature, e.g. in Gatheral et al. (2018). However, when larger time scales are considered (which is possible thanks to the size of our dataset), we face a violation of the stationarity assumption of the increments of the fBm driving the volatility process, in contrast with the rough volatility paradigm. In fact, in the log-log plot we observe a convexity effect that cannot be explained even by taking into account the different types of noises in the estimation procedures. As a consequence, we consider this convexity effect as a new stylized fact of the market that any advanced volatility model should be able to reproduce.

References

  • Alòs et al. [2007] E. Alòs, J.A. León, and J. Vives. On the short-time behavior of the implied volatility for jump-diffusion models with stochastic volatility. Finance and stochastics, 11(4):571–589., 2007.
  • Andersen and Bollerslev [1997] T.G. Andersen and T. Bollerslev. Intraday periodicity and volatility persistence in financial markets. Journal of empirical finance, 4(2):115–158, 1997.
  • Andersen et al. [2003] T.G. Andersen, T. Bollerslev, F.X. Diebold, and P. Labys. The distribution of realized exchange rate volatility. Journal of the American Statistical Association, 96:42–55, 2003.
  • Baillie [1996] R.T. Baillie. Long memory processes and fractional integration in econometrics. Journal of econometrics, 73(1):5–59, 1996.
  • Barndorff-Nielsen and Shephard [2001] O.E. Barndorff-Nielsen and N. Shephard. Non-Gaussian Ornstein-Uhlenbeck-based models and some of their uses in financial economics. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 63(2):167–241, 2001.
  • Barndorff-Nielsen and Shephard [2002] O.E. Barndorff-Nielsen and N. Shephard. Econometric analysis of realized volatility and its use in estimating stochastic volatility models. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 64(2):253–280, 2002.
  • Benassi et al. [1998] A. Benassi, S. Cohen, and J. Istas. Identifying the multifractional function of a Gaussian process. Statistics and probability letters, 39(4):337–345, 1998.
  • Bennedsen et al. [2016] M. Bennedsen, A. Lunde, and M.S. Pakkanen. Decoupling the short- and long-term behavior of stochastic volatility. arXiv preprint, 1610.00332, 2016.
  • Bollerslev and Mikkelsen [1996] T. Bollerslev and H.O. Mikkelsen. Modeling and pricing long memory in stock market volatility. Journal of econometrics, 73(1):151–184, 1996.
  • Breidt et al. [1998] F.J. Breidt, N. Crato, and P. De Lima. The detection and estimation of long memory in stochastic volatility. Journal of econometrics, 83(1-2):325–348, 1998.
  • Bubák et al. [2011] V. Bubák, E. Kočenda, and F. Žikeš. Volatility transmission in emerging European foreign exchange markets. Journal of banking & finance, 35(11):2829–2841, 2011.
  • Cheridito et al. [2003] P. Cheridito, H. Kawaguchi, and M. Maejima. Fractional Ornstein-Uhlenbeck processes. Electronic journal of probability, 8(3):1–14, 2003.
  • Coeurjolly [2005] J.-F. Coeurjolly. Identification of multifractional Brownian motion. Bernoulli, 11(6):987–1008, 2005.
  • Comte and Renault [1998] F. Comte and E. Renault. Long memory in continuous-time stochastic volatility models. Mathematical finance, 8(4):291–323, 1998.
  • Cuchiero and Teichmann [2015] C. Cuchiero and J. Teichmann. Fourier transform methods for pathwise covariance estimation in the presence of jumps. Stochastic processes and their applications, 125(1):116–160, 2015.
  • Ding et al. [1993] Z. Ding, C.W. Granger, and R.F. Engle. A long memory property of stock market returns and a new model. Journal of empirical finance, 1(1):83–106, 1993.
  • Duffie et al. [2003] D. Duffie, D. Filipovic, and W. Schachermayer. Affine processes and applications in finance. The Annals of Applied Probability, 13(3):984–1053, 2003.
  • Dupire [1994] B. Dupire. Pricing with a smile. Risk, 7:18–20, 1994.
  • Fukasawa et al. [2019] M. Fukasawa, T. Takabatake, and R. Westphal. Is volatility rough? arXiv preprint, 1905.04852, 2019.
  • Garcin [2017] M. Garcin. Estimation of time-dependent Hurst exponents with variational smoothing and application to forecasting foreign exchange rates. Physica A: statistical mechanics and its applications, 483:462–479, 2017.
  • Garcin [2019] M. Garcin. Hurst exponents and delampertized fractional Brownian motions. International journal of theoretical and applied finance, 22(5):1950024, 2019.
  • Garcin and Goulet [2019] M. Garcin and C. Goulet. Non-parametric news impact curve: a variational approach. Soft computing, 24(18):13797–13812, 2019.
  • Gatheral et al. [2018] J. Gatheral, T. Jaisson, and M. Rosenbaum. Volatility is rough. Quantitative finance, 18(6):933–949, 2018.
  • Geweke and Porter-Hudak [1983] J. Geweke and S. Porter-Hudak. The estimation and application of long memory time series models. Journal of time series analysis, 4(4):221–238, 1983.
  • Hagan et al. [2002] P.S. Hagan, D. Kumar, A. Lesniewski, and D.E. Woodward. Managing smile risk. Wilmott Magazine, 1:84–108, 2002.
  • Heston [1993] S.L. Heston. A closed-form solution for options with stochastic volatility with applications to bond and currency options. The Review of Financial Studies, 6(2):327–343, 1993.
  • Jacod and Protter [1998] J. Jacod and P. Protter. Asymptotic error distributions for the euler method for stochastic differential equations. The Annals of Probability, 26(1):267–307, 1998.
  • Kolmogorov [1940] A. Kolmogorov. The Wiener spiral and some other interesting curves in Hilbert space. Doklady akad. nauk SSSR, 26(2):115–118, 1940.
  • Kuck and Maderitsch [2019] K. Kuck and R. Maderitsch. Intra-day dynamics of exchange rates: New evidence from quantile regression. Quarterly review of economics and finance, 71:247–257, 2019.
  • Lallouache and Abergel [2013] M. Lallouache and F. Abergel. Empirical properties of the foreign exchange interdealer market. arXiv preprint, 1307.5440, 2013.
  • Lo [1991] A.W. Lo. Long-term memory in stock market prices. Econometrica, 59(5):1279–1313, 1991.
  • Lo and MacKinlay [1988] A.W. Lo and A.C. MacKinlay. Stock market prices do not follow random walks: Evidence from a simple specification test. Review of Financial Studies, 1(1):41–66, 1988.
  • Mandelbrot and van Ness [1968] B. Mandelbrot and J. van Ness. Fractional Brownian motions, fractional noises and applications. SIAM review, 10(4):422–437, 1968.
  • Parkinson [1980] M. Parkinson. The extreme value method for estimating the variance of the rate of return. Journal of business, 53(1):61–65, 1980.
  • Robert and Rosenbaum [2011a] C.Y. Robert and M. Rosenbaum. A new approach for the dynamics of ultra-high-frequency data: The model with uncertainty zones. Journal of Financial Econometrics, 9(2):344–366, 2011a.
  • Robert and Rosenbaum [2011b] C.Y. Robert and M. Rosenbaum. Volatility and covariation estimation when microstructure noise and trading times are endogenous. Mathematical Finance, 22(1):133–164, 2011b.
  • Rogers [2019] L.C.G. Rogers. Things we think we know. Preprint, available at https://www.skokholm.co.uk/lcgr/downloadable-papers/, 2019.
  • Rootzen [1980] H. Rootzen. Limit distributions for the error in approximations of stochastic integrals. The Annals of Probability, 8(2):241–251, 1980.
  • Stein and Stein [1994] E. Stein and J. Stein. Stock price distributions with stochastic volatility: An analytic approach. The Review of Financial Studies, 4(4):727–752, 1994.
  • Šapina et al. [2020] M. Šapina, M. Garcin, K. Kramarić, K. Milas, D. Brdarić, and M. Pirić. The Hurst exponent of heart rate variability in neonatal stress, based on a mean-reverting fractional Lévy stable motion. Fluctuation and noise letters, 19(3):2050026, 2020.