This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

thanks: ES was supported in part by Simons Grant 636383.thanks: JDM was supported in part by Simons Grant 601972.

Computing Lyapunov Exponents using Weighted Birkhoff Averages

E. Sander esander@gmu.edu Department of Mathematical Sciences, George Mason University, Fairfax, VA 22030, USA    J.D. Meiss jdm@colorado.edu Department of Applied Mathematics, University of Colorado, Boulder, CO 80309-0526, USA
Abstract

The Lyapunov exponents of a dynamical system measure the average rate of exponential stretching along an orbit. Positive exponents are often taken as a defining characteristic of chaotic dynamics. However, the standard orthogonalization-based method for computing Lyapunov exponents converges slowly—if at all. Many alternatively techniques have been developed to distinguish between regular and chaotic orbits, though most do not compute the exponents. We compute the Lyapunov spectrum in three ways: the standard method, the weighted Birkhoff average (WBA), and the “mean exponential growth rate for nearby orbits” (MEGNO). The latter two improve convergence for nonchaotic orbits, but the WBA is fastest. However, for chaotic orbits the three methods convergence at similar, slow rates. Though the original MEGNO method does not compute Lyapunov exponents, we show how to reformulate it as a weighted average that does.

Lyapunov exponents, weighted Birkhoff averages, MEGNO, chaos
pacs:
05.45.-a, 02.70.-c

I Introduction

Lyapunov exponents are a fundamental gauge of chaotic behavior in dynamical systems. They measure the growth rate of the distance between a pair of (infinitesimally) close orbits, and a positive exponent is often taken as a primary indicator for “sensitive dependence on initial conditions,” one of the principal requirements for chaos. Formally, given a differentiable map f:MMf:M\to M on an nn-dimensional phase space MM, let x0Mx_{0}\in M denote an initial point and v0Tx0Mnv_{0}\in T_{x_{0}}M\cong{\mathbb{R}}^{n} a deviation vector. These evolve under the system

xt+1\displaystyle x_{t+1} =f(xt),\displaystyle=f(x_{t}), (1)
vt+1\displaystyle v_{t+1} =Df(xt)vt,\displaystyle=Df(x_{t})v_{t},

for tt\in{\mathbb{N}}. The Lyapunov exponent for (x0,v0)TM(x_{0},v_{0})\in TM is then the growth rate of the norm of vtv_{t}:

μT(x0,x0)=1Tln(vTv0),\displaystyle\mu_{T}(x_{0},x_{0})=\frac{1}{T}\ln\left(\frac{\|v_{T}\|}{\|v_{0}\|}\right), (2)
μ(x0,v0)=lim supTμT(x0,v0)\displaystyle\mu(x_{0},v_{0})=\limsup_{T\to\infty}\mu_{T}(x_{0},v_{0})

if this limit exists. Equivalently, the time TT exponent can also be written as the time average of the exponential stretching factors

μT(x0,v0)\displaystyle\mu_{T}(x_{0},v_{0}) =1Tt=0T1st,\displaystyle=\frac{1}{T}\sum_{t=0}^{T-1}s_{t}, (3)
st\displaystyle s_{t} ln(vt+1vt),\displaystyle\equiv\ln\left(\frac{\|v_{t+1}\|}{\|v_{t}\|}\right),

since the sum is telescoping. Note that (3) is the time average of the stretching function S:TMS:TM\to{\mathbb{R}} on the tangent bundle, defined by st=S(xt,vt)s_{t}=S(x_{t},v_{t}), so that

S(x,v)ln(Df(x)vv).S(x,v)\equiv\ln\left(\frac{\|Df(x)v\|}{\|v\|}\right). (4)

Convergence of μT\mu_{T} as TT\to\infty almost everywhere with respect to an invariant measure was proven in Oseledec’s multiplicative ergodic theorem under certain restrictions [1, 2, 3, 4].

A similar process can be used to compute the spectrum of exponents. A standard technique is repeated application of Gram-Schmidt orthogonalization [5, 6, 7, 8, 9]. Given an initial orthonormal basis Q0=(q0(1),q0(2),q0(n))Q_{0}=(q_{0}^{(1)},q_{0}^{(2)},\ldots q_{0}^{(n)}), one iterates and then orthogonalizes:

p(j)=Df(xt)qt(j)z(j)=p(j)k=1j1p(j),z(k)z(k)2z(k),j=1,n.\begin{array}[]{l}p^{(j)}=Df(x_{t})q^{(j)}_{t}\\ z^{(j)}=p^{(j)}-\sum_{k=1}^{j-1}\frac{\langle p^{(j)},z^{(k)}\rangle}{\|z^{(k)}\|^{2}}z^{(k)}\end{array},\quad\,j=1,\ldots n.

Normalization of the orthogonal basis Z=(z(1),,z(n))Z=(z^{(1)},\ldots,z^{(n)}) then gives the scaling factors and new orthonormal basis

rt+1(j)=z(j),qt+1(j)=z(j)rt+1(j).r^{(j)}_{t+1}=\|z^{(j)}\|,\quad q^{(j)}_{t+1}=\frac{z^{(j)}}{r^{(j)}_{t+1}}.

Iterating this process along an orbit {xt}\{x_{t}\} gives a sequence QtQ_{t} of orthogonal matrices and growth factors rt(j)r^{(j)}_{t}. The spectrum of Lyapunov exponents then becomes

μ(j)(x0)\displaystyle\mu^{(j)}(x_{0}) =\displaystyle= lim supTμT(j)(x0), where\displaystyle\limsup_{T\to\infty}\mu_{T}^{(j)}(x_{0}),\mbox{ where } (5)
μT(j)(x0)\displaystyle\mu^{(j)}_{T}(x_{0}) =\displaystyle= 1Tt=0T1ln(rt+1(j)),\displaystyle\frac{1}{T}\sum_{t=0}^{T-1}\ln(r^{(j)}_{t+1}),

when the limit exists. We define the stretching for the jthj^{th} exponent by

R(j)(xt,vt)\displaystyle R^{(j)}(x_{t},v_{t}) =\displaystyle= ln(rt(j)), and let \displaystyle\ln(r_{t}^{(j)}),\,\mbox{ and let }\, (6)
R\displaystyle R =\displaystyle= (R(1),,R(n)).\displaystyle(R^{(1)},\dots,R^{(n)})\,.

There has been a large amount of research on computing Lyapunov exponents, and more generally on methods for distinguishing chaotic orbits. Accurate computation of μ\mu is difficult—the convergence of the limit (2) is often no faster than ln(T)/T\ln(T)/T [10]. There have been many attempts to accelerate this convergence [11, 12, 13, 14]; however, such an acceleration seems to be difficult. Indeed, it often is difficult to even determine if μ0\mu\neq 0; for example, orbits that are very close to regular regions in a Hamiltonian system can be chaotic but have arbitrarily small maximal Lyapunov exponents [15]. Nevertheless, specialized methods may help in some cases, for example, dynamics conjugate to an incommensurate rotation [16, 17, 18] or random matrix products of shears [19].

Our goal in this paper is to investigate if the weighted Birkhoff average (WBA) [20, 21, 22] can help to accurately compute Lyapunov exponents. As we will see, for smooth maps and nonchaotic orbits, CC^{\infty}-smooth, weighted averages can compute (non-positive) Lyapunov exponents with super-polynomial convergence, meaning with convergence faster than 1/Tk1/T^{k} for all kk\in{\mathbb{N}}. On the other hand, just as for functions on phase space [21], we will see that a weighted average usually does not improve the rate of convergence of the μT(j)\mu^{(j)}_{T} when the orbit is chaotic.

Instead of accurately computing μ\mu, many methods, such as frequency analysis, fast Lyapunov indicators, 0-1 Test, SALI, GALI, MEGNO, REM, RLI, etc., have been developed with a weaker goal: that of distinguishing chaotic from regular motion, see e.g., [23, 24, 25, 26, 27, 28, 10, 29]. Many of these methods have been compared in [30]. However as we showed in [21, 22], the WBA for a function on phase space can efficiently distinguish between regular and chaotic orbits by its convergence rates. Consequently, if this is the only goal, it would be more efficient to compute the WBA for a function since in this case only iterates of the map ff and not those of its derivatives are needed. In [21] we compared the convergence of the WBA to that of conventional Lyapunov exponents as well as to the 0-1 test [25].

The paper proceeds as follows. Section II recalls the weighted Birkhoff average. In §II.2 we show that the “mean exponential growth of nearby orbits” (MEGNO) method [10] can be reformulated as a weighted average method, but not one that is CC^{\infty} smooth at the endpoints. In §III, we compare five weight functions for estimating the time average in (5). Several example maps that we think of as “typical” are discussed in §III.1. Finally, in §III.2 we discuss some outliers; maps which have unexpected speed-up or slow-down of convergence. These examples include maps with shear, that are noninvertible, and those with fixed Jacobian. Our conclusions appear in §IV.

II Weighted average methods

In this section, we review weighted average methods. Since (3) is a dynamical time average, its convergence is related to that implied by Birkhoff’s ergodic theorem, which states that time averages equal space averages for L1(M,)L^{1}(M,{\mathbb{R}}) functions on an ergodic invariant set, see e.g., [31]. Unfortunately, the convergence of such averages is typically slow, i.e., no faster than 1/T1/T [32]. A technique for accelerating converge of time averages is the method of the weighted Birkhoff average developed by [20]. In this work, an average like that in (3) for the stretching function (6) is replaced by

𝑊𝐵T(R)(x0,v0)=t=0T1wT(t)R(xt,vt).\mathit{WB}_{T}(R)(x_{0},v_{0})=\sum_{t=0}^{T-1}w_{T}(t)R(x_{t},v_{t}). (7)

Here wT:[0,T]w_{T}:[0,T]\to{\mathbb{R}} is a normalized weight, which we write in the form

wT(t)=1NTg(tT),NT=t=0T1g(tT),w_{T}(t)=\frac{1}{N_{T}}g\left(\tfrac{t}{T}\right),\quad N_{T}=\sum_{t=0}^{T-1}g\left(\tfrac{t}{T}\right), (8)

for an unnormalized weight function g:[0,1]+g:[0,1]\to{\mathbb{R}}^{+}.

In general the acceleration of the convergence of 𝑊𝐵T\mathit{WB}_{T} for functions on phase space relies on the fact that g(τ)g(\tau) is a bump function: it vanishes at 0 and 11 and is smooth on the closed interval [0,1][0,1]. In particular it has been proven that when the orbit lies on an invariant torus on which the dynamics is conjugate to a rigid rotation with incommensurate frequency, and the map and conjugacy are sufficiently smooth, then such a bump function improves the convergence of the Birkhoff average of a sufficiently smooth function on phase space [20, 33, 34]. In the best cases, the convergence is super-polynomial (faster than 1/Tk1/T^{k} for any kk\in{\mathbb{Z}}) or even exponential.

Our goal in the current paper is to investigate when this improvement extends to computations of the Lyapunov spectrum (5).

II.1 Bump Functions

In this paper we will test several weight functions gg (8) to see how they influence the convergence of the time average (7) for the Lyapunov spectrum.

The standard choice for gg is the CC^{\infty} bump function of [20] 111Other possibilities with varying “widths” were explored in [33]. For time averages of functions on phase space, (9) was found to have the best convergence properties.

gwba(τ)={e(τ(1τ))1τ(0,1)0τ=0,1.g_{\rm wba}(\tau)=\left\{\begin{array}[]{ll}e^{-\left({\tau(1-\tau)}\right)^{-1}}&\tau\in(0,1)\\ 0&\tau=0,1\end{array}\right.. (9)

We also consider the skewed bump function

gskew(τ)={e(τ2(1τ2))1τ(0,1)0τ=0,1.g_{\rm skew}(\tau)=\left\{\begin{array}[]{ll}e^{-\left({\tau^{2}(1-\tau^{2})}\right)^{-1}}&\tau\in(0,1)\\ 0&\tau=0,1\end{array}\right.. (10)

This is still CC^{\infty}, but is no longer symmetric about τ=12\tau=\tfrac{1}{2}—it has a maximum at τ=12\tau=\tfrac{1}{\sqrt{2}}. As a third example, we will use the function

gleft(τ)={e(τ(112τ))1τ(0,1)0τ=0,1,g_{\rm left}(\tau)=\left\{\begin{array}[]{ll}e^{-\left(\tau(1-\tfrac{1}{2}\tau)\right)^{-1}}&\tau\in(0,1)\\ 0&\tau=0,1\end{array}\right.,\quad (11)

which is CC^{\infty} and monotone increasing on [0,1)[0,1) but has a discontinuity at t=1t=1. This will test whether the computations of μT\mu_{T} are more sensitive to initial transient behavior in the stretching of the vector vtv_{t}. These functions are shown in Fig. 1.

Refer to caption

Figure 1: Normalized exponential bump (9), (2,0)(2,0) MEGNO (15), “skew” (10) and “left” (11) weight functions. The weights are normalized to have unit area, instead of using the sum as in (8).

II.2 MEGNO

In [35], a modification of the average (3) was introduced to give a chaos indicator that they entitle the “mean exponential growth rate for nearby orbits” (MEGNO). As reviewed in [10], a generalized MEGNO can be formulated, labeled by a pair integers (m,n)(m,n). It is obtained by first computing a weighted average of the stretching factor (4) for the first tt iterates: 222To be consistent with the definition of the stretch (4), and the concept of a bump function, we shift the indices by one step from [10, Eqs. 4.38-39].

Ym,n(t)=(m+1)tnj=0tjmS(xj,vj).Y_{m,n}(t)=(m+1)t^{n}\sum_{j=0}^{t}j^{m}S(x_{j},v_{j}).

The (m,n)(m,n) MEGNO is obtained as an additional time average of this quantity:

Y¯(m,n)(T)=1Tm+n+1t=0T1Ym,n(t).\bar{Y}_{(m,n)}(T)=\frac{1}{T^{m+n+1}}\sum_{t=0}^{T-1}Y_{m,n}(t).

Note that we can reorder this double sum to obtain an expression that closely resembles the weighted average (7):

Y¯(m,n)(T)=t=0T1WTmeg(t)S(xt,vt),\bar{Y}_{(m,n)}(T)=\sum_{t=0}^{T-1}W^{meg}_{T}(t)S(x_{t},v_{t}),

where the MEGNO “weight” is effectively

WTmeg(t)=m+1Tm+n+1tmj=tT1jn.W^{meg}_{T}(t)=\frac{m+1}{T^{m+n+1}}t^{m}\sum_{j=t}^{T-1}j^{n}\;. (12)

Though Y¯(m,n)\bar{Y}_{(m,n)} now has a form similar to (7), the weight function is not normalized: MEGNO does not attempt to compute an accurate value for μT\mu_{T}. We propose that a normalized weight function would be more appropriate, so we rescale (12) to define

wTmeg(t)=1NT(tT)m1Tj=tT1(jT)n,w_{T}^{meg}(t)=\frac{1}{N_{T}}\left(\frac{t}{T}\right)^{m}\frac{1}{T}\sum_{j=t}^{T-1}\left(\frac{j}{T}\right)^{n},\\ (13)

where NTN_{T} is the normalization constant as in (8). Then the MEGNO-weighted average for the Lyapunov exponent is defined using this weight in (7).

According to [10], the most useful cases of MEGNO correspond to (m,n)=(1,1)(m,n)=(1,-1) and (2,0)(2,0). We will compare our results only with the second case since it was shown to converge more rapidly. When n=0n=0 the sum in (13) is trivial. Normalizing then gives

wTmeg(t)=12t2(Tt)T2(T21).w^{meg}_{T}(t)=12\frac{t^{2}(T-t)}{T^{2}(T^{2}-1)}. (14)

Note that this function vanishes at t=0t=0 and TT: it is a bump function like those in §II.1. To compare more directly with these, we rescale time and set the interval to [0,1][0,1] to obtain

gmeg(τ)=τ2(1τ).g_{meg}(\tau)=\tau^{2}(1-\tau). (15)

and then wTmegw_{T}^{meg} is given by the normalization (8). This weight is C1C^{1} at the endpoint τ=0\tau=0, but only C0C^{0} at τ=1\tau=1. This function is the orange curve in Fig. 1.

III Numerical results

In this section, we compare the performance of the averages defined in §II by computing the Lyapunov spectrum for several example maps with regular and chaotic orbits. The first case, in §III.1, exemplifies what we believe is the “typical” behavior. We then describe in §III.2 cases where the behavior is atypical due to special properties of the dynamics.

We first compute Lyapunov exponents using the Gram-Schmidt method and standard, unweighted average (5). Then we compute the exponents using the four weighting functions: exponential (9), skew (10), left (11), and (2,0)(2,0)-MEGNO (15). In each case our goal is to understand how the averages converge as TT\to\infty. The errors at time TT could be computed if we knew the theoretical values of the exponents, μ(j)\mu^{(j)}; however, in most cases these are not known.

Instead, we estimate the exponents by using the values μT(j)\mu_{T^{*}}^{(j)} at a fixed, large TT^{*} to give an estimate of the “true” answer. In order to avoid bias, rather than choosing a fixed “truth”, each method produces its own estimate. We will say that a Lyapunov exponent converges as TkT^{-k} if

|𝑊𝐵T(R)μT|Tk for 1TT,|\mathit{WB}_{T}(R)-\mu_{T^{*}}|\sim T^{-k}\quad\mbox{ for }\quad 1\ll T\ll T^{*},

i.e., if a log-log plot of the error has slope k-k over some interval.

III.1 Typical Convergence

Recall that the weighted Birkhoff average 𝑊𝐵T(h)\mathit{WB}_{T}(h) for a function hC(M,)h\in C^{\infty}(M,{\mathbb{R}}) on phase space converges slowly when an orbit is chaotic but for “regular” orbits (those smoothly conjugate to incommensurate rotations) it converges at a rate rate determined by the smoothness of the weight function—for a CC^{\infty} weight, such as (9), this can be super-polynomial [20, 21, 22, 34]. By contrast, the standard unweighted average nominally convergences at best as T1T^{-1} [32].

Here we similarly observe that the Lyapunov spectrum also converges slowly whenever the orbit is chaotic, regardless of the method used. But for a regular orbit, the weighted averages do typically enhance the convergence of the Lyapunov spectrum.

As a first example, consider the three-dimensional “discrete Lorenz map” [36]

x\displaystyle x^{\prime} =y\displaystyle=y (16)
y\displaystyle y^{\prime} =0.85x+ν2y+yz\displaystyle=-0.85x+\nu_{2}y+yz
z\displaystyle z^{\prime} =0.95zy2.\displaystyle=0.95z-y^{2}\;.

Here we fix two of the parameters (ν1=0.85\nu_{1}=-0.85 and ν3=0.95\nu_{3}=0.95, in the notation of [36]) and allow only the parameter ν2\nu_{2} to vary. For (16), the determinant of the Jacobian is independent of the point: det(Df)=ν1ν3=.8075\det(Df)=\nu_{1}\nu_{3}=-.8075. This implies that the sum of the exponents should be

d=ln(|det(Df)|)0.2138122238853254.d^{*}=\ln(|\det(Df)|)\approx-0.2138122238853254.

However, in the computations below, we do not use this since we want to test convergence of the individual exponents (5).

Figure 2 shows the three Lyapunov exponents for this map as a function of ν2\nu_{2}, computed using the standard WBA weight (9) with the iterative Gram-Schmidt method (5). The orbit is arbitrarily chosen to start at (0,0.01,0.0001)(0,-0.01,0.0001), discarding the first 40004000 iterates to remove transients. The exponents are computed using (7) for the next T=2(10)4T=2(10)^{4} iterates. In all cases, |μ(1)+μ(2)+μ(3)d|<1014|\mu^{(1)}+\mu^{(2)}+\mu^{(3)}-d^{*}|<10^{-14}, consistent with the constant Jacobian determinant. This excellent convergence follows from the fact that, up to floating point error, ln(rt(1))+ln(rt(2))+ln(rt(3))=d\ln(r_{t}^{(1)})+\ln(r_{t}^{(2)})+\ln(r_{t}^{(3)})=d^{*} for each tt, so the sum of the averages is the average of the sum, with or without a weight function.

Refer to caption

Figure 2: The three Lyapunov exponents for the discrete Lorenz map for 10001000 values of ν2[1.83,1.95]\nu_{2}\in[1.83,1.95]. These were computed using WBA weight (9) with T=2(10)4T=2(10)^{4}. The dashed lines mark the values of ν2\nu_{2} used in Fig. 3.

As shown in [36], the fixed point (x,y,z)=(0,0,0)(x,y,z)=(0,0,0) of the map (16) is stable up to ν2=1.85\nu_{2}=1.85 where it undergoes a pitchfork bifurcation—at this point the largest exponent (blue in the figure) hits zero. The newly created pair of fixed points lose stability at ν21.8645\nu_{2}\approx 1.8645 in Neimark-Sacker bifurcations. The resulting pair of attracting circles or periodic orbits have basins of attraction limited by the stable and unstable manifolds of the origin, and there can be additional attractors. At ν2=1.87\nu_{2}=1.87 there is a chaotic, Lorenz-like attractor, as shown in Fig. 3(a). This attractor has a single positive Lyapunov exponent and a tangential exponent of zero. Using the exponential weight (9) with T=2(10)6T^{*}=2(10)^{6} gives the exponents

μ(0.0039858,0.0000000,0.2177981).\mu\simeq(0.0039858,0.0000000,-0.2177981).

As ν2\nu_{2} varies, the Lorenz-like attractor can collapse onto an attracting circle; for example this occurs at ν2=1.8785\nu_{2}=1.8785, see Fig. 3(c). For this attracting circle the maximal Lyapunov exponent is zero, and again using the exponential weight (9), we find (to higher accuracy)

μ(0.0000000000000000.0001609910512610.213651232834031),\mu\simeq\left(\begin{array}[]{c}0.000000000000000\\ -0.000160991051261\\ -0.213651232834031\end{array}\right),

for the same TT^{*}. These cases are marked with vertical dashed lines in Fig. 2.

The convergence of the largest exponent using the five weight functions of §II is shown in Fig. 3, panels (b) and (d), for ν2=1.87\nu_{2}=1.87 and 1.87851.8785, respectively. These plots show the errors for 100100 logarithmically spaced values of T[900,1.5(10)6]T\in[900,1.5(10)^{6}]. In each case the first 40004000 iterates of the initial condition are discarded to remove transients. The error is estimated by comparing to μT(1)\mu^{(1)}_{T^{*}} with T=2(10)6T^{*}=2(10)^{6}.

As seen in Fig. 3(b), all of the methods perform poorly for the chaotic attractor at ν2=1.87\nu_{2}=1.87: the convergence is at best like T1T^{-1}. Consistent with this, one might believe the results to 5 or 6 digits at T=106T=10^{6}. The convergence of the second and third Lyapunov exponents (not shown) is essentially indistinguishable from that shown in panel (b).

For the invariant circle at ν2=1.8785\nu_{2}=1.8785, the WBA and skew weights, which are CC^{\infty}, far outperform the other methods. The best convergence is for the skew weight; it essentially reaches machine precision by T=3(10)5T=3(10)^{5}. The MEGNO weight (15) also gives increased convergence at a rate nearing T2T^{-2}. The left and constant (regular) weights still converge as T1T^{-1}: there is no improvement since these weight functions are not continuous.

Refer to caption Refer to caption Refer to caption Refer to caption

Figure 3: Two attractors for the discrete Lorenz map (16) and corresponding errors for the largest Lyapunov exponent. Panels (a) and (b): Chaotic attractor with ν2=1.87\nu_{2}=1.87. The weighted averages show no improvement over the “regular” method. Panels (c) and (d): Invariant circle at ν2=1.8785\nu_{2}=1.8785. The smoothly weighted methods converge much more quickly. The curves in panels (b) and (d) correspond to the unweighted average (regular), exponential bump (WBA) (9), (2,0)(2,0)-MEGNO (15), left (11), and skew (10) weights as labeled.

The behavior seen in Fig. 3 appears to be typical: we have seen similar performance for convergence of the Lyapunov spectrum in many other simulations for chaotic and nonchaotic orbits for various maps including the discrete Lorenz map for other parameter values, the classic Hénon map, the Derived from Anosov (DA) map [37], and the 2D torus map studied in [20, Section 3.7.3]. Similar behavior is also seen for Poincaré maps of flows, such for a periodically forced, double-well Duffing oscillator [38].

III.2 Outliers

We now describe several cases where the convergence does not follow the pattern seen in §III.1.

Dynamics with Shear: Integrable symplectic maps have families of invariant tori whose rotation vectors generically vary across tori: they have shear. This results in the linear growth of the length of vectors transverse to the tori, and this gives the slow convergence

μTln(T)T\mu_{T}\sim\frac{\ln(T)}{T}

to zero for (3). It has long been known [39] that shear causes a similar slow convergence even when the map is not integrable, whenever the orbit lies on an invariant torus [28]. It appears to be better for the weighted cases, as we describe below and see in Fig. 4.

For example, consider the 3D volume-preserving map for (x,y,z)𝕋2×(x,y,z)\in{\mathbb{T}}^{2}\times{\mathbb{R}} [22]:

x\displaystyle x^{\prime} =x+z+12(51)mod1\displaystyle=x+z^{\prime}+\tfrac{1}{2}(\sqrt{5}-1)\mod 1 (17)
y\displaystyle y^{\prime} =y+2(z)2+0.4mod1\displaystyle=y+2(z^{\prime})^{2}+0.4\mod 1
z\displaystyle z^{\prime} =z0.02(sin(2πx)+sin(2πy)+sin(2π(xy)).\displaystyle=z-0.02\left(\sin(2\pi x)+\sin(2\pi y)+\sin(2\pi(x-y)\right)\;.

The initial point (x,y,z)=(0,0,0.05)(x,y,z)=(0,0,-0.05) appears to lie on an invariant two-torus that is a graph over (x,y)(x,y) on which the dynamics has the incommensurate rotation vector ω(0.544519,0.411571)\omega\simeq(0.544519,0.411571) [22]. All three of the Lyapunov exponents for this orbit are zero: the two tangential exponents vanish because the invariant set is a two-torus, and the transverse exponent is then zero because of volume-preservation. Nevertheless, since the rotation vector varies with zz, the map has shear and the length of a vector transverse to the torus will grow linearly. This should result in slow convergence of the exponents.

The convergence of a Birkhoff average of the function h=cos(2πx)h=\cos(2\pi x) and of the largest Lyapunov exponent are shown in Fig. 4 for the five weight functions of §II. The figure shows the averages for 100100 values of T[800,1.5(10)7]T\in[800,1.5(10)^{7}]. Since the map is volume preserving, no transient removal is needed. For the CC^{\infty} weight functions the convergence rate of the Birkhoff average is excellent, as we would expect for a quasiperiodic orbit [22]. The errors for the Lyapunov exponent in panel (b) were computed by comparing to μ(1)=0\mu^{(1)}=0. Note that this convergence is very slow—even though the orbit is nonchaotic. The other two Lyapunov exponents (not shown) also have the same convergence rates.

We observe similar behavior for other parameters and orbits of the map (17) as well as for nonchaotic orbits of the 2D Chirikov standard map.

Refer to caption Refer to caption

Figure 4: Convergence of weighted averages for an invariant torus of the map (17). Panel (a): time averages of cos(2πx)\cos(2\pi x). For the CC^{\infty} weights this average converges to machine precision by T3(10)5T\approx 3(10)^{5}. Panel (b): convergence of a Lyapunov exponent to zero. This appears to converge as lnT/T\ln T/T for the regular average and as 1/T1/T for the weighted averages. Note the vertical scales are different in the two panels.

Weak Chaos: A dynamical system has weak chaos when the Lyapunov exponents are not positive, but it still has sensitive dependence on initial conditions. Such dynamics can lead to strange nonchaotic attractors (SNA), where the orbit lies on a geometrically strange (fractal) set, [40, 41]. In this case we have previously observed that averages converge slowly for both Lyapunov exponents and functions on phase space, and adding a weight function does not improve convergence [33, 42].

Noninvertible maps: Noninvertibility makes the computation of Lyapunov exponents more delicate. Denote the set of points where the Jacobian of the map is singular by

J0={x:det(Df(x))=0}.J_{0}=\{x:\det(Df(x))=0\}.

If an invariant set intersects J0J_{0}, it may be nonsmooth and even self-intersecting, see [43, 44, 45] and references therein. If an ergodic component intersects J0J_{0}, even if it is smooth, some of the exponents will be undefined, since a zero determinant implies that rt(j)=0r_{t}^{(j)}=0 for some jj, so that lnrt(j)\ln r_{t}^{(j)} is infinite. Moreover, the average (5) will also be undefined for initial conditions on the dense, countably infinite set of preimages of J0J_{0}. Of course it is still possible for the Lyapunov exponents to exist almost everywhere. However, from the numerical standpoint, if an orbit nears the dense set of singular points, then at least one rt(j)0r_{t}^{(j)}\approx 0, and this will lead to significant floating point errors.

This phenomenon is shown in Fig. 5 for a so-called Tinkerbell map [7]

x\displaystyle x^{\prime} =0.33x0.6y+x2y2\displaystyle=0.33x-0.6y+x^{2}-y^{2} (18)
y\displaystyle y^{\prime} =2x+0.5y+2xy.\displaystyle=2x+0.5y+2xy\;.

In this case J0J_{0} is a circle centered at (0.2,0.65)(-0.2,-0.65) with radius 0.125\sqrt{0.125} that intersects the attractor, see Fig. 5(a). The Lyapunov exponents for this attractor are nonpositive, μ(0,0.14292)\mu\simeq(0,-0.14292), and as seen in Fig. 5(b), the convergence of μ(1)\mu^{(1)} for the CC^{\infty} bump functions is excellent: since J0J_{0} intersects the orbit transversally, rt(1)r_{t}^{(1)} is never near zero. However, rt(2)r_{t}^{(2)} does get arbitrarily close to zero infinitely often on the orbit. As seen in Fig. 5(c), this results in slow convergence of the second Lyapunov exponent for all methods. For this case, the attractor is smooth, and we observe that the convergence of 𝑊𝐵T(h)\mathit{WB}_{T}(h) using (9) for functions on phase space (such as h=cos(2πx)h=\cos(2\pi x) and the rotation number) is excellent (not shown). The implication is that for this case, the numerical difficulities are restricted to the smaller Lyapunov exponent.

Refer to caption Refer to caption

Figure 5: For the Tinkerbell map, the left panel depicts the attractor, the set of points with a singular Jacobian (J0J_{0}), and the first iterate of these points (J1J_{1}). The right panels show the errors for μT(1)\mu_{T}^{(1)} and μT(2)\mu_{T}^{(2)} as a function of TT (note the difference in vertical scales). The second exponent converges slowly for all methods.

Constant Jacobian: When the map ff has a constant, hyperbolic Jacobian, the convergence rates of 𝑊𝐵T(S)\mathit{WB}_{T}(S), (7) are enhanced for smooth weight functions. Indeed, when Df=ADf=A is constant, the method given in (5) is equivalent to a computing the singular values σj\sigma_{j} of AA via normalized simultaneous power iteration. As long as 0σj+1<σj<<σ10\leq\sigma_{j+1}<\sigma_{j}<\dots<\sigma_{1} (the strict inequality is the generic case), we have rt(j)σjr_{t}^{(j)}\to\sigma_{j} with error 𝒪(|σj+1/σj|t){\cal O}(|\sigma_{j+1}/\sigma_{j}|^{t}) [46]. As a result, the errors in the weighted average (5), occur only for the initial transients. The implication is that the left (11) and skew (10) weights perform better than the others, since they suppress the initial portion of the average. In contrast, when the orbit is chaotic, 𝑊𝐵T(h)\mathit{WB}_{T}(h) for functions hh converges slowly for any weight function.

We have verified this improved convergence of μT(j)\mu^{(j)}_{T} and slow convergence of other averages for examples including Arnold’s cat map and the skinny baker map [7], both of which are uniformly hyperbolic.

IV Conclusions

The computation of the Lyapunov exponent for orbits of a dynamical system can be formulated as a time average of the stretching function S(xt,vt)S(x_{t},v_{t}) along a trajectory. As noted in [20], these averages can be computed using bump functions, similar to those used to compute Birkhoff averages of functions on phase space. An advantage of this smoothing is that the results can be super-polynomially convergent when the trajectory is regular. We extended these ideas to compute the full Lyapunov spectrum (5) using weighted averages (7).

In §III we showed that a CC^{\infty} weight function typically gives super-convergence of the Lyapunov spectrum on nonchaotic orbits, just as it does for functions on phase space. There are exceptions, however, including invariant sets that are tori with transverse shear; these are common in the Hamiltonian or symplectic case. Moreover, having nonpositive Lyapunov spectrum is not sufficient for super-convergence, as the example of weak-chaos shows. In addition, attractors of non-invertible maps, even if they are non-chaotic, can have slow convergence of some exponents due to singularities. Finally, convergence of exponents can be enhanced by a smooth weight for the simplest of chaotic systems: those that are uniformly hyperbolic with constant Jacobian.

As we showed in §II, the MEGNO chaos detection method of [35, 10] can be reformulated as a weighted time average which gives the maximal Lyapunov exponent. The weight function in this case, however, is not CC^{\infty} and this results in slower convergence of the average. Since CC^{\infty} weight functions have essentially the same computational cost as less smooth functions, there seems to be no reason to use a less smooth weight.

The results in this paper still leave open the question: is is possible to devise a technique for efficiently and accurately computing the Lyapunov spectrum for a typical, chaotic invariant set?

References