This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

\extraauthor

Dorian S. Abbot \extraaffilDepartment of the Geophysical Sciences, University of Chicago \extraauthorJonathan Weare \extraaffilCourant Institute of Mathematical Sciences, New York University

Path properties of atmospheric transitions: illustration with a low-order sudden stratospheric warming model

Abstract

Many rare weather events, including hurricanes, droughts, and floods, dramatically impact human life. To accurately forecast these events and characterize their climatology requires specialized mathematical techniques to fully leverage the limited data that are available. Here we describe transition path theory (TPT), a framework originally developed for molecular simulation, and argue that it is a useful paradigm for developing mechanistic understanding of rare climate events. TPT provides a method to calculate statistical properties of the paths into the event. As an initial demonstration of the utility of TPT, we analyze a low-order model of sudden stratospheric warming (SSW), a dramatic disturbance to the polar vortex which can induce extreme cold spells at the surface in the midlatitudes. SSW events pose a major challenge for seasonal weather prediction because of their rapid, complex onset and development. Climate models struggle to capture the long-term statistics of SSW, owing to their diversity and intermittent nature. We use a stochastically forced Holton-Mass-type model with two stable states, corresponding to radiative equilibrium and a vacillating SSW-like regime. In this stochastic bistable setting, from certain probabilistic forecasts TPT facilitates estimation of dominant transition pathways and return times of transitions. These “dynamical statistics” are obtained by solving partial differential equations in the model’s phase space. With future application to more complex models, TPT and its constituent quantities promise to improve the predictability of extreme weather events, through both generation and principled evaluation of forecasts.

1 Introduction and background

The polar winter stratosphere typically supports a strong, cyclonic polar vortex, maintained by the thermal wind relation and meridional temperature gradient. A sudden stratospheric warming (SSW) event is a large excursion from this normal state, which can take many different forms. In split-type SSWs, the vortex splits completely in two. In displacement-type SSWs the vortex displaces far away from the pole (these can be considered wavenumber-2 and wavenumber-1 disturbances, respectively). Both types are “major” warmings, as the mean zonal wind reverses. In a “minor” warming, the zonal wind slows down significantly without completely reversing (Butler et al., 2015).

SSW is a rare event occurring about twice every three years, depending on the definition used (Butler et al., 2015). Its effects can propagate downward into the troposphere, altering the tropospheric jetstream and inducing extreme midlatitude surface weather events, including cold spells and precipitation (Baldwin and Dunkerton, 2001; Thompson et al., 2002). Abrupt cold spells severely stress infrastructures, economies and human lives, and every bit of extra prediction lead time is helpful for adaptation. Unfortunately, numerical weather prediction struggles to forecast SSW at any lead time longer than about two weeks (Tripathi et al., 2016). Understanding SSW is therefore important for practical forecasting as well as science, but this task remains difficult. Several different geophysical fields are often used as indices of SSW onset. One simple indicator is zonal-mean zonal wind at 6060^{\circ}N, which defines thresholds for minor and major warming (Charlton and Polvani, 2007; Butler et al., 2015). Another common indicator is the 10hPa geopotential height field, which was used by Inatsu et al. (2015) to estimate a fluctuation-dissipation relation in its leading empirical orthogonal functions (EOFs). Many studies have examined SSW precursors and dominant pathways through simulation and observation. Limpasuvan et al. (2004), for instance, catalogued the various wavenumber forcings, heat fluxes and zonal wind anomalies that accompanied each stage of SSW events from reanalysis data. While planetary wave forcing from the troposphere is an accepted proximal cause of SSW, the polar vortex’s susceptibility to such forcing, or “preconditioning”, is a nontrivial and debated function of its geometry (Albers and Birner, 2014; Bancalá et al., 2012). Tropospheric blocking is also thought to be linked to SSW; Martius et al. (2009) and Bao et al. (2017) found blocking to precede many major SSW events of the past half century. The diversity and complex life cycle of SSWs makes it difficult to build a unified picture of their onset.

In this article, we are interested in developing a detailed understanding of transition events between two states, at least one of which is typically long-lived. Consider, for example, a particle with position x(t)x(t) moving in the double well potential energy landscape V(x)=x44x22V(x)=\frac{x^{4}}{4}-\frac{x^{2}}{2} (illustrated in Figure 1) and forced by stochastic white noise W˙\dot{W}: x˙=V(x)+σW˙\dot{x}=-V^{\prime}(x)+\sigma\dot{W}. If the system starts in the left well, it will tend to remain there a while, but occasionally the stochastic forcing will push it over the barrier into the right well. The natural predictor for this event is the committor: the probability of reaching the right well before the left well.

We denote this function by q(x)q(x), which solves the Kolmogorov backward equation (to be introduced later). For this simple system the equation takes a form which can be solved exactly:

{V(x)q(x)+σ22q′′(x)=0x(1,1)q(1)=0,q(1)=1\displaystyle\begin{cases}-V^{\prime}(x)q^{\prime}(x)+\frac{\sigma^{2}}{2}q^{\prime\prime}(x)=0&x\in(-1,1)\\ q(-1)=0,\ \ q(1)=1\\ \end{cases} (1)
q(x)=1xexp(2σ2V(x))𝑑x11exp(2σ2V(x))𝑑x\displaystyle\implies q(x)=\frac{\int_{-1}^{x}\exp\big{(}\frac{2}{\sigma^{2}}V(x^{\prime})\big{)}\,dx^{\prime}}{\int_{-1}^{1}\exp\big{(}\frac{2}{\sigma^{2}}V(x^{\prime})\big{)}\,dx^{\prime}} (2)

Note that the boundary conditions are implied by the probabilistic interpratation.

The committor is plotted in the right panel of Figure 1 for various noise levels. N.B., the potential landscape picture is not fully general, but is a useful mental model. The equations that determine q(x)q(x) will be presented in section 2.

In the case of SSW, the long-lived states are the steady and disturbed circulation regimes of the stratospheric polar vortex. Recent work by Yasuda et al. (2017) has studied SSW in an equilibrium statistical mechanics framework, with these two stable states as saddle points of energy functionals. TPT takes a complementary non-equilibrium view, describing the long-time (steady-state) statistics of trajectories between the two states. For example, TPT introduces a probability density of reactive trajectories (or “reactive density”) indicating the regions where trajectories tend to spend their time en route from A to B. The system is said to be reactive at a point in time if it has most recently visited AA and will next visit BB. The associated probability current of reactive trajectories (or “reactive current”) indicates the preferred direction and speed of transition paths. These detailed descriptors of the mechanism underlying a rare event can be expressed in terms of probabilistic forecasts like the committor q(x)q(x), the probability of entering state A before reaching state B from a given initial condition xx (not in either A or B). The committor is the ideal probabilistic forecast in the usual variance-minimizing sense of conditional expectations (Durrett, 2013). Any other predictor of a transition derived through experiments and observations, such as vortex preconditioning and forcing at different wavenumbers (Albers and Birner, 2014; Bancalá et al., 2012; Martius et al., 2009; Bao et al., 2017) necessarily corresponds to an approximation of the committor.

Ensemble simulation is a commonly used method to estimate the committor at a single given initial condition by measuring the fraction of ensemble trajectories that achieve the rare event. This is challenging because many simulations are needed to generate enough rare events for significant statistical power. Recent and ongoing work aims to channel this computing power more efficiently in weather simulation using importance sampling and large deviation theory (Hoffman et al., 2006; Weare, 2009; Vanden-Eijnden and Weare, 2013; Ragone et al., 2018; Dematteis et al., 2018; Plotkin et al., 2019; Webber et al., 2019). The committor, and other quantities of interest, are fundamentally averages over sample trajectories. We can also express them as solutions to a concrete set of partial differential equations (PDEs) using basic stochastic calculus. TPT provides a framework to exploit these quantities and enhance our understanding of rare events, both from simulation data and from the fundamental equations of motion.

Figure 1: The committor function for a double-well potential under the dynamics x˙=V(x)+σw˙\dot{x}=-V^{\prime}(x)+\sigma\dot{w}. Panel (a) shows the potential function V(x)V(x), and panel (b) shows the committor function. The committor has value zero on the left minimum, one on the right minimum, and one half at the top of the barrier. The stronger the stochastic forcing, the less the actual potential shape matters and the more gradual the committor’s slope. For small noise, the dynamics become more deterministic and the committor approaches a step function, since x(t)x(t) will directly approach whichever minimum is closer.
Refer to caption

TPT has been applied primarily to molecular dynamics simulation to determine reaction rates and pathways of complex conformational transitions (Vanden-Eijnden and E, 2010; Metzner et al., 2006; E and Vanden-Eijnden, 2006); however, the framework does not depend on the details of the underlying dynamical system. TPT deals particularly with stochastically forced systems, such as Brownian dynamics of particles. Stochastic forcing applies quite generally; while the climate system is deterministic in principle, nonlinear interactions between resolved and unresolved scales inevitably leads to resolution-dependent model errors that can be approximated as stochastic. Hasselmann (1976) originally formulated stochastic climate models to capture the influence of quickly evolving “weather” variables on the slowly evolving “climate” variables. Stochastic parameterization remains an active area of research. For example, Franzke and Majda (2006) had success in capturing energy fluxes of a 3-layer quasigeostrophic model by projecting onto ten EOF modes and treating the remainder as stochastic forcing. Kitsios and Frederiksen (2019) addressed the challenge of designing consistent numerical schemes for subgrid-scale parametrization. Deep convection in the atmosphere and turbulence in the ocean boundary layer are two examples of multiscale processes that are especially challenging to resolve.

The aim of this article is to introduce the key quantities and relations describing the path properties of rare atmospheric events. Computing those quantities for more complicated systems than the low order model studied here is a significant and, we argue, worthwhile challenge. We do not address that computational challenge here. Instead we note that development of approximation techniques for TPT and related quantities in high dimensional settings is an active area of research Thiede et al. (2019); Bowman et al. (2009); Chodera and Noe (2014). These methods incorporate both simulated and observational data and we outline them briefly in the conclusion.

The paper is organized as follows. Section 2 describes the dynamical model we use, building on work by Ruzmaikin et al. (2003) and Birner and Williams (2008). Section 3 describes the mathematical framework of TPT, with detailed, but informal, derivations mainly put in the supplement. Section 4 explains the methodology and results particular to this model.

2 Dynamical model

Holton and Mass (1976) studied “stratospheric vacillation cycles,” a certain kind of minor warming in which zonal wind oscillates on a roughly seasonal timescale. They posited a mechanism of wave-mean flow interaction, which continues to be an important modeling paradigm. The quasi-geostrophic equations are confined to a β\beta-plane channel from 6060^{\circ} to the north pole, and the streamfunction is perturbed from below by orographically induced planetary waves, specified through the lower boundary condition. The Holton-Mass model combines the zonal-mean flow equations,

u¯tf0v¯\displaystyle\frac{\partial{\overline{u}}}{\partial{t}}-f_{0}\overline{v} =0\displaystyle=0 (3)
f0u¯\displaystyle f_{0}\overline{u} =Φ¯y\displaystyle=-\frac{\partial{\overline{\Phi}}}{\partial{y}} (4)
t(Φ¯z)+N2w¯\displaystyle\frac{\partial{}}{\partial{t}}\bigg{(}\frac{\partial{\overline{\Phi}}}{\partial{z}}\bigg{)}+N^{2}\overline{w} =α(Φ¯zRT¯H)y(vΦz¯)\displaystyle=-\alpha\bigg{(}\frac{\partial{\overline{\Phi}}}{\partial{z}}-\frac{R\overline{T}^{*}}{H}\bigg{)}-\frac{\partial{}}{\partial{y}}(\overline{v^{\prime}\Phi_{z}^{\prime}}) (5)
v¯y+1ρsz(ρsw¯)\displaystyle\frac{\partial{\overline{v}}}{\partial{y}}+\frac{1}{\rho_{s}}\frac{\partial{}}{\partial{z}}(\rho_{s}\overline{w}) =0\displaystyle=0 (6)

and the linearized quasigeostrophic potential vorticity equation,

(t+u¯x)q\displaystyle\bigg{(}\frac{\partial{}}{\partial{t}}+\overline{u}\frac{\partial{}}{\partial{x}}\bigg{)}q^{\prime} +βeψx+f02ρsz(αρsN2ψz)=0\displaystyle+\beta_{e}\frac{\partial{\psi^{\prime}}}{\partial{x}}+\frac{f_{0}^{2}}{\rho_{s}}\frac{\partial{}}{\partial{z}}\bigg{(}\frac{\alpha\rho_{s}}{N^{2}}\frac{\partial{\psi^{\prime}}}{\partial{z}}\bigg{)}=0 (7)
where q\displaystyle\text{where }\ \ \ \ q^{\prime} :=2ψ+1ρsz(f02N2ρsψz)\displaystyle:=\nabla^{2}\psi^{\prime}+\frac{1}{\rho_{s}}\frac{\partial{}}{\partial{z}}\bigg{(}\frac{f_{0}^{2}}{N^{2}}\rho_{s}\frac{\partial{\psi^{\prime}}}{\partial{z}}\bigg{)} (8)
and βe\displaystyle\text{and }\ \ \ \beta_{e} =β2u¯y21ρsz(ρsf02N2u¯z).\displaystyle=\beta-\frac{\partial^{2}{\overline{u}}}{\partial{y}^{2}}-\frac{1}{\rho_{s}}\frac{\partial{}}{\partial{z}}\bigg{(}\rho_{s}\frac{f_{0}^{2}}{N^{2}}\frac{\partial{\overline{u}}}{\partial{z}}\bigg{)}. (9)

These equations represent (2) zonal and (3) meridional momentum balance, (4) conservation of energy, (5) conservation of mass, and (6) a combination of all these for the perturbations. Overbars and primes represent zonal averages and perturbations. Φ\Phi is the geopotential height; H=7kmH=7\,km is the atmospheric scale height; z=Hln(pp0)z=-H\ln(\frac{p}{p_{0}}) is log-pressure; ρs=ρ0ez/H\rho_{s}=\rho_{0}e^{-z/H} is a standard density profile; α=α(z)\alpha=\alpha(z) is an altitude-dependent damping coefficient; and T¯=T¯(y,z)\overline{T}^{*}=\overline{T}^{*}(y,z) is the radiative equilibrium temperature. Holton and Mass (1976) studied the solution consisting of single Fourier modes in the zonal and meridional directions:

u¯(y,z,t)\displaystyle\overline{u}(y,z,t) =U(z,t)siny\displaystyle=U(z,t)\sin\ell y (10)
ψ(x,y,t)\displaystyle\psi^{\prime}(x,y,t) =Re{Ψ(z,t)eikx}ez/2Hsiny\displaystyle=\text{Re}\{\Psi(z,t)e^{ikx}\}e^{z/2H}\sin\ell y (11)

where ψ\psi^{\prime} is the zonal perturbation of ψ=(g/f0)Φ\psi=(g/f_{0})\Phi. k=2/(acos60)k=2/(a\cos 60^{\circ}) and =3/a\ell=3/a are the zonal and meridional wavenenumbers, where aa is Earth’s radius. These wavenumbers are commonly observed in real SSWs and used in theoretical studies (Birner and Williams, 2008; Ruzmaikin et al., 2003; Yoden, 1987; Holton and Mass, 1976); a split-type SSW is an extreme wavenumber-two perturbation. The lower boundary condition at z=zBz=z_{B} (the tropopause) is

Ψ(zB,t)=gf0h(t)\displaystyle\Psi(z_{B},t)=\frac{g}{f_{0}}h(t) (12)

where hh is a topographically induced perturbation to geopotential height at the tropopause. Holton and Mass found that for a certain range of hh, this system has qualitatively different regimes: a steady eastward zonal flow close to radiative equilibrium, and a weaker zonal flow with quasi-periodic “vacillations” from eastward to westward, even under constant forcing. Each vacillation cycle consists of a sudden warming and cooling over the timescale of weeks. Although these individual cycles are interesting weather events unto themselves, in this paper we think of the vacillations as occurring within a general climate regime that is conducive to sudden warming, as opposed to the steady flow state, which is not. Transitions between these two regimes, which we focus on here, are more accurately described as climatological shifts than weather events. The study by Ruzmaikin et al. (2003) varies hh on an interannual timescale, with each single winter season occupying one of the two stable states and generating its daily weather accordingly. Hence, for this paper we will use the term “climate transitions.”

The original Holton-Mass model discretizes the above PDE with finite differences across 27 vertical levels, which is assumed to be close to a continuum limit. Following several studies at this resolution (Holton and Mass, 1976; Yoden, 1987; Christiansen, 2000), Ruzmaikin et al. (2003) did the most severe truncation possible, resolving only three vertical levels (including fixed boundaries) for easy analysis and exploration of parameter space. This reduces phase space to only three degrees of freedom: U(t)U(t), which modulates u¯\overline{u} as a sine jet; X(t)=Re{Ψ(t)}X(t)=\text{Re}\{\Psi(t)\}; and Y(t)=Im{Ψ(t)}Y(t)=\text{Im}\{\Psi(t)\}. XX and YY modulate the amplitude and phase of the perturbation streamfunction:

ψ(x,y,t)\displaystyle\psi^{\prime}(x,y,t) =(XcoskxYsinkx)ez/2Hsiny\displaystyle=(X\cos kx-Y\sin kx)e^{z/2H}\sin\ell y (13)

Carrying the Ansatz through the QG equations, Ruzmaikin et al. (2003) derived the following system:

X˙\displaystyle\dot{X} =1τ1XrY+sUYξh+δwh˙\displaystyle=-\frac{1}{\tau_{1}}X-rY+sUY-\xi h+\delta_{w}\dot{h} (14)
Y˙\displaystyle\dot{Y} =1τ1Y+rXsUX+ζhU\displaystyle=-\frac{1}{\tau_{1}}Y+rX-sUX+\zeta hU (15)
U˙\displaystyle\dot{U} =1τ2(UUR)ηhYδΛΛ˙\displaystyle=-\frac{1}{\tau_{2}}(U-U_{R})-\eta hY-\delta_{\Lambda}\dot{\Lambda} (16)

The primary control parameter, hh, represents topographic forcing and other sources of planetary waves, such as land-sea ice contrast. While Ruzmaikin et al. (2003) also varies Λ\Lambda, representing vertical wind shear, we will only vary hh and set Λ\Lambda constant. Time derivatives h˙\dot{h} and Λ˙\dot{\Lambda} are zeroed, removing transient forcings such as seasonality effects. Appendix A and Ruzmaikin et al. (2003) have a detailed list of parameters.

Remarkably, this hugely simplified model retains the qualitative structure of the Holton-Mass model as a bistable system for a certain range of hh between the critical values h120mh_{1}\approx 20\,m and h2160mh_{2}\approx 160\,m, as shown in the bifurcation diagram of Figure 2. Blue points represent the normal state of the vortex, in approximate thermal wind balance with the radiative equilibrium temperature field (henceforth called the “radiative solution”). Red points represent a disturbed vortex, with weaker zonal wind and vacillations. This climatological regime supports more SSW events, and is henceforth called the “vacillating solution.” We use the same blue-red color scheme consistently here to represent these two states. Transitions between them happen on interannual time scales, affecting each year’s likelihood of SSW events. The structure of transitions is illustrated in Figure 3: as hh increases slowly past the bifurcation threshold h2h_{2}, the system enters a series of rapid, large-amplitude oscillations that spiral into the weaker-circulation state.

Figure 2: Fixed points of Equations (14)-(16) in the state space (X,Y,U)(X,Y,U), where XX and YY represent the real and imaginary parts of the streamfunction and UU the mean zonal wind amplitude. Fixed points vary as a function of the topographic forcing parameter, hh. Panels (a), (b) and (c) show fixed points of XX, YY and UU respectively on the vertical axis, while hh varies across the horizontal axis. Circles and crosses denote linearly stable and unstable fixed points, respectively. The range of hh between 20m\sim 20\,m and 160m\sim 160\,m supports three fixed points, two stable and one unstable. In this range, the blue points correspond to the radiative solution, while the red points represent the vacillating regime. This corresponds to a winter climatology that is conducive to sudden stratospheric warming events.
Refer to caption
Figure 3: Trajectories in (X,Y,U)(X,Y,U) space, where XX and YY represent the real and imaginary parts of the streamfunction and UU the mean zonal wind amplitude. In this simulation, the topographic forcing hh increases linearly from 0m0\,m to 200m200\,m in 1300 days. (a) shows the fixed points, with colors blue, red and black for the radiative solution (AA), the vacillating solution (BB) and the unstable fixed point between them respectively. The trajectory of UU over time is superimposed in gray. (b) plots this same curve parametrically, in XUXU space. Before the bifurcation, the trajectories follow the existing fixed point; after the bifurcation, they spiral into the new fixed point through a series of “vacillations.”
Refer to caption

In Figures 2 and 3, transitions require crossing the bifurcation threshold h2h_{2}, where the radiative solution ceases to exist. Birner and Williams (2008) introduced additive white-noise forcing in the UU variable to model unresolved gravity waves and found that these perturbations were sufficient to excite the system out of its normal state and into a vacillating regime. In Figure 4 we illustrate stochastic trajectories of the system for three different (fixed) values of hh. (For numerical reasons we also add a small amount of independent white noise to XX and YY variables). Even when hh is far below h2h_{2}, transitions still occur, and in fact the preference for the vacillating solution branch increases quickly with hh.

Figure 4: Stochastic trajectories of the system for various fixed values of the parameters hh (topographic forcing) and σ3\sigma_{3} (amplitude of stochastic forcing). Panels (a), (b) and (c) show U(t)U(t) for three different forcing levels: h=25,35,45mh=25,35,45\,m with σ3=0.5m/s/day1/2\sigma_{3}=0.5\,m/s/day^{1/2}. In keeping with the bifurcation diagrams, the blue, black and red lines mark the radiative, unstable and vacillating solutions respectively. Note that their relative positions vary slightly with hh, as fixed points depend on parameters. As hh increases from left to right, the systems spends increasingly more of its time in the vacillating state. Panel (d) shows a parametric plot of the transitions through (X,U)(X,U) space, for h=35mh=35\,m (another view of panel (b)). The transition happens seven times, and hence panel (d) shows seven different transition paths superimposed on each other. Most of the transitions follow a similar characteristic curve through XUXU space, with a rapid decrease in UU followed by a decrease in XX.
Refer to caption

Birner and Williams (2008) used direct numerical simulation and the Fokker-Planck equation to calculate long-term occupation statistics, i.e., how much time on average was spent in each regime and the mean first passage time before a transition to the vacillating regime, all for a range of forcing and noise levels. Our approach differs in both target and methodology. We aim to characterize the transition process between the two states, to monitor its progress in real time, as well as to describe statistics of the transition over many realizations. Methodologically, transition path theory phrases these questions in terms of the generator of the stochastic process, a differential operator that encodes all information about the behavior of the process.

3 Path properties

TPT characterizes the steady state statistics of transitions between states. In this section, we introduce key quantities needed to introduce TPT as applied to the Ruzmaikin model to obtain a more complete picture than we get from the sample paths shown above. For mathematical details, see the supplement and background literature (E and Vanden-Eijnden, 2006; Metzner et al., 2006; Vanden-Eijnden and E, 2010).

a. Infinitesimal Generator

The noisy Ruzmaikin model can be expressed compactly as a stochastic differential equation (SDE)—specifically a diffusion process—in the variable Z=(X,Y,U)3Z=(X,Y,U)\in\mathbb{R}^{3} (or a more general phase space Ω\Omega) with a deterministic drift vector b(z)=(b1(z),b2(z),b3(z))b(z)=\big{(}b_{1}(z),b_{2}(z),b_{3}(z)\big{)} and a 3×33\times 3 diffusion matrix σ(z)\sigma(z).

dZt=b(Zt)dt+σ(Zt)dWt\displaystyle dZ_{t}=b(Z_{t})\,dt+\sigma(Z_{t})\,dW_{t} (17)

Here, WtW_{t} is a 3-vector of independent Brownian motions. We use the Ito convention for stochastic integration. While σ\sigma can in principle be any zz-dependent matrix, we make σ\sigma diagonal and constant: σ(z)=diag(σ1,σ2,σ3)\sigma(z)=\text{diag}(\sigma_{1},\sigma_{2},\sigma_{3}), creating independent additive noise in the XX, YY and UU variables. σ1\sigma_{1} and σ2\sigma_{2} have units of m2/s/day1/2m^{2}/s/day^{1/2}, while σ3\sigma_{3} has units of m/s/day1/2m/s/day^{1/2}. Associated with this equation is the infinitesimal generator \mathcal{L}, an operator describing the evolution of observable functions forward in time following a trajectory. Explicitly, if f()f(\cdot) is a smooth function of phase space variables, then

f(z)\displaystyle\mathcal{L}f(z) :=ddt𝔼[f(Zt)|Z0=z]|t=0\displaystyle:=\frac{d}{dt}\mathbb{E}[f(Z_{t})|Z_{0}=z]\Big{|}_{t=0} (18)

where 𝔼\mathbb{E} is an expectation over sample paths.

Ito’s lemma (the chain rule for diffusion SDEs) gives the Kolmogorov backward equation, which represents \mathcal{L} as a partial differential operator (for details see Pavliotis (2014)):

f(z)\displaystyle\mathcal{L}f(z) =ibi(z)f(z)zi+12i,j(σσ)ij2f(z)zizj\displaystyle=\sum_{i}b_{i}(z)\frac{\partial{f(z)}}{\partial{z_{i}}}+\frac{1}{2}\sum_{i,j}(\sigma\sigma^{\top})_{ij}\frac{\partial^{2}f(z)}{\partial z_{i}\partial z_{j}} (19)
=b(z)f(z)+Tr(12σσHf(z))\displaystyle=b(z)\cdot\nabla f(z)+\text{Tr}\Big{(}\frac{1}{2}\sigma\sigma^{\top}Hf(z)\Big{)} (20)

The diffusion matrix 12σσ\frac{1}{2}\sigma\sigma^{\top} is also called DD for convenience, which we will use interchangeably. HfHf denotes the Hessian matrix: [Hf]ij=2f/zizj[Hf]_{ij}=\partial^{2}f/\partial z_{i}\partial z_{j}. The generator provides path statistics as the solution to PDEs, as illustrated in the following subsections.

b. Equilibrium probability density

This stochastic process admits a time-dependent probability density, ρ(z,t)\rho(z,t), which can be derived from the generator. For example, if the system starts in a known position Z0=zZ_{0}=z, then ρ(z,0)=δ(zz)\rho(z^{\prime},0)=\delta(z-z^{\prime}). The density spreads out from this initial point over time according to the Fokker-Planck equation, which can be written in terms of the adjoint of the generator:

ρ(z,t)t=ρ(z,t)\displaystyle\frac{\partial{\rho(z,t)}}{\partial{t}}=\mathcal{L}^{*}\rho(z,t) (21)
=izi[bi(z)ρ(z,t)+jzj(ρ(z,t)Dij(z))]\displaystyle=\sum_{i}\frac{\partial{}}{\partial{z_{i}}}\Big{[}-b_{i}(z)\rho(z,t)+\sum_{j}\frac{\partial{}}{\partial{z_{j}}}\big{(}\rho(z,t)D_{ij}(z)\big{)}\Big{]} (22)
=[b(z)ρ(z,t)+(ρ(z,t)D(z))]\displaystyle=\nabla\cdot\big{[}-b(z)\rho(z,t)+\nabla\cdot\big{(}\rho(z,t)D(z)\big{)}\big{]} (23)

When DD is constant and diagonal, the last term simplifies to [(ρD)]=iDii2ρ/zi2\nabla\cdot\big{[}\nabla\cdot(\rho D)\big{]}=\sum_{i}D_{ii}\partial^{2}\rho/\partial z_{i}^{2}. In the case of pure Brownian motion, dZt=dWtdZ_{t}=dW_{t}, then b=0b=0 and D=12ID=\frac{1}{2}I, giving the heat equation tρ=122ρ\partial_{t}\rho=\frac{1}{2}\nabla^{2}\rho (2=i2zi2\nabla^{2}=\sum_{i}\frac{\partial^{2}}{\partial z_{i}^{2}}). Assuming that the process is ergodic, the density eventually forgets the initial condition and stabilizes into a long-term (or equilibrium, or stationary) probability density π(z)\pi(z), which solves π=0\mathcal{L}^{*}\pi=0. This can be approximated by either simulating the SDE for a very long time and binning data points, or directly solving the stationary PDE π=0\mathcal{L}^{*}\pi=0, subject to the normalization and positivity constraints Ωπ(z)𝑑z=1\int_{\Omega}\pi(z)\,dz=1 and π(z)0\pi(z)\geq 0.

c. Committor probability

The stationary density is an equilibrium quantity characterizing the long-term occupation statistics. But it is insufficient to describe the events of interest to us, which are transition paths: trajectory segments beginning inside the radiative state and ending inside the vacillating state. Specifically, we define the sets AA and BB as ellipsoids around these two fixed points, respectively. Their size is determined by contours of a local approximation to the stationary density π\pi; see supplement for details. We say that a snapshot ZtZ_{t} of the system is undergoing a transition (or reaction) at time tt if it is on the way from set AA to set BB. This involves information about both its future and its past, for which we introduce the forward and backward committor probabilities in this section.

The forward committor q+q^{+} (denoted qq when context is clear) describes the progress of a stochastic trajectory traveling from set AA to set BB, as follows:

q+(z)\displaystyle q^{+}(z) ={Zt next hits B before A|Z0=z}\displaystyle=\mathbb{P}\{Z_{t}\text{ next hits }B\text{ before }A|Z_{0}=z\} (24)
q+(zA)=0,q+(zB)=1\displaystyle q^{+}(z\in A)=0,\ \ \ \ \ q^{+}(z\in B)=1 (25)

The boundary conditions on AA and BB follow naturally from the probabilistic definition. If the system begins in set AA, by path continuity it will certainly next find itself in AA, with zero chance of hitting BB first. Starting in set BB the opposite is true. The committor therefore obeys the boundary value problem (see supplement for derivation)

{q+(z)=0z(AB)cq+(z)=0zAq+(z)=1zB\displaystyle\begin{cases}\mathcal{L}q^{+}(z)=0&z\in(A\cup B)^{c}\\ q^{+}(z)=0&z\in A\\ q^{+}(z)=1&z\in B\\ \end{cases} (26)

The equivalence of conditional expectations with respect to a Markov process like the committor and solutions to PDE involving the generator of the process are generally referred to as Feynman-Kac relations (Karatzas and Shreve, 1998) and are well studied. The PDE in (25) is most naturally posed on an infinite domain, but as a numerical approximation we solve it in a large rectangular domain and impose homogeneous Neumann conditions at the domain boundary. A limiting example is the noise-dominated case, where b(z)b(z) is negligible and D=ID=I. The Kolmogorov backward equation then becomes Laplace’s equation:

q+(z)=2q+(z)=0\displaystyle\mathcal{L}q^{+}(z)=\nabla^{2}q^{+}(z)=0 (27)

If posed on the interval [0,1][0,1], with A={0}A=\{0\} and B={1}B=\{1\}, the solution is q+(z)=zq^{+}(z)=z. The linear increase from set AA to set BB reflects the greater likelihood of entering BB when beginning closer to it. This limit is reflected in Figure 1, which shows the committor of the double-well potential approaching a straight line for large σ\sigma values.

Prediction is naturally much harder in high-dimensional systems such as stratospheric models. A number of physically interpretable fields, such as zonal wind and geopotential height anomalies, seem to have some predictive power for SSW, but prediction by any single such diagnostic is suboptimal. Insofar as they are successful, these variables approximate certain aspects of the committor. For example, the committor might increase monotonically with the quasi-biennial oscillation index. Furthermore, statistical correlations potentially obscure the conditional relationships needed. For example, Martius et al. (2009) and Bao et al. (2017) examined tropospheric precursors to SSW events in reanalysis records, finding that blocking events preceded most major SSWs, potentially by enhancing upward-propagating planetary waves. (We use “precursor” only to mean an event that sometimes happens before SSW.) Blocking influences SSW through height perturbations at the tropopause, which would enter the Ruzmaikin model as low-frequency variations in lower boundary forcing hh. Since we fix hh constant, the blocking precursor is outside our scope here. However, farther down the dynamic chain are other measurable precursors such vertical wave activity flux and meridional heat flux, which are also found to have predictive power (Sjoberg and Birner, 2012). However comprehensive the model, we would naturally expect the true committor probability to exhibit similar patterns to canonical precursors of that model such as blocking (for a troposphere-coupled model) and heat flux (for a stratosphere-only model). Yet, there is an important difference: while a precursor PP may appear with high probability given that a SSW is imminent, the committor specifies the probability of a SSW given an observed pattern. As acknowledged in Martius et al. (2009), many blocking events did not lead to SSW events, meaning that {SSW|blocking}{blocking|SSW}\mathbb{P}\{\text{SSW}|\text{blocking}\}\neq\mathbb{P}\{\text{blocking}|\text{SSW}\}. Such distinctions highlight the need for a precise mathematical formulation that provides and distinguishes both kinds of information.

While q+q^{+} describes the future of a transition, the backward committor qq^{-} describes its past. It is defined as

q(z)\displaystyle q^{-}(z) :={Zt last visited A rather than B|Z0=z}\displaystyle:=\mathbb{P}\{Z_{t}\text{ last visited }A\text{ rather than }B|Z_{0}=z\} (28)
q(A)=1,q(B)=0\displaystyle q^{-}(A)=1,\ \ \ \ \ q^{-}(B)=0 (29)

qq^{-} solves the time-reversed Kolmogorov backward equation ~q=0\widetilde{\mathcal{L}}q^{-}=0, where ~\widetilde{\mathcal{L}} is the time-reversed generator, which evolves observables backward in time. See supplement for a detailed description of ~\widetilde{\mathcal{L}} and its relationship the forward generator \mathcal{L}.

We now describe the fundamental statistics characterizing transition events as identified by TPT and explain how they can be expressed in terms of quantities such as q+q^{+}, qq^{-}, and π\pi. The probability density of reactive trajectories ρR(z)\rho_{R}(z), the probability of observing the system ZtZ_{t} at the location zz during a transition, is proportional (up to a normalization constant) to the product π(z)q(z)q+(z)\pi(z)q^{-}(z)q^{+}(z). This density is large in regions of phase space that are highly trafficked by reactive trajectories. This is how TPT gives information about precursors, indicating regions of phase space that are usually visited by the system over the course of a transition path.

The direction and intensity of this traffic is specified by the reactive current. To develop this concept, we start by introducing the probability current JJ, a vector field that satisfies a continuity equation with the time-dependent density ρ\rho:

ρt=ρ=J\displaystyle\frac{\partial{\rho}}{\partial{t}}=\mathcal{L}^{*}\rho=-\nabla\cdot J

If ρ\rho were the density and vv the velocity field of a fluid, JJ would be ρv\rho v. One can think of JJ as an instantaneous (in time and position) average over all possible system trajectories, though a precise mathematical description requires some care. In equilibrium, when ρ=π\rho=\pi is no longer changing, J=0\nabla\cdot J=0, or equivalently CJn𝑑S=0\oint_{C}J\cdot n\,dS=0 where CC is any closed surface.

The reactive current JABJ_{AB} is also an “average velocity”, but restricted to reactive paths. Unlike JJ, JABJ_{AB} is not divergence-free, with a source in AA and a sink in BB (where transition paths start and end). JABJ_{AB} is defined implicitly via surface integrals. If CC is any surface enclosing set AA but not set BB, with outward normal nn, then the flux CJABn𝑑S\oint_{C}J_{AB}\cdot n\,dS is the number of forward transitions per unit time, also called the transition rate RABR_{AB}; see Vanden-Eijnden and E (2010) for details. The supplement describes another expression in terms of the generator. The result is (Metzner et al., 2006)

J\displaystyle J =πb(πD)\displaystyle=\pi b-\nabla\cdot(\pi D) (30)
JAB\displaystyle J_{AB} =q+qJ+πD(qq+q+q)\displaystyle=q^{+}q^{-}J+\pi D(q^{-}\nabla q^{+}-q^{+}\nabla q^{-}) (31)

where again π\pi is the stationary density. This expression has intuitive ingredients. Multiplying JJ by q+qq^{+}q^{-} conditions the equilibrium probability current on the trajectory being reactive, meaning en route from AA to BB. The qq+q+qq^{-}\nabla q^{+}-q^{+}\nabla q^{-} reflects the fact that trajectories from AA to BB must ascend a gradient of q+q^{+}, going from q+=0q^{+}=0 to q+=1q^{+}=1, while descending a gradient of qq^{-}.

Just as JAB(z)J_{AB}(z) describes the average reactive velocity, a streamline ztz_{t} of JAB(z)J_{AB}(z) (solving z˙t=JAB(z)\dot{z}_{t}=J_{AB}(z), with z0=aAz_{0}=a\in A and zT=bBz_{T}=b\in B for some T>0T>0) is a kind of “average” transition path. Although the streamline will not be realized by any particular transition path, it will have common geometric features in phase space with many actual path samples. At low noise the reactive trajectories will cluster in a thin corridor about the streamline. The streamline is a more dynamical description of precursors: whereas regions of high reactive density are commonly observed states along reactive trajectories, streamlines of reactive current are commonly observed sequences of states along reactive trajectories. The study by Limpasuvan et al. (2004), for example, described a sequence of events in a prototypical SSW life cycle based on reanalysis including vortex preconditioning, wave forcing, and anomalous heat fluxes at various levels in the troposphere and stratosphere. The sequence described there likely corresponds to a streamline of the reactive trajectory.

The committor also quantifies the relative balance of time spent on the way to each set. If more probability mass lies in the region where q+>12q^{+}>\frac{1}{2}, set BB is globally more imminent, whereas more mass where q+<12q^{+}<\frac{1}{2} indicates set AA is. A single summary statistic of imminence is the average committor during a long trajectory, 𝔼[q+(Zt)]\mathbb{E}[q^{+}(Z_{t})], computed as a weighted average against the equilibrium density:

𝔼[q+(Zt)]=𝔼π[q+(Z)]=Ωq+(z)π(z)dz=:q+π\displaystyle\mathbb{E}[q^{+}(Z_{t})]=\mathbb{E}_{\pi}[q^{+}(Z)]=\int_{\Omega}q^{+}(z)\pi(z)\,dz=:\langle q^{+}\rangle_{\pi} (32)

An average below (above) 1/21/2 would indicate more time spent on the way to to AA (BB).

Another statistic, the forward transition rate, captures the frequency of transitions between AA and BB rather than the overall time spent in each. We earlier defined RABR_{AB} as the number of ABA\to B transitions per unit time. Since a BAB\to A transition must occur between every two ABA\to B transitions, RAB=RBA=:RR_{AB}=R_{BA}=:R. The inverse of the transition rate is the return time, a widely used metric for changing frequency of extreme events under climate change scenarios (Easterling et al., 2000). However, the forward and backward transitions may differ in important characteristics like speed. To capture this asymmetry, we need a dynamical analogue to the equilibrium statistic q+π\langle q^{+}\rangle_{\pi}. The typical quantity of choice is the rate constant kABk_{AB}, which is larger if ABA\to B transitions happen faster than BAB\to A transitions. We therefore normalize by the overall time spent having come from AA, which is qπ\langle q^{-}\rangle_{\pi}.

kAB=Rqπ=Rq(z)π(z)𝑑z\displaystyle k_{AB}=\frac{R}{\langle q^{-}\rangle_{\pi}}=\frac{R}{\int q^{-}(z)\pi(z)\,dz} (33)

This rate constant, defined in Vanden-Eijnden (2014), parallels the chemistry definition. If XAX_{A} and XBX_{B} are two chemical species, with [][\cdot] denoting concentration, the forward and backward rate constants kABk_{AB} and kBAk_{BA} are defined so that

R=[XA]kAB=[XB]kBA\displaystyle R=[X_{A}]k_{AB}=[X_{B}]k_{BA} (34)

In the language of transition path theory, [XA][X_{A}] is the long-term probability of the system existing most recently in state AA, which is qπ\langle q^{-}\rangle_{\pi}. Rates are also expressible in terms of expected passage times. Thinking of [XA][X_{A}] as the total probability of having last visited set AA, 1/kAB=[XA]/R1/k_{AB}=[X_{A}]/R estimates the total transition time between entering AA (having last visited BB) and next re-entering BB. It is these inverse quantities we display in the results section.

These quantities together make an informative description of the typical transition process from AA to BB. We now proceed to analyze the transition path properties of the Ruzmaikin stratospheric model.

4 Methodology

a. Spatial discretization

The quantities of interest described above (π\pi, q+q^{+}, qq^{-}, and JABJ_{AB}) emerge as solutions to PDEs involving the generator \mathcal{L}, which must be approximated by spatial discretization. In the supplement we describe a finite volume scheme to directly discretize the adjoint \mathcal{L}^{*} as a matrix, which we name LL^{*}, on a regular grid in dd dimensions. Here we use the same domain and noise levels as Birner and Williams (2008): 0.06X0.04-0.06\leq X\leq 0.04, 0.05Y0.05-0.05\leq Y\leq 0.05, 0U0.80\leq U\leq 0.8 in units non-dimensionalized in terms of the radius of Earth and the length of a day. We tile this with a grid of 40×40×8040\times 40\times 80 grid cells. We choose a noise constant σ3\sigma_{3} in the UU variable in the range 0.41.50.4-1.5. This is a similar range to observed atmospheric gravity wave momentum forcing (Birner and Williams, 2008). For numerical reasons, we also add small noise to the streamfunction variables XX and YY, in proportion to the domain size. Specifically, as UU spans a range of 0.80.8 and X,YX,Y span a smaller range of 0.10.1, we choose σ1\sigma_{1} and σ2\sigma_{2} to be σ3×(0.1/0.8)\sigma_{3}\times(0.1/0.8). This adjustment does change our results with respect to Birner and Williams (2008), causing more transitions in both directions at lower hh than if only the UU variable were perturbed. While gravity wave drag forces the zonal wind, eddy interactions and other sources of internal variability can perturb the streamfunction as well, and it is not uncommon to represent these effects stochastically (DelSole and Farrell, 1995). There are surely more accurate representations of noise, but this important issue is not our focus. We retain these perturbations for numerical convenience, but stress that the general principles of the TPT framework are independent of any specific form of stochasticity. In the forthcoming experiments, we will refer only to σ3\sigma_{3} with the understanding that σ1\sigma_{1} and σ2\sigma_{2} are adjusted proportionally. The discretization we use has strengths and limitations. Given the matrix LL^{*} on this grid, the discretized generator LL is just the transpose. To ensure certain properties of solutions, such as positivity of probabilities, LL should ideally retain characteristics of the infinitesimal generator of a discrete-space, continuous-time Markov process: rows that sum to zero, and nonnegative off-diagonal entries. Such a discretization is called “realizable” (Bou-Rabee and Vanden-Eijnden, 2015). One can check that our discretization always satisfies the former property, and so is realizable provided small enough grid spacing (δX,δY,δU)(\delta X,\delta Y,\delta U). In our current example, the spacing is not nearly small enough to guarantee this (matrix entries were just as often negative as positive), but results are still accurate, as verified by stochastic simulations to be described in the results section. While we could have used one-sided finite differences to enforce positivity, this would have degraded the overall numerical accuracy of the solutions. We opted instead to zero out negatives, which were always negligible in magnitude.

The discretized Kolmogorov backward equation is Lq+=0Lq^{+}=0, augmented with appropriate boundary conditions. The definition of AA and BB is a design choice that should satisfy three conditions: (1) they are disjoint, (2) AA contains the radiative fixed point and BB the fixed point of the vacillating regime, and (3) both sets are relatively stable in the chosen noise range. We choose AA and BB to be ellipses with orientations determined by the covariance of the equilibrium density of the linearized stochastic dynamics about their respective fixed points, as described in the supplement. The choice of the sizes of AA and BB is a subjective decision which alters the very definition of a reactive trajectory; hence, different sizes emphasize different features of the transition path ensemble, especially in oscillatory systems like this one. We made AA and BB large enough to enclose the many loops that often accompany the escape from AA and the descent into BB, so that we can focus on the relatively rare crossing of phase space. More sophisticated techniques exist for shrinking the two sets while erasing resulting loops (Lu and Vanden-Eijnden, 2014; Banisch and Vanden-Eijnden, 2016); for simplicity, we forgo these techniques for the current study.

Careful discretization is important for constructing the dominant pathways discussed above, i.e. the streamlines ztz_{t} satisfying z˙t=JAB\dot{z}_{t}=J_{AB}. Standard integration techniques such as Euler or Runge-Kutta will accumulate errors, not only from Taylor expansion but also from the discretized solution of q+q^{+}, qq^{-} and π\pi. These can be severe enough to prevent ztz_{t} from reaching set BB. To guarantee that full transitions are extracted, we instead solve shortest-path algorithms on the graph induced by the discretization, as described in Metzner et al. (2009). The supplement contains more details on this computation.

5 Results

We begin this section by describing the kinematic path characteristics of the process in its three-dimensional phase space, according to the quantities described above. Following this purely geometrical description, we will suggest some dynamical interpretations and compare with previous studies. Finally, we will map statistical features as functions of background parameters.

The Ruzmaikin model is attractive for demonstrating use of the tools introduced in Section 3 due to its low-dimensional state space, in which PDEs can be solved numerically using standard methods such as our finite volume scheme. We tested the committor’s accuracy empirically by randomly selecting 50 cells in our grid (this is 0.04%0.04\% of the grid) and evolving n=60n=60 stochastic trajectories forward in time from each, stopping when they reach either set AA or set BB. The fraction of trajectories starting from zz that first reach BB is taken as the empirical committor at point zz and is denoted q~+\widetilde{q}^{+}, which we compare with the predicted committor q+q^{+} from the finite volume scheme. Figure 5 clearly demonstrates the usefulness of the committor for probabilistic forecasting. The left column displays the committor calculated from finite volumes, averaged in the YY direction, for two different forcing levels hh. The right column shows a scatterplot of q~+\widetilde{q}^{+} vs. q+q^{+} at the 50 randomly selected grid cells. We expect the points to fall along the line q~+=q+\widetilde{q}^{+}=q^{+} with some spread proportional to the standard deviation of the binomial distribution, q+(1q+)/n\sqrt{q^{+}(1-q^{+})/n}. Approximating this as a Gaussian, we have plotted red curves enclosing the 95% confidence interval, which indeed contains approximately 95% of the data points. Statistical sampling error can explain the observed level of deviation, although discretization error (from solving the PDE and from time-stepping) may also contribute.

Figure 5: The committor gives the probabilistic forecast of the system, and this plot is an empirical demonstration of its predictive capacity. Here, noise is fixed at σ3=0.5m/s/day1/2\sigma_{3}=0.5\,m/s/day^{1/2}, while topographic forcing h=25mh=25\,m in (a), (b) and 30m30\,m in (c), (d). The left column shows the forward committor q+q^{+} solved by the finite volume method, averaged in the YY direction. The ellipses labeled AA and BB are projections of the actual sets onto the XUXU plane, where XX and UU are the real part of the streamfunction and the mean zonal wind amplitude. Committor values range from 0 (blue) to 1 (red), with the white contour showing the surface q+=12q^{+}=\frac{1}{2}. The right column compares the PDE solution of the committor with a Monte Carlo solution from running many trajectories. For 50 randomly chosen grid points (sampled uniformly across the committor range (0,1)(0,1)), we launched 60 independent stochastic trajectories and counted the fraction that reached set BB first. We call this the empirical committor, q~+\widetilde{q}^{+}. Plots (b) and (d) show q~+\widetilde{q}^{+} vs. q+q^{+} for these 50 random grid cells. The middle red line is the curve q~+=q+\widetilde{q}^{+}=q^{+}, and the envelope around it is the 95% confidence interval for sampling errors, based on a Gaussian approximation to the binomial distribution.
Refer to caption
Figure 6: The committor-1/2 surface is the set of all points in state space where the q+=12q^{+}=\frac{1}{2}, and sets AA and BB have equal probabilities of being visited next. Here the surface is rendered as a set of points and viewed from various vantage points in state space (the supplement shows a video with rotation). The topographic forcing is fixed to h=25mh=25\,m and the noise level to σ3=0.5m/s/day1/2\sigma_{3}=0.5\,m/s/day^{1/2}. The blue and red clusters mark sets AA and BB respectively, centered around the two stable fixed points. The gray points show the location of the surface q+=12q^{+}=\frac{1}{2}. The most striking feature is the “spiral staircase” structure in the low-UU region of phase space. For any given streamfunction phase, the likelihood of heading toward state AA or BB depends sensitively on UU, in an oscillatory manner. Even at very low values of UU, there are narrow channels which are likely to lead back to set AA rather than set BB. This accounts for the blue regions in the lower part of Figure 5. These disappear, however, at higher noise, when set BB overtakes the lower half of the picture.
Refer to caption

Figure 5 also shows how q+q^{+} responds to increasing hh, even far below the bifurcation threshold: the committor values throughout state space become rapidly skewed toward unity (meaning more red in the picture). This means that even slight perturbations can kick the system out of state AA toward state BB. Another indicator is the “isocommittor surface”, the set of points zz such that q+(z)=1/2q^{+}(z)=1/2; that is, the system has equal probability of next entering set AA or BB. In the left-hand column this is the set of gray points (averaging out the variable Y). For low forcing, this surface tightly encloses set BB, meaning the system must wander very close before a transition is imminent. For high forcing values, the isocommittor hugs set AA more closely, meaning that small perturbations from this normal state can easily push the system into dangerous territory. In Figure 6, the isocommittor is shown as a set of gray points in a 3D plot viewed from various vantage points. In the low-UU region, the isocommittor resembles a spiral staircase, reflecting the spiral-shaped stable manifold of the fixed point in set BB. Different initial positions with the same streamfunction phase, differing only slightly in the UU direction, can have drastically different final destinations. These spiral surfaces are responsible for the blue lobes in the lower part of Figure 5, but they disappear at higher noise.

Figures 7 and 8 display numerical solutions of the equilibrium density π\pi and reactive density ρRπqq+\rho_{R}\propto\pi q^{-}q^{+} for two forcing levels. While π\pi indicates where ZtZ_{t} tends to reside, ρR\rho_{R} indicates where ZtZ_{t} resides given a transition from AA to BB is underway. As hh increases, even far below the bifurcation threshold, π\pi responds strongly, shifting weight toward state BB. On the other hand, the reactive density displays similar characteristics for all hh values. In the XUXU plane, the two lobes of high reactive density surrounding AA indicate that zonal wind tends to remain strong for a while before dipping into the weaker regime. Viewing the same field in the XYXY plane (Figure 8) reveals a halo of intermediate density about set AA. While many different motions would be consistent with this pattern, the coming figures verify that the early stages of transition have circular loops in the XYXY plane, meaning zonal movement of the streamfunction’s peaks and troughs. The exact streamfunction phase corresponding to the (X,Y)(X,Y) position is calculated as follows. Recall the streamfunction is ψ=Re{Ψ(t)eikxcos(y)}\psi^{\prime}=\text{Re}\{\Psi(t)e^{ikx}\cos(\ell y)\}, where Ψ=X+iY\Psi=X+iY. In polar coordinates, Ψ=X2+Y2eiϕ\Psi=\sqrt{X^{2}+Y^{2}}e^{i\phi}, where ϕ=tan1(Y/X)\phi=\tan^{-1}(Y/X). The full streamfunction is

ψRe{Ψeikx}\displaystyle\psi^{\prime}\propto\text{Re}\{\Psi e^{ikx}\} =Re{X2+Y2ei(ϕ+kx)}\displaystyle=\text{Re}\{\sqrt{X^{2}+Y^{2}}e^{i(\phi+kx)}\}
=X2+Y2cos(ϕ+2λ)\displaystyle=\sqrt{X^{2}+Y^{2}}\cos(\phi+2\lambda)

where λ\lambda is longitude.

The angle from the origin in the XYXY plane indicates the zonal streamfunction phase, and circular motion indicates zonal movement. (This “looping” motion is indeed shared by the transition path samples shown in Figures 9 and 10, to be described later.)

The darkest (most-trafficked) region of this loop is the sector π4ϕπ2\frac{\pi}{4}\lesssim\phi\lesssim\frac{\pi}{2}. The relationship between (X,Y)(X,Y) and λ\lambda indicates ψ\psi^{\prime} is maximized at longitudes λ={ϕ2,πϕ2}\lambda=\{-\frac{\phi}{2},\pi-\frac{\phi}{2}\}. As the maximum reactive density occurs around ϕ=3π8\phi=\frac{3\pi}{8}, the streamfunction peaks are at {3π16,13π16}{326,146}\{-\frac{3\pi}{16},\frac{13\pi}{16}\}\approx\{326^{\circ},146^{\circ}\}. What is the significance of this phase relative to the lower boundary forcing? Recalling the forcing form Re{Ψ(zB,t)eikx}=hRe{ei2λ}cos(2λ)\text{Re}\{\Psi(z_{B},t)e^{ikx}\}=h\,\text{Re}\{e^{i2\lambda}\}\propto\cos(2\lambda), the bottom peaks are located at λ={0,π}\lambda=\{0,\pi\}. Hence, the bulk of the transition process happens when the perturbation streamfunction at the mid-stratosphere lags the lower boundary condition by 316±116\frac{3}{16}\pm\frac{1}{16} of a wavelength. Meanwhile, the XUXU plane reveals what happens to the zonal wind speed during the SSW transition. The high-reactive density region discussed above coincides with the crescent-shaped bridge of high density between the sets in Figure 7. This suggests that in an SSW, the zonal wind weakens while the streamfunction stays in that particular phase window.

The pictures of reactive density suggest that reactive trajectories tend to loop around set AA, physically meaning the streamfunction tends to travel in one direction before slowing down, but they technically convey no directional information to explicitly support this claim. For this, we turn to the reactive current. We computed the discrete-space effective current matrix fij+f_{ij}^{+}, directly from the finite volume discretization and numerical solutions of π\pi, q+q^{+} and qq^{-}. Physically, this matrix represents the flux of a vector field from grid cell ii to cell jj. From this we calculated the maximum-current paths as described in Metzner et al. (2009) and displayed the results in Figure 9 for a forcing level of h=30mh=30\,m (other levels are qualitatively similar). Both the XUXU and XYXY views are shown. Superimposed on these paths are seven actual reactive trajectories that occurred during a long stochastic simulation, to demonstrate features that are captured by the dominant pathways. The dominant path from AA to BB indeed contains a half-loop in the XYXY plane in the clockwise direction, which means an eastward phase velocity. With a smaller set AA, this dominant path would contain more of these loops. However, during the next transition stage, the streamfunction slows to a halt at the phase angle ϕ=π2\phi=\frac{\pi}{2}, doubles back and travels westward as zonal wind loses strength. The smear of high density in the neighborhood ϕ3π16\phi\sim\frac{3\pi}{16} therefore comprises not only a precipitous drop in zonal wind (which happens at the edge of that region) but also a backtrack, this time with weaker background zonal wind. This behavior is borne out by the trajectory samples, which vacillate in the upper-middle section of the XYXY plot. These paths are displayed as space-time diagrams of the streamfunction in Figure 10. In panel (a), the dominant path’s two loops correspond to two troughs moving east past a fixed longitude before the slowdown. The random streamfunction trajectories shown in (b)-(g) do not follow this representative history exactly, but they do combine elements of it: steady eastward wave propagation followed by slowdown and reversal. Each stage can have multiple false starts. Notably, the slowdown consistently happens at the same phase, with peaks at 120E\sim 120^{\circ}E and 300E300^{\circ}E, at roughly the same phase as found from the density plots. In fact, the figures show a brief slowdown every time the streamfunction passes this phase. This can be thought of as a representation of blocking events that often accompany sudden stratospheric warmings. The third transition path shown is an exception to the general pattern, making a final turn toward the east instead of to the west. This outlier of a reactive trajectory can also be seen in Figure 9, as the single green trajecory that decreases in XX before decreasing in UU instead of the other way around.

This kinematic sequence of events has a dynamical interpretation with precedent in prior literature. A critical ingredient of SSW is meridional eddy heat flux, which in this model takes the form vΦz¯hY\overline{v^{\prime}\Phi_{z}^{\prime}}\propto hY (see the supplement for a detailed derivation.) This term shows up explicitly as a negative forcing on UU in equation (15), showing that a reduced equator-to-pole temperature gradient in turn weakens the vortex via the thermal wind relation. The association of heat flux with SSW has been demonstrated in reanalysis (Sjoberg and Birner, 2012) and in detailed numerical simulations of internal stratospheric dynamics, even with time-independent lower boundary forcing (Scott and Polvani, 2006). This relationship favors the phase ϕ=π2\phi=\frac{\pi}{2} as the most susceptible state for SSW onset, which is exactly picked out by the dominant transition pathway in Figure 9.

However, immediately after the wind starts weakening at ϕ=π2\phi=\frac{\pi}{2}, where the streamfunction lines up with its lower boundary condition, the phase velocity reverses, giving rise to the westward lag of 3π16\frac{3\pi}{16} we observed in the reactive density. A similar phase lag has also been observed in more detailed numerical studies. For example, Scott and Polvani (2006) observed a lag of π2\frac{\pi}{2} across the whole stratosphere (π4\frac{\pi}{4} at the mid-level), quite similar to our result. They found that vortex breakup was preceded by a long, slow build-up phase in which the vortex became increasingly vertically coherent, only to be ripped apart by an upward- and west-propagating wave. In an experiment with slowly-increasing lower boundary forcing, Dunkerton et al. (1981) saw a phase lag across the whole stratosphere that increased from 100\sim 100^{\circ} to 180\sim 180^{\circ} (5050^{\circ} to 9090^{\circ} between the lower boundary and the mid-stratosphere) over the course of the warming event. They attribute this phase tilt to the zonal wind rapidly reversing and carrying the streamfunction along. The weakening zonal wind simply removes the Doppler shift from the Rossby wave dispersion relation, ω=Ukβk/(k2+2)\omega=Uk-\beta k/(k^{2}+\ell^{2}), allowing the waves to revert to their preferred westward phase velocity. This balance is also clear in equations (13-14): ignoring the damping and forcing terms, the dynamics read [X˙,Y˙]=[(sUr)Y,(sUr)X][\dot{X},\dot{Y}]=[(sU-r)Y,-(sU-r)X]. For weak UU, rotation in the XYXY plane is anti-clockwise and phase speed is westward.

Let us re-emphasize the probabilistic interpretation of reactive density. We have found that transitions from AA to BB are accompanied by anomalous increases in meridional heat flux. In other words, {High heat flux|SSW}\mathbb{P}\{\text{High heat flux$|$SSW}\} is high. As noted earlier, this does not imply that {SSW|high heat flux}\mathbb{P}\{\text{SSW$|$high heat flux}\} is also high. The identified streamfunction phase is not a “danger zone” in the sense that a trajectory entering this region is at higher risk of falling into state BB; the committor alone conveys that information. Rather, a trajectory is highly likely to pass through that region given that it is reactive. Notably, reactive trajectories are unlikely to take a straight-line path from AA to BB with UU, XX and YY changing linearly. This unrealistic path would represent a zonally stationary streamfunction growing steadily in magnitude, while zonal wind falls off gradually. At higher noise levels, however, the system would be increasingly dominated by pure Brownian motion, and such a path would become more plausible.

Figure 7: Equilibrium densities π(z)\pi(z) (left column) and reactive densities ρR(z)=π(z)q+(z)q(z)\rho_{R}(z)=\pi(z)q^{+}(z)q^{-}(z) (right column) for two different forcing levels, h=25mh=25\,m (top row) and h=35mh=35\,m (bottom row). π(z)\pi(z) is the long-term probability density of finding the system at point zz at any given time; ρR(z)\rho_{R}(z) is the same probability, but conditional on also being reactive at that time, meaning having last visited AA and next destined to visit BB. Dark color indicates higher density. These densities are summed in the YY direction to give a marginal density as a function of XX and UU. The red and blue ellipses show the projected boundaries of sets AA and BB, respectively. These reactive densities capture the patterns of transition path samples shown in Figure 4, but through continuous fields instead. At low forcing levels, most of the equilibrium density is concentrated around set AA, whereas higher forcing shifts some of the mass to set BB. Meanwhile, the characteristic curved shape of transition paths in phase space is borne out by the reactive densities, with a sickle-shaped high-density region bridging the gap between set AA and BB.
Refer to caption
Figure 8: Same as Figure 7, but projected onto the XYXY plane, which is the complex plane that characterizes the perturbation streamfunction. The center of set AA, approximately at X=Y=0X=Y=0, corresponds to a zonally symmetric streamfunction with no perturbation. Counterclockwise motion of trajectories around AA represents an eastward phase velocity of the streamfunction, which is the dominant modality in the radiative regime. The region of high density above set AA shows the phase in the streamfunction at which zonal wind is most likely to begin to weaken.
Refer to caption
Figure 9: Paths of maximal current superimposed on transition path samples. All four Figures shown the same path, but from different vantage points. While the reactive probability density (Figures 7 and 8) says where transition paths spend their time, the reactive current is a vector field of the transition paths’ local average directionality. The path shown in a color gradient from blue to red is a streamline of this vector field, representing an “average” transition path. The path is colored blue where the local committor is less than 0.5, and red otherwise. Note that the path can cross back and forth. The transition from red to blue, where the path first crosses the threshold q+=12q^{+}=\frac{1}{2} and enters the probabilistic BB basin, is marked by a sudden drop in the UU variable – a deceleration in zonal wind. At the same time, the path’s rotations about AA reverse direction, from clockwise to anti-clockwise, corresponding to a reversal in phase velocity of the streamfunction. This path accurately captures geometric tendencies of actual transition paths; five random samples of reactive trajectories are superimposed in green, the bulk of which cluster around the maximum current path. Panels (a) and (b) show cross-sections in XUXU and XYXY space respectively, while (c) shows a three-dimensional view. The parameters are h=30mh=30\,m and σ3=0.5m/s/day1/2\sigma_{3}=0.5\,m/s/day^{1/2}. Figure 10 shows the corresponding spacetime diagrams of the streamfunction.
Refer to caption
Figure 10: Streamfunctions over time corresponding to the trajectories shown in Figure 9. As ψ(XcoskxYsinkx)\psi^{\prime}\propto(X\cos kx-Y\sin kx), the XX and YY variables represent the phase of the streamfunction, whose movement we plot over time as a space-time diagram. Panel (a) shows the dominant transition path. The phase velocity is initially eastward, matching with the clockwise rotations in the XYXY plane as shown in Figure 9. The waves then slow down and reverse direction, matching with the anti-clockwise turn and zonal wind drop in Figure 9. The vertical axis plays the role of time, but the dominant path technically conveys only geometrical information. Hence, we measure it in discrete steps. Panels (b)-(d) show the streamfunctions over time corresponding to four of the green transition path samples in Figure 9, chosen randomly. Most exhibit the same slow-down and reversal behavior exemplified by the dominant path. The exception is sample (b), which turns to the east at the end of its path. This corresponds to the stray green trajectory visible in Figure 9(a), which enters set BB from above and in the clockwise direction. Samples 1, 2 and 4 undergo some winding before the slowdown, but do slow down every time they reach the same phase.
Refer to caption

We now turn to a quantitative comparison of committors and transition rates for different forcing and noise levels. These trends illustrate the effects of modeling choices and global change on the climatology of SSW. Planetary wave forcing, hh, varies across days and seasons as well as different planets. The strength of additive noise, σ\sigma, is a modeling choice intended to represent gravity wave drag. Different stochastic parametrizations will vary in their effective σ\sigma value, and it is important to understand the sensitivity of SSW to model choices (Sigmond and Scinocca, 2010). Furthermore, long-term climate change may cause both parameters to drift, altering the occurrence of SSW-induced severe weather events.

The measure of the relative “imminence” of a vacillating solution vs. a radiative solution, as described in the background section on committors, is the equilibrium density-weighted average committor, denoted q+π\langle q^{+}\rangle_{\pi}. Figure 11(a) shows this quantity for 25h4525\leq h\leq 45 (mm) and 0.4σ32.00.4\leq\sigma_{3}\leq 2.0 (m/s/day1/2)m/s/day^{1/2}). Two trends are clearly expected from the basic physics of the model. First, as seen in Figures 7 and 8, q+π\langle q^{+}\rangle_{\pi} should increase with hh. Second, in the limit of large noise and infinite domain size, the dominance of Brownian motion will smooth out the committor function and make q+π\langle q^{+}\rangle_{\pi} tend to an intermediate value between zero and one. On the other hand, as noise approaches zero, the dynamical system becomes increasingly deterministic, and the ultimate destination of a trajectory will depend entirely on which basin of attraction it starts in. The boundary, or separatrix, between these two basins is the stable manifold of the third (unstable) fixed point. In the case of a potential system, of the form x˙=V(x)\dot{x}=-\nabla V(x), this boundary would be a literal ridge of the function VV. Our system admits no such potential function, but this is a useful visual analogy. The committor function becomes a step function in the deterministic limit, with the discontinuity located exactly on this boundary. The addition of low noise moves the committor-12\frac{1}{2} surface away from the separatrix, possibly asymmetrically: one basin will shrink, becoming more precarious with respect to random perturbations, while the other will expand, becoming a stronger global attractor. Which basin will shrink is not evident a priori, so we compute the averaged committor, q+π\langle q^{+}\rangle_{\pi}, as a summary statistic which will increase when the basin of BB expands.

Figure 11(a) plots the trends in q+π\langle q^{+}\rangle_{\pi} as a function of hh (along the horizontal axis) and σ3\sigma_{3} (along the vertical axis). The two basic hypotheses are verified: q+π\langle q^{+}\rangle_{\pi} increases monotonically as hh increases, and q+π12\langle q^{+}\rangle_{\pi}\sim\frac{1}{2} as σ\sigma increases, no matter the value of hh. The less predictable behavior is in the range h=35mh=35\,m, 0.75σ31.00.75\leq\sigma_{3}\leq 1.0, where q+π\langle q^{+}\rangle_{\pi} displays non-monotonicity with respect to noise, at low noise levels. As σ3\sigma_{3} increases, the average committor increases from 0.4\sim 0.4 to 0.6\sim 0.6, and then decreases again. The four committor plots at the bottom of 11 illustrate the trend graphically. At low noise, the AA basin includes winding passageways leading from the small-UU region back to AA. Small additive noise closes them off, effectively expanding the BB basin. As noise increases and Brownian motion dominates the dynamics, committor values everywhere relax back to less-extreme values, reflecting the unbiased nature of Brownian motion.

Figure 11: Behavior of the committor as a function of forcing hh and noise σ3\sigma_{3}. (a) shows the average committor (weighted by equilibrium density): q+(x)π(x)𝑑x\int q^{+}(x)\pi(x)\,dx evaluated for a range of hh and σ\sigma values. Panels (b)-(e) show images of the committor for h=30h=30 and σ=0.5, 0.75, 1.0, 1.25\sigma=0.5,\ 0.75,\ 1.0,\ 1.25. The blue lobes in the lower part of the images, a shadow of the spiral structure from Figure 6, thin out and disappear with increasing forcing hh.
Refer to caption

Despite the coarse grid resolution, the first-order effect of noise is clear. At the low and high margins of hh, where falling into state AA and BB respectively is virtually certain, an increase in noise decreases this virtual certainty, and the trend continues at larger noise to attenuate q+π\langle q^{+}\rangle_{\pi} to its limit of 1/2. The middle hh range, however, behaves differently. Whereas h=35mh=35\,m appears to balance out the basin sizes at low noise, a slight noise increase tends to kick the system out of the AA basin and toward BB, more so than the other way around. At higher noise, the committor relaxes back to 1/2. Examining the committor fields, it’s clear that the isocommittor surface location does not move back and forth; rather, it moves toward AA, and then the rest of the field flattens out.

Figure 12 shows trends in the return times of SSW with varying hh and σ3\sigma_{3}. There are several different return times of interest. The first, shown in panel (a), is the total expected time between one transition event and the next, whose reciprocal is the rate RABR_{AB}, the number of transitions per unit time. The return time is a symmetric quantity between AA and BB, since every forward transition is accompanied by a backward one. Among the parameter combinations, (h,σ3)(35m,0.75m/s/day1/2)(h,\sigma_{3})\approx(35\,m,0.75\,m/s/day^{1/2}) is the one which minimizes return time, or equivalently maximizes the transitions per unit time. h=35mh=35\,m is a forcing level which approximately balances out the time spent between the two sets, making transitions relatively common. At lower noise, transitions are exceedingly rare, and at higher noise the two states cease to be long-lived. However, this symmetric quantity does not capture information about the relative speed of transition from AA to BB vs. from BB back to AA. Panel (b) shows a different passage time, which is the average time between the end of a backward (BA)(B\to A) transition and the end of the next forward (ABA\to B) transition, which we call TABT_{AB}. This is computed as the reciprocal of the rate constant kABk_{AB}, as described in the previous section. In other words, the stopwatch begins when the system returns to AA after having last visited BB, and ends when the system next hits BB. This metric is asymmetric: a smaller ABA\to B return time indicates that the forward transition is faster than the backward transition. Panel (b) shows the complementary BAB\to A return time, TBAT_{BA}. Unsurprisingly, an increase in hh causes a decrease in TABT_{AB} and an increase in TBAT_{BA} regardless of the noise level. The noise level has a less obvious effect. Whereas TBAT_{BA} decreases monotonically with increasing noise, regardless of hh, the forward time TABT_{AB} is minimized by a mid-range noise level of σ30.75m/s/day1/2\sigma_{3}\approx 0.75\,m/s/day^{1/2}. This is another reflection of the bias toward state BB that is effected by adding noise to a very low baseline.

Figure 12: Behavior of return times as a function of forcing hh and noise σ3\sigma_{3}. Panel (a) shows the average period between the start of one transition event and the start of the next. Red here means many transitions per unit time, both ABA\to B and BAB\to A. We call this the return time, and calculate it as the reciprocal of RR, the number of forward (or backward) transitions per unit time. There is clearly a parameter set: (h,σ3)(35m,0.75m/s/day1/2)(h,\sigma_{3})\approx(35\,m,0.75\,m/s/day^{1/2}), which optimizes the number of transitions per unit time. Below this noise level, internal variability is scarcely enough to jump between regions. Above this noise level, sets AA and BB are no longer metastable, and excursions are so wide and frequent that passing from set AA to set BB is a very spatially restricted event. Panels (b) and (c), below, distinguish forward and backward transition times. Panel (b) shows the expected passage time TABT_{AB}, the interval between the end of a BAB\to A transition and the end of the next ABA\to B transition. Panel (c) shows the analogous backward passage time, TBAT_{BA}. Note the scales are logarithmic, and here red simply means faster transitions, regardless of which direction is being considered.
Refer to caption

6 Conclusion

Transition path theory is a framework for describing rare transitions between states. We have described TPT along with a number its key ingredients like the forward and backward committor functions. While TPT has been applied primarily to molecular systems, we believe it offers valuable insight into climate and weather phenomena such as sudden stratospheric warming, primarily through committors, reactive densities, and reactive currents. Of interest apart from its role in TPT, the forward committor defines an optimal probabilistic forecast, borne out by direct numerical simulation experiments. The reactive densities and currents describe the geometric properties of dominant transition mechanisms at low noise. In applying TPT to a noisy, truncated Holton-Mass model, we find that transitions tend to begin with a drop in mean zonal wind and a reversal of the streamfunction’s phase velocity at a particular streamfunction phase. This is consistent with the significance of blocking precursors to SSW as found in Martius et al. (2009), insofar as this idealized model can represent them. We also find that noise has a non-monotonic effect on the overall preference for a vacillating state, measured by the average committor. At a forcing of h35mh\approx 35\,m, where the isocommittor surface (essentially the basin boundary of the deterministic dynamics) divides the space approximately in half, we find that raising the noise tilts the balance decisively toward the vacillating solution. Still larger noise evens the whole field out. The transition rate constant shows a similar dependence on hh and σ3\sigma_{3}.

In future work, we plan to scale these methods to more realistic and complex models as well as observational data, where predicting SSW remains an active area of research. In this high-dimensional setting, the generator may be unknown or computationally intractable. Any state space with more than 5\sim 5 degrees of freedom is beyond the reach of a finite-volume discretization, because the number of grid cells increases exponentially with dimension. However, there is a generic insight that physical models evolve on very low-dimensional manifolds within the full available state space. A growing body of research in molecular dynamics (Thiede et al., 2019), fluid dynamics (Giannakis et al., 2018; Froyland and Junge, 2018), climate dynamics (Giannakis and Majda, 2012; Sabeerali et al., 2017), and general multiscale systems (Harlim and Yang, 2018; Berry et al., 2015; Giannakis, 2015, 2019) exploits the intrinsic low-dimensionality to represent the infinitesimal generator more efficiently. While here we represented the generator as a finite-volume or finite-difference operator on a grid, one can also write it in a basis of globally coherent functions, such as Fourier modes, or more generally harmonic functions on a manifold. Given only data, without an explicit form of the dynamics, this manifold and the basis functions can be estimated from (for example) the diffusion maps algorithm, and the generator’s action on this basis can be approximated from short trajectories. These ideas can be applied to computing the dynamical statistics that have been the focus of this paper (Thiede et al., 2019). We hope that these techniques will enable more efficient observation strategies for targeted data assimilation procedures with the goal of tracking the progression of specific extreme events, including hurricanes and heat waves as well as sudden stratospheric warming.

\datastatement

Data from this study will be made available upon request.

Acknowledgements.
Justin Finkel was funded by the Department of Energy Computational Science Graduate Fellowship under grant DE-FG02-97ER25308. We acknowledge support from the National Science Foundation under NSF award number 1623064. Jonathan Weare was supported by the Advanced Scientific Computing Research Program within the DOE Office of Science through award DE-SC0020427. The reviewers of the paper provided invaluable insights on physical interpretation of the mathematical results. In particular, they helped us to recognize the connection between our computed maximum-flux reactive pathway and anomalous heat fluxes which weaken the zonal wind and reverse the streamfunction’s phase velocity. We thank Cristina Cadavid, Thomas Birner and Paul Williams for helpful clarifications about previous work. We thank Eric Vanden-Eijnden, Mary Silber, Robert Webber, Erik Thiede, Aaron Dinner, Tiffany Shaw, and Noboru Nakamura for useful discussions throughout this project. The University of Chicago’s Research Computing Center provided the computational resources to greatly expedite the calculations done here. Below are the numerical coefficients used in the reduced-order Ruzmaikin model, with very similar values to Ruzmaikin et al. (2003) and Birner and Williams (2008). The relationship with physical parameters is described in the appendix of Ruzmaikin et al. (2003). Note that our notation differs slightly: following Birner and Williams (2008), we write the topographic forcing in terms of hh rather than Ψ0=ghf0\Psi_{0}=\frac{gh}{f_{0}}, a difference that results in numerical factors of 1000\sim 1000 depending on the convention used.
τ1\displaystyle\tau_{1} =122.6\displaystyle=122.6 (35)
τ2\displaystyle\tau_{2} =30.4\displaystyle=30.4 (36)
r\displaystyle r =0.63\displaystyle=0.63 (37)
s\displaystyle s =1.96\displaystyle=1.96 (38)
ξ\displaystyle\xi =1.75\displaystyle=1.75 (39)
δw\displaystyle\delta_{w} =70.84\displaystyle=70.84 (40)
ζ\displaystyle\zeta =240.54\displaystyle=240.54 (41)
UR\displaystyle U_{R} =0.47\displaystyle=0.47 (42)
η\displaystyle\eta =9.13×104\displaystyle=9.13\times 10^{4} (43)
δΛ\displaystyle\delta_{\Lambda} =4.91×103\displaystyle=4.91\times 10^{-3} (44)
Λ˙\displaystyle\dot{\Lambda} =0\displaystyle=0 (45)

References

  • Albers and Birner (2014) Albers, J. R., and T. Birner, 2014: Vortex preconditioning due to planetary and gravity waves prior to sudden stratospheric warmings. Journal of the Atmospheric Sciences, 71 (11), 4028–4054, 10.1175/JAS-D-14-0026.1.
  • Baldwin and Dunkerton (2001) Baldwin, M. P., and T. J. Dunkerton, 2001: Stratospheric harbingers of anomalous weather regimes. Science, 96 (5542), 581–584, 10.1126/science.1063315.
  • Bancalá et al. (2012) Bancalá, S., K. Krüger, and M. Giorgetta, 2012: The preconditioning of major sudden stratospheric warmings. Journal of Geophysical Research: Atmospheres, 117 (D4), 10.1029/2011JD016769.
  • Banisch and Vanden-Eijnden (2016) Banisch, R., and E. Vanden-Eijnden, 2016: Direct generation of loop-erased transition paths in non-equilibrium reactions. Faraday Discussions, 195, 443–468, 10.1039/c6fd00149a.
  • Bao et al. (2017) Bao, M., X. Tan, D. L. Hartmann, and P. Ceppi, 2017: Classifying the tropospheric precursor patterns of sudden stratospheric warmings. Geophysical Research Letters, 44 (15), 8011–8016, 10.1002/2017GL074611.
  • Berry et al. (2015) Berry, T., D. Giannakis, and J. Harlim, 2015: Nonparametric forecasting of low-dimensional dynamical systems. Phys. Rev. E, 91, 032 915, 10.1103/PhysRevE.91.032915.
  • Birner and Williams (2008) Birner, T., and P. D. Williams, 2008: Sudden stratospheric warmings as noise-induced transitions. Journal of the Atmospheric Sciences, 65 (10), 3337–3343, 10.1175/2008JAS2770.1.
  • Bou-Rabee and Vanden-Eijnden (2015) Bou-Rabee, N., and E. Vanden-Eijnden, 2015: Continuous-time random walks for the numerical solution of stochastic differential equations. Memoirs of the American Mathematical Society.
  • Bowman et al. (2009) Bowman, G. R., K. A. Beauchamp, G. Boxer, and V. S. Pande, 2009: Progress and challenges in the automated construction of markov state models for full protein systems. The Journal of chemical physics, 131 (12), 124 101.
  • Butler et al. (2015) Butler, A. H., D. J. Seidel, S. C. Hardiman, N. Butchart, T. Birner, and A. Match, 2015: Defining sudden stratospheric warmings. Bulletin of the American Meteorological Society, 96 (11), 1913–1928, 10.1175/BAMS-D-13-00173.1.
  • Charlton and Polvani (2007) Charlton, A. J., and L. M. Polvani, 2007: A new look at stratospheric sudden warmings. part i: Climatology and modeling benchmarks. Journal of Climate, 20 (3), 449–469, 10.1175/JCLI3996.1.
  • Chodera and Noe (2014) Chodera, J. D., and F. Noe, 2014: Markov state models of biomolecular conformational dynamics. Current Opinion in Structural Biology, 25, 135–144, https://doi.org/10.1016/j.sbi.2014.04.002.
  • Christiansen (2000) Christiansen, B., 2000: Chaos, quasiperiodicity, and interannual variability: Studies of a stratospheric vacillation model. Journal of the Atmospheric Sciences, 57 (18), 3161–3173, 10.1175/1520-0469(2000)057¡3161:CQAIVS¿2.0.CO;2.
  • DelSole and Farrell (1995) DelSole, T., and B. F. Farrell, 1995: A stochastically excited linear system as a model for quasigeostrophic turbulence: Analytic results for one- and two-layer fluids. Journal of the Atmospheric Sciences, 52 (14), 2531–2547, 10.1175/1520-0469(1995)052¡2531:ASELSA¿2.0.CO;2.
  • Dematteis et al. (2018) Dematteis, G., T. Grafke, and E. Vanden-Eijnden, 2018: Rogue waves and large deviations in deep sea. Proceedings of the National Academy of Sciences, 115 (5), 855–860, 10.1073/pnas.1710670115.
  • Dunkerton et al. (1981) Dunkerton, T., C.-P. F. Hsu, and M. E. McIntyre, 1981: Some eulerian and lagrangian diagnostics for a model stratospheric warming. Journal of the Atmospheric Sciences, 38 (4), 819–844, 10.1175/1520-0469(1981)038¡0819:SEALDF¿2.0.CO;2.
  • Durrett (2013) Durrett, R., 2013: Probability: Theory and Examples. Cambridge University Press.
  • E and Vanden-Eijnden (2006) E, W., and E. Vanden-Eijnden, 2006: Towards a theory of transition paths. Journal of Statistical Physics, 123 (3), 503–523, 10.1007/s10955-005-9003-9.
  • Easterling et al. (2000) Easterling, D. R., G. A. Meehl, C. Parmesan, S. A. Changnon, T. R. Karl, and L. O. Mearns, 2000: Climate extremes: Observations, modeling, and impacts. Science, 289 (5487), 2068–2074, 10.1126/science.289.5487.2068.
  • Franzke and Majda (2006) Franzke, C., and A. J. Majda, 2006: Low-order stochastic mode reduction for a prototype atmospheric gcm. Journal of the Atmospheric Sciences, 63 (2), 457–479, 10.1175/JAS3633.1.
  • Froyland and Junge (2018) Froyland, G., and O. Junge, 2018: Robust fem-based extraction of finite-time coherent sets using scattered, sparse, and incomplete trajectories. SIAM Journal on Applied Dynamical Systems, 17 (2), 1891–1924, 10.1137/17M1129738.
  • Giannakis (2015) Giannakis, D., 2015: Dynamics-adapted cone kernels. SIAM Journal of Applied Dynamical Systems, 14, 556–608, 10.1137/140954544.
  • Giannakis (2019) Giannakis, D., 2019: Data-driven spectral decomposition and forecasting of ergodic dynamical systems. Applied and Computational Harmonic Analysis, 47 (2), 338 – 396, https://doi.org/10.1016/j.acha.2017.09.001.
  • Giannakis et al. (2018) Giannakis, D., A. Kolchinskaya, D. Krasnov, and J. Schumacher, 2018: Koopman analysis of the long-term evolution in a turbulent convection cell. Journal of Fluid Mechanics, 847, 735–767, 10.1017/jfm.2018.297.
  • Giannakis and Majda (2012) Giannakis, D., and A. J. Majda, 2012: Nonlinear laplacian spectral analysis for time series with intermittency and low-frequency variability. Proceedings of the National Academy of Sciences, 109 (7), 2222–2227, 10.1073/pnas.1118984109, https://www.pnas.org/content/109/7/2222.full.pdf.
  • Harlim and Yang (2018) Harlim, J., and H. Yang, 2018: Diffusion forecasting model with basis functions from qr-decomposition. Journal of Nonlinear Science, 28 (3), 847–872, 10.1007/s00332-017-9430-1.
  • Hasselmann (1976) Hasselmann, K., 1976: Stochastic climate models part i. theory. Tellus, 28 (6), 473–485, 10.3402/tellusa.v28i6.11316.
  • Hoffman et al. (2006) Hoffman, R. N., J. M. Henderson, S. M. Leidner, C. Grassotti, and T. Nehrkorn, 2006: The response of damaging winds of a simulated tropical cyclone to finite-amplitude perturbations of different variables. Journal of the Atmospheric Sciences, 63 (7), 1924–1937, 10.1175/JAS3720.1.
  • Holton and Mass (1976) Holton, J. R., and C. Mass, 1976: Stratospheric vacillation cycles. Journal of the Atmospheric Sciences, 33 (11), 2218–2225, 10.1175/1520-0469(1976)033¡2218:SVC¿2.0.CO;2.
  • Inatsu et al. (2015) Inatsu, M., N. Nakano, S. Kusuoka, and H. Mukougawa, 2015: Predictability of wintertime stratospheric circulation examined using a nonstationary fluctuation–dissipation relation. Journal of the Atmospheric Sciences, 72 (2), 774–786, 10.1175/JAS-D-14-0088.1.
  • Karatzas and Shreve (1998) Karatzas, I., and S. E. Shreve, 1998: Brownian Motion and Stochastic Calculus. Springer.
  • Kitsios and Frederiksen (2019) Kitsios, V., and J. S. Frederiksen, 2019: Subgrid parameterizations of the eddy–eddy, eddy–mean field, eddy–topographic, mean field–mean field, and mean field–topographic interactions in atmospheric models. Journal of the Atmospheric Sciences, 76 (2), 457–477, 10.1175/JAS-D-18-0255.1.
  • Limpasuvan et al. (2004) Limpasuvan, V., D. W. J. Thompson, and D. L. Hartmann, 2004: The life cycle of the northern hemisphere sudden stratospheric warmings. Journal of Climate, 17 (13), 2584–2596, 10.1175/1520-0442(2004)017¡2584:TLCOTN¿2.0.CO;2.
  • Lu and Vanden-Eijnden (2014) Lu, J., and E. Vanden-Eijnden, 2014: Exact dynamical coarse-graining without time-scale separation. The Journal of Chemical Physics, 141 (4), 044 109, 10.1063/1.4890367.
  • Martius et al. (2009) Martius, O., L. M. Polvani, and H. C. Davies, 2009: Blocking precursors to stratospheric sudden warming events. Geophysical Research Letters, 36 (14), 1–2, 10.1029/2009GL038776.
  • Metzner et al. (2006) Metzner, P., C. Schutte, and E. Vanden-Eijnden, 2006: Illustration of transition path theory on a collection of simple examples. The Journal of Chemical Physics, 125 (8), 1–2, 10.1063/1.2335447.
  • Metzner et al. (2009) Metzner, P., C. Schutte, and E. Vanden-Eijnden, 2009: Transition path theory for markov jump processes. Multiscale Modeling and Simulation, 7 (3), 1192–1219, 10.1137/070699500.
  • Pavliotis (2014) Pavliotis, G. A., 2014: Stochastic processes and applications. Springer.
  • Plotkin et al. (2019) Plotkin, D. A., R. J. Webber, M. E. O’Neill, J. Weare, and D. S. Abbot, 2019: Maximizing simulated tropical cyclone intensity with action minimization. Journal of Advances in Modeling Earth Systems, 11 (4), 863–891, 10.1029/2018MS001419.
  • Ragone et al. (2018) Ragone, F., J. Wouters, and F. Bouchet, 2018: Computation of extreme heat waves in climate models using a large deviation algorithm. Proceedings of the National Academy of Sciences, 115 (1), 24–29, 10.1073/pnas.1712645115, https://www.pnas.org/content/115/1/24.full.pdf.
  • Ruzmaikin et al. (2003) Ruzmaikin, A., J. Lawrence, and C. Cadavid, 2003: A simple model of stratospheric dynamics including solar variability. Journal of Climate, 16, 1593–1600, 10.1175/2007JCLI2119.1.
  • Sabeerali et al. (2017) Sabeerali, C. T., R. S. Ajayamohan, D. Giannakis, and A. J. Majda, 2017: Extraction and prediction of indices for monsoon intraseasonal oscillations: an approach based on nonlinear laplacian spectral analysis. Climate Dynamics, 49 (9), 3031–3050, 10.1007/s00382-016-3491-y.
  • Scott and Polvani (2006) Scott, R. K., and L. M. Polvani, 2006: Internal variability of the winter stratosphere. part i: Time-independent forcing. Journal of the Atmospheric Sciences, 63 (11), 2758–2776, 10.1175/JAS3797.1, URL https://doi.org/10.1175/JAS3797.1.
  • Sigmond and Scinocca (2010) Sigmond, M., and J. F. Scinocca, 2010: The influence of the basic state on the northern hemisphere circulation response to climate change. Journal of Climate, 23 (6), 1434–1446, 10.1175/2009JCLI3167.1.
  • Sjoberg and Birner (2012) Sjoberg, J. P., and T. Birner, 2012: Transient tropospheric forcing of sudden stratospheric warmings. Journal of the Atmospheric Sciences, 69 (11), 3420–3432, 10.1175/JAS-D-11-0195.1.
  • Thiede et al. (2019) Thiede, E., D. Giannakis, A. R. Dinner, and J. Weare, 2019: Approximation of dynamical quantities using trajectory data. arXiv:1810.01841 [physics.data-an], 1–24, 1810.01841.
  • Thompson et al. (2002) Thompson, D. W. J., M. P. Baldwin, and J. M. Wallace, 2002: Stratospheric connection to northern hemisphere wintertime weather: Implications for prediction. Journal of Climate, 15 (12), 1421–1428, 10.1175/1520-0442(2002)015¡1421:SCTNHW¿2.0.CO;2.
  • Tripathi et al. (2016) Tripathi, O. P., and Coauthors, 2016: Examining the predictability of the stratospheric sudden warming of january 2013 using multiple nwp systems. Monthly Weather Review, 144 (5), 1935–1960, 10.1175/MWR-D-15-0010.1.
  • Vanden-Eijnden (2014) Vanden-Eijnden, E., 2014: An introduction to Markov state models and their application to long timescale molecular simulation, ch. 7. Springer.
  • Vanden-Eijnden and E (2010) Vanden-Eijnden, E., and W. E, 2010: Transition-path theory and path-finding algorithms for the study of rare events. Annual Review of Physical Chemistry, 61 (1), 391–420, 10.1146/annurev.physchem.040808.090412.
  • Vanden-Eijnden and Weare (2013) Vanden-Eijnden, E., and J. Weare, 2013: Data assimilation in the low noise regime with application to the kuroshio. Monthly Weather Review, 141 (6), 1822–1841, 10.1175/MWR-D-12-00060.1.
  • Weare (2009) Weare, J., 2009: Particle filtering with path sampling and an application to a bimodal ocean current model. Journal of Computational Physics, 228 (12), 4312 – 4331, https://doi.org/10.1016/j.jcp.2009.02.033.
  • Webber et al. (2019) Webber, R. J., D. A. Plotkin, M. E. O’Neill, D. S. Abbot, and J. Weare, 2019: Practical rare event sampling for extreme mesoscale weather. Chaos, 29 (5), 053 109, 10.1063/1.5081461.
  • Yasuda et al. (2017) Yasuda, Y., F. Bouchet, and A. Venaille, 2017: A new interpretation of vortex-split sudden stratospheric warmings in terms of equilibrium statistical mechanics. Journal of the Atmospheric Sciences, 74 (12), 3915–3936, 10.1175/JAS-D-17-0045.1.
  • Yoden (1987) Yoden, S., 1987: Bifurcation properties of a stratospheric vacillation model. Journal of the Atmospheric Sciences, 44 (13), 1723–1733, 10.1175/1520-0469(1987)044¡1723:BPOASV¿2.0.CO;2.