This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Stability of the global attractor under Markov-Wasserstein noise

Martin Kell Max-Planck-Institute for Mathematics in the Sciences, Inselstr. 22-26, D-04103 Leipzig, Germany mkell@mis.mpg.de
(Date: March 17, 2011)
Abstract.

We develop a “weak Ważewski principle” for discrete and continuous time dynamical systems on metric spaces having a weaker topology to show that attractors can be continued in a weak sense. After showing that the Wasserstein space of a proper metric space is weakly proper we give a sufficient and necessary condition such that a continuous map (or semiflow) induces a continuous map (or semiflow) on the Wasserstein space. In particular, if these conditions hold then the global attractor, viewed as invariant measures, can be continued under Markov-type random perturbations which are sufficiently small w.r.t. the Wasserstein distance, e.g. any small bounded Markov-type noise and Gaussian noise with small variance will satisfy the assumption.

Key words and phrases:
global attractor, random perturbation, Wasserstein space
2000 Mathematics Subject Classification:
34D23, 37B35, 60B05, 60B10
The author would like to thank the IMPRS “Mathematics in the Sciences” for financial support and his advisor, Prof. Jürgen Jost, and the MPI MiS for providing an inspiring research atmosphere.

In this paper we are going to show that the invariant measures of a dynamical system having a global attractor (either discrete and continuous time) can be “continued” under small (not necessarily bounded) noise. Instead of just showing that there is a stationary measure of the perturbed system “weakly” close to the original ones (see e.g. [Kif88, 1.7]) we show that it is close w.r.t. the Wasserstein metric wpw_{p} (the order pp depends on regularity of the noise). Previous research mainly focused on Gaussian type noise, “absolute continuous” noise or assumed implicitly bounded noise, e.g. Kifer [Kif88, p.103] and L.-S. Young [You86] considered noise on a positive invariant (bounded) neighborhood UU of a local attractor which is zero on U\partial U and thus is bounded. If XX is compact than any noise will be bounded. Thus we are in particular interested in non-compact XX, although we restrict our attention to proper metric spaces which includes all locally compact geodesic spaces.

All our results apply equally to discrete and continuous time dynamical system. We will mainly focus on discrete time because there is a better intuition behind these. In the first section we extend Rybakowski’s continuation of a positive invariant isolating neighborhood to the discrete time setting which will be the key step to treat continuous and discrete time systems on equal footing. Using a kind of “weak Ważewski principle” we show that attractors can be continued in a weak sense without assuming admissibility of the perturbed system (theorem 5).

Then we introduce the Wasserstein space and show that the Wasserstein space of a proper metric space is weakly proper, i.e. closed δ\delta-neighborhoods of compact sets are weakly compact (theorem 6). We give a necessary and sufficient condition such that a dynamical system ff (resp. a semiflow π\pi) on a proper metric space makes the transfer map f:𝒫(X)𝒫(X)f_{*}:\mathcal{P}(X)\to\mathcal{P}(X) (resp. transfer semiflow π\pi_{*}), which is always (weakly) continuous, (strongly) continuous on any Wasserstein space (𝒫p(X),wp)(\mathcal{P}_{p}(X),w_{p}) of order 1p<1\leq p<\infty.

Finally, we look at Markov-type perturbations of a dynamical system ff (resp. semiflow π\pi) having a global attractor. These are perturbation of ff_{*} in the Wasserstein space. Under quite general assumptions on ff we can show using Conley theory that if the perturbation is sufficiently small then the perturbed system has an isolated (weak) attractor and there is at least one stationary measure (theorem 15). The perturbed attractor is strongly close to the unperturbed attractor and contains all invariant measures and its positive invariant isolating neighborhood is convex. If the noise is of order pp then the mass of the invariant measures decays at least as RpR^{-p} where RR is the distance from the global attractor of ff (resp. π\pi). Furthermore, a standard result from Conley theory shows that if the noise level converges to zero then the set of invariant measures converges in the Wasserstein space to the set of invariant measures of the deterministic system (as usual without further assumptions only upper semicontinuity holds).

Because of the special form of the Kantorovich-Rubinstein duality in 𝒫1(X)\mathcal{P}_{1}(X) we can show that the (local) weak attractor of the perturbed system P:𝒫1(X)𝒫1(X)P:\mathcal{P}_{1}(X)\to\mathcal{P}_{1}(X) is actually the global weak attractor of PP.

The framework of a metric space with a weaker topology used here is similar to the framework in [AGS08, section 2.1] used to construct general gradient flows.

Motivation

Consider a dynamical system f:XXf:X\to X on a proper metric space XX having a global attractor, i.e. a compact invariant set AA that attracts all its (bounded) neighborhoods. We will not consider the map ff itself, but the map f:𝒫1(X)𝒫1(X)f_{*}:\mathcal{P}_{1}(X)\to\mathcal{P}_{1}(X) defined via push-forward map on the space of probability measures. If ff is “nice” then ff_{*} is continuous and has the global attractor K=𝒫(A)K=\mathcal{P}(A), i.e. the probability measures supported on the global attractor of ff.

Markov-type noise can be considered as a perturbation F~\tilde{F} of ff_{*}, i.e. instead of δxδf(x)\delta_{x}\mapsto\delta_{f(x)} we have δxp(dy|x)\delta_{x}\mapsto p(dy|x) where p(dy|x)p(dy|x) and δf(x)\delta_{f(x)} uniformly close w.r.t. the Wasserstein distance w1w_{1} for all xx. This can be seen as a smearing of the image f(x)f(x) or some uncertainty about the actual image. For example, if ff is the time-11 map of a flow generated by the ODE x˙=g(x)\dot{x}=g(x) then F~\tilde{F} could be the time-one map of the flow of distributions of the SDE dx=g(x)dt+ϵdWtdx=g(x)dt+\epsilon dW_{t}, i.e. additive Gaussian noise with small variance.

If F~\tilde{F} and ff_{*} are sufficiently close then F~\tilde{F} has a (weak) global attractor (in 𝒫1(X)\mathcal{P}_{1}(X)) which is close to KK w.r.t. w1w_{1}. Hence stability of the global attractor holds in the Wasserstein space 𝒫1(X)\mathcal{P}_{1}(X).

The following example is inspired by Crauel, Flandoli - “Additive Noise Destroys a Pitchfork Bifurcation” [CF98] and could be decribed as “Additive Noise Destroys Attractors”. The noise will be worse than white noise used by Crauel and Flandoli, but can still be considered as small.

Example (Generic collapse under "small" noise).

(1) Suppose f:XXf:X\to X has a global attractor and at least one fixed point x0x_{0} (the argument works equally well with general attractors). Take any noise level ϵ>0\epsilon>0 and let Pϵ:𝒫(X)𝒫(X)P_{\epsilon}:\mathcal{P}(X)\to\mathcal{P}(X) be the Markov map induced by

x(1ϵ)δf(x)+ϵδx0.x\mapsto(1-\epsilon)\delta_{f(x)}+\epsilon\delta_{x_{0}}.

This map is (weakly) close to the unperturbed system f:𝒫(X)𝒫(X)f_{*}:\mathcal{P}(X)\to\mathcal{P}(X). Namely, if dLPd_{LP} is the Levy-Prokhorov distance (which metrizes 𝒫(X)\mathcal{P}(X)) then

supμP(X)dLP(Pϵ(μ),f(μ))ϵ.\sup_{\mu\in P(X)}d_{LP}(P_{\epsilon}(\mu),f_{*}(\mu))\leq\epsilon.

But PP has exactly one invariant measure, namely δx0\delta_{x_{0}}, and all others converge to this measure weakly.

(2) Now we want to show that this can also happen in any Wasserstein space 𝒫p(X)\mathcal{P}_{p}(X) for 1p<1\leq p<\infty (𝒫(X)\mathcal{P}_{\infty}(X) only allows bounded noise which when sufficiently small cannot destroy local attractors and thus a global attractors with at least two sinks never collapses, see [Kel11]). Suppose a:X[0,1]a:X\to[0,1] is a continuous function. Define qa(dy|):X𝒫p(X)q_{a}(dy|\cdot):X\to\mathcal{P}_{p}(X) by

xqa(dy|x)=(1a(x))δf(x)+a(x)δx0,x\mapsto q_{a}(dy|x)=(1-a(x))\delta_{f(x)}+a(x)\delta_{x_{0}},

which is obviously continuous. Thus

wp(qa(dy|x),δf(x))p=a(x)d(f(x),x0)p.w_{p}(q_{a}(dy|x),\delta_{f(x)})^{p}=a(x)d(f(x),x_{0})^{p}.

So if we define

a(x)=ϵp1+d(f(x),x0)pa(x)=\frac{\epsilon^{p}}{1+d(f(x),x_{0})^{p}}

then

wp(qa(dy|x),δf(x))p=ϵpd(f(x),x0)p1+d(f(x),x0)pϵp.w_{p}(q_{a}(dy|x),\delta_{f(x)})^{p}=\frac{\epsilon^{p}\cdot d(f(x),x_{0})^{p}}{1+d(f(x),x_{0})^{p}}\leq\epsilon^{p}.

So, in particular, the induced MW-map Qa:𝒫p(X)𝒫p(X)Q_{a}:\mathcal{P}_{p}(X)\to\mathcal{P}_{p}(X) of order pp relative to ff has noise level ϵ\epsilon. Furthermore, the only invariant measure of QaQ_{a} is δx0\delta_{x_{0}} and all other measures converge to it.

The example above should make clear that using arbitrary unbounded noise even when it is small can have strange effects on the global attractor. Although we have some “attracting” invariant measures of the perturbed system the attractor might look very different from the original one, in our case it might be just one fixed point and this one can even be the “most” unstable one of the original attractor. Therefore, stochastic stability of attractors under arbitrary “small” noise should not be referred to a single invariant measure but to all of them, even though we can speak of stochastic stability if the type of noise is more restricted, besides of being sufficiently “small”.

1. Discrete-time Conley theory for stable invariant sets

In this section we will use Conley theory, that is continuation methods from Conley index theory without using the topological (or (co)homological) Conley index. We will prove a continuation for a positive invariant neighborhood of a stable isolated invariant set of a time discrete dynamical system. The result will not require a compactness assumption (called admissibility) of the perturbed system and is a different type of continuation than [MR91]. Our proof will follow the proof of [Ryb87, Theorem 12.3] which is the continuation for semiflows. In particular, the results stated here and in the next sections also hold for semiflows if we assume that they do not explode on a given neighborhood.

In both cases the Ważewski principle for the index pair of the perturbed system does not apply. But we can use other assumptions to show that attractors continue, e.g. the map is weakly continuous and closed δ\delta-neighborhoods of compact set are weakly compact, which is the case for the Wasserstein space on proper metric spaces.

We will now give the definitions used in [MR91] and [Ryb87] to prove the existence of an index pair for certain isolated invariant sets. Our setting will be a complete separable metric space YY and a dynamical system, i.e. a continuous map f:YYf:Y\to Y. A full left solution of ff in NN is a sequence {xn}nN\{x_{-n}\}_{n\in\mathbb{N}}\subset N such that f(xn1)=xnf(x_{n-1})=x_{n} for n0n\leq 0. Define the following sets

A+(N)\displaystyle A^{+}(N) =\displaystyle= {xN|fk(x)Nfor all k0}\displaystyle\{x\in N\,|\,f^{k}(x)\in N\,\mbox{for all }k\geq 0\}
A(N)\displaystyle A^{-}(N) =\displaystyle= {xN| full left solution {xn}n in N through x0=x}\displaystyle\{x\in N\,|\,\exists\mbox{\,\ full left solution $\{x_{-n}\}_{n\in\mathbb{Z}}$\,\ in \,$N$\,\ through $x_{0}=x$}\}
A(N)\displaystyle A(N) =\displaystyle= A+(N)A(N).\displaystyle A^{+}(N)\cap A^{-}(N).

These are called the maximal positive invariant (resp. negative invariant, resp. invariant) set in NN. If NN is unbounded then A(N)A(N) usually denotes only the bounded invariant orbits instead of all of them. A set KK is called invariant if A(K)=KA(K)=K. If there is a closed neighborhood NN of an invariant set KK with A(N)=KA(N)=K then KK is called isolated with isolating neighborhood NN.

For l,ml,m\in\mathbb{N} and lkl\leq k define

f[l,m](x)={y|fk(x)=yfor some k[l,m]}f^{[l,m]}(x)=\{y\,|\,f^{k}(x)=y\,\mbox{for some }k\in\mathbb{N}\cap[l,m]\}

If fn:YYf_{n}:Y\to Y is a sequence of continuous maps such that fnff_{n}\to f, i.e. fn(xn)f(x)f_{n}(x_{n})\to f(x) whenever xnxx_{n}\to x as nn\to\infty, then we say that a closed bounded set NN is {fn}\{f_{n}\}-admissible if for any sequence {xn}n\{x_{n}\}_{n\in\mathbb{N}} with fn[0,mn](xn)Nf_{n}^{[0,m_{n}]}(x_{n})\subset N and mnm_{n}\to\infty the sequence of endpoints {fnmn(xn)}n\{f_{n}^{m_{n}}(x_{n})\}_{n\in\mathbb{N}} is precompact. In case this property holds for fnff_{n}\equiv f then we just say NN is ff-admissible.

Remark.

Later on, we deal with dynamical systems on the space of probability measures Y=𝒫(X)Y=\mathcal{P}(X) for some metric space XX. An invariant measure for that system is invariant w.r.t. the definition above. In particular, periodic measures will be called invariant. A fixed point for these systems will be called stationary measure.

In the following we will use several ideas from [MR91]: Let N,NN,N^{\prime} be two ff-admissible isolating neighborhood for some isolated invariant set KK with

NintNf1(intN).N\subset\operatorname{int}N^{\prime}\cap f^{-1}(\operatorname{int}N^{\prime}).

The authors in [MR91, 4.4] used a so called Lyapunov pair (ϕ,γ)(\phi,\gamma) which is continuous on a small neighborhood WNW\subset N of KK and has the following properties: Kγ1(0)K\subset\gamma^{-1}(0), ϕ\phi (resp. γ\gamma) is decreasing (resp. increasing) along orbits and ϕ(x)=0\phi(x)=0 with xWx\in W implies

xA(N)N.x\in A^{-}(N)\cup\partial N^{\prime}.

Because KK is compact we can choose d(W,N)>0d(W,\partial N^{\prime})>0 and assume xA(N)x\in A^{-}(N) whenever ϕ(x)=0\phi(x)=0. Furthermore, it is shown that if ϕ(xn)0\phi(x_{n})\to 0 then xnx_{n} admits a convergent subsequence.

Theorem 1.

Suppose NN is an isolating neighborhood for KK such that the assumptions above hold and

A(N)=A(N)=K.A^{-}(N)=A(N)=K\neq\varnothing.

Then there exists an admissible isolating neighborhood BNB\subset N which is positive invariant, i.e. no trajectories exit BB.

Remark.

This is the discrete time version of [Ryb87, I-5.5] using the theory of [MR91]. The proof is essentially copied from Rybakowski using the Lyapunov pair above.

Proof.

Define

P1ϵ\displaystyle P_{1}^{\epsilon} =\displaystyle= Ncl{xintN|ϕ(x)<ϵ}\displaystyle N\cap\operatorname{cl}\{x\in\operatorname{int}N^{\prime}\,|\,\phi(x)<\epsilon\}
P2ϵ\displaystyle P_{2}^{\epsilon} =\displaystyle= P1ϵ\{xintN|γ(x)<ϵ}.\displaystyle P_{1}^{\epsilon}\backslash\{x\in\operatorname{int}N^{\prime}\,|\,\gamma(x)<\epsilon\}.

it was shown [MR91, 4.4] that P1ϵWP_{1}^{\epsilon}\subset W is a neighborhood of KK for sufficiently small ϵ>0\epsilon>0 and whenever xPiϵx\in P_{i}^{\epsilon} and f(x)Nf(x)\in N then xPiϵx\in P_{i}^{\epsilon} and if xP1ϵx\in P_{1}^{\epsilon} and f(x)Nf(x)\notin N then xP2ϵx\in P_{2}^{\epsilon}, i.e. P2ϵP_{2}^{\epsilon} is the exit ramp for P1ϵP_{1}^{\epsilon}.

Now fix a sufficiently small ϵ>0\epsilon>0 and let 0<δϵ0<\delta\leq\epsilon then P1δP1ϵP_{1}^{\delta}\subset P_{1}^{\epsilon}. Define P~2δ:=P1δP2ϵ\tilde{P}_{2}^{\delta}:=P_{1}^{\delta}\cap P_{2}^{\epsilon} then because ϕ\phi is decreasing along orbits P~2δ\tilde{P}_{2}^{\delta} is still an exit ramp for P2δP_{2}^{\delta}.

If A(N)=A(N)=KA^{-}(N)=A(N)=K then we claim that there is a δ>0\delta>0 such that P~2δ=\tilde{P}_{2}^{\delta}=\varnothing which implies that P1δP_{1}^{\delta} is positive invariant. If this does not hold then there is a sequence xnP1δnP2ϵx_{n}\in P_{1}^{\delta_{n}}\cap P_{2}^{\epsilon} with δn0\delta_{n}\to 0. Thus ϕ(xn)0\phi(x_{n})\to 0 and γ(xn)ϵ\gamma(x_{n})\geq\epsilon which implies that there is a subsequence xnxNx_{n^{\prime}}\to x\in N such that ϕ(x)=0\phi(x)=0. Hence

xA(N)=Kx\in A^{-}(N)=K

and γ(x)=0\gamma(x)=0 by assumption. But γ\gamma is continuous and xnxx_{n^{\prime}}\to x implies ϵγ(xn)γ(x)=0\epsilon\leq\gamma(x_{n^{\prime}})\to\gamma(x)=0 which is a contradiction. This proofs our claim and thus the theorem. ∎

In the following we will assume that KK satisfies the assumption of the theorem and that B:=P1δ0B:=P_{1}^{\delta_{0}}, ϕ\phi and γ\gamma are given as in the proof. It is obvious that P1δP_{1}^{\delta} is positive invariant w.r.t. ff for any 0<δδ00<\delta\leq\delta_{0}. Furthermore, suppose fnff_{n}\to f.

Theorem 2.

Assume NN^{\prime} (see above) is {fnm}\{f_{n_{m}}\}-admissible for each subsequence of {fn}n\{f_{n}\}_{n\in\mathbb{N}}. Set U~=intB\tilde{U}=\operatorname{int}B and define

V(a)={xU~|ϕ(x)<a}.V(a)=\{x\in\tilde{U}\,|\,\phi(x)<a\}.

Then for some a0>0a_{0}>0, N:=clV(a0)U~N:=\operatorname{cl}V(a_{0})\subset\tilde{U}. Furthermore, for some sufficiently small ϵ0>0\epsilon_{0}>0 and all 0<ϵϵ00<\epsilon\leq\epsilon_{0} there is an n0=n0(ϵ)n_{0}=n_{0}(\epsilon) such that for all nn0n\geq n_{0} there is a positive fnf_{n}-invariant closed Nn(ϵ)N_{n}(\epsilon) and

KnV(ϵ)Nn(ϵ)N.K_{n}\subset V(\epsilon)\subset N_{n}(\epsilon)\subset N.
Remark.

The complete continuation theorem for index pairs does not hold for discrete time dynamical systems in general. A proof would require that there is a neighborhood such that the exit time is continuous in U~\tilde{U} , i.e. ωn+(xn)ω+(x0)\omega_{n}^{+}(x_{n})\to\omega^{+}(x_{0}) whenever xnx0x_{n}\to x_{0} in U~\tilde{U}, which holds for semiflows only for so called isolating blocks. These blocks do not necessarily exist for continuous maps.

Proof.

By [MR91, 3.9] there is an a0>0a_{0}>0 such that N=clV(a0)U~N=\operatorname{cl}V(a_{0})\subset\tilde{U}. And similar to [Ryb87, I-4.5] we can show that for 0<ϵa00<\epsilon\leq a_{0} and all nn0(ϵ)n\geq n_{0}(\epsilon)

KnV(ϵ).K_{n}\subset V(\epsilon).

Define

Nn(ϵ)=Ncl{y| xV(ϵ)m0 s.t. fn[0,m](x)U~and fnm(x)=y}.N_{n}(\epsilon)=N\cap\operatorname{cl}\{y\,|\,\mbox{ $\exists x\in V(\epsilon)$, $m\geq 0$\,\ s.t.\,\ $f_{n}^{[0,m]}(x)\subset\tilde{U}\,$and $f_{n}^{m}(x)=y$}\}.

Following the proof of [Ryb87, I-12.5] we can show that Nn(ϵ)N_{n}(\epsilon) satisfies the following properties for nn0(ϵ)n\geq n_{0}(\epsilon)

  • xNn(ϵ)x\in N_{n}(\epsilon) and fn(x)Nf_{n}(x)\in N implies fn(x)Nn(ϵ)f_{n}(x)\in N_{n}(\epsilon)

  • KnV(ϵ)Nn(ϵ)K_{n}\subset V(\epsilon)\subset N_{n}(\epsilon)

We claim that for small ϵ0>0\epsilon_{0}>0 whenever ϵϵ0\epsilon\leq\epsilon_{0} and nn0(ϵ)n\geq n_{0}(\epsilon) then Nn(ϵ)N_{n}(\epsilon) is positive invariant w.r.t. fnf_{n}. If this is not true then there is a sequence ϵm0\epsilon_{m}\to 0 and

ymNnm(ϵm)y_{m}\in N_{n_{m}}(\epsilon_{m})

with fnm(ym)Nf_{n_{m}}(y_{m})\notin N. By definition of Nnm(ϵm)N_{n_{m}}(\epsilon_{m}) there is a sequence y~mY\tilde{y}_{m}\in Y, xmV(ϵm)x_{m}\in V(\epsilon_{m}) and km0k_{m}\geq 0 such that d(ym,y~m)<2md(y_{m},\tilde{y}_{m})<2^{-m}, fnm[0,km](xm)U~f_{n_{m}}^{[0,k_{m}]}(x_{m})\subset\tilde{U} and y~m=fnmkm(xm)\tilde{y}_{m}=f_{n_{m}}^{k_{m}}(x_{m}). Because ϕ(xm)0\phi(x_{m})\to 0 and Af(B)=Af(B)A_{f}^{-}(B)=A_{f}(B) we can assume w.l.o.g. that xmx0Af(B)x_{m}\to x_{0}\in A_{f}(B). Admissibility and fnmff_{n_{m}}\to f imply the sequence {fnmkm(xm)}m\{f_{n_{m}}^{k_{m}}(x_{m})\}_{m\in\mathbb{N}} has a convergent subsequence and w.l.o.g. y~m=fnmkm(xm)y0Af(N)=Af(N)intN\tilde{y}_{m}=f_{n_{m}}^{k_{m}}(x_{m})\to y_{0}\in A_{f}^{-}(N^{\prime})=A_{f}(N^{\prime})\subset\operatorname{int}N and thus ymy0y_{m}\to y_{0}.

Since fnm(ym)NintNf1(intN)f_{n_{m}}(y_{m})\notin N\subset\operatorname{int}N^{\prime}\cap f^{-1}(\operatorname{int}N^{\prime}) and fnff_{n}\to f we have fnm(ym)f(y0)N\intNf_{n_{m}}(y_{m})\to f(y_{0})\in N^{\prime}\backslash\operatorname{int}N. But y0Af(N)y_{0}\in A_{f}(N^{\prime}) implies f(y0)Af(N)f(y_{0})\in A_{f}(N^{\prime}) which is a contradiction because Af(N)A_{f}(N^{\prime}) and N\intNN^{\prime}\backslash\operatorname{int}N are disjoint.∎

Corollary 3.

Under the assumption above for all nn0n\geq n_{0} we can find positive fnf_{n}-invariant Nn,NnN_{n},N_{n}^{{}^{\prime}} such that

NnUδ(K)NnBN_{n}\subset U_{\delta}(K)\subset N_{n}^{{}^{\prime}}\subset B

for some δ\delta-neighborhood of KK denoted by

Uδ(K)={xY|d(x,y)<δfor someyK}.U_{\delta}(K)=\{x\in Y\,|\,d(x,y)<\delta\,\mbox{for some}\>y\in K\}.

Furthermore, we have KintNnK\subset\operatorname{int}N_{n} and there is an ϵ>0\epsilon>0 such that

Uϵ(Kn)Nn.U_{\epsilon}(K_{n})\subset N_{n}^{{}^{\prime}}.
Proof.

Applying the previous theorem we get

KKnV(ϵ~)NnB.K\cup K_{n}\subset V(\tilde{\epsilon})\subset N_{n}^{{}^{\prime}}\subset B.

Recalling the definition of ϕ\phi it is obvious that because Af(B)=Af(B)=KA_{f}^{-}(B)=A_{f}(B)=K for small 0<ϵ<10<\epsilon^{\prime}<1 and xP1ϵx\in P_{1}^{\epsilon^{\prime}}

d(x,K)ϵ.d(x,K)\leq\epsilon^{\prime}.

Because KK is compact and V(ϵ~)V(\tilde{\epsilon}) a neighborhood of KK the δ\delta-neighborhood Uδ(K)U_{\delta}(K) of KK is contained in V(ϵ~)V(\tilde{\epsilon}) for δ\delta sufficiently small. Furthermore, we can find an ϵ>0\epsilon^{\prime}>0 with ϵ<δ\epsilon^{\prime}<\delta such that P1ϵUδ(K)V(ϵ~)P_{1}^{\epsilon^{\prime}}\subset U_{\delta}(K)\subset V(\tilde{\epsilon}). Applying the theorem again for P1ϵP_{1}^{\epsilon^{\prime}} instead of BB we get for nn0n\geq n_{0}

NnP1ϵUδ(K)NnBN_{n}\subset P_{1}^{\epsilon^{\prime}}\subset U_{\delta}(K)\subset N_{n}^{{}^{\prime}}\subset B

and KV(ϵ)NnK\subset V(\epsilon^{\prime})\subset N_{n}.

To show that Uϵ(Kn)NnU_{\epsilon}(K_{n})\subset N_{n} we need another positive fnf_{n}-invariant neighborhood Nn′′N_{n}^{{}^{\prime\prime}}. First note that there is a δ>0\delta^{\prime}>0 such that

d(x,K)δd(x,K)\geq\delta^{\prime}

for all xPϵx\in\partial P^{\epsilon^{\prime}}. So if we choose 0<2ϵ<δ0<2\epsilon<\delta then

Uϵ(P1ϵ)P1ϵ.U_{\epsilon}(P_{1}^{\epsilon})\subset P_{1}^{\epsilon^{\prime}}.

Applying the previous theorem again we get a positive fnf_{n}-invariant isolating Nn′′N_{n}^{{}^{\prime\prime}} of KnK_{n} inside of P1ϵP_{1}^{\epsilon}. Hence

Uϵ(Kn)Uϵ(Nn′′)Uϵ(P1ϵ)P1ϵNn.U_{\epsilon}(K_{n})\subset U_{\epsilon}(N_{n}^{{}^{\prime\prime}})\subset U_{\epsilon}(P_{1}^{\epsilon})\subset P_{1}^{\epsilon^{{}^{\prime}}}\subset N_{n}^{{}^{\prime}}.

Now we are able to continue the attractor. Instead of an admissibility assumption for the perturbed map we will use weak compactness of close δ\delta-neighborhoods of compact sets.

Definition 4 (weak attractor).

Suppose YY has a weaker (Hausdorff) topology (i.e. xnxx_{n}\to x strongly implies xnxx_{n}\rightharpoonup x weakly) and ff is continuous and weakly continuous. An isolated invariant set KK is called a weak attractor if it admits a positive ff-invariant isolating neighborhood NN such that ωweak(N)K\omega^{\tiny\mbox{weak}}(N)\subset K where ωweak(N)\omega^{\tiny\mbox{weak}}(N) is defined as

ωweak(N)={yY|xnN,mns.t. fmn(xn)y}.\omega^{\tiny\mbox{weak}}(N)=\{y\in Y\,|\,\exists x_{n}\in N,m_{n}\to\infty\,\mbox{s.t. }\,f^{m_{n}}(x_{n})\rightharpoonup y\}.
Remark.

(1) Our definition of weakness of an attractor is w.r.t. the weaker topology and is different from one defined in [Hur01]. Even our definition of a (strong) attractor is weaker than the one used there because we only require the existence of a positive invariant isolating neighborhood of the invariant set. But there might be a connection to Ochs’ weak random attractor [Och99].

(2) A Conley theory with weak-admissibility instead of admissibility might not make sense since the continuation proof requires continuity of the metric and usually the metric is only lower semicontinuous w.r.t. weak convergence.

(3) A weakly continuous function might not be continuous and vice versa (see counterexample in the proof of theorem 8)

Theorem 5.

Under the assumption of the previous theorem, suppose there is a weaker (Hausdorff) topology on YY and that (strongly) closed δ\delta-neighborhoods of compact sets are weakly (sequentially) compact, i.e. clUδ(C)\operatorname{cl}U_{\delta}(C) is weakly compact for compact CC. If fnf_{n} is weakly continuous then KnK_{n} is non-empty and a weakly compact weak attractor w.r.t. fnf_{n} for all nn0n\geq n_{0}. Furthermore, Uϵ(Kn)NnU_{\epsilon}(K_{n})\subset N_{n}^{{}^{\prime}} for some ϵ>0\epsilon>0 and positive fnf_{n}-invariant NnN_{n}^{{}^{\prime}}.

Proof.

Applying the previous corollary we get

NnUδ(K)NnBN_{n}\subset U_{\delta}(K)\subset N_{n}^{{}^{\prime}}\subset B

and

Uϵ(Kn)Nn.U_{\epsilon}(K_{n})\subset N_{n}^{{}^{\prime}}.

Because fnf_{n} is weakly continuous, NnN_{n} is positive fnf_{n}-invariant and clUδ(K)Nn\operatorname{cl}U_{\delta}(K)\subset N_{n}^{{}^{\prime}} is closed and thus weakly compact the set

ωnweak(x)={yY|fnnk(x)yfor somenk}\omega_{n}^{\tiny\mbox{weak}}(x)=\{y\in Y\,|\,f_{n}^{n_{k}}(x)\rightharpoonup y\,\mbox{for some}\,n_{k}\to\infty\}

for xNnx\in N_{n} is non-empty and weakly compact. This implies KnK_{n}\neq\varnothing and in particular

ωnweak(x)KnNn.\omega_{n}^{\tiny\mbox{weak}}(x)\subset K_{n}\subset N_{n}.

Similarly weak compactness of clUδ(K)\operatorname{cl}U_{\delta}(K) implies ωnweak(Nn)An(Nn)=KnNn\omega_{n}^{\tiny\mbox{weak}}(N_{n})\subset A_{n}(N_{n}^{{}^{\prime}})=K_{n}\subset N_{n} and thus KnK_{n} is a weak attractor. Obviously KnK_{n} is weakly closed and contained in the weakly compact set clUδ(K)\operatorname{cl}U_{\delta}(K) and is therefore weakly compact as well. ∎

2. Wasserstein spaces

Now we will introduce some notation and results for Wasserstein spaces of a metric space, general references are [AGS08] and [Vil09].

Let (X,d)(X,d) be a complete separable metric space, also called Polish space. We call it proper if every bounded closed set is compact. In particular, this implies that XX is locally compact. The metric of a non-compact proper metric space is necessarily unbounded.

The space of probability measures on the Borel σ\sigma-algebra of XX is denoted by 𝒫(X)\mathcal{P}(X). This space is given the weak topology, i.e. μnμ\mu_{n}\rightharpoonup\mu if f𝑑μnf𝑑μ\int fd\mu_{n}\to\int fd\mu for all bounded continuous functions ff . Let x0x_{0} be an arbitrary point of XX and define 𝒫p(X)\mathcal{P}_{p}(X), the Wasserstein space (of order pp), by

𝒫p(X)={μ𝒫(X)|d(x0,x)p𝑑μ(x)<}.\mathcal{P}_{p}(X)=\{\mu\in\mathcal{P}(X)\,|\,\int d(x_{0},x)^{p}d\mu(x)<\infty\}.

Furthermore, define for μ,ν𝒫p(X)\mu,\nu\in\mathcal{P}_{p}(X)

wp(μ,ν)=(infπΠ(μ,ν)d(x,y)p𝑑π(x,y))1pw_{p}(\mu,\nu)=\left(\inf_{\pi\in\Pi(\mu,\nu)}\int d(x,y)^{p}d\pi(x,y)\right)^{\frac{1}{p}}

where πΠ(μ,ν)𝒫(X×X)\pi\in\Pi(\mu,\nu)\subset\mathcal{P}(X\times X) with π(A×X)=μ\pi(A\times X)=\mu and π(X×A)=ν\pi(X\times A)=\nu for all Borel sets AA and BB. Then (𝒫p(X),wp)(\mathcal{P}_{p}(X),w_{p}) is a complete separable metric space. This topology is usually stronger than the induced subspace topology of 𝒫p(X)𝒫(X)\mathcal{P}_{p}(X)\subset\mathcal{P}(X).

If XX is compact so is 𝒫p(X)\mathcal{P}_{p}(X). And 𝒫p(X)\mathcal{P}_{p}(X) is local compact only if XX is compact. A counterexample for non-proper metric spaces is given in [AGS08, 7.1.9]. We will adjust their example to non-compact proper metric spaces by showing that the closed ϵ\epsilon-ball Bϵwp(δx0)B_{\epsilon}^{w_{p}}(\delta_{x_{0}}) around δx0\delta_{x_{0}} in 𝒫p(X)\mathcal{P}_{p}(X) cannot be compact for any ϵ>0\epsilon>0 and thus 𝒫p(X)\mathcal{P}_{p}(X) cannot be locally compact.

Example.

Assume XX is non-compact and proper and define

μn=mnδxn+(1mn)δx0\mu_{n}=m_{n}\delta_{x_{n}}+(1-m_{n})\delta_{x_{0}}

for some sequence {xn}xX\{x_{n}\}_{x\in\mathbb{N}}\subset X. Then

wp(μn,δx0)p=mnd(xn,x0)p.w_{p}(\mu_{n},\delta_{x_{0}})^{p}=m_{n}d(x_{n},x_{0})^{p}.

Suppose d(xn,x0)ϵ>0d(x_{n},x_{0})\geq\epsilon>0 and set mn=ϵpd(xn,x0)pm_{n}=\epsilon^{p}\cdot d(x_{n},x_{0})^{-p} then μnBϵwp(δx0)\mu_{n}\in\partial B_{\epsilon}^{w_{p}}(\delta_{x_{0}}). If {mn}n\{m_{n}\}_{n\in\mathbb{N}} stays bounded away from 0 then {d(xn,x0)}n\{d(x_{n},x_{0})\}_{n\in\mathbb{N}} is bounded and thus {xn}n\{x_{n}\}_{n\in\mathbb{N}} and {mn}n\{m_{n}\}_{n\in\mathbb{N}} have convergent subsequences xnxx_{n^{\prime}}\to x_{\infty} and mnm(0,1]m_{n^{\prime}}\to m\in(0,1] and μnmδx+(1m)δx0\mu_{n}\to m\delta_{x_{\infty}}+(1-m)\delta_{x_{0}} strongly in 𝒫p(X)\mathcal{P}_{p}(X). But if we assume d(xn,x0)d(x_{n},x_{0})\to\infty then mn0m_{n}\to 0 and thus μnδx0\mu_{n}\rightharpoonup\delta_{x_{0}} weakly. Because strong convergence requires that wp(μn,δx0)=ϵw_{p}(\mu_{n},\delta_{x_{0}})=\epsilon converges to 0 the sequence cannot converge strongly in 𝒫p(X)\mathcal{P}_{p}(X).

Even though Wasserstein spaces are in general not locally compact we can still show that the following holds for proper metric spaces. The result is probably known or at least implicitly used in case X=nX=\mathbb{R}^{n}. Because it will be our main reason why the “weak” Conley theory is applicable and because we couldn’t find any reference, we will prove it completely.

Theorem 6.

If (X,d)(X,d) is a proper metric space then all closed δ\delta-neighborhoods of compact sets in (𝒫1(X),w1)(\mathcal{P}_{1}(X),w_{1}) are weakly compact, where the weak topology of 𝒫1(X)\mathcal{P}_{1}(X) is the induced subspace topology 𝒫1(X)𝒫(X)\mathcal{P}_{1}(X)\subset\mathcal{P}(X). A space, e.g. 𝒫1(X)\mathcal{P}_{1}(X), having this property may be called weakly proper.

Corollary 7.

For 1p<q1\leq p<q closed δ\delta-neighborhoods of compact sets in (𝒫q(X),wq)(\mathcal{P}_{q}(X),w_{q}) are compact in (𝒫p(X),wp)(\mathcal{P}_{p}(X),w_{p}), i.e. 𝒫q(X)\mathcal{P}_{q}(X) is weakly proper w.r.t. the subspace topology induced by 𝒫q(X)𝒫p(X)\mathcal{P}_{q}(X)\subset\mathcal{P}_{p}(X).

Remark.

This is stronger then a compact embedding i:(𝒫q(X),wq)(𝒫p(X),wp)i:(\mathcal{P}_{q}(X),w_{q})\to(\mathcal{P}_{p}(X),w_{p}) because if μn𝒫q(X)\mu_{n}\in\mathcal{P}_{q}(X) is bounded then w.l.o.g. i(μn)μi(\mu_{n})\to\mu_{*} in 𝒫p(X)\mathcal{P}_{p}(X) and necessarily μ𝒫q(X)\mu_{*}\in\mathcal{P}_{q}(X), i.e. bounded sequences never “leave” the space.

Proof of theorem.

We will show that

Brw:={ν𝒫1(X)|w1(ν,δx0)r}B_{r}^{w}:=\{\nu\in\mathcal{P}_{1}(X)\,|\,w_{1}(\nu,\delta_{x_{0}})\leq r\}

is weakly compact for all r0r\geq 0. Since BrwB_{r}^{w} is closed and μnμ\mu_{n}\rightharpoonup\mu implies w1(μn,δx0)lim infnw1(μn,δx0)rw_{1}(\mu_{n},\delta_{x_{0}})\leq\liminf_{n\to\infty}w_{1}(\mu_{n},\delta_{x_{0}})\leq r we only need to show that BrwB_{r}^{w} is tight.

Tightness of a subset 𝒦𝒫1(X)\mathcal{K}\subset\mathcal{P}_{1}(X) means for all ϵ>0\epsilon>0 there is a compact KϵK_{\epsilon} such that for all μ𝒦\mu\in\mathcal{K}

μ(X\Kϵ)ϵ.\mu(X\backslash K_{\epsilon})\leq\epsilon.

For μBrw\mu\in B_{r}^{w} we have

Xd(x,x0)𝑑μ=w1(μ,δx0)r.\int_{X}d(x,x_{0})d\mu=w_{1}(\mu,\delta_{x_{0}})\leq r.

Now choose Kϵ=Brϵ(x0)K_{\epsilon}=B_{\frac{r}{\epsilon}}(x_{0}), the closed ball around x0x_{0} with radius rϵ\frac{r}{\epsilon}, which is compact because XX is proper. Then we have

μ(X\Kϵ)=X\Kϵ𝑑μ(x)ϵrX\Kϵd(x,x0)𝑑μ(x)ϵ.\mu(X\backslash K_{\epsilon})=\int_{X\backslash K_{\epsilon}}d\mu(x)\leq\frac{\epsilon}{r}\int_{X\backslash K_{\epsilon}}d(x,x_{0})d\mu(x)\leq\epsilon.

Thus the closed ball in 𝒫1(X)\mathcal{P}_{1}(X) around δx0\delta_{x_{0}} is weakly compact.

Let K𝒫1(X)K\subset\mathcal{P}_{1}(X) be a compact set , e.g. K={μ}K=\{\mu\}, then the closed RR-neighborhood around KK is defined as

NRw(K)={ν𝒫1(X)|w1(μ,ν)R for some μK}.N_{R}^{w}(K)=\{\nu\in\mathcal{P}_{1}(X)\,|\,w_{1}(\mu,\nu)\leq R\,\mbox{ for some }\mu\in K\}.

This set is closed and bounded and for some R~\tilde{R} we have

NRw(K)BR~w.N_{R}^{w}(K)\subset B_{\tilde{R}}^{w}.

Let νnNRw(K)\nu_{n}\in N_{R}^{w}(K) be an arbitrary sequence. Then there are μnK\mu_{n}\in K with w1(μn,νn)Rw_{1}(\mu_{n},\nu_{n})\leq R. Because BR~wB_{\tilde{R}}^{w} is weakly compact and KK is compact there are νBR~w\nu_{\infty}\in B_{\tilde{R}}^{w} and μK\mu_{\infty}\in K such that for some subsequence (also denoted by μn\mu_{n}, resp. νn\nu_{n})

μn\displaystyle\mu_{n} \displaystyle\to μ\displaystyle\mu_{\infty}
νn\displaystyle\nu_{n} \displaystyle\rightharpoonup ν.\displaystyle\nu_{\infty}.

Since w1(,)w_{1}(\cdot,\cdot) is weakly lower semicontinuity we have

w1(μ,ν)lim infnw1(μn,νn)R,w_{1}(\mu,\nu)\leq\liminf_{n\to\infty}w_{1}(\mu_{n},\nu_{n})\leq R,

i.e. νNRw(K)\nu\in N_{R}^{w}(K) which implies weak compactness. ∎

Proof of corollary.

We only show that Brwq(δx0)B_{r}^{w_{q}}(\delta_{x_{0}}) is weakly compact w.r.t. the induced subspace topology 𝒫q(X)𝒫p(X)\mathcal{P}_{q}(X)\subset\mathcal{P}_{p}(X) for 1p<q1\leq p<q. The rest will follow by the same arguments used above.

Assume {μn}nBrwq(δx0)\{\mu_{n}\}_{n\in\mathbb{N}}\subset B_{r}^{w_{q}}(\delta_{x_{0}}). Since wqw1w_{q}\leq w_{1} the previous theorem implies w.l.o.g. μnμ\mu_{n}\rightharpoonup\mu_{\infty} for some μ𝒫1(X)\mu_{\infty}\in\mathcal{P}_{1}(X). Because

wq(μ,δx0)lim infnwq(μn,δx0)rw_{q}(\mu_{\infty},\delta_{x_{0}})\leq\liminf_{n\to\infty}w_{q}(\mu_{n},\delta_{x_{0}})\leq r

we actually have μBrwq(δx0)𝒫q(X)\mu_{\infty}\in B_{r}^{w_{q}}(\delta_{x_{0}})\subset\mathcal{P}_{q}(X).

Because 1p<q1\leq p<q

X\BRd(x,x0)p𝑑μn(x)1RqpX\BRd(x,x0)q𝑑μn(x)rRqp.\int_{X\backslash B_{R}}d(x,x_{0})^{p}d\mu_{n}(x)\leq\frac{1}{R^{q-p}}\int_{X\backslash B_{R}}d(x,x_{0})^{q}d\mu_{n}(x)\leq\frac{r}{R^{q-p}}.

Hence

limRlim supnX\BRd(x,x0)p𝑑μn(x)limRrRqp=0.\lim_{R\to\infty}\limsup_{n\to\infty}\int_{X\backslash B_{R}}d(x,x_{0})^{p}d\mu_{n}(x)\leq\lim_{R\to\infty}\frac{r}{R^{q-p}}=0.

This and the μnμ\mu_{n}\rightharpoonup\mu_{\infty} weakly show that μnμ\mu_{n}\to\mu_{\infty} in 𝒫p(X)\mathcal{P}_{p}(X) (see [Vil09, 6.8]). ∎

In the following assume that XX is proper. Suppose now f:XXf:X\to X is a continuous map having a global (set) attractor, i.e. there is a compact ff-invariant AXA\subset X such that for all bounded sets BB

limnd~(fn(B),A)=0,\lim_{n\to\infty}\tilde{d}(f^{n}(B),A)=0,

where d~\tilde{d} is the semi-Hausdorff metric induced by dd such that d~(A,B)=0\tilde{d}(A,B)=0 iff AclBA\subset\operatorname{cl}B. The map ff induces a continuous map f:𝒫(X)𝒫(X)f_{*}:\mathcal{P}(X)\to\mathcal{P}(X) with f(μ)(B)=μ(f1(B))f_{*}(\mu)(B)=\mu(f^{-1}(B)) for all Borel set BB. Furthermore, under slightly stronger assumptions ff_{*} has a global attractor

𝒫(A)={μ𝒫(X)|μ(A)=1}.\mathcal{P}(A)=\{\mu\in\mathcal{P}(X)\,|\,\mu(A)=1\}.
Remark.

For compact XX the global attractor is always XX itself. In particular, since 𝒫p(X)=𝒫(X)\mathcal{P}_{p}(X)=\mathcal{P}(X) for 1p<1\leq p<\infty is compact the global attractor of 𝒫(X)\mathcal{P}(X) is the space itself and we don’t get new information. The whole theory is only interesting for non-compact proper metric spaces XX.

Since the Wasserstein space includes distance the behavior of ff at infinity becomes important.

Theorem 8.

The map ff_{*} induces a continuous map (𝒫p(X),wp)(𝒫p(X),wp)(\mathcal{P}_{p}(X),w_{p})\to(\mathcal{P}_{p}(X),w_{p}) (also denoted by ff_{*}) if and only if for some x0Xx_{0}\in X

supxXd(f(x),x0)1+d(x,x0)<.\sup_{x\in X}\frac{d(f(x),x_{0})}{1+d(x,x_{0})}<\infty.
Remark.

For a semiflow π\pi we need that supxXd(xπt,x0)/1+d(x,x0)=Mt<\sup_{x\in X}\nicefrac{{d(x\pi t,x_{0})}}{{1+d(x,x_{0})}}=M_{t}<\infty for t[0,T0]t\in[0,T_{0}]. Which means, in particular, that there has to be a global lower bound on the blow-up time and thus there cannot be blow-ups at all, i.e. π\pi has to be a global semiflow. Which implies that the induced semiflow π\pi_{*} on 𝒫p(X)\mathcal{P}_{p}(X) is also a global semiflow. A necessary requirement for the existence of a global attractor is an upper bound on MtM_{t} for all t0t\geq 0. The requirements in [AGS08, Chapter 8] are sometimes too strong. A sufficient condition is that a one-sided Lipschitz condition holds globally (e.g. v(x)=xx3v(x)=x-x^{3} is an unbounded vector field and only locally Lipschitz, but satisfies a one-sided Lipschitz condition).

Proof.

Suppose first that

M=supxXd(f(x),x0)1+d(x,x0)<.M=\sup_{x\in X}\frac{d(f(x),x_{0})}{1+d(x,x_{0})}<\infty.

We can assume w.l.o.g. M>0M>0, otherwise f|Xx0f|_{X}\equiv x_{0} and the result is obvious. For μ𝒫p(X)\mu\in\mathcal{P}_{p}(X) we have

d(x,x0)p𝑑fμ(x)\displaystyle\int d(x,x_{0})^{p}df_{*}\mu(x) =\displaystyle= d(f(x),x0)p𝑑μ(x)\displaystyle\int d(f(x),x_{0})^{p}d\mu(x)
\displaystyle\leq Mp(1+d(x,x0))p𝑑μ(x)<,\displaystyle M^{p}\int(1+d(x,x_{0}))^{p}d\mu(x)<\infty,

i.e. f(μ)𝒫p(X)f_{*}(\mu)\in\mathcal{P}_{p}(X). So we only need to show continuity.

Suppose μnμ\mu_{n}\to\mu in 𝒫p(X)\mathcal{P}_{p}(X) then fμnfμf_{*}\mu_{n}\rightharpoonup f_{*}\mu. Since d(f(),x0)pd(f(\cdot),x_{0})^{p} is continuous and grows at most like d(,x0)pd(\cdot,x_{0})^{p} it follows that

d(x,x0)p𝑑fμn(x)\displaystyle\int d(x,x_{0})^{p}df_{*}\mu_{n}(x) =\displaystyle= d(f(x),x0)p𝑑μn(x)\displaystyle\int d(f(x),x_{0})^{p}d\mu_{n}(x)
d(f(x),x0)p𝑑μ(x)=d(x,x0)p𝑑fμ(x).\displaystyle\longrightarrow\int d(f(x),x_{0})^{p}d\mu(x)=\int d(x,x_{0})^{p}df_{*}\mu(x).

Thus fμnfμf_{*}\mu_{n}\to f_{*}\mu (see [Vil09, 6.8]) which shows that f:(𝒫p(X),wp)(𝒫p(X),wp)f_{*}:(\mathcal{P}_{p}(X),w_{p})\to(\mathcal{P}_{p}(X),w_{p}) is (strongly) continuous.

It remains to show that ff_{*} is not continuous if there is a sequence {xn}nX\{x_{n}\}_{n\in\mathbb{N}}\subset X such that

dn=d(f(xn),x0)1+d(xn,x0).d_{n}=\frac{d(f(x_{n}),x_{0})}{1+d(x_{n},x_{0})}\to\infty.

Because XX is proper and ff continuous we must have d(xn,x0)d(x_{n},x_{0})\to\infty. For large nn we can assume 0<1dn<d(xn,x0)0<\frac{1}{d_{n}}<d(x_{n},x_{0}). Set cn=1dnc_{n}=\frac{1}{d_{n}} then

μn=cnp1d(xn,x0)pδxn+(1cnp1d(xn,x0)p)δx0𝒫1(X)\mu_{n}=c_{n}^{p}\frac{1}{d(x_{n},x_{0})^{p}}\delta_{x_{n}}+(1-c_{n}^{p}\frac{1}{d(x_{n},x_{0})^{p}})\delta_{x_{0}}\in\mathcal{P}_{1}(X)

and wp(μn,δx0)p=cnp0w_{p}(\mu_{n},\delta_{x_{0}})^{p}=c_{n}^{p}\to 0, i.e. μnδx0\mu_{n}\to\delta_{x_{0}} strongly in 𝒫p(X)\mathcal{P}_{p}(X). We have

fμn=cnp1d(xn,x0)pδf(xn)+(1cnp1d(xn,x0)p)δf(x0)f_{*}\mu_{n}=c_{n}^{p}\frac{1}{d(x_{n},x_{0})^{p}}\delta_{f(x_{n})}+(1-c_{n}^{p}\frac{1}{d(x_{n},x_{0})^{p}})\delta_{f(x_{0})}

and therefore

wp(fμn,fδx0)p=cnpd(f(xn),x0)pd(xn,x0)p=1+d(xn,x0)pd(xn,x0)p1w_{p}(f_{*}\mu_{n},f_{*}\delta_{x_{0}})^{p}=c_{n}^{p}\frac{d(f(x_{n}),x_{0})^{p}}{d(x_{n},x_{0})^{p}}=\frac{1+d(x_{n},x_{0})^{p}}{d(x_{n},x_{0})^{p}}\to 1

which implies that ff_{*} cannot be (strongly) continuous in δx0\delta_{x_{0}}. ∎

The following results will hold for any 𝒫p(X)\mathcal{P}_{p}(X). To simplify the notation and some of the proofs we will just state them for 𝒫1(X)\mathcal{P}_{1}(X). Furthermore, we assume from now on that f:𝒫1(X)𝒫1(X)f_{*}:\mathcal{P}_{1}(X)\to\mathcal{P}_{1}(X) is strongly continuous (for short just continuous) and whenever we speak about ff_{*} we mean the map f:𝒫1(X)𝒫1(X)f_{*}:\mathcal{P}_{1}(X)\to\mathcal{P}_{1}(X). Since f(𝒫1(X))𝒫1(X)f_{*}(\mathcal{P}_{1}(X))\subset\mathcal{P}_{1}(X) (for f:𝒫(X)𝒫(X)f_{*}:\mathcal{P}(X)\to\mathcal{P}(X)) this also implies that ff_{*} is weakly continuous in 𝒫1(X)\mathcal{P}_{1}(X). Similarly we could say that f:𝒫q(X)𝒫q(X)f_{*}:\mathcal{P}_{q}(X)\to\mathcal{P}_{q}(X) is continuous and “weakly” continuous in 𝒫q(X)\mathcal{P}_{q}(X) w.r.t. the induced subspace topology of 𝒫q(X)𝒫p(X)\mathcal{P}_{q}(X)\subset\mathcal{P}_{p}(X) for any 1p<q1\leq p<q.

Example.

Having a global attractor does not imply that ff_{*} is strongly continuous, even finite time compactness is not sufficient: Let XX be \mathbb{R} with the Euclidean metric |||\cdot|. Define f:f:\mathbb{R}\to\mathbb{R} by

f(x)={0x0x2x<0.f(x)=\begin{cases}0&x\geq 0\\ x^{2}&x<0.\end{cases}

Then ff is continuous and f20f^{2}\equiv 0 but for xn=nx_{n}=-n

d(f(xn),0)1+d(xn,0)=n21+n,\frac{d(f(x_{n}),0)}{1+d(x_{n},0)}=\frac{n^{2}}{1+n}\to\infty,

i.e. ff_{*} is not continuous on (𝒫p(X),wp)(\mathcal{P}_{p}(X),w_{p}).

If K𝒫1(X)K\subset\mathcal{P}_{1}(X) is invariant w.r.t. ff_{*} then it is invariant w.r.t. f:𝒫(X)𝒫(X)f_{*}:\mathcal{P}(X)\to\mathcal{P}(X). Which implies that all measures in KK are supported on the global attractor AA of ff. Since 𝒫(A)𝒫1(X)\mathcal{P}(A)\subset\mathcal{P}_{1}(X) the maximal invariant set of 𝒫1(X)\mathcal{P}_{1}(X) is 𝒫(A)=𝒫1(A)\mathcal{P}(A)=\mathcal{P}_{1}(A).

Suppose ff is finite time compact, i.e. there is an mm such that fm(X)BRf^{m}(X)\subset B_{R} for some compact set BRB_{R}. It should be obvious that this implies K=𝒫1(A)K=\mathcal{P}_{1}(A) is the global attractor of ff_{*}. Furthermore, we have the following:

Proposition 9.

Suppose for some m>0m>0,

fm(X)BR(x0).f^{m}(X)\subset B_{R}(x_{0}).

Then ff_{*} is finite time compact and thus any closed set Bw𝒫1(X)B^{w}\subset\mathcal{P}_{1}(X) is ff_{*}-admissible.

Proof.

Let {νn}n\{\nu_{n}\}_{n\in\mathbb{N}} be any sequence in fm(𝒫1(X))f_{*}^{m}(\mathcal{P}_{1}(X)). Then there is a sequence {μn}n\{\mu_{n}\}_{n\in\mathbb{N}} such that νn=fm(μn)\nu_{n}=f_{*}^{m}(\mu_{n}). fm(X)BR(x0)f^{m}(X)\subset B_{R}(x_{0}) implies suppνnBR(x0)\operatorname{supp}\nu_{n}\subset B_{R}(x_{0}). Thus νn\nu_{n} is tight and

d(x,x0)R+ϵd(x,x0)𝑑νn(x)=0,\int_{d(x,x_{0})\geq R+\epsilon}d(x,x_{0})d\nu_{n}(x)=0,

i.e. {νn}n\{\nu_{n}\}_{n\in\mathbb{N}} has uniformly integrable first moments. Which means that {νn}n\{\nu_{n}\}_{n\in\mathbb{N}} has a convergent subsequence. Therefore, fm(𝒫1(X))f_{*}^{m}(\mathcal{P}_{1}(X)) is compact which easily implies admissibility for any closed BwB^{w}.∎

Proposition 10.

Suppose there is an R0R_{0}, 0c<10\leq c<1 and m>0m>0 such that for all RR0R\geq R_{0}

fm(BR)BcR.f^{m}(B_{R})\subset B_{cR}.

Then any bounded closed set Bw𝒫1(X)B^{w}\subset\mathcal{P}_{1}(X) is ff_{*}-admissible.

Proof.

Let {μn}n\{\mu_{n}\}_{n\in\mathbb{N}} be a sequence in BwB^{w} such that f[0,mn](μn)Bwf_{*}^{[0,m_{n}]}(\mu_{n})\subset B^{w} for some mnm_{n}\to\infty. Because BwB^{w} is bounded we have

Xd(x,x0)𝑑μn(x)M.\int_{X}d(x,x_{0})d\mu_{n}(x)\leq M.

First assume mn=knmm_{n}=k_{n}\cdot m for an unbounded sequence knk_{n}\in\mathbb{N}. Then for RR0R\geq R_{0}

χX\BR(x)d(x,x0)𝑑fmnμn(x)\displaystyle\int\chi_{X\backslash B_{R}}(x)\cdot d(x,x_{0})df_{*}^{m_{n}}\mu_{n}(x) =\displaystyle= χX\BR(fmn(x))d(fmn(x),x0)𝑑μn(x)\displaystyle\int\chi_{X\backslash B_{R}}(f^{m_{n}}(x))\cdot d(f^{m_{n}}(x),x_{0})d\mu_{n}(x)
\displaystyle\leq χX\BcknR(x)cknd(x,x0)𝑑μn(x)\displaystyle\int\chi_{X\backslash B_{c^{-k_{n}}R}}(x)\cdot c^{k_{n}}d(x,x_{0})d\mu_{n}(x)
\displaystyle\leq cknM0,\displaystyle c^{k_{n}}M\to 0,

which shows that {fmn(μn)}n\{f_{*}^{m_{n}}(\mu_{n})\}_{n\in\mathbb{N}} has uniformly integrable first moments which implies that the sequence has a convergent subsequence.

If mn0(modm)m_{n}\not\equiv 0(\operatorname{mod}m) then for some 0ln<m0\leq l_{n}<m we have mnln0(modm)m_{n}-l_{n}\equiv 0(\operatorname{mod}m). Therefore, if we set νn=fln(μ)\nu_{n}=f_{*}^{l_{n}}(\mu) then the argument above applies to νn\nu_{n} and the sequence of endpoints (which is equal to {fmn(μn)}n\{f_{*}^{m_{n}}(\mu_{n})\}_{n\in\mathbb{N}}) has a convergent subsequence.∎

Remark.

Kifer used in [Kif88, Theorem 1.7] linear attraction instead of exponential. This might not be sufficient for admissibility. Nevertheless, later we will assume that a Markov-type perturbation of ff_{*} is small in the Wasserstein distance which is stronger than Kifer’s assumption and thus an invariant (probability) measure exists for the perturbation by the same theorem. But that theorem does not imply that the perturbed and unperturbed invariant measures are close w.r.t. the Wasserstein distance, the perturbed invariant measures might not even be in the Wasserstein space. So our result improves this sufficiently.

Before we show how to use Conley theory for small Markov-type noise applied to ff we give a sufficient condition such that bounded sets in 𝒫1(X)\mathcal{P}_{1}(X) are {Fn}\{F_{n}\}-admissible for FnfF_{n}\to f_{*}.

Proposition 11.

Let BwB^{w} be closed and bounded and UU be a δ\delta-neighborhood of BwB^{w} with ff_{*}-admissible closure. Suppose FnfF_{n}\to f_{*} uniformly on some UU, i.e.

supμUw1(f(μ),Fn(μ))=ϵn0.\sup_{\mu\in U}w_{1}(f_{*}(\mu),F_{n}(\mu))=\epsilon_{n}\to 0.

If ff_{*} is uniformly continuous in UU then BwB^{w} is {Fn}\{F_{n}\}-admissible.

Remark.

(1) The idea is to use the uniform convergence and uniform continuity to construct longer and longer orbits of ff_{*} close the the last part of the orbits of FnF_{n}, i.e. the orbit f[0,mnkn](yn)f^{[0,m_{n}-k_{n}]}(y_{n}) and Fn[kn,mn](xn)F_{n}^{[k_{n},m_{n}]}(x_{n}) should be closer and closer and mnknm_{n}-k_{n}\to\infty for yn=Fnkn(xn)y_{n}=F_{n}^{k_{n}}(x_{n}).

(2) Uniform continuity of ff_{*} and uniform convergence of FnfF_{n}\to f_{*} are the assumptions Benci [Ben91] used to prove his continuation theorem for the Conley index. Besides having continuous time dynamical systems he also needs invertibility.

Proof.

Let μnN\mu_{n}\in N and mnm_{n}\to\infty be sequences with Fn[0,mn](μn)F_{n}^{[0,m_{n}]}(\mu_{n}). Uniform convergence of FnfF_{n}\to f_{*} and uniform continuity of ff imply that for some ϵ(ϵn)0\epsilon(\epsilon_{n})\to 0 as ϵn0\epsilon_{n}\to 0

w1(Fn2(μ),f2(μ))\displaystyle w_{1}(F_{n}^{2}(\mu),f_{*}^{2}(\mu)) \displaystyle\leq w1(Fn2(μ),f(Fn(μ))+w1(f(Fn(μ)),f2(μ))\displaystyle w_{1}(F_{n}^{2}(\mu),f_{*}(F_{n}(\mu))+w_{1}(f_{*}(F_{n}(\mu)),f_{*}^{2}(\mu))
\displaystyle\leq ϵn+ϵ(ϵn)=:ϵn,20as n.\displaystyle\epsilon_{n}+\epsilon(\epsilon_{n})=:\epsilon_{n,2}\to 0\,\mbox{as \,}n\to\infty.

Similarly we can show that there are ϵn,k0\epsilon_{n,k}\to 0 as nn\to\infty such that

w1(Fnk(μ),fk(μ))ϵn,k0.w_{1}(F_{n}^{k}(\mu),f_{*}^{k}(\mu))\leq\epsilon_{n,k}\to 0.

Therefore, there is a sequence kn0k_{n}\geq 0 with mnknm_{n}-k_{n}\to\infty such that Fn[0,mn](μn)BwF_{n}^{[0,m_{n}]}(\mu_{n})\subset B^{w} implies that

f[0,mnkn](νn)U=Uδ(Bw)f_{*}^{[0,m_{n}-k_{n}]}(\nu_{n})\subset U=U_{\delta}(B^{w})

for νn=Fnkn(μn)\nu_{n}=F_{n}^{k_{n}}(\mu_{n}). Furthermore, we can choose knk_{n} such that

δn=maxk[0,mnkn]ϵn,k0\delta_{n}=\max_{k\in[0,m_{n}-k_{n}]}\epsilon_{n,k}\to 0

and therefore

w1(Fnmn(μn),fmnkn(νn))δn.w_{1}(F_{n}^{m_{n}}(\mu_{n}),f_{*}^{m_{n}-k_{n}}(\nu_{n}))\leq\delta_{n}.

Because the closure of UU is ff_{*}-admissible, the sequence of endpoints {fmnkn(νn)}nU\{f_{*}^{m_{n}-k_{n}}(\nu_{n})\}_{n\in\mathbb{N}}\subset U has a convergent subsequence which implies that {Fnmn(μn)}n\{F_{n}^{m_{n}}(\mu_{n})\}_{n\in\mathbb{N}} has a convergent subsequence, in particular the limit point is in BwB^{w}. ∎

This proposition applies in particular to Lipschitz continuous functions f:XXf:X\to X because the induce map f:𝒫1(X)𝒫1(X)f_{*}:\mathcal{P}_{1}(X)\to\mathcal{P}_{1}(X) is Lipschitz continuous as well.

E.g. suppose X=nX=\mathbb{R}^{n} and ff is the time hh map of a flow generated by an ODE x˙=g(x)\dot{x}=g(x) such that gg satisfies the one-sided Lipschitz condition for some MM\in\mathbb{R}

xy,g(x)g(y)Mxy2\langle x-y,g(x)-g(y)\rangle\leq M\|x-y\|^{2}

then ff is Lipschitz continuous with constant eMhe^{Mh}.

3. Markov-Wasserstein maps

Definition 12.

A Markov-Wasserstein map (MW-map) of order pp is a continuous map P:𝒫p(X)𝒫p(X)P:\mathcal{P}_{p}(X)\to\mathcal{P}_{p}(X) which is convex linear, i.e. for μ,ν𝒫p(X)\mu,\nu\in\mathcal{P}_{p}(X) and a[0,1]a\in[0,1]

P(aμ+(1a)ν)=aP(μ)+(1a)P(ν).P(a\mu+(1-a)\nu)=aP(\mu)+(1-a)P(\nu).

Suppose PP is induced by a kernel p(dy|x):X𝒫p(X)p(dy|x):X\to\mathcal{P}_{p}(X) (necessarily continuous), i.e.

P:dμ(y)p(dy|x)𝑑μ(x).P:d\mu(y)\mapsto\int p(dy|x)d\mu(x).

The map P=PfMP=P_{f}^{M} is called an MW-map of order pp relative to ff with noise level (at most) MM if

supxXwp(p(dy|x),δf(x))M.\sup_{x\in X}w_{p}(p(dy|x),\delta_{f(x)})\leq M.

This implies by [Vil09, 4.8]

wp(P(μ),f(μ))pwp(p(dy|x),δf(x))p𝑑μ(x)Mp.w_{p}(P(\mu),f_{*}(\mu))^{p}\leq\int w_{p}(p(dy|x),\delta_{f(x)})^{p}d\mu(x)\leq M^{p}.
Remark.

As in the sections before, the results also hold for semiflows and a suitable definition for MW-semiflows, i.e. a continuous semigroups (Pt)t0(P_{t})_{t\geq 0} on 𝒫p(X)\mathcal{P}_{p}(X). The noise level model can be stated similarly, but we only require that it is uniformly small for all “small” tt. We will focus here only on maps, resp. MW-maps, because the intuition behind these is easier, but all results also hold for MW-semiflows if the noise level is sufficiently small.

MW-maps (resp. MW-semiflows) appear naturally in the theory of Markov chains (resp. processes). The map p(dy|):X𝒫p(X)p(dy|\cdot):X\to\mathcal{P}_{p}(X) is the Markov transition probability function, whereas the map P:𝒫p(X)𝒫p(X)P:\mathcal{P}_{p}(X)\to\mathcal{P}_{p}(X) almost never has a name. If P=fP=f_{*} for some dynamical system f:XXf:X\to X then PP is sometimes called transfer map. The following will show that we only need continuity of p(dy|)p(dy|\cdot) to ensure that PP is continuous in 𝒫p(X)\mathcal{P}_{p}(X) (and thus for any 𝒫q(X)\mathcal{P}_{q}(X), 1q<p1\leq q<p and for 𝒫(X)\mathcal{P}(X)).

Theorem 13.

Let p(dy|):X𝒫p(X)p(dy|\cdot):X\to\mathcal{P}_{p}(X) be a Markov kernel, i.e. a measure-valued map, continuously depending on xx. If M=supxXwp(p(dy|x),δx)<M=\sup_{x\in X}w_{p}(p(dy|x),\delta_{x})<\infty then PP defined by

P:dμ(y)p(dy|x)𝑑μ(x)P:d\mu(y)\mapsto\int p(dy|x)d\mu(x)

is an MW-map of order pp relative to id:XX\operatorname{id}:X\to X with noise level MM.

Remark.

For MW-semiflows weak continuity of pt(dy|)p_{t}(dy|\cdot) corresponds to Feller continuity of the corresponding stochastic process. In fact, if the initial distribution of (Xtn)t0(X_{t}^{n})_{t\geq 0} is δxn\delta_{x_{n}}, i.e. X0n=xnX_{0}^{n}=x_{n}, then by our continuity requirement if xnx0x_{n}\to x_{0} then μtn=PtδxtPtδx0=μt0\mu_{t}^{n}=P_{t}\delta_{x_{t}}\to P_{t}\delta_{x_{0}}=\mu_{t}^{0} which implies that

u(xn)=𝔼g(Xtn)=g(x)𝑑μtn(x)g(x)𝑑μt0(x)=𝔼g(Xt0)=u(x0)u(x_{n})=\mathbb{E}g(X_{t}^{n})=\int g(x)d\mu_{t}^{n}(x)\to\int g(x)d\mu_{t}^{0}(x)=\mathbb{E}g(X_{t}^{0})=u(x_{0})

for all bounded continuous function g(x)g(x), i.e. the stochastic process generated by (Pt)t0(P_{t})_{t\geq 0} is Feller continuous. The continuity of the moments implies that, in addition, the moments are also continuous. This condition could be called pp-Feller continuous. This type of continuity does not need tt\in\mathbb{R} and thus applies equally to Markov chains, i.e. discrete time stochastic processes.

Proof.

Continuous dependency implies that P:𝒫p(X)𝒫(X)P:\mathcal{P}_{p}(X)\to\mathcal{P}(X) is continuous. So we only need to show that μnμ\mu_{n}\to\mu in 𝒫p(X)\mathcal{P}_{p}(X) implies that PμnPμP\mu_{n}\to P\mu in 𝒫p(X)\mathcal{P}_{p}(X)

Because wp(p(dy|x),δx)Mw_{p}(p(dy|x),\delta_{x})\leq M for some M<M<\infty we have

d(y,x0)pp(dy|x)\displaystyle\int d(y,x_{0})^{p}p(dy|x) =\displaystyle= wp(p(dy|x),δx0)p\displaystyle w_{p}(p(dy|x),\delta_{x_{0}})^{p}
\displaystyle\leq (wp(p(dy|x),δx)+wp(δx,δx0))p\displaystyle(w_{p}(p(dy|x),\delta_{x})+w_{p}(\delta_{x},\delta_{x_{0}}))^{p}
\displaystyle\leq 2p1(Mp+d(x,x0)p)\displaystyle 2^{p-1}(M^{p}+d(x,x_{0})^{p})

This implies that g(x)=d(y,x0)pp(dy|x)g(x)=\int d(y,x_{0})^{p}p(dy|x) grows at most like d(x,x0)pd(x,x_{0})^{p}. Furthermore, gg is continuous because xp(dy|x)x\mapsto p(dy|x) and μwp(μ,δx0)p\mu\mapsto w_{p}(\mu,\delta_{x_{0}})^{p} are. Thus by [Vil09, 6.8]

d(y,x0)p𝑑Pμn(x)=g(x)𝑑μn(x)g(x)𝑑μ(x)=d(y,x0)p𝑑Pμ(y),\int d(y,x_{0})^{p}dP\mu_{n}(x)=\int g(x)d\mu_{n}(x)\to\int g(x)d\mu(x)=\int d(y,x_{0})^{p}dP\mu(y),

i.e. PμnPμP\mu_{n}\to P\mu in 𝒫p(X)\mathcal{P}_{p}(X).∎

Example.

(1) Bounded noise can be modeled by Markov maps with

M=supxXd(x,suppp(dy|x))<.M=\sup_{x\in X}d(x,\operatorname{supp}p(dy|x))<\infty.

Then wp(p(dy|x),δx)Mw_{p}(p(dy|x),\delta_{x})\leq M and thus continuity of p(dy|):X𝒫p(X)p(dy|\cdot):X\to\mathcal{P}_{p}(X) for some pp implies that of P:𝒫p(X)𝒫p(X)P:\mathcal{P}_{p}(X)\to\mathcal{P}_{p}(X). This could also be used to model multi-valued perturbations, i.e. maps fn:X2Xf_{n}:X\to 2^{X} with supxXd(f(x),fn(x))M\sup_{x\in X}d(f(x),f_{n}(x))\leq M.

(2) Let X=nX=\mathbb{R}^{n} with its Euclidean distance. If ν\nu is the standard normal distribution then ν=ρ(x)dx\nu=\rho(x)dx, where dxdx is the Lebesgue measure on n\mathbb{R}^{n}. Any normal distribution with mean xx and variance σ2\sigma^{2} can be modeled as follows

μx,σ2=δxρσ\mu_{x,\sigma^{2}}=\delta_{x}*\rho_{\sigma}

where ρϵ(x)=ϵnρ(x/ϵ)\rho_{\epsilon}(x)=\epsilon^{-n}\rho(x/\epsilon). For σ=0\sigma=0 we set μx,0=δx\mu_{x,0}=\delta_{x}.

Since mp=|x|pρ(x)𝑑x<m_{p}=\int|x|^{p}\rho(x)dx<\infty for all pp this implies (see [AGS08, 7.1.10]) that

wp(δx,μx,σ2)σmp.w_{p}(\delta_{x},\mu_{x,\sigma^{2}})\leq\sigma m_{p}.

Thus Gaussian noise with uniformly small variance is uniformly small in all Wasserstein spaces (although the noise level diverges to \infty as pp\to\infty).

Corollary 14.

If f:XXf:X\to X induces a continuous self map on 𝒫p(X)\mathcal{P}_{p}(X) and p(dy|):X𝒫p(X)p(dy|\cdot):X\to\mathcal{P}_{p}(X) is continuous with

M=supxXwp(p(dy|x),δf(x))<M=\sup_{x\in X}w_{p}(p(dy|x),\delta_{f(x)})<\infty

then P:𝒫p(X)𝒫p(X)P:\mathcal{P}_{p}(X)\to\mathcal{P}_{p}(X) defined as above is an MW map of order pp relative to ff with noise level MM.

A random perturbation can now be modeled as a composition of ff followed by a smearing via p(dy|)p(dy|\cdot), i.e. ff_{*} followed by P=PidMP=P_{\operatorname{id}}^{M}. This corresponds to additive noise depending only on the image, whereas a general MW-map relative to ff might smear the image f(x)f(x) and f(y)f(y) for xyx\neq y differently even if f(x)=f(y)f(x)=f(y).

If for some sequence p~n(dy|)\tilde{p}_{n}(dy|\cdot) the noise level

supxXwp(p~n(dy|x),δf(x))=ϵn\sup_{x\in X}w_{p}(\tilde{p}_{n}(dy|x),\delta_{f(x)})=\epsilon_{n}

converges to zero then P~n\tilde{P}_{n} converges to ff_{*} uniformly on 𝒫p(X)\mathcal{P}_{p}(X), i.e.

supμ𝒫p(X)wp(P~n(μ),f(μ))0.\sup_{\mu\in\mathcal{P}_{p}(X)}w_{p}(\tilde{P}_{n}(\mu),f_{*}(\mu))\to 0.

An MW-chain relative to ff satisfies the Markov property, i.e. future behavior only depends on the current state. Furthermore, this models only time-independent random perturbations. Time-dependent perturbations can be modeled with the result of [Kel11]. There it is shown that a local attractor can be continued if the non-autonomous perturbations is uniformly small. Translated into this framework this means

supμ𝒫p(X),kwp(P(μ,k),f(μ))<ϵ\sup_{\mu\in\mathcal{P}_{p}(X),k\in\mathbb{Z}}w_{p}(P(\mu,k),f_{*}(\mu))<\epsilon

for the non-autonomous dynamical system (\approx inhomogeneous Markov map)

(μ,k)(P(μ,k),k+1).(\mu,k)\mapsto(P(\mu,k),k+1).

Instead of using the semi-admissibility argument to show that the invariant set KnK_{n} is non-empty we can use a weak compactness argument to get the same result.

Example.

Consider the ODE with x˙=xx3\dot{x}=x-x^{3}. This satisfies the one-side Lipschitz condition with M=1M=1 and generates a global semiflow that attracts in finite time. Thus the time-one map induces a Lipschitz continuous map ff_{*} on 𝒫p(X)\mathcal{P}_{p}(X) which attracts in finite time, too. Therefore, any bounded closed set in 𝒫p(X)\mathcal{P}_{p}(X) is {Fn}\{F_{n}\}-admissible for FnfF_{n}\to f_{*} uniformly. In particular, the MW-map of order pp for small noise level has an attractor close to the original w.r.t. the Wasserstein metric wpw_{p}.

Theorem 15.

Suppose f:XXf:X\to X induces a dynamical system ff_{*} on 𝒫p(X)\mathcal{P}_{p}(X) having a global attractor and that ff_{*} is uniformly continuous in a neighborhood of the global attractor. If PnfP_{n}\to f_{*} is a sequence of MW-maps of order pp relative to ff with noise level ϵn0\epsilon_{n}\to 0. Then for nn0n\geq n_{0} there is a positive PnP_{n}-invariant isolating neighborhood NnN_{n} such that Kn=APn(Nn)K_{n}=A_{P_{n}}(N_{n}) is non-empty and a weakly compact weak attractor which contains all bounded PnP_{n}-invariant measures, i.e. APn(Pp(X))=KnA_{P_{n}}(P_{p}(X))=K_{n}. Furthermore, there is at least one stationary measure in KnK_{n}.

Remark.

Suppose p>1p>1. Whenever KK is (strongly) compact in 𝒫p(X)\mathcal{P}_{p}(X) then clUδ(K)\operatorname{cl}U_{\delta}(K) is weakly compact w.r.t. the weaker subspace topology of 𝒫p(X)\mathcal{P}_{p}(X) induced by 𝒫p(X)𝒫q(X)\mathcal{P}_{p}(X)\subset\mathcal{P}_{q}(X) for any 1q<p1\leq q<p. Thus for all μNn\mu\in N_{n} there is a μKKn\mu_{K}\in K_{n}

Pn(μ)wqμK.P_{n}(\mu)\overset{w_{q}}{\longrightarrow}\mu_{K}.

This means that, although the pp-moment may not converge, any qq-moment converges for 1q<p1\leq q<p, but the convergence may get worse the closer qq comes to pp.

Proof.

Everything but the existence of a stationary measure and APn(𝒫p(X))=KnA_{P_{n}}(\mathcal{P}_{p}(X))=K_{n} follows from theorem 5. Since PnP_{n} is convex linear KnK_{n} must be convex. This implies that for any μKn\mu\in K_{n} the sequence

{1mk=0m1Pnk(μ)}m\left\{\frac{1}{m}\sum_{k=0}^{m-1}P_{n}^{k}(\mu)\right\}_{m\in\mathbb{N}}

is in KnK_{n} and thus weakly converging to some νKn\nu\in K_{n} and by the Krylov-Bogolyubov theorem it must be a fixed point of PnP_{n}, i.e. ν\nu is a stationary measure of PnP_{n}. Furthermore, if RR is the distance from the global attractor of ff then its mass must decay as RpR^{-p}.

The invariant measures must all be contained in the interior of NnN_{n}. Otherwise take μAPn(𝒫p(X))\Kn\mu\in A_{P_{n}}(\mathcal{P}_{p}(X))\backslash K_{n}. If μ\mu is PnP_{n}-stationary then the argument is as follows: For t[0,1]t\in[0,1] and some μ0Kn\mu_{0}\in K_{n} the graph of ttμ+(1t)μ0t\mapsto t\mu+(1-t)\mu_{0} is stationary and intersects Nn\partial N_{n}, which implies Kn=APn(Nn)K_{n}=A_{P_{n}}(N_{n}) intersects Nn\partial N_{n}. This contradicts the isolatedness of KnK_{n}.

For the general case assume σ:𝒫p(X)\sigma:\mathbb{Z}\to\mathcal{P}_{p}(X) is a bounded full solution through μ\mu, i.e. Pn(σ(k))=σ(k+1)P_{n}(\sigma(k))=\sigma(k+1) and σ(0)=μ\sigma(0)=\mu. Now define the function g:[0,1]g:\mathbb{Z}\to[0,1]

g:ksup{t[0,1]|sσ(k)+(1s)μ0Nnfor alls[0,t]}.g:k\mapsto\sup\{t\in[0,1]\,|\,s\sigma(k)+(1-s)\mu_{0}\in N_{n}^{{}^{\prime}}\,\mbox{for all}\,s\in[0,t]\}.

Because NnN_{n}^{{}^{\prime}} is a positive invariant neighborhood of KnK_{n}, μ0\mu_{0} stationary, PnP_{n} convex linear and {σ(k)}k\{\sigma(k)\}_{k\in\mathbb{Z}} bounded and not entirely in NnN_{n}^{{}^{\prime}} we have

0<δg(k)g(k+1).0<\delta\leq g(k)\leq g(k+1).

This implies that

T=infk0g(k)δ.T=\inf_{k\leq 0}g(k)\geq\delta.

Because σ(0)Nn\sigma(0)\notin N_{n}^{{}^{\prime}} we have T<1T<1 and thus by definition of gg

wp(Tσ(k)+(1T)μ0,Nn)0as kw_{p}(T\sigma(k)+(1-T)\mu_{0},\partial N_{n}^{{}^{\prime}})\to 0\quad\mbox{as\,}k\to-\infty

and thus there is a T1TT_{1}\leq T such that σ~(k)=T1σ(k)+(1T1)μ0\tilde{\sigma}(k)=T_{1}\sigma(k)+(1-T_{1})\mu_{0} is a full solution in NnN_{n}^{{}^{\prime}} with

wp(σ~(k1),Nn)ϵ2w_{p}(\tilde{\sigma}(k_{1}),\partial N_{n}^{{}^{\prime}})\leq\frac{\epsilon}{2}

for some k10k_{1}\leq 0. But σ~(k1)Kn\tilde{\sigma}(k_{1})\in K_{n} and Uϵ(Kn)NnU_{\epsilon}(K_{n})\subset N_{n}^{{}^{\prime}} which implies that

ϵwp(σ(k1),Nn)ϵ2.\epsilon\leq w_{p}(\sigma(k_{1}),\partial N_{n}^{{}^{\prime}})\leq\frac{\epsilon}{2}.

This is a contradiction and thus APn(𝒫p(X))\Kn=A_{P_{n}}(\mathcal{P}_{p}(X))\backslash K_{n}=\varnothing, i.e. KnK_{n} contains all bounded invariant measures in 𝒫p(X)\mathcal{P}_{p}(X).∎

Corollary 16.

The positive ff_{*}-invariant isolating neighborhood BB and the positive PnP_{n}-invariant isolating neighborhood NnN_{n} can be chosen convex, i.e. if μiB\mu_{i}\in B (resp. μiNn\mu_{i}\in N_{n}) for i=0,1i=0,1 then μtB\mu_{t}\in B (resp. μtNn\mu_{t}\in N_{n}) for μt=tμ0+(1t)μ1\mu_{t}=t\mu_{0}+(1-t)\mu_{1} and t[0,1]t\in[0,1].

Proof.

Let νi,μi𝒫p(X)\nu_{i},\mu_{i}\in\mathcal{P}_{p}(X) for i=0,1i=0,1 and define μt=tμ1+(1t)μ0\mu_{t}=t\mu_{1}+(1-t)\mu_{0} and νt=tν1+(1t)ν0\nu_{t}=t\nu_{1}+(1-t)\nu_{0}. Assume

wp(μi,νi)<ϵ.w_{p}(\mu_{i},\nu_{i})<\epsilon.

Then there are optimal transference plans πiΠ(μi,ν)\pi_{i}\in\Pi(\mu_{i},\nu) such that

d(x,y)p𝑑πi(x,y)<ϵp.\int d(x,y)^{p}d\pi_{i}(x,y)<\epsilon^{p}.

The plan πt=tπ1+(1t)π0\pi_{t}=t\pi_{1}+(1-t)\pi_{0} is a transference plan for the pair (μt,νt)(\mu_{t},\nu_{t}) and thus

wp(μt,νt)p\displaystyle w_{p}(\mu_{t},\nu_{t})^{p} \displaystyle\leq d(x,y)p𝑑πt(x,y)\displaystyle\int d(x,y)^{p}d\pi_{t}(x,y)
\displaystyle\leq td(x,y)p𝑑π1(x,y)+(1t)d(x,y)p𝑑π0(x,y)\displaystyle t\int d(x,y)^{p}d\pi_{1}(x,y)+(1-t)\int d(x,y)^{p}d\pi_{0}(x,y)
<\displaystyle< tϵp+(1t)ϵp=ϵp.\displaystyle t\epsilon^{p}+(1-t)\epsilon^{p}=\epsilon^{p}.

The construction of BB is done via a Lyapunov pair (ϕ,γ)(\phi,\gamma) essentially measuring a weighted distance of the forward orbit of a point, i.e.

FN:μmin{1,wp(μ,A(N)N)}F_{N^{\prime}}:\mu\mapsto\min\{1,w_{p}(\mu,A^{-}(N^{\prime})\cup\partial N^{\prime})\}

and

ϕ:μsup{(2n+1)FN(fn(x))/(n+1)|n,nωN(x)}\phi:\mu\mapsto\sup\{(2n+1)F_{N^{\prime}}(f_{*}^{n}(x))/(n+1)\,|\,n\in\mathbb{N},n\leq\omega_{N^{\prime}}(x)\}

Let P1ϵP_{1}^{\epsilon} be defined as in the proof of theorem 1. We can assume that wp(P1ϵ,N)>2ϵw_{p}(P_{1}^{\epsilon},\partial N^{\prime})>2\epsilon for some for sufficiently small ϵ>0\epsilon>0. Because A(N)=A(N)A^{-}(N^{\prime})=A(N^{\prime}) is convex, ϕ(μi)<ϵ\phi(\mu_{i})<\epsilon for i=0,1i=0,1 implies

ϕ(μt)<ϵ.\phi(\mu_{t})<\epsilon.

Hence P1ϵP_{1}^{\epsilon} is convex and we can choose B=P1δB=P_{1}^{\delta} for some small δ>0\delta>0. Similarly V(a)V(a) defined in theorem 2 is convex.

The set NnN_{n} was defined as

Nn(ϵ)=Ncl{y| xV(ϵ)m0 s.t. Pn[0,m](x)U~and Pnm(x)=y}N_{n}(\epsilon)=N\cap\operatorname{cl}\{y\,|\,\mbox{ $\exists x\in V(\epsilon)$, $m\geq 0$\,\ s.t.\,\ $P_{n}^{[0,m]}(x)\subset\tilde{U}\,$and $P_{n}^{m}(x)=y$}\}

where U~=intB\tilde{U}=\operatorname{int}B and N=clV(ϵ0)N=\operatorname{cl}V(\epsilon_{0}) are convex sets. Hence Nn(ϵ)N_{n}(\epsilon) is convex .∎

Corollary 17.

Under the assumption of the previous theorem if p=1p=1 then all orbits of PnP_{n} are bounded for nn0n\geq n_{0}, i.e. Pn[0,](μ)BRw(μ0)P_{n}^{[0,\infty]}(\mu)\subset B_{R}^{w}(\mu_{0}) for some R0R\geq 0 and some fixed μ0\mu_{0}. In particular, KnK_{n} is the global weak attractor of PnP_{n}.

Remark.

The idea of the proof is to control the distance of μ1\mu_{1} and μ0\mu_{0} by the distance of μt\mu_{t} and μ1\mu_{1} where μ0\mu_{0} will be some stationary measure and t(0,1]t\in(0,1] is sufficiently small.

Proof.

Using the Kantorovich-Rubinstein formula we have the following equality for μ,ν𝒫1(X)\mu,\nu\in\mathcal{P}_{1}(X)

w1(μ,ν)=infπΠ(μ,ν)d(x,y)𝑑π(x,y)=supϕLip1{ϕ𝑑μϕ𝑑ν},w_{1}(\mu,\nu)=\inf_{\pi\in\Pi(\mu,\nu)}\int d(x,y)d\pi(x,y)=\sup_{\|\phi\|_{\operatorname{Lip}}\leq 1}\left\{\int\phi d\mu-\int\phi d\nu\right\},

i.e. there is a sequence ϕk\phi_{k} with ϕkLip1\|\phi_{k}\|_{\operatorname{Lip}}\leq 1 such that ϕk𝑑μϕk𝑑νw1(μ,ν)\int\phi_{k}d\mu-\int\phi_{k}d\nu\nearrow w_{1}(\mu,\nu). Furthermore, there exist an optimal plan πΠ(μ,ν)\pi\in\Pi(\mu,\nu) such that the infimum is actually attained.

Choose μ0𝒫1(X)\mu_{0}\in\mathcal{P}_{1}(X) and define μt=tμ1+(1t)μ0\mu_{t}=t\mu_{1}+(1-t)\mu_{0} for t[0,1]t\in[0,1] and μ1𝒫1(X)\mu_{1}\in\mathcal{P}_{1}(X). We claim

w1(μt,μ0)=tw1(μt,μ0).w_{1}(\mu_{t},\mu_{0})=tw_{1}(\mu_{t},\mu_{0}).

Suppose πΠ(μ1,μ0)\pi\in\Pi(\mu_{1},\mu_{0}) is the optimal plan and ϕk\phi_{k} the sequence of Lipschitz maps as above. Then π~=tπ+(1t)(id,id)μ0\tilde{\pi}=t\pi+(1-t)(\operatorname{id},\operatorname{id})_{*}\mu_{0} is in Π(μt,μ0)\Pi(\mu_{t},\mu_{0}). Thus

w1(μt,μ0)d(x,y)𝑑π~(x,y)=tw1(μ1,μ0).w_{1}(\mu_{t},\mu_{0})\leq\int d(x,y)d\tilde{\pi}(x,y)=tw_{1}(\mu_{1},\mu_{0}).

Furthermore, we have

w1(μt,μ0)ϕk𝑑μtϕk𝑑μ0=t(ϕk𝑑μ1ϕk𝑑μ0).w_{1}(\mu_{t},\mu_{0})\geq\int\phi_{k}d\mu_{t}-\int\phi_{k}d\mu_{0}=t\left(\int\phi_{k}d\mu_{1}-\int\phi_{k}d\mu_{0}\right).

Because the left hand side converges monotonically to tw1(μt,μ0)tw_{1}(\mu_{t},\mu_{0}) we have proved our claim.

Now fix some stationary measure μ0Kn\mu_{0}\in K_{n}. Since NnN_{n} is a neighborhood of KnK_{n} there is a t(0,1]t\in(0,1] for all μ1X\mu_{1}\in X such that μt\mu_{t} as defined above is in NnN_{n}. Thus Pn[0,)(μt)P_{n}^{[0,\infty)}(\mu_{t}) is in NnN_{n} and bounded, i.e. w1(Pnk(μt),μ0)Rw_{1}(P_{n}^{k}(\mu_{t}),\mu_{0})\leq R for some RR. Because PnP_{n} is convex linear and μ0\mu_{0} stationary we have Pnk(μt)=tPnk(μ1)+(1t)μ0P_{n}^{k}(\mu_{t})=tP_{n}^{k}(\mu_{1})+(1-t)\mu_{0} and hence

w1(Pnk(μ1),μ0)\displaystyle w_{1}(P_{n}^{k}(\mu_{1}),\mu_{0}) =\displaystyle= 1tw1(Pnk(μt),μ0)Rt,\displaystyle\frac{1}{t}w_{1}(P_{n}^{k}(\mu_{t}),\mu_{0})\leq\frac{R}{t},

which implies that the orbit of μ1\mu_{1} is bounded. ∎

References

  • [AGS08] L. Ambrosio, N. Gigli, and G. Savaré, Gradient flows in metric spaces and in the space of probability measures, second ed., Lectures in Mathematics ETH Zürich, Birkhäuser Verlag, Basel, 2008.
  • [Ben91] V. Benci, A new approach to the Morse-Conley theory and some applications, Annali di Matematica Pura ed Applicata 158 (1991), no. 1, 231–305.
  • [CF98] H. Crauel and F. Flandoli, Additive Noise Destroys a Pitchfork Bifurcation, Journal of Dynamics and Differential Equations 10 (1998), no. 2, 259–274.
  • [Hur01] M. Hurley, Weak attractors from Lyapunov functions, Topology and its Applications 109 (2001), no. 2, 201–210.
  • [Kel11] M. Kell, Local attractor continuation of non-autonomously perturbed systems, preprint (2011).
  • [Kif88] Yu. Kifer, Random perturbations of dynamical systems, Birkhäuser, 1988.
  • [MR91] M. Mrozek and K.P. Rybakowski, A cohomological Conley index for maps on metric spaces, Journal of Differential Equations 90 (1991), no. 1, 143–171.
  • [Och99] G. Ochs, Weak Random Attractors, preprint (1999).
  • [Ryb87] K.P. Rybakowski, The Homotopy Index and Partial Differential Equations, Springer, Berlin, 1987.
  • [Vil09] C. Villani, Optimal transport: old and new, Springer Verlag, 2009.
  • [You86] L.-S. Young, Stochastic stability of hyperbolic attractors, Ergodic Theory and Dynamical Systems 6 (1986), no. 2, 311–319.