This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

A variational principle
for hardening elastoplasticity

Ulisse Stefanelli IMATI - CNR, v. Ferrata 1, I-27100 Pavia, Italy. ulisse.stefanelli @ imati.cnr.it http://www.imati.cnr.it/ulisse/
Abstract.

We present a variational principle governing the quasistatic evolution of a linearized elastoplastic material. In case of linear hardening, the novel characterization allows to recover and partly extend some known results and proves itself to be especially well-suited for discussing general approximation and convergence issues. In particular, the variational principle is exploited in order to prove in a novel setting the convergence of time and space-time discretizations as well as to provide some possible a posteriori error control.

Key words and phrases:
Variational principle, Elastoplasticity, Approximation
2000 Mathematics Subject Classification:
35K55, 49S05, 74C05
This research was performed during a visit to the Seminar for Applied Mathematics at ETH Zürich and the Institute of Mathematics of the University of Zürich under the sponsorship of the STM CNR 2006 Program and the Swiss National Science Foundation. The support and hospitality of both institutions is gratefully acknowledged.

1. Introduction

The primal initial-boundary value problem of elastoplasticity consists in determining the generalized deformation state of a material subject to external mechanical actions. In particular, starting from some initial state and for a given load and traction, one shall determine the displacement uu of the body from the reference configuration, the inelastic (plastic) part pp of its strain, and, possibly, a vector of internal hardening variables ξ\xi. In the small deformation regime and within the frame of associative elastoplasticity, the problem is classically formulated in a variational form as that of finding the absolutely continuous trajectory t[0,T]y(t)Yt\in[0,T]\mapsto y(t)\in Y (YY is a Banach space) such that

ψ(y)+Aya.e. in(0,T),y(0)=y0,\partial\psi(y^{\prime})+Ay\ni\ell\quad\text{a.e. in}\ \ (0,T),\ \ y(0)=y_{0}, (1.1)

where y=(u,p,ξ)y=(u,p,\xi) stands for the vector of unknown fields, A:YYA:Y\to Y^{*} (dual) is linear, continuous, and symmetric, and ψ:Y[0,]\psi:Y\to[0,\infty] is the positively 11-homogeneous and convex dissipation potential (\partial is the classical subdifferential in the sense of Convex Analysis, see below). Moreover, :[0,T]Y\ell:[0,T]\to Y^{*} is a given and suitably smooth generalized load (possibly including surface tractions) and y0y_{0} represents the initial state. The reader is referred to Section 2 for some brief mechanical motivation as well as to the classical monographs by Duvaut & Lions [DL76], Han & Reddy [HR99], Lemaitre & Chaboche [LC90], and Simo & Hughes [SH98] for a comprehensive collection of results.

The aim of this paper is that of investigating a global-in-time variational formulation of problem (1.1). In particular, we shall introduce the functional :W1,1(0,T;Y)[0,]{\mathcal{F}}:W^{1,1}(0,T;Y)\to[0,\infty] on trajectories as

(y)=0T(ψ(y)+ψ(Ay)Ay,y),{\mathcal{F}}(y)=\int_{0}^{T}\big{(}\psi(y^{\prime})+\psi^{*}(\ell-Ay)-\left\langle\ell-Ay,y^{\prime}\right\rangle\big{)},

where ψ\psi^{*} stands for the conjugate ψ(w)=supvY(w,vψ(v))\psi^{*}(w)=\sup_{v\in Y}(\left\langle w,v\right\rangle-\psi(v)) of ψ\psi and ,\left\langle\cdot,\cdot\right\rangle denotes the duality pairing between YY^{*} and YY. The starting point of this analysis relies on the fact that solutions of (1.1) and minimizers of {\mathcal{F}} fulfilling the initial condition coincide, namely (see Theorem 3.1)

ysolves (1.1)iff(y)=min=0andy(0)=y0.\displaystyle y\ \ \text{solves \eqref{eq}}\quad\text{iff}\quad{\mathcal{F}}(y)=\min{\mathcal{F}}=0\ \ \text{and}\ \ y(0)=y_{0}. (1.2)

This variational characterization has a clear mechanical interpretation. Indeed, since ψ\psi is positively 11-homogeneous, its conjugate ψ\psi^{*} turns out to be the indicator function of the convex set ψ(0)\partial\psi(0). Hence, min\min{\mathcal{F}} is actually a constrained minimization problem and we have

(y)=0iff{Ayψ(0)a.e. in(0,T),φ(T,y(T))+0Tψ(y)=φ(0,y(0))0T,y,\displaystyle{\mathcal{F}}(y)=0\quad\text{iff}\quad\left\{\begin{array}[]{lll}\ell-Ay\in\partial\psi(0)\quad\text{a.e. in}\ \ (0,T),\\ \varphi(T,y(T))+\displaystyle\int_{0}^{T}\psi(y^{\prime})=\varphi(0,y(0))-\displaystyle\int_{0}^{T}\left\langle\ell^{\prime},y\right\rangle,\end{array}\right. (1.5)

where we have used the notation (t,y)φ(t,y)=12Ay,y(t),y(t,y)\mapsto\varphi(t,y)=\frac{1}{2}\left\langle Ay,y\right\rangle-\left\langle\ell(t),y\right\rangle. The first relation above expresses the so-called local stability [Mie05] of the trajectory whereas the second is nothing but the energy balance at time TT. More precisely, φ(t,y)\varphi(t,y) denotes the complementary energy at time tt for the state yy, 0Tψ(y)\int_{0}^{T}\psi(y^{\prime}) represents the dissipation of the system on [0,T][0,T], and 0T,y-\int_{0}^{T}\left\langle\ell^{\prime},y\right\rangle is the work of external actions on [0,T][0,T]. Hence, minimizing {\mathcal{F}} consists in selecting the (only) stable trajectory which conserves the energy. In this regards, the reader is referred to the pioneering papers by Moreau [Mor68, Mor70, Mor71].

The interest of variational characterization (1.2) of the differential problem (1.1) relies on the possibility of exploiting the general tools from the Calculus of Variations. Some care is however required. Indeed, although {\mathcal{F}} is convex and lower semicontinuous with respect to the weak topology of W1,1(0,T;Y)W^{1,1}(0,T;Y), the functional generally fails to be coercive. Moreover, one is not just asked to minimize {\mathcal{F}} but also to prove that the minimum is 0. This considerations suggest that the Direct Method is hardly applicable in order to get solutions to (1.1) via the characterization in (1.2).

The first issue of this paper is instead that of showing that the variational principle in (1.2) is particularly well-suited for discussing general approximation issues. Since solutions and minimizers coincide, a natural tool in order to frame an abstract approach to limiting procedures within (1.1) is that of considering the corresponding minimum problems via Γ\Gamma-convergence [GF75]. As the value of the functional is directly quantified to be 0 on the minimizers, what is actually needed here for passing to limits are so-called Γ\Gamma-lim inf\liminf inequalities only and the latter are generally easily available. We shall specifically focus on the case of linear hardening elastoplasticity and apply the above-mentioned perspective in order to recover in a unified and more transparent frame and partly generalize some convergence results for conformal finite elements (Thm. 5.3), time-discretizations (Thm. 6.5), and fully-discrete space-time approximations (Thm. 7.1). In particular, for time-discretization we develop a discrete version of the variational principle (1.2) in the same spirit of the theory of variational integrators [MW01] (see Subsection 6.1). This connection entails also some generalized view at the classical discrete time-schemes (see Subsection 6.5).

A second novel point of the present variational approach consists in the possibility of exploiting {\mathcal{F}} in order to estimate a posteriori some approximation error. By letting (y)=0{\mathcal{F}}(y)=0, we will check that (Cor. 4.5)

max[0,T]12A(yv),yv(v)vW1,1(0,T;Y),v(0)=y0.\max_{[0,T]}\frac{1}{2}\left\langle A(y-v),y-v\right\rangle\leq{\mathcal{F}}(v)\quad\forall v\in W^{1,1}(0,T;Y),\ \ v(0)=y_{0}.

If AA shows some coercivity (which is precisely the case of linearized hardening, see Subsection 2.4), and vv is the outcome of some approximation procedure, the estimate above may serve as the basis for some a posteriori estimation procedure, possibly headed to adaptivity (see Subsection 6.7). Let us stress that the latter and (1.5) entail that the distance of a (stable) trajectory from the solution to (1.1) can be uniformly estimated by means of its energy production along the path.

The variational characterization in (1.2) stems from a reinterpretation in the present elastoplastic frame of the celebrated principle by Brezis & Elekand [BE76a, BE76b] for gradient flows of convex functionals. Since its introduction, the latter principle has continuously attracted attention. In particular, it has been exploited in the direction of proving existence [Rio76a, Rio76b, Rio78, Rio79, Auc93, Rou00, GT04] (note that the above-mentioned obstructions to the application of the Direct Methods again appear) and the description of long-time dynamics [Lem96]. Moreover, the Brezis-Ekeland approach has been adapted to the case of second order [Mab01, Mab03] and doubly nonlinear equations [Ste06a] as well.

One has to mention that, of course, (1.2) is not the only possible global-in-time variational characterization of (1.1). Besides minimizing the L2L^{2} space-time norm of the residual (which might be little interesting since the order of the problem is doubled), one has at least to mention Visintin [Vis01], where generalized solutions are obtained as minimal elements of a certain partial-order relation on the trajectories, and the recent contribution by Mielke & Ortiz [MO07] where the functional

yeT/εφ(T,y(T))+0Tet/ε(ψ(y)+1εφ(t,y))y\mapsto e^{-T/\varepsilon}\varphi(T,y(T))+\int_{0}^{T}e^{-t/\varepsilon}\left(\psi(y^{\prime})+\displaystyle\frac{1}{\varepsilon}\varphi(t,y)\right) (1.6)

is minimized among trajectories with y(0)=y0y(0)=y_{0}. Under extra-smoothness conditions on ψ\psi (not fulfilled in the current frame), the Euler-Lagrange equations of the latter functional are

εD2ψ(y)y′′+Dψ(y)+Ay=\displaystyle-\varepsilon D^{2}\psi(y^{\prime})y^{\prime\prime}+D\psi(y^{\prime})+Ay=\ell
y(0)=y0,Dψ(y(T))+Ay(T)=(T).\displaystyle y(0)=y_{0},\qquad D\psi(y^{\prime}(T))+Ay(T)=\ell(T).

In particular, minimizing the functional in (1.6) consists in performing a suitable elliptic (in time) regularization of the problem. In the specific case of ψ\psi positively 11-homogeneous, the limit ε0\varepsilon\to 0 can be carried out and the minimizers of the functional in (1.6) are proved to converge to the solution of (1.1). The latter approach is quite different form that of (1.2). On the one hand, it is much more general as it naturally applies to the non-smooth case as well (no derivatives of ψ\psi and ϕ\phi are involved). On the other hand, it relies on an intermediate and somehow unphysical (not causal) ε\varepsilon-regularized problem and (besides existence) it seems not directly suited for recovering the full extent of our approximation results for the specific case of problem (1.1).

2. Mechanical model

Let us provide the reader with a brief introduction to the mechanical setting under consideration. Our aim is just that of recalling some essential features of the models and well as their variational formulation. In particular, we restrain from reporting here an extensive discussion on associative elastoplasticity as the latter can be easily recovered from the many contributions on the subject. The reader is particularly referred to the mentioned monographs for some comprehensive presentation.

2.1. Preliminaries

We will denote by sym3×3\mathbb{R}^{3\times 3}_{\,\text{\rm sym}} the space of symmetric 3×33\times 3 tensors endowed with the natural scalar product a:b:=tr(ab)=aijbija:b:=\text{tr}(ab)=a_{ij}b_{ij} (summation convention). The space sym3×3\mathbb{R}^{3\times 3}_{\,\text{\rm sym}} is orthogonally decomposed as sym3×3=dev3×3 12\mathbb{R}^{3\times 3}_{\,\text{\rm sym}}=\mathbb{R}^{3\times 3}_{\,\text{\rm dev}}\oplus\mathbb{R}\,1_{2}, where  12\mathbb{R}\,1_{2} is the subspace spanned by the identity 2-tensor 121_{2} and dev3×3\mathbb{R}^{3\times 3}_{\,\text{\rm dev}} is the subspace of deviatoric symmetric 3×33\times 3 tensors. In particular, for all asym3×3a\in\mathbb{R}^{3\times 3}_{\,\text{\rm sym}}, we have that a=adev+tr(a)12/3a=a_{\text{\rm dev}}+\text{tr}(a)1_{2}/3.

We shall assume the reference configuration Ω\Omega to be a non-empty, bounded, and connected open set in 3\mathbb{R}^{3} with Lipschitz continuous boundary. The space dimension 33 plays essentially no role throughout the analysis and we would be in the position of reformulating our results in d\mathbb{R}^{d} with no particular intricacy. Our unknown variables are the displacement of the body u3u\in\mathbb{R}^{3}, the plastic strain pdev3×3p\in\mathbb{R}^{3\times 3}_{\,\text{\rm dev}}, and a vector of internal variables ξm(m)\xi\in\mathbb{R}^{m}\ (m\in\mathbb{N}) which will describe the hardening of the material. We will denote by ε(u)\varepsilon(u) the standard symmetric gradient.

2.2. Constitutive relation

Moving within the small-strain regime, we additively decompose the linearized deformation ε(u)\varepsilon(u) into the elastic strain ee and the inelastic (or plastic) strain pp as

ε(u)=e+p.\varepsilon(u)=e+p.

Let {\mathbb{C}} be the elasticity tensor. By regarding the latter as a symmetric positive definite linear map :sym3×3sym3×3{\mathbb{C}}:\mathbb{R}^{3\times 3}_{\,\text{\rm sym}}\rightarrow\mathbb{R}^{3\times 3}_{\,\text{\rm sym}}, we shall assume that the orthogonal subspaces dev3×3\mathbb{R}^{3\times 3}_{\,\text{\rm dev}} and  12\mathbb{R}\,1_{2} are invariant under {\mathbb{C}}. This amounts to say that indeed

a=devadev+κtr(a)12,{\mathbb{C}}a={\mathbb{C}}_{\text{\rm dev}}a_{\text{\rm dev}}+\kappa\,\text{tr}(a)1_{2},

for a given dev:dev3×3dev3×3{\mathbb{C}}_{\text{\rm dev}}:\mathbb{R}^{3\times 3}_{\,\text{\rm dev}}\rightarrow\mathbb{R}^{3\times 3}_{\,\text{\rm dev}} and a constant κ\kappa, and all asym3×3a\in\mathbb{R}^{3\times 3}_{\,\text{\rm sym}}. The case of isotropic materials is given by dev=2G(141212/3){\mathbb{C}}_{\text{\rm dev}}=2G(1_{4}-1_{2}\otimes 1_{2}/3) and GG and κ\kappa are respectively the shear and the bulk moduli. The latter decomposition is not exploited in our analysis but it is clearly suggested by the mechanical application. Moreover, we shall introduce two linear symmetric positive semi-definite hardening moduli p:dev3×3dev3×3{\mathbb{H}}_{p}:\mathbb{R}^{3\times 3}_{\,\text{\rm dev}}\to\mathbb{R}^{3\times 3}_{\,\text{\rm dev}} and ξ:mm{\mathbb{H}}_{\xi}:\mathbb{R}^{m}\to\mathbb{R}^{m} (to be identified with a fourth order tensor and a matrix, respectively) and define the Helmholtz free energy W:sym3×3×dev3×3×m[0,)W:\mathbb{R}^{3\times 3}_{\,\text{\rm sym}}\times\mathbb{R}^{3\times 3}_{\,\text{\rm dev}}\times\mathbb{R}^{m}\to[0,\infty) of the material as

W(ε(u),p,ξ):=12(ε(u)p):(ε(u)p)+12p:pp+12ξTξξ.W(\varepsilon(u),p,\xi):=\frac{1}{2}(\varepsilon(u)-p):{\mathbb{C}}(\varepsilon(u)-p)+\frac{1}{2}p:{\mathbb{H}}_{p}p+\frac{1}{2}\xi^{T}\cdot{\mathbb{H}}_{\xi}\xi.

The generalized stresses (σ,η)(\sigma,\eta) are conjugate to the above-defined generalized strains (e,ξ)(e,\xi) via the energy WW. In particular, the material is classically assumed to show elastic response,

σ=We=e=(ε(u)p).\sigma=\frac{\partial W}{\partial e}={\mathbb{C}}e={\mathbb{C}}(\varepsilon(u)-p). (2.1)

and the thermodynamic force η\eta driving the evolution of the internal variables ξ\xi is defined as

η=Wξ=ξξ.\eta=-\frac{\partial W}{\partial\xi}=-{\mathbb{H}}_{\xi}\xi. (2.2)

Moreover, moving within the frame of associative elastoplasticity, we assume the existence of a function R:dev3×3×m[0,]R:\mathbb{R}^{3\times 3}_{\,\text{\rm dev}}\times\mathbb{R}^{m}\to[0,\infty] convex, positively 11-homogeneous, and lower semicontinuous such that

R(p˙,ξ˙)(σppη).\partial R(\dot{p},\dot{\xi})\ni\binom{\sigma-{\mathbb{H}}_{p}p}{\eta}. (2.3)

In particular, RR is asked to be the support function of a convex set Cdev3×3×mC^{*}\in\mathbb{R}^{3\times 3}_{\,\text{\rm dev}}\times\mathbb{R}^{m}, i.e. R(p)=supqCq:pR(p)=\sup_{q\in C^{*}}q:p. We will indicate with RR^{*} its conjugate, namely the indicator function of CC^{*} given by R(q)=0R^{*}(q)=0 if qCq\in C^{*} and R(q)=R^{*}(q)=\infty otherwise. Moreover, we let CC be the domain of RR, namely C=D(R)={(p,ξ)dev3×3×m:R(p,ξ)<}C=D(R)=\{(p,\xi)\in\mathbb{R}^{3\times 3}_{\,\text{\rm dev}}\times\mathbb{R}^{m}\ :\ R(p,\xi)<\infty\}.

Finally, the above material relations (2.1)-(2.3) can be condensed as the following constitutive material law

R(p˙,ξ˙)+((+p)pξξ)(ε(u)0),\partial R(\dot{p},\dot{\xi})+\binom{({\mathbb{C}}+{\mathbb{H}}_{p})p}{{\mathbb{H}}_{\xi}\xi}\ni\binom{{\mathbb{C}}\varepsilon(u)}{0}, (2.4)

which in turn can be rephrased in the form of (1.1) by letting

y=(p,ξ),Y=dev3×3×m,ψ=R,\displaystyle y=(p,\xi),\ \ Y=\mathbb{R}^{3\times 3}_{\,\text{\rm dev}}\times\mathbb{R}^{m},\ \ \psi=R,
A(p,ξ)=((+p)p,ξξ),=(ε(u),0).\displaystyle A(p,\xi)=\big{(}({\mathbb{C}}+{\mathbb{H}}_{p})p,{\mathbb{H}}_{\xi}\xi\big{)},\ \ \ell=({\mathbb{C}}\varepsilon(u),0). (2.5)

Let us close this subsection by explicitly mentioning three classical linear hardening models [HR99, Ex. 4.8, p.88]

Linear kinetic hardening:

choose p=hp14{\mathbb{H}}_{p}=h_{p}1_{4} where hp>0h_{p}>0, and ξ=0{\mathbb{H}}_{\xi}=0. In this case the internal variable ξ\xi is not evolving and shall be removed from the set of unknowns.

Linear isotropic hardening:

choose p=0{\mathbb{H}}_{p}=0, m=1m=1, and ξ=hξ>0{\mathbb{H}}_{\xi}=h_{\xi}>0. Moreover, let D(R)={(p,ξ)dev3×3×:|p|ξ}D(R)=\{(p,\xi)\in\mathbb{R}^{3\times 3}_{\,\text{\rm dev}}\times\mathbb{R}\ :\ |p|\leq\xi\}.

Linear combined kinetic-isotropic hardening:

let p=hp14{\mathbb{H}}_{p}=h_{p}1_{4}, m=1m=1, and ξ=hξ{\mathbb{H}}_{\xi}=h_{\xi} where hp,hξ>0h_{p},h_{\xi}>0. Moreover, let D(R)={(p,ξ)dev3×3×:|p|ξ}D(R)=\{(p,\xi)\in\mathbb{R}^{3\times 3}_{\,\text{\rm dev}}\times\mathbb{R}\ :\ |p|\leq\xi\}.

It is beyond the purpose if this introduction to discuss and justify the above-mentioned material models. The reader shall check the cited references for comments on their relevance within applications and some mechanical motivation.

2.3. Variational formulation of the quasistatic evolution

Let us now move to the consideration of the full equilibrium problem. To this aim, we assume that the boundary Ω\partial\Omega is partitioned in two disjoint open sets Γtr\Gamma_{\text{\rm tr}} and ΓDir\Gamma_{\text{\rm Dir}} with Γtr=ΓDir\partial\Gamma_{\text{\rm tr}}=\partial\Gamma_{\text{\rm Dir}} (in Ω\partial\Omega). We ask ΓDir\Gamma_{\text{\rm Dir}} to be such that there exists a positive constant cKornc_{\text{\rm Korn}} depending on ΓDir\Gamma_{\text{\rm Dir}} and Ω\Omega such that the Korn inequality

cKornuH1(Ω;3)2uL2(ΓDir;3)2+ε(u)L2(Ω;sym3×3)2,c_{\text{\rm Korn}}\|u\|^{2}_{H^{1}(\Omega;\mathbb{R}^{3})}\leq\|u\|^{2}_{L^{2}(\Gamma_{\text{\rm Dir}};\mathbb{R}^{3})}+\|\varepsilon(u)\|^{2}_{L^{2}(\Omega;\mathbb{R}^{3\times 3}_{\,\text{\rm sym}})}, (2.6)

holds true for all uH1(Ω;3)u\in H^{1}(\Omega;\mathbb{R}^{3}). It would indeed suffice to impose ΓDir\Gamma_{\text{\rm Dir}} to have a positive surface measure (see, e.g., [DL76, Thm. 3.1, p. 110]).

For the sake of simplicity, we will prescribe homogeneous Dirichlet boundary conditions on ΓDir\Gamma_{\text{\rm Dir}} (our analysis extends with little notational intricacy to the case of non-homogeneous Dirichlet boundary conditions as well). On Γtr\Gamma_{\text{\rm tr}} some time-dependent traction will be prescribed instead.

As for the full quasistatic evolution of the material we shall couple the constitutive relation (2.4) with the equilibrium equation

divσ+f=0inΩ.\text{div}\,\sigma+f=0\quad\text{in}\ \ \Omega. (2.7)

Here, we assume to be given the body force f:[0,T]L2(Ω;3)f:[0,T]\to L^{2}(\Omega;\mathbb{R}^{3}) and a surface traction g:[0,T]L2(Γtr;3)g:[0,T]\to L^{2}(\Gamma_{\text{\rm tr}};\mathbb{R}^{3}).

Then, one can rephrase the problem into the form of (1.1) by choosing

y=(u,p,ξ),\displaystyle y=(u,p,\xi), (2.8)
Y={(u,p,ξ)H1(Ω;3)×L2(Ω;dev3×3)×L2(Ω;m)\displaystyle Y=\Big{\{}(u,p,\xi)\in H^{1}(\Omega;\mathbb{R}^{3})\times L^{2}(\Omega;\mathbb{R}^{3\times 3}_{\,\text{\rm dev}})\times L^{2}(\Omega;\mathbb{R}^{m})
such thatu=0onΓDir},\displaystyle\qquad\qquad\text{such that}\ \ u=0\ \ \text{on}\ \ \Gamma_{\text{\rm Dir}}\Big{\}}, (2.9)
A(u,p,ξ),(v,q,z)=Ω((ε(u)p):(ε(v)q)+p:pq+ξTξz)\displaystyle\left\langle A(u,p,\xi),(v,q,z)\right\rangle=\int_{\Omega}\Big{(}(\varepsilon(u)-p):{\mathbb{C}}(\varepsilon(v)-q)+p:{\mathbb{H}}_{p}q+\xi^{T}\cdot{\mathbb{H}}_{\xi}z\Big{)}
(v,q,z)Y,\displaystyle\qquad\qquad\forall(v,q,z)\in Y, (2.10)
ψ(u,p,ξ)=ΩR(p,ξ),\displaystyle\psi(u,p,\xi)=\int_{\Omega}R(p,\xi), (2.11)

and defining the total load :[0,T]Y\ell:[0,T]\to Y^{*} as

(t),(u,p,ξ)=Ωfu+Γtrgud2uH1(Ω;3),t[0,T],\left\langle\ell(t),(u,p,\xi)\right\rangle=\int_{\Omega}f\cdot u+\int_{\Gamma_{\text{\rm tr}}}g\cdot u\,\,{\rm d}{\mathcal{H}}^{2}\quad\forall u\in H^{1}(\Omega;\mathbb{R}^{3}),\ t\in[0,T],

where 2{\mathcal{H}}^{2} is the 2-dimensional Hausdorff measure.

2.4. The coercivity of AA

Let us close this introductory discussion by explicitly commenting on the coercivity of the bilinear form induced by AA. We shall recall some sufficient conditions on p{\mathbb{H}}_{p}, ξ{\mathbb{H}}_{\xi}, and RR in such a way that there exists a constant α>0\alpha>0 such that

Ay,yα|y|2yD(ψ)\left\langle Ay,y\right\rangle\geq\alpha|y|^{2}\quad\forall y\in D(\psi) (2.12)

where |||\cdot| is the norm in YY. This issue is fairly classical [HR99, Sec. 7.3, p. 167] and we discuss it here for the sake of completeness only.

Of course (2.12) holds (and even for all yYy\in Y) whenever p{\mathbb{H}}_{p} and ξ{\mathbb{H}}_{\xi} are positive definite (this is the case of the above-mentioned linear combined kinematic-isotropic hardening).

As we have already observed, in case ξ=0{\mathbb{H}}_{\xi}=0, the problem naturally reduces to the pair (u,p)(u,p) only. Up to this reduction, (2.12) holds (again for all yYy\in Y) when p{\mathbb{H}}_{p} is positive definite. This is exactly the case of linear kinematic hardening.

On the other hand, in case p=0{\mathbb{H}}_{p}=0, the plastic strain will still evolve and one has (2.12) if D(R)D(R) is bounded in the pp-direction for all ξ\xi, namely if [HR99, (7.51)]

D(R){(p,ξ)dev3×3×m:β|p|2ξTξξfor some constantβ>0},D(R)\subset\{(p,\xi)\in\mathbb{R}^{3\times 3}_{\,\text{\rm dev}}\times\mathbb{R}^{m}\ :\ \ \beta|p|^{2}\leq\xi^{T}\cdot{\mathbb{H}}_{\xi}\xi\ \ \text{for some constant}\ \ \beta>0\}, (2.13)

which is clearly the case for linear isotropic hardening.

Some generalization of the latter condition could in principle be considered for the case when p{\mathbb{H}}_{p} and ξ{\mathbb{H}}_{\xi} are only semi-definite. In particular, (2.12) holds if one assumes (2.13) and

ξ0andξTξξ=0R(p,ξ)=pdev3×3.\xi\not=0\ \ \text{and}\ \ \xi^{T}\cdot{\mathbb{H}}_{\xi}\xi=0\ \ \Rightarrow\ \ R(p,\xi)=\infty\ \ \forall p\in\mathbb{R}^{3\times 3}_{\,\text{\rm dev}}.

Let us mention that the most critical case in the class of (2.4) is p=0{\mathbb{H}}_{p}=0, ξ=0{\mathbb{H}}_{\xi}=0 where actually no hardening takes place. This is the situation perfect plasticity for which the Sobolev space framework above is not appropriate and one would consider the space BD(Ω)BD(\Omega) of functions of bounded deformations instead [DDM06]. We shall make clear that, even if our variational characterization covers the case of perfect plasticity, the subsequent approximation results apply to the linear hardening situation only.

3. Characterization

3.1. General assumptions

Let us start by recalling notation and enlisting the basic assumptions for the following. First of all, we will ask that

Yis a separable and reflexive Banach space.Y\quad\text{is a separable and reflexive Banach space.} (3.1)

We will use the symbols |||\cdot| for the norm of YY and ,\left\langle\cdot,\cdot\right\rangle for the duality pairing between YY^{*} (dual) and YY. The norm in YY^{*} will be denoted by |||\cdot|_{*} instead.

We introduce the functional

ψ:Y[0,]proper, convex, lower semicontinuous,\displaystyle\psi:Y\to[0,\infty]\quad\text{proper, convex, lower semicontinuous,}
and positively 11-homogeneous. (3.2)

Equivalently, ψ\psi is required to be the support function of a convex and closed set CYC^{*}\subset Y^{*} containing 0, namely

ψ(y)=sup{y,y:yC}.\psi(y)=\sup\{\left\langle y^{*},y\right\rangle\ :\ y^{*}\in C^{*}\}.

We shall define C=D(ψ)C=D(\psi). Hence, the conjugate ψ:Y[0,]\psi^{*}:Y^{*}\to[0,\infty], which is classically defined as ψ(y)=supyY(y,yψ(y))\psi^{*}(y^{*})=\sup_{y\in Y}(\left\langle y^{*},y\right\rangle-\psi(y)), is the indicator function of the convex set CC^{*}, namely ψ(y)=0\psi^{*}(y^{*})=0 if yCy^{*}\in C^{*} and ψ(y)=\psi^{*}(y^{*})=\infty otherwise. Let us remark that ψ\psi fulfills the triangle inequality ψ(a)ψ(b)+ψ(c)\psi(a)\leq\psi(b)+\psi(c) whenever a=b+ca=b+c.

We shall use the symbol \partial in order to denote the usual subdifferential in the sense of Convex Analysis, namely

yψ(y)iffyD(ψ)andy,wyψ(w)ψ(y)wY.y^{*}\in\partial\psi(y)\quad\text{iff}\quad y\in D(\psi)\ \ \text{and}\ \ \left\langle y^{*},w-y\right\rangle\leq\psi(w)-\psi(y)\quad\forall w\in Y.

Similarly, we define

yψ(y)iffyD(ψ)andwy,yψ(w)ψ(y)wY\displaystyle y\in\partial\psi^{*}(y^{*})\quad\text{iff}\quad y^{*}\in D(\psi^{*})\ \ \text{and}\ \ \left\langle w^{*}-y^{*},y\right\rangle\leq\psi^{*}(w^{*})-\psi^{*}(y^{*})\quad\forall w^{*}\in Y^{*}
iffyCandwy,y0wC.\displaystyle\quad\text{iff}\quad y^{*}\in C^{*}\ \ \text{and}\ \ \left\langle w^{*}-y^{*},y\right\rangle\leq 0\quad\forall w^{*}\in C^{*}.

Finally, we recall Fenchel’s inequality

ψ(y)+ψ(y)y,yyY,yY,\psi(y)+\psi^{*}(y^{*})\geq\left\langle y^{*},y\right\rangle\quad\forall y\in Y,\ y^{*}\in Y^{*},

and remark that equality holds iff yψ(y)y^{*}\in\partial\psi(y) (or, equivalently, yψ(y)y\in\partial^{*}\psi^{*}(y^{*})).

As for the operator AA we require

A:YYlinear, continuous, and symmetric,A:Y\to Y^{*}\quad\text{linear, continuous, and symmetric,} (3.3)

and define the function

yϕ(y)=12Ay,y,y\to\phi(y)=\frac{1}{2}\left\langle Ay,y\right\rangle,

so that A=Dϕ\,A=D\phi. Moreover, we will ask ϕ\phi to be coercive on C=D(ψ)C=D(\psi), namely we assume that there exists a positive constant α\alpha such that

ϕ(y)α2|y|2yC.\phi(y)\geq\frac{\alpha}{2}|y|^{2}\quad\forall y\in C. (3.4)

As we have already commented in Subsection 2.4, the latter coercivity is fulfilled in the situation of elastoplastic evolution with linear kinematic, isotropic, or combined kinematic-isotropic hardening and will turn out to be sufficient for both the forthcoming characterization results.

On the other hand, the following uniqueness-type results will be checked under some stronger coercivity frame and we will ask for

ϕ(y)α2|y|2yCC.\phi(y)\geq\frac{\alpha}{2}|y|^{2}\quad\forall y\in C-C. (3.5)

Clearly, condition (3.5) is fulfilled when ϕ\phi happens to be coercive on the whole space YY. The latter applies in particular to the case of linear kinematic and combined kinematic-isotropic hardening elastoplasticity. In this case, ϕ\phi defines an equivalent (squared) norm in YY.

We shall make use of the following notation

χ(y)=ϕ(y)+|y|2yY\chi(y)=\phi(y)+|y|^{2}\quad\forall y\in Y

Indeed the latter choice is just motivated by simplicity and could be replaced as well by any other χ:Y[0,)\chi:Y\to[0,\infty) such that χ(y)=0\chi(y)=0 iff y=0y=0 and yχ(y)ϕ(y)y\mapsto\chi(y)-\phi(y) is lower semicontinuous.

Finally, we shall fix data such that

L(0,T;Y),y0C.\ell\in L^{\infty}(0,T;Y^{*}),\ y_{0}\in C. (3.6)

The restriction on the choice of the initial datum in CC is motivated by the coercivity assumption on ϕ\phi in (3.4). On the other hand, we shall explicitly mention that the usual choice for y0y_{0} in elastoplasticity is y0=0y_{0}=0.

In the forthcoming of the paper the above assumptions (3.1)-(3.4) and (3.6) will be tacitly assumed (unless explicitly stated). It should be however clear that the above choice is motivated by the sake of simplicity. Indeed, most of the following results still hold under suitably weaker assumptions, as we shall comment.

3.2. The functional

Let the Lagrangian L:(0,T)×Y×Y[0,]L:(0,T)\times Y\times Y\to[0,\infty] be defined as

L(t,y,p)=ψ(p)+ψ((t)Ay)(t)Ay,p\displaystyle L(t,y,p)=\psi(p)+\psi^{*}(\ell(t)-Ay)-\left\langle\ell(t)-Ay,p\right\rangle
for a.e.t(0,T),y,pY,\displaystyle\text{for a.e.}\ \ t\in(0,T),\ \forall y,\,p\in Y, (3.7)

and the functional F:W1,1(0,T;Y)[0,]\,F:W^{1,1}(0,T;Y)\to[0,\infty] as

F(y)=0TL(t,y(t),y(t))dt+χ(y(0)y0).F(y)=\int_{0}^{T}L(t,y(t),y^{\prime}(t))\,\,{\rm d}t+\chi(y(0)-y_{0}). (3.8)

Now, by simply using the chain rule, we obtain that

F(y)=0T(ψ(y)+ψ(Ay),y)+ϕ(y(T))ϕ(y(0))+χ(y(0)y0).F(y)=\int_{0}^{T}\Big{(}\psi(y^{\prime})+\psi^{*}(\ell-Ay)-\left\langle\ell,y^{\prime}\right\rangle\Big{)}+\phi(y(T))-\phi(y(0))+\chi(y(0)-y_{0}).

A first remark is that, by exploiting the particular form of χ\chi,

F(y)\displaystyle F(y) =\displaystyle= 0T(ψ(y)+ψ(Ay),y)\displaystyle\int_{0}^{T}\Big{(}\psi(y^{\prime})+\psi^{*}(\ell-Ay)-\left\langle\ell,y^{\prime}\right\rangle\Big{)} (3.9)
+\displaystyle+ ϕ(y(T))+ϕ(y0)Ay(0),y0+|y(0)y0|2.\displaystyle\phi(y(T))+\phi(y_{0})-\left\langle Ay(0),y_{0}\right\rangle+|y(0)-y_{0}|^{2}.

In particular, FF is clearly convex.

3.3. The characterization

Theorem 3.1 (Variational principle).

yy solves (1.1) iff F(y)=0=minFF(y)=0=\min F.

Proof.

Owing to Fenchel’s inequality we have that

L(t,y,p)=0iff(t)Ayψ(p)L(t,y,p)=0\quad\text{iff}\ \ \ell(t)-Ay\in\partial\psi(p)

and, clearly, χ(y(0)y0)=0\chi(y(0)-y_{0})=0 iff y(0)=y0y(0)=y_{0}. Hence, all solutions yy of (1.1) are such that F(y)=0F(y)=0 and vice-versa. ∎

Let us remark that the latter variational characterization result holds in much grater generality. The proof made no use of the separability and reflexivity of YY nor of the linearity of AA (besides its being single-valued and such that tAy(t)t\mapsto Ay(t) is measurable). Moreover, the positive 11-homogeneity of ψ\psi is unessential [Ste07]. In particular, the variational approach of Theorem 3.1 can be directly extended to a variety of different dissipative systems possibly including viscous evolution as well. We shall address this perspective in a forthcoming contribution.

We have already observed that FF is convex. Moreover, FF is lower semicontinuous with respect to the weak topology of W1,1(0,T;Y)W^{1,1}(0,T;Y) since all weakly convergent sequences in W1,1(0,T;Y)W^{1,1}(0,T;Y) are pointwise weakly convergent as well. Hence, one could be tempted to use the Direct Method in order to get the existence of minimizers, i.e. solutions to equation (1.1). As we commented in the Introduction, this seems to be no trivial task.

First of all, the functional FF need not be coercive with respect to the weak topology of W1,1(0,T;Y)W^{1,1}(0,T;Y). Indeed, the functional ψ\psi may degenerate and hence not control the norm of its argument. Moreover, even in the case when ψ\psi is non-degenerate, the homogeneity assumption just entails that the sublevels of FF are bounded in W1,1(0,T;Y)W^{1,1}(0,T;Y) and no weak compactness follows.

Secondly, even assuming coercivity in the weak topology of W1,1(0,T;Y)W^{1,1}(0,T;Y), one would still need to prove that the minimum 0 is attained. This very obstruction to the use of the Direct Method occurs for the Brezis & Ekeland principle for gradient flows [BE76b, Rem. 1] and for its doubly nonlinear generalization in [Ste06a] as well.

3.4. The variational principle for hardening elastoplasticity

By referring to the notations of Section 2, let us now present the actual form of the functional FF for the case of the constitutive relation for linearized elastoplastic materials with linear hardening (see (2.5)). In this case the functional reads

F(p,ξ)\displaystyle F(p,\xi) =\displaystyle= 0T(R(p˙,ξ˙)+R((ε(u)p)pp,ξξ))\displaystyle\displaystyle\int_{0}^{T}\Big{(}R(\dot{p},\dot{\xi})+R^{*}\big{(}{\mathbb{C}}(\varepsilon(u)-p)-{\mathbb{H}}_{p}p,-{\mathbb{H}}_{\xi}\xi\big{)}\Big{)}
\displaystyle- 0T(((ε(u)p)pp):p˙ξTξξ˙)\displaystyle\int_{0}^{T}\Big{(}\big{(}{\mathbb{C}}(\varepsilon(u)-p)-{\mathbb{H}}_{p}p\big{)}:\dot{p}-\xi^{T}\cdot{\mathbb{H}}_{\xi}\dot{\xi}\Big{)}
+\displaystyle+ 12(p(0)p0):(+p)(p(0)p0)+12(ξ(0)ξ0)Tξ(ξ(0)ξ0)\displaystyle\frac{1}{2}(p(0)-{p}_{0}):({\mathbb{C}}+{\mathbb{H}}_{p})(p(0)-{p}_{0})+\frac{1}{2}(\xi(0)-\xi_{0})^{T}\cdot{\mathbb{H}}_{\xi}(\xi(0)-\xi_{0})
+\displaystyle+ |(p(0),ξ(0))(p0,ξ0)|2,\displaystyle|({p}(0),\xi(0))-({p}_{0},\xi_{0})|^{2},

for some given initial datum (p0,ξ0)dev3×3×m({p}_{0},\xi_{0})\in\mathbb{R}^{3\times 3}_{\,\text{\rm dev}}\times\mathbb{R}^{m} and ε(u)L(0,T;sym3×3)\varepsilon(u)\in L^{\infty}(0,T;\mathbb{R}^{3\times 3}_{\,\text{\rm sym}}).

In the situation of the quasistatic evolution, for some given initial datum (u0,p0,ξ0)Y(u_{0},{p}_{0},\xi_{0})\in Y, a load fL(0,T;(L2(Ω;3)))f\in L^{\infty}(0,T;(L^{2}(\Omega;\mathbb{R}^{3}))), and a traction gL(0,T;(L2(Γtr;3)))g\in L^{\infty}(0,T;(L^{2}(\Gamma_{\text{\rm tr}};\mathbb{R}^{3}))), the functional reads (see (2.8)-(2.11))

F(u,p,ξ)\displaystyle F(u,p,\xi) =\displaystyle= 0TΩ(R(p˙,ξ˙)+R((ε(u)p)pp,ξξ))\displaystyle\displaystyle\int_{0}^{T}\int_{\Omega}\Big{(}R(\dot{p},\dot{\xi})+R^{*}\big{(}{\mathbb{C}}(\varepsilon(u)-p)-{\mathbb{H}}_{p}p,-{\mathbb{H}}_{\xi}\xi\big{)}\Big{)}
\displaystyle- 0TΩfu˙0TΓtrgu˙d2\displaystyle\displaystyle\int_{0}^{T}\int_{\Omega}f\cdot\dot{u}-\int_{0}^{T}\int_{\Gamma_{\text{\rm tr}}}g\cdot\dot{u}\,{\rm d}{\mathcal{H}}^{2}
+\displaystyle+ 0TΩ(ε(u)p):(ε(u˙)p˙)+p:pp˙+ξTξξ˙)\displaystyle\displaystyle\int_{0}^{T}\int_{\Omega}(\varepsilon(u)-p):{\mathbb{C}}(\varepsilon(\dot{u})-\dot{p})+p:{\mathbb{H}}_{p}\dot{p}+\xi^{T}\cdot{\mathbb{H}}_{\xi}\dot{\xi}\Big{)}
+\displaystyle+ 12Ω(ε(u(0)u0)(p(0)p0)):(ε(u(0)u0)(p(0)p0))\displaystyle\displaystyle\frac{1}{2}\int_{\Omega}(\varepsilon(u(0)-u_{0})-(p(0)-{p}_{0})):{\mathbb{C}}(\varepsilon(u(0)-u_{0})-(p(0)-{p}_{0}))
+\displaystyle+ 12Ω((p(0)p0):p(p(0)p0)+(ξ(0)ξ0)Tξ(ξ(0)ξ0))\displaystyle\displaystyle\frac{1}{2}\int_{\Omega}\Big{(}(p(0)-{p}_{0}):{\mathbb{H}}_{p}(p(0)-{p}_{0})+(\xi(0)-\xi_{0})^{T}\cdot{\mathbb{H}}_{\xi}(\xi(0)-\xi_{0})\Big{)}
+\displaystyle+ 12Ω|(u(0),p(0),ξ(0))(u0,p0,ξ0)|2,\displaystyle\displaystyle\frac{1}{2}\int_{\Omega}|(u(0),{p}(0),\xi(0))-(u_{0},{p}_{0},\xi_{0})|^{2},

for all points (u,p,ξ)Y(u,p,\xi)\in Y such that

Ω(ε(u)p):ε(v)=ΩfvΓtrgvd2\displaystyle\displaystyle\int_{\Omega}(\varepsilon(u)-p):{\mathbb{C}}\varepsilon(v)=\displaystyle\int_{\Omega}f\cdot v-\int_{\Gamma_{\text{\rm tr}}}g\cdot v\,{\rm d}{\mathcal{H}}^{2}
vH1(Ω;3)withv=0onΓDir,a.e. in(0,T),\displaystyle\forall v\in H^{1}(\Omega;\mathbb{R}^{3})\ \ \text{with}\ \ v=0\ \ \text{on}\ \ \Gamma_{\text{\rm Dir}},\ \ \text{a.e. in}\ \ (0,T),

and F(u,p,ξ)=F(u,p,\xi)=\infty otherwise.

4. Properties of the minimizers

For the sake of illustrating the variational principle of Theorem 3.1, we shall collect here some properties of the trajectories belonging to the domain of the functional FF and, in particular, of the minimizers.

4.1. Trajectories are in CC

Lemma 4.1.

Let F(y)<F(y)<\infty. Then y(t)Cy(t)\in C for all t[0,T]t\in[0,T].

Proof.

Since F(y)<F(y)<\infty we have that yCy^{\prime}\in C almost everywhere in (0,T)(0,T). Hence, for all t[0,T]t\in[0,T], we have that 0tyC\int_{0}^{t}y^{\prime}\in C by Jensen’s inequality. On the other hand y0Cy_{0}\in C and y(t)=y0+0tyy(t)=y_{0}+\int_{0}^{t}y^{\prime}. The assertion follows by recalling that CC is a cone. ∎

4.2. Stability at regular points of \ell

Assume :[0,T]Y\ell:[0,T]\to Y^{*} is given and let the set of stable states S(t)Y\,S(t)\subset Y\, for t[0,T]\,t\in[0,T]\, be defined as

S(t)={yY:ϕ(y)(t),yϕ(w)(t),w+ψ(wy)wY}.S(t)=\{y\in Y\ :\ \phi(y)-\left\langle\ell(t),y\right\rangle\leq\phi(w)-\left\langle\ell(t),w\right\rangle+\psi(w-y)\ \ \forall w\in Y\}.

One can easily prove that

yS(t)iff(t)Ayϕ(0)=C.y\in S(t)\quad\text{iff}\quad\ell(t)-Ay\in\partial\phi(0)=C^{*}.
Lemma 4.2 (Stability of the minimizers).

Let \ell be either left- or right-weakly continuous at some point t[0,T]t\in[0,T] and F(y)<F(y)<\infty. Then y(t)S(t)y(t)\in S(t).

Proof.

Since F(y)<F(y)<\infty, we have that (t)Ay(t)=C\,\ell(t)-Ay(t)\in=C^{*}\, for all t(0,T)Nt\in(0,T)\setminus N, where |N|=0|N|=0. Choose a sequence tk(0,T)N\,t_{k}\in(0,T)\setminus N such that tktt_{k}\to t (from the left or from the right) and (tk)(t)\ell(t_{k})\to\ell(t) weakly in YY^{*}. Hence, (tk)Ay(tk)(t)Ay(t)\ell(t_{k})-Ay(t_{k})\to\ell(t)-Ay(t) weakly in YY^{*} and (t)Ay(t)C\ell(t)-Ay(t)\in C^{*}. ∎

In particular, if \ell happens to be right-continuous at 0, the functional FF will not attain the minimum value 0 unless the initial datum y0y_{0} is stable, namely y0S(0)y_{0}\in S(0).

4.3. Equivalent formulations

Letting now W1,1(0,T;Y)\ell\in W^{1,1}(0,T;Y^{*}), problem (1.1) admits some alternative equivalent formulations [Mie05, Sec. 2.1]. We explicitly mention that yW1,1(0,T;Y)y\in W^{1,1}(0,T;Y) is said to be an energetic solution if it solves the energetic formulation [MT04] of (1.1), namely

y(t)S(t)t[0,T],\displaystyle y(t)\in S(t)\quad\forall t\in[0,T], (4.1)
ϕ(y(t))(t),y(t)+0tψ(y)=ϕ(y(0))(0),y(0)0t,y\displaystyle\phi(y(t))-\left\langle\ell(t),y(t)\right\rangle+\int_{0}^{t}\psi(y^{\prime})=\phi(y(0))-\left\langle\ell(0),y(0)\right\rangle-\int_{0}^{t}\left\langle\ell^{\prime},y\right\rangle
t[0,T],\displaystyle\qquad\quad\forall t\in[0,T], (4.2)
y(0)=y0.\displaystyle y(0)=y_{0}. (4.3)

Mielke & Theil [MT04] proved that the latter is equivalent to (1.1) and hence, owing to the characterization of Theorem 3.1, to F(y)=0=minFF(y)=0=\min F. For the aim of pointing out some features of our variational approach, we shall present here a direct proof of this fact.

Lemma 4.3 (Equivalence with the energetic formulation).

Let W1,1(0,T;Y)\ell\in W^{1,1}(0,T;Y^{*}). Then, F(y)=0=minFF(y)=0=\min F iff yy fulfills (4.1)-(4.3).

Proof.

Owing to Lemma 4.2, we readily have that the stability condition (4.1) holds iff ψ(Ay)=0\psi^{*}(\ell-Ay)=0 almost everywhere.

Let yy be such that F(y)=0F(y)=0. Then (4.1) and (4.3) hold and L(t,y(t),y(t))=0L(t,y(t),y^{\prime}(t))=0 for a.e. t(0,T)t\in(0,T). In particular, for all t[0,T]t\in[0,T],

0=0tL(s,y(s),y(s))ds=0t(ψ(y),y)+ϕ(y)|0t\displaystyle 0=\int_{0}^{t}L(s,y(s),y^{\prime}(s))\,\,{\rm d}s=\int_{0}^{t}\Big{(}\psi(y^{\prime})-\left\langle\ell,y^{\prime}\right\rangle\Big{)}+\phi(y)\Big{|}_{0}^{t}
=(ϕ(y),y)|0t+0tψ(y)+0t,y,\displaystyle=(\phi(y)-\left\langle\ell,y\right\rangle)\Big{|}_{0}^{t}+\int_{0}^{t}\psi(y^{\prime})+\int_{0}^{t}\left\langle\ell^{\prime},y\right\rangle,

so that the energy equality (4.2) holds for all t[0,T]t\in[0,T].

On the contrary, let yW1,1(0,T;Y)y\in W^{1,1}(0,T;Y) fulfill (4.1)-(4.3). Then χ(y(0)y0)=0\chi(y(0)-y_{0})=0 and ψ(Ay)=0\psi^{*}(\ell-Ay)=0 almost everywhere (see above). Hence F(y)=0F(y)=0 follows from the energy equality (4.2) at time TT and an integration by parts. ∎

Let us mention that the latter lemma proves in particular that the energy equality (4.2) could be equivalently enforced at the final time TT only. Moreover, it proves that, as already commented in the Introduction, all stable trajectories ty(t)t\mapsto y(t) (i.e. trajectories such that y(t)S(t)y(t)\in S(t) for all t[0,T]t\in[0,T]) are such that the following energy inequality holds

ϕ(y(t))(t),y(t)+0tψ(y)ϕ(y(0))(0),y(0)0t,yt[0,T].\phi(y(t))-\left\langle\ell(t),y(t)\right\rangle+\int_{0}^{t}\psi(y^{\prime})\geq\phi(y(0))-\left\langle\ell(0),y(0)\right\rangle-\int_{0}^{t}\left\langle\ell^{\prime},y\right\rangle\quad\forall t\in[0,T].

Hence, we have provided a proof to [Mie05, Prop. 5.7] (in a somehow simpler situation though).

Before closing this subsection, let us explicitly remark that the above inferred equivalence between formulations has been obtained for the absolutely continuous case W1,1(0,T;Y)\ell\in W^{1,1}(0,T;Y^{*}) only, whereas the characterization of Theorem 3.1 holds more generally for bounded \ell.

4.4. The functional controls the uniform distance: uniqueness

So far, we have simply reformulated known results in a variational fashion. Here, we present some novel results instead.

Lemma 4.4 (Uniform distance control via FF).

We have that

η(1η)maxt[0,T]ϕ(u(t)v(t))ηF(u)+(1η)F(v)\displaystyle\eta(1-\eta)\max_{t\in[0,T]}\phi\big{(}u(t)-v(t)\big{)}\leq\eta F(u)+(1-\eta)F(v)
u,vW1,1(0,T;Y),η[0,1].\displaystyle\forall u,\,v\in W^{1,1}(0,T;Y),\ \eta\in[0,1]. (4.4)
Proof.

The statement follows from the quadratic character of ϕ\phi. Fix t[0,T]t\in[0,T] and define Ft:W1,1(0,t;Y)[0,]F^{t}:W^{1,1}(0,t;Y)\to[0,\infty] as

Ft(y)\displaystyle F^{t}(y) =\displaystyle= 0tL(s,y(s),y(s))ds+χ(y(0)y0)\displaystyle\int_{0}^{t}L(s,y(s),y^{\prime}(s))\,\,{\rm d}s+\chi(y(0)-y_{0})
=\displaystyle= 0t(ψ(y)+ψ(Ay),y)\displaystyle\int_{0}^{t}\Big{(}\psi(y^{\prime})+\psi^{*}(\ell-Ay)-\left\langle\ell,y^{\prime}\right\rangle\Big{)}
+\displaystyle+ ϕ(y(t))+ϕ(y0)Ay(0),y0+|y(0)y0|2.\displaystyle\phi(y(t))+\phi(y_{0})-\left\langle Ay(0),y_{0}\right\rangle+|y(0)-y_{0}|^{2}.

Then, clearly yGt(y)=Ft(y)ϕ(y(t))y\mapsto G^{t}(y)=F^{t}(y)-\phi(y(t)) is convex. Hence, letting w=ηu+(1η)vw=\eta u+(1-\eta)v we have that

0Ft(w)\displaystyle 0\leq F^{t}(w)
η(Gt(u)+ϕ(u(t)))+(1η)(Gt(v)+ϕ(v(t)))η(1η)ϕ(u(t)v(t)),\displaystyle\leq\eta\big{(}G^{t}(u)+\phi(u(t))\big{)}+(1-\eta)\big{(}G^{t}(v)+\phi(v(t))\big{)}-\eta(1-\eta)\phi\big{(}u(t)-v(t)\big{)},

whence the assertion follows. ∎

The latter lemma exploits the quadratic character of ϕ\phi only. In particular, no coercivity for ϕ\phi is assumed. It should however be clear that its application (as the title of the lemma indeed suggests) will always be referred to the situation where the stronger (3.5) is required, namely when the left-hand side of (4.4) controls

η(1η)α2max[0,T]|uv|2.\eta(1-\eta)\frac{\alpha}{2}\max_{[0,T]}|u-v|^{2}.

We now present two immediate corollaries of Lemma 4.4.

Corollary 4.5 (Uniform distance from the minimizer).

Let F(y)=0F(y)=0. Then

maxt[0,T]ϕ(y(t)v(t))F(v).\displaystyle\max_{t\in[0,T]}\phi\big{(}y(t)-v(t)\big{)}\leq F(v).

This corollary encodes an interesting novel feature of our variational approach, for it provides possible a posteriori error estimator to be used within approximation procedures. It is interesting to remark that the uniform distance of any stable trajectory from the minimizer is controlled by means of its energy production along the path only. Again, although Corollary 4.5 holds under no coercivity assumptions of ϕ\phi, let us mention that its application will be restricted to the frame of (3.5). Finally, we have uniqueness of the minimizers of FF attaining the value 0.

Corollary 4.6 (Uniqueness).

Assume (3.5). Then, there exists at most one trajectory yy such that F(y)=0F(y)=0.

4.5. Lipschitz bound

Form this point on and throughout the remainder of the paper we shall tacitly assume

W1,(0,T;Y),y0S(0).\ell\in W^{1,\infty}(0,T;Y^{*}),\quad y_{0}\in S(0). (4.5)

As already commented after Lemma 4.2, the above restriction on the initial datum is mandatory whenever \ell admits a weak-right-limit in 0.

As for \ell, the extra Lipschitz continuity assumption is motivated by the rate-independence of the problem (every absolutely continuous datum can be time-rescaled to a Lipschitz continuous datum) and the following well-known result.

Lemma 4.7 (Lipschitz bound).

Assume (3.5) and let W1,(0,T;Y)\ell\in W^{1,\infty}(0,T;Y^{*}), and F(y)=0F(y)=0. Then

yL(0,T;Y)1αL(0,T;Y)a.e. in(0,T).\|y^{\prime}\|_{L^{\infty}(0,T;Y)}\leq\frac{1}{\alpha}\|\ell^{\prime}\|_{L^{\infty}(0,T;Y^{*})}\quad\text{a.e. in}\ \ (0,T). (4.6)

The proof of the lemma is exactly the classical one [MT04, Thm. 7.5], but formulated by means of our variational arguments. We provide it for the sake of completeness.

Proof.

Let 0s<tT0\leq s<t\leq T be fixed. Since L(y,y)=0L(y,y^{\prime})=0 almost everywhere we have that

st(ψ(y)+,y)+(ϕ(y),y)|st=0.\int_{s}^{t}\Big{(}\psi(y^{\prime})+\left\langle\ell^{\prime},y\right\rangle\Big{)}+(\phi(y)-\left\langle\ell,y\right\rangle)\Big{|}_{s}^{t}=0.

On the other hand, owing to the strong monotonicity of AA and the fact that y(s)S(s)y(s)\in S(s) (see Lemma 4.2) one obtains

ϕ(y(t)y(s))ϕ(y(t))(s),y(t)+ψ(y(t)y(s))ϕ(y(s))(s),y(s)\displaystyle\phi(y(t)-y(s))\leq\phi(y(t))-\left\langle\ell(s),y(t)\right\rangle+\psi(y(t)-y(s))-\phi(y(s))-\left\langle\ell(s),y(s)\right\rangle
=(ϕ(y),u)|st+st(r),y(t)dr+ψ(y(t)y(s)).\displaystyle=(\phi(y)-\left\langle\ell,u\right\rangle)\Big{|}_{s}^{t}+\int_{s}^{t}\left\langle\ell^{\prime}(r),y(t)\right\rangle\,{\rm d}r+\psi(y(t)-y(s)).

By taking the sum of these two relations and recalling that, by Jensen,

ψ(y(t)y(s))=ψ(sty)stψ(y)\psi(y(t)-y(s))=\psi\left(\int_{s}^{t}y^{\prime}\right)\leq\int_{s}^{t}\psi(y^{\prime})

we get that

ϕ(y(t)y(s))st(r),y(t)y(r)dr.\phi(y(t)-y(s))\leq\int_{s}^{t}\left\langle\ell^{\prime}(r),y(t)-y(r)\right\rangle\,{\rm d}r.

Finally, an application of some extended Gronwall lemma (see [Mie05, Thm. 3.4]) entails that

α2|y(t)y(s)|212L(0,T;Y)|y(t)y(s)|(ts)\frac{\alpha}{2}|y(t)-y(s)|^{2}\leq\frac{1}{2}\|\ell^{\prime}\|_{L^{\infty}(0,T;Y^{*})}|y(t)-y(s)|\,(t-s)

and the assertion follows.

5. Space and data approximation

We now apply the characterization results of Theorem 3.1 to the approximation of solutions of (1.1). As already commented in the Introduction, we shall proceed via Γ\Gamma-convergence [GF75]. The reader is referred to the monographs by Attouch [Att84] and Dal Maso [Dal93] for some comprehensive discussion on this topic. Indeed, since Theorem 3.1 directly quantifies the value of the minimum to be 0, what is actually needed for passing to limits are Γlim inf\Gamma-\liminf\, inequalities only. We shall illustrate this fact by discussing the simple case of data approximation first.

Lemma 5.1 (Data approximation).

Assume (3.5), let h\ell_{h}\to\ell strongly in L1(0,T;Y)L^{1}(0,T;Y^{*}) being uniformly Lipschitz continuous, and y0,hy0y_{0,h}\to y_{0}. Moreover, let Fh:W1,1(0,T;Y)[0,]F_{h}:W^{1,1}(0,T;Y)\to[0,\infty] be defined as

Fh(y)=0T(ψ(y)+ψ(hAy)hAy,y)+χ(y(0)y0,h),\displaystyle F_{h}(y)=\int_{0}^{T}\Big{(}\psi(y^{\prime})+\psi^{*}(\ell_{h}-Ay)-\left\langle\ell_{h}-Ay,y^{\prime}\right\rangle\Big{)}+\chi(y(0)-y_{0,h}),

and let Fh(yh)=0F_{h}(y_{h})=0. Then yhyy_{h}\to y weakly star in W1,(0,T;Y)W^{1,\infty}(0,T;Y) and F(y)=0F(y)=0.

Proof.

Owing to Lemma 4.7, we find a (not relabeled) subsequence yhy_{h} such that yhyy_{h}\to y weakly star in W1,(0,T;Y)W^{1,\infty}(0,T;Y). Hence, we have by lower semicontinuity

0F(y)\displaystyle 0\leq F(y) \displaystyle\leq lim infh0(0T(ψ(yh)+ψ(hAyh)h,yh)\displaystyle\liminf_{h\to 0}\Bigg{(}\int_{0}^{T}\Big{(}\psi(y^{\prime}_{h})+\psi^{*}(\ell_{h}-Ay_{h})-\left\langle\ell_{h},y^{\prime}_{h}\right\rangle\Big{)}
+\displaystyle+ ϕ(yh(T))+ϕ(y0,n)Ayh(0),y0,h+|yh(0)y0,h|2)\displaystyle\phi(y_{h}(T))+\phi(y_{0,n})-\left\langle Ay_{h}(0),y_{0,h}\right\rangle+|y_{h}(0)-y_{0,h}|^{2}\Bigg{)}
=\displaystyle= lim infh0Fh(yh)=0.\displaystyle\liminf_{h\to 0}F_{h}(y_{h})=0.

Hence, F(y)=0F(y)=0, yy is unique, and the assertion follows from the fact that the whole sequence converges. ∎

5.1. Preliminaries on functional convergence

In order to move to more general approximation situations, we are forced to discuss a suitable functional convergence notion. We limit ourselves in introducing the relevant definitions, referring to the mentioned monographs for all the necessary details.

Recall that YY is a real reflexive Banach space. Letting fn,f:Y(,]f_{n},\,f:Y\to(-\infty,\infty] be convex, proper, and lower semicontinuous, we say that fnff_{n}\to f in the Mosco sense in YY [Att84, Mos69] iff, for all yYy\in Y,

f(y)lim infnfn(yn)ynyweakly inY,\displaystyle f(y)\leq\liminf\limits_{n\to\infty}f_{n}(y_{n})\quad\forall y_{n}\to y\ \ \text{weakly in}\ \ Y,
ynystrongly inYsuch thatf(y)=lim supnfn(yn).\displaystyle\exists y_{n}\to y\ \ \text{strongly in}\ \ Y\ \ \text{such that}\ \ f(y)=\limsup\limits_{n\to\infty}f_{n}(y_{n}).

In particular, fnff_{n}\to f in the Mosco sense iff fnff_{n}\to f in the sense of Γ\Gamma-convergence with respect to both the weak and the strong topology in YY.

We will consider the situation of approximating functionals ψh\psi_{h}. By [Att84, Thm. 3.18, p. 295], we have that ψhψ\psi_{h}\to\psi in the Mosco sense in YY iff ψhψ\psi_{h}^{*}\to\psi^{*} in the Mosco sense in YY^{*}. By assuming the functionals ψh\psi_{h} to be positively 11-homogeneous, it turns out that the Mosco convergence ψhψ\psi_{h}\to\psi in YY is equivalent to the Mosco convergence of sets ChC{C^{*}_{h}}\to{C^{*}} in YY^{*} which reads,

Cnynyweakly in YyC,\displaystyle C^{*}_{n}\ni y^{*}_{n}\to y^{*}\quad\text{weakly in $Y^{*}$}\ \ \Rightarrow\ \ y^{*}\in C^{*},
yC,ynCn:ynystrongly in Y.\displaystyle\forall y^{*}\in C^{*},\ \ \exists y^{*}_{n}\in C^{*}_{n}\ :\ y^{*}_{n}\to y^{*}\quad\text{strongly in $Y^{*}$}.

Finally, we repeatedly use a lemma from [Ste06a] which we report it here, for the sake of completeness.

Lemma 5.2 (Cor. 4.4, [Ste06a]).

Let p[1,]p\in[1,\infty] and fh,f:Y(,]f_{h},\,f:Y\to(-\infty,\infty] be convex, proper, and lower semicontinuous such that

f(y)inf{lim infh0fh(yh):yhyweakly inY}yY.f(y)\leq\inf\left\{\liminf\limits_{h\to 0}f_{h}(y_{h})\ :\ y_{h}\to y\ \ \text{weakly in}\ \ Y\right\}\quad\forall y\in Y.

Moreover, let yhyy_{h}\to y weakly in W1,p(0,T;Y)W^{1,p}(0,T;Y) (weakly star if p=p=\infty), Then, we have that

0Tf(y(t))dtlim infh00Tfh(yh(t))dt.\int_{0}^{T}f(y(t))\,\,{\rm d}t\leq\liminf\limits_{h\to 0}\int_{0}^{T}f_{h}(y_{h}(t))\,\,{\rm d}t.

5.2. Space approximations

We now move to the analysis of some space approximation situation, indeed specifically tailored for the case of conformal finite elements. Let us enlist here our assumptions for the sake of later referencing.

We assume to be given

YhYclosed subspaces such thath>0Yhis dense inY,\displaystyle Y_{h}\subset Y\ \ \text{closed subspaces such that}\ \ \bigcup_{h>0}Y_{h}\ \ \text{is dense in}\ \ Y, (5.1)
ϕh(y)=ϕ(y)ifyYhandϕh(y)=otherwise.\displaystyle\phi_{h}(y)=\phi(y)\ \ \text{if}\ \ y\in Y_{h}\ \ \text{and}\ \ \phi_{h}(y)=\infty\ \ \text{otherwise}. (5.2)
ψh:Y(,]convex, proper, and lower semicontinuous,\displaystyle\psi_{h}:Y\to(-\infty,\infty]\quad\text{convex, proper, and lower semicontinuous}, (5.3)
ψhpositively 1-homogeneous,\displaystyle\psi_{h}\quad\text{positively $1$-homogeneous}, (5.4)
ψhψin the Mosco sense in Y,\displaystyle\psi_{h}\to\psi\quad\text{in the Mosco sense in $Y$}, (5.5)
ϕ(y)α2|y|2yChChwhereCh=D(ψh),\displaystyle\phi(y)\geq\frac{\alpha}{2}|y|^{2}\quad\forall y\in C_{h}-C_{h}\quad\text{where}\ \ C_{h}=D(\psi_{h}), (5.6)
hpointwise strongly in Y,\displaystyle\ell_{h}\to\ell\quad\text{pointwise strongly in $Y^{*}$}, (5.7)
huniformly Lipschitz continuous,\displaystyle\ell_{h}\quad\text{uniformly Lipschitz continuous}, (5.8)
y0,hYh,y0,hy0.\displaystyle y_{0,h}\in Y_{h},\ \ y_{0,h}\to y_{0}. (5.9)

We shall mention that within the frame of conformal finite elements methods the subspaces YhY_{h} are obviously taken to be finite-dimensional and that the approximating functionals ϕh\phi_{h} and ψh\psi_{h} are usually the restrictions of the functionals ϕ\phi and ψ\psi on the subspace YhY_{h}. This is exactly our choice here for ϕh\phi_{h}. In particular, one shall observe that ϕhϕ\phi_{h}\to\phi in the Mosco sense in YY, D(ϕh)=YhD(\partial\phi_{h})=Y_{h}, and that

Ahy=ϕh(y)=ϕ(y)=Dϕ(y)=AyyYh.A_{h}y=\partial\phi_{h}(y)=\partial\phi(y)=D\phi(y)=Ay\quad\forall y\in Y_{h}. (5.10)

As for ψh\psi_{h} we are allowing some extra freedom (let us however remark that (5.6) follows from (3.5) as soon as ψh\psi_{h} is the restriction of ψ\psi to YhY_{h} since, in this case, Ch=CYhC_{h}=C\cap Y_{h}). On the other hand, we are asking ψh\psi_{h} to be positively 11-homogeneous, namely we are considering the case of some rate-independent approximation of (1.1) only. The reader is referred instead to Efendiev & Mielke [EM06], Efendiev, Mielke, Rossi, & Savaré [EMRS06], and Zanini [Zan07] for some results in the direction of rate-dependent approximation of rate-independent processes.

Finally, we shall (re)define the approximating functionals as Fh:W1,1(0,T;Y)[0,]F_{h}:W^{1,1}(0,T;Y)\to[0,\infty] as

Fh(y)=0T(ψh(y)+ψh(hAhy)hAhy,y)+χh(y(0)y0,h),\displaystyle F_{h}(y)=\int_{0}^{T}\Big{(}\psi_{h}(y^{\prime})+\psi^{*}_{h}(\ell_{h}-A_{h}y)-\left\langle\ell_{h}-A_{h}y,y^{\prime}\right\rangle\Big{)}+\chi_{h}(y(0)-y_{0,h}),

where Ah=ϕhA_{h}=\partial\phi_{h} and χh()=ϕh()+||2\chi_{h}(\cdot)=\phi_{h}(\cdot)+|\cdot|^{2}. We have the following.

Theorem 5.3 (Convergence of space approximations).

Assume (5.1)-(5.9) and let Fh(yh)=0F_{h}(y_{h})=0. Then yhyy_{h}\to y weakly star in W1,(0,T;Y)W^{1,\infty}(0,T;Y) and F(y)=0F(y)=0.

Proof.

By Lemma 4.7, we find a (not relabeled) subsequence ynyy_{n}\to y weakly star in W1,(0,T;Y)W^{1,\infty}(0,T;Y) and weakly pointwise. Since Fh(yh)=0F_{h}(y_{h})=0 we readily check that y(t)Yhy(t)\in Y_{h} for all t[0,T]t\in[0,T]. In particular, Ahyh=AyhA_{h}y_{h}=Ay_{h} for all t[0,T]t\in[0,T] owing to (5.10). Hence, by lower semicontinuity,

0F(y)\displaystyle 0\leq F(y) \displaystyle\leq lim infh0(0T(ψh(yh)+ψh(hAyh)h,yh)\displaystyle\liminf_{h\to 0}\Bigg{(}\int_{0}^{T}\Big{(}\psi_{h}(y^{\prime}_{h})+\psi^{*}_{h}(\ell_{h}-Ay_{h})-\left\langle\ell_{h},y^{\prime}_{h}\right\rangle\Big{)}
+\displaystyle+ ϕ(yh(T))+ϕ(y0,h)Ayh(0),y0,h+|yh(0)y0,h|2)\displaystyle\phi(y_{h}(T))+\phi(y_{0,h})-\left\langle Ay_{h}(0),y_{0,h}\right\rangle+|y_{h}(0)-y_{0,h}|^{2}\Bigg{)}
=\displaystyle= lim infh0(0T(ψh(yh)+ψh(hAhyh)h,yh)\displaystyle\liminf_{h\to 0}\Bigg{(}\int_{0}^{T}\Big{(}\psi_{h}(y^{\prime}_{h})+\psi^{*}_{h}(\ell_{h}-A_{h}y_{h})-\left\langle\ell_{h},y^{\prime}_{h}\right\rangle\Big{)}
+\displaystyle+ ϕh(yh(T))ϕh(yh(0)))\displaystyle\phi_{h}(y_{h}(T))-\phi_{h}(y_{h}(0))\Bigg{)}
=\displaystyle= lim infh0Fh(yh)=0,\displaystyle\liminf_{h\to 0}F_{h}(y_{h})=0,

Note that the integral terms containing ψ\psi and ψ\psi^{*} pass to the lim inf\liminf by means of Lemma 5.2. ∎

By inspecting the proof of Theorem 5.3 (which of course generalizes Lemma 5.1), one realizes that, whenever the weak-star precompactness in W1,(0,T;Y)W^{1,\infty}(0,T;Y) of the sequence yhy_{h} is assumed, the convergence statement holds more generally in the case Fh(yh)0F_{h}(y_{h})\to 0. Namely, by directly asking for the above-mentioned compactness, one could consider the convergence of some approximated solutions yhy_{h} such that, possibly, Fh(yh)>0F_{h}(y_{h})>0. We rephrase this fact in the following statement.

Lemma 5.4 (Γ\Gamma-liminf inequality for FhF_{h}).

Assume (5.1)-(5.3), (5.5)-(5.7), and (5.9). Then,

F(u)inf{lim infh0Fh(yh):yhyweakly star inW1,(0,T;Y)}.F(u)\leq\inf\left\{\liminf\limits_{h\to 0}F_{h}(y_{h})\ :\ y_{h}\to y\quad\text{weakly star in}\ \ W^{1,\infty}(0,T;Y)\right\}.

Note that the homogeneity of ψh\psi_{h}, the uniform convexity of ϕh\phi_{h}, and the Lipschitz continuity of h\ell_{h} play no role here.

Finally, again by looking carefully to the proof of Theorem 5.3 one could wonder if the requirement on the Mosco convergence of ψh\psi_{h} could be weakened. Indeed, what we are actually using is only that

ψΓlim infh0ψhandψΓlim infh0ψh\psi\leq\Gamma-\liminf\limits_{h\to 0}\psi_{h}\quad\text{and}\quad\psi^{*}\leq\Gamma-\liminf\limits_{h\to 0}\psi_{h}^{*} (5.11)

with respect to the weak topologies of YY and YY^{*}, respectively. On the other hand, in our specific situation, [Ste06a, Lemma 4.1] entails that (5.11) and the fact that ψhψ\psi_{h}\to\psi in the Mosco sense in YY are equivalent.

This observation motivates once again the belief that Mosco convergence is the right frame in order to pass to limits within rate-independent problems. For the sake of completeness, let us recall that a first result in the direction of the approximation of the play operator (YY Hilbert and AA coercive on YY) under the Hausdorff convergence of the characteristic sets Ch=D(ψh)C_{h}^{*}=D(\psi^{*}_{h}) is contained in [Kre96, Thm. 3.12, p. 34] whereas the extension of this result to the more general situation of Mosco converging sets as well as some application to parabolic PDEs with hysteresis is discussed in [Ste06b]. More recently, Mielke, Roubíček, & Stefanelli [MRS06] addressed in full generality the issue of Γ\Gamma-convergence and relaxation for the energetic solutions of rate-independent processes. An alternative convergence proof in the specific case of convex energies is obtained by means of the Brezis-Ekeland approach in [Ste06a].

6. Time-discretization

Assume now we are given the partitions Pn={0=tn0<tn1<<tnNn=T}P_{n}=\{0=t^{0}_{n}<t^{1}_{n}<\dots<t^{N_{n}}_{n}=T\} and denote by τni=tnitni1\tau^{i}_{n}=t^{i}_{n}-t^{i-1}_{n} the ii-th time-step and by τn=max1iNnτni\tau_{n}=\max_{1\leq i\leq N_{n}}\,\tau_{n}^{i} the diameter of the nn-th partition. No constraints are imposed on the possible choice of the time-steps throughout this analysis besides τn0\tau_{n}\to 0 as nn\to\infty. Moreover, let the parameter θ[1/2,1]\theta\in[1/2,1] be given.

In the following we will make an extensive use of the following notation: letting v=(v0,,vNn)v=(v^{0},\dots,v^{N_{n}}) be a vector, we will denote by v^n\widehat{v}_{n} and v¯n\overline{v}_{n} two functions of the time-interval [0,T][0,T] which interpolate the values of the vector vv piecewise linearly and backward constantly on the partition PnP_{n}, respectively. Namely

v^n(0)=v0,v^n(t)=γni(t)vi+(1γni(t))vi1,\displaystyle\widehat{v}_{n}(0)=v^{0},\ \ \widehat{v}_{n}(t)=\gamma^{i}_{n}(t)v^{i}+\big{(}1-\gamma^{i}_{n}(t)\big{)}v^{i-1},
v¯n(0)=v0,v¯n(t)=vi,fort(tni1,tni],i=1,,Nn\displaystyle\overline{v}_{n}(0)=v^{0},\ \ \overline{v}_{n}(t)=v^{i},\quad\text{for}\ \ t\in(t^{i-1}_{n},t^{i}_{n}],\ \ i=1,\dots,N_{n}

where

γni(t)=(ttni1)/τnifort(tni1,tni],i=1,,Nn.\gamma^{i}_{n}(t)=(t-t^{i-1}_{n})/\tau^{i}_{n}\quad\mbox{for}\ \ t\in(t^{i-1}_{n},t^{i}_{n}],\ \ i=1,\dots,N_{n}.

Moreover, we let δvi=(vivi1)/τni\delta v^{i}=(v^{i}-v^{i-1})/\tau^{i}_{n}\, for i=1,,Nn\,i=1,\dots,N_{n} (so that v^n=δvn¯\widehat{v}_{n}^{\prime}=\overline{\delta v_{n}}) and denote by vθv_{\theta} the vector with components vθi=θvi+(1θ)vi1v_{\theta}^{i}=\theta v^{i}+(1-\theta)v^{i-1}.

Recall that W1,(0,T;Y)\ell\in W^{1,\infty}(0,T;Y^{*}) and y0S(0)y_{0}\in S(0). We shall be concerned with the so-called θ\theta-scheme for problem (1.1):

ψ(yniyni1τni)+A(θyni+(1θ)yni1)(θtni+(1θ)tni1)\displaystyle\partial\psi\left(\frac{y^{i}_{n}-y^{i-1}_{n}}{\tau^{i}_{n}}\right)+A(\theta y^{i}_{n}+(1-\theta)y^{i-1}_{n})\ni\ell(\theta t^{i}_{n}+(1-\theta)t^{i-1}_{n})
fori=1,,Nn,\displaystyle\text{for}\ \ i=1,\dots,N_{n}, (6.1)
yn0=y0.\displaystyle y^{0}_{n}=y_{0}. (6.2)

One usually refers to the latter as backward or implicit Euler scheme for the choice θ=1\theta=1 and as Crank-Nicholson scheme for θ=1/2\theta=1/2.

Owing to the above-introduced notation, the latter scheme can be equivalently rewritten as

ψ(yniyni1)+Ayn,θi(tn,θi)fori=1,,Nn,yn0=y0.\displaystyle\partial\psi\left(y^{i}_{n}-y^{i-1}_{n}\right)+Ay^{i}_{n,\theta}\ni\ell(t^{i}_{n,\theta})\quad\text{for}\ \ i=1,\dots,N_{n},\quad y^{0}_{n}=y_{0}. (6.3)

Clearly, the θ\theta-scheme (6.3) is rate-independent. Namely, no time-step appears in (6.3) and the choice of the partition affects the solution via the values of the load \ell only. In this concern, our focus on variable time-steps partition could be simplified by considering proper rescaled loads \ell instead. We shall however keep up with it, especially in order to underline the possibility of adapting the partition according to some a posteriori analysis (see Subsection 6.7).

Before moving on, let us comment that, for all nn, the latter scheme as a unique solution. Indeed, given yni1Cy^{i-1}_{n}\in C, it suffices to (uniquely) solve iteratively the incremental problem

yni Arg min yY(θϕ(y)(tn,θi)(1θ)Ayni1,y+ψ(yyni1)).y^{i}_{n}\in\text{\,\rm Arg\,min\,}_{y\in Y}\Big{(}\theta\phi(y)-\left\langle\ell(t^{i}_{n,\theta})-(1-\theta)Ay^{i-1}_{n},y\right\rangle+\psi(y-y^{i-1}_{n})\Big{)}. (6.4)

Note that, since yni1Cy^{i-1}_{n}\in C, the functional under minimization turns out to be uniformly convex. Hence, by (3.4), the minimum problem has a unique solution. In particular, exactly as in Lemma 4.1 we have the following.

Lemma 6.1.

yniCy^{i}_{n}\in C for all i=0,1,,Nni=0,1,\dots,N_{n}.

A crucial observation is that, as in the continuous case, the discrete trajectories show some sort of stability as well.

Lemma 6.2 (Stability of the discrete trajectories).

We have that

yni Arg min yY(θϕ(y)(tn,θi)(1θ)Ayni1,y+ψ(yyni))\displaystyle y^{i}_{n}\in\text{\,\rm Arg\,min\,}_{y\in Y}\Big{(}\theta\phi(y)-\left\langle\ell(t^{i}_{n,\theta})-(1-\theta)Ay^{i-1}_{n},y\right\rangle+\psi(y-y^{i}_{n})\Big{)}
fori=1,,Nn.\displaystyle\text{for}\ \ i=1,\dots,N_{n}. (6.5)

In particular, if θ=1\theta=1, then yniS(tni)y^{i}_{n}\in S(t^{i}_{n}).

Proof.

From the incremental formulation (6.4) and the triangle inequality for ψ\psi we get that, for all yYy\in Y,

θϕ(yni)(tn,θi)(1θ)Ayni1,yni+ψ(yniyni1)\displaystyle\theta\phi(y^{i}_{n})-\left\langle\ell(t^{i}_{n,\theta})-(1-\theta)Ay^{i-1}_{n},y^{i}_{n}\right\rangle+\psi(y^{i}_{n}-y^{i-1}_{n})
θϕ(y)(tn,θi)(1θ)Ayni1,y+ψ(yyni1)\displaystyle\leq\theta\phi(y)-\left\langle\ell(t^{i}_{n,\theta})-(1-\theta)Ay^{i-1}_{n},y\right\rangle+\psi(y-y^{i-1}_{n})
θϕ(y)(tn,θi)(1θ)Ayni1,y+ψ(yyni)+ψ(yniyni1),\displaystyle\leq\theta\phi(y)-\left\langle\ell(t^{i}_{n,\theta})-(1-\theta)Ay^{i-1}_{n},y\right\rangle+\psi(y-y^{i}_{n})+\psi(y^{i}_{n}-y^{i-1}_{n}),

whence the assertion follows. ∎

Again as in the continuous case, we readily check that

relation (6.5) holds iff(tn,θi)Ayn,θiC.\text{relation \eqref{eq_stab} holds iff}\quad\ell(t^{i}_{n,\theta})-Ay^{i}_{n,\theta}\in C^{*}. (6.6)

6.1. The discrete variational principle

We shall now present a discrete version of the variational principle of Theorem 3.1.

We define Lnθ,i(y,z):Y×Y[0,]L^{\theta,i}_{n}(y,z):Y\times Y\to[0,\infty] as

Lnθ,i(y,z)=ψ(yzτni)+ψ((tn,θi)A(θy+(1θ)z))\displaystyle L^{\theta,i}_{n}(y,z)=\psi\left(\frac{y-z}{\tau^{i}_{n}}\right)+\psi^{*}\left(\ell(t^{i}_{n,\theta})-A(\theta y+(1-\theta)z)\right)
(tn,θi)A(θy+(1θ)z),yzτni,\displaystyle-\left\langle\ell(t^{i}_{n,\theta})-A(\theta y+(1-\theta)z),\frac{y-z}{\tau^{i}_{n}}\right\rangle,

and the functionals Fnθ:YNn+1[0,]F_{n}^{\theta}:Y^{N_{n}+1}\to[0,\infty] as

Fnθ(yn0,,ynNn)=i=1NnτniLnθ,i(yni,yni1)+χ(yn0y0).F_{n}^{\theta}(y^{0}_{n},\dots,y^{N_{n}}_{n})=\sum_{i=1}^{N_{n}}\tau^{i}_{n}L^{\theta,i}_{n}(y^{i}_{n},y^{i-1}_{n})+\chi(y^{0}_{n}-y_{0}).
Lemma 6.3 (Discrete variational principle).

(yn0,,ynNn)(y^{0}_{n},\dots,y^{N_{n}}_{n}) solves (6.3) iff Fnθ(yn0,,ynNn)=0=minFnθF_{n}^{\theta}(y^{0}_{n},\dots,y^{N_{n}}_{n})=0=\min F_{n}^{\theta}.

Proof.

Analogously to the continuous case, we have that, for all i=1,,Nni=1,\dots,N_{n},

ψ(δyni)+Ayn,θi(tn,θi)iffLnθ,i(yni,yni1)=0,\partial\psi\left(\delta y^{i}_{n}\right)+Ay^{i}_{n,\theta}\ni\ell(t^{i}_{n,\theta})\quad\text{iff}\quad L^{\theta,i}_{n}(y^{i}_{n},y^{i-1}_{n})=0,

and yn0=y0y^{0}_{n}=y_{0} iff χ(yn0y0)=0\chi(y^{0}_{n}-y_{0})=0. ∎

Let us observe that the functional FnθF_{n}^{\theta} is convex and lower-semicontinuous. Moreover, by the homogeneity of ψ\psi (see (3.2)), FnθF_{n}^{\theta} is actually independent of the time steps. In fact, we have

Fnθ(yn0,,ynNn)=i=1Nn(ψ(yniyni1)+ψ((tn,θi)Ayn,θi))\displaystyle F_{n}^{\theta}(y^{0}_{n},\dots,y^{N_{n}}_{n})=\sum_{i=1}^{N_{n}}\Bigg{(}\psi\left({y^{i}_{n}-y^{i-1}_{n}}\right)+\psi^{*}\left(\ell(t^{i}_{n,\theta})-Ay^{i}_{n,\theta})\right)
(tn,θi)Ayn,θi,yniyni1)+χ(yn0y0).\displaystyle-\left\langle\ell(t^{i}_{n,\theta})-Ay^{i}_{n,\theta},{y^{i}_{n}-y^{i-1}_{n}}\right\rangle\Bigg{)}+\chi(y^{0}_{n}-y_{0}).

The idea of dealing with time-discretizations via a discrete variational principle closely relates our analysis to the theory of so-called variationals integrators. The latter are numerical schemes stemming from the approximation of the action functional in Lagrangian Mechanics. By referring the reader to the monograph [HLW06] and the survey [MW01], we shall restrain here from giving a detailed presentation of the subject and limit ourselves to some (necessarily sketchy) considerations. Letting (t,y,p)[0,T]×m×m(t,y,p)(t,y,p)\in[0,T]\times\mathbb{R}^{m}\times\mathbb{R}^{m}\mapsto{\mathcal{L}}(t,y,p) denote the Lagrangian of a (finite dimensional, for simplicity) system, the Hamilton principle asserts that the actual trajectory ty(t)t\mapsto y(t) of the system minimizes the action functional

y0T(t,y(t),y(t))dty\mapsto\int_{0}^{T}{\mathcal{L}}(t,y(t),y^{\prime}(t))\,\,{\rm d}t

among all curves with prescribed endpoints, thus solving the Lagrange equations

yiddtpi=0for i=1,,m.\partial_{y_{i}}{\mathcal{L}}-\displaystyle\frac{\text{\rm d}}{\rm dt}\partial_{p_{i}}{\mathcal{L}}=0\quad\text{for $i=1,\dots,m$}. (6.7)

Hence, a natural idea is that of deriving numerical schemes for Lagrangian mechanics by applying some quadrature procedure to the action functional, i.e. discretizing Hamilton’s principle. The resulting discrete schemes show comparable performance with respect to other methods but generally enjoy some interesting extra (and often crucial) properties such as the conservation of suitable quantities [LMOW04]. Variational integrators have been intensively applied in finite-dimensional contexts and, more recently, to the situation of nonlinear wave equations [MPS98] and non-equilibrium elasticity [LMOW03].

The present analysis may bear some resemblance to the above-mentioned theory. Indeed, the formulation of the θ\theta-scheme in the case θ=1/2\theta=1/2 stems exactly from the midpoint quadrature of the functional FF as

tni1tniL(t,y^(t),y^(t))dt\displaystyle\int_{t^{i-1}_{n}}^{t^{i}_{n}}L(t,\widehat{y}(t),\widehat{y}^{\prime}(t))\,\,{\rm d}t
=\displaystyle= τniL(tn,1/2i,y^(tn,1/2i),y^(tn,1/2i))\displaystyle\tau^{i}_{n}L(t^{i}_{n,1/2},\widehat{y}(t^{i}_{n,1/2}),\widehat{y}^{\prime}(t^{i}_{n,1/2}))
=\displaystyle= ψ(yiyi1)+ψ((tn,1/2i)A(yi+yi12))\displaystyle\psi\left({y^{i}-y^{i-1}}\right)+\psi^{*}\left(\ell(t^{i}_{n,1/2})-A\left(\frac{y^{i}+y^{i-1}}{2}\right)\right)
\displaystyle- (tn,1/2i)A(yi+yi12),yniyni1,\displaystyle\left\langle\ell(t^{i}_{n,1/2})-A\left(\frac{y^{i}+y^{i-1}}{2}\right),{y^{i}_{n}-y^{i-1}_{n}}\right\rangle,

where y^\widehat{y} is taken to be piecewise affine on the partition PnP_{n}.

On the other hand, our focus here is quite different. First of all, we are not dealing with the Hamilton principle (endpoints are not fixed) as we are not aimed at solving the Euler-Lagrange equations for FF (i.e., solve (6.7)). Secondly, we are specifically interested at infinite-dimensional situations, namely PDEs. Finally, the only choice of θ\theta which is directly related with a quadrature of FF is θ=1/2\theta=1/2 and we are not considering higher order schemes.

Before closing this discussion, let us mention that some Γ\Gamma-convergence techniques have been recently exploited in the (finite-dimensional) frame of variational integrators by Müller & Ortiz [MO04] (see also [MM04]). In this same spirit, we are here providing Γ\Gamma-convergence results in infinite dimensions instead.

6.2. Stability of the θ\theta-scheme

It is known since Han & Reddy [HR95, HR99] that the choice θ<1/2\theta<1/2 in (6.3) leads to an unconditionally unstable scheme and that, on the contrary, for θ[1/2,1]\theta\in[1/2,1] the θ\theta-scheme is stable in H1(0,T;Y)H^{1}(0,T;Y) when YY is a Hilbert space and the partitions are chosen to be uniform.

Here we shall provide an alternative stability proof by taking into account the Banach frame.

Lemma 6.4 (Stability).

Assume (3.5) and let θ[1/2,1]\theta\in[1/2,1]. Then, the solution to the θ\theta-scheme (6.3) fulfills

y^n,θL(0,T;Y)1αL(0,T;Y)ifθ=1orθ=12.\|\widehat{y}_{n,\theta}^{\prime}\|_{L^{\infty}(0,T;Y)}\leq\frac{1}{\alpha}\|\ell^{\prime}\|_{L^{\infty}(0,T;Y^{*})}\quad\text{if}\ \ \theta=1\ \ \text{or}\ \ \theta=\frac{1}{2}. (6.8)

Moreover, for constant time-steps,

y^n,θL(0,T;Y)1α(2θ1)L(0,T;Y)if12<θ<1.\|\widehat{y}_{n,\theta}^{\prime}\|_{L^{\infty}(0,T;Y)}\leq\frac{1}{\alpha(2\theta-1)}\|\ell^{\prime}\|_{L^{\infty}(0,T;Y^{*})}\quad\text{if}\ \ \frac{1}{2}<\theta<1. (6.9)

Our argument coincides with that of [MT04, Thm. 4.4] in the case of Euler, i.e. θ=1\theta=1 and it is an extension of the latter for the case 1/2<θ<11/2<\theta<1. Here, we do not play with the variational inequality by choosing suitable tests but use the scalar relations Lnθ,i(yni,yni1)=0L^{\theta,i}_{n}(y^{i}_{n},y^{i-1}_{n})=0 instead (this makes however no substantial difference since the latter scalar relations are exactly the outcome of the test on the variational inequality in [MT04, Thm. 4.4]).

The stability proof for the Crank-Nicholson scheme θ=1/2\theta=1/2 is quite different from former arguments and stems as a direct outcome of our variational approach. Let us mention that, unlike the classical parabolic situation, here the Crank-Nicholson scheme is indeed unconditionally stable. The reason for this fact is the rate-independence of the problem, namely the degenerate character of the evolution (no relaxation time). In both cases θ=1\theta=1 and θ=1/2\theta=1/2, the stability constant 1/α1/\alpha is sharp (see Lemma 4.7).

We complement this analysis by providing the stability for the θ\theta-scheme for 1/2<θ<11/2<\theta<1 in the case of constant time-steps (likely with a non-optimal, although explicit, stability constant).

Proof.

Let us prove the stability of the Crank-Nicholson scheme θ=1/2\theta=1/2 first. For this aim, it suffices to recall that

0=Fn1/2(yn0,,ynNn)\displaystyle 0=F^{1/2}_{n}(y^{0}_{n},\dots,y^{N_{n}}_{n})
=0T(ψ(y^n)+ψ(^nAy^n)^nAy^n,y^n)+χ(y^n(0)y0).\displaystyle=\int_{0}^{T}\Big{(}\psi(\widehat{y}^{\prime}_{n})+\psi^{*}(\widehat{\ell}_{n}-A\widehat{y}_{n})-\left\langle\widehat{\ell}_{n}-A\widehat{y}_{n},\widehat{y}_{n}^{\prime}\right\rangle\Big{)}+\chi(\widehat{y}_{n}(0)-y_{0}).

Hence y^n\widehat{y}_{n} minimizes the functional FF where \ell is replaced by ^n\widehat{\ell}_{n}. The stability estimate follows from Lemma 4.7.

Let us now move to the case 1/2<θ11/2<\theta\leq 1. Relation (6.5) applied at level i1i-1 for some i=2,,Nni=2,\dots,N_{n} along with the choice y=yniy=y^{i}_{n} entails that

θϕ(yniyni1)+θϕ(yni1)(tn,θi1)(1θ)Ayni2,yni1\displaystyle\theta\phi(y^{i}_{n}-y^{i-1}_{n})+\theta\phi(y^{i-1}_{n})-\left\langle\ell(t^{i-1}_{n,\theta})-(1-\theta)Ay^{i-2}_{n},y^{i-1}_{n}\right\rangle
θϕ(yni)(tn,θi1)(1θ)Ayni2,yni+ψ(yniyni1),\displaystyle\leq\theta\phi(y^{i}_{n})-\left\langle\ell(t^{i-1}_{n,\theta})-(1-\theta)Ay^{i-2}_{n},y^{i}_{n}\right\rangle+\psi(y^{i}_{n}-y^{i-1}_{n}),

where the extra-term θϕ(yniyni1)\theta\phi(y^{i}_{n}-y^{i-1}_{n}) is obtained from the fact that ϕ\phi is quadratic. Hence, we have that

θϕ(yniyni1)+θϕ(yni1)θϕ(yni)\displaystyle\theta\phi(y^{i}_{n}-y^{i-1}_{n})+\theta\phi(y^{i-1}_{n})-\theta\phi(y^{i}_{n})
(tn,θi1),yni1yni+(1θ)A(yni2yni1),yniyni1+ψ(yniyni1)\displaystyle\leq\left\langle\ell(t^{i-1}_{n,\theta}),y^{i-1}_{n}-y^{i}_{n}\right\rangle+(1-\theta)\left\langle A(y^{i-2}_{n}-y^{i-1}_{n}),y^{i}_{n}-y^{i-1}_{n}\right\rangle+\psi(y^{i}_{n}-y^{i-1}_{n})
+(1θ)Ayni1,yniyni1\displaystyle+(1-\theta)\left\langle Ay^{i-1}_{n},y^{i}_{n}-y^{i-1}_{n}\right\rangle
=(tn,θi1),yni1yni+(1θ)A(yni2yni1),yniyni1+ψ(yniyni1)\displaystyle=\left\langle\ell(t^{i-1}_{n,\theta}),y^{i-1}_{n}-y^{i}_{n}\right\rangle+(1-\theta)\left\langle A(y^{i-2}_{n}-y^{i-1}_{n}),y^{i}_{n}-y^{i-1}_{n}\right\rangle+\psi(y^{i}_{n}-y^{i-1}_{n})
(1θ)(ϕ(yni1)+ϕ(yniyni1)ϕ(yni)),\displaystyle-(1-\theta)\Big{(}\phi(y^{i-1}_{n})+\phi(y^{i}_{n}-y^{i-1}_{n})-\phi(y^{i}_{n})\Big{)},

so that

ϕ(eni)+ϕ(yni1)ϕ(yni)(tn,θi1),eni+(θ1)Aeni1,eni+ψ(eni),\displaystyle\phi(e^{i}_{n})+\phi(y^{i-1}_{n})-\phi(y^{i}_{n})\leq-\left\langle\ell(t^{i-1}_{n,\theta}),e^{i}_{n}\right\rangle+(\theta-1)\left\langle Ae^{i-1}_{n},e^{i}_{n}\right\rangle+\psi(e^{i}_{n}), (6.10)

where we have used eni=yniyni1e^{i}_{n}=y^{i}_{n}-y^{i-1}_{n} in order to shorten notations.

Next, from Lnθ,i(yni,yni1)=0L^{\theta,i}_{n}(y^{i}_{n},y^{i-1}_{n})=0 for i=1,,Nni=1,\dots,N_{n}, we obtain that

0\displaystyle 0 =\displaystyle= ψ(eni)(tn,θi)Ayn,θi,eni\displaystyle\psi(e^{i}_{n})-\left\langle\ell(t^{i}_{n,\theta})-Ay^{i}_{n,\theta},e^{i}_{n}\right\rangle
=\displaystyle= ψ(eni)(tn,θi),eni+θ(ϕ(yni)+ϕ(eni)ϕ(yni1))\displaystyle\psi(e^{i}_{n})-\left\langle\ell(t^{i}_{n,\theta}),e^{i}_{n}\right\rangle+\theta\Big{(}\phi(y^{i}_{n})+\phi(e^{i}_{n})-\phi(y^{i-1}_{n})\Big{)}
\displaystyle- (1θ)(ϕ(yni1)+ϕ(eni)ϕ(yni)).\displaystyle(1-\theta)\Big{(}\phi(y^{i-1}_{n})+\phi(e^{i}_{n})-\phi(y^{i}_{n})\Big{)}.

In particular, we have checked that

ψ(eni)+ϕ(yni)ϕ(yni1)+(2θ1)ϕ(eni)=(tn,θi),eni.\psi(e^{i}_{n})+\phi(y^{i}_{n})-\phi(y^{i-1}_{n})+(2\theta-1)\phi(e^{i}_{n})=\left\langle\ell(t^{i}_{n,\theta}),e^{i}_{n}\right\rangle. (6.11)

We take the sum between the latter and (6.10) and get that

2θϕ(eni)(tn,θi)(tn,θi1),eni+(θ1)Aeni1,eni,2\theta\phi(e^{i}_{n})\leq\left\langle\ell(t^{i}_{n,\theta})-\ell(t^{i-1}_{n,\theta}),e^{i}_{n}\right\rangle+(\theta-1)\left\langle Ae^{i-1}_{n},e^{i}_{n}\right\rangle,

or, equivalently,

Aen,θi,eni(tn,θi)(tn,θi1),eni.\left\langle Ae^{i}_{n,\theta},e^{i}_{n}\right\rangle\leq\left\langle\ell(t^{i}_{n,\theta})-\ell(t^{i-1}_{n,\theta}),e^{i}_{n}\right\rangle. (6.12)

Now, if θ=1\theta=1, we conclude that

|eni||1α|(tn,θi)(tn,θi1)|,|e^{i}_{n}|\leq|\leq\frac{1}{\alpha}|\ell(t^{i}_{n,\theta})-\ell(t^{i-1}_{n,\theta})|_{*},

and the assertion follows.

In case 1/2θ<11/2\leq\theta<1 and for a constant time-step partition, one proceeds from (6.12) by computing

τn^nL(0,T;Y)2αϕ(eni)\displaystyle\tau_{n}\|\widehat{\ell}^{\prime}_{n}\|_{L^{\infty}(0,T;Y^{*})}\sqrt{\frac{2}{\alpha}}\sqrt{\phi(e^{i}_{n})}
(tn,θi)(tn,θi1),eni\displaystyle\geq\left\langle\ell(t^{i}_{n,\theta})-\ell(t^{i-1}_{n,\theta}),e^{i}_{n}\right\rangle
θAeni,eni+(1θ)Aeni1,eni\displaystyle\geq\theta\left\langle Ae^{i}_{n},e^{i}_{n}\right\rangle+(1-\theta)\left\langle Ae^{i-1}_{n},e^{i}_{n}\right\rangle
=(2θ1)Aeni,eni+(1θ)A(eni+eni1),eni\displaystyle=(2\theta-1)\left\langle Ae^{i}_{n},e^{i}_{n}\right\rangle+(1-\theta)\left\langle A(e^{i}_{n}+e^{i-1}_{n}),e^{i}_{n}\right\rangle
2(2θ1)ϕ(eni)+(1θ)(ϕ(eni)ϕ(eni1))\displaystyle\geq 2(2\theta-1)\phi(e^{i}_{n})+(1-\theta)\Big{(}\phi(e^{i}_{n})-\phi(e^{i-1}_{n})\Big{)}

Note that the coefficient (2θ1)(2\theta-1) is strictly positive as θ>1/2\theta>1/2. By using the fact that yn0=y0S(0)y^{0}_{n}=y_{0}\in S(0) (recall (4.5)), we readily check that

ϕ(en1)+ϕ(yn0)(0),yn0ϕ(yn1)(0),yn1+ψ(en1)\phi(e^{1}_{n})+\phi(y^{0}_{n})-\left\langle\ell(0),y^{0}_{n}\right\rangle\leq\phi(y^{1}_{n})-\left\langle\ell(0),y^{1}_{n}\right\rangle+\psi(e^{1}_{n})

and, by adding the latter to (6.11) for i=1i=1 we have

2θϕ(en1)(tn,θ1)(0),en1τn^nL(0,T;Y)2αϕ(en1).2\theta\phi(e^{1}_{n})\leq\left\langle\ell(t^{1}_{n,\theta})-\ell(0),e^{1}_{n}\right\rangle\leq\tau_{n}\|\widehat{\ell}^{\prime}_{n}\|_{L^{\infty}(0,T;Y^{*})}\sqrt{\frac{2}{\alpha}}\sqrt{\phi(e^{1}_{n})}. (6.13)

Let us define

ai2=ϕ(yniyi1τn)=ϕ(eni)/τn2,\displaystyle a_{i}^{2}=\phi\left(\frac{y^{i}_{n}-y^{i-1}}{\tau_{n}}\right)=\phi(e^{i}_{n})/\tau_{n}^{2},
C0=2(2θ1)1θ,C1=11θ2α^nL(0,T;Y),C2=C1C0,\displaystyle C_{0}=\frac{2(2\theta-1)}{1-\theta},\ \ C_{1}=\frac{1}{1-\theta}\sqrt{\frac{2}{\alpha}}\|\widehat{\ell}^{\prime}_{n}\|_{L^{\infty}(0,T;Y^{*})},\ \ C_{2}=\frac{C_{1}}{C_{0}},\ \

so that, owing to (6.2), (6.13), and using the fact that 2(2θ1)<2θ2(2\theta-1)<2\theta,

(C0+1)ai2ai12\displaystyle\big{(}C_{0}+1\big{)}a_{i}^{2}-a_{i-1}^{2} \displaystyle\leq C1aifori=2,,Nn,\displaystyle C_{1}a_{i}\quad\text{for}\ \ i=2,\dots,N_{n},
a1\displaystyle a_{1} \displaystyle\leq 12θ2α^nL(0,T;Y)12(2θ1)2α^nL(0,T;Y)\displaystyle\frac{1}{2\theta}\sqrt{\frac{2}{\alpha}}\|\widehat{\ell}^{\prime}_{n}\|_{L^{\infty}(0,T;Y^{*})}\leq\frac{1}{2(2\theta-1)}\sqrt{\frac{2}{\alpha}}\|\widehat{\ell}^{\prime}_{n}\|_{L^{\infty}(0,T;Y^{*})}
=\displaystyle= 11θ2α^nL(0,T;Y)1θ2(2θ1)\displaystyle\frac{1}{1-\theta}\sqrt{\frac{2}{\alpha}}\|\widehat{\ell}^{\prime}_{n}\|_{L^{\infty}(0,T;Y^{*})}\,\frac{1-\theta}{2(2\theta-1)}
=\displaystyle= C1C0=C2.\displaystyle\frac{C_{1}}{C_{0}}=C_{2}.

Now, since (C0+1)C22C1C2=C22(C_{0}+1)C_{2}^{2}-C_{1}C_{2}=C_{2}^{2}, we easily prove by induction that aiC2a_{i}\leq C_{2} and the assertion follows. ∎

6.3. Convergence

We shall prove the weak-star W1,(0,T;Y)W^{1,\infty}(0,T;Y^{*}) convergence for the θ\theta-method. This result has to be compared with that of Han & Reddy [HR00, Thm. 3.4] where the uniform convergence of the backward constant interpolations is obtained. Our result is weaker than that of [HR00, Thm. 3.4] since we are not providing strong convergence. On the other hand, we believe our half-page proof to be possibly more transparent than the long argument developed in [HR00]. Let us moreover mention that in the Hilbertian case and for AA coercive on YY, the strong convergence in W1,p(0,T;Y)W^{1,p}(0,T;Y) for all p<p<\infty of the Euler method θ=1\theta=1 has been proved in [Kre96, Prop. 3.9, p. 33].

Theorem 6.5 (Convergence for the θ\theta-method).

Assume (3.5) and let Fnθ(yn0,,ynNn)=0F_{n}^{\theta}(y^{0}_{n},\dots,y^{N_{n}}_{n})=0. Then, y^ny\widehat{y}_{n}\to y weakly star in W1,(0,T;Y)W^{1,\infty}(0,T;Y) where F(y)=0=minFF(y)=0=\min F.

Proof.

Owing to Lemma 6.4, we can extract a (not relabeled) subsequence such that y^ny\widehat{y}_{n}\to y weakly star in W1,(0,T;Y)W^{1,\infty}(0,T;Y), hence weakly pointwise in YY. Moreover, we clearly have that both y¯n\overline{y}_{n} and y¯n,θ\overline{y}_{n,\theta} converge at the same limit weakly star in L(0,T;Y)L^{\infty}(0,T;Y). Finally, we directly check that ¯n,θ\overline{\ell}_{n,\theta}\to\ell strongly in L(0,T;Y)L^{\infty}(0,T;Y^{*}). By observing that, since θ1/2\theta\geq 1/2,

τniA(θyni+(1θ)yni1),δyni\displaystyle\tau^{i}_{n}\left\langle A(\theta y^{i}_{n}+(1-\theta)y^{i-1}_{n}),\delta y^{i}_{n}\right\rangle =\displaystyle= ϕ(yni)+(2θ1)ϕ(yniyni1)ϕ(yni1)\displaystyle\phi(y^{i}_{n})+(2\theta-1)\phi(y^{i}_{n}-y^{i-1}_{n})-\phi(y^{i-1}_{n})
\displaystyle\geq ϕ(yni)ϕ(yni1),\displaystyle\phi(y^{i}_{n})-\phi(y^{i-1}_{n}),

we compute that

0\displaystyle 0 =\displaystyle= Fnθ(yn0,,ynNn)\displaystyle F_{n}^{\theta}(y^{0}_{n},\dots,y^{N_{n}}_{n})
\displaystyle\geq 0T(ψ(y^n)+ψ(¯n,θAy¯n,θ)¯n,θ,y^n)\displaystyle\int_{0}^{T}\Big{(}\psi(\widehat{y}^{\prime}_{n})+\psi^{*}(\overline{\ell}_{n,\theta}-A\overline{y}_{n,\theta})-\left\langle\overline{\ell}_{n,\theta},\widehat{y}^{\prime}_{n}\right\rangle\Big{)}
+\displaystyle+ ϕ(y^n(T))ϕ(y^n(0))+χ(y^n(0)y0)\displaystyle\phi(\widehat{y}_{n}(T))-\phi(\widehat{y}_{n}(0))+\chi(\widehat{y}_{n}(0)-y_{0})
=\displaystyle= 0T(ψ(y^n)+ψ(¯n,θAy¯n,θ)¯n,θ,y^n)\displaystyle\int_{0}^{T}\Big{(}\psi(\widehat{y}^{\prime}_{n})+\psi^{*}(\overline{\ell}_{n,\theta}-A\overline{y}_{n,\theta})-\left\langle\overline{\ell}_{n,\theta},\widehat{y}^{\prime}_{n}\right\rangle\Big{)}
+\displaystyle+ ϕ(y^n(T))+ϕ(y0)Ay^n(0),y0+|y^n(0)y0|2.\displaystyle\phi(\widehat{y}_{n}(T))+\phi(y_{0})-\left\langle A\widehat{y}_{n}(0),y_{0}\right\rangle+|\widehat{y}_{n}(0)-y_{0}|^{2}.

Finally, it suffices to pass to the lim inf\liminf above as nn\to\infty and exploit lower semicontinuity and the stated convergences in order to obtain that F(y)0F(y)\leq 0. Hence, by Theorem 3.1 and Corollary 4.6, yy is the only solution to (1.1) and the whole sequence y^n\widehat{y}_{n} converges. ∎

We shall mention that the separability assumption for YY is not crucial and could be weakened. Indeed, in case YY is not separable, one simply has to pass to limits as y^ny\widehat{y}_{n}\to y weakly in W1,p(0,T;Y)W^{1,p}(0,T;Y) for some p[1,)p\in[1,\infty).

6.4. The functional controls the uniform distance

We shall reproduce at the discrete level the results of Subsection 4.6. We begin by showing how to possibly control the uniform distance of two vectors by means of the discrete functional FnθF_{n}^{\theta}.

Lemma 6.6 (Uniform distance control via FnθF_{n}^{\theta}).

Let the vectors u=(u0,,uNn)u=(u^{0},\dots,u^{N_{n}}), v=(v0,,vNn)YNn+1v=(v^{0},\dots,v^{N_{n}})\in Y^{N_{n}+1} be given. Then,

η(1η)max1iNnϕ(uivi)\displaystyle\eta(1-\eta)\max_{1\leq i\leq N_{n}}\phi(u^{i}-v^{i})
ηFnθ(u0,,uNn)+(1η)Fnθ(v0,,vNn)η[0,1].\displaystyle\leq\eta F_{n}^{\theta}(u^{0},\dots,u^{N_{n}})+(1-\eta)F_{n}^{\theta}(v^{0},\dots,v^{N_{n}})\quad\forall\eta\in[0,1].
Proof.

This proof follows the same lines of that of Corollary 4.4. Let 1iNn1\leq i\leq N_{n} be fixed and define Fnθ,i:Yi+1[0,]F^{\theta,i}_{n}:Y^{i+1}\to[0,\infty] as

Fnθ,i(y0,,yi)\displaystyle\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!F^{\theta,i}_{n}(y^{0},\dots,y^{i})
=\displaystyle= j=1iτnjLnθ,j(yj,yj1)+χ(y0y0)\displaystyle\sum_{j=1}^{i}\tau^{j}_{n}L^{\theta,j}_{n}(y^{j},y^{j-1})+\chi(y^{0}-y_{0})
=\displaystyle= j=1iψ(yjyj1)+ψ((tn,θj)Ayθj)(tn,θj),yjyj1\displaystyle\sum_{j=1}^{i}\psi(y^{j}-y^{j-1})+\psi^{*}(\ell(t^{j}_{n,\theta})-Ay^{j}_{\theta})-\left\langle\ell(t^{j}_{n,\theta}),y^{j}-y^{j-1}\right\rangle
+\displaystyle+ ϕ(yi)+(2θ1)j=1iϕ(yjyj1)+ϕ(y0)Ay0,y0+|y0y0|2.\displaystyle\phi(y^{i})+(2\theta-1)\sum_{j=1}^{i}\phi(y^{j}-y^{j-1})+\phi(y_{0})-\left\langle Ay^{0},y_{0}\right\rangle+|y^{0}-y_{0}|^{2}.

Then, clearly y=(y0,,yi)Gnθ,i(y)=Fnθ,i(y)ϕ(yi)y=(y^{0},\dots,y^{i})\mapsto G^{\theta,i}_{n}(y)=F^{\theta,i}_{n}(y)-\phi(y^{i}) is convex. Hence, letting w=ηu+(1η)vw=\eta u+(1-\eta)v for some η[0,1]\eta\in[0,1] we have that

0Fnθ,i(w)\displaystyle 0\leq F^{\theta,i}_{n}(w)
η(Gnθ,i(u)+ϕ(ui))+(1η)(Gnθ,i(v)+ϕ(vi))η(1η)ϕ(uivi),\displaystyle\leq\eta\big{(}G^{\theta,i}_{n}(u)+\phi(u^{i})\big{)}+(1-\eta)\big{(}G^{\theta,i}_{n}(v)+\phi(v^{i})\big{)}-\eta(1-\eta)\phi(u^{i}-v^{i}),

whence the assertion follows. ∎

Again, note that the latter lemma controls the uniform norm of the distance only if the stronger (3.5) is required. The following corollary of Lemma 6.6 will be the starting point for some possible a posteriori error control procedure (see Subsection 6.6).

Corollary 6.7 (Uniform distance from the minimizer).

Let Fnθ(y0,,yNn)=0F_{n}^{\theta}(y^{0},\dots,y^{N_{n}})=0. Then

max1iNnϕ(yivi)Fnθ(v0,,vNn)(v0,,vNn)YNn+1.\displaystyle\max_{1\leq i\leq N_{n}}\phi(y^{i}-v^{i})\leq F_{n}^{\theta}(v^{0},\dots,v^{N_{n}})\quad\forall(v^{0},\dots,v^{N_{n}})\in Y^{N_{n}+1}.

Moreover, we re-obtain a proof of the uniqueness of the solution of the θ\theta-method.

Corollary 6.8 (Uniqueness of the minimizer).

Assume (3.5). Then, there exists at most one y=(y0,,yNn)y=(y^{0},\dots,y^{N_{n}}) such that Fnθ(y)=0F_{n}^{\theta}(y)=0.

6.5. The generalized θ\theta-method

Although minimizers of FnθF^{\theta}_{n} and solutions of the θ\theta-scheme (6.3) coincide, minimizing sequences of FnθF^{\theta}_{n} need not solve (6.3). This extra freedom allows the minimization formulation to capture the convergence of some generalized θ\theta-method, where the relations in (6.3) are not solved exactly but rather approximated. Namely, we shall look for vectors un=(un0,,unNn)u_{n}=(u^{0}_{n},\dots,u^{N_{n}}_{n}) such that

Fnθ(un)0asnF^{\theta}_{n}(u_{n})\to 0\quad\text{as}\quad n\to\infty

instead of Fnθ(un)=0F^{\theta}_{n}(u_{n})=0 for all nn\in\mathbb{N}.

From the computational viewpoint, note that the θ\theta-scheme consists in solving NnN_{n} nonlinear equations in one unknown each while checking for stationarity for FnθF^{\theta}_{n} implies the solution of a tridiagonal system of Nn+1\,N_{n}+1\, nonlinear equations with (up to) three unknowns each. This entails in particular that minimizing FnθF^{\theta}_{n} instead of solving (6.3) could be of a scarce interest if one is merely concerned in reproducing the θ\theta-scheme with no error. On the other hand, the issue of solving up to some tolerance turns out to be particularly relevant whenever one is aimed at implementing an optimization procedure for the solution of (6.3). Indeed, one should be prepared to run the algorithm (some descent method, say) until some given tolerance is reached.

Our starting point for a possible convergence analysis of the generalized θ\theta-method is the following classical error control result.

Theorem 6.9 (Mielke & Theil [MT04]).

Assume (3.5). Then, y^ny\widehat{y}_{n}\to y uniformly and F(y)=0F(y)=0. In particular,

maxt[0,T]|(y^ny)(t)|Ceτn\max_{t\in[0,T]}|(\widehat{y}_{n}-y)(t)|\leq C_{e}\sqrt{\tau_{n}} (6.14)

where CeC_{e} depends only on data and is independent of nn.

More precisely, in [MT04] solely the case of the Euler scheme θ=1\theta=1 is discussed. However, an easy adaptation of the argument entails the result for θ[1/2,1)\theta\in[1/2,1) as well.

By explicitly comparing the minimizing sequence un=(un0,,unNn)u_{n}=(u^{0}_{n},\dots,u^{N_{n}}_{n}) with the corresponding solution (yn0,,ynNn)(y^{0}_{n},\dots,y^{N_{n}}_{n}) of the θ\theta-method, we have the following.

Theorem 6.10 (Convergence for the generalized θ\theta-method).

Assume (3.5) and let Fnθ(un0,,unNn)0F_{n}^{\theta}(u^{0}_{n},\dots,u^{N_{n}}_{n})\to 0. Then, u^ny\widehat{u}_{n}\to y uniformly, where F(y)=0F(y)=0. In particular,

maxt[0,T]|(u^ny)(t)|Ceτn+(2αFnθ(un0,,unNn))1/2.\max_{t\in[0,T]}|(\widehat{u}_{n}-y)(t)|\leq C_{e}\sqrt{\tau_{n}}+\left(\frac{2}{\alpha}F_{n}^{\theta}(u^{0}_{n},\dots,u^{N_{n}}_{n})\right)^{1/2}. (6.15)
Proof.

We have that

maxt[0,T]|(yu^n)(t)|\displaystyle\max_{t\in[0,T]}|(y-\widehat{u}_{n})(t)| \displaystyle\leq maxt[0,T]|(yy^n)(t)|+maxt[0,T]|(y^nu^n)(t)|\displaystyle\max_{t\in[0,T]}|(y-\widehat{y}_{n})(t)|+\max_{t\in[0,T]}|(\widehat{y}_{n}-\widehat{u}_{n})(t)|
\displaystyle\leq Ceτn+max1iNn|uniyni|\displaystyle C_{e}\sqrt{\tau_{n}}+\max_{1\leq i\leq N_{n}}|u^{i}_{n}-y^{i}_{n}|
\displaystyle\leq Ceτn+(2αmax1iNnϕ(uniyni))1/2\displaystyle C_{e}\sqrt{\tau_{n}}+\left(\frac{2}{\alpha}\max_{1\leq i\leq N_{n}}\phi(u^{i}_{n}-y^{i}_{n})\right)^{1/2}

and we conclude by applying Corollary 6.7. ∎

6.6. A posteriori error control

Let us now exploit both Corollary 4.5 and Theorem 6.10 in order to provide some possible a posteriori estimates of the approximation error by means of solutions unu_{n} of the generalized θ\theta-method described above.

Lemma 6.11 (A posteriori error control via FnθF_{n}^{\theta}).

Assume (3.5) and let Fnθ(un0,,unNn)τnsF_{n}^{\theta}(u^{0}_{n},\dots,u^{N_{n}}_{n})\sim\tau_{n}^{s} for some s>0s>0 and F(y)=0F(y)=0. Then,

maxt[0,T]|(u^ny)(t)|τnrwhere2r=max{1,s}.\max_{t\in[0,T]}|(\widehat{u}_{n}-y)(t)|\sim\tau_{n}^{r}\quad\text{where}\quad 2r=\max\{1,s\}.
Lemma 6.12 (A posteriori error control via FF).

Assume (3.5) and let F(u^n)τnsF(\widehat{u}_{n})\sim\tau_{n}^{s} for some s>0s>0 and F(y)=0F(y)=0. Then, maxt[0,T]|(u^ny)(t)|τns/2\max_{t\in[0,T]}|(\widehat{u}_{n}-y)(t)|\sim\tau_{n}^{s/2}.

We are also in the position of proving the weak-star convergence of the time derivatives of solutions unu_{n} of the generalized θ\theta-method by comparing them with the corresponding derivatives of the exact solution of the θ\theta-method.

Lemma 6.13 (Improved convergence for the generalized θ\theta-method).

Assume (3.5) and let Fnθ(un0,,unNn)τn2F^{\theta}_{n}(u_{n}^{0},\dots,u^{N_{n}}_{n})\sim\tau_{n}^{2}. Then, u^n\widehat{u}_{n} is equibounded in W1,(0,T;Y)W^{1,\infty}(0,T;Y). In particular, u^ny\widehat{u}_{n}\to y weakly star in W1,(0,T;Y)W^{1,\infty}(0,T;Y).

Proof.

Let (yn0,,ynNn)(y^{0}_{n},\dots,y^{N_{n}}_{n}) be the solution of the θ\theta-scheme. By exploiting Lemma 6.7, we check that

|uniuni1|\displaystyle|u^{i}_{n}-u^{i-1}_{n}| \displaystyle\leq |uniyni|+|yniyni1|+|yni1uni1|\displaystyle\left|{u^{i}_{n}-y^{i}_{n}}\right|+\left|{y^{i}_{n}-y^{i-1}_{n}}\right|+\left|y^{i-1}_{n}-u^{i-1}_{n}\right|
\displaystyle\leq τniy^nL(0,T;Y)+2(2αFnθ(un0,,unNn))1/2.\displaystyle\tau^{i}_{n}\|\widehat{y}_{n}^{\prime}\|_{L^{\infty}(0,T;Y)}+2\left(\frac{2}{\alpha}F_{n}^{\theta}(u^{0}_{n},\dots,u^{N_{n}}_{n})\right)^{1/2}.

The uniform bound on u^nW1,(0,T;Y)\|\widehat{u}_{n}\|_{W^{1,\infty}(0,T;Y)} follows by dividing the latter by τni\tau^{i}_{n}, taking the maximum as 1iNn1\leq i\leq N_{n}, and recalling Lemma 6.4. ∎

6.7. Adaptivity

Assuming (3.5), the above introduced a posteriori error estimators can be exploited in order to develop an adaptive strategy. In particular, the error control in the uniform norm up to a given tolerance tol>0\text{\rm tol}>0

maxt[0,T]|(yy^n)(t)|tol\max_{t\in[0,T]}|(y-\widehat{y}_{n})(t)|\leq\text{\rm tol}

for some piecewise approximation y^n\widehat{y}_{n} with χ(y^n(0)y0)αtol2/4\chi(\widehat{y}_{n}(0)-y_{0})\leq\alpha\,\text{\rm tol}^{2}/4 can be inferred, for instance, by choosing time steps in such a way that

tni1tniL(t,y^n(t),y^n(t))αtol24Nn.\int_{t^{i-1}_{n}}^{t^{i}_{n}}L(t,\widehat{y}_{n}(t),\widehat{y}_{n}^{\prime}(t))\leq\frac{\alpha\,\text{\rm tol}^{2}}{4N_{n}}.

Namely, by uniformly distributing the error along the partition.

Alternatively, one could develop an adaptive strategy by considering just computed quantities at the discrete level by asking for

τniLnθ,i(yni,yni1)αtol232Nnforτntol216Ce2\tau^{i}_{n}L^{\theta,i}_{n}(y^{i}_{n},y^{i-1}_{n})\leq\frac{\alpha\,\text{\rm tol}^{2}}{32N_{n}}\quad\text{for}\quad\tau_{n}\leq\frac{\text{\rm tol}^{2}}{16C_{e}^{2}}

and exploiting Theorem 6.10.

7. Space-time approximations

Let us combine the results of the previous sections (and use the corresponding notation) in order to state and prove a result on the convergence of full space-time approximations. Our results have to be compared with the former convergence analysis by Han & Reddy [HR99]. Our approach leads to a convergence proof with respect to a weaker topology. However, it is on the one hand slightly more general (some assumptions on the spaces and the functionals, see (H1)-(H2) [HR99, p. 264], are not required) and on the other hand has a much simpler proof.

Theorem 7.1 (Convergence of space-time approximations).

Assume (5.1)-(5.9),θ[1/2,1]\theta\in[1/2,1], define Ln,hθ,i(y,z):Y×Y[0,]L^{\theta,i}_{n,h}(y,z):Y\times Y\to[0,\infty] as

Ln,hθ,i(y,z)=ψh(yzτni)+ψh(h(tn,θi)Ah(θy+(1θ)z))\displaystyle L^{\theta,i}_{n,h}(y,z)=\psi_{h}\left(\frac{y-z}{\tau^{i}_{n}}\right)+\psi^{*}_{h}\left(\ell_{h}(t^{i}_{n,\theta})-A_{h}(\theta y+(1-\theta)z)\right)
h(tn,θi)Ah(θy+(1θ)z),yzτni,\displaystyle-\left\langle\ell_{h}(t^{i}_{n,\theta})-A_{h}(\theta y+(1-\theta)z),\frac{y-z}{\tau^{i}_{n}}\right\rangle,

where Ah=ϕhA_{h}=\partial\phi_{h}, and let the functionals Fn,hθ:YNn+1[0,]F^{\theta}_{n,h}:Y^{N_{n}+1}\to[0,\infty] be defined as

Fn,hθ(y0,,yNn)=i=1NnτniLn,hθ,i(yi,yi1)+χh(y0y0),F^{\theta}_{n,h}(y^{0},\dots,y^{N_{n}})=\sum_{i=1}^{N_{n}}\tau^{i}_{n}L^{\theta,i}_{n,h}(y^{i},y^{i-1})+\chi_{h}(y^{0}-y_{0}),

where χh()=ϕh()+||2\chi_{h}(\cdot)=\phi_{h}(\cdot)+|\cdot|^{2} (note that D(Fn,hθ)YhNn+1D(F_{n,h}^{\theta})\subset Y^{N_{n}+1}_{h}). Finally, let Fn,h(yh0,,yhNn)=0F_{n,h}(y^{0}_{h},\dots,y^{N_{n}}_{h})=0. We have:

  • (a)

    y^n,hyh\widehat{y}_{n,h}\to y_{h} weakly star in W1,(0,T;Y)W^{1,\infty}(0,T;Y) as (n,h)(,h)(n,h)\to(\infty,h) and Fh(yh)=0F_{h}(y_{h})=0.

  • (b)

    y^n,hy^n\widehat{y}_{n,h}\to\widehat{y}_{n} weakly star in W1,(0,T;Y)W^{1,\infty}(0,T;Y) as (n,h)(n,0)(n,h)\to(n,0) and Fnθ(yn)=0F^{\theta}_{n}(y_{n})=0.

  • (c)

    y^ny\widehat{y}_{n}\to y weakly star in W1,(0,T;Y)W^{1,\infty}(0,T;Y) as (n,0)(,0)(n,0)\to(\infty,0) and F(y)=0F(y)=0.

  • (d)

    yhyy_{h}\to y weakly star W1,(0,T;Y)W^{1,\infty}(0,T;Y) as (,h)(,0)(\infty,h)\to(\infty,0) and F(y)=0F(y)=0.

  • (e)

    y^n,hy\widehat{y}_{n,h}\to y weakly star in W1,(0,T;Y)W^{1,\infty}(0,T;Y) as (n,h)(,0)(n,h)\to(\infty,0) and F(y)=0F(y)=0.

The thesis of the Theorem is illustrated in Figure 1. In particular, we aim at showing that the space (or data) and time-limit can be taken in any order. Note that Limit (c) has been already checked in Theorem 6.5 and that the very same argument yields Limit (a) as well (recall that YhY_{h} is closed). Moreover, Limit (d) is discussed in Theorem 5.3. So what we are actually left to check are Limits (b) and (e) only.

\psfrag{r}{$n$}\psfrag{t}{$h$}\psfrag{0}{$(\infty,0)$}\psfrag{a}{$a$}\psfrag{b}{$b$}\psfrag{c}{$c$}\psfrag{d}{$d$}\psfrag{e}{$e$}\includegraphics[width=173.44534pt]{CR.eps}
Figure 1. Convergences for space-time approximations
Proof.

Limit (b). The assertion follows once we check that, for all i=1,,Nni=1,\dots,N_{n}, if yn,hi1yni1y^{i-1}_{n,h}\to y^{i-1}_{n} weakly in YY, one has the weak convergence yn,hiyniy^{i}_{n,h}\to y^{i}_{n} as well. Recall that

yn,hi Arg min yY(θϕh(y)h(tn,θi)(1θ)Ahyn,hi1,y+ψh(yyn,hi1))\displaystyle y^{i}_{n,h}\in\text{\,\rm Arg\,min\,}_{y\in Y}\Big{(}\theta\phi_{h}(y)-\left\langle\ell_{h}(t^{i}_{n,\theta})-(1-\theta)A_{h}y^{i-1}_{n,h},y\right\rangle+\psi_{h}(y-y^{i-1}_{n,h})\Big{)}
= Arg min yYh(θϕ(y)h(tn,θi)(1θ)Ayn,hi1,y+ψh(yyn,hi1)).\displaystyle=\text{\,\rm Arg\,min\,}_{y\in Y_{h}}\Big{(}\theta\phi(y)-\left\langle\ell_{h}(t^{i}_{n,\theta})-(1-\theta)Ay^{i-1}_{n,h},y\right\rangle+\psi_{h}(y-y^{i-1}_{n,h})\Big{)}.

Hence, since we have (5.6), the sequence yn,hiy^{i}_{n,h} is weakly precompact and, up to the extraction of a (not relabeled) subsequence, yn,hiy~y^{i}_{n,h}\to\tilde{y} weakly in YY. Let us prove that y~\tilde{y} solves the incremental problem (6.4). Indeed, we have that

0\displaystyle 0 \displaystyle\leq Lnθ,i(y~,yni1)\displaystyle L^{\theta,i}_{n}(\tilde{y},y^{i-1}_{n})
\displaystyle\leq lim infh0(ψh(yn,hiyn,hi1)+ψh(h(tn,θi)Ayn,h,θi)\displaystyle\liminf\limits_{h\to 0}\Big{(}\psi_{h}(y^{i}_{n,h}-y^{i-1}_{n,h})+\psi^{*}_{h}(\ell_{h}(t^{i}_{n,\theta})-Ay^{i}_{n,h,\theta})
\displaystyle- h(tn,θi)Ayn,h,θi,yn,hiyn,hi1)\displaystyle\left\langle\ell_{h}(t^{i}_{n,\theta})-Ay^{i}_{n,h,\theta},y^{i}_{n,h}-y^{i-1}_{n,h}\right\rangle\Big{)}
=\displaystyle= lim infh0Ln,hθ,i(yn,hi,yn,hi1)=0\displaystyle\liminf\limits_{h\to 0}L^{\theta,i}_{n,h}(y^{i}_{n,h},y^{i-1}_{n,h})=0

where we have used the Mosco convergence in (5.5) and the pointwise convergence of h\ell_{h} (5.7). Since the only solution of (6.4) is yniy^{i}_{n}, we have that y~=yni\tilde{y}=y^{i}_{n} and the whole sequence converges.

Let us mention that, if the functionals ψh\psi_{h} are uniformly linearly bounded (which is quite common in practice), then one could prove the latter convergence to be actually strong: namely yn,hi1yni1y^{i-1}_{n,h}\to y^{i-1}_{n} strongly in YY implies the strong convergence yn,hiyniy^{i}_{n,h}\to y^{i}_{n}. Indeed, let whw_{h} and w~h\tilde{w}_{h} be such that whyn,hi0w_{h}-y^{i}_{n,h}\to 0 strongly in YY, ψh(whyn,hi)0\psi_{h}(w_{h}-y^{i}_{n,h})\to 0, w~hYh\tilde{w}_{h}\in Y_{h} and w~hwh0\tilde{w}_{h}-w_{h}\to 0 strongly in YY. Then

θϕ(yn,hi)(tn,θi)(1θ)Ahyn,hi1,yn,hi+ψh(yn,hiyn,hi1)\displaystyle\theta\phi(y^{i}_{n,h})-\left\langle\ell(t^{i}_{n,\theta})-(1-\theta)A_{h}y^{i-1}_{n,h},y^{i}_{n,h}\right\rangle+\psi_{h}(y^{i}_{n,h}-y^{i-1}_{n,h})
θϕh(w~h)(tn,θi)(1θ)Ayn,hi1,w~h+ψh(w~hwh)+ψh(whyn,hi).\displaystyle\leq\theta\phi_{h}(\tilde{w}_{h})-\left\langle\ell(t^{i}_{n,\theta})-(1-\theta)Ay^{i-1}_{n,h},\tilde{w}_{h}\right\rangle+\psi_{h}(\tilde{w}_{h}-w_{h})+\psi_{h}(w_{h}-y^{i}_{n,h}).

If ψh\psi_{h} are uniformly linearly bounded above, then ψh(w~hwh)0\psi_{h}(\tilde{w}_{h}-w_{h})\to 0 with h0h\to 0. Whence, by passing to the lim sup\limsup in the latter, we check that lim suph0ϕ(yn,hi)ϕ(yni)\limsup_{h\to 0}\phi(y^{i}_{n,h})\leq\phi(y^{i}_{n}), which, together with lower semicontinuity gives ϕ(yn,hi)ϕ(yni)\phi(y^{i}_{n,h})\to\phi(y^{i}_{n}) and the strong convergence follows from the reflexivity of YY.

Limit (e). Lemma 6.4, the uniform Lipschitz continuity of h\ell_{h} (5.8), and the initial datum convergence (5.9), entail that y^n,h\widehat{y}_{n,h} are uniformly Lipschitz continuous as well. Hence, by extracting a (not relabeled) subsequence, y^n,hy\widehat{y}_{n,h}\to y weakly star in W1,(0,T;Y)W^{1,\infty}(0,T;Y). In order to check that yy solves (1.1), let us remark that, being n,h,θi=h(tn,θi)\ell_{n,h,\theta}^{i}=\ell_{h}(t^{i}_{n,\theta}),

¯n,h,θstrongly inL1(0,T;Y)\overline{\ell}_{n,h,\theta}\to\ell\quad\text{strongly in}\ \ L^{1}(0,T;Y^{*})

and that, by [Ste06a, Cor. 4.4]

0Tψ(y)lim infh00Tψh(y^n,h)\displaystyle\int_{0}^{T}\psi(y^{\prime})\leq\liminf\limits_{h\to 0}\int_{0}^{T}\psi_{h}(\widehat{y}_{n,h}^{\prime})
0Tψ(Ay)lim infh00Tψh(¯n,h,θAy¯n,h,θ),\displaystyle\int_{0}^{T}\psi^{*}(\ell-Ay)\leq\liminf\limits_{h\to 0}\int_{0}^{T}\psi_{h}^{*}(\overline{\ell}_{n,h,\theta}-A\overline{y}_{n,h,\theta}),

and compute that

0\displaystyle 0 \displaystyle\leq F(y)\displaystyle F(y)
\displaystyle\leq lim infh0(0T(ψh(y^n,h)+ψh(¯n,h,θAy¯n,h,θ)¯n,h,θ,y^n,h)\displaystyle\liminf\limits_{h\to 0}\Bigg{(}\int_{0}^{T}\big{(}\psi_{h}(\widehat{y}_{n,h}^{\prime})+\psi_{h}^{*}(\overline{\ell}_{n,h,\theta}-A\overline{y}_{n,h,\theta})-\left\langle\overline{\ell}_{n,h,\theta},\widehat{y}_{n,h}^{\prime}\right\rangle\big{)}
+\displaystyle+ ϕ(y^n,h(T))+ϕ(y0,h)Ay^n,h(0),y0,h+|y^n,h(0)y0,h|2)\displaystyle\phi(\widehat{y}_{n,h}(T))+\phi(y_{0,h})-\left\langle A\widehat{y}_{n,h}(0),y_{0,h}\right\rangle+|\widehat{y}_{n,h}(0)-y_{0,h}|^{2}\Bigg{)}
\displaystyle\leq lim infh0Fn,hθ(yn,h0,,yn,hNn)=0,\displaystyle\liminf\limits_{h\to 0}F_{n,h}^{\theta}(y^{0}_{n,h},\dots,y^{N_{n}}_{n,h})=0,

and we have that F(y)=0F(y)=0. ∎

We shall conclude by briefly mention some further results which can be obtained by suitably adapting to the current fully-discretized situation the arguments developed above for time-discretizations. Firstly, in the same spirit of Lemma 6.6, one could consider the possibility of estimating the distance of a vector from the minimizer of Fn,hθF^{\theta}_{n,h} by means of the functional itself. Secondly, the use of Corollary 4.5 would entail the possibility of an a posteriori error control an some adaptive strategy along the lines of Subsection 6.7 could be considered. Finally, by relying on the known convergence estimates for full space-time discretized problems [HR99] one could obtain a convergence and an a posteriori error control result for some generalized space-time approximated problem where Fn,hθF^{\theta}_{n,h} are not exactly minimized and one considers minimizing sequences instead (see Subsection 6.5). We shall develop these considerations elsewhere.

References

  • [Att84] H. Attouch. Variational convergence for functions and operators. Pitman (Advanced Publishing Program), Boston, MA, 1984.
  • [Auc93] G. Auchmuty. Saddle-points and existence-uniqueness for evolution equations. Differential Integral Equations, 6(5):1161–1171, 1993.
  • [BE76a] H. Brezis and I. Ekeland. Un principe variationnel associé à certaines équations paraboliques. Le cas dépendant du temps. C. R. Acad. Sci. Paris Sér. A-B, 282(20):Ai, A1197–A1198, 1976.
  • [BE76b] H. Brezis and I. Ekeland. Un principe variationnel associé à certaines équations paraboliques. Le cas indépendant du temps. C. R. Acad. Sci. Paris Sér. A-B, 282(17):Aii, A971–A974, 1976.
  • [Dal93] G. Dal Maso. An introduction to Γ\Gamma-convergence. Progress in Nonlinear Differential Equations and their Applications, 8. Birkhäuser Boston Inc., Boston, MA, 1993.
  • [DDM06] G. Dal Maso, A. DeSimone, and M. G. Mora. Quasistatic evolution problems for linearly elastic-perfectly plastic materials. Arch. Ration. Mech. Anal., 180(2):237–291, 2006.
  • [DL76] G. Duvaut and J.-L. Lions. Inequalities in Mechanics and Physics. Springer, Berlin, 1976.
  • [EM06] M. A. Efendiev and A. Mielke. On the rate-independent limit of systems with dry friction and small viscosity. J. Convex Anal., 13(1):151–167, 2006.
  • [EMRS06] M. A. Efendiev, A. Mielke, R. Rossi, and G. Savaré. In preparation. 2006.
  • [GF75] E. De Giorgi and T. Franzoni. Su un tipo di convergenza variazionale. Atti Accad. Naz. Lincei Rend. Cl. Sci. Fis. Mat. Natur. (8), 58(6):842–850, 1975.
  • [GT04] N. Ghoussoub and L. Tzou. A variational principle for gradient flows. Math. Ann., 330(3):519–549, 2004.
  • [HLW06] E. Hairer, Ch. Lubich, and G. Wanner. Geometric numerical integration, volume 31 of Springer Series in Computational Mathematics. Springer-Verlag, Berlin, second edition, 2006. Structure-preserving algorithms for ordinary differential equations.
  • [HR95] W. Han and B. D. Reddy. Computational plasticity: the variational basis and numerical analysis. Comput. Mech. Adv., 2(4):283–400, 1995.
  • [HR99] W. Han and B.D. Reddy. Plasticity, Mathematical theory and numerical analysis. Springer-Verlag, New York, 1999.
  • [HR00] W. Han and B.D. Reddy. Convergence of approximations to the primal problem in plasticity under conditions of minimal regularity. Numer. Math., 87(2):283–315, 2000.
  • [Kre96] P. Krejčí. Hysteresis, Convexity and Dissipation in Hyperbolic Equations, volume 8 of GAKUTO Int. Series Math. Sci. Appl. Gakkotosho, Tokyo, 1996.
  • [LC90] J. Lemaitre and J.-L. Chaboche. Mechanics of solid materials. Cambridge University Press, 1990.
  • [Lem96] B. Lemaire. An asymptotical variational principle associated with the steepest descent method for a convex function. J. Convex Anal., 3(1):63–70, 1996.
  • [LMOW03] A. Lew, J. E. Marsden, M. Ortiz, and M. West. Asynchronous variational integrators. Arch. Ration. Mech. Anal., 167(2):85–146, 2003.
  • [LMOW04] A. Lew, J. E. Marsden, M. Ortiz, and M. West. Variational time integrators. Internat. J. Numer. Methods Engrg., 60(1):153–212, 2004.
  • [Mab01] M. Mabrouk. Un principe variationnel pour une équation non linéaire du second ordre en temps. C. R. Acad. Sci. Paris Sér. I Math., 332(4):381–386, 2001.
  • [Mab03] M. Mabrouk. A variational principle for a nonlinear differential equation of second order. Adv. in Appl. Math., 31(2):388–419, 2003.
  • [Mie05] A. Mielke. Evolution of rate-independent systems. In C. Dafermos and E. Feireisl, editors, Handbook of Differential Equations, evolutionary equations, volume 2, pages 461–559. Elsevier, 2005.
  • [MM04] F. Maggi and M. Morini. A Γ\Gamma-convergence result for variational integrators of Lagrangians with quadratic growth. ESAIM Control Optim. Calc. Var., 10(4):656–665 (electronic), 2004.
  • [MO04] S. Müller and M. Ortiz. On the Γ\Gamma-convergence of discrete dynamics and variational integrators. J. Nonlinear Sci., 14(3):279–296, 2004.
  • [MO07] A. Mielke and M. Ortiz. A class of minimum principles for characterizing the trajectories and the relaxation of dissipative systems. ESAIM Control Optim. Calc. Var., to appear, 2007. Preprint available at: http://www.wias-berlin.de/main/publications/wias-publ/index.cgi.en.
  • [Mor68] J.-J Moreau. La notion de sur-potentiel et les liaisons unilatérales en élastostatique. C. R. Acad. Sci. Paris Sér. A-B, 267:A954–A957, 1968.
  • [Mor70] J.-J. Moreau. Sur les lois de frottement, de viscosité et plasticité. C. R. Acad. Sci. Paris Sér. II Méc. Phys. Chim. Sci. Univers Sci. Terre, 271:608–611, 1970.
  • [Mor71] J.-J. Moreau. Sur l’évolution d’un système élasto-visco-plastique. C. R. Acad. Sci. Paris Sér. A-B, 273:A118–A121, 1971.
  • [Mos69] U. Mosco. Convergence of convex sets and of solutions of variational inequalities. Advances in Math., 3:510–585, 1969.
  • [MPS98] J. E. Marsden, G. W. Patrick, and S. Shkoller. Multisymplectic geometry, variational integrators, and nonlinear PDEs. Comm. Math. Phys., 199(2):351–395, 1998.
  • [MRS06] A. Mielke, T. Roubíček, and U. Stefanelli. Γ\Gamma-limits and relaxations for rate-independent evolutionary problems. Technical Report 1156, WIAS, 2006. Preprint available at: http://www.wias-berlin.de/main/publications/wias-publ/index.cgi.en.
  • [MT04] A. Mielke and F. Theil. On rate-independent hysteresis models. NoDEA, Nonlinear Diff. Equations Applications, 11:151–189, 2004.
  • [MW01] J. E. Marsden and M. West. Discrete mechanics and variational integrators. Acta Numer., 10:357–514, 2001.
  • [Rio76a] H. Rios. Étude de la question d’existence pour certains problèmes d’évolution par minimisation d’une fonctionnelle convexe. C. R. Acad. Sci. Paris Sér. A-B, 283(3):Ai, A83–A86, 1976.
  • [Rio76b] H. Rios. La question d’existence de solutions pour certaines équations à opérateurs monotones vue comme problème de minimum ou comme problème de point-selle. Travaux Sém. Anal. Convexe, 6(Exp. 13):16, 1976.
  • [Rio78] H. Rios. Étude de certains problèmes paraboliques: existence et approximation des solutions. Travaux Sém. Anal. Convexe, 8(1):Exp. No. 1, 96, 1978.
  • [Rio79] H. Rios. Une étude d’existence sur certains problèmes paraboliques. Ann. Fac. Sci. Toulouse Math. (5), 1(3):235–255, 1979.
  • [Rou00] T. Roubíček. Direct method for parabolic problems. Adv. Math. Sci. Appl., 10(1):57–65, 2000.
  • [SH98] J.C. Simo and T.J.R. Hughes. Computational inelasticity, volume 7 of Interdisciplinary Applied Mathematics. Springer-Verlag, New York, 1998.
  • [Ste06a] U. Stefanelli. The Brezis-Ekeland principle for doubly nonlinear equations. Preprint IMATI - CNR, 41-PV, 2006.
  • [Ste06b] U. Stefanelli. Some remarks on convergence and approximation for a class of hysteresis problems. Istit. Lombardo Accad. Sci. Lett. Rend. A, to appear, 2006. Preprint available at: http://www.imati.cnr.it/ulisse/pubbl.html.
  • [Ste07] U. Stefanelli. A variational principle in non-smooth mechanics. Oberwolfach Reports, 2007.
  • [Vis01] A. Visintin. A new approach to evolution. C. R. Acad. Sci. Paris Sér. I Math., 332(3):233–238, 2001.
  • [Zan07] C. Zanini. Singular perturbations of finite dimensional gradient flows. Discrete Contin. Dyn. Syst., to appear, 2007. Preprint available at: http://arxiv.org/abs/math/0607461.