An abstract inf-sup problem inspired by limit analysis in perfect plasticity and related applications

S. Sysala¹¹¹1corresponding author, email: stanislav.sysala@ugn.cas.cz, J. Haslinger¹, B. D. Reddy², S. Repin^3,4

¹Institute of Geonics of the Czech Academy of Sciences, Ostrava, Czech Republic
²University of Cape Town, South Africa
³V.A. Steklov Institute of Mathematics at St. Petersburg, Russia
⁴University of Jyväskylä, Finland

Abstract

This work is concerned with an abstract inf-sup problem generated by a bilinear Lagrangian and convex constraints. We study the conditions that guarantee no gap between the inf-sup and related sup-inf problems. The key assumption introduced in the paper generalizes the well-known Babuška-Brezzi condition. It is based on an inf-sup condition defined for convex cones in function spaces. We also apply a regularization method convenient for solving the inf-sup problem and derive a computable majorant of the critical (inf-sup) value, which can be used in a posteriori error analysis of numerical results. Results obtained for the abstract problem are applied to continuum mechanics. In particular, examples of limit load problems and similar ones arising in classical plasticity, gradient plasticity and delamination are introduced.

Keywords: convex optimization, duality, inf-sup conditions on cones, regularization, computable majorants, plasticity, delamination, limit analysis

1 Introduction

This paper is concerned with analysis of the abstract duality problem

\lambda^{*}:=\sup_{x\in P}\inf_{\begin{subarray}{c}y\in Y\\ L(y)=1\end{subarray}}\ a(x,y)\stackrel{{\scriptstyle?}}{{=}}\inf_{\begin{subarray}{c}y\in Y\\ L(y)=1\end{subarray}}\sup_{x\in P}\ a(x,y)=:\zeta^{*},

(1.1)

where $P\subset X$ is a closed, convex set with $0_{X}\in P$ , $X,Y$ are Banach spaces, $L$ is a non-trivial continuous linear functional in $Y$ , and $a\colon X\times Y\rightarrow\mathbb{R}$ is a bilinear form continuous with respect to both arguments. Henceforth the problem in the right hand side of (1.1) is called primal, while the one in the left hand side is called dual. It is easy to check that $0\leq\lambda^{*}\leq\zeta^{*}\leq+\infty$ . In general, necessary and sufficient conditions for $\lambda^{*}=\zeta^{*}$ are unknown (therefore, (1.1) uses the symbol $\stackrel{{\scriptstyle?}}{{=}}$ ). One of our main goals is to identify cases where (1.1) holds as the equality.

Problem (1.1) and similar problems appear in various applications, from mechanics to economics [12, 9, 3]. In finite dimensions, minimax and maximin variants of these problems are known in game theory [19] and linear, cone or convex programming [11, 5, 21, 18].

In classical elastic-perfect plasticity, (1.1) is known as the limit analysis problem. In this case, $\lambda^{*}$ is the factor that determines the critical load $\lambda^{*}L$ ( $L$ is a linear functional associated with external loads), subject to the constraint set $P$ of plastically admissible stresses; see for example [17, 9, 32, 8, 10, 29, 30, 28, 16]. For the load $\lambda L$ with $\lambda>\zeta^{*}$ , no solution of the primal and dual problems exists; the body is unable to sustain the loading and collapses. Also, we note the similarity between (1.1) and the shakedown analysis problem (see [33] and the references therein).

Although the limit analysis problem has been studied for several decades, it is still unsolved in the general setting and presents a challenging problem from the theoretical and numerical points of view. There are several reasons that stimulate further analysis of the problem. First, we notice that the equality $\lambda^{*}=\zeta^{*}$ can be analyzed in a rather general framework introduced in [12] or by using particular results from [9, 32, 16]. However, these results do not cover any interesting cases. Second, additional and hidden constraints appear in the primal and dual problems (that follow from their inf- and sup-definitions). They often make the numerical analysis difficult. The third reason is related to the choice of the function spaces $X$ and $Y$ . This question becomes especially important if the primal problem is related to minimization of a functional with linear growth at infinity and a certain problem relaxation must be done to find a minimizer (see e.g. [17, 32, 29, 10]). Then we arrive, for example, at a formulation in which the $BD$ - or $BV$ - spaces of functions of bounded deformation and bounded variation, respectively, are appropriate for the problem setting [32]. Nevertheless, standard Sobolev spaces seem to be sufficient or even more appropriate for analysis of numerical errors [27, 28, 16]. Finally, reliable estimates of $\lambda^{*}$ and $\zeta^{*}$ are often required because they define safety factors of structures. Lower bounds of $\lambda^{*}$ and upper bounds of $\zeta^{*}$ can be found by analytical approaches for specific geometries [8] or, more generally, by finite element methods; see [30] and the references therein. Computable majorants of $\zeta^{*}$ can be found in recent papers [28, 16].

In order to investigate the abstract problem (1.1), we use the ideas applied in [14, 15, 28, 16] for analysis of limit load problems. This extension is not always straightforward and requires innovative techniques. In particular, we derive conditions for the equality $\lambda^{*}=\zeta^{*}$ to hold, the existence of a solution to the dual problem in (1.1), a regularization method for solving (1.1) with related convergence results, and a computable majorant of $\zeta^{*}$ , which can be used for a posteriori analysis of numerical results.

One of the key assumptions in the results presented is the so-called inf-sup condition on convex cones which was introduced in [16]. This condition generalizes the Babuška-Brezzi condition defined on function spaces [1, 4]. Conditions of this type are important for analysis of saddle point problems generated by various mixed finite element approximations [3].

Generalization and abstraction of results is a basic procedure that allows results and insights in a particular application to be applied to broad classes of problems. In our case, we show that the results presented here are useful in problems of gradient-enhanced plasticity and in delamination problems. We choose the strain gradient model studied in [25, 24, 6, 26] and use (1.1) for the description of a global yield surface and for limit load analysis. In related work, limit analysis has been considered for a model in which size-dependence is through the gradient of a scalar function of plastic strain, see [13, Section 7] or [23]. One can expect further applications of the problem (1.1), at least within nonlinear mechanics.

The rest of the paper is organized as follows. In Section 2, we introduce the primal and dual problems, discuss them in more detail, and present criteria ensuring their solvability and the principal duality relation $\lambda^{*}=\zeta^{*}$ . One of the criteria is based on the inf-sup condition on convex cones. The proof of this new result is carried out in Section 3 and its extensions are studied in Section 4. Section 5 is devoted to a regularization of the problem (1.1). The regularized problem provides a lower and sufficiently sharp bound of $\lambda^{*}$ , reduces the constraints in the dual problem, and thus it is convenient for numerical solution. In Section 6, a computable majorant of the quantity $\zeta^{*}$ is derived. Section 7 contains particular examples of the abstract problem (1.1), including classical and strain-gradient plasticity and a delamination problem.

2 The primal and dual problems and duality criteria

First, we recapitulate the basic assumptions used in the problem (1.1):

(A1)

$X,Y$ are two Banach spaces equipped with the norms $\|.\|_{X}$ and $\|.\|_{Y}$ , respectively. The corresponding dual spaces are denoted by $X^{*}$ and $Y^{*}$ ;
(A2)

$a\colon X\times Y\rightarrow\mathbb{R}$ is a continuous bilinear form;
(A3)

$L\colon Y\rightarrow\mathbb{R}$ is a non-trivial continuous linear functional (i.e., $L\neq 0$ in $Y^{*}$ );
(A4)

$P\subset X$ is a nonempty, closed and convex set with $0_{X}\in P$ .

The primal problem in (1.1) reads

\zeta^{*}=\inf_{\begin{subarray}{c}y\in Y\\ L(y)=1\end{subarray}}\sup_{x\in P}\ a(x,y)=\inf_{\begin{subarray}{c}y\in Y\\ L(y)=1\end{subarray}}\mathcal{J}(y),

(2.1)

where

\mathcal{J}\colon Y\rightarrow\mathbb{R}\cup\{+\infty\},\qquad\mathcal{J}(y):=\sup_{x\in P}\ a(x,y),\qquad y\in Y.

(2.2)

The functional $\mathcal{J}$ is convex, proper and 1-positively homogeneous. In addition, the effective domain $\mathrm{dom}\,\mathcal{J}$ is a convex cone; see Section 6 for more details. We shall assume that all cones considered in the text have a vertex at zero, so henceforth do not emphasize this property. We say that the problem (2.1) has a solution if the functional $\mathcal{J}$ has a minimizer in the feasible set $\mathrm{dom}\,\mathcal{J}\cap\{y\in Y\ |\;L(y)=1\}$ . Using the positive homogeneity of $\mathcal{J}$ , we obtain the following useful and equivalent definition of $\zeta^{*}$ :

\zeta^{*}=\sup\{\lambda\in\mathbb{R}_{+}\ |\;\;\mathcal{J}(y)-\lambda L(y)\geq 0\;\;\forall y\in Y\}.

(2.3)

To rewrite the dual problem in (1.1) we define the functional

\mathcal{I}(x):=\inf_{\begin{subarray}{c}y\in Y\\ L(y)=1\end{subarray}}\ a(x,y)=\left\{\begin{array}[]{cc}\lambda,&\exists\lambda\in\mathbb{R}:\;\;a(x,y)=\lambda L(y)\;\;\forall y\in Y,\\ -\infty,&\mbox{otherwise},\end{array}\right.\quad x\in X,

(2.4)

and the related set

\Lambda_{\lambda}:=\{x\in X\ |\;\;a(x,y)=\lambda L(y)\;\;\forall y\in Y\}.

(2.5)

Then, we have

\lambda^{*}=\sup_{x\in P}\inf_{\begin{subarray}{c}y\in Y\\ L(y)=1\end{subarray}}\ a(x,y)=\sup_{x\in P}\,\mathcal{I}(x)=\sup\{\lambda\in\mathbb{R}_{+}\ |\;\;P\cap\Lambda_{\lambda}\neq\emptyset\}.

(2.6)

We shall say that the problem (2.6) has a solution if $\lambda^{*}<+\infty$ and there exists $\bar{x}\in P\cap\Lambda_{\lambda^{*}}$ .

Now, we present three different results ensuring the equality $\lambda^{*}=\zeta^{*}$ and the existence of primal or dual solutions. The first result follows from [12, Proposition VI.2.3 and Remark VI.2.3].

Theorem 2.1.

Let (A1)–(A4) be satisfied and assume in addition that

(B)

$P$ is a bounded set in $X$ .

Then $\lambda^{*}=\zeta^{*}$ and the dual problem (2.6) has a solution.

Unfortunately, the set $P$ can be unbounded in plasticity and other applications. Therefore, we also need other criteria. The second result has been introduced in [9, Theorem 2.1] and also used in [10, Theorem 5.7]. It is convenient for use with non-reflexive spaces such as $L^{\infty}$ .

Theorem 2.2.

Let (A1)–(A4) be satisfied together with the following:

$(C1)$

$P$ has a non-empty interior in $X$ ;
$(C2)$

There exists $x_{0}\in X$ such that $a(x_{0},y)=L(y)$ for any $y\in Y$ ;

(C3)

For any $M\in X^{*}$ such that

\left\{\begin{array}[]{l}\inf_{x\in P}\limits M(x)>-\infty,\\[8.53581pt] a(x,y)=0\;\;\forall y\in Y\;\;\Longrightarrow M(x)=0,\end{array}\right.

there exists $y_{0}\in Y$ satisfying $a(x,y_{0})=M(x)$ for any $x\in X$ .

Then $\lambda^{*}=\zeta^{*}$ and the primal problem has a solution.

The third result is inspired by [16, Theorem 5.2]. It is convenient for analysis on reflexive Banach spaces. This result is new and will be proven in the next section.

Theorem 2.3.

Let (A1)–(A4) be satisfied and, in addition, assume the following:

$(D1)$

$X$ is a reflexive Banach space;
$(D2)$

$Y$ is a Hilbert space with a scalar product $(.,.)_{Y}$ and the induced norm $\|.\|_{Y}$ ;
$(D3)$

For any $x\in P$ there exist $x_{A}\in P_{A}$ and $x_{C}\in P_{C}$ such that $x=x_{A}+x_{C}$ , where $P_{A}\subset X$ is closed, convex and bounded and $P_{C}\subset P$ is a closed convex cone;

(D4)

\inf_{\begin{subarray}{c}x_{C}\in P_{C}\\ x_{C}\neq 0_{X}\end{subarray}}\ \sup_{\begin{subarray}{c}y\in Y\\ y\neq 0_{Y}\end{subarray}}\ \frac{a(x_{C},y)}{\|x_{C}\|_{X}\|y\|_{Y}}=c_{*}>0.

(2.7)

Then $\lambda^{*}=\zeta^{*}$ . Moreover, if $\lambda^{*}<+\infty$ then the dual problem (2.6) has a solution.

It is worth noting that for the validity of the theorem it suffices to assume that the set $P_{C}$ is only closed and convex in $X$ and satisfies $(D4)$ . On the other hand, we have

\frac{a(x_{C},y)}{\|x_{C}\|_{X}\|y\|_{Y}}=\frac{a(\alpha x_{C},y)}{\|\alpha x_{C}\|_{X}\|y\|_{Y}}\quad\forall\alpha>0.

This fact (independence of the scaling parameter) explains why we assume that $P_{C}$ is a convex cone. In addition, we shall see in Section 6 that the cones $P_{C}$ and $\mathrm{dom}\,\mathcal{J}$ are closely related.

We also note that any closed linear subspace of $X$ is a special case of the cone $P_{C}$ . Then, we arrive at the standard inf-sup condition on function spaces. This case will be considered in Theorem 4.2 and in Section 7.

3 The proof of Theorem 2.3

Within this section we assume that the conditions (A1)–(A4), (D1)–(D4) are satisfied and also $\lambda^{*}<+\infty$ (notice that Theorem 2.3 holds trivially for $\lambda^{*}=+\infty$ ). To prove this theorem we define auxiliary functions $\varphi\colon\mathbb{R}\rightarrow\mathbb{R}_{+}$ and $\Phi_{\lambda}\colon X\rightarrow\mathbb{R}_{+}$ :

\varphi(\lambda):=\inf_{x\in P}\Phi_{\lambda}(x),\quad\Phi_{\lambda}(x):=\sup_{\begin{subarray}{c}y\in Y\\ y\neq 0_{Y}\end{subarray}}\frac{a(x,y)-\lambda L(y)}{\|y\|_{Y}}.

(3.1)

Their basic properties are introduced in the following lemma.

Lemma 3.1.

The function $\Phi_{\lambda}$ is nonnegative, convex and Lipschitz continuous in $X$ for any $\lambda\in\mathbb{R}_{+}$ . The function $\varphi$ is nonnegative, nondecreasing, and Lipschitz continuous in $\mathbb{R}_{+}$ .

Proof.

It is straightforward to verify that $\Phi_{\lambda}$ and $\varphi$ are nonnegative and convex. Let $x_{1},x_{2}\in X$ . Then, using continuity of the bilinear form $a$ , we have

\Phi_{\lambda}(x_{1})=\sup_{\begin{subarray}{c}y\in Y\\ y\neq 0_{Y}\end{subarray}}\frac{a(x_{2},y)-\lambda L(y)+a(x_{1}-x_{2},y)}{\|y\|_{Y}}\leq\Phi_{\lambda}(x_{2})+\|a\|\|x_{1}-x_{2}\|_{X},

where $\|a\|$ is the norm of $a$ . Similarly, $\Phi_{\lambda}(x_{2})\leq\Phi_{\lambda}(x_{1})+\|a\|\|x_{1}-x_{2}\|_{X}$ and so $|\Phi_{\lambda}(x_{1})-\Phi_{\lambda}(x_{2})|\leq\|a\|\|x_{1}-x_{2}\|_{X}$ proving the Lipschitz continuity of $\Phi_{\lambda}$ in $X$ .

Since $P$ is convex and $0_{X}\in P$ , we have $x/\alpha\in P$ for any $x\in P$ and $\alpha\geq 1$ . Hence,

\varphi(\alpha\lambda)=\alpha\inf_{x\in P}\Phi_{\lambda}(x/\alpha)\geq\alpha\inf_{x\in P}\Phi_{\lambda}(x)=\alpha\varphi(\lambda)\geq\varphi(\lambda)\quad\forall\alpha\geq 1,

i.e, $\varphi$ is nondecreasing in $\mathbb{R}_{+}$ . Let $\lambda,\bar{\lambda}\in\mathbb{R}_{+}$ , $\lambda<\bar{\lambda}$ . Then $\varphi(\lambda)\leq\varphi(\bar{\lambda})$ and

\varphi(\bar{\lambda})=\inf_{x\in P}\sup_{\begin{subarray}{c}y\in Y\\ y\neq 0\end{subarray}}\frac{a(x,y)-\lambda L(y)-(\bar{\lambda}-\lambda)L(y)}{\|y\|_{Y}}\leq\varphi(\lambda)+(\bar{\lambda}-\lambda)\|L\|_{Y*}.

Thus $\varphi$ is Lipschitz continuous in $\mathbb{R}_{+}$ with modulus $\|L\|_{Y^{*}}$ . ∎

The next lemma shows that the function $\varphi$ is closely related to the problems (2.6) and (2.1).

Lemma 3.2.

The function $\varphi$ defined in (3.1) satisfies the following relations:

\varphi(\lambda)=0\;\;\mbox{if}\;\;\lambda\leq\zeta^{*},\quad\varphi(\lambda)>0\;\;\mbox{if}\;\;\lambda>\zeta^{*},

(3.2)

and

\lambda^{*}=\zeta^{*}\quad\mbox{if and only if}\quad\varphi(\lambda)>0\quad\forall\lambda>\lambda^{*}.

(3.3)

Proof.

To prove (3.2) we use the Lagrangian

\mathcal{L}(x,y):=\frac{1}{2}\|y\|_{Y}^{2}+a(x,y)-\lambda L(y),\quad x\in P,\;y\in Y.

(3.4)

The mapping $y\mapsto\mathcal{L}(x,y)$ is coercive, convex, and continuous in $Y$ for any $x\in P$ while $x\mapsto\mathcal{L}(x,y)$ is linear for any $y\in Y$ and the set $P$ is closed and convex in $X$ . Therefore, by [12, Proposition VI 2.3], we know that

\min_{y\in Y}\sup_{x\in P}\mathcal{L}(x,y)=\sup_{x\in P}\inf_{y\in Y}\mathcal{L}(x,y).

(3.5)

For any given $x\in P$ , there exists a unique element $y_{x}\in Y$ such that

\mathcal{L}(x,y_{x})\leq\mathcal{L}(x,y)\quad\forall y\in Y,

or equivalently

(y_{x},y)_{Y}=\lambda L(y)-a(x,y)\quad\forall y\in Y.

(3.6)

Consequently,

\|y_{x}\|_{Y}=\sup_{\begin{subarray}{c}y\in Y\\ y\neq 0_{Y}\end{subarray}}\frac{(y_{x},y)_{Y}}{\|y\|_{Y}}=\sup_{\begin{subarray}{c}y\in Y\\ y\neq 0_{Y}\end{subarray}}\frac{-(y_{x},y)_{Y}}{\|y\|_{Y}}=\sup_{\begin{subarray}{c}y\in Y\\ y\neq 0_{Y}\end{subarray}}\frac{a(x,y)-\lambda L(y)}{\|y\|_{Y}}=\Phi_{\lambda}(x)

(3.7)

and

\sup_{x\in P}\inf_{y\in Y}\mathcal{L}(x,y)=\sup_{x\in P}\Big{\{}-\frac{1}{2}\|y_{x}\|^{2}_{Y}\Big{\}}\stackrel{{\scriptstyle(\ref{norm_v_tau})}}{{=}}-\frac{1}{2}\inf_{x\in P}\Phi_{\lambda}^{2}(x)=-\frac{1}{2}\left(\inf_{x\in P}\Phi_{\lambda}(x)\right)^{2}=-\frac{1}{2}\varphi^{2}(\lambda).

(3.8)

From (3.5) and (3.8), we have:

-\frac{1}{2}\varphi^{2}(\lambda)=\min_{y\in Y}\sup_{x\in P}\mathcal{L}(x,y)=\min_{y\in Y}\left\{\frac{1}{2}\|y\|_{Y}^{2}+\mathcal{J}(y)-\lambda L(y)\right\}\quad\forall\lambda\in\mathbb{R}_{+},

where $\mathcal{J}$ is the primal functional defined by (2.2). Thus,

\varphi(\lambda)=\left(-2\min_{y\in Y}\left\{\frac{1}{2}\|y\|_{Y}^{2}+\mathcal{J}(y)-\lambda L(y)\right\}\right)^{1/2}.

(3.9)

From (2.3), one can see that $\mathcal{J}(y)-\lambda L(y)\geq 0$ for any $\lambda<\zeta^{*}$ and $y\in Y$ . Hence, $\varphi(\lambda)=0$ for any $\lambda<\zeta^{*}$ and $\varphi(\zeta^{*})=0$ using the continuity argument. On the other hand, if $\lambda>\zeta^{*}$ then there exists $\bar{y}\in Y$ such that $\mathcal{J}(\bar{y})-\lambda L(\bar{y})<0$ . Hence,

\frac{1}{2}\|\alpha\bar{y}\|_{Y}^{2}+\mathcal{J}(\alpha\bar{y})-\lambda L(\alpha\bar{y})=\alpha\left\{\frac{\alpha}{2}\|\bar{y}\|_{Y}^{2}+\mathcal{J}(\bar{y})-\lambda L(\bar{y})\right\}<0

for any $\alpha>0$ small enough. From this and (3.9), it follows that $\varphi(\lambda)>0$ for any $\lambda>\zeta^{*}$ . Therefore, (3.2) holds. It is easy to see that (3.3) follows from (3.2) and the inequality $\lambda^{*}\leq\zeta^{*}$ . ∎

Next, consider the following problem: given $\lambda\geq 0$ ,

\mbox{{\it find }}x_{\lambda}\in P:\quad\Phi_{\lambda}(x_{\lambda})\leq\Phi_{\lambda}(x)\quad\forall x\in P.

(3.10)

Lemma 3.3.

Let (3.10) have a solution for any $\lambda\geq 0$ . Then $\lambda^{*}=\zeta^{*}$ . In addition, $P\cap\Lambda_{\lambda^{*}}\neq\emptyset$ ; that is, the dual problem (2.6) has a solution.

Proof.

Let $\lambda>\lambda^{*}$ be fixed but arbitrary and $x_{\lambda}\in P$ be the solution to (3.10). From (2.6) and the choice of $\lambda$ , it follows that $x_{\lambda}\not\in\Lambda_{\lambda}$ . Using the definition (2.5) of $\Lambda_{\lambda}$ , we see that there exists $\bar{y}\in Y$ such that $a(x_{\lambda},\bar{y})-\lambda L(\bar{y})>0$ . Hence, $\varphi(\lambda)=\Phi_{\lambda}(x_{\lambda})>0$ . By Lemma 3.3, we have $\lambda^{*}=\zeta^{*}$ . If $\lambda\leq\lambda^{*}$ then $\varphi(\lambda)=\Phi_{\lambda}(x_{\lambda})=0$ and so $x_{\lambda}\in P\cap\Lambda_{\lambda}$ , proving the existence of a solution to (2.6). ∎

Proof of Theorem 2.3. The proof is based on Lemma 3.3. We show that the problem (3.10) has a solution for any $\lambda\geq 0$ under the assumptions of Theorem 2.3. From Lemma 3.1, we know that the function $\Phi_{\lambda}$ is convex and Lipschitz continuous in $X$ for any $\lambda\in\mathbb{R}_{+}$ . Using the assumptions (D3) and (D4) we prove that $\Phi_{\lambda}$ is also coercive in $P$ for any $\lambda\geq 0$ . Indeed, for any $x\in P$ and $\lambda\in\mathbb{R}_{+}$ , we have:

$\displaystyle\Phi_{\lambda}(x)$	$\displaystyle=$	$\displaystyle\sup_{\begin{subarray}{c}y\in Y\\ y\neq 0_{Y}\end{subarray}}\frac{a(x_{C},y)+a(x_{A},y)-\lambda L(y)}{\\|y\\|_{Y}}$	(3.11)
	$\displaystyle\geq$	$\displaystyle c_{}\\|x_{\mathcal{C}}\\|_{X}-\\|a\\|\\|x_{A}\\|_{X}-\lambda\\|L\\|_{Y^{}}$
	$\displaystyle\geq$	$\displaystyle c_{}\\|x\\|_{X}-(c_{}+\\|a\\|)\\|x_{A}\\|_{X}-\lambda\\|L\\|_{Y^{*}}$
	$\displaystyle\geq$	$\displaystyle c_{}\\|x\\|_{X}-(c_{}+\\|a\\|)\rho_{A}-\lambda\\|L\\|_{Y^{*}}\quad\forall x\in P,$

where $c_{*}>0$ is the inf-sup constant from (2.7) and $\rho_{A}$ is a positive constant characterizing the boundedness of $A$ . Since $X$ is a reflexive Banach space, the properties of $\Phi_{\lambda}$ guarantee that (3.10) has a solution for any $\lambda\geq 0$ . ∎

4 Generalizations of Theorem 2.3

Now we present three different generalizations (or extensions) of Theorem 2.3. Since their proofs are quite analogous, we only sketch them.

First, the assumption (D4) cannot hold if the subspace

H:=\{x_{0}\in X\ |\;\;a(x_{0},y)=0\quad\forall y\in Y\}

(4.1)

contains an element $x_{0}\in P_{C}$ such that $x_{0}\neq 0$ . In Section 7, we will show that this case may arise in some applications. To weaken the assumption (D4) we introduce the quotient space $X/H$ (with the norm $\|\cdot\|_{X/H}$ ) whose elements are the equivalence classes induced by the equivalence relation:

x_{1}\cong x_{2}\quad\mbox{if and only if}\quad x_{1}-x_{2}\in H,\quad x_{1},x_{2}\in X.

Let $P/H$ denote the set of equivalent classes generated by the set $P$ . It is easy to verify that $P/H$ is a closed, convex, and nonempty set in $X/H$ . Similarly, one can introduce the sets $P_{A}/H$ and $P_{C}/H$ , where $P_{A}$ and $P_{C}$ are defined in accordance with the assumption (D3). These sets have properties analogous to properties of $P_{A}$ and $P_{C}$ . In particular, for any $x\in P/H$ there exists $x_{A}\in P_{A}/H$ and $x_{C}\in P_{C}/H$ such that $x=x_{A}+x_{C}$ . We note that if $X$ is a Hilbert space then $X/H$ can be identified with the orthogonal complement $H^{\perp}$ of $H$ in $X$ and $P/H$ with the projection of $P$ onto $H^{\perp}$ .

Theorem 4.1.

Let the assumptions (A1)–(A4) and (D1)-(D3) of Theorem 2.3 be satisfied, $H$ be defined by (4.1), and

\inf_{\begin{subarray}{c}x_{C}\in P_{C}/H\\ x_{C}\neq 0_{X}\end{subarray}}\ \sup_{\begin{subarray}{c}y\in Y\\ y\neq 0_{Y}\end{subarray}}\ \frac{a(x_{C},y)}{\|x_{C}\|_{X/H}\|y\|_{Y}}=c_{*}>0.

(4.2)

Then $\lambda^{*}=\zeta^{*}$ . If, in addition $\lambda^{*}<+\infty$ then the dual problem (2.6) has a solution.

Sketch of the proof: It suffices to show that (3.10) has a solution in $P$ for any $\lambda\geq 0$ under the assumptions of this theorem. Let $\lambda\geq 0$ be fixed. Using (4.1), we see that the function $\Phi_{\lambda}$ defined in (3.1) satisfies

\Phi_{\lambda}(x+x_{0})=\Phi_{\lambda}(x)\quad\forall x\in X,\;\forall x_{0}\in H.

(4.3)

Therefore, (3.10) has a solution in $P$ if and only if $\Phi_{\lambda}$ has a minimum in $P/H$ . From (4.2), one can prove the coercivity of $\Phi_{\lambda}$ in $P/H$ analogously as in the proof of Theorem 2.3. Therefore, $\Phi_{\lambda}$ has a minimum in $P/H$ and thus the result of Theorem 4.1 holds. ∎

Second, it turns out that the assumption (D2) of Theorem 2.3 can be extended to some reflexive Banach spaces associated with a bounded Lipschitz domain $\Omega\subset\mathbb{R}^{d}$ , $d=2,3$ .

Theorem 4.2.

Let the assumptions (A1)–(A4) and (D1), (D3)-(D4) be satisfied and

(D2’)

$Y=W^{1,p}(\Omega,\mathbb{R}^{m})$ , equipped with the standard Sobolev norm

$\|y\|_{Y}=\left(\int_{\Omega}|\nabla y|^{p}+|y|^{p}\,dx\right)^{1/p}.$

Then $\lambda^{*}=\zeta^{*}$ . In addition, if $\lambda^{*}<+\infty$ then the dual problem (2.6) has a solution.

Sketch of the proof: It suffices to modify formulae (3.4)–(3.9). To this end, we set

\mathcal{L}(x,y):=\frac{1}{p}\|y\|_{Y}^{p}+a(x,y)-\lambda L(y),\quad x\in P,\;y\in Y.

(4.4)

Then (3.5) holds and there exists a unique element $y_{x}\in Y$ such that

\mathcal{L}(x,y_{x})\leq\mathcal{L}(x,y)\quad\forall y\in Y,

or equivalently

\int_{\Omega}|\nabla y_{x}|^{p-2}\nabla y_{x}:\nabla y+|y_{x}|^{p-2}y_{x}\cdot y\,dx=\lambda L(y)-a(x,y)\quad\forall y\in Y.

(4.5)

Consequently,

\displaystyle\|y_{x}\|_{Y}^{p-1}=\sup_{\begin{subarray}{c}y\in Y\\ y\neq 0_{Y}\end{subarray}}\frac{\int_{\Omega}|\nabla y_{x}|^{p-2}\nabla y_{x}:\nabla y+|y_{x}|^{p-2}y_{x}\cdot y\,dx}{\|y\|_{Y}}=\sup_{\begin{subarray}{c}y\in Y\\ y\neq 0_{Y}\end{subarray}}\frac{a(x,y)-\lambda L(y)}{\|y\|_{Y}}=\Phi_{\lambda}(x)

(4.6)

and

\sup_{x\in P}\inf_{y\in Y}\mathcal{L}(x,y)=\sup_{x\in P}\Big{\{}-\frac{1}{q}\|y_{x}\|^{p}_{Y}\Big{\}}\stackrel{{\scriptstyle(\ref{norm_v_tau2})}}{{=}}-\frac{1}{q}\inf_{x\in P}\Phi_{\lambda}^{q}(x)=-\frac{1}{q}\left(\inf_{x\in P}\Phi_{\lambda}(x)\right)^{q}=-\frac{1}{q}\varphi^{q}(\lambda),

(4.7)

where $1/q=1-1/p$ . The rest of the proof is analogous to that of Section 3. ∎

The third extension illustrates that Theorem 2.3 remains valid even if the space $Y$ is replaced by its conic subset.

Theorem 4.3.

Let the assumptions (A1)–(A4) and (D1)–(D3) of Theorem 2.3 be satisfied, $Y_{C}\subset Y$ be a closed convex cone, and

\inf_{\begin{subarray}{c}x_{C}\in P_{C}\\ x_{C}\neq 0_{X}\end{subarray}}\ \sup_{\begin{subarray}{c}y\in Y_{C}\\ y\neq 0_{Y}\end{subarray}}\ \frac{-a(x_{C},y)}{\|x_{C}\|_{X}\|y\|_{Y}}=c_{*}>0.

(4.8)

Then

\lambda^{*}:=\sup_{x\in P}\inf_{\begin{subarray}{c}y\in Y_{C}\\ L(y)=1\end{subarray}}\ a(x,y)=\inf_{\begin{subarray}{c}y\in Y_{C}\\ L(y)=1\end{subarray}}\sup_{x\in P}\ a(x,y)=:\zeta^{*}.

(4.9)

In addition, $\lambda^{*}=\max\{\lambda\in\mathbb{R}_{+}\ |\;\;P\cap\Lambda_{\lambda}\neq\emptyset\}$ , where

\Lambda_{\lambda}=\{x\in X\ |\;a(x,y)\geq\lambda L(y)\;\forall y\in Y_{C}\}.

Sketch of the proof: The following two changes in the proof of Theorem 2.3 are necessary:

We define

\Phi_{\lambda}(x):=\sup_{\begin{subarray}{c}y\in Y_{C}\\ y\neq 0_{Y}\end{subarray}}\frac{-a(x,y)+\lambda L(y)}{\|y\|_{Y}}

and consider the minimization problem

\mbox{find }y_{x}\in Y_{C}:\quad\mathcal{L}(x,y_{x})\leq\mathcal{L}(x,y)\quad\forall y\in Y_{C}

with $\mathcal{L}$ defined by (3.4). Since $Y_{C}$ is a closed convex cone, the corresponding necessary and sufficient condition characterising $y_{x}$ reads

(y_{x},y)_{Y}\geq\lambda L(y)-a(x,y)\quad\forall y\in Y_{C},\quad(y_{x},y_{x})_{Y}=\lambda L(y_{x})-a(x,y_{x}).

We obtain

\|y_{x}\|_{Y}=\sup_{\begin{subarray}{c}y\in Y_{C}\\ y\neq 0_{Y}\end{subarray}}\frac{(y_{x},y)_{Y}}{\|y\|_{Y}}\geq\sup_{\begin{subarray}{c}y\in Y_{C}\\ y\neq 0_{Y}\end{subarray}}\frac{-a(x,y)+\lambda L(y)}{\|y\|_{Y}}\geq\frac{-a(x,y_{x})+\lambda L(y_{x})}{\|y_{x}\|_{Y}}=\|y_{x}\|_{Y},

so that $\|y_{x}\|_{Y}=\Phi_{\lambda}(x)$ .

To prove Lemma 3.3, we modify (2.6) as follows:

\lambda^{*}=\sup\{\lambda\in\mathbb{R}_{+}\ |\;\;P\cap\Lambda_{\lambda}\neq\emptyset\},\quad\Lambda_{\lambda}:=\{x\in X\ |\;\;a(x,y)\geq\lambda L(y)\;\;\forall y\in Y_{C}\}.

Then the proof of Theorem 2.3 is applicable without any substantial changes.

5 Regularization method

Regularization methods are often used for solving nonsmooth, constrained, or ill-posed problems. As an example, we mention proximal point methods [22] which can be used for solving the problems (2.6) and (2.1).

Here we consider another regularization method which has been subsequently developed in [31, 7, 14, 15] and used also in [28, 16]. In these recent papers, this method has been called either the “indirect incremental method” or the “penalization method”. Below, we generalize, results of [14, 15] and show that some of these can be established in a simpler way. Within this section it is assumed that the conditions (A1)–(A4) from Section 2 hold.

To regularize the functional $\mathcal{J}$ defined by (2.2) we introduce the functional

\mathcal{J}_{\alpha}\colon Y\rightarrow\mathbb{R},\qquad\mathcal{J}_{\alpha}(y):=\max_{x\in P}\ \left\{a(x,y)-\frac{1}{2\alpha}\|x\|_{X}^{2}\right\},

(5.1)

where $\alpha>0$ is a given parameter. It is easy to see that $\mathcal{J}_{\alpha}$ is convex and finite-valued in $Y$ (unlike the functional $\mathcal{J}$ ) and $\mathcal{J}_{\alpha_{1}}\leq\mathcal{J}_{\alpha_{2}}\leq\mathcal{J}$ for any $\alpha_{1},\alpha_{2}>0$ , $\alpha_{1}\leq\alpha_{2}$ .

Lemma 5.1.

Let $\mathcal{J}$ and $\mathcal{J}_{\alpha}$ be defined by (2.2) and (5.1). Then

\lim_{\alpha\rightarrow+\infty}\mathcal{J}_{\alpha}(y)=\mathcal{J}(y)\quad\forall y\in Y.

(5.2)

Proof.

Let $y\in Y$ be fixed. As mentioned above, the sequence $\{\mathcal{J}_{\alpha}(y)\}_{\alpha}$ is nondecreasing. Therefore, it has a limit which is less than or equal to $\mathcal{J}(y)$ . On the other hand,

\lim_{\alpha\rightarrow+\infty}\mathcal{J}_{\alpha}(y)\geq\lim_{\alpha\rightarrow+\infty}\left\{a(x,y)-\frac{1}{2\alpha}\|x\|_{X}^{2}\right\}=a(x,y)\quad\forall x\in P.

Thus (5.2) holds. ∎

The regularization of the primal problem (2.1) with respect to the parameter $\alpha$ defines the function $\psi:\mathbb{R}_{+}\rightarrow\mathbb{R}_{+}$ :

\psi(\alpha):=\inf_{\begin{subarray}{c}y\in Y\\ L(y)=1\end{subarray}}\mathcal{J}_{\alpha}(y),\qquad\alpha>0.

(5.3)

In view of (5.1) and [12, Proposition VI 2.3], it holds

\psi(\alpha)=\inf_{\begin{subarray}{c}y\in Y\\ L(y)=1\end{subarray}}\max_{x\in P}\ \left\{a(x,y)-\frac{1}{2\alpha}\|x\|_{X}^{2}\right\}=\max_{x\in P}\inf_{\begin{subarray}{c}y\in Y\\ L(y)=1\end{subarray}}\ \left\{a(x,y)-\frac{1}{2\alpha}\|x\|_{X}^{2}\right\}.

(5.4)

Thus, the main duality relation holds without any gap, unlike the original primal-dual problem (1.1). The properties of the function $\psi$ are set out in the following theorem.

Theorem 5.1.

The function $\psi$ is continuous, nondecreasing and

\lim_{\alpha\rightarrow+\infty}\psi(\alpha)=\lambda^{*}\leq\zeta^{*},

(5.5)

where $\lambda^{*}$ and $\zeta^{*}$ are defined by (2.6) and (2.1), respectively.

Proof.

From the properties of $\{\mathcal{J}_{\alpha}\}_{\alpha>0}$ , it is easy to see that $\psi$ is nondecreasing and thus it has a limit as $\alpha\rightarrow+\infty$ . Comparing (2.6) and (5.4)₃ we see that $\psi(\alpha)\leq\lambda^{*}$ . In addition, for any $x\in P$ we have

\lim_{\alpha\rightarrow+\infty}\psi(\alpha)\stackrel{{\scriptstyle\eqref{psi2}}}{{\geq}}\lim_{\alpha\rightarrow+\infty}\inf_{\begin{subarray}{c}y\in Y\\ L(y)=1\end{subarray}}\ \left\{a(x,y)-\frac{1}{2\alpha}\|x\|_{X}^{2}\right\}=\inf_{\begin{subarray}{c}y\in Y\\ L(y)=1\end{subarray}}\ a(x,y).

Making use of the definition of $\lambda^{*}$ , we arrive at (5.5).

Let $\beta>\alpha$ . Since $0_{X}\in P$ , we have $(\alpha/\beta)x\in P$ if $x\in P$ . Hence,

\psi(\alpha)\stackrel{{\scriptstyle\eqref{psi2}}}{{\geq}}\inf_{\begin{subarray}{c}y\in Y\\ L(y)=1\end{subarray}}\max_{x\in P}\ \left\{a((\alpha/\beta)x,y)-\frac{1}{2\alpha}\|(\alpha/\beta)x\|_{X}^{2}\right\}=\frac{\alpha}{\beta}\psi(\beta).

This relation and the monotonicity of $\psi$ imply

\frac{\alpha}{\beta}\psi(\beta)\leq\psi(\alpha)\leq\psi(\beta).

Hence,

\limsup_{\beta\searrow\alpha}\psi(\beta)=\limsup_{\beta\searrow\alpha}\frac{\alpha}{\beta}\psi(\beta)\leq\psi(\alpha)\leq\liminf_{\beta\searrow\alpha}\psi(\beta).

(5.6)

Let $\beta<\alpha$ . By interchanging $\alpha$ and $\beta$ in (5), we obtain

\frac{\beta}{\alpha}\psi(\alpha)\leq\psi(\beta)\leq\psi(\alpha)\quad\mbox{or}\quad\psi(\beta)\leq\psi(\alpha)\leq\frac{\alpha}{\beta}\psi(\beta).

Hence,

\limsup_{\beta\nearrow\alpha}\psi(\beta)\leq\psi(\alpha)\leq\limsup_{\beta\nearrow\alpha}\frac{\alpha}{\beta}\psi(\beta)=\liminf_{\beta\nearrow\alpha}\psi(\beta).

(5.7)

From (5.6) and (5.7), we have

\limsup_{\beta\rightarrow\alpha}\psi(\beta)\leq\psi(\alpha)\leq\liminf_{\beta\rightarrow\alpha}\psi(\beta)\quad\mbox{or}\quad\lim_{\beta\rightarrow\alpha}\psi(\beta)=\psi(\alpha),

implying the continuity of $\psi$ . ∎

It is worth noting that for any value of $\alpha>0$ , the quantity $\psi(\alpha)$ is a lower bound of $\lambda^{*}$ and $\zeta^{*}$ . Upper bounds of $\lambda^{*}$ and $\zeta^{*}$ will be derived in the next section.

From the numerical point of view, it is useful if the functional $\mathcal{J}_{\alpha}$ is differentiable in the Gâteaux sense. Below we establish this property of the regularized functional.

Lemma 5.2.

Let $X$ be a Hilbert space with the scalar product $(.,.)_{X}$ and define

\Pi_{\alpha}\colon Y\rightarrow P,\qquad\Pi_{\alpha}y:=\mathrm{arg}\max_{x\in P}\ \left\{a(x,y)-\frac{1}{2\alpha}\|x\|_{X}^{2}\right\}.

(5.8)

Then $\Pi_{\alpha}$ is Lipschitz continuous in $Y$ and

\mathcal{J}^{\prime}_{\alpha}(y;z):=\lim_{t\rightarrow 0}\frac{1}{t}[\mathcal{J}_{\alpha}(y+tz)-\mathcal{J}_{\alpha}(y)]=a(\Pi_{\alpha}y,z)\quad\forall\alpha>0,\;\;\forall y,z\in Y.

(5.9)

Proof.

Since $X$ is a Hilbert space, it is easy to see that there exists a unique $\Pi_{\alpha}y$ solving (5.8) and satisfying the variational inequality

\frac{1}{\alpha}(\Pi_{\alpha}y,x-\Pi_{\alpha}y)_{X}\geq a(x-\Pi_{\alpha}y,y)\quad\forall x\in P,\;\;\forall y\in Y.

(5.10)

Hence, we derive the inequalities

	$\displaystyle\frac{1}{\alpha}(\Pi_{\alpha}y,\Pi_{\alpha}(y+tz)-\Pi_{\alpha}y)_{X}$	$\displaystyle\geq a(\Pi_{\alpha}(y+tz)-\Pi_{\alpha}y,y),$
	$\displaystyle\frac{1}{\alpha}(\Pi_{\alpha}(y+tz),\Pi_{\alpha}y-\Pi_{\alpha}(y+tz))_{X}$	$\displaystyle\geq a(\Pi_{\alpha}y-\Pi_{\alpha}(y+tz),y+tz),$

which hold for any $y,z\in Y$ and any $t\in\mathbb{R}$ . By adding these inequalities, we obtain

\frac{1}{\alpha}\|\Pi_{\alpha}(y+tz)-\Pi_{\alpha}y\|_{X}^{2}\leq ta(\Pi_{\alpha}(y+tz)-\Pi_{\alpha}y,z)\leq t\|a\|\|\Pi_{\alpha}(y+tz)-\Pi_{\alpha}y\|_{X}\|z\|_{Y}.

Thus $\Pi_{\alpha}$ is Lipschitz continuous in $Y$ .

From (5.1) and (5.8) we have, for any $t\in\mathbb{R}$ and any $y,z\in Y$ ,

\mathcal{J}_{\alpha}(y)=a(\Pi_{\alpha}y,y)-\frac{1}{2\alpha}\|\Pi_{\alpha}y\|_{X}^{2}\geq a(\Pi_{\alpha}(y+tz),y)-\frac{1}{2\alpha}\|\Pi_{\alpha}(y+tz)\|_{X}^{2},

\mathcal{J}_{\alpha}(y+tz)=a(\Pi_{\alpha}(y+tz),y+tz)-\frac{1}{2\alpha}\|\Pi_{\alpha}(y+tz)\|_{X}^{2}\geq a(\Pi_{\alpha}y,y+tz)-\frac{1}{2\alpha}\|\Pi_{\alpha}y\|_{X}^{2}.

Hence,

a(\Pi_{\alpha}y,z)\leq\frac{1}{t}[\mathcal{J}_{\alpha}(y+tz)-\mathcal{J}_{\alpha}(y)]\leq a(\Pi_{\alpha}(y+tz),z),

proving (5.9). ∎

Using the differentiability of $\mathcal{J}_{\alpha}$ , one can rewrite the problem (5.3) as a system of nonlinear variational equations.

Theorem 5.2.

Let $X$ be a Hilbert space with the scalar product $(.,.)_{X}$ and let $y_{\alpha}$ be a minimizer in (5.3). Then there exists $\lambda_{\alpha}\in\mathbb{R}_{+}$ such that the pair $(y_{\alpha},\lambda_{\alpha})$ is a solution of the system:

\left.\begin{array}[]{c}a(\Pi_{\alpha}y_{\alpha},z)=\lambda_{\alpha}L(z)\quad\forall z\in Y,\\ L(y_{\alpha})=1.\end{array}\right\}

(5.11)

Conversely, if $(y_{\alpha},\lambda_{\alpha})$ is a solution to (5.11) then $y_{\alpha}$ solves (5.3).

Remark 5.1.

In [14], the function $\tilde{\psi}\colon\alpha\mapsto\lambda_{\alpha}$ was introduced and analysed for the case of Hencky plasticity. It is worth noticing that this function is well defined even if (5.3) does not have a minimizer in $Y$ . In addition, $\tilde{\psi}$ is continuous and nondecreasing, with $\psi(\alpha)\leq\tilde{\psi}(\alpha)\leq\lambda^{*}$ for any $\alpha>0$ , and $\tilde{\psi}(\alpha)\rightarrow\lambda^{*}$ as $\alpha\rightarrow+\infty$ . One can expect that these considerations from [14] can be extended to our abstract problem.

6 A computable majorant of $\zeta^{*}$

For classical limit analysis problems, computable majorants of $\zeta^{*}$ have been derived in [28, 16]. The aim of this section is to derive a more general majorant valid for the abstract problem (2.1). In our analysis, we shall use the assumptions (A1)–(A4) and (D1)–(D4) of Theorem 2.3. The following alternative to the assumption (D3) will also be considered:

(D3^′)

$P=P_{A}+P_{C}=\{x\in X\ |\;x=x_{A}+x_{C},\;x_{A}\in P_{A},\;x_{C}\in P_{C}\}$ , where $P$ , $P_{A}$ and $P_{C}$ have the same properties as in (D3).

We note that (D3^′) is more restrictive than (D3); it has been used in [16].

From the definition of $\zeta^{*}$ (see (2.6)), we have the following simple upper bound of $\zeta^{*}$ :

\zeta^{*}\leq\frac{\mathcal{J}(y)}{L(y)}\qquad\forall y\in Y,\;\;y\in\mathrm{dom}\,\mathcal{J},\;\;L(y)>0.

(6.1)

Unfortunately, if the set $P$ is unbounded then it is difficult or even impossible to find $y\in\mathrm{dom}\,\mathcal{J}$ in such a way that the bound (6.1) would be sufficiently sharp. The aim of this section is to derive an upper bound of $\zeta^{*}$ for a larger class of functions $y\in Y$ , not necessarily belonging to $\mathrm{dom}\,\mathcal{J}$ .

First, we need to characterize the set $\mathrm{dom}\,\mathcal{J}$ . For this purpose, we define the closed convex cone

\mathcal{K}:=\{y\in Y\ |\;\;a(x,y)\leq 0\;\;\forall x\in P_{C}\},

(6.2)

and the convex, finite-valued functional

\mathcal{J}_{A}\colon Y\rightarrow\mathbb{R},\quad\mathcal{J}_{A}(y):=\max_{x\in P_{A}}\,a(x,y),\quad y\in Y.

(6.3)

Lemma 6.1.

Let the assumptions (A1)–(A4) and (D1)–(D4) be satisfied. Then

\mathrm{dom}\,\mathcal{J}=\mathcal{K}\quad\mbox{and}\quad\mathcal{J}(y)\leq\mathcal{J}_{A}(y)\quad\forall y\in\mathcal{K}.

(6.4)

Moreover, if (D3 $\,{}^{\prime}$ ) holds then $\mathcal{J}(y)=\mathcal{J}_{A}(y)$ for any $y\in\mathcal{K}$ .

Proof.

Assume that $y\not\in\mathcal{K}$ . Then there exists $x_{C}\in P_{C}$ such that $a(x_{C},y)>0$ . From (D3), it follows that $\alpha x_{C}\in P$ for any $\alpha\geq 0$ . Hence,

\mathcal{J}(y)\geq\lim_{\alpha\rightarrow+\infty}a(\alpha x_{C},y)=\lim_{\alpha\rightarrow+\infty}\alpha a(x_{C},y)=+\infty.

Let $y\in\mathcal{K}$ . Then

\mathcal{J}(y)\leq\sup_{x_{A}\in P_{A}}a(x_{A},y)+\sup_{x_{C}\in P_{C}}a(x_{C},y)=J_{A}(y)+0=J_{A}(y)<+\infty.

(6.5)

If (D3 $\,{}^{\prime}$ ) holds then $P_{A}=P_{A}+\{0_{X}\}\subset P$ . Hence,

\mathcal{J}(y)\geq\sup_{x\in P_{A}}a(x,y)=J_{A}(y).

(6.6)

From (6.5) and (6.6), it follows that $\mathcal{J}(y)=J_{A}(y)$ for any $y\in\mathcal{K}$ . ∎

From the definition of $\mathcal{J}_{A}$ and the boundedness of $a$ and $P_{A}$ , we easily derive the useful estimates

|\mathcal{J}_{A}(y_{1})-\mathcal{J}_{A}(y_{2})|\leq\mathcal{J}_{A}(y_{1}-y_{2}),\quad\forall y_{1},y_{2}\in Y,

(6.7)

and

\mathcal{J}_{A}(y)\leq\varrho_{A}\|a\|\|y\|_{Y},\quad\forall y\in Y,\quad\varrho_{A}:=\max_{x\in P_{A}}\|x\|_{X}.

(6.8)

In order to estimate $\zeta^{*}$ using $y\not\in\mathcal{K}$ , it is important to measure the distance between $y$ and $\mathcal{K}$ . Define the quantity

\|\Pi_{C}\,y\|_{X}:=\left(\max_{x\in P_{C}}\{-\|x\|^{2}_{X}+2a(x,y)\}\right)^{1/2},\quad y\in Y.

(6.9)

Remark 6.1.

The notation $\|\Pi_{C}\,y\|_{X}$ including the norm in $X$ is justified if $X$ is a Hilbert space. Indeed, define the operator

\Pi_{C}\colon Y\rightarrow P_{C},\quad\Pi_{C}\,y:=\mathrm{arg}\max_{x\in P_{C}}\{-\|x\|^{2}_{X}+2a(x,y)\},\quad y\in Y.

(6.10)

From the cone property of $P_{C}$ , (6.10) is equivalent to

\|\Pi_{C}\,y\|_{X}^{2}=a(\Pi_{C}\,y,y)\quad\mbox{and}\quad(\Pi_{C}\,y,x)\geq a(x,y)\quad\forall x\in P_{C}.

Hence, we obtain (6.9).

It is also useful to note that if $y\in\mathcal{K}$ then $\|\Pi_{C}\,y\|_{X}=0$ . We have the following result.

Lemma 6.2.

Let the assumptions (A1)–(A4) and (D1)–(D4) be satisfied and $c^{*}>0$ , $\mathcal{K}$ , $\|\Pi_{C}\,y\|_{X}$ be defined by (2.7), (6.2), and (6.9), respectively. Then

\min_{z\in\mathcal{K}}\|y-z\|\leq C_{*}\|\Pi_{C}\,y\|_{X},\quad\forall y\in Y,\quad C_{*}:=c_{*}^{-1}>0.

(6.11)

Proof.

Using (6.2), [12, Proposition VI 2.3] and the substitution $z\mapsto z+y$ , we consequently derive

$\displaystyle\min_{z\in\mathcal{K}}\\|y-z\\|^{2}$	$\displaystyle=\min_{z\in Y}\,\sup_{x\in P_{C}}\left\{\\|y-z\\|^{2}+2a(x,z)\right\}$
	$\displaystyle=\sup_{x\in P_{C}}\,\min_{z\in Y}\left\{\\|y-z\\|^{2}+2a(x,z)\right\}$
	$\displaystyle=\sup_{x\in P_{C}}\,\min_{z\in Y}\left\{\\|z\\|^{2}+2a(x,z)+2a(x,y)\right\}\quad\forall y\in Y.$	(6.12)

For any $x\in X$ , there exists a unique $z_{x}\in Y$ such that

(z_{x},z)_{Y}=-a(x,z)\quad\forall z\in Y.

Hence,

\|z_{x}\|_{X}=\sup_{\begin{subarray}{c}z\in Y\\ z\neq 0_{Y}\end{subarray}}\frac{a(x,z)}{\|z\|_{Y}}\quad\mbox{and}\quad\min_{z\in Y}\left\{\|z\|^{2}+2a(x,z)\right\}=-\|z_{x}\|^{2}.

(6.13)

Inserting (6.13) into (6.12), we find that

$\displaystyle\min_{z\in\mathcal{K}}\\|y-z\\|^{2}$	$\displaystyle=\sup_{x\in P_{C}}\,\min_{z\in Y}\left\{\\|z\\|^{2}+2a(x,z)+2a(x,y)\right\}$
	$\displaystyle=\sup_{x\in P_{C}}\,\left\{-\left(\sup_{\begin{subarray}{c}z\in Y\\ z\neq 0_{Y}\end{subarray}}\frac{a(x,z)}{\\|z\\|_{Y}}\right)^{2}+2a(x,y)\right\}$
	$\displaystyle\stackrel{{\scriptstyle\eqref{inf-sup_abstract}}}{{\leq}}\sup_{x\in P_{C}}\,\left\{-c_{*}^{2}\\|x\\|_{X}^{2}+2a(x,y)\right\}$
	$\displaystyle=\max_{x\in P_{C}}\,\left\{-c_{}^{2}\\|x/c_{}^{2}\\|_{X}^{2}+2a(x/c_{*}^{2},y)\right\}$
	$\displaystyle=\frac{1}{c_{}^{2}}\max_{x\in P_{C}}\,\left\{-\\|x\\|_{X}^{2}+2a(x,y)\right\}\stackrel{{\scriptstyle\eqref{Pi_C}}}{{=}}C_{}^{2}\\|\Pi_{C}\,y\\|_{X}^{2}\qquad\forall y\in Y,$	(6.14)

which gives the desired result. ∎

Using Lemma 6.1 and 6.11, we derive the following upper bound of $\zeta^{*}$ .

Theorem 6.1.

Let the assumptions (A1)–(A4) and (D1)–(D4) be satisfied and $y\in Y$ be such that

L(y)>C_{*}\|\Pi_{C}\,y\|_{X}\|L\|_{Y^{*}}.

(6.15)

Then

\zeta^{*}\leq\frac{\mathcal{J}_{A}(y)+\varrho_{A}C_{*}\|a\|\|\Pi_{C}\,y\|_{X}}{L(y)-C_{*}\|\Pi_{C}\,y\|_{X}\|L\|_{Y^{*}}}.

(6.16)

Proof.

Let $y\in Y$ satisfy (6.15). By Lemma 6.11 there exists $z_{y}\in\mathcal{K}$ such that

\|y-z_{y}\|_{Y}\leq C_{*}\|\Pi_{C}\,y\|_{X}.

(6.17)

For any $\lambda>\frac{\mathcal{J}_{A}(y)+\varrho_{A}C_{*}\|a\|\|\Pi_{C}\,y\|_{X}}{L(y)-C_{*}\|\Pi_{C}\,y\|_{X}\|L\|_{Y^{*}}}$ , we have

	$\displaystyle\mathcal{J}(z_{y})-\lambda L(z_{y})$	$\displaystyle\stackrel{{\scriptstyle\eqref{J_bound}}}{{\leq}}\mathcal{J}_{A}(z_{y})-\lambda L(z_{y})$
		$\displaystyle=\mathcal{J}_{A}(y)-\lambda L(y)+[\mathcal{J}_{A}(z_{y})-\mathcal{J}_{A}(y)]+\lambda L(y-z_{y})$
		$\displaystyle\stackrel{{\scriptstyle\eqref{J_A_est1},\eqref{J_A_est2}}}{{\leq}}\mathcal{J}_{A}(y)-\lambda L(y)+(\varrho_{A}\\|a\\|+\lambda\\|L\\|_{Y^{*}})\\|y-z_{y}\\|_{Y}$
		$\displaystyle\stackrel{{\scriptstyle\eqref{distance2}}}{{\leq}}\mathcal{J}_{A}(y)-\lambda L(y)+C_{}(\varrho_{A}\\|a\\|+\lambda\\|L\\|_{Y^{}})\\|\Pi_{C}\,y\\|_{X}$
		$\displaystyle=\mathcal{J}_{A}(y)+\varrho_{A}C_{}\\|a\\|\\|\Pi_{C}\,y\\|_{X}-\lambda\left[L(y)-C_{}\\|L\\|_{Y^{*}}\\|\Pi_{C}\,y\\|_{X}\right]<0.$

Hence, $L(z_{y})>\mathcal{J}(z_{y})/\lambda\geq 0$ and

\zeta^{*}\stackrel{{\scriptstyle\eqref{upper_bound1}}}{{\leq}}\frac{\mathcal{J}(z_{y})}{L(z_{y})}<\lambda\qquad\forall\lambda>\frac{\mathcal{J}_{A}(y)+\varrho_{A}C_{*}\|a\|\|\Pi_{C}\,y\|_{X}}{L(y)-C_{*}\|\Pi_{C}\,y\|_{X}\|L\|_{Y^{*}}}.

This implies (6.16). ∎

Remark 6.2.

If the assumption (D3^′) holds and $y\in\mathcal{K}$ then $\mathcal{J}_{A}(y)=\mathcal{J}(y)$ , $\|\Pi_{C}\,y\|_{X}=0$ , and thus the bounds (6.1) and (6.16) coincide.

Remark 6.3.

If $y\in Y$ is sufficiently close to the cone $\mathcal{K}$ then the assumption (6.15) is satisfied. This can be achieved by a convenient numerical method, e.g., by the regularization method presented in the previous section.

Remark 6.4.

The bound (6.16) is computable if estimates of $\|L\|_{Y^{*}}$ , $\|\Pi_{C}\,y\|_{X}$ and $C_{*}$ are at our disposal. The computable bounds of $\|L\|_{Y^{*}}$ , $\|\Pi_{C}\,y\|_{X}$ are available in the literature on a posteriori error analysis. Computable bounds of the inf-sup constant $C_{*}$ have appeared in the literature quite recently, see [16] and references therein.

Remark 6.5.

In [28], a computable majorant of the limit load was used in the Hencky plasticity problem to prove convergence of the standard finite element method and to detect locking effects that may arise when the simplest P1 elements are used.

7 Examples

In this section, we illustrate the abstract problem (1.1) on particular examples from nonlinear mechanics and discuss the validity of the assumptions (A1)–(A4), (B) and (D1)–(D4) presented in Section 2. In all examples we consider a bounded domain $\Omega\subset\mathbb{R}^{d}$ , $d=2,3$ , with Lipschitz continuous boundary $\partial\Omega$ . The outward unit normal to $\partial\Omega$ is denoted by $\nu$ . The abstract spaces $X$ and $Y$ will be represented by $L^{2}$ and $H^{1}$ spaces, respectively, for the sake of simplicity.

7.1 Limit analysis in classical perfect plasticity

Details of the mathematical theory of limit analysis in classical perfect plasticity may be found in [32] or [10]. For its engineering applications we refer, for example, to [8, 30]. The aim is to find the largest load factor at which plastic behaviour may be sustained, in the context of proportional loading. We briefly recapitulate results presented in [16, 28, 15, 14].

A body occupying the domain $\Omega$ is fixed on a part $\Gamma_{0}\subset\partial\Omega$ and surface forces $f\colon\Gamma_{f}\rightarrow\mathbb{R}^{d}$ act on the remaining part $\Gamma_{f}$ of $\partial\Omega$ . We assume that $\Gamma_{0}$ and $\Gamma_{f}$ have a positive surface measure. Let $F\colon\Omega\rightarrow\mathbb{R}^{d}$ denote the volume force. The external loads are parametrized by a scalar factor $\lambda\geq 0$ .

Next, we denote the space of symmetric matrices (second order tensors) by $\mathbb{R}^{d\times d}_{sym}$ . The Cauchy stress field $\sigma\colon\Omega\rightarrow\mathbb{R}^{d\times d}_{sym}$ satisfies the equilibrium equation and traction boundary condition


$\displaystyle\mathrm{div}\,\sigma+\lambda F=0$	$\displaystyle\quad\mbox{in}\;\Omega,$	(7.1a)
$\displaystyle\sigma\nu=\lambda f$	$\displaystyle\quad\mbox{on}\;\Gamma_{f},$	(7.1b)

and is plastically admissible in the sense that

\sigma\in B\;\;\mbox{in}\;\Omega,\quad B:=\{\tau\in\mathbb{R}^{d\times d}_{sym}\ |\;\;\varphi(\tau)\leq 0\}.

(7.2)

Here, $\varphi\colon\mathbb{R}^{d\times d}_{sym}\rightarrow\mathbb{R}$ , $\varphi(0)<0$ , is a convex function representing a yield criterion. For the sake of simplicity, we assume that $\varphi$ and thus $B$ are independent of the spatial variable.

The infinitesimal strain rate $\varepsilon\colon\Omega\rightarrow\mathbb{R}^{d\times d}_{sym}$ and the displacement rate $v\colon\Omega\rightarrow\mathbb{R}^{d}$ satisfy the relations

\varepsilon:=\varepsilon(v)=\frac{1}{2}[\nabla v+(\nabla v)^{\top}]\;\;\mbox{in}\;\Omega,\quad v=0\;\;\mbox{on}\;\Gamma_{0}.

(7.3)

The last ingredient of the perfectly plastic model is a plastic flow rule that relates $\sigma$ and $\varepsilon$ , and which is based on the set $B$ . This relation is represented by the principle of maximum plastic dissipation in quasistatic models or by a generalized projection of $\mathbb{R}^{d\times d}_{sym}$ onto $B$ in total strain models. We skip its definition, for the sake of brevity.

Formally, the limit load factor $\lambda^{*}$ is defined as the supremum over $\lambda\geq 0$ subject to (7.1a), (7.1b) and (7.2). To define $\lambda^{*}$ more precisely and in the form (2.6), it is necessary to introduce a convenient function space $X$ for stress fields. For this purpose define the Hilbert space

X:=L^{2}(\Omega;\mathbb{R}^{d\times d}_{sym})=\{\sigma\colon\Omega\rightarrow\mathbb{R}^{d\times d}_{sym}\ |\;\sigma_{ij}\in L^{2}(\Omega),\;\;i,j=1,2,\ldots d\}

equipped with the scalar product and norm

(\sigma,\varepsilon)_{X}:=(\sigma,\varepsilon)_{2}=\int_{\Omega}\sigma:\varepsilon\,dx,\quad\|\sigma\|_{X}:=\|\sigma\|_{2}=\sqrt{(\sigma,\sigma)_{2}},

where $\sigma:\varepsilon=\sigma_{ij}\varepsilon_{ij}$ with the summation convention on repeated indices. The corresponding primal space $Y$ is chosen as follows:

Y:=\{v\in W^{1,2}(\Omega;\mathbb{R}^{d})\ |\;\;v=0\;\mbox{a.e. in }\Gamma_{0}\}.

It is also a Hilbert space representing rates of displacements with the following scalar product and norm:

(u,v)_{Y}:=(\nabla u,\nabla v)_{2},\quad\|v\|_{Y}:=\|\nabla v\|_{2}.

Using the spaces $X,Y$ and Green’s theorem, a weak formulation of (7.1a) and (7.1b) for fixed $\sigma$ reads as follows:

a(\sigma,v)=\lambda L(v)\quad\forall v\in Y,

(7.4)

where

a(\sigma,v):=\int_{\Omega}\,\sigma:\varepsilon(v)\,dx,\quad L(v):=\int_{\Omega}F\cdot v\,dx+\int_{\Gamma_{f}}f\cdot v\,ds,\quad v\in Y,

(7.5)

with $\sigma\in X$ , $F\in L^{2}(\Omega;\mathbb{R}^{d})$ and $f\in L^{2}(\Gamma_{f};\mathbb{R}^{d})$ . It is easy to see that $a$ is a continuous bilinear form in $X\times Y$ and $L\in Y^{*}$ . Using the notation from Section 1, one can write

\lambda^{*}=\sup\{\lambda\in\mathbb{R}_{+}\ |\;\;P\cap\Lambda_{\lambda}\neq\emptyset\}=\sup_{\sigma\in P}\inf_{\begin{subarray}{c}v\in Y\\ L(v)=1\end{subarray}}\ a(\sigma,v),

where

P:=\{\sigma\in X\ |\;\;\sigma\in B\;\;\mbox{a.e. in }\Omega\},\quad\Lambda_{\lambda}:=\{\sigma\in X\ |\;\;a(\sigma,v)=\lambda L(v)\;\;\forall v\in Y\}.

(7.6)

The sets $P$ and $\Lambda_{\lambda}$ are closed, convex and non-empty in $X$ and represent plastically and statically admissible stresses, respectively.

We note that the set $P$ is defined in a pointwise sense. Consequently, the sets $P_{A}$ , $P_{C}$ and the functions $\mathcal{J}$ , $\mathcal{J}_{\alpha}$ , $\Pi_{\alpha}$ and $\Pi_{C}$ introduced in the previous sections may be also defined in a pointwise sense. To illustrate, we choose the von Mises yield criterion defined by

\varphi(\sigma):=|\sigma^{D}|-\gamma,\quad\gamma>0,\;\sigma^{D}=\sigma-\frac{1}{d}(\mathrm{tr}\,\sigma)I,\;|\sigma|:=\sqrt{\sigma_{ij}\sigma_{ij}},

(7.7)

where $I$ is the unit $d\times d$ matrix, $\mathrm{tr}\,\sigma$ denotes the trace of $\sigma$ , $\sigma^{D}$ is the deviatoric part of $\sigma$ and $\gamma>0$ is a given parameter representing the initial yield stress. From [32, 10, 16], it is known that $P$ can be decomposed according to $P=P_{A}+P_{C}$ , where

P_{A}=\{\tau\in X\ |\;\;|\tau|\leq\gamma\;\;\mbox{a.e. in }\Omega\},\quad P_{C}=\{\tau\in X\ |\;\;\exists q\in L^{2}(\Omega):\;\;\tau=qI\}.

Clearly, $P_{A}$ is bounded in $X$ and $P_{C}$ is a closed subspace of $X$ , that is, a convex cone. To prove (2.7), we use the well-known inf-sup condition for incompressible flow media with $c_{\Omega}>0$ :

\inf_{\begin{subarray}{c}x_{C}\in P_{C}\\ x_{C}\neq 0_{X}\end{subarray}}\ \sup_{\begin{subarray}{c}y\in Y\\ y\neq 0_{Y}\end{subarray}}\ \frac{a(x_{C},y)}{\|x_{C}\|_{X}\|y\|_{Y}}=\inf_{\begin{subarray}{c}\tau\in P_{C}\\ \tau\neq 0_{X}\end{subarray}}\ \sup_{\begin{subarray}{c}v\in Y\\ v\neq 0_{Y}\end{subarray}}\ \frac{\int_{\Omega}\,\tau:\varepsilon(v)\,dx}{\|\tau\|_{2}\|\nabla v\|_{2}}=\frac{1}{\sqrt{d}}\inf_{\begin{subarray}{c}q\in L^{2}(\Omega)\\ q\neq 0\end{subarray}}\ \sup_{\begin{subarray}{c}v\in Y\\ v\neq 0_{Y}\end{subarray}}\ \frac{\int_{\Omega}q\,\mathrm{div}\,v\,dx}{\|q\|_{2}\|\nabla v\|_{2}}\geq\frac{c_{\Omega}}{\sqrt{d}}.

(7.8)

Thus, the condition (2.7) holds with $c_{*}=c_{\Omega}/\sqrt{d}$ . Consequently, the assumptions (A1)–(A4), (D1)–(D4) from Section 2 are satisfied and from Theorem 2.3 it follows that

\lambda^{*}=\zeta^{*}=\inf_{\begin{subarray}{c}v\in Y\\ L(v)=1\end{subarray}}\sup_{\sigma\in P}\ a(\sigma,v)=\inf_{\begin{subarray}{c}v\in Y\\ L(v)=1\end{subarray}}\mathcal{J}(v).

Notice that if $\Gamma_{0}=\partial\Omega$ then it is necessary to use Theorem 4.1 with the weaker assumption (4.2) instead of (D4). In this case, we replace the space $L^{2}(\Omega)$ in (7.8) by $L^{2}_{0}(\Omega)=\{q\in L^{2}(\Omega)\ |\;\int_{\Omega}q\,dx=0\}$ , see [28, 16].

The primal functional $\mathcal{J}$ for the von Mises yield criterion is given by

\mathcal{J}(v)=\sup_{\sigma\in P}\ a(\sigma,v)=\left\{\begin{array}[]{ll}\displaystyle\int_{\Omega}\gamma|\varepsilon(v)|\,dx,&\mathrm{div}\,v=0\;\mbox{in }\Omega,\\ +\infty,&\mbox{otherwise},\end{array}\right.\qquad\forall v\in Y.

This functional may have no minimizers in $Y$ . To guarantee that the primal problem is solvable, it is necessary to use another choice of $X$ and $Y$ , as was done, for example, in [9, 32, 10]. In particular, the assumptions (C1)–(C3) of Theorem 2.2 were verified in [9, 10].

The functions $\mathcal{J}_{\alpha}$ , $\mathcal{J}_{A}$ and $\Pi_{C}$ for the von Mises yield criterion can be found in the following forms:

\mathcal{J}_{\alpha}(v):=\int_{\Omega}j_{\alpha}(\varepsilon(v))\,dx,\quad j_{\alpha}(\varepsilon)=\left\{\begin{array}[]{cl}\frac{1}{2}\alpha|\varepsilon|^{2},&\alpha|\varepsilon^{D}|\leq\gamma\\[2.0pt] \frac{1}{2d}\alpha(\mathrm{tr}\,\varepsilon)^{2}+\gamma|e^{D}|-\frac{\gamma^{2}}{2\alpha},&\alpha|e^{D}|\geq\gamma,\end{array}\right.,

\mathcal{J}_{A}(v)=\int_{\Omega}\gamma|\varepsilon(v)|\,dx,\quad\|\Pi_{C}\,v\|_{2}=d^{-1/2}\|\mathrm{div}\,v\|_{2}\quad\forall v\in Y.

Let us recall that they are important for the regularization method and the computable majorant presented in the previous sections. We refer to [14, 15, 28, 16] for more details.

Remark 7.1.

If we choose the Drucker-Prager or Mohr-Coulomb yield criteria in (7.2) instead of von Mises then it is also possible to find an appropriate split $P=P_{A}+P_{C}$ such that the assumptions (D3) and even (D3^′) are satisfied. But for these criteria the cone $P_{C}$ is not a subspace of $X$ . Therefore, it is necessary to work with the inf-sup condition on convex cones, see [16].

7.2 Plastically admissible stresses in strain-gradient plasticity

In the next two subsections, we consider as further examples the models of strain-gradient plasticity presented in [25, 24, 6, 26]. First, following [26], we introduce a subproblem that enables us to decide whether a given stress tensor is plastically admissible or not. We note that this problem is simple in classical plasticity where the yield criterion can be verified pointwisely (see, for example the definition of $P$ in (7.6)). However, plastic yield criteria in strain-gradient plasticity are non-local and the verification is strongly non-trivial.

Beside the space $\mathbb{R}^{d\times d}_{sym}$ defined in Section 7.1, we also use the following spaces of the second and third order tensors, respectively:

\mathbb{R}^{d\times d}_{sym,0}:=\{\pi\in\mathbb{R}^{d\times d}_{sym}\ |\;\mathrm{tr}\,\pi=0\},

\mathbb{R}^{d\times d\times d}_{sym,0}:=\{\Pi\in\mathbb{R}^{d\times d\times d}\ |\;\Pi_{ijk}=\Pi_{jik},\;i,j,k=1,2,\ldots,d,\;\Pi_{ppk}=0,\;k=1,2,\ldots,d\}.

Thus, the third order tensor $\Pi$ belongs to $\mathbb{R}^{d\times d\times d}_{sym,0}$ if it is symmetric and deviatoric with respect to the first two indices.

We assume that $\sigma\colon\Omega\rightarrow\mathbb{R}^{d\times d}_{sym}$ is a given stress field and $\sigma^{D}\colon\Omega\rightarrow\mathbb{R}^{d\times d}_{sym,0}$ denotes its deviatoric part. The theory of strain gradient plasticity makes use of second- and third-order tensors $\pi\colon\Omega\rightarrow\mathbb{R}^{d\times d}_{sym,0}$ and $\Pi\colon\Omega\rightarrow\mathbb{R}^{d\times d\times d}_{sym,0}$ that represent microstresses. We say that $\sigma$ is plastically admissible if there exists a pair $(\pi,\Pi)$ such that

\sigma^{D}=\pi-\mbox{div}\,\Pi\quad\mbox{in }\Omega,\quad\Pi\nu=0\;\;\mbox{on }\Gamma_{F},

(7.9)

\varphi_{\ell}(\pi,\Pi):=\sqrt{|\pi|^{2}+\ell^{-2}|\Pi|^{2}}-\gamma\leq 0\quad\mbox{in }\Omega,

(7.10)

where $\gamma>0$ is the yield stress, $\ell>0$ is the length parameter, $|\Pi|^{2}=\Pi\circ\Pi:=\Pi_{ijk}\Pi_{ijk}$ and $\Gamma_{F}\subset\partial\Omega$ . The part of $\partial\Omega$ complementary to $\Gamma_{F}$ in $\partial\Omega$ is denoted by $\Gamma_{H}$ .

We note that the yield criterion (7.10) can be viewed as an extension of the classical condition (7.7). Indeed, setting $\Pi=0$ we derive the sufficient condition $|\sigma^{D}|\leq\gamma$ for $\sigma$ to be plastically admissible. Unlike the classical case, the stress $\sigma$ can be plastically admissible even if $|\sigma^{D}|>\gamma$ .

If $\sigma$ is plastically admissible then $\lambda\sigma$ is also plastically admissible for any $\lambda\in[0,1]$ . This parametrization motivates us to introduce the following problem: find the maximal value $\lambda^{*}$ of $\lambda\geq 0$ for which $\lambda\sigma$ is plastically admissible in the sense of (7.9) and (7.10). Clearly, if $\lambda^{*}>1$ then $\sigma$ is admissible.

Let us define $\lambda^{*}$ more precisely, using the abstract problem (2.6). We assume that all components of $\sigma$ , $\pi$ and $\Pi$ belong to $L^{2}(\Omega)$ , that is, $\sigma\in L^{2}(\Omega;\mathbb{R}^{d\times d}_{sym})$ , $\pi\in L^{2}(\Omega;\mathbb{R}^{d\times d}_{sym,0})$ and $\Pi\in L^{2}(\Omega;\mathbb{R}^{d\times d\times d}_{sym,0})$ . The space $X$ is defined as the space of pairs $(\pi,\Pi)$ endowed with the scalar product

((\pi,\Pi),(\bar{\pi},\bar{\Pi}))_{X}:=\int_{\Omega}(\pi:\bar{\pi}+\Pi\circ\bar{\Pi})\,dx.

The primal space

Y:=\{q\in L^{2}(\Omega;\mathbb{R}^{d\times d}_{sym,0})\ |\;\;\nabla q\in L^{2}(\Omega;\mathbb{R}^{d\times d\times d}_{sym,0}),\;q=0\;\mbox{on }\Gamma_{H}\}

is the Hilbert space of admissible plastic strain rates with the scalar product

(q,\bar{q})_{Y}:=\int_{\Omega}(q:\bar{q}+\nabla q\circ\nabla\bar{q})\,dx.

Using the spaces $X$ and $Y$ , we introduce the following weak form of (7.9):

\int_{\Omega}[\pi:q+\Pi\circ\nabla q]\,dx=\int_{\Omega}\sigma^{D}:q\,dx\quad\forall\ q\in Y,

(7.11)

and define the forms $a\colon X\times Y$ and $L\in Y^{*}$ by

a((\pi,\Pi),q):=\int_{\Omega}[\pi:q+\Pi\circ\nabla q]\,dx,\quad L(q):=\int_{\Omega}\sigma^{D}:q\,dx.

Then the dual problem (2.6) reads

\lambda^{*}=\sup\{\lambda\in\mathbb{R}_{+}\ |\;\;P\cap\Lambda_{\lambda}\neq\emptyset\}=\sup_{(\pi,\Pi)\in P}\inf_{\begin{subarray}{c}q\in Y\\ L(q)=1\end{subarray}}\ a((\pi,\Pi),q),

where

P:=\{(\pi,\Pi)\in X\ |\;\;\sqrt{|\pi|^{2}+\ell^{-2}|\Pi|^{2}}\leq\gamma\;\;\mbox{a.e. in }\Omega\},

\Lambda_{\lambda}:=\{(\pi,\Pi)\in X\ |\;\;a((\pi,\Pi),q)=\lambda L(q)\;\;\forall q\in Y\}.

From (7.10), it follows that $P$ is bounded in $X$ , i.e. the assumption (B) of Theorem 2.1 is satisfied. Thus we have

\lambda^{*}=\zeta^{*}=\inf_{\begin{subarray}{c}q\in Y\\ L(q)=1\end{subarray}}\sup_{(\pi,\Pi)\in P}\ a((\pi,\Pi),q)=\inf_{\begin{subarray}{c}q\in Y\\ L(q)=1\end{subarray}}\mathcal{J}(q).

In this case, the functional $\mathcal{J}$ can be found in the form

\mathcal{J}(q)=\int_{\Omega}\gamma\sqrt{|q|^{2}+\ell^{2}|\nabla q|^{2}}\,dx\quad\forall q\in Y.

Although $\mathcal{J}$ is finite-valued everywhere, it is not coercive in $Y$ . Therefore, a certain relaxation of the problem is necessary if we wish to properly define a minimizer of $\mathcal{J}$ and guarantee its existence. Such an analysis has not been done for this problem and we leave this as a topic for further investigation.

The primal and dual problems have been solved by regularization (penalization) methods in [26]. In particular, the regularized functional $\mathcal{J}_{\alpha}$ defined by (5.1) takes the form

\mathcal{J}_{\alpha}(q):=\int_{\Omega}D_{\alpha}(q,\nabla q)\,dx,\quad D_{\alpha}(q,\nabla q)=\left\{\begin{array}[]{cl}\frac{\alpha}{2}(|q|^{2}+\ell^{2}|\nabla q|^{2}),&\sqrt{|q|^{2}+\ell^{2}|\nabla q|^{2}}\leq\frac{1}{\alpha}\\[5.69054pt] \sqrt{|q|^{2}+\ell^{2}|\nabla q|^{2}}-\frac{1}{2\alpha},&\sqrt{|q|^{2}+\ell^{2}|\nabla q|^{2}}\geq\frac{1}{\alpha}.\end{array}\right.

Reliable lower and upper bounds of $\lambda^{*}$ have also been estimated in [26] using the regularization methods.

Remark 7.2.

Other choices of yield functions are possible in (7.10). For example, the following more general function has been considered in [26, 24]:

\varphi_{\ell,r}(\pi,\Pi):=\left\{\begin{array}[]{cc}\left[|\pi|^{r}+(\ell^{-1}|\Pi|)^{r}\right]^{1/r}-\gamma,&1\leq r<+\infty,\\ \max\{|\pi|,\ \ell^{-1}|\Pi|\}-\gamma,&r=+\infty.\end{array}\right.

(7.12)

The set $P$ corresponding to this function remains bounded and thus the equality $\lambda^{*}=\zeta^{*}$ holds. Denoting $r^{\prime}=(1-1/r)^{-1}$ we find the functional $\mathcal{J}$ in the following form:

\mathcal{J}(q)=\left\{\begin{array}[]{cc}\int_{\Omega}\gamma[|q|^{r^{\prime}}+\ell^{2}|\nabla q|^{r^{\prime}}]^{1/r^{\prime}}\,dx,&1\leq r^{\prime}<+\infty,\\[5.69054pt] \int_{\Omega}\gamma\max\{|q|,\ \ell|\nabla q|\}\,dx,&r^{\prime}=+\infty.\end{array}\right.

(7.13)

7.3 Limit (load) analysis in strain-gradient plasticity

Limit analysis in gradient-enhanced plasticity has been studied in [13, 23] for a model in which size-dependence is through the gradient of a scalar function of the plastic strain. Here, we consider the model from [25, 24, 6, 26] where the gradient is applied to the entire plastic strain.

We use the same tensors $\sigma$ , $\pi$ , $\Pi$ and external forces $F$ and $f$ as in Sections 7.1 and 7.2. Let us note that the pair of boundaries $(\Gamma_{F},\Gamma_{H})$ defined in Section 7.2 may differ from $(\Gamma_{0},\Gamma_{f})$ introduced in Section 7.1. The limit analysis problem for the strain gradient plasticity reads: find the supremum $\lambda^{*}$ over all $\lambda\geq 0$ for which there exist $\sigma$ , $\pi$ , $\Pi$ such that

\mathrm{div}\,\sigma+\lambda F=0\;\;\mbox{in}\;\Omega,\quad\sigma\nu=\lambda f\;\;\mbox{on}\;\Gamma_{f},

(7.14)

\sigma^{D}=\pi-\mbox{div}\,\Pi\quad\mbox{in }\Omega,\quad\Pi\nu=0\;\;\mbox{on }\Gamma_{F},

(7.15)

\varphi_{\ell}(\pi,\Pi)=\sqrt{|\pi|^{2}+\ell^{-2}|\Pi|^{2}}\leq\gamma\quad\mbox{in }\Omega,\quad\gamma,\ell>0.

(7.16)

We see that (7.14) coincides with (7.1a) and (7.1b) from Section 7.1. However, we now use the definition of plastically admissible stresses from Section 7.2 (see (7.15) and (7.16)) instead of (7.10).

To rewrite this problem in the form (2.6) or (2.1), we split $\sigma$ as follows:

\sigma=pI+\sigma^{D}=pI+\pi-\mathrm{div}\,\Pi\quad\mbox{in }\Omega.

(7.17)

We denote by $X$ the $L^{2}$ -space of all admissible triples $(p,\pi,\Pi)$ . The equations (7.14) and (7.15) can be rewritten using (7.17) to the following weak form:

a((p,\pi,\Pi),v)=\lambda L(v)\quad\forall v\in Y,

where

a((p,\pi,\Pi),v):=\int_{\Omega}[\,p\,\mathrm{div}\,v+\pi:\varepsilon(v)+\Pi\circ\nabla\varepsilon(v)]\,dx,

L(v):=\int_{\Omega}F\cdot v\,dx+\int_{\Gamma_{f}}f\cdot v\,ds,

and

Y:=\{v\in W^{2,2}(\Omega;\mathbb{R}^{d})\ |\;\;v=0\;\mbox{on }\Gamma_{0},\;\;\varepsilon(v)=0\;\mbox{on }\Gamma_{H}\}.

The space $Y$ is equipped with the standard norm denoted by $\|.\|_{Y}$ . The set $\Lambda_{\lambda}$ remains the same as in (2.5) and the set $P$ of plastically admissible stresses reads

P:=\{(p,\pi,\Pi)\in X\ |\;\;\sqrt{|\pi|^{2}+\ell^{-2}|\Pi|^{2}}\leq\gamma\;\;\mbox{a.e. in }\Omega\}.

Thus, we can define the limit analysis problem as follows:

\lambda^{*}=\sup\{\lambda\in\mathbb{R}_{+}\ |\;\;P\cap\Lambda_{\lambda}\neq\emptyset\}=\sup_{(p,\pi,\Pi)\in P}\inf_{\begin{subarray}{c}v\in Y\\ L(v)=1\end{subarray}}\ a((p,\pi,\Pi),v).

For analysis of the primal problem (2.1), it is convenient to use the split $P=P_{A}+P_{C}$ , where

P_{A}:=\{(p,\pi,\Pi)\in X\ |\;\;p=0,\;\;\sqrt{|\pi|^{2}+\ell^{-2}|\Pi|^{2}}\leq\gamma\;\;\mbox{a.e. in }\Omega\},

P_{C}:=\{(p,\pi,\Pi)\in X\ |\;\;\pi=0,\;\Pi=0\}.

It is easy to check that $P_{A}$ is bounded in $X$ and $P_{C}$ is a closed linear subspace of $X$ . We have

\zeta^{*}=\inf_{\begin{subarray}{c}v\in Y\\ L(v)=1\end{subarray}}\ \sup_{(p,\pi,\Pi)\in P}a((p,\pi,\Pi),v)=\inf_{\begin{subarray}{c}v\in Y\\ L(v)=1\end{subarray}}\mathcal{J}(v),

where

\displaystyle\mathcal{J}(v)

\displaystyle=\left\{\begin{array}[]{cc}\int_{\Omega}\gamma\sqrt{|\varepsilon(v)|^{2}+\ell^{2}|\nabla\varepsilon(v)|^{2}}\,dx,&\mbox{if}\;\;\mathrm{div}\,v=0\;\mbox{in }\Omega,\\ +\infty,&\mbox{otherwise}.\end{array}\right.

The inf-sup term in (2.7) becomes

\inf_{\begin{subarray}{c}(p,\pi,\Pi)\in P_{C}\\ (p,\pi,\Pi)\neq 0\end{subarray}}\ \sup_{\begin{subarray}{c}v\in Y\\ v\neq 0\end{subarray}}\ \frac{a((p,\pi,\Pi),v)}{\|(p,\pi,\Pi)\|_{X}\|v\|_{Y}}=\inf_{\begin{subarray}{c}p\in L^{2}(\Omega)\\ p\neq 0\end{subarray}}\ \sup_{\begin{subarray}{c}v\in Y\\ v\neq 0\end{subarray}}\ \frac{\int_{\Omega}p(\mathrm{div}\,v)\,dx}{\|p\|_{2}\|v\|_{Y}}.

(7.18)

For the equality $\lambda^{*}=\zeta^{*}$ to be satisfied it suffices to show that the right-hand side of (7.18) is positive on an appropriate factor space of $L^{2}(\Omega)$ . Such an analysis seems to be more involved and we leave this as a topic for further investigation.

Remark 7.3.

If we replace the yield functions $\varphi_{\ell}$ in (7.16) with $\varphi_{\ell,r}$ defined by (7.12) then the set $P_{C}$ and the inf-sup expression (7.18) remain the same. The corresponding functional $\mathcal{J}(v)$ is the same as in (7.13) for $\mathrm{div}\,v=0$ .

7.4 Limit analysis for a delamination problem

The last example is devoted to a model for delamination, inspired by [2]. Let $\Omega\subset\mathbb{R}^{2}$ denote the domain occupied by an elastic body, with boundary $\partial\Omega$ . The body is a laminated composite, comprising two distinct materials. The geometry is idealized with one material, referred to as the bulk, comprising the entire domain with the exception of a thin layer of the second material. This thin layer is treated as a line $\Gamma_{b}\subset\Omega$ , and separation or delamination may occur along this line.

We follow [2] and consider a problem with a symmetric geometry and loading, as shown in Figure 1(a). Zero displacements in the normal ( $x_{1}$ ) direction are prescribed along the boundary $\Gamma_{\ell}$ , while on $\Gamma_{f}$ a surface force $\lambda f$ is applied, where $\lambda\geq 0$ is a load factor. The remainder of the boundary $\Gamma_{t}$ is unconstrained and traction-free. The surface force as well as a body force $\lambda F$ act symmetrically along the $x_{1}$ axis so that $F(x_{1},x_{2})=F(x_{1},-x_{2})$ , the same applying to $f$ .

Refer to caption — Figure 1: (a) Composite body showing domain and loading; (b) Upper half of symmetric body and loading

Given the symmetry of the problem we may confine attention to the upper half $\Omega^{+}$ of the domain, shown in Figure 1(b).

The boundary conditions set out above have to be augmented with a condition along $\Gamma_{b}$ . This takes the form of conditions on the traction vector $t=\sigma\nu$ : from symmetry the tangential component $\sigma\nu\cdot\tau:=\sigma_{12}$ must be zero. Here and henceforth subscripts $\nu$ and $\tau$ refer respectively to normal and tangential components. The condition in the normal direction is a constitutive relation that (in the original domain) gives the normal traction $\sigma\nu\cdot\nu:=\sigma_{22}$ as a function of the separation $[u_{2}]$ between the upper and lower surfaces along $\Gamma_{b}$ . Here $u_{2}$ is the displacement in the normal direction and $[u_{2}]=u^{+}_{2}-u^{-}_{2}$ denotes the jump in displacement at the interface. For the symmetrized problem one may replace the jump $[u_{2}]$ by $2u_{2}^{+}:=2u_{2}$ . This has to be supplemented by a non-interpenetration condition, which we do not impose for now, but return to later.

The boundary conditions on $\partial\Omega^{+}$ are then as follows:

\begin{array}[]{ll}u_{1}=0,\;\sigma_{12}=0&\quad\mbox{on}\ \Gamma_{\ell},\\ \sigma\nu=\lambda f&\quad\mbox{on}\ \Gamma_{f},\\ \sigma\nu=0&\quad\mbox{on}\ \Gamma_{t},\\ \sigma_{12}=0,\ \ \sigma_{22}(x_{1})\in H(u_{2}(x_{1}))&\quad\mbox{on}\ \Gamma_{b},\end{array}

(7.19)

where $H$ denotes a multivalued step function in $\mathbb{R}^{1}$ . Examples of $H$ can be found in [2]. For purposes of this paper, we shall assume that the values of $H$ belong to the interval $[-\gamma,\gamma]$ where $\gamma>0$ is a prescribed threshold for delamination. Then $H$ can be either the projection of $\mathbb{R}^{1}$ onto $[-\gamma,\gamma]$ or the multifunction $H(x)=\gamma\,\mathrm{sign}\,x$ for $x\neq 0$ and $H(0)\in[-\gamma,\gamma]$ .

The bulk material is modelled as linear elastic, to which we add the equilibrium equation on $\Omega^{+}$ :

\displaystyle\mathrm{div}\,\sigma+\lambda F=0.

(7.20)

The limit load for the problem can be defined formally as follows: find the supremum $\lambda^{*}\geq 0$ over all $\lambda\geq 0$ for which there exists a stress field $\sigma\colon\Omega^{+}\rightarrow\mathbb{R}^{2\times 2}_{sym}$ that satisfies (7.20) and

\begin{array}[]{ll}\sigma_{12}=0&\quad\mbox{on}\ \Gamma_{\ell},\\ \sigma\nu=\lambda f&\quad\mbox{on}\ \Gamma_{f},\\ \sigma\nu=0&\quad\mbox{on}\ \Gamma_{t},\\ \sigma_{12}=0,\ \ |\sigma_{22}|\leq\gamma&\quad\mbox{on}\ \Gamma_{b}.\end{array}

(7.21)

To rewrite this problem in the form (2.6) and (2.1), we introduce an auxiliary variable $\Xi\in L^{2}(\Gamma_{b})$ that coincides with $-\sigma_{22}$ on $\Gamma_{b}$ in a weak sense. Then the space $X=L^{2}(\Omega^{+},\mathbb{R}^{2\times 2}_{sym})\times L^{2}(\Gamma_{b})$ contains pairs $(\sigma,\Xi)$ and

Y:=\{v=(v_{1},v_{2})\in W^{1,2}(\Omega^{+},\mathbb{R}^{2})\ |\;\;v_{1}=0\;\mbox{on }\Gamma_{\ell}\}

consists of admissible displacement fields. Using the spaces $X$ and $Y$ one can rewrite the equations in (7.20)–(7.21) in the following weak form:

a((\sigma,\Xi),v)=\lambda L(v)\quad\forall v\in Y,

where

a((\sigma,\Xi),v)=\int_{\Omega^{+}}\sigma:\varepsilon(v)\,dx+\int_{\Gamma_{b}}\Xi v_{2}\,dx,\quad\varepsilon(v)=\frac{1}{2}[\nabla v+(\nabla v)^{\top}]

and

L(v)=\int_{\Omega^{+}}F\cdot v\,dx+\int_{\Gamma_{f}}f\cdot v\,ds,\quad v\in Y.

The set $P$ and its decomposition into $P_{A}$ and $P_{C}$ are defined as follows:

P:=\{(\sigma,\Xi)\in X\ |\;\;|\Xi|\leq\gamma\;\mbox{in }\Gamma_{b}\},\quad P_{C}:=\{(\sigma,\Xi)\in X\ |\;\;\Xi=0\;\mbox{on }\Gamma_{b}\},

P_{A}:=\{(\sigma,\Xi)\in X\ |\;\;\sigma=0\;\mbox{on }\Omega^{+},\;|\Xi|\leq\gamma\;\mbox{on }\Gamma_{b}\}.

We also define $\Lambda_{\lambda}:=\{(\sigma,\Xi)\in X\ |\ a((\sigma,\Xi),v)=\lambda L(v)\ \ \forall v\in Y\}$ . Then, the dual and primal problems read

\lambda^{*}=\sup\{\lambda\in\mathbb{R}_{+}\ |\;\;P\cap\Lambda_{\lambda}\neq\emptyset\}=\sup_{(\sigma,\Xi)\in P}\inf_{\begin{subarray}{c}v\in Y\\ L(v)=1\end{subarray}}\ a((\sigma,\Xi),v)

(7.22)

and

\zeta^{*}=\inf_{\begin{subarray}{c}v\in Y\\ L(v)=1\end{subarray}}\sup_{(\sigma,\Xi)\in P}a((\sigma,\Xi),v)=\inf_{\begin{subarray}{c}v\in Y\\ L(v)=1\end{subarray}}\mathcal{J}(v).

(7.23)

To show that $\lambda^{*}=\zeta^{*}$ we use Theorem 4.1. In particular, we have

H=\{(\sigma,\Xi)\in X\ |\;\;a((\sigma,\Xi),v)=0\;\;\forall v\in Y\}

and

P_{C}/H=\{(\sigma,\Xi)\in X\ |\;\;\exists v\in Y:\;\sigma=\varepsilon(v),\;\;\Xi=0\}.

The latter identity follows, for example, from [20]. Then the inf-sup condition (4.2) is a consequence of the Korn inequality [20].

In addition, if $\lambda^{*}=\zeta^{*}<+\infty$ then one can find analytical solutions $v^{*}\in Y$ and $(\sigma^{*},\Xi^{*})\in P\cap\Lambda_{\lambda^{*}}$ to (7.23) and (7.22), respectively. Indeed, from (2.2), (6.2) and Lemma 6.1, it follows that

\mathcal{J}(v)=\left\{\begin{array}[]{cl}\int_{\Gamma_{b}}\gamma|v_{2}|\,dx,&v\in\mathcal{K},\\ +\infty,&v\not\in\mathcal{K},\end{array}\right.,\quad\mathcal{K}=\{v\in Y\ |\;\;v=(0,q),\;q\in\mathbb{R}\},

that is, $\mathrm{dom}\,\mathcal{J}=\mathcal{K}$ . It is readily seen that the feasible set $\mathrm{dom}\,\mathcal{J}\cap\{v\in Y\ |\;L(v)=1\}$ in (7.23) is the singleton consisting of the function

v^{*}=(v_{1}^{*},v_{2}^{*}),\quad v_{1}^{*}=0,\quad v_{2}^{*}=\left(\int_{\Omega^{+}}F_{2}\,dx+\int_{\Gamma_{f}}f_{2}\,ds\right)^{-1},

provided that $\int_{\Omega^{+}}F_{2}\,dx+\int_{\Gamma_{f}}f_{2}\,ds\neq 0$ . If it is so then $v^{*}$ is also the unique solution to the primal problem (7.23) and

\lambda^{*}=\zeta^{*}=\gamma|\Gamma_{b}|\Big{|}\int_{\Omega^{+}}F_{2}\,dx+\int_{\Gamma_{f}}f_{2}\,ds\Big{|}^{-1}<+\infty.

By analysis of the saddle-point problem related to (7.23) and (7.22), we find that the solution $(\sigma^{*},\Xi^{*})$ to the dual problem (7.22) satisfies $\Xi^{*}=\gamma\mathrm{sign}(v_{2}^{*})$ and

\int_{\Omega^{+}}\sigma^{*}:\varepsilon(v)\,dx=\lambda^{*}L(v)-\int_{\Gamma_{b}}\Xi^{*}v_{2}\,ds\quad\forall v\in Y.

(7.24)

The component $\sigma^{*}$ is not uniquely defined. One of $\sigma^{*}$ satisfying (7.24) is the elastic stress of the form $\sigma^{*}=\mathbb{C}\varepsilon(u^{*})$ in $\Omega^{+}$ , where $u^{*}\in Y$ and $\mathbb{C}$ is the elastic fourth order tensor representing Hooke’s law. If $\int_{\Omega^{+}}F_{2}\,dx+\int_{\Gamma_{f}}f_{2}\,ds=0$ then $\lambda^{*}=\zeta^{*}=+\infty$ .

Remark 7.4.

If we consider the case in which the body is fixed on $\Gamma_{\ell}$ as in [2], then $\mathcal{K}=\{0_{Y}\}$ , which implies that $\lambda^{*}=\zeta^{*}=+\infty$ . Thus the related delamination problem may have a solution even if the composite is completely debonded.

Remark 7.5.

The complete formulation of the delamination problem requires also a condition of non-interpenetration (that is, a Signorini condition) along $\Gamma_{b}$ . For the symmetrized problem this amounts to defining the conic set $Y_{C}:=\{v\in Y\ |\;\;v_{2}\geq 0\;\mbox{on }\Gamma_{b}\}$ of admissible displacement fields, replacing the last of equations (7.21) with

\sigma_{21}=0,\;\;-\sigma_{22}\in[0,\gamma]\quad\mbox{on}\;\Gamma_{b},

and consequently, replacing $P$ with $P:=\{(\sigma,\Xi)\in X\ |\;\;\Xi\in[0,\gamma]\;\mbox{on }\Gamma_{b}\}$ . According to Theorem 4.3, we have the duality problem

\lambda^{*}=\sup_{x\in P}\inf_{\begin{subarray}{c}y\in Y_{C}\\ L(y)=1\end{subarray}}\ a(x,y)\stackrel{{\scriptstyle?}}{{=}}\inf_{\begin{subarray}{c}y\in Y_{C}\\ L(y)=1\end{subarray}}\sup_{x\in P}\ a(x,y)=\zeta^{*}.

By combining Theorems 4.1 and 4.3 it is possible to show that $\lambda^{*}=\zeta^{*}$ . In particular, if

\int_{\Omega^{+}}F_{2}\,dx+\int_{\Gamma_{f}}f_{2}\,ds>0

we obtain the same limit value and the primal and dual solutions as for the duality problem without the non-penetration condition.

8 Conclusion

This work has been concerned with an inf-sup problem posed on abstract Banach spaces. The main feature of this convex and constrained problem has been the presence of a bilinear Lagrangian, which appears in applications leading to linear, cone or convex programming problems. Conditions for ensuring duality without any gap have been introduced. We have introduced and extended an innovative framework based on an inf-sup condition on convex cones generalizing the well-known Babuška-Brezzi conditions. We have also suggested a new regularization method and derived a computable majorant to the problem.

Applications of the abstract problem to various examples in mechanics have been presented. First, the problem of limit analysis in classical plasticity has been revisited in the context of the duality framework of this work. Then, we have shown that the abstract framework may be used in the case of two different subproblems related to strain-gradient plasticity, viz. the determination of plastically admissible stresses and the determination of limit loads, and for a delamination problem.

The techniques presented in this paper could be extended to more general duality problems where the Lagrangian contains, in addition to the bilinear form, linear forms with respect to primal or dual variables. Such an extension would be applicable to a wider range of problems in mechanics.

Acknowledgment: SS and JH acknowledge support for their work from the Czech Science Foundation (GAČR) through project No. 19-11441S. BDR acknowledges support for his work from the National Research Foundation, through the South African Chair in Computational Mechanics, SARChI Grant 47584.

References

[1] Babuška, I. (1971). Error-bounds for finite element method. Numerische Mathematik, 16(4), 322–333.
[2] Baniotopoulos, C.C., Haslinger, J., Morávková, Z. (2005). Mathematical modeling of delamination and nonmonotone friction problems by hemivariational inequalities. Applications of Mathematics, 50(1), 1–25.
[3] Boffi, D., Brezzi, F., Fortin, M. (2013). Mixed Finite Element Methods and Applications. Springer.
[4] Brezzi, F. (1974). On the existence, uniqueness and approximation of saddle-point problems arising from Lagrangian multipliers. Publications mathématiques et informatique de Rennes, (S4), 1–26.
[5] Boyd, S. and Vandenberghe, L. (2004). Convex programming. Cambridge University Press.
[6] Carstensen, C., Ebobisse, F., McBride, A.T., Reddy, B.D., Steinmann, P. (2017). Some properties of the dissipative model of strain-gradient plasticity. Phil. Mag. 97 (10), 693–717.
[7] Cermak, M., Haslinger, J., Kozubek, T., Sysala, S. (2015). Discretization and numerical realization of contact problems for elastic‐perfectly plastic bodies. PART II–numerical realization, limit analysis. ZAMM‐Journal of Applied Mathematics and Mechanics/Zeitschrift für Angewandte Mathematik und Mechanik, 95(12), 1348–1371.
[8] Chen, W. and Liu, X.L. (1990). Limit Analysis in Soil Mechanics. Elsevier.
[9] Christiansen, E. (1980). Limit analysis in plasticity as a mathematical programming problem. Calcolo 17, 41–65.
[10] Christiansen, E. (1996). Limit analysis of colapse states. In P. G. Ciarlet and J. L. Lions, editors, Handbook of Numerical Analysis, Vol IV, Part 2, North-Holland, 195–312.
[11] Dantzig, G.B. (1998). Linear programming and extensions (Vol. 48). Princeton university press.
[12] Ekeland, I. and Temam, R. (1974). Analyse Convexe et Problèmes Variationnels. Dunod, Gauthier Villars, Paris.
[13] Fleck, N.A. and Willis, J.R. (2009). A mathematical basis for strain-gradient plasticity theory. part II: tensorial plastic multiplier. J. Mech. Phys. Solids 57, 1045–1057.
[14] Haslinger, J., Repin, S., Sysala, S. (2016). A reliable incremental method of computing the limit load in deformation plasticity based on compliance: Continuous and discrete setting. Journal of Computational and Applied Mathematics 303, 156–170.
[15] Haslinger, J., Repin, S., Sysala, S (2016). Guaranteed and computable bounds of the limit load for variational problems with linear growth energy functionals. Applications of Mathematics 61, 527–564.
[16] Haslinger, J., Repin, S., Sysala, S. (2019). Inf-sup conditions on convex cones and applications to limit load analysis. Mathematics and Mechanics of Solids 24, 3331–3353.
[17] Johnson, C. (1976). Existence theorems for plasticity problem. J. Math. Pures et Appl., 55, 79–84.
[18] Kanno, Y. (2011). Nonsmooth mechanics and convex optimization. Crc Press.
[19] Myerson, R.B. (2013). Game theory. Harvard university press.
[20] Nečas, J. and Hlaváček, I. (2017). Mathematical theory of elastic and elasto-plastic bodies: an introduction, Elsevier.
[21] Nocedal, J., and Wright, S. (2006). Numerical optimization. Springer Science & Business Media.
[22] Parikh, N. and Boyd, S. (2014). Proximal Algorithms. Foundations and Trends in Optimization, 1(3), 127–239.
[23] Polizzotto, C. (2010). Strain gradient plasticity, strengthening effects and plastic limit analysis. International journal of solids and structures, 47(1), 100–112.
[24] Reddy, B.D. (2011). The role of dissipation and defect energy in variational formulations of problems in strain- gradient plasticity. Part 1: single-crystal plasticity. Cont. Mech. Thermodyn. 23, 551–572.
[25] Reddy, B.D., Ebobisse, F., McBride, A.T. (2008). Well-posedness of a model of strain gradient plasticity for plastically irrotational materials, Int. J. Plast. 24, 55–73.
[26] Reddy, B.D. and Sysala, S. (2020). Bounds on the elastic threshold for problems of dissipative strain-gradient plasticity. Journal of the Mechanics and Physics of Solids, 10.1016/j.jmps.2020.104089.
[27] Repin, S. (2010). Estimates of deviations from exact solutions of variational problems with linear growth functional. Journal of Mathematical Sciences 166, 75–85.
[28] Repin, S., Sysala, S., Haslinger, J. Computable majorants of the limit load in Hencky’s plasticity problems. Comp. & Math. with Appl. (2018) 75: 199–217.
[29] Repin, S. and Seregin, G. (1995). Existence of a weak solution of the minimax problem arising in Coulomb-Mohr plasticity, Nonlinear evolution equations, 189–220, Amer. Math. Soc. Transl. (2), 164, Amer. Math. Soc., Providence, RI.
[30] Sloan SW (2013). Geotechnical stability analysis, Géotechnique, 63, 531–572.
[31] Sysala, S., Haslinger, J., Hlaváček, I., Cermak, M. (2015). Discretization and numerical realization of contact problems for elastic‐perfectly plastic bodies. PART I–discretization, limit analysis. ZAMM‐Journal of Applied Mathematics and Mechanics/Zeitschrift für Angewandte Mathematik und Mechanik, 95(4), 333–353.
[32] Temam, R. (1985). Mathematical Problems in Plasticity. Gauthier-Villars, Paris.
[33] Zouain, N. (2018). Shakedown and safety assessment. In: Encyclopedia of Computational Mechanics Second Edition (E. Stein, R. de Borst and T.J.R. Hughes eds.), 1–48.

$\displaystyle\min_{z\in\mathcal{K}}\\|y-z\\|^{2}$	$\displaystyle=\min_{z\in Y}\,\sup_{x\in P_{C}}\left\{\\|y-z\\|^{2}+2a(x,z)\right\}$
	$\displaystyle=\sup_{x\in P_{C}}\,\min_{z\in Y}\left\{\\|y-z\\|^{2}+2a(x,z)\right\}$
	$\displaystyle=\sup_{x\in P_{C}}\,\min_{z\in Y}\left\{\\|z\\|^{2}+2a(x,z)+2a(x,y)\right\}\quad\forall y\in Y.$	(6.12)

$\displaystyle\min_{z\in\mathcal{K}}\\|y-z\\|^{2}$	$\displaystyle=\sup_{x\in P_{C}}\,\min_{z\in Y}\left\{\\|z\\|^{2}+2a(x,z)+2a(x,y)\right\}$
	$\displaystyle=\sup_{x\in P_{C}}\,\left\{-\left(\sup_{\begin{subarray}{c}z\in Y\\ z\neq 0_{Y}\end{subarray}}\frac{a(x,z)}{\\|z\\|_{Y}}\right)^{2}+2a(x,y)\right\}$
	$\displaystyle\stackrel{{\scriptstyle\eqref{inf-sup_abstract}}}{{\leq}}\sup_{x\in P_{C}}\,\left\{-c_{*}^{2}\\|x\\|_{X}^{2}+2a(x,y)\right\}$
	$\displaystyle=\max_{x\in P_{C}}\,\left\{-c_{}^{2}\\|x/c_{}^{2}\\|_{X}^{2}+2a(x/c_{*}^{2},y)\right\}$
	$\displaystyle=\frac{1}{c_{}^{2}}\max_{x\in P_{C}}\,\left\{-\\|x\\|_{X}^{2}+2a(x,y)\right\}\stackrel{{\scriptstyle\eqref{Pi_C}}}{{=}}C_{}^{2}\\|\Pi_{C}\,y\\|_{X}^{2}\qquad\forall y\in Y,$	(6.14)

An abstract inf-sup problem inspired by limit analysis in perfect plasticity and related applications

Abstract

1 Introduction

2 The primal and dual problems and duality criteria

Theorem 2.1.

Theorem 2.2.

Theorem 2.3.

3 The proof of Theorem 2.3

Lemma 3.1.

Proof.

Lemma 3.2.

Proof.

Lemma 3.3.

Proof.

4 Generalizations of Theorem 2.3

Theorem 4.1.

Theorem 4.2.

Theorem 4.3.

5 Regularization method

Lemma 5.1.

Proof.

Theorem 5.1.

Proof.

Lemma 5.2.

Proof.

Theorem 5.2.

Remark 5.1.

6 A computable majorant of ζ∗\zeta^{*}

Lemma 6.1.

Proof.

Remark 6.1.

Lemma 6.2.

Proof.

Theorem 6.1.

Proof.

Remark 6.2.

Remark 6.3.

Remark 6.4.

Remark 6.5.

7 Examples

7.1 Limit analysis in classical perfect plasticity

Remark 7.1.

7.2 Plastically admissible stresses in strain-gradient plasticity

Remark 7.2.

7.3 Limit (load) analysis in strain-gradient plasticity

Remark 7.3.

7.4 Limit analysis for a delamination problem

Remark 7.4.

Remark 7.5.

8 Conclusion

References

6 A computable majorant of $\zeta^{*}$