\RedeclareSectionCommand

[ runin=true, ]subsection \RedeclareSectionCommand[ runin=true, ]subsubsection

Linear stability estimates for Serrin’s problem via a modified implicit function theorem

Alexandra Gilsbach Department of Mathematics, School of Science, Tokyo Institute of Technology Michiaki Onodera Department of Mathematics, School of Science, Tokyo Institute of Technology

Abstract

Abstract. We examine Serrin’s classical overdetermined problem under a perturbation of the Neumann boundary condition. The solution of the problem for a constant Neumann boundary condition exists provided that the underlying domain is a ball. The question arises whether for a perturbation of the constant there still are domains admitting solutions to the problem. Furthermore, one may ask whether a domain that admits a solution for the perturbed problem is unique up to translation and whether it is close to the ball. We develop a new implicit function theorem for a pair of Banach triplets that is applicable to nonlinear problems with loss of derivatives except at the point under consideration. Combined with a detailed analysis of the linearized operator, we prove the existence and local uniqueness of a domain admitting a solution to the perturbed overdetermined problem. Moreover, an optimal linear stability estimate for the shape of a domain is established.

Keywords: overdetermined problem & implicit function theorem & stability estimates

MSC2010: 35N25 & 35B35 & 47J07

1 Introduction

We study the shape of a bounded domain $\Omega\subset\mathbb{R}^{n}$ , $n\geq 2$ , in which a solution $u$ to the Dirichlet problem


(1.1a)		$\displaystyle\begin{aligned} -\Delta u&=1\quad\text{in}\ \Omega,\\ u&=0\quad\text{on}\ \Gamma=\partial\Omega\end{aligned}$
satisfies the overdetermined boundary condition
(1.1b)		$\displaystyle-\frac{\partial u}{\partial\nu}=f\quad\text{on}\ \Gamma,$

where $\nu$ is the outer unit normal vector to $\Gamma$ and $f$ is a prescribed positive function defined on $\mathbb{R}^{n}$ .

The overdetermined problem (1.1) arises in a shape optimization problem called the Saint-Venant problem, in which one maximizes the torsional rigidity

P(\Omega)=\sup_{u\in H_{0}^{1}(\Omega)\setminus\{0\}}\frac{\left(\int_{\Omega}u\,dx\right)^{2}}{\int_{\Omega}|\nabla u|^{2}\,dx}

of a bar with cross section $\Omega$ , among all sets $\Omega$ of equal weighted volume

V(\Omega)=\int_{\Omega}f^{2}\,dx.

The Euler-Lagrange equation, after multiplying a normalizing constant, consists in (1.1). In the case where $f$ is a constant, Pólya [17] proved that the maximizer $\Omega$ of $P$ must be a ball with the prescribed volume $V$ , using the symmetric rearrangement of a function. This applies to a more general situation, where $f$ is radially symmetric and non-decreasing in the radial direction.

In fact, the same symmetry result also holds for all critical points, namely, if $f$ is a constant, then (1.1) has a solution $u$ if and only if $\Omega$ is a ball. In particular, for the normalized constant $f=\tfrac{1}{n}$ , $\Omega$ is a ball of radius one and $u(x)=\tfrac{1}{2n}(1-|x-c|^{2})$ with $c$ being the center of the ball. This well-known symmetry result is due to Serrin [18]. The proof introduces the method of moving planes motivated by Alexandrov’s reflection principle [2] originally used to establish the soap bubble theorem. This symmetry result can be alternatively proven by an ingenious combination of the Rellich-Pohozaev integral identity and elementary inequalities (see Weinberger [19], and Brandolini, Nitsch, Salani, and Trombetti [5]), or by a continuous version of the Steiner symmetrization (see Brock and Henrot [7]).

The objective of this paper is the stability of a domain $\Omega$ under a perturbation of the Neumann boundary condition (1.1b), which naturally arises if one considers the torsional rigidity in anisotropic media. Namely, setting $\Omega_{0}:=\mathbb{B}$ , the $n$ -dimensional unit ball centered at the origin, and

(1.2)

f(x)=\frac{1}{n}+g\left(\frac{x}{|x|}\right)\quad(x\in\mathbb{R}^{n}\setminus\{0\})

with a prescribed function $g$ defined on $\Gamma_{0}:=\mathbb{S}$ , where $\mathbb{S}=\partial\mathbb{B}$ , we prove the existence and local uniqueness of $\Omega$ admitting a solution $u$ to (1.1), and establish a quantitative estimate of the deviation of $\Omega$ from $\Omega_{0}$ in terms of the perturbation $g$ .

The domain deviation is measured by a function $\rho=\rho(\zeta)\in(-1,\infty)$ which defines the star-shaped bounded domain $\Omega_{\rho}$ enclosed by

(1.3)

\Gamma_{\rho}:=\left\{\zeta+\rho(\zeta)\zeta\mid\zeta\in\mathbb{S}\right\}.

A domain $\Omega$ admitting a solution to (1.1) will also be referred to as a solution of the problem.

In what follows, $h^{k+\alpha}(\overline{\Omega})$ denotes the little Hölder space defined as the closure of the Schwartz space $\mathcal{S}$ of rapidly decreasing functions in $C^{k+\alpha}(\overline{\Omega})$ , and similarly $h^{k+\alpha}(\Gamma)$ for a hypersurface $\Gamma$ (see Lunardi [13]).

In order to motivate our study, let us mention several related results concerning existence and stability of solutions to (1.1). The existence of $\Omega$ for non-constant $f$ is known (see Bianchini, Henrot and Salani [4]) in the case where $f$ is positively homogeneous, i.e.,

(1.4)

f(tx)=t^{\gamma}f(x)\quad(t>0,\ x\in\mathbb{R}^{n})

for $\gamma>0$ with $\gamma\neq 1$ with $f$ being Hölder continuous on $\mathbb{R}^{n}\setminus\{0\}$ . This condition ensures the existence of a maximizer $\Omega$ of the Saint-Venant problem, and a solution $u$ to (1.1a) then satisfies $-\partial_{\nu}u=\lambda f$ on $\Gamma$ with a Lagrange multiplier $\lambda>0$ . The $\gamma$ -homogeneity of $f$ allows us to control $\lambda$ by considering $t\Omega:=\{tx\mid x\in\Omega\}$ , and indeed $t=\lambda^{1/(1-\gamma)}$ gives a desired domain. However, the $0$ -homogeneous case (1.2) cannot be treated by this variational approach, since the dichotomy of a maximizing sequence cannot be excluded in the concentration-compactness alternative.

Most of the existing stability results in the literature for (1.1), fitted into our context by translation and dilation, take inequalities of the form

(1.5)

\|\rho\|_{L^{\infty}(\mathcal{S}^{n-1})}\leq C\left[\frac{\partial u_{\Omega}}{\partial\nu}+R\right]_{X}^{\tau},

where $u_{\Omega}$ is a solution to (1.1a) in $\Omega=\Omega_{\rho}$ with $C^{2+\alpha}$ -boundary, $0<\tau\leq 1$ and $[\,\cdot\,]_{X}$ denotes a norm or seminorm which measures the deviation of $-\partial_{\nu}u_{\Omega}$ from a constant $R>0$ . Aftalion, Busca and Reichel [1] adopted a quantitative version of the method of moving planes and obtained a logarithmic version of (1.5) with $X=C^{1}(\Gamma)$ . The method was further developed by Ciraolo, Magnanini and Vespri [8], and they obtained (1.5) for some $0<\tau<1$ in terms of the Lipschitz seminorm of $X=\text{Lip}(\Gamma)$ . In fact, these results also hold for semilinear equations $-\Delta u=f(u)$ with $u>0$ . On the other hand, Brandolini, Nitsch, Salani and Trombetti [6] made use of integral identities and proved (1.5) with $X=L^{\infty}(\Gamma)$ for some $0<\tau<1$ . Moreover, they obtained an estimate of the volume of the symmetric difference of $\Omega$ and a union of balls by a weaker norm, i.e., $X=L^{1}(\Gamma)$ . Note that the problem (1.1) admits a domain $\Omega$ composed of a finite number of balls joined by tiny tentacles if we only control the extra boundary condition (1.1b) by the $L^{1}$ -norm. Following this approach, Feldman [9] obtained the sharp estimate

|\Omega\triangle\mathbb{B}|\leq C\left\|\frac{\partial u_{\Omega}}{\partial\nu}+R\right\|_{L^{2}(\Gamma)},

where $|\Omega\triangle\mathbb{B}|$ is the volume of the symmetric difference of $\Omega$ and $\mathbb{B}$ and is considered as $\|\rho\|_{L^{1}(\mathbb{S})}$ for star-shaped $\Omega=\Omega_{\rho}$ . The linear (i.e., $\tau=1$ ) stability estimate has also been expected in (1.5). Recently, Magnanini and Poggesi proved (1.5) with $X=L^{2}(\Gamma)$ and $\tau=1$ for $n=2$ , $\tau$ arbitrarily close to $1$ for $n=3$ , and $\tau=\frac{2}{n-1}$ for $n\geq 4$ [14].

In general, for overdetermined problems, the super-subsolution method based on the maximum principle provides an existence criterion. In our setting, a bounded domain $\Omega$ is called a supersolution to (1.1) if the unique solution $u=u_{\Omega}$ to (1.1a) satisfies

-\frac{\partial u_{\Omega}}{\partial\nu}\leq f\quad\text{on}\ \Gamma,

and a subsolution is defined analogously with the opposite inequality. The existence of a solution, i.e., a bounded domain $\Omega$ in which $u_{\Omega}$ satisfies (1.1b), is guaranteed provided there are a supersolution $\Omega_{\sup}$ and a subsolution $\Omega_{\text{sub}}$ satisfying $\Omega_{\text{sub}}\subset\Omega_{\sup}$ . Typically, balls $\mathbb{B}_{r}$ with large or small radii $r>0$ give super- or subsolutions. Indeed, for $\Omega=\mathbb{B}_{r}$ ,

u_{\mathbb{B}_{r}}(x)=\frac{r^{2}-|x|^{2}}{2n}

with $-\partial_{\nu}u_{\mathbb{B}_{r}}=\frac{r}{n}$ on $\partial\mathbb{B}_{r}$ solves (1.1), and we see that, in the $\gamma$ -homogeneous setting (1.4) with $\gamma>1$ , $\mathbb{B}_{r}$ with large (resp. small) $r>0$ is a supersolution (resp. subsolution); while for $0\leq\gamma<1$ , $\mathbb{B}_{r}$ with large (resp. small) $r>0$ is a subsolution (resp. supersolution). Hence these balls provide an appropriate pair of super- and subsolutions only if $\gamma>1$ .

We therefore take another approach in this paper based on an implicit function theorem, yielding linear stability estimates with Hölder norms on both sides of the estimate, as well as the existence and local uniqueness of $\Omega$ for a given perturbation $g$ in (1.2). We will need to exploit detailed properties of the linearized equation


(1.6a)		$\displaystyle\begin{aligned} -\Delta p&=0&&\text{in}\ \Omega_{\rho_{0}},\\ \left(H-\frac{1}{f}\right)p+\frac{\partial p}{\partial\nu}&=-\varphi&&\text{on}\ \Gamma_{\rho_{0}},\end{aligned}$
(1.6b)		$\displaystyle p=f\tilde{\rho}\quad\,\text{on}\ \Gamma_{\rho_{0}},$

where $H=H_{\Gamma_{\rho_{0}}}$ is the mean curvature of $\Gamma_{\rho_{0}}$ normalized such that $H=n-1$ for $\Omega=\mathbb{B}$ . The linearized equation (1.6) is derived by substituting a solution pair $(\Omega_{\rho_{0}+\varepsilon\tilde{\rho}},u_{\rho_{0}+\varepsilon\tilde{\rho}})$ with formal expansions

	$\displaystyle\Gamma_{\rho_{0}+\varepsilon\tilde{\rho}}$	$\displaystyle=\{\zeta+\left(\rho_{0}(\zeta)+\varepsilon\tilde{\rho}(\zeta)\nu(\zeta)\right)+o(\varepsilon)\mid\zeta\in\mathbb{S}\},$
	$\displaystyle u_{\rho_{0}+\varepsilon\tilde{\rho}}$	$\displaystyle=u_{\rho_{0}}+\varepsilon p+o(\varepsilon)$

into (1.1) for a right hand side $f+\varepsilon\varphi$ , and equating functions of order $\varepsilon$ . Note that (1.6) is a decoupled system for $p$ and $\tilde{\rho}$ , and we may consider only (1.6a) for the solvability of (1.6). Then (1.6b) with known $p$ yields a solution $\tilde{\rho}$ .

Recall that the implicit function theorem states that the nonlinear equation $F(\rho,g)=0$ has for each $g$ close to $g_{0}$ a unique solution $\rho$ near $\rho_{0}$ with $F(\rho_{0},g_{0})=0$ , if

(i)

the mapping $F\colon X\times Y\to Z$ is $C^{1}$ in a neighbourhood of $(\rho_{0},g_{0})$ and if
(ii)

the partial derivative $\partial_{\rho}F(\rho_{0},g_{0})\in\mathscr{L}(X,Z)$ is bijective.

Here, $X$ , $Y$ and $Z$ are Banach spaces with $X\subset Z$ . In addition to the solution $\rho(g)$ being locally unique, the mapping $g\mapsto\rho(g)\in X$ is in $C^{1}$ . In the current setting, the Neumann boundary condition (1.1b) yields such a mapping $F$ , and the linearized equation $\partial_{\rho}F(\rho_{0},g_{0})[\tilde{\rho}]=\varphi$ is reflected by (1.6). However, the linearized equation (1.6) has a regularity defect called loss of derivatives, i.e. $\partial_{\rho}F(\rho_{0},g_{0})^{-1}\not\in\mathscr{L}(Z,X)$ . Since solutions $\tilde{\rho}$ to (1.6) are less regular than $\rho_{0}$ , and hence the typical iterative scheme in the classical implicit function theorem fails.

One method to overcome this regularity issue is the Nash-Moser theorem, a generalization of the classical implicit function theorem introduced by Nash in [16] and generalized by Moser in [15]. The introduction of a smoothing operator combined with Newton’s method for improved convergence was there shown to be a mean to overcome the regularity deficit. For the Nash-Moser theorem to work,

(i)

regularity properties are required for $F\colon X_{i}\times Y\to Z_{i}$ , where $(X_{i},Z_{i})_{i}$ is a family of pairs of Banach spaces such that $X_{i}\subset X_{i-1}$ , $Z_{i}\subset Z_{i-1}$ . Furthermore,
(ii)

a (right) inverse of $\partial_{\rho}F(\rho,g)$ has to exist for $(\rho,g)$ in a neighbourhood of $(\rho_{0},g_{0})$ .

In this setting, for every $g$ in a neighbourhood of $g_{0}$ , the existence of $\rho(g)$ in $X_{0}$ is then given. Note that there are various versions of the Nash-Moser theorem, also referred to as Nash-Moser-Hörmander theorem. We refer as an example to the work of Baldi and Haus [3] and the references therein.

Instead of applying the Nash-Moser theorem, we introduce a new modified version of the classical implicit function theorem, which has the constraint that a loss of derivatives may take place except at the point $(\rho_{0},g_{0})$ . We require for a pair of Banach triplets $X_{2}\subset X_{1}\subset X_{0}$ and $Z_{2}\subset Z_{1}\subset Z_{0}$ that

(i)

for $j=1,2$ , $F$ is continuous in a neighbourhood of $(\rho_{0},g_{0})$ from $X_{j-1}\times Y$ to $Z_{j-1}$ , and that it is in $C^{1}$ in a neighbourhood of $(\rho_{0},g_{0})$ from $X_{j}\times Y$ to $Z_{j-1}$ , For $(\rho,g)$ in a neighbourhood of $(\rho_{0},g_{0})\in X_{j}\times Y$ , we have $\partial_{\rho}F(\rho,g)\in\mathscr{L}(Z_{j-1},X_{j-1})$ . Further,
(ii)

$F\colon X_{j}\times Y\to Z_{j}$ is Fréchet-differentiable at $(\rho_{0},g_{0})$ for $j=1,2$ and $\partial_{\rho}F(\rho_{0},g_{0})\in\mathscr{L}(X_{j},Z_{j})$ is invertible for $j=0,1,2$ .

The first point reflects the loss of regularity, the second point reflects that it does not occur at the point under consideration. Under these assumptions, we derive a modified implicit function theorem that yields local uniqueness of a solution $\rho(g)\in X_{1}$ for all $g$ in a neighbourhood of $g_{0}$ , and the mapping $g\mapsto\rho(g)\in X_{0}$ is in $C^{1}$ . Note that in the setting of (1.1) and (1.6), the loss of derivatives does indeed not occur in the case of the solution of (1.1) for constant $f$ , as then $\Gamma$ and $u$ are smooth.

However, a second obstacle apart from the loss of derivatives arises. Due to the translational invariance of (1.1), the linearized equation (1.6a) for $g=0$ and $\Omega_{\rho_{0}}=\mathbb{B}$ is not solvable for arbitrary $\varphi\in h^{2+\alpha}(\mathbb{S})$ , and for $\varphi=0$ it has an $n$ -dimensional space of solutions. This implies that the partial derivative of $F$ at $(0,0)$ is not invertible, which is necessary also for the modified implicit function theorem. We will remove this degeneracy by imposing an additional condition

(1.7)

\int_{\Omega}x_{j}\,dx=0\quad(j=1,\ldots,n),

so that the barycenter of $\Omega$ is fixed to be the origin, and by decomposing the space $h^{k+\alpha}(\mathbb{S})$ into $h^{k+\alpha}(\mathbb{S})=X_{l}\oplus K$ , where

(1.8)

\displaystyle\begin{aligned} X_{l}&:=\left\{\rho\in h^{l+\alpha}(\mathbb{S})\mid\langle\rho,x_{j}\rangle_{L^{2}(\mathbb{S})}=0,\quad j=1,\ldots,n\right\},\\ K&:=\text{span}\left\{x_{1},\ldots,x_{n}\right\}.\end{aligned}

This allows for a decomposition of a domain perturbation $\rho\in h^{l+\alpha}(\mathbb{S})$ into $\rho_{1}\in X_{l}$ , $\rho_{2}\in K$ , as well as a decomposition of the function $g$ in (1.2) likewise, and we examine $F(\rho_{1}+\rho_{2},g_{1}+g_{2})=0$ .

With these preparations, we present the main result of this work.

Theorem 1.1.

There exist neighbourhoods of zeros

V\subset X_{2},\,U_{2}\subset h^{2+\alpha}(\mathbb{S})\times K\text{ and }U_{3}\subset h^{3+\alpha}(\mathbb{S})\times K,

such that for all $g_{1}\in V$ there are unique $(\rho,g_{2})=(\rho(g_{1}),g_{2}(g_{1}))\in U_{3}$ such that the following holds.

(i)

$\Omega_{\rho(g_{1})}$ defined by (1.3) admits a solution $u\in h^{3+\alpha}(\overline{\Omega_{\rho(g_{1})}})$ to (1.1) for (1.2) with $g=g_{1}+g_{2}(g_{1})$ , and satisfies (1.7).
(ii)

$\Omega_{\rho(g_{1})}$ is locally unique up to translations in the sense that there is $(\rho(g_{1}),g_{2}(g_{1}))\in U_{3}$ for $g_{1}\in V$ , and if $\Omega_{\rho}$ with $\rho\in U_{3}$ admits a solution $u$ to (1.1) for (1.2) with $g=g_{2}(g_{1})+g_{1}$ and satisfies (1.7), then $\rho=\rho(g_{1})$ .

(iii)

For the mapping $(\rho,g_{2})\colon V\to U_{3}$ , we have $(\rho,g_{2})\in C^{1}(V,U_{2})$ and the stability estimates

(1.9)		$\displaystyle\left\\|\rho(g_{1})\right\\|_{h^{2+\alpha}(\mathbb{S})}$	$\displaystyle\leq C\left\\|g_{1}\right\\|_{h^{2+\alpha}(\mathbb{S})},$
(1.9)		$\displaystyle\left\\|g_{2}(g_{1})\right\\|_{h^{2+\alpha}(\mathbb{S})}$	$\displaystyle\leq C\left\\|g_{1}\right\\|_{h^{2+\alpha}(\mathbb{S})}$

hold.

While the existence of $\rho$ is guaranteed in $h^{3+\alpha}(\mathbb{S})$ , only the weaker norm, i.e. the $h^{2+\alpha}(\mathbb{S})$ norm, of $\rho$ is estimated in (1.9). This is due to the fact that the linear stability estimate requires $C^{1}$ -regularity of the mapping $g_{1}\mapsto\rho$ , and this regularity is expected only when the image space is $h^{2+\alpha}(\mathbb{S})$ due to the loss of derivatives.

Remark 1.2.

The translational invariance of (1.1) is mirrored in that theorem by using the decomposition into the translational part and its orthogonal complement. In that regard, it also becomes clear why the setting of the little Hölder spaces $h^{l+\alpha}$ instead of the Hölder spaces $C^{l+\alpha}$ is necessary: The decomposition into subspaces is induced by the so-called spherical harmonics on $\mathbb{S}$ . They are dense in $h^{l+\alpha}(\mathbb{S})$ , but not in $C^{l+\alpha}(\mathbb{S})$ . This will be further discussed in Section 3.

The paper is organised as follows. In Section 2, we introduce the perturbed problem as well as derive in detail the formulation via the linearized equation (1.6). We motivate the application of an implicit function theorem to a mapping $F$ that is derived from the Neumann boundary condition. This application is obstructed by the degeneracy of the derivative of $F$ as well as the loss of derivatives. The degeneracy of the derivative of $F$ stemming from the inherent symmetry of (1.1) will be addressed in Section 3. There, also the decomposition for the little Hölder spaces is motivated as well as the necessity of using the setting of the little Hölder spaces. In Section 4, we will revisit the implicit function theorem and establish a modified version fitting our setting. This is then applied to the perturbed problem in Section 5 to prove Theorem 1.1.

2 Preliminaries

We formally set up the perturbed problem defined in (1.1). We want to know whether for a perturbation $g$ there exists an open bounded domain $\Omega=\Omega(g)$ admitting a solution $u_{\Omega}$ to (1.1) with (1.2), i.e. $f=\frac{1}{n}+g$ .

We restrict the domain $\Omega$ to be in such a way that it may be modelled as a deviation of $\mathbb{B}$ , the domain admitting a solution to (1.1) with $f=\frac{1}{n}$ . For this reference domain $\Omega_{0}=\mathbb{B}$ , with $\partial\Omega_{0}=\Gamma_{0}=\mathbb{S}$ , we define the perturbed domain $\Omega_{\rho}$ by its $h^{m+\alpha}$ -boundary $\Gamma_{\rho}=\partial\Omega_{\rho}$ in the following way. We set for $m\in\mathbb{N}$

U_{\gamma,m}:=\left\{v\in h^{m+\alpha}(\mathbb{S})\,\big{|}\,\left\|v\right\|_{h^{m+\alpha}(\mathbb{S})}<\gamma\right\},

with $\gamma\leq 1$ sufficiently small. Next, we define

\theta:\,\mathbb{S}\times(-1,\infty)\to\theta\left(\mathbb{S}\times(-1,\infty)\right),\quad\theta(\zeta,r):=\zeta+r\nu_{0}(\zeta)=\zeta+r\zeta.

In general, $\nu_{\rho}$ denotes the outer unit normal vector of $\Gamma_{\rho}$ ; for $\rho=0$ we have $\nu_{0}(x)=x$ . Then we set

\Gamma_{\rho}=\left\{\zeta+\rho(\zeta)\nu_{0}(\zeta)\in\mathbb{R}^{n}\,\big{|}\,\zeta\in\Gamma_{0}\right\}=\left\{\zeta+\rho(\zeta)\zeta\in\mathbb{R}^{n}\,\big{|}\,\zeta\in\mathbb{S}\right\},

and $\rho\in U_{\gamma,m}$ . $\rho$ models the velocity of the boundary, and will be used to measure how much $\Gamma_{\rho}$ deviates from $\Gamma_{0}$ . Using this, we define the diffeomorphism

\theta_{\rho}(x):=\begin{cases}x+\varphi\left(|x|-1\right)\rho\left(\frac{x}{|x|}\right)\frac{x}{|x|},\quad&\text{for }x\neq 0,\\ 0\quad&\text{for }x=0,\end{cases}

from $\Omega_{0}=\mathbb{B}$ to $\Omega_{\rho}$ , where $\varphi\colon\mathbb{R}\to\mathbb{R}$ is a smooth cut-off function with $0\leq\varphi(r)\leq 1$ , $\varphi(r)=1$ for $|r|\leq\frac{1}{4}$ and $\varphi(r)=0$ for $|r|\geq\frac{3}{4}$ , as well as $\left|\frac{d\varphi}{dr}(r)\right|\leq 4$ . The diffeomorphism $\theta_{\rho}$ induces pullback and pushforward operators

	$\displaystyle\theta_{\rho}^{*}u$	$\displaystyle:=u\circ\theta_{\rho},\quad$	$\displaystyle\theta_{\rho}^{*}\colon h^{k+\alpha}(\Gamma_{\rho})\to h^{k+\alpha}(\mathbb{S}),$
	$\displaystyle\theta_{*}^{\rho}v$	$\displaystyle:=v\circ\theta_{\rho}^{-1},\quad$	$\displaystyle\theta_{*}^{\rho}\colon h^{k+\alpha}(\mathbb{S})\to h^{k+\alpha}(\Gamma_{\rho}),$

with $k\in\mathbb{N}\cup\{0\}$ .

Our problem now becomes the following:

Problem 2.1.

For $g\in h^{1+\alpha}(\mathbb{S})$ , is there a $\rho=\rho(g)\in U_{\gamma,2}$ such that $\Omega_{\rho}$ as defined above admits a solution $u_{\rho}$ to (2.1)?


(2.1a)		$\displaystyle\begin{aligned} -\Delta u_{\rho}&=1\quad&\text{in }\Omega_{\rho},\\ u_{\rho}&=0\phantom{+g\left(\frac{x}{\|x\|}\right)}\quad&\text{on }\Gamma_{\rho},\end{aligned}$
(2.1b)		$\displaystyle\frac{\partial u_{\rho}}{\partial\nu_{\rho}}(x)=\frac{1}{n}+g\left(\frac{x}{\|x\|}\right)\quad\text{on }\Gamma_{\rho}.$

By elliptic regularity theory, we have, for given $\rho\in U_{\gamma,2}$ , the existence and uniqueness of a solution $u_{\rho}\in h^{2+\alpha}(\overline{\Omega}_{\rho})$ when only considering (2.1a). Therefore, for the examination of this problem, it is sufficient to focus on the perturbation, i.e. (2.1b).

We define $F\in C(U_{\gamma,2}\times h^{1+\alpha}(\mathbb{S}),h^{1+\alpha}(\mathbb{S}))$ by

(2.2)

F(\rho,g):=\theta_{\rho}^{*}\left(\frac{\partial u_{\rho}}{\partial\nu_{\rho}}\right)+\frac{1}{n}+g,

where $u_{\rho}$ is the unique solution of (2.1a). Then $\Omega_{\rho}$ admits a solution to (2.1) for given $g\in h^{1+\alpha}(\mathbb{S})$ if and only if $F(\rho,g)=0$ .

This structure tempts to use the implicit function theorem to arrive at solutions in a neighbourhood of $(0,0)$ . However, we shall arrive at two obstacles. The first is the derivative $\partial_{\rho}F(0,0)$ not being bijective due to the inherent translational invariance, an observation that will be treated in Section 3. The second is the loss of derivatives, a regularity issue of the $\rho$ -derivative of $F$ that will be discussed in the following.

In view of this, note that for $g\in h^{2+\alpha}(\mathbb{S})$ and for $m=2,3$ , we have

(2.3)

F\in C(U_{\gamma,m}\times h^{2+\alpha}(\mathbb{S}),h^{m-1+\alpha}(\mathbb{S})).

2.1 Derivative of $F$

We turn to the $\rho$ -differentiability of F at a point $(\rho_{0},g)$ . Due to the loss of derivatives, we need to assume $\rho_{0}\in U_{\gamma,3}$ . We consider

F(\rho_{0}+\varepsilon\tilde{\rho},g)-F(\rho_{0},g)=A(\rho_{0},g)[\varepsilon\tilde{\rho}]+o(\varepsilon)

for $\tilde{\rho}\in U_{\gamma,3}$ and $\varepsilon\to 0$ . Since $u_{\rho}$ in $F(\rho,g)$ lives on $\overline{\Omega}_{\rho}$ , which varies for $\rho$ , we consider the following approach.

Let $x\in\overline{\Omega}_{0}$ . We define the mapping $u(\rho,x):=u_{\rho}(\theta_{\rho}(x))$ and $y=\theta_{\rho_{0}}(x)\in\overline{\Omega}_{\rho_{0}}$ , as well as $z=\theta_{\rho_{0}+\varepsilon\tilde{\rho}}(x)\in\overline{\Omega}_{\rho_{0}+\varepsilon\tilde{\rho}}$ . One may show that $u(\rho,\theta_{\rho}(x))$ is differentiable with respect to $\rho$ , for the procedure see e.g. [12, Sect. 5.6]. Therefore, the following calculations are well-defined.

Using the Taylor expansion, we write

	$\displaystyle u_{\rho_{0}+\varepsilon\tilde{\rho}}(z)$	$\displaystyle=u(\rho_{0}+\varepsilon\tilde{\rho},\theta_{\rho_{0}+\varepsilon\tilde{\rho}}(x))$
		$\displaystyle=u(\rho_{0},y)+\underbrace{\partial_{\rho}u(\rho_{0},y)\tilde{\rho}\left(\frac{\theta_{\rho_{0}}^{-1}(y)}{\|\theta_{\rho_{0}}^{-1}(y)\|}\right)}_{=:p(y)}\varepsilon$
		$\displaystyle\quad+\partial_{y}u(\rho_{0},y)V_{\rho_{0}}(y)\nu_{0}\left(\frac{\theta_{\rho_{0}}^{-1}(y)}{\|\theta_{\rho_{0}}^{-1}(y)\|}\right)\tilde{\rho}\left(\frac{\theta_{\rho_{0}}^{-1}(y)}{\|\theta_{\rho_{0}}^{-1}(y)\|}\right)\varepsilon+o(\varepsilon)$

with $V_{\rho_{0}}(y):=\varphi\left(\left|\theta_{\rho_{0}}^{-1}(y)\right|-1\right)$ . The function $p$ is the so-called shape derivative of $u_{\rho_{0}}$ with respect to the domain variation from $\Omega_{\rho_{0}}$ to $\Omega_{\rho_{0}+\varepsilon\tilde{\rho}}$ .

Next, we reformulate problem (2.1) in terms of $\tilde{\rho}$ and $p$ . Using the representation as before and considering the Dirichlet problem (2.1a) for $u_{\rho_{0}+\varepsilon\tilde{\rho}}$ as well as for $u_{\rho_{0}}$ , and letting $\varepsilon\to 0$ , we get

(2.4)

\displaystyle\begin{aligned} \Delta_{y}p(y)&=0&\quad\text{in }\Omega_{\rho_{0}},\\ p(y)&=-\frac{\partial u(\rho_{0},y)}{\partial\nu_{\rho_{0}}}\left(\theta_{*}^{\rho_{0}}\tilde{\rho}\right)(y)\frac{1}{|\nabla N_{\rho_{0}}|}&\quad\text{on }\Gamma_{\rho_{0}}.\end{aligned}

Here, $N_{\rho}(x):=|x|-1-\rho(\frac{x}{|x|})$ is defined for $x\in\theta(\mathbb{S}\times(-1,\infty))$ and we note that $\Gamma_{\rho}$ is the zero-level set of $N_{\rho}$ . Therefore, for the outer unit normal vector field $\nu_{\rho_{0}}$ at $\Gamma_{\rho_{0}}$ , we have $\nu_{\rho_{0}}(x)=\frac{\nabla N_{\rho_{0}}(x)}{|\nabla N_{\rho_{0}}(x)|}$ and $\nabla N_{\rho_{0}}\left(\theta_{\rho_{0}}(\zeta)\right).\nabla N_{0}(\zeta)=1$ for $\zeta\in\mathbb{S}$ . We also note that

\theta_{*}^{\rho_{0}}(\nu_{0})=(\nu_{\rho_{0}}.\nu_{0})\nu_{\rho_{0}}+\tau_{\rho_{0}}=\frac{1}{|\nabla N_{\rho_{0}}|}\nu_{\rho_{0}}+\tau_{\rho_{0}},

where $\tau_{\rho_{0}}$ is a tangent vector field. Here, we used $(\nu_{\rho_{0}}.\nu_{0})=\frac{1}{\left|\nabla N_{\rho_{0}}\left(\theta_{\rho_{0}}(\zeta)\right)\right|}$ .

Remark 2.2.

The regularity of $u_{\rho_{0}}$ and $\nu_{\rho_{0}}$ , $\rho_{0}\in U_{\gamma,3}$ , imposes restrictions on the regularity of $p$ and in general, we can only expect $p\in h^{2+\alpha}(\Gamma_{\rho_{0}})$ . We have $\partial_{\nu_{\rho_{0}}}{u_{\rho_{0}}}\in h^{2+\alpha}(\Gamma_{\rho_{0}})$ and for the mean curvature $H_{\rho_{0}}\in h^{1+\alpha}(\Gamma_{\rho_{0}})$ , but in general, no more. If, however, we are in the setting for $\rho_{0}=0$ , then we have $\partial_{\nu_{0}}{u_{0}}\in C^{\infty}(\mathbb{S})$ , $-\partial_{\nu_{0}}{u_{0}}=\frac{1}{n}$ and $p=\frac{1}{n}\tilde{\rho}$ , thus $p\in h^{3+\alpha}(\mathbb{S})$ , the same regularity as $\tilde{\rho}$ .

Now we calculate the Fréchet-derivative of $F$ . With the notation as before, let $x\in\mathbb{S}$ , $y=\theta_{\rho_{0}}(x)\in\Gamma_{\rho_{0}}$ and $z=\theta_{\rho_{0}+\varepsilon\tilde{\rho}}(x)\in\Gamma_{\rho_{0}+\rho}$ . Note that in this case, $V_{\rho_{0}}(y)=1$ and $\frac{\theta_{\rho_{0}}^{-1}(y)}{|\theta_{\rho_{0}}^{-1}(y)|}=x$ .

First note that there exists a tangent vector $\tau$ at $y\in\Gamma_{\rho_{0}}$ such that

\nu_{\rho_{0}+\varepsilon\tilde{\rho}}(z)=\nu_{\rho_{0}}(y)+\varepsilon\tau(y)+o(\varepsilon).

Next, we calculate

	$\displaystyle\partial_{z_{i}}u_{\rho_{0}+\varepsilon\tilde{\rho}}(z)$	$\displaystyle=\partial_{z_{i}}u({\rho_{0}+\varepsilon\tilde{\rho}},\theta_{\rho_{0}+\varepsilon\tilde{\rho}}(x))$
		$\displaystyle=\frac{\partial}{\partial y_{i}}u(\rho_{0},y)+\varepsilon\frac{\partial}{\partial y_{i}}p(y)+\varepsilon\frac{\partial}{\partial y_{i}}\frac{\partial}{\partial y_{k}}\left(u(\rho_{0},y)\right)\theta_{*}^{\rho_{0}}\left(\nu_{0}\tilde{\rho}\right)(y)+o(\varepsilon).$

This implies

	$\displaystyle\quad\nabla_{z}u_{\rho_{0}+\varepsilon\tilde{\rho}}(z).\nu_{\rho_{0}+\varepsilon\tilde{\rho}}(z)=\partial_{z_{i}}u_{\rho_{0}+\varepsilon\tilde{\rho}}(z)\nu_{\rho_{0}+\varepsilon\tilde{\rho}}^{i}(z)$
	$\displaystyle=\left[\frac{\partial}{\partial y_{i}}u(\rho_{0},y)+\varepsilon\frac{\partial}{\partial y_{i}}p(y)+\varepsilon\frac{\partial}{\partial y_{i}}\frac{\partial}{\partial y_{k}}\left(u(\rho_{0},y)\right)\theta_{*}^{\rho_{0}}\left(\nu_{0}\tilde{\rho}\right)(y)\right]\left[\nu_{\rho_{0}}(y)+\varepsilon\tau(y)\right]^{i}+o(\varepsilon)$
	$\displaystyle=\frac{\partial}{\partial y_{i}}u(\rho_{0},y)\nu_{\rho_{0}}^{i}(y)+\varepsilon\frac{\partial}{\partial y_{i}}u(\rho_{0},y)\tau_{\rho_{0}}^{i}(y)+\varepsilon\frac{\partial}{\partial y_{i}}p(y)\nu_{\rho_{0}}^{i}(y)$
	$\displaystyle\quad+\varepsilon\frac{\partial}{\partial y_{i}}\frac{\partial}{\partial y_{k}}\left(u(\rho_{0},y)\right)\theta_{*}^{\rho_{0}}\left(\tilde{\rho}\right)(y)\left[\frac{1}{\|\nabla N_{\rho_{0}}\|}\nu_{\rho_{0}}+\tau_{\rho_{0}}\right]\nu_{\rho_{0}}^{i}(y)+o(\varepsilon)$
	$\displaystyle=\frac{\partial}{\partial\nu_{\rho_{0}}}u(\rho_{0},y)+\varepsilon\frac{\partial}{\partial\nu_{\rho_{0}}}p(y)+\varepsilon\frac{\partial^{2}}{\partial\nu_{\rho_{0}}^{2}}u(\rho_{0},y)\theta_{*}^{\rho_{0}}\left(\tilde{\rho}\right)(y)\frac{1}{\|\nabla N_{\rho_{0}}\|}$
	$\displaystyle\quad+\varepsilon\frac{\partial}{\partial\tau_{\rho_{0}}}\frac{\partial}{\partial\nu_{\rho_{0}}}u(\rho_{0},y)\theta_{*}^{\rho_{0}}\left(\tilde{\rho}\right)(y)+o(\varepsilon)$
	$\displaystyle=\frac{\partial}{\partial\nu_{\rho_{0}}}u(\rho_{0},y)+\varepsilon\frac{\partial}{\partial\nu_{\rho_{0}}}p(y)+\varepsilon\left(-1-H_{\rho_{0}}\frac{\partial}{\partial\nu_{\rho_{0}}}u(\rho_{0},y)\right)\theta_{*}^{\rho_{0}}\left(\tilde{\rho}\right)(y)\frac{1}{\|\nabla N_{\rho_{0}}\|}$
	$\displaystyle\quad+\varepsilon\frac{\partial}{\partial\tau_{\rho_{0}}}\frac{\partial}{\partial\nu_{\rho_{0}}}u(\rho_{0},y)\theta_{*}^{\rho_{0}}\left(\tilde{\rho}\right)(y)+o(\varepsilon),$

where in the last step we used the identity $\Delta u_{\rho_{0}}=\Delta_{\Gamma_{\rho_{0}}}u_{\rho_{0}}+\partial^{2}_{\nu_{\rho_{0}}}u_{\rho_{0}}+H_{\rho_{0}}\partial_{\nu_{\rho_{0}}}u_{\rho_{0}},$ with $\Delta_{\Gamma_{\rho_{0}}}$ being the Laplace-Beltrami operator and $H_{\rho_{0}}$ the mean curvature on $\Gamma_{\rho_{0}}$ . Using the Dirichlet boundary condition for $p$ in (2.4), we arrive at

\displaystyle F(\rho_{0}+\varepsilon\tilde{\rho},g)

\displaystyle=F(\rho_{0},g)+\varepsilon\theta_{\rho_{0}}^{*}\left[\frac{\partial}{\partial\nu_{\rho_{0}}}p+H_{\rho_{0}}p-\frac{\theta_{*}^{\rho_{0}}\tilde{\rho}}{|\nabla N_{\rho_{0}}|}+\frac{\partial}{\partial\tau_{\rho_{0}}}\frac{\partial}{\partial\nu_{\rho_{0}}}u(\rho_{0},y)\theta_{*}^{\rho_{0}}\tilde{\rho}\right]+o(\varepsilon).

We see that the term

\theta_{\rho_{0}}^{*}\left[\frac{\partial}{\partial\nu_{\rho_{0}}}p+H_{\rho_{0}}p-\frac{\theta_{*}^{\rho_{0}}\tilde{\rho}}{|\nabla N_{\rho_{0}}|}+\frac{\partial}{\partial\tau_{\rho_{0}}}\frac{\partial}{\partial\nu_{\rho_{0}}}u(\rho_{0},y)\theta_{*}^{\rho_{0}}\tilde{\rho}\right]

is well-defined and lies in $h^{1+\alpha}(\mathbb{S})$ even when only assuming $\tilde{\rho}\in h^{2+\alpha}(\mathbb{S})$ . Note that to verify this, one also needs to take into account the impact of the regularity assumption of $\tilde{\rho}$ on the regularity of the solution $p$ of (2.4) as mentioned in Remark 2.5. This gives us the following lemma.

Lemma 2.3.

The mapping $F$ as defined in (2.2) satisfies

(2.5)

F\in C(U_{\gamma,2}\times h^{1+\alpha}(\mathbb{S}),h^{1+\alpha}(\mathbb{S}))\cap C^{1}(U_{\gamma,3}\times h^{1+\alpha}(\mathbb{S}),h^{1+\alpha}(\mathbb{S})).

Furthermore, we have the following.

The $\rho$ -Fréchet-derivative of $F$ at $(\rho_{0},g)\in U_{\gamma,3}\times h^{1+\alpha}(\mathbb{S})$ is

\displaystyle\begin{aligned} &\partial_{\rho}F(\rho_{0},g)\left[\tilde{\rho}\right]\\ &=\theta_{\rho}^{*}\left[\frac{\partial p}{\partial\nu_{\rho_{0}}}+H_{\rho_{0}}p-\frac{\theta_{*}^{\rho_{0}}\tilde{\rho}}{|\nabla N_{\rho_{0}}|}+\frac{\partial}{\partial\tau_{\rho_{0}}}\left(\frac{\partial u_{\rho_{0}}}{\partial\nu_{\rho_{0}}}\right)\theta_{*}^{\rho_{0}}\tilde{\rho}\right]\in h^{1+\alpha}(\mathbb{S}),\end{aligned}

where $p$ is a unique solution to (2.4), i.e.

\displaystyle\begin{aligned} \Delta p&=0&\quad\text{in }\Omega_{\rho_{0}},\\ p&=-\frac{\partial u_{\rho_{0}}}{\partial\nu_{\rho_{0}}}\theta_{*}^{\rho_{0}}\tilde{\rho}\frac{1}{|\nabla N_{\rho_{0}}|}&\quad\text{on }\Gamma_{\rho_{0}}.\end{aligned}

We have

	$\displaystyle\partial_{\rho}F(0,0)$	$\displaystyle\in\mathcal{L}(h^{3+\alpha}(\mathbb{S}),h^{2+\alpha}(\mathbb{S}))\text{ and }$
	$\displaystyle\partial_{\rho}F(\rho,g)$	$\displaystyle\in\mathcal{L}(h^{3+\alpha}(\mathbb{S}),h^{1+\alpha}(\mathbb{S})).$

3.

The linear operator $\partial_{\rho}F(\rho,g)$ has the extension

$\partial_{\rho}F(\rho,g)\in\mathcal{L}(h^{2+\alpha}(\mathbb{S}),h^{1+\alpha}(\mathbb{S})).$

In view of (2.3), one may verify that for $m=2,3$ and $g\in h^{2+\alpha}(\mathbb{S})$ , one has

F\in C(U_{\gamma,m}\times h^{2+\alpha}(\mathbb{S}),h^{m-1+\alpha}(\mathbb{S}))\cap C^{1}(U_{\gamma,m+1}\times h^{2+\alpha}(\mathbb{S}),h^{m-1+\alpha}(\mathbb{S})).

Remark 2.4.

We have the following characterisation of bijectivity of the $\rho$ -Fréchet-deriva-tive of $F$ at a point $(\rho,g)$ with $F(\rho,g)=0$ , i.e. when $\Omega_{\rho}$ is a solution to (2.1) for $g\in h^{1+\alpha}(\mathbb{S})$ :

The extended operator $\partial_{\rho}F(\rho,g)$ , with

\partial_{\rho}F\in C\left(U_{\gamma,3}\times{h^{1+\alpha}(\mathbb{S})},\mathcal{L}\left(h^{2+\alpha}(\mathbb{S}),h^{1+\alpha}(\mathbb{S})\right)\right),

has the bounded inverse

\partial_{\rho}F(\rho,g)^{-1}\in\mathcal{L}\left(h^{1+\alpha}(\mathbb{S}),h^{2+\alpha}(\mathbb{S})\right)

if and only if the boundary problem

(2.6)

\displaystyle\begin{aligned} -\Delta p&=0&\text{ in }\Omega_{\rho}\\ \left(H_{\rho}-\frac{1}{\frac{1}{n}+\theta_{*}^{\rho}g}\right)p+\frac{\partial p}{\partial\nu_{\rho}}&=-\varphi&\text{ on }\Gamma_{\rho}.\end{aligned}

is uniquely solvable for any $\varphi\in h^{1+\alpha}(\Gamma_{\rho})$ . Unique solvability of (2.6) is given provided that $\left(H_{\rho}-\frac{1}{\frac{1}{n}+\theta_{*}^{\rho}g}\right)>0,$ see [11, Thm. 6.31], in which case we would have $\left\|p\right\|_{h^{2+\alpha}(\overline{\Omega}_{\rho})}\leq C\left\|\varphi\right\|_{h^{1+\alpha}(\Gamma_{\rho})}$ . This does not hold in the current setting. Thus, we have to examine the bijectivity in a different manner.

3 Degeneracy of $\partial_{\rho}F$

3.1 Non-Bijectivity of the partial derivative of $F$

We examine the $\rho$ -derivative of $F$ at $(\rho,g)=(0,0)$ , as we merely require the existence of an inverse of $\partial_{\rho}F(0,0)$ to use the modified implicit function theorem, Theorem 4.2. We have

\partial_{\rho}F(0,0)[\tilde{\rho}]=-\frac{1}{n}\tilde{\rho}+\frac{1}{n}\mathscr{N}\tilde{\rho},\quad\text{for }\tilde{\rho}\in h^{3+\alpha}(\mathbb{S}).

Here, $\mathscr{N}$ denotes the Dirichlet-to-Neumann operator on the sphere $\mathbb{S}$ .

Definition 3.1.

Let $\varphi\in h^{l+\alpha}(\mathbb{S})$ , $l\geq 2$ arbitrary. The Dirichlet-to-Neumann operator on the sphere $\mathscr{N}\colon h^{l+\alpha}(\mathbb{S})\to h^{l-1+\alpha}(\mathbb{S})$ is defined as

\mathscr{N}\varphi:=\partial_{\nu}u,\text{ with }u\text{ the unique solution of }\begin{cases}-\Delta u&=0\quad\text{in }\mathbb{B},\\ \hfil u&=\varphi\quad\text{on }\mathbb{S}.\end{cases}

We see that

\mathscr{N}\in\mathscr{L}\left(h^{l+\alpha}(\mathbb{S}),h^{l-1+\alpha}(\mathbb{S})\right),\,l\in\mathbb{N}.

It suffices to test bijectivity of $\partial_{\rho}F(0,0)$ for $\tilde{\rho}\in H_{k}$ , $k\in\mathbb{N}\cup\{0\}$ , where

H_{k}=\text{span}\left\{h_{k,j}\,\big{|}\,j=1,\ldots,d_{k}^{n}\right\},\quad d_{k}^{n}<\infty,

is the set of harmonic homogeneous polynomials on the unit sphere of degree $k$ . Indeed, the $h_{k,j}$ , $k\in\mathbb{N}\cup\{0\},\,j=1,\ldots,d_{k}^{n}$ , form an orthonormal basis of $L^{2}(\mathbb{S})$ , see e.g. [10, Thm. 2.53]. One may show that

\mathscr{H}:=\text{span}\left\{h_{k,j}\,\big{|}\,k\in\mathbb{N}\cup\{0\},\,j=1,\ldots,d_{k}^{n}\right\}

is dense in $h^{l+\alpha}(\mathbb{S})$ , which is why it is sufficient to consider $\tilde{\rho}\in\mathscr{H}$ .

Therefore, let $\tilde{\rho}=h_{k,j}$ be a harmonic homogeneous polynomial on the unit sphere of order $k\in\mathbb{N}\cup\{0\}$ , with $j\in\{1,\ldots,d_{k}^{n}\}$ . We get

\partial_{\rho}F(0,0)[h_{k,j}]=-\frac{1}{n}h_{k,j}+\frac{k}{n}h_{k,j}=\frac{k-1}{n}h_{k,j}\begin{cases}=0&\text{if }k=1,\\ \neq 0&\text{else}.\end{cases}

This shows that $\partial_{\rho}F(0,0)$ is not bijective and that its kernel is

(3.1)

\text{ker}\left(\partial_{\rho}F(0,0)\right)=\text{span}\left\{h_{1,j}\,\big{|}\,j=1,\ldots,n\right\},

with $\dim(\text{ker}(\partial_{\rho}F(0,0)))=n<\infty$ . Furthermore, we see that the range of $\partial_{\rho}F(0,0)$ is

\text{range}\left(\partial_{\rho}F(0,0)\right)=\overline{\text{span}\left\{h_{k,j}\,\big{|}\,k\in\mathbb{N}_{\geq 2}\cup\{0\},j=1,\ldots,d_{k}^{n}\right\}}^{\left\|\cdot\right\|_{h^{2+\alpha}(\mathbb{S})}}.

Notation 3.2.

In view of the calculations to come, we define

X_{l}=\overline{\text{span}\left\{h_{k,j}\,\big{|}\,k\in\mathbb{N}_{\geq 2}\cup\{0\},j=1,\ldots,d_{k}^{n}\right\}}^{\left\|\cdot\right\|_{h^{l+\alpha}(\mathbb{S})}},\text{ for }l\in\mathbb{N}.

Note that this is equivalent to defining

X_{l}:=\left\{\rho\in h^{l+\alpha}(\mathbb{S})\,\big{|}\,\langle{\rho},h_{1,j}\rangle_{L^{2}(\mathbb{S})}=0,\,j=1,\ldots,n\right\},

as in (1.8). To confirm this, also note that $h_{1,j}(x)=\omega_{n}^{-\frac{1}{2}}x_{j}$ , $j=1,\ldots,n$ . Since $X_{l}$ is a finite-codimensional subspace of $h^{l+\alpha}(\mathbb{S})$ , we have $X_{l}\oplus X_{l}^{\bot}=h^{l+\alpha}(\mathbb{S})$ , where $X_{l}^{\bot}$ denotes the orthogonal complement of $X_{l}$ . Because $\dim(X_{l}^{\bot})=n<\infty$ for all $l\in\mathbb{N}$ , we do not need to differentiate between those spaces depending on $l$ , as we have $X_{l}^{\bot}\cong\mathbb{R}^{n}$ . Therefore, we may define

K:=\text{span}\left\{h_{1,j}\,\big{|}\,j=1,\ldots,n\right\}

and get $X_{l}\oplus K=h^{l+\alpha}(\mathbb{S})$ . Finally, we denote

\mathcal{U}_{l}:=\left\{\rho\in U_{\gamma,l}\,\big{|}\,\langle{\rho},{h_{1,j}}\rangle_{L^{2}(\mathbb{S})}=0,\,j=1,\ldots,n\right\},

as well as $\mathcal{U}_{l}^{\bot}$ for the subset of $K$ such that $U_{\gamma,l}=\mathcal{U}_{l}\oplus\mathcal{U}_{l}^{\bot}$ .

Remark 3.3.

That the kernel of $\partial_{\rho}F(0,0)$ is non-trivial is not surprising, in fact it is an obvious property resulting from the translational invariance of (2.1) with $g=0$ . The overdetermined problem is in this case solvable for any translated sphere $\mathbb{B}+c:=B_{1}(c)$ , with $c\in\mathbb{R}^{n}$ , and with solution $u_{c}(x)=\frac{1}{2n}\left(1-|x-c|^{2}\right)$ .

To show the connection to the kernel of $\partial_{\rho}F(0,0)$ , we find $\rho\in U_{\gamma,3}$ such that $\Gamma_{\rho}=\partial\left(\mathbb{B}+c\right)$ . With the ansatz $x+\rho(x)x=y+c,\ x,y\in\mathbb{S},$ we arrive at $\rho(x)=\rho(c,x)=x.c-1\pm\sqrt{1-|c|^{2}+(x.c)^{2}}$ for any $|c|\leq 1$ . For $c=te_{j}$ , we then arrive at

\frac{d}{dt}\rho(te_{j},x)\big{|}_{t=0}=\left[x_{j}+\frac{1}{2\sqrt{1-t^{2}+t^{2}x_{j}^{2}}}\left(-2t+2tx_{j}^{2}\right)\right]_{t=0}=x_{j}.

Now, because of the translational invariance of (2.1), we have $F(\rho(c,x),0)=F(0,0)=0$ , which implies

\partial_{\rho}F(0,0)\left[\frac{d}{dt}\left(\rho(te_{j},\cdot)\right)\big{|}_{t=0}\right]=0

and we arrive at $\partial\rho F(0,0)[x_{j}]=0$ for $j\in\{1,\ldots,n\}$ . This coincides with (3.1).

3.2 Re-formulation to eliminate degeneracy

To eliminate the problem of non-bijectivity, i.e. the degeneracy of the problem (2.6), we need to eliminate the translation invariance in the original problem (2.1). Therefore, we replace $F$ as defined in (2.2) by a mapping

G\in C(\mathcal{U}_{2}\times\mathcal{U}_{2}^{\bot}\times X_{1}\times K,X_{1}\times\mathbb{R}^{n}\times K),

\displaystyle\begin{aligned} G&(\rho_{1},\rho_{2},g_{1},g_{2}):=\begin{pmatrix}G_{0}\\ G_{1}\\ \vdots\\ G_{n}\\ G_{n+1}\end{pmatrix}(\rho_{1},\rho_{2},g_{1},g_{2}):=\begin{pmatrix}P_{2}F(\rho_{1}+\rho_{2},g_{1}+g_{2})\\ \int_{\Omega_{\rho_{1}+\rho_{2}}}x_{1}\,\mathrm{d}x\\ \vdots\\ \int_{\Omega_{\rho_{1}+\rho_{2}}}x_{n}\,\mathrm{d}x\\ (\text{Id}-P_{2})F(\rho_{1}+\rho_{2},g_{1}+g_{2})\end{pmatrix}.\end{aligned}

$P_{l}\colon h^{l+\alpha}(\mathbb{S})\to X_{l}$ denotes the projection onto $X_{l}$ . If it is clear which $l\in\mathbb{N}$ is to be used, we write $P$ for $P_{l}$ .

Note that for $m=2,3$ , and $g_{1}\in X_{2}$ , $G$ is also well-defined and we have

\displaystyle G\in C\left(\mathcal{U}_{m}\times\mathcal{U}_{m}^{\bot}\times X_{2}\times K,X_{m-1}\times\mathbb{R}^{n}\times K\right).

By the condition $\int_{\Omega_{\rho}}x_{i}\,\mathrm{d}x=0$ for $i=1,\ldots,n$ , we achieve that the center of mass of $\Omega_{\rho}$ is in the origin and thus eliminate the possibility of translations, and thus, admissible $\rho$ will be in the set

\mathcal{M}=\left\{\rho\in U_{\gamma,3}\,\big{|}\,\int_{\Omega_{\rho}}x_{i}\,\mathrm{d}x=0,\,i=1,\ldots,n\right\}.

As a direct consequence, we have

Lemma 3.4.

$\Omega_{\rho}$ with barycenter zero admits a solution to (2.1) for given $g\in h^{1+\alpha}(\mathbb{S})$ if and only if $G(\rho_{1},\rho_{2},g_{1},g_{2})=0$ , with $\rho=\rho_{1}+\rho_{2}$ and $g=g_{1}+g_{2}$ .

3.3 Bijectivity of the partial derivative of $G$

The mapping $G$ has the following regularity properties.

Lemma 3.5.

$G$ is Fréchet-differentiable as a map from $\mathcal{U}_{3}\times\mathcal{U}_{3}^{\bot}\times K$ to $X_{1}\times\mathbb{R}^{n}\times K$ . We have

\partial_{\rho_{1},\rho_{2},g_{2}}G(\rho_{1},\rho_{2},g_{1},g_{2})\in\mathscr{L}(X_{3}\times K\times K,X_{1}\times\mathbb{R}^{n}\times K),

for $(\rho_{1}+\rho_{2},g_{1}+g_{2})\in U_{\gamma,3}\times h^{1+\alpha}(\mathbb{S})$ , which can be extended to

\partial_{\rho_{1},\rho_{2},g_{2}}G(\rho_{1},\rho_{2},g_{1},g_{2})\in\mathscr{L}(X_{2}\times K\times K,X_{1}\times\mathbb{R}^{n}\times K).

Furthermore,

	$\displaystyle G\in\$	$\displaystyle C(\mathcal{U}_{2}\times\mathcal{U}_{2}^{\bot}\times X_{1}\times K,X_{1}\times\mathbb{R}^{n}\times K)$
		$\displaystyle\cap\ C^{1}(\mathcal{U}_{3}\times\mathcal{U}_{3}^{\bot}\times X_{1}\times K,X_{1}\times\mathbb{R}^{n}\times K).$

In view of the application of the modified function theorem introduced in Section 4, we also need the following observation concerning the regularity of $G$ . For $m=2,3$ , we have

(3.2)		$\displaystyle G\in$	$\displaystyle C\left(\mathcal{U}_{m}\times\mathcal{U}_{m}^{\bot}\times X_{2}\times K,X_{m-1}\times\mathbb{R}^{n}\times K\right)$
(3.2)			$\displaystyle\cap\ C^{1}\left(\mathcal{U}_{m+1}\times\mathcal{U}_{m+1}^{\bot}\times X_{2}\times K,X_{m-1}\times\mathbb{R}^{n}\times K\right).$

as well as for the extension of the partial derivative

(3.3)

\partial_{\rho_{1},\rho_{2},g_{2}}G(\rho_{1},\rho_{2},g_{1},g_{2})\in\mathscr{L}(X_{m}\times{K\times K},X_{m-1}\times\mathbb{R}^{n}\times K),

where $(\rho_{1},\rho_{2},g_{1},g_{2})\in X_{m+1}\times K\times X_{2}\times K$ .

Proof.

We have for $i=1,2$ and $\tilde{\rho}_{1}\in X_{3}\subset h^{3+\alpha}(\mathbb{S})$ , $\tilde{\rho}_{2}\in K\subset h^{3+\alpha}(\mathbb{S})$

	$\displaystyle\partial_{\rho_{i}}G_{0}(\rho_{1},\rho_{2},g_{1},g_{2})[\tilde{\rho}_{i}]$	$\displaystyle=\partial_{\rho_{i}}PF(\rho_{1}+\rho_{2},g_{1}+g_{2})[\tilde{\rho}_{i}]$	$\displaystyle\in X_{1},$
	$\displaystyle\partial_{\rho_{i}}G_{n+1}(\rho_{1},\rho_{2},g_{1},g_{2})[\tilde{\rho}_{i}]$	$\displaystyle=\partial_{\rho_{i}}(\text{Id}-P)F(\rho_{1}+\rho_{2},g_{1}+g_{2})[\tilde{\rho}_{i}]$	$\displaystyle\in K$

and further for $j=1,\ldots,n$

\partial_{\rho_{i}}G_{j}(\rho_{1},\rho_{2},g_{1},g_{2})[\tilde{\rho}_{i}]=\int_{\Gamma_{\rho_{1}+\rho_{2}}}\frac{1}{|\nabla N_{\rho}|}\theta_{*}^{\rho}\tilde{\rho}_{i}\cdot\sigma_{j}\,\mathrm{d}\sigma,\quad\text{for }j=1,\ldots,n.

These expressions are still well-defined and of the same regularity for $\tilde{\rho}\in h^{2+\alpha}(\mathbb{S})$ , implying the existence of an extension of $\partial_{\rho_{1},\rho_{2}}G(\rho_{1},\rho_{2},g_{1},g_{2})$ onto $X_{2}\times K$ and thus

\partial_{\rho_{1},\rho_{2}}G(\rho_{1},\rho_{2},g_{1},g_{2})\in\mathscr{L}(X_{3}\times K,X_{1}\times\mathbb{R}^{n}\times K)\cap\mathscr{L}(X_{2}\times K,X_{1}\times\mathbb{R}^{n}\times K).

For the $g_{2}$ -partial derivative and $\tilde{g_{2}}\in K$ , we get

	$\displaystyle\partial_{g_{2}}G_{0}(\rho_{1},\rho_{2},g_{1},g_{2})[\tilde{g_{2}}]=\partial_{g_{2}}PF(\rho_{1}+\rho_{2},g_{1}+g_{2})[\tilde{g_{2}}],$
	$\displaystyle\partial_{g_{2}}G_{j}(\rho_{1},\rho_{2},g_{1},g_{2})[\tilde{g_{2}}]=0\quad\text{and}$
	$\displaystyle\partial_{g_{2}}G_{n+1}(\rho_{1},\rho_{2},g_{1},g_{2})[\tilde{g_{2}}]=\partial_{g_{2}}(\text{Id}-P)F(\rho_{1}+\rho_{2},g_{1}+g_{2})[\tilde{g_{2}}].$

This implies $\partial_{g_{2}}G(\rho_{1},\rho_{2},g_{1},g_{2})\in\mathscr{L}(K,X_{1}\times\mathbb{R}^{n}\times K)$ . ∎

At zero, the partial derivative $\partial_{\rho_{1},\rho_{2},g_{2}}G$ is bijective. We abbreviate $(0,0,0,0)$ by $(0)$ .

Lemma 3.6.

$G$ is Fréchet-differentiable at $(\rho_{1},\rho_{2},g_{1},g_{2})=(0)$ , and we have

\displaystyle\partial_{\rho_{1},\rho_{2},g_{2}}G(0)\in\mathscr{L}(X_{3}\times K\times K,X_{2}\times\mathbb{R}^{n}\times K)

with

(3.4)

\displaystyle\begin{aligned} \partial_{\rho_{1},\rho_{2},g_{2}}G(0)[\tilde{\rho}_{1},\tilde{\rho}_{2},\tilde{g}_{2}]&=\begin{pmatrix}\frac{1}{n}\left(\mathscr{N}-\text{Id}\right)\tilde{\rho}_{1}\\ \omega_{n}^{1/2}\tilde{\rho}_{2}\\ \tilde{g}_{2}\end{pmatrix}.\end{aligned}

Indeed, for arbitrary $m\in\mathbb{N}$ , we have

\partial_{\rho_{1},\rho_{2},g_{2}}G(0)\in\mathscr{L}(X_{m+1}\times K\times K,X_{m}\times\mathbb{R}^{n}\times K).

Further, $\partial_{\rho_{1},\rho_{2},g_{2}}G(0)$ is invertible with

\partial_{\rho_{1},\rho_{2},g_{2}}G(0)^{-1}\in\mathscr{L}(X_{m}\times\mathbb{R}^{n}\times K,X_{m+1}\times K\times K)

for $m,\in\mathbb{N}$ , and

(3.5)

\partial_{\rho_{1},\rho_{2},g_{2}}G(0)^{-1}[\phi,\alpha_{1},\ldots,\alpha_{n},\psi]=\begin{pmatrix}n\left(\left(\mathscr{N}-1\right)\big{|}_{X_{2}}\right)^{-1}\phi\\ \omega_{n}^{-1/2}\sum_{j=1}^{n}\alpha_{j}h_{1,j}\\ \psi\end{pmatrix}\in X_{m+1}\times K\times K

where $\phi\in X_{m}$ , $\alpha_{j}\in\mathbb{R}$ for $j=1,\ldots,n$ and $\psi\in K$ .

Proof.

Let $m\in\mathbb{N}$ . Let $(\tilde{\rho}_{1},\tilde{\rho}_{2},\tilde{g}_{2})\in X_{m+1}\times K\times K$ and $j=1,\ldots,n$ . We calculate

	$\displaystyle\partial_{\rho_{1}}G_{0}(0)[\tilde{\rho}_{1}]=\partial_{\rho_{1}}PF(0)[\tilde{\rho}_{1}]=\frac{1}{n}\left(\mathscr{N}-\text{Id}\right)\tilde{\rho}_{1},$
	$\displaystyle\partial_{\rho_{2}}G_{0}(0)[\tilde{\rho}_{2}]=\partial_{\rho_{2}}PF(0)[\tilde{\rho}_{2}]=0,$
	$\displaystyle\partial_{\rho_{1}}G_{j}(0)[\tilde{\rho}_{1}]=\int_{\mathbb{S}}\tilde{\rho}_{1}\sigma_{j}\,\mathrm{d}\sigma=\omega_{n}^{1/2}\int_{\mathbb{S}}\tilde{\rho}_{1}h_{1,j}\,\mathrm{d}\sigma=0,$
	$\displaystyle\partial_{\rho_{2}}G_{j}(0)[\tilde{\rho}_{2}]=\int_{\mathbb{S}}\tilde{\rho}_{2}\sigma_{j}\,\mathrm{d}\sigma=\omega_{n}^{1/2}\int_{\mathbb{S}}\tilde{\rho}_{2}h_{1,j}\,\mathrm{d}\sigma=\omega_{n}^{1/2}\left(\tilde{\rho}_{2}\right)_{j},$
	$\displaystyle\partial_{\rho_{i}}G_{n+1}(0)[\tilde{\rho}_{i}]=\partial_{\rho_{i}}(\text{Id}-P)F(0)[\tilde{\rho}_{i}]=0,\quad\text{for }i=1,2.$

$\mathscr{N}$ again denotes the Dirichlet-to-Neumann operator. Recall that the linear operator $\left(\mathscr{N}-\text{Id}\right)$ is bijective as an operator in $\mathscr{L}(X_{m+1},X_{m})$ . For the $g_{2}$ -partial derivative, we find

	$\displaystyle\partial_{g_{2}}G_{0}(0)[\tilde{g_{2}}]=\partial_{g_{2}}PF(0)[\tilde{g_{2}}]=0,$
	$\displaystyle\partial_{g_{2}}G_{j}(0)[\tilde{g_{2}}]=0\quad\text{and}$
	$\displaystyle\partial_{g_{2}}G_{n+1}(0)[\tilde{g_{2}}]=\partial_{g_{2}}(\text{Id}-P)F(0)[\tilde{g_{2}}]=\tilde{g}_{2}.$

This implies (3.4). We directly arrive at (3.5) and also at the regularity properties of $\partial_{\rho_{1},\rho_{2},g_{2}}G(0)^{-1}$ . ∎

4 A modified implicit function theorem

To arrive at the existence, uniqueness and at a stability result for Problem 2.1, we introduce a modified version of the implicit function theorem, Theorem 4.2. Because of the regularity issues stated in Remark 2.5, we are not able to apply the classical implicit function theorem. In preparation, we need

Theorem 4.1.

Assume the following.

(I)

Let $\mathcal{X}_{0},\mathcal{X}_{1},\mathcal{Y},\mathcal{Z}_{0},\mathcal{Z}_{1}$ be Banach spaces with $\mathcal{X}_{1}\hookrightarrow\mathcal{X}_{0}$ and $\mathcal{Z}_{1}\hookrightarrow\mathcal{Z}_{0}$ . Let $D_{1}\subset D_{0}$ be open sets such that $(0,0)\in D_{j}\subset\mathcal{X}_{j}\times\mathcal{Y}$ for $j=0,1$ .
(II)

Let $F\in C^{1}(D_{1},\mathcal{Z}_{0})\cap C(D_{0},\mathcal{Z}_{0})$ with $F(0,0)=0$ and $\partial_{x}F\in C(D_{1},\mathscr{L}(\mathcal{X}_{0},\mathcal{Z}_{0}))$ , which is to be understood such that for $(x,y)\in D_{1}$ , the partial deriative $\partial_{x}F(x,y)\in\mathscr{L}(\mathcal{X}_{1},\mathcal{Z}_{0})$ can be extended to $\overline{\partial_{x}F}(x,y)\in\mathscr{L}(\mathcal{X}_{0},\mathcal{Z}_{0})$ and $\overline{\partial_{x}F}\in C(D_{1},\mathscr{L}(\mathcal{X}_{0},\mathcal{Z}_{0}))$ .
(III)

We have $F\colon D_{1}\to\mathcal{Z}_{1}$ and $F$ is Fréchet-differentiable at $(0,0)$ , hence $\partial_{x}F(0,0)\in\mathscr{L}(\mathcal{X}_{1},\mathcal{Z}_{1})$ and $\partial_{y}F(0,0)\in\mathscr{L}(\mathcal{Y},\mathcal{Z}_{1})$ .
(IV)

The inverse $\partial_{x}F(0,0)^{-1}\in\mathscr{L}(\mathcal{Z}_{1},\mathcal{X}_{1})\cap\mathscr{L}(\mathcal{Z}_{0},\mathcal{X}_{0})$ exists.

Then there exist neighbourhoods of zero $0\in U_{0}\subset\mathcal{X}_{0}$ , $0\in U_{1}\subset\mathcal{X}_{1}$ and $0\in V\subset\mathcal{Y}$ , as well as a function $u\colon V\to U_{0}$ such that

(i)

$F(u(y),y)=0$ for all $y\in V$ , $u(0)=0$ , and
(ii)

for $x_{1},x_{2}\in U_{1}$ , $y\in V$ such that $F(x_{i},y)=0$ for $i=1,2$ , we have $x_{1}=x_{2}$ .

Proof.

Let $\varepsilon,\delta>0$ – we will redefine both later – and define

	$\displaystyle U_{1}:=\left\{x\in\mathcal{X}_{1}\,\big{\|}\,\left\\|x\right\\|_{\mathcal{X}_{1}}\leq\varepsilon\right\},$
	$\displaystyle U_{0}:=\left\{x\in\mathcal{X}_{0}\,\big{\|}\,\left\\|x\right\\|_{\mathcal{X}_{0}}\leq C\varepsilon\right\},$
	$\displaystyle V:=\left\{y\in\mathcal{Y}\,\big{\|}\,\left\\|y\right\\|_{\mathcal{Y}}<\delta\right\},$

with $C>0$ a constant satisfying $\left\|x\right\|_{\mathcal{X}_{0}}\leq C\left\|x\right\|_{\mathcal{X}_{1}}$ , thus $U_{1}\subset U_{0}$ .

Step 1: Show that for all $y\in V$ , the function

\Phi_{y}(x):=x-\partial_{x}F(0,0)^{-1}F(x,y)=\partial_{x}F(0,0)^{-1}\left(\partial_{x}F(0,0)x-F(x,y)\right)

is a contraction mapping from $\left(U_{1},\left\|\cdot\right\|_{\mathcal{X}_{0}}\right)$ to itself, provided that $\varepsilon,\delta>0$ are sufficiently small.

As the fundamental theorem of calculus holds on Banach spaces as well, we have for $F\in C^{1}(D_{1},\mathcal{Z}_{0})$ and for all $x_{1},x_{2}\in D_{1}$

(4.1)

F(x_{1},y)-F(x_{2},y)=\int_{0}^{1}\partial_{x}F(x_{2}+t(x_{1}-x_{2}),y)(x_{1}-x_{2})\,\mathrm{d}t.

Note that $\partial_{x}F(x_{2}+t(x_{1}-x_{2}),y)\in\mathscr{L}(\mathcal{X}_{1},\mathcal{Z}_{0})$ with extension in $\mathscr{L}(\mathcal{X}_{0},\mathcal{Z}_{0})$ .

Now let $x_{1},x_{2}\in U_{1}$ , $y\in V$ . Then $(x_{j},y)\in D_{1}$ for $j=1,2$ , and we use (4.1) to arrive at

\Phi_{y}(x_{1})-\Phi_{y}(x_{2})=\partial_{x}F(0,0)^{-1}\left[\int_{0}^{1}\left(\partial_{x}F(0,0)-\partial_{x}F(x_{2}+t(x_{1}-x_{2}),y)\right)\,\mathrm{d}t(x_{1}-x_{2})\right].

By choosing $\varepsilon,\delta>0$ smaller, if necessary, we get

(4.2)

\displaystyle\begin{aligned} \left\|\Phi_{y}(x_{1})-\Phi_{y}(x_{2})\right\|_{\mathcal{X}_{0}}&\leq\left\|\partial_{x}F(0,0)^{-1}\right\|_{\mathscr{L}(\mathcal{Z}_{0},\mathcal{X}_{0})}\left\|x_{1}-x_{2}\right\|_{\mathcal{X}_{0}}\\ &\quad\cdot\sup_{0\leq t\leq 1}\left\|\partial_{x}F(0,0)-\partial_{x}F(x_{2}+t(x_{1}-x_{2}),y)\right\|_{\mathscr{L}(\mathcal{X}_{0},\mathcal{Z}_{0})}\\ &\leq\frac{1}{2}\left\|x_{1}-x_{2}\right\|_{\mathcal{X}_{0}},\end{aligned}

where the second inequality holds because of the condition $\partial_{x}F\in C(D_{1},\mathscr{L}(\mathcal{X}_{0},\mathcal{Z}_{0}))$ in assumption (II), which for sufficiently small $\varepsilon,\delta>0$ implies

\sup_{0\leq t\leq 1}\left\|\partial_{x}F(0,0)-\partial_{x}F(x_{2}+t(x_{1}-x_{2}),y)\right\|_{\mathscr{L}(\mathcal{X}_{0},\mathcal{Z}_{0})}<<1.

Next, we show that $\Phi_{y}(x)\in U_{1}$ for $(x,y)\in D_{1}$ . We estimate $\left\|\Phi_{y}(x)\right\|_{\mathcal{X}_{1}}$ for $(x,y)\in D_{1}$ : Choosing $\delta=\delta(\varepsilon)>0$ smaller, if necessary, we obtain

	$\displaystyle\left\\|\Phi_{y}(x)\right\\|_{\mathcal{X}_{1}}$	$\displaystyle\leq\left\\|\partial_{x}F(0,0)^{-1}\right\\|_{\mathscr{L}(\mathcal{Z}_{1},\mathcal{X}_{1})}\left\\|\partial_{x}F(0,0)x-F(x,y)\right\\|_{\mathcal{Z}_{1}}$
		$\displaystyle\leq\left\\|\partial_{x}F(0,0)^{-1}\right\\|_{\mathscr{L}(\mathcal{Z}_{1},\mathcal{X}_{1})}\left\\|\partial_{y}F(0,0)\right\\|_{\mathscr{L}(\mathcal{Y},\mathcal{Z}_{1})}\delta+o(\varepsilon+\delta(\varepsilon))$
		$\displaystyle\leq\varepsilon.$

Step 2: Construct a mapping $u\colon U_{0}\to V$ .

Let $y\in V$ arbitrary but fixed. The inductively defined sequence $\left(x_{j}\right)_{j\in\mathbb{N}_{0}}$ with $x_{0}:=0$ , $x_{j+1}:=\Phi_{y}(x_{j})\in U_{1}\subset U_{0}$ for $j\in\mathbb{N}$ is a Cauchy sequence and thus converges in $\left\|\cdot\right\|_{\mathcal{X}_{0}}$ to some $x_{\infty}\in U_{0}$ . Because $F\in C(D_{0},\mathcal{Z}_{0})$ , this implies

\left\|F(x_{\infty},y)\right\|_{\mathcal{Z}_{0}}=\lim_{j\to\infty}\left\|F(x_{j},y)\right\|_{\mathcal{Z}_{0}}\leq\lim_{j\to\infty}\left\|\partial_{x}F(0,0)\right\|_{\mathscr{L}(\mathcal{X}_{0},\mathcal{Z}_{0})}\left\|x_{j}-x_{j+1}\right\|_{\mathcal{X}_{0}}=0,

where we used $F(x_{j},y)=\partial_{x}F(0,0)(x_{j}-x_{j+1})$ for $j\in\mathbb{N}_{0}$ by definition of $\Phi_{y}(x)$ . We set $u(y):=x_{\infty}\in U_{0}$ for $y\in V$ .

Step 3: Show (ii) of the theorem.

If $x_{1},x_{2}\in U_{1}$ and $y\in V$ with $F(x_{j},y)=0$ for $j=1,2$ , then

\left\|x_{1}-x_{2}\right\|_{\mathcal{X}_{0}}=\left\|\Phi_{y}(x_{1})-\Phi_{y}(x_{2})\right\|_{\mathcal{X}_{0}}\overset{\eqref{eq:contraction}}{\leq}\frac{1}{2}\left\|x_{1}-x_{2}\right\|_{\mathcal{X}_{0}},

and therefore $x_{1}=x_{2}$ . ∎

Theorem 4.2.

Assume the following.

(I)

Consider Banach spaces $\mathcal{X}_{2}\hookrightarrow\mathcal{X}_{1}\hookrightarrow\mathcal{X}_{0}$ , $\mathcal{Z}_{2}\hookrightarrow\mathcal{Z}_{1}\hookrightarrow\mathcal{Z}_{0}$ and $\mathcal{Y}$ . Let $D_{2}\subset D_{1}\subset D_{0}$ be open sets such that $(0,0)\in D_{j}\subset\mathcal{X}_{j}\times\mathcal{Y}$ for $j=0,1,2$ .
(II)

For $j=1,2$ , let $F\in C^{1}(D_{j},\mathcal{Z}_{j-1})\cap C(D_{j-1},\mathcal{Z}_{j-1})$ with $F(0,0)=0$ and further $\partial_{x}F\in C(D_{j},\mathscr{L}(\mathcal{X}_{j-1},\mathcal{Z}_{j-1}))$ . This is to be understood such that for $(x,y)\in D_{j}$ , the partial deriative $\partial_{x}F(x,y)\in\mathscr{L}(\mathcal{X}_{j},\mathcal{Z}_{j-1})$ can be extended to $\overline{\partial_{x}F}(x,y)\in\mathscr{L}(\mathcal{X}_{j-1},\mathcal{Z}_{j-1})$ and $\overline{\partial_{x}F}\in C(D_{j},\mathscr{L}(\mathcal{X}_{j-1},\mathcal{Z}_{j-1}))$ .
(III)

For $j=1,2$ , the mapping $F\colon D_{j}\to\mathcal{Z}_{j}$ is Fréchet-differentiable at $(0,0)$ .
(IV)

For $j=0,1,2$ , the inverse $\partial_{x}F(0,0)^{-1}\in\mathscr{L}(\mathcal{Z}_{j},\mathcal{X}_{j})$ exists.

Then there exist neighbourhoods of zero $0\in U_{0}\subset\mathcal{X}_{0}$ , $0\in U_{1}\subset\mathcal{X}_{1}$ and $0\in V\subset\mathcal{Y}$ such that there is a function $u\colon V\to U_{1}$ satisfying

(i)

$F(u(y),y)=0$ for all $y\in V$ , $u(0)=0$ ,
(ii)

if $x\in U_{1}$ , $y\in V$ such that $F(x,y)=0$ , then $x=u(y)$ , and
(iii)

it holds $u\in C^{1}(V,\mathcal{X}_{0})$ , and

$u^{\prime}(y)=-\partial_{x}F(u(y),y)^{-1}\partial_{y}F(u(y),y),$

with $\partial_{x}F(u(y),y)^{-1}\in\mathscr{L}(\mathcal{Z}_{0},\mathcal{X}_{0})$ and $\partial_{y}F(u(y),y)\in\mathscr{L}(\mathcal{Y},\mathcal{Z}_{0})$ .

Proof.

Step 1: Existence and uniqueness of $u$ .

Applying Theorem 4.1 twice, we arrive at the existence of neighbourhoods $0\in V\subset\mathcal{Y}$ , $0\in U_{j}\subset\mathcal{X}_{j}$ , $j=0,1$ , and at the existence of a mapping $u\colon V\to U_{1}$ such that

$\circ$

$F(u(y),y)=0$ for $y\in V$ , $u(0)=0$ , and
$\circ$

for $x_{1},x_{2}\in U_{1}$ , $y\in V$ with $F(x_{i},y)=0$ , $i=1,2$ , we have $x_{1}=x_{2}$ .

Thus, for $x_{1}\in U_{1}$ and $y\in V$ such that $F(x,y)=0$ , we have $x=u(y)$ . This shows (i) and $(ii)$ of Theorem 4.2.

Step 2: Show Lipschitz-continuity of $u$ , i.e. $u\in C^{0,1}(V,U_{0})$ .

Consider $y_{1},y_{2}\in V$ . Then with $u$ as above, i.e. $u(y_{i})\in U_{1}$ for $i=1,2$ , we have

	$\displaystyle\left\\|u(y_{1})-u(y_{2})\right\\|_{\mathcal{X}_{0}}$	$\displaystyle=\left\\|\Phi_{y_{1}}(u(y_{1}))-\Phi_{y_{2}}(u(y_{2}))\right\\|_{\mathcal{X}_{0}}$
		$\displaystyle\leq\left\\|\Phi_{y_{1}}(u(y_{1}))-\Phi_{y_{1}}(u(y_{2}))\right\\|_{X_{0}}+\left\\|\Phi_{y_{1}}(u(y_{2}))-\Phi_{y_{2}}(u(y_{2}))\right\\|_{\mathcal{X}_{0}}$
		$\displaystyle\overset{\eqref{eq:contraction}}{\leq}\frac{1}{2}\left\\|u(y_{1}-u(y_{2}))\right\\|_{\mathcal{X}_{0}}$
		$\displaystyle\quad+\left\\|\partial_{x}F(0,0)^{-1}\right\\|_{\mathscr{L}(\mathcal{Z}_{0},\mathcal{X}_{0})}\left\\|F(u(y_{2}),y_{1})-F(u(y_{2}),y_{2})\right\\|_{\mathcal{Z}_{0}}.$

This implies

	$\displaystyle\left\\|u(y_{1})-u(y_{2})\right\\|_{\mathcal{X}_{0}}$	$\displaystyle\leq 2\left\\|\partial_{x}F(0,0)^{-1}\right\\|_{\mathscr{L}(\mathcal{Z}_{0},\mathcal{X}_{0})}$
		$\displaystyle\quad\cdot\sup_{0\leq t\leq 1}\left\\|\partial_{y}F(u(y_{2}),y_{2}+t(y_{1}-y_{2}))\right\\|_{\mathscr{L}(\mathcal{Y},\mathcal{Z}_{0})}\left\\|y_{1}-y_{2}\right\\|_{\mathcal{Y}}$
		$\displaystyle\leq L\left\\|y_{1}-y_{2}\right\\|_{\mathcal{Y}},$

because the $\sup$ -term is uniformly bounded in $V$ for sufficiently small $\varepsilon,\delta>0$ defining the sets as in the proof of Theorem 4.1, since $\partial_{y}F$ is continuous around $(0,0)$ .

Step 3: Show $u\in C^{1}(V,\mathcal{X}_{0})$ .

We have for $y,h\in V$ s.th. $y+h\in V$

	$\displaystyle 0$	$\displaystyle=F(u(y+h),y+h)-F(u(y),y)$
		$\displaystyle=F(u(y+h),y+h)-F(u(y+h),y)+F(u(y+h),y)-F(u(y),y)$
		$\displaystyle=\partial_{y}F(u(y+h),y)[h]+o_{Z_{0}}(\left\\|h\right\\|_{\mathcal{Y}})+F(u(y+h),y)-F(u(y),y)$
		$\displaystyle=\partial_{y}F(u(y),y)[h]+\left(\partial_{y}F(u(y+h),y)+\partial_{y}F(u(y),y)\right)[h]+o_{Z_{0}}(\left\\|h\right\\|_{\mathcal{Y}})$
		$\displaystyle\quad+F(u(y+h),y)-F(u(y),y)$
		$\displaystyle=\partial_{y}F(u(y),y)[h]+o_{Z_{0}}(\left\\|h\right\\|_{\mathcal{Y}})$
		$\displaystyle\quad+\int_{0}^{1}\partial_{x}F(u(y)+t(u(y+h)-u(y)),y)\,\mathrm{d}t[u(y+h)-u(y)],$

and further

	$\displaystyle\int_{0}^{1}\partial_{x}F(u(y)+t(u(y+h)-u(y)),y)\,\mathrm{d}t[u(y+h)-u(y)]$
	$\displaystyle=\partial_{x}F(u(y),y)[u(y+h)-u(y)]+o_{\mathcal{Z}_{0}}(\left\\|h\right\\|_{\mathcal{Y}}).$

Therefore,

u(y+h)-u(y)=\partial_{x}F(u(y),y)^{-1}\partial_{y}F(u(y),y)[h]+o_{\mathcal{X}_{0}}(\left\|h\right\|_{\mathcal{Y}}),

yielding $u\in C^{1}(V,\mathcal{X}_{0})$ and (iii).

Note that $\partial_{x}F(u(y),y)\in\mathscr{L}(\mathcal{X}_{0},\mathcal{Z}_{0})$ is invertible for $y\in V$ , since $\partial_{x}F(0,0)$ is invertible and $\left\|u(y)\right\|_{\mathcal{X}_{1}}+\left\|y\right\|_{\mathcal{Y}}<<1$ . ∎

5 Proof of Theorem 1.1

With the tool of the modified implicit function theorem, Theorem 4.2, at hand, we are now able to prove Theorem 1.1, that is, the existence and uniqueness of admissible sets $\Omega_{\rho}$ with barycenter zero that solve the perturbed overdetermined problem (2.1), as well as a stability estimate.

Remark 5.1.

Considering the somewhat unintuitive partial derivative $\partial_{\rho_{1},\rho_{2},g_{2}}$ in Lemma 3.6 was necessary to arrive at bijectivity and to be able to apply Theorem 4.2. The partial derivative $\partial_{\rho}G(0)$ is not bijective.

In addition to that, keeping in mind the nature of the problem discussed in Section 3, the set $\Omega_{\rho}$ will only depend on the perturbations that do not induce a mere translation of the problem. $\rho$ depending on $g_{1}$ (instead of $g$ ) is a consequence of that setting.

Proof of Theorem 1.1.

We confirm the requirements for Theorem 4.2. For (I), we set

	$\displaystyle\mathcal{X}_{j}$	$\displaystyle=h^{j+2+\alpha}(\mathbb{S})\times X_{2}^{\bot}=h^{j+2+\alpha}(\mathbb{S})\times K\text{ and }$
	$\displaystyle\mathcal{Z}_{j}$	$\displaystyle=X_{j+1}\times\mathbb{R}^{n}\times X_{2}^{\bot}=X_{j+1}\times\mathbb{R}^{n}\times K$

for $j=0,1,2$ , $\mathcal{Y}=X_{2}$ , and $D_{j}$ accordingly. By Lemma 3.5, (3.2) and (3.3), (II) is satisfied. Lemma 3.6 implies (III) and (IV).

Thus, Theorem 4.2 implies the existence of a neighbourhood $0\in V\subset X_{2}$ as well as neighbourhoods $0\in U_{j}\subset h^{j+\alpha}(\mathbb{S})\times X_{2}^{\bot}=h^{j+\alpha}(\mathbb{S})\times K$ , $j=2,3$ , such that there is a function $(\rho,g_{2})\colon V\to U_{3}$ with $G(\rho(g_{1}),g_{1}+g_{2}(g_{1}))=0$ for all $g_{1}\in V$ and $(\rho,g_{2})(0)=0$ .

Furthermore, $(\rho,g_{2})$ is unique in $U_{3}$ and we have $(\rho,g_{2})\in C^{1}(V,U_{2})$ . Differentiating $G(\rho(g_{1}),g_{1}+g_{2}(g_{1}))=0$ with respect to $g_{1}$ and evaluating it at $g_{1}=0$ in direction $\tilde{g}_{1}$ , we get

	$\displaystyle 0$	$\displaystyle=D_{g_{1}}G\left(\rho_{1}(g_{1}),\rho_{2}(g_{1}),g_{1}+g_{2}(g_{1})\right)\big{\|}_{(0)}[\tilde{g}_{1}]$
		$\displaystyle=D_{\rho_{1},\rho_{2},g_{2}}G(0)\left[\partial_{g_{1}}\rho_{1}(0)[\tilde{g}_{1}],\partial_{g_{1}}\rho_{2}(0)[\tilde{g}_{1}],\partial_{g_{1}}g_{2}(0)[\tilde{g}_{1}]\right]+\partial_{g_{1}}G(0)[\tilde{g}_{1}]$
		$\displaystyle=\begin{pmatrix}\partial_{\rho_{1}}F(0)[\partial_{g_{1}}\rho_{1}(0)[\tilde{g}_{1}]]\\ \int_{\mathbb{S}}\sigma_{1}\partial_{g_{1}}\rho_{2}(0)[\tilde{g}_{1}]\,\mathrm{d}\sigma\\ \vdots\\ \int_{\mathbb{S}}\sigma_{n}\partial_{g_{1}}\rho_{2}(0)[\tilde{g}_{1}]\,\mathrm{d}\sigma\\ \partial_{g_{1}}g_{2}(0)[\tilde{g}_{1}]\end{pmatrix}+\begin{pmatrix}\tilde{g}_{1}\\ 0\\ 0\end{pmatrix}.$

This yields

\displaystyle\begin{aligned} \partial_{g_{1}}\rho_{1}(0)[\tilde{g}_{1}]&=\partial_{\rho_{1}}F(0)^{-1}[\tilde{g}_{1}],\\ \partial_{g_{1}}\rho_{2}(0)[\tilde{g}_{1}]&=0,\text{ and}\\ \partial_{g_{1}}g_{2}(0)[\tilde{g}_{1}]&=0,\end{aligned}

where the last equation results from $\partial_{g_{1}}\rho_{2}(0)[\tilde{g}_{1}]\in X_{2}^{\bot}=K$ , and we arrive at the stability estimates in (1.9). ∎

References

Aftalion et al. [1999] Amandine Aftalion, Jérôme Busca, and Wolfgang Reichel. Approximate radial symmetry for overdetermined boundary value problems. Adv. Differential Equations, 4(6):907–932, 1999.
Alexandrov [1962] Aleksandr D. Alexandrov. A characteristic property of spheres. Ann. Mat. Pura Appl., 58(1):303–315, 1962.
Baldi and Haus [2017] Pietro Baldi and Emanuele Haus. A Nash–Moser–Hörmander implicit function theorem with applications to control and Cauchy problems for PDEs. J. Funct. Anal., 273(12):3875–3900, 2017.
Bianchini et al. [2014] Chiara Bianchini, Antoine Henrot, and Paolo Salani. An overdetermined problem with non-constant boundary condition. Interfaces and Free Bound., 16(2):215–242, 2014.
Brandolini et al. [2008a] Barbara Brandolini, Carlo Nitsch, Paolo Salani, and Cristina Trombetti. Serrin-type overdetermined problems: an alternative proof. Arch. Ration. Mech. Anal., 190(2):267–280, 2008a.
Brandolini et al. [2008b] Barbara Brandolini, Carlo Nitsch, Paolo Salani, and Cristina Trombetti. On the stability of the Serrin problem. J. Diff. Equations, 245(6):1566–1583, 2008b.
Brock and Henrot [2002] Friedemann Brock and Antoine Henrot. A symmetry result for an overdetermined elliptic problem using continuous rearrangement and domain derivative. Rend. Circ. Mat. Palermo, 51(3):375–390, 2002.
Ciraolo et al. [2016] Giulio Ciraolo, Rolando Magnanini, and Vincenzo Vespri. Hölder stability for Serrin’s overdetermined problem. Ann. Mat. Pura Appl., 195(4):1333–1345, 2016.
Feldman [2018] William M. Feldman. Stability of Serrin’s problem and dynamic stability of a model for contact angle motion. SIAM J. Math. Anal., 50(3):3303–3326, 2018.
Folland [1995] Gerald B. Folland. Introduction to partial differential equations. Princeton University Press, 1995.
Gilbarg and Trudinger [2001] D. Gilbarg and N.S. Trudinger. Elliptic partial differential equations of second order. Classics in Mathematics. Springer-Verlag, Berlin, 2001. reprint of the 1998 edition.
Henrot and Pierre [2018] Antoine Henrot and Michel Pierre. Shape variation and optimization. European Mathematical Society, 2018.
Lunardi [2012] Alessandra Lunardi. Analytic semigroups and optimal regularity in parabolic problems. Springer Science & Business Media, 2012.
Magnanini and Poggesi [2020] Rolando Magnanini and Giorgio Poggesi. Nearly optimal stability for Serrin’s problem and the Soap Bubble theorem. Calc. Var. Partial Differential Equations, 59(1):35, 2020.
Moser [1961] Jürgen Moser. A new technique for the construction of solutions of nonlinear differential equations. Proc. Nat. Acad. Sci. USA, 47(11):1824, 1961.
Nash [1956] John Nash. The imbedding problem for Riemannian manifolds. Ann. of Math., pages 20–63, 1956.
Pólya [1948] George Pólya. Torsional rigidity, principal frequency, electrostatic capacity and symmetrization. Q. Appl. Math., 6(3):267–277, 1948.
Serrin [1971] James Serrin. A symmetry problem in potential theory. Arch. Ration. Mech. Anal., 43(4):304–318, 1971.
Weinberger [1971] Hans F. Weinberger. Remark on the preceding paper of Serrin. Arch. Ration. Mech. Anal., 43:319–320, 1971.

(1.9)		$\displaystyle\left\\|\rho(g_{1})\right\\|_{h^{2+\alpha}(\mathbb{S})}$	$\displaystyle\leq C\left\\|g_{1}\right\\|_{h^{2+\alpha}(\mathbb{S})},$
(1.9)		$\displaystyle\left\\|g_{2}(g_{1})\right\\|_{h^{2+\alpha}(\mathbb{S})}$	$\displaystyle\leq C\left\\|g_{1}\right\\|_{h^{2+\alpha}(\mathbb{S})}$

	$\displaystyle\left\\|\Phi_{y}(x)\right\\|_{\mathcal{X}_{1}}$	$\displaystyle\leq\left\\|\partial_{x}F(0,0)^{-1}\right\\|_{\mathscr{L}(\mathcal{Z}_{1},\mathcal{X}_{1})}\left\\|\partial_{x}F(0,0)x-F(x,y)\right\\|_{\mathcal{Z}_{1}}$
		$\displaystyle\leq\left\\|\partial_{x}F(0,0)^{-1}\right\\|_{\mathscr{L}(\mathcal{Z}_{1},\mathcal{X}_{1})}\left\\|\partial_{y}F(0,0)\right\\|_{\mathscr{L}(\mathcal{Y},\mathcal{Z}_{1})}\delta+o(\varepsilon+\delta(\varepsilon))$
		$\displaystyle\leq\varepsilon.$

	$\displaystyle\left\\|u(y_{1})-u(y_{2})\right\\|_{\mathcal{X}_{0}}$	$\displaystyle=\left\\|\Phi_{y_{1}}(u(y_{1}))-\Phi_{y_{2}}(u(y_{2}))\right\\|_{\mathcal{X}_{0}}$
		$\displaystyle\leq\left\\|\Phi_{y_{1}}(u(y_{1}))-\Phi_{y_{1}}(u(y_{2}))\right\\|_{X_{0}}+\left\\|\Phi_{y_{1}}(u(y_{2}))-\Phi_{y_{2}}(u(y_{2}))\right\\|_{\mathcal{X}_{0}}$
		$\displaystyle\overset{\eqref{eq:contraction}}{\leq}\frac{1}{2}\left\\|u(y_{1}-u(y_{2}))\right\\|_{\mathcal{X}_{0}}$
		$\displaystyle\quad+\left\\|\partial_{x}F(0,0)^{-1}\right\\|_{\mathscr{L}(\mathcal{Z}_{0},\mathcal{X}_{0})}\left\\|F(u(y_{2}),y_{1})-F(u(y_{2}),y_{2})\right\\|_{\mathcal{Z}_{0}}.$

	$\displaystyle\left\\|u(y_{1})-u(y_{2})\right\\|_{\mathcal{X}_{0}}$	$\displaystyle\leq 2\left\\|\partial_{x}F(0,0)^{-1}\right\\|_{\mathscr{L}(\mathcal{Z}_{0},\mathcal{X}_{0})}$
		$\displaystyle\quad\cdot\sup_{0\leq t\leq 1}\left\\|\partial_{y}F(u(y_{2}),y_{2}+t(y_{1}-y_{2}))\right\\|_{\mathscr{L}(\mathcal{Y},\mathcal{Z}_{0})}\left\\|y_{1}-y_{2}\right\\|_{\mathcal{Y}}$
		$\displaystyle\leq L\left\\|y_{1}-y_{2}\right\\|_{\mathcal{Y}},$

Linear stability estimates for Serrin’s problem via a modified implicit function theorem

Abstract

1 Introduction

Theorem 1.1.

Remark 1.2.

2 Preliminaries

Problem 2.1.

2.1 Derivative of FF

Remark 2.2.

Lemma 2.3.

Remark 2.4.

3 Degeneracy of ∂ρF\partial_{\rho}F

3.1 Non-Bijectivity of the partial derivative of FF

Definition 3.1.

Notation 3.2.

Remark 3.3.

3.2 Re-formulation to eliminate degeneracy

Lemma 3.4.

3.3 Bijectivity of the partial derivative of GG

Lemma 3.5.

Proof.

Lemma 3.6.

Proof.

4 A modified implicit function theorem

Theorem 4.1.

Proof.

Theorem 4.2.

Proof.

5 Proof of Theorem 1.1

Remark 5.1.

Proof of Theorem 1.1.

References

2.1 Derivative of $F$

3 Degeneracy of $\partial_{\rho}F$

3.1 Non-Bijectivity of the partial derivative of $F$

3.3 Bijectivity of the partial derivative of $G$