Sparse Non-Negative Stencils for Anisotropic Diffusion ^†^†thanks: This work was partly supported by ANR grant MESANGE ANR-08-BLAN-0198.

Jérôme Fehrenbach¹¹1 Institut de Mathématiques de Toulouse, Université Paul Sabatier, 31062 TOULOUSE CEDEX 9, France Jean-Marie Mirebeau²²2CNRS, Laboratory CEREMADE, UMR 7534, University Paris Dauphine, Place du Maréchal De Lattre De Tassigny 75775 PARIS CEDEX 16, France

Abstract

We introduce a new discretization scheme for Anisotropic Diffusion, AD-LBR, on two and three dimensional cartesian grids. The main features of this scheme is that it is non-negative and has sparse stencils, of cardinality bounded by $6$ in 2D, by $12$ in 3D, despite allowing diffusion tensors of arbitrary anisotropy. The radius of these stencils is not a-priori bounded however, and can be quite large for pronounced anisotropies. Our scheme also has good spectral properties, which permits larger time steps and avoids e.g. chessboard artifacts.

AD-LBR relies on Lattice Basis Reduction, a tool from discrete mathematics which has recently shown its relevance for the discretization on grids of strongly anisotropic Partial Differential Equations [14]. We prove that AD-LBR is in 2D asymptotically equivalent to a finite element discretization on an anisotropic Delaunay triangulation, a procedure more involved and computationally expensive. Our scheme thus benefits from the theoretical guarantees of this procedure, for a fraction of its cost. Numerical experiments in 2D and 3D illustrate our results.

keywords :

Anisotropic Diffusion, Non-Negative Numerical Scheme, Lattice Basis Reduction.

We consider throughout this paper a bounded smooth domain $\Omega\subset{\mathbb{R}}^{d}$ , where $d\in\{2,3\}$ denotes the dimension, equipped with a continuous diffusion tensor $\operatorname{\mathbf{D}}$ . We do not impose any bound on the diffusion tensor anisotropy, and we are in fact interested in pronounced, non axis-aligned anisotropies. Anisotropic diffusion is here understood in the sense of [25]: the diffusion tensor $\operatorname{\mathbf{D}}(z)$ , at a point $z\in\Omega$ , is a symmetric positive definite matrix whose eigenvalues may have different orders of magnitude. Our results are not relevant for isotropic diffusion with a variable scalar coefficient, as in the pioneering work of Perona and Malik [20].

We address the discretization of the following energy ${\mathcal{E}}$ , defined for $u\in H^{1}(\Omega)$ :

{\mathcal{E}}(u):=\int_{\Omega}\|\nabla u(z)\|^{2}_{\operatorname{\mathbf{D}}(z)}dz.

(1)

We denote $\|e\|_{M}:=\sqrt{\langle e,Me\rangle}$ , for any $e\in{\mathbb{R}}^{d}$ , and any $M$ in the set $S_{d}^{+}$ of symmetric positive definite $d\times d$ matrices. Gradient descent for the energy (1) has the form of a parabolic PDE:

\partial_{t}u=\operatorname{div}(\operatorname{\mathbf{D}}\nabla u).

(2)

This equation, Anisotropic Diffusion, is with its variants at the foundation of powerful image processing techniques. Some variants include curvature terms [19], or diffusion-reaction terms [5]. Time varying and solution dependent diffusion tensors can also be considered. A general exposition can be found in [25], where various choices for the definition of the diffusion tensor ${\bf D}$ from the image $u$ , adapted to various applications, are proposed and discussed.

Our contribution in the discretization of the energy (1) results in improved numerical solutions of (2), in terms of accuracy and stability, for a minor increase in complexity. This extends to applications, such as Coherence Enhancing Diffusion and Edge Preserving Diffusion [25], see the numerical experiments in §4, which involve solving (2) using a solution dependent diffusion tensor ${\bf D}={\bf D}(u)$ . For that purpose, one fixes a time step $\Delta T$ , and solves for each integer $n\geq 0$ the linear diffusion equation $\partial_{t}u=\operatorname{div}(\operatorname{\mathbf{D}}_{n}\nabla u)$ on the interval $[n\Delta T,(n+1)\Delta T]$ , with $\operatorname{\mathbf{D}}_{n}:=\operatorname{\mathbf{D}}(u(n\Delta T))$ . In these applications, the diffusion tensor ${\bf D}(u)$ is typically defined in terms of the structure tensor [25] of $u$ , in such way that diffusion is pronounced within image homogeneous regions, and tangentially along image edges, but not across edges.

In two dimensions, AD-LBR strictly speaking is not the first non-negative scheme for anisotropic diffusion: the proof of Theorem 6 in [25] implicitly defines an alternative 6-point non-negative scheme. This alternative scheme does however lack many of the qualities of AD-LBR: it leads to axis aligned artifacts, spectral aberrations, stencils of larger radius, reduced numerical accuracy, and does not extend to 3D. A detailed description and comparison is presented in §4.1.

Consider a scale parameter $h>0$ , and a sampling $\Omega_{h}$ of the domain $\Omega$ on the cartesian grid ${\mathbb{Z}}^{d}$ , rescaled by $h$ : with obvious notations

\Omega_{h}:=\Omega\cap h{\mathbb{Z}}^{d}.

We introduce a novel discretization of the energy (1), referred to as AD-LBR (Anisotropic Diffusion using Lattice Basis Reduction). It is a sum of squared differences of a discrete map $u\in L^{2}(\Omega_{h})$

{\mathcal{E}}_{h}(u):=h^{d-2}\sum_{z\in\Omega_{h}}\sum_{e\in V(z)}\gamma_{z}(e)\,|u(z+he)-u(z)|^{2}

(3)

The stencils $V(z)\subset{\mathbb{Z}}^{d}$ , $z\in\Omega_{h}$ , are symmetric and have cardinality at most $6$ in 2D, $12$ in 3D. The coefficients $\gamma_{z}(e)\geq 0$ are non-negative. They are constructed using a classical tool from discrete mathematics, Lattice Basis Reduction, which allows to cheaply build efficient stencils for grid discretizations of Partial Differential Equations (PDEs) involving strongly anisotropic diffusion tensors or Riemannian metrics. This approach has been applied to anisotropic static Hamilton-Jacobi PDEs in [14], resulting in a new numerical scheme: Fast Marching using Lattice Basis Reduction (FM-LBR). Substantial improvements were obtained in comparison with earlier methods, in terms of both accuracy and complexity.

The paper is organized as follows. We describe the stencils of the two dimensional AD-LBR in §1, and state our main 2D result: the asymptotic equivalence of AD-LBR with a finite element discretization on an Anisotropic Delaunay Triangulation. Section §2 provides additional details on the two dimensional stencils of AD-LBR, and describes the three dimensional ones. The more technical §3 details the proof of the 2D equivalence result stated in §1. Two and three dimensional numerical experiments are presented in §4, including qualitative and quantitative comparisons with five other numerical schemes.

1 Description of the scheme, and main results

Our numerical scheme, Anisotropic Diffusion using Lattice Basis Reduction (AD-LBR), involves the construction of stencils whose geometry is tailored after the local diffusion tensor. Its essential feature is non-negativity: the discrete energy ${\mathcal{E}}_{h}(u)$ is written as a sum (3) of squared differences of values of $u$ , with non-negative weights $\gamma_{z}(e)\geq 0$ . This discretization is consistent if for each $z\in\Omega_{h}$ , and any smooth $u$ ,

h^{d}\|\nabla u(z)\|_{\operatorname{\mathbf{D}}(z)}^{2}=h^{d-2}\sum_{e\in V(z)}\gamma_{z}(e)\,\langle\nabla u(z),he\rangle^{2}.

(4)

Indeed, the left hand side approximates the contribution of the “voxel” $z+[-h/2,h/2]^{d}$ to the integral (1), while the right hand side is obtained by inserting the first order approximation $u(z+he)\simeq u(z)+\langle\nabla u(z),he\rangle$ in (3). The identity (4) is in turn equivalent to

\operatorname{\mathbf{D}}(z)=\sum_{e\in V(z)}\gamma_{z}(e)\,ee^{\mathrm{T}}.

(5)

The next lemma shows how to obtain such a decomposition in 2D. We denote by $u^{\perp}:=(-b,a)$ the rotation of a vector $u=(a,b)\in{\mathbb{R}}^{2}$ by $\pi/2$ , in such way that for all $v\in{\mathbb{R}}^{2}$ :

\langle u^{\perp},v\rangle=\det(u,v).

Lemma 1.

Let $e_{0},e_{1},e_{2}\in{\mathbb{R}}^{2}$ be such that $e_{0}+e_{1}+e_{2}=0$ , and $|\det(e_{1},e_{2})|=1$ . Then for any $D\in S_{2}^{+}$ , with the convention $e_{3+i}:=e_{i}$ :

D=-\sum_{0\leq i\leq 2}\langle e_{i+1}^{\perp},De_{i+2}^{\perp}\rangle\,e_{i}e_{i}^{\mathrm{T}}.

(6)

Proof.

Note that $1=|\det(e_{2},e_{0})|=|\det(e_{0},e_{1})|$ . Denoting by $D^{\prime}$ the right hand side of (6), we obtain

	$\displaystyle\langle e_{1}^{\perp},D^{\prime}e_{1}^{\perp}\rangle$	$\displaystyle=-\langle e_{0}^{\perp},De_{1}^{\perp}\rangle\langle e_{2},e_{1}^{\perp}\rangle^{2}-\langle e_{1}^{\perp},De_{2}^{\perp}\rangle\langle e_{0},e_{1}^{\perp}\rangle^{2}$
		$\displaystyle=-\langle e_{0}^{\perp}+e_{2}^{\perp},De_{1}^{\perp}\rangle=\langle e_{1}^{\perp},De_{1}^{\perp}\rangle.$

Thus $\|e_{1}^{\perp}\|_{D^{\prime}}=\|e_{1}^{\perp}\|_{D}$ . Likewise $\|e_{2}^{\perp}\|_{D^{\prime}}=\|e_{2}^{\perp}\|_{D}$ , and $\|e_{1}^{\perp}+e_{2}^{\perp}\|_{D^{\prime}}=\|e_{0}^{\perp}\|_{D^{\prime}}=\|e_{0}^{\perp}\|_{D}=\|e_{1}^{\perp}+e_{2}^{\perp}\|_{D}$ . Since $(e_{1}^{\perp},e_{2}^{\perp})$ is a basis of ${\mathbb{R}}^{2}$ , the result follows. ∎

Refer to caption — Figure 1: Right: the stencils associated to three matrices $M$ of anisotropy ratios $\kappa(M)$ equal to $1.1$ , $3.5$ , $8$ respectively. The ellipses $\{z\in{\mathbb{R}}^{2};\,\|z\|_{M}=1\}$ are shown left; their principal axis is aligned with $(\cos(\pi/3),\sin(\pi/3))$ . More stencils are shown in [14].

The diffusion tensor $\operatorname{\mathbf{D}}$ is meant to measure gradients, as in (1). In order to measure angles between vectors, we introduce a Riemannian metric ³³3The Laplace Beltrami operator associated to $\operatorname{\mathbf{M}}$ does not coincide with $\operatorname{div}(\operatorname{\mathbf{D}}\nabla\cdot)$ , unless $\operatorname{\mathbf{D}}$ is identically of determinant $1$ . This is not an issue for our application. $\operatorname{\mathbf{M}}$ on the domain $\Omega$ , which is proportional to the inverse of $\operatorname{\mathbf{D}}$ : for all $z\in\Omega$

\operatorname{\mathbf{M}}(z):=\operatorname{\mathbf{d}}(z)\operatorname{\mathbf{D}}(z)^{-1},\,\text{ where}\,\operatorname{\mathbf{d}}(z):=\det(\operatorname{\mathbf{D}}(z))^{\frac{1}{d}}.

(7)

The normalizing factor $\operatorname{\mathbf{d}}(z)$ was chosen so as to normalize the metric determinant: $\det(\operatorname{\mathbf{M}}(z))=1$ . This normalization reflects the fact that the construction of our stencil $V(z)$ depends on the preferred direction of diffusion, and on the amount of anisotropy, whereas the absolute strength of diffusion is irrelevant. In dimension $d=2$ , one easily checks that for any $z\in\Omega$ and any $e,f\in{\mathbb{R}}^{2}$ , one has

\langle e^{\perp},\operatorname{\mathbf{D}}(z)f^{\perp}\rangle=\operatorname{\mathbf{d}}(z)\langle e,\operatorname{\mathbf{M}}(z)f\rangle.

(8)

The AD-LBR is based on decompositions (5), given by the previous lemma, with a family of vectors $(e_{i})_{i=0}^{2}$ chosen so that the scalar products appearing in (6) are non-positive. The adequate concept is that of $M$ -obtuse superbase of ${\mathbb{Z}}^{d}$ [4].

Definition 1.

•

A basis of ${\mathbb{Z}}^{d}$ is a family $(e_{i})_{i=1}^{d}$ of elements of ${\mathbb{Z}}^{d}$ such that $|\det(e_{1},\cdots,e_{d})|=1$ .
•

A superbase of ${\mathbb{Z}}^{d}$ is a family $(e_{i})_{i=0}^{d}$ such that $e_{0}+\cdots+e_{d}=0$ , and $(e_{i})_{i=1}^{d}$ is a basis of ${\mathbb{Z}}^{d}$ .

Definition 2.

Let $M\in S_{d}^{+}$ . A family $(e_{i})_{i\in I}$ of vectors in ${\mathbb{R}}^{d}$ is said to be $M$ -obtuse if $\langle e_{i},Me_{j}\rangle\leq 0$ for all distinct $i,j\in I$ .

In dimension $d\leq 3$ , there exists for each $M\in S_{d}^{+}$ at least one $M$ -obtuse superbase of ${\mathbb{Z}}^{d}$ [4]. The practical construction of such superbases is discussed in §2, and based on lattice basis reduction algorithms described in [11, 23, 17] (hence the name of our numerical scheme). This construction has a logarithmic numerical cost ${\mathcal{O}}(\ln\kappa(M))$ in the anisotropy ratio of the matrix $M$ :

\kappa(M):=\max_{|u|=|v|=1}\frac{\|u\|_{M}}{\|v\|_{M}}=\sqrt{\|M\|\|M^{-1}\|}.

(9)

The AD-LBR energy ${\mathcal{E}}_{h}:L^{2}(\Omega_{h})\to{\mathbb{R}}_{+}$ , see (3), is in two dimensions written in terms of the following stencils and coefficients. Let $z\in\Omega$ , and let $e_{0},e_{1},e_{2}$ be an $\operatorname{\mathbf{M}}(z)$ -obtuse superbase of ${\mathbb{Z}}^{2}$ . We set

V(z):=\{e_{0},e_{1},e_{2},\,-e_{0},-e_{1},-e_{2}\},\\

(10)

and for $0\leq i\leq 2$ , with the convention $e_{i+3}:=e_{i}$ ,

\gamma_{z}(\pm e_{i}):=-\frac{1}{2}\langle e^{\perp}_{i+1},\operatorname{\mathbf{D}}(z)e^{\perp}_{i+2}\rangle.

(11)

Lemma 1 implies the announced decomposition (5), and the weights $\gamma_{z}$ are non-negative in view of (8). These weights $\gamma_{z}:{\mathbb{Z}}^{2}\to{\mathbb{R}}_{+}$ , extended by $0$ outside $V(z)$ , do not depend on the choice of $\operatorname{\mathbf{M}}(z)$ -obtuse superbase $(e_{0},e_{1},e_{2})$ , see Lemma 11. Stencils of the three dimensional AD-LBR are described in §2, and involve a construction of Selling⁴⁴4The authors would like to thank Professor P. Q. Nguyen for pointing out this $12$ points 3D stencil, which is simpler and sparser than the $14$ points stencil proposed by the authors in an earlier version of the manuscript. [22]. The above description of the stencils $V(z)$ is suitable for periodic, reflected, and Dirichlet boundary conditions (extending $u$ by zero outside $\Omega_{h}$ in the latter case). In the case of Neumann boundary conditions, a slight modification is in order:

V(z;\,h):=\{e\in V(z);\,z+he\in\Omega_{h}\}.

We have so far established three strongpoints of the AD-LBR:

Non-negativity.: Off diagonal coefficients of the symmetric semi-definite $N\times N$ matrix, $N=\#(\Omega_{h})$ , associated to the energy ${\mathcal{E}}_{h}$ are non-positive, while diagonal coefficients are positive.
Sparsity.: Stencil cardinality is uniformly bounded, without restriction on the anisotropy ratio $\kappa(\operatorname{\mathbf{D}}(z))$ of the diffusion tensor.
Complexity.: The construction of the stencil $V(z)$ , and of the associated coefficients $\gamma_{z}$ , has a logarithmic cost ${\mathcal{O}}(\ln\kappa(\operatorname{\mathbf{D}}(z)))$ in the anisotropy ratio of the diffusion tensor.

The next result, Theorem 1, restricted to the two dimensional case, establishes that AD-LBR is asymptotically equivalent to a more involved and computationally intensive procedure: a finite element discretization of the energy (1), on an Anisotropic Delaunay Triangulation (ADT, see [10] and below) of the domain $\Omega$ . Under the assumptions of Theorem 1, AD-LBR benefits from two additional guarantees, that we state informally and without proof.

No chessboard artifacts.: Some numerical schemes for anisotropic diffusion suffer from chessboard artifacts, in the sense that periodic artifacts develop at the pixel level. Such artifacts cannot develop in finite element discretizations, since they would lead to high frequency oscillations of the finite element interpolant, and therefore to an increase of the energy (12). The asymptotic equivalence of the AD-LBR with a finite element discretization also rules out these defects.
Spectral correctness.: The $n$ -th smallest eigenvalue $\lambda_{n}(h)$ of the symmetric matrix associated to $h^{-d}{\mathcal{E}}_{h}$ (3), converges as $h\to 0$ towards the $n$ -th smallest eigenvalue $\lambda_{n}$ of the continuous operator $-\operatorname{div}(\operatorname{\mathbf{D}}\nabla)$ , for any given integer $n\geq 0$ . See Figure 8 page 8 for an illustration. This follows from a similar property of the finite element energy ${\mathcal{E}}^{\prime}_{h}$ (12), and from the asymptotic equivalence (13).

Our convergence result, Theorem 1 below, is specialized to the case of a square periodic domain, which covers reflecting boundary conditions frequently used in image processing. Since the grid discretization must be compatible with the boundary conditions, any scale parameter $h$ appearing in the rest of the paper is assumed to be the inverse of a positive integer:

h\in\{1/n;\,n\geq 1\}.

Theorem 1.

Let $\Omega$ be the unit square $[0,1[^{2}$ , equipped with periodic boundary conditions. Let $\operatorname{\mathbf{D}}:\overline{\Omega}\to S_{2}^{+}$ be a (periodic) diffusion tensor with Lipschitz regularity, and let $\operatorname{\mathbf{M}}$ be the Riemannian metric defined by (7). When $h$ is sufficiently small, the periodic Riemannian domain $(\Omega,\operatorname{\mathbf{M}})$ admits an Anisotropic Delaunay Triangulation ${\mathcal{T}}_{h}$ , with collection of vertices $\Omega_{h}:=\Omega\cap h{\mathbb{Z}}^{2}$ . For $u\in L^{2}(\Omega_{h})$ , define

{\mathcal{E}}^{\prime}_{h}(u):=\int_{\Omega}\|\nabla(\operatorname{I}_{{\mathcal{T}}_{h}}u)(z)\|^{2}_{\operatorname{\mathbf{D}}(z)}dz,

(12)

where $\operatorname{I}_{\mathcal{T}}$ denotes the piecewise linear interpolation operator on a triangulation ${\mathcal{T}}$ . Then for some constant $c=c(\operatorname{\mathbf{D}})$ , independent of $u$ and $h$ ,

(1-ch){\mathcal{E}}_{h}(u)\leq{\mathcal{E}}^{\prime}_{h}(u)\leq(1+ch){\mathcal{E}}_{h}(u).

(13)

Let us mention that the finite element discretization on an ADT is a more general procedure than AD-LBR, since it does not require the domain $\Omega$ to be sampled on a grid. This flexibility can be used to locally increase the density of vertices, in places where solution $u$ is expected to be less regular, or to insert vertices exactly on $\partial\Omega$ for a better discretization of boundary conditions. (Such refinements are however generally incompatible with image processing since the unknowns, the pixel values, lie by construction on a fixed and given cartesian grid.) Here and as often, the performance of AD-LBR is at the cost of its specialization.

The proof of Theorem 1 is postponed to §3, but for the sake of concreteness, we describe here the concept of Anisotropic Delaunay Triangulation (ADT) [10]. In the rest of this introduction, and in §3, we assume as in Theorem 1 that the diffusion tensor $\operatorname{\mathbf{D}}$ is defined on the square $[0,1]^{2}$ and satisfies periodic boundary conditions. We extend it, as well as the metric $\operatorname{\mathbf{M}}$ , to the whole plane ${\mathbb{R}}^{2}$ by periodicity.

We specialize the concept of ADT [10], to the domain ${\mathbb{R}}^{2}$ and the collection of vertices $h{\mathbb{Z}}^{2}$ . For that purpose, we introduce some notations. For all $p,q\in{\mathbb{R}}^{2}$ , we denote by $\delta_{p}(q)$ the distance from $p$ to $q$ , as measured by the metric at the point $p$ :

\delta_{p}(q):=\|q-p\|_{\operatorname{\mathbf{M}}(p)}.

(14)

We denote by $\Delta_{h}(q)$ the least distance from a point $q\in{\mathbb{R}}^{2}$ , to the grid $h{\mathbb{Z}}^{2}$ :

\Delta_{h}(q):=\min_{p\in h{\mathbb{Z}}^{2}}\delta_{p}(q).

(15)

We introduce the Voronoi cell $\operatorname{Vor}_{h}(p)$ of a grid point $p\in h{\mathbb{Z}}^{2}$ , which is the collection of points $q\in{\mathbb{R}}^{2}$ closer to $p$ than to any other grid point:

\operatorname{Vor}_{h}(p):=\{q\in{\mathbb{R}}^{2};\,\delta_{p}(q)=\Delta_{h}(q)\}.

(16)

The collection of Voronoi cells is referred to as the Voronoi diagram, see Figure 2. A Voronoi vertex is a point $q\in{\mathbb{R}}^{2}$ at which at least three distinct Voronoi regions intersect: $(\operatorname{Vor}_{h}(p_{i}))_{i=1}^{k}$ , $k\geq 3$ , $p_{i}\in h{\mathbb{Z}}^{2}$ . We attach to $q$ a dual Voronoi cell $T_{q}$ , defined as the convex hull of the points $(p_{i})_{i=1}^{k}$ , see Figure 3.

The geometric dual ${\mathcal{Q}}_{h}$ , of the Voronoi diagram, is defined as the collection of all dual Voronoi cells $T_{q}$ . Note that, generically on the metric $\operatorname{\mathbf{M}}$ , no more than three Voronoi regions can intersect at any point in ${\mathbb{R}}^{2}$ , thus the elements of ${\mathcal{Q}}_{h}$ are generically triangles. If $h$ is small enough, we show in §3 (using the Dual Triangulation Theorem in [10]) that $T_{q}$ is a strictly convex polygon, of vertices $(p_{i})_{i=1}^{k}$ with the above notations, and that ${\mathcal{Q}}_{h}$ is a polygonization (generically a triangulation) of ${\mathbb{R}}^{2}$ , with vertices $h{\mathbb{Z}}^{2}$ .

Since the metric $\operatorname{\mathbf{M}}$ and the vertices $h{\mathbb{Z}}^{2}$ are periodic (recall that $h=1/n$ for some integer $n\geq 1$ ), arbitrarily triangulating the elements of ${\mathcal{Q}}_{h}$ , respecting periodicity, yields a periodic triangulation ${\mathcal{T}}_{h}$ .

Definition 3 (ADT, Labelle and Shewchuk [10]).

The triangulation ${\mathcal{T}}_{h}$ obtained by the above construction is referred to as an ADT of the domain ${\mathbb{R}}^{2}$ , with collection of vertices $h{\mathbb{Z}}^{2}$ , and underlying Riemannian metric $\operatorname{\mathbf{M}}$ . Since ${\mathcal{T}}_{h}$ is ${\mathbb{Z}}^{2}$ -periodic, we also regard it as an ADT of the periodic unit square $\Omega$ .

We establish in §3.1 the existence of the ADT ${\mathcal{T}}_{h}$ . Incidentally, we show in Lemma 7 (iii) page 7 that the angles of the elements of ${\mathcal{T}}_{h}$ , measured with respect to the local metric $\operatorname{\mathbf{M}}$ , are asymptotically acute. This geometrical property (which holds thanks to our special choice of triangulation vertices, on a grid) is linked to the non-negativity of AD-LBR: indeed, it is known that finite elements discretizations such as (12) yield non-negative numerical schemes, and the discrete maximum principle, if the mesh satisfies a non-obtuse angle condition, see Lemma 3.1 in [9].

Subsection §3.2 is devoted to the study of $M$ -obtuse superbases of ${\mathbb{Z}}^{2}$ , and their cousins $M$ -reduced bases of ${\mathbb{Z}}^{2}$ , on which the AD-LBR relies: we discuss their characterization, uniqueness and stability properties. We study in §3.3 the finite element stencils, defined for $p\in h{\mathbb{Z}}^{2}$ by

V_{h}(p):=\{e\in{\mathbb{Z}}^{2};[p,p+he]\text{ is an edge of }{\mathcal{T}}_{h}\},

(17)

see Figure 3 (right). We show that $V_{h}(p)$ coincides with the AD-LBR stencil $V(p)$ , unless the lattice ${\mathbb{Z}}^{2}$ admits a basis almost orthogonal with respect to the scalar product associated to $\operatorname{\mathbf{M}}(p)$ , see Lemma 13. This is tied to the fact that orthogonal grids admit several (usual) Delaunay triangulations. Overcoming this technical difficulty, we conclude the proof of Theorem 1.

Note that the construction of the ADT ${\mathcal{T}}_{h}$ is not easy to parallelize, in particular when anisotropy is pronounced since the Voronoi regions of far away points interact. The construction of ${\mathcal{T}}_{h}$ also involves solving polynomial equations of degree four, because Voronoi regions boundaries are conics, and Voronoi vertices must be identified at their intersections. In contrast, the AD-LBR stencils are independent of each other, and the numerical cost of their construction only grows logarithmically with the metric anisotropy.

2 Construction of obtuse superbases, and three dimensional stencils

Algorithms for the construction of privileged bases of lattices, consisting of short and almost orthogonal vectors, have attracted an important research effort from the mathematical community, over a long period of time. The first such algorithm dates back to Lagrange [11], and is restricted to two dimensional lattices. Methods for high dimensional lattices, such as the LLL algorithm [12], are of key importance for integer programming and cryptography [18]. AD-LBR is based on the original algorithm of Lagrange [11], and on its recent extension to three dimensional lattices [23, 17]. These methods output a basis of ${\mathbb{Z}}^{d}$ reduced in the sense of Minkowski, which in dimension $d\leq 4$ is equivalent to the following definition.

We denote by $e_{1}{\mathbb{Z}}+\cdots+e_{k}{\mathbb{Z}}$ the sub-lattice of ${\mathbb{Z}}^{d}$ generated by vectors $e_{1},\cdots,e_{k}\in{\mathbb{Z}}^{d}$ . This sub-lattice equals $\{0\}$ by convention if $k=0$ .

Definition 4.

An $M$ -reduced basis of ${\mathbb{Z}}^{d}$ , where $d\leq 4$ and $M\in S_{d}^{+}$ , is a basis $(e_{1},\cdots,e_{d})$ of ${\mathbb{Z}}^{d}$ such that

\|e_{i}\|_{M}=\min\{\|e\|_{M};\,e\in{\mathbb{Z}}^{d}\setminus(e_{1}{\mathbb{Z}}+\cdots+e_{i-1}{\mathbb{Z}})\}.

(18)

For each $d\leq 4$ , and each $M\in S_{d}^{+}$ , there exists at least one $M$ -reduced basis [17]. In contrast, there exists $M\in S_{5}^{+}$ for which no basis of ${\mathbb{Z}}^{d}$ satisfies (18). The norms of the elements $(e_{i})_{i=1}^{d}$ of an $M$ -reduced basis,

\lambda_{i}(M):=\|e_{i}\|_{M},

(19)

are called the Minkowski minima, and are independent of the choice of $M$ -reduced basis. In particular, $e_{1}$ is the shortest vector of ${\mathbb{Z}}^{d}$ , with respect to the norm $\|\cdot\|_{M}$ , and $e_{2}$ is the shortest linearly independent vector.

Lemma 2.

For any $M\in S_{d}^{+}$ , $1\leq i\leq d$ ,

\|M^{-\frac{1}{2}}\|^{-\frac{1}{2}}\leq\lambda_{i}(M)\leq\|M\|^{\frac{1}{2}}.

Proof.

Note that $\|M^{-\frac{1}{2}}\|^{-\frac{1}{2}}\|e\|\leq\|e\|_{M}\leq\|M\|^{\frac{1}{2}}\|e\|$ , for any $e\in{\mathbb{R}}^{2}$ . In addition: (i) any $e\in{\mathbb{Z}}^{2}\setminus\{0\}$ satisfies $\|e\|\geq 1$ , and (ii) the set ${\mathbb{Z}}^{d}\setminus(e_{1}{\mathbb{Z}}+\cdots+e_{i-1}{\mathbb{Z}})$ appearing in (18) always contains at least one element $e$ of the canonical basis of ${\mathbb{R}}^{d}$ , so that $\|e\|\leq 1$ . The announced result easily follows. ∎

We emphasize that obtaining an $M$ -reduced basis, i.e. solving the minimization problems (18), is both simple and cheap numerically. In dimension $d=2$ , this is the object of Lagrange’s algorithm [11] (later rediscovered by Gauss and often erroneously called Gauss’s algorithm, see [17]): initialize $(e,f)$ as the canonical basis of ${\mathbb{Z}}^{2}$ , and

		$\displaystyle\text{\bf Do }(e,\ f):=(f,\ e-\mathrm{Round}(\langle e,Mf\rangle/\\|f\\|_{M}^{2})\,f),$		(20)
		$\displaystyle\text{\bf while }\\|e\\|_{M}>\\|f\\|_{M}.$

This algorithm can be regarded as a two dimensional geometrical generalization of greatest common divisor computation. It can be extended to higher dimension and, in dimension up to four, outputs an $M$ -reduced basis after at most ${\mathcal{O}}(\ln\kappa(M))$ iterations [17], each consisting of ${\mathcal{O}}(1)$ operations among reals.

The elements of an $M$ -reduced basis are heuristically never very far from being orthogonal, as illustrated by the following lemma.

Lemma 3.

Let $M\in S_{d}^{+}$ , $d\leq 4$ , and let $(e_{1},\cdots,e_{d})$ be an $M$ -reduced basis. Then for any $i,j\in\{1,\cdots,d\}$ ,

2|\langle e_{i},Me_{j}\rangle|\leq\|e_{i}\|_{M}^{2}.

(21)

Proof.

Since $\|e_{k}\|_{M}$ is an increasing function of $k\in\{1,\cdots,d\}$ , we may assume that $i<j$ . If follows from (18) that $\|e_{j}\|_{M}\leq\|e_{j}+e_{i}\|_{M}$ , and $\|e_{j}\|_{M}\leq\|e_{j}-e_{i}\|_{M}$ . Squaring these inequalities, and developing the scalar products, we obtain the announced result. ∎

Corollary 1.

Let $M\in S_{2}^{+}$ , and let $(e,f)$ be an $M$ -reduced basis such that $\langle e,Mf\rangle\leq 0$ . Then $(e,f,g)$ is an $M$ -obtuse superbase of ${\mathbb{Z}}^{2}$ , with $g:=-e-f$ . In addition

\langle e,Mg\rangle\leq-\|e\|_{M}^{2}/2,\quad\langle f,Mg\rangle\leq-\|f\|_{M}^{2}/2.

(22)

Proof.

The previous Lemma implies $\langle e,M(e+f)\rangle\geq\|e\|_{M}^{2}-|\langle e,Mf\rangle|\geq\frac{1}{2}\|e\|_{M}^{2}.$ Likewise $\langle f,M(e+f)\rangle\geq\frac{1}{2}\|f\|_{M}^{2}$ . The result follows. ∎

The practical construction of the two dimensional AD-LBR stencil at a point $z\in\Omega$ amounts to (i) compute an $\operatorname{\mathbf{M}}(z)$ -reduced basis $(e,f)$ using Lagrange’s algorithm (20), (ii) replace $f$ with $-f$ , if necessary, so that $\langle e,Mf\rangle\leq 0$ , and (iii) define the stencil $V(z)$ and the weights $\gamma_{z}$ in terms of the $M$ -obtuse superbase $(e,f,g)$ of ${\mathbb{Z}}^{2}$ , where $g=-e-f$ , as described in (10) and (11).

The rest of this section is devoted to the description of the three dimensional AD-LBR stencils. In constrast with the two dimensional case, the construction of the 3D stencil $V(z)$ at a point $z\in\Omega$ , involves a $\operatorname{\mathbf{D}}(z)$ -obtuse basis, instead of an $\operatorname{\mathbf{M}}(z)$ -obtuse basis.

Proposition 1.

Let $D\in S_{3}^{+}$ , and let $(e_{1},e_{2},e_{3})$ be a $D$ -reduced basis. Let $b_{i}:=\varepsilon_{i}e_{\sigma(i)}$ , for all $1\leq i\leq 3$ , where the signs $\varepsilon_{1},\varepsilon_{2},\varepsilon_{3}\in\{-1,1\}$ , and the permutation $\sigma$ of $\{1,2,3\}$ are chosen so that

|\langle b_{1},Db_{2}\rangle|\leq\min\{-\langle b_{1},Db_{3}\rangle,\,-\langle b_{2},Db_{3}\rangle\}.

(23)

Then the following is a $D$ -obtuse superbase:

\left\{\begin{array}[]{cl}(b_{1},b_{2},b_{3},-b_{1}-b_{2}-b_{3})&\text{if }\langle b_{1},Db_{2}\rangle\leq 0,\\ (-b_{1},b_{2},b_{1}+b_{3},-b_{2}-b_{3})&\text{otherwise.}\end{array}\right.

(24)

Proof.

To achieve (23), one can choose $\sigma$ such that $b^{\prime}_{i}:=e_{\sigma(i)}$ satisfies $|\langle b^{\prime}_{1},Db^{\prime}_{2}\rangle|\leq|\langle b^{\prime}_{1},Db^{\prime}_{3}\rangle|\leq|\langle b^{\prime}_{2},Db^{\prime}_{3}\rangle|$ . Then choose the signs $(\varepsilon_{i})_{i=1}^{3}$ such that $b_{i}:=\varepsilon_{i}b^{\prime}_{i}$ satisfies $\langle b_{1},Db_{3}\rangle\leq 0$ and $\langle b_{2},Db_{3}\rangle\leq 0$ .

The two families of vectors appearing in (24) are clearly superbases. We thus only need to show that they are $D$ -obtuse; in other words that $\langle e,Df\rangle\leq 0$ for any two distinct elements $e,f$ of these families. Note that for all distinct $i,j\in\{1,2,3\}$ , using (21),

2|\langle b_{i},Db_{j}\rangle|\leq\|b_{i}\|_{D}^{2}.

In the case where $\langle b_{1},Db_{2}\rangle\leq 0$ , the pairwise scalar products between $b_{1},b_{2},b_{3}$ are non-positive by construction. In addition

		$\displaystyle 2\langle b_{1}+b_{2}+b_{3},Db_{1}\rangle$
	$\displaystyle\geq$	$\displaystyle(\\|b_{1}\\|_{D}^{2}-2\|\langle b_{1},Db_{2}\rangle\|)+(\\|b_{1}\\|^{2}_{D}-2\|\langle b_{1},Db_{3}\rangle\|)\geq 0.$

Likewise $\langle b_{1}+b_{2}+b_{3},Db_{i}\rangle\geq 0$ for all $i\in\{1,2,3\}$ , which concludes the proof.

We next turn to the second case, where $\langle b_{1},Db_{2}\rangle\geq 0$ . Enumerating all scalar products we obtain

	$\displaystyle\langle b_{1},D(b_{1}+b_{3})\rangle$	$\displaystyle\geq\\|b_{1}\\|_{D}^{2}-\|\langle b_{1},Db_{3}\rangle\|\geq 0,$
	$\displaystyle\langle b_{1},D(-b_{2}-b_{3})\rangle$	$\displaystyle=-\langle b_{1},Db_{2}\rangle-\langle b_{1},Db_{3}\rangle\geq 0,$
	$\displaystyle-\langle b_{2},D(b_{1}+b_{3})\rangle$	$\displaystyle=-\langle b_{2},Db_{1}\rangle-\langle b_{2},Db_{3}\rangle\geq 0,$
	$\displaystyle\langle b_{2},D(b_{2}+b_{3})\rangle$	$\displaystyle\geq\\|b_{2}\\|^{2}_{D}-\|\langle b_{2},Db_{3}\rangle\|\geq 0,$

and finally

	$\displaystyle 2\langle b_{1}+b_{3},D(b_{2}+b_{3})\rangle\geq 2\langle b_{1},Db_{2}\rangle$
	$\displaystyle+(\\|b_{3}\\|^{2}-2\|\langle b_{1},Db_{3}\rangle\|)+(\\|b_{3}\\|^{2}-2\|\langle b_{2},Db_{3}\rangle\|)\geq 0.$

This concludes the proof. ∎

In view of the previous Proposition, obtaining a $D$ -obtuse superbase of ${\mathbb{Z}}^{3}$ has numerical cost ${\mathcal{O}}(\ln\kappa(D))$ . Indeed a $D$ -reduced basis needs to be computed in a preliminary step, after what Proposition 1 is applied for a negligible ${\mathcal{O}}(1)$ cost. An alternative method for the construction of $D$ -obtuse superbases of ${\mathbb{Z}}^{3}$ is presented in [4] and in appendix B of [3], but its numerical complexity is not known to the authors.

The three dimensional AD-LBR is defined by the following stencils and coefficients. Let $z\in\Omega$ , let $D:=\operatorname{\mathbf{D}}(z)$ , and let $(e_{i})_{i=0}^{3}$ be a $D$ -obtuse superbase of ${\mathbb{Z}}^{3}$ . We set

V(z):=\{e_{k}\times e_{l};\ k,l\in\{0,1,2,3\},\ k\neq l\},

and if $\{i,j,k,l\}=\{0,1,2,3\}$ , $i\neq j$ and $k\neq l$ , then

\gamma_{z}(e_{k}\times e_{l}):=-\frac{1}{2}\langle e_{i},De_{j}\rangle.

As announced, $\#(V(z))=12$ , and the weights $\gamma_{z}$ are non-negative. The proof of the scheme consistency (5), due to Selling [22], is reproduced in the next lemma for completeness. A generalization, appearing in Appendix B of [3], allows in arbitrary dimension to build a non-negative decomposition of the form (25) from a $D$ -obtuse superbase of ${\mathbb{Z}}^{d}$ . However the non existence of such a superbase, for some matrices $D\in S_{4}^{+}$ , forbids a straightforward extension of AD-LBR to higher dimension.

Lemma 4 (Selling [22]).

Let $(e_{i})_{i=0}^{3}$ be a superbase of ${\mathbb{Z}}^{3}$ . For all $i,j,k,l$ such that $\{i,j,k,l\}=\{0,1,2,3\}$ , $i<j$ , and $k<l$ , let $c_{ij}:=e_{k}\times e_{l}$ . Then, for any $D\in S_{3}^{+}$ :

D=-\sum_{0\leq i<j\leq 3}\langle e_{i},De_{j}\rangle c_{ij}c_{ij}^{\mathrm{T}}.

(25)

Proof.

Let $i,j,k,l$ be as in the definition of $c_{ij}$ . Then

\langle e_{i},c_{ij}\rangle=\langle e_{i},e_{k}\times e_{l}\rangle=\det(e_{i},e_{k},e_{l})\in\{-1,1\},

since $(e_{i},e_{k},e_{l})$ is a basis of ${\mathbb{Z}}^{3}$ . Also

	$\displaystyle\langle e_{j},c_{ij}\rangle$	$\displaystyle=\langle-e_{i}-e_{k}-e_{l},\,e_{k}\times e_{l}\rangle$
		$\displaystyle=-\langle e_{i},\ e_{k}\times e_{l}\rangle=-\langle e_{i},c_{ij}\rangle.$

In addition, clearly, $\langle e_{k},c_{ij}\rangle=\langle e_{l},c_{ij}\rangle=0.$ Denoting by $D^{\prime}$ the right hand side of (25), we obtain as a result

	$\displaystyle\langle e_{0},D^{\prime}e_{0}\rangle$	$\displaystyle=-\langle e_{0},De_{1}\rangle-\langle e_{0},De_{2}\rangle-\langle e_{0},De_{3}\rangle$
		$\displaystyle=\langle e_{0},D(-e_{1}-e_{2}-e_{3})\rangle=\langle e_{0},De_{0}\rangle.$
	$\displaystyle\langle e_{0},D^{\prime}e_{1}\rangle$	$\displaystyle=-\langle e_{0},De_{1}\rangle\langle e_{0},c_{01}\rangle\langle e_{1},c_{01}\rangle=\langle e_{0},De_{1}\rangle.$

Likewise $\langle e_{i},D^{\prime}e_{j}\rangle=\langle e_{i},De_{j}\rangle$ for all $i,j\in\{1,2,3,4\}$ . It follows as announced that $D=D^{\prime}$ . ∎

3 Equivalence to a finite element discretization

This section is devoted to the proof of Theorem 1: the asymptotic equivalence of AD-LBR with a finite element discretization on an Anisotropic Delaunay Triangulation (ADT). We use the notations of §1. The existence of the ADT ${\mathcal{T}}_{h}$ is established in the first subsection, for $h$ sufficiently small, as well as a few of its properties. The second subsection is devoted to the study of $M$ -reduced bases. Theorem 1 is proved in the third subsection, by comparing the stencils of the AD-LBR and of the finite element discretization.

We denote by ${\boldsymbol{\kappa}}$ the maximum anisotropy ratio (9) of the diffusion tensor

{\boldsymbol{\kappa}}:=\max_{z\in\Omega}\kappa(\operatorname{\mathbf{D}}(z)).

(26)

Observing that $\kappa(\operatorname{\mathbf{D}}(z))=\kappa(\operatorname{\mathbf{M}}(z))$ , and recalling that $\det(\operatorname{\mathbf{M}}(z))=1$ , one easily checks that

{\boldsymbol{\kappa}}^{-\frac{1}{2}}\|e\|\leq\|e\|_{\operatorname{\mathbf{M}}(z)}\leq{\boldsymbol{\kappa}}^{\frac{1}{2}}\|e\|,

(27)

for all $z\in\Omega$ and all $e\in{\mathbb{R}}^{2}$ .

3.1 Existence of an ADT

Our first lemma provides an uniform bound on the size of the Voronoi regions, see Figure 3, involved in the construction of the ADT.

Lemma 5.

(i)

For all $r\in{\mathbb{R}}^{2}$ , one has $\Delta_{h}(r)\leq{\boldsymbol{\kappa}}^{\frac{1}{2}}h$ .
(ii)

If $p,q\in h{\mathbb{Z}}^{2}$ , and $r\in\operatorname{Vor}_{h}(p)\cap\operatorname{Vor}_{h}(q)$ , then ${\|p-r\|\leq{\boldsymbol{\kappa}}h}$ and $\|p-q\|\leq 2{\boldsymbol{\kappa}}h$ .

Proof.

Point (i). Rounding the coordinates of $r$ to a nearest multiple of $h$ , we obtain a point $p\in h{\mathbb{Z}}^{2}$ such that $\|p-r\|\leq h$ . Recalling (27) we obtain $\delta_{p}(r)\leq{\boldsymbol{\kappa}}^{\frac{1}{2}}h$ , and therefore $\Delta_{h}(r)\leq{\boldsymbol{\kappa}}^{\frac{1}{2}}h$ in view of (15).

Point (ii). We have ${\boldsymbol{\kappa}}^{-\frac{1}{2}}\|p-r\|\leq\delta_{p}(r)=\Delta_{h}(r)\leq{\boldsymbol{\kappa}}^{\frac{1}{2}}h$ . Thus $\|p-r\|\leq{\boldsymbol{\kappa}}h$ , and likewise $\|q-r\|\leq{\boldsymbol{\kappa}}h$ . Finally, by the triangle inequality, $\|p-q\|\leq\|p-r\|+\|q-r\|\leq 2{\boldsymbol{\kappa}}h$ . ∎

Following the notations of [10], we denote by $\tau(p,q)$ , $p,q\in{\mathbb{R}}^{2}$ , the smallest constant $\tau\geq 1$ such that

\tau^{-1}\delta_{p}(r)\leq\delta_{q}(r)\leq\tau\delta_{p}(r),\quad\text{for all }r\in{\mathbb{R}}^{2}.

Equivalently, in the sense of symmetric matrices,

\tau^{-2}\operatorname{\mathbf{M}}(p)\leq\operatorname{\mathbf{M}}(q)\leq\tau^{2}\operatorname{\mathbf{M}}(p).

(28)

We also define a quantity $\tau_{h}\geq 1$ , closely related to the modulus of continuity of the metric $\operatorname{\mathbf{M}}$ :

\tau_{h}:=\max\{\tau(p,q);\,\|p-q\|\leq 2{\boldsymbol{\kappa}}h\}.

(29)

One has $\tau_{h}\to 1$ as $h\to 0$ , for any continuous metric $\operatorname{\mathbf{M}}$ (indeed $\operatorname{\mathbf{M}}$ is periodic and therefore uniformly continuous). If $\operatorname{\mathbf{M}}$ is Lipschitz, as assumed in Theorem 1, then $\tau_{h}=1+{\mathcal{O}}(h)$ .

We show in the next lemma the existence of an ADT, by applying the main result of [10], under the assumption that $\tau_{h}$ is sufficiently small. More precisely, we assume in the rest of this subsection that

\tau_{h}<\sqrt{1+{\boldsymbol{\kappa}}^{-2}}.

(30)

Lemma 6.

(i)

If $p,q\in h{\mathbb{Z}}^{2}$ , $p\neq q$ , and $r\in\operatorname{Vor}_{h}(p)\cap\operatorname{Vor}_{h}(q)$ , then $\delta_{p}(r)<\delta_{p}(q)/\sqrt{\tau(p,q)^{2}-1}.$
(ii)

The geometric dual ${\mathcal{Q}}_{h}$ of the Voronoi diagram is, as announced in §1, a polygonization of ${\mathbb{R}}^{2}$ into strictly convex polygons, with vertices $h{\mathbb{Z}}^{2}$ .

Proof.

Point (i). We may assume that $\tau(p,q)>1$ , otherwise there is nothing to prove. Point (ii) of Lemma 5 implies that $\|p-q\|\leq 2{\boldsymbol{\kappa}}h$ , thus

\sqrt{\tau(p,q)^{2}-1}\leq\sqrt{\tau_{h}^{2}-1}<{\boldsymbol{\kappa}}^{-1}.

On the other hand $\delta_{p}(q)\geq{\boldsymbol{\kappa}}^{-\frac{1}{2}}\|q-p\|\geq{\boldsymbol{\kappa}}^{-\frac{1}{2}}h$ , and $\delta_{p}(r)\leq\Delta_{h}(r)\leq{\boldsymbol{\kappa}}^{\frac{1}{2}}h$ . The announced inequality follows.

Point (ii). We apply Theorem 7 (Dual Triangulation Theorem) in [10]. Since the domain ${\mathbb{R}}^{2}$ has no boundary, it suffices to check that all the Voronoi arcs and vertices are wedged, see [10]. This condition means that for any $p,q\in h{\mathbb{Z}}^{2}$ such that $p\neq q$ , and any $r\in\operatorname{Vor}_{h}(p)\cap\operatorname{Vor}_{h}(q)$ , one has $(r-q)\operatorname{\mathbf{M}}(q)(p-q)>0$ , and likewise exchanging the roles of $p$ and $q$ . Heuristically, it expresses the acuteness of some angles measured in the local metric. Lemma 5 in [10] shows that this condition follows from point (i) of this lemma, which concludes the proof. ∎

We recall that ${\mathcal{T}}_{h}$ is the triangulation obtained by arbitrarily triangulating the polygonization $Q_{h}$ of the previous lemma, respecting periodicity, see Definition 3. Generically $Q_{h}$ is already a triangulation, hence ${\mathcal{T}}_{h}=Q_{h}$ , see §1. The Voronoi regions $\operatorname{Vor}_{h}$ , and the triangulation ${\mathcal{T}}_{h}$ , are illustrated in Figures 2 and 3.

The next lemma provides estimates of the diameter, the area, and the angles of the elements of ${\mathcal{T}}_{h}$ . These geometrical properties also have an interpretation in the context of lattices: (ii) shows that the edges of any triangle $T\in{\mathcal{T}}_{h}$ define a superbase $(e,f,g)$ of ${\mathbb{Z}}^{2}$ , and (iii) that this superbase is almost $\operatorname{\mathbf{M}}(z)$ -obtuse, for any $z\in T$ .

Note that the vertices $p,q,r$ of any triangle $T\in{\mathcal{T}}_{h}$ satisfy by construction

\operatorname{Vor}_{h}(p)\cap\operatorname{Vor}_{h}(q)\cap\operatorname{Vor}_{h}(r)\neq\emptyset.

(31)

Lemma 7.

Denote by $he,hf,hg$ the edges of a triangle $T\in{\mathcal{T}}_{h}$ , where $e,f,g\in{\mathbb{Z}}^{2}$ are oriented so that $e+f+g=0$ . Then

(i)

$\max\{\|e\|,\|f\|,\|g\|\}\leq 2{\boldsymbol{\kappa}}$ .
(ii)

$|\det(e,f)|=1$ , thus $|T|=h^{2}/2$ .
(iii)

$\langle e,\operatorname{\mathbf{M}}(z)f\rangle\leq\theta_{h}$ , for any $z\in T$ , where $\theta_{h}\to 0$ as $h\to 0$ . (Explicitly: $\theta_{h}={\boldsymbol{\kappa}}(3+9\tau_{2h}^{2})(\tau_{2h}^{2}-1)$ )

Proof.

Point (i). We denote by $p,q,r$ the vertices of $T$ , ordered in such way that $p+he=q$ , $q+hf=r$ , $r+hg=p$ . The announced estimate follows from (31), and from point (ii) of Lemma 5.

Point (ii). Since ${\mathcal{T}}_{h}$ is a conforming triangulation, the intersection of $T$ with the collection $h{\mathbb{Z}}^{2}$ of all vertices of ${\mathcal{T}}_{h}$ consists of only three points: the vertices $p,q,r$ of $T$ . Thus the triangle of vertices $-e,0,f$ , homothetic to $T$ , contains no point of integer coordinates but its vertices. This implies that $(e,f)$ is a basis of ${\mathbb{Z}}^{2}$ , hence $|\det(e,f)|=1$ , as announced.

Point (iii). The pairwise distances between $p,q,r$ are bounded by $2{\boldsymbol{\kappa}}h$ , see point (i), and since $z\in T$ so are the pairwise distances between $p,q,r,z$ . Defining $s:=p-q+r\in h{\mathbb{Z}}^{2}$ , and observing that $\|s-p\|=\|r-q\|\leq 2{\boldsymbol{\kappa}}h$ , we find that the pairwise distances between $p,q,r,z,s$ are bounded by $4{\boldsymbol{\kappa}}h$ .

Let $x\in\operatorname{Vor}_{h}(p)\cap\operatorname{Vor}_{h}(q)\cap\operatorname{Vor}_{h}(r)$ . We have $\delta_{p}(x)=\delta_{q}(x)=\delta_{r}(x)=\Delta_{h}(x)\leq\delta_{s}(x)$ , thus

\delta_{s}(x)^{2}\geq\delta_{p}(x)^{2}-\delta_{q}(x)^{2}+\delta_{r}(x)^{2}.

(32)

(For intuition: in a classical Delaunay triangulation, $x$ would be the circumcenter of $T$ , and (32) would state that $s$ is outside the circumcircle of $T$ .) Denoting $M:=\operatorname{\mathbf{M}}(z)$ , and $\delta:=\Delta_{h}(x)$ , we obtain

	$\displaystyle\|\delta_{p}(x)^{2}-\\|x-p\\|^{2}_{M}\|$	$\displaystyle\leq\delta_{p}(x)^{2}(\tau(p,z)^{2}-1)$
		$\displaystyle\leq\delta^{2}(\tau_{2h}^{2}-1),$		(33)

using Lemma 5, and likewise for $q,r$ . We also have

	$\displaystyle\delta_{s}(x)$	$\displaystyle=\\|p-q+r-x\\|_{\operatorname{\mathbf{M}}(s)}$
		$\displaystyle\leq\\|p-x\\|_{\operatorname{\mathbf{M}}(s)}+\\|q-x\\|_{\operatorname{\mathbf{M}}(s)}+\\|r-x\\|_{\operatorname{\mathbf{M}}(s)}$
		$\displaystyle\leq 3\delta\tau_{2h}.$

Thus, proceeding as in (33),

|\delta_{s}(x)-\|s-x\|^{2}_{M}|\leq\delta_{s}(x)^{2}(\tau_{2h}^{2}-1)\leq 9\delta^{2}\tau^{2}_{2h}(\tau_{2h}^{2}-1).

Inserting in (32) these estimates of $\delta_{\star}(x)$ , $\star\in\{p,q,r,s\}$ , and using the fact that $\delta\leq{\boldsymbol{\kappa}}^{\frac{1}{2}}h$ , see Lemma 5, we obtain after expansion the announced estimate of $\langle e,Mf\rangle$ . ∎

We next rewrite the finite element energy ${\mathcal{E}}^{\prime}_{h}$ (12) in a form similar to that of the AD-LBR energy ${\mathcal{E}}_{h}$ (3). Let $\varphi_{p}^{h}:{\mathbb{R}}^{2}\to{\mathbb{R}}$ be the piecewise linear function on ${\mathcal{T}}_{h}$ such that $\varphi_{p}^{h}(p)=1$ , and $\varphi_{p}^{h}(q)=0$ for any vertex $q\in h{\mathbb{Z}}^{2}$ distinct from $p$ . This is the classical “hat function” encountered in finite element analysis. For all $p\in h{\mathbb{Z}}^{2}$ , $e\in{\mathbb{Z}}^{2}\setminus\{0\}$ , let

\gamma^{h}_{p}(e):=-\frac{1}{2}\int_{{\mathbb{R}}^{2}}\langle\nabla\varphi_{p}^{h}(z),\operatorname{\mathbf{D}}(z)\nabla\varphi_{p+he}^{h}(z)\rangle dz

(34)

Clearly, $\gamma_{p}^{h}(e)=0$ if $[p,p+he]$ is not an edge of ${\mathcal{T}}_{h}$ , in other words if $e$ does not belong to the stencil $V_{h}(p)$ , defined in (17). We express in the next lemma the finite element energy ${\mathcal{E}}^{\prime}_{h}$ (12) in terms of the stencils $V_{h}$ and of the (potentially negative) weights $\gamma^{h}_{p}$ .

Lemma 8.

For any $u\in L^{2}(\Omega_{h})$ , extended by periodicity to $h{\mathbb{Z}}^{2}$ , one has

{\mathcal{E}}^{\prime}_{h}(u)=\sum_{p\in\Omega_{h}}\sum_{e\in V_{h}(p)}\gamma_{p}^{h}(e)|u(p+he)-u(p)|^{2}.

(35)

Proof.

For any triangle $T\in{\mathcal{T}}_{h}$ , and any $p,q\in h{\mathbb{Z}}^{2}$ , we denote

s_{T}(p,q):=\int_{T}\langle\nabla\varphi_{p}^{h}(z),\,\operatorname{\mathbf{D}}(z)\nabla\varphi_{q}^{h}(z)\rangle\,dz.

Clearly $s_{T}(p,q)=0$ if $q$ or $p$ is not a vertex of $T$ . The coefficient $\gamma_{p}^{h}(e)$ , $e\in{\mathbb{Z}}^{2}$ , is thus given by the following sum with at most two non-zero terms:

\gamma_{p}^{h}(e)=-\frac{1}{2}\sum_{T\in{\mathcal{T}}_{h}}s_{T}(p,p+he).

(36)

Let $p,q,r\in h{\mathbb{Z}}^{2}$ be the vertices of a triangle $T\in{\mathcal{T}}_{h}$ . Since the sum $\varphi_{p}^{h}+\varphi_{q}^{h}+\varphi_{r}^{h}$ is constant on $T$ , equal to $1$ , it has a null gradient on $T$ , and therefore

s_{T}(p,p)+s_{T}(p,q)+s_{T}(p,r)=0.

Using this relation, and the two similar ones obtained by a cyclic permutation of $p,q,r$ , we obtain

		$\displaystyle\int_{T}\\|\nabla(\operatorname{I}_{{\mathcal{T}}_{h}}u)(z)\\|_{\operatorname{\mathbf{D}}(z)}^{2}dz$
	$\displaystyle=$	$\displaystyle\,u(p)^{2}s_{T}(p,p)+u(q)^{2}s_{T}(q,q)+u(r)^{2}s_{T}(r,r)$
		$\displaystyle+2u(p)u(q)s_{T}(p,q)+2u(q)u(r)s_{T}(q,r)$
		$\displaystyle+2u(r)u(p)s_{T}(r,p),$
	$\displaystyle=$	$\displaystyle-s_{T}(p,q)(u(p)-u(q))^{2}-s_{T}(q,r)(u(q)-u(r))^{2}$
		$\displaystyle-s_{T}(r,p)(u(r)-u(p))^{2}.$

Summing this expression over all $T\in{\mathcal{T}}_{h}$ , and combining it with (36), we obtain (35), which concludes the proof. ∎

Finally, we provide an approximation of the coefficients $\gamma_{p}^{h}$ which will be easily compared with the AD-LBR weights $\gamma_{p}$ (11).

Lemma 9.

Consider an edge $[p,p+he]$ of ${\mathcal{T}}_{h}$ , shared by the two distinct triangles $T,T^{\prime}\in{\mathcal{T}}_{h}$ . Let $hf,hg$ (resp. $hf^{\prime},hg^{\prime}$ ) be the two other vector edges of $T$ (resp. $T^{\prime}$ ), oriented so that $e+f+g=0$ (resp. $e+f^{\prime}+g^{\prime}=0$ ). Then

\left|\gamma_{p}^{h}(e)+\frac{1}{4}\left(\langle f^{\perp},\operatorname{\mathbf{D}}(p)\,g^{\perp}\rangle+\langle f^{\prime\perp},\operatorname{\mathbf{D}}(p)\,g^{\prime\perp}\rangle\right)\right|\leq\varepsilon_{h},

where $\varepsilon_{h}:=2{\boldsymbol{\kappa}}^{2}\max\{\|\operatorname{\mathbf{D}}(x)-\operatorname{\mathbf{D}}(y)\|;\,\|x-y\|\leq 2{\boldsymbol{\kappa}}h\}$ .

Proof.

We assume, up to exchanging $f$ and $g$ , that $[p,p-hg]$ is an edge of $T$ . Let $\alpha:=\det(e,f)\in\{-1,1\}$ , see point (ii) of Lemma 7; note that $\alpha=\det(f,g)=\det(g,e)$ . Let $\gamma$ be the constant value of $\nabla\varphi_{p}^{h}$ on $T$ . Then $\langle\gamma,he\rangle=-1$ and $\langle\gamma,hg\rangle=1$ . These two independent linear identities are also satisfied by $\alpha f^{\perp}/h$ , hence $\nabla\varphi_{p}^{h}=\gamma=\alpha f^{\perp}/h$ on $T$ .

Denoting $q:=p+hg$ , we obtain likewise $\nabla\varphi_{q}^{h}=\alpha g^{\perp}/h$ on $T$ . Hence recalling that $|T|=h^{2}/2$ :

	$\displaystyle\int_{T}\langle\nabla\varphi_{p}^{h},\operatorname{\mathbf{D}}(p)\nabla\varphi_{q}^{h}\rangle$	$\displaystyle=\frac{h^{2}}{2}\left\langle\frac{\alpha f^{\perp}}{h},\operatorname{\mathbf{D}}(p)\frac{\alpha g^{\perp}}{h}\right\rangle$
		$\displaystyle=\frac{1}{2}\left\langle f^{\perp},\operatorname{\mathbf{D}}(p)g^{\perp}\right\rangle$

Therefore, using point (i) of Lemma 7 in the last step,

	$\displaystyle\left\|\int_{T}\langle\nabla\varphi_{p}^{h},\operatorname{\mathbf{D}}(z)\nabla\varphi_{q}^{h}\rangle dz-\frac{1}{2}\langle f^{\perp},\operatorname{\mathbf{D}}(p)g^{\perp}\rangle\right\|$
	$\displaystyle=\left\|\int_{T}\langle\nabla\varphi_{p}^{h},(\operatorname{\mathbf{D}}(z)-\operatorname{\mathbf{D}}(p))\nabla\varphi_{q}^{h}\rangle dz\right\|$
	$\displaystyle\leq\frac{h^{2}}{2}\frac{2{\boldsymbol{\kappa}}}{h}\frac{2{\boldsymbol{\kappa}}}{h}\max\{\\|\operatorname{\mathbf{D}}(z)-\operatorname{\mathbf{D}}(p)\\|;\,z\in T\}\leq\varepsilon_{h}.$

Proceeding likewise on $T^{\prime}$ , and recalling (34) (or (36)), we conclude the proof. ∎

3.2 Some properties of $M$ -reduced bases

We establish some technical properties of $M$ -reduced bases, thanks to which we will be able to compare in §3.3 the “geometric” construction of the ADT finite element stencils $V_{h}$ , with the lattice based construction of the AD-LBR stencils $V$ .

Lemma 10.

Let $M\in S_{2}^{+}$ , let $e_{1},\cdots,e_{n}\in{\mathbb{Z}}^{2}$ , $n>2$ , and let $\varepsilon\in\{-1,1\}$ . Assume that for all $1\leq i\leq n$ , with the convention $e_{n+1}:=e_{1}$ :

	$\displaystyle\det(e_{i},e_{i+1})$	$\displaystyle=\varepsilon,$		(37)
	$\displaystyle\langle e_{i},Me_{i+1}\rangle$	$\displaystyle>-\frac{1}{2}\min\left\{\\|e_{i}\\|_{M}^{2},\\|e_{i+1}\\|_{M}^{2}\right\}.$		(38)

Then any $M$ -reduced basis $(e,f)$ satisfies

\{e,f\}\subset\{e_{1},\cdots,e_{n}\}.

Proof.

Let $z\in{\mathbb{Z}}^{2}\setminus\{e_{1},\cdots,e_{n}\}$ . Our objective is to show that $z$ cannot be an element of an $M$ -reduced basis, and we may therefore assume that $z$ has co-prime coordinates.

It follows from (37) that the closed polygonal line of consecutive vertices $e_{1},\cdots,e_{n}$ , circles at least once around the origin, see Figure 5. Hence $z=\alpha e_{i}+\beta e_{i+1}$ , for some $1\leq i\leq n$ and some $\alpha,\beta\geq 0$ . Since $(e_{i},e_{i+1})$ is a basis of ${\mathbb{Z}}^{2}$ (indeed $|\det(e_{i},e_{i+1})|=1$ ), the coefficients $\alpha$ and $\beta$ are integers. Since $z\notin\{e_{i},e_{i+1}\}$ , $\alpha+\beta\geq 2$ . Since $z$ has co-prime coordinates, $\alpha\beta\neq 0$ .

Assuming without loss of generality that $\|e_{i}\|_{M}\geq\|e_{i+1}\|_{M}$ , we obtain using (38):

	$\displaystyle\\|z\\|_{M}^{2}=\alpha^{2}\\|e_{i}\\|_{M}^{2}+\beta^{2}\\|e_{i+1}\\|_{M}^{2}+2\alpha\beta\langle e_{i},Me_{i+1}\rangle$
	$\displaystyle>\alpha^{2}\\|e_{i}\\|_{M}^{2}+\beta^{2}\\|e_{i+1}\\|_{M}^{2}-\alpha\beta\min\{\\|e_{i}\\|_{M}^{2},\\|e_{i+1}\\|_{M}^{2}\}$
	$\displaystyle\geq\\|e_{i}\\|^{2}_{M}+(\alpha^{2}+\beta^{2}-1-\alpha\beta)\\|e_{i+1}\\|_{M}^{2}.$

Observing that $\alpha^{2}+\beta^{2}-1-\alpha\beta\geq 0$ for all $\alpha,\beta\in[1,\infty[$ , we obtain $\|z\|_{M}>\|e_{i}\|_{M}$ . Since $e_{i}$ and $e_{i+1}$ are linearly independent, we have $\|e_{i}\|_{M}\geq\lambda_{2}(M)$ . Finally $\|z\|_{M}>\lambda_{2}(M)$ , hence $z$ cannot be an element of an $M$ -reduced basis, which concludes the proof. ∎

The next corollary reverses the construction, presented in Corollary 1, of an $M$ -obtuse superbase from an $M$ -reduced basis.

Corollary 2.

Let $M\in S_{2}^{+}$ and let $(e,f,g)$ be an $M$ -obtuse superbase of ${\mathbb{Z}}^{2}$ , ordered so that $\|e\|_{M}\leq\|f\|_{M}\leq\|g\|_{M}$ . Then $(e,f)$ is an $M$ -reduced basis.

Proof.

The family $(e,-g,f,-e,g,-f)$ satisfies by construction the conditions of the previous lemma. Hence any $\operatorname{\mathbf{M}}(z)$ -reduced basis $(e^{\prime},f^{\prime})$ of ${\mathbb{Z}}^{2}$ satisfies $\{e^{\prime},f^{\prime}\}\subset\{e,f,g,-e,-f,-g\}$ . Observing that $e^{\prime}$ and $f^{\prime}$ are linearly independent, that $\|e^{\prime}\|_{M}\leq\|f^{\prime}\|_{M}$ , and that $\|e\|_{M}\leq\|f\|_{M}\leq\|g\|_{M}$ , we obtain that $\|e\|_{M}\leq\|e^{\prime}\|_{M}$ and $\|f\|_{M}\leq\|f^{\prime}\|_{M}$ . Recalling that $M$ -reduced bases are defined by the minimality of their $\|\cdot\|_{M}$ -norms, see Definition 4, we obtain as announced that $(e,f)$ is an $M$ -reduced basis. ∎

The previous lemma shows that for any $z\in\Omega$ , there exists an $\operatorname{\mathbf{M}}(z)$ -reduced basis $(e,f)$ such that

V(z)=\{e,f,e+f,-e,-f,-e-f\}.

(39)

Given $M\in S_{2}^{+}$ , and an $M$ -reduced basis $(e,f)$ of ${\mathbb{Z}}^{2}$ , we denote $\mu(M):=|\langle e,Mf\rangle|$ . This value can be expressed in terms of the Minkowski minima (19) and thus does not depend on the particular choice of $M$ -reduced basis. Indeed, recalling the identity

\langle e,Mf\rangle^{2}+\det(M)\det(e,f)^{2}=\|e\|_{M}^{2}\|f\|_{M}^{2},

we obtain

\mu(M)=|\langle e,Mf\rangle|=\sqrt{\lambda_{1}(M)^{2}\lambda_{2}(M)^{2}-\det(M)}.

(40)

In addition one has

0\leq 2\mu(M)\leq\lambda_{1}(M)^{2},

(41)

where the right hand side follows from Lemma 3. A vanishing value, $\mu(M)=0$ , indicates that the lattice ${\mathbb{Z}}^{2}$ admits an $M$ -orthogonal basis. In contrast, when the upper bound is met, $2\mu(M)=\lambda_{1}(M)^{2}$ , one has $\|f\|_{M}=\|f+\varepsilon e\|_{M}$ for $\varepsilon:=-{\rm sign}\langle e,Mf\rangle$ , hence the reduced basis $(e,f)$ is not unique even up to sign changes. See Figure 5.

We next show that the stencils of the AD-LBR do not depend on the choices of reduced bases, as was announced in the introduction.

Lemma 11.

The weights $\gamma_{z}:{\mathbb{Z}}^{2}\to{\mathbb{R}}_{+}$ used in the AD-LBR at a point $z\in\Omega$ (defined on $V(z)$ by (11) and extended to ${\mathbb{Z}}^{2}$ by $0$ ), do not depend on the choice of $\operatorname{\mathbf{M}}(z)$ -obtuse superbase of ${\mathbb{Z}}^{2}$ .

Proof.

We denote $M:=\operatorname{\mathbf{M}}(z)$ and $D:=\operatorname{\mathbf{D}}(z)$ . Let $(e,f,g)$ and $(e^{\prime},f^{\prime},g^{\prime})$ be two $M$ -obtuse superbases, and let $V,V^{\prime}$ and $\gamma,\gamma^{\prime}:{\mathbb{Z}}^{2}\to{\mathbb{R}}_{+}$ be the corresponding AD-LBR stencils and weights defined by (10) and (11). We may assume, using Corollary 2 and up to reordering, that $(e,f)$ and $(e^{\prime},f^{\prime})$ are $M$ -reduced bases.

Corollary 1 states that the scalar products $\langle e,Mg\rangle$ , $\langle f,Mg\rangle$ , $\langle e^{\prime},Mg^{\prime}\rangle$ and $\langle f^{\prime},Mg^{\prime}\rangle$ are (strictly) negative. On the other hand

\langle e,Mf\rangle=\langle e^{\prime},Mf^{\prime}\rangle=-\mu(M)\leq 0.

(42)

Applying Lemma 10 to the family

(e^{\prime},-g^{\prime},f^{\prime},-e^{\prime},g^{\prime},-f^{\prime})

we obtain that

\{e,f\}\subset\{e^{\prime},f^{\prime},g^{\prime},-e^{\prime},-f^{\prime},-g^{\prime}\}.

(43)

If $\mu(M)\neq 0$ , then $\langle e,Mf\rangle$ and $\langle e^{\prime},Mf^{\prime}\rangle$ are negative, and not merely non-positive, thus $\{e,f\}\subset\{e^{\prime},f^{\prime},g^{\prime}\}$ , or $\{e,f\}\subset\{-e^{\prime},-f^{\prime},-g^{\prime}\}$ . Since $e+f+g=0=e^{\prime}+f^{\prime}+g^{\prime}$ , it follows that $\{e,f,g\}=\{e^{\prime},f^{\prime},g^{\prime}\}$ , or $\{e,f,g\}=\{-e^{\prime},-f^{\prime},-g^{\prime}\}$ . The stencils $V,V^{\prime}$ are thus identical, see (10), and so are the weights $\gamma,\gamma^{\prime}$ .

If $\mu(M)=0$ , then the stencils $V,V^{\prime}$ may not be identical. Observe however that $\langle e^{\perp},Df^{\perp}\rangle=0=\langle e^{\prime\perp},Df^{\prime\perp}\rangle$ , using (8). Hence using the weights expression (11):

	$\displaystyle\gamma(\pm g)$	$\displaystyle=-\langle e^{\perp},Df^{\perp}\rangle/2=0,$				(44)
	$\displaystyle\gamma(\pm e)$	$\displaystyle=\\|f^{\perp}\\|_{D}^{2}/2,$	$\displaystyle\gamma(\pm f)$	$\displaystyle=\\|e^{\perp}\\|_{D}^{2}/2,$

and likewise for $\gamma^{\prime},e^{\prime},f^{\prime},g^{\prime}$ . Note also that $\|g^{\prime}\|^{2}_{M}=\|e^{\prime}\|_{M}^{2}+\|f^{\prime}\|_{M}^{2}>\lambda_{2}(M)^{2}$ , hence $e$ and $f$ are different from $g^{\prime}$ and $-g^{\prime}$ . It follows from (43) that $\{e,f\}=\{\varepsilon_{1}e^{\prime},\varepsilon_{2}f^{\prime}\}$ for some $\varepsilon_{1},\varepsilon_{2}\in\{-1,1\}$ . This implies $\gamma=\gamma^{\prime}$ in view of (44), and concludes the proof. ∎

The next lemma establishes weak uniqueness and stability properties for $M$ -reduced bases, in the case of a strict inequality $2\mu(M)<\lambda_{1}(M)^{2}.$

Lemma 12.

Consider $M,M^{\prime}\in S_{2}^{+}$ , an $M$ -reduced basis $(e,f)$ , and an $M^{\prime}$ -reduced basis $(e^{\prime},f^{\prime})$ . Let $\tau\geq 1$ be such that $\tau^{-2}M\leq M^{\prime}\leq\tau^{2}M$ , in the sense of symmetric matrices. Assume either:

(i)

$2\mu(M)<\lambda_{1}(M)^{2}$ , and $\tau=1$ (i.e. $M^{\prime}=M$ ).
(ii)

$4\mu(M)\leq\lambda_{1}(M)^{2}$ , and $\tau^{4}\leq 1+\frac{1}{3}{\kappa(M)^{-2}}$ .

Then $\{e^{\prime},f^{\prime}\}\subset\{e,f,-e,-f\}.$

Proof.

Denoting $\alpha:=2\mu(M)/\lambda_{1}(M)^{2}$ , we obtain:

	$\displaystyle 4\langle e,M^{\prime}f\rangle=\\|e+f\\|_{M^{\prime}}^{2}-\\|e-f\\|_{M^{\prime}}^{2}$
	$\displaystyle\leq\tau^{2}\\|e+f\\|_{M}^{2}-\tau^{-2}\\|e-f\\|_{M}^{2}$
	$\displaystyle=(\tau^{2}-\tau^{-2})(\\|e\\|_{M}^{2}+\\|f\\|_{M}^{2})+2(\tau^{2}+\tau^{-2})\langle e,Mf\rangle$
	$\displaystyle\leq((\tau^{2}-\tau^{-2})(1+\kappa(M)^{2})+\alpha(\tau^{2}+\tau^{-2}))\\|e\\|_{M}^{2}$
	$\displaystyle\leq((\tau^{4}-1)(1+\kappa(M)^{2})+\alpha(\tau^{4}+1))\\|e\\|_{M^{\prime}}^{2}.$

In the fourth line we used Lemma 2, which implies that $\|f\|_{M}=\lambda_{2}(M)\leq\kappa(M)\lambda_{1}(M)=\kappa(M)\|e\|_{M}$ , and Lemma 3 to bound $2\langle e,Mf\rangle$ . Replacing $\alpha$ and $\tau$ with their assumed upper bounds, we obtain $2\langle e,M^{\prime}f\rangle<\|e\|_{M^{\prime}}^{2}$ . Proceeding likewise, we obtain $2|\langle e,M^{\prime}f\rangle|<\min\{\|e\|_{M^{\prime}}^{2},\|f\|_{M^{\prime}}^{2}\}$ . We may therefore apply Lemma 10 to $M^{\prime}$ and $(e,f,-e,-f)$ , which implies $\{e^{\prime},f^{\prime}\}\subset\{e,f,-e,-f\}$ as announced. ∎

3.3 Comparison of the stencils

We assume in this subsection that the scale parameter $h$ is sufficiently small. Our assumption is stronger than the one used in §3.1, see (30), hence in particular there exists an Anisotropic Delaunay Triangulation ${\mathcal{T}}_{h}$ . More precisely we assume that

\tau_{h}\leq\sqrt[4]{1+1/(3{\boldsymbol{\kappa}}^{2})}\ \text{ and }\ \theta_{h}\leq\theta_{0}:=1/(4{\boldsymbol{\kappa}}).

(45)

See (26), (29), and Lemma 7 for the definition of ${\boldsymbol{\kappa}}$ , $\tau_{h}$ and $\theta_{h}$ respectively. For Lipschitz metrics, $\tau_{h}=1+{\mathcal{O}}(h)$ and $\theta_{h}={\mathcal{O}}(h)$ .

Our objective is to compare the stencils $V(p)$ , $V_{h}(p)$ , of the AD-LBR (10) and of the ADT finite element discretization (17) respectively, at a point $p\in h{\mathbb{Z}}^{2}$ . The next lemma shows that they are equal unless the lattice ${\mathbb{Z}}^{2}$ is almost orthogonal with respect to the local metric; a property quantified via $\mu(\operatorname{\mathbf{M}}(p))$ , see (40).

Lemma 13.

Let $p\in h{\mathbb{Z}}^{2}$ , and let $M:=\operatorname{\mathbf{M}}(p)$ . If $\mu(M)>\theta_{h}$ , then $V_{h}(p)=V(p)$ . In any case, one has for any $M$ -reduced basis $(e,f)$ :

	$\displaystyle V_{h}(p)\supset$	$\displaystyle\{e,f,-e,-f\}$		(46)
	$\displaystyle V_{h}(p)\subset$	$\displaystyle\{e,f,e+f,e-f,\ -e,-f,-e-f,f-e\}$		(47)

Proof.

We assume that $\langle e,Mf\rangle\leq 0$ , up to replacing $f$ with $-f$ . Let $T\in{\mathcal{T}}_{h}$ be a triangle containing $p$ , and let $he_{1},he_{2},he_{3}$ be the edges of $T$ , oriented so that $e_{1}+e_{2}+e_{3}=0$ . Using point (iii) of Lemma 7, and (27), we obtain for all $1\leq i\leq 3$ , with the convention $e_{4}:=e_{1}$

\langle e_{i},Me_{i+1}\rangle\leq\theta_{h}\leq\theta_{0}<\frac{1}{2{\boldsymbol{\kappa}}}\leq\frac{1}{2}\min\{\|e_{i}\|_{M}^{2},\|e_{i+1}\|_{M}^{2}\}.

(48)

Denote $E:=\{e_{1},e_{2},e_{3}\}$ , and $-E:=\{-e_{1},-e_{2},-e_{3}\}$ . Applying Lemma 10 to $M$ and the points $(e_{1},-e_{3},e_{2},\linebreak-e_{1},e_{3},-e_{2})$ , we obtain that $\{e,f\}\subset E\cup(-E)$ . Up to exchanging $E$ with $-E$ , we thus have $\{e,f\}\subset E$ or $\{e,-f\}\subset E$ . Since the elements of $E$ sum to zero, we conclude that

E=\{e,f,-e-f\}\ \text{ or }\ E=\{e,-f,-e+f\},

(49)

which implies (47).

If $\mu(M)=|\langle e,Mf\rangle|>\theta_{h}$ , then (48) forbids the second case in (49). Thus $E=\{e,f,-e-f\}$ , and therefore $V_{h}(p)\subset V(p)$ , using (39).

Let $T\in{\mathcal{T}}_{h}$ be a triangle containing $p$ and intersecting the half line $L:=\{p+re;\,r>0\}$ . We know (49) that $he$ is a vector edge of $T$ (i.e. the difference between two vertices of $T$ ). The corresponding edge segment must be $[p,p+he]$ , since otherwise $T\cap L$ would be empty. Thus $e\in V_{h}(p)$ . Applying the same argument to $-e,f,-f$ , we obtain (46).

If $\mu(M)>\theta_{h}$ , then $h(e+f)$ is also a vector edge of any triangle $T\in{\mathcal{T}}_{h}$ containing $p$ , since we eliminated the second case in (49). Reasoning as above we find that $\{e+f,-e-f\}\subset V_{h}(p)$ , and therefore $V(p)\subset V_{h}(p)$ . Thus $V(p)=V_{h}(p)$ . This concludes the proof. ∎

We introduce new stencils $W(p),W^{\prime}(p)$ , for $p\in{\mathbb{R}}^{2}$ , defined as follows. Let $M:=\operatorname{\mathbf{M}}(p)$ . If $\mu(M)\leq\theta_{0}$ , then denoting by $(e,f)$ an $M$ -reduced basis,

	$\displaystyle W(p)$	$\displaystyle:=\{e,f,-e,-f\},$		(50)
	$\displaystyle W^{\prime}(p)$	$\displaystyle:=\{e,f,e+f,e-f,\ -e,-f,-e-f,f-e\}.$		(51)

On the other hand, if $\mu(M)>\theta_{0}$ , then

W(p):=V(p)=:W^{\prime}(p).

(52)

The previous lemma implies that $W(p)\subset V_{h}(p)\subset W^{\prime}(p)$ for any $p\in h{\mathbb{Z}}^{2}$ .

Lemma 14.

The stencils $W(p)$ , $W^{\prime}(p)$ , do not depend on the choice of $\operatorname{\mathbf{M}}(p)$ -reduced basis.

Proof.

Let $M:=\operatorname{\mathbf{M}}(p)$ . If $\mu(M)>\theta_{0}$ , then $W(p)$ , $W^{\prime}(p)$ are defined by (52), hence there is nothing to prove. Otherwise we obtain $\mu(M)\leq\theta_{0}\leq 1/(4{\boldsymbol{\kappa}})\leq\lambda_{1}(M)^{2}/4$ . Hence, by Lemma 12, any two $M$ -reduced bases $(e,f)$ , $(e^{\prime},f^{\prime})$ , need to satisfy $\{e^{\prime},f^{\prime}\}\subset\{e,f,-e,-f\}$ . In view of (50) and (51), they thus yield the same stencils $W(p)$ , $W^{\prime}(p)$ . ∎

Let ${\mathcal{F}}_{h},{\mathcal{F}}^{\prime}_{h}$ be the energies associated to the stencils $W,W^{\prime}$ : for $u\in L^{2}(\Omega_{h})$ , extended to $h{\mathbb{Z}}^{2}$ by periodicity,

	$\displaystyle{\mathcal{F}}_{h}(u)$	$\displaystyle:=\sum_{z\in\Omega_{h}}\sum_{g\in W(z)}\|u(z+hg)-u(z)\|^{2},$
	$\displaystyle{\mathcal{F}}^{\prime}_{h}(u)$	$\displaystyle:=\sum_{z\in\Omega_{h}}\sum_{g\in W^{\prime}(z)}\|u(z+hg)-u(z)\|^{2}.$

The outline of the proof of Theorem 1 is as follows. We prove in Lemmas 16, 17 and 15 respectively that for any $u\in L^{2}(\Omega_{h})$ :

$\displaystyle\|{\mathcal{E}}^{\prime}_{h}(u)-{\mathcal{E}}_{h}(u)\|$	$\displaystyle\leq(\varepsilon_{h}+C_{0}\theta_{h}){\mathcal{F}}^{\prime}_{h}(u)$	(53)
$\displaystyle{\mathcal{F}}^{\prime}_{h}(u)$	$\displaystyle\leq C_{1}{\mathcal{F}}_{h}(u)$	(54)
$\displaystyle{\mathcal{F}}_{h}(u)$	$\displaystyle\leq C_{2}{\mathcal{E}}_{h}(u),$	(55)

where the constants $C_{0},C_{1},C_{2}$ only depend on the metric $\operatorname{\mathbf{M}}$ . Combining these inequalities, and recalling that $\theta_{h}={\mathcal{O}}(h)$ and $\varepsilon_{h}={\mathcal{O}}(h)$ for Lipschitz metrics ( $\varepsilon_{h}$ is defined in Lemma 9), we obtain

|{\mathcal{E}}^{\prime}_{h}(u)-{\mathcal{E}}_{h}(u)|\leq ch{\mathcal{E}}_{h}(u),

for some constant $c=c(\operatorname{\mathbf{M}})$ . This establishes (13), and concludes the proof of Theorem 1.

For each $p\in{\mathbb{R}}^{2}$ , we denote by $\eta_{p}$ , $\eta^{\prime}_{p}:{\mathbb{Z}}^{2}\to\{0,1\}$ , the characteristic functions of $W(p)$ and $W^{\prime}(p)$ respectively. The proofs of (53) and (55) immediately result from the comparison, in Lemmas 16 and 15 respectively, of the coefficients $\gamma_{p}$ , $\gamma^{h}_{p}$ , $\eta_{p}$ , $\eta^{\prime}_{p}$ appearing in the expressions of ${\mathcal{E}}_{h},{\mathcal{E}}^{\prime}_{h},{\mathcal{F}}_{h},{\mathcal{F}}^{\prime}_{h}$ .

In the following, it will be convenient to express the AD-LBR weights, and others, in terms of the scalar product associated to the Riemannian metric. We thus recall (8): for any $z\in\Omega$ , and any $e,f\in{\mathbb{R}}^{2}$ ,

\langle e^{\perp},\operatorname{\mathbf{D}}(z)f^{\perp}\rangle=\operatorname{\mathbf{d}}(z)\langle e,\operatorname{\mathbf{M}}(z)f\rangle.

We also define the bounds ( $0<\underline{\operatorname{\mathbf{d}}}\leq\overline{\operatorname{\mathbf{d}}}<\infty$ )

\underline{\operatorname{\mathbf{d}}}:=\min_{z\in\Omega}\operatorname{\mathbf{d}}(z),\quad\overline{\operatorname{\mathbf{d}}}:=\max_{z\in\Omega}\operatorname{\mathbf{d}}(z).

Lemma 15.

For any $p\in{\mathbb{R}}^{2}$ , one has on ${\mathbb{Z}}^{2}$

\eta_{p}\leq C_{2}\gamma_{p},\quad\text{ with }C_{2}:=2\overline{\operatorname{\mathbf{d}}}/\theta_{0}.

Proof.

Let $M:=\operatorname{\mathbf{M}}(p)$ , and let $(e,f,g)$ be an $M$ -obtuse superbase of ${\mathbb{Z}}^{2}$ . We can assume, thanks to Corollary 2, that $(e,f)$ is an $M$ -reduced basis. Then using (22)

\displaystyle 2\operatorname{\mathbf{d}}(p)\gamma_{p}(\pm f)

\displaystyle\geq\frac{1}{2}\|e\|_{M}^{2}\geq\frac{1}{2{\boldsymbol{\kappa}}}=2\theta_{0},

hence $\gamma_{p}(\pm f)\geq\theta_{0}/\overline{\operatorname{\mathbf{d}}}$ , and likewise $\gamma_{p}(\pm e)\geq\theta_{0}/\overline{\operatorname{\mathbf{d}}}$ . If $\mu(M)\leq\theta_{0}$ , then $W(p)=\{e,f,-e,-f\}$ , and this concludes the proof.

Assume now that $\mu(M)>\theta_{0}$ . Then

2\operatorname{\mathbf{d}}(p)\gamma_{p}(\pm g)=-\langle e,Mf\rangle=\mu(M)\geq\theta_{0},

hence $\gamma_{p}(\pm g)\geq\theta_{0}/(2\overline{\operatorname{\mathbf{d}}})$ . The result follows since $W(p)=\{e,f,g,-e,-f,-g\}$ . ∎

Let $p\in h{\mathbb{Z}}^{2}$ and let $e_{1},\cdots,e_{k}$ be the consecutive elements of $V_{h}(p)$ , in trigonometric order. We define for all $1\leq i\leq k$ , denoting $M:=\operatorname{\mathbf{M}}(p)$ ,

\tilde{\gamma}_{p}^{h}(e_{i}):=-\frac{\operatorname{\mathbf{d}}(p)}{4}(\langle e_{i}-e_{i-1},\,Me_{i-1}\rangle+\langle e_{i}-e_{i+1},\,Me_{i+1}\rangle),

with the periodic conventions $e_{k+1}:=e_{1}$ , $e_{0}:=e_{k}$ . We also set $\tilde{\gamma}_{p}^{h}=0$ on ${\mathbb{Z}}^{2}\setminus\{e_{1},\cdots,e_{k}\}$ .

Lemma 16.

For any $p\in h{\mathbb{Z}}^{2}$ , one has on ${\mathbb{Z}}^{2}$

|\gamma^{h}_{p}-\tilde{\gamma}^{h}_{p}|\leq\varepsilon_{h}\eta^{\prime}_{p},\quad\text{ and }\quad|\tilde{\gamma}^{h}_{p}-\gamma_{p}|\leq C_{0}\theta_{h}\eta^{\prime}_{p},

(56)

where $\varepsilon_{h}$ is given in Lemma 9, and $C_{0}=1/\underline{\operatorname{\mathbf{d}}}$ .

Proof.

The coefficients $\gamma_{p}$ , $\gamma^{h}_{p}$ , $\tilde{\gamma}^{h}_{p}$ , are all equal to zero outside of $W^{\prime}(p)$ . This holds by construction of $\gamma_{p}$ , and by Lemma 13 for $\gamma^{h}_{p}$ , $\tilde{\gamma}^{h}_{p}$ . We may therefore forget about the presence of $\eta^{\prime}_{p}$ in (56).

First inequality. Lemma 9 states that $|\gamma^{h}_{p}-\tilde{\gamma}^{h}_{p}|\leq\varepsilon_{h}$ on ${\mathbb{Z}}^{2}$ , which concludes the proof.

Second inequality. If $\mu(M)>\theta_{h}$ , then $V_{h}(p)=V(p)$ . Comparing the definition of $\tilde{\gamma}^{h}_{p}$ with that of $\gamma_{p}$ (11) we observe that $\tilde{\gamma}^{h}_{p}=\gamma_{p}$ on ${\mathbb{Z}}^{2}$ , which concludes the proof in this case.

Assume now that $\mu(M)\leq\theta_{h}$ . Let $(e,f,g)$ be an $M$ -obtuse superbase of ${\mathbb{Z}}^{2}$ . We can assume, thanks to Corollary 2 that $(e,f)$ is an $M$ -reduced basis. Looking at (11) and denoting $\delta:=2\operatorname{\mathbf{d}}(p)$ , we find that

|\delta\gamma_{p}(\pm e)-\|f\|_{M}^{2}|=|\langle e,Mf\rangle|=\mu(M)\leq\theta_{h}.

Likewise $|\delta\gamma_{p}(\pm f)-\|e\|_{M}^{2}|\leq\theta_{h}$ . In addition

\delta\gamma_{p}(\pm(e+f))=\mu(M)\leq\theta_{h},\text{ and }\gamma_{p}(\pm(e-f))=0.

Combining the definition of $\tilde{\gamma}^{h}_{p}$ with the description of the stencil $V_{h}(p)$ in Lemma 13, we obtain that

2\delta\,\tilde{\gamma}^{h}_{p}(e)=\left\{\begin{array}[]{c}\langle f-e,Mf\rangle\\ \text{or}\\ \langle f+e,Mf\rangle\end{array}\right\}+\left\{\begin{array}[]{c}\langle f-e,Mf\rangle\\ \text{or}\\ \langle f+e,Mf\rangle\end{array}\right\}.

In any case $|\delta\,\tilde{\gamma}^{h}_{p}(e)-\|f\|_{M}^{2}|\leq\theta_{h}$ . The expressions and estimates of $\tilde{\gamma}^{h}_{p}$ at the points $-e,f,-f$ are obtained similarly. Likewise, using Lemma 13,

2\delta\,\tilde{\gamma}^{h}_{p}(e+f)=\left\{\begin{array}[]{ll}\langle e,Mf\rangle+\langle e,Mf\rangle&\text{ if }e+f\in V_{h}(p),\\ 0&\text{ otherwise.}\end{array}\right.

In any case $|\delta\,\tilde{\gamma}^{h}_{p}(e+f)|\leq\theta_{h}$ . The expressions and estimates of $\tilde{\gamma}_{p}^{h}$ at the points $-(e+f),e-f,-(e-f)$ are similar. Comparing the above estimates of $\gamma_{p}$ , $\tilde{\gamma}^{h}_{p}$ , we obtain that $\delta|\gamma_{p}-\tilde{\gamma}^{h}_{p}|\leq 2\theta_{h}$ on $\{e,f,e+f,e-f,\linebreak-e,-f,-e-f,f-e\}=W^{\prime}(p)$ . Since $\delta=2\operatorname{\mathbf{d}}(p)\geq 2\underline{\operatorname{\mathbf{d}}}=2/C_{0}$ , this concludes the proof. ∎

In the last lemma of this section, we control the contribution to the energy ${\mathcal{F}}^{\prime}_{h}$ of a stencil $W^{\prime}(p)$ , $p\in h{\mathbb{Z}}^{2}$ , in terms of the contributions to ${\mathcal{F}}_{h}$ of $W(p)$ and of the neighboring stencils $W(p+he)$ , $e\in W(p)$ . This leads to an estimate of ${\mathcal{F}}^{\prime}_{h}$ in terms of ${\mathcal{F}}_{h}$ , which concludes the proof of Theorem 1.

Lemma 17.

One has ${\mathcal{F}}^{\prime}_{h}(u)\leq C_{1}{\mathcal{F}}_{h}(u)$ , for any $u\in L^{2}(\Omega_{h})$ , with $C_{1}:=17$ .

Proof.

Consider a grid point $p\in h{\mathbb{Z}}^{2}$ , and denote $M:=\operatorname{\mathbf{M}}(p)$ . Assume first that $\mu(M)\leq\theta_{0}$ , so that $W(p)\subsetneq W^{\prime}(p)$ . Consider also an arbitrary $g\in W^{\prime}(p)\setminus W(p)$ , and observe that $g=e+f$ for some $M$ -reduced basis $(e,f)$ .

We set $p^{\prime}:=p+e$ and $M^{\prime}:=\operatorname{\mathbf{M}}(p^{\prime})$ . Applying point (ii) of Lemma 12, we find that $(e,f)$ is also an $M^{\prime}$ -reduced basis. Indeed we have as required

4\mu(M)\leq 4\theta_{0}={\boldsymbol{\kappa}}^{-1}\leq\lambda_{1}(M)^{2},

using (27), and the assumption on $\tau$ follows from (45) and (28). Therefore

f\in W(p^{\prime}),\text{ and }h^{-1}(p^{\prime}-p)=e\in W(p^{\prime}).

(57)

We obtain

		$\displaystyle\|u(p+g)-u(p)\|^{2}$		(58)
		$\displaystyle=\|u(p+e+f)-u(p)\|^{2}$
		$\displaystyle\leq 2(\|u(p+e+f)-u(p+e)\|^{2}+\|u(p+e)-u(p)\|^{2})$
		$\displaystyle=2(\|u(p^{\prime}+f)-u(p^{\prime})\|^{2}+\|u(p+e)-u(p)\|^{2}).$

Denote, for all $q\in h{\mathbb{Z}}^{2}$ ,

	$\displaystyle{\mathcal{F}}_{h}(u;q)$	$\displaystyle:=\sum_{g\in W(q)}\|u(q+hg)-u(q)\|^{2},$
	$\displaystyle{\mathcal{F}}^{\prime}_{h}(u;q)$	$\displaystyle:=\sum_{g\in W^{\prime}(q)}\|u(q+hg)-u(q)\|^{2}.$

Using (58), we obtain

{\mathcal{F}}^{\prime}_{h}(u;p)-{\mathcal{F}}_{h}(u;p)\leq{\mathcal{G}}_{h}(u;p)

(59)

where ${\mathcal{G}}_{h}(u;p)$ is given by

\left\{\begin{array}[]{ll}\displaystyle 4{\mathcal{F}}_{h}(u;p)+2\sum_{g\in W(p)}{\mathcal{F}}_{h}(u;p+g),&\text{ if }\mu(\operatorname{\mathbf{M}}(p))\leq\theta_{0}\\ 0,&\text{ if }\mu(\operatorname{\mathbf{M}}(p))>\theta_{0}\end{array}\right.

When ${\mathcal{F}}_{h}(u;p^{\prime})$ appears in ${\mathcal{G}}_{h}(u;p)$ , with $p,p^{\prime}\in h{\mathbb{Z}}^{2}$ , $p\neq p^{\prime}$ , we have $h^{-1}(p^{\prime}-p)\in W(p^{\prime})$ , see (57). For each $p^{\prime}\in h{\mathbb{Z}}^{2}$ , there are thus at most $\#(W(p^{\prime}))\leq 6$ points $p\in h{\mathbb{Z}}^{2}\setminus\{p^{\prime}\}$ such that ${\mathcal{F}}_{h}(u;p^{\prime})$ appears in ${\mathcal{G}}_{h}(u;p)$ . Summing (59) over $p\in\Omega_{h}$ , we thus obtain ${\mathcal{F}}^{\prime}_{h}(u)-{\mathcal{F}}_{h}(u)\leq(4+2\times 6){\mathcal{F}}_{h}(u)$ (the constant could easily be improved), which concludes the proof. ∎

4 Numerical experiments

We compare our scheme AD-LBR with a family of other schemes: finite difference, finite elements, and two schemes from the image processing literature. We begin with a quantitative comparison for the discretization of the restoration equation, in a synthetic case where the exact solution is analytically available for reference. The second test case is a qualitative comparison of Coherence-Enhancing Diffusion (CED) [25], on a real image and the quality assessment is by visual inspection. Finally we present a 3D implementation of AD-LBR for proof of feasibility, featuring a synthetic CED experiment, and a application of Edge Enhancing Diffusion to MRI data.

4.1 The different schemes

Our two dimensional numerical experiments feature the following six numerical schemes for anisotropic diffusion.

➣

AD-LBR: the scheme presented in this work.
➣

Finite Differences (FD). The gradient and the divergence are discretized using standard centered finite differences [15], see Remark 3 for details. This approach, arguably the most straightforward, leads to a 9 point stencil.
➣

Bilinear Finite Elements (Q1). Bilinear finite elements, also referred to as Q1 finite elements, are linear with respect to each space direction. This amounts to use a 9 points stencil, where the coefficients are different from the previous scheme.
➣

Weickert-Scharr scheme (WS). This scheme, introduced in [27], is based on a second order approximation of the gradient using a $3\times 3$ centered stencil. As a result, it offers good accuracy and rotation invariance when applied to sufficiently smooth functions, but lacks robustness guarantees such as the discrete maximum principle and spectral correctness (see §1), even for $\operatorname{\mathbf{D}}={\rm Id}$ . The stencil for this scheme has size $5\times 5$ .
➣

Weickert’s Non-Negative scheme (W-NN). The coefficients of this scheme, detailed in [25] page 95, are non-negative as long as the anisotropy ratio (9) satisfies $\kappa\leq 1+\sqrt{2}\sim 2.41$ .
➣

Axes-directed Non-Negative scheme (A-NN). This six point non-negative scheme is implicitly defined in the proof Theorem 6 in [25], and can be regarded as a generalisation of W-NN. See Remark 1 below for details. Among the 6 points of the stencil, 4 points are along the axes of coordinates.

Note that other schemes exist, see for instance [16, 28]. While an exhaustive comparison is in principle desirable, it could not be done here due to time and space constraints.

To fix the ideas and illustrate the difference between the schemes, we propose to compute the stencil and the coefficients for different constant diffusion tensors $\operatorname{\mathbf{D}}$ , in isotropic and anisotropic cases. Denoting by $R$ the matrix of rotation by the angle $\theta=\pi/6$ , and by $\kappa\geq 1$ the chosen anisotropy ratio, we set, identically on ${\mathbb{R}}^{2}$ :

\operatorname{\mathbf{D}}:=R\left(\begin{array}[]{cc}1&0\\ 0&\kappa^{-2}\end{array}\right)R^{\mathrm{T}}.

(60)

The results are presented in Tables 1 and 2. Note that for the two last cases (anisotropy $\kappa=\sqrt{10}$ and $\kappa=\sqrt{50}$ ) the AD-LBR stencil contains points that are outside the $3\times 3$ neighborhood of the pixel. However the stencil contains 6 points, as expected. This contrasts with the schemes FD, Q1, W-NN where only the $3\times 3$ neighborhood is involved. Another observation is that the off-center stencil coefficients of the AD-LBR are non-positive (this gives non-negative off-diagonal coefficients for $\operatorname{div}({\bf D}\nabla)$ ), in contrast with schemes FD, Q1, WS, and with scheme W-NN for anisotropy $\kappa>1+\sqrt{2}$ . This is an essential property of AD-LBR (and A-NN), and as a consequence our scheme satisfies, unconditionally, the discrete maximum principle [1, 6].

The largest eigenvalue of the discrete operator $-\operatorname{div}({\bf D}\nabla)$ is given in Table 3, for the different schemes. It turns out that AD-LBR has in most cases the smallest eigenvalues among all schemes, except for scheme WS and occasionally A-NN. This property allows (although this was not done in our numerical experiments) to use larger time steps for AD-LBR than for the other schemes, when solving parabolic equations (2) or (64) with an explicit time discretization.

Operator splitting is a classical approach to further increase the timestep in (potentially anisotropic and non-linear) diffusion PDEs [25, 26, 2]. The AD-LBR is compatible with Additive Operator Splitting, by applying Remark (e) page 111 in [25], although the efficiency of this technique is here compromised by the potentially large number of directions in our adaptive stencils. Let us also mention Multiplicative Operator Splittings, and Additive-Multiplicative Operator Splittings, which allow to combine different time-steps [2, 8]. None of these methods was used in our experiments.

Remark 1 (Axes-directed non negative six point scheme).

The following six point scheme A-NN is, in our belief, the best possible implementation of the constructive proof in [25] of the existence of non-negative schemes for two dimensional anisotropic diffusion. Like AD-LBR, this scheme is defined by the data at each point $z\in\Omega$ of a stencil $V(z)$ , and of non-negative weights $\gamma_{z}$ .

Let $\operatorname{\mathbf{D}}(z)=\left(\begin{array}[]{cc}a&b\\ b&c\end{array}\right).$ In the diagonal case $b=0$ , the scheme A-NN relies on the classical four points stencil. Otherwise note that

\frac{a}{|b|}-\frac{|b|}{c}=\frac{ac-b^{2}}{|b|c}>0.

Let $p,q\in{\mathbb{Z}}\setminus\{0\}$ be such that $bpq\geq 0$ ,

\frac{|b|}{c}\leq\left|\frac{p}{q}\right|\leq\frac{a}{|b|},

(61)

and $\max(|p|,|q|)$ is minimal. The scheme A-NN is defined by the six point stencil

V(z):=\{(\pm 1,0),(0,\pm 1),\pm(p,q)\}

and the non-negative weights

	$\displaystyle 2\gamma(\pm 1,0)$	$\displaystyle=a-\frac{p}{q}b,$	$\displaystyle 2\gamma(0,\pm 1)$	$\displaystyle=c-\frac{q}{p}b,$
	$\displaystyle 2\gamma(\pm(p,q))$	$\displaystyle=\frac{b}{pq}.$

These coefficients are non-negative by construction, and consistency (5) is easily checked. Contrary to AD-LBR, the coordinate axes play a privileged role in A-NN. This introduces axis aligned artifacts which are visible in Figure 12 (g).

Remark 2 (Stencil radius).

The two dimensional stencils of AD-LBR coincide with those of FM-LBR, a numerical scheme for anisotropic static Hamilton-Jacobi PDEs introduced in [14] the second author. As shown in Proposition 1.6 of [14], the euclidean radius

r=\max\{\|v\|;v\in V(z)\}

of this stencil is bounded by $\kappa(\operatorname{\mathbf{D}}(z))$ .

In contrast, consider for $0<\varepsilon<1/4$ the matrix

D:=\left(\begin{array}[]{cc}1&1-2\varepsilon\\ 1-2\varepsilon&1-3\varepsilon\end{array}\right).

If follows from (61) that $1+\varepsilon\leq p/q+{\mathcal{O}}(\varepsilon^{2})\leq 1+2\varepsilon$ . From this point, one easily obtains that $q\gtrsim\varepsilon^{-1}\approx\kappa(D)^{2}$ . The radius of the A-NN stencil, at a point $z\in\Omega$ , may thus be of the order of $\kappa(\operatorname{\mathbf{D}}(z))^{2}$ . The radii of the AD-LBR and A-NN stencils, computed for diffusion tensors of anisotropy $\kappa=10$ and of various orientations, are illustrated on Figure 7.

Remark 3 (Scheme FD).

The operator $\operatorname{div}(\operatorname{\mathbf{D}}\nabla\cdot)$ is discretized using centered finite differences [15]. This involves quantities defined at half integer indices, and in particular the diffusion tensor is here given on the offsetted grid $(i+1/2,\,j+1/2)$ , $(i,j)\in{\mathbb{Z}}^{2}$ . For the sake of readability, we thus define $i^{+}:=i+1/2$ and $i^{-}:=i-1/2$ . The gradient operator is discretized by:

(\partial_{x}u)_{i^{+},j}=u_{i+1,j}-u_{i,j},\quad(\partial_{y}u)_{i,j^{+}}=u_{i,j+1}-u_{i,j}.

The divergence is defined as follows:

	$\displaystyle\operatorname{div}({\bf D}\nabla u)_{i,j}$	$\displaystyle=\partial_{x}({\bf D}^{11}\partial_{x}u+{\bf D}^{12}\partial_{y}u)_{i,j}$
		$\displaystyle+\partial_{y}({\bf D}^{21}\partial_{x}u+{\bf D}^{22}\partial_{y}u)_{i,j},$

with

	$\displaystyle({\bf D}^{11}\partial_{x}u)_{i^{+},j}$	$\displaystyle=\frac{1}{2}\left({\bf D}^{11}_{i^{+},j^{+}}+{\bf D}^{11}_{i^{+},j^{-}}\right)(\partial_{x}u)_{i^{+},j},$
	$\displaystyle\partial_{x}({\bf D}^{11}\partial_{x}u)_{i,j}$	$\displaystyle=({\bf D}^{11}\partial_{x}u)_{i^{+},j}-({\bf D}^{11}\partial_{x}u)_{i^{-},j}$
	$\displaystyle({\bf D}^{21}\partial_{x}u)_{i^{+},j^{+}}$	$\displaystyle=\dfrac{1}{2}{\bf D}^{21}_{i^{+},j^{+}}\left((\partial_{x}u)_{i^{+},j}+(\partial_{x}u)_{i^{+},j+1}\right),$
	$\displaystyle\partial_{y}({\bf D}^{21}\partial_{x}u)_{i,j}$	$\displaystyle=\dfrac{1}{2}\left(({\bf D}^{21}\partial_{x}u)_{i^{+},j^{+}}-({\bf D}^{21}\partial_{x}u)_{i^{+},j^{-}}\right.$
		$\displaystyle\phantom{=}\left.+({\bf D}^{21}\partial_{x}u)_{i^{-},j^{+}}-({\bf D}^{21}\partial_{x}u)_{i^{-},j^{-}}\right),$

and similar terms involving $\partial_{y}u$ .

Table 1: The stencil coefficients for different constant diffusion tensors, and the different schemes presented. The value of the anisotropy ratio

\kappa

is given in the second row, and the orientation of the principal axis is

\theta=\pi/6

, see (60). The bold coefficient indicates the center node. In some examples we present for clarity reasons only half of the stencil (the other half can be deduced by symmetry). Stencil entries are highlighted when they are positive and off-center - an undesirable property which gives rise to stability issues. For small anisotropies,

\kappa\leq 1+\sqrt{2}

, one has AD-LBR = W-NN = A-NN.

$\kappa$	$\kappa=1$ ( $\operatorname{\mathbf{D}}={\rm Id}$ )	$\kappa=\sqrt{2}$
stencil for AD-LBR	$\begin{array}[]{ccc}0&-1&0\\ -1&{\bf 4}&-1\\ 0&-1&0\end{array}$	$\begin{array}[]{ccc}0&-0.41&-0.22\\ -0.66&{\bf 2.57}&-0.66\\ -0.22&-0.41&0\end{array}$
stencil for FD	$\begin{array}[]{ccc}0&-1&0\\ -1&{\bf 4}&-1\\ 0&-1&0\end{array}$	$\begin{array}[]{ccc}{\color[rgb]{0.72,0,0}0.11}&-0.63&-0.11\\ -0.88&{\bf 3}&-0.88\\ -0.11&-0.63&{\color[rgb]{0.72,0,0}0.11}\end{array}$
stencil for Q1	$\dfrac{1}{3}\left(\begin{array}[]{ccc}-1&-1&-1\\ -1&{\bf 8}&-1\\ -1&-1&-1\end{array}\right)$	$\begin{array}[]{ccc}-0.14&-0.13&-0.36\\ -0.38&{\bf 2}&-0.38\\ -0.36&-0.13&-0.14\end{array}$
stencil for WS	$\begin{array}[]{ccc}-0.1&-0.06&-0.02\\ {\color[rgb]{0.72,0,0}0.12}&0&-0.06\\ {\bf 0.46}&{\color[rgb]{0.72,0,0}0.12}&-0.1\\ {\color[rgb]{0.72,0,0}0.12}&0&-0.06\\ -0.1&-0.06&-0.02\end{array}$	$\begin{array}[]{ccc}-0.06&-0.05&-0.02\\ {\color[rgb]{0.72,0,0}0.01}&-0.04&-0.06\\ {\bf 0.35}&{\color[rgb]{0.72,0,0}0.07}&-0.09\\ {\color[rgb]{0.72,0,0}0.01}&{\color[rgb]{0.72,0,0}0.04}&-0.04\\ -0.06&-0.02&-0.01\end{array}$
stencil for W-NN	$\begin{array}[]{ccc}0&-1&0\\ -1&{\bf 4}&-1\\ 0&-1&0\end{array}$	$\begin{array}[]{ccc}0&-0.41&-0.22\\ -0.66&{\bf 2.57}&-0.66\\ -0.22&-0.41&0\end{array}$
stencil for A-NN	$\begin{array}[]{ccc}0&-1&0\\ -1&{\bf 4}&-1\\ 0&-1&0\end{array}$	$\begin{array}[]{ccc}0&-0.41&-0.22\\ -0.66&{\bf 2.57}&-0.66\\ -0.22&-0.41&0\end{array}$

Table 2: The stencil coefficients for different metrics and the different schemes presented, similarly to Table 1 but with more pronounced anisotropies. For the scheme A-NN some points of the stencil are too far from the center node to be represented here, so we indicate the coordinates of these points and the associated coefficient.

$\kappa$	$\kappa=\sqrt{10}$	$\kappa=\sqrt{50}$
stencil for AD-LBR	$\begin{array}[]{ccc}0&-0.26&-0.06\\ {\bf 1.16}&-0.26&0\end{array}$	$\begin{array}[]{ccc}0&-0.11&-0.16\\ {\bf 0.55}&-0.01&0\end{array}$
stencil for FD	$\begin{array}[]{ccc}{\color[rgb]{0.72,0,0}0.19}&-0.32&-0.19\\ -0.77&{\bf 2.2}&-0.77\\ -0.19&-0.32&{\color[rgb]{0.72,0,0}0.19}\end{array}$	$\begin{array}[]{ccc}{\color[rgb]{0.72,0,0}0.21}&-0.27&-0.21\\ -0.76&{\bf 2.04}&-0.76\\ -0.21&-0.27&{\color[rgb]{0.72,0,0}0.21}\end{array}$
stencil for Q1	$\begin{array}[]{ccc}{\color[rgb]{0.72,0,0}0.01}&{\color[rgb]{0.72,0,0}0.04}&-0.38\\ -0.41&{\bf 1.47}&-0.41\\ -0.38&{\color[rgb]{0.72,0,0}0.04}&{\color[rgb]{0.72,0,0}0.01}\end{array}$	$\begin{array}[]{ccc}{\color[rgb]{0.72,0,0}0.04}&{\color[rgb]{0.72,0,0}0.08}&-0.38\\ -0.42&{\bf 1.36}&-0.42\\ -0.38&{\color[rgb]{0.72,0,0}0.08}&{\color[rgb]{0.72,0,0}0.04}\end{array}$
stencil for WS	$\begin{array}[]{ccc}-0.02&-0.04&-0.02\\ {\color[rgb]{0.72,0,0}0.09}&-0.08&-0.07\\ {\bf 0.25}&{\color[rgb]{0.72,0,0}0.04}&-0.08\\ {\color[rgb]{0.72,0,0}0.09}&{\color[rgb]{0.72,0,0}0.08}&-0.02\\ -0.02&{\color[rgb]{0.72,0,0}0.004}&-0.003\end{array}$	$\begin{array}[]{ccc}-0.02&-0.04&-0.02\\ {\color[rgb]{0.72,0,0}0.09}&-0.08&-0.07\\ {\bf 0.24}&{\color[rgb]{0.72,0,0}0.03}&-0.08\\ {\color[rgb]{0.72,0,0}0.09}&{\color[rgb]{0.72,0,0}0.08}&-0.02\\ -0.02&{\color[rgb]{0.72,0,0}0.01}&-0.002\end{array}$
stencil for W-NN	$\begin{array}[]{ccc}0&{\color[rgb]{0.72,0,0}0.06}&-0.39\\ -0.39&{\bf 1.42}&-0.39\\ -0.39&{\color[rgb]{0.72,0,0}0.06}&0\end{array}$	$\begin{array}[]{ccc}0&{\color[rgb]{0.72,0,0}0.16}&-0.42\\ -0.33&{\bf 1.19}&-0.33\\ -0.42&{\color[rgb]{0.72,0,0}0.16}&0\end{array}$
stencil for A-NN	$\begin{array}[]{ccc}-0.07&0\\ {\bf 0.64}&-0.19\end{array}$	$\begin{array}[]{ccc}-0.01&0\\ {\bf 0.17}&-0.05\end{array}$
	$\gamma(3,2)=-0.06$	$\gamma(5,3)=-0.03$

Table 3: Largest eigenvalue of the discretized operator

-\operatorname{div}({\bf D}\nabla)

, for the constant metric

\operatorname{\mathbf{D}}=D

, where the matrix

D

is given on Tables 1 and 2. The time step, in the explicit discretization of (64), should not exceed the inverse of this value.

$\kappa$	$\kappa=1$	$\kappa=\sqrt{2}$	$\kappa=\sqrt{10}$	$\kappa=\sqrt{50}$
eigenvalue AD-LBR	8	4.27	2.06	1.06
eigenvalue FD	8	6.22	5.06	4.85
eigenvalue Q1	5.7	4.94	4.32	4.20
eigenvalue WS	1	1	1	1
eigenvalue W-NN	8	4.27	3.1	3.02
eigenvalue A-NN	8	4.27	1.04	0.3

4.2 A test case with an explicit solution

Consider an image $v\in L^{2}(\Omega)$ , defined on a domain $\Omega$ , and a diffusion tensor field ${\bf D}:\Omega\to S_{2}^{+}$ . A classical approach to restore the image $v$ , if it has been corrupted by additive noise, is to find $u\in H^{1}(\Omega)$ which minimizes:

j(u)=\int_{\Omega}|u-v|^{2}+\lambda\int_{\Omega}||\nabla u||^{2}_{{\bf D}}.

(62)

In other words, $u$ is a penalized least squares approximation of $v$ . The parameter $\lambda>0$ should be adjusted so as to avoid excessive smoothing (for large $\lambda$ ), or insufficient denoising (for small $\lambda$ ). The solution $u$ can be characterized as the solution to the static elliptic PDE:

\left\{\begin{array}[]{ll}-\lambda\operatorname{div}({\bf D}\nabla u)+u=v,&\text{ on }\Omega.\\ \langle\nabla u,n\rangle=0,&\text{ on }\partial\Omega.\end{array}\right.

(63)

In applications [20, 24] the diffusion tensor $\operatorname{\mathbf{D}}$ is usually adapted to the local image structure, in order to avoid smoothing the edges of $v$ . We construct below a test case (image $v$ and tensor field ${\bf D}$ ), for which the solution $u$ is known analytically.

In order to obtain an analytic solution, we first consider a separable problem where the image is invariant by translation along the horizontal axis, and the metric is constant with axes parallel to the coordinate axes. This first problem is invariant under translations along the $x$ -axis, and therefore boils down to a 1-dimensional problem. This separable problem is then transported by a diffeomorphism in order to obtain a new problem where the axes of the metric are no more parallel to the coordinate axes.

The analytical image is composed of a black and a white stripe: $v_{0}(x,y)={\bf 1}_{y<0.5}$ , see Figure 9. Given $\kappa\geq 1$ , we consider the constant diffusion tensor

{\bf D}_{0}=\left(\begin{array}[]{cc}1&0\\ 0&\kappa^{-2}\end{array}\right).

The analytical solution $u_{0}$ of (63), applied to $\operatorname{\mathbf{D}}_{0}$ and $v_{0}$ , is known in the case of the infinite domain $\Omega={\mathbb{R}}^{2}$ . In Fourier domain all the coefficients are real and:

\widehat{u_{0}}(\xi)=\widehat{v_{0}}(\xi)/(1+\langle\xi,{\bf D}_{0}\xi\rangle).

This separable problem is transformed using the following diffeomorphism: for $(x,y)\in\Omega$

f(x,y)=(x,y+\alpha\cos(2\pi x)).

The Jacobian of $f$ is

J(x,y)=\left(\begin{array}[]{cc}1&0\\ -2\pi\alpha\sin(2\pi x)&1\end{array}\right)

We apply the different restoration schemes to the image $v=v_{0}\circ f$ , and the following diffusion tensor:

	$\displaystyle{\bf D}(z)$	$\displaystyle=\|\det J(z)\|\ J(z)^{-1}\,{\bf D}_{0}\,(J(z)^{-1})^{\mathrm{T}}$
		$\displaystyle=J(z)^{-1}\,{\bf D}_{0}\,(J(z)^{-1})^{\mathrm{T}}=\left(\begin{array}[]{cc}1&s\\ s&s^{2}+\kappa^{-2}\end{array}\right),$

where we denoted $z=(x,y)\in\Omega$ and $s=2\pi\alpha\sin(2\pi x)$ . The numerical solution is compared to the analytical function $u=u_{0}\circ f$ , which is the exact solution in the case of the infinite domain $\Omega={\mathbb{R}}^{2}$ . This numerical solution was obtained on the bounded domain $\Omega=[0,1[^{2}$ , equipped with reflecting boundary conditions. Numerical evidence suggests that this change of domain and of boundary conditions has only an anecdotic impact on the solution of (63), with the parameters chosen in this test case.

We used $\alpha:=1/3$ in the numerical experiments. The maximum value of $\kappa(\operatorname{\mathbf{D}}(x))$ , among all $x\in\Omega$ , is equivalent to $\kappa_{\max}:=\kappa\sqrt{1+(2\pi\alpha)^{2}}\simeq 2.3\kappa$ .

4.3 Results for the synthetic test case

We present in Figure 10 the performance results of the different schemes, for different values of the anisotropy $\kappa$ , obtained on a series of grids of size ranging from $100\times 100$ to $1200\times 1200$ . The anisotropy varies from $\kappa=2$ to $\kappa=10$ , which are relevant values for imaging applications, see the numerical experiments in §4.4. The quality of a scheme is measured by the $L^{2}$ difference and the $H^{1}$ semi-norm difference between the numerical solution and the analytical solution. Note that the error is concentrated close to the discontinuity, since the solution tends rapidly to a constant (0 or 1) far from the discontinuity. We chose the smoothing parameter $\lambda=10^{-3}$ in (62). The linear equation obtained by the discretization of (63) is solved using Conjugate Gradient.

We also tested extreme anisotropies, $\kappa\geq 100$ (thus $\kappa_{\max}\geq 230$ ), which can be relevant in physics related applications. None of the tested schemes showed convincing results: methods based on fixed stencils fail because the discrete operator looses positivity, while the AD-LBR (and A-NN even more) suffers from under-sampling due to the large radius of its stencils. We thus refer to [7] for a radically different approach tailored for this setting. This method introduces an auxiliary one-dimensional unknown, which is constant on the field lines (obtained in a preprocessing step) of the anisotropy direction field, and varies orthogonally to them.

The performance advantage of the AD-LBR is particularly clear when the error is measured in the $H^{1}$ semi-norm: for the anisotropy $\kappa=10$ and the resolution $500\times 500$ , which are relevant values in image processing, AD-LBR outperforms its alternatives by a factor ranging from 3 to 5.

4.4 Coherence-enhancing diffusion

In order to document the interest of our discretization, we implement Coherence-Enhancing Diffusion [25] using the different numerical schemes at our disposal. The following parabolic equation is considered:

\partial_{t}u=\operatorname{div}({\bf D}(J_{\rho}(\nabla u_{\sigma}))\nabla u).

(64)

This equation is non-linear since the diffusion tensor depends on the solution $u$ . This tensor also depends on four user defined parameters $\sigma,\rho,C\in{\mathbb{R}}_{+}$ , $\alpha\in]0,1[$ . Let $K_{\sigma}$ (resp. $K_{\rho}$ ), be the Gaussian kernel of variance $\sigma$ (resp. $\rho$ ). Define the convolution $u_{\sigma}:=K_{\sigma}\star u$ , and the structure tensor $J_{\rho}:=K_{\rho}\star(\nabla u_{\sigma}\nabla u_{\sigma}^{T})$ . The diffusion tensor ${\bf D}(J_{\rho})$ possesses the same eigenvectors $(v_{1},v_{2})$ as $J_{\rho}$ , and if the eigenvalues of $J_{\rho}$ are $\mu_{1}\geq\mu_{2}$ then the eigenvalues of ${\bf D}(J_{\rho})$ are

	$\displaystyle\lambda_{1}$	$\displaystyle:=\alpha$
	$\displaystyle\lambda_{2}$	$\displaystyle:=\alpha+(1-\alpha)\exp\left(\dfrac{-C}{(\mu_{1}-\mu_{2})^{2}}\right).$

This ensures that one smoothes preferably along the coherence direction $v_{2}$ , with a diffusivity that increases with respect to the coherence $(\mu_{1}-\mu_{2})^{2}$ . When the time parameter $t$ becomes large, the image tends to a constant image, therefore it is necessary to stop the process at some finite time $T$ . The ratio of the eigenvalues is bounded by $\lambda_{2}/\lambda_{1}\leq 1/\alpha$ , hence $\kappa\leq 1/\sqrt{\alpha}$ .

We used an explicit time discretization for (64), with time step $\Delta t$ . The image $u^{n+1}$ at time $(n+1)\Delta t$ is defined by the explicit equation:

\dfrac{u^{n+1}-u^{n}}{\Delta t}=\operatorname{div}({\bf D}(J_{\rho}(\nabla u_{\sigma}^{n}))\nabla u^{n}).

The parameters used in our simulation were: $\sigma=0.5$ , $\rho=4$ , $C=10^{-5}$ , $\alpha=10^{-2}$ and $\Delta t=0.02$ . This gives a maximum anisotropy of $\kappa=10$ . The algorithm was applied to a fingerprint image. The results obtained for $T=10$ are shown in Figures 11 and 12, and they document the ability of our scheme to close interrupted lines more efficiently than the other schemes. The largest eigenvalue of the discrete operator $-\operatorname{div}({\bf D}\nabla)$ at $t=0$ is given in Table 4 for the different schemes. As was already noticed in the constant metric case, it turns out that AD-LBR has the smallest eigenvalues among all schemes, except for scheme WS. This property allows (although this was not done in our numerical experiments) to use larger time steps for AD-LBR than for the other schemes.

Note also that ridges are clearer, and valleys are darker, using AD-LBR than with the other schemes. (Gray-scale range is the same for all images, see also Figure 13). This reflects the fact that AD-LBR avoids, better than the other schemes, smoothing transversally to the orientation encoded in the continuous anisotropic PDE (64).

Remark 4 (Computation time).

Numerical solvers of the parabolic PDE (64) combine three main components: (i) Constructing the diffusion tensor. (ii) Assembling the discretization stencils and the operator sparse matrix. (iii) Performing an explicit time step. Components (i) and (ii) are executed exactly the same number of times, while step (iii) is generally more frequent: in order to save CPU time, one typically does not update the diffusion operator at each time step. We produced a C++ implementation of AD-LBR, within the Insight Toolkit open source library. Although our code is neither parallel nor aggressively optimized, we believe that comparing the CPU times for steps (i), (ii) and (iii) is informative, and allows to estimate the additional cost of AD-LBR which is essentially contained in step (ii).

For our 2D Coherence-Enhancing Diffusion (CED) experiment, on the $512\times 512$ fingerprint image, (i) takes 0.21s, (ii) 0.027s, (iii) 0.005s. For our 3D CED Experiment, on $100\times 100\times 100$ synthetic data, (i) takes 1.35s, (ii) 0.51s, (iii) 0.035s. In both cases, the AD-LBR specific step (ii) is dominated by the construction of the diffusion tensor (i). Step (ii) may also be dominated by the mere cost (iii) of iterations, provided the operator is updated less than once every 6 explicit steps in 2D (14 in 3D). To our eyes, the limited additional cost (ii) of AD-LBR is acceptable in view of the strong theoretical guarantees, and qualitative improvements, brought by this scheme.

Table 4: Largest eigenvalue of the discretized operator

-\operatorname{div}({\bf D}\nabla)

, where

{\bf D}={\bf D}(J_{\rho}(\nabla u_{\sigma}))

t=0

scheme	AD-LBR	FD	Q1	WS	W-NN	A-NN
eigenvalue	3.75	5.67	5.09	0.96	3.83	6.23

4.5 3-dimensional experiments

In order to illustrate the feasibility of our scheme in 3D space, we present the action of anisotropic diffusion PDEs on two examples. The first example is a 3D analog of the synthetic test case presented in [27], featuring Coherence-Enhancing Diffusion. The second one is the application of Edge-Enhancing Diffusion to a MRI scan.

Synthetic example

The original, radially varying image is defined on the cube $[0,1]^{3}$ . The gray-level at a point $x$ is defined by

u^{0}(x)=\cos\left(2(r/R)^{3}\right),

where $r:=|x|$ and $R:=1/2$ . This image presents a series of concentric level-sets. We present in Figure 14 the level sets $\{u^{0}=0\}$ , and a slice through the plane $z=0.7$ .

The image $u^{0}$ is perturbed by

u:=u^{0}+n,

where $n$ is an additive Gaussian noise of variance $\sigma=0.5$ . The reconstructed image is obtained using a 3D Coherence-Enhancing Diffusion PDE [25], similar to the 2D one in section 4.4:

\partial_{t}u=\operatorname{div}({\bf D}(J_{\rho}(\nabla u_{\sigma}))\nabla u),

where $J_{\rho}$ is the structure tensor defined by $J_{\rho}:=K_{\rho}\star(\nabla u_{\sigma}\nabla u_{\sigma}^{T})$ , $u_{\sigma}:=K_{\sigma}\star u$ . The tensor ${\bf D}(J_{\rho})$ possesses the same eigenvectors $(v_{1},v_{2},v_{3})$ as $J_{\rho}$ , and if the eigenvalues of $J_{\rho}$ are $\mu_{1}\geq\mu_{2}\geq\mu_{3}$ then the eigenvalues of ${\bf D}(J_{\rho})$ are

\lambda_{1}:=\alpha

\lambda_{2}:=\alpha+(1-\alpha)\exp\left(\dfrac{-C}{(\mu_{1}-\mu_{2})^{2}}\right),

\lambda_{3}:=\alpha+(1-\alpha)\exp\left(\dfrac{-C}{(\mu_{1}-\mu_{3})^{2}}\right),

where $\alpha=10^{-2}$ . The anisotropy ratio is bounded by $\kappa=1/\sqrt{\alpha}=10$ . We used the values $\sigma=0.5$ , $\rho=4$ . The problem is discretized using $100^{3}$ voxels. We present in Figure 14 the noisy image $u$ (levelset 0 and planar slice) and the result after 20 time-steps of $\Delta t=10^{-3}$ .

3D MRI data

The data is a $256\times 256\times 100$ Magnetic Resonance Imaging scan of a skull, and was obtained from the ”University of North Carolina Volume Rendering Test Data Set” archive.

The reconstructed image is obtained using a 3D Edge-Enhancing Diffusion PDE [25], which differs from the above Coherence-Enhancing Diffusion one by the choice of the diffusion tensor eigenvalues. The optimal choice of these eigenvalues indeed depends on the application, and is still an active subject of research [13]. With the above notations, the eigenvalues of ${\bf D}(J_{\rho})$ are

	$\displaystyle\lambda_{1}$	$\displaystyle:=1-\exp\left(\dfrac{-C}{\mu_{1}^{2}}\right)$
	$\displaystyle\lambda_{2}$	$\displaystyle:=1-\exp\left(\dfrac{-C}{\mu_{2}^{2}}\right),$
	$\displaystyle\lambda_{3}$	$\displaystyle:=1.$

We used the values $\sigma=0.5$ , $\rho=4$ . In our experiment, the maximum anisotropy ratio was $\kappa=11.2$ . We present in Figure 15 the original image and two slices of the result after 10 time-steps of $\Delta t=10^{-4}$ .

Conclusion

We introduced in this paper a new numerical scheme, AD-LBR, for anisotropic diffusion in image processing. This scheme is non-negative, and its stencils have a limited support: 6 points in 2D, 12 points in 3D. The former property implies that our scheme respects the maximum principle of Alvarez, Guichard, Lions and Morel, which is an essential feature of parabolic PDEs.

AD-LBR outperformed all tested alternatives in a quantitative numerical experiment: a test case in which approximate numerical solutions are compared against a known analytical solution. In a second qualitative test case, different schemes were used to enhance a fingerprint image. Our scheme appears here to close more efficiently the lines of the fingerprint, and to diffuse less orthogonally to the lines. This is precisely the purpose of the implemented PDE, coherence enhancing diffusion. We also presented a 3-dimensional implementation as a proof of feasibility.

The construction of the stencils of the AD-LBR is both original and non-trivial. The computational load for this aspect of the algorithm is fortunately not dominant, thanks to the use of a tool from discrete geometry: lattice basis reduction. The AD-LBR also allows to use larger time steps than most of its counterparts, in explicit discretizations of parabolic equations.

AD-LBR trivially extends to vector valued and matrix valued images, by applying it on each image component independently. (In other words, the coupling between image components lies in the construction of the common diffusion tensor $\operatorname{\mathbf{D}}$ , which AD-LBR regards as user input.) Future work will be devoted to the application of AD-LBR to the regularization of diffusion tensor fields, arising for instance from diffusion MRI, for which we expect it to be particularly appropriate: thanks to the scheme non-negativity, positive-definiteness is naturally preserved.

References

[1] L. Alvarez, F. Guichard, P.-L. Lions, J.-M. Morel, Axioms and fundamental equations of Image processing, Arch. Rational Mech. Anal., vol. 123, 199–257 (1993)
[2] D. Barash, T. Schlick, M. Israeli, and R. Kimmel, Multiplicative operator splittings in non-linear diffusion: from spatial splitting to multiplicative timesteps, Journal of Mathematical Imaging and Vision, 19:33-48, (2003).
[3] J.-B. Bost and K. Künnemann, Hermitian vector bundles and extension groups on arithmetic schemes. I. Geometry of numbers, 2010
[4] J. H. Conway, N. J. A. Sloane, Low-dimensional lattices. VI. Voronoi reduction of three-dimensional lattices., Proceedings of the Royal Society of London. Series A: Mathematical and Physical Sciences 436.1896 (1992): 55-68.
[5] G.-H. Cottet, L. Germain, Image processing through reaction combined with nonlinear diffusion, Math. Comp., vol. 61, 659–673 (1993).
[6] L. Dascal, A. Ditkowski, and N. Sochen, On the Discrete Maximum Principle for the Beltrami Color Flow, J. Math. Imaging Vision, vol. 29, 63–77 (2007)
[7] P. Degond, A. Lozinski, J. Narski, C. Negulescu, An Asymptotic-Preserving method for highly anisotropic elliptic equations based on a micro-macro decomposition, J. Comput. Phys., vol. 231(7), 2724–2740 (2012)
[8] S. Grewenig, J. Weickert, and A. Bruhn, From box filtering to fast explicit diffusion, Pattern Recognition 533-542, (2010).
[9] W. Huang, Discrete maximum principle and a Delaunay-type mesh condition for linear finite element approximations of two-dimensional anisotropic diffusion problems, arXiv preprint arXiv:1008.0562, (2010)
[10] F. Labelle, J. R. Shewchuk, Anisotropic Voronoi Diagrams and Guaranteed-Quality Anisotropic Mesh Generation, Proceedings of the Nineteenth Annual Symposium on Computational Geometry, 191-200 (2003)
[11] J. L. Lagrange, Recherches d’arithmétique, Nouveaux Mémoires de l’Académie de Berlin, (1773)
[12] A. K. Lenstra, H. W. Lenstra, and L. Lovász, Factoring polynomials with rational coefficients, Mathematische Annalen 261, 513–534, (1982)
[13] A. M. Mendrik, E. J. Vonken, A. Rutten, M. A. Viergever, and B. van Ginneken, Noise reduction in computed tomography scans using 3-d anisotropic hybrid diffusion with continuous switch, Medical Imaging, IEEE Transactions on, 28(10), 1585-1594. (2009)
[14] J.-M. Mirebeau, Anisotropic Fast Marching on Cartesian Grids, using Lattice Basis Reduction, preprint, 2012.
[15] A. Mitchell and D. Griffiths The Finite Difference Method in Partial Differential Equations. Chichester: Wiley (1980).
[16] P. Mrázek and M. Navara, Consistent positive directional splitting of anisotropic diffusion, Proc. Sixth Computer Vision Winter Workshop, (2001)
[17] P. Q. Nguyen, and D. Stehlé, Low-dimensional lattice basis reduction revisited, ACM Transactions on Algorithms, Article 46 (2009).
[18] P. Q. Nguyen and J. Stern, The two faces of lattices in cryptology, In Proceedings of the 2001 Cryptography and Lattices Conference (CALC’01). Lecture Notes in Computer Science, vol. 2146. Springer-Verlag, 146–180, (2001)
[19] S. Osher, L. Rudin Feature-oriented image enhancement using shock filters, SIAM J. Numer. Anal., vol. 27, 919–940 (1990)
[20] P. Perona and J. Malik, Scale-Space and Edge Detection Using Anisotropic Diffusion, IEEE Trans. Patt. Anal. Mach. Int., vol. 12, 629–639 (1990)
[21] B. Schmid, J. Schindelin, A. Cardona, M. Longair, M. Heisenberg, A high-level 3D visualization API for Java and ImageJ. BMC Bioinformatics, 11:274 (2010)
[22] E. Selling. über die binären und ternären quadratischen formen, J. reine angew. Math., 77:143–229, (1874).
[23] I. Semaev, A 3-dimensional lattice reduction algorithm, In Proceedings of the 2001 Cryp- tography and Lattices Conference (CALC’01). Lecture Notes in Computer Science, vol. 2146. Springer-Verlag, 181–193, (2001)
[24] J. Weickert, Theoretical foundations of anisotropic diffusion in image processing, Computing, SUppl. 11, 221–236 (1996)
[25] J. Weickert, Anisotropic Diffusion in Image Processing, Teubner, Stuttgart (1998)
[26] J. Weickert, B. Romeny, and M. Viergever, Efficient and reliable schemes for nonlinear diffusion filtering, IEEE Trans. Image Proc., vol. 7, 398–410 (1998)
[27] J. Weickert, and H. Scharr, A scheme for coherence-enhancing diffusion filtering with optimized rotation invariance, J. Visual Comm. Image Rep., Vol. 13, 103–118, (2002).
[28] M. Welk and G. Steidl and J. Weickert, Locally analytic schemes: A link between diffusion filtering and wavelet shrinkage, Applied and Computational Harmonic Analysis, 24, pp 195–224, (2008)

	$\displaystyle 4\langle e,M^{\prime}f\rangle=\\|e+f\\|_{M^{\prime}}^{2}-\\|e-f\\|_{M^{\prime}}^{2}$
	$\displaystyle\leq\tau^{2}\\|e+f\\|_{M}^{2}-\tau^{-2}\\|e-f\\|_{M}^{2}$
	$\displaystyle=(\tau^{2}-\tau^{-2})(\\|e\\|_{M}^{2}+\\|f\\|_{M}^{2})+2(\tau^{2}+\tau^{-2})\langle e,Mf\rangle$
	$\displaystyle\leq((\tau^{2}-\tau^{-2})(1+\kappa(M)^{2})+\alpha(\tau^{2}+\tau^{-2}))\\|e\\|_{M}^{2}$
	$\displaystyle\leq((\tau^{4}-1)(1+\kappa(M)^{2})+\alpha(\tau^{4}+1))\\|e\\|_{M^{\prime}}^{2}.$

		$\displaystyle\|u(p+g)-u(p)\|^{2}$		(58)
		$\displaystyle=\|u(p+e+f)-u(p)\|^{2}$
		$\displaystyle\leq 2(\|u(p+e+f)-u(p+e)\|^{2}+\|u(p+e)-u(p)\|^{2})$
		$\displaystyle=2(\|u(p^{\prime}+f)-u(p^{\prime})\|^{2}+\|u(p+e)-u(p)\|^{2}).$

Sparse Non-Negative Stencils for Anisotropic Diffusion ††thanks: This work was partly supported by ANR grant MESANGE ANR-08-BLAN-0198.

Abstract

keywords :

1 Description of the scheme, and main results

Lemma 1.

Proof.

Definition 1.

Definition 2.

Theorem 1.

Definition 3 (ADT, Labelle and Shewchuk [10]).

2 Construction of obtuse superbases, and three dimensional stencils

Definition 4.

Lemma 2.

Proof.

Lemma 3.

Proof.

Corollary 1.

Proof.

Proposition 1.

Proof.

Lemma 4 (Selling [22]).

Proof.

3 Equivalence to a finite element discretization

3.1 Existence of an ADT

Lemma 5.

Proof.

Lemma 6.

Proof.

Lemma 7.

Proof.

Lemma 8.

Proof.

Lemma 9.

Proof.

3.2 Some properties of MM-reduced bases

Lemma 10.

Proof.

Corollary 2.

Proof.

Lemma 11.

Proof.

Lemma 12.

Proof.

3.3 Comparison of the stencils

Lemma 13.

Proof.

Lemma 14.

Proof.

Lemma 15.

Proof.

Lemma 16.

Proof.

Lemma 17.

Proof.

4 Numerical experiments

4.1 The different schemes

Remark 1 (Axes-directed non negative six point scheme).

Remark 2 (Stencil radius).

Remark 3 (Scheme FD).

4.2 A test case with an explicit solution

4.3 Results for the synthetic test case

4.4 Coherence-enhancing diffusion

Remark 4 (Computation time).

4.5 3-dimensional experiments

Synthetic example

3D MRI data

Conclusion

References

Sparse Non-Negative Stencils for Anisotropic Diffusion ^†^†thanks: This work was partly supported by ANR grant MESANGE ANR-08-BLAN-0198.

3.2 Some properties of $M$ -reduced bases