This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Sub-Finsler geometry and nonholonomic mechanics

Layth M. Alabdulsada Dep. of Math, College of Science, University of Al Qadisiyah, Al-Qadisiyah 58001, Iraq layth.muhsin@qu.edu.iq
Abstract.

We discuss a variational approach to the length functional and its relation to sub-Hamiltonian equations on sub-Finsler manifolds. Then, we introduce the notion of the nonholonomic sub-Finslerian structure and prove that the distributions are geodesically invariant concerning the Barthel non-linear connection. We provide necessary and sufficient conditions for the existence of the curves that are abnormal extremals; likewise, we provide necessary and sufficient conditions for normal extremals to be the motion of a free nonholonomic mechanical system, and vice versa. Moreover, we show that a coordinate-free approach for a free particle is a comparison between the solutions of the nonholonomic mechanical problem and the solutions of the Vakonomic dynamical problem for the nonholonomic sub-Finslerian structure. In addition, we provide an example of the nonholonomic sub-Finslerian structure. Finally, we show that the sub-Laplacian measures the curvature of the nonholonomic sub-Finslerian structure.

Key words and phrases:
Sub-Finsler geometry, Sub-Hamiltonian vector field, Sub-Hamiltonian equations, Non-linear connection, Nonholonomic Free Particle, Sub-Laplacian
2020 Mathematics Subject Classification:
53C05, 53C60, 70F25, 53C17

1. Introduction

Sub-Finsler geometry and nonholonomic mechanics have attracted much attention recently; they are rich subjects with many applications.

Sub-Finsler geometry is a natural generalization of sub-Riemannian geometry. The sub-Riemannian metric was initially referred to as the Carnot-Carathéodory metric. J. Mitchell, [23], investigated the Carnot-Carathéodory distance between two points by considering a smooth Riemannian nn-manifold (M,g)(M,g) equipped with a kk-rank distribution 𝒟\mathcal{D} of the tangent bundle TMTM. A decade later, M. Gromov [17] provided a comprehensive study of the above concepts. V. N. Berestovskii [9] identified the Carnot-Caratheodory Finsler metric version as the Finsler counterpart of this metric, now commonly known as the sub-Finsler metric. In this study, our definition of the sub-Finsler metric closely aligns with the definition presented in previous works [3, 14]. The motivation behind studying sub-Finsler geometry lies in its pervasive presence within various branches of pure mathematics, particularly in differential geometry and applied fields like geometric mechanics, control theory, and robotics. We refer the readers to [1, 4, 8, 20].

Nonholonomic mechanics is currently a very active area of the so-called geometric mechanics [21]. Constraints on mechanical systems are typically classified into two categories: integrable and nonintegrable constraints. Nonholonomic mechanics: constraints that are not holonomic; these might be constraints that are expressed in terms of the velocity of the coordinates that cannot be derived from the constraints of the coordinates (thereby nonintegrable) or the constraints that are not given as an equation at all [19]. Nonholonomic control systems exhibit unique characteristics, allowing control of underactuated systems due to constraint nonintegrability. These problems arise in physical contexts like wheel systems, cars, robotics, and manipulations, with more insights found in [10, 21].

In [18], B. Langerock considered a general notion of connections over a vector bundle map and applied it to the study of mechanical systems with linear nonholonomic constraints and a Lagrangian of kinetic energy type. A. D. Lewis in [19], investigated various consequences of a natural restriction of a given affine connection to distribution. The basic construction comes from the dynamics of a class of mechanical systems with nonholonomic constraints. In a previous paper in collaboration with L. Kozma [3], constructed a generalized non-linear connection for a sub-Finslerian manifold, called \mathcal{L}-connection by the Legendre transformation which characterizes normal extremals of a sub-Finsler structure as geodesics of this connection. In this paper, [3] and [4] play an important role in calculating our main results. These results are divided into two parts: sub-Hamiltonian systems and nonholonomic sub-Finslerian structures on the nonintegrable distributions.

The paper is organized according to the following: In Section 2, we review some standard facts about sub-Finslerian settings. In Section 3, we define a sub-Finsler metric on 𝒟\mathcal{D} by using a sub-Hamiltonian function η(x,p)\eta(x,p) and show the correspondence between the solutions of sub-Hamiltonian equations and the solution of a variational problem. Section 4 introduces the notion of nonholonomic sub-Finslerian structures and presents the main results, including conditions for the motion of a free mechanical system under linear nonholonomic constraints to be normal extremal with respect to the linked sub-Finslerian structure. Section 5 provides an example of the nonholonomic sub-Finslerian structure, and Section 6 discusses the curvature of the sub-Finslerian structure. We conclude that if the sub-Laplacian ΔF\Delta_{F} is zero, then the sub-Finslerian structure is flat and locally isometric to a Riemannian manifold, while if ΔF\Delta_{F} is nonzero, the sub-Finslerian structure is curved and the shortest paths between two points on the manifold are not necessarily straight lines.

2. Preliminaries

Let MM be an nn-dimensional smooth (CC^{\infty}) manifold, and let TxMT_{x}M represent its tangent space at a point xMx\in M. We denote the module of vector fields over C(M)C^{\infty}(M) by 𝔛(M)\mathfrak{X}(M), and the module of 11-forms by 𝔛(M)\mathfrak{X}^{*}(M).

Consider 𝒟\mathcal{D}, a regular distribution on MM, defined as a subbundle of the tangent bundle TMTM with a constant rank of kk. Locally, in coordinates, this distribution can be expressed as 𝒟=span{X1,,Xk}\mathcal{D}=\textrm{span}\{X_{1},\ldots,X_{k}\}, where Xi(x)𝔛(M)X_{i}(x)\in\mathfrak{X}(M) are linearly independent vector fields.

A non-negative function F:𝒟+F:\mathcal{D}\rightarrow\mathbb{R}_{+} is called a sub-Finsler metric if it satisfies the following conditions:

  • 1.

    Smoothness: FF is a smooth function over 𝒟0\mathcal{D}\setminus{0};

  • 2.

    Positive Homogeneity: F(λv)=|λ|F(v)F(\lambda v)=|\lambda|F(v) for all λ\lambda\in\mathbb{R} and v𝒟0v\in\mathcal{D}\setminus{0};

  • 3.

    Positive Definiteness: The Hessian matrix of F2F^{2} is positive definite at every v𝒟x0v\in\mathcal{D}_{x}\setminus{0}.

A differential manifold MM equipped with a sub-Finsler metric FF is recognized as a sub-Finsler manifold, denoted by (M,𝒟,F)(M,\mathcal{D},F).

An absolutely continuous curve, denoted as σ:[0,1]M\sigma:[0,1]\rightarrow M, is considered horizontal if its tangent vector field σ˙(t)\dot{\sigma}(t) lies within 𝒟σ(t)\mathcal{D}_{\sigma(t)} for all t[0,1]t\in[0,1], whenever it is defined. This condition reflects the nonholonomic constraints imposed on the curve.

The length functional of such a horizontal curve σ\sigma possesses a derivative for almost all t[0,1]t\in[0,1], with the components of the derivative, σ˙\dot{\sigma}, representing measurable curves. The length of σ\sigma is usually defined as:

(σ)=01F(σ˙(t))𝑑t.\ell(\sigma)=\int_{0}^{1}F(\dot{\sigma}(t))dt.

This length structure gives rise to a distance function, denoted as d:M×M+d:M\times M\rightarrow\mathbb{R}_{+}, defined by:

d(x0,x1)=inf(σ),x0,x1M,d(x_{0},x_{1})=\inf\ell(\sigma),\qquad x_{0},x_{1}\in M,

and the infimum is taken over all horizontal curves connecting σ(0)=x0\sigma(0)=x_{0} to σ(1)=x1\sigma(1)=x_{1}. This distance metric captures the minimal length among all possible horizontal paths between two points on the manifold MM.

A geodesic, also known as a minimizing geodesic, refers to a horizontal curve σ:[0,1]M\sigma:[0,1]\rightarrow M that realizes the distance between two points, i.e., (σ)=d(σ(0),σ(1))\ell(\sigma)=d(\sigma(0),\sigma(1)).

Throughout this paper, it is consistently assumed that 𝒟\mathcal{D} is bracket-generating. A distribution 𝒟\mathcal{D}, is characterized as bracket-generating if every local frame XiX_{i} of 𝒟\mathcal{D}, along with all successive Lie brackets involving these frames, collectively span the entire tangent bundle TMTM. If 𝒟\mathcal{D} represents a bracket-generating distribution on a connected manifold MM, it follows that any two points within MM can be joined by a horizontal curve. This foundational concept was initially established by C. Carathéodory [12] and later reaffirmed by W. L. Chow [13] and P. K. Rashevskii [25]. However, for a comprehensive explanation of the bracket-generating concept, one can turn to R. Montgomery’s book, [24].

3. Sub-Hamiltonian associated with sub-Finslerian manifolds

3.1. The Legendre transformation and Finsler dual of sub-Finsler metrics

Let 𝒟\mathcal{D}^{*} be a rank-ss codistribution on a smooth manifold MM, assigning to each point xUMx\in U\subset M a linear subspace 𝒟xTxM\mathcal{D}^{*}_{x}\subset T^{*}_{x}M. This codistribution is a smooth subbundle, and spanned locally by ss pointwise linearly independent smooth differential 1-forms:

𝒟x=span{α1(x),,αs(x)}, withαi(x)𝔛(M).\mathcal{D}_{x}^{*}=\mathrm{span}\{\alpha_{1}(x),\ldots,\alpha_{s}(x)\},\ \text{ with}\ \alpha_{i}(x)\in\mathfrak{X}^{*}(M).

We define the annihilator of a distribution 𝒟\mathcal{D} on MM as (𝒟)0(\mathcal{D}^{\bot})^{0}, a subbundle of TMT^{*}M consisting of covectors that vanish on 𝒟\mathcal{D}:

(𝒟)0={αTM:α(v)=0 for all v𝒟},(\mathcal{D}^{\bot})^{0}=\{\alpha\in T^{*}M:\alpha(v)=0\text{ for all }v\in\mathcal{D}\},

such that v,α:=α(v)\langle v,\alpha\rangle:=\alpha(v). Similarly, we define the annihilator of the orthogonal complement of 𝒟\mathcal{D}, denoted by 𝒟0\mathcal{D}^{0}, as the subbundle of TMT^{*}M consisting of covectors that vanish on TMTM^{\bot}.

Using these notions, we can define a sub-Finslerian function denoted by F𝒟TM𝒟0F^{*}\in\mathcal{D}^{*}\sim T^{*}M\setminus\mathcal{D}^{0}, where FF^{*} is a positive function. This function shares similar properties with FF, but is based on 𝒟\mathcal{D}^{*} instead of 𝒟\mathcal{D}.

In our previous work [4], we established the relationship:

F(p)=F(v),wherep=L(v),for everyp𝒟xandv𝒟x,F^{*}(p)=F(v),\ \mathrm{where}\ p=\mathcal{L}_{L}(v),\quad\mbox{for every}\quad p\in\mathcal{D}^{*}_{x}\quad\mbox{and}\quad v\in\mathcal{D}_{x}, (1)

such that L\mathcal{L}_{L} is the Legendre transformation of the sub-Lagrangian function L:𝒟TML:\mathcal{D}\subset TM\>\lx@xy@svg{\hbox{\raise 0.0pt\hbox{\kern 0.0pt\hbox{\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces{}\ignorespaces{\hbox{\lx@xy@droprule}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces{\hbox{\kern 14.99176pt\raise 0.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{\lx@xy@tip{1}\lx@xy@tip{-1}}}}}}\ignorespaces{\hbox{\lx@xy@droprule}}{\hbox{\lx@xy@droprule}}\ignorespaces}}}}\>\mathbb{R}, a diffeomorphism between 𝒟\mathcal{D} and 𝒟\mathcal{D}^{*}.

In this context, to express FF^{*} in terms of FF, we consider the Legendre transformation of FF with respect to the sub-Lagrangian function L(v)=12F(v,v)L(v)=\frac{1}{2}F(v,v), where F(v,v)F(v,v) is the square of the Finsler norm of vv. The Legendre transformation L\mathcal{L}_{L} maps v𝒟v\in\mathcal{D} to p=Lv(v)p=\frac{\partial L}{\partial v}(v).

Utilizing the definition of the Legendre transformation, we observe that

p=Lv(v)=v(12F(v,v))=F(v,),p=\frac{\partial L}{\partial v}(v)=\frac{\partial}{\partial v}\left(\frac{1}{2}F(v,v)\right)=F(v,\cdot),

where F(v,)F(v,\cdot) denotes the differential of FF with respect to its first argument evaluated at vv. Note that F(v,)F(v,\cdot) is a linear function on 𝒟x\mathcal{D}_{x}.

Given a covector p𝒟p\in\mathcal{D}^{*}, with xx the base point of 𝒟\mathcal{D}, we can express the dual sub-Finsler metric FF^{*} in terms of FF as

F(p)=supv𝒟x{p,vF(v)},F^{*}(p)=\sup_{v\in\mathcal{D}_{x}}\left\{\frac{\langle p,v\rangle}{F(v)}\right\},

where p,v\langle p,v\rangle represents the dual pairing between the covector pp and the vector vv.

3.2. The Sub-Hamiltonian Function and Sub-Hamilton’s Equations for Sub-Finsler Manifolds

The sub-Hamiltonian function associated with a sub-Finsler metric FF given by

η:=12(F)2.\eta:=\frac{1}{2}(F^{*})^{2}.

Here, FF^{*} denotes the dual metric to FF, defined by

F(p)=supv𝒟x,F(v)=1p,v,F^{*}(p)=\sup_{v\in\mathcal{D}_{x},F(v)=1}\langle p,v\rangle, (2)

where pp represents a momentum vector in 𝒟x\mathcal{D}^{*}_{x} associated with the point xx in the manifold MM, and ,\langle\cdot,\cdot\rangle denotes the inner product induced by a Riemannian metric gg. The sub-Finslerian metric defined by (2) is known as the Legendre transform of FF, i.e., satisfying the relationship in (1). It is worth noting that the sub-Hamiltonian function associated with a Finsler metric is not unique, and different choices of Hamiltonians may lead to different dynamics for the associated geodesics.

The sub-Hamiltonian formalism is a method of constructing a sub-Finsler metric on a subbundle 𝒟\mathcal{D} by defining a sub-Hamiltonian function η(x,p)\eta(x,p) on the subbundle 𝒟\mathcal{D}^{*}, where xx denotes a point in MM and pp denotes a momentum vector in 𝒟\mathcal{D}^{*}, as explained in the following remark:

Remark 1.

The sub-Finsler vector bundle, introduced in [4] and expanded upon in [5], plays a pivotal role in formulating sub-Hamiltonians in sub-Finsler geometry. Consider the covector subbundle (𝒟,τ,M)(\mathcal{D}^{*},\tau,M) with projection τ:𝒟M\tau:\mathcal{D}^{*}\>\lx@xy@svg{\hbox{\raise 0.0pt\hbox{\kern 0.0pt\hbox{\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces{}\ignorespaces{\hbox{\lx@xy@droprule}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces{\hbox{\kern 14.99176pt\raise 0.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{\lx@xy@tip{1}\lx@xy@tip{-1}}}}}}\ignorespaces{\hbox{\lx@xy@droprule}}{\hbox{\lx@xy@droprule}}\ignorespaces}}}}\>M, forming a rank-kk subbundle in the cotangent bundle of TMT^{*}M. The pullback bundle τ(τ)=(𝒟×𝒟,pr1,𝒟)\tau^{*}(\tau)=(\mathcal{D}^{*}\times\mathcal{D}^{*},\mathrm{pr}_{1},\mathcal{D}^{*}) is obtained by pulling back τ\tau through itself and is denoted as the sub-Finsler bundle over 𝒟x\mathcal{D}^{*}_{x}. This bundle allows the introduction of kk orthonormal covector fields X1,X2,,XkX_{1},X_{2},\dots,X_{k} with respect to the induced Riemannian metric gg. The sub-Hamiltonian η\eta induces a metric gg on the sub-Finsler bundle. In terms of this metric, the sub-Hamiltonian function η\eta can be expressed as a function of components pip_{i}. Specifically, η(x,p)=12i,j=1ngijpipj\eta(x,p)=\frac{1}{2}\sum_{i,j=1}^{n}{g}^{ij}p_{i}p_{j}, where gijg^{ij} is the inverse of the metric tensor gijg_{ij} for the extended Finsler metric F^\hat{F} on TMTM, kindly check Remark 2. This defines a sub-Finsler metric on a subbundle 𝒟\mathcal{D} of TMTM that is determined by a distribution on MM.

Now fixing a point xMx\in M, for any covector p𝒟p\in\mathcal{D}^{*}, there exists a unique sub-Hamiltonian vector field on 𝒟\mathcal{D}^{*}, denoted by H\vec{H}, described by

H=ηpixiηxipi.\vec{H}=\frac{\partial\eta}{\partial p_{i}}\frac{\partial}{\partial x^{i}}-\frac{\partial\eta}{\partial x^{i}}\frac{\partial}{\partial p_{i}}. (3)

where the partial derivatives are taken with respect to the local coordinates (xi,pi)(x^{i},p_{i}) on 𝒟TM\mathcal{D}^{*}\subset T^{*}M.

Definition 1.

The sub-Hamiltonian equations on 𝒟\mathcal{D}^{*} are then given by

ηpi\displaystyle\frac{\partial\eta}{\partial p_{i}} =gijpj,\displaystyle=g^{ij}p_{j}, (4a)
ηxi\displaystyle\frac{\partial\eta}{\partial x^{i}} =12gjkxipjpk.\displaystyle=-\frac{1}{2}\frac{\partial g^{jk}}{\partial x^{i}}p_{j}p_{k}. (4b)

These equations express the fact that the sub-Hamiltonian vector field H\vec{H} preserves the sub-Finsler metric FF^{*} on 𝒟\mathcal{D}^{*}. If the Hamiltonian is independent of the cotangent variables pip_{i}, then the second equation above reduces to the Hamilton-Jacobi equation for the sub-Finsler manifold (M,𝒟,F)(M,\mathcal{D},F).

Remark 2.

We extended sub-Finsler metrics to full Finsler metrics using an orthogonal complement subbundle in [3]. However, here are more details and evidence.

Given a subbundle 𝒟\mathcal{D} of the tangent bundle TMTM, its direct complement 𝒟\mathcal{D}^{\perp} is a subbundle of TMTM such that TM=𝒟𝒟TM=\mathcal{D}\oplus\mathcal{D}^{\perp}, and at every point xMx\in M, 𝒟x𝒟x=0\mathcal{D}_{x}\cap\mathcal{D}_{x}^{\perp}={0} and 𝒟x+𝒟x=TxM\mathcal{D}_{x}+\mathcal{D}_{x}^{\perp}=T_{x}M.

One canonical way to obtain a direct complement to 𝒟\mathcal{D} is to use the notion of an orthogonal complement. Given a subbundle 𝒟\mathcal{D} of TMTM, we define the orthogonal complement bundle 𝒟\mathcal{D}^{\perp} as follows:

𝒟x={vTxM:v,w=0 for all w𝒟x},\mathcal{D}^{\perp}_{x}=\{v\in T_{x}M:\langle v,w\rangle=0\text{ for all }w\in\mathcal{D}_{x}\},

such that v,wv,w are orthogonal with respect to the inner product induced by the Riemannian metric. It can be shown that 𝒟\mathcal{D}^{\perp} is a subbundle of TMTM and satisfies the conditions for being a direct complement to 𝒟\mathcal{D}. Moreover, it can be shown that any two direct complements to 𝒟\mathcal{D} are isomorphic bundles, so the orthogonal complement is unique up to bundle isomorphism.

Note that if MM is equipped with a sub-Finsler metric, then the metric induces a non-degenerate inner product on 𝒟\mathcal{D}, so we can use this inner product to define the orthogonal complement. However, if MM is not equipped with a Riemannian metric, then the notion of an orthogonal complement may not be well-defined. So, to extend a given sub-Finsler metric FF on a subbundle 𝒟\mathcal{D} of TMTM to a full Finsler metric on TMTM, one can use an orthogonal complement subbundle 𝒟\mathcal{D}^{\perp}. This is a regular subbundle of TMTM that is orthogonal to 𝒟\mathcal{D} with respect to the Riemannian metric gijg_{ij}. Locally, 𝒟\mathcal{D}^{\perp} can be written as:

𝒟=span{X1,,Xnk},\mathcal{D}^{\perp}=\text{span}\{X^{\prime}_{1},\ldots,X^{\prime}_{n-k}\}, (5)

where kk is the rank of the subbundle 𝒟\mathcal{D} and X1,,XnkX^{\prime}_{1},\ldots,X^{\prime}_{n-k} are local vector fields that form a basis for 𝒟\mathcal{D}^{\perp}. Then, one can define a Finsler metric F^\hat{F} on TMTM by:

F^(v)=F2(P(v))+F~2(Pc(v)),\hat{F}(v)=\sqrt{F^{2}(P(v))+\widetilde{F}^{2}(P^{c}(v))}, (6)

where PP is the projection onto 𝒟\mathcal{D}, PcP^{c} is the projection onto 𝒟\mathcal{D}^{\perp}, and F~\widetilde{F} is a Finsler metric on 𝒟\mathcal{D}^{\perp}. This construction yields a full Finsler metric on TMTM that extends the sub-Finsler metric FF on 𝒟\mathcal{D}. Note that the Finsler metric F~\widetilde{F} on 𝒟\mathcal{D}^{\perp} is not unique, so the choice of F~\widetilde{F} is arbitrary. However, the resulting Finsler metric F^\hat{F} on TMTM is unique and independent of the choice of F~\widetilde{F}.

To see this, suppose we have two choices of Finsler metrics F~\widetilde{F} and F~\widetilde{F}^{\prime} on 𝒟\mathcal{D}^{\perp}. Let F^\hat{F} and F^\hat{F}^{\prime} be the corresponding extensions of FF to TMTM using Equation 6. Then for any vTMv\in TM, we have

F^2(v)\displaystyle\hat{F}^{2}(v) =F2(P(v))+F~2(Pc(v))\displaystyle=F^{2}(P(v))+\widetilde{F}^{2}(P^{c}(v))
F^2(v)\displaystyle\hat{F}^{\prime 2}(v) =F2(P(v))+F~2(Pc(v)).\displaystyle=F^{2}(P(v))+\widetilde{F}^{\prime 2}(P^{c}(v)).

Subtracting these two equations, we obtain

F^2(v)F^2(v)=F~2(Pc(v))F~2(Pc(v)).\hat{F}^{\prime 2}(v)-\hat{F}^{2}(v)=\widetilde{F}^{\prime 2}(P^{c}(v))-\widetilde{F}^{2}(P^{c}(v)).

Since vv can be decomposed uniquely as v=v+vv=v_{\parallel}+v_{\perp} with v𝒟v_{\parallel}\in\mathcal{D} and v𝒟v_{\perp}\in\mathcal{D}^{\perp}, we have Pc(v)=vP^{c}(v)=v_{\perp}, and the right-hand side of the above equation depends only on vv_{\perp}. Since the choice of F~\widetilde{F} on 𝒟\mathcal{D}^{\perp} is arbitrary, we can choose F~\widetilde{F} and F~\widetilde{F}^{\prime} to be equal except on a single vector vv_{\perp}, in which case F~2(Pc(v))F~2(Pc(v))\widetilde{F}^{\prime 2}(P^{c}(v))-\widetilde{F}^{2}(P^{c}(v)) will be nonzero only for that vector. Therefore, we have F^2(v)F^2(v)0\hat{F}^{\prime 2}(v)-\hat{F}^{2}(v)\neq 0 only for that vector, and hence F^=F^\hat{F}=\hat{F}^{\prime}.

Therefore, we have shown that the resulting Finsler metric F^\hat{F} on TMTM is unique and independent of the choice of F~\widetilde{F}.

Let us turn to define the normal and abnormal extremals:

The projection x(t)x(t) to MM is called a normal extremal. One can see that every sufficiently short subarc of the normal extremal x(t)x(t) is a minimizer sub-Finslerian geodesic. This subarc is the unique minimizer joining its endpoints (see [4, 7]). In the sub-Finslerian manifold, not all the sub-Finslerian geodesics are normal (contrary to the Finsler manifold). This is because the sub-Finslerian geodesics, which admit a minimizing geodesic, might not solve the sub-Hamiltonian equations. Those minimizers that are not normal extremals are called singular or abnormal extremals, (see for instance [24]). Even in the sub-Finslerian case, Pontryagin’s maximum principle implies that every minimizer of the arc length of the horizontal curves is a normal or abnormal extremal.

3.3. Non-Linear Connections on a sub-Finsler manifolds

Definition 2.

An \mathcal{L}-connection \nabla on a sub-Finsler manifold is a generalized non-linear connection over the induced mapping

E:TMTM,E(α(x))=𝐢(η(𝐢(α(x))))TM,E:T^{*}M\>\lx@xy@svg{\hbox{\raise 0.0pt\hbox{\kern 0.0pt\hbox{\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces{}\ignorespaces{\hbox{\lx@xy@droprule}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces{\hbox{\kern 14.99176pt\raise 0.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{\lx@xy@tip{1}\lx@xy@tip{-1}}}}}}\ignorespaces{\hbox{\lx@xy@droprule}}{\hbox{\lx@xy@droprule}}\ignorespaces}}}}\>TM,\ \ \ E(\alpha(x))=\mathbf{i}(\mathcal{L}_{\eta}(\mathbf{i}^{*}(\alpha(x))))\in TM, (7)

constructed by Legendre transformation η:𝒟TM𝒟TM\mathcal{L}_{\eta}:\mathcal{D}^{*}\subset T^{*}M\>\lx@xy@svg{\hbox{\raise 0.0pt\hbox{\kern 0.0pt\hbox{\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces{}\ignorespaces{\hbox{\lx@xy@droprule}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces{\hbox{\kern 14.99176pt\raise 0.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{\lx@xy@tip{1}\lx@xy@tip{-1}}}}}}\ignorespaces{\hbox{\lx@xy@droprule}}{\hbox{\lx@xy@droprule}}\ignorespaces}}}}\>\mathcal{D}\subset TM by (7), where 𝐢:TM𝒟\mathbf{i}^{*}:T^{*}M\>\lx@xy@svg{\hbox{\raise 0.0pt\hbox{\kern 0.0pt\hbox{\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces{}\ignorespaces{\hbox{\lx@xy@droprule}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces{\hbox{\kern 14.99176pt\raise 0.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{\lx@xy@tip{1}\lx@xy@tip{-1}}}}}}\ignorespaces{\hbox{\lx@xy@droprule}}{\hbox{\lx@xy@droprule}}\ignorespaces}}}}\>\mathcal{D}^{*} is the adjoint mapping of 𝐢:𝒟TM\mathbf{i}:\mathcal{D}\rightarrow TM, i.e. for any α(x)𝔛(M),\alpha(x)\in\mathfrak{X}^{*}(M), 𝐢(α(x))\mathbf{i}^{*}(\alpha(x)) is determined by

X(x),𝐢(α(x))=𝐢(X(x)),α(x)for allX(x)𝔛(M),\langle X(x),\mathbf{i}^{*}(\alpha(x))\rangle=\langle\mathbf{i}(X(x)),\alpha(x)\rangle\ \text{for all}\ X(x)\in\mathfrak{X}(M),

such that v,α:=α(v)\langle v,\alpha\rangle:=\alpha(v) for all v𝒟,α𝒟v\in\mathcal{D},\alpha\in\mathcal{D}^{*}. For more details about the settings of the \mathcal{L}- connection \nabla, we refer the reader to [3]. Obviously, EE is a bundle mapping whose image set is precisely the subbundle 𝒟\mathcal{D} of TMTM and whose kernel is the annihilator 𝒟0\mathcal{D}^{0} of 𝒟\mathcal{D}.

Moreover, we recall the Barthel non-linear connection ¯B\overline{\nabla}^{B} of the cotangent bundle as follows

¯XBα(Y)=X(α(Y))α(XBY),\overline{\nabla}^{B}_{X}\alpha(Y)=X(\alpha(Y))-\alpha(\nabla^{B}_{X}Y),

where the Berwald connection B\nabla^{B} on the tangent bundle was locally given by

Nji=12Givj;Gi=gij(2LvjxkvkLxj).N_{j}^{i}=\frac{1}{2}\frac{\partial G^{i}}{\partial v^{j}};\quad G^{i}=g^{ij}\left(\frac{\partial^{2}L}{\partial v^{j}\partial x^{k}}v^{k}-\frac{\partial L}{\partial x^{j}}\right). (8)

The Barthel nonlinear connection plays the same role in the positivity homogeneous case as the Levi-Civita connection in Riemannian geometry, see [22].

Definition 3.

A curve α:[0,1]TM\alpha:[0,1]\>\lx@xy@svg{\hbox{\raise 0.0pt\hbox{\kern 0.0pt\hbox{\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces{}\ignorespaces{\hbox{\lx@xy@droprule}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces{\hbox{\kern 14.99176pt\raise 0.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{\lx@xy@tip{1}\lx@xy@tip{-1}}}}}}\ignorespaces{\hbox{\lx@xy@droprule}}{\hbox{\lx@xy@droprule}}\ignorespaces}}}}\>T^{*}M is said to be EE-admissible if E(α(t))=σ˙(t)t[0,1]E(\alpha(t))=\dot{\sigma}(t)\ \forall t\in[0,1] such that πM:TMM\pi_{M}:T^{*}M\>\lx@xy@svg{\hbox{\raise 0.0pt\hbox{\kern 0.0pt\hbox{\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces{}\ignorespaces{\hbox{\lx@xy@droprule}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces{\hbox{\kern 14.99176pt\raise 0.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{\lx@xy@tip{1}\lx@xy@tip{-1}}}}}}\ignorespaces{\hbox{\lx@xy@droprule}}{\hbox{\lx@xy@droprule}}\ignorespaces}}}}\>M is the natural cotangent bundle projection. An auto-parallel curve is the EE-admissible curve with respect to \mathcal{L}-connection \nabla if it satisfies αα(t)=0\nabla_{\alpha}\alpha(t)=0 for all t[0,1]t\in[0,1]. The geodesic of \nabla is just the base curve γ=πMα\gamma=\pi_{M}\circ\alpha of the auto-parallel curve.

In coordinates, an auto-parallel curve α(t)=(xi(t),pi(t))\alpha(t)=(x^{i}(t),p_{i}(t)) satisfies the equations

x˙i(t)=gij(x(t),p(t))pj(t),p˙i(t)=Γijk(x(t),p(t))pj(t)pk(t),\dot{x}^{i}(t)=g^{ij}(x(t),p(t))p_{j}(t),\qquad\dot{p}_{i}(t)=-\Gamma^{jk}_{i}(x(t),p(t))p_{j}(t)p_{k}(t),

such that gijg^{ij} and Γjik\Gamma^{ik}_{j} are the local components of the contravariant tensor field of TMTMMTM\otimes TM\>\lx@xy@svg{\hbox{\raise 0.0pt\hbox{\kern 0.0pt\hbox{\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces{}\ignorespaces{\hbox{\lx@xy@droprule}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces{\hbox{\kern 14.99176pt\raise 0.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{\lx@xy@tip{1}\lx@xy@tip{-1}}}}}}\ignorespaces{\hbox{\lx@xy@droprule}}{\hbox{\lx@xy@droprule}}\ignorespaces}}}}\>M associated with the sub-Hamiltonian structure and the connection coefficients of \nabla, respectively. In fact, given a non-linear \mathcal{L}-connection \nabla we can always introduce a smooth vector field Γ\Gamma^{\nabla} on 𝒟\mathcal{D}^{*}, in addition, their integral curves are auto-parallel curves in relation to \nabla. In canonical coordinates, this vector field given by

Γ(x,p)=gij(x,p)pjxiΓjik(x,p)pipkpj.\Gamma^{\nabla}(x,p)=g^{ij}(x,p)p_{j}\frac{\partial}{\partial x^{i}}-\Gamma^{ik}_{j}(x,p)p_{i}p_{k}\frac{\partial}{\partial p_{j}}.

In [3], we proved that every geodesic of \nabla is a normal extremal, and vice versa. More precisely, we have shown that the coordinate expression for the sub-Hamiltonian vector field (this is another form of (3)) H\vec{H} equals:

H(x,p)=gij(x,p)pjxi12gijxk(x,p)pipjpk.\vec{H}(x,p)=g^{ij}(x,p)p_{j}\frac{\partial}{\partial x^{i}}-\frac{1}{2}\frac{\partial g^{ij}}{\partial x^{k}}(x,p)p_{i}p_{j}\frac{\partial}{\partial p_{k}}.

Comparing the latter formula with the definition of Γ\Gamma^{\nabla}, yields that Γ(x,p)=H(x,p)\Gamma^{\nabla}(x,p)=\vec{H}(x,p).

3.4. Variational approach to the length functional and its relation to sub-Hamiltonian equations on sub-Finsler manifolds

We can consider a small variation ψ(s,t)\psi(s,t) of the curve σ(t)\sigma(t) such that ψ(s,0)\psi(s,0) and ψ(s,1)\psi(s,1) are fixed at x0x_{0} and x1x_{1}, respectively, and ψ(0,t)=σ(t)\psi(0,t)=\sigma(t) for all t[0,1]t\in[0,1]. We can think of ψ(s,t)\psi(s,t) as a one-parameter family of curves in the set of all curves joining x0x_{0} and x1x_{1}, and we can consider the variation vector field v(t)=ψs(0,t)v(t)=\frac{\partial\psi}{\partial s}(0,t), which is tangent to the curve σ(t)\sigma(t).

Then, we can define the directional derivative of the length functional \ell along the variation vector field vv as

𝐝(σ)v=dds|s=0(ψ(s,)).\mathbf{d}\ell(\sigma)\cdot v=\frac{d}{ds}\Big{|}_{s=0}\ell(\psi(s,\cdot)). (9)

Note that (ψ(s,))\ell(\psi(s,\cdot)) is the length of the curve ψ(s,)\psi(s,\cdot), which starts at x0x_{0} and ends at x1x_{1}. Therefore, dds|s=0(ψ(s,))\frac{d}{ds}\Big{|}_{s=0}\ell(\psi(s,\cdot)) is the rate of change of the length of the curve as we vary it along the vector field vv.

By chain rule, we can write

dds|s=0(ψ(s,))=01xa(σ(t))ψas(0,t)𝑑t,\frac{d}{ds}\Big{|}_{s=0}\ell(\psi(s,\cdot))=\int_{0}^{1}\frac{\partial\ell}{\partial x^{a}}(\sigma(t))\frac{\partial\psi^{a}}{\partial s}(0,t)dt,

where xa\frac{\partial\ell}{\partial x^{a}} is the gradient of the length functional. Using the fact that ψ(s,t)\psi(s,t) is a variation of σ(t)\sigma(t) and ψ(0,t)=σ(t)\psi(0,t)=\sigma(t), we can express ψas(0,t)\frac{\partial\psi^{a}}{\partial s}(0,t) in terms of the variation vector field vv as

ψas(0,t)=s|s=0ψa(s,t)=s|s=0σa(t)+st|t=tva(t)=va(t).\frac{\partial\psi^{a}}{\partial s}(0,t)=\frac{\partial}{\partial s}\Big{|}_{s=0}\psi^{a}(s,t)=\frac{\partial}{\partial s}\Big{|}_{s=0}\sigma^{a}(t)+s\frac{\partial}{\partial t}\Big{|}{t=t}v^{a}(t)=v^{a}(t).

Therefore, we obtain

dds|s=0(ψ(s,))=01xa(σ(t))va(t)𝑑t=𝐝(σ)v,\frac{d}{ds}\Big{|}_{s=0}\ell(\psi(s,\cdot))=\int_{0}^{1}\frac{\partial\ell}{\partial x^{a}}(\sigma(t))v^{a}(t)dt=\mathbf{d}\ell(\sigma)\cdot v,

which gives the desired equation (9).

Let us clarify the correct relationship between the sub-Hamiltonian equations and the length functional.

Given a sub-Finsler manifold (M,𝒟,F)(M,\mathcal{D},F), the sub-Hamiltonian equations on MM are given by

ddt(Fpa(σ(t)))=Fxa(σ(t)),\frac{d}{dt}\left(\frac{\partial F}{\partial p_{a}}(\sigma(t))\right)=-\frac{\partial F}{\partial x^{a}}(\sigma(t)), (10)

where σ:[0,1]M\sigma:[0,1]\rightarrow M is a horizontal curve in MM with σ(0)=x0\sigma(0)=x_{0} and σ(1)=x1\sigma(1)=x_{1}.

On the other hand, the length functional on MM is defined as

(σ)=01F(σ(t),σ˙(t)),dt,\ell(\sigma)=\int_{0}^{1}F(\sigma(t),\dot{\sigma}(t)),dt,

where σ\sigma is a horizontal curve in MM with σ(0)=x0\sigma(0)=x_{0} and σ(1)=x1\sigma(1)=x_{1}.

It will be shown (see Proposition 6) that a curve σ\sigma is a solution to the sub-Hamiltonian equations if and only if it is a critical point of the length functional \ell. In other words, if σ\sigma satisfies the sub-Hamiltonian equations, then 𝐝(σ)=0\mathbf{d}\ell(\sigma)=0, and conversely, if σ\sigma is a critical point of \ell, then it satisfies the sub-Hamiltonian equations.

Proposition 1.

A horizontal curve σ:[0,1]M\sigma:[0,1]\rightarrow M joining σ(0)=x0\sigma(0)=x_{0} with σ(1)=x1\sigma(1)=x_{1} in MM is a solution to the sub-Hamiltonian equations if and only if it is a critical point of the length functional \ell. That is, if and only if 𝐝(σ)=0\mathbf{d}\ell(\sigma)=0.

Proof.

We will begin by proving the first direction:

Assume that σ\sigma satisfies the sub-Hamiltonian equations. Then, we have

ddt(Fpa(σ(t)))=Fxa(σ(t)),\frac{d}{dt}\left(\frac{\partial F}{\partial p_{a}}(\sigma(t))\right)=-\frac{\partial F}{\partial x^{a}}(\sigma(t)),

for all a=1,,ma=1,\ldots,m and t[0,1]t\in[0,1]. Note that Fpa\frac{\partial F}{\partial p_{a}} is the conjugate momentum of xax^{a}, and we can write the sub-Finsler Lagrangian as

L(x,x˙)=F(x,x˙)det(gij(x)),L(x,\dot{x})=F(x,\dot{x})\sqrt{\det(g_{ij}(x))},

where gij(x)=2F2x˙ix˙j(x,x˙)g_{ij}(x)=\frac{\partial^{2}F^{2}}{\partial\dot{x}^{i}\partial\dot{x}^{j}}(x,\dot{x}) is the sub-Finsler metric tensor. Then, the length functional can be written as

(σ)=01L(σ(t),σ˙(t)),dt.\ell(\sigma)=\int_{0}^{1}L(\sigma(t),\dot{\sigma}(t)),dt.

Using the Euler-Lagrange equation for the Lagrangian LL, we have

ddt(Lx˙a(σ(t),σ˙(t)))Lxa(σ(t),σ˙(t))=0,\frac{d}{dt}\left(\frac{\partial L}{\partial\dot{x}^{a}}(\sigma(t),\dot{\sigma}(t))\right)-\frac{\partial L}{\partial x^{a}}(\sigma(t),\dot{\sigma}(t))=0,

for all a=1,,ma=1,\ldots,m and t[0,1]t\in[0,1]. Since LL depends only on x˙\dot{x} and not on xx explicitly, we can write this as

ddt(Fx˙a(σ(t))det(gij(σ(t))))Fxa(σ(t))det(gij(σ(t)))=0,\frac{d}{dt}\left(\frac{\partial F}{\partial\dot{x}^{a}}(\sigma(t))\sqrt{\det(g_{ij}(\sigma(t)))}\right)-\frac{\partial F}{\partial x^{a}}(\sigma(t))\sqrt{\det(g_{ij}(\sigma(t)))}=0,

for all a=1,,ma=1,\ldots,m and t[0,1]t\in[0,1]. Using the chain rule and the fact that σ\sigma is horizontal curve, we can write this as

ddt(Fpa(σ(t)))Fxa(σ(t))=0,\frac{d}{dt}\left(\frac{\partial F}{\partial p_{a}}(\sigma(t))\right)-\frac{\partial F}{\partial x^{a}}(\sigma(t))=0,

for all a=1,,ma=1,\ldots,m and t[0,1]t\in[0,1]. This is exactly the condition for σ\sigma to be a critical point of \ell, i.e., 𝐝(σ)=0\mathbf{d}\ell(\sigma)=0.

Now, let us proceed to prove the second direction:

Assume that σ\sigma is a critical point of \ell, i.e., 𝐝(σ)=0\mathbf{d}\ell(\sigma)=0. Then, for any smooth variation δσ:[0,1]TM\delta\sigma:[0,1]\rightarrow TM with δσ(0)=δσ(1)=0\delta\sigma(0)=\delta\sigma(1)=0, we have

0=𝐝(σ)(δσ)=01Fxa(σ(t)),δxa(t)𝑑t,0=\mathbf{d}\ell(\sigma)(\delta\sigma)=\int_{0}^{1}\left\langle\frac{\partial F}{\partial x^{a}}(\sigma(t)),\delta x^{a}(t)\right\rangle dt,

where δxa(t)=dds|s=0xa(σ(t)+sδσ(t))\delta x^{a}(t)=\frac{d}{ds}\bigg{|}_{s=0}x^{a}(\sigma(t)+s\delta\sigma(t)) is the variation of the coordinates xax^{a} induced by δσ\delta\sigma. Note that we have used the fact that δσ(0)=δσ(1)=0\delta\sigma(0)=\delta\sigma(1)=0 to get rid of boundary terms.

Since δσ\delta\sigma is arbitrary, this implies that

Fxa(σ(t))=0,\frac{\partial F}{\partial x^{a}}(\sigma(t))=0,

for all a=1,,ma=1,\ldots,m and t[0,1]t\in[0,1]. Using the sub-Hamiltonian equations, we can write this as

ddt(Fpa(σ(t)))=0,\frac{d}{dt}\left(\frac{\partial F}{\partial p_{a}}(\sigma(t))\right)=0,

for all a=1,,ma=1,\ldots,m and t[0,1]t\in[0,1]. This implies that Fpa\frac{\partial F}{\partial p_{a}} is constant along σ\sigma. Since σ\sigma is horizontal curve, we can choose a partition 0=t0<t1<<tn=10=t_{0}<t_{1}<\cdots<t_{n}=1 such that σ\sigma is smooth on each subinterval [ti1,ti][t_{i-1},t_{i}]. Let cac_{a} be the constant value of Fpa\frac{\partial F}{\partial p_{a}} on σ\sigma.

Then, for each i=1,,ni=1,\ldots,n, we have

ddt(Fpa(σ(t)))=0,\frac{d}{dt}\left(\frac{\partial F}{\partial p_{a}}(\sigma(t))\right)=0,

for all a=1,,ma=1,\ldots,m and t[ti1,ti]t\in[t_{i-1},t_{i}]. This implies that

Fpa(σ(t))=ca,\frac{\partial F}{\partial p_{a}}(\sigma(t))=c_{a},

for all a=1,,ma=1,\ldots,m and t[ti1,ti]t\in[t_{i-1},t_{i}]. Since Fpa\frac{\partial F}{\partial p_{a}} is the conjugate momentum of xax^{a}, this implies that σ\sigma satisfies the sub-Hamiltonian equations on each subinterval [ti1,ti][t_{i-1},t_{i}].

Therefore, σ\sigma satisfies the sub-Hamiltonian equations on the whole interval [0,1][0,1], which completes the proof of second direction. ∎

Corollary 1.

If σ:[0,1]M\sigma:[0,1]\rightarrow M is a horizontal curve that minimizes the length functional \ell between two points x0x_{0} and x1x_{1} on a sub-Finsler manifold (M,𝒟,F)(M,\mathcal{D},F), then σ\sigma is a smooth geodesic between x0x_{0} and x1x_{1}. Conversely, if σ\sigma is a smooth geodesic between x0x_{0} and x1x_{1}, then its length (σ)\ell(\sigma) is locally minimized.

Proof.

The proof of this corollary follows directly from Proposition 1. ∎

The Proposition 1 establish the significance of the results in the context of sub-Hamiltonian equations and curve optimization on a sub-Finsler manifold. The corollary highlights the connection between curve optimization, geodesics, and the length functional on sub-Finsler manifolds. Collectively, these results provide deep insights into the geometric behavior of curves on sub-Finsler manifolds, linking the sub-Hamiltonian equations, length minimization, and the concept of geodesics in this context.

4. Nonholonomic sub-Finslerian structure

A sub-Finslerian structure is a generalization of a Finslerian structure, where the metric on the tangent space at each point is only required to be positive-definite on a certain subbundle of tangent vectors.

A nonholonomic sub-Finslerian structure is a triple (M,𝒟,F)(M,\mathcal{D},F) where MM is a smooth manifold of dimension nn, 𝒟\mathcal{D} is a non-integrable distribution of rank k<nk<n on MM, which means that it cannot be generated by taking the Lie bracket of vector fields. This property leads to the nonholonomicity of the structure and has important implications for the geometry and dynamics of the system. The regularity condition on 𝒟\mathcal{D} means that it can be locally generated by smooth vector fields, and the nonholonomic condition means that it cannot be integrable to a smooth submanifold of MM. The sub-Finslerian metric FF is a positive-definite inner product on the tangent space of 𝒟\mathcal{D} at each point of MM. It is often expressed as a norm that satisfies the triangle inequality but does not necessarily have the homogeneity property of a norm. The metric FF induces a distance function on MM, known as the sub-Riemannian distance or Carnot-Carathéodory distance, which is a natural generalization of the Riemannian distance. Mechanically, sub-Riemannian manifolds (M,𝒟,g)(M,\mathcal{D},g) and their generalization, sub-Finslerian manifolds (M,𝒟,F)(M,\mathcal{D},F) are classified as configuration spaces [6].

Nonholonomic sub-Finslerian structures arise in the study of control theory and robotics, where they model the motion of nonholonomic systems, i.e., systems that cannot achieve arbitrary infinitesimal motions despite being subject to arbitrary small forces. The motivation for this generalization comes from the need to provide a framework that captures the complexities of motion in such systems beyond what sub-Riemannian geometry alone can achieve. The study of these structures involves geometric methods, such as the theory of connections and curvature, and leads to interesting mathematical problems. This generalization not only extends the applicability of the theory to a wider class of problems but also paves the way for new insights into the geometric mechanics of nonholonomic systems.

4.1. Nonholonomic Free Particle Motion under a Non-Linear Connection and Projection Operators

We have the projection operator P:TM𝒟0P^{*}:T^{*}M\>\lx@xy@svg{\hbox{\raise 0.0pt\hbox{\kern 0.0pt\hbox{\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces{}\ignorespaces{\hbox{\lx@xy@droprule}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces{\hbox{\kern 14.99176pt\raise 0.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{\lx@xy@tip{1}\lx@xy@tip{-1}}}}}}\ignorespaces{\hbox{\lx@xy@droprule}}{\hbox{\lx@xy@droprule}}\ignorespaces}}}}\>\mathcal{D}^{0} that projects any covector αTM\alpha\in T^{*}M onto its horizontal component with respect to the non-linear connection induced by the distribution 𝒟\mathcal{D}. More precisely, for any YTMY\in TM, we define P(Y)P(Y) to be the projection of YY onto 𝒟\mathcal{D}, and then P(α)(Y)=α(YP(Y))P^{*}(\alpha)(Y)=\alpha(Y-P(Y)).

Next, we have the complement projection (P)c:TM(𝒟)0(P^{*})^{c}:T^{*}M\>\lx@xy@svg{\hbox{\raise 0.0pt\hbox{\kern 0.0pt\hbox{\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces{}\ignorespaces{\hbox{\lx@xy@droprule}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces{\hbox{\kern 14.99176pt\raise 0.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{\lx@xy@tip{1}\lx@xy@tip{-1}}}}}}\ignorespaces{\hbox{\lx@xy@droprule}}{\hbox{\lx@xy@droprule}}\ignorespaces}}}}\>(\mathcal{D}^{\bot})^{0}, which projects any covector αTM\alpha\in T^{*}M onto its vertical component with respect to the non-linear connection induced by the distribution 𝒟\mathcal{D}. More precisely, for any YTMY\in TM, we define P(Y)P^{\bot}(Y) to be the projection of YY onto 𝒟\mathcal{D}^{\bot}, and then (P)c(α)(Y)=α(P(Y))(P^{*})^{c}(\alpha)(Y)=\alpha(P^{\bot}(Y)).

Now, we consider a nonholonomic free particle moving along a horizontal curve σ:[0,1]M\sigma:[0,1]\>\lx@xy@svg{\hbox{\raise 0.0pt\hbox{\kern 0.0pt\hbox{\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces{}\ignorespaces{\hbox{\lx@xy@droprule}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces{\hbox{\kern 14.99176pt\raise 0.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{\lx@xy@tip{1}\lx@xy@tip{-1}}}}}}\ignorespaces{\hbox{\lx@xy@droprule}}{\hbox{\lx@xy@droprule}}\ignorespaces}}}}\>M. Let ¯B\overline{\nabla}^{B} be a Barthel non-linear connection, (see [3, 18]), and the condition P(¯σ˙(t)Bσ˙(t))=0P^{*}(\overline{\nabla}^{B}_{\dot{\sigma}(t)}\dot{\sigma}(t))=0 expresses the fact that the velocity vector σ˙(t)\dot{\sigma}(t) is constrained to be horizontal, while the constraint condition σ˙(t)𝒟0\dot{\sigma}(t)\in\mathcal{D}^{0} expresses the fact that the velocity vector lies in the distribution 𝒟\mathcal{D}.

Using the fact that TMT^{*}M can be decomposed into its horizontal and vertical components with respect to the non-linear connection induced by the distribution 𝒟\mathcal{D}, we can express any covector αTM\alpha\in T^{*}M as α=P(α)+(P)c(α)\alpha=P^{*}(\alpha)+(P^{*})^{c}(\alpha). Then, the constraint condition σ˙(t)𝒟0\dot{\sigma}(t)\in\mathcal{D}^{0} can be written as (P)c(dσ/dt)=0(P^{*})^{c}(\mathrm{d}\sigma/\mathrm{d}t)=0.

Using the above decomposition of α\alpha, we can rewrite the condition P(¯σ˙(t)Bσ˙(t))=0P^{*}(\overline{\nabla}^{B}_{\dot{\sigma}(t)}\dot{\sigma}(t))=0 as P(¯σ˙(t)Bσ˙(t))=P(dσ˙/dt)=d(P(σ˙))/dt=0P^{*}(\overline{\nabla}^{B}_{\dot{\sigma}(t)}\dot{\sigma}(t))=P^{*}(\mathrm{d}\dot{\sigma}/\mathrm{d}t)=\mathrm{d}(P^{*}(\dot{\sigma}))/\mathrm{d}t=0, where we have used the fact that P(dσ˙/dt)P^{*}(\mathrm{d}\dot{\sigma}/\mathrm{d}t) is the derivative of the horizontal component of σ˙\dot{\sigma} with respect to time, and hence is zero if σ˙\dot{\sigma} is constrained to be horizontal.

Therefore, the conditions P(¯σ˙(t)Bσ˙(t))=0P^{*}(\overline{\nabla}^{B}_{\dot{\sigma}(t)}\dot{\sigma}(t))=0 and (P)c(dσ/dt)=0(P^{*})^{c}(\mathrm{d}\sigma/\mathrm{d}t)=0 together express the fact that the velocity vector σ˙(t)\dot{\sigma}(t) of the nonholonomic free particle is constrained to be horizontal and lie in the distribution 𝒟\mathcal{D}, respectively.

Since TMT^{*}M is identified with TMTM via a Riemannian metric gg, we have a natural isomorphism between (𝒟)0(\mathcal{D}^{\bot})^{0} and 𝒟0\mathcal{D}^{0} given by the orthogonal projection. In particular, we have a direct sum decomposition of the cotangent bundle TMT^{*}M as

TM(𝒟)0𝒟0.T^{*}M\cong(\mathcal{D}^{\bot})^{0}\oplus\mathcal{D}^{0}.

Note that any covector αTM\alpha\in T^{*}M can be uniquely decomposed as α=(P)c(α)+P(α)\alpha=(P^{*})^{c}(\alpha)+P^{*}(\alpha).

We can define a new non-linear connection ¯\overline{\nabla} on (M,𝒟,F)(M,\mathcal{D},F) according to

¯X(P(α))(Y)=¯XB(P(α))(Y)+¯XB((P)c(α))(Y)\overline{\nabla}_{X}(P^{*}(\alpha))(Y)=\overline{\nabla}^{B}_{X}(P^{*}(\alpha))(Y)+\overline{\nabla}^{B}_{X}((P^{*})^{c}(\alpha))(Y) (11)

for all X𝔛(M)X\in\mathfrak{X}(M) and α𝔛(M)\alpha\in\mathfrak{X}^{*}(M). We restrict this connection to 𝒟0\mathcal{D}^{0} and the equations of motion of the nonholonomic free particle can be re-written as ¯σ˙(t)σ˙(t)=0\overline{\nabla}_{\dot{\sigma}(t)}{\dot{\sigma}(t)}=0, together with the initial velocity taken in 𝒟0\mathcal{D}^{0} (see [18, 19]).

Given a nonholonomic sub-Finsler structure (M,𝒟,F)(M,\mathcal{D},F) one can always construct a normal and 𝒟\mathcal{D}-adapted \mathcal{L}-connection [3, Proposition 16]. Furthermore, we can construct a generalized non-linear connection over the vector bundle 𝐢:𝒟TM\mathbf{i}:\mathcal{D}\>\lx@xy@svg{\hbox{\raise 0.0pt\hbox{\kern 0.0pt\hbox{\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces{}\ignorespaces{\hbox{\lx@xy@droprule}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces{\hbox{\kern 14.99176pt\raise 0.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{\lx@xy@tip{1}\lx@xy@tip{-1}}}}}}\ignorespaces{\hbox{\lx@xy@droprule}}{\hbox{\lx@xy@droprule}}\ignorespaces}}}}\>TM, we will set XΓ(𝒟)X\in\Gamma(\mathcal{D}) with η(𝐢X)𝔛(M)\mathcal{L}_{\eta}(\mathbf{i}\circ X)\in\mathfrak{X}^{*}(M). So, attached to (M,𝒟,F)(M,\mathcal{D},F) there is a non-linear connection H:Γ(𝒟)×Γ(𝒟0)Γ(𝒟0)\nabla^{H}:\Gamma(\mathcal{D})\times\Gamma(\mathcal{D}^{0})\>\lx@xy@svg{\hbox{\raise 0.0pt\hbox{\kern 0.0pt\hbox{\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces{}\ignorespaces{\hbox{\lx@xy@droprule}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces{\hbox{\kern 14.99176pt\raise 0.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{\lx@xy@tip{1}\lx@xy@tip{-1}}}}}}\ignorespaces{\hbox{\lx@xy@droprule}}{\hbox{\lx@xy@droprule}}\ignorespaces}}}}\>\Gamma(\mathcal{D}^{0}) called the nonholonomic connection over the adjoint mapping 𝐢:𝒟TM\mathbf{i}:\mathcal{D}\rightarrow TM on natural projection τ:𝒟0M\tau:\mathcal{D}^{0}\>\lx@xy@svg{\hbox{\raise 0.0pt\hbox{\kern 0.0pt\hbox{\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces{}\ignorespaces{\hbox{\lx@xy@droprule}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces{\hbox{\kern 14.99176pt\raise 0.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{\lx@xy@tip{1}\lx@xy@tip{-1}}}}}}\ignorespaces{\hbox{\lx@xy@droprule}}{\hbox{\lx@xy@droprule}}\ignorespaces}}}}\>M given by

XHα(Y)=P(¯XBα(Y)).\nabla^{H}_{X}\alpha(Y)=P^{*}(\overline{\nabla}^{B}_{X}\alpha(Y)).

Moreover, there is no doubt this indeed determines a non-linear connection, namely,

XHα(Y)=¯X(P(α))(Y),\nabla^{H}_{X}\alpha(Y)=\overline{\nabla}_{X}(P^{*}(\alpha))(Y),

such that ¯\overline{\nabla} is the non-linear connection given in (11), for all XΓ(𝒟)X\in\Gamma(\mathcal{D}) and αΓ(𝒟0)\alpha\in\Gamma(\mathcal{D}^{0}). In the nonholonomic setting, the horizontal curves are σ^\hat{\sigma} in 𝒟\mathcal{D} that are extensions of curves in MM, i.e. σ^(t)=σ˙(t)\hat{\sigma}(t)=\dot{\sigma}(t) for some curve σ\sigma in MM.

Definition 4.

Let (M,𝒟,F)(M,\mathcal{D},F) be a nonholonomic sub-Finsler structure. A nonholonomic bracket

[,]:Γ(π𝒟)Γ(τ)Γ(τ)[\cdot,\cdot]:\Gamma(\pi_{\mathcal{D}})\otimes\Gamma(\tau)\>\lx@xy@svg{\hbox{\raise 0.0pt\hbox{\kern 0.0pt\hbox{\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces{}\ignorespaces{\hbox{\lx@xy@droprule}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces{\hbox{\kern 14.99176pt\raise 0.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{\lx@xy@tip{1}\lx@xy@tip{-1}}}}}}\ignorespaces{\hbox{\lx@xy@droprule}}{\hbox{\lx@xy@droprule}}\ignorespaces}}}}\>\Gamma(\tau)

is defined as [X,α]=(P)c[X,α][X,\alpha]=(P^{*})^{c}[X,\alpha] for all XΓ(π𝒟)X\in\Gamma(\pi_{\mathcal{D}}), αΓ(τ)\alpha\in\Gamma(\tau), π𝒟:𝒟M\pi_{\mathcal{D}}:\mathcal{D}\>\lx@xy@svg{\hbox{\raise 0.0pt\hbox{\kern 0.0pt\hbox{\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces{}\ignorespaces{\hbox{\lx@xy@droprule}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces{\hbox{\kern 14.99176pt\raise 0.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{\lx@xy@tip{1}\lx@xy@tip{-1}}}}}}\ignorespaces{\hbox{\lx@xy@droprule}}{\hbox{\lx@xy@droprule}}\ignorespaces}}}}\>M and τ:𝒟M\tau:\mathcal{D}^{*}\>\lx@xy@svg{\hbox{\raise 0.0pt\hbox{\kern 0.0pt\hbox{\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces{}\ignorespaces{\hbox{\lx@xy@droprule}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces{\hbox{\kern 14.99176pt\raise 0.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{\lx@xy@tip{1}\lx@xy@tip{-1}}}}}}\ignorespaces{\hbox{\lx@xy@droprule}}{\hbox{\lx@xy@droprule}}\ignorespaces}}}}\>M. This Lie bracket satisfies all the regular properties of the Lie bracket with the exception of the Jacobi identity. It may happen that the nonholonomic bracket [X,α]Γ(τ)[X,\alpha]\notin\Gamma(\tau) because 𝒟\mathcal{D}^{*} is nonintegrable.

Now, we can formally define the torsion operator

T(X,α):=XHααHXP[X,α].T(X,\alpha):=\nabla^{H}_{X}\alpha-\nabla^{H}_{\alpha}X-P^{*}[X,\alpha].

In this setting, due to the symmetry of the non-linear connection XHα=αHX\nabla^{H}_{X}\alpha=\nabla^{H}_{\alpha}X, the torsion T(X,α)=0T(X,\alpha)=0 for all XΓ(𝒟)X\in\Gamma(\mathcal{D}) and αΓ(𝒟0)\alpha\in\Gamma(\mathcal{D}^{0}). Moreover, [4, Lemma 5], implies that the non-linear connection H\nabla^{H} preserves the sub-Finsler metric FF on 𝒟\mathcal{D}, i.e. XHF=0\nabla^{H}_{X}F=0 for all XΓ(𝒟)X\in\Gamma(\mathcal{D}). Therefore, there exists a unique conservative homogeneous nonlinear connection H\nabla^{H} with zero torsion and we can write the equations of motion for the given nonholonomic problem as σ˙(t)Hσ˙(t)=0\nabla^{H}_{\dot{\sigma}(t)}\dot{\sigma}(t)=0, in such a way that σ\sigma is a curve in MM tangent to 𝒟\mathcal{D}.

There is a close relationship between nonholonomic constraints and the controllability of non-linear systems. More precisely, there is a beautiful link between optimal control of nonholonomic systems and sub-Finsler geometry. In the case of a large class of physically interesting systems, the optimal control problem is reduced to finding geodesics with respect to the sub-Finslerian metric. The geometry of such geodesic flows is exceptionally rich and provides guidance for the design of control laws, for more information see Montgomery [24]. We have seen in Section 2 that for each point xMx\in M, we have the following distribution of rank kk

𝒟=span{X1,,Xk},Xi(x)TxM,\mathcal{D}=\textrm{span}\{X_{1},\ldots,X_{k}\},\qquad X_{i}(x)\in T_{x}M,

such that for any control function u(t)=(u1(t),,ut(t))ku(t)=(u_{1}(t),\ldots,u_{t}(t))\in\mathbb{R}^{k} the control system is defined as

x˙=i=1kuiXi(x),xM,\dot{x}=\sum_{i=1}^{k}u_{i}X_{i}(x),\qquad x\in M,

is called a nonholonomic control system or driftless control system in the quantum mechanical sense, see [6].

4.2. Results

The subsequent findings enhance comprehension of nonholonomic sub-Finslerian structures and their relevance in geometric mechanics. These insights offer essential tools for addressing and resolving issues concerning restricted movement within mathematical and physical domains. Specifically, these results shed light on the behavior of nonholonomic structures and their utility in analyzing constrained motion, particularly within the realm of geometric mechanics.

Remark 3.

We call the distribution 𝒟\mathcal{D} a geodesically invariant if for every geodesic σ:[0,1]M\sigma:[0,1]\>\lx@xy@svg{\hbox{\raise 0.0pt\hbox{\kern 0.0pt\hbox{\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces{}\ignorespaces{\hbox{\lx@xy@droprule}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces{\hbox{\kern 14.99176pt\raise 0.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{\lx@xy@tip{1}\lx@xy@tip{-1}}}}}}\ignorespaces{\hbox{\lx@xy@droprule}}{\hbox{\lx@xy@droprule}}\ignorespaces}}}}\>M of ¯B\overline{\nabla}^{B}, σ˙(0)𝒟σ(0)\dot{\sigma}(0)\in\mathcal{D}_{{\sigma}(0)} implies that σ˙(t)𝒟σ(t)\dot{\sigma}(t)\in\mathcal{D}_{{\sigma}(t)} for every t(0,1]t\in(0,1].

One can prove that if (M,𝒟,F)(M,\mathcal{D},F) is a sub-Finslerian manifold such that for any xMx\in M, 𝒟x\mathcal{D}_{x} is a vector subspace of TxMT_{x}M. The distribution 𝒟\mathcal{D} is geodesically invariant if and only if, for any xMx\in M and any v𝒟xv\in\mathcal{D}_{x}, the Jacobi field along any geodesic γ(t)\gamma(t) with initial conditions γ(0)=x\gamma(0)=x and γ˙(0)=v\dot{\gamma}(0)=v is also in 𝒟\mathcal{D}.

In other words, if the Jacobi fields along any geodesic with initial conditions in 𝒟\mathcal{D} remain in 𝒟\mathcal{D}, then 𝒟\mathcal{D} is geodesically invariant. Conversely, if 𝒟\mathcal{D} is geodesically invariant, then any Jacobi field along a geodesic with initial conditions in 𝒟\mathcal{D} must also remain in 𝒟\mathcal{D}. We leave the proof of this statement for future work.

The following Proposition implies, in particular, that 𝒟\mathcal{D} is geodesically invariant with respect to Barthel’s non-linear connection ¯B\overline{\nabla}^{B}.

Proposition 2.
  • (I)

    For each X𝔛(M)X\in\mathfrak{X}(M) and αΓ(𝒟0)\alpha\in\Gamma(\mathcal{D}^{0}), ¯X(P(α))(Y)Γ(𝒟0)\overline{\nabla}_{X}(P^{*}(\alpha))(Y)\in\Gamma(\mathcal{D}^{0}).

  • (II)

    For each X𝔛(M)X\in\mathfrak{X}(M) and αΓ(𝒟0)\alpha\in\Gamma(\mathcal{D}^{0}), ¯XB((P)c(α))(Y)Γ(𝒟0)\overline{\nabla}^{B}_{X}((P^{*})^{c}(\alpha))(Y)\in\Gamma(\mathcal{D}^{0}).

  • (III)

    For each X𝔛(M)X\in\mathfrak{X}(M) and αΓ(𝒟)0\alpha\in\Gamma(\mathcal{D}^{\bot})^{0}, ¯XB((P)c(α))(Y)Γ(𝒟)0\overline{\nabla}^{B}_{X}((P^{*})^{c}(\alpha))(Y)\in\Gamma(\mathcal{D}^{\bot})^{0}.

Proof.
  • (I)

    Let X𝔛(M)X\in\mathfrak{X}(M) and αΓ(𝒟0)\alpha\in\Gamma(\mathcal{D}^{0}). Then, by the definition of the pullback connection, given in (11), and the Leibniz rule, we have

    ¯X(P(α))(Y)\displaystyle\overline{\nabla}_{X}(P^{*}(\alpha))(Y) =X(P(α)(Y))P(α)(¯X(Y))\displaystyle=X(P^{*}(\alpha)(Y))-P^{*}(\alpha)(\overline{\nabla}_{X}(Y))
    =X(α(P(Y)))α(¯X(Y))\displaystyle=X(\alpha(P(Y)))-\alpha(\overline{\nabla}_{X}(Y))
    =α(X(P(Y)))α(¯X(Y))\displaystyle=\alpha(X(P(Y)))-\alpha(\overline{\nabla}_{X}(Y))
    =α(P(X(Y)))α(¯X(Y))\displaystyle=\alpha(P(\mathcal{L}_{X}(Y)))-\alpha(\overline{\nabla}_{X}(Y))
    =P(α(X(Y)))α(¯X(Y))\displaystyle=P(\alpha(\mathcal{L}_{X}(Y)))-\alpha(\overline{\nabla}_{X}(Y))
    =P(X(α(Y)))α(¯X(Y))\displaystyle=P(\mathcal{L}_{X}(\alpha(Y)))-\alpha(\overline{\nabla}_{X}(Y))
    =P(X(P(α)(Y)))α(¯X(Y))\displaystyle=P(\mathcal{L}_{X}(P^{*}(\alpha)(Y)))-\alpha(\overline{\nabla}_{X}(Y))
    =P(¯X(P(α))(Y))α(¯X(Y)).\displaystyle=P(\overline{\nabla}_{X}(P^{*}(\alpha))(Y))-\alpha(\overline{\nabla}_{X}(Y)).

    Since P(¯X(P(α))(Y))P(\overline{\nabla}_{X}(P^{*}(\alpha))(Y)) and α(¯X(Y))\alpha(\overline{\nabla}_{X}(Y)) both lie in Γ(𝒟0)\Gamma(\mathcal{D}^{0}), it follows that ¯X(P(α))(Y)\overline{\nabla}_{X}(P^{*}(\alpha))(Y) also lies in Γ(𝒟0)\Gamma(\mathcal{D}^{0}).

  • (II)

    Using the definition of the connection ¯B\overline{\nabla}^{B}, we have:

    ¯XB((P)c(α))(Y)\displaystyle\overline{\nabla}^{B}_{X}((P^{*})^{c}(\alpha))(Y) =X((P)c(α)(Y))(P)c(α)(XBY)\displaystyle=X((P^{*})^{c}(\alpha)(Y))-(P^{*})^{c}(\alpha)(\nabla_{X}^{B}Y)
    +(P)c(α)(XBY).\displaystyle\quad+(P^{\bot})^{c}(\alpha)(\nabla_{X}^{B}Y).

    Now, let us analyze each term on the right-hand side individually:

    First, consider X((P)c(α)(Y))X((P^{*})^{c}(\alpha)(Y)). Since (P)c(α)(Y)(P^{*})^{c}(\alpha)(Y) is a section of 𝒟0\mathcal{D}^{0} and XX is a vector field on MM, X((P)c(α)(Y))X((P^{*})^{c}(\alpha)(Y)) is a section of 𝒟0\mathcal{D}^{0}.

    Next, we have (P)c(α)(XBY)-(P^{*})^{c}(\alpha)(\nabla_{X}^{B}Y). Here, (P)c(α)(P^{*})^{c}(\alpha) is a bundle map from \mathcal{E} to 𝒟0\mathcal{D}^{0}, so (P)c(α)(XBY)(P^{*})^{c}(\alpha)(\nabla_{X}^{B}Y) is a section of 𝒟0\mathcal{D}^{0}. The negative sign in front ensures that the result remains in 𝒟0\mathcal{D}^{0}.

    Finally, we consider (P)c(α)(XBY)(P^{\bot})^{c}(\alpha)(\nabla_{X}^{B}Y). Since (P)c(α)(P^{\bot})^{c}(\alpha) is a bundle map from \mathcal{E} to the orthogonal complement of 𝒟0\mathcal{D}^{0}, (P)c(α)(XBY)(P^{\bot})^{c}(\alpha)(\nabla_{X}^{B}Y) is a section of Γ(𝒟)\Gamma(\mathcal{D}^{\bot}). However, we need it to be a section of Γ(𝒟0)\Gamma(\mathcal{D}^{0}).

    To ensure that (P)c(α)(XBY)(P^{\bot})^{c}(\alpha)(\nabla_{X}^{B}Y) lies in Γ(𝒟)0\Gamma(\mathcal{D)^{0}}, we can use the projection operator PP to project it back onto 𝒟0\mathcal{D}^{0}. This projection ensures that the final result remains within Γ(𝒟0)\Gamma(\mathcal{D}^{0}).

    Combining these results, we see that ¯XB((P)c(α))(Y)\overline{\nabla}^{B}_{X}((P^{*})^{c}(\alpha))(Y) is a section of Γ(𝒟0)\Gamma(\mathcal{D}^{0}), as desired.

  • (III)

    Using the definition of the connection ¯B\overline{\nabla}^{B}, we have

    ¯XB((P)c(α))(Y)\displaystyle\overline{\nabla}^{B}_{X}((P^{*})^{c}(\alpha))(Y) =X[(P)c(α)(Y)](P)c(α)(¯XY)+(P)c(¯XBα)(Y)\displaystyle=X[(P^{*})^{c}(\alpha)(Y)]-(P^{*})^{c}(\alpha)(\overline{\nabla}_{X}Y)+(P^{*})^{c}(\overline{\nabla}^{B}_{X}\alpha)(Y)
    =X[(P)c(α)(Y)](P)c(α)(¯XY)+(P)c((¯Xα))(Y)\displaystyle=X[(P^{*})^{c}(\alpha)(Y)]-(P^{*})^{c}(\alpha)(\overline{\nabla}_{X}Y)+(P^{*})^{c}((\overline{\nabla}_{X}\alpha)^{\top})(Y)
    =X[(P)c(α)(Y)](P)c(α)(XBY)+(P)c((¯Xα))(Y)\displaystyle=X[(P^{*})^{c}(\alpha)(Y)]-(P^{*})^{c}(\alpha)(\nabla_{X}^{B}Y)+(P^{*})^{c}((\overline{\nabla}_{X}\alpha)^{\top})(Y)

    where in the last step we used the fact that

    (P)c(α)(XBY)=(P)c(α)(¯XY),(P^{*})^{c}(\alpha)(\nabla_{X}^{B}Y)=-(P^{*})^{c}(\alpha)(\overline{\nabla}_{X}Y),

    which follows from the definition of the codifferential operator and the fact that (P)c=(P)c(P^{*})^{c}=-(P^{*})^{c}.

    Now we need to show that the three terms on the right-hand side of this expression lie in Γ(𝒟)0\Gamma(\mathcal{D}^{\bot})^{0}. We will do this term by term. First, note that (P)c(α)(Y)Γ(𝒟)0(P^{*})^{c}(\alpha)(Y)\in\Gamma(\mathcal{D}^{\bot})^{0} since (P)c(α)(P^{*})^{c}(\alpha) maps Γ(𝒟)\Gamma(\mathcal{D}^{\bot}) to itself and YΓ(𝒟)0Y\in\Gamma(\mathcal{D}^{\bot})^{0}.

    Next, we need to show that (P)c(α)(XBY)Γ(𝒟)0(P^{*})^{c}(\alpha)(\nabla_{X}^{B}Y)\in\Gamma(\mathcal{D}^{\bot})^{0}. Note that

    (P)c(α)(XBY)=(P)c(α)(¯XY),(P^{*})^{c}(\alpha)(\nabla_{X}^{B}Y)=-(P^{*})^{c}(\alpha)(\overline{\nabla}_{X}Y),

    so it suffices to show that (P)c(α)(¯XY)Γ(𝒟)0(P^{*})^{c}(\alpha)(\overline{\nabla}_{X}Y)\in\Gamma(\mathcal{D}^{\bot})^{0}. To see this, note that ¯XYΓ(𝒟)0\overline{\nabla}_{X}Y\in\Gamma(\mathcal{D}^{\bot})^{0} since XX and YY are both sections of 𝒟\mathcal{D}^{\bot}, and that (P)c(α)(P^{*})^{c}(\alpha) maps Γ(𝒟)0\Gamma(\mathcal{D}^{\bot})^{0} to itself.

    Finally, we need to show that (P)c((¯Xα))(Y)Γ(𝒟)0(P^{*})^{c}((\overline{\nabla}_{X}\alpha)^{\top})(Y)\in\Gamma(\mathcal{D}^{\bot})^{0}. To see this, note that (¯Xα)(\overline{\nabla}_{X}\alpha)^{\top} is a tensor of type (1,1)(1,1) that maps vectors tangent to MM to vectors tangent to MM, so (P)c((¯Xα))(Y)(P^{*})^{c}((\overline{\nabla}_{X}\alpha)^{\top})(Y) is a section of 𝒟\mathcal{D}^{\bot}. Moreover, (P)c((¯Xα))(P^{*})^{c}((\overline{\nabla}_{X}\alpha)^{\top}) maps Γ(𝒟)0\Gamma(\mathcal{D}^{\bot})^{0} to itself since (¯Xα)(\overline{\nabla}_{X}\alpha)^{\top} maps Γ(TM)\Gamma(TM) to itself and (P)c(P^{*})^{c} maps Γ(𝒟)\Gamma(\mathcal{D}^{\bot}) to itself.

    Therefore, we have shown that ¯XαΓ(𝒟)0\overline{\nabla}_{X}\alpha\in\Gamma(\mathcal{D}^{\bot})^{0}, which implies that α\alpha is a harmonic one-form with respect to the induced metric on M\partial M.

    To summarize, we showed that if α\alpha is a closed one-form on MM such that α|M=0\alpha|_{\partial M}=0, then α\alpha is a harmonic one-form with respect to the induced metric on M\partial M.

In the following, we shall present the nonholonomic sub-Finslerian structure results. To begin, we define coordinate independent conditions for the motion of a free mechanical system subjected to linear nonholonomic constraints to be normal extremal with respect to the connected sub-Finslerian manifold, and vice versa. Then, we address the problem of characterizing the normal and abnormal extremals that validate both nonholonomic and Vakonomic equations for a free particle subjected to certain kinematic constraints.

Let (M,𝒟,F)(M,\mathcal{D},F) be a nonholonomic sub-Finslerian structure and σ:[0,1]M\sigma:[0,1]\>\lx@xy@svg{\hbox{\raise 0.0pt\hbox{\kern 0.0pt\hbox{\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces{}\ignorespaces{\hbox{\lx@xy@droprule}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces{\hbox{\kern 14.99176pt\raise 0.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{\lx@xy@tip{1}\lx@xy@tip{-1}}}}}}\ignorespaces{\hbox{\lx@xy@droprule}}{\hbox{\lx@xy@droprule}}\ignorespaces}}}}\>M be a horizontal curve tangent to 𝒟\mathcal{D}, then σ\sigma is said to be a normal extremal if there exists EE-admissible curve α\alpha with base curve σ\sigma that is auto-parallel with respect to a normal \mathcal{L}-connection (Definition 3). While the curve σ\sigma is said to be an abnormal extremal if there exists γΓ(𝒟0)\gamma\in\Gamma(\mathcal{D}^{0}) along σ\sigma such that αγ(t)=0\nabla_{\alpha}\gamma(t)=0 for all t[0,1]t\in[0,1], with α\alpha a EE-admissible curve with base curve σ\sigma.

Remark 4.

Cortés et al. [15], made a comparison between the solutions of the nonholonomic mechanical problem and the solutions of the Vakonomic dynamical problem for the general Lagrangian system. The Vakonomic dynamical problem, associated with a free particle with linear nonholonomic constraints, consists of finding normal extremals with respect to the sub-Finsleriann structure (M,𝒟,F)(M,\mathcal{D},F). It is an interesting comparison because the equations of motion for the mechanical problem are derived by means of d’Alembert’s principle, while the normal extremals are derived from a variation principle. Our next results are an alternate approach to the Cortés results, that is a coordinate-free approach, for the free particle case in the sub-Finslerian settings.

Definition 5.

Let (M,𝒟,F)(M,\mathcal{D},F) be a nonholonomic sub-Finslerian structure, one can establish new tensorial operators according to the following:

TB\displaystyle T^{B} :Γ(𝒟)Γ(𝒟)Γ(𝒟0),(X,α)P(¯XBα);\displaystyle:\Gamma(\mathcal{D})\otimes\Gamma(\mathcal{D}^{*})\>\lx@xy@svg{\hbox{\raise 0.0pt\hbox{\kern 0.0pt\hbox{\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces{}\ignorespaces{\hbox{\lx@xy@droprule}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces{\hbox{\kern 14.99176pt\raise 0.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{\lx@xy@tip{1}\lx@xy@tip{-1}}}}}}\ignorespaces{\hbox{\lx@xy@droprule}}{\hbox{\lx@xy@droprule}}\ignorespaces}}}}\>\Gamma(\mathcal{D}^{0}),\quad(X,\alpha)\mapsto P^{*}(\overline{\nabla}_{X}^{B}\alpha);
T\displaystyle T :Γ(𝒟)Γ(𝒟0)Γ(𝒟)0,(X,γ)(P)c(δXγ);\displaystyle:\Gamma(\mathcal{D})\otimes\Gamma(\mathcal{D}^{0})\>\lx@xy@svg{\hbox{\raise 0.0pt\hbox{\kern 0.0pt\hbox{\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces{}\ignorespaces{\hbox{\lx@xy@droprule}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces{\hbox{\kern 14.99176pt\raise 0.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{\lx@xy@tip{1}\lx@xy@tip{-1}}}}}}\ignorespaces{\hbox{\lx@xy@droprule}}{\hbox{\lx@xy@droprule}}\ignorespaces}}}}\>\Gamma(\mathcal{D}^{\bot})^{0},\quad(X,\gamma)\mapsto(P^{*})^{c}(\delta_{X}\gamma);

such that

δ:Γ(𝒟)×Γ(𝒟0)𝔛(M),(X,γ)δXγ=iXdγ.\delta:\Gamma(\mathcal{D})\times\Gamma(\mathcal{D}^{0})\>\lx@xy@svg{\hbox{\raise 0.0pt\hbox{\kern 0.0pt\hbox{\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces{}\ignorespaces{\hbox{\lx@xy@droprule}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces{\hbox{\kern 14.99176pt\raise 0.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{\lx@xy@tip{1}\lx@xy@tip{-1}}}}}}\ignorespaces{\hbox{\lx@xy@droprule}}{\hbox{\lx@xy@droprule}}\ignorespaces}}}}\>\mathfrak{X}^{*}(M),\quad(X,\gamma)\mapsto\delta_{X}\gamma=i_{X}d\gamma.

In addition, these tensorial operators have the following properties:

  • (I)

    TBT^{B} and TT are (M)\mathcal{F}(M)-bilinear in their independent variables;

  • (II)

    The behavior of TBT^{B} and TT can be identified pointwise;

  • (III)

    TxB(X,α)T_{x}^{B}(X,\alpha) and Tx(X,γ)T_{x}(X,\gamma) have a clear and unequivocal meaning for all X𝒟,α𝒟X\in\mathcal{D},\alpha\in\mathcal{D}^{*} and γ𝒟0\gamma\in\mathcal{D}^{0}.

In the following, w show the relation between the operator TBT^{B} and the curvature of the distribution 𝒟\mathcal{D} using the following condition:

Suppose X𝒟,α𝒟X\in\mathcal{D},\alpha\in\mathcal{D}^{*}, then one has

T(X,γ),α=δXγ,α=γ,[X,α],\langle T(X,\gamma),\alpha\rangle=\langle\delta_{X}\gamma,\alpha\rangle=-\langle\gamma,[X,\alpha]\rangle,

for any γΓ(𝒟0).\gamma\in\Gamma(\mathcal{D}^{0}). Therefore, TT is trivial if and only if 𝒟\mathcal{D} is involutive.

Definition 6.

Let T\nabla^{T} denote the non-linear connection over i:𝒟TMi:\mathcal{D}\>\lx@xy@svg{\hbox{\raise 0.0pt\hbox{\kern 0.0pt\hbox{\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces{}\ignorespaces{\hbox{\lx@xy@droprule}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces{\hbox{\kern 14.99176pt\raise 0.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{\lx@xy@tip{1}\lx@xy@tip{-1}}}}}}\ignorespaces{\hbox{\lx@xy@droprule}}{\hbox{\lx@xy@droprule}}\ignorespaces}}}}\>TM on 𝒟0\mathcal{D}^{0} by the following formula

XTγ=P(δXγ),\nabla^{T}_{X}\gamma=P^{*}(\delta_{X}\gamma),

such that XΓ(𝒟)X\in\Gamma(\mathcal{D}) and γΓ(𝒟0).\gamma\in\Gamma(\mathcal{D}^{0}).

Proposition 3.

Let (M,𝒟,F)(M,\mathcal{D},F) be a nonholonomic sub-Finslerian structure, assume that σ:[0,1]M\sigma:[0,1]\>\lx@xy@svg{\hbox{\raise 0.0pt\hbox{\kern 0.0pt\hbox{\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces{}\ignorespaces{\hbox{\lx@xy@droprule}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces\ignorespaces{\hbox{\kern 4.49588pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{$\scriptstyle{}$}}}\kern 3.0pt}}}}}}\ignorespaces{\hbox{\kern 14.99176pt\raise 0.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{\lx@xy@tip{1}\lx@xy@tip{-1}}}}}}\ignorespaces{\hbox{\lx@xy@droprule}}{\hbox{\lx@xy@droprule}}\ignorespaces}}}}\>M is a horizontal curve on 𝒟\mathcal{D} and let \nabla be a 𝒟\mathcal{D}-adapted \mathcal{L}-connection. Then, the following properties are satisfied:

  • (I)

    If p0𝒟σ(0)p_{0}\in\mathcal{D}^{*}_{\sigma(0)} is a given initial point, then p(t)=p~(t)p(t)=\tilde{p}(t) for each t[0,1]t\in[0,1] if and only if TB(σ˙(t),p~(t))=0T^{B}(\dot{\sigma}(t),\tilde{p}(t))=0, such that p(t)p(t) and p~(t)\tilde{p}(t) are parallel transported curves along σ\sigma w.r.t. ¯B\overline{\nabla}^{B} and H\nabla^{H}, respectively.

  • (II)

    If γ0𝒟σ(0)0\gamma_{0}\in\mathcal{D}^{0}_{\sigma(0)} is a given initial point, then γ(t)=γ~(t)\gamma(t)=\tilde{\gamma}(t) for each t[0,1]t\in[0,1] if and only if T(σ˙(t),γ~(t))=0T(\dot{\sigma}(t),\tilde{\gamma}(t))=0, such that γ(t)\gamma(t) and γ~(t)\tilde{\gamma}(t) are parallel transported curves along σ\sigma w.r.t. \nabla and T\nabla^{T}, respectively.

Proof.

It is sufficient to prove that the first case and the second one follow similar arguments.

As a consequence of the definition of the tensorial operator TBT^{B}, for any section S(t)S(t) of 𝒟\mathcal{D}^{*} along σ\sigma, the next expression is true

σ˙(t)HS(t)=¯σ˙(t)BS(t)TB(σ˙(t),S(t)).\nabla^{H}_{{\dot{\sigma}(t)}}S(t)=\overline{\nabla}^{B}_{{\dot{\sigma}(t)}}S(t)-T^{B}({\dot{\sigma}(t)},S(t)).

Now, suppose that S(t)=p~(t)=p(t)S(t)=\tilde{p}(t)=p(t), then we get,

TB(σ˙(t),S(t))=0.T^{B}({\dot{\sigma}(t)},S(t))=0.

Conversely, it is well known that, regarding any connection, the parallel transported curves are uniquely determined by their initial conditions. ∎

It is clear that the second property of the above Proposition yields necessary and sufficient conditions for the existence of the curves that have abnormal extremals. In other words, σ\sigma is an abnormal extremal if and only if there exists a parallel transported section γ~\tilde{\gamma} of 𝒟0\mathcal{D}^{0} along σ\sigma with respect to T\nabla^{T} such that T(σ˙(t),γ~(t))=0T(\dot{\sigma}(t),\tilde{\gamma}(t))=0. Now, by the next Proposition, one can derive the necessary and sufficient condition for normal extremals to be a motion of a free nonholonomic mechanical system and vice versa.

Lemma 1.

Let (M,𝒟,F)(M,\mathcal{D},F) be nonholonomic sub-Finslerian structures, and \nabla be a normal non-linear \mathcal{L}-connection. Then for any α𝔛(M)\alpha\in\mathfrak{X}^{*}(M) we have that αα=0\nabla_{\alpha}\alpha=0 if and only if

E(α)Hα(P)=\displaystyle\nabla^{H}_{E(\alpha)}\alpha(P)= T(E(α),(P)c(α));\displaystyle-T(E(\alpha),(P^{*})^{c}(\alpha));
E(α)TP(α)=\displaystyle\nabla^{T}_{E(\alpha)}P^{*}(\alpha)= TB(E(α),α(P)).\displaystyle-T^{B}(E(\alpha),\alpha(P)).
Proof.

We proved in [3], that αα=0\nabla_{\alpha}\alpha=0 if and only if αα=¯E(α)B(P)c(α)+δE(α)P(α)=0.\nabla_{\alpha}\alpha=\overline{\nabla}^{B}_{E(\alpha)}(P^{*})^{c}(\alpha)+\delta_{E(\alpha)}P^{*}(\alpha)=0. Moreover, P(α)=α(P)P^{*}(\alpha)=\alpha(P) and the Barthel non-linear connection preserves the metric, i.e. Bη=η¯B\nabla^{B}\circ\mathcal{L}_{\eta}=\mathcal{L}_{\eta}\circ\overline{\nabla}^{B}, therefore

¯E(α)BP(α)=\displaystyle\overline{\nabla}^{B}_{E(\alpha)}P^{*}(\alpha)= E(α)HP(α)+TB(E(α),P(α)),\displaystyle\nabla^{H}_{E(\alpha)}P^{*}(\alpha)+T^{B}(E(\alpha),P^{*}(\alpha)),
δE(α)P(α)=\displaystyle\delta_{E(\alpha)}P^{*}(\alpha)= E(α)TP(α)+T(E(α),(P)c(α)).\displaystyle\nabla^{T}_{E(\alpha)}P^{*}(\alpha)+T(E(\alpha),(P^{*})^{c}(\alpha)).

According to the fact that TMT^{*}M can be written as the direct sum of (𝒟)0(\mathcal{D}^{\bot})^{0} and 𝒟0\mathcal{D}^{0}, so the equivalence is pretty clear. ∎

Theorem 1.

If σ:[0,1]M\sigma:[0,1]\rightarrow M is a solution of a free nonholonomic system given by nonholonomic sub-Finslerian structures, then it is also a solution of the corresponding Vakonomic problem, and vice versa, if and only if there exists γΓ(𝒟0)\gamma\in\Gamma(\mathcal{D}^{0}) along σ\sigma such that

σ˙Tγ(t)=TB(σ˙(t),L(σ˙(t))),\nabla^{T}_{\dot{\sigma}}\gamma(t)=-T^{B}(\dot{\sigma}(t),\mathcal{L}_{L}(\dot{\sigma}(t))), (12)

further, for all tt, γ(t)(𝒟σ(t)+[σ˙(t),𝒟σ(t)])0\gamma(t)\in{(\mathcal{D}_{\sigma(t)}+[\dot{\sigma}(t),\mathcal{D}_{\sigma(t)}])}^{0}.

Proof.

αα(t)=0\nabla_{\alpha}\alpha(t)=0 is the condition for any EE-admissible curve α(t)=L(σ˙(t))+γ(t)\alpha(t)=\mathcal{L}_{L}(\dot{\sigma}(t))+\gamma(t) to be parallel transported with respect to a normal \mathcal{L}-connection. In other words,

σ˙HL(σ˙(t))=T(σ˙(t),γ(t))\nabla^{H}_{\dot{\sigma}}\mathcal{L}_{L}(\dot{\sigma}(t))=-T(\dot{\sigma}(t),\gamma(t))

and

σ˙Tγ(t)=TB(σ˙(t),L(σ˙(t))).\nabla^{T}_{\dot{\sigma}}\gamma(t)=-T^{B}(\dot{\sigma}(t),\mathcal{L}_{L}(\dot{\sigma}(t))).

Therefore, σ˙HL(σ˙(t))=0\nabla^{H}_{\dot{\sigma}}\mathcal{L}_{L}(\dot{\sigma}(t))=0 if and only if T(σ˙(t),γ(t))=0T(\dot{\sigma}(t),\gamma(t))=0, such that γ(t)\gamma(t) is a solution of (12). Since Remark 3 and Proposition 2 guaranteed that 𝒟\mathcal{D} is geodesically invariant, therefore, given any γ(t)\gamma(t) in (𝒟σ(t)+[σ˙(t),𝒟σ(t)])0{(\mathcal{D}_{\sigma(t)}+[\dot{\sigma}(t),\mathcal{D}_{\sigma(t)}])}^{0}, then (12) ensure that there is always a solution for all t[0,1]t\in[0,1] not only for γ(0)\gamma(0) in (𝒟σ(0)+[σ˙(0),𝒟σ(0)])0{(\mathcal{D}_{\sigma(0)}+[\dot{\sigma}(0),\mathcal{D}_{\sigma(0)}])}^{0}. ∎

5. Examples from Robotics

Typically, nonholonomic systems occur when velocity restrictions are applied, such as the constraint that bodies move on a surface without slipping. Bicycles, cars, unicycles, and anything with rolling wheels are all examples of nonholonomic sub-Finslerian structures.

We will discuss the simplest wheeled mobile robot, which is a single upright rolling wheel, or unicycle, which is known as a kinematic penny rolling on a plane. Assume this wheel is of radius 1 and does not allow sideways sliding. Its configuration MM consists of the heading angle ϕ\phi, the wheel’s point or the contact position (x1,x2)(x_{1},x_{2}), and the rolling angle ψ\psi (see Figure 1). Consequently, the space concerned has dimensions four, i.e., M=2×S1×S1M=\mathbb{R}^{2}\times S^{1}\times S^{1}. There are two control functions deriving the wheel [14, 21]:

Refer to caption
Figure 1. A kinematic penny rolling on a plane
  • (I)

    u1u_{1} [rolling speed], the forward-backward rolling angular,

  • (II)

    u2u_{2} [turning speed], the speed of turning the heading direction ϕ\phi.

With these controls, the rate of change of the coordinates can be expressed as follows:

M˙\displaystyle\dot{M} =[ϕ˙x˙1x˙2ψ˙]=[01cosϕ0sinϕ010][u1u2]=X(M)u.\displaystyle=\begin{bmatrix}\dot{\phi}\\ \dot{x}_{1}\\ \dot{x}_{2}\\ \dot{\psi}\end{bmatrix}=\begin{bmatrix}0&1\\ \cos{\phi}&0\\ \sin{\phi}&0\\ 1&0\\ \end{bmatrix}\begin{bmatrix}u_{1}\\ u_{2}\end{bmatrix}=X(M)u. (13)

As we generally do not worry about the wheel’s rolling angle, we could drop the fourth row from the above equation to get a simpler control system

M˙\displaystyle\dot{M} =[ϕ˙x˙1x˙2]=[01cosϕ0sinϕ0][u1u2]=X(M)u,\displaystyle=\begin{bmatrix}\dot{\phi}\\ \dot{x}_{1}\\ \dot{x}_{2}\end{bmatrix}=\begin{bmatrix}0&1\\ \cos{\phi}&0\\ \sin{\phi}&0\end{bmatrix}\begin{bmatrix}u_{1}\\ u_{2}\end{bmatrix}=X(M)u, (14)

which can be written as the following equation:

X(M)u=X1(M)u1+X2(M)u2,X(M)u=X_{1}(M)u_{1}+X_{2}(M)u_{2},

such that u1,u2u_{1},u_{2} are called the controls and X1(w),X2(w)X_{1}(w),X_{2}(w) are called vector fields. Moreover, each vector field assigns a velocity to every point ww in the configuration space, so these vector fields are sometimes called velocity vector fields. Hence the velocity vector fields of any solution curve should lie in 𝒟\mathcal{D} spanned by the following vector fields:

X1(M)\displaystyle X_{1}(M) =cosϕx1+sinϕx2+ψ\displaystyle=\cos{\phi}\frac{\partial}{\partial x_{1}}+\sin{\phi}\frac{\partial}{\partial x_{2}}+\frac{\partial}{\partial\psi}
X2(M)\displaystyle X_{2}(M) =ϕ.\displaystyle=\frac{\partial}{\partial\phi}.

In a natural way, a sub-Riemannian metric on 𝒟\mathcal{D} is gained by asserting the vector fields X1(M),X2(M)X_{1}(M),X_{2}(M) to be orthonormal vectors,

u1X1(M)+u2X2(M),u1X1(M)+u2X2(M)=u12+u22.\langle u_{1}X_{1}(M)+u_{2}X_{2}(M),u_{1}X_{1}(M)+u_{2}X_{2}(M)\rangle=u_{1}^{2}+u_{2}^{2}.

The integral of this quadratic form measures the work completed in rolling the heading angle ϕ\phi at the rate ϕ˙\dot{\phi} and propelling the wheel ahead at the rate of ψ˙\dot{\psi}. The sub-Riemannian structure will be adjusted as specified by the notion that curvature is costly: namely, it takes more attempts to steer the wheel in a tight circle with little forward or backward movement than to steer it in a wide arc. Therefore, the curvature of the projection σ\sigma given by κ=ϕ˙ψ˙\kappa=\frac{\dot{\phi}}{\dot{\psi}} this brings us to assume sub-Finsler metrics of the body

F=f(κ)dψ2+dϕ2,F=f(\kappa)\sqrt{d{\psi}^{2}+d{\phi}^{2}},

such that ff grows larger but remains constrained as κ\mid\kappa\mid increases. After we check the sub-Finslerian property, one finds the nonholonomic of the rolling wheel, often known as a unicycle, by the equation M˙=X(M)u\dot{M}=X(M)u which is the kinematic model of the unicycle.

6. The sub-Laplacian associated with nonholonomic sub-Finslerian structures

The sub-Laplacian is a differential operator that arises naturally in the study of nonholonomic sub-Finslerian structures. These are geometric structures that generalize Riemannian manifolds, allowing for non-integrable distributions of tangent spaces.

On a sub-Finslerian manifold MM, there is a distinguished distribution of tangent spaces 𝒟\mathcal{D}, which corresponds to the directions that are accessible by moving along curves with bounded sub-Finsler length. The sub-Finsler metric FF on MM measures the sub-Finsler length of curves with respect to this distribution.

The sub-Laplacian is defined as a second-order differential operator that acts on functions on MM and is defined in terms of the metric FF and the distribution 𝒟\mathcal{D}. It is given by

ΔF=div𝒟(gradF),\Delta_{F}=\mathrm{div}_{\mathcal{D}}(\mathrm{grad}_{F}),

where gradF\mathrm{grad}_{F} is the gradient vector field associated with FF which is the unique vector field satisfying dF(gradF,X)=X(F)\mathrm{d}F(\mathrm{grad}_{F},X)=X(F) for all vector fields XX on MM, and div𝒟\mathrm{div}_{\mathcal{D}} is the divergence operator with respect to the distribution 𝒟\mathcal{D}, which is defined as the trace of the tangential part of the connection on 𝒟\mathcal{D}.

Our goal in this section is to show that the sub-Laplacian measures the curvature of the sub-Finslerian structure. It captures the interplay between the sub-Finsler metric FF and the distribution 𝒟\mathcal{D}, and plays a crucial role in many geometric and analytic problems on nonholonomic sub-Finslerian manifolds.

For example, the heat kernel associated with the sub-Laplacian provides a way to study the long-term behavior of solutions to the heat equation on sub-Finslerian manifolds. The Hodge theory on sub-Finslerian manifolds is also intimately related to the sub-Laplacian, and involves the study of differential forms that are harmonic with respect to the sub-Laplacian.

Remark 5.

To see that the sub-Laplacian measures the curvature of the sub-Finslerian structure, let us first recall some basic facts about Riemannian manifolds, see [16]. On a Riemannian manifold (M,g)(M,g), the Laplace-Beltrami operator is defined as

Δg=div(gradg),\Delta_{g}=\mathrm{div}(\mathrm{grad}_{g}),

where gradg\mathrm{grad}_{g} is the gradient vector field associated with the Riemannian metric gg, and div\mathrm{div} is the divergence operator. It is a well-known fact that the Laplace-Beltrami operator measures the curvature of the Riemannian structure in the sense that it is zero if and only if the Riemannian manifold is flat.

The sub-Finslerian case is more complicated due to the presence of the distribution 𝒟\mathcal{D} that is not integrable in general. However, the sub-Laplacian ΔF\Delta_{F} can still be understood as a curvature operator. To see this, we need to introduce the notion of a horizontal vector field.

A vector field XX on MM is called horizontal if it is tangent to the distribution 𝒟\mathcal{D}. Equivalently, XX is horizontal if it is locally of the form X=i=1khiXiX=\sum_{i=1}^{k}h_{i}X_{i}, where hih_{i} are smooth functions and X1,,XkX_{1},\ldots,X_{k} are smooth vector fields that form a basis for 𝒟\mathcal{D}.

Given a horizontal vector field XX, we can define its sub-Finsler length |X|F|X|_{F} as the infimum of the lengths of horizontal curves that are tangent to XX at each point. Equivalently, |X|F|X|_{F} is the supremum of the scalar products g(X,Y)g(X,Y) over all horizontal vector fields YY with |Y|F1|Y|_{F}\leq 1.

With these definitions in place, we can now show that the sub-Laplacian measures the curvature of the sub-Finslerian structure. More precisely, we have the following result:

Theorem 2.

The sub-Laplacian ΔF\Delta_{F} is zero if and only if the sub-Finslerian manifold (M,F,𝒟)(M,F,\mathcal{D}) is locally isometric to a Riemannian manifold.

Proof.

First, suppose that (M,F,𝒟)(M,F,\mathcal{D}) is locally isometric to a Riemannian manifold (M,g)(M,g). Then we can choose a local frame of orthonormal horizontal vector fields X1,,XkX_{1},\ldots,X_{k} with respect to the Riemannian metric gg. In this frame, we have

gradFh=i=1kg(gradh,Xi)Xi\mathrm{grad}_{F}h=\sum_{i=1}^{k}g(\mathrm{grad}_{h},X_{i})X_{i}

for any function hh on MM, and hence

ΔFh=i=1kdiv𝒟(g(gradh,Xi)Xi).\Delta_{F}h=\sum_{i=1}^{k}\mathrm{div}_{\mathcal{D}}(g(\mathrm{grad}_{h},X_{i})X_{i}).

Using the fact that the XiX_{i} form a basis for 𝒟\mathcal{D}, we can rewrite this as

ΔFh=div(gradh)=Δgh,\Delta_{F}h=\mathrm{div}(\mathrm{grad}_{h})=\Delta_{g}h,

where Δg\Delta_{g} is the Laplace-Beltrami operator associated with the Riemannian metric gg. Since Δg\Delta_{g} is zero if and only if (M,g)(M,g) is flat, it follows that ΔF\Delta_{F} is zero if and only if (M,F,𝒟)(M,F,\mathcal{D}) is locally isometric to a Riemannian manifold, which implies that the sub-Finslerian structure is also flat.

Conversely, suppose that ΔF\Delta_{F} is zero. Let X1,,XkX_{1},\ldots,X_{k} be a local frame of horizontal vector fields such that F(Xi)=1F(X_{i})=1 for all ii, and let ωij=g(Xi,Xj)\omega_{ij}=g(X_{i},X_{j}) be the Riemannian metric induced by FF on 𝒟\mathcal{D}. Using the definition of the sub-Laplacian and the fact that ΔF\Delta_{F} is zero, we have

0=ΔFF=div𝒟(gradFF)=i=1ki=1k2Fxixjωij,0=\Delta_{F}F=\mathrm{div}_{\mathcal{D}}(\mathrm{grad}_{F}F)=\sum_{i=1}^{k}\sum_{i=1}^{k}\frac{\partial^{2}F}{\partial x_{i}\partial x_{j}}\omega_{ij},

where x1,,xkx_{1},\ldots,x_{k} are local coordinates on MM that are adapted to 𝒟\mathcal{D} (i.e., X1,,XkX_{1},\ldots,X_{k} form a basis for the tangent space at each point). This implies that the Hessian of FF with respect to the Riemannian metric ω\omega is zero, so FF is locally affine with respect to ω\omega. In other words, (M,F,𝒟)(M,F,\mathcal{D}) is locally isometric to a Riemannian manifold. ∎

Remark 6.

In the above Theorem 2, we have shown that the sub-Laplacian ΔF\Delta_{F} measures the curvature of the sub-Finslerian structure. If ΔF\Delta_{F} is zero, then the sub-Finslerian manifold is locally isometric to a Riemannian manifold, and hence the sub-Finslerian structure is flat. If ΔF\Delta_{F} is nonzero, then the sub-Finslerian manifold is not locally isometric to a Riemannian manifold, and the sub-Finslerian structure is curved. This means that the shortest paths between two points on the manifold are not necessarily straight lines, and the geometry of the manifold is more complex than that of a Riemannian manifold.

Acknowledgements

The author gratefully acknowledges the helpful suggestions given by Dr. László Kozma during the preparation of the paper as well as his support for hosting me as a visiting research scholar at the University of Debrecen.

References

  • [1] A. Agrachev, D. Barilari, and U. Boscain, A comprehensive introduction to sub-Riemannian geometry. Cambridge Studies in Advanced Mathematics (2019).
  • [2] D. Bao, S.-S. Chern, and Z. Shen, An introduction to Riemann-Finsler geometry, Graduate Texts in Mathematics 200. Springer-Verlag, New York, (2000).
  • [3] L. M. Alabdulsada, L. Kozma, On the connection of sub-Finslerian geometry. Int. J. Geom. Methods Mod. Phys.16, no. supp02, 1941006, (2019).
  • [4] L. M. Alabdulsada, L. Kozma, Hopf-Rinow theorem of sub-Finslerian geometry. Rom. J. Math. Comput. Sci., 13(2), (2023).
  • [5] L. M. Alabdulsada, Geodesically complete sub-Finsler manifolds and sub-Hamiltonian dynamics. Submitted
  • [6] L. M. Alabdulsada, A note on the distributions in quantum mechanical systems. J. Phys.: Conf. Ser. 1999, 012112, (2021).
  • [7] L. M. Alabdulsada, Sub-Finsler Geometry and Non-positive Curvature in Hilbert Geometry. PhD thesis, University of Debrecen, Hungary, (2019).
  • [8] D. Barilari, U. Boscain, E. Le Donne, and M. Sigalotti, Sub-Finsler structures from the time-optimal control viewpoint for some nilpotent distributions. J. Dyn. Control Syst. 23(3), 547-575, (2017).
  • [9] V. N. Berestovskii, Homogeneous manifolds with intrinsic metric. I., Sib. Math. J. 29, 887-897, (1988).
  • [10] A. Bloch, L. Colombo, R. Gupta, and D. Martín de Diego, A geometric approach to the optimal control of nonholonomic mechanical systems, Analysis and Geometry in Control Theory and its Applications. INdAM 11, (Springer), pp. 35-64, (2015).
  • [11] O. Calin, D.-C. Chang, Sub-Riemannian Geometry: General Theory and Examples. Cambridge University Press, New York, (2009).
  • [12] C. Carathéodory, Untersuchungen über die Grundlagen der Termodynamik, Math. Ann., 67, 93-161, (1909).
  • [13] W.-L. Chow, Über Systeme von linearen partiellen Differentialgleichungen erster Ordnung. (German) Math. Ann. 117, 98-105, (1939).
  • [14] J.N. Clelland, C.G. Moseley, and G.R. Wilkens, Geometry of sub-Finsler Engel manifolds. Asian J. Math 11, no. 4, 699-726, (2007).
  • [15] J. Cortés, M. de León, D. Martín de Diego, and S. Martínez, Geometric description of Vakonomic and nonholonomic dynamics. Comparison of Solutions. SIAM J. Control Optim. 41, no. 5, 1389-1412, (2002).
  • [16] M. Gordina, T. Laetsch, Sub-Laplacians on Sub-Riemannian manifolds. Potential Anal 44, 811-837, (2016).
  • [17] Gromov, M. Carnot-Caratheodory spaces seen from within. In Sub-Riemannian geometry, vol. 144 of Progr. Math. Birkhäuser, Basel, 79-323, (1996).
  • [18] B. Langerock, Nonholonomic mechanics and connections over a bundle map. J. Phys. A, 34, 609-615, (2001).
  • [19] A. D. Lewis, Affine connections and distributions with applications to nonholonomic mechanics. Rep. Math. Phys. 42, no. 1-2, 135-164, (1998).
  • [20] C. López; E. Martínez, Sub-Finslerian metric associated to an optimal control system. SIAM J. Control Optim. 39, no. 3, 798-811, (2000).
  • [21] Kevin M. Lynch, Frank C. Park, Modern robotics. Cambridge University Press, (2017).
  • [22] L. Kozma, Holonomy structures in Finsler geometry, Handbook of Finsler Geometry ed. Antonelli (Kluwer), (2003).
  • [23] J. Mitchell, On Carnot-Carathéodory metrics, J. Different. Geom., 21, No. 1, 35-45, (1985).
  • [24] R. Montgomery, A tour of subriemannian geometries, their geodesics and applications. Mathematical Surveys and Monographs 91, Amer. Math. Soc., Providence, RI, (2002).
  • [25] P. K. Rashevsky, Any two points of a totally nonholonomic space may be connected by an admissible line. Uch. Zap. Ped. Inst. Im. Liebknechta, Ser. Phys. Math. (in Russian). 2: 83-94, (1938).