This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Three-to-one internal resonances in coupled harmonic oscillators with cubic nonlinearity

L. Di Gregorio, W. Lacarbonara
Abstract

We investigate a general system of two coupled harmonic oscillators with cubic nonlinearity, a model relevant to various structural engineering applications. As a concrete example, we consider the case of two oscillators obtained from the reduction of the wave propagation equations representing a cellular hosting structure with 1-dof resonators in each cell. Without damping, the system is Hamiltonian, with the origin as an elliptic equilibrium characterized by two distinct linear frequencies. To understand the dynamics, it is crucial to derive explicit analytic formulae for the nonlinear frequencies as functions of the physical parameters involved. In the small amplitude regime (perturbative case), we provide the first-order nonlinear correction to the linear frequencies. While this analytic expression was already derived for non-resonant cases, it is novel in the context of resonant or nearly resonant scenarios. Specifically, we focus on the 3:1 resonance, the only resonance involved in the first-order correction. Utilizing the Hamiltonian structure, we employ Perturbation Theory methods to transform the system into Birkhoff Normal Form up to order four. This involves converting the system into action-angle variables (symplectically rescaled polar coordinates), where the truncated Hamiltonian at order four depends on the actions and, due to the resonance, on one “slow” angle. By constructing suitable nonlinear and not close-to-the-identity coordinate transformations, we identify new sets of symplectic action-angle variables. In these variables, the resulting system is integrable up to higher-order terms, meaning it does not depend on the angles, and the frequencies are obtained from the derivatives of the energy with respect to the actions. This construction is highly dependent on the physical parameters, necessitating a detailed case analysis of the phase portrait, revealing up to six topologically distinct behaviors. In each configuration, we describe the nonlinear normal modes (elliptic/hyperbolic periodic orbits, invariant tori) and their stable and unstable manifolds of the truncated Hamiltonian. As an application, we examine wave propagation in metamaterial honeycombs with periodically distributed nonlinear resonators, evaluating the nonlinear effects on the bandgap particularly in the presence of resonances.


Acknowledgments Project ECS 0000024 Rome Technopole, CUP B83C22002820006, National Recovery and Resilience Plan (NRRP) Mission 4 Component 2 Investment 1.5, funded by the European Union - NextGenerationEU.


Funder Project funded under the National Recovery and Resilience Plan (NRRP), Mission 4 Component 2 Investment 1.5 - Call for tender No. 3277 of 30 December 2021 of the Italian Ministry of University and Research funded by the European Union - NextGenerationEU.

1 Introduction

Let us briefly recall the model introduced in [SW23jsv]. Figure 1 shows schematic view of the orthotropic plate model with the periodically distributed spider-web resonators. Each multi-frequency resonator should be meant as the multi-mass-spring system resulting from the multi-dof modal reduction of the infinite-dimensional resonator (i.e., the spider webs with a central mass, here represented in the figure, for the sake of graphical clarity, by a single mass-spring system instead of a set of mass-spring systems). The modal reduction is performed via the Galerkin projection method employing a number of mode shapes of the distributed-parameter resonators. Each resonator is represented by equivalent modal masses and modal springs.

Refer to caption
Figure 1: Schematic view of the orthotropic plate model with the periodically distributed spider-web resonators, see [SW23jsv] as reference.

The adopted plate theory (see [W]) with the elastic constants of the equivalent, homogenized orthotropic material describes the motion of the honeycomb with the attached resonators. By the Floquet-Bloch Theorem, which states that the solutions of the corresponding linear periodic resonators-plate system are quasi-periodic in space with the fundamental periodicity provided by the lattice period, the plate equation of motion can be projected onto the unit cell domain (i.e., the periodically repeated lattice unit). Then one obtains a system of 2N2N coupled second order ODEs, NN being the number of retained resonators modes. For the metamaterial lattice with an array of equally spaced single-dof resonators, i.e., N = 1, equations reduce to the following system of second order ODEs

(M~H(k~1,k~2)M~M~M~)(w~¨0z~¨0)+(K~H(k~1,k~2)00K~)(w~0z~0)=(0N~(3)z~03),\left(\begin{array}[]{cc}\tilde{M}_{H}(\tilde{k}_{1},\tilde{k}_{2})&\tilde{M}\\ \tilde{M}&\tilde{M}\end{array}\right)\left(\begin{array}[]{c}\ddot{\tilde{w}}_{0}\\ \ddot{\tilde{z}}_{0}\end{array}\right)+\left(\begin{array}[]{cc}\tilde{K}_{H}(\tilde{k}_{1},\tilde{k}_{2})&0\\ 0&\tilde{K}\end{array}\right)\left(\begin{array}[]{c}{\tilde{w}}_{0}\\ {\tilde{z}}_{0}\end{array}\right)=-\left(\begin{array}[]{c}0\\ \tilde{N}^{(3)}{\tilde{z}}_{0}^{3}\end{array}\right)\,, (1)

where w~0{\tilde{w}}_{0} and z~0{\tilde{z}}_{0} denote the nondimensional plate deflection and resonator relative motion at the origin of the fixed frame;

M~H(k~1,k~2):=43sin(k~12)sin(14(k~1+3k~2))k~1(k~1+3k~2)\tilde{M}_{H}(\tilde{k}_{1},\tilde{k}_{2}):=\frac{4\sqrt{3}\sin\left(\frac{\tilde{k}_{1}}{2}\right)\sin\left(\frac{1}{4}\left(\tilde{k}_{1}+\sqrt{3}\tilde{k}_{2}\right)\right)}{\tilde{k}_{1}\left(\tilde{k}_{1}+\sqrt{3}\tilde{k}_{2}\right)} (2)

and

K~H(k~1,k~2)=K~H(k~1,k~2;D~12,D~66,D~22):=M~H(k~1,k~2)[k~14+2k~12k~22(D~12+2D~66)+k~24D~22]\begin{split}&\tilde{K}_{H}(\tilde{k}_{1},\tilde{k}_{2})=\tilde{K}_{H}(\tilde{k}_{1},\tilde{k}_{2};\tilde{D}_{12},\tilde{D}_{66},\tilde{D}_{22}):=\tilde{M}_{H}(\tilde{k}_{1},\tilde{k}_{2})\left[\tilde{k}_{1}^{4}+2\tilde{k}_{1}^{2}\tilde{k}_{2}^{2}(\tilde{D}_{12}+2\tilde{D}_{66})+\tilde{k}_{2}^{4}\tilde{D}_{22}\right]\end{split} (3)

are the nondimensional modal mass and stiffness as functions of the nondimensional wave numbers (k~1,k~2)(\tilde{k}_{1},\tilde{k}_{2}), which stay within the irreducible Brillouin triangle \triangle (see Figure 2):

Refer to caption
Figure 2: The irreducible Brillouin triangle :=𝚪𝐗𝐌\triangle:=\bf{\Gamma}\overset{{\triangle}}{\bf{X}}\bf{M}. 𝚪=(0,0)\mathbf{\Gamma}=(0,0), 𝐗=(43π,0)\mathbf{X}=(\frac{4}{3}\pi,0), 𝐌=(π,π3)\mathbf{M}=(\pi,\frac{\pi}{\sqrt{3}}).

moreover

D~12=0.0815599,D~22=12.48,D~66=0.0000247357,\tilde{D}_{12}=0.0815599,\qquad\tilde{D}_{22}=12.48,\qquad\tilde{D}_{66}=0.0000247357\,,

are the nondimensional plate bending stiffness coefficients; finally N~(3)\tilde{N}^{(3)} is the nondimensional nonlinearity.

Actually we consider the more general system of ODEs

𝙼(v¨y¨)+𝙺(vy)=(M3v3N3y3),\mathtt{M}\left(\begin{array}[]{c}\ddot{v}\\ \ddot{y}\end{array}\right)+\mathtt{K}\left(\begin{array}[]{c}v\\ y\end{array}\right)=-\left(\begin{array}[]{c}M_{3}v^{3}\\ N_{3}y^{3}\end{array}\right)\,, (4)

where v(t),y(t)v(t),y(t) are unknown scalar functions, M3,N3M_{3},N_{3} are real coefficients, 𝙼\mathtt{M} is a symmetric positive definite 2×22\times 2 real matrix and 𝙺\mathtt{K} is a diagonal positive definite 2×22\times 2 real matrix.

Note that (1) is a particular case of (4) taking v=w~0v=\tilde{w}_{0}, z=z~0z=\tilde{z}_{0}, M3=0,M_{3}=0, N3=N~(3)N_{3}=\tilde{N}^{(3)} and

𝙼=(M~H(k~1,k~2)M~M~M~),𝙺=(K~H(k~1,k~2)00K~),\mathtt{M}=\left(\begin{array}[]{cc}\tilde{M}_{H}(\tilde{k}_{1},\tilde{k}_{2})&\tilde{M}\\ \tilde{M}&\tilde{M}\end{array}\right)\,,\qquad\mathtt{K}=\left(\begin{array}[]{cc}\tilde{K}_{H}(\tilde{k}_{1},\tilde{k}_{2})&0\\ 0&\tilde{K}\end{array}\right)\,, (5)

with M~H(k~1,k~2)\tilde{M}_{H}(\tilde{k}_{1},\tilde{k}_{2}) and K~H(k~1,k~2)\tilde{K}_{H}(\tilde{k}_{1},\tilde{k}_{2}) defined in (2) and (3), respectively.


The existing literature on Hamiltonian and dissipative systems covers various topics, including bifurcations, invariant manifolds, and homoclinic and heteroclinic orbits. In [Fontich23], the authors study a one-parameter family of 2-DOF Hamiltonian systems with an equilibrium point undergoing a Hamiltonian-Hopf bifurcation. They focus on invariant manifolds and the behavior of the splitting of 2D invariant manifolds in the presence of homoclinic orbits. Similarly, [Celletti13] presents a KAM theory for conformally symplectic dissipative systems, demonstrating that solutions with a fixed n-dimensional (Diophantine) frequency can be found by an a-posteriori approach adjusting the parameters.

In [Llave06], the authors develop numerical algorithms to compute invariant manifolds in quasi-periodically forced systems, focusing on invariant tori and their asymptotic invariant manifolds (whiskers). These algorithms utilize Newton’s method and power-matching expansions of parameterizations. [Cabre05] describes a method to establish the existence and regularity of invariant manifolds, simplifying the proof of the stable manifold theorem near hyperbolic points by using the implicit function theorem in Banach spaces.

[H16] proposes a unified approach to nonlinear modal analysis in dissipative oscillatory systems. This approach defines nonlinear normal modes (NNMs) and spectral submanifolds, emphasizing the importance of damping for accurate conclusions about them, and the reduced-order models they produce. Lastly, [HW95], [HW96] and [HW93] develop methods to detect orbits asymptotic to slow manifolds in perturbed Hamiltonian systems, revealing complex chaotic behaviors and the creation of homoclinic orbits in resonant Hamiltonian systems through geometric singular perturbation theory and Melnikov-type methods.

1.1 Main results

We are interested here in small amplitude solutions of (4). In the first approximation the system is linear with linear frequencies ω\omega_{-} and ω+\omega_{+} and the nonlinearity is a third order perturbation. If the linear frequencies are non vanishing, distinct and satisfy the non resonance condition 3ωω+3\omega_{-}\neq\omega_{+} the system can be integrated, for instance, using the multiple scales method, up to a smaller fifth order nonlinear remainder, see [SW23jsv]. In particular, [SW23jsv] provides explicit expressions for the nonlinear frequencies of the truncated system (obtained by disregarding the fifth order perturbation) as functions of the initial amplitudes. Moreover the effects on the bandgap were explored.
In [DL], we analytically estimated the applicability threshold of the perturbative argument, specifically the maximal admissible amplitude for which the above formula is valid. It was found that this applicability threshold decays to zero in the presence of resonances, more precisely when the ratio between the optical and acoustic frequencies is close to 3; indeed the 3:1 resonance is the only involved resonance in the first order correction.
The methodology used is based on techniques from Hamiltonian Perturbation Theory. Since the system is conservative, we study it as a Hamiltonian system. The origin is an elliptic equilibrium and we put the system in (complete) Birkhoff Normal Form up to order 4 (3 in the equations of motion). The Birkhoff Normal Form is a powerful tool in Hamiltonian Perturbation Theory that, through a suitable symplectic, close-to-the-identity nonlinear change of coordinates, simplifies the Hamiltonian. More precisely, after introducing action-angle variables111Essentially rescaled polar coordinates., in the non resonant case, the truncated system at order four is integrated, meaning its Hamiltonian depends only on the actions, which are constant of motion, and not on the angles. As a consequence the phase space of the truncated Hamiltonian is completely foliated by nonlinear normal modes (NNMs), which are two dimensional invariant tori filled with periodic/quasi-periodic orbits depending on whether the frequency ratio is rational/irrational. Moreover such tori are (constant) graphs over the angles. Finally the nonlinear frequencies of the truncated Hamiltonian are easily evaluated as the derivatives of the Hamiltonian, i.e. the energy, with respect to the two actions. This procedure, being perturbative in nature, only works in a ball of small radius ε\varepsilon around the origin. More precisely in [DL] we proved that there exists a constant c1c_{1}, which was explicitly estimated as function of the physical parameters, such that the smallness condition reads

εc1|σ|,whereσ:=ω+3ω.\varepsilon\leq c_{1}\sqrt{|\sigma|}\,,\qquad\mbox{where}\qquad\sigma:=\omega_{+}-3\omega_{-}\,. (6)

In contrast, the main aim of the present paper is to investigate what happens in the complementary regime, namely when the linear frequencies are in, or almost in, 3:1 resonance, specifically when

c1|σ|<εc_{1}\sqrt{|\sigma|}<\varepsilon (7)

and ε\varepsilon is small enough. In this case, only a resonant BNF is available. This means that, after introducing action-angle variables and a linear symplectic change of coordinates, the truncated Hamiltonian at order four, ^res\hat{\mathbb{H}}_{\rm res} (see (38)), depends on the actions and on one “slow” angle (as its associated frequency is small). The phase portrait becomes more complicated and interesting; its topology strongly depends on the values of the physical parameters. The phase space is still foliated by two dimensional NNMs (invariant tori) but many of them are no longer graphs over the angles as in the nonresonant case, exhibiting different topologies. Moreover, one dimensional NNMs appear such as: elliptic periodic orbits or even hyperbolic ones with their two dimensional (coinciding) stable and unstable manifolds. As the parameters vary, six possible topologically different phase portraits appear. An example is given in Figure 3.

Let us denote by J2{J_{2}} the action conjugated to the other angle, the “fast” one, which does not appear in ^res\hat{\mathbb{H}}_{\rm res}. Then J2{J_{2}} is a constant of motion for ^res\hat{\mathbb{H}}_{\rm res}. For every fixed value of J2{J_{2}}, ^res\hat{\mathbb{H}}_{\rm res} evaluated at J2=const{J_{2}}=const in the reduced bidimensional phase space containing only the slow angle and its conjugated action is a 1-degree-of-freedom Hamiltonian system. In this reduced system, the above two dimensional NNMs (invariant tori) correspond to one dimensional NNMs (periodic orbits), one dimensional NNMs (elliptic/hyperbolic periodic orbits) correspond to zero dimensional NNMs (elliptic/hyperbolic fixed points) and, finally, two dimensional (coinciding) stable and unstable manifolds correspond to one dimensional (coinciding) stable and unstable separatrices, respectively. Some examples are shown in Figures 3, 4 and 5.

Up to the singular222We call it singular since it is formed by all the points whose energy is singular, namely corresponds to some critical value of the Hamiltonian. set formed by the union of zero dimensional NNMs (equilibria) and one dimensional separatrices, the phase space of the reduced Hamiltonian is separated into two or four333According to the different values of the parameters. In Figure 3 a case with four regions is shown. open connected components having different topologies. Since the reduced system has one degree of freedom, on such connected components one can introduce suitable new action-angle coordinates, integrating the system. Recollecting, in these new variables, ^res\hat{\mathbb{H}}_{\rm res} depends only on the new actions and the nonlinear frequencies are simply obtained as the derivatives of the Hamiltonian with respect to the actions.

However, we note that, at this stage, the nonlinear frequencies take the form of elliptic integrals, which are not simple to explicitly evaluate since both the integrating functions and the domains strongly depend on parameters. Nevertheless, we calculate them by using suitable Moebius transformations.

Finally, having the explicit formulas available, we study the nonlinear bandgap in the resonant regime. We found that, while the nonlinearity far from resonances can significantly change the bandgap, in the resonant case, the effect of resonances results in a less pronounced variation in the bandgap.

Here we study in details the truncated Hamiltonian giving a very precise description of its phase space and explicitly integrating the system. The case of the complete Hamiltonian is different since the system is genuinely two dimensional and, therefore, not integrable444Since the fast angle appears at higher order terms and, therefore, its conjugated action J2{J_{2}} is not more a constant of motion.. However, using methods of KAM Theory one can prove the persistence of hyperbolic periodic orbits with their (local) stable and unstable manifolds as well as of the majority of invariant tori. Indeed, our analysis can bee seen as a necessary preparatory step toward applying KAM techniques in the resonant zones (see Remark 9).

Finally, we stress that our analysis is not limited to the case of the honeycomb metamaterials but applies directly to a wide range of problems modeled by two harmonic oscillators coupled with cubic nonlinearity as in equation (4).

Refer to captionRefer to caption
Refer to caption
Figure 3: (Left top) Level curves of the phase space of the reduced Hamiltonian obtained by the fourth order resonant ^res\hat{\mathbb{H}}_{\rm res} (see (38)) fixing the constant of motion J2=104{J_{2}}=10^{-4} (here, e.g., we have chosen the physical parameters as follows: k~1=4π/3,k~2=0,M~=0.005862,K~=1.73,M3=0,N3=104\tilde{k}_{1}=4\pi/3,\tilde{k}_{2}=0,\tilde{M}=0.005862,\tilde{K}=1.73,M_{3}=0,N_{3}=-10^{4}). The slow angle is on the horizontal axis and its (rescaled) conjugated action is on the vertical axis, so that the phase space is actually the cylinder shown on the right. Up to the green and black curves that act as separatrices, the phase space is divided into four connected components. Every component is completely foliated by one dimensional NNMs (periodic orbits). Such NNMs have different topology: the orbits in the zone above the green curve or between the green and solid black curves wrap around the cylinder (dash-dotted curves); the orbits inside the green or the black curves do not wrap around the cylinder and are contractible (dotted, blue and yellow curves). The red point and the green curve are, respectively, a zero dimensional NNM (a hyperbolic equilibrium) and its one dimensional (coinciding) stable and unstable separatrices. (Right) The cylindrical phase portrait immersed in the three dimensional space.
(Left bottom) A representation of the phase space of the truncated Hamiltonian ^res\hat{\mathbb{H}}_{\rm res}, once we have fixed the constant of motion J2=104{J_{2}}=10^{-4}. The image is obtained by rotating the picture on the top by the fast angle from 0 to 4π/34\pi/3. In particular, by rotation, the blue and yellow curves become two dimensional NNMs (invariant tori) and the red point and the green curve become, respectively, a one dimensional NNM (a hyperbolic periodic orbit) and its two dimensional (coinciding) stable and unstable manifolds.
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 4: The same yellow and blue two dimensional NNMs of Figure 3 are plotted here in the modal subspaces (q˙1,q1,q2)(\dot{q}_{1},q_{1},q_{2}) on the left and (q˙2,q1,q2)(\dot{q}_{2},q_{1},q_{2}) on the right (see (9)). Note that, being manifolds, they do not have self-intersections in the complete four dimensional modal phase space (q˙1,q˙2,q1,q2)(\dot{q}_{1},\dot{q}_{2},q_{1},q_{2}). However one can plot only a projection on a three dimensional subspace, where self-intersections may occur.
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 5: The same red one dimensional NNM and its green separatrix of Figure 3 are plotted here in the modal subspaces (q˙1,q1,q2)(\dot{q}_{1},q_{1},q_{2}) on the left and (q˙2,q1,q2)(\dot{q}_{2},q_{1},q_{2}) on the right (see (9)). Note the triangular symmetry, which is particular evident in the green separatrix. It is due to the 3:1 resonance.
Refer to captionRefer to caption
Refer to caption
Figure 6: (Left top) Level curves of the reduced Hamiltonian obtained by the fourth order resonant ^res\hat{\mathbb{H}}_{\rm res} (see (38)) fixing the constant of motion J2=104{J_{2}}=10^{-4} (here, e.g., we have chosen the physical parameters as follows: k~1=2.27,k~2=0,M~=0.2,K~=1.1,M3=0,N3=104\tilde{k}_{1}=2.27,\tilde{k}_{2}=0,\tilde{M}=0.2,\tilde{K}=1.1,M_{3}=0,N_{3}=-10^{4}). The slow angle is on the horizontal axis and its (rescaled) conjugated action is on the vertical axis, so that the phase space is actually the cylinder shown on the right. Except for the solid black curve that acts as separatrix, the phase space is divided into two connected components. Every component is completely foliated by one dimensional NNMs (periodic orbits). Such NNMs have different topology: the orbits in the zone above the separatrix wrap around the cylinder (dashed curves); the orbits inside the separatrix do not wrap around the cylinder and are contractible (dotted and blue curves). (Right) The cylindrical phase portrait immersed in the three dimensional space.
(Left bottom) A representation of the phase space of the truncated Hamiltonian ^res\hat{\mathbb{H}}_{\rm res}, once we have fixed the constant of motion J2=104{J_{2}}=10^{-4}. The image is obtained by rotating the picture on the left by the fast angle from 0 to 4π/34\pi/3. In particular, by rotation, the blue and yellow curves become two dimensional NNMs (invariant tori) and the red point and the green curve become, respectively, a one dimensional NNM (a hyperbolic periodic orbit) and its two dimensional (coinciding) stable and unstable manifolds.
Refer to caption
Refer to caption
Figure 7: (On the left) For M~=0.2\tilde{M}=0.2, K~=1.1\tilde{K}=1.1, N3=104N_{3}=-10^{4} and ε104\varepsilon\sim 10^{-4}, according to the values of the wave numbers in the Brillouin triangle, we have different BNFs. In the white regions, one can construct a nonresonant BNF. In the blue and green regions, only a resonant BNF is available. In particular, in the green regions, the phase portrait of the Hamiltonian in BNF is as in Figure 3, while, in the blue regions, the phase portrait of the Hamiltonian in BNF is as in Figure 6. The red curves represent the pairs (k~1,k~2)(\tilde{k}_{1},\tilde{k}_{2}) for which σ=0\sigma=0, namely when the exact 3:1 resonance occurs. (On the right) Here we fix k~1=2.58\tilde{k}_{1}=2.58, k~2=0\tilde{k}_{2}=0, and let (M~,K~)(\tilde{M},\tilde{K}) vary in the rectangle [0.05,0.3]×[1,5][0.05,0.3]\times[1,5]. A more quantitative and precise description will be given in Figure 11.
Refer to caption
Figure 8: Referring to Figure 7, six different phase portraits are shown as σ\sigma varies from 0.1-0.1 to 1.8-1.8. Note that in Figure 7 the red curves on the right corresponds to the exact 3:1 resonance σ=0\sigma=0. Moving on the left and passing through the green, blue, and, finally, white zone, the value of σ\sigma decreases and the topology of the phase space changes. For σ=0.1,0.4,0.7,1.1\sigma=-0.1,-0.4,-0.7,-1.1 (corresponding to wave numbers in the green region) the phase space has the same topology shown in Figure 3 with a hyperbolic point with its coinciding stable and unstable manifolds (green curve) and two periodic orbits not wrapping on the cylinder: the yellow curve, that soon contracts and disappears, and the blue curve, that reduces. For σ=1.4\sigma=-1.4 (corresponding to wave numbers in the blue region), the phase space has the same topology shown in Figure 6: the hyperbolic point does not exist since the green curve does not self-intersect anymore and becomes a simple periodic curve wrapping on the cylinder, while the blue curve becomes smaller and smaller. Finally for σ=1.8\sigma=-1.8 (corresponding to wave numbers in the white region), the blue curve disappears and only periodic curves wrapping on the cylinder survive, showing the typical behavior of integrable systems.

1.2 Summary of the paper

Section 2: the resonant Birkhoff Normal Form

We reinterpret the problem as a Hamiltonian system (see (11)). In Subsection 2.1, we put the system, close to the origin, in resonant BNF. Then, we examine the Hamiltonian truncated at fourth order, which is equivalent to third order in the equations of motion, as it captures the essential characteristics of the overall motion. Upon introducing action-angle variables it becomes evident that the truncated, or “effective”, Hamiltonian, after a suitable linear change of variables (see (36)), also depends on one angle, known as the “slow” angle (see (38)), as its associated frequency is small or even zero on the exact resonance.

After a suitable rescaling, the effective Hamiltonian, depending on the slow angle ψ[0,2π)\psi\in[0,2\pi) and on the non-dimensional action x(0,1)x\in(0,1), takes the form F(ψ,x)=12a2x2+a1x+b(x)cosψF(\psi,x)=\frac{1}{2}a_{2}x^{2}+a_{1}x+b(x)\cos\psi, where b(x)=(1x)3xb(x)=\sqrt{(1-x)^{3}x} and a1,a2a_{1},a_{2} depend on the physical parameters and on the other action (which is a constant of motion); see Subsection 2.2.

Section 3: the six possible phase portraits

The behavior of the system depends on the number and on the nature of the critical points of FF, which, in turn, depends on the values of a1a_{1} and a2a_{2}. The gradient of FF can vanish only on the lines {ψ=0}\{\psi=0\}, when a2x+a1+b(x)=0a_{2}x+a_{1}+b^{\prime}(x)=0, or {ψ=π}\{\psi=\pi\}, when a2x+a1b(x)=0a_{2}x+a_{1}-b^{\prime}(x)=0. At this point studying the solutions of these equations, as a1a_{1} and a2a_{2} vary, is crucial (see Figure 9). This identifies six zones in the plane (a1,a2)(a_{1},a_{2}), as detailed in Proposition 1, Lemma 5 and Figure 15. Correspondingly we have six possible configurations. When reached, the maximum of FF is attained on the line {ψ=0}\{\psi=0\}, conversely the minimum is attained on the line {ψ=π}\{\psi=\pi\}. E.g. let us briefly describe the scenario a1+a2<0a_{1}+a_{2}<0. By studying xF(0,x)x\to F(0,x) we have three possible cases: no critical points, a maximum and a minimum with negative energy, a maximum and a minimum with positive energy. On the other hand xF(π,x)x\to F(\pi,x) has a minimum. Note that the maximum of F(0,x)F(0,x) corresponds to a maximum for F(ψ,x)F(\psi,x), the minimum of F(0,x)F(0,x) corresponds to a saddle for F(ψ,x)F(\psi,x) and the minimum of F(π,x)F(\pi,x) corresponds to a minimum of F(ψ,x)F(\psi,x). Analogously, the complementary case a1+a2>0a_{1}+a_{2}>0 gives rise to three additional configurations.

Section 4: construction of the integrating action variable

Since the action conjugated to the “fast” angle is a constant of motion, the truncated system has two independent conserved quantities (the other one is the energy) and, therefore, is integrable (by the Arnold-Liouville Theorem), in the sense that one can find a new set of symplectic action-angle variables in which the new Hamiltonian depends only on the actions. Although the theoretical construction of the integrating action is classic, finding an explicit analytical expression as a function of all the physical parameters involved is rather complicated.

For every value of the energy EE, the new integrating action I1{\rm I}_{1} is given by the area enclosed by the level curve F(ψ,x)=EF(\psi,x)=E divided by 2π2\pi, see Section 4. Such level curves are closed and can either wrap around the cylinder [0,2π)×(0,1)[0,2\pi)\times(0,1) or remain confined to its surface without wrapping around it; see Figures 3 and 6.

Since FF is even in ψ\psi we can restrict to consider (ψ,x)[0,π]×(0,1)(\psi,x)\in[0,\pi]\times(0,1). In this set the level curves are graphs over xx and the area enclosed by them can be computed by an integral over xx, whose endpoints are the xx-coordinate of their intersections with the lines {ψ=0}\{\psi=0\} and {ψ=π}\{\psi=\pi\}. It turns out that these correspond to the roots 0<xj(E)<10<x_{j}(E)<1, with j=1,2,3,4j=1,2,3,4, of the quartic polynomial 𝐏(x;E)=(12a2x2+a1xE)2(b(x))2\mathbf{P}(x;E)=\big{(}\frac{1}{2}a_{2}x^{2}+a_{1}x-E\big{)}^{2}-(b(x))^{2}, see (55) and Figure 25. As the energy EE varies, it is necessary to distinguish whether 𝐏\mathbf{P} has 4,24,2 or 0 real roots555Note that, excluding the degenerate case of multiple roots, the number of real roots is even. and whether a root corresponds to an intersection with {ψ=0}\{\psi=0\} or {ψ=π}\{\psi=\pi\}. Explicit formulae for the roots are given in Subsection 3.5, see Figure 14.

Once we have defined the integrating action I1{\rm I}_{1} as a function of EE (and of the “dumb” action, let us say, I2{\rm I}_{2}), the resulting integrated Hamiltonian will be its inverse E=E(I1,I2)E=E({\rm I}_{1},{\rm I}_{2}). The nonlinear frequencies are given by the derivatives of the energy with respect to the actions, see (100), expressed through integrals, see Proposition 2. Such integrals are evaluated by suitable Moebius transformations in terms of elliptic functions, see Subsections 4.3 and 4.4.

Section 5: evaluation of the nonlinear bandgap for the honeycomb metamaterial

Finally, having the explicit formulas for the nonlinear frequencies available, we discuss the nonlinear bandgap for the honeycomb metamaterial, especially in the resonant regime. We found that, while nonlinear effects far from resonances can significantly alter the bandgap, in the resonant case the nonlinear frequencies, especially the acoustic one, closely align with the linear frequencies, resulting in a less pronounced variation in the bandgap.

2 The Hamiltonian structure and resonant BNF

In this section, after introducing optical and acoustic modes, we identify the system in (4) as Hamiltonian, see (11) below, and we evaluate the coefficients of the Hamiltonian, see (22). Set

𝚲:=(ω200ω+2),\mathbf{\Lambda}:=\left(\begin{array}[]{cc}\omega_{-}^{2}&0\\ 0&\omega_{+}^{2}\end{array}\right)\,,

where ω2<ω+2\omega_{-}^{2}<\omega_{+}^{2} are the positive eigenvalues of 𝙼1𝙺\mathtt{M}^{-1}\mathtt{K} and 0<ω<ω+0<\omega_{-}<\omega_{+}. Since 𝙼\mathtt{M} is symmetric and 𝙺\mathtt{K} is diagonal, there exists a 2×22\times 2 matrix 𝚽\mathbf{\Phi} such that

𝚽T𝙼𝚽=𝐈,𝚽T𝙺𝚽=𝚲,𝚽=(ϕ1ϕ1+ϕ2ϕ2+),\mathbf{\Phi}^{T}\mathtt{M}\mathbf{\Phi}=\mathbf{I}\,,\qquad\mathbf{\Phi}^{T}\mathtt{K}\mathbf{\Phi}=\mathbf{\Lambda}\,,\qquad\mathbf{\Phi}=\left(\begin{array}[]{cc}\phi_{1}^{-}&\phi_{1}^{+}\\ \phi_{2}^{-}&\phi_{2}^{+}\end{array}\right)\,, (8)

where 𝐈\mathbf{I} is the identity matrix. Consider the change of variables

(vy)=𝚽𝐪,𝐪:=(q1q2).\left(\begin{array}[]{c}v\\ y\end{array}\right)=\mathbf{\Phi}\mathbf{q}\,,\qquad\mathbf{q}:=\left(\begin{array}[]{c}q_{1}\\ q_{2}\end{array}\right)\,. (9)

By Lemma 8 the system in (4) is transformed into

𝐪¨+𝚲𝐪=𝐜(𝐪),𝐜(𝐪)=(c1c2):=𝚽T(M3(ϕ1q1+ϕ1+q2)3N3(ϕ2q1+ϕ2+q2)3).\ddot{\mathbf{q}}+\mathbf{\Lambda}\mathbf{q}=\mathbf{c}(\mathbf{q})\,,\qquad\mathbf{c}(\mathbf{q})=\left(\begin{array}[]{c}c_{1}\\ c_{2}\end{array}\right):=-\mathbf{\Phi}^{T}\left(\begin{array}[]{c}M_{3}(\phi_{1}^{-}q_{1}+\phi_{1}^{+}q_{2})^{3}\\ N_{3}(\phi_{2}^{-}q_{1}+\phi_{2}^{+}q_{2})^{3}\\ \end{array}\right)\,. (10)

In particular

c1=ϕ1M3(ϕ1q1+ϕ1+q2)3ϕ2N3(ϕ2q1+ϕ2+q2)3c2=ϕ1+M3(ϕ1q1+ϕ1+q2)3ϕ2+N3(ϕ2q1+ϕ2+q2)3.\begin{array}[]{ll}c_{1}=-\phi_{1}^{-}M_{3}(\phi_{1}^{-}q_{1}+\phi_{1}^{+}q_{2})^{3}-\phi_{2}^{-}N_{3}(\phi_{2}^{-}q_{1}+\phi_{2}^{+}q_{2})^{3}\\ c_{2}=-\phi_{1}^{+}M_{3}(\phi_{1}^{-}q_{1}+\phi_{1}^{+}q_{2})^{3}-\phi_{2}^{+}N_{3}(\phi_{2}^{-}q_{1}+\phi_{2}^{+}q_{2})^{3}\,.\end{array}

Introducing the momenta 𝐪˙=𝐩=(p1p2)\dot{\mathbf{q}}=\mathbf{p}=\left(\begin{array}[]{c}p_{1}\\ p_{2}\end{array}\right), the system in (10) is Hamiltonian with Hamiltonian

H(𝐩,𝐪)=12(p12+p22)+12ω2q12+12ω+2q22+f(q),H(\mathbf{p},\mathbf{q})=\frac{1}{2}(p_{1}^{2}+p_{2}^{2})+\frac{1}{2}\omega_{-}^{2}q_{1}^{2}+\frac{1}{2}\omega_{+}^{2}q_{2}^{2}+f(q)\,, (11)

where

f(𝐪):=14M3(ϕ1q1+ϕ1+q2)4+14N3(ϕ2q1+ϕ2+q2)4.f(\mathbf{q}):=\frac{1}{4}M_{3}(\phi_{1}^{-}q_{1}+\phi_{1}^{+}q_{2})^{4}+\frac{1}{4}N_{3}(\phi_{2}^{-}q_{1}+\phi_{2}^{+}q_{2})^{4}\,. (12)

Indeed it is immediate to see that the Hamilton’s equations 𝐩˙=𝐪H\dot{\mathbf{p}}=-\partial_{\mathbf{q}}H, 𝐪˙=𝐩H=𝐩\dot{\mathbf{q}}=\partial_{\mathbf{p}}H=\mathbf{p} are equivalent to the system in (10). Since f(𝐪)f(\mathbf{q}) is a homogeneous polynomial of degree 44 we write

f(𝐪)=i+j=4fi,jq1iq2j,withfi,j:=6i!j!((ϕ1)i(ϕ1+)jM3+(ϕ2)i(ϕ2+)jN3).f(\mathbf{q})=\sum_{i+j=4}f_{i,j}q_{1}^{i}q_{2}^{j}\,,\qquad\mbox{with}\quad f_{i,j}:=\frac{6}{i!j!}\Big{(}(\phi_{1}^{-})^{i}(\phi_{1}^{+})^{j}M_{3}+(\phi_{2}^{-})^{i}(\phi_{2}^{+})^{j}N_{3}\Big{)}\,. (13)

Introducing coordinates 𝐐=(Q1,Q2)\mathbf{Q}=(Q_{1},Q_{2}), 𝐏=(P1,P2)\mathbf{P}=(P_{1},P_{2}) through

p1=ωP1p2=ω+P2q1=1ωQ1q2=1ω+Q2p_{1}=\sqrt{\omega_{-}}P_{1}\quad p_{2}=\sqrt{\omega_{+}}P_{2}\quad q_{1}=\frac{1}{\sqrt{\omega_{-}}}Q_{1}\quad q_{2}=\frac{1}{\sqrt{\omega_{+}}}Q_{2} (14)

we have that the Hamiltonian in the new variables reads

𝙷(𝐏,𝐐):=ωP12+Q122+ω+P22+Q222+f(Q1ω,Q2ω+).\mathtt{H}(\mathbf{P},\mathbf{Q}):=\omega_{-}\frac{P_{1}^{2}+Q_{1}^{2}}{2}+\omega_{+}\frac{P_{2}^{2}+Q_{2}^{2}}{2}+f\left(\frac{Q_{1}}{\sqrt{\omega_{-}}},\frac{Q_{2}}{\sqrt{\omega_{+}}}\right)\,. (15)

In complex coordinates, i=1\mathrm{i}=\sqrt{-1}\in\mathbb{C}, 𝐳=(z1,z2)2\mathbf{z}=(z_{1},z_{2})\in\mathbb{C}^{2}

zj=Qj+iPj2z¯j=QjiPj2j=1,2z_{j}=\frac{Q_{j}+\mathrm{i}P_{j}}{\sqrt{2}}\quad\bar{z}_{j}=\frac{Q_{j}-\mathrm{i}P_{j}}{\sqrt{2}}\qquad j=1,2 (16)

the Hamiltonian reads

𝙷(𝐳,𝐳¯)=𝙽(𝐳,𝐳¯)+𝙶(𝐳,𝐳¯)\mathtt{H}(\mathbf{z},\bar{\mathbf{z}})=\mathtt{N}(\mathbf{z},\bar{\mathbf{z}})+{\mathtt{G}}(\mathbf{z},\bar{\mathbf{z}}) (17)

where

𝙽(𝐳,𝐳¯):=ωz1z¯1+ω+z2z¯2,𝙶(𝐳,𝐳¯):=f(z1+z¯12ω,z2+z¯22ω+).\mathtt{N}(\mathbf{z},\bar{\mathbf{z}}):=\omega_{-}z_{1}\bar{z}_{1}+\omega_{+}z_{2}\bar{z}_{2}\,,\qquad{\mathtt{G}}(\mathbf{z},\bar{\mathbf{z}}):=f\left(\frac{z_{1}+\bar{z}_{1}}{\sqrt{2\omega_{-}}},\frac{z_{2}+\bar{z}_{2}}{\sqrt{2\omega_{+}}}\right)\,. (18)

Note that in complex coordinates the Hamilton’s equations of motion are

z˙j=iz¯j𝙷,z¯˙j=izj𝙷.\dot{z}_{j}=-\mathrm{i}\partial_{\bar{z}_{j}}\mathtt{H}\,,\ \ \dot{\bar{z}}_{j}=\mathrm{i}\partial_{z_{j}}\mathtt{H}\,. (19)

In the following we use the multi-index notation

P(𝐳,𝐳¯)=(𝜶,𝜷)2×2P𝜶,𝜷𝐳𝜶𝐳¯𝜷P(\mathbf{z},\bar{\mathbf{z}})=\sum_{({\bm{\alpha}},{\bm{\beta}})\in\mathbb{N}^{2}\times\mathbb{N}^{2}}P_{{\bm{\alpha}},{\bm{\beta}}}\mathbf{z}^{\bm{\alpha}}\bar{\mathbf{z}}^{\bm{\beta}} (20)

for suitable coefficients P𝜶,𝜷P_{{\bm{\alpha}},{\bm{\beta}}}\in\mathbb{C} with 𝐳𝜶=z1α1z2α2\mathbf{z}^{\bm{\alpha}}=z_{1}^{\alpha_{1}}z_{2}^{\alpha_{2}} (analogously for 𝐳¯𝜷\bar{\mathbf{z}}^{\bm{\beta}}). In these notation, recalling (13) and (18), we rewrite 𝙶{\mathtt{G}} as666Where, for integer vectors 𝜶=(α1,α2),𝜷=(β1,β2){\bm{\alpha}}=(\alpha_{1},\alpha_{2}),{\bm{\beta}}=(\beta_{1},\beta_{2}) we set |𝜶+𝜷|:=α1+α2+β1+β2|{\bm{\alpha}}+{\bm{\beta}}|:=\alpha_{1}+\alpha_{2}+\beta_{1}+\beta_{2}.

𝙶(𝐳,𝐳¯)=i+j=4fi,j4(ω)i(ω+)j(z1+z¯1)i(z2+z¯2)j=|𝜶+𝜷|=4𝙶𝜶,𝜷𝐳𝜶𝐳¯𝜷{\mathtt{G}}(\mathbf{z},\bar{\mathbf{z}})=\sum_{i+j=4}\frac{f_{i,j}}{4(\sqrt{\omega_{-}})^{i}(\sqrt{\omega_{+}})^{j}}(z_{1}+\bar{z}_{1})^{i}(z_{2}+\bar{z}_{2})^{j}=\sum_{|{\bm{\alpha}}+{\bm{\beta}}|=4}{\mathtt{G}}_{{\bm{\alpha}},{\bm{\beta}}}\mathbf{z}^{\bm{\alpha}}\bar{\mathbf{z}}^{\bm{\beta}} (21)

where

𝙶𝜶,𝜷:=fα1+β1,α2+β24(ω)α1+β1(ω+)α2+β2(α1+β1)!α1!β1!(α2+β2)!α2!β2!.{\mathtt{G}}_{{\bm{\alpha}},{\bm{\beta}}}:=\frac{f_{\alpha_{1}+\beta_{1},\alpha_{2}+\beta_{2}}}{4(\sqrt{\omega_{-}})^{\alpha_{1}+\beta_{1}}(\sqrt{\omega_{+}})^{\alpha_{2}+\beta_{2}}}\frac{(\alpha_{1}+\beta_{1})!}{\alpha_{1}!\beta_{1}!}\frac{(\alpha_{2}+\beta_{2})!}{\alpha_{2}!\beta_{2}!}\,. (22)

Note that 𝙶𝜶,𝜷=𝙶𝜷,𝜶{\mathtt{G}}_{{\bm{\alpha}},{\bm{\beta}}}={\mathtt{G}}_{{\bm{\beta}},{\bm{\alpha}}}\in\mathbb{R}.

2.1 Resonant BNF

The aim of the BNF is to construct a symplectic change of variables that “simplifies” the Hamiltonian 𝙷\mathtt{H} in (17). First note that a Hamiltonian HH depending only on |z1|2|z_{1}|^{2} and |z2|2|z_{2}|^{2} writes H=𝜶H𝜶,𝜶|𝐳|2𝜶H=\sum_{{\bm{\alpha}}}H_{{\bm{\alpha}},{\bm{\alpha}}}|\mathbf{z}|^{2{\bm{\alpha}}} and is integrable; in particular |z1|2|z_{1}|^{2} and |z2|2|z_{2}|^{2} are constants of motion. In light of the above considerations we guess if it is possible to find, in a sufficiently small neighborhood of the origin

𝐳ϵ,\|\mathbf{z}\|\leq\epsilon\,, (23)

a close-to-the-identity symplectic transformation that “integrates” 𝙷\mathtt{H} up to terms of degree 6 in (𝐳,𝐳¯)(\mathbf{z},\bar{\mathbf{z}}), which are smaller. This amounts to transform 𝙷\mathtt{H} into 𝙽+𝙷¯4+O(𝐳6){\mathtt{N}}+\bar{\mathtt{H}}_{4}+O(\|\mathbf{z}\|^{6}), with

𝙷¯4:=|𝜶|=2𝙶𝜶,𝜶|𝐳|2𝜶=𝙶(2,0),(2,0)|z1|4+𝙶(1,1),(1,1)|z1|2|z2|2+𝙶(0,2),(0,2)|z2|4,\bar{\mathtt{H}}_{4}:=\sum_{|{\bm{\alpha}}|=2}{\mathtt{G}}_{{\bm{\alpha}},{\bm{\alpha}}}|\mathbf{z}|^{2{\bm{\alpha}}}={\mathtt{G}}_{(2,0),(2,0)}|z_{1}|^{4}+{\mathtt{G}}_{(1,1),(1,1)}|z_{1}|^{2}|z_{2}|^{2}+{\mathtt{G}}_{(0,2),(0,2)}|z_{2}|^{4}\,, (24)

where, recalling (22),

𝙶(2,0),(2,0)\displaystyle{\mathtt{G}}_{(2,0),(2,0)} =\displaystyle= 3f4,02ω2=38ω2((ϕ1)4M3+(ϕ2)4N3),\displaystyle\frac{3f_{4,0}}{2\omega_{-}^{2}}=\frac{3}{8\omega_{-}^{2}}\Big{(}(\phi_{1}^{-})^{4}M_{3}+(\phi_{2}^{-})^{4}N_{3}\Big{)}\,,
𝙶(1,1),(1,1)\displaystyle{\mathtt{G}}_{(1,1),(1,1)} =\displaystyle= f2,2ωω+=32ωω+((ϕ1)2(ϕ1+)2M3+(ϕ2)2(ϕ2+)2N3),\displaystyle\frac{f_{2,2}}{\omega_{-}\omega_{+}}=\frac{3}{2\omega_{-}\omega_{+}}\Big{(}(\phi_{1}^{-})^{2}(\phi_{1}^{+})^{2}M_{3}+(\phi_{2}^{-})^{2}(\phi_{2}^{+})^{2}N_{3}\Big{)}\,,
𝙶(0,2),(0,2)\displaystyle{\mathtt{G}}_{(0,2),(0,2)} =\displaystyle= 3f0,42ω+2=38ω+2((ϕ1+)4M3+(ϕ2+)4N3).\displaystyle\frac{3f_{0,4}}{2\omega_{+}^{2}}=\frac{3}{8\omega_{+}^{2}}\Big{(}(\phi_{1}^{+})^{4}M_{3}+(\phi_{2}^{+})^{4}N_{3}\Big{)}\,. (25)

As well known, this is possible if the nonresonance condition ω+k1+ωk20\omega_{+}k_{1}+\omega_{-}k_{2}\neq 0 is satisfied for every couple of integers k1,k2k_{1},k_{2} with |k1|+|k2|=4|k_{1}|+|k_{2}|=4 and ϵ\epsilon is small enough. It is simple to show (see, e.g. Proposition 1 in [DL]) that

min|k1|+|k2|=4|ω+k1+ωk2|min{ω,ω+ω,|3ωω+|}.\min_{|k_{1}|+|k_{2}|=4}|\omega_{+}k_{1}+\omega_{-}k_{2}|\geq\min\{\omega_{-},\omega_{+}-\omega_{-},|3\omega_{-}-\omega_{+}|\}\,.

While, by hypothesis, ω,ω+ω>0\omega_{-},\omega_{+}-\omega_{-}>0, σ=ω+3ω\sigma=\omega_{+}-3\omega_{-} (introduced in (6)) could be zero or small. It turns out that there exists a constant C1C_{1} (see [DL] for a proof and the evaluation of C1C_{1}) such that, if

ϵC1|σ|,\epsilon\leq C_{1}\sqrt{|\sigma|}\,, (26)

then it is possible to construct a symplectic transformation putting 𝙷\mathtt{H} in (complete) BNF up to order 4, namely 𝙽+𝙷¯4+O(𝐳6){\mathtt{N}}+\bar{\mathtt{H}}_{4}+O(\|\mathbf{z}\|^{6}). Otherwise, if |σ||\sigma| is too small with respect to777In particular we can assume that |σ|min{ω,ω+ω}|\sigma|\leq\min\{\omega_{-},\omega_{+}-\omega_{-}\}. ϵ\epsilon, namely if ϵ>C1|σ|\epsilon>C_{1}\sqrt{|\sigma|}, but ϵ\epsilon still satisfies a suitable (weaker888With C2>C1|σ|C_{2}>C_{1}\sqrt{|\sigma|}.) smallness condition ϵC2\epsilon\leq C_{2}, only a resonant BNF is available. This means that, in the case

C1|σ|ϵC2,C_{1}\sqrt{|\sigma|}\leq\epsilon\leq C_{2}\,, (27)

through a symplectic transformation, the Hamiltonian takes the form 𝙽+𝙷¯4,res+O(𝐳6){\mathtt{N}}+\bar{\mathtt{H}}_{4,{\rm res}}+O(\|\mathbf{z}\|^{6}), where

𝙷¯4,res:=𝙷¯4+𝙶(0,1),(3,0)z2z¯13+𝙶(3,0),(0,1)z13z¯2=(22)𝙷¯4+f3,14(ω)3ω+(z2z¯13+z13z¯2).\bar{\mathtt{H}}_{4,{\rm res}}:=\bar{\mathtt{H}}_{4}+\mathtt{G}_{(0,1),(3,0)}z_{2}\bar{z}_{1}^{3}+\mathtt{G}_{(3,0),(0,1)}z_{1}^{3}\bar{z}_{2}\stackrel{{\scriptstyle\eqref{Gab}}}{{=}}\bar{\mathtt{H}}_{4}+\frac{f_{3,1}}{4(\sqrt{\omega_{-}})^{3}\sqrt{\omega_{+}}}(z_{2}\bar{z}_{1}^{3}+z_{1}^{3}\bar{z}_{2})\,. (28)
Remark 1.

The construction of the above symplectic transformation in the resonant case was given in [DL], where the remainder O(𝐳6)O(\|\mathbf{z}\|^{6}) was explicitly estimated. This means that we found a concrete constant cc_{*} depending on the parameters such that O(𝐳6)cϵ6O(\|\mathbf{z}\|^{6})\leq c_{*}\epsilon^{6}.

Remark 2.

ϵ\epsilon introduced in (23) is simply related to ε\varepsilon introduced in (6) by the change of variables (9), (14), (16). This means that there exist two constants c¯<c¯\underline{c}<\bar{c} such that c¯ϵ/εc¯\underline{c}\leq\epsilon/\varepsilon\leq\bar{c}. Then (26) and (27) justify (6) and (7), respectively.

We now introduce action-angle variables999𝕋:=/2π\mathbb{T}:=\mathbb{R}/{2\pi\mathbb{Z}}, 𝕋2:=2/2π2\mathbb{T}^{2}:=\mathbb{R}^{2}/{2\pi\mathbb{Z}^{2}}. (𝐈,𝝋)=(I1,I2,φ1,φ2)2×𝕋2(\mathbf{I},{\bm{\varphi}})=(I_{1},I_{2},\varphi_{1},\varphi_{2})\in\mathbb{R}^{2}\times\mathbb{T}^{2} through the transformation

zj=Ijeiφj,Ij>0,j=1,2.z_{j}=\sqrt{I_{j}}e^{-\mathrm{i}\varphi_{j}}\,,\qquad I_{j}>0\,,\qquad j=1,2. (29)
Remark 3.

Note that the above map is singular at z1\mathrm{z}_{1} or z2=0\mathrm{z}_{2}=0 and is defined for I1,I2>0I_{1},I_{2}>0.

In the symplectic variables in (29) the truncated Hamiltonians 𝙽+𝙷4{\mathtt{N}}+{\mathtt{H}}_{4} and 𝙽+𝙷¯4,res{\mathtt{N}}+\bar{\mathtt{H}}_{4,{\rm res}} take the final forms

^res(𝐈,𝝋)\displaystyle\hat{\mathcal{H}}_{\rm res}(\mathbf{I},{\bm{\varphi}}) :=\displaystyle:= ^(𝐈,𝝋)+f3,12(ω)3ω+I13I2cos(φ23φ1),,\displaystyle\hat{\mathcal{H}}(\mathbf{I},{\bm{\varphi}})+\frac{f_{3,1}}{2(\sqrt{\omega_{-}})^{3}\sqrt{\omega_{+}}}\sqrt{I_{1}^{3}I_{2}}\cos(\varphi_{2}-3\varphi_{1})\,,\,, (30)
^(𝐈,𝝋)\displaystyle\hat{\mathcal{H}}(\mathbf{I},{\bm{\varphi}}) :=\displaystyle:= ωI1+ω+I2+4(𝐈)\displaystyle\omega_{-}I_{1}+\omega_{+}I_{2}+\mathcal{H}_{4}(\mathbf{I}) (31)
4(𝐈)\displaystyle\mathcal{H}_{4}(\mathbf{I}) :=\displaystyle:= 𝙶(2,0),(2,0)I12+𝙶(1,1),(1,1)I1I2+𝙶(0,2),(0,2)I22.\displaystyle{\mathtt{G}}_{(2,0),(2,0)}I_{1}^{2}+{\mathtt{G}}_{(1,1),(1,1)}I_{1}I_{2}+{\mathtt{G}}_{(0,2),(0,2)}I_{2}^{2}\,. (32)

The frequencies of the integrable nonresonant truncated Hamiltonian ^\hat{\mathcal{H}} in (31) are the derivatives of the energy with respect to the actions, namely, by (32),

ωnl\displaystyle\omega_{-}^{\rm nl} :=\displaystyle:= I1^=ω+2𝙶(2,0),(2,0)I1+𝙶(1,1),(1,1)I2,\displaystyle\partial_{I_{1}}\hat{\mathcal{H}}=\omega_{-}+2{\mathtt{G}}_{(2,0),(2,0)}I_{1}+{\mathtt{G}}_{(1,1),(1,1)}I_{2}\,,
ω+nl\displaystyle\omega_{+}^{\rm nl} :=\displaystyle:= I2^=ω++2𝙶(0,2),(0,2)I2+𝙶(1,1),(1,1)I1,\displaystyle\partial_{I_{2}}\hat{\mathcal{H}}=\omega_{+}+2{\mathtt{G}}_{(0,2),(0,2)}I_{2}+{\mathtt{G}}_{(1,1),(1,1)}I_{1}\,,

In particular, when M3=0M_{3}=0, by (25) we have

ωnl\displaystyle\omega_{-}^{\rm nl} =\displaystyle= ω+N3(38ω(ϕ2)4a2+34ω(ϕ2)2(ϕ2+)2a+2),\displaystyle\omega_{-}+N_{3}\left(\frac{3}{8\omega_{-}}(\phi_{2}^{-})^{4}a_{-}^{2}+\frac{3}{4\omega_{-}}(\phi_{2}^{-})^{2}(\phi_{2}^{+})^{2}a_{+}^{2}\right)\,,
ω+nl\displaystyle\omega_{+}^{\rm nl} =\displaystyle= ω++N3(38ω+(ϕ2+)4a+2+34ω+(ϕ2)2(ϕ2+)2a2),\displaystyle\omega_{+}+N_{3}\left(\frac{3}{8\omega_{+}}(\phi_{2}^{+})^{4}a_{+}^{2}+\frac{3}{4\omega_{+}}(\phi_{2}^{-})^{2}(\phi_{2}^{+})^{2}a_{-}^{2}\right)\,, (33)

where a,a+>0a_{-},a_{+}>0 are the initial amplitudes. Note that in the original variables q1q_{1} and q2q_{2}, one has

q1(0)=a,q2(0)=a+,p˙1(0)=p˙2(0)=0,q_{1}(0)=a_{-}\,,\quad q_{2}(0)=a_{+}\,,\quad\dot{p}_{1}(0)=\dot{p}_{2}(0)=0\,, (34)

that correspond, by (14), (16) and (29), in initial action-angle variables:

I1(0)=12ωa2,I2(0)=12ω+a+2,φ1(0)=φ2(0)=0.I_{1}(0)=\frac{1}{2}\omega_{-}a_{-}^{2}\,,\qquad I_{2}(0)=\frac{1}{2}\omega_{+}a_{+}^{2}\,,\qquad\varphi_{1}(0)=\varphi_{2}(0)=0\,. (35)

Formula (2.1) was already known (see [SW23jsv] or [DL]), but it does not hold close to resonances. To obtain the analogous of formula (2.1) in the resonant case is much more complicated since one has to integrate the Hamiltonian ^res\hat{\mathcal{H}}_{\rm res} in (30). This is exactly what we are going to do in the following sections. The analogous of (2.1) in the resonant case are the formula (153)-(156) below.

Remark 4 (Reversibility).

Since the Hamiltonian ^res(𝐈,𝛗)\hat{\mathcal{H}}_{\rm res}(\mathbf{I},{\bm{\varphi}}) in (30) is even in 𝛗{\bm{\varphi}} the system is reversible, namely if (𝐈(t),𝛗(t))\big{(}\mathbf{I}(t),{\bm{\varphi}}(t)\big{)} is a solution the same holds true for (𝐈(t),𝛗(t))\big{(}\mathbf{I}(-t),-{\bm{\varphi}}(-t)\big{)}. In particular if 𝛗(0)=0{\bm{\varphi}}(0)=0 the solution is even in the actions and odd in the angles, namely 𝐈(t)=𝐈(t)\mathbf{I}(t)=\mathbf{I}(-t) and 𝛗(t)=𝛗(t){\bm{\varphi}}(t)=-{\bm{\varphi}}(-t).

2.2 The slow angle and the effective Hamiltonian

It is convenient to introduce the adimensional effective Hamiltonian FF depending solely on one angle ψ1\psi_{1}, namely the “slow angle”. Let us consider the canonical transformation

Φ:2×𝕋22×𝕋2Φ(J,ψ)=(I,𝝋):=(TJ,1ψ)=(3110)\Phi_{*}:\mathbb{R}^{2}\times\mathbb{T}^{2}\to\mathbb{R}^{2}\times\mathbb{T}^{2}\qquad\Phi_{*}(J,\psi)=(I,{\bm{\varphi}}):=(\mathcal{M}^{T}J,\mathcal{M}^{-1}\psi)\qquad\mathcal{M}=\left(\begin{array}[]{cc}-3&1\\ 1&0\end{array}\right)

so that

{I1=J23J1I2=J1,{φ1=ψ2φ2=ψ1+3ψ2.\left\{\begin{array}[]{l}I_{1}=J_{2}-3J_{1}\\ I_{2}=J_{1}\,,\end{array}\right.\qquad\left\{\begin{array}[]{l}\varphi_{1}=\psi_{2}\\ \varphi_{2}=\psi_{1}+3\psi_{2}\,.\end{array}\right. (36)

Note that \mathcal{M} has integer entries and det=1\mathrm{det}\mathcal{M}=-1 so that the inverse 1\mathcal{M}^{-1} has also integer entries. This implies that ψ=φ\psi=\mathcal{M}\varphi and its inverse φ=1ψ\varphi=\mathcal{M}^{-1}\psi are well defined on the torus 𝕋2\mathbb{T}^{2}. Note also that, by (29), we have

J2>3J1,J1>0.J_{2}>3J_{1}\,,\qquad J_{1}>0\,. (37)

Let us write ^res\hat{\mathcal{H}}_{{\rm res}} in (30) in the (J,ψ)(J,\psi)-variables

^res(J,ψ1):=^res(Φ(J,ψ1)):=ωJ2+σJ1+4,res(J,ψ1),\hat{\mathbb{H}}_{\rm res}(J,\psi_{1}):=\hat{\mathcal{H}}_{{\rm res}}\big{(}\Phi_{*}(J,\psi_{1})\big{)}:=\omega_{-}J_{2}+\sigma J_{1}+\mathbb{H}_{4,{\rm res}}(J,\psi_{1})\,, (38)

where

4,res(J,ψ1):=4(J23J1,J1)+f3,12(ω)3ω+(J23J1)3J1cos(ψ1).\mathbb{H}_{4,{\rm res}}(J,\psi_{1}):=\mathcal{H}_{4}(J_{2}-3J_{1},J_{1})+\frac{f_{3,1}}{2(\sqrt{\omega_{-}})^{3}\sqrt{\omega_{+}}}\sqrt{(J_{2}-3J_{1})^{3}J_{1}}\cos(\psi_{1})\,. (39)

Note that ^res\hat{\mathbb{H}}_{\rm res} is reversible in the sense of Remark 4. Moreover it depends only on the “slow angle” ψ1\psi_{1}, that evolves by a small frequency σ+O(|J|)0\sigma+O(|J|)\sim 0 (recall (7)), but does not depend on the “fast angle” ψ2\psi_{2}, that, on the contrary, evolves by a frequency ω>0\omega_{-}>0, which is definitively different from zero. So the partial derivative w.r.t. ψ2\psi_{2} of ^res\hat{\mathbb{H}}_{\rm res} vanishes and, by the Hamilton’s equations, J˙2=0\dot{J}_{2}=0, so that J2J_{2} is a constant of motion, namely

J2(t)=J2(0)=:J2.J_{2}(t)=J_{2}(0)=:{J_{2}}\,.

Moreover the fast angle ψ2\psi_{2} simply evolves as ψ2(t)=ωt+ψ2(0)\psi_{2}(t)=\omega_{-}t+\psi_{2}(0). It remains to study the evolution of the (J1,ψ1)(J_{1},\psi_{1}) variables.

Being J2J_{2} a constant of motion the dynamic of the “resonant truncated Hamiltonian” ^res\hat{\mathbb{H}}_{\rm res} in (38) is simply generated by the one-degree-of-freedom “effective Hamiltonian”

J2(J1,ψ1):=σJ1+4,res(J1,J2,ψ1),\mathbb{H}_{{J_{2}}}(J_{1},\psi_{1}):=\sigma J_{1}+\mathbb{H}_{4,{\rm res}}(J_{1},{J_{2}},\psi_{1})\,, (40)

with 4,res\mathbb{H}_{4,{\rm res}} defined in (39). At this point it is convenient to introduce the “adimensional Hamiltonian”101010Also χ\chi is adimensional.

H^(J1,ψ1)=H^J2(J1,ψ1):=1χJ22J2(J1,ψ1),withχ:=f3,123(ω)3ω+0,\hat{H}(J_{1},\psi_{1})=\hat{H}_{J_{2}}(J_{1},\psi_{1}):=\frac{1}{\chi{J_{2}^{2}}}\mathbb{H}_{{J_{2}}}(J_{1},\psi_{1})\,,\qquad\mbox{with}\ \ \chi:=\frac{f_{3,1}}{2\sqrt{3}(\sqrt{\omega_{-}})^{3}\sqrt{\omega_{+}}}\neq 0\,, (41)

and rewrite H^J2\hat{H}_{J_{2}} as a function of the “adimensional action”

x:=3J1/J2with0<x<1,x:=3J_{1}/{J_{2}}\qquad\mbox{with}\quad 0<x<1\,, (42)

by (37). We have the following

Lemma 1.

It results that

H^(J1,ψ1)=H^J2(J1,ψ1)=F(ψ1,3J1/J2;J2)+a0,\hat{H}(J_{1},\psi_{1})=\hat{H}_{J_{2}}(J_{1},\psi_{1})=F(\psi_{1},3J_{1}/{J_{2}};{J_{2}})+a_{0}\,, (43)

where

F(ψ,x)=F(ψ,x;J2)\displaystyle F(\psi,x)=F(\psi,x;{J_{2}}) :=\displaystyle:= a(x;J2)+b(x)cosψ,\displaystyle a(x;{J_{2}})+b(x)\cos\psi\,,
a(x)=a(x;J2)\displaystyle a(x)=a(x;J_{2}) :=\displaystyle:= 12a2x2+a1x,\displaystyle\frac{1}{2}a_{2}x^{2}+a_{1}x\,,
b(x)\displaystyle b(x) :=\displaystyle:= (1x)3x>0,0<x<1,\displaystyle\sqrt{(1-x)^{3}x}>0\,,\qquad 0<x<1\,,
a0\displaystyle a_{0} :=\displaystyle:= 𝙶(2,0),(2,0)χ,\displaystyle\frac{{\mathtt{G}}_{(2,0),(2,0)}}{\chi}\,,
a1\displaystyle a_{1} =\displaystyle= a1(J2):=2𝙶(2,0),(2,0)χ+𝙶(1,1),(1,1)3χ+ω+3ω3J2χ,\displaystyle a_{1}({J_{2}}):=-2\frac{{\mathtt{G}}_{(2,0),(2,0)}}{\chi}+\frac{{\mathtt{G}}_{(1,1),(1,1)}}{3\chi}+\frac{\omega_{+}-3\omega_{-}}{3{J_{2}}\chi}\,,
a2\displaystyle a_{2} :=\displaystyle:= 2𝙶(2,0),(2,0)χ2𝙶(1,1),(1,1)3χ+2𝙶(0,2),(0,2)9χ.\displaystyle 2\frac{{\mathtt{G}}_{(2,0),(2,0)}}{\chi}-2\frac{{\mathtt{G}}_{(1,1),(1,1)}}{3\chi}+2\frac{{\mathtt{G}}_{(0,2),(0,2)}}{9\chi}\,. (44)

proof. Multiplying the right hand side of (43) by χJ22\chi{J_{2}^{2}} we have, by (1),

χJ22F(ψ1,3J1/J2)+χJ22a0=χJ22a(3J1/J2)+χJ22a0+χJ22b(3J1/J2)cosψ1\displaystyle\chi{J_{2}^{2}}\,F(\psi_{1},3J_{1}/{J_{2}})+\chi{J_{2}^{2}}\,a_{0}=\chi{J_{2}^{2}}\,a(3J_{1}/{J_{2}})+\chi{J_{2}^{2}}\,a_{0}+\chi{J_{2}^{2}}\,b(3J_{1}/{J_{2}})\cos\psi_{1}
=92χa2J12+3χa1J1J2+χJ22a0+χ3(J23J1)3J1cosψ1\displaystyle\qquad=\frac{9}{2}\chi a_{2}J_{1}^{2}+3\chi a_{1}J_{1}{J_{2}}+\chi{J_{2}^{2}}\,a_{0}+\chi\sqrt{3({J_{2}}-3J_{1})^{3}J_{1}}\cos\psi_{1}
=𝙶(2,0),(2,0)(J23J1)2+𝙶(1,1),(1,1)J1(J23J1)+𝙶(0,2),(0,2)J22\displaystyle\qquad={\mathtt{G}}_{(2,0),(2,0)}({J_{2}}-3J_{1})^{2}+{\mathtt{G}}_{(1,1),(1,1)}J_{1}({J_{2}}-3J_{1})+{\mathtt{G}}_{(0,2),(0,2)}{J_{2}^{2}}
+σJ1+χ3(J23J1)3J1cosψ1\displaystyle\qquad\quad+\sigma J_{1}+\chi\sqrt{3({J_{2}}-3J_{1})^{3}J_{1}}\cos\psi_{1}
=(32)4(J23J1,J1)+σJ1+χ3(J23J1)3J1cosψ1\displaystyle\qquad\stackrel{{\scriptstyle\eqref{pollo}}}{{=}}\mathcal{H}_{4}(J_{2}-3J_{1},J_{1})+\sigma J_{1}+\chi\sqrt{3({J_{2}}-3J_{1})^{3}J_{1}}\cos\psi_{1}
=(39),(41)4,res(J,ψ1)+σJ1\displaystyle\quad\ \stackrel{{\scriptstyle\eqref{connessione},\eqref{pluto}}}{{=}}\mathbb{H}_{4,{\rm res}}(J,\psi_{1})+\sigma J_{1}
=(40)J2(J1,ψ1)=(41)χJ22H^J2(J1,ψ1),\displaystyle\qquad\stackrel{{\scriptstyle\eqref{1DHAM}}}{{=}}\mathbb{H}_{{J_{2}}}(J_{1},\psi_{1})\stackrel{{\scriptstyle\eqref{pluto}}}{{=}}\chi{J_{2}^{2}}\hat{H}_{J_{2}}(J_{1},\psi_{1})\,,

proving (43). \Box

Remark 5.

Note that F(ψ,x;J2)F(\psi,x;{J_{2}}) depends on J2{J_{2}} only through a(x;J2)a(x;{J_{2}}), which depends on J2{J_{2}} only through a1(J2)a_{1}({J_{2}}). Moreover at the exact resonance ω+=3ω\omega_{+}=3\omega_{-} the dependence on J2{J_{2}} disappears.

Since ψ2\psi_{2} does not appear in H^\hat{H}, the conjugated action J2{J_{2}} is a constant of motion and H^\hat{H} is actually a one degree of freedom Hamiltonian system depending on J2{J_{2}} as a parameter. From now on we consider the one degree of freedom Hamiltonian H^(J1,ψ1)=H^J2(J1,ψ1)\hat{H}(J_{1},\psi_{1})=\hat{H}_{J_{2}}(J_{1},\psi_{1}) on the phase space (0,3J2)×𝕋(J1,ψ1)(0,3{J_{2}})\times\mathbb{T}\ni(J_{1},\psi_{1}) with 𝕋:=/2π\mathbb{T}:=\mathbb{R}/2\pi\mathbb{Z}.

3 The phase portrait

In this section we study the phase portrait of the adimensional Hamiltonian H^\hat{H} in (41) describing level curves, critical points and extrema. An important remark, that simplifies the treatment, is the fact that, thanks to (43), H^\hat{H} has, up to the rescalings x=3J1/J2x=3J_{1}/J_{2} and F=H^+a0F=\hat{H}+a_{0}, the same level curves, critical points and extrema as the auxiliary function FF in (1). Such objects are studied in Subsections 3.1, 3.2 and 3.3, respectively. As usual, the new action coordinates, that integrate the system, are defined as the areas enclosed by the level curves. In order to evaluate them its important to determine the intersections between the level curves and the lines {ψ=0}\{\psi=0\} and {ψ=π}\{\psi=\pi\} since they appear as endpoints of the involved integrals. It turns out that such intersections correspond to the real roots of the quartic polynomial 𝐏(x)\mathbf{P}(x), see (55) and Figure 25. As the energy EE varies, it is necessary to distinguish whether 𝐏\mathbf{P} has 4,24,2 or 0 real roots111111Note that, excluding the degenerate case of multiple roots, the number of real roots is even. and whether a root corresponds to an intersection with {ψ=0}\{\psi=0\} or {ψ=π}\{\psi=\pi\}. Explicit formulae for the roots are given in Subsection 3.5, see Figure 14. In Subsections 3.6 and 3.7 as the parameters vary, six topologically different scenarios appear.

3.1 Critical points, elliptic and hyperbolic zones

We now describe how the critical point of H^\hat{H} depends on the values of the parameters a2a_{2} and a1a_{1} in (1). First we note that (J1,ψ1)(J_{1},\psi_{1}) is a critical point of H^\hat{H} if and only if (ψ,x):=(ψ1,3J1/J2)(\psi,x):=(\psi_{1},3J_{1}/J_{2}) is a critical point of the auxiliary function F(ψ,x)=a(x)+b(x)cosψF(\psi,x)=a(x)+b(x)\cos\psi defined on 𝕋×(0,1)\mathbb{T}\times(0,1) (recall (1)). Moreover the nature of a critical point (maximum, minimum or saddle) is the same for H^\hat{H} and FF. Then in the following we will study critical points of FF as the parameters a2a_{2} and a1a_{1} vary.

It is immediate to see that, since

xF(ψ,x)=a(x)+b(x)cosψ,ψF(ψ,x)=b(x)sinψ\partial_{x}F(\psi,x)=a^{\prime}(x)+b^{\prime}(x)\cos\psi\,,\qquad\partial_{\psi}F(\psi,x)=-b(x)\sin\psi

and b(x)>0b(x)>0, the critical points of FF have the form (x,0)(x,0) with a(x)+b(x)=0a^{\prime}(x)+b^{\prime}(x)=0 or (x,π)(x,\pi) with a(x)b(x)=0a^{\prime}(x)-b^{\prime}(x)=0. Namely

F(0,x)=0\displaystyle\nabla F(0,x)=0\ \displaystyle\iff a2xa1=b(x),\displaystyle\ -a_{2}x-a_{1}=b^{\prime}(x)\,, (45)
F(π,x)=0\displaystyle\nabla F(\pi,x)=0\ \displaystyle\iff a2x+a1=b(x)\displaystyle\ a_{2}x+a_{1}=b^{\prime}(x)\, (46)

where

b(x)=(14x)1x2x.b^{\prime}(x)=\frac{(1-4x)\sqrt{1-x}}{2\sqrt{x}}. (47)

The number of solutions of equations (45),(46) depends on the parameters a1,a2a_{1},a_{2}.

Set

g(a1):=127(9+4a122a1)(94a124a19+4a12)g(a_{1}):=\frac{1}{27}\textstyle\big{(}\sqrt{9+4a_{1}^{2}}-2a_{1}\big{)}\big{(}9-4a_{1}^{2}-4a_{1}\sqrt{9+4a_{1}^{2}}\big{)} (48)

and121212Note that g(a1)>a1g(a_{1})>-a_{1}.

Z10\displaystyle Z_{10} :=\displaystyle:= {(a1,a2):a2<g(a1)},\displaystyle\{(a_{1},a_{2})\ \ :\ \ a_{2}<-g(-a_{1})\}\,,
Z12\displaystyle Z_{12} :=\displaystyle:= {(a1,a2):g(a1)<a2<a1},\displaystyle\{(a_{1},a_{2})\ \ :\ \ -g(-a_{1})<a_{2}<-a_{1}\}\,,
Z21\displaystyle Z_{21} :=\displaystyle:= {(a1,a2):a1<a2<g(a1)},\displaystyle\{(a_{1},a_{2})\ \ :\ \ -a_{1}<a_{2}<g(a_{1})\}\,,
Z01\displaystyle Z_{01} :=\displaystyle:= {(a1,a2):a2>g(a1).\displaystyle\{(a_{1},a_{2})\ \ :\ \ a_{2}>g(a_{1})\,. (49)
Refer to caption
Figure 9: Intersections between the function b(x)b^{\prime}(x) and the straight line a2x+a1a_{2}x+a_{1} for a fixed a1a_{1} and different values of a2a_{2}. The green line, passing through (1,0)(1,0), and the purple one, which is tangent to b(x)b^{\prime}(x), separate the half plane x>0x>0 in three regions, in which the lines intersect 1,2 or 0 times the curve b(x)b^{\prime}(x). More precisely for a2>a1a_{2}>-a_{1} there is one intersection, for g(a1)<a2<a1-g(-a_{1})<a_{2}<-a_{1} there are two intersections and none for a2<g(a1)a_{2}<-g(-a_{1}).
Refer to caption
Figure 10: The four zones Z10,Z12,Z21,Z01Z_{10},Z_{12},Z_{21},Z_{01} in the space (a1,a2)(a_{1},a_{2}). See formula (49).

In particular the following result holds

Proposition 1.

If (a1,a2)Zij(a_{1},a_{2})\in Z_{ij} then F(0,x)F(0,x) has ii critical points and F(π,x)F(\pi,x) jj critical points. More precisely:

  • If (a1,a2)Z10(a_{1},a_{2})\in Z_{10} then F(0,x)F(0,x) has a positive maximum at some x1(0)x_{1}^{(0)} and F(π,x)F(\pi,x) is strictly decreasing;

  • If (a1,a2)Z01(a_{1},a_{2})\in Z_{01} then F(π,x)F(\pi,x) has a negative minimum at some x1(π)x_{1}^{(\pi)} and F(0,x)F(0,x) is strictly increasing;

  • If (a1,a2)Z12(a_{1},a_{2})\in Z_{12} then F(0,x)F(0,x) has a positive maximum at some x1(0)x_{1}^{(0)}, while F(π,x)F(\pi,x) has a negative minimum at some x1(π)x_{1}^{(\pi)} and a maximum at some x2(π)x_{2}^{(\pi)}, with x1(π)<x2(π)x_{1}^{(\pi)}<x_{2}^{(\pi)};

  • If (a1,a2)Z21(a_{1},a_{2})\in Z_{21} then F(0,x)F(0,x) has a positive maximum at some x1(0)x_{1}^{(0)} and a minimum at some x2(0)x_{2}^{(0)}, with x1(0)<x2(0)x_{1}^{(0)}<x_{2}^{(0)}, while F(π,x)F(\pi,x) has a negative minimum at some x1(π)x_{1}^{(\pi)}.

As a corollary, if (a1,a2)Zij(a_{1},a_{2})\in Z_{ij} then FF has ii critical points of the form (0,x)(0,x) and jj critical points of the form (π,x)(\pi,x). More precisely:

  • If (a1,a2)Z10(a_{1},a_{2})\in Z_{10} then FF has a positive maximum at (0,x1(0))(0,x_{1}^{(0)});

  • If (a1,a2)Z01(a_{1},a_{2})\in Z_{01} then FF has a negative minimum at (π,x1(π))(\pi,x_{1}^{(\pi)});

  • If (a1,a2)Z12(a_{1},a_{2})\in Z_{12} then FF has a positive maximum at (0,x1(0))(0,x_{1}^{(0)}), a negative minimum at (π,x1(π))(\pi,x_{1}^{(\pi)}) and a saddle at (π,x2(π))(\pi,x_{2}^{(\pi)});

  • If (a1,a2)Z21(a_{1},a_{2})\in Z_{21} then FF has a positive maximum at (0,x1(0))(0,x_{1}^{(0)}), a saddle at (0,x2(0))(0,x_{2}^{(0)}) and a negative minimum at (π,x1(π))(\pi,x_{1}^{(\pi)}).

proof. See Appendix. \Box

We call Z21Z_{21}, Z12Z_{12} hyperbolic zones, since they contain hyperbolic equilibria, and Z01Z_{01}, Z10Z_{10} elliptic zones, since they contain only elliptic equilibria. For any fixed pair (M~,K~)(\tilde{M},\tilde{K}), it is possible to identify which wave numbers (k~1,k~2)(\tilde{k}_{1},\tilde{k}_{2}) in the Brillouin triangle give rise to resonant normal forms with different phase portraits. In particular if the corresponding values of a1a_{1} and a2a_{2} belong to Z21Z_{21}, Z12Z_{12}, then the phase portrait contains one hyperbolic and two elliptic equilibria, while, for a1a_{1} and a2a_{2} belonging to Z01Z_{01}, Z10Z_{10} only elliptic equilibria appear (see Figure 11). For brevity we denote by BNF of type ZijZ_{ij} the corresponding Birkhoff Normal Form.

Refer to caption
Refer to caption
Figure 11: (On the left) For M~=0.2\tilde{M}=0.2, K~=1.1\tilde{K}=1.1, N3=104N_{3}=-10^{4}, one can take C1C_{1} and C2C_{2} in (27) as C1=5×104C_{1}=5\times 10^{-4} and C2=2.5×103C_{2}=2.5\times 10^{-3}. Choose ϵ=6.5×104\epsilon=6.5\times 10^{-4}. Then the smallness condition (26) for the nonresonant BNF is satisfied only when |σ|1.7|\sigma|\geq 1.7. On the other hand for |σ|<1.7|\sigma|<1.7 only a resonant BNF is available. The first condition is satisfied only by wave numbers (k~1,k~2)(\tilde{k}_{1},\tilde{k}_{2}) in the white regions in the Brillouin triangle. On the contrary, in the green regions we have a resonant BNF of type Z12Z_{12} while, in the blue regions, we have a resonant BNF of type Z01Z_{01} or Z10Z_{10}. Actually, close to the red curves, representing the exact 3:1 resonance, there is also a very tiny strip (of width 10610^{-6}, not shown in the figure) corresponding to a BNF of type Z21Z_{21}. (On the right) Here we fix k~1=2.58\tilde{k}_{1}=2.58, k~2=0\tilde{k}_{2}=0, and let (M~,K~)(\tilde{M},\tilde{K}) vary in the rectangle [0.05,0.3]×[1,5][0.05,0.3]\times[1,5].
Refer to caption
Figure 12: For M~=0.146\tilde{M}=0.146, K~=5.73\tilde{K}=5.73, N3=104N_{3}=-10^{4}, one can take C1C_{1} and C2C_{2} in (27) as C1=9×104C_{1}=9\times 10^{-4} and C2=4.5×103C_{2}=4.5\times 10^{-3}. Choose ϵ=1.2×103\epsilon=1.2\times 10^{-3}. Then the smallness condition (26) for the nonresonant BNF is satisfied only when |σ|1.78|\sigma|\geq 1.78. On the other hand for |σ|<1.78|\sigma|<1.78 only a resonant BNF is available. The first condition is satisfied only by wave numbers (k~1,k~2)(\tilde{k}_{1},\tilde{k}_{2}) in the white regions in the Brillouin triangle. On the contrary, in the green regions we have a resonant BNF of type Z12Z_{12} while, in the blue regions, we have a resonant BNF of type Z01Z_{01} or Z10Z_{10}. Actually, close to the red curves, representing the exact 3:1 resonance, there is also a very tiny strip (of width 10610^{-6}, not shown in the figure) corresponding to a BNF of type Z21Z_{21}. Note that the point 𝐗\bf X is on an exact 3:1 resonance.
Remark 6.

In the following for simplicity we restrict to the case in which (a1,a2)Zij(a_{1},a_{2})\in Z_{ij} for some 0i,j20\leq i,j\leq 2. This means that we avoid the degenerate cases a2+a1=0a_{2}+a_{1}=0, when x=1x=1 is a solution of (45)-(46), a2g(a1)=0a_{2}-g(a_{1})=0 and a2+g(a1)=0a_{2}+g(-a_{1})=0, when two solutions coincide. We will briefly discuss such degenerate cases in Subsection 3.8.

Refer to caption
Figure 13: Frequency response curve: critical values of xx as function of σ\sigma and their linear stability. The black/blue lines correspond to marginally stable equilibria (maxima/minima). The red dotted curve corresponds to unstable equilibria (saddles). Here J2=104,k1=4π/3,k2=0,N3=104,M3=0,M~=0.15{J_{2}}=10^{-4},k_{1}=4\pi/3,k_{2}=0,N_{3}=-10^{4},M_{3}=0,\tilde{M}=0.15 and K~\tilde{K} is chosen as a suitable function of σ\sigma; more precisely K~(σ)\tilde{K}(\sigma) is the inverse function of K~(ω+3ω)\tilde{K}\to(\omega_{+}-3\omega_{-}). The four different zones correspond, from left to right, to Z10,Z12,Z21,Z01Z_{10},Z_{12},Z_{21},Z_{01}.

3.2 Extrema

We now discuss the extrema of FF in (1) and their dependence on the parameters a1,a2a_{1},a_{2}. Following the notation of Proposition 1 we set

Emax\displaystyle E_{\rm max} :=\displaystyle:= F(0,x1(0)),if(a1,a2)Z10,Z12,Z21,\displaystyle F(0,x_{1}^{(0)})\,,\qquad{\rm if}\ \ (a_{1},a_{2})\in Z_{10},\ Z_{12},\ Z_{21}\,,
Emin\displaystyle E_{\rm min} :=\displaystyle:= F(π,x1(π)),if(a1,a2)Z01,Z12,Z21,\displaystyle F(\pi,x_{1}^{(\pi)})\,,\qquad{\rm if}\ \ (a_{1},a_{2})\in Z_{01},\ Z_{12},\ Z_{21}\,,
Esad\displaystyle E_{\rm sad} :=\displaystyle:= F(0,x2(0)),if(a1,a2)Z21,\displaystyle F(0,x_{2}^{(0)})\,,\qquad\,{\rm if}\ \ (a_{1},a_{2})\in Z_{21}\,,
Esad\displaystyle E_{\rm sad} :=\displaystyle:= F(π,x2(π)),if(a1,a2)Z12.\displaystyle F(\pi,x_{2}^{(\pi)})\,,\qquad{\rm if}\ \ (a_{1},a_{2})\in Z_{12}\,. (50)

Then define

E+:=sup𝕋×(0,1)F,E:=inf𝕋×(0,1)F.E_{+}:=\sup_{\mathbb{T}\times(0,1)}F\,,\qquad E_{-}:=\inf_{\mathbb{T}\times(0,1)}F\,. (51)

Since F(π,x)<F(ψ,x)<F(0,x)F(\pi,x)<F(\psi,x)<F(0,x) for every 0<x<10<x<1, 0<ψ<2π0<\psi<2\pi, ψπ\psi\neq\pi, we have that

E+:=sup(0,1)F(0,x),E:=inf(0,1)F(π,x).E_{+}:=\sup_{(0,1)}F(0,x)\,,\qquad E_{-}:=\inf_{(0,1)}F(\pi,x)\,.

Note that

E<0<E+,E_{-}<0<E_{+}\,,

since F(0,0)=F(0,π)=0F(0,0)=F(0,\pi)=0 and F(π,x)<0<F(0,x)F(\pi,x)<0<F(0,x) for x>0x>0 small enough since limx0+xF(0,x)=+\lim_{x\to 0^{+}}\partial_{x}F(0,x)=+\infty and limx0+xF(π,x)=\lim_{x\to 0^{+}}\partial_{x}F(\pi,x)=-\infty. Note that in the cases Z10,Z12Z_{10},Z_{12} we have E+=EmaxE_{+}=E_{\rm max}, since the function xF(0,x)x\to F(0,x) has only one critical point (a maximum); analogously in the cases Z01,Z21Z_{01},Z_{21} we have E=EminE_{-}=E_{\rm min}, since the function xF(π,x)x\to F(\pi,x) has only one critical point (a minimum). Moreover E+=a(1)E_{+}=a(1) in the case Z01Z_{01}; indeed the function xF(0,x)=a(x)+b(x)x\to F(0,x)=a(x)+b(x) has no critical points then E+=max{a(0)+b(0),a(1)+b(1)}=max{a(0),a(1)}E_{+}=\max\{a(0)+b(0),a(1)+b(1)\}=\max\{a(0),a(1)\}, moreover a(x)+b(x)a(x)+b(x) is increasing close to zero since limx0+(a(x)+b(x))=+\lim_{x\to 0^{+}}\big{(}a^{\prime}(x)+b^{\prime}(x)\big{)}=+\infty. Analogously E=a(1)E_{-}=a(1) in the case Z10Z_{10}. Finally in the case Z21Z_{21} we have E+=max{a(1),Emax}E_{+}=\max\{a(1),E_{\rm max}\}, since the function xF(0,x)x\to F(0,x) has a maximum at x1(0)x_{1}^{(0)} and a saddle at x2(0)x_{2}^{(0)} with x1(0)<x2(0)x_{1}^{(0)}<x_{2}^{(0)}. Analogously in the case Z12Z_{12} we have E=min{a(1),Emin}E_{-}=\min\{a(1),E_{\rm min}\}.

3.3 Level curves

Since FF is even with respect to ψ\psi we can reduce to consider the “half phase space” [0,π]×(0,1)[0,\pi]\times(0,1). Take an energy E<E<E+E_{-}<E<E_{+} with EEmax,Emin,EsadE\neq E_{\rm max},E_{\rm min},E_{\rm sad}, and consider the level set {F=E}\{F=E\}. If (ψ0,x0){F=E}(\psi_{0},x_{0})\in\{F=E\}, namely F(ψ0,x0)=EF(\psi_{0},x_{0})=E, since (ψ0,x0)(\psi_{0},x_{0}) is not a critical point (being EEmax,Emin,EsadE\neq E_{\rm max},E_{\rm min},E_{\rm sad} and recalling Proposition 1 and (3.2)), we can locally131313Namely in a sufficiently small neighborood of (ψ0,x0)(\psi_{0},x_{0}). express {F=E}\{F=E\} as a curve by the implicit function theorem. In particular, in the half phase space [0,π]×(0,1)[0,\pi]\times(0,1), we can always express ψ\psi as a function of xx, indeed the equation F(ψ,x)=a(x)+b(x)cosψ=EF(\psi,x)=a(x)+b(x)\cos\psi=E has the unique solution

ψ(x)=ψ(x;E;J2)=arccos(Ea(x)b(x)).\psi(x)=\psi(x;E;{J_{2}})=\arccos\left(\frac{E-a(x)}{b(x)}\right)\,. (52)

Since the domain of definition of the arccos\arccos is [1,1][-1,1], the domain of ψ(x)\psi(x) is

D:={x(0,1)|b(x)Ea(x)b(x)}.D:=\{x\in(0,1)\ |\ -b(x)\leq E-a(x)\leq b(x)\}\,.

We now discuss the structure of DD. Consider first the case in which 0 is an accumulation point for DD; then it must be E=0E=0. Indeed, taking the limit for x0+x\to 0^{+}, xDx\in D in the inequality b(x)Ea(x)b(x)-b(x)\leq E-a(x)\leq b(x), we get E=0E=0. Moreover, when E=0E=0,

limx0+ψ(x;0)=limx0+arccos(a(x)/b(x))=arccos(0)=π/2.\lim_{x\to 0^{+}}\psi(x;0)=\lim_{x\to 0^{+}}\arccos\big{(}-a(x)/b(x)\big{)}=\arccos(0)=\pi/2\,. (53)
Claim 1.

1 cannot be an accumulation point for DD, since we are assuming that a2+a10a_{2}+a_{1}\neq 0 (recall Remark 6).

proof. Indeed assume, by contradiction, that 1 is an accumulation point for DD. Then taking the limit for x1x\to 1^{-}, xDx\in D, in the inequality b(x)Ea(x)b(x)-b(x)\leq E-a(x)\leq b(x) we get E=a(1)E=a(1). Substituting E=a(1)E=a(1) in the above inequality and dividing by 1x1-x we get

x(1x)a(1)a(x)1xx(1x),x[x0,1).-\sqrt{x(1-x)}\leq\frac{a(1)-a(x)}{1-x}\leq\sqrt{x(1-x)}\,,\qquad\forall x\in[x_{0},1)\,.

Taking again the limit for x1x\to 1^{-} we get 0=a(1)=a2+a10=a^{\prime}(1)=a_{2}+a_{1}, which contradicts the assumption a2+a10a_{2}+a_{1}\neq 0. \Box

As a consequence, assuming E0E\neq 0, we have that DD is a compact set contained in (0,1)(0,1); moreover it is not difficult to see that it is formed by a finite number of closed intervals (possibly isolated points), whose endpoints satisfy one of the equations

a(x)E=b(x).a(x)-E=\mp b(x)\,. (54)

This amounts to find the roots of the quartic polynomial

𝐏(x)=(a(x)E)2(b(x))2=(12a2x2+a1xE)2(1x)3x=0,\mathbf{P}(x)=(a(x)-E)^{2}-(b(x))^{2}=\left(\frac{1}{2}a_{2}x^{2}+a_{1}x-E\right)^{2}-(1-x)^{3}x=0\,, (55)

with 0<x<10<x<1.

Lemma 2.

If EE is not a critical energy for141414Namely the energy of a critical point of FF. FF, the roots of the quartic polynomial 𝐏(x)\mathbf{P}(x) in (55) with 0<x<10<x<1 are simple.

proof. By contradiction, if 0<x0<10<x_{0}<1 is a multiple root of 𝐏\mathbf{P}, then 𝐏(x0)=𝐏(x0)=0\mathbf{P}(x_{0})=\mathbf{P}^{\prime}(x_{0})=0. Write

𝐏(x)=(a(x)Eb(x))(a(x)E+b(x)).\mathbf{P}(x)=\big{(}a(x)-E-b(x)\big{)}\big{(}a(x)-E+b(x)\big{)}\,.

Assume that a(x0)Eb(x0)=0a(x_{0})-E-b(x_{0})=0, the case a(x0)E+b(x0)=0a(x_{0})-E+b(x_{0})=0 being analogous. By 𝐏(x0)=0\mathbf{P}^{\prime}(x_{0})=0 it follows that

𝐏(x0)=(a(x0)b(x0))(a(x0)E+b(x0))=0.\mathbf{P}^{\prime}(x_{0})=\big{(}a^{\prime}(x_{0})-b^{\prime}(x_{0})\big{)}\big{(}a(x_{0})-E+b(x_{0})\big{)}=0\,. (56)

Since a(x0)Eb(x0)=0a(x_{0})-E-b(x_{0})=0 and b(x0)>0b(x_{0})>0, by (56) we get a(x0)b(x0)=0a^{\prime}(x_{0})-b^{\prime}(x_{0})=0. This means that (x0,π)(x_{0},\pi) is a critical point of FF, which is a contradiction since EE is a not critical energy. \Box

From now on we will assume that EE is not a critical energy of FF. We denote the roots of 𝐏(x;E)\mathbf{P}(x;E) with 0<x<10<x<1 by xi=xi(E)x_{i}=x_{i}(E) with i{1,2,3,4}i\in\{1,2,3,4\}. We label the roots in increasing order, namely xi<xi+1x_{i}<x_{i+1}.

3.4 The quartic equation

In studying the solutions of (54) (equivalently of (55)) on 0<x<10<x<1, it is convenient to consider the real variable tt\in\mathbb{R} and make the substitution

x=t21+t2.x=\frac{t^{2}}{1+t^{2}}\,.

Since

a(x)E=1(1+t2)2[12a2t4+a1t2(1+t2)E(1+t2)2],b(x)=|t|(1+t2)2a(x)-E=\frac{1}{(1+t^{2})^{2}}\left[\frac{1}{2}a_{2}t^{4}+a_{1}t^{2}(1+t^{2})-E(1+t^{2})^{2}\right]\,,\qquad b(x)=\frac{|t|}{(1+t^{2})^{2}}

and

12a2t4+a1t2(1+t2)E(1+t2)2=(12a2+a1E)t4+(a12E)t2E,\frac{1}{2}a_{2}t^{4}+a_{1}t^{2}(1+t^{2})-E(1+t^{2})^{2}=\left(\frac{1}{2}a_{2}+a_{1}-E\right)t^{4}+(a_{1}-2E)t^{2}-E\,,

the two equations in (54) are equivalent to

(12a2+a1E)t4+(a12E)t2E=|t|.\left(\frac{1}{2}a_{2}+a_{1}-E\right)t^{4}+(a_{1}-2E)t^{2}-E=\mp|t|\,. (57)
Lemma 3.

Let t0t_{0} be a root of the polynomial

P(t):=(12a2+a1E)t4+(a12E)t2tEP(t):=\left(\frac{1}{2}a_{2}+a_{1}-E\right)t^{4}+(a_{1}-2E)t^{2}-t-E (58)

and set

x0:=t021+t02.x_{0}:=\frac{t_{0}^{2}}{1+t_{0}^{2}}\,. (59)

If t0<0t_{0}<0, resp. t0>0t_{0}>0, then x0x_{0} solves F(0,x0)=EF(0,x_{0})=E, resp. F(π,x0)=EF(\pi,x_{0})=E. Conversely if 0<x0<10<x_{0}<1 solve the equation in (54) with the \mp sign, then t0:=x0/(1x02)t_{0}:=\mp x_{0}/(1-x_{0}^{2}) solves (57) with the \mp sign and, therefore, is a root of P(t)P(t).

proof. If P(t0)=0P(t_{0})=0 for some t0>0t_{0}>0, then t0t_{0} satisfies (57) and, therefore (54), with the plus sign. As a consequence F(π,x0)=EF(\pi,x_{0})=E. The proof in the case t0<0t_{0}<0 is analogous. \Box

When E=12a2+a1E=\frac{1}{2}a_{2}+a_{1} the polynomial P(t)P(t) reduces to (a2+a1)t2t+a1+a2/2(a_{2}+a_{1})t^{2}-t+a_{1}+a_{2}/2, whose two roots are easily evaluated. Then we can reduce to the case E12a2+a1E\neq\frac{1}{2}a_{2}+a_{1} and consider the equivalent monic polynomial P(t):=P(t)/(12a2+a1E){\rm P}(t):=P(t)/(\frac{1}{2}a_{2}+a_{1}-E), namely

P(t)=t4+pt2+qt+r,p:=a12E12a2+a1E,q:=112a2+a1E,r:=E12a2+a1E.{\rm P}(t)=t^{4}+{\rm p}t^{2}+{\rm q}t+{\rm r},\quad{\rm p}:=\frac{a_{1}-2E}{\frac{1}{2}a_{2}+a_{1}-E},\ \ {\rm q}:=\frac{-1}{\frac{1}{2}a_{2}+a_{1}-E},\ \ {\rm r}:=\frac{-E}{\frac{1}{2}a_{2}+a_{1}-E}. (60)

The above quartic polynomial is called “depressed” since it is monic and its third order coefficient vanishes. Obviously P(t)P(t) and P(t){\rm P}(t) have the same roots. An immediate corollary of Lemma 3 is the following

Lemma 4.

Fix E12a2+a1E\neq\frac{1}{2}a_{2}+a_{1}. Let t0t_{0} be a root of P(t){\rm P}(t) in (60). If t0<0t_{0}<0, resp. t0>0t_{0}>0, then x0x_{0} in (59) solves F(0,x0)=EF(0,x_{0})=E, resp. F(π,x0)=EF(\pi,x_{0})=E. In particular x0x_{0} is a root of 𝐏(x)\mathbf{P}(x) in (55).

Remark 7.

If P(t){\rm P}(t) has four real distinct roots and E0E\neq 0 (so that t=0t=0 is not a root), then the number of positive/negative roots depends on the sign of r{\rm r} defined in (60). Indeed, since limt±P(t)=+\lim_{t\to\pm\infty}{\rm P}(t)=+\infty, the number of positive/negative roots is even if r>0{\rm r}>0 and odd otherwise.

3.5 Finding the roots of the quartic equation

Following [CP23] we find the roots of the quartic polynomial P(t){\rm P}(t) in (60). First set151515Compare formulas (20) and (10) in [CP23].

p:=p2+12r3,q:=2p372pr+27q227,Δ:=4p327q2.p_{*}:=-\frac{{\rm p}^{2}+12{\rm r}}{3}\,,\quad q_{*}:=-\frac{2{\rm p}^{3}-72{\rm p}{\rm r}+27{\rm q}^{2}}{27}\,,\quad\Delta:=-4p_{*}^{3}-27q_{*}^{2}\,.

Let us define the positive161616Compare Theorem 8 in [CP23]. number s>0s_{*}>0 as

s:={q2+Δ1083+q2Δ10832p3ifΔ0,2p3cos(13arccos(q2(3p)3))2p3ifΔ>0.s_{*}:=\left\{\begin{array}[]{ll}\sqrt[3]{-\frac{q_{*}}{2}+\sqrt{-\frac{\Delta}{108}}}+\sqrt[3]{-\frac{q_{*}}{2}-\sqrt{-\frac{\Delta}{108}}}-\frac{2{\rm p}}{3}&{\rm if}\ \ \Delta\leq 0\,,\\ 2\sqrt{-\frac{p_{*}}{3}}\cos\left(\frac{1}{3}\arccos\left(-\frac{q_{*}}{2}\sqrt{\left(-\frac{3}{p_{*}}\right)^{3}}\right)\right)-\frac{2{\rm p}}{3}&{\rm if}\ \ \Delta>0\,.\end{array}\right. (61)

Then the roots of P(t){\rm P}(t) are given by171717Compare formula (9) in [CP23].

tς±:=ςs±δς2,δς:=ς2q(s)1/22ps,ς=±.t^{\pm}_{\varsigma}:=\frac{-\varsigma\sqrt{s_{*}}\pm\sqrt{\delta_{\varsigma}}}{2}\,,\qquad\delta_{\varsigma}:=\varsigma 2{\rm q}(s_{*})^{-1/2}-2{\rm p}-s_{*}\,,\qquad\varsigma=\pm\,. (62)

The number of real roots of P(t){\rm P}(t) is:

4 if δ±>0\delta_{\pm}>0,

2 if δ+δ<0\delta_{+}\delta_{-}<0,

0 if δ±<0\delta_{\pm}<0.

Let us now define

xς±:=(tς±)21+(tς±)2=|tς±|21+|tς±|2.x^{\pm}_{\varsigma}:=\frac{(t^{\pm}_{\varsigma})^{2}}{1+(t^{\pm}_{\varsigma})^{2}}=\frac{|t^{\pm}_{\varsigma}|^{2}}{1+|t^{\pm}_{\varsigma}|^{2}}\,. (63)

Note that xς±x^{\pm}_{\varsigma} is an increasing function of |tς±||t^{\pm}_{\varsigma}|. By Lemma 4 xς±x^{\pm}_{\varsigma} are the roots of 𝐏(x)\mathbf{P}(x) in (55). We now want to order the real roots xς±x^{\pm}_{\varsigma} in increasing order x1<x2<x_{1}<x_{2}<\ldots. We have different cases (see Figure 14):

x1\displaystyle x_{1} :=\displaystyle:= {min{x++,x}ifδ±>0,min{x++,x+}ifδ0δ+,min{x+,x}ifδ+0δ,\displaystyle\left\{\begin{array}[]{ll}\min\{x_{+}^{+},x_{-}^{-}\}&{\rm if}\ \ \delta_{\pm}>0\,,\\ \min\{x_{+}^{+},x_{+}^{-}\}&{\rm if}\ \ \delta_{-}\leq 0\leq\delta_{+}\,,\\ \min\{x_{-}^{+},x_{-}^{-}\}&{\rm if}\ \ \delta_{+}\leq 0\leq\delta_{-}\,,\end{array}\right. (67)
x2\displaystyle x_{2} :=\displaystyle:= {max{x++,x}ifδ±>0,max{x++,x+}ifδ0δ+,max{x+,x}ifδ+0δ,\displaystyle\left\{\begin{array}[]{ll}\max\{x_{+}^{+},x_{-}^{-}\}&{\rm if}\ \ \delta_{\pm}>0\,,\\ \max\{x_{+}^{+},x_{+}^{-}\}&{\rm if}\ \ \delta_{-}\leq 0\leq\delta_{+}\,,\\ \max\{x_{-}^{+},x_{-}^{-}\}&{\rm if}\ \ \delta_{+}\leq 0\leq\delta_{-}\,,\end{array}\right. (71)
x3\displaystyle x_{3} :=\displaystyle:= min{x+,x+}ifδ±>0,\displaystyle\ \quad\min\{x_{-}^{+},x_{+}^{-}\}\quad{\rm if}\quad\delta_{\pm}>0\,,
x4\displaystyle x_{4} :=\displaystyle:= max{x+,x+}ifδ±>0.\displaystyle\ \quad\max\{x_{-}^{+},x_{+}^{-}\}\quad{\rm if}\quad\delta_{\pm}>0\,. (72)
Refer to caption
Figure 14: The mutual position of the roots xς±x^{\pm}_{\varsigma}, ς{1,+1}\varsigma\in\{-1,+1\}, as in (63), used in (72) to define the roots of the quartic polynomial for a1=1a_{1}=1, a2=2a_{2}=-2 and different value of the energy EE.

3.6 The separatrices at the saddle points

Recall the definition of the zones ZijZ_{ij} given in (49). We now consider the curve with zero energy bifurcating from the point (π/2,0)(\pi/2,0) (recall (53)) in the “half phase space” [0,π]×(0,1)[0,\pi]\times(0,1). In the case Z10Z_{10} such curve “turns left” and touches the line {ψ=0}\{\psi=0\} at some point >x1(0)>x_{1}^{(0)}. Analogously in the case Z01Z_{01} such curve “turns right” and touches the line {ψ=π}\{\psi=\pi\} at some point >x1(π)>x_{1}^{(\pi)}.
The situation in the cases Z21Z_{21} and Z12Z_{12} is more involved; more precisely it depends on the sign of EsadE_{\rm sad}. In particular for (i,j){(2,1),(1,2)}(i,j)\in\left\{(2,1),(1,2)\right\} we set

Zij±:={(a1,a2)Zij:±Esad>0},Zij0:={(a1,a2)Zij:Esad=0},Z_{ij}^{\pm}:=\{(a_{1},a_{2})\in Z_{ij}\ :\ \pm E_{\rm sad}>0\}\,,\qquad Z_{ij}^{0}:=\{(a_{1},a_{2})\in Z_{ij}\ :\ E_{\rm sad}=0\}\,, (73)

so that

Zij=Zij+ZijZij0.Z_{ij}=Z_{ij}^{+}\cup Z_{ij}^{-}\cup Z_{ij}^{0}\,.

The next result characterises the sets in (73)

Lemma 5.

Setting

g~(a1):=227a1(4a12+27)\tilde{g}(a_{1}):=-\frac{2}{27}a_{1}(4a_{1}^{2}+27) (74)

we have

Z21+=Z21{a2>g~(a1)},Z12+=Z12{a2>g~(a1)},\displaystyle Z_{21}^{+}=Z_{21}\cap\{a_{2}>\tilde{g}(a_{1})\}\,,\quad\quad Z_{12}^{+}=Z_{12}\cap\{a_{2}>\tilde{g}(a_{1})\}\,,
Z21=Z21{a2<g~(a1)},Z12=Z12{a2<g~(a1)},
\displaystyle Z_{21}^{-}=Z_{21}\cap\{a_{2}<\tilde{g}(a_{1})\}\,,\qquad Z_{12}^{-}=Z_{12}\cap\{a_{2}<\tilde{g}(a_{1})\}\,,\vskip 12.0pt plus 4.0pt minus 4.0pt
Z210=Z21{a2=g~(a1)}={a2=g~(a1),a1<0},\displaystyle Z_{21}^{0}=Z_{21}\cap\{a_{2}=\tilde{g}(a_{1})\}=\{a_{2}=\tilde{g}(a_{1}),\ \ a_{1}<0\}\,,
Z120=Z12{a2=g~(a1)}={a2=g~(a1),a1>0}.\displaystyle Z_{12}^{0}=Z_{12}\cap\{a_{2}=\tilde{g}(a_{1})\}=\{a_{2}=\tilde{g}(a_{1}),\ \ a_{1}>0\}\,. (75)

Note that, since g~\tilde{g} is odd and g~(a1)a1\tilde{g}(a_{1})\leq-a_{1} for a10a_{1}\geq 0, by the definition of Z21Z_{21} and Z12Z_{12} it follows that Z21{a1<0}Z_{21}^{-}\subset\{a_{1}<0\} and Z12+{a1>0}Z_{12}^{+}\subset\{a_{1}>0\}.

proof. We discuss only the case Z21Z_{21}, the study of Z12Z_{12} being analogous. As we said above, the picture of the phase space in the case Z21Z_{21} strongly depends on the sign of the energy of the saddle point Esad=F(0,x2(0))E_{\rm sad}=F(0,x_{2}^{(0)}), where x2(0)x_{2}^{(0)} is the minimum of the function xF(0,x)x\to F(0,x). In particular we claim that Esad0E_{\rm sad}\lesseqqgtr 0 if and only if a2g~(a1)a_{2}\lesseqqgtr\tilde{g}(a_{1}). In particular we note that x=x2(0)x=x_{2}^{(0)} satisfies the system

{F(0,x)=12a2x2+a1x+b(x)=0,xF(0,x)=a2x+a1+b(x)=0.\left\{\begin{array}[]{ll}F(0,x)=\frac{1}{2}a_{2}x^{2}+a_{1}x+b(x)=0\,,\\ \partial_{x}F(0,x)=a_{2}x+a_{1}+b^{\prime}(x)=0\,.\end{array}\right.

By algebraic manipulation we get

{12a2x2+xb(x)b(x)=0,a1x+2b(x)xb(x)=0,\left\{\begin{array}[]{ll}\frac{1}{2}a_{2}x^{2}+xb^{\prime}(x)-b(x)=0\,,\\ a_{1}x+2b(x)-xb^{\prime}(x)=0\,,\end{array}\right.

by which we finally have

a2=2b(x)xb(x)x2,a1=xb(x)2b(x)x.a_{2}=2\frac{b(x)-xb^{\prime}(x)}{x^{2}}\,,\qquad\quad a_{1}=\frac{xb^{\prime}(x)-2b(x)}{x}\,.

By using (47)

a1=321x1,a2=1x1(1x+2).a_{1}=-\frac{3}{2}\sqrt{\frac{1}{x}-1}\,,\qquad a_{2}=\sqrt{\frac{1}{x}-1}\left(\frac{1}{x}+2\right)\,. (76)

Note that a1<0a_{1}<0. By inverting the first expression in (76) we get 1x=49a12+1\frac{1}{x}=\frac{4}{9}a_{1}^{2}+1; substituting in the second expression we obtain that a2=g~(a1)a_{2}=\tilde{g}(a_{1}) defined in (74). Therefore in Z210Z_{21}^{0}, namely when a2=g~(a1)a_{2}=\tilde{g}(a_{1}), the value of the function xF(0,x)=12a2x2+a1x+b(x)x\to F(0,x)=\frac{1}{2}a_{2}x^{2}+a_{1}x+b(x) at its minimum x2(0)x_{2}^{(0)} is exactly 0. On the other hand in Z21+Z_{21}^{+}, namely when a2>g~(a1)a_{2}>\tilde{g}(a_{1}), one has F(0,x2(0))>0F(0,x_{2}^{(0)})>0. Finally in Z21Z_{21}^{-} it is F(0,x2(0))<0F(0,x_{2}^{(0)})<0. \Box

Refer to caption
Figure 15: The six zones Z01,Z10,Z21+,Z21,Z12+,Z12Z_{01},Z_{10},Z_{21}^{+},Z_{21}^{-},Z_{12}^{+},Z_{12}^{-}.

3.7 Different topologies of the level curves

Let us consider the energy level sets in the phase spase

𝒫:=𝕋×(0,1),{\mathcal{P}}:=\mathbb{T}\times(0,1)\,,

which is a cylinder. The points where the level curves {F=E}\{F=E\} touch the lines ψ=0\psi=0 or ψ=π\psi=\pi are the solutions of the equation F(0,x)=a(x)+b(x)=EF(0,x)=a(x)+b(x)=E and F(π,x)=a(x)b(x)=EF(\pi,x)=a(x)-b(x)=E, respectively; equivalently they are the roots of the quartic polynomial in (54).

We note that in the cases Z10Z_{10}, Z01Z_{01} the set {F=E}\{F=E\} has only one connected component. The same holds in the case Z21Z_{21} except for Esad<E<min{a(1),Emax}E_{\rm sad}<E<\min\{a(1),E_{\rm max}\} when {F=E}\{F=E\} possesses two connected components. Analogously in the case Z12Z_{12} the level set {F=E}\{F=E\} possesses two connected component for max{a(1),Emin}<E<Esad\max\{a(1),E_{\rm min}\}<E<E_{\rm sad} and only one otherwise.

Remark 8.

Up to the energy level corresponding to E=0E=0 and to the critical energies181818Namely the energy of critical points of FF. In the case a1+a2=0a_{1}+a_{2}=0, that we are actually excluding (recall Remark 6), there is also a curve which touches the line x=1x=1., the level sets are curves of three types:
(i) a homotopically trivial, namely contractible, curve making a loop around the maximum (0,x1(0))(0,x_{1}^{(0)}) intersecting twice the line ψ=0\psi=0;
(ii) a curve wrapping on the cylinder; in particular it intersects once the line ψ=0\psi=0 and once the line ψ=π\psi=\pi;
(iii) a homotopically trivial curve making a loop around the minimum (π,x1(π))(\pi,x_{1}^{(\pi)}) intersecting twice the line ψ=π\psi=\pi.

In the following we will always label the roots of the quartic polynomial in (54) so that xi(E)<xi+1(E)x_{i}(E)<x_{i+1}(E). Recall Proposition 1.

Case Z10Z_{10}.

Refer to caption
Figure 16: Phase portrait of Z10Z_{10}: (ψ,x)[0,2π]×(0,1)(\psi,x)\in[0,2\pi]\times(0,1). The zone Z10Z_{10} for a1=1a_{1}=-1 and a2=2a_{2}=-2 filled by the level curves of xx2+b(x)cosψ=E-x-x^{2}+b(x)\cos\psi=E, for different values of the energy EE.

The zero level separatrix actually separates the phase space 𝒫{\mathcal{P}} into two open connected components 𝒫10I{\mathcal{P}}_{10}^{{\rm I}} and 𝒫10II{\mathcal{P}}_{10}^{{\rm I\!I}} supporting two different kind of motions191919Where {F>0}:={(ψ,x)𝒫:F(ψ,x)>0}\{F>0\}:=\{(\psi,x)\in{\mathcal{P}}\ :\ F(\psi,x)>0\}.

𝒫10I:={F>0},𝒫10II:={F<0},{\mathcal{P}}_{10}^{{\rm I}}:=\{F>0\}\,,\qquad{\mathcal{P}}_{10}^{{\rm I\!I}}:=\{F<0\}\,, (77)

with 𝒫=𝒫10I𝒫10II{F=0}{\mathcal{P}}={\mathcal{P}}_{10}^{{\rm I}}\cup{\mathcal{P}}_{10}^{{\rm I\!I}}\cup\{F=0\}. Indeed in 𝒫10I{\mathcal{P}}_{10}^{{\rm I}} the level curves have the form in case (ii) above, while in 𝒫10II{\mathcal{P}}_{10}^{{\rm I\!I}} they have the form in case (i). In the present case the quartic polynomial in (54) possesses, for E0E\neq 0 and not critical, only two real roots x1(E)<x2(E)x_{1}(E)<x_{2}(E). Note that x1(E)=x1(E;J2)x_{1}(E)=x_{1}(E;{J_{2}}) and x2(E)=x2(E;J2)x_{2}(E)=x_{2}(E;{J_{2}}). If E>0E>0 the EE–level curve starts at (0,x1(E))(0,x_{1}(E)) and come back on the line ψ=0\psi=0 at (0,x2(E))(0,x_{2}(E)), otherwise, for E<0E<0, it joints the line ψ=π\psi=\pi at (π,x1(E))(\pi,x_{1}(E)) and the line ψ=0\psi=0 at (π,x2(E))(\pi,x_{2}(E)). Recalling (52), the level curve {F=E}\{F=E\} can be expressed as a graph over x1(E)<x<x2(E)x_{1}(E)<x<x_{2}(E) by the function ψ(x;J2)\psi(x;{J_{2}}).

Case Z01Z_{01}.

Refer to caption
Figure 17: Phase portrait of Z01Z_{01}: (ψ,x)[0,2π]×(0,1)(\psi,x)\in[0,2\pi]\times(0,1). The zone Z01Z_{01} for a1=1a_{1}=1 and a2=2a_{2}=2 filled by the level curves of x+x2+b(x)cosψ=Ex+x^{2}+b(x)\cos\psi=E, for different values of the energy EE.

We set

𝒫01II:={F>0},𝒫01III:={F<0},{\mathcal{P}}_{01}^{{\rm I\!I}}:=\{F>0\}\,,\qquad{\mathcal{P}}_{01}^{{\rm I\!I\!I}}:=\{F<0\}\,, (78)

with 𝒫=𝒫01III𝒫01II{F=0}{\mathcal{P}}={\mathcal{P}}_{01}^{{\rm I\!I\!I}}\cup{\mathcal{P}}_{01}^{{\rm I\!I}}\cup\{F=0\}. Again the zero level separatrix actually separates the two different kind of motions: in 𝒫01III{\mathcal{P}}_{01}^{{\rm I\!I\!I}} the level curves have the form in case (iii) above, while in 𝒫01II{\mathcal{P}}_{01}^{{\rm I\!I}} they have the form in case (ii).

Case Z21+Z_{21}^{+}.

Refer to caption
Figure 18: Phase portrait of Z21+Z_{21}^{+}: (ψ,x)[0,2π]×(0,1)(\psi,x)\in[0,2\pi]\times(0,1). The zone Z21+Z_{21}^{+} for a1=1a_{1}=-1 and a2=3a_{2}=3 filled by the level curves of x+32x2+b(x)cosψ=E-x+\frac{3}{2}x^{2}+b(x)\cos\psi=E, for different values of the energy EE.

The zero level separatrix and the two separatrices emanating from the saddle point (0,x2(0))(0,x_{2}^{(0)}) with energy202020Recall Proposition 1 and (3.2). F(0,x2(0))=Esad>0F(0,x_{2}^{(0)})=E_{\rm sad}>0 (recall (73)) separate the phase space 𝒫{\mathcal{P}} into 4 open connected components:

𝒫21+,I:={F>Esadcontaining(0,x1(0))},𝒫21+,II:={0<F<Esad},\displaystyle\!{\mathcal{P}}_{21}^{+,{\rm I}}:=\{F>E_{\rm sad}\ \mbox{containing}\ (0,x_{1}^{(0)})\}\,,\qquad{\mathcal{P}}_{21}^{+,{\rm I\!I}}:=\{0<F<E_{\rm sad}\}\,, (79)
𝒫21+,III:={F<0},𝒫21+,IV:={F>Esadnot containing(x1(0),0)}\displaystyle\!{\mathcal{P}}_{21}^{+,{\rm I\!I\!I}}:=\{F<0\}\,,\quad{\mathcal{P}}_{21}^{+,{\rm I\!V}}:=\{F>E_{\rm sad}\ \mbox{not containing}\ (x_{1}^{(0)},0)\}

with 𝒫=𝒫21+,I𝒫21+,II𝒫21+,III𝒫21+,IV{F=0}{F=Esad}{\mathcal{P}}={\mathcal{P}}_{21}^{+,{\rm I}}\cup{\mathcal{P}}_{21}^{+,{\rm I\!I}}\cup{\mathcal{P}}_{21}^{+,{\rm I\!I\!I}}\cup{\mathcal{P}}_{21}^{+,{\rm I\!V}}\cup\{F=0\}\cup\{F=E_{\rm sad}\}. The level curves in 𝒫21+,II{\mathcal{P}}_{21}^{+,{\rm I\!I}} and 𝒫21+,IV{\mathcal{P}}_{21}^{+,{\rm I\!V}} have the form case (ii) above, the ones in 𝒫21+,I{\mathcal{P}}_{21}^{+,{\rm I}} have the form in case (i), finally the ones in 𝒫21+,III{\mathcal{P}}_{21}^{+,{\rm I\!I\!I}} are as in (iii). In particular the level curves in 𝒫21+,I{\mathcal{P}}_{21}^{+,{\rm I}} pass through the points (0,x1(E))(0,x_{1}(E)) and (0,x2(E))(0,x_{2}(E)); the ones in 𝒫21+,II{\mathcal{P}}_{21}^{+,{\rm I\!I}} through (0,x1(E))(0,x_{1}(E)) and (π,x2(E))(\pi,x_{2}(E)); the ones in 𝒫21+,III{\mathcal{P}}_{21}^{+,{\rm I\!I\!I}} through (π,x1(E))(\pi,x_{1}(E)) and (π,x2(E))(\pi,x_{2}(E)); the ones in 𝒫21+,IV{\mathcal{P}}_{21}^{+,{\rm I\!V}} through (0,x3(E))(0,x_{3}(E)) and (π,x4(E))(\pi,x_{4}(E)).

Case Z21Z_{21}^{-}.

Refer to caption
Figure 19: Phase portrait of Z21Z_{21}^{-}: (ψ,x)[0,2π]×(0,1)(\psi,x)\in[0,2\pi]\times(0,1). The zone Z21Z_{21}^{-} for a1=1a_{1}=-1 and a2=2a_{2}=2 filled by the level curves of x+x2+b(x)cosψ=E-x+x^{2}+b(x)\cos\psi=E, for different values of the energy EE.

The zero level separatrix and the two separatrices emanating from the saddle point (0,x2(0))(0,x_{2}^{(0)}) with energy F(0,x2(0))=Esad<0F(0,x_{2}^{(0)})=E_{\rm sad}<0 (recall (73)) separate the phase space 𝒫{\mathcal{P}} into 4 open connected components:

𝒫21,I:={F>0},𝒫21,III:={F<Esad},{\mathcal{P}}_{21}^{-,{\rm I}}:=\{F>0\}\,,\qquad{\mathcal{P}}_{21}^{-,{\rm I\!I\!I}}:=\{F<E_{\rm sad}\}\,, (80)

while 𝒫21,II{\mathcal{P}}_{21}^{-,{\rm I\!I}} and 𝒫21,IV{\mathcal{P}}_{21}^{-,{\rm I\!V}} are the two open connected components of {Esad<F<0}\{E_{\rm sad}<F<0\} with 𝒫21,II{\mathcal{P}}_{21}^{-,{\rm I\!I}} containing (0,π)(0,\pi) in its closure. We immediately see that

𝒫=𝒫21,I𝒫21,II𝒫21,III𝒫21,IV{F=0}{F=Esad}.{\mathcal{P}}={\mathcal{P}}_{21}^{-,{\rm I}}\cup{\mathcal{P}}_{21}^{-,{\rm I\!I}}\cup{\mathcal{P}}_{21}^{-,{\rm I\!I\!I}}\cup{\mathcal{P}}_{21}^{-,{\rm I\!V}}\cup\{F=0\}\cup\{F=E_{\rm sad}\}\,.

The level curves in 𝒫21,II{\mathcal{P}}_{21}^{-,{\rm I\!I}} and 𝒫21,IV{\mathcal{P}}_{21}^{-,{\rm I\!V}} have the form in case (ii) above, the ones in 𝒫21,I{\mathcal{P}}_{21}^{-,{\rm I}} have the form in case (i), finally the ones in 𝒫21,III{\mathcal{P}}_{21}^{-,{\rm I\!I\!I}} are as in (iii). In particular the level curves in 𝒫21,I{\mathcal{P}}_{21}^{-,{\rm I}} pass through the points (0,x1(E))(0,x_{1}(E)) and (0,x2(E))(0,x_{2}(E)); the ones in 𝒫21,II{\mathcal{P}}_{21}^{-,{\rm I\!I}} through (π,x1(E))(\pi,x_{1}(E)) and (0,x2(E))(0,x_{2}(E)); the ones in 𝒫21,III{\mathcal{P}}_{21}^{-,{\rm I\!I\!I}} through (π,x1(E))(\pi,x_{1}(E)) and (π,x2(E))(\pi,x_{2}(E)); the ones in 𝒫21,IV{\mathcal{P}}_{21}^{-,{\rm I\!V}} through (0,x3(E))(0,x_{3}(E)) and (π,x4(E))(\pi,x_{4}(E)).

Case Z12+Z_{12}^{+}.

Refer to caption
Figure 20: Phase portrait of Z12+Z_{12}^{+}: (ψ,x)[0,2π]×(0,1)(\psi,x)\in[0,2\pi]\times(0,1). The zone Z12+Z_{12}^{+} for a1=1a_{1}=1 and a2=2a_{2}=-2 filled by the level curves of xx2+b(x)cosψ=Ex-x^{2}+b(x)\cos\psi=E, for different values of the energy EE.

The zero level separatrix and the two separatrices emanating from the saddle point (π,x2(π))(\pi,x_{2}^{(\pi)}) with energy F(π,x2(π))=Esad>0F(\pi,x_{2}^{(\pi)})=E_{\rm sad}>0 (recall (73)) separate the phase space 𝒫{\mathcal{P}} into 4 open connected components:

𝒫12+,I:={F>Esad},𝒫12+,III:={F<0},{\mathcal{P}}_{12}^{+,{\rm I}}:=\{F>E_{\rm sad}\}\,,\qquad{\mathcal{P}}_{12}^{+,{\rm I\!I\!I}}:=\{F<0\}\,, (81)

while 𝒫12+,II{\mathcal{P}}_{12}^{+,{\rm I\!I}} and 𝒫12+,IV{\mathcal{P}}_{12}^{+,{\rm I\!V}} are the two open connected components of {0<F<Esad}\{0<F<E_{\rm sad}\} with 𝒫12+,II{\mathcal{P}}_{12}^{+,{\rm I\!I}} containing (0,0)(0,0) in its closure. We note that

𝒫=𝒫12+,I𝒫12+,II𝒫12+,III𝒫12+,IV{F=0}{F=Esad}.{\mathcal{P}}={\mathcal{P}}_{12}^{+,{\rm I}}\cup{\mathcal{P}}_{12}^{+,{\rm I\!I}}\cup{\mathcal{P}}_{12}^{+,{\rm I\!I\!I}}\cup{\mathcal{P}}_{12}^{+,{\rm I\!V}}\cup\{F=0\}\cup\{F=E_{\rm sad}\}\,.

The level curves in 𝒫12+,II{\mathcal{P}}_{12}^{+,{\rm I\!I}} and 𝒫12+,IV{\mathcal{P}}_{12}^{+,{\rm I\!V}} have the form case (ii) above, the ones in 𝒫12+,I{\mathcal{P}}_{12}^{+,{\rm I}} have the form in case (i), finally the ones in 𝒫12+,III{\mathcal{P}}_{12}^{+,{\rm I\!I\!I}} are as in (iii). In particular the level curves in 𝒫12+,I{\mathcal{P}}_{12}^{+,{\rm I}} pass through the points (0,x1(E))(0,x_{1}(E)) and (0,x2(E))(0,x_{2}(E)); the ones in 𝒫12+,II{\mathcal{P}}_{12}^{+,{\rm I\!I}} through (0,x1(E))(0,x_{1}(E)) and (π,x2(E))(\pi,x_{2}(E)); the ones in 𝒫12+,III{\mathcal{P}}_{12}^{+,{\rm I\!I\!I}} through (π,x1(E))(\pi,x_{1}(E)) and (π,x2(E))(\pi,x_{2}(E)); the ones in 𝒫12+,IV{\mathcal{P}}_{12}^{+,{\rm I\!V}} through (π,x3(E))(\pi,x_{3}(E)) and (0,x4(E))(0,x_{4}(E)).

Case Z12Z_{12}^{-}.

Refer to caption
Figure 21: Phase portrait of Z12Z_{12}^{-}: (ψ,x)[0,2π]×(0,1)(\psi,x)\in[0,2\pi]\times(0,1). The zone Z12Z_{12}^{-} for a1=1a_{1}=1 and a2=3a_{2}=-3 filled by the level curves of x32x2+b(x)cosψ=Ex-\frac{3}{2}x^{2}+b(x)\cos\psi=E, for different values of the energy EE.

The zero level separatrix and the two separatrices emanating from the saddle point (π,x2(π))(\pi,x_{2}^{(\pi)}) with energy F(π,x2(π))=Esad<0F(\pi,x_{2}^{(\pi)})=E_{\rm sad}<0 (recall (73)) separate the phase space 𝒫{\mathcal{P}} into 4 open connected components:

𝒫12,I:={F>0},𝒫12,III:={F<Esadcontaining(x1(π),π)},\displaystyle{\mathcal{P}}_{12}^{-,{\rm I}}:=\{F>0\}\,,\qquad{\mathcal{P}}_{12}^{-,{\rm I\!I\!I}}:=\{F<E_{\rm sad}\ \mbox{containing}\ (x_{1}^{(\pi)},\pi)\}\,,
𝒫12,II:={Esad<F<0},𝒫12,IV:={F<Esadnot containing(x1(π),π)},\displaystyle{\mathcal{P}}_{12}^{-,{\rm I\!I}}:=\{E_{\rm sad}<F<0\}\,,\quad{\mathcal{P}}_{12}^{-,{\rm I\!V}}:=\{F<E_{\rm sad}\ \mbox{not containing}\ (x_{1}^{(\pi)},\pi)\}\,, (82)

with 𝒫=𝒫12,I𝒫12,II𝒫12,III𝒫12,IV{F=0}{F=Esad}{\mathcal{P}}={\mathcal{P}}_{12}^{-,{\rm I}}\cup{\mathcal{P}}_{12}^{-,{\rm I\!I}}\cup{\mathcal{P}}_{12}^{-,{\rm I\!I\!I}}\cup{\mathcal{P}}_{12}^{-,{\rm I\!V}}\cup\{F=0\}\cup\{F=E_{\rm sad}\}. The level curves in 𝒫12,II{\mathcal{P}}_{12}^{-,{\rm I\!I}} and 𝒫12,IV{\mathcal{P}}_{12}^{-,{\rm I\!V}} have the form in case (ii) above, the ones in 𝒫12,I{\mathcal{P}}_{12}^{-,{\rm I}} have the form in case (i), finally the ones in 𝒫12,III{\mathcal{P}}_{12}^{-,{\rm I\!I\!I}} are as in (iii). In particular the level curves in 𝒫12,I{\mathcal{P}}_{12}^{-,{\rm I}} pass through the points (0,x1(E))(0,x_{1}(E)) and (0,x2(E))(0,x_{2}(E)); the ones in 𝒫12,II{\mathcal{P}}_{12}^{-,{\rm I\!I}} through (π,x1(E))(\pi,x_{1}(E)) and (0,x2(E))(0,x_{2}(E)); the ones in 𝒫12,III{\mathcal{P}}_{12}^{-,{\rm I\!I\!I}} through (π,x1(E))(\pi,x_{1}(E)) and (π,x2(E))(\pi,x_{2}(E)); the ones in 𝒫12,IV{\mathcal{P}}_{12}^{-,{\rm I\!V}} through (π,x3(E))(\pi,x_{3}(E)) and (0,x4(E))(0,x_{4}(E)).

3.8 Degenerate cases

Recalling Remark 6, we briefly illustrate in Figures 22-24 the degenerate cases: a2=a1a_{2}=-a_{1}, when x=1x=1 is a solution of (45)-(46), a2=g(a1)a_{2}=g(a_{1}) and a2=g(a1)a_{2}=-g(-a_{1}), when two solutions coincide, finally a2=g~(a1)a_{2}=\tilde{g}(a_{1}), when the separatrix and the stable and unstable manifolds of the saddle point coincide and have zero energy.

Refer to caption
Figure 22: The degenerate case a1=a2a_{1}=-a_{2}. Phase portraits: (ψ,x)[0,2π]×(0,1)(\psi,x)\in[0,2\pi]\times(0,1). On the left, the case with a1=1a_{1}=1, a2=1a_{2}=-1, filled by the level curves x1/2x2+b(x)cosψ=Ex-1/2x^{2}+b(x)\cos\psi=E. On the right, the case with a1=1a_{1}=-1, a2=1a_{2}=1, filled by the level curves x+1/2x2+b(x)cosψ=E-x+1/2x^{2}+b(x)\cos\psi=E. In both cases a new separatrix appears approaching the line x=1x=1 at ψ=π/2,3π/2\psi=\pi/2,3\pi/2.
Refer to caption
Figure 23: The degenerate cases a2=g(a1)a_{2}=g(a_{1}) on the left and a2=g(a1)a_{2}=-g(-a_{1}). Phase portraits: (ψ,x)[0,2π]×(0,1)(\psi,x)\in[0,2\pi]\times(0,1). On the left, a1=2a_{1}=2, a2=g~(2)=4727a_{2}=\tilde{g}(2)=-\frac{47}{27}, filled by the level curves 2x4754x2+b(x)cosψ=E2x-\frac{47}{54}x^{2}+b(x)\cos\psi=E. On the right, a1=2a_{1}=-2, a2=g~(2)=4727a_{2}=-\tilde{g}(2)=\frac{47}{27}, filled by the level curves 2x+4754x2+b(x)cosψ=E-2x+\frac{47}{54}x^{2}+b(x)\cos\psi=E. Note that a non-smooth curve appears with a cusp.
Refer to caption
Figure 24: The degenerate case a2=g~(a1)a_{2}=\tilde{g}(a_{1}). Phase portraits: (ψ,x)[0,2π]×(0,1)(\psi,x)\in[0,2\pi]\times(0,1). On the left, the zone Z210Z_{21}^{0} with a1=1a_{1}=-1, a2=g~(1)=6227a_{2}=\tilde{g}(-1)=\frac{62}{27}, filled by the level curves x+3127x2+b(x)cosψ=E-x+\frac{31}{27}x^{2}+b(x)\cos\psi=E. On the right, the zone Z120Z_{12}^{0} with a1=1a_{1}=1, a2=g~(1)=6227a_{2}=\tilde{g}(1)=-\frac{62}{27}, filled by the level curves x3127x2+b(x)cosψ=Ex-\frac{31}{27}x^{2}+b(x)\cos\psi=E. In both cases the stable and unstable manifolds of the saddle point have zero energy.

4 Explicit formulae of the nonlinear frequencies

In this section we first write the integrating action I1{\rm I}_{1} as a function of the energy EE in terms of integrals in the xx variables with endpoints given by the roots of the quartic polynomial 𝐏(x)\mathbf{P}(x) in (55), studied in the previous section. In addition to energy, these representation formulae depend on the values of the parameters a1a_{1} and a2a_{2}, according to the resulting different topologies of the phase space described above.

The final integrated Hamiltonian is the inverse :I1(I1;I2)\mathcal{E}:{\rm I}_{1}\to\mathcal{E}({\rm I}_{1};{\rm I}_{2}) of the function EI1(E;I2)E\to{\rm I}_{1}(E;{\rm I}_{2}) in (88). Its derivatives with respect to I1{\rm I}_{1} and I2{\rm I}_{2} are the nonlinear frequencies and can be written in terms of the derivatives of I1(E;I2){\rm I}_{1}(E;{\rm I}_{2}) with respect to EE and I2{\rm I}_{2}, see (95). These derivatives are expressed in terms of elliptic integrals in Proposition 2. The integrals are explicitly evaluated by means of suitable Moebius transformations in Subsections 4.3 and 4.4, in the case that 𝐏(x)\mathbf{P}(x) has four or two real roots, respectively. In the last subsection we consider the exact 3:1 resonance case, where the above formulae simplify a bit.

Refer to caption
Figure 25: The zone Z21+{0ψπ}Z_{21}^{+}\cap\{0\leq\psi\leq\pi\} for a1=1a_{1}=1 and a2=3a_{2}=3 filled by the level curves of x32x2+b(x)cosψ=Ex-\frac{3}{2}x^{2}+b(x)\cos\psi=E, for different values of the energy EE. The intersections x1,x2,x3x_{1},x_{2},x_{3}, respectively x4x_{4}, of the two level curves of energy E=0.16E=0.16 with the line ψ=0\psi=0, respectively ψ=π\psi=\pi, are shown.

4.1 Construction of the integrating action variables

Since H^\hat{H} has two independent integrals of motions: the Hamiltonian itself and J2{J_{2}}, by the Arnold-Liouville theorem the Hamiltonian H^\hat{H} is integrable. A part from I2:=J2{\rm I}_{2}:={J_{2}} the construction of the other action I1{\rm I}_{1} as function of the energy EE is as follows. I1(E){\rm I}_{1}(E) is simply the area enclosed by the level curves of H^=E\hat{H}=E divided by 2π2\pi. Such level curves coincide with the ones of FF.

Our aim is to find a symplectic map Ψ:(I,θ)=(I1,I2,θ1,θ2)(J1,J2,ψ1,ψ2)\Psi:({\rm I},\theta)=({\rm I}_{1},{\rm I}_{2},\theta_{1},\theta_{2})\to(J_{1},{J_{2}},\psi_{1},\psi_{2}), fixing

I2=J2,{\rm I}_{2}={J_{2}}\,, (83)

such that, in the new coordinates, the Hamiltonian H^\hat{H} is integrated, namely212121Recalling (41) note that \mathcal{E} is adimensional.

H^Ψ=:(I)\hat{H}\circ\Psi=:\mathcal{E}({\rm I}) (84)

depends only on the new actions I=(I1,I2){\rm I}=({\rm I}_{1},{\rm I}_{2}).

Note that the same transformation Ψ\Psi also integrates J2=I2\mathbb{H}_{{J_{2}}}=\mathbb{H}_{{\rm I}_{2}} in (41) and ^res\hat{\mathbb{H}}_{\rm res} in (38). Indeed

I2Ψ=χI22(I)and^resΨ=𝔼(I):=ωI2+χI22(I).\mathbb{H}_{{\rm I}_{2}}\circ\Psi=\chi\,{\rm I}_{2}^{2}\,\mathcal{E}({\rm I})\qquad\mbox{and}\qquad\hat{\mathbb{H}}_{\rm res}\circ\Psi=\mathbb{E}({\rm I}):=\omega_{-}{\rm I}_{2}+\chi\,{\rm I}_{2}^{2}\,\mathcal{E}({\rm I})\,. (85)

In the new coordinates, the actions are constants of motion and the angles perform a linear motion θ(t)=θ(0)+ωt\theta(t)=\theta(0)+\omega t with frequencies

ω=(ω1,ω2):=(I1𝔼,I2𝔼)=(χI22I1,ω+2χI2+χI22I2).\omega=(\omega_{1},\omega_{2}):=(\partial_{{\rm I}_{1}}\mathbb{E},\partial_{{\rm I}_{2}}\mathbb{E})=(\chi\,{\rm I}_{2}^{2}\,\partial_{{\rm I}_{1}}\mathcal{E}\,,\ \ \omega_{-}+2\chi\,{\rm I}_{2}\mathcal{E}+\chi\,{\rm I}_{2}^{2}\,\partial_{{\rm I}_{2}}\mathcal{E})\,. (86)

The classical construction of the Hamiltonian \mathcal{E}, “the adimensional energy”, is as follows. First one constructs, for every fixed value of I2=J2{\rm I}_{2}={J_{2}}, the action function I1:EI1(E;J2){\rm I}_{1}:E\to{\rm I}_{1}(E;{J_{2}}) defined as the area enclosed by the level curve γE:={H^J2=E+a0}\gamma_{E}:=\{\hat{H}_{J_{2}}=E+a_{0}\} normalised by 2π2\pi. Then, since the function I1:EI1(E;J2){\rm I}_{1}:E\to{\rm I}_{1}(E;{J_{2}}) turns out to be monotone (being |EI1(E;J2)|>0|\partial_{E}{\rm I}_{1}(E;{J_{2}})|>0), one defines :I1(I1,J2)\mathcal{E}:{\rm I}_{1}\to\mathcal{E}({\rm I}_{1},{J_{2}}) as its inverse. Namely, in view of (43),

(I1(E;J2),J2)=(I1(E;I2),I2)=E+a0.\mathcal{E}\big{(}{\rm I}_{1}(E;{J_{2}}),{J_{2}}\big{)}=\mathcal{E}\big{(}{\rm I}_{1}(E;{\rm I}_{2}),{\rm I}_{2}\big{)}=E+a_{0}\,. (87)

So the level curves of H^J2\hat{H}_{J_{2}} play a crucial role here. Note that by (41) the level curves of H^J2\hat{H}_{J_{2}} are the same as the ones of J2\mathbb{H}_{{J_{2}}}, moreover by (43) they are simple related to the ones of FF.
More precisely the new action is defined as222222Recall (43).

I1(E)=I1(E;I2):=I23𝒜(E;I2),{\rm I}_{1}(E)={\rm I}_{1}(E;{\rm I}_{2}):=\frac{{\rm I}_{2}}{3}{\mathcal{A}}(E;{\rm I}_{2})\,, (88)

where, recalling the notation introduced in Remark 8, 𝒜{\mathcal{A}} is the area (normalised by 2π2\pi) enclosed by the EE-level curve in the cases (i) and (iii), and below the level curve in the case (ii). In particular we have four cases indexed by I,II,III,IV{\rm I},{\rm I\!I},{\rm I\!I\!I},{\rm I\!V}, according if one is in the zones 𝒫ijI,𝒫ijII,𝒫ijIII,𝒫ij±,I,𝒫ij±,II,𝒫ij±,III,𝒫ij±,IV{\mathcal{P}}_{ij}^{{\rm I}},{\mathcal{P}}_{ij}^{{\rm I\!I}},{\mathcal{P}}_{ij}^{{\rm I\!I\!I}},{\mathcal{P}}_{ij}^{\pm,{\rm I}},{\mathcal{P}}_{ij}^{\pm,{\rm I\!I}},{\mathcal{P}}_{ij}^{\pm,{\rm I\!I\!I}},{\mathcal{P}}_{ij}^{\pm,{\rm I\!V}}.
Case I{\rm I}. The level curve makes a loop around the maximum (0,x1(0))(0,x_{1}^{(0)}) then232323Recall (52).

𝒜(E)=𝒜I(E;I2):=Areaπ=1πx1(E)x2(E)ψ(x;E)𝑑x.{\mathcal{A}}(E)={\mathcal{A}}^{{\rm I}}(E;{\rm I}_{2}):=\frac{\mathrm{Area}}{\pi}=\frac{1}{\pi}\int_{x_{1}(E)}^{x_{2}(E)}\psi(x;E)\,dx\,. (89)

This holds in the zones: 𝒫10I,𝒫ij±,I.{\mathcal{P}}_{10}^{{\rm I}},{\mathcal{P}}_{ij}^{\pm,{\rm I}}.
Case II{\rm I\!I}. The level curve wraps on the cylinder

𝒜(E)=𝒜II(E;I2):=Areaπ={x1(E)+1πx1(E)x2(E)ψ(x;E)𝑑x,ifψ(x1)=πx2(E)1πx1(E)x2(E)ψ(x;E)𝑑x,ifψ(x1)=0.{\mathcal{A}}(E)={\mathcal{A}}^{\rm I\!I}(E;{\rm I}_{2}):=\frac{\mathrm{Area}}{\pi}=\left\{\begin{array}[]{ll}x_{1}(E)+\frac{1}{\pi}\int_{x_{1}(E)}^{x_{2}(E)}\psi(x;E)\,dx\,,&{\rm if}\ \ \psi(x_{1})=\pi\\ x_{2}(E)-\frac{1}{\pi}\int_{x_{1}(E)}^{x_{2}(E)}\psi(x;E)\,dx\,,&{\rm if}\ \ \psi(x_{1})=0\,.\end{array}\right. (90)

in the cases 𝒫ijII,𝒫ij±,II{\mathcal{P}}_{ij}^{{\rm I\!I}},{\mathcal{P}}_{ij}^{\pm,{\rm I\!I}}.
Case III{\rm I\!I\!I}. The level curve makes a loop around the minimum (π,x1(π))(\pi,x_{1}^{(\pi)})

𝒜(E)=𝒜III(E;I2):=Areaπ=1πx1(E)x2(E)(πψ(x;E))𝑑x.{\mathcal{A}}(E)={\mathcal{A}}^{\rm I\!I\!I}(E;{\rm I}_{2}):=\frac{\mathrm{Area}}{\pi}=\frac{1}{\pi}\int_{x_{1}(E)}^{x_{2}(E)}\big{(}\pi-\psi(x;E)\big{)}\,dx\,. (91)

This holds in the zones: 𝒫01III,𝒫ij±,III{\mathcal{P}}_{01}^{{\rm I\!I\!I}},{\mathcal{P}}_{ij}^{\pm,{\rm I\!I\!I}}.
Case IV{\rm I\!V}. The level curve wraps on the cylinder

𝒜(E)=𝒜IV(E;I2):=Areaπ={x3(E)+1πx3(E)x4(E)ψ(x;E)𝑑x,ifψ(x3)=πx4(E)1πx3(E)x4(E)ψ(x;E)𝑑x,ifψ(x3)=0.{\mathcal{A}}(E)={\mathcal{A}}^{\rm I\!V}(E;{\rm I}_{2}):=\frac{\mathrm{Area}}{\pi}=\left\{\begin{array}[]{ll}x_{3}(E)+\frac{1}{\pi}\int_{x_{3}(E)}^{x_{4}(E)}\psi(x;E)\,dx\,,&{\rm if}\ \ \psi(x_{3})=\pi\\ x_{4}(E)-\frac{1}{\pi}\int_{x_{3}(E)}^{x_{4}(E)}\psi(x;E)\,dx\,,&{\rm if}\ \ \psi(x_{3})=0\,.\end{array}\right. (92)

in the cases 𝒫ij±,IV{\mathcal{P}}_{ij}^{\pm,{\rm I\!V}}.

Remark 9 (KAM Theory).

The above integrating construction holds for the truncated Hamiltonian ^res\hat{\mathbb{H}}_{\rm res} in (38) but it does not work for the complete Hamiltonian. In fact the complete system is genuinely two dimensional and, therefore, not integrable. In particular J2{J_{2}} is not more a constant of motion. One might wonder whether, for ϵ\epsilon small enough, the invariant structures, both NNMs and stable and unstable manifolds, that exist for the truncated Hamiltonian survive, slightly deformed, for the full Hamiltonian. The answer is substantially positive thanks to KAM Theory. More precisely, the hyperbolic periodic orbit and its (local) stable and unstable manifolds survive as can be demonstrated following, e.g., [Graff] and [Val]. The conservation of two dimensional invariant tori is ensured when the frequencies are strongly rationally independent. This implies that the majority of invariant tori still exist in the complete system, whereas a minority is destroyed. However we note that, in this resonant case, the application of KAM Theory is not straightforward. In fact the standard KAM theory only regards the persistence of the so called primary tori, namely tori that are graphs over the angles. However, as we have already shown, in the resonant case also the so called secondary tori appear (the blue and the yellow tori in Figure 5). All our analysis can bee seen as a necessary preparatory step in view of the application of KAM techniques, since it integrates the resonant BNF up to order four. This means that, in the final action angle variables, the invariant tori are graphs over the angles and KAM methods can be applied. For a KAM result in presence of resonances and the persistence of secondary tori see [MNT].
Finally we note that, since the complete system is, in general, not integrable, KAM tori do not completely fill the phase space but some gaps appear between them. In these gaps chaotic behaviour may occur. However one has to notice that, since we are in two degrees of freedom, every orbit is perpetually stable in the sense that the solutions exist for all times and the values of the action variables remain close to the initial ones forever. The argument is standard in KAM Theory: the orbits evolve on the three dimensional energy surface we have two cases. 1) If on orbit starts on a KAM torus, then it remains on it forever, since the torus is invariant for the Hamiltonian flow. 2) If an orbit starts in a gap between two KAM tori then, since the tori are invariant and bidimensional and the energy surface is three dimensional, the orbit cannot cross them and it remains trapped between them forever.

4.2 Evaluation of the nonlinear frequencies as functions of the energy

In evaluating the new frequencies in (86), it is convenient to use (E;I2)(E;{\rm I}_{2}) as independent variables, rather than (I1,I2)({\rm I}_{1},{\rm I}_{2}). In particular, we have to evaluate I1(I1(E;I2),I2)\partial_{{\rm I}_{1}}\mathcal{E}\big{(}{\rm I}_{1}(E;{\rm I}_{2}),{\rm I}_{2}\big{)} and I2(I1(E;I2),I2)\partial_{{\rm I}_{2}}\mathcal{E}\big{(}{\rm I}_{1}(E;{\rm I}_{2}),{\rm I}_{2}\big{)}. Deriving (87) with respect to EE we get

I1(I1(E;I2),I2)EI1(E;I2)=1.\partial_{{\rm I}_{1}}\mathcal{E}\big{(}{\rm I}_{1}(E;{\rm I}_{2}),{\rm I}_{2}\big{)}\partial_{E}{\rm I}_{1}(E;{\rm I}_{2})=1\,.

Then

I1(I1(E;I2),I2)=1EI1(E;I2).\partial_{{\rm I}_{1}}\mathcal{E}\big{(}{\rm I}_{1}(E;{\rm I}_{2}),{\rm I}_{2}\big{)}=\frac{1}{\partial_{E}{\rm I}_{1}(E;{\rm I}_{2})}\,. (93)

Analogously, deriving (87) with respect to I2{\rm I}_{2}, we get

I1(I1(E;I2),I2)I2I1(E;I2)+I2(I1(E;I2),I2)=0,\partial_{{\rm I}_{1}}\mathcal{E}\big{(}{\rm I}_{1}(E;{\rm I}_{2}),{\rm I}_{2}\big{)}\partial_{{\rm I}_{2}}{\rm I}_{1}(E;{\rm I}_{2})+\partial_{{\rm I}_{2}}\mathcal{E}\big{(}{\rm I}_{1}(E;{\rm I}_{2}),{\rm I}_{2}\big{)}=0\,,

and, therefore,

I2(I1(E;I2),I2)=I1(I1(E;I2),I2)I2I1(E;I2)=(93)I2I1(E;I2)EI1(E;I2).\partial_{{\rm I}_{2}}\mathcal{E}\big{(}{\rm I}_{1}(E;{\rm I}_{2}),{\rm I}_{2}\big{)}=-\partial_{{\rm I}_{1}}\mathcal{E}\big{(}{\rm I}_{1}(E;{\rm I}_{2}),{\rm I}_{2}\big{)}\partial_{{\rm I}_{2}}{\rm I}_{1}(E;{\rm I}_{2})\stackrel{{\scriptstyle\eqref{peperosa}}}{{=}}-\frac{\partial_{{\rm I}_{2}}{\rm I}_{1}(E;{\rm I}_{2})}{\partial_{E}{\rm I}_{1}(E;{\rm I}_{2})}\,. (94)

Then, using (87), we rewrite (86) as

ω1(E,I2)=χI221EI1(E;I2),ω2(E,I2)=ω+2χI2(E+a0)χI22I2I1(E;I2)EI1(E;I2),\omega_{1}(E,{\rm I}_{2})=\chi\,{\rm I}_{2}^{2}\,\frac{1}{\partial_{E}{\rm I}_{1}(E;{\rm I}_{2})}\,,\qquad\omega_{2}(E,{\rm I}_{2})=\omega_{-}+2\chi\,{\rm I}_{2}(E+a_{0})-\chi\,{\rm I}_{2}^{2}\,\frac{\partial_{{\rm I}_{2}}{\rm I}_{1}(E;{\rm I}_{2})}{\partial_{E}{\rm I}_{1}(E;{\rm I}_{2})}\,,

namely, recalling (88),

ω1(E,I2)=3χI21E𝒜(E;I2),ω2(E,I2)=ω+2χI2(E+a0)χI22I2𝒜(E;I2)E𝒜(E;I2).\omega_{1}(E,{\rm I}_{2})=3\chi\,{\rm I}_{2}\,\frac{1}{\partial_{E}{\mathcal{A}}(E;{\rm I}_{2})}\,,\qquad\omega_{2}(E,{\rm I}_{2})=\omega_{-}+2\chi\,{\rm I}_{2}(E+a_{0})-\chi\,{\rm I}_{2}^{2}\,\frac{\partial_{{\rm I}_{2}}{\mathcal{A}}(E;{\rm I}_{2})}{\partial_{E}{\mathcal{A}}(E;{\rm I}_{2})}\,. (95)

As a final symplectic change of variables we consider the inverse of the map in (36), namely the map Φ~:(I~,φ~)(I,θ)\tilde{\Phi}:(\tilde{I},\tilde{\varphi})\to({\rm I},\theta)

{I1=I~2I2=I~1+3I~2,{θ1=φ~23φ~1θ2=φ~1.\left\{\begin{array}[]{l}{\rm I}_{1}=\tilde{I}_{2}\\ {\rm I}_{2}=\tilde{I}_{1}+3\tilde{I}_{2}\,,\end{array}\right.\qquad\left\{\begin{array}[]{l}\theta_{1}=\tilde{\varphi}_{2}-3\tilde{\varphi}_{1}\\ \theta_{2}=\tilde{\varphi}_{1}\,.\end{array}\right. (96)

Applying the above map to the Hamiltonian 𝔼\mathbb{E} in (85) we get 𝔼~:=𝔼Φ~\tilde{\mathbb{E}}:=\mathbb{E}\circ\tilde{\Phi}, namely

𝔼~(I~)=𝔼(I~2,I~1+3I~2).\tilde{\mathbb{E}}(\tilde{I})=\mathbb{E}(\tilde{I}_{2},\tilde{I}_{1}+3\tilde{I}_{2})\,. (97)

In order to describe the frequencies of 𝔼~(I~)\tilde{\mathbb{E}}(\tilde{I}) it is convenient to use (E,I2)(E,{\rm I}_{2}) as variables instead of (I~1,I~2)(\tilde{I}_{1},\tilde{I}_{2}). The (invertible) relation between the two set of variable is the following

E=𝔼(I~2,I~1+3I~2)ω(I~1+3I~2)χ(I~1+3I~2)2a0,I2=I~1+3I~2,E=\frac{\mathbb{E}(\tilde{I}_{2},\tilde{I}_{1}+3\tilde{I}_{2})-\omega_{-}(\tilde{I}_{1}+3\tilde{I}_{2})}{\chi\,(\tilde{I}_{1}+3\tilde{I}_{2})^{2}}-a_{0}\,,\qquad{\rm I}_{2}=\tilde{I}_{1}+3\tilde{I}_{2}\,, (98)

(recalling (85), (87)). We are now able to evaluate the final nonlinear frequencies, namely the partial derivatives of 𝔼~(I~)\tilde{\mathbb{E}}(\tilde{I}) in (97), namely

ωnlr:=I~1𝔼~=ω2,ω+nlr:=I~2𝔼~=ω1+3ω2.\omega_{-}^{\rm nlr}:=\partial_{\tilde{I}_{1}}\tilde{\mathbb{E}}=\omega_{2}\,,\qquad\omega_{+}^{\rm nlr}:=\partial_{\tilde{I}_{2}}\tilde{\mathbb{E}}=\omega_{1}+3\omega_{2}\,. (99)

Indeed, recalling (86) and (95), we have

ωnlr(E,I2)\displaystyle\omega_{-}^{\rm nlr}(E,{\rm I}_{2}) :=\displaystyle:= ω+χI2(2(E+a0)I2I2𝒜(E;I2)E𝒜(E;I2)),\displaystyle\omega_{-}+\chi\,{\rm I}_{2}\left(2(E+a_{0})-{\rm I}_{2}\,\frac{\partial_{{\rm I}_{2}}{\mathcal{A}}(E;{\rm I}_{2})}{\partial_{E}{\mathcal{A}}(E;{\rm I}_{2})}\right)\,,
ω+nlr(E,I2)\displaystyle\omega_{+}^{\rm nlr}(E,{\rm I}_{2}) :=\displaystyle:= 3ω+3χI2(2(E+a0)+1E𝒜(E;I2)I2I2𝒜(E;I2)E𝒜(E;I2)).\displaystyle 3\omega_{-}+3\chi\,{\rm I}_{2}\left(2(E+a_{0})+\frac{1}{\partial_{E}{\mathcal{A}}(E;{\rm I}_{2})}-{\rm I}_{2}\,\frac{\partial_{{\rm I}_{2}}{\mathcal{A}}(E;{\rm I}_{2})}{\partial_{E}{\mathcal{A}}(E;{\rm I}_{2})}\right)\,. (100)

It remains to evaluate E𝒜(E;I2)\partial_{E}{\mathcal{A}}(E;{\rm I}_{2}) and I2𝒜(E;I2)\partial_{{\rm I}_{2}}{\mathcal{A}}(E;{\rm I}_{2}).

Proposition 2.

Set242424𝐏(x)\mathbf{P}(x) was defined in (55). Note that 𝐏(xi)=0\mathbf{P}(x_{i})=0 for i=1,2,3,4i=1,2,3,4.

W(x,E,I2):=1π(1x)3x(E12a2x2a1x)2=1π𝐏(x).W(x,E,{\rm I}_{2}):=\frac{1}{\pi\sqrt{(1-x)^{3}x-\big{(}E-\frac{1}{2}a_{2}x^{2}-a_{1}x\big{)}^{2}}}=\frac{1}{\pi\sqrt{-\mathbf{P}(x)}}\,. (101)

In the zones labelled by I,II,III{\rm I},{\rm I\!I},{\rm I\!I\!I}

E𝒜(E;I2)\displaystyle\partial_{E}{\mathcal{A}}(E;{\rm I}_{2}) =\displaystyle= ±x1(E,I2)x2(E,I2)W(x,E,I2)𝑑x,\displaystyle\pm\int_{x_{1}(E,{\rm I}_{2})}^{x_{2}(E,{\rm I}_{2})}W(x,E,{\rm I}_{2})\,dx\,,
I2𝒜(E;I2)\displaystyle\partial_{{\rm I}_{2}}{\mathcal{A}}(E;{\rm I}_{2}) =\displaystyle= ±σ3χI22x1(E,I2)x2(E,I2)xW(x,E,I2)𝑑x,\displaystyle\pm\frac{\sigma}{3\chi{\rm I}_{2}^{2}}\int_{x_{1}(E,{\rm I}_{2})}^{x_{2}(E,{\rm I}_{2})}x\,W(x,E,{\rm I}_{2})\,dx\,, (102)

where the ++ sign holds in the zones labelled by III{\rm I\!I\!I} and 𝒫01II,𝒫21+,II,𝒫12+,II{\mathcal{P}}_{01}^{{\rm I\!I}},{\mathcal{P}}_{21}^{+,{\rm I\!I}},{\mathcal{P}}_{12}^{+,{\rm I\!I}}, while the - sign in the zones labelled by I{\rm I} and 𝒫10II,𝒫21,II,𝒫12,II{\mathcal{P}}_{10}^{{\rm I\!I}},{\mathcal{P}}_{21}^{-,{\rm I\!I}},{\mathcal{P}}_{12}^{-,{\rm I\!I}}. Finally

E𝒜IV(E;I2)\displaystyle\partial_{E}{\mathcal{A}}^{{\rm I\!V}}(E;{\rm I}_{2}) =\displaystyle= ±x3(E,I2)x4(E,I2)W(x,E,I2)𝑑x,\displaystyle\pm\int_{x_{3}(E,{\rm I}_{2})}^{x_{4}(E,{\rm I}_{2})}W(x,E,{\rm I}_{2})\,dx\,,
I2𝒜IV(E;I2)\displaystyle\partial_{{\rm I}_{2}}{\mathcal{A}}^{{\rm I\!V}}(E;{\rm I}_{2}) =\displaystyle= ±σ3χI22x3(E,I2)x4(E,I2)xW(x,E,I2)𝑑x.\displaystyle\pm\frac{\sigma}{3\chi{\rm I}_{2}^{2}}\int_{x_{3}(E,{\rm I}_{2})}^{x_{4}(E,{\rm I}_{2})}x\,W(x,E,{\rm I}_{2})\,dx\,. (103)

where the ++ sign holds in the zones 𝒫21±,IV{\mathcal{P}}_{21}^{\pm,{\rm I\!V}} and the - one in 𝒫12±,IV{\mathcal{P}}_{12}^{\pm,{\rm I\!V}}.

proof. First note that from (52) and (1) we get

Eψ(x,I2)\displaystyle\partial_{E}\psi(x,{\rm I}_{2}) =\displaystyle= 1b(x)11(Ea(x)b(x))2=1b(x)2(Ea(x))2=W(x,E,I2),\displaystyle-\frac{1}{b(x)}\cdot\frac{1}{\sqrt{1-\left(\displaystyle\frac{E-a(x)}{b(x)}\right)^{2}}}=-\frac{1}{\sqrt{b(x)^{2}-(E-a(x))^{2}}}=-W(x,E,{\rm I}_{2})\,,
I2ψ(x,I2)\displaystyle\partial_{{\rm I}_{2}}\psi(x,{\rm I}_{2}) =\displaystyle= 3ωω+3χI22xW(x,E,I2).\displaystyle\frac{3\omega_{-}-\omega_{+}}{3\chi{\rm I}_{2}^{2}}\,x\,W(x,E,{\rm I}_{2})\,. (104)

Case I{\rm I}. Since ψ(x2(E,I2);E,I2)=ψ(x1(E,I2);E,I2)=0\psi(x_{2}(E,{\rm I}_{2});E,{\rm I}_{2})=\psi(x_{1}(E,{\rm I}_{2});E,{\rm I}_{2})=0, we have252525For brevity we omit to write the dependence on I2{\rm I}_{2}.

E𝒜I(E)\displaystyle\partial_{E}{\mathcal{A}}^{\rm I}(E) =\displaystyle= 1π[ψ(x2(E);E)Ex2(E)ψ(x1(E);E)Ex1(E)]+1πx1(E)x2(E)Eψ(x;E)dx\displaystyle\frac{1}{\pi}\left[\psi(x_{2}(E);E)\partial_{E}x_{2}(E)-\psi(x_{1}(E);E)\partial_{E}x_{1}(E)\right]+\frac{1}{\pi}\int_{x_{1}(E)}^{x_{2}(E)}\partial_{E}\psi(x;E)\,dx
=\displaystyle= 1πx1(E)x2(E)Eψ(x;E)dx=(4.2)x1(E)x2(E)W(x,E,I2)𝑑x,,\displaystyle\frac{1}{\pi}\int_{x_{1}(E)}^{x_{2}(E)}\partial_{E}\psi(x;E)\,dx\stackrel{{\scriptstyle\eqref{derivopsi(x)}}}{{=}}-\int_{x_{1}(E)}^{x_{2}(E)}W(x,E,{\rm I}_{2})\,dx\,,\,,

and, analogously,

I2𝒜I(E)\displaystyle\partial_{{\rm I}_{2}}{\mathcal{A}}^{\rm I}(E) =\displaystyle= 1π[ψ(x2(E);E)I2x2(E)ψ(x1(E);E)I2x1(E)]+1πx1(E)x2(E)I2ψ(x;E)dx\displaystyle\frac{1}{\pi}\left[\psi(x_{2}(E);E)\partial_{{\rm I}_{2}}x_{2}(E)-\psi(x_{1}(E);E)\partial_{{\rm I}_{2}}x_{1}(E)\right]+\frac{1}{\pi}\int_{x_{1}(E)}^{x_{2}(E)}\partial_{{\rm I}_{2}}\psi(x;E)\,dx
=\displaystyle= 1πx1(E)x2(E)I2ψ(x;E)dx.\displaystyle\frac{1}{\pi}\int_{x_{1}(E)}^{x_{2}(E)}\partial_{{\rm I}_{2}}\psi(x;E)\,dx\,.

Then (2) follows by (4.2).
Case II{\rm I\!I}. We have two sub-cases: ψ(x2(E,I2);E,I2)=0,ψ(x1(E,I2);E,I2)=π\psi(x_{2}(E,{\rm I}_{2});E,{\rm I}_{2})=0\,,\psi(x_{1}(E,{\rm I}_{2});E,{\rm I}_{2})=\pi or ψ(x2(E,I2);E,I2)=π,ψ(x1(E,I2);E,I2)=0\psi(x_{2}(E,{\rm I}_{2});E,{\rm I}_{2})=\pi\,,\psi(x_{1}(E,{\rm I}_{2});E,{\rm I}_{2})=0. In the first sub-case by the first formula in (90) we have262626For brevity we omit to write the dependence on EE and I2{\rm I}_{2}.

E𝒜II(E)\displaystyle\partial_{E}{\mathcal{A}}^{\rm I\!I}(E) =\displaystyle= Ex1+1π[ψ(x2)Ex2ψ(x1)Ex1]+1πx1x2Eψdx\displaystyle\partial_{E}x_{1}+\frac{1}{\pi}\left[\psi(x_{2})\partial_{E}x_{2}-\psi(x_{1})\partial_{E}x_{1}\right]+\frac{1}{\pi}\int_{x_{1}}^{x_{2}}\partial_{E}\psi\,dx
=\displaystyle= 1πx1x2Eψdx=(4.2)x1x2W(x)𝑑x,,\displaystyle\frac{1}{\pi}\int_{x_{1}}^{x_{2}}\partial_{E}\psi\,dx\stackrel{{\scriptstyle\eqref{derivopsi(x)}}}{{=}}-\int_{x_{1}}^{x_{2}}W(x)\,dx\,,\,,

and, analogously,

I2𝒜II(E)\displaystyle\partial_{{\rm I}_{2}}{\mathcal{A}}^{\rm I\!I}(E) =\displaystyle= I2x1+1π[ψ(x2)I2x2ψ(x1)I2x1]+1πx1x2I2ψ(x)dx\displaystyle\partial_{{\rm I}_{2}}x_{1}+\frac{1}{\pi}\left[\psi(x_{2})\partial_{{\rm I}_{2}}x_{2}-\psi(x_{1})\partial_{{\rm I}_{2}}x_{1}\right]+\frac{1}{\pi}\int_{x_{1}}^{x_{2}}\partial_{{\rm I}_{2}}\psi(x)\,dx
=\displaystyle= 1πx1x2I2ψ(x)dx.\displaystyle\frac{1}{\pi}\int_{x_{1}}^{x_{2}}\partial_{{\rm I}_{2}}\psi(x)\,dx\,.

In the second sub-case by the second formula in (90) we have

E𝒜II(E)\displaystyle\partial_{E}{\mathcal{A}}^{\rm I\!I}(E) =\displaystyle= Ex21π[ψ(x2)Ex2ψ(x1)Ex1]1πx1x2Eψdx\displaystyle\partial_{E}x_{2}-\frac{1}{\pi}\left[\psi(x_{2})\partial_{E}x_{2}-\psi(x_{1})\partial_{E}x_{1}\right]-\frac{1}{\pi}\int_{x_{1}}^{x_{2}}\partial_{E}\psi\,dx
=\displaystyle= 1πx1x2Eψdx=(4.2)x1x2W(x)𝑑x,,\displaystyle-\frac{1}{\pi}\int_{x_{1}}^{x_{2}}\partial_{E}\psi\,dx\stackrel{{\scriptstyle\eqref{derivopsi(x)}}}{{=}}\int_{x_{1}}^{x_{2}}W(x)\,dx\,,\,,

and, analogously,

I2𝒜II(E)\displaystyle\partial_{{\rm I}_{2}}{\mathcal{A}}^{\rm I\!I}(E) =\displaystyle= I2x11π[ψ(x2)I2x2ψ(x1)I2x1]1πx1x2I2ψ(x)dx\displaystyle\partial_{{\rm I}_{2}}x_{1}-\frac{1}{\pi}\left[\psi(x_{2})\partial_{{\rm I}_{2}}x_{2}-\psi(x_{1})\partial_{{\rm I}_{2}}x_{1}\right]-\frac{1}{\pi}\int_{x_{1}}^{x_{2}}\partial_{{\rm I}_{2}}\psi(x)\,dx
=\displaystyle= 1πx1x2I2ψ(x)dx.\displaystyle-\frac{1}{\pi}\int_{x_{1}}^{x_{2}}\partial_{{\rm I}_{2}}\psi(x)\,dx\,.

We conclude by (4.2).
Case III{\rm I\!I\!I}. Since ψ(x2(E,I2);E,I2)=ψ(x1(E,I2);E,I2)=π\psi(x_{2}(E,{\rm I}_{2});E,{\rm I}_{2})=\psi(x_{1}(E,{\rm I}_{2});E,{\rm I}_{2})=\pi, by (91) we have

E𝒜III(E)\displaystyle\partial_{E}{\mathcal{A}}^{\rm I\!I\!I}(E) =\displaystyle= 1π[(πψ(x2))Ex2(πψ(x1))Ex1]1πx1x2Eψ(x)dx\displaystyle\frac{1}{\pi}\left[\big{(}\pi-\psi(x_{2})\big{)}\partial_{E}x_{2}-\big{(}\pi-\psi(x_{1})\big{)}\partial_{E}x_{1}\right]-\frac{1}{\pi}\int_{x_{1}}^{x_{2}}\partial_{E}\psi(x)\,dx
=\displaystyle= 1πx1x2Eψ(x)dx=(4.2)x1x2W(x)𝑑x,,\displaystyle-\frac{1}{\pi}\int_{x_{1}}^{x_{2}}\partial_{E}\psi(x)\,dx\stackrel{{\scriptstyle\eqref{derivopsi(x)}}}{{=}}\int_{x_{1}}^{x_{2}}W(x)\,dx\,,\,,

and, analogously,

I2𝒜III(E)\displaystyle\partial_{{\rm I}_{2}}{\mathcal{A}}^{\rm I\!I\!I}(E) =\displaystyle= 1π[(πψ(x2))I2x2(πψ(x1))I2x1]1πx1x2I2ψ(x)dx\displaystyle\frac{1}{\pi}\left[\big{(}\pi-\psi(x_{2})\big{)}\partial_{{\rm I}_{2}}x_{2}-\big{(}\pi-\psi(x_{1})\big{)}\partial_{{\rm I}_{2}}x_{1}\right]-\frac{1}{\pi}\int_{x_{1}}^{x_{2}}\partial_{{\rm I}_{2}}\psi(x)\,dx
=\displaystyle= 1πx1x2I2ψ(x)dx.\displaystyle-\frac{1}{\pi}\int_{x_{1}}^{x_{2}}\partial_{{\rm I}_{2}}\psi(x)\,dx\,.

Again we conclude by (4.2).
Case IV{\rm I\!V} is analogous to case II{\rm I\!I} sending 131\to 3 and 242\to 4. \Box

Remark 10.

In the case of exact 3:1 resonance, namely when ω+=3ω\omega_{+}=3\omega_{-} the functions FF and aa in (1), ψ\psi in (52), 𝐏\mathbf{P} in (55) with its roots xix_{i}, do not depend on I2{\rm I}_{2}. As a consequence the functions WW and 𝒜{\mathcal{A}} in Proposition 2 do not depend on I2{\rm I}_{2}. In particular I2𝒜(E;I2)=0\partial_{{\rm I}_{2}}{\mathcal{A}}(E;{\rm I}_{2})=0, 𝒜(E;I2)=𝒜(E){\mathcal{A}}(E;{\rm I}_{2})={\mathcal{A}}(E) and formula (100) simplifies

ωnlr(E,I2)\displaystyle\omega_{-}^{\rm nlr}(E,{\rm I}_{2}) :=\displaystyle:= ω+χI2(2(E+a0)),\displaystyle\omega_{-}+\chi\,{\rm I}_{2}\left(2(E+a_{0})\right)\,,
ω+nlr(E,I2)\displaystyle\omega_{+}^{\rm nlr}(E,{\rm I}_{2}) :=\displaystyle:= ω++3χI2(2(E+a0)+1E𝒜(E)).\displaystyle\omega_{+}+3\chi\,{\rm I}_{2}\left(2(E+a_{0})+\frac{1}{\partial_{E}{\mathcal{A}}(E)}\right)\,. (105)

Let us now practically evaluate the elliptic integrals272727For a wide treatment of elliptic integrals see, e.g., [Elliptic]. W(x)𝑑x\int W(x)dx and xW(x)𝑑x\int xW(x)dx in (2). Assume that the polynomial 𝐏\mathbf{P} in (55) has 4 distinct roots: x1,x2,x3,x4x_{1},x_{2},x_{3},x_{4}, namely282828(1+a22/4)(1+a_{2}^{2}/4) is the coefficient of the fourth order term of 𝐏(x)\mathbf{P}(x).

𝐏(x)=(1+a224)(xx1)(xx2)(xx3)(xx4).\mathbf{P}(x)=\left(1+\frac{a_{2}^{2}}{4}\right)(x-x_{1})(x-x_{2})(x-x_{3})(x-x_{4})\,. (106)

We have two cases:
i) the four roots are real, x1<x2<x3<x4x_{1}<x_{2}<x_{3}<x_{4};
ii) we have two real roots, x1<x2x_{1}<x_{2} and two complex conjugated roots x3=x¯4x_{3}=\bar{x}_{4}.

4.3 Elliptic integrals: the case of four real roots

Let us define the cross ratio292929Note that λ0,1,\lambda\neq 0,1,\infty, since xjx_{j}, j=1,2,3,4,j=1,2,3,4, are distinct.:

λ:=(x2x1)(x4x3)(x3x1)(x4x2).\lambda:=\frac{(x_{2}-x_{1})(x_{4}-x_{3})}{(x_{3}-x_{1})(x_{4}-x_{2})}\,. (107)

Note that 0<λ<10<\lambda<1. Define the elliptic modulus:

𝚔:=1λ1+λ.{\mathtt{k}}:=\frac{1-\sqrt{\lambda}}{1+\sqrt{\lambda}}\,. (108)

Note that 0<𝚔<10<{\mathtt{k}}<1. We now construct a change of variable x=𝚃(z)x={\mathtt{T}}(z) given by a Möbius transformation

𝚃(z):=𝙰z+𝙱𝙲z+𝙳,{\mathtt{T}}(z):=\frac{{\mathtt{A}}z+{\mathtt{B}}}{{\mathtt{C}}z+{\mathtt{D}}}\,, (109)

such that303030As is well known the cross ratio is invariant under Möbius transformations. Then, by (110), we get λ=(𝚔1)2(𝚔+1)2\lambda=\frac{({\mathtt{k}}-1)^{2}}{({\mathtt{k}}+1)^{2}}, which is consistent with (108). See Lemma 2.3 and Exercise 2.4 of [Elliptic].

𝚃(1/𝚔)=x4,𝚃(1)=x3,𝚃(1)=x2,𝚃(1/𝚔)=x1.{\mathtt{T}}(-1/{\mathtt{k}})=x_{4}\,,\quad{\mathtt{T}}(-1)=x_{3}\,,\quad{\mathtt{T}}(1)=x_{2}\,,\quad{\mathtt{T}}(1/{\mathtt{k}})=x_{1}\,. (110)

It is simple to show (see formula (2.7) of [Elliptic]) that the transformation x=𝚃(z)x={\mathtt{T}}(z) can be construct as the solution of equation

(xx1)(x3x4)(xx4)(x3x1)=(z1/𝚔)(1+1/𝚔)(z+1/𝚔)(11/𝚔).\frac{(x-x_{1})(x_{3}-x_{4})}{(x-x_{4})(x_{3}-x_{1})}=\frac{(z-1/{\mathtt{k}})(-1+1/{\mathtt{k}})}{(z+1/{\mathtt{k}})(-1-1/{\mathtt{k}})}\,. (111)

Then the (real) coefficients of 𝚃{\mathtt{T}} are given by

𝙰\displaystyle{\mathtt{A}} :=\displaystyle:= 𝚔x1x3𝚔2x1x3+2𝚔x1x4𝚔x3x4+𝚔2x3x4,\displaystyle-{\mathtt{k}}x_{1}x_{3}-{\mathtt{k}}^{2}x_{1}x_{3}+2{\mathtt{k}}x_{1}x_{4}-{\mathtt{k}}x_{3}x_{4}+{\mathtt{k}}^{2}x_{3}x_{4}\,,
𝙱\displaystyle{\mathtt{B}} :=\displaystyle:= x1x3𝚔x1x3+2𝚔x1x4+x3x4𝚔x3x4,\displaystyle-x_{1}x_{3}-{\mathtt{k}}x_{1}x_{3}+2{\mathtt{k}}x_{1}x_{4}+x_{3}x_{4}-{\mathtt{k}}x_{3}x_{4}\,,
𝙲\displaystyle{\mathtt{C}} :=\displaystyle:= 𝚔x1𝚔2x12𝚔x3+𝚔x4+𝚔2x4,\displaystyle{\mathtt{k}}x_{1}-{\mathtt{k}}^{2}x_{1}-2{\mathtt{k}}x_{3}+{\mathtt{k}}x_{4}+{\mathtt{k}}^{2}x_{4}\,,
𝙳\displaystyle{\mathtt{D}} :=\displaystyle:= x1+𝚔x12𝚔x3+x4+𝚔x4.\displaystyle-x_{1}+{\mathtt{k}}x_{1}-2{\mathtt{k}}x_{3}+x_{4}+{\mathtt{k}}x_{4}\,. (112)

Note that, since 𝚔>0{\mathtt{k}}>0 and x1<x2<x3<x4x_{1}<x_{2}<x_{3}<x_{4} we have313131Indeed x1𝚔x12x3+x4+𝚔x4=0x_{1}-{\mathtt{k}}x_{1}-2x_{3}+x_{4}+{\mathtt{k}}x_{4}=0 implies, by (108), that λx1(1+λ)x3+x4=0\sqrt{\lambda}x_{1}-(1+\sqrt{\lambda})x_{3}+x_{4}=0, namely λ=(x4x3)/(x3x1)\sqrt{\lambda}=(x_{4}-x_{3})/(x_{3}-x_{1}). Squaring, by (107), we get the right hand side of (113).

𝙲=0(x2x1)(x3x1)=(x4x2)(x4x3).{\mathtt{C}}=0\qquad\iff\qquad(x_{2}-x_{1})(x_{3}-x_{1})=(x_{4}-x_{2})(x_{4}-x_{3})\,. (113)

Note also that 𝚃{\mathtt{T}} is invertible (on the Riemann sphere {}\mathbb{C}\cup\{\infty\}) and 𝚃()={\mathtt{T}}(\mathbb{R})=\mathbb{R}. Note that, since x1<x2<x3<x4x_{1}<x_{2}<x_{3}<x_{4} and 0<𝚔<10<{\mathtt{k}}<1, then

𝙰𝙳𝙱𝙲=2𝚔(1𝚔2)(x1x3)(x1x4)(x3x4)<0.{\mathtt{A}}{\mathtt{D}}-{\mathtt{B}}{\mathtt{C}}=2{\mathtt{k}}(1-{\mathtt{k}}^{2})(x_{1}-x_{3})(x_{1}-x_{4})(x_{3}-x_{4})<0\,. (114)

We have

d𝚃dz(z)=𝙰𝙳𝙱𝙲(𝙲z+𝙳)2<0.\frac{d{\mathtt{T}}}{dz}(z)=\frac{{\mathtt{A}}{\mathtt{D}}-{\mathtt{B}}{\mathtt{C}}}{({\mathtt{C}}z+{\mathtt{D}})^{2}}<0\,. (115)

Since

𝚃(z)𝚃(ζ)=(𝙰𝙲𝚃(ζ))zζ𝙲z+𝙳,{\mathtt{T}}(z)-{\mathtt{T}}(\zeta)=\big{(}{\mathtt{A}}-{\mathtt{C}}\,{\mathtt{T}}(\zeta)\big{)}\frac{z-\zeta}{{\mathtt{C}}z+{\mathtt{D}}}\,,

recalling (106) and (110), the substitution x=𝚃(z)x={\mathtt{T}}(z) gives

𝐏(𝚃(z))\displaystyle\mathbf{P}({\mathtt{T}}(z)) =\displaystyle= (1+a224)[𝚃(z)𝚃(1/𝚔)][𝚃(z)𝚃(1)][𝚃(z)𝚃(1)][𝚃(z)𝚃(1/𝚔)]\displaystyle(1+\frac{a_{2}^{2}}{4})[{\mathtt{T}}(z)-{\mathtt{T}}(1/{\mathtt{k}})][{\mathtt{T}}(z)-{\mathtt{T}}(1)][{\mathtt{T}}(z)-{\mathtt{T}}(-1)][{\mathtt{T}}(z)-{\mathtt{T}}(-1/{\mathtt{k}})] (116)
=\displaystyle= 𝚌p𝚔(z)(𝙲z+𝙳)4\displaystyle{\mathtt{c}}\frac{p_{\mathtt{k}}(z)}{({\mathtt{C}}z+{\mathtt{D}})^{4}}

where

p𝚔(z):=(1z2)(1𝚔2z2),𝚌:=(1+a22/4)𝚔21j4(𝙰𝙲xj).p_{\mathtt{k}}(z):=(1-z^{2})(1-{\mathtt{k}}^{2}z^{2})\,,\qquad{\mathtt{c}}:=(1+a_{2}^{2}/4){\mathtt{k}}^{-2}\prod_{1\leq j\leq 4}\big{(}{\mathtt{A}}-{\mathtt{C}}\,x_{j}\big{)}\,. (117)

Note that

1j4(𝙰𝙲xj)\displaystyle\prod_{1\leq j\leq 4}\big{(}{\mathtt{A}}-{\mathtt{C}}\,x_{j}\big{)} =\displaystyle= 16λ(1+λ)7(1+λ)4(x1x3)2(x1x4)2(x3x4)2\displaystyle 16\sqrt{\lambda}\big{(}1+\sqrt{\lambda}\big{)}^{-7}\big{(}-1+\sqrt{\lambda}\big{)}^{4}(x_{1}-x_{3})^{2}(x_{1}-x_{4})^{2}(x_{3}-x_{4})^{2}\cdot
((x1x2)(x3x4)+(x1x3)(x2x4)λ)> 0,\displaystyle\cdot\Big{(}(x_{1}-x_{2})(x_{3}-x_{4})+(x_{1}-x_{3})(x_{2}-x_{4})\sqrt{\lambda}\Big{)}\,>\,0\,,

which implies that 𝚌>0{\mathtt{c}}>0.

By (101), (115), (110) and (116) we get

x1x2W(x)𝑑x=𝙱𝙲𝙰𝙳π𝚌11/𝚔dzp𝚔(z)=𝙱𝙲𝙰𝙳π𝚌1/𝚔1dzp𝚔(z)=x3x4W(x)𝑑x,\int_{x_{1}}^{x_{2}}W(x)dx=\frac{{\mathtt{B}}{\mathtt{C}}-{\mathtt{A}}{\mathtt{D}}}{\pi\sqrt{\mathtt{c}}}\int_{1}^{1/{\mathtt{k}}}\frac{dz}{\sqrt{-p_{\mathtt{k}}(z)}}=\frac{{\mathtt{B}}{\mathtt{C}}-{\mathtt{A}}{\mathtt{D}}}{\pi\sqrt{\mathtt{c}}}\int_{-1/{\mathtt{k}}}^{-1}\frac{dz}{\sqrt{-p_{\mathtt{k}}(z)}}=\int_{x_{3}}^{x_{4}}W(x)dx\,, (118)

where the second equality holds since p𝚔(z)p_{\mathtt{k}}(z) is even. It remains to evaluate 11/𝚔dzp𝚔(z)\int_{1}^{1/{\mathtt{k}}}\frac{dz}{\sqrt{-p_{\mathtt{k}}(z)}}, which is an elliptic integral. We get the complete elliptic integral of the first kind323232Note that 𝙴𝚕𝚕𝚒𝚙𝚝𝚒𝚌𝙺:(,1)(0,+){\tt EllipticK}:(-\infty,1)\to(0,+\infty) is an analytic strictly increasing function with limx𝙴𝚕𝚕𝚒𝚙𝚝𝚒𝚌𝙺(x)=0\lim_{x\to-\infty}{\tt EllipticK}(x)=0 and limx1𝙴𝚕𝚕𝚒𝚙𝚝𝚒𝚌𝙺(x)=+\lim_{x\to 1^{-}}{\tt EllipticK}(x)=+\infty.

11/𝚔dzp𝚔(z)=01ds(1s2)(1m1s2)=:𝙴𝚕𝚕𝚒𝚙𝚝𝚒𝚌𝙺(m1),m1:=1𝚔2,\int_{1}^{1/{\mathtt{k}}}\frac{dz}{\sqrt{-p_{\mathtt{k}}(z)}}=\int_{0}^{1}\frac{ds}{\sqrt{(1-s^{2})(1-m_{1}s^{2})}}=:{\tt EllipticK}(m_{1})\,,\qquad m_{1}:=1-{\mathtt{k}}^{2}\,, (119)

by the change of variable z=11m1s2z=\frac{1}{\sqrt{1-m_{1}s^{2}}}. Note that, since 0<𝚔<10<{\mathtt{k}}<1 we have 0<m1<10<m_{1}<1. By (118) and (119) we get

x1x2W(x)𝑑x=x3x4W(x)𝑑x=𝙱𝙲𝙰𝙳π𝚌𝙴𝚕𝚕𝚒𝚙𝚝𝚒𝚌𝙺(1𝚔2).\int_{x_{1}}^{x_{2}}W(x)dx=\int_{x_{3}}^{x_{4}}W(x)dx=\frac{{\mathtt{B}}{\mathtt{C}}-{\mathtt{A}}{\mathtt{D}}}{\pi\sqrt{\mathtt{c}}}{\tt EllipticK}(1-{\mathtt{k}}^{2})\,. (120)

Similarly

x1x2xW(x)𝑑x=𝙱𝙲𝙰𝙳π𝚌11/𝚔𝙰z+𝙱𝙲z+𝙳dzp𝚔(z).\int_{x_{1}}^{x_{2}}x\,W(x)dx=\frac{{\mathtt{B}}{\mathtt{C}}-{\mathtt{A}}{\mathtt{D}}}{\pi\sqrt{\mathtt{c}}}\int_{1}^{1/{\mathtt{k}}}\frac{{\mathtt{A}}z+{\mathtt{B}}}{{\mathtt{C}}z+{\mathtt{D}}}\frac{dz}{\sqrt{-p_{\mathtt{k}}(z)}}\,. (121)

We have two cases: 𝙲0{\mathtt{C}}\neq 0 and 𝙲=0{\mathtt{C}}=0. In the first case setting

𝚊:=𝙳𝙲,𝚋:=𝙱𝙲𝙰𝙳𝙲2,{\mathtt{a}}:=\frac{{\mathtt{D}}}{{\mathtt{C}}}\,,\qquad{\mathtt{b}}:=\frac{{\mathtt{B}}{\mathtt{C}}-{\mathtt{A}}{\mathtt{D}}}{{\mathtt{C}}^{2}}\,, (122)

we have

11/𝚔𝙰z+𝙱𝙲z+𝙳dzp𝚔(z)=𝙰𝙲11/𝚔dzp𝚔(z)+𝚋11/𝚔1z+𝚊dzp𝚔(z).\int_{1}^{1/{\mathtt{k}}}\frac{{\mathtt{A}}z+{\mathtt{B}}}{{\mathtt{C}}z+{\mathtt{D}}}\frac{dz}{\sqrt{-p_{\mathtt{k}}(z)}}=\frac{{\mathtt{A}}}{{\mathtt{C}}}\int_{1}^{1/{\mathtt{k}}}\frac{dz}{\sqrt{-p_{\mathtt{k}}(z)}}+{\mathtt{b}}\int_{1}^{1/{\mathtt{k}}}\frac{1}{z+{\mathtt{a}}}\frac{dz}{\sqrt{-p_{\mathtt{k}}(z)}}\,. (123)

Note that the real number 𝚊{\mathtt{a}} satisfies |𝚊|>1/𝚔|{\mathtt{a}}|>1/{\mathtt{k}}. Otherwise, by contradiction, assume that |𝚊|1/𝚔|{\mathtt{a}}|\leq 1/{\mathtt{k}}. Since 𝚃(𝚊)={\mathtt{T}}(-{\mathtt{a}})=\infty (in the Riemann sphere), by (110) we have that |𝚊|=|𝚊|<1/𝚔|{\mathtt{a}}|=|-{\mathtt{a}}|<1/{\mathtt{k}}. Since the real function 𝚃(z){\mathtt{T}}(z) has a vertical asymptote at z=𝚊z=-{\mathtt{a}}, has 𝙰/𝙲{\mathtt{A}}/{\mathtt{C}} as horizontal asymptote and is decreasing (recall (115)) in the intervals (,𝚊)(-\infty,-{\mathtt{a}}) and (𝚊,+)(-{\mathtt{a}},+\infty), we have that 𝚃(1/𝚔)<𝙰/𝙲<𝚃(1/𝚔){\mathtt{T}}(-1/{\mathtt{k}})<{\mathtt{A}}/{\mathtt{C}}<{\mathtt{T}}(1/{\mathtt{k}}). Then by (110) we obtain x4<x1x_{4}<x_{1}, which is a contradiction. We conclude that |𝚊|>1/𝚔|{\mathtt{a}}|>1/{\mathtt{k}}.

The first integral on the right hand side of (123) has been evaluated in (119). Regarding the second one we have

11/𝚔1z+𝚊dzp𝚔(z)=11/𝚔zz2𝚊2dzp𝚔(z)11/𝚔𝚊z2𝚊2dzp𝚔(z).\int_{1}^{1/{\mathtt{k}}}\frac{1}{z+{\mathtt{a}}}\frac{dz}{\sqrt{-p_{\mathtt{k}}(z)}}=\int_{1}^{1/{\mathtt{k}}}\frac{z}{z^{2}-{\mathtt{a}}^{2}}\frac{dz}{\sqrt{-p_{\mathtt{k}}(z)}}-\int_{1}^{1/{\mathtt{k}}}\frac{{\mathtt{a}}}{z^{2}-{\mathtt{a}}^{2}}\frac{dz}{\sqrt{-p_{\mathtt{k}}(z)}}\,. (124)

We have

11/𝚔zz2𝚊2dzp𝚔(z)=z2=t1211/𝚔21t𝚊2dt(t1)(1𝚔2t)=π2(𝚊21)(𝚊2𝚔21)\int_{1}^{1/{\mathtt{k}}}\frac{z}{z^{2}-{\mathtt{a}}^{2}}\frac{dz}{\sqrt{-p_{\mathtt{k}}(z)}}\stackrel{{\scriptstyle z^{2}=t}}{{=}}\frac{1}{2}\int_{1}^{1/{\mathtt{k}}^{2}}\frac{1}{t-{\mathtt{a}}^{2}}\frac{dt}{\sqrt{(t-1)(1-{\mathtt{k}}^{2}t)}}=-\frac{\pi}{2\sqrt{({\mathtt{a}}^{2}-1)({\mathtt{a}}^{2}{\mathtt{k}}^{2}-1)}} (125)

and, by the change of variable z=11m1s2z=\frac{1}{\sqrt{1-m_{1}s^{2}}}, we obtain

11/𝚔𝚊z2𝚊2dzp𝚔(z)\displaystyle\int_{1}^{1/{\mathtt{k}}}\frac{{\mathtt{a}}}{z^{2}-{\mathtt{a}}^{2}}\frac{dz}{\sqrt{-p_{\mathtt{k}}(z)}} =\displaystyle= 𝚊011m1s2(1𝚊2+m1𝚊2s2)(1s2)(1m1s2)𝑑s\displaystyle{\mathtt{a}}\int_{0}^{1}\frac{1-m_{1}s^{2}}{(1-{\mathtt{a}}^{2}+m_{1}{\mathtt{a}}^{2}s^{2})\sqrt{(1-s^{2})(1-m_{1}s^{2})}}\,ds (126)
=\displaystyle= 1𝚊011(1s2)(1m1s2)𝑑s\displaystyle-\frac{1}{{\mathtt{a}}}\int_{0}^{1}\frac{1}{\sqrt{(1-s^{2})(1-m_{1}s^{2})}}\,ds (128)
+1m1𝚊30111n1s21(1s2)(1m1s2)𝑑s,\displaystyle+\frac{1}{m_{1}{\mathtt{a}}^{3}}\int_{0}^{1}\frac{1}{\frac{1}{n_{1}}-s^{2}}\frac{1}{\sqrt{(1-s^{2})(1-m_{1}s^{2})}}\,ds\,,

where

0<n1:=m1𝚊2𝚊21=(119)𝚊2𝚔2𝚊2𝚊21<1,0<n_{1}:=\frac{m_{1}{\mathtt{a}}^{2}}{{\mathtt{a}}^{2}-1}\stackrel{{\scriptstyle\eqref{EllipticK}}}{{=}}\frac{{\mathtt{a}}^{2}-{\mathtt{k}}^{2}{\mathtt{a}}^{2}}{{\mathtt{a}}^{2}-1}<1\,,

since |𝚊|>1/𝚔>1|{\mathtt{a}}|>1/{\mathtt{k}}>1. Recalling that

𝙴𝚕𝚕𝚒𝚙𝚝𝚒𝚌𝙿𝚒(n1,m1):=0111n1s21(1s2)(1m1s2)𝑑s\mathtt{EllipticPi}(n_{1},m_{1}):=\int_{0}^{1}\frac{1}{1-n_{1}s^{2}}\frac{1}{\sqrt{(1-s^{2})(1-m_{1}s^{2})}}\,ds (129)

is the complete elliptic integral of the third kind, by (119) and (121)-(128) and noting that 𝙰𝙲+𝚋𝚊=𝙱𝙳,\frac{{\mathtt{A}}}{{\mathtt{C}}}+\frac{{\mathtt{b}}}{{\mathtt{a}}}=\frac{{\mathtt{B}}}{{\mathtt{D}}}, we get

x1x2xW(x)𝑑x=\displaystyle\int_{x_{1}}^{x_{2}}x\,W(x)dx= (130)
𝙱𝙲𝙰𝙳π𝚌(𝙱𝙳𝙴𝚕𝚕𝚒𝚙𝚝𝚒𝚌𝙺(m1)π𝚋2(𝚊21)(𝚊2𝚔21)n1𝚋m1𝚊3𝙴𝚕𝚕𝚒𝚙𝚝𝚒𝚌𝙿𝚒(n1,m1)).\displaystyle\frac{{\mathtt{B}}{\mathtt{C}}-{\mathtt{A}}{\mathtt{D}}}{\pi\sqrt{\mathtt{c}}}\left(\frac{{\mathtt{B}}}{{\mathtt{D}}}{\tt EllipticK}(m_{1})-\frac{\pi{\mathtt{b}}}{2\sqrt{({\mathtt{a}}^{2}-1)({\mathtt{a}}^{2}{\mathtt{k}}^{2}-1)}}-\frac{n_{1}{\mathtt{b}}}{m_{1}{\mathtt{a}}^{3}}\mathtt{EllipticPi}(n_{1},m_{1})\right)\,.

Let us now consider the case 𝙲=0{\mathtt{C}}=0. By (121) and since

11/𝚔zdzp𝚔(z)=z2=t1211/𝚔2dt(t1)(1𝚔2t)=π2𝚔,\int_{1}^{1/{\mathtt{k}}}\frac{z\,dz}{\sqrt{-p_{\mathtt{k}}(z)}}\stackrel{{\scriptstyle z^{2}=t}}{{=}}\frac{1}{2}\int_{1}^{1/{\mathtt{k}}^{2}}\frac{dt}{\sqrt{(t-1)(1-{\mathtt{k}}^{2}t)}}=\frac{\pi}{2{\mathtt{k}}}\,,

we have

x1x2xW(x)𝑑x\displaystyle\int_{x_{1}}^{x_{2}}x\,W(x)dx =\displaystyle= 𝙰𝙱π𝚌𝙴𝚕𝚕𝚒𝚙𝚝𝚒𝚌𝙺(m1)𝙰2π𝚌11/𝚔zdzp𝚔(z)\displaystyle-\frac{{\mathtt{A}}{\mathtt{B}}}{\pi\sqrt{\mathtt{c}}}{\tt EllipticK}(m_{1})-\frac{{\mathtt{A}}^{2}}{\pi\sqrt{\mathtt{c}}}\int_{1}^{1/{\mathtt{k}}}\frac{z\,dz}{\sqrt{-p_{\mathtt{k}}(z)}} (131)
=\displaystyle= 𝙰𝙱π𝚌𝙴𝚕𝚕𝚒𝚙𝚝𝚒𝚌𝙺(m1)𝙰22𝚔𝚌,\displaystyle-\frac{{\mathtt{A}}{\mathtt{B}}}{\pi\sqrt{\mathtt{c}}}{\tt EllipticK}(m_{1})-\frac{{\mathtt{A}}^{2}}{2{\mathtt{k}}\sqrt{\mathtt{c}}}\,,

which is exactly the limit for 𝙲0{\mathtt{C}}\to 0 of (130).

Let us now evaluate

x3x4xW(x)𝑑x=𝙱𝙲𝙰𝙳π𝚌1/𝚔1𝙰z+𝙱𝙲z+𝙳dzp𝚔(z).\int_{x_{3}}^{x_{4}}x\,W(x)dx=\frac{{\mathtt{B}}{\mathtt{C}}-{\mathtt{A}}{\mathtt{D}}}{\pi\sqrt{\mathtt{c}}}\int_{-1/{\mathtt{k}}}^{-1}\frac{{\mathtt{A}}z+{\mathtt{B}}}{{\mathtt{C}}z+{\mathtt{D}}}\frac{dz}{\sqrt{-p_{\mathtt{k}}(z)}}\,. (132)

Changing variable zzz\to-z we get

1/𝚔1𝙰z+𝙱𝙲z+𝙳dzp𝚔(z)=11/𝚔𝙰z𝙱𝙲z𝙳dzp𝚔(z)\int_{-1/{\mathtt{k}}}^{-1}\frac{{\mathtt{A}}z+{\mathtt{B}}}{{\mathtt{C}}z+{\mathtt{D}}}\frac{dz}{\sqrt{-p_{\mathtt{k}}(z)}}=\int_{1}^{1/{\mathtt{k}}}\frac{{\mathtt{A}}z-{\mathtt{B}}}{{\mathtt{C}}z-{\mathtt{D}}}\frac{dz}{\sqrt{-p_{\mathtt{k}}(z)}}

When 𝙲0{\mathtt{C}}\neq 0, recalling (122), we have

11/𝚔𝙰z+𝙱𝙲z+𝙳dzp𝚔(z)=𝙰𝙲11/𝚔dzp𝚔(z)𝚋11/𝚔1z𝚊dzp𝚔(z).\int_{1}^{1/{\mathtt{k}}}\frac{{\mathtt{A}}z+{\mathtt{B}}}{{\mathtt{C}}z+{\mathtt{D}}}\frac{dz}{\sqrt{-p_{\mathtt{k}}(z)}}=\frac{{\mathtt{A}}}{{\mathtt{C}}}\int_{1}^{1/{\mathtt{k}}}\frac{dz}{\sqrt{-p_{\mathtt{k}}(z)}}-{\mathtt{b}}\int_{1}^{1/{\mathtt{k}}}\frac{1}{z-{\mathtt{a}}}\frac{dz}{\sqrt{-p_{\mathtt{k}}(z)}}\,. (133)

Reasoning as in derivation of (130) we get

x3x4xW(x)𝑑x=\displaystyle\int_{x_{3}}^{x_{4}}x\,W(x)dx= (134)
𝙱𝙲𝙰𝙳π𝚌(𝙱𝙳𝙴𝚕𝚕𝚒𝚙𝚝𝚒𝚌𝙺(m1)+π𝚋2(𝚊21)(𝚊2𝚔21)n1𝚋m1𝚊3𝙴𝚕𝚕𝚒𝚙𝚝𝚒𝚌𝙿𝚒(n1,m1)).\displaystyle\frac{{\mathtt{B}}{\mathtt{C}}-{\mathtt{A}}{\mathtt{D}}}{\pi\sqrt{\mathtt{c}}}\left(\frac{{\mathtt{B}}}{{\mathtt{D}}}{\tt EllipticK}(m_{1})+\frac{\pi{\mathtt{b}}}{2\sqrt{({\mathtt{a}}^{2}-1)({\mathtt{a}}^{2}{\mathtt{k}}^{2}-1)}}-\frac{n_{1}{\mathtt{b}}}{m_{1}{\mathtt{a}}^{3}}\mathtt{EllipticPi}(n_{1},m_{1})\right)\,.

The case 𝙲=0{\mathtt{C}}=0 can be obtained taking the limit for 𝙲0{\mathtt{C}}\to 0 of (134).

4.4 Elliptic integrals: the case of two real roots

In this case define the cross ratio and the elliptic modulus as:333333Setting w:=(x1x4)(x2x3)w:=(x_{1}-x_{4})(x_{2}-x_{3}) we have that λ=w¯/w\lambda_{*}=\bar{w}/w since x1,x2x_{1},x_{2}\in\mathbb{R} and x¯3=x4\bar{x}_{3}=x_{4}. Then λ:=|w|/w\sqrt{\lambda}_{*}:=|w|/w satisfies (λ)2=|w|2/w2=λ(\sqrt{\lambda}_{*})^{2}=|w|^{2}/w^{2}=\lambda_{*}.

λ:=(x1x3)(x2x4)(x1x4)(x2x3),𝚔:=1λ1+λ,λ:=|x1x4||x2x3|(x1x4)(x2x3).\lambda_{*}:=\frac{(x_{1}-x_{3})(x_{2}-x_{4})}{(x_{1}-x_{4})(x_{2}-x_{3})}\,,\quad{\mathtt{k}}_{*}:=\frac{1-\sqrt{\lambda}_{*}}{1+\sqrt{\lambda}_{*}}\,,\quad\sqrt{\lambda}_{*}:=\frac{|x_{1}-x_{4}||x_{2}-x_{3}|}{(x_{1}-x_{4})(x_{2}-x_{3})}\,. (135)

Since |λ|=1|\sqrt{\lambda}_{*}|=1 there exists a real θ\theta such that λ=eiθ\sqrt{\lambda}_{*}=e^{\mathrm{i}\theta}, so that 𝚔=itan(θ/2){\mathtt{k}}_{*}=-\mathrm{i}\tan(\theta/2), namely 𝚔{\mathtt{k}}_{*} is purely imaginary and 𝚔2<0{\mathtt{k}}_{*}^{2}<0 (see page 40 of [Elliptic] for details). We now construct a Möbius transformation

𝚃(z):=𝙰z+𝙱𝙲z+𝙳,{\mathtt{T}}_{*}(z):=\frac{{\mathtt{A}}_{*}z+{\mathtt{B}}_{*}}{{\mathtt{C}}_{*}z+{\mathtt{D}}_{*}}\,, (136)

such that

𝚃(1/𝚔)=x4,𝚃(1)=x2,𝚃(1)=x1,𝚃(1/𝚔)=x3.{\mathtt{T}}_{*}(-1/{\mathtt{k}}_{*})=x_{4}\,,\quad{\mathtt{T}}_{*}(-1)=x_{2}\,,\quad{\mathtt{T}}_{*}(1)=x_{1}\,,\quad{\mathtt{T}}_{*}(1/{\mathtt{k}}_{*})=x_{3}\,. (137)

It is simple to show (see formula (2.7) of [Elliptic]) that the transformation x=𝚃(z)x={\mathtt{T}}_{*}(z) can be construct as the solution of equation

(xx3)(x2x4)(xx4)(x2x3)=(z1/𝚔)(1+1/𝚔)(z+1/𝚔)(11/𝚔).\frac{(x-x_{3})(x_{2}-x_{4})}{(x-x_{4})(x_{2}-x_{3})}=\frac{(z-1/{\mathtt{k}}_{*})(-1+1/{\mathtt{k}}_{*})}{(z+1/{\mathtt{k}}_{*})(-1-1/{\mathtt{k}}_{*})}\,. (138)

Note that 𝚃{\mathtt{T}}_{*} is invertible (on the Riemann sphere {}\mathbb{C}\cup\{\infty\}) and 𝚃()={\mathtt{T}}_{*}(\mathbb{R})=\mathbb{R}. Indeed the last claim is equivalent to show that if xx\in\mathbb{R} in (138) then also zz\in\mathbb{R}. This can be proven taking the complex conjugate of (138) and inverting both sides343434More precisely denoting by \ell and rr, respectively, the left and right hand side of (138), we have that, if xx\in\mathbb{R} then =1/¯\ell=1/\bar{\ell} (recall x¯3=x4\bar{x}_{3}=x_{4}), which implies r=1/r¯r=1/\bar{r} (recall 𝚔¯=𝚔\bar{\mathtt{k}}_{*}=-{\mathtt{k}}_{*}), namely zaz+a=z¯az¯+a\frac{z-a}{z+a}=\frac{\bar{z}-a}{\bar{z}+a} denoting for brevity a:=1/𝚔a:=1/{\mathtt{k}}_{*}. Then z=z¯z=\bar{z}, namely zz\in\mathbb{R}. . The coefficients of 𝚃{\mathtt{T}}_{*}, which are given by

𝙰\displaystyle{\mathtt{A}}_{*} :=\displaystyle:= x2(x3+x4)+𝚔x2(x4x3)+2x3x4,\displaystyle-x_{2}(x_{3}+x_{4})+{\mathtt{k}}_{*}x_{2}(x_{4}-x_{3})+2x_{3}x_{4}\,,
𝙱\displaystyle{\mathtt{B}}_{*} :=\displaystyle:= x2(x3+x4)+x2(x4x3)/𝚔+2x3x4,\displaystyle-x_{2}(x_{3}+x_{4})+x_{2}(x_{4}-x_{3})/{\mathtt{k}}_{*}+2x_{3}x_{4}\,,
𝙲\displaystyle{\mathtt{C}}_{*} :=\displaystyle:= 2x2+x3+x4+𝚔(x4x3),\displaystyle-2x_{2}+x_{3}+x_{4}+{\mathtt{k}}_{*}(x_{4}-x_{3})\,,
𝙳\displaystyle{\mathtt{D}}_{*} :=\displaystyle:= 2x2+x3+x4+(x4x3)/𝚔,\displaystyle-2x_{2}+x_{3}+x_{4}+(x_{4}-x_{3})/{\mathtt{k}}_{*}\,, (139)

are real since, x2,x_{2}\in\mathbb{R}, x¯3=x4\bar{x}_{3}=x_{4} and 𝚔{\mathtt{k}}_{*} is purely imaginary. We have that

d𝚃dz(z)=𝙰𝙳𝙱𝙲(𝙲z+𝙳)2<0forz,\frac{d{\mathtt{T}}_{*}}{dz}(z)=\frac{{\mathtt{A}}_{*}{\mathtt{D}}_{*}-{\mathtt{B}}_{*}{\mathtt{C}}_{*}}{({\mathtt{C}}_{*}z+{\mathtt{D}}_{*})^{2}}<0\qquad\mbox{for}\qquad z\in\mathbb{R}\,, (140)

since 𝚃(1)=x2>𝚃(1)=x1{\mathtt{T}}_{*}(-1)=x_{2}>{\mathtt{T}}_{*}(1)=x_{1} by (137). It follows that

𝙰𝙳𝙱𝙲=2(𝚔21)(x2x3)(x2x4)(x3x4)/𝚔<0.{\mathtt{A}}_{*}{\mathtt{D}}_{*}-{\mathtt{B}}_{*}{\mathtt{C}}_{*}=2({\mathtt{k}}_{*}^{2}-1)(x_{2}-x_{3})(x_{2}-x_{4})(x_{3}-x_{4})/{\mathtt{k}}_{*}<0\,. (141)

Arguing as in (116), the substitution x=𝚃(z)x={\mathtt{T}}_{*}(z) gives

𝐏(𝚃(z))=𝚌p𝚔(z)(𝙲z+𝙳)4\mathbf{P}({\mathtt{T}}_{*}(z))=-{\mathtt{c}}_{*}\frac{p_{{\mathtt{k}}_{*}}(z)}{({\mathtt{C}}_{*}z+{\mathtt{D}}_{*})^{4}} (142)

where p𝚔(z):=(1z2)(1𝚔2z2)p_{{\mathtt{k}}_{*}}(z):=(1-z^{2})(1-{\mathtt{k}}_{*}^{2}z^{2}) and

𝚌:=(1+a22/4)𝚔21j4(𝙰𝙲xj).{\mathtt{c}}_{*}:=-(1+a_{2}^{2}/4){\mathtt{k}}_{*}^{-2}\prod_{1\leq j\leq 4}\big{(}{\mathtt{A}}_{*}-{\mathtt{C}}_{*}\,x_{j}\big{)}\,. (143)

Note that 𝚌>0{\mathtt{c}}_{*}>0; indeed 𝚔2<0{\mathtt{k}}_{*}^{-2}<0, (𝙰𝙲x3)(𝙰𝙲x4)=|𝙰𝙲x3|2>0({\mathtt{A}}_{*}-{\mathtt{C}}_{*}x_{3})({\mathtt{A}}_{*}-{\mathtt{C}}_{*}x_{4})=|{\mathtt{A}}_{*}-{\mathtt{C}}_{*}x_{3}|^{2}>0 (since353535Note that 𝙰𝙲xj0{\mathtt{A}}_{*}-{\mathtt{C}}_{*}x_{j}\neq 0 for j=1,2,3,4j=1,2,3,4, since 𝙰𝙲xj=0{\mathtt{A}}_{*}-{\mathtt{C}}_{*}x_{j}=0 implies 𝚃1(xj)={\mathtt{T}}_{*}^{-1}(x_{j})=\infty that contradicts (137). 𝙰,𝙲{\mathtt{A}}_{*},{\mathtt{C}}_{*}\in\mathbb{R} and x¯3=x4\bar{x}_{3}=x_{4}), finally, denoting for brevity w:=(x1x4)(x2x3)w:=(x_{1}-x_{4})(x_{2}-x_{3}), we have

(𝙰𝙲x1)(𝙰𝙲x2)=4(x2x3)2(x1x4)(x2x4)1+w¯/|w|1+w/|w|\displaystyle({\mathtt{A}}_{*}-{\mathtt{C}}_{*}x_{1})({\mathtt{A}}_{*}-{\mathtt{C}}_{*}x_{2})=4(x_{2}-x_{3})^{2}(x_{1}-x_{4})(x_{2}-x_{4})\frac{1+\bar{w}/|w|}{1+w/|w|}
=4|x2x3|2w1+w¯/|w|1+w/|w|=4|x2x3|2|w|>0.\displaystyle=4|x_{2}-x_{3}|^{2}w\frac{1+\bar{w}/|w|}{1+w/|w|}=4|x_{2}-x_{3}|^{2}|w|>0\,.

By (101), (140), (137) and (142) we get

x1x2W(x)𝑑x=𝙱𝙲𝙰𝙳π𝚌11dzp𝚔(z)=2𝙱𝙲𝙰𝙳π𝚌01dzp𝚔(z),\int_{x_{1}}^{x_{2}}W(x)dx=\frac{{\mathtt{B}}_{*}{\mathtt{C}}_{*}-{\mathtt{A}}_{*}{\mathtt{D}}_{*}}{\pi\sqrt{{\mathtt{c}}_{*}}}\int_{-1}^{1}\frac{dz}{\sqrt{p_{\mathtt{k}}(z)}}=2\frac{{\mathtt{B}}_{*}{\mathtt{C}}_{*}-{\mathtt{A}}_{*}{\mathtt{D}}_{*}}{\pi\sqrt{{\mathtt{c}}_{*}}}\int_{0}^{1}\frac{dz}{\sqrt{p_{{\mathtt{k}}_{*}}(z)}}\,, (144)

since p𝚔(z)p_{{\mathtt{k}}_{*}}(z) is even; in particular

01dzp𝚔(z)=01dz(1z2)(1𝚔2z2)=:𝙴𝚕𝚕𝚒𝚙𝚝𝚒𝚌𝙺(𝚔2).\int_{0}^{1}\frac{dz}{\sqrt{p_{{\mathtt{k}}_{*}}(z)}}=\int_{0}^{1}\frac{dz}{\sqrt{(1-z^{2})(1-{\mathtt{k}}_{*}^{2}z^{2})}}=:{\tt EllipticK}({\mathtt{k}}_{*}^{2})\,. (145)

By (144) and (145) we get

x1x2W(x)𝑑x=2𝙱𝙲𝙰𝙳π𝚌𝙴𝚕𝚕𝚒𝚙𝚝𝚒𝚌𝙺(𝚔2).\int_{x_{1}}^{x_{2}}W(x)dx=2\frac{{\mathtt{B}}_{*}{\mathtt{C}}_{*}-{\mathtt{A}}_{*}{\mathtt{D}}_{*}}{\pi\sqrt{{\mathtt{c}}_{*}}}{\tt EllipticK}({\mathtt{k}}_{*}^{2})\,. (146)

Arguing as in (144) and recalling the definition of p𝚔(z)p_{{\mathtt{k}}_{*}}(z) in (117) we obtain

x1x2xW(x)𝑑x=𝙱𝙲𝙰𝙳π𝚌11𝙰z+𝙱𝙲z+𝙳dzp𝚔(z)=𝙱𝙲𝙰𝙳π𝚌11𝚃(z)dzp𝚔(z),\int_{x_{1}}^{x_{2}}xW(x)dx=\frac{{\mathtt{B}}_{*}{\mathtt{C}}_{*}-{\mathtt{A}}_{*}{\mathtt{D}}_{*}}{\pi\sqrt{{\mathtt{c}}_{*}}}\int_{-1}^{1}\frac{{\mathtt{A}}_{*}z+{\mathtt{B}}_{*}}{{\mathtt{C}}_{*}z+{\mathtt{D}}_{*}}\frac{dz}{\sqrt{p_{{\mathtt{k}}_{*}}(z)}}=\frac{{\mathtt{B}}_{*}{\mathtt{C}}_{*}-{\mathtt{A}}_{*}{\mathtt{D}}_{*}}{\pi\sqrt{{\mathtt{c}}_{*}}}\int_{-1}^{1}{\mathtt{T}}_{*}(z)\frac{dz}{\sqrt{p_{{\mathtt{k}}_{*}}(z)}}\,, (147)

Since the last integration interval is symmetric and p𝚔(z)p_{{\mathtt{k}}_{*}}(z) is an even function we can substitute 𝚃(z){\mathtt{T}}_{*}(z) with its even part, namely

12(𝚃(z)+𝚃(z))=𝙰𝙲z2𝙱𝙳𝙲2z2𝙳2=𝙰𝙲+𝙱𝙲𝙰𝙳𝙲𝙳11(𝙲/𝙳)2z2,\frac{1}{2}\big{(}{\mathtt{T}}_{*}(z)+{\mathtt{T}}_{*}(-z)\big{)}=\frac{{\mathtt{A}}_{*}{\mathtt{C}}_{*}z^{2}-{\mathtt{B}}_{*}{\mathtt{D}}_{*}}{{\mathtt{C}}_{*}^{2}z^{2}-{\mathtt{D}}_{*}^{2}}=\frac{{\mathtt{A}}_{*}}{{\mathtt{C}}_{*}}+\frac{{\mathtt{B}}_{*}{\mathtt{C}}_{*}-{\mathtt{A}}_{*}{\mathtt{D}}_{*}}{{\mathtt{C}}_{*}{\mathtt{D}}_{*}}\frac{1}{1-({\mathtt{C}}_{*}/{\mathtt{D}}_{*})^{2}z^{2}}\,,

obtaining (since the integrands are even)

11𝚃(z)dzp𝚔(z)=2𝙰𝙲01dzp𝚔(z)+2𝙱𝙲𝙰𝙳𝙲𝙳0111(𝙲/𝙳)2z2dzp𝚔(z).\int_{-1}^{1}{\mathtt{T}}_{*}(z)\frac{dz}{\sqrt{p_{{\mathtt{k}}_{*}}(z)}}=2\frac{{\mathtt{A}}_{*}}{{\mathtt{C}}_{*}}\int_{0}^{1}\frac{dz}{\sqrt{p_{{\mathtt{k}}_{*}}(z)}}+2\frac{{\mathtt{B}}_{*}{\mathtt{C}}_{*}-{\mathtt{A}}_{*}{\mathtt{D}}_{*}}{{\mathtt{C}}_{*}{\mathtt{D}}_{*}}\int_{0}^{1}\frac{1}{1-({\mathtt{C}}_{*}/{\mathtt{D}}_{*})^{2}z^{2}}\frac{dz}{\sqrt{p_{{\mathtt{k}}_{*}}(z)}}\,.

Then, by (119) and (129)

11𝚃(z)dzp𝚔(z)=2𝙰𝙲𝙴𝚕𝚕𝚒𝚙𝚝𝚒𝚌𝙺(𝚔2)+2𝙱𝙲𝙰𝙳𝙲𝙳𝙴𝚕𝚕𝚒𝚙𝚝𝚒𝚌𝙿𝚒(𝙲2𝙳2,𝚔2).\int_{-1}^{1}{\mathtt{T}}_{*}(z)\frac{dz}{\sqrt{p_{{\mathtt{k}}_{*}}(z)}}=2\frac{{\mathtt{A}}_{*}}{{\mathtt{C}}_{*}}{\tt EllipticK}({\mathtt{k}}_{*}^{2})+2\frac{{\mathtt{B}}_{*}{\mathtt{C}}_{*}-{\mathtt{A}}_{*}{\mathtt{D}}_{*}}{{\mathtt{C}}_{*}{\mathtt{D}}_{*}}\mathtt{EllipticPi}({\mathtt{C}}_{*}^{2}{\mathtt{D}}_{*}^{-2},{\mathtt{k}}_{*}^{2})\,. (148)

Recalling (2), (146), (147), (148), in the case of two real roots, the last term in (100) writes

χI22I2𝒜(E;I2)E𝒜(E;I2)=ω+3ω3(𝙰𝙲+𝙱𝙲𝙰𝙳𝙲𝙳𝙴𝚕𝚕𝚒𝚙𝚝𝚒𝚌𝙿𝚒(𝙲2𝙳2,𝚔2)𝙴𝚕𝚕𝚒𝚙𝚝𝚒𝚌𝙺(𝚔2)).\chi\,{\rm I}_{2}^{2}\,\frac{\partial_{{\rm I}_{2}}{\mathcal{A}}(E;{\rm I}_{2})}{\partial_{E}{\mathcal{A}}(E;{\rm I}_{2})}=\frac{\omega_{+}-3\omega_{-}}{3}\left(\frac{{\mathtt{A}}_{*}}{{\mathtt{C}}_{*}}+\frac{{\mathtt{B}}_{*}{\mathtt{C}}_{*}-{\mathtt{A}}_{*}{\mathtt{D}}_{*}}{{\mathtt{C}}_{*}{\mathtt{D}}_{*}}\frac{\mathtt{EllipticPi}({\mathtt{C}}_{*}^{2}{\mathtt{D}}_{*}^{-2},{\mathtt{k}}_{*}^{2})}{{\tt EllipticK}({\mathtt{k}}_{*}^{2})}\right)\,. (149)

4.5 Explicit expression of the nonlinear frequencies for the exact 3:1 resonance

In this subsection we consider only the case of exact 3:1 resonance, namely when ω+=3ω\omega_{+}=3\omega_{-}. Let the energy EE be such that the polynomial 𝐏\mathbf{P} in (55) has 4 distinct roots: x1(E),x2(E),x_{1}(E),x_{2}(E), x3(E),x4(E).x_{3}(E),x_{4}(E). Recalling the definitions of 𝚔{\mathtt{k}} in (108), of 𝙰,𝙱,𝙲,𝙳{\mathtt{A}},{\mathtt{B}},{\mathtt{C}},{\mathtt{D}} in (112), of 𝚌{\mathtt{c}} in (117), of 𝚔{\mathtt{k}}_{*} in (135), of 𝙰,𝙱,𝙲,𝙳{\mathtt{A}}_{*},{\mathtt{B}}_{*},{\mathtt{C}}_{*},{\mathtt{D}}_{*} in (139), of 𝚌{\mathtt{c}}_{*} in (143), note that all these quantities depend on EE. Recalling Proposition 2, (2), (2), (120) and (146), formula (105) in Remark 10 becomes

ωnlr(E,I2)\displaystyle\omega_{-}^{\rm nlr}(E,{\rm I}_{2}) :=\displaystyle:= ω+2χI2(E+a0),\displaystyle\omega_{-}+2\chi\,{\rm I}_{2}(E+a_{0})\,,
ω+nlr(E,I2)\displaystyle\omega_{+}^{\rm nlr}(E,{\rm I}_{2}) :=\displaystyle:= ω++6χI2(E+a0+V(E)),\displaystyle\omega_{+}+6\chi\,{\rm I}_{2}\left(E+a_{0}+V(E)\right)\,, (150)

where the function V(E)V(E) is defined as follows:

V(E):=±π𝚌2(𝙱𝙲𝙰𝙳)𝙴𝚕𝚕𝚒𝚙𝚝𝚒𝚌𝙺(𝚔2)V(E):=\pm\frac{\pi\sqrt{{\mathtt{c}}_{*}}}{2({\mathtt{B}}_{*}{\mathtt{C}}_{*}-{\mathtt{A}}_{*}{\mathtt{D}}_{*}){\tt EllipticK}({\mathtt{k}}_{*}^{2})} (151)

with the ++ sign in the zones 𝒫01,{\mathcal{P}}_{01}, 𝒫21+,II{\mathcal{P}}_{21}^{+,{\rm I\!I}}, 𝒫21+,III{\mathcal{P}}_{21}^{+,{\rm I\!I\!I}}, 𝒫21,III{\mathcal{P}}_{21}^{-,{\rm I\!I\!I}}, 𝒫12+,III{\mathcal{P}}_{12}^{+,{\rm I\!I\!I}}, and with - sign in the zones 𝒫10,{\mathcal{P}}_{10}, 𝒫21,I{\mathcal{P}}_{21}^{-,{\rm I}}, 𝒫12+,I{\mathcal{P}}_{12}^{+,{\rm I}}, 𝒫12,I{\mathcal{P}}_{12}^{-,{\rm I}}, 𝒫12,II{\mathcal{P}}_{12}^{-,{\rm I\!I}}, moreover

V(E):=±π𝚌(𝙱𝙲𝙰𝙳)𝙴𝚕𝚕𝚒𝚙𝚝𝚒𝚌𝙺(1𝚔2)V(E):=\pm\frac{\pi\sqrt{\mathtt{c}}}{({\mathtt{B}}{\mathtt{C}}-{\mathtt{A}}{\mathtt{D}}){\tt EllipticK}(1-{\mathtt{k}}^{2})} (152)

with the ++ sign in the zones 𝒫21+,IV{\mathcal{P}}_{21}^{+,{\rm I\!V}}, 𝒫21,IV{\mathcal{P}}_{21}^{-,{\rm I\!V}}, 𝒫12+,II{\mathcal{P}}_{12}^{+,{\rm I\!I}}, 𝒫12,III{\mathcal{P}}_{12}^{-,{\rm I\!I\!I}}, and with - sign in the zones 𝒫21+,I{\mathcal{P}}_{21}^{+,{\rm I}}, 𝒫21,II{\mathcal{P}}_{21}^{-,{\rm I\!I}}, 𝒫12+,IV{\mathcal{P}}_{12}^{+,{\rm I\!V}}, 𝒫12,IV{\mathcal{P}}_{12}^{-,{\rm I\!V}}.
Note that, recalling (1), in (150) we have that

χa0=𝙶(2,0),(2,0).\chi a_{0}={\mathtt{G}}_{(2,0),(2,0)}\,.

Then we can rewrite (150) as

ωnlr(E,I2)\displaystyle\omega_{-}^{\rm nlr}(E,{\rm I}_{2}) :=\displaystyle:= ω+2I2(χE+𝙶(2,0),(2,0)),\displaystyle\omega_{-}+2{\rm I}_{2}(\chi E+{\mathtt{G}}_{(2,0),(2,0)})\,,
ω+nlr(E,I2)\displaystyle\omega_{+}^{\rm nlr}(E,{\rm I}_{2}) :=\displaystyle:= ω++6I2(χE+𝙶(2,0),(2,0)+χV(E)).\displaystyle\omega_{+}+6{\rm I}_{2}\left(\chi E+{\mathtt{G}}_{(2,0),(2,0)}+\chi V(E)\right)\,. (153)

Finally we can see the nonlinear resonant frequencies as functions of the initial amplitudes aa_{-} and a+a_{+}. By (35) and (36) we get

J1(0)=12ω+a+2,J2(0)=12(ωa2+3ω+a+2),ψ1(0)=ψ2(0)=0.J_{1}(0)=\frac{1}{2}\omega_{+}a_{+}^{2}\,,\qquad J_{2}(0)=\frac{1}{2}(\omega_{-}a_{-}^{2}+3\omega_{+}a_{+}^{2})\,,\qquad\psi_{1}(0)=\psi_{2}(0)=0\,. (154)

By (83) we have

I2=I2(0)=12(ωa2+3ω+a+2){\rm I}_{2}={\rm I}_{2}(0)=\frac{1}{2}(\omega_{-}a_{-}^{2}+3\omega_{+}a_{+}^{2}) (155)

and by (43) and (1) we get

E=F(3J1(0)/J2(0),ψ1(0);J2(0))=a(x;I2)+b(x),withx:=3ω+a+2ωa2+3ω+a+2.E=F\Big{(}3J_{1}(0)/{J_{2}}(0),\psi_{1}(0);{J_{2}}(0)\Big{)}=a(x_{\dagger};{\rm I}_{2})+b(x_{\dagger})\,,\quad\mbox{with}\quad x_{\dagger}:=\frac{3\omega_{+}a_{+}^{2}}{\omega_{-}a_{-}^{2}+3\omega_{+}a_{+}^{2}}\,. (156)

5 Nonlinear bandgap for the honeycomb metamaterial

In this section we present some outcomes of our analysis and discuss its application to the honeycomb metamaterial described in the introduction. In particular we investigate the effect of nonlinearity on the bandgap size, highlighting the differences between the resonant and non resonant cases. First, we briefly recall what we proved in [DL].

For a given pair (M~,K~)(\tilde{M},\tilde{K}), the bandgap is defined as the interval between the maximum of the acoustic frequency and the minimum of the optical frequency as the wave numbers run over the Brillouin triangle. In the linear case, since the gradients of ω\omega_{-} and ω+\omega_{+} (with respect to (k~1,k~2)(\tilde{k}_{1},\tilde{k}_{2})) never vanish in the interior of \triangle, maxima and minima are attained on the boundary \partial\triangle. In particular, for every pair (M~,K~)(\tilde{M},\tilde{K}), the maximum of the linear acoustic frequency is attained at 𝐗{\bf X}, while the minimum of the linear optical frequency is attained at 𝚪{\bf\Gamma}. We anticipate that, in evaluating the nonlinear bandgap, the point 𝐗{\bf X} plays a crucial role, more important than 𝚪{\bf\Gamma}. Indeed, typically, in the set of parameters we are considering, namely the rectangle [0.05,0.3]×[1,20][0.05,0.3]\times[1,20] in the (M~,K~)(\tilde{M},\tilde{K})-plane, the displacement of the maximum of the acoustic frequency due to the nonlinearity is more relevant than that of the minimum of the optical frequency.

Resonant parameters

As in [DL], within the reference rectangle [0.05,0.3]×[1,20][0.05,0.3]\times[1,20], we identify the curve \mathcal{R} formed by the pairs (M~,K~)(\tilde{M},\tilde{K}) such that the linear acoustic and optical frequencies evaluated at (k~1,k~2)=𝐗(\tilde{k}_{1},\tilde{k}_{2})={\bf X} are in 3:1 resonance, namely satisfy 3ω=ω+3\omega_{-}=\omega_{+}. \mathcal{R} is shown in Figure 26. In [DL], we identify the set of nonresonant pairs (M~,K~)(\tilde{M},\tilde{K}) within the rectangle [0.05,0.3]×[1,20][0.05,0.3]\times[1,20] (represented by the light yellow region in Figure 26 (left)), for which the maximum/minimum of the nonlinear acoustic/optical frequencies on the boundary of the Brillouin triangle are attained at non resonant wave numbers (k~1,k~2)(\tilde{k}_{1},\tilde{k}_{2}), i.e. at points where the quantity |3ωω+||3\omega_{-}-\omega_{+}| is not small.

Formula (2.1) is valid in this nonresonant set, allowing us to directly evaluate the bandgap in [DL]. In contrast, in the complementary light purple zone in Figure 26, formula (2.1) is not applicable due to resonances and one has to use (153) as we will show here.

The final result of our analysis is presented in Figure 27, where the maximum percentage increment between the nonlinear and linear bandgap363636Namely 100×(Wnl/W1)100\times(W^{\rm nl}/W-1), where WnlW^{\rm nl} and WW denote the width of the nonlinear and linear bandgap, respectively. is plotted as the pair (M~,K~)(\tilde{M},\tilde{K}) varies over the rectangle [0.05,0.3]×[1,20][0.05,0.3]\times[1,20] in the softening case (N3=104N_{3}=-10^{4}). We emphasize that, while in [DL] we derived Figure 27 using (2.1) only for the pairs (M~,K~)(\tilde{M},\tilde{K}) belonging to the light yellow set in Figure 26, in this section, we show how to derive it in the light purple set by (153).

Refer to caption
Figure 26: (Left) The reference rectangle [0.05,0.3]×[1,20][0.05,0.3]\times[1,20] with the curve \mathcal{R} plotted in red. The light yellow region indicates the non resonant set where the representation formula (2.1) is valid. In contrast, the light purple region shows the set of resonant parameters, where formula (153) is needed. (Right) The reference rectangle [0.05,0.3]×[1,20][0.05,0.3]\times[1,20]. For every pair (M~,K~)(\tilde{M},\tilde{K}) in the blue region, there exist two resonant curves intersecting \triangle (as shown in Figure 28), while for pairs in the green region, there exists only one resonant curve intersecting \triangle. The red curve \mathcal{R} separates the two regions.
Refer to caption
Figure 27: Level curves of the maximum percentage difference between the nonlinear and linear bandgap are shown in the (M~,K~)(\tilde{M},\tilde{K})-plane. The curve \mathcal{R} is plotted in red. Here, N3=104N_{3}=-10^{4} (softening). In this softening case, the majority of parameter pairs above \mathcal{R} result in an increase in the bandgap width, while those below \mathcal{R} either show a decrease or, at most, a very slight increase. The region where the increase is most pronounced closely coincides with the set of nonresonant parameters highlighted in light yellow in Figure 26.

Let us first recall how in [DL] we identified the two regions in Figure 26 (left). Given a pair (M~,K~)(\tilde{M},\tilde{K}), we define a set in the (k~1,k~2)(\tilde{k}_{1},\tilde{k}_{2})-plane as resonant if every point in the set satisfies the 3:1 resonance condition 3ω(M~,K~,k~1,k~2)=ω+(M~,K~,k~1,k~2)3\omega_{-}(\tilde{M},\tilde{K},\tilde{k}_{1},\tilde{k}_{2})=\omega_{+}(\tilde{M},\tilde{K},\tilde{k}_{1},\tilde{k}_{2}). For a fixed pair (M~,K~)(\tilde{M},\tilde{K}) within the rectangle [0.05,0.3]×[1,20][0.05,0.3]\times[1,20] (see Figure 26, (right)) there are always one or two resonant curves in the (k~1,k~2)(\tilde{k}_{1},\tilde{k}_{2})-plane, that intersect the Brillouin triangle \triangle (see Figure 28). The curve \mathcal{R} divides the rectangle [0.05,0.3]×[1,20][0.05,0.3]\times[1,20] into two regions: the one above and the one below \mathcal{R}, corresponding to the green region and the blue region in Figure 26 (right), respectively. For every fixed pair (M~,K~)(\tilde{M},\tilde{K}) in the green region, there is only one resonant curve in the plane of wave numbers (k~1,k~2)(\tilde{k}_{1},\tilde{k}_{2}), that intersects the Brillouin triangle (the green curve in Figure 28). Conversely, for every fixed pair (M~,K~)(\tilde{M},\tilde{K}) in the blue region, there are two resonant curves in the plane of wave numbers (k~1,k~2)(\tilde{k}_{1},\tilde{k}_{2}), that intersect the Brillouin triangle (the blue curves in Figure 28). Finally, in the limit case when the pair (M~,K~)(\tilde{M},\tilde{K}) belongs to the curve \mathcal{R}, there are two resonant curves in the (k~1,k~2)(\tilde{k}_{1},\tilde{k}_{2})-plane, that intersect the Brillouin triangle, but one intersects \triangle only at 𝐗{\bf X} (see the red curves in Figure 28).

Refer to caption
Figure 28: Resonant curves in the (k~1,k~2)(\tilde{k}_{1},\tilde{k}_{2})-plane for different fixed values of the pairs (M~,K~)(\tilde{M},\tilde{K}) and their intersections with the boundary of the Brillouin triangle \triangle. The six blue curves correspond to three different points (M~,K~)(\tilde{M},\tilde{K}) in the blue region of Figure 26 (right). In particular when (M~,K~)=(0.146,2)(\tilde{M},\tilde{K})=(0.146,2) the corresponding two blue curves, the dot-dashed ones, have 4 intersections. If (M~,K~)=(0.146,3.6)(\tilde{M},\tilde{K})=(0.146,3.6) the corresponding two blue curves, the solid ones, have 6 intersections. When (M~,K~)=(0.146,5)(\tilde{M},\tilde{K})=(0.146,5) the corresponding two blue curves, the dashed ones, have 4 intersections. The red curves, corresponding to (M~,K~)=(0.146,5.73)(\tilde{M},\tilde{K})=(0.146,5.73)\in\mathcal{R}, have 3 intersections. Finally the green curves, corresponding to (M~,K~)=(0.146,10.79)(\tilde{M},\tilde{K})=(0.146,10.79), have only 2 intersections.

Admissible amplitudes

Both in formula (2.1) and in formula (153), (recall also (155) and (156)), the nonlinear corrections to the frequencies are essentially proportional to the squares of the amplitudes a+a_{+} and aa_{-}. Thus, the larger the amplitudes a±a_{\pm}, the greater the displacement of the nonlinear bandgap relative to the linear one. On the other hand, (2.1) and (153) are perturbative in nature, as they are derived from the non resonant and resonant BNF, respectively. Therefore, a±a_{\pm} must be sufficiently small for the formulae to remain valid. As shown in [DL], where they are analytically evaluated, the “admissible” amplitudes are smaller in the nonresonant case than in the resonant one. Indeed, since the nonresonant BNF cancels more terms, it is “stronger” than the resonant one. In particular the admissible amplitudes in the nonresonant case approach zero as the quantity |3ωω+||3\omega_{-}-\omega_{+}| vanishes. For example, when taking (k~1,k~2)=𝐗(\tilde{k}_{1},\tilde{k}_{2})=\mathbf{X}, the admissible amplitudes vanish for parameters values (M~,K~)(\tilde{M},\tilde{K}) on the curve \mathcal{R}. This is not the case of the admissible amplitudes in the resonant case, namely the ones appearing in formulae (153), (155) and (156)). Indeed they are bounded away from zero on the resonances.

Shifting perspective, we can fix (M~,K~)(\tilde{M},\tilde{K}) and observe at the variation of a±a_{\pm} in the nonresonant case, as the wave numbers (k~1,k~2)(\tilde{k}_{1},\tilde{k}_{2}) vary along the boundary \partial\triangle of the Brillouin triangle \triangle. Notably, a±a_{\pm} decreases to zero at certain resonant points, denoted 𝐑i{\bf R}_{i}. These points correspond to the intersections of the boundary of the Brillouin triangle with the resonant curves plotted in Figure 28. Formula (2.1) loses validity in the vicinity of any point 𝐑i{\bf R}_{i}. The values of the admissible initial amplitude a+a_{+} (in the nonresonant case) as (k~1,k~2)(\tilde{k}_{1},\tilde{k}_{2}) traverses \partial\triangle are shown in Figure 29 for three different pairs of (M~,K~)(\tilde{M},\tilde{K}).

Refer to caption
Figure 29: Admissible initial amplitude (nonresonant case), a+a_{+}, on the optical mode as a function of the wave numbers on \partial\triangle for (M~,K~)=(0.146,2)(\tilde{M},\tilde{K})=(0.146,2) (on the left), (M~,K~)=(0.146,5.73)(\tilde{M},\tilde{K})=(0.146,5.73) (in the middle) (M~,K~)=(0.09,8)(\tilde{M},\tilde{K})=(0.09,8) (on the right). Here, N3=104N_{3}=-10^{4}. Note that the value decreases to zero at the four (on the left), three (in the middle), and two (on the right), resonant points (denoted by 𝐑i{\bf R}_{i}), respectively. Referring to Figure 28, these four, three and two points correspond to the intersections of the dot-dashed blue, red, and green curves with \partial\triangle, respectively. The two points in the image on the right are not shown in Figure 28 but they correspond to the same type of intersection that the green curves have with \partial\triangle.
Refer to caption
Figure 30: Linear, ω±\omega_{\pm}, and resonant, ω±nlr\omega_{\pm}^{\rm nlr}, as well as nonresonant, ω±nl\omega_{\pm}^{\rm nl}, nonlinear dispersion curves versus wave numbers on \partial\triangle for three different pairs of parameters (M~,K~)(\tilde{M},\tilde{K}) in the softening case, N3=104N_{3}=-10^{4}. The points 𝐑i{\bf R}_{i} are the resonant points, as seen in Figure 29. In small neighborhoods of these points, the expression ω±nl\omega_{\pm}^{\rm nl} is replaced by the resonant representation ω±nlr\omega_{\pm}^{\rm nlr}. Note that in all three cases, the minimum of the nonlinear optical frequency essentially coincides with ω+nl\omega_{+}^{\rm nl} evaluated at 𝚪{\bf\Gamma}.
(On the left) Case (i): (M~,K~)=(0.09,8)(\tilde{M},\tilde{K})=(0.09,8) belonging to the light yellow region in Figure 26; admissible initial amplitudes a=0.0036a_{-}=0.0036, a+=0.0025a_{+}=0.0025. As in the linear case, the maximum of the acoustic frequency is attained at 𝐗{\bf X}, which is nonresonant. The resulting percentage bandgap increment is around 30%30\%.
(In the middle) Case (ii) (M~,K~)=(0.146,5.73)(\tilde{M},\tilde{K})=(0.146,5.73) belonging to the light purple region, more precisely to the red curve, in Figure 26; admissible initial amplitudes a=0.002a_{-}=0.002, a+=0.0012a_{+}=0.0012. As in the linear case the maximum of the acoustic frequency is attained at 𝐗{\bf X}, which, however, is now resonant. Since, at 𝐗{\bf X}, ωnlr\omega_{-}^{\rm nlr} is very close to ω\omega_{-}, the resulting nonlinear bandgap undergoes a slight decrement compared to the linear case.
(On the right) Case (iii) (M~,K~)=(0.2,1.1)(\tilde{M},\tilde{K})=(0.2,1.1) belonging to the light purple region in Figure 26; admissible initial amplitudes are a=0.002a_{-}=0.002, a+=0.001a_{+}=0.001. Since ω\omega_{-} is almost flat around its maximum, the maximum of the nonlinear acoustic frequency is attained far from 𝐗{\bf X}, more precisely near the resonant point 𝐑2{\bf R}_{2}. Since, close to 𝐗{\bf X}, ωnlr\omega_{-}^{\rm nlr} is very close to ω\omega_{-}, the resulting nonlinear bandgap undergoes a decrement compared to the linear case.

In conclusion, due to the presence of the 3:1 resonance, formula (2.1) becomes invalid in the vicinity of the points 𝐑i{\bf R}_{i}, when the parameters are resonant or nearly resonant. Specifically, this occurs when they give rise to an exact, or nearly exact, 3:1 resonance between acoustic and optical frequencies. In this resonant case the correct expression for the nonlinear frequencies is ω±nlr\omega_{\pm}^{\rm nlr}, as given by (153).

Nonlinear bandgap

Let us consider the softening case; the hardening case can be treated analogously, leading to a general decrement of the bandgap. We note that, since we are considering pairs (M~,K~)(\tilde{M},\tilde{K}) belonging to the rectangle [0.05,0.3]×[1,20][0.05,0.3]\times[1,20], the point 𝚪{\bf\Gamma}, where the minimum of the linear acoustic frequency is attained, is always far from being resonant. Therefore, in the following discussion, we will focus on the maximum of acoustic frequency because it undergoes the most significant displacements and may be resonant. It turns out that, for the calculation of the nonlinear bandgap, there are essentially three cases:
i) the maximum of the acoustic frequency and the minimum of the optical frequency are attained away from resonant points,
ii) 𝐗\mathbf{X} is resonant or nearly resonant,
iii) ω\omega_{-} has an almost flat maximum, so that, even if 𝐗{\bf X} is away from resonance, the nonlinear acoustic frequency may attain its maximum at some resonant (or nearly resonant) point away from 𝐗{\bf X}.
Note that case i) corresponds to the light yellow region in Figure 26, while cases ii) and iii) correspond to the light purple one. These three cases are shown in Figure 30.

To summarize, one applies formula (2.1) in case (i), as we did in [DL], and formula (153) in cases (ii) and (iii), as we do here. Using the expressions for the admissible amplitudes evaluated in [DL], we are able to compute the bandgap, thereby obtaining Figure 27 in its entirety.

6 Conclusions

In this study, we investigated a broad range of structural engineering models by analyzing a general system of two coupled harmonic oscillators with cubic nonlinearity. Our examination revealed that, in the absence of damping, the system exhibits Hamiltonian dynamics, with an elliptic equilibrium at the origin characterized by two distinct linear frequencies. In particular, we focused on the resonant or nearly resonant case, specifically when the two frequencies are close to a 3:1 resonance.
Our investigation involved employing Hamiltonian Perturbation Theory to transform the system into (resonant) Birkhoff Normal Form up to order 4. This transformation provided a new set of symplectic action-angle variables, on which the Hamiltonian, up to six-order terms, depends only on the actions and the slow angle. Notably, our analysis highlighted the dependency of the construction on the system’s physical parameters, necessitating a meticulous case analysis of the phase portrait in the 3:1 resonant case. We found that the system can exhibit up to six topologically different behaviors, depending on the values of the physical parameters. In each of these configurations, we described the nonlinear normal modes (elliptic/hyperbolic periodic orbits, invariant tori) and their stable and unstable manifolds of the truncated Hamiltonian (neglecting order six or higher terms). This is a fundamental step for proving the persistence of the majority of these structures for the complete Hamiltonian by KAM Theory.

By using elliptic integrals, we derived explicit analytic formulas for the nonlinear frequencies. While this analytic expression was already known away from resonances, it is, as far as we know, new in this context for the resonant or nearly resonant case.

As an application of our findings, we explored wave propagation in metamaterial honeycombs equipped with periodically distributed nonlinear resonators. Our investigation allowed us to examine the bandgap phenomenon in the presence of resonance. We found that while nonlinear effects far from resonances can significantly alter the bandgap, in the resonant case, the nonlinear frequencies, especially the acoustic one, closely align with the linear ones, resulting in a less pronounced variation in the bandgap.

7 Appendix

7.1 Proof of Proposition 1

We first count the solutions of equation (46), namely the intersections between the line (x):=a2x+a1\ell(x):=a_{2}x+a_{1} and the function b(x)b^{\prime}(x) in (47). We note that, since bb^{\prime} is strictly convex, if (1)>0\ell(1)>0 there is only one intersection. Note that condition (1)=a2+a1>0\ell(1)=a_{2}+a_{1}>0 is equivalent to a2>a1a_{2}>-a_{1}. Since g(a1)>a1g(a_{1})>-a_{1}, condition a2>a1a_{2}>-a_{1} implies that we are in the zones Z01Z_{01} or Z21Z_{21} in which we have, indeed, one intersection that we call x1(π)x_{1}^{(\pi)}.
Moreover, in this case a2>a1a_{2}>-a_{1}, the function xF(π,x)=a(x)b(x)x\to F(\pi,x)=a(x)-b(x) with x(0,1)x\in(0,1) has only one critical point, which is exactly x1(π)x_{1}^{(\pi)}. This critical point is a minimum since limx0+xF(π,x)=limx0+a(x)b(x)=\lim_{x\to 0^{+}}\partial_{x}F(\pi,x)=\lim_{x\to 0^{+}}a^{\prime}(x)-b^{\prime}(x)=-\infty and limx1xF(π,x)=limx1a(x)b(x)=a2+a1>0\lim_{x\to 1^{-}}\partial_{x}F(\pi,x)=\lim_{x\to 1^{-}}a^{\prime}(x)-b^{\prime}(x)=a_{2}+a_{1}>0.
Assume now that a2<a1a_{2}<-a_{1}. Note that for every fixed a1a_{1}\in\mathbb{R} there exists a unique a2=h(a1)a_{2}=h(a_{1}) such that a2x+a1a_{2}x+a_{1} is tangent to b(x)b^{\prime}(x) at some point 0<x0<10<x_{0}<1. In order to evaluate the function h(a1)h(a_{1}) above let us consider the tangent r(x)r(x) in a point x0x_{0} to b(x)b^{\prime}(x); namely:

r(x)=b(x0)+b′′(x0)(xx0).r(x)=b^{\prime}(x_{0})+b^{\prime\prime}(x_{0})(x-x_{0})\,.

Since we want that r(x)=a2x+a1r(x)=a_{2}x+a_{1} we have to impose r(0)=a1r(0)=a_{1} and b′′(x0)=a2b^{\prime\prime}(x_{0})=a_{2}. Since

b′′(x)=8x24x14x3/21x,b^{\prime\prime}(x)=\frac{8x^{2}-4x-1}{4x^{3/2}\sqrt{1-x}}\,, (157)

imposing r(0)=a1r(0)=a_{1} we have

a1=r(0)\displaystyle a_{1}=r(0) =\displaystyle= b(x0)b′′(x0)x0=(14x0)1x02x08x024x014x03/21x0x0\displaystyle b^{\prime}(x_{0})-b^{\prime\prime}(x_{0})x_{0}=\frac{(1-4x_{0})\sqrt{1-x_{0}}}{2\sqrt{x_{0}}}-\frac{8x_{0}^{2}-4x_{0}-1}{4x_{0}^{3/2}\sqrt{1-x_{0}}}x_{0} (158)
=\displaystyle= 2(14x0)(1x0)(8x024x01)41x0x0\displaystyle\frac{2(1-4x_{0})(1-x_{0})-(8x_{0}^{2}-4x_{0}-1)}{4\sqrt{1-x_{0}}\sqrt{x_{0}}}
=\displaystyle= 28x02x0+8x028x02+4x0+141x0x0\displaystyle\frac{2-8x_{0}-2x_{0}+8x_{0}^{2}-8x_{0}^{2}+4x_{0}+1}{4\sqrt{1-x_{0}}\sqrt{x_{0}}}
=\displaystyle= 6x0+341x0x0=3(12x0)41x0x0.\displaystyle\frac{-6x_{0}+3}{4\sqrt{1-x_{0}}\sqrt{x_{0}}}=\frac{3(1-2x_{0})}{4\sqrt{1-x_{0}}\sqrt{x_{0}}}\,.

Note that:

a1>0,<0,=0x0<12,>12,=12.a_{1}>0,<0,=0\qquad\implies\qquad x_{0}<\frac{1}{2},>\frac{1}{2},=\frac{1}{2}\,. (159)

Squaring we get

a12=9(1+4x024x0)16(1x0)x0a_{1}^{2}=\frac{9(1+4x_{0}^{2}-4x_{0})}{16(1-x_{0})x_{0}}

namely

(36+16a12)x02(36+16a12)x0+9=0.(36+16a_{1}^{2})x_{0}^{2}-(36+16a_{1}^{2})x_{0}+9=0\,.

The solutions of the above second order equation are

x0=12±a19+4a12,x_{0}=\frac{1}{2}\pm\frac{a_{1}}{\sqrt{9+4a_{1}^{2}}}\,,

but by (159) we have to choose the minus sign. Since by (158) we have

141x0x0=a13(12x0)\frac{1}{4\sqrt{1-x_{0}}\sqrt{x_{0}}}=\frac{a_{1}}{3(1-2x_{0})}

by (157) and denoting for brevity s:=4a12+9s:=\sqrt{4a_{1}^{2}+9}, we get373737Note that 2x01=2a1/4a12+9=2a1/s2x_{0}-1=-2a_{1}/\sqrt{4a_{1}^{2}+9}=-2a_{1}/s.

a2\displaystyle a_{2} =\displaystyle= b′′(x0)=8x024x01x0a13(12x0)=2(2x01)2+2(2x01)1x0a13(12x0)\displaystyle b^{\prime\prime}(x_{0})=\frac{8x_{0}^{2}-4x_{0}-1}{x_{0}}\frac{a_{1}}{3(1-2x_{0})}=\frac{2(2x_{0}-1)^{2}+2(2x_{0}-1)-1}{x_{0}}\frac{a_{1}}{3(1-2x_{0})} (160)
=\displaystyle= 8a124a1ss23(s2a1)=8a124a1ss227(s+2a1)\displaystyle\frac{8a_{1}^{2}-4a_{1}s-s^{2}}{3(s-2a_{1})}=\frac{8a_{1}^{2}-4a_{1}s-s^{2}}{27}(s+2a_{1})
=\displaystyle= 127(4a124a14a12+99)(2a1+4a12+9)=:h(a1).\displaystyle\frac{1}{27}\textstyle(4a_{1}^{2}-4a_{1}\sqrt{4a_{1}^{2}+9}-9)(2a_{1}+\sqrt{4a_{1}^{2}+9})=:h(a_{1})\,.

Note that h(a1)=g(a1)<a1h(a_{1})=-g(-a_{1})<-a_{1}. Since we are in the case a2<a1a_{2}<-a_{1} and we have proved that the line h(a1)x+a1h(a_{1})x+a_{1} is tangent to b(x)b^{\prime}(x), we have that for a2<h(a1)a_{2}<h(a_{1}) there are not intersections (zone Z10Z_{10}) while for h(a1)<a2<a1h(a_{1})<a_{2}<-a_{1} there are two intersections (zone Z12Z_{12}), that we call 0<x1(π)<x2(π)<10<x_{1}^{(\pi)}<x_{2}^{(\pi)}<1.
In this last case, the function xF(π,x)=a(x)b(x)x\to F(\pi,x)=a(x)-b(x) with x(0,1)x\in(0,1) has two critical points, which are exactly x1(π)x_{1}^{(\pi)} and x2(π)x_{2}^{(\pi)}. Since limx0+xF(π,x)=limx0+a(x)b(x)=\lim_{x\to 0^{+}}\partial_{x}F(\pi,x)=\lim_{x\to 0^{+}}a^{\prime}(x)-b^{\prime}(x)=-\infty and limx1xF(π,x)=limx1a(x)b(x)=a2+a1<0\lim_{x\to 1^{-}}\partial_{x}F(\pi,x)=\lim_{x\to 1^{-}}a^{\prime}(x)-b^{\prime}(x)=a_{2}+a_{1}<0, x1(π)x_{1}^{(\pi)} must be a minimum and x2(π)x_{2}^{(\pi)} a maximum.
Finally the case of equation (45) and the critical points of the function F(0,x)F(0,x) can be studied in the same way sending a2a2a_{2}\to-a_{2} and a1a1a_{1}\to-a_{1}. ∎

References

  • [B20] Bukhari M., Barry O. Spectro-spatial analyses of a nonlinear metamaterial with multiple nonlinear local resonators, Nonlinear Dynamics, 99, pp. 1539–1560, 2020.
  • [F22] Fortunati A., Bacigalupo A., Lepidi M., Arena A., Lacarbonara W. Nonlinear wave propagation in locally dissipative metamaterials via Hamiltonian perturbation approach, Nonlinear Dynamics 108, n.2, pp.765–787, 2022.
  • [M23] Murer M., Guruva S. K., Formica G., Lacarbonara W. A multi-bandgap metamaterial with multi-frequency resonators, Journal of Composite Materials 57(4), 783-804 (2023).
  • [SW23mssp] Shen Y., Lacarbonara Y. Nonlinear dispersion properties of metamaterial beams hosting nonlinear resonators and stop band optimization, Mechanical Systems and Signal Processing 187, 2023.
  • [SW23jsv] Shen Y., Lacarbonara W. Nonlinearity-enhanced wave stop bands in honeycombs embedding spider web-like resonators, Journal of Sound and Vibration 562, 2023.
  • [Guo22] Wenjie G., Zhou Yang, Qingsong Feng, Chengxin Dai, Jian Yang, Xiaoyan Lei A new method for band gap analysis of periodic structures using virtual spring model and energy functional variational principle, Mechanical Systems and Signal Processing 168, 2022.
  • [Liu21] Liu Lei, Sridhar A., Geers M.G.D., Kouznetsova V.G. Computational homogenization of locally resonant acoustic metamaterial panels towards enriched continuum beam/shell structures, Computer Methods in Applied Mechanics and Engineering 387, 2021.
  • [Cai22] Cai Changqi, Zhou Jiaxi, Wang Kai, Pan Hongbin, Tan Dongguo, Xu Daolin, Wen Guilin Flexural wave attenuation by metamaterial beam with compliant quasi-zero-stiffness resonators, Mechanical Systems and Signal Processing 174, 2022.
  • [B16] Bacigalupo A., Gambarotta L. Simplified modelling of chiral lattice materials with local resonators, International Journal of Solids and Structures 83, 126–141, 2016.
  • [Comi18] Comi C., Driemeier L. Wave propagation in cellular locally resonant metamaterials, Latin American Journal of Solids and Structures 15, 2018.
  • [M22] Miranda Jr. E.J.P., Rodrigues S.F., Aranas Jr. C., Dos Santos, J.M.C. Plane wave expansion and extended plane wave expansion formulations for Mindlin-Reissner elastic metamaterial thick plates, Journal of Mathematical Analysis and Applications 2, 505, 2022.
  • [Fan21] Fan Lei, He Ye, Chen Xiao-an, Zhao Xue A frequency response function-based optimization for metamaterial beams considering both location and mass distributions of local resonators, Journal of Applied Physics 11, 130, 2021.
  • [Wang21] Wang Qiang, Li Jinqiang, Zhang Yao, Xue Yu, Li Fengming A frequency response function-based optimization for metamaterial beams considering both location and mass distributions of local resonators, Mechanical Systems and Signal Processing 151, 2021.
  • [CP23] Chàvez-Pichardo M., Martìnez-Cruz M.A., Trejo-Martìnez A., Vega-Cruz A.B., Arenas-Resendiz T. On the Practicality of the Analytical Solutions for all Third- and Fourth-Degree Algebraic Equations with Real Coefficients Mathematics 11, 1147, 2023.
  • [W] Lacarbonara W. Nonlinear Structural Mechanics: Theory, Dynamical Phenomena and Modeling, Springer, New-York, 2013.
  • [Elliptic] Takebe T. Elliptic Integrals and Elliptic Functions, Moscow Lectures, Springer, 2022.
  • [V07] Sanders J.A., Verhulst F., Murdock J. Averaging Methods in Nonlinear Dynamical Systems, Revised 2nd Edition, Springer, New York, 2007.
  • [L19jsv] Fronk M. D., Leamy M. J. Direction-dependent invariant waveforms and stability in two-dimensional, weakly nonlinear lattices, Journal of Sound and Vibration 447, pp. 137–154, 2019.
  • [M15] Malek S., Gibson L. Effective elastic properties of periodic hexagonal honeycombs, Mechanics of Materials 91, pp. 226–240, 2015.
  • [S18] Sorohan S., Constantinescu D.M., Sandu M., Sandu A.G. On the homogenization of hexagonal honeycombs under axial and shear loading. Part I: Analytical formulation for free skin effect, Mechanics of Materials 119, pp. 74–91, 2018.
  • [G97] Gibson L.J., Ashby M.F. Cellular solids: structure and properties, Cambridge Solid State Science Series, Cambridge University Press, 1997.
  • [Graff] Graff S.M. On the conservation of hyperbolic invariant tori for Hamiltonian systems, J. Differential Equations 15, 1-69, 1974.
  • [Val] Valdinoci, E. Families of whiskered tori for a-priori stable/unstable Hamiltonian systems and construction of unstable orbits, Math. Phys. Electron. J. 6, Paper 2, 31 pp., 2000.
  • [MNT] Medvedev A.G., Neishtadt A.I., Treschev D.V. Lagrangian tori near resonances of near–integrable Hamiltonian systems, Nonlinearity 28 (7), pp. 2105–2130, 2015.
  • [DL] Di Gregorio L., Lacarbonara W. On bandgaps sensitivity to 3:1 interactions between acoustic and optical waves, Preprint 2024.
  • [H16] Haller G., Ponsioen S. Nonlinear normal modes and spectral submanifolds: existence, uniqueness and use in model reduction, Nonlinear Dynamics 86, pp. 1493–1534, 2016.
  • [Cabre05] Cabre X., Fontich E., de la Llave R. The parametrization method for invariant manifolds III: overview and applications, J. Differential Equations 218, pp. 444–515, 2005.
  • [Celletti13] Calleja R.C., Celletti A., de la Llave R. A KAM theory for conformally symplectic systems: Efficient algorithms and their validation, J. Differential Equations 255, pp. 978–1049, 2013.
  • [Llave05] Haro A., de la Llave R. A parameterization method for the computation of invariant tori and their whiskers in quasi-periodic maps: Rigorous results, J. Differential Equations 228, pp. 230–279, 2005.
  • [Fontich23] Fontich E., Vierio A. Dynamics near the invariant manifolds after a Hamiltonian-Hopf bifurcation, Communications in Nonlinear Science and Numerical Simulation 117, 2023, 106971.
  • [Llave06] Haro A., de la Llave R. A parameterization method for the computation of invariant tori and their whiskers in quasi-periodic maps: numerical algorithms, Discrete and continuous dynamical systems Series B, 6 (6), pp. 1261–1300, 2006.
  • [HW96] Haller G., Wiggins S. Geometry and chaos near resonant equilibria of 3-DOF Hamiltonian systems, Physica D 90, pp. 319–365, 1996.
  • [HW95] Haller G., Wiggins S. N-pulse homoclinic orbits in perturbations of resonant Hamiltonian systems, Arch. Rat. Mech. Anal. 130, pp. 25–101, 1995.
  • [HW93] Haller G., Wiggins S. Orbits homoclinic to resonances: the Hamiltonian case, Physica D 66, pp. 298–346, 1993.