This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

A uni-directional optical pulse propagation equation for materials with both electric and magnetic responses

Paul Kinsler Dr.Paul.Kinsler@physics.org Blackett Laboratory, Imperial College London, Prince Consort Road, London SW7 2AZ, United Kingdom.
(July 30, 2025)
Abstract

I derive uni-directional wave equations for fields propagating in materials with both electric and magnetic dispersion and nonlinearity. The derivation imposes no conditions on the pulse profile except that the material modulates the propagation only slowly: i.e. that loss, dispersion, and nonlinearity have only a small effect over the scale of a wavelength. It also allows a direct term-to-term comparison of the exact bi-directional theory with its approximate uni-directional counterpart.

I Introduction

In the past few years, composite materials (“metamaterials”) that demonstrate both an electric and magnetic response have been the subject of both experimental and theoretical investigation. Often the motivation for this research is the potential for exotic applications: for example, superresolution Pendry (2000), or the possibility of “trapped rainbow” light storage Tsakmakidis et al. (2007). Despite these interesting possibilities, there is also the more basic need for efficient methods for propagating optical pulses in such metamaterials, in particular in one-dimensional (1D) waveguide geometries. Indeed, methods for doing so have already led to interesting predictions (see, e.g., Scalora et al. (2005); Wen et al. (2007); D’Aguanno et al. (2008)). However, these methods, and earlier ones, (e.g. Agrawal (2007); Brabec and Krausz (1997); Geissler et al. (1999); Kinsler and New (2003)) tend to rely on mechanisms such as the introduction of a co-moving frame, and assumptions that the pulse profile has negligible second-order temporal or spatial derivatives. Assuming second-order derivatives are small may well be reasonable, but it means that the pulse profile must remain well behaved. This approximation therefore might well become poorly controlled Kinsler and New (2003); Kinsler (2002), particularly for ultrashort or otherwise ultrawideband pulses, or exotic or extreme material parameters. Ideally we would prefer to make approximations based solely on the material parameters of our device, so as to avoid making assumptions about the state of an ever-changing propagating pulse.

Here I derive 1D wave equations for a waveguide with both electric and magnetic dispersion, and electric and magnetic nonlinearity. I use the directional fields approach Kinsler et al. (2005); Kinsler (2007a), which allows us to directly write down a first-order wave equation for pulse propagation without complicated derivation or approximation. We simply look at the coupled forward and backward wave equations that are a direct re-expression of Maxwell’s curl equations, and substitute in the appropriate dispersion and nonlinearity. I also show separate examples for second- and third-order nonlinearities in both electric and magnetic responses, although the effects can be combined if desired. Note that these directional fields are applicable to more than just pulse propagation, as they have been used to simplify Poynting-vector-based approaches to electromagnetic continuity equations Kinsler et al. (2009).

The derivation makes only a single, well-defined approximation to reduce the bi-directional forward-backward coupled model down to a single first-order wave equation – that of assuming small changes over the scale of a wavelength. This approximation is remarkably robust for all physically realistic parameter values – see Kinsler (2007b) for an analysis focused on nonlinear effects; more general considerations have been dealt with in terms of factorized wave equations Kinsler (2008). The resulting wave equation retains all the usual intuitive and analytical simplicity of ordinary wave propagation equations, unlike the computationally intensive approach of a direct numerical solution of Maxwell’s equations (see, e.g., Yee-1966tap; Flesch-PM-1996prl; Gilles-MV-1999pre; Brabec and Krausz (1997); Tarasishin-MZ-2001oc; Tyrrell-KN-2005jmo).

This paper is structured as follows: Directional fields and their re-expression of the Maxwell curl equations is outlined in Section II, followed by the reduction of the bi-directional wave equation into a uni-directional form in Section III. Section IV shows wave equations for a doubly-nonlinear third-order nonlinearity material, and Section V does the same for a second-order case. In Section VI propagation under the influence of typical metamaterial responses is discussed, and I conclude in Section VII.

II Directional fields

The directional fields approach Kinsler et al. (2005) allows us to write down wave equations for hybrid electromagnetic fields G±\textbf{G}^{\pm}. Note that here I define G±\textbf{G}^{\pm} with an alternate (and more sensible) sign convention than previously. Further, I also allow for more general types of polarization and magnetization in such a way as to provide a simpler presentation. For propagation along the unit vector u, the propagation (curl) equation for G±\textbf{G}^{\pm} is written in the frequency domain as111See derivation in Appendix A.

×G±\displaystyle\nabla\times\textbf{G}^{\pm} =±ıωαrβru×G±±ıωβru×Pc+ıωαrμ0Mc\displaystyle=\pm\imath\omega\alpha_{r}\beta_{r}\textbf{u}\times\textbf{G}^{\pm}~\pm\imath\omega\beta_{r}\textbf{u}\times\textbf{P}_{c}~+\imath\omega\alpha_{r}\mu_{0}\textbf{M}_{c} (1)

with

G±(ω)\displaystyle\textbf{G}^{\pm}(\omega) =αr(ω)E(ω)±βr(ω)H(ω)×u.\displaystyle=\alpha_{r}(\omega)\textbf{E}(\omega)\pm\beta_{r}(\omega)\textbf{H}(\omega)\times\textbf{u}. (2)

Here the electric response of the material is encoded in two parts: a spatially invariant linear response component αr\alpha_{r}, and the remaining contributions (of any type) in Pc\textbf{P}_{c}. Similarly, the magnetic response is divided up in the same way between βr\beta_{r} and μ0Mc\mu_{0}\textbf{M}_{c}. Generally we will put the entire non-lossy linear response of the medium (i.e. the dispersion) into the reference parameters αr\alpha_{r} and βr\beta_{r}, although it may also be convenient to specify only that the product αrβr\alpha_{r}\beta_{r} is real (cf. Kinsler-2009pra). All the nonlinear responses and other complications (“corrections”), such as spatial variations in the material parameters, remain in Pc\textbf{P}_{c} and Mc\textbf{M}_{c}. As an example, in Kinsler et al. (2005) this approach was applied to second-harmonic generation in a periodically poled dielectric crystal. The time derivatives of these corrections Pc\textbf{P}_{c} and Mc\textbf{M}_{c} correspond to bound electric and magnetic currents respectively Kinsler et al. (2009). These Pc\textbf{P}_{c} and Mc\textbf{M}_{c} are functions of both fields E and H, i.e. PcPc(E,H)\textbf{P}_{c}\equiv\textbf{P}_{c}(\textbf{E},\textbf{H}) and McMc(E,H)\textbf{M}_{c}\equiv\textbf{M}_{c}(\textbf{E},\textbf{H}). If we choose instead to have frequency-independent αr\alpha_{r} and βr\beta_{r}, then the remaining linear response can simply be included in Pc\textbf{P}_{c} and Mc\textbf{M}_{c}; in this case the “weak loss and nonlinearity” condition I use later to decouple forward and backward fields would then need to be broadened to include weak dispersion as well (also see Kinsler et al. (2005) for more discussion). However, neither version imposes any requirements on the pulse profile.

It is useful to give a simple example of the directional fields to provide some insight into their nature. In the pure transverse plane-polarized case, with fields propagating along the zz direction, and frequency-independent (material parameters) permitivitty ϵr\epsilon_{r} and permeability μr\mu_{r}, we can write

Gx±\displaystyle G_{x}^{\pm} =ϵrEx±μrHy,\displaystyle=\sqrt{\epsilon_{r}}E_{x}\pm\sqrt{\mu_{r}}H_{y}, (3)
Gy±\displaystyle G_{y}^{\pm} =ϵrEyμrHx,\displaystyle=\sqrt{\epsilon_{r}}E_{y}\mp\sqrt{\mu_{r}}H_{x}, (4)

where this simple Gx±G_{x}^{\pm} definition matches the original proposal of Fleck Fleck (1970).

It is worth considering how reflections arise in this picture based on spatially invariant reference parameters augmented by corrections terms. Leaving aside for now the distinctions between spatially propagated fields and temporally propagated ones (see the discussion in Kinsler (2008)), transition to a new media can be handled in two ways. First, we could map the existing fields G±\textbf{G}^{\pm} onto new ones Gn±\textbf{G}_{n}^{\pm} based on new reference parameters αrn\alpha_{rn} and βrn\beta_{rn}. Here a pure G+\textbf{G}^{+} field would separate into two pieces, one a forward propagating Gn+\textbf{G}_{n}^{+}, and the other a “reflected” backward propagating Gn\textbf{G}_{n}^{-}. Second, we might retain the existing reference parameters, and have modified corrections Pc\textbf{P}_{c}^{\prime} and Mc\textbf{M}_{c}^{\prime}. These altered correction terms then couple the forward and backward directed fields, inducing the necessary reflection in G\textbf{G}^{-}; although as a side-effect of our now no longer optimal αr\alpha_{r} and βr\beta_{r}, the forward evolving field is made up of coupled G+\textbf{G}^{+} and G\textbf{G}^{-} components Kinsler et al. (2005).

II.1 Material response

We define the electric and magnetic material response in the frequency domain, as it greatly simplifies the description of the linear components. Let us chose a reference behaviour given by ϵr(ω),μr(ω)\epsilon_{r}(\omega),\mu_{r}(\omega), and use them to define reference parameters αr(ω)=ϵr(ω)\alpha_{r}(\omega)=\sqrt{\epsilon_{r}(\omega)} and βr(ω)=μr(ω)\beta_{r}(\omega)=\sqrt{\mu_{r}(\omega)}. Note that these are allowed to have a frequency dependence Kinsler et al. (2005), and that αrβr\alpha_{r}\beta_{r} is just the reciprocal of the (reference) speed of light in the medium (i.e. nr/cn_{r}/c). We therefore have that the electric displacement and magnetic fields are

D(ω)\displaystyle\textbf{D}(\omega) =ϵ0E(ω)+P(ω)=\displaystyle=\epsilon_{0}\textbf{E}(\omega)+\textbf{P}(\omega)~= ϵr(ω)E(ω)+Pc(ω),\displaystyle\epsilon_{r}(\omega)\textbf{E}(\omega)+\textbf{P}_{c}(\omega), (5)
B(ω)\displaystyle\textbf{B}(\omega) =μ0H(ω)+M(ω)=\displaystyle=\mu_{0}\textbf{H}(\omega)+\textbf{M}(\omega)= μr(ω)H(ω)+μ0Mc(ω).\displaystyle\mu_{r}(\omega)\textbf{H}(\omega)+\mu_{0}\textbf{M}_{c}(\omega). (6)

To give a specific example, we can define frequency-dependent loss and dispersive corrections by κϵ(ω)\kappa_{\epsilon}(\omega) and κμ(ω)\kappa_{\mu}(\omega), along with (e.g.) independent third-order nonlinearities χϵ,χμ\chi_{\epsilon},\chi_{\mu} to both the material responses; although any appropriate expression can be used – even magneto-electric or other types. Thus we can write the frequency domain expressions

Pc(ω)\displaystyle\textbf{P}_{c}(\omega) =αr2κϵE+ϵ0[χϵE2(t)]E\displaystyle=\alpha_{r}^{2}\kappa_{\epsilon}\textbf{E}+\epsilon_{0}\mathscr{F}\left[\chi_{\epsilon}E^{2}(t)\right]\star\textbf{E} (7)
μ0Mc(ω)\displaystyle\mu_{0}\textbf{M}_{c}(\omega) =βr2κμH+μ0[χμH2(t)]H,\displaystyle=\beta_{r}^{2}\kappa_{\mu}\textbf{H}+\mu_{0}\mathscr{F}\left[\chi_{\mu}H^{2}(t)\right]\star\textbf{H}, (8)

where []\mathscr{F}[...] takes the Fourier transform (which is necessary because nonlinear effects are defined in the time domain as powers of the field) and \star denotes a convolution [i.e., ab=a(ω)b(ωω)𝑑ωa\star b=\int a(\omega)b(\omega-\omega^{\prime})d\omega^{\prime}]. If the nonlinearity is time dependent, then the simple χϵE2\chi_{\epsilon}E^{2} type terms can be replaced with the appropriate convolution. In general, it is best to pick αr,βr\alpha_{r},\beta_{r} subject to the condition that the sizes of Pc\textbf{P}_{c} and Mc\textbf{M}_{c} are minimised.

In a double-negative material (with both ϵ,μ<0\epsilon,\mu<0) we would get imaginary αr,βr\alpha_{r},\beta_{r}, changing the complex phase of G±(ω)\textbf{G}^{\pm}(\omega) away from that given by the original E and H. Since this is in the frequency domain, it converts into a phase shift in the time domain, so although imaginary-valued αr,βr\alpha_{r},\beta_{r} might seem inconvenient, it does not give unphysical results.

III Wave equations

Starting with the vectorial curl equation (1), I first take the 1D, but bi-directional, limit, and describe the approximation necessary to produce a simpler uni-directional form. After this, I discuss how the common transformations used in optical wave equations can be applied in this context. All equations and field quantities are in the frequency (ω\omega) domain, unless explicitly noted otherwise.

III.1 Bi-directional wave equations

Here we set u along the zz axis without loss of generality, and consider just an xx polarized wave (i.e., consisting of Ex,HyE_{x},H_{y}). This means we use the yy component of eqn. (1) with z=d/dz\partial_{z}=d/dz, so that the wave equations for the full spectrum fields Gx±(ω){G}_{x}^{\pm}(\omega), coupled by corrections PxPc,x(ω)P_{x}\equiv{P}_{c,x}(\omega) and MyMc,y(ω)M_{y}\equiv{M}_{c,y}(\omega) are

zGx±\displaystyle\partial_{z}{G}_{x}^{\pm} =±ıωαrβrGx±±ıωβrPx+ıωαrMy.\displaystyle=\pm\imath\omega\alpha_{r}\beta_{r}{G}_{x}^{\pm}\quad\pm\imath\omega\beta_{r}{P}_{x}\quad+\imath\omega\alpha_{r}{M}_{y}. (9)

Following the detailed discussion in Kinsler (2008)222Section III, we say that this wave equation propagates (“steps”) the fields forward along the zz direction using oppositely directed fields Gx+{G}_{x}^{+} and Gx{G}_{x}^{-}. These fields can be written as functions of either time or frequency, and pulses they describe therefore evolve (travel) forward or backward in time.

Consider the example case with parameters κϵ(ω)\kappa_{\epsilon}(\omega) and κμ(ω)\kappa_{\mu}(\omega), and χϵ,χμ\chi_{\epsilon},\chi_{\mu} in eqns. (7) and (8). Defining kr(ω)=ωαr(ω)βr(ω)k_{r}(\omega)=\omega\alpha_{r}(\omega)\beta_{r}(\omega), we get

zGx±\displaystyle\partial_{z}{G}_{x}^{\pm} =±ıkrGx±±ıkrκϵ2(Gx++Gx)+ıkrκμ2(Gx+Gx)\displaystyle=\pm\imath k_{r}{G}_{x}^{\pm}~~\pm\frac{\imath k_{r}\kappa_{\epsilon}}{2}\left({G}_{x}^{+}+{G}_{x}^{-}\right)+\frac{\imath k_{r}\kappa_{\mu}}{2}\left({G}_{x}^{+}-{G}_{x}^{-}\right)
±ıkr2ϵ0ϵr[χϵE2](Gx++Gx)\displaystyle\qquad\pm\frac{\imath k_{r}}{2}\frac{\epsilon_{0}}{\epsilon_{r}}\mathscr{F}\left[\chi_{\epsilon}E^{2}\right]\star\left({G}_{x}^{+}+{G}_{x}^{-}\right)
+ıkr2μ0μr[χμH2](Gx+Gx).\displaystyle\qquad\quad+\frac{\imath k_{r}}{2}\frac{\mu_{0}}{\mu_{r}}\mathscr{F}\left[\chi_{\mu}H^{2}\right]\star\left({G}_{x}^{+}-{G}_{x}^{-}\right). (10)

Note that even for a frequency-independent choice of the reference parameters αr\alpha_{r} and βr\beta_{r}, the reference wave vector krk_{r} retains a (linear) frequency dependence. Also, the dispersion and/or loss parameters κϵ,κμ\kappa_{\epsilon},\kappa_{\mu} are directly related to ϵ\epsilon and μ\mu respectively, and not to a refractive index nn or wavevector kk. This is why there is a factor of 1/21/2 associated with their appearance in eqn. (10) and subsequently.

If written in the time domain, these wave equations are seen to propagate the full temporal history of a field forward in space. There, the reference propagation given by ±ıkrGx±\pm\imath k_{r}G_{x}^{\pm} becomes a convolution if krk_{r} retains a nontrivial frequency dependence. However, if we expand kr(ω)k_{r}(\omega) around a central frequency ω1\omega_{1} in powers of ωω1\omega-\omega_{1}, we can instead convert it (in the time domain) into a Taylor series in time derivatives, which is a popular alternative to the frequency domain form used here. However, if implementing a split-step Fourier method of solving these wave equations, dispersion is applied in the frequency domain, so that in general such an expansion is an unnecessary complication.

III.2 Uni-directional approximation

Now we apply the approximation: that the effect of any correction terms is small over propagation distances of one wavelength – or, if you prefer, over time intervals of one optical period. This translates into a weak loss and nonlinearity assumption; and if the correction terms Pc\textbf{P}_{c} and Mc\textbf{M}_{c} include dispersion, a weak dispersion assumption is also made. These are rarely very stringent approximations. If |Pc|<<|D||\textbf{P}_{c}|<<|\textbf{D}| and |μ0Mc|<<|B||\mu_{0}\textbf{M}_{c}|<<|\textbf{B}|, then a forward G+\textbf{G}^{+} has minimal co-propagating G\textbf{G}^{-} Kinsler et al. (2005). Further, the forward field has a wave vector krk_{r} evolving as exp(+ıkrz)\exp(+\imath k_{r}z), but any generated backward component will evolve as exp(ıkrz)\exp(-\imath k_{r}z). This gives a very rapid relative oscillation exp(2ıkrz)\exp(-2\imath k_{r}z), which will quickly average to zero. Nevertheless, although achievable optical nonlinearity coefficients fall well within this approximation, care may need to be taken with the dispersion, particularly if near a band edge or in the vicinity of a narrow resonance.

A directly comparable approximation is treated exhaustively in Kinsler (2008), where although applied to bi-directional factorizations of the second-order wave equation, the physical considerations are exactly the same: Deviations from the reference behaviour over a propagation distance of one wavelength should be small. Note that the slow evolution approximation applied here is not the same as other “slowly varying” types of approximation [e.g., the slowly varying envelope approximation (SVEA)] – although the physical motivation is similar, the approach used here is far less restrictive.

After we apply this weak correction or “slow evolution” approximation, we set the initial value of Gx0{G}_{x}^{-}\equiv 0, and can be sure that it will stay negligible. Thus eqn. (9) for the full spectrum, forward directed field Gx+(ω){G}_{x}^{+}(\omega) can be written as

zGx+\displaystyle\partial_{z}{G}_{x}^{+} +ıωαrβrGx++ıωβrPx+ıωαrμ0My.\displaystyle\simeq+\imath\omega\alpha_{r}\beta_{r}{G}_{x}^{+}~~+\imath\omega\beta_{r}{P}_{x}~~+\imath\omega\alpha_{r}\mu_{0}{M}_{y}. (11)

Alternatively, we can scale the Gx+{G}_{x}^{+} field so that it has the same units and scaling as the electric field, using Fx+(ω)=Gx+(ω)/2αr(ω){F}_{x}^{+}(\omega)={G}_{x}^{+}(\omega)/2\alpha_{r}(\omega). This gives

zFx+\displaystyle\partial_{z}{F}_{x}^{+} +ıωαrβrFx++ıωβr2αrPx++ıω2μ0My+.\displaystyle\simeq+\imath\omega\alpha_{r}\beta_{r}{F}_{x}^{+}~~+\imath\frac{\omega\beta_{r}}{2\alpha_{r}}{P}_{x}^{+}~~+\imath\frac{\omega}{2}\mu_{0}{M}_{y}^{+}. (12)

Note that Ex+=Gx+/2αrFx+E_{x}^{+}={G}_{x}^{+}/2\alpha_{r}\equiv{F}_{x}^{+} and Hy+=Gx+/2βrFx+αr/βrH_{y}^{+}={G}_{x}^{+}/2\beta_{r}\equiv{F}_{x}^{+}\alpha_{r}/\beta_{r}, since Gx=0{G}_{x}^{-}=0. In either version of these uni-directional equations, Px+P(Ex+,Hy+)P_{x}^{+}\equiv P(E_{x}^{+},H_{y}^{+}) and My+M(Hy+,Ex+)M_{y}^{+}\equiv M(H_{y}^{+},E_{x}^{+}) – the uni-directional (residual) polarization Px+{P}_{x}^{+} and (residual) magnetization My+{M}_{y}^{+} should not be written as functions of the total fields Ex{E}_{x} and Hy{H}_{y}.

III.3 Modifications

Either of eqns. (11) or (12) by themselves are sufficient to model the propagation of the electric and magnetic fields. However, there are many traditional simplifications which can be applied, and which in other treatments are even sometimes required in order obtain a simple evolution equation. In particular, the various envelope equations Brabec and Krausz (1997); Kinsler and New (2003); Geissler et al. (1999); Scalora and Crenshaw (1994) all use co-moving and/or envelopes as a preparation for discarding inconvenient derivatives: Here such steps are optional extras.

These are all considered in more detail for a factorised wave equation approach in Kinsler (2008), but here I have adapted them for this context.

  1. 1.

    A co-moving frame can now be added, using t=tz/vft^{\prime}=t-z/v_{f}. This is a simple linear process that causes no extra complications; the leading right-hand side (RHS) ıαrβrω=ıkr\imath\alpha_{r}\beta_{r}\omega=\imath k_{r} term is replaced by ı(αrβrωkf)\imath(\alpha_{r}\beta_{r}\omega\mp k_{f}), for frame speed vf=ω0/kfv_{f}=\omega_{0}/k_{f}. Setting kf=kr(ω0)k_{f}=k_{r}(\omega_{0}) will cancel the phase velocity vpv_{p} of the pulse at ω0\omega_{0}, not the group velocity.

  2. 2.

    The field can be split up into pieces localized at certain frequencies, as done in descriptions of optical parametric amplifiers or Raman combs (as in, e.g., Kinsler and New (2003, 2004, 2005)). The wave equation can then be separated into one equation for each piece, coupled by the appropriate frequency-matched polarization terms (see, e.g., Kinsler et al. (2006)).

  3. 3.

    A carrier-envelope description of the field can easily be implemented with the usual prescription of Gabor (1946); Boyd (2003) Fx+(t)=A(t)exp[ı(ω1tk1z)]+A(t)exp[ı(ω1tk1z)]{F_{x}^{+}}(t)=A(t)\exp[\imath(\omega_{1}t-k_{1}z)]+A^{*}(t)\exp[-\imath(\omega_{1}t-k_{1}z)] defining an envelope A(t)A(t) with respect to carrier frequency ω1\omega_{1} and wave vector k1k_{1}; this also provides a built-in a co-moving frame vf=ω1/k1v_{f}=\omega_{1}/k_{1}. Multiple envelopes centred at different carrier frequencies and wave vectors (ωi\omega_{i}, kik_{i}) can also be used Kinsler et al. (2006); Boyd (2003).

  4. 4.

    Bandwidth restrictions might be added (see below), either to ensure a smooth envelope or to simplify the wave equations; in addition they might be used to separate out or neglect frequency mixing terms or harmonic generation. As it stands, no bandwidth restrictions were applied when deriving eqns. (11) or (12) – there are only the limitations introduced by the dispersion and/or polarization models to consider. Typically we would expand the model parameters to the first few orders about some convenient reference frequency ω0\omega_{0}.

  5. 5.

    Mode averaging is where the transverse extent of a propagating beam is not explicitly modeled, but is subsumed into a description of a transverse mode profile; as such it is typically applied to situations involving optical fibres or other waveguides. Thus we could use mode averaging when calculating the effective dispersion or nonlinear parameters. See, for example, Laegsgaard (2007) for a recent approach, which goes beyond a simple addition of a frequency dependence to the “effective area” of the mode, and generalizes the effective area concept itself.

III.4 Diffraction

One important feature lacking in this approach is the handling of transverse effects such as diffraction, although they can be inserted by hand (at least in the paraxial limit) by adding the term ı(x2+y2)Fx+/2kr\imath(\partial_{x}^{2}+\partial_{y}^{2})F_{x}^{+}/2k_{r} to the RHS Kinsler (2007a). However, no treatment of transverse effects has been achieved in a native directional fields description on the basis of the first-order equations – although transverse terms arise naturally in the second-order equation resulting from taking the curl of eqn. (1). Treating nonlinear diffraction Boardman et al. (2000) suffers the same difficulties, although presumably it might be incorporated in an analogous way as to ordinary diffraction.

IV Third-order nonlinearity

Third-order nonlinearities are common in many materials, for example in the silica used to make optical fibres (see, e.g., Agrawal (2007)). There are many applications of significant scientific interest, for example, white light supercontinnua Alfano and Shapiro (1970a, b); Dudley et al. (2006), optical rogue waves Solli et al. (2007); or filamentation Braun et al. (1995); Berge and Skupin (2009).

Here we study propagation through such a material, with non-reference linear responses κϵ,κμ\kappa_{\epsilon},\kappa_{\mu} (describing e.g. loss and dispersion), and instantaneous magnetic third-order nonlinearity χμ\chi_{\mu} Klein-WFL-2007oe along with the more common electric type χϵ\chi_{\epsilon}. For such a system, and for plane-polarized fields, the propagation equation is

zFx+\displaystyle\partial_{z}{F}_{x}^{+} =+ıkr[1+κϵ2+κμ2]Fx+\displaystyle=+\imath k_{r}\left[1+\frac{\kappa_{\epsilon}}{2}+\frac{\kappa_{\mu}}{2}\right]{F}_{x}^{+}
+ıkr2ϵ0ϵr[χϵ+μ0ϵ0ϵr2μr2χμ][Fx+2(t)]Fx+.\displaystyle\qquad~+\frac{\imath k_{r}}{2}\frac{\epsilon_{0}}{\epsilon_{r}}\left[\chi_{\epsilon}+\frac{\mu_{0}}{\epsilon_{0}}\frac{\epsilon_{r}^{2}}{\mu_{r}^{2}}\chi_{\mu}\right]\mathscr{F}\left[{F}_{x}^{+2}(t)\right]\star{F}_{x}^{+}. (13)

This is a generalized nonlinear Schrödinger (NLS) equation, and it retains both the full field (i.e. uses no envelope description) and the full nonlinearity (i.e. includes third-harmonic generation). The only assumptions made are that of transverse fields and weak dispersive and nonlinear responses; these latter assumptions allow us to decouple the forward and backward wave equations. This decoupling allows us, without any extra approximation, to reduce our description to one of forward-only pulse propagation. The specific example chosen here is for a cubic nonlinearity, but it is easily generalized to the noninstantaneous case or even other scalar nonlinearities.

We can transform eqn. (13) into one closer to the ordinary NLS equation by representing the field in terms of an envelope and carrier

Fx+(t)\displaystyle{F}_{x}^{+}(t) =A(t)exp[ı(ω0tk0z)]+A(t)exp[ı(ω0tk0z)],\displaystyle=A(t)\exp\left[\imath\left(\omega_{0}t-k_{0}z\right)\right]+A^{*}(t)\exp\left[-\imath\left(\omega_{0}t-k_{0}z\right)\right], (14)

where we choose the carrier wave vector to be k0=kr(ω0)=ω0αr(ω0)βr(ω0)k_{0}=k_{r}(\omega_{0})=\omega_{0}\alpha_{r}(\omega_{0})\beta_{r}(\omega_{0}). After separating into a pair of complex-conjugate equations (one for AA and one for AA^{*}), and ignoring the off-resonant third-harmonic generation term, this gives us the expected NLS equation without diffraction. The chosen carrier effectively moves us into a frame that freezes the carrier oscillations, but this phase velocity (vp=ω/kv_{p}=\omega/k) frame differs from one that is co-moving with the pulse envelope (i.e., one moving at the group velocity vg=ω/kv_{g}=\partial\omega/\partial k). After we transform into a frame co-moving with the group velocity at ω0\omega_{0}, where Δg=ω0[vg1(ω0)vp1(ω0)]\Delta_{g}=\omega_{0}[v_{g}^{-1}(\omega_{0})-v_{p}^{-1}(\omega_{0})], the wave equation for A(ω){A}(\omega) is

zA\displaystyle\partial_{z}{A} =+ıK(ω)A+ıkr2ϵ0ϵr[2χ|A(t)|2A(t)],\displaystyle=+\imath K(\omega){A}~~+\frac{\imath k_{r}}{2}\frac{\epsilon_{0}}{\epsilon_{r}}\mathscr{F}\left[2\chi{\left|A(t)\right|^{2}}{A}(t)\right], (15)

where K(ω)=kr[κϵ(ω)+κμ(ω)]/2+ΔgK(\omega)=k_{r}[\kappa_{\epsilon}(\omega)+\kappa_{\mu}(\omega)]/2+\Delta_{g}; and χ=χϵ(μ0ϵr2/ϵ0μr2)χμ\chi=\chi_{\epsilon}-(\mu_{0}\epsilon_{r}^{2}/\epsilon_{0}\mu_{r}^{2})\chi_{\mu}. All that has been assumed to derive this standard envelope NLS equation is uni-directional propagation and negligible third-harmonic generation. The self-steepening term, often seen in (or added to) NLS equations arises from the frequency dependence of krk_{r}. This self-steepening has both electric and magnetic contributions, which can be adjusted independently, as has been pointed out by Wen et al. Wen et al. (2007) for the case of the SVEA limit. In Section VI, I discuss how the importance of each contribution varies with frequency for both a double-plasmon model (as in Wen et al. (2007)), and a wire-array and split-ring model more typical of practical metamaterials.

It is worth comparing this eqn. (15) to D’Aguanno et al.’s D’Aguanno et al. (2008) eqn. (5) [hereafter eqn. (DMB5)]. Although in many respects they appear to be the same, mine is far more general and can be applied (at least in principle) to an arbitrarily wide pulse bandwidth, whereas theirs is subject to the rather restrictive SVEA. For example, my eqn. (15) results from only one “slow evolution” approximation, as opposed to the numerous steps, substitutions, and approximations in Section 2 of D’Aguanno et al. (2008). I also retain the possibility of arbitrary dispersion K(ω)K(\omega), whereas theirs retains only the second-order part (i.e. as t2\propto\partial_{t}^{2}, which in the frequency domain would be ω2\propto\omega^{2}). Indeed, with the dispersion and nonlinear factors in my eqn. (13) combined, that full-spectrum wave propagation equation is scarecly more complicated than eqn. (DMB5). Similar remarks also hold when comparing eqn. (15) to Wen et al.’s Wen et al. (2007): but although Wen et al.’s result is also restricted by the SVEA, it does at least allow for diffraction. Both, however, along with Scalora et al.’s form Scalora et al. (2005), cannot model the full non-envelope field, nor revert to an exact and explicitly bi-directional form, as in my eqn. (9) or (10).

V Second-order nonlinearity

Treating a second-order nonlinearity is more complicated than the third-order case, since it typically couples the two possible polarization states of the field together. Such interactions occur in materials used for optical parametric amplification, and have long been used for a wide variety of applications (see, e.g., Boyd (2003); Various (1993); Danielius et al. (1993)). To model the cross-coupling between the orthogonally polarised fields, it is necessary to solve for both field polarizations; and to allow for the birefringence we need two pairs of (non-reference) linear responses, i.e. κϵx,κϵy\kappa_{\epsilon x},\kappa_{\epsilon y} and κμx,κμy\kappa_{\mu x},\kappa_{\mu y}.

As an example, I choose a magnetic nonlinearity that couples HyH_{y} and HxH_{x} in the same way as the electric nonlinearity couples ExE_{x} and EyE_{y}, although other configurations are possible. This means that the βru×P\beta_{r}\textbf{u}\times\textbf{P} term in eqn. (1), which represents the non-reference part of the electric response, needs to include those for the standard second-order nonlinear terms (here PxExEyP_{x}\propto E_{x}E_{y} and PyEx2P_{y}\propto E_{x}^{2}). Similarly, the αrμ0M\alpha_{r}\mu_{0}\textbf{M} term has ones for the complementary second-order nonlinear magnetic response. Note that second-order nonlinear magnetic effects have been measured in split ring resonators by Klein et al. Klein-EWL-2006s; Klein-WFL-2007oe.

Since it is convenient, I split the vector form of the G±\textbf{G}^{\pm} wave equation up into its transverse xx and yy components. By noting that the definition of Gy±G^{\pm}_{y} means that Hx+=Fy+αr/βrH_{x}^{+}=-F_{y}^{+}\alpha_{r}/\beta_{r}, the 1D wave equations can be written as

zFx+\displaystyle\partial_{z}{F}_{x}^{+} =+ıkr[1+κϵx2+κμy2]Fx+\displaystyle=+\imath k_{r}\left[1+\frac{\kappa_{\epsilon x}}{2}+\frac{\kappa_{\mu y}}{2}\right]{F}_{x}^{+}
+2ıkr2ϵ0ϵr[χϵμ0ϵ0(ϵrμr)32χμ][Fy+(t)Fx+(t)]\displaystyle\qquad+2\frac{\imath k_{r}}{2}\frac{\epsilon_{0}}{\epsilon_{r}}\left[\chi_{\epsilon}-\frac{\mu_{0}}{\epsilon_{0}}\left(\frac{\epsilon_{r}}{\mu_{r}}\right)^{\frac{3}{2}}\chi_{\mu}\right]\mathscr{F}\left[{F}_{y}^{+}(t){F}_{x}^{+}(t)\right] (16)
zFy+\displaystyle\partial_{z}{F}_{y}^{+} =+ıkr[1+κϵy2+κμx2]Fy+\displaystyle=+\imath k_{r}\left[1+\frac{\kappa_{\epsilon y}}{2}+\frac{\kappa_{\mu x}}{2}\right]{F}_{y}^{+}
+ıkr2ϵ0ϵr[χϵ+μ0ϵ0(ϵrμr)32χμ][Fx+(t)2].\displaystyle\qquad~+\frac{\imath k_{r}}{2}\frac{\epsilon_{0}}{\epsilon_{r}}\left[\chi_{\epsilon}+\frac{\mu_{0}}{\epsilon_{0}}\left(\frac{\epsilon_{r}}{\mu_{r}}\right)^{\frac{3}{2}}\chi_{\mu}\right]\mathscr{F}\left[{F}_{x}^{+}(t)^{2}\right]. (17)

These wave equations for the field are strikingly similar to the usual SVEA equations used to propagate narrowband pulses; the main differences are the addition of terms for magnetic dispersion (κμx,κμy\kappa_{\mu x},\kappa_{\mu y}) and nonlinearity (χμ\chi_{\mu}), and the lack of a co-moving frame.

We can transform eqns. (16) and (17) into a form close to the usual equations for a parametric amplifier by representing the xx and yy polarized fields in terms of three envelope and carrier pairs:

Fx+(t)\displaystyle{F}_{x}^{+}(t) =A1(t)exp[ı(ω1tk1z)]+A1(t)exp[ı(ω1tk1z)]\displaystyle=A_{1}(t)\exp\left[\imath\left(\omega_{1}t-k_{1}z\right)\right]+A_{1}^{*}(t)\exp\left[-\imath\left(\omega_{1}t-k_{1}z\right)\right]
+A2(t)exp[ı(ω2tk2z)]+A2(t)exp[ı(ω2tk2z)]\displaystyle\quad+A_{2}(t)\exp\left[\imath\left(\omega_{2}t-k_{2}z\right)\right]+A_{2}^{*}(t)\exp\left[-\imath\left(\omega_{2}t-k_{2}z\right)\right] (18)
Fy+(t)\displaystyle{F}^{+}_{y}(t) =A3(t)exp[ı(ω3tk3z)]+A3(t)exp[ı(ω3tk3z)],\displaystyle=A_{3}(t)\exp\left[\imath\left(\omega_{3}t-k_{3}z\right)\right]+A_{3}^{*}(t)\exp\left[-\imath\left(\omega_{3}t-k_{3}z\right)\right], (19)

where ω3=ω1+ω2\omega_{3}=\omega_{1}+\omega_{2}. After separating into pairs of complex-conjugate equations (one each for all AiA_{i} and AiA_{i}^{*}), and ignoring the off-resonant polarization terms, we transform into a frame co-moving with the group velocity, although here we select the group velocity of a preferred frequency component, with Δg=ω(vg1vp1)\Delta_{g}=\omega(v_{g}^{-1}-v_{p}^{-1}). The wave equations for the Ai(ω){A}_{i}(\omega) are then

zA1\displaystyle\partial_{z}{A}_{1} =+ıK1(ω)A1+ık022k1χ[2A3(t)A2(t)]eıΔkz\displaystyle=+\imath K_{1}(\omega){A}_{1}~~+\frac{\imath k_{0}^{2}}{2k_{1}}\chi^{-}\mathscr{F}\left[2{A}_{3}(t){A}_{2}^{*}(t)\right]e^{-\imath\Delta kz} (20)
zA2\displaystyle\partial_{z}{A}_{2} =+ıK2(ω)A2+ık022k2χ[2A3(t)A1(t)]eıΔkz\displaystyle=+\imath K_{2}(\omega){A}_{2}~~+\frac{\imath k_{0}^{2}}{2k_{2}}\chi^{-}\mathscr{F}\left[2{A}_{3}(t){A}_{1}^{*}(t)\right]e^{-\imath\Delta kz} (21)
zA3\displaystyle\partial_{z}{A}_{3} =+ıK3(ω)A3+ık022k3χ+[A1(t)A2(t)]e+ıΔkz.\displaystyle=+\imath K_{3}(\omega){A}_{3}~~+\frac{\imath k_{0}^{2}}{2k_{3}}\chi^{+}\mathscr{F}\left[{A}_{1}(t){A}_{2}(t)\right]e^{+\imath\Delta kz}. (22)

Here K1,2(ω)=k1,2[κϵx(ω)+κμy(ω)]/2+ΔgK_{1,2}(\omega)=k_{1,2}[\kappa_{\epsilon x}(\omega)+\kappa_{\mu y}(\omega)]/2+\Delta_{g} and K3(ω)=k3[κϵy(ω)+κμx(ω)]/2+ΔgK_{3}(\omega)=k_{3}[\kappa_{\epsilon y}(\omega)+\kappa_{\mu x}(\omega)]/2+\Delta_{g}; we choose krk_{r} for each equation differently (i.e., with kr{k1,k2,k3}k_{r}\in\{k_{1},k_{2},k_{3}\}); also the phase mismatch term is Δk=k3k2k1\Delta k=k_{3}-k_{2}-k_{1}. The combined nonlinear coefficient is χ±=χϵ±(μ0/ϵ0)(ϵr/μr)3/2χμ\chi^{\pm}=\chi_{\epsilon}\pm(\mu_{0}/\epsilon_{0})(\epsilon_{r}/\mu_{r})^{3/2}\chi_{\mu}.

VI Discussion

Examining the respective roles of the reference permittivity ϵr\epsilon_{r} and permeability μr\mu_{r} in eqns. (13) and (16), (17), we see that as far as dispersion and other linear effects are concerned, the two components simply add. In contrast, their effect on nonlinear terms is more dramatic: with the ratio Y2=ϵr/μrY^{2}=\epsilon_{r}/\mu_{r} scaling the nonlinear corrections to the magnetization into the electric field units of F±\textbf{F}^{\pm}. This is because Y2Y^{2} determines how much of a given directional field F±\textbf{F}^{\pm} is electric field and how much magnetic field; large values of Y2Y^{2} correspond to cases where the magnetic field is most prominent. Indeed, YY is just the reciprocal of the electromagnetic impedance of our chosen reference medium, and only if YY is real-valued do propagating fields exist, since otherwise the fields become evanescent.

Figures 1 and 2 show how YY varies with frequency for two different metamaterial types, with the dispersions encoded on ϵr\epsilon_{r} and μr\mu_{r} and scaled by ω2\omega^{2} to moderate the low-frequency singularity of the Drude response. The extreme limits of large YY occur when |μr||ϵr||\mu_{r}|\ll|\epsilon_{r}|, that is, usually just at an edge of a non-propagating band, where μr\mu_{r} is about to change sign. In such a region, it would be better to revert to the G±\textbf{G}^{\pm} fields, or to rescale the propagation equations into units of magnetic field (e.g., with some K±=G±/2βr\textbf{K}^{\pm}=\textbf{G}^{\pm}/2\beta_{r}).

In previous work Scalora et al. (2005); Wen et al. (2007); D’Aguanno et al. (2008), a Drude type response for both ϵ\epsilon and μ\mu was assumed, where ϵr,μr1ωϵ,μ2/(ω2ıγϵ,μω)\epsilon_{r},\mu_{r}\propto 1-\omega_{\epsilon,\mu}^{2}/(\omega^{2}-\imath\gamma_{\epsilon,\mu}\omega); and this situation is shown on Fig. 1. However, although the dielectric response in metamaterials (e.g., a wire grid array Pendry-HSY-1996prl; Pendry-HRS-1998jpcm) often has this behaviour, the magnetic response of split ring resonators (SRRs) differs. SRR magnetization is best described by a pseudo-Lorentz model333Also known as the “F-model” Pendry-HRS-1999ieeemtt with μr1+Fμω2/(ω2ωμ2ıγμω)\mu_{r}\propto 1+F_{\mu}\omega^{2}/(\omega^{2}-\omega_{\mu}^{2}-\imath\gamma_{\mu}\omega), although sometimes a true Lorentz response is used instead, μr1+Fμωμ2/(ω2ωμ2ıγμω)\mu_{r}\propto 1+F_{\mu}\omega_{\mu}^{2}/(\omega^{2}-\omega_{\mu}^{2}-\imath\gamma_{\mu}\omega). Note the difference between the numerators in these latter two expressions: a frequency-dependent ω2\omega^{2} versus a constant material parameter ωμ2\omega_{\mu}^{2}. The pseudo-Lorentz model has an incorrect high-frequency behaviour, and so it is incompatible with the Kramers-Kronig relations that enforce causality. However, at low and medium frequency it is a better match to the physical response of SRRs, and so I use it for Figs. 2 and 3.

Refer to caption
Figure 1: Normalised electromagnetic field ratio and ϵ,μ\epsilon,\mu curves for Drude responses in both permittivity ϵr\epsilon_{r} and permeability μr\mu_{r}. The magnetic resonance at ωμ=ωϵ/2\omega_{\mu}=\omega_{\epsilon}/\sqrt{2} is lower than the (dielectric) plasma frequency ωϵ\omega_{\epsilon}, and γϵ=γμ=0.01ωϵ\gamma_{\epsilon}=\gamma_{\mu}=0.01\omega_{\epsilon}. Large |Y||Y| corresponds to mostly magnetic G±\textbf{G}^{\pm} fields, small |Y||Y| corresponds to mostly electric G±\textbf{G}^{\pm}. The frequency ranges of propagating negative phase velocity (NPV) and positive phase velocity (PPV) light are shown.
Refer to caption
Figure 2: Normalised electromagnetic field ratio and ϵ,μ\epsilon,\mu curves for Drude response permittivity ϵr\epsilon_{r} (dot-dashed line), with a pseudo-Lorentz response for the permeability μr\mu_{r} (dashed line). The magnetic resonance at ωμ=ωϵ/2\omega_{\mu}=\omega_{\epsilon}/\sqrt{2} is lower than the (dielectric) plasma frequency ωϵ\omega_{\epsilon}, and γϵ=γμ/5=0.01ωϵ\gamma_{\epsilon}=\gamma_{\mu}/5=0.01\omega_{\epsilon}, Fμ=5F_{\mu}=5. The high-frequency behaviour of the pseudo-Lorentz model “illegally” increases faster than that of the Drude model, whereas a properly causal form should match it. Apart from detail, and the high-frequency behaviour, replacing the pseudo-Lorentz model with the (causal) Lorentz one gives a figure of similar appearance. The frequency ranges of propagating NPV and PPV light are shown.

There are frequency ranges over which the linear material responses vary dramatically, and in particular on Fig. 3 (which uses the same model as Fig. 2) this is evident for both the refractive index nn and group velocity vgv_{g}. If we aim to operate in such regions, this leads to two potential complications. First, if we have chosen reference parameters that do not match the linear material responses exactly, then the correction terms will become large, meaning that our wavelength-scale “slow evolution” approximation may come under threat. Second, even if our reference parameters do match the linear material responses exactly, our wavelength-scale will have become frequency dependent, and so again our “slow evolution” approximation may be threatened. In either case the solution is simple – we just need to revert to the bi-directional wave equations [i.e., eqns. (9) or (10)]. However, this does not necessarily mean that any backward evolving fields are generated (as explained in Kinsler et al. (2005), and following a different approach in Kinsler (2008)), so that in principle one could optimize the propagation by reinstating it only over those frequency ranges where it becomes necessary.

Refer to caption
Figure 3: Normalised refractive index nn (solid line) and group velocity vg=dk/dωv_{g}=dk/d\omega (dashed line) for the system defined in fig. 2, for only those frequencies where the field propagates. By comparing with fig. 2, we can see that the edges of the NPV band are dominated by the magnetic field, whereas the lower edge of the PPV band is dominated by the electric field. Note that in the NPV band, the sign of the group velocity is not tied to the sign of the phase velocity.

VII Conclusions

I have derived a uni-directional optical pulse propagation equation for media with both electric and magnetic responses, based on the directional fields approach Kinsler et al. (2005). This involved a re-expression of Maxwell’s equations, and required only a single approximation to reduce a one dimensional bi-directional model, to a uni-directional first-order wave equation. The simplicity of this approach makes it very convenient in waveguides, optical fibres, or other collinear situations. The important approximation is that the pulse evolves only slowly on the scale of a wavelength; and indeed this is a valid assumption in a wide variety of cases – note in particular that nonlinear effects have to be unrealizably strong to violate it Kinsler (2007b). The result has no intrinsic bandwidth restrictions, makes no demands on the pulse profile, and does not require a co-moving frame – unlike other common types of derivation Agrawal (2007); Brabec and Krausz (1997); Geissler et al. (1999); Kinsler and New (2003).

The resulting equations have the advantage that they are straightforward to write down, despite containing the complications of both electric and magnetic responses, and that a carrier-envelope representation or co-moving frames are easy to apply if desired, requiring no further approximation. In this, they match the clarity and flexibility of factorized second-order wave equations Ferrando et al. (2005); Genty et al. (2007); Kinsler (2008), but they can more easily incorporate the effects of magnetic material responses – albiet at the cost of being restricted to one dimensional propagation.

Acknowledgements.
I acknowledge financial support from the Engineering and Physical Sciences Research Council (EP/E031463/1).

References

Appendix A Derivation of eqn. (1)

The derivation in this paper is simpler, more general, and defines G±\textbf{G}^{\pm} using a better sign convention than those in Kinsler et al. (2005); Kinsler (2006). I start with the Maxwell curl equations, and transform into frequency space:

×H(t)\displaystyle\nabla\times\textbf{H}(t) =+tϵrE(t)+J(t),\displaystyle=+\partial_{t}\epsilon_{r}\star\textbf{E}(t)+\textbf{J}(t),
×E(t)\displaystyle\nabla\times\textbf{E}(t) =tμrH(t)μ0K(t)\displaystyle=-\partial_{t}\mu_{r}\star\textbf{H}(t)-\mu_{0}\textbf{K}(t) (23)
×H(ω)\displaystyle\nabla\times\textbf{H}(\omega) =ıωαr(ω)2E(ω)+J(ω),\displaystyle=-\imath\omega~\alpha_{r}(\omega)^{2}\textbf{E}(\omega)+\textbf{J}(\omega),
×E(ω)\displaystyle\nabla\times\textbf{E}(\omega) =+ıωβr(ω)2H(ω)μ0K(ω).\displaystyle=+\imath\omega~\beta_{r}(\omega)^{2}\textbf{H}(\omega)-\mu_{0}\textbf{K}(\omega). (24)

I now rotate the ×H\nabla\times\textbf{H} equation by taking the cross product with u,

u×(×H)\displaystyle~\textbf{u}\times\left(\nabla\times\textbf{H}\right) =ıωαr2(u×E)+u×J,\displaystyle=-\imath\omega~\alpha_{r}^{2}\left(\textbf{u}~\times\textbf{E}\right)+\textbf{u}\times\textbf{J}, (25)

scale each part by βr\beta_{r} and αr\alpha_{r} respectively, while insisting that these parameters do not depend on position. Thus,

u×(×βrH)\displaystyle\textbf{u}\times\left(\nabla\times\beta_{r}\textbf{H}\right) =ıωβrαr2(u×E)+u×βrJ,\displaystyle=-\imath\omega~\beta_{r}\alpha_{r}^{2}\left(\textbf{u}~\times\textbf{E}\right)+\textbf{u}\times\beta_{r}\textbf{J},
×αrE\displaystyle\nabla\times\alpha_{r}\textbf{E} =+ıωαrβr2Hαrμ0K,\displaystyle=+\imath\omega~\alpha_{r}\beta_{r}^{2}\textbf{H}-\alpha_{r}\mu_{0}\textbf{K}, (26)

and then take the sum and difference –

×αrE\displaystyle\nabla\times\alpha_{r}\textbf{E} ±u×(×βrH)\displaystyle\pm~\textbf{u}\times\left(\nabla\times\beta_{r}\textbf{H}\right)
=+ıωαrβr2Hıωβrαr2(u×E)\displaystyle=+\imath\omega~\alpha_{r}\beta_{r}^{2}\textbf{H}~\mp~\imath\omega~\beta_{r}\alpha_{r}^{2}\left(\textbf{u}~\times\textbf{E}\right)~~
±u×βrJαrμ0K.\displaystyle\qquad\qquad\qquad\pm~~\textbf{u}\times\beta_{r}\textbf{J}-\alpha_{r}\mu_{0}\textbf{K}. (27)

The vector G±\textbf{G}^{\pm} fields are defined in eqn. (2), but that neglects the longitudinal part of H. Thus, for completeness, we also need to define G=uβrH{G}^{\circ}=\textbf{u}\cdot\beta_{r}\textbf{H} (as in Kinsler et al. (2005); Kinsler (2006)). Their form means that I need to convert both the second term on the LHS of the sum-and-difference equation above, as well as the RHS. It is most important for the LHS to be simple, because this will define the type of propagation specified by the RHS. In the following I use the vector identity.

u×(×H)(uH)\displaystyle\textbf{u}\times\left(\nabla\times\textbf{H}\right)-\nabla\left(\textbf{u}\cdot\textbf{H}\right) =\displaystyle= ×(u×H),\displaystyle\nabla\times\left(\textbf{u}\times\textbf{H}\right), (28)

along with u×[u×H]=[uH]uH\textbf{u}\times[\textbf{u}\times\textbf{H}]=[\textbf{u}\cdot\textbf{H}]\textbf{u}-\textbf{H}, so that

u×G\displaystyle\textbf{u}\times\textbf{G}^{\mp} =u×αrE±u×[u×βrH]\displaystyle=\textbf{u}\times\alpha_{r}\textbf{E}\pm\textbf{u}\times\left[\textbf{u}\times\beta_{r}\textbf{H}\right] (29)
=u×αrEβrH±[uβrH]u\displaystyle=\textbf{u}\times\alpha_{r}\textbf{E}\mp\beta_{r}\textbf{H}~~\pm\left[\textbf{u}\cdot\beta_{r}\textbf{H}\right]\textbf{u} (30)
=u×αrEβrH±uG.\displaystyle=\textbf{u}\times\alpha_{r}\textbf{E}\mp\beta_{r}\textbf{H}~~\pm\textbf{u}{G}^{\circ}. (31)

Continuing the derivation,

×αrE±u×(×βrH)\displaystyle\nabla\times\alpha_{r}\textbf{E}~~\pm~~\textbf{u}\times\left(\nabla\times\beta_{r}\textbf{H}\right) =+ıωαrβr2Hıωβrαr2(u×E)±u×βrJαrμ0K,\displaystyle=+\imath\omega\alpha_{r}\beta_{r}^{2}\textbf{H}~~\mp~~\imath\omega\beta_{r}\alpha_{r}^{2}\left(\textbf{u}\times\textbf{E}\right)~~\pm\textbf{u}\times\beta_{r}\textbf{J}-\alpha_{r}\mu_{0}\textbf{K}, (32)
×αrE±×(u×βrH)±(uβrH)\displaystyle\nabla\times\alpha_{r}\textbf{E}~~\pm~~\nabla\times\left(\textbf{u}\times\beta_{r}\textbf{H}\right)\pm\nabla\left(\textbf{u}\cdot\beta_{r}\textbf{H}\right) =+ıωαrβr2Hıωβrαr2(u×E)±u×βrJαrμ0K\displaystyle=+\imath\omega\alpha_{r}\beta_{r}^{2}\textbf{H}~~\mp~~\imath\omega\beta_{r}\alpha_{r}^{2}\left(\textbf{u}\times\textbf{E}\right)~~\pm\textbf{u}\times\beta_{r}\textbf{J}-\alpha_{r}\mu_{0}\textbf{K} (33)
×[αrE±(u×βrH)]\displaystyle\nabla\times\left[\alpha_{r}\textbf{E}~~\pm~~\left(\textbf{u}\times\beta_{r}\textbf{H}\right)\right] =+ıωαrβr2Hıωβrαr2(u×E)(uβrH)±u×βrJαrμ0K\displaystyle=+\imath\omega\alpha_{r}\beta_{r}^{2}\textbf{H}~~\mp~~\imath\omega\beta_{r}\alpha_{r}^{2}\left(\textbf{u}\times\textbf{E}\right)\mp\nabla\left(\textbf{u}\cdot\beta_{r}\textbf{H}\right)~~\pm\textbf{u}\times\beta_{r}\textbf{J}-\alpha_{r}\mu_{0}\textbf{K}~~~~~~~~ (34)
×G\displaystyle\nabla\times\textbf{G}^{\mp} =ıω{αrβr2Hβrαr2(u×E)}(uβrH)±u×βrJαrμ0K\displaystyle=\imath\omega\left\{\alpha_{r}\beta_{r}^{2}\textbf{H}~~\mp~~\beta_{r}\alpha_{r}^{2}\left(\textbf{u}\times\textbf{E}\right)\right\}\mp\nabla\left(\textbf{u}\cdot\beta_{r}\textbf{H}\right)~~\pm\textbf{u}\times\beta_{r}\textbf{J}-\alpha_{r}\mu_{0}\textbf{K} (35)

I now rearrange eqn. (35) to give the final form, in which I substitute tP=J(t)\partial_{t}\textbf{P}=\textbf{J}(t) and tM=K(t)\partial_{t}\textbf{M}=\textbf{K}(t) to match eqn. (1). Thus

×G\displaystyle\nabla\times\textbf{G}^{\mp} =ıω{αrβruGαrβr2(u×[u×H])βrαr2(u×E)}G±u×βrJαrμ0K\displaystyle=\imath\omega\left\{\alpha_{r}\beta_{r}\textbf{u}{G}^{\circ}~~-\alpha_{r}\beta_{r}^{2}\left(\textbf{u}\times\left[\textbf{u}\times\textbf{H}\right]\right)~~\mp\beta_{r}\alpha_{r}^{2}\left(\textbf{u}\times\textbf{E}\right)\right\}\mp\nabla{G}^{\circ}~~\pm\textbf{u}\times\beta_{r}\textbf{J}-\alpha_{r}\mu_{0}\textbf{K} (36)
=ıω{βrαr2(u×E)±αrβr2(u×[u×H])}+ıωαrβruGG±u×βrJαrμ0K,\displaystyle=\mp\imath\omega\left\{\beta_{r}\alpha_{r}^{2}\left(\textbf{u}\times\textbf{E}\right)\pm\alpha_{r}\beta_{r}^{2}\left(\textbf{u}\times\left[\textbf{u}\times\textbf{H}\right]\right)\right\}~~+\imath\omega\alpha_{r}\beta_{r}\textbf{u}{G}^{\circ}~~\mp\nabla{G}^{\circ}~~\pm\textbf{u}\times\beta_{r}\textbf{J}-\alpha_{r}\mu_{0}\textbf{K}, (37)

and finally

×G±\displaystyle\nabla\times\textbf{G}^{\pm} =±ıωαrβru×G±+ıωαrβruG±Gu×βrJαrμ0K\displaystyle=\pm\imath\omega~\alpha_{r}\beta_{r}\textbf{u}\times\textbf{G}^{\pm}~~+\imath\omega\alpha_{r}\beta_{r}\textbf{u}{G}^{\circ}~~\pm\nabla{G}^{\circ}~~\mp\textbf{u}\times\beta_{r}\textbf{J}-\alpha_{r}\mu_{0}\textbf{K} (38)
=±ıωαrβru×G±+ıωαrβruG±G±ıωβru×P+ıωαrμ0M.\displaystyle=\pm\imath\omega~\alpha_{r}\beta_{r}\textbf{u}\times\textbf{G}^{\pm}~~+\imath\omega\alpha_{r}\beta_{r}\textbf{u}{G}^{\circ}~~\pm\nabla{G}^{\circ}~~\pm\imath\omega\beta_{r}\textbf{u}\times\textbf{P}+\imath\omega\alpha_{r}\mu_{0}\textbf{M}. (39)

Note that for J(t)=tP=tκϵE(t)\textbf{J}(t)=\partial_{t}\textbf{P}=\partial_{t}\kappa_{\epsilon}\textbf{E}(t), where κϵ\kappa_{\epsilon} is some complicated but scalar dielectric response function, we have

u×βrJ(ω)\displaystyle\mp\textbf{u}\times\beta_{r}\textbf{J}(\omega) =±ıωβru×P=±ıωαr2βrκϵu×E\displaystyle=\pm\imath\omega\beta_{r}\textbf{u}\times\textbf{P}\quad=\pm\imath\omega\alpha_{r}^{2}\beta_{r}\kappa_{\epsilon}\star\textbf{u}\times\textbf{E} (40)
=±ıωαr2βr2κϵ(u×[G++G]),\displaystyle=\pm\imath\omega\frac{\alpha_{r}^{2}\beta_{r}}{2}\kappa_{\epsilon}\star\left(\textbf{u}\times\left[\textbf{G}^{+}+\textbf{G}^{-}\right]\right), (41)

and for K(t)=tM=tκμH(t)\textbf{K}(t)=\partial_{t}\textbf{M}=\partial_{t}\kappa_{\mu}\textbf{H}(t), where κμ\kappa_{\mu} is some complicated but scalar magnetic response function, we have

αrμ0K(ω)\displaystyle-\alpha_{r}\mu_{0}\textbf{K}(\omega) =+ıωμ0αrM=+ıωμ0αrκμH\displaystyle=+\imath\omega\mu_{0}\alpha_{r}\textbf{M}\quad=+\imath\omega\mu_{0}\alpha_{r}\kappa_{\mu}\star\textbf{H} (42)
=+ıωμ0αrκμ(u[uH]u×u×H)\displaystyle=+\imath\omega\mu_{0}\alpha_{r}\kappa_{\mu}\star\left(\textbf{u}\left[\textbf{u}\cdot\textbf{H}\right]-\textbf{u}\times\textbf{u}\times\textbf{H}\right) (43)
=+ıωμ0αrβrκμ(uGu×[u×βrH])\displaystyle=+\imath\omega\mu_{0}\frac{\alpha_{r}}{\beta_{r}}\kappa_{\mu}\star\left(\textbf{u}{G}^{\circ}-\textbf{u}\times\left[\textbf{u}\times\beta_{r}\textbf{H}\right]\right) (44)
=ıωμ0αr2βrκμ(u×[G+G]2uG).\displaystyle=-\imath\omega\mu_{0}\frac{\alpha_{r}}{2\beta_{r}}\kappa_{\mu}\star\left(\textbf{u}\times\left[\textbf{G}^{+}-\textbf{G}^{-}\right]-2\textbf{u}{G}^{\circ}\right). (45)

Finally, when generating eqn. (25), we lost the longitudinal part of ×H=ϵrE+J\nabla\times\textbf{H}=\partial\epsilon_{r}\textbf{E}+\textbf{J} (i.e. that parallel to u). This is

u×H\displaystyle\textbf{u}\cdot\nabla\times\textbf{H} =ıωαr2uE+uJ\displaystyle=-\imath\omega\alpha_{r}^{2}\textbf{u}\cdot\textbf{E}+\textbf{u}\cdot\textbf{J} (46)
u×(G+G+2uG)\displaystyle\textbf{u}\cdot\nabla\times\left(\textbf{G}^{+}-\textbf{G}^{-}+2\textbf{u}G^{\circ}\right) =ıωαrβru(G+G)\displaystyle=-\imath\omega\alpha_{r}\beta_{r}\textbf{u}\cdot\left(\textbf{G}^{+}-\textbf{G}^{-}\right)
+2βruJ\displaystyle\qquad+2\beta_{r}\textbf{u}\cdot\textbf{J} (47)
2u(Gu×G)\displaystyle 2\textbf{u}\cdot\left(\nabla{G}^{\circ}-\textbf{u}\times\nabla{G}^{\circ}\right) =ıωαrβru(G+G)\displaystyle=-\imath\omega\alpha_{r}\beta_{r}\textbf{u}\cdot\left(\textbf{G}^{+}-\textbf{G}^{-}\right)
+2βruJ\displaystyle\qquad+2\beta_{r}\textbf{u}\cdot\textbf{J} (48)
2uG\displaystyle 2\textbf{u}\cdot\nabla{G}^{\circ} =ıωαrβru(G+G)+2βruJ,\displaystyle=-\imath\omega\alpha_{r}\beta_{r}\textbf{u}\cdot\left(\textbf{G}^{+}-\textbf{G}^{-}\right)+2\beta_{r}\textbf{u}\cdot\textbf{J}, (49)

since ×Gu=G×uu×G=u×G\nabla\times{G}^{\circ}\textbf{u}={G}^{\circ}\nabla\times\textbf{u}-\textbf{u}\times\nabla{G}^{\circ}=-\textbf{u}\times\nabla{G}^{\circ}, and

u×(G+G)\displaystyle\textbf{u}\cdot\nabla\times\left(\textbf{G}^{+}-\textbf{G}^{-}\right) =ıωαrβruu×(G++G)\displaystyle=\imath\omega\alpha_{r}\beta_{r}\textbf{u}\cdot\textbf{u}\times\left(\textbf{G}^{+}+\textbf{G}^{-}\right)
+2uG+2ıωβruu×P\displaystyle\qquad+2\textbf{u}\cdot\nabla{G}^{\circ}+2\imath\omega\beta_{r}\textbf{u}\cdot\textbf{u}\times\textbf{P} (50)
=2uG.\displaystyle=2\textbf{u}\cdot\nabla{G}^{\circ}. (51)

Appendix B Correction terms

In this appendix I work through the details of how the polarization and magnetization terms scale with respect to one another. To simplify matters, I assume all corrections are scalar since when ϵr\epsilon_{r} and μr\mu_{r} are not field-polarization or orientation sensitive, the scalings remain the same, even if the specific field terms may vary (e.g. ExEy{E}_{x}{E}_{y} instead of Ex2{E}_{x}^{2}).

Consider the general unidirectional equation for Fx±{F}_{x}^{\pm} (i.e. eqn. (10)), and replace the polarization and magnetization terms with dimensionless response parameters qϵq_{\epsilon} and qμq_{\mu} multiplied by the appropriate field Ex{E}_{x} or Hy{H}_{y}. Then replace Ex{E}_{x} and Hy{H}_{y} with their representation in terms of Fx+{F}_{x}^{+}, so that

ıω2βrαrPx+ıω2μ0My\displaystyle\frac{\imath\omega}{2}\frac{\beta_{r}}{\alpha_{r}}{P}_{x}+\frac{\imath\omega}{2}\mu_{0}{M}_{y} =ıω2βrαrqϵϵ0Ex+ıω2qμμ0Hy\displaystyle=\frac{\imath\omega}{2}\frac{\beta_{r}}{\alpha_{r}}q_{\epsilon}\epsilon_{0}{E}_{x}~~+\frac{\imath\omega}{2}q_{\mu}\mu_{0}{H}_{y} (52)
=ıω2[βrαrqϵϵ0Fx++qμμ0αrβrFx+]\displaystyle=\frac{\imath\omega}{2}\left[\frac{\beta_{r}}{\alpha_{r}}q_{\epsilon}\epsilon_{0}{F}_{x}^{+}~~+q_{\mu}\mu_{0}\frac{\alpha_{r}}{\beta_{r}}{F}_{x}^{+}\right] (53)
=ıωαrβr2[qϵϵ0αr2Fx++qμμ0βr2Fx+]\displaystyle=\frac{\imath\omega\alpha_{r}\beta_{r}}{2}\left[\frac{q_{\epsilon}\epsilon_{0}}{\alpha_{r}^{2}}{F}_{x}^{+}~~+\frac{q_{\mu}\mu_{0}}{\beta_{r}^{2}}{F}_{x}^{+}\right] (54)
=ıkr2ϵ0ϵr[qϵ+qμμ0ϵ0ϵrμr]Fx+,\displaystyle=\frac{\imath k_{r}}{2}\frac{\epsilon_{0}}{\epsilon_{r}}\left[q_{\epsilon}~~+q_{\mu}\frac{\mu_{0}}{\epsilon_{0}}\frac{\epsilon_{r}}{\mu_{r}}\right]{F}_{x}^{+}, (55)

remembering that F+=E=(βr/αr)H\textbf{F}^{+}=\textbf{E}=(\beta_{r}/\alpha_{r})\textbf{H}, and that ϵr=αr2\epsilon_{r}=\alpha_{r}^{2}, and μr=βr2\mu_{r}=\beta_{r}^{2}.

Since we consider the electric-field-like field F+{F}^{+}, the polarization corrections are trivial to write down; as for an mm-th order nonlinear term, qϵ=χϵF+(m1)q_{\epsilon}=\chi_{\epsilon}F^{+(m-1)}. This means we need only concentrate on the magnetization correction. If qμq_{\mu} is that for an mm-th order nonlinear term, then qμ=χμHn1=χμ(αr/βr)(m1)F+(m1)q_{\mu}=\chi_{\mu}H^{n-1}=\chi_{\mu}(\alpha_{r}/\beta_{r})^{(m-1)}F^{+(m-1)}. Writing down only the term in square brackets from eqn. (55) gives us

[qϵ+χμμ0ϵ0ϵrμr(αrβr)m1Fx+(m1)]Fx+\displaystyle\left[q_{\epsilon}~+\chi_{\mu}\frac{\mu_{0}}{\epsilon_{0}}\frac{\epsilon_{r}}{\mu_{r}}\left(\frac{\alpha_{r}}{\beta_{r}}\right)^{m-1}{F}_{x}^{+(m-1)}\right]{F}_{x}^{+} (56)
=[qϵ+χμμ0ϵ0(ϵrμr)m+12Fx+(m1)]Fx+.\displaystyle\qquad\qquad=\left[q_{\epsilon}~+\chi_{\mu}\frac{\mu_{0}}{\epsilon_{0}}\left(\frac{\epsilon_{r}}{\mu_{r}}\right)^{\frac{m+1}{2}}{F}_{x}^{+(m-1)}\right]{F}_{x}^{+}. (57)

Note that corrections for linear loss or gain are first-order processes (i.e. with m=1m=1), where for loss we need qıγq\sim\imath\gamma, with γ>0\gamma>0; Thus for loss the whole correction term will be proportional to γFx+-\gamma{F}_{x}^{+}, as would be expected.