A uni-directional optical pulse propagation equation for materials with both electric and magnetic responses

Paul Kinsler Dr.Paul.Kinsler@physics.org Blackett Laboratory, Imperial College London, Prince Consort Road, London SW7 2AZ, United Kingdom.

(July 30, 2025)

Abstract

I derive uni-directional wave equations for fields propagating in materials with both electric and magnetic dispersion and nonlinearity. The derivation imposes no conditions on the pulse profile except that the material modulates the propagation only slowly: i.e. that loss, dispersion, and nonlinearity have only a small effect over the scale of a wavelength. It also allows a direct term-to-term comparison of the exact bi-directional theory with its approximate uni-directional counterpart.

I Introduction

In the past few years, composite materials (“metamaterials”) that demonstrate both an electric and magnetic response have been the subject of both experimental and theoretical investigation. Often the motivation for this research is the potential for exotic applications: for example, superresolution Pendry (2000), or the possibility of “trapped rainbow” light storage Tsakmakidis et al. (2007). Despite these interesting possibilities, there is also the more basic need for efficient methods for propagating optical pulses in such metamaterials, in particular in one-dimensional (1D) waveguide geometries. Indeed, methods for doing so have already led to interesting predictions (see, e.g., Scalora et al. (2005); Wen et al. (2007); D’Aguanno et al. (2008)). However, these methods, and earlier ones, (e.g. Agrawal (2007); Brabec and Krausz (1997); Geissler et al. (1999); Kinsler and New (2003)) tend to rely on mechanisms such as the introduction of a co-moving frame, and assumptions that the pulse profile has negligible second-order temporal or spatial derivatives. Assuming second-order derivatives are small may well be reasonable, but it means that the pulse profile must remain well behaved. This approximation therefore might well become poorly controlled Kinsler and New (2003); Kinsler (2002), particularly for ultrashort or otherwise ultrawideband pulses, or exotic or extreme material parameters. Ideally we would prefer to make approximations based solely on the material parameters of our device, so as to avoid making assumptions about the state of an ever-changing propagating pulse.

Here I derive 1D wave equations for a waveguide with both electric and magnetic dispersion, and electric and magnetic nonlinearity. I use the directional fields approach Kinsler et al. (2005); Kinsler (2007a), which allows us to directly write down a first-order wave equation for pulse propagation without complicated derivation or approximation. We simply look at the coupled forward and backward wave equations that are a direct re-expression of Maxwell’s curl equations, and substitute in the appropriate dispersion and nonlinearity. I also show separate examples for second- and third-order nonlinearities in both electric and magnetic responses, although the effects can be combined if desired. Note that these directional fields are applicable to more than just pulse propagation, as they have been used to simplify Poynting-vector-based approaches to electromagnetic continuity equations Kinsler et al. (2009).

The derivation makes only a single, well-defined approximation to reduce the bi-directional forward-backward coupled model down to a single first-order wave equation – that of assuming small changes over the scale of a wavelength. This approximation is remarkably robust for all physically realistic parameter values – see Kinsler (2007b) for an analysis focused on nonlinear effects; more general considerations have been dealt with in terms of factorized wave equations Kinsler (2008). The resulting wave equation retains all the usual intuitive and analytical simplicity of ordinary wave propagation equations, unlike the computationally intensive approach of a direct numerical solution of Maxwell’s equations (see, e.g., Yee-1966tap; Flesch-PM-1996prl; Gilles-MV-1999pre; Brabec and Krausz (1997); Tarasishin-MZ-2001oc; Tyrrell-KN-2005jmo).

This paper is structured as follows: Directional fields and their re-expression of the Maxwell curl equations is outlined in Section II, followed by the reduction of the bi-directional wave equation into a uni-directional form in Section III. Section IV shows wave equations for a doubly-nonlinear third-order nonlinearity material, and Section V does the same for a second-order case. In Section VI propagation under the influence of typical metamaterial responses is discussed, and I conclude in Section VII.

II Directional fields

The directional fields approach Kinsler et al. (2005) allows us to write down wave equations for hybrid electromagnetic fields $\textbf{G}^{\pm}$ . Note that here I define $\textbf{G}^{\pm}$ with an alternate (and more sensible) sign convention than previously. Further, I also allow for more general types of polarization and magnetization in such a way as to provide a simpler presentation. For propagation along the unit vector u, the propagation (curl) equation for $\textbf{G}^{\pm}$ is written in the frequency domain as¹¹1See derivation in Appendix A.

\displaystyle\nabla\times\textbf{G}^{\pm}

\displaystyle=\pm\imath\omega\alpha_{r}\beta_{r}\textbf{u}\times\textbf{G}^{\pm}~\pm\imath\omega\beta_{r}\textbf{u}\times\textbf{P}_{c}~+\imath\omega\alpha_{r}\mu_{0}\textbf{M}_{c}

(1)

with

\displaystyle\textbf{G}^{\pm}(\omega)

\displaystyle=\alpha_{r}(\omega)\textbf{E}(\omega)\pm\beta_{r}(\omega)\textbf{H}(\omega)\times\textbf{u}.

(2)

Here the electric response of the material is encoded in two parts: a spatially invariant linear response component $\alpha_{r}$ , and the remaining contributions (of any type) in $\textbf{P}_{c}$ . Similarly, the magnetic response is divided up in the same way between $\beta_{r}$ and $\mu_{0}\textbf{M}_{c}$ . Generally we will put the entire non-lossy linear response of the medium (i.e. the dispersion) into the reference parameters $\alpha_{r}$ and $\beta_{r}$ , although it may also be convenient to specify only that the product $\alpha_{r}\beta_{r}$ is real (cf. Kinsler-2009pra). All the nonlinear responses and other complications (“corrections”), such as spatial variations in the material parameters, remain in $\textbf{P}_{c}$ and $\textbf{M}_{c}$ . As an example, in Kinsler et al. (2005) this approach was applied to second-harmonic generation in a periodically poled dielectric crystal. The time derivatives of these corrections $\textbf{P}_{c}$ and $\textbf{M}_{c}$ correspond to bound electric and magnetic currents respectively Kinsler et al. (2009). These $\textbf{P}_{c}$ and $\textbf{M}_{c}$ are functions of both fields E and H, i.e. $\textbf{P}_{c}\equiv\textbf{P}_{c}(\textbf{E},\textbf{H})$ and $\textbf{M}_{c}\equiv\textbf{M}_{c}(\textbf{E},\textbf{H})$ . If we choose instead to have frequency-independent $\alpha_{r}$ and $\beta_{r}$ , then the remaining linear response can simply be included in $\textbf{P}_{c}$ and $\textbf{M}_{c}$ ; in this case the “weak loss and nonlinearity” condition I use later to decouple forward and backward fields would then need to be broadened to include weak dispersion as well (also see Kinsler et al. (2005) for more discussion). However, neither version imposes any requirements on the pulse profile.

It is useful to give a simple example of the directional fields to provide some insight into their nature. In the pure transverse plane-polarized case, with fields propagating along the $z$ direction, and frequency-independent (material parameters) permitivitty $\epsilon_{r}$ and permeability $\mu_{r}$ , we can write

	$\displaystyle G_{x}^{\pm}$	$\displaystyle=\sqrt{\epsilon_{r}}E_{x}\pm\sqrt{\mu_{r}}H_{y},$		(3)
	$\displaystyle G_{y}^{\pm}$	$\displaystyle=\sqrt{\epsilon_{r}}E_{y}\mp\sqrt{\mu_{r}}H_{x},$		(4)

where this simple $G_{x}^{\pm}$ definition matches the original proposal of Fleck Fleck (1970).

It is worth considering how reflections arise in this picture based on spatially invariant reference parameters augmented by corrections terms. Leaving aside for now the distinctions between spatially propagated fields and temporally propagated ones (see the discussion in Kinsler (2008)), transition to a new media can be handled in two ways. First, we could map the existing fields $\textbf{G}^{\pm}$ onto new ones $\textbf{G}_{n}^{\pm}$ based on new reference parameters $\alpha_{rn}$ and $\beta_{rn}$ . Here a pure $\textbf{G}^{+}$ field would separate into two pieces, one a forward propagating $\textbf{G}_{n}^{+}$ , and the other a “reflected” backward propagating $\textbf{G}_{n}^{-}$ . Second, we might retain the existing reference parameters, and have modified corrections $\textbf{P}_{c}^{\prime}$ and $\textbf{M}_{c}^{\prime}$ . These altered correction terms then couple the forward and backward directed fields, inducing the necessary reflection in $\textbf{G}^{-}$ ; although as a side-effect of our now no longer optimal $\alpha_{r}$ and $\beta_{r}$ , the forward evolving field is made up of coupled $\textbf{G}^{+}$ and $\textbf{G}^{-}$ components Kinsler et al. (2005).

II.1 Material response

We define the electric and magnetic material response in the frequency domain, as it greatly simplifies the description of the linear components. Let us chose a reference behaviour given by $\epsilon_{r}(\omega),\mu_{r}(\omega)$ , and use them to define reference parameters $\alpha_{r}(\omega)=\sqrt{\epsilon_{r}(\omega)}$ and $\beta_{r}(\omega)=\sqrt{\mu_{r}(\omega)}$ . Note that these are allowed to have a frequency dependence Kinsler et al. (2005), and that $\alpha_{r}\beta_{r}$ is just the reciprocal of the (reference) speed of light in the medium (i.e. $n_{r}/c$ ). We therefore have that the electric displacement and magnetic fields are

	$\displaystyle\textbf{D}(\omega)$	$\displaystyle=\epsilon_{0}\textbf{E}(\omega)+\textbf{P}(\omega)~=$	$\displaystyle\epsilon_{r}(\omega)\textbf{E}(\omega)+\textbf{P}_{c}(\omega),$		(5)
	$\displaystyle\textbf{B}(\omega)$	$\displaystyle=\mu_{0}\textbf{H}(\omega)+\textbf{M}(\omega)=$	$\displaystyle\mu_{r}(\omega)\textbf{H}(\omega)+\mu_{0}\textbf{M}_{c}(\omega).$		(6)

To give a specific example, we can define frequency-dependent loss and dispersive corrections by $\kappa_{\epsilon}(\omega)$ and $\kappa_{\mu}(\omega)$ , along with (e.g.) independent third-order nonlinearities $\chi_{\epsilon},\chi_{\mu}$ to both the material responses; although any appropriate expression can be used – even magneto-electric or other types. Thus we can write the frequency domain expressions

	$\displaystyle\textbf{P}_{c}(\omega)$	$\displaystyle=\alpha_{r}^{2}\kappa_{\epsilon}\textbf{E}+\epsilon_{0}\mathscr{F}\left[\chi_{\epsilon}E^{2}(t)\right]\star\textbf{E}$		(7)
	$\displaystyle\mu_{0}\textbf{M}_{c}(\omega)$	$\displaystyle=\beta_{r}^{2}\kappa_{\mu}\textbf{H}+\mu_{0}\mathscr{F}\left[\chi_{\mu}H^{2}(t)\right]\star\textbf{H},$		(8)

where $\mathscr{F}[...]$ takes the Fourier transform (which is necessary because nonlinear effects are defined in the time domain as powers of the field) and $\star$ denotes a convolution [i.e., $a\star b=\int a(\omega)b(\omega-\omega^{\prime})d\omega^{\prime}$ ]. If the nonlinearity is time dependent, then the simple $\chi_{\epsilon}E^{2}$ type terms can be replaced with the appropriate convolution. In general, it is best to pick $\alpha_{r},\beta_{r}$ subject to the condition that the sizes of $\textbf{P}_{c}$ and $\textbf{M}_{c}$ are minimised.

In a double-negative material (with both $\epsilon,\mu<0$ ) we would get imaginary $\alpha_{r},\beta_{r}$ , changing the complex phase of $\textbf{G}^{\pm}(\omega)$ away from that given by the original E and H. Since this is in the frequency domain, it converts into a phase shift in the time domain, so although imaginary-valued $\alpha_{r},\beta_{r}$ might seem inconvenient, it does not give unphysical results.

III Wave equations

Starting with the vectorial curl equation (1), I first take the 1D, but bi-directional, limit, and describe the approximation necessary to produce a simpler uni-directional form. After this, I discuss how the common transformations used in optical wave equations can be applied in this context. All equations and field quantities are in the frequency ( $\omega$ ) domain, unless explicitly noted otherwise.

III.1 Bi-directional wave equations

Here we set u along the $z$ axis without loss of generality, and consider just an $x$ polarized wave (i.e., consisting of $E_{x},H_{y}$ ). This means we use the $y$ component of eqn. (1) with $\partial_{z}=d/dz$ , so that the wave equations for the full spectrum fields ${G}_{x}^{\pm}(\omega)$ , coupled by corrections $P_{x}\equiv{P}_{c,x}(\omega)$ and $M_{y}\equiv{M}_{c,y}(\omega)$ are

\displaystyle\partial_{z}{G}_{x}^{\pm}

\displaystyle=\pm\imath\omega\alpha_{r}\beta_{r}{G}_{x}^{\pm}\quad\pm\imath\omega\beta_{r}{P}_{x}\quad+\imath\omega\alpha_{r}{M}_{y}.

(9)

Following the detailed discussion in Kinsler (2008)²²2Section III, we say that this wave equation propagates (“steps”) the fields forward along the $z$ direction using oppositely directed fields ${G}_{x}^{+}$ and ${G}_{x}^{-}$ . These fields can be written as functions of either time or frequency, and pulses they describe therefore evolve (travel) forward or backward in time.

Consider the example case with parameters $\kappa_{\epsilon}(\omega)$ and $\kappa_{\mu}(\omega)$ , and $\chi_{\epsilon},\chi_{\mu}$ in eqns. (7) and (8). Defining $k_{r}(\omega)=\omega\alpha_{r}(\omega)\beta_{r}(\omega)$ , we get

$\displaystyle\partial_{z}{G}_{x}^{\pm}$	$\displaystyle=\pm\imath k_{r}{G}_{x}^{\pm}~~\pm\frac{\imath k_{r}\kappa_{\epsilon}}{2}\left({G}_{x}^{+}+{G}_{x}^{-}\right)+\frac{\imath k_{r}\kappa_{\mu}}{2}\left({G}_{x}^{+}-{G}_{x}^{-}\right)$
	$\displaystyle\qquad\pm\frac{\imath k_{r}}{2}\frac{\epsilon_{0}}{\epsilon_{r}}\mathscr{F}\left[\chi_{\epsilon}E^{2}\right]\star\left({G}_{x}^{+}+{G}_{x}^{-}\right)$
	$\displaystyle\qquad\quad+\frac{\imath k_{r}}{2}\frac{\mu_{0}}{\mu_{r}}\mathscr{F}\left[\chi_{\mu}H^{2}\right]\star\left({G}_{x}^{+}-{G}_{x}^{-}\right).$	(10)

Note that even for a frequency-independent choice of the reference parameters $\alpha_{r}$ and $\beta_{r}$ , the reference wave vector $k_{r}$ retains a (linear) frequency dependence. Also, the dispersion and/or loss parameters $\kappa_{\epsilon},\kappa_{\mu}$ are directly related to $\epsilon$ and $\mu$ respectively, and not to a refractive index $n$ or wavevector $k$ . This is why there is a factor of $1/2$ associated with their appearance in eqn. (10) and subsequently.

If written in the time domain, these wave equations are seen to propagate the full temporal history of a field forward in space. There, the reference propagation given by $\pm\imath k_{r}G_{x}^{\pm}$ becomes a convolution if $k_{r}$ retains a nontrivial frequency dependence. However, if we expand $k_{r}(\omega)$ around a central frequency $\omega_{1}$ in powers of $\omega-\omega_{1}$ , we can instead convert it (in the time domain) into a Taylor series in time derivatives, which is a popular alternative to the frequency domain form used here. However, if implementing a split-step Fourier method of solving these wave equations, dispersion is applied in the frequency domain, so that in general such an expansion is an unnecessary complication.

III.2 Uni-directional approximation

Now we apply the approximation: that the effect of any correction terms is small over propagation distances of one wavelength – or, if you prefer, over time intervals of one optical period. This translates into a weak loss and nonlinearity assumption; and if the correction terms $\textbf{P}_{c}$ and $\textbf{M}_{c}$ include dispersion, a weak dispersion assumption is also made. These are rarely very stringent approximations. If $|\textbf{P}_{c}|<<|\textbf{D}|$ and $|\mu_{0}\textbf{M}_{c}|<<|\textbf{B}|$ , then a forward $\textbf{G}^{+}$ has minimal co-propagating $\textbf{G}^{-}$ Kinsler et al. (2005). Further, the forward field has a wave vector $k_{r}$ evolving as $\exp(+\imath k_{r}z)$ , but any generated backward component will evolve as $\exp(-\imath k_{r}z)$ . This gives a very rapid relative oscillation $\exp(-2\imath k_{r}z)$ , which will quickly average to zero. Nevertheless, although achievable optical nonlinearity coefficients fall well within this approximation, care may need to be taken with the dispersion, particularly if near a band edge or in the vicinity of a narrow resonance.

A directly comparable approximation is treated exhaustively in Kinsler (2008), where although applied to bi-directional factorizations of the second-order wave equation, the physical considerations are exactly the same: Deviations from the reference behaviour over a propagation distance of one wavelength should be small. Note that the slow evolution approximation applied here is not the same as other “slowly varying” types of approximation [e.g., the slowly varying envelope approximation (SVEA)] – although the physical motivation is similar, the approach used here is far less restrictive.

After we apply this weak correction or “slow evolution” approximation, we set the initial value of ${G}_{x}^{-}\equiv 0$ , and can be sure that it will stay negligible. Thus eqn. (9) for the full spectrum, forward directed field ${G}_{x}^{+}(\omega)$ can be written as

\displaystyle\partial_{z}{G}_{x}^{+}

\displaystyle\simeq+\imath\omega\alpha_{r}\beta_{r}{G}_{x}^{+}~~+\imath\omega\beta_{r}{P}_{x}~~+\imath\omega\alpha_{r}\mu_{0}{M}_{y}.

(11)

Alternatively, we can scale the ${G}_{x}^{+}$ field so that it has the same units and scaling as the electric field, using ${F}_{x}^{+}(\omega)={G}_{x}^{+}(\omega)/2\alpha_{r}(\omega)$ . This gives

\displaystyle\partial_{z}{F}_{x}^{+}

\displaystyle\simeq+\imath\omega\alpha_{r}\beta_{r}{F}_{x}^{+}~~+\imath\frac{\omega\beta_{r}}{2\alpha_{r}}{P}_{x}^{+}~~+\imath\frac{\omega}{2}\mu_{0}{M}_{y}^{+}.

(12)

Note that $E_{x}^{+}={G}_{x}^{+}/2\alpha_{r}\equiv{F}_{x}^{+}$ and $H_{y}^{+}={G}_{x}^{+}/2\beta_{r}\equiv{F}_{x}^{+}\alpha_{r}/\beta_{r}$ , since ${G}_{x}^{-}=0$ . In either version of these uni-directional equations, $P_{x}^{+}\equiv P(E_{x}^{+},H_{y}^{+})$ and $M_{y}^{+}\equiv M(H_{y}^{+},E_{x}^{+})$ – the uni-directional (residual) polarization ${P}_{x}^{+}$ and (residual) magnetization ${M}_{y}^{+}$ should not be written as functions of the total fields ${E}_{x}$ and ${H}_{y}$ .

III.3 Modifications

Either of eqns. (11) or (12) by themselves are sufficient to model the propagation of the electric and magnetic fields. However, there are many traditional simplifications which can be applied, and which in other treatments are even sometimes required in order obtain a simple evolution equation. In particular, the various envelope equations Brabec and Krausz (1997); Kinsler and New (2003); Geissler et al. (1999); Scalora and Crenshaw (1994) all use co-moving and/or envelopes as a preparation for discarding inconvenient derivatives: Here such steps are optional extras.

These are all considered in more detail for a factorised wave equation approach in Kinsler (2008), but here I have adapted them for this context.

1.

A co-moving frame can now be added, using $t^{\prime}=t-z/v_{f}$ . This is a simple linear process that causes no extra complications; the leading right-hand side (RHS) $\imath\alpha_{r}\beta_{r}\omega=\imath k_{r}$ term is replaced by $\imath(\alpha_{r}\beta_{r}\omega\mp k_{f})$ , for frame speed $v_{f}=\omega_{0}/k_{f}$ . Setting $k_{f}=k_{r}(\omega_{0})$ will cancel the phase velocity $v_{p}$ of the pulse at $\omega_{0}$ , not the group velocity.
2.

The field can be split up into pieces localized at certain frequencies, as done in descriptions of optical parametric amplifiers or Raman combs (as in, e.g., Kinsler and New (2003, 2004, 2005)). The wave equation can then be separated into one equation for each piece, coupled by the appropriate frequency-matched polarization terms (see, e.g., Kinsler et al. (2006)).
3.

A carrier-envelope description of the field can easily be implemented with the usual prescription of Gabor (1946); Boyd (2003) ${F_{x}^{+}}(t)=A(t)\exp[\imath(\omega_{1}t-k_{1}z)]+A^{*}(t)\exp[-\imath(\omega_{1}t-k_{1}z)]$ defining an envelope $A(t)$ with respect to carrier frequency $\omega_{1}$ and wave vector $k_{1}$ ; this also provides a built-in a co-moving frame $v_{f}=\omega_{1}/k_{1}$ . Multiple envelopes centred at different carrier frequencies and wave vectors ( $\omega_{i}$ , $k_{i}$ ) can also be used Kinsler et al. (2006); Boyd (2003).
4.

Bandwidth restrictions might be added (see below), either to ensure a smooth envelope or to simplify the wave equations; in addition they might be used to separate out or neglect frequency mixing terms or harmonic generation. As it stands, no bandwidth restrictions were applied when deriving eqns. (11) or (12) – there are only the limitations introduced by the dispersion and/or polarization models to consider. Typically we would expand the model parameters to the first few orders about some convenient reference frequency $\omega_{0}$ .
5.

Mode averaging is where the transverse extent of a propagating beam is not explicitly modeled, but is subsumed into a description of a transverse mode profile; as such it is typically applied to situations involving optical fibres or other waveguides. Thus we could use mode averaging when calculating the effective dispersion or nonlinear parameters. See, for example, Laegsgaard (2007) for a recent approach, which goes beyond a simple addition of a frequency dependence to the “effective area” of the mode, and generalizes the effective area concept itself.

III.4 Diffraction

One important feature lacking in this approach is the handling of transverse effects such as diffraction, although they can be inserted by hand (at least in the paraxial limit) by adding the term $\imath(\partial_{x}^{2}+\partial_{y}^{2})F_{x}^{+}/2k_{r}$ to the RHS Kinsler (2007a). However, no treatment of transverse effects has been achieved in a native directional fields description on the basis of the first-order equations – although transverse terms arise naturally in the second-order equation resulting from taking the curl of eqn. (1). Treating nonlinear diffraction Boardman et al. (2000) suffers the same difficulties, although presumably it might be incorporated in an analogous way as to ordinary diffraction.

IV Third-order nonlinearity

Third-order nonlinearities are common in many materials, for example in the silica used to make optical fibres (see, e.g., Agrawal (2007)). There are many applications of significant scientific interest, for example, white light supercontinnua Alfano and Shapiro (1970a, b); Dudley et al. (2006), optical rogue waves Solli et al. (2007); or filamentation Braun et al. (1995); Berge and Skupin (2009).

Here we study propagation through such a material, with non-reference linear responses $\kappa_{\epsilon},\kappa_{\mu}$ (describing e.g. loss and dispersion), and instantaneous magnetic third-order nonlinearity $\chi_{\mu}$ Klein-WFL-2007oe along with the more common electric type $\chi_{\epsilon}$ . For such a system, and for plane-polarized fields, the propagation equation is

	$\displaystyle\partial_{z}{F}_{x}^{+}$	$\displaystyle=+\imath k_{r}\left[1+\frac{\kappa_{\epsilon}}{2}+\frac{\kappa_{\mu}}{2}\right]{F}_{x}^{+}$
		$\displaystyle\qquad~+\frac{\imath k_{r}}{2}\frac{\epsilon_{0}}{\epsilon_{r}}\left[\chi_{\epsilon}+\frac{\mu_{0}}{\epsilon_{0}}\frac{\epsilon_{r}^{2}}{\mu_{r}^{2}}\chi_{\mu}\right]\mathscr{F}\left[{F}_{x}^{+2}(t)\right]\star{F}_{x}^{+}.$		(13)

This is a generalized nonlinear Schrödinger (NLS) equation, and it retains both the full field (i.e. uses no envelope description) and the full nonlinearity (i.e. includes third-harmonic generation). The only assumptions made are that of transverse fields and weak dispersive and nonlinear responses; these latter assumptions allow us to decouple the forward and backward wave equations. This decoupling allows us, without any extra approximation, to reduce our description to one of forward-only pulse propagation. The specific example chosen here is for a cubic nonlinearity, but it is easily generalized to the noninstantaneous case or even other scalar nonlinearities.

We can transform eqn. (13) into one closer to the ordinary NLS equation by representing the field in terms of an envelope and carrier

\displaystyle{F}_{x}^{+}(t)

\displaystyle=A(t)\exp\left[\imath\left(\omega_{0}t-k_{0}z\right)\right]+A^{*}(t)\exp\left[-\imath\left(\omega_{0}t-k_{0}z\right)\right],

(14)

where we choose the carrier wave vector to be $k_{0}=k_{r}(\omega_{0})=\omega_{0}\alpha_{r}(\omega_{0})\beta_{r}(\omega_{0})$ . After separating into a pair of complex-conjugate equations (one for $A$ and one for $A^{*}$ ), and ignoring the off-resonant third-harmonic generation term, this gives us the expected NLS equation without diffraction. The chosen carrier effectively moves us into a frame that freezes the carrier oscillations, but this phase velocity ( $v_{p}=\omega/k$ ) frame differs from one that is co-moving with the pulse envelope (i.e., one moving at the group velocity $v_{g}=\partial\omega/\partial k$ ). After we transform into a frame co-moving with the group velocity at $\omega_{0}$ , where $\Delta_{g}=\omega_{0}[v_{g}^{-1}(\omega_{0})-v_{p}^{-1}(\omega_{0})]$ , the wave equation for ${A}(\omega)$ is

\displaystyle\partial_{z}{A}

\displaystyle=+\imath K(\omega){A}~~+\frac{\imath k_{r}}{2}\frac{\epsilon_{0}}{\epsilon_{r}}\mathscr{F}\left[2\chi{\left|A(t)\right|^{2}}{A}(t)\right],

(15)

where $K(\omega)=k_{r}[\kappa_{\epsilon}(\omega)+\kappa_{\mu}(\omega)]/2+\Delta_{g}$ ; and $\chi=\chi_{\epsilon}-(\mu_{0}\epsilon_{r}^{2}/\epsilon_{0}\mu_{r}^{2})\chi_{\mu}$ . All that has been assumed to derive this standard envelope NLS equation is uni-directional propagation and negligible third-harmonic generation. The self-steepening term, often seen in (or added to) NLS equations arises from the frequency dependence of $k_{r}$ . This self-steepening has both electric and magnetic contributions, which can be adjusted independently, as has been pointed out by Wen et al. Wen et al. (2007) for the case of the SVEA limit. In Section VI, I discuss how the importance of each contribution varies with frequency for both a double-plasmon model (as in Wen et al. (2007)), and a wire-array and split-ring model more typical of practical metamaterials.

It is worth comparing this eqn. (15) to D’Aguanno et al.’s D’Aguanno et al. (2008) eqn. (5) [hereafter eqn. (DMB5)]. Although in many respects they appear to be the same, mine is far more general and can be applied (at least in principle) to an arbitrarily wide pulse bandwidth, whereas theirs is subject to the rather restrictive SVEA. For example, my eqn. (15) results from only one “slow evolution” approximation, as opposed to the numerous steps, substitutions, and approximations in Section 2 of D’Aguanno et al. (2008). I also retain the possibility of arbitrary dispersion $K(\omega)$ , whereas theirs retains only the second-order part (i.e. as $\propto\partial_{t}^{2}$ , which in the frequency domain would be $\propto\omega^{2}$ ). Indeed, with the dispersion and nonlinear factors in my eqn. (13) combined, that full-spectrum wave propagation equation is scarecly more complicated than eqn. (DMB5). Similar remarks also hold when comparing eqn. (15) to Wen et al.’s Wen et al. (2007): but although Wen et al.’s result is also restricted by the SVEA, it does at least allow for diffraction. Both, however, along with Scalora et al.’s form Scalora et al. (2005), cannot model the full non-envelope field, nor revert to an exact and explicitly bi-directional form, as in my eqn. (9) or (10).

V Second-order nonlinearity

Treating a second-order nonlinearity is more complicated than the third-order case, since it typically couples the two possible polarization states of the field together. Such interactions occur in materials used for optical parametric amplification, and have long been used for a wide variety of applications (see, e.g., Boyd (2003); Various (1993); Danielius et al. (1993)). To model the cross-coupling between the orthogonally polarised fields, it is necessary to solve for both field polarizations; and to allow for the birefringence we need two pairs of (non-reference) linear responses, i.e. $\kappa_{\epsilon x},\kappa_{\epsilon y}$ and $\kappa_{\mu x},\kappa_{\mu y}$ .

As an example, I choose a magnetic nonlinearity that couples $H_{y}$ and $H_{x}$ in the same way as the electric nonlinearity couples $E_{x}$ and $E_{y}$ , although other configurations are possible. This means that the $\beta_{r}\textbf{u}\times\textbf{P}$ term in eqn. (1), which represents the non-reference part of the electric response, needs to include those for the standard second-order nonlinear terms (here $P_{x}\propto E_{x}E_{y}$ and $P_{y}\propto E_{x}^{2}$ ). Similarly, the $\alpha_{r}\mu_{0}\textbf{M}$ term has ones for the complementary second-order nonlinear magnetic response. Note that second-order nonlinear magnetic effects have been measured in split ring resonators by Klein et al. Klein-EWL-2006s; Klein-WFL-2007oe.

Since it is convenient, I split the vector form of the $\textbf{G}^{\pm}$ wave equation up into its transverse $x$ and $y$ components. By noting that the definition of $G^{\pm}_{y}$ means that $H_{x}^{+}=-F_{y}^{+}\alpha_{r}/\beta_{r}$ , the 1D wave equations can be written as

	$\displaystyle\partial_{z}{F}_{x}^{+}$	$\displaystyle=+\imath k_{r}\left[1+\frac{\kappa_{\epsilon x}}{2}+\frac{\kappa_{\mu y}}{2}\right]{F}_{x}^{+}$
		$\displaystyle\qquad+2\frac{\imath k_{r}}{2}\frac{\epsilon_{0}}{\epsilon_{r}}\left[\chi_{\epsilon}-\frac{\mu_{0}}{\epsilon_{0}}\left(\frac{\epsilon_{r}}{\mu_{r}}\right)^{\frac{3}{2}}\chi_{\mu}\right]\mathscr{F}\left[{F}_{y}^{+}(t){F}_{x}^{+}(t)\right]$		(16)

	$\displaystyle\partial_{z}{F}_{y}^{+}$	$\displaystyle=+\imath k_{r}\left[1+\frac{\kappa_{\epsilon y}}{2}+\frac{\kappa_{\mu x}}{2}\right]{F}_{y}^{+}$
		$\displaystyle\qquad~+\frac{\imath k_{r}}{2}\frac{\epsilon_{0}}{\epsilon_{r}}\left[\chi_{\epsilon}+\frac{\mu_{0}}{\epsilon_{0}}\left(\frac{\epsilon_{r}}{\mu_{r}}\right)^{\frac{3}{2}}\chi_{\mu}\right]\mathscr{F}\left[{F}_{x}^{+}(t)^{2}\right].$		(17)

These wave equations for the field are strikingly similar to the usual SVEA equations used to propagate narrowband pulses; the main differences are the addition of terms for magnetic dispersion ( $\kappa_{\mu x},\kappa_{\mu y}$ ) and nonlinearity ( $\chi_{\mu}$ ), and the lack of a co-moving frame.

We can transform eqns. (16) and (17) into a form close to the usual equations for a parametric amplifier by representing the $x$ and $y$ polarized fields in terms of three envelope and carrier pairs:

$\displaystyle{F}_{x}^{+}(t)$	$\displaystyle=A_{1}(t)\exp\left[\imath\left(\omega_{1}t-k_{1}z\right)\right]+A_{1}^{*}(t)\exp\left[-\imath\left(\omega_{1}t-k_{1}z\right)\right]$
	$\displaystyle\quad+A_{2}(t)\exp\left[\imath\left(\omega_{2}t-k_{2}z\right)\right]+A_{2}^{*}(t)\exp\left[-\imath\left(\omega_{2}t-k_{2}z\right)\right]$	(18)
$\displaystyle{F}^{+}_{y}(t)$	$\displaystyle=A_{3}(t)\exp\left[\imath\left(\omega_{3}t-k_{3}z\right)\right]+A_{3}^{*}(t)\exp\left[-\imath\left(\omega_{3}t-k_{3}z\right)\right],$	(19)

where $\omega_{3}=\omega_{1}+\omega_{2}$ . After separating into pairs of complex-conjugate equations (one each for all $A_{i}$ and $A_{i}^{*}$ ), and ignoring the off-resonant polarization terms, we transform into a frame co-moving with the group velocity, although here we select the group velocity of a preferred frequency component, with $\Delta_{g}=\omega(v_{g}^{-1}-v_{p}^{-1})$ . The wave equations for the ${A}_{i}(\omega)$ are then

$\displaystyle\partial_{z}{A}_{1}$	$\displaystyle=+\imath K_{1}(\omega){A}_{1}~~+\frac{\imath k_{0}^{2}}{2k_{1}}\chi^{-}\mathscr{F}\left[2{A}_{3}(t){A}_{2}^{*}(t)\right]e^{-\imath\Delta kz}$	(20)
$\displaystyle\partial_{z}{A}_{2}$	$\displaystyle=+\imath K_{2}(\omega){A}_{2}~~+\frac{\imath k_{0}^{2}}{2k_{2}}\chi^{-}\mathscr{F}\left[2{A}_{3}(t){A}_{1}^{*}(t)\right]e^{-\imath\Delta kz}$	(21)
$\displaystyle\partial_{z}{A}_{3}$	$\displaystyle=+\imath K_{3}(\omega){A}_{3}~~+\frac{\imath k_{0}^{2}}{2k_{3}}\chi^{+}\mathscr{F}\left[{A}_{1}(t){A}_{2}(t)\right]e^{+\imath\Delta kz}.$	(22)

Here $K_{1,2}(\omega)=k_{1,2}[\kappa_{\epsilon x}(\omega)+\kappa_{\mu y}(\omega)]/2+\Delta_{g}$ and $K_{3}(\omega)=k_{3}[\kappa_{\epsilon y}(\omega)+\kappa_{\mu x}(\omega)]/2+\Delta_{g}$ ; we choose $k_{r}$ for each equation differently (i.e., with $k_{r}\in\{k_{1},k_{2},k_{3}\}$ ); also the phase mismatch term is $\Delta k=k_{3}-k_{2}-k_{1}$ . The combined nonlinear coefficient is $\chi^{\pm}=\chi_{\epsilon}\pm(\mu_{0}/\epsilon_{0})(\epsilon_{r}/\mu_{r})^{3/2}\chi_{\mu}$ .

VI Discussion

Examining the respective roles of the reference permittivity $\epsilon_{r}$ and permeability $\mu_{r}$ in eqns. (13) and (16), (17), we see that as far as dispersion and other linear effects are concerned, the two components simply add. In contrast, their effect on nonlinear terms is more dramatic: with the ratio $Y^{2}=\epsilon_{r}/\mu_{r}$ scaling the nonlinear corrections to the magnetization into the electric field units of $\textbf{F}^{\pm}$ . This is because $Y^{2}$ determines how much of a given directional field $\textbf{F}^{\pm}$ is electric field and how much magnetic field; large values of $Y^{2}$ correspond to cases where the magnetic field is most prominent. Indeed, $Y$ is just the reciprocal of the electromagnetic impedance of our chosen reference medium, and only if $Y$ is real-valued do propagating fields exist, since otherwise the fields become evanescent.

Figures 1 and 2 show how $Y$ varies with frequency for two different metamaterial types, with the dispersions encoded on $\epsilon_{r}$ and $\mu_{r}$ and scaled by $\omega^{2}$ to moderate the low-frequency singularity of the Drude response. The extreme limits of large $Y$ occur when $|\mu_{r}|\ll|\epsilon_{r}|$ , that is, usually just at an edge of a non-propagating band, where $\mu_{r}$ is about to change sign. In such a region, it would be better to revert to the $\textbf{G}^{\pm}$ fields, or to rescale the propagation equations into units of magnetic field (e.g., with some $\textbf{K}^{\pm}=\textbf{G}^{\pm}/2\beta_{r}$ ).

In previous work Scalora et al. (2005); Wen et al. (2007); D’Aguanno et al. (2008), a Drude type response for both $\epsilon$ and $\mu$ was assumed, where $\epsilon_{r},\mu_{r}\propto 1-\omega_{\epsilon,\mu}^{2}/(\omega^{2}-\imath\gamma_{\epsilon,\mu}\omega)$ ; and this situation is shown on Fig. 1. However, although the dielectric response in metamaterials (e.g., a wire grid array Pendry-HSY-1996prl; Pendry-HRS-1998jpcm) often has this behaviour, the magnetic response of split ring resonators (SRRs) differs. SRR magnetization is best described by a pseudo-Lorentz model³³3Also known as the “F-model” Pendry-HRS-1999ieeemtt with $\mu_{r}\propto 1+F_{\mu}\omega^{2}/(\omega^{2}-\omega_{\mu}^{2}-\imath\gamma_{\mu}\omega)$ , although sometimes a true Lorentz response is used instead, $\mu_{r}\propto 1+F_{\mu}\omega_{\mu}^{2}/(\omega^{2}-\omega_{\mu}^{2}-\imath\gamma_{\mu}\omega)$ . Note the difference between the numerators in these latter two expressions: a frequency-dependent $\omega^{2}$ versus a constant material parameter $\omega_{\mu}^{2}$ . The pseudo-Lorentz model has an incorrect high-frequency behaviour, and so it is incompatible with the Kramers-Kronig relations that enforce causality. However, at low and medium frequency it is a better match to the physical response of SRRs, and so I use it for Figs. 2 and 3.

Refer to caption — Figure 1: Normalised electromagnetic field ratio and $\epsilon,\mu$ curves for Drude responses in both permittivity $\epsilon_{r}$ and permeability $\mu_{r}$ . The magnetic resonance at $\omega_{\mu}=\omega_{\epsilon}/\sqrt{2}$ is lower than the (dielectric) plasma frequency $\omega_{\epsilon}$ , and $\gamma_{\epsilon}=\gamma_{\mu}=0.01\omega_{\epsilon}$ . Large $|Y|$ corresponds to mostly magnetic $\textbf{G}^{\pm}$ fields, small $|Y|$ corresponds to mostly electric $\textbf{G}^{\pm}$ . The frequency ranges of propagating negative phase velocity (NPV) and positive phase velocity (PPV) light are shown.

There are frequency ranges over which the linear material responses vary dramatically, and in particular on Fig. 3 (which uses the same model as Fig. 2) this is evident for both the refractive index $n$ and group velocity $v_{g}$ . If we aim to operate in such regions, this leads to two potential complications. First, if we have chosen reference parameters that do not match the linear material responses exactly, then the correction terms will become large, meaning that our wavelength-scale “slow evolution” approximation may come under threat. Second, even if our reference parameters do match the linear material responses exactly, our wavelength-scale will have become frequency dependent, and so again our “slow evolution” approximation may be threatened. In either case the solution is simple – we just need to revert to the bi-directional wave equations [i.e., eqns. (9) or (10)]. However, this does not necessarily mean that any backward evolving fields are generated (as explained in Kinsler et al. (2005), and following a different approach in Kinsler (2008)), so that in principle one could optimize the propagation by reinstating it only over those frequency ranges where it becomes necessary.

VII Conclusions

I have derived a uni-directional optical pulse propagation equation for media with both electric and magnetic responses, based on the directional fields approach Kinsler et al. (2005). This involved a re-expression of Maxwell’s equations, and required only a single approximation to reduce a one dimensional bi-directional model, to a uni-directional first-order wave equation. The simplicity of this approach makes it very convenient in waveguides, optical fibres, or other collinear situations. The important approximation is that the pulse evolves only slowly on the scale of a wavelength; and indeed this is a valid assumption in a wide variety of cases – note in particular that nonlinear effects have to be unrealizably strong to violate it Kinsler (2007b). The result has no intrinsic bandwidth restrictions, makes no demands on the pulse profile, and does not require a co-moving frame – unlike other common types of derivation Agrawal (2007); Brabec and Krausz (1997); Geissler et al. (1999); Kinsler and New (2003).

The resulting equations have the advantage that they are straightforward to write down, despite containing the complications of both electric and magnetic responses, and that a carrier-envelope representation or co-moving frames are easy to apply if desired, requiring no further approximation. In this, they match the clarity and flexibility of factorized second-order wave equations Ferrando et al. (2005); Genty et al. (2007); Kinsler (2008), but they can more easily incorporate the effects of magnetic material responses – albiet at the cost of being restricted to one dimensional propagation.

Acknowledgements.

I acknowledge financial support from the Engineering and Physical Sciences Research Council (EP/E031463/1).

References

Pendry (2000) J. B. Pendry,
Phys. Rev. Lett. 85, 3966 (2000),
doi:10.1103/PhysRevLett.85.3966.
Tsakmakidis et al. (2007) K. L. Tsakmakidis, A. D. Boardman, and O. Hess,
Nature 450, 397 (2007),
doi:10.1038/nature06285.
Scalora et al. (2005) M. Scalora, M. S. Syrchin, N. Akozbek, E. Y. Poliakov, G. D’Aguanno, N. Mattiucci, M. J. Bloemer, and A. M. Zheltikov,
Phys. Rev. Lett. 95, 013902 (2005),
doi:10.1103/PhysRevLett.95.013902.
Wen et al. (2007) S. Wen, Y. Xiang, X. Dai, Z. Tang, W. Su, and D. Fan,
Phys. Rev. A 75, 033815 (2007),
doi:10.1103/PhysRevA.75.033815.
D’Aguanno et al. (2008) G. D’Aguanno, N. Mattiucci, and M. J. Bloemer,
J. Opt. Soc. Am. B 25, 1236 (2008),
doi:10.1364/JOSAB.25.001236.
Kinsler and New (2003) P. Kinsler and G. H. C. New,
Phys. Rev. A 67, 023813 (2003),
doi:10.1103/PhysRevA.67.023813,
arXiv:physics/0212016.
Brabec and Krausz (1997) T. Brabec and F. Krausz,
Phys. Rev. Lett. 78, 3282 (1997),
doi:10.1103/PhysRevLett.78.3282.
Geissler et al. (1999) M. Geissler, G. Tempea, A. Scrinzi, M. Schnürer, F. Krausz, and T. Brabec,
Phys. Rev. Lett. 83, 2930 (1999),
doi:10.1103/PhysRevLett.83.2930.
Agrawal (2007) G. P. Agrawal, Nonlinear Fiber Optics
(Academic Press, Boston, 2007), 4th ed., ISBN 978-0-12-369516-1.
Kinsler (2002) P. Kinsler (2002), arXiv:physics/0212014.
Kinsler et al. (2005) P. Kinsler, S. B. P. Radnor, and G. H. C. New,
Phys. Rev. A 72, 063807 (2005),
doi:10.1103/PhysRevA.72.063807, arXiv:physics/0611215.
Kinsler (2007a) P. Kinsler (2007a),
“Pulse propagation methods in nonlinear optics”,
arXiv:0707.0982.
Kinsler et al. (2009) P. Kinsler, A. Favaro, and M. W. McCall,
Eur. J. Phys. 30, 983 (2009),
doi:10.1088/0143-0807/30/5/007, arXiv:0908.1721.
Kinsler (2007b) P. Kinsler,
J. Opt. Soc. Am. B 24, 2363 (2007b),
doi:10.1364/JOSAB.24.002363, arXiv:0707.0986.
Kinsler (2008) P. Kinsler,
Phys. Rev. A 81, 013819 (2008),
doi:10.1103/PhysRevA.81.013819, arXiv:0810.5689.
Fleck (1970) J. A. Fleck,
Phys. Rev. B 1, 84 (1970),
doi:10.1103/PhysRevB.1.84.
Scalora and Crenshaw (1994) M. Scalora and M. E. Crenshaw,
Opt. Commun. 108, 191 (1994),
doi:10.1016/0030-4018(94)90647-5.
Kinsler and New (2004) P. Kinsler and G. H. C. New,
Phys. Rev. A 69, 013805 (2004),
doi:10.1103/PhysRevA.69.013805,
arXiv:physics/0411120.
Kinsler and New (2005) P. Kinsler and G. H. C. New,
Phys. Rev. A 72, 033804 (2005),
doi:10.1103/PhysRevA.72.033804,
arXiv:physics/0606111.
Kinsler et al. (2006) P. Kinsler, G. H. C. New, and J. C. A. Tyrrell
(2006), arXiv:physics/0611213.
Boyd (2003) R. W. Boyd, Nonlinear Optics
(Academic Press, New York, 2003), 2nd ed., 1st ed. 1994.
Gabor (1946) D. Gabor,
J. Inst. Electr. Eng. (London) 93, 429 (1946).
Laegsgaard (2007) J. Laegsgaard,
Opt. Express 15, 16110 (2007),
doi:10.1364/OE.15.016110.
Boardman et al. (2000) A. D. Boardman, K. Marinov, D. I. Pushkarov, and A. Shivarova,
Optical and Quantum Electronics 32, 49 (2000),
doi:10.1023/A:1007045304098.
Alfano and Shapiro (1970a) R. R. Alfano and S. L. Shapiro,
Phys. Rev. Lett. 24, 584 (1970a), doi:10.1103/PhysRevLett.24.584.
Alfano and Shapiro (1970b) R. R. Alfano and S. L. Shapiro,
Phys. Rev. Lett. 24, 592 (1970b), doi:10.1103/PhysRevLett.24.592.
Dudley et al. (2006) J. M. Dudley, G. Genty, and S. Coen,
Rev. Mod. Phys. 78, 1135 (2006),
doi:10.1103/RevModPhys.78.1135.
Solli et al. (2007) D. R. Solli, C. Ropers, P. Koonath, and B. Jalali,
Nature 450, 1054 (2007),
doi:10.1038/nature06402.
Braun et al. (1995) A. Braun, G. Korn, X. Liu, D. Du, J. Squier, and G. Mourou,
Opt. Lett. 20, 7375 (1995),
doi:10.1364/OL.20.000073.
Berge and Skupin (2009) L. Berge and S. Skupin,
Discrete and Continuous Dynamical Systems (DCDS-A) 23, 1099 (2009),
doi:10.3934/dcds.2009.23.1099.
Various (1993) Various, J. Opt. Soc. Am. B 10, 1656–1791 (1993),
”Special Issue on optical parametric oscillation and amplification”,
URL http://www.opticsinfobase.org/josab/issue.cfm?volume=10&issue%=9.
Danielius et al. (1993) R. Danielius, A. Piskarskas, A. Stabinis, G. P. Banfi, P. D. Trapani, and R. Righini,
J. Opt. Soc. Am. B 10, 2222 (1993),
doi:10.1364/JOSAB.10.002222.
Genty et al. (2007) G. Genty, P. Kinsler, B. Kibler, and J. M. Dudley,
Opt. Express 15, 5382 (2007),
doi:10.1364/OE.15.005382.
Ferrando et al. (2005) A. Ferrando, M. Zacares, P. F. de Cordoba, D. Binosi, and A. Montero,
Phys. Rev. E 71, 016601 (2005),
doi:10.1103/PhysRevE.71.016601.
Kinsler (2006) P. Kinsler,
(2006), arXiv:physics/0611216.

Appendix A Derivation of eqn. (1)

The derivation in this paper is simpler, more general, and defines $\textbf{G}^{\pm}$ using a better sign convention than those in Kinsler et al. (2005); Kinsler (2006). I start with the Maxwell curl equations, and transform into frequency space:

$\displaystyle\nabla\times\textbf{H}(t)$	$\displaystyle=+\partial_{t}\epsilon_{r}\star\textbf{E}(t)+\textbf{J}(t),$
$\displaystyle\nabla\times\textbf{E}(t)$	$\displaystyle=-\partial_{t}\mu_{r}\star\textbf{H}(t)-\mu_{0}\textbf{K}(t)$	(23)
$\displaystyle\nabla\times\textbf{H}(\omega)$	$\displaystyle=-\imath\omega~\alpha_{r}(\omega)^{2}\textbf{E}(\omega)+\textbf{J}(\omega),$
$\displaystyle\nabla\times\textbf{E}(\omega)$	$\displaystyle=+\imath\omega~\beta_{r}(\omega)^{2}\textbf{H}(\omega)-\mu_{0}\textbf{K}(\omega).$	(24)

I now rotate the $\nabla\times\textbf{H}$ equation by taking the cross product with u,

\displaystyle~\textbf{u}\times\left(\nabla\times\textbf{H}\right)

\displaystyle=-\imath\omega~\alpha_{r}^{2}\left(\textbf{u}~\times\textbf{E}\right)+\textbf{u}\times\textbf{J},

(25)

scale each part by $\beta_{r}$ and $\alpha_{r}$ respectively, while insisting that these parameters do not depend on position. Thus,

	$\displaystyle\textbf{u}\times\left(\nabla\times\beta_{r}\textbf{H}\right)$	$\displaystyle=-\imath\omega~\beta_{r}\alpha_{r}^{2}\left(\textbf{u}~\times\textbf{E}\right)+\textbf{u}\times\beta_{r}\textbf{J},$
	$\displaystyle\nabla\times\alpha_{r}\textbf{E}$	$\displaystyle=+\imath\omega~\alpha_{r}\beta_{r}^{2}\textbf{H}-\alpha_{r}\mu_{0}\textbf{K},$		(26)

and then take the sum and difference –

$\displaystyle\nabla\times\alpha_{r}\textbf{E}$	$\displaystyle\pm~\textbf{u}\times\left(\nabla\times\beta_{r}\textbf{H}\right)$
	$\displaystyle=+\imath\omega~\alpha_{r}\beta_{r}^{2}\textbf{H}~\mp~\imath\omega~\beta_{r}\alpha_{r}^{2}\left(\textbf{u}~\times\textbf{E}\right)~~$
	$\displaystyle\qquad\qquad\qquad\pm~~\textbf{u}\times\beta_{r}\textbf{J}-\alpha_{r}\mu_{0}\textbf{K}.$	(27)

The vector $\textbf{G}^{\pm}$ fields are defined in eqn. (2), but that neglects the longitudinal part of H. Thus, for completeness, we also need to define ${G}^{\circ}=\textbf{u}\cdot\beta_{r}\textbf{H}$ (as in Kinsler et al. (2005); Kinsler (2006)). Their form means that I need to convert both the second term on the LHS of the sum-and-difference equation above, as well as the RHS. It is most important for the LHS to be simple, because this will define the type of propagation specified by the RHS. In the following I use the vector identity.

\displaystyle\textbf{u}\times\left(\nabla\times\textbf{H}\right)-\nabla\left(\textbf{u}\cdot\textbf{H}\right)

\displaystyle=

\displaystyle\nabla\times\left(\textbf{u}\times\textbf{H}\right),

(28)

along with $\textbf{u}\times[\textbf{u}\times\textbf{H}]=[\textbf{u}\cdot\textbf{H}]\textbf{u}-\textbf{H}$ , so that

$\displaystyle\textbf{u}\times\textbf{G}^{\mp}$	$\displaystyle=\textbf{u}\times\alpha_{r}\textbf{E}\pm\textbf{u}\times\left[\textbf{u}\times\beta_{r}\textbf{H}\right]$	(29)
	$\displaystyle=\textbf{u}\times\alpha_{r}\textbf{E}\mp\beta_{r}\textbf{H}~~\pm\left[\textbf{u}\cdot\beta_{r}\textbf{H}\right]\textbf{u}$	(30)
	$\displaystyle=\textbf{u}\times\alpha_{r}\textbf{E}\mp\beta_{r}\textbf{H}~~\pm\textbf{u}{G}^{\circ}.$	(31)

Continuing the derivation,

$\displaystyle\nabla\times\alpha_{r}\textbf{E}~~\pm~~\textbf{u}\times\left(\nabla\times\beta_{r}\textbf{H}\right)$	$\displaystyle=+\imath\omega\alpha_{r}\beta_{r}^{2}\textbf{H}~~\mp~~\imath\omega\beta_{r}\alpha_{r}^{2}\left(\textbf{u}\times\textbf{E}\right)~~\pm\textbf{u}\times\beta_{r}\textbf{J}-\alpha_{r}\mu_{0}\textbf{K},$	(32)
$\displaystyle\nabla\times\alpha_{r}\textbf{E}~~\pm~~\nabla\times\left(\textbf{u}\times\beta_{r}\textbf{H}\right)\pm\nabla\left(\textbf{u}\cdot\beta_{r}\textbf{H}\right)$	$\displaystyle=+\imath\omega\alpha_{r}\beta_{r}^{2}\textbf{H}~~\mp~~\imath\omega\beta_{r}\alpha_{r}^{2}\left(\textbf{u}\times\textbf{E}\right)~~\pm\textbf{u}\times\beta_{r}\textbf{J}-\alpha_{r}\mu_{0}\textbf{K}$	(33)
$\displaystyle\nabla\times\left[\alpha_{r}\textbf{E}~~\pm~~\left(\textbf{u}\times\beta_{r}\textbf{H}\right)\right]$	$\displaystyle=+\imath\omega\alpha_{r}\beta_{r}^{2}\textbf{H}~~\mp~~\imath\omega\beta_{r}\alpha_{r}^{2}\left(\textbf{u}\times\textbf{E}\right)\mp\nabla\left(\textbf{u}\cdot\beta_{r}\textbf{H}\right)~~\pm\textbf{u}\times\beta_{r}\textbf{J}-\alpha_{r}\mu_{0}\textbf{K}~~~~~~~~$	(34)
$\displaystyle\nabla\times\textbf{G}^{\mp}$	$\displaystyle=\imath\omega\left\{\alpha_{r}\beta_{r}^{2}\textbf{H}~~\mp~~\beta_{r}\alpha_{r}^{2}\left(\textbf{u}\times\textbf{E}\right)\right\}\mp\nabla\left(\textbf{u}\cdot\beta_{r}\textbf{H}\right)~~\pm\textbf{u}\times\beta_{r}\textbf{J}-\alpha_{r}\mu_{0}\textbf{K}$	(35)

I now rearrange eqn. (35) to give the final form, in which I substitute $\partial_{t}\textbf{P}=\textbf{J}(t)$ and $\partial_{t}\textbf{M}=\textbf{K}(t)$ to match eqn. (1). Thus

	$\displaystyle\nabla\times\textbf{G}^{\mp}$	$\displaystyle=\imath\omega\left\{\alpha_{r}\beta_{r}\textbf{u}{G}^{\circ}~~-\alpha_{r}\beta_{r}^{2}\left(\textbf{u}\times\left[\textbf{u}\times\textbf{H}\right]\right)~~\mp\beta_{r}\alpha_{r}^{2}\left(\textbf{u}\times\textbf{E}\right)\right\}\mp\nabla{G}^{\circ}~~\pm\textbf{u}\times\beta_{r}\textbf{J}-\alpha_{r}\mu_{0}\textbf{K}$		(36)
		$\displaystyle=\mp\imath\omega\left\{\beta_{r}\alpha_{r}^{2}\left(\textbf{u}\times\textbf{E}\right)\pm\alpha_{r}\beta_{r}^{2}\left(\textbf{u}\times\left[\textbf{u}\times\textbf{H}\right]\right)\right\}~~+\imath\omega\alpha_{r}\beta_{r}\textbf{u}{G}^{\circ}~~\mp\nabla{G}^{\circ}~~\pm\textbf{u}\times\beta_{r}\textbf{J}-\alpha_{r}\mu_{0}\textbf{K},$		(37)

and finally

	$\displaystyle\nabla\times\textbf{G}^{\pm}$	$\displaystyle=\pm\imath\omega~\alpha_{r}\beta_{r}\textbf{u}\times\textbf{G}^{\pm}~~+\imath\omega\alpha_{r}\beta_{r}\textbf{u}{G}^{\circ}~~\pm\nabla{G}^{\circ}~~\mp\textbf{u}\times\beta_{r}\textbf{J}-\alpha_{r}\mu_{0}\textbf{K}$		(38)
		$\displaystyle=\pm\imath\omega~\alpha_{r}\beta_{r}\textbf{u}\times\textbf{G}^{\pm}~~+\imath\omega\alpha_{r}\beta_{r}\textbf{u}{G}^{\circ}~~\pm\nabla{G}^{\circ}~~\pm\imath\omega\beta_{r}\textbf{u}\times\textbf{P}+\imath\omega\alpha_{r}\mu_{0}\textbf{M}.$		(39)

Note that for $\textbf{J}(t)=\partial_{t}\textbf{P}=\partial_{t}\kappa_{\epsilon}\textbf{E}(t)$ , where $\kappa_{\epsilon}$ is some complicated but scalar dielectric response function, we have

	$\displaystyle\mp\textbf{u}\times\beta_{r}\textbf{J}(\omega)$	$\displaystyle=\pm\imath\omega\beta_{r}\textbf{u}\times\textbf{P}\quad=\pm\imath\omega\alpha_{r}^{2}\beta_{r}\kappa_{\epsilon}\star\textbf{u}\times\textbf{E}$		(40)
		$\displaystyle=\pm\imath\omega\frac{\alpha_{r}^{2}\beta_{r}}{2}\kappa_{\epsilon}\star\left(\textbf{u}\times\left[\textbf{G}^{+}+\textbf{G}^{-}\right]\right),$		(41)

and for $\textbf{K}(t)=\partial_{t}\textbf{M}=\partial_{t}\kappa_{\mu}\textbf{H}(t)$ , where $\kappa_{\mu}$ is some complicated but scalar magnetic response function, we have

$\displaystyle-\alpha_{r}\mu_{0}\textbf{K}(\omega)$	$\displaystyle=+\imath\omega\mu_{0}\alpha_{r}\textbf{M}\quad=+\imath\omega\mu_{0}\alpha_{r}\kappa_{\mu}\star\textbf{H}$	(42)
	$\displaystyle=+\imath\omega\mu_{0}\alpha_{r}\kappa_{\mu}\star\left(\textbf{u}\left[\textbf{u}\cdot\textbf{H}\right]-\textbf{u}\times\textbf{u}\times\textbf{H}\right)$	(43)
	$\displaystyle=+\imath\omega\mu_{0}\frac{\alpha_{r}}{\beta_{r}}\kappa_{\mu}\star\left(\textbf{u}{G}^{\circ}-\textbf{u}\times\left[\textbf{u}\times\beta_{r}\textbf{H}\right]\right)$	(44)
	$\displaystyle=-\imath\omega\mu_{0}\frac{\alpha_{r}}{2\beta_{r}}\kappa_{\mu}\star\left(\textbf{u}\times\left[\textbf{G}^{+}-\textbf{G}^{-}\right]-2\textbf{u}{G}^{\circ}\right).$	(45)

Finally, when generating eqn. (25), we lost the longitudinal part of $\nabla\times\textbf{H}=\partial\epsilon_{r}\textbf{E}+\textbf{J}$ (i.e. that parallel to u). This is

$\displaystyle\textbf{u}\cdot\nabla\times\textbf{H}$	$\displaystyle=-\imath\omega\alpha_{r}^{2}\textbf{u}\cdot\textbf{E}+\textbf{u}\cdot\textbf{J}$	(46)
$\displaystyle\textbf{u}\cdot\nabla\times\left(\textbf{G}^{+}-\textbf{G}^{-}+2\textbf{u}G^{\circ}\right)$	$\displaystyle=-\imath\omega\alpha_{r}\beta_{r}\textbf{u}\cdot\left(\textbf{G}^{+}-\textbf{G}^{-}\right)$
	$\displaystyle\qquad+2\beta_{r}\textbf{u}\cdot\textbf{J}$	(47)
$\displaystyle 2\textbf{u}\cdot\left(\nabla{G}^{\circ}-\textbf{u}\times\nabla{G}^{\circ}\right)$	$\displaystyle=-\imath\omega\alpha_{r}\beta_{r}\textbf{u}\cdot\left(\textbf{G}^{+}-\textbf{G}^{-}\right)$
	$\displaystyle\qquad+2\beta_{r}\textbf{u}\cdot\textbf{J}$	(48)

\displaystyle 2\textbf{u}\cdot\nabla{G}^{\circ}

\displaystyle=-\imath\omega\alpha_{r}\beta_{r}\textbf{u}\cdot\left(\textbf{G}^{+}-\textbf{G}^{-}\right)+2\beta_{r}\textbf{u}\cdot\textbf{J},

(49)

since $\nabla\times{G}^{\circ}\textbf{u}={G}^{\circ}\nabla\times\textbf{u}-\textbf{u}\times\nabla{G}^{\circ}=-\textbf{u}\times\nabla{G}^{\circ}$ , and

$\displaystyle\textbf{u}\cdot\nabla\times\left(\textbf{G}^{+}-\textbf{G}^{-}\right)$	$\displaystyle=\imath\omega\alpha_{r}\beta_{r}\textbf{u}\cdot\textbf{u}\times\left(\textbf{G}^{+}+\textbf{G}^{-}\right)$
	$\displaystyle\qquad+2\textbf{u}\cdot\nabla{G}^{\circ}+2\imath\omega\beta_{r}\textbf{u}\cdot\textbf{u}\times\textbf{P}$	(50)
	$\displaystyle=2\textbf{u}\cdot\nabla{G}^{\circ}.$	(51)

Appendix B Correction terms

In this appendix I work through the details of how the polarization and magnetization terms scale with respect to one another. To simplify matters, I assume all corrections are scalar since when $\epsilon_{r}$ and $\mu_{r}$ are not field-polarization or orientation sensitive, the scalings remain the same, even if the specific field terms may vary (e.g. ${E}_{x}{E}_{y}$ instead of ${E}_{x}^{2}$ ).

Consider the general unidirectional equation for ${F}_{x}^{\pm}$ (i.e. eqn. (10)), and replace the polarization and magnetization terms with dimensionless response parameters $q_{\epsilon}$ and $q_{\mu}$ multiplied by the appropriate field ${E}_{x}$ or ${H}_{y}$ . Then replace ${E}_{x}$ and ${H}_{y}$ with their representation in terms of ${F}_{x}^{+}$ , so that

$\displaystyle\frac{\imath\omega}{2}\frac{\beta_{r}}{\alpha_{r}}{P}_{x}+\frac{\imath\omega}{2}\mu_{0}{M}_{y}$	$\displaystyle=\frac{\imath\omega}{2}\frac{\beta_{r}}{\alpha_{r}}q_{\epsilon}\epsilon_{0}{E}_{x}~~+\frac{\imath\omega}{2}q_{\mu}\mu_{0}{H}_{y}$	(52)
	$\displaystyle=\frac{\imath\omega}{2}\left[\frac{\beta_{r}}{\alpha_{r}}q_{\epsilon}\epsilon_{0}{F}_{x}^{+}~~+q_{\mu}\mu_{0}\frac{\alpha_{r}}{\beta_{r}}{F}_{x}^{+}\right]$	(53)
	$\displaystyle=\frac{\imath\omega\alpha_{r}\beta_{r}}{2}\left[\frac{q_{\epsilon}\epsilon_{0}}{\alpha_{r}^{2}}{F}_{x}^{+}~~+\frac{q_{\mu}\mu_{0}}{\beta_{r}^{2}}{F}_{x}^{+}\right]$	(54)
	$\displaystyle=\frac{\imath k_{r}}{2}\frac{\epsilon_{0}}{\epsilon_{r}}\left[q_{\epsilon}~~+q_{\mu}\frac{\mu_{0}}{\epsilon_{0}}\frac{\epsilon_{r}}{\mu_{r}}\right]{F}_{x}^{+},$	(55)

remembering that $\textbf{F}^{+}=\textbf{E}=(\beta_{r}/\alpha_{r})\textbf{H}$ , and that $\epsilon_{r}=\alpha_{r}^{2}$ , and $\mu_{r}=\beta_{r}^{2}$ .

Since we consider the electric-field-like field ${F}^{+}$ , the polarization corrections are trivial to write down; as for an $m$ -th order nonlinear term, $q_{\epsilon}=\chi_{\epsilon}F^{+(m-1)}$ . This means we need only concentrate on the magnetization correction. If $q_{\mu}$ is that for an $m$ -th order nonlinear term, then $q_{\mu}=\chi_{\mu}H^{n-1}=\chi_{\mu}(\alpha_{r}/\beta_{r})^{(m-1)}F^{+(m-1)}$ . Writing down only the term in square brackets from eqn. (55) gives us

	$\displaystyle\left[q_{\epsilon}~+\chi_{\mu}\frac{\mu_{0}}{\epsilon_{0}}\frac{\epsilon_{r}}{\mu_{r}}\left(\frac{\alpha_{r}}{\beta_{r}}\right)^{m-1}{F}_{x}^{+(m-1)}\right]{F}_{x}^{+}$		(56)
	$\displaystyle\qquad\qquad=\left[q_{\epsilon}~+\chi_{\mu}\frac{\mu_{0}}{\epsilon_{0}}\left(\frac{\epsilon_{r}}{\mu_{r}}\right)^{\frac{m+1}{2}}{F}_{x}^{+(m-1)}\right]{F}_{x}^{+}.$		(57)

Note that corrections for linear loss or gain are first-order processes (i.e. with $m=1$ ), where for loss we need $q\sim\imath\gamma$ , with $\gamma>0$ ; Thus for loss the whole correction term will be proportional to $-\gamma{F}_{x}^{+}$ , as would be expected.