Numerical solution of the linear semiclassical Schrödinger equation on the real line

Arieh Iserles
Department of Applied Mathematics and Theoretical Physics
Centre for Mathematical Sciences
University of Cambridge
Wilberforce Rd, Cambridge CB4 1LE
United Kingdom Karolina Kropielnicka
Institute of Mathematics
Polish Academy of Sciences
Antoniego Abrahama 18, 81-825 Sopot
Poland Katharina Schratz
Laboratoire Jacques-Louis Lions
Sorbonne Université
4 place Jussieu, 75252 Paris
France Marcus Webb
Department of Mathematics
University of Manchester
Alan Turing Building
Manchester M13 9PL
United Kingdom

Abstract

The numerical solution of a linear Schrödinger equation in the semiclassical regime is very well understood in a torus $\mbox{\Bbb T}^{d}$ . A raft of modern computational methods are precise and affordable, while conserving energy and resolving high oscillations very well. This, however, is far from the case with regard to its solution in $\mbox{\Bbb R}^{d}$ , a setting more suitable for many applications. In this paper we extend the theory of splitting methods to this end. The main idea is to derive the solution using a spectral method from a combination of solutions of the free Schrödinger equation and of linear scalar ordinary differential equations, in a symmetric Zassenhaus splitting method. This necessitates detailed analysis of certain orthonormal spectral bases on the real line and their evolution under the free Schrödinger operator.

1 Introduction

1.1 Why the real line?

This paper is concerned with the numerical solution of the linear Schrödinger equation in the semiclassical regime, describing the motion of an electron in a quantum system,

{\mathrm{i}}\varepsilon\frac{\partial u}{\partial t}=-\varepsilon^{2}\Delta u+V(\mbox{\boldmath$x$\unboldmath})u,\qquad t\geq 0,\;\mbox{\boldmath$x$\unboldmath}\in\mbox{\Bbb R}^{d},

(1.1)

where the initial condition $u(\mbox{\boldmath$x$\unboldmath},0)=u_{0}(\mbox{\boldmath$x$\unboldmath})\in\mathrm{L}_{2}(\mbox{\Bbb R}^{d})$ for all $\mbox{\boldmath$x$\unboldmath}\in\mbox{\Bbb R}^{d}$ is given. The semiclassical parameter $\varepsilon>0$ is a small number which describes the square root of the ratio between the mass of an electron and the total mass of the system, and $V:\mbox{\Bbb R}^{d}\rightarrow\mbox{\Bbb R}$ is the interaction potential which is assumed to be smooth for the purposes of this paper. Since $|u(\mbox{\boldmath$x$\unboldmath},t)|^{2}$ gives the probability density of the electron residing at $x$ at time $t$ , the system is required to satisfy,

\int_{\mbox{\sBbb R}^{d}}|u(\mbox{\boldmath$x$\unboldmath},t)|^{2}\,\mathrm{d}\mbox{\boldmath$x$\unboldmath}\equiv 1,

(1.2)

and any physically relevant numerical solution must be consistent with this conservation law. To read more, (?) is an excellent, up to date review of both the equation (1.1) and its numerical solution.

Respecting the unitarity property (1.2) underlies the importance of geometric numerical integration methodologies in this context and has been central to modern treatment of the linear Schrödinger equation in the semiclassical, $0<\varepsilon\ll 1$ , and the atomistic, $\varepsilon=1$ , regimes alike (?, ?, ?, ?, ?). However, all these publications are focussed on a subtly different problem: instead of being defined in $\mbox{\Bbb R}^{d}$ , the equation (1.1) is set on a torus, typically $\mbox{\Bbb T}^{d}$ , with periodic boundary conditions. This is of crucial importance to splitting techniques, a common denominator to all these methodologies, because the free Schrödinger equation

{\mathrm{i}}\frac{\partial u}{\partial t}=-\varepsilon\Delta u,

(1.3)

given with periodic boundary conditions, can be approximated very rapidly, affordably and precisely by means of the Fast Fourier Transform (FFT).

Our contention is that the periodic setting imposes unwelcome limitations on the solution, which might lead to altogether false outcomes, and this becomes problematic once a solution over a long time interval is sought (e.g. in quantum control). The underlying reason is the tension arising from the nature of the differential equation and of the initial condition, both predicated by quantum-mechanical considerations. The differential equation itself is dispersive: different waves travel at different speeds, dependent on their wavelengths, which can span a very wide range, all the way from ${\cal O}\!\left(1\right)$ to ${\cal O}\!\left(\varepsilon^{-1}\right)$ . The initial condition is typically a linear combination of highly localised (and rapidly oscillating) wave packets. Recall that $|u(\mbox{\boldmath$x$\unboldmath},t)|^{2}$ represents the probability of a particle residing at $x$ in time $t$ : while it is a central tenet of quantum mechanics that a particle cannot be completely localised, typically $|u(\mbox{\boldmath$x$\unboldmath},t)|^{2}$ is a linear combination of narrowly-concentrated Gaussian-like structures. These Gaussian-like structures travel at different speeds and, provided the equation is solved for sufficiently long time, some of them eventually reach the boundary. At this very moment periodicity becomes a foe because the wave packet reaches the boundary and ‘pops out’ at the other end — this is not physical!

An alternative to periodic boundary conditions is to impose zero Dirichlet or zero Neumann boundary conditions. However, the following argument shows that this approach is also problematic. Consider an initial condition $u_{0}\in\mathrm{H}^{1}_{0}(0,1)$ and potential $V\in\mathrm{H}^{1}(0,1)$ . Now consider the following two initial boundary value problems, the first of which has zero Dirichlet boundary conditions, the second of which has periodic boundary conditions:

$\displaystyle{\mathrm{i}}\varepsilon\frac{\partial v}{\partial t}$	$\displaystyle=$	$\displaystyle-\varepsilon^{2}\frac{\partial^{2}v}{\partial x^{2}}+V(x)v,\qquad x\in[0,1],$	(1.4)
$\displaystyle v(0,t)$	$\displaystyle=$	$\displaystyle 0,\qquad v(1,t)=0,\qquad t>0,$
$\displaystyle v(x,0)$	$\displaystyle=$	$\displaystyle u_{0}(x),\qquad x\in[0,1],$

and,

$\displaystyle{\mathrm{i}}\varepsilon\frac{\partial w}{\partial t}$	$\displaystyle=$	$\displaystyle-\varepsilon^{2}\frac{\partial^{2}w}{\partial x^{2}}+V(\|x\|)w,\qquad x\in[-1,1],$	(1.5)
$\displaystyle w(-1,t)$	$\displaystyle=$	$\displaystyle w(1,t),\qquad\partial_{x}w(-1,t)=\partial_{x}w(1,t),\qquad t>0,$
$\displaystyle w(x,0)$	$\displaystyle=$	$\displaystyle\mathrm{sign}(x)u_{0}(\|x\|),\qquad x\in[-1,1].$

The relationship between $v(x,t)$ and $w(x,t)$ for $x\in[0,1]$ and $t>0$ is rather simple. Clearly the oddness of $w(x,0)$ is preserved since the second derivative and multiplication by $V(|x|)$ preserve oddness. Combining oddness with periodicity implies that $w(0,t)=0=w(1,t)$ for all time. It therefore follows from uniqueness of solution to (1.4) that $w(x,t)=v(x,t)$ for $x\in[0,1]$ and $t>0$ . So now let us return to the notion of a wave packet moving towards the boundary, but this time with zero Dirichlet boundary conditions imposed. The solution to the odd extension implies that this wave packet will be reflected back and its sign reversed — while this physically happens in the case of an infinite potential barrier, it is not the correct behaviour when posed in free space! A similar construction can be made for Neumann boundary conditions.

Refer to caption — Figure 1.1: Top: We plot the evolution of (1.4) with $u_{0}(x)$ an approximate wave packet (so that zero boundary conditions are satisfied) and $V(x)=0$ . The wave packet moves rightwards towards the right boundary until time $t=10$ , after which it moves leftwards, returning unscathed by the encounter. Such “reflections” contradict the behaviour of a wave packet in free space. Bottom: We plot the evolution of the corresponding extension in (1.5). We see that the reflection behaviour for the Dirichlet initial boundary value problem can be explained by the periodic behaviour of this one.

We hope this has convinced the reader: no matter what we do, and no matter how rapidly and accurately we can solve Schrödinger’s equation posed on a bounded set, the result of truncating the domain from $\mbox{\Bbb R}^{d}$ to such a set destroys the physics of the problem over a large enough time interval. This is the raison d’être for this paper: solve (1.1) without compromising its setting in $\mbox{\Bbb R}^{d}$ . Throughout the remainder of the paper we assume that (1.1) is presented in a single space dimension, $d=1$ . A generalisation to a modest number of space dimensions can be accomplished with tensor products along the lines of (?), while generalisation to a large number of dimensions would require a raft of additional techniques and is beyond the scope of the current paper.

To achieve this aim, we will extend the framework of symmetric Zassenhaus splittings, which has been developed for (1.1) on the torus 𝕋 (?, ?), to (1.1) posed on the whole real line. This is not a straightforward exercise, because we cannot use special properties of the Fourier basis. In Section 2 we derive these Zassenhaus splittings under more general assumptions, allowing for bases other than the Fourier basis to be used. In Section 3, we discuss the solution of the free Schrödinger equation (1.3), focusing on two bases which are orthonormal on the real line: Hermite functions and Malmquist–Takenaka functions. In Section 5 we demonstrate how these pieces can be put together to construct practical numerical solvers on the real line.

2 Splitting techniques

For the clarity of exposition we write $\partial_{x}^{2}$ instead of $\Delta$ as in (1.1). The simplest splitting methodology is to separate the potential and kinetic parts in (1.1), ${\mathrm{i}}\varepsilon\partial_{x}^{2}u-{\mathrm{i}}\varepsilon^{-1}V(x)u$ , building upon the fact that separate solutions of

\frac{\partial u}{\partial t}=-{\mathrm{i}}\varepsilon^{-1}V(x)u\qquad\mbox{and}\qquad\frac{\partial u}{\partial t}={\mathrm{i}}\varepsilon\partial_{x}^{2}u

are (at least in a torus or a parallelepiped) much less expensive to compute than those of the full problem. We abuse notation for the exponential and write

u(x,t)={\mathrm{e}}^{-{\mathrm{i}}t\varepsilon^{-1}V(x)}u(x,0)\qquad\mbox{and}\qquad u(x,t)={\mathrm{e}}^{{\mathrm{i}}t\varepsilon\partial_{x}^{2}}u(x,0)

for their respective solutions. Splitting methods produce a sequence of functions $u^{0}(x)$ , $u^{1}(x)$ , $u^{2}(x)$ , $\ldots$ , intended to satisfy $u^{k}(x)\approx u(x,kh)$ where $h$ is the time-step parameter. These functions of $x$ can be discretised by any approach, for example by a spectral method.

The two simplest splitting methods are the Lie–Trotter formula

u^{k+1}(x)={\mathrm{e}}^{{\mathrm{i}}\varepsilon h\partial_{x}^{2}}{\mathrm{e}}^{-{\mathrm{i}}h\varepsilon^{-1}V(x)}u^{k}(x),

(2.6)

and

u^{k+1}(x)={\mathrm{e}}^{-{\mathrm{i}}h\varepsilon^{-1}V(x)/2}{\mathrm{e}}^{{\mathrm{i}}\varepsilon h\partial_{x}^{2}}{\mathrm{e}}^{-{\mathrm{i}}h\varepsilon^{-1}V(x)/2}u^{k}(x).

(2.7)

Of course, the role of $\varepsilon^{-1}V(x)$ and $\varepsilon\partial_{x}^{2}$ can be reversed. The latter approach, advocated in (?) in tandem with spectral methods, is the famous Strang splitting (known also as Strang–Marchuk splitting in Russian literature).

Formally, the Strang splitting is known to produce time-stepping methods bearing an error of ${\cal O}\!\left(h^{3}\right)$ . However, this is misleading because the error constant is of size ${\cal O}\!\left(\varepsilon^{-1}\right)$ , as we show below using Theorem 3. A more effective measure of error should incorporate the small parameter $\varepsilon$ , which may be even smaller in magnitude than the time-step $h$ . To calculate the effective error of the splitting (2.7), where the error constant does not depend on the small semiclassical parameter $\varepsilon$ , let us have a closer look at symmetric Baker–Campbell–Hausdorff formula (?, Sec. III.4.2),

{\mathrm{e}}^{\frac{1}{2}\tau A}{\mathrm{e}}^{\tau B}{\mathrm{e}}^{\frac{1}{2}\tau A}={\mathrm{e}}^{\mathrm{sBCH}(\tau A,\tau B)}

(2.8)

where $A=-\varepsilon^{-1}V$ , $B=\varepsilon\partial_{x}^{2}$ and $\tau={\mathrm{i}}h$ with

	$\displaystyle\mathrm{sBCH}(\tau A,\tau$	$\displaystyle B)=\tau A+\tau B-\tau^{3}\frac{1}{24}[[B,A],A]-\tau^{3}\frac{1}{12}[[B,A],B]$
		$\displaystyle+\tau^{5}\frac{7}{5760}[[[[B,A],A],A],A]+\tau^{5}\frac{7}{1440}[[[[B,A],A],A],B]$
		$\displaystyle+\tau^{5}\frac{1}{180}[[[[B,A],A],B],B]+\tau^{5}\frac{1}{720}[[[[B,A],B],B],B]$
		$\displaystyle+\tau^{5}\frac{1}{480}[[[B,A],A],[B,A]]-\tau^{5}\frac{1}{360}[[[B,A],B],[B,A]]+{\rm h.o.t.}$

Given that $A$ and $B$ are unbounded operators and also contain powers of $\varepsilon$ , we now proceed to clarify the meaning of “h.o.t.” (higher order terms).

2.1 A new analysis of the sBCH formula for the semiclassical Schrödinger equation

As it was shown in (?) Schrödinger equations in semiclassical regime produce oscillations in space of frequency ${\cal O}\!\left(\varepsilon^{-1}\right)$ , which places restrictions on the discretisation in space depending on which basis is used, because we must employ sufficiently fine discretisation to resolve these oscillations. If the spatial variable is discretised using the Fourier basis then this necessitates $\mathcal{O}(\varepsilon^{-1})$ basis elements, which in turn, leads to the conclusion that after discretisation, operators of type $\partial_{x}^{n}$ have a spectral radius which scales like ${\cal O}\!\left(\varepsilon^{-n}\right)$ . As we discuss in Section 3, for other bases it is not necessarily the case that $\mathcal{O}(\varepsilon^{-1})$ basis elements can resolve spatial oscillations of frequency ${\cal O}\!\left(\varepsilon^{-1}\right)$ (indeed the Fourier basis is the optimal basis for resolving periodic oscillations). As such, we will not make assumptions about the number of basis elements, but rather, make assumptions directly on the spectral radius of the partial derivative operator (an assumption which holds in both of our examples discussed in Section 3)

Assumption 1

Throughout this paper we will assume that after spatial discretisation, the operator $\partial_{x}$ has spectral radius $\mathcal{O}(\varepsilon^{-1})$ .

Since the potential $V(x)$ can in principle be an unbounded function on the real line, we must be careful that our expansions be treated locally in $x$ .

Assumption 2

The potential $V:\mbox{\Bbb R}\to\mbox{\Bbb R}$ is infinitely differentiable, which we write $V\in\mathrm{C}^{\infty}_{\mathrm{loc}}(\mbox{\Bbb R})$ . As a result, all derivatives are locally bounded in ℝ.

We can now make sense of “h.o.t.” in the sBCH formula by bounding the magnitude element of the Hall basis for the free Lie algebra generated by $A=-\varepsilon^{-1}V$ and $B=\varepsilon\partial_{x}^{2}$ (i.e. $A$ , $B$ , $[A,B]$ , $[[B,A],A]$ , $[[B,A],B]$ $\ldots$ (?, ?)).

Theorem 3

Let $A=-\varepsilon^{-1}V$ and $B=\varepsilon\partial_{x}^{2}$ assume they have been discretised following Assumption 1, and stipulate Assumption 2. Then all terms $C$ of the Hall basis constructed of letters $A$ and $B$ either vanish (i.e. $C\equiv 0$ ) or are $\mathcal{O}(\varepsilon^{-1})$ .

Before we proceed with the proof of the theorem, note that all elements of Hall basis (?) of commutators constructed of letters $A$ and $B$ live in the set

\mbox{\gothic G}=\left\{\sum_{k=0}^{K}y_{k}(x)\partial_{x}^{k}\,:K\in\mbox{\Bbb Z}_{+},y_{0},\ldots,y_{K}\in\mathrm{C}^{\infty}_{\mathrm{loc}}(\mbox{\Bbb R})\right\},

by applying the product rule (for differentiation). For example,

	$\displaystyle[B,A]=-[\partial_{x}^{2},V]=-\left(V^{(2)}+2V^{(1)}\partial_{x}+V\partial_{x}^{2}-V\partial_{x}^{2}\right)=-V^{(2)}-2V^{(1)}\partial_{x}$
	$\displaystyle[[B,A],A]=\varepsilon^{-1}[[\partial_{x}^{2},V],V]=\varepsilon^{-1}2(V^{(1)})^{2}$
	$\displaystyle[[B,A],B]=-\varepsilon[[\partial_{x}^{2},V],\partial_{x}^{2}]=\varepsilon\left(V^{(4)}+4V^{(3)}\partial_{x}+4V^{(2)}\partial_{x}^{2}\right),$
	$\displaystyle[[[B,A],A],A]=0,$

where $V^{(k)}=\partial_{x}^{k}V$ . We define the height of the commutator $C$ as the largest index of non-zero coefficient $y_{K}(x)$ :

{\rm ht}(C)={\rm ht}\!\left(\sum_{k=0}^{K}y_{k}(x)\partial_{x}^{k}\right)=K,\text{ where }y_{K}(x)\not\equiv 0,

One can observe, that ${\rm ht}(A)=0$ , ${\rm ht}(B)=2$ , ${\rm ht}([B,A])=1$ , ${\rm ht}([[B,A],A])=0$ and ${\rm ht}([[B,A],B])=2$ .

In the proof we will also refer to the formula elaborated in (?)

	$\displaystyle\left[\sum_{i=0}^{n}f_{i}(x)\partial^{i}_{x},\sum_{j=0}^{m}g_{j}(x)\partial^{j}_{x}\right]=$	$\displaystyle\sum_{i=0}^{n}\sum_{j=0}^{m}\sum_{\ell=0}^{i}\binom{i}{\ell}f_{i}(x)\left(\partial^{i-\ell}_{x}g_{j}(x)\right)\partial^{\ell+j}_{x}$
		$\displaystyle\mbox{}-\sum_{j=0}^{m}\sum_{i=0}^{n}\sum_{\ell=0}^{j}\binom{j}{\ell}g_{j}(x)\left(\partial^{j-\ell}_{x}f_{i}(x)\right)\partial^{\ell+i}_{x}.$		(2.9)

Proof (of Theorem 3) Let us assume, that a certain non-zero commutator $C$ in Hall basis is built of $N_{A}$ letters $A$ and $N_{B}$ letters $B$ . We show by induction on $N_{A}+N_{B}$ , that

{\rm ht}(C)\leq N_{B}-N_{A}+1.

(2.10)

The cases in which $N_{A}+N_{B}=1$ are obtained explicitly as ${\rm ht}(A)=0$ and ${\rm ht}(B)=2$ , thus (2.10) is satisfied for the generators of the free Lie algebra. Now let us assume that a given non-zero commutator $C$ satisfies (2.10), so can be written as

C=\sum_{k=0}^{K}y_{k}(x)\partial_{x}^{k},

where $0\leq K\leq N_{B}-N_{A}+1$ and $y_{K}\not\equiv 0$ . Then by (2.9),

	$\displaystyle[A,C]$	$\displaystyle=\varepsilon^{-1}\sum_{k=0}^{K-1}\left(\sum_{j=k}^{K}\binom{j}{k}y_{j}(x)V^{(j-k)}(x)\right)\partial_{x}^{k},$
	$\displaystyle[B,C]$	$\displaystyle=\varepsilon\sum_{k=0}^{K}y_{k}^{\prime\prime}(x)\partial_{x}^{k}+2y_{k}^{\prime}(x)\partial_{x}^{k+1},$

Therefore, ignoring the cases where these commutators vanish identically, we see that (2.10) is satisfied for $[A,C]$ and $[B,C]$ by the inductive hypothesis. This, in fact, completes the induction step for the entire Hall basis, because any commutator in the Hall basis can be written as a linear combination of words of the form

[a_{1},[a_{2},[\dots,[a_{n-1},a_{n}]\dots]]],

where $a_{k}\in\{A,B\}$ for all $k$ , by the Jacobi identity (this is known as the Dynkin basis).

Next we show that every non-zero commutator $C$ in the Hall basis scales like ${\cal O}\!\left(\varepsilon^{-1}\right)$ . Indeed, when $C$ is made up of $N_{A}$ letters $A=-\varepsilon^{-1}V$ and $N_{B}$ letters $B=\varepsilon\partial_{x}^{2}$ , the linearity of commutators implies the equality of commutators $C$ and $\varepsilon^{N_{B}-N_{A}}\bar{C}$ , where $\bar{C}$ has the same structure as $C$ , but with $\bar{A}=-V$ and $\bar{B}=\partial_{x}^{2}$ instead of $A$ and $B$ . Obviously ${\rm ht}(C)={\rm ht}(\bar{C})$ . Now by Assumption 1, $\partial_{x}$ scales like $\varepsilon^{-1}$ after discretisation and by Assumption 2 we have that all variable coefficients $y_{k}$ lie in $\mathrm{C}^{\infty}_{\mathrm{loc}}(\mbox{\Bbb R})$ (so all derivatives are locally bounded). Therefore $\bar{C}=\mathcal{O}(\varepsilon^{-{\rm ht}(C)})$ . Since for non-zero $C$ , we have ${\rm ht}(C)\leq N_{B}-N_{A}+1$ , we conclude that,

C=\varepsilon^{N_{B}-N_{A}}\bar{C}=\mathcal{O}(\varepsilon^{N_{B}-A_{N}-{\rm ht}(\bar{C})})=\mathcal{O}(\varepsilon^{N_{B}-N_{A}-(N_{B}-N_{A}+1)})=\mathcal{O}(\varepsilon^{-1}),

which concludes the proof of the theorem. $\Box$

An immediate consequence of Theorem 3 and (2.8) is that

{\mathrm{e}}^{\frac{1}{2}\tau A}{\mathrm{e}}^{\tau B}{\mathrm{e}}^{\frac{1}{2}\tau A}={\mathrm{e}}^{\tau(A+B)+{\cal O}\!\left(h^{3}\varepsilon^{-1}\right)}={\mathrm{e}}^{\tau(A+B)}+{\cal O}\!\left(h^{3}\varepsilon^{-1}\right).

This means that taking the time step size $h={\cal O}\!\left(\varepsilon\right)$ in the Strang splitting (2.7) yields a local truncation error of $\mathcal{O}(h^{2})$ or equivalently, $\mathcal{O}(\varepsilon^{2})$ . However, a time step $h={\cal O}\!\left(\varepsilon\right)$ is overly expensive. If instead, one took a more reasonable $h={\cal O}\!\left(\varepsilon^{1/2}\right)$ , then the local truncation error is effectively $\mathcal{O}(h)$ or equivalently, $\mathcal{O}(\varepsilon^{1/2})$ . In summary, unless the time step is unacceptably reduced, the effective error of the Strang splitting is larger than that suggested by an analysis which ignores the smallness of $\varepsilon$ .

2.2 Symmetric Zassenhaus splittings

This order reduction for the Strang splitting in the case of Hamiltonians in a semi-classical setting motivates the quest for higher order splittings. A systematic approach is to calculate higher order symmetric Zassenhaus splittings, first proposed in (?). Using this methodology we will derive two splittings for the solution operator $\exp(\tau(A+B))$ where $A=-\varepsilon^{-1}V$ , $B=\varepsilon\partial_{x}^{2}$ and $\tau={\mathrm{i}}h$ , of order ${\cal O}\!\left(h^{3}\varepsilon^{-1}\right)$ and ${\cal O}\!\left(h^{5}\varepsilon^{-1}\right)$ respectively, in the family of symmetric Zassenhaus splittings.

To derive the first symmetric Zassenhaus splitting, we apply the sBCH formula in the following way.

{\mathrm{e}}^{-\frac{1}{2}\tau A}{\mathrm{e}}^{\tau A+\tau B}{\mathrm{e}}^{-\frac{1}{2}\tau A}={\mathrm{e}}^{\mathrm{sBCH}(-\tau A,\tau A+\tau B)}

(2.11)

where

		$\displaystyle\mathrm{sBCH}(-\tau A,\tau A+\tau B)=$		(2.12)
		$\displaystyle=\tau B+\tau^{3}\frac{1}{24}[[B,A]A]+\tau^{3}\frac{1}{12}[[B,A],B]$
		$\displaystyle-\tau^{5}\frac{1}{720}[[[[B,A],A],B],B]-\tau^{5}\frac{1}{720}[[[[B,A],B],B],B]$
		$\displaystyle-\tau^{5}\frac{1}{480}[[[B,A],A],[B,A]]-\tau^{5}\frac{1}{240}[[[B,A],B],[B,A]]+{\cal O}\!\left(h^{7}\varepsilon^{-1}\right).$

Substituting (2.12) into (2.11) results in the first symmetric Zassenhauss splitting, which coincides with Strang splitting,

	$\displaystyle{\mathrm{e}}^{\tau(A+B)}$	$\displaystyle={\mathrm{e}}^{\frac{1}{2}\tau A}{\mathrm{e}}^{\mathrm{sBCH}(-\tau A,\tau A+\tau B)}{\mathrm{e}}^{\frac{1}{2}\tau A}$		(2.13)
		$\displaystyle={\mathrm{e}}^{\frac{1}{2}\tau A}{\mathrm{e}}^{\tau B}{\mathrm{e}}^{\frac{1}{2}\tau A}+{\cal O}\!\left(h^{3}\varepsilon^{-1}\right).$

To derive the second symmetric Zassenhaus splitting, we split the inner term of (2.13) by the same approach as above, that is

\displaystyle{\mathrm{e}}^{-\frac{1}{2}\tau B}{\mathrm{e}}^{\mathrm{sBCH}(-\tau A,\tau A+\tau B)}{\mathrm{e}}^{-\frac{1}{2}\tau B}

\displaystyle={\mathrm{e}}^{\mathrm{sBCH}(-\tau B,\mathrm{sBCH}(-\tau A,\tau A+\tau B))}

which leads to,

{\mathrm{e}}^{\tau A+\tau B}={\mathrm{e}}^{\frac{1}{2}\tau A}e^{\frac{1}{2}\tau B}{\mathrm{e}}^{\mathrm{sBCH}(-\tau B,\mathrm{sBCH}(-\tau A,\tau A+\tau B))}{\mathrm{e}}^{\frac{1}{2}\tau B}e^{\frac{1}{2}\tau A},

(2.14)

where

	$\displaystyle\mathrm{sBCH}(-\tau B,\mathrm{sBCH}(-\tau A,\tau B+\tau A))$
	$\displaystyle=\frac{1}{24}\tau^{3}[[B,A],A]+\frac{1}{12}\tau^{3}[[B,A],B]$
	$\displaystyle-\frac{19}{2880}\tau^{5}[[[[B,A],A],B],B]-\frac{17}{1440}\tau^{5}[[[[B,A],B],B],B]$
	$\displaystyle-\tau^{5}\frac{1}{480}[[[B,A],A],[B,A]]-\tau^{5}\frac{1}{240}[[[B,A],B],[B,A]]+{\cal O}\!\left(h^{7}\varepsilon^{-1}\right).$

Observe that by Theorem 3, the first two commutators (which involve three letters) scale like ${\cal O}\!\left(h^{3}\varepsilon^{-1}\right)$ and the remainder scales like ${\cal O}\!\left(h^{5}\varepsilon^{-1}\right)$ . Therefore, these first two terms are what will appear in this Zassenhaus splitting. However, the commutator,

\displaystyle[[B,A],B]=[[\varepsilon\partial_{x}^{2},-\varepsilon^{-1}V],\varepsilon\partial_{x}^{2}]

\displaystyle=\varepsilon\left(V^{(4)}+4V^{(3)}\partial_{x}+4V^{(2)}\partial_{x}^{2}\right),

will not be skew-Hermitian after discretisation (which would result in loss of unitarity of the method), and therefore cannot be substituted into (2.14). For this reason, as proposed in (?), we use a substitution rule of the following kind:

y(x)\partial_{x}=-\frac{1}{2}\left[\int_{x_{0}}^{x}y(s){\rm d}s\right]\partial_{x}^{2}-\frac{1}{2}\partial_{x}y(x)+\frac{1}{2}\partial_{x}^{2}\left[\int_{x_{0}}^{x}y(s){\rm d}s\,\cdot\right],

and obtain terms that remain skew-Hermitian after discretisation:

		$\displaystyle\mathrm{sBCH}(-\tau B,\mathrm{sBCH}(-\tau A,\tau B+\tau A))$
		$\displaystyle=\tau^{3}\varepsilon^{-1}\frac{1}{12}(V^{(1)})^{2}+\tau^{3}\varepsilon\frac{1}{12}V^{(4)}+\tau^{3}\varepsilon\frac{1}{3}V^{(3)}\partial_{x}+\tau^{3}\varepsilon\frac{1}{3}V^{(2)}\partial_{x}^{2}+{\cal O}\!\left(h^{5}\varepsilon^{-1}\right)$
	$\displaystyle=$	$\displaystyle\tau^{3}\varepsilon^{-1}\frac{1}{12}(V^{(1)})^{2}+\frac{1}{6}\tau^{3}\varepsilon\underbrace{\left\{V^{(2)}\partial_{x}^{2}+\partial_{x}^{2}\left[V^{(2)}\cdot\right]\right\}}_{{\cal O}\!\left(\varepsilon^{-2}\right)}-\frac{1}{12}\tau^{3}\varepsilon V^{(4)}+{\cal O}\!\left(h^{5}\varepsilon^{-1}\right).$

In the final form of the splitting (2.14) the small ${\cal O}\!\left(h^{3}\varepsilon\right)$ term involving $V^{(4)}$ can be discarded.

Summing up these two derivations, we have the splittings,

u^{k+1}(x)={\mathrm{e}}^{\mathcal{R}_{0}}{\mathrm{e}}^{2\mathcal{R}_{1}}{\mathrm{e}}^{\mathcal{R}_{0}}u^{k}(x)+{\cal O}\!\left(h^{3}\varepsilon^{-1}\right)

(2.15)

and

u^{k+1}(x)={\mathrm{e}}^{\mathcal{R}_{0}}{\mathrm{e}}^{\mathcal{R}_{1}}{\mathrm{e}}^{2\mathcal{R}_{2}}{\mathrm{e}}^{\mathcal{R}_{1}}{\mathrm{e}}^{\mathcal{R}_{0}}u^{k}(x)+{\cal O}\!\left(h^{5}\varepsilon^{-1}\right),

(2.16)

where, letting $\tau={\mathrm{i}}h$ ,

$\displaystyle\mathcal{R}_{0}$	$\displaystyle=$	$\displaystyle-\frac{1}{2}\tau\varepsilon^{-1}V,$
$\displaystyle\mathcal{R}_{1}$	$\displaystyle=$	$\displaystyle\frac{1}{2}\tau\varepsilon\partial_{x}^{2},$
$\displaystyle\mathcal{R}_{2}$	$\displaystyle=$	$\displaystyle\color[rgb]{0,0,0}\frac{1}{12}\tau^{3}\varepsilon\left\{\partial_{x}^{2}[V^{(2)}\,\cdot\,]+V^{(2)}\partial_{x}^{2}\right\}+\frac{1}{24}\tau^{3}\varepsilon^{-1}(V^{(1)})^{2}.$

Note that $\mathcal{R}_{0}={\cal O}\!\left(h\varepsilon^{-1}\right)$ , $\mathcal{R}_{1}={\cal O}\!\left(h\varepsilon^{-1}\right)$ , $\mathcal{R}_{2}={\cal O}\!\left(h^{3}\varepsilon^{-1}\right)$ .

It is also possible to derive even higher order methods, such as

u^{n+1}(x)={\mathrm{e}}^{\mathcal{R}_{0}}{\mathrm{e}}^{\mathcal{R}_{1}}{\mathrm{e}}^{\mathcal{R}_{2}}{\mathrm{e}}^{2\mathcal{R}_{3}}{\mathrm{e}}^{\mathcal{R}_{2}}{\mathrm{e}}^{\mathcal{R}_{1}}{\mathrm{e}}^{\mathcal{R}_{0}}u^{n}(x)+{\cal O}\!\left(h^{7}\varepsilon^{-1}\right),

(2.17)

where

$\displaystyle\mathcal{R}_{3}$	$\displaystyle=$	$\displaystyle-\frac{1}{120}\tau^{5}\varepsilon^{-1}V^{(2)}(V^{(1)})^{2}+\frac{1}{24}\tau^{3}\varepsilon V^{(4)}$
		$\displaystyle+\frac{1}{120}\tau^{5}\varepsilon\left\{\partial_{x}^{2}\left[\left(7(V^{(2)})^{2}+V^{(3)}V^{(1)}\right)\,\cdot\right]+\left(7(V^{(2)})^{2}+V^{(3)}V^{(1)}\right)\partial_{x}^{2}\right\}$
		$\displaystyle+\frac{1}{60}\tau^{5}\varepsilon^{-3}\left\{\partial_{x}^{4}\left[V^{(4)}\,\cdot\,\right]+V^{(4)}\partial_{x}^{4}\right\}.$

Note that $\mathcal{R}_{3}={\cal O}\!\left(h^{5}\varepsilon^{-1}\right)$ . We refer the reader to (?) for discussion of deriving such higher order methods via a sequence of skew-Hermitian operators $\mathcal{R}_{0},\mathcal{R}_{1},\ldots$ . Our new analysis encapsulated in Theorem 3 shows that each term $\mathcal{R}_{\ell}$ is actually of size ${\cal O}\!\left(h^{2\ell-1}\varepsilon^{-1}\right)$ for $\ell=1,2,\ldots$ . In Section 5, we will discuss how to go about computing ${\mathrm{e}}^{\mathcal{R}_{\ell}}$ for each $\ell$ .

3 Orthonormal systems and free Schrödinger evolutions

3.1 Orthogonal systems with tridiagonal differentiation matrices

Solving (1.1) by spectral methods based upon symmetric Zassenhaus splittings (2.15) or (2.16) involves three ingredients: the splitting itself into $\mathcal{R}_{0},\mathcal{R}_{1},\mathcal{R}_{2},\ldots$ , the choice of spectral basis, and the means to compute the exponentials ${\mathrm{e}}^{\mathcal{R}_{\ell}}$ . The generalisation of each to the new setting requires new ideas and substantial effort. In this subsection we are concerned with the choice of the spectral basis.

We seek a set $\Phi=\{\varphi_{n}\}_{n=0}^{\infty}$ which forms an orthonormal basis of $\mathrm{L}_{2}(\mbox{\Bbb R})$ – this means that any $f\in\mathrm{L}_{2}(\mbox{\Bbb R})$ can be expanded in the form

f(x)=\sum_{n=0}^{\infty}\hat{f}_{n}\varphi_{n}(x),\qquad\mbox{where}\qquad\hat{f}_{n}=\int_{-\infty}^{\infty}f(x)\overline{\varphi_{n}(x)}\,\mathrm{d}x,\quad n\in\mbox{\Bbb Z}_{+}.

For the time being we require the $\varphi_{n}$ s to be real, although this will be lifted as necessary (with suitable changes). In addition we require that $\Phi$ has a tridiagonal differentiation matrix (which, it is easy to prove, must be skew-symmetric),

\varphi_{n}^{\prime}=-b_{n-1}\varphi_{n-1}+b_{n}\varphi_{n+1},\qquad n\in\mbox{\Bbb Z}_{+},

(3.1)

where $b_{-1}=0$ and $b_{n}>0$ , $n\in\mbox{\Bbb Z}_{+}$ . This makes both computation and analysis considerably easier.

A comprehensive theory of such orthogonal systems has been developed in (?, ?). The main issue, making (3.1) compatible with orthonormality, can be explicated by considering Fourier transforms of the $\varphi_{n}$ s. Specifically, let $w(\xi)\,\mathrm{d}\xi$ be a Borel measure over ℝ and its Radon–Nikodym derivative $w\geq 0$ be absolutely continuous and even. Furthermore assume that all the moments of this measure are finite. Such measure generates a system of orthonormal polynomials $\{p_{n}\}_{n=0}^{\infty}$ ,

\int_{-\infty}^{\infty}p_{n}(\xi)p_{m}(\xi)w(\xi)\,\mathrm{d}\xi=0,\quad m\neq n,\qquad\int_{-\infty}^{\infty}p_{n}^{2}(\xi)w(\xi)\,\mathrm{d}\xi=1.

Then the scaled inverse Fourier transform,

\varphi_{n}(x)=\frac{(-{\mathrm{i}})^{n}}{\sqrt{2\pi}}\int_{-\infty}^{\infty}p_{n}(\xi)g(\xi){\mathrm{e}}^{{\mathrm{i}}x\xi}\,\mathrm{d}\xi,\qquad n\in\mbox{\Bbb Z}_{+},

(3.2)

where $g$ is any function satisfying $|g(\xi)|^{2}=w(\xi)$ , forms an orthonormal system on the real line which satisfies (3.1). Note that this system is real-valued if and only if $g$ has even real part and odd imaginary part, for example $g(\xi)=\sqrt{w(\xi)}$ . The constants $b_{n}$ in (3.1) are inherited from the recurrence relation for orthonormal polynomials,

b_{n}p_{n+1}(\xi)=\xi p_{n}(\xi)-b_{n-1}p_{n-1}(\xi),\qquad n\in\mbox{\Bbb Z}_{+}.

The orthonormal system given by (3.2) need not be dense in ℝ – as a matter of fact, it is dense in the Paley–Wiener space $\mathcal{PW}_{\mbox{supp}(w)}(\mbox{\Bbb R})\subseteq\mathrm{L}_{2}(\mbox{\Bbb R})$ which is the space of $L_{2}(\mbox{\Bbb R})$ functions whose Fourier transforms vanish outside of the support of $w$ . Therefore, the system is a basis of $\mathrm{L}_{2}(\mbox{\Bbb R})$ if and only if the weight function $w$ is positive on the whole real line.

Complete orthonormal bases can be formed also from polynomials $P=\{p_{n}\}_{n=0}^{\infty}$ orthogonal on the half-line $[0,\infty)$ (?), e.g. the Laguerre polynomials whose orthogonality measure is ${\mathrm{e}}^{-\xi}\,\mathrm{d}\xi$ , $\xi\geq 0$ : The representation (3.2) survives intact but, to render the system dense in $\mathrm{L}_{2}(\mbox{\Bbb R})$ , we need to complement $P$ with orthogonal polynomials with respect to the mirror image of $w$ in the left half-line, $w(-\xi)\,\mathrm{d}\xi$ for $\xi\leq 0$ . The new system $\Phi$ is enumerated by $n\in\mbox{\Bbb Z}$ and in place of (3.1) we have

\varphi_{n}^{\prime}=-b_{n-1}\varphi_{n-1}+{\mathrm{i}}c_{n}\varphi_{n}+b_{n}\varphi_{n+1},\qquad n\in\mbox{\Bbb Z},

with $b_{n}>0$ , $n\neq 0$ , $b_{0}=0$ and real $c_{n}$ – note that the new differentiation matrix is skew-Hermitian.

3.2 Free Schrödinger evolutions

Given an orthonormal system $\Phi$ on the real line, we denote by $\psi_{n}$ , $n\in\mbox{\Bbb Z}_{+}$ , the solution of the free Schrödinger equation (1.3) with the initial condition $\varphi_{n}$ – in other words,

\frac{\partial\psi_{n}}{\partial t}=-{\mathrm{i}}\varepsilon\frac{\partial^{2}\psi_{n}}{\partial x^{2}},\qquad\psi_{n}(x,0)=\varphi_{n}(x),\;x\in\mbox{\Bbb R}.

(3.3)

We call $\Psi(t)=\{\psi_{n}(\cdot,t)\}_{n=0}^{\infty}$ the free Schrödinger evolution (FSE) of $\Phi$ .

The exact solution of (3.3) via the Fourier transform is well known and can be easily verified by direct differentiation:

\psi_{n}(x,t)=\frac{1}{\sqrt{2\pi}}\int_{-\infty}^{\infty}\hat{\varphi}_{n}(\eta){\mathrm{e}}^{{\mathrm{i}}\eta^{2}\varepsilon t+{\mathrm{i}}\eta x}\,\mathrm{d}\eta,

(3.4)

where

\hat{\varphi}_{n}(\eta)=\frac{1}{\sqrt{2\pi}}\int_{-\infty}^{\infty}\varphi_{n}(\xi){\mathrm{e}}^{-{\mathrm{i}}\eta\xi}\,\mathrm{d}\xi

is the familiar Fourier transform of $\varphi_{n}$ .

On the face of it, our job is done: any mention of the phrase “Fourier transform” elicits from a numerical analyst the instinctive response “Fast Fourier Transform!”. This, however, is somewhat rash. An FFT computes rapidly the discrete Fourier transform which, in turn, is a very precise (at any rate, for very smooth functions) approximation of the Fourier transform of a periodic function in a compact interval, while our setting is the entire real line. One possibility is to clip the real line, approximating it by a sufficiently large interval and disregarding the Gibbs effect at the endpoints. This immediately begs the question “how large” which, while not beyond the ken of numerical reasoning, presents its own challenges. In this paper we adopt an alternative – and arguably more effective – point of view, seeking the exact solution of (3.4) for specific orthonormal systems $\Phi$ . While this approach cannot be expected to apply to each and every $\Phi$ consistent with the setting of Subsection 2.1, it does so with the two most interesting orthonormal systems: Hermite functions and Malmquist–Takenaka functions.

Once FSEs $\Psi(t)$ are known, the solution of the free Schrödinger equation (1.3) with the initial condition $u(x,kh)$ proceeds as follows: The function $u(x,kh)$ is expanded in the orthonormal basis $\Phi$ ,

u(x,kh)\approx\sum_{n=0}^{N}\hat{u}_{n}\varphi_{n}(x)

(3.5)

for a sufficiently large truncation parameter $N$ . Having done so, linearity of (1.3) implies that

u(x,(k+1)h)\approx\sum_{n=0}^{N}\hat{u}_{n}\psi_{n}(x,h).

(3.6)

We get the coefficients for free because they do not change — it is the basis which changes. The choice of $N$ is governed by approximation properties of the spectral basis, and its ability to approximate spatial oscillations of frequency ${\cal O}\!\left(\varepsilon^{-1}\right)$ as discussed in the introduction.

Indeed, orthonormal systems are not all of equal value: more specifically, they can approximate functions at different speeds. While standard spectral methods on a torus are known to converge (for analytic functions) at an exponential speed, equivalent theory does not exist yet on the real line. Recalling from Section 1 that solutions of (1.1) are typically composed of wave packets, it is instructive to enquire how well different orthonormal systems approximate wave packets. This is investigated in (?) for the two families $\Phi$ described in the sequel: in both cases we can prove exponential convergence to any set error tolerance.

We note for further reference that the computation of (3.6) (once $N$ and $h$ have been appropriately chosen) requires both the knowledge of $\Psi(h)$ and the means to evaluate an expansion as in (3.5).

Theorem 4

Let $\Phi$ be as in (3.2). Then the functions,

\psi_{n}(x,t)=\frac{(-{\mathrm{i}})^{n}}{\sqrt{2\pi}}\int_{-\infty}^{\infty}p_{n}(\xi)g(\xi){\mathrm{e}}^{{\mathrm{i}}x\xi+{\mathrm{i}}\varepsilon t\xi^{2}}\,\mathrm{d}\xi,\qquad n\in\mbox{\Bbb Z}_{+},

(3.7)

where $\{p_{n}\}_{n=0}^{\infty}$ is the system of orthonormal polynomials with respect to the measure $|g(\xi)|^{2}\,\mathrm{d}\xi$ , satisfies (3.3) (in particular $\psi_{n}(x,0)=\varphi_{n}(x)$ ) and for all $t$ is itself an orthonormal basis of $\mathrm{L}_{2}(\mbox{\Bbb R})$ satisfying,

\frac{\partial\psi_{n}(x,t)}{\partial x}=-b_{n-1}\psi_{n-1}(x,t)+b_{n}\psi_{n+1}(x,t),\qquad n\in\mbox{\Bbb Z}_{+},

(3.8)

where $\{b_{n}\}_{n\in\mbox{\sBbb Z}_{+}}$ are the same constants as in (3.1).

Proof Differentiating under the integral sign with respect to $x$ twice and $t$ once demonstrates that $\psi_{n}(x,t)$ satisfies the free Schrödinger equation (3.3), and it is clear that setting $t=0$ in this formula yields $\varphi_{n}(x)$ .

To show that $\psi_{n}$ is an orthonormal system satisfying (3.8), note that

\left|g(\xi){\mathrm{e}}^{{\mathrm{i}}\varepsilon t\xi^{2}}\right|^{2}=|g(\xi)|^{2}=w(\xi),

(3.9)

so these functions still come under the framework of (3.2), with exactly the same polynomials $\{p_{n}\}_{n\in\mbox{\sBbb Z}_{+}}$ , but with the function $g(\xi){\mathrm{e}}^{{\mathrm{i}}\varepsilon t\xi^{2}}$ in place of $g(\xi)$ .

$\Box$

3.3 Re-expanding an FSE expansion in the original basis

Suppose that have an expansion the FSE basis $\Psi(t)=\{\psi_{n}\}_{n=0}^{\infty}$ ,

u(x,t)=\sum_{n=0}^{\infty}a_{n}\psi_{n}(x,t),

(3.10)

and we wish to re-expand this basis in terms of the original basis $\Phi(=\Psi(0))$ for each $t$ . Let us consider time-dependent coefficients $\alpha_{n}(t)$ satisfying

u(x,t)=\sum_{n=0}^{\infty}\alpha_{n}(t)\varphi_{n}(x).

(3.11)

The relationship between $\boldsymbol{\alpha}(t)$ and $\boldsymbol{a}$ is simple when considered in terms of the polynomial basis $P$ . Indeed, the relationship is given by

\sum_{n=0}^{\infty}(-{\mathrm{i}})^{n}\alpha_{n}(t)p_{n}(\xi)=\sum_{n=0}^{\infty}(-{\mathrm{i}})^{n}a_{n}p_{n}(\xi){\mathrm{e}}^{{\mathrm{i}}t\xi^{2}},

(3.12)

where the expansions are convergent in the space $\mathrm{L}_{2}(\mbox{\Bbb R},w(\xi)\mathrm{d}\xi)$ . Writing this in terms of operators acting on the vectors $\boldsymbol{a},\boldsymbol{\alpha}(t)\in\ell_{2}$ , we have

TS\boldsymbol{\alpha}(t)=M(t)TS\boldsymbol{a},

(3.13)

where $S:\ell_{2}\to\ell_{2}$ simply multiplies $n$ th component of a sequence by $(-{\mathrm{i}})^{n}$ , $T:\ell_{2}\to\mathrm{L}_{2}(\mbox{\Bbb R},w(\xi)\mathrm{d}\xi)$ is the synthesis operator for the basis $P$ (AKA coefficients-to-values operator), and $M(t):\mathrm{L}_{2}(\mbox{\Bbb R},w(\xi)\mathrm{d}\xi)\to\mathrm{L}_{2}(\mbox{\Bbb R},w(\xi)\mathrm{d}\xi)$ multiplies functions by $\exp\left({\mathrm{i}}t\xi^{2}\right)$ . Note that since $P$ is an orthonormal basis for $\mathrm{L}_{2}(\mbox{\Bbb R},w(\xi)\mathrm{d}\xi)$ , we have that $T$ is a unitary operator (the inverse, $T^{*}$ is usually called the analysis operator or values-to-coefficients operator). $S$ is also clearly unitary, so we can invert operators to find,

\boldsymbol{\alpha}(t)=S^{*}T^{*}M(t)TS\boldsymbol{a}.

(3.14)

Since $M(t)$ is unitary, we see that as expected, the operation sending $\boldsymbol{a}$ to $\boldsymbol{\alpha}(t)$ is unitary overall.

Now, let us project these equations onto the first $N+1$ terms of $\Phi$ . We obtain,

\boldsymbol{\alpha}^{[N]}(t)=S_{N}^{*}T_{N}^{*}M_{N}(t)T_{N}S_{N}\boldsymbol{a}^{[N]},

(3.15)

where $\boldsymbol{\alpha}^{[N]}(t),\boldsymbol{a}^{[N]}\in\mbox{\Bbb C}^{N+1}$ , and $S_{N},T_{N},M_{N}(t):\mbox{\Bbb C}^{N+1}\to\mbox{\Bbb C}^{N+1}$ . These discretised operators are, $S_{N}=\mathrm{diag}((-{\mathrm{i}})^{n})_{n=0}^{N}$ , $M_{N}(t)=\mathrm{diag}(\exp({\mathrm{i}}t\xi_{k}^{2}))_{k=0}^{N}$ , and

T_{N}=\begin{pmatrix}\sqrt{w_{0}}p_{0}(\xi_{0})&\sqrt{w_{0}}p_{1}(\xi_{0})&\cdots&\sqrt{w_{0}}p_{N}(\xi_{0})\\ \sqrt{w_{1}}p_{0}(\xi_{1})&\sqrt{w_{1}}p_{1}(\xi_{1})&\cdots&\sqrt{w_{1}}p_{N}(\xi_{1})\\ \vdots&\vdots&&\vdots\\ \sqrt{w_{N}}p_{0}(\xi_{N})&\sqrt{w_{N}}p_{1}(\xi_{N})&\cdots&\sqrt{w_{N}}p_{N}(\xi_{N})\\ \end{pmatrix},

(3.16)

where $w_{0},\ldots,w_{N},\xi_{0},\ldots,\xi_{N}$ are Gauss quadrature weights and nodes (respectively) for the measure $w(\xi)\mathrm{d}\xi$ . First, note that the unitarity of the operators has been preserved by this discretisation. Second, note that the unitary matrix $T_{N}$ and the nodes $\{\xi_{k}\}_{k=0}^{N}$ can be computed rapidly and stably by computing the eigendecomposition of the Jacobi matrix for the orthonormal polynomials $P$ , as in the Golub–Welsch algorithm (?). However, if $P$ is an orthonormal polynomial basis which enjoys fast transforms from coefficients to values and back, and fast computation of Gaussian quadrature nodes (Jacobi polynomials, for example (?)) then such algorithms can be used in place of the generic Golub–Welsch approach.

4 Examples of orthonormal systems

In this section we describe two systems $\Phi$ and their free Schrödinger evolutions $\Psi(t)$ .

4.1 Hermite functions

Hermite functions

\varphi_{n}(x)=\frac{1}{{(2^{n}n!\pi^{1/2})}^{1/2}}\mathrm{H}_{n}(x){\mathrm{e}}^{-x^{2}/2},\qquad n\in\mbox{\Bbb Z}_{+},

(4.1)

where $\mathrm{H}_{n}$ is the $n$ th Hermite polynomial, are eigenfunctions of the Fourier transform,

\frac{1}{\sqrt{2\pi}}\int_{-\infty}^{\infty}\varphi_{n}(\xi){\mathrm{e}}^{{\mathrm{i}}x\xi}\,\mathrm{d}\xi={\mathrm{i}}^{n}\varphi_{n}(x),\qquad x\in\mbox{\Bbb R},\quad n\in\mbox{\Bbb Z}_{+}.

(4.2)

Their orthonormality in $\mathrm{L}_{2}(\mbox{\Bbb R})$ follows from that of the familiar Hermite polynomials (?, 18.3) in $\mathrm{L}_{2}(\mbox{\Bbb R};{\mathrm{e}}^{-\xi^{2}})$ , they obey the differential recurrence relation (3.2) with $b_{n}=\sqrt{n/2}$ and the Cramér inequality $|\varphi_{n}(x)|\leq\pi^{-1/4}$ , $x\in\mbox{\Bbb R}$ .¹¹1They should not be confused with Hermite functions from (?, p. 84).

To derive the FSE $\Psi=\{\psi_{n}\}_{n=0}^{\infty}$ we assume the atomistic setting $\varepsilon=1$ : to translate to semiclassical setting, we will replace $t$ by $\varepsilon t$ in the final formula. Our starting point is the standard generating function for Hermite polynomials,

\sum_{n=0}^{\infty}\frac{\mathrm{H}_{n}(x)}{n!}z^{n}={\mathrm{e}}^{2xz-z^{2}}

(?, 18.12.15). It now follows from (4.1) that

\pi^{1/4}{\mathrm{e}}^{x^{2}/2}\sum_{n=0}^{\infty}\frac{\varphi_{n}(x)}{\sqrt{n!}}(2^{1/2}z)^{n}={\mathrm{e}}^{2xz-z^{2}}

or, replacing $z\rightarrow 2^{-1/2}z$ ,

\sum_{n=0}^{\infty}\frac{\varphi_{n}(x)}{\sqrt{n!}}z^{n}=\pi^{-1/4}\exp\!\left(-\frac{x^{2}}{2}+2^{1/2}xz-\frac{z^{2}}{2}\right)\!.

It now follows from (3.4) and (4.2) that

$\displaystyle\sum_{n=0}^{\infty}\frac{\psi_{n}(x,t)}{\sqrt{n!}}({\mathrm{i}}z)^{n}$	$\displaystyle=$	$\displaystyle\frac{1}{\sqrt{2\pi}}\sum_{n=0}^{\infty}\frac{z^{n}}{\sqrt{n!}}\int_{-\infty}^{\infty}\varphi_{n}(\xi){\mathrm{e}}^{{\mathrm{i}}(\xi^{2}t+\xi x)}\,\mathrm{d}\xi$
	$\displaystyle=$	$\displaystyle\frac{1}{\sqrt{2\pi}}\int_{-\infty}^{\infty}\left[\sum_{n=0}^{\infty}\frac{\varphi_{n}(\xi)}{\sqrt{n!}}z^{n}\right]\!{\mathrm{e}}^{{\mathrm{i}}(\xi^{2}t+\xi x)}\,\mathrm{d}\xi$
	$\displaystyle=$	$\displaystyle\frac{1}{2^{1/2}\pi^{3/4}}\int_{-\infty}^{\infty}\exp\!\left(-\frac{1}{2}\xi^{2}+2^{1/2}\xi z-\frac{1}{2}z^{2}+{\mathrm{i}}\xi^{2}t+{\mathrm{i}}\xi x\right)\!\,\mathrm{d}\xi$
	$\displaystyle=$	$\displaystyle\frac{1}{\pi^{1/4}(1-2{\mathrm{i}}t)^{1/2}}\exp\!\left(-\frac{z^{2}+2^{3/2}{\mathrm{i}}xz-x^{2}+2{\mathrm{i}}tz^{2}}{2(2{\mathrm{i}}t-1)}\right)\!.$

We conclude that

\sum_{n=0}^{\infty}\frac{\psi_{n}(x,t)}{\sqrt{n!}}({\mathrm{i}}z)^{n}=\frac{1}{\pi^{1/4}(1-2{\mathrm{i}}t)^{1/2}}\exp\!\left(\frac{x^{2}}{2(2{\mathrm{i}}t-1)}\right)\exp\!\left(-\frac{2^{1/2}{\mathrm{i}}xz}{2{\mathrm{i}}t-1}-\frac{1}{2}\frac{2{\mathrm{i}}t+1}{2{\mathrm{i}}t-1}z^{2}\right)\!.

Set

X=-\frac{x}{(1+4t^{2})^{1/2}},\qquad Z=\frac{1}{2^{1/2}}\left(\frac{2{\mathrm{i}}t+1}{2{\mathrm{i}}t-1}\right)^{\!1/2}z,

which satisfy,

2XZ-Z^{2}=-\frac{2^{1/2}{\mathrm{i}}xz}{2{\mathrm{i}}t-1}-\frac{1}{2}\frac{2{\mathrm{i}}t+1}{2{\mathrm{i}}t-1}z^{2},

and we deduce, using again the generating function for Hermite polynomials, that

\exp\!\left(-\frac{2^{1/2}{\mathrm{i}}xz}{2{\mathrm{i}}t-1}-\frac{1}{2}\frac{2{\mathrm{i}}t+1}{2{\mathrm{i}}t-1}z^{2}\right)=\sum_{n=0}^{\infty}\frac{\mathrm{H}_{n}(X)}{n!}Z^{n}.

All we thus need is to compare the powers of $z$ in

			$\displaystyle\sum_{n=0}^{\infty}\frac{\psi_{n}(x,t)}{\sqrt{n!}}({\mathrm{i}}z)^{n}$
		$\displaystyle=$	$\displaystyle\frac{1}{\pi^{1/4}(1-2{\mathrm{i}}t)^{1/2}}\exp\!\left(\frac{x^{2}}{2(2{\mathrm{i}}t-1)}\right)\sum_{n=0}^{\infty}\frac{\mathrm{H}_{n}(X)}{n!}Z^{n}$
		$\displaystyle=$	$\displaystyle\frac{1}{\pi^{1/4}(1-2{\mathrm{i}}t)^{1/2}}\exp\!\left(\frac{x^{2}}{2(2{\mathrm{i}}t-1)}\right)\sum_{n=0}^{\infty}\frac{1}{n!}\mathrm{H}_{n}\!\left(\!-\frac{x}{(1+4t^{2})^{1/2}}\right)\!\!\left(\frac{1}{2^{1/2}}\left(\frac{2{\mathrm{i}}t\!+\!1}{2{\mathrm{i}}t\!-\!1}\right)^{\!1/2}\!z\!\right)^{\!n}\!\!\!.$

The outcome is

\psi_{n}(x,t)=\frac{{\mathrm{i}}^{n}}{(2^{n}n!\pi^{1/2})^{1/2}(1-2{\mathrm{i}}t)^{1/2}}\exp\!\left(\frac{x^{2}}{2(2{\mathrm{i}}t-1)}\right)\!\left(\frac{2{\mathrm{i}}t+1}{2{\mathrm{i}}t-1}\right)^{\!n/2}\!\mathrm{H}_{n}\!\left(\frac{x}{(1+4t^{2})^{1/2}}\right)\!.

Finally, since

\mathrm{H}_{n}\!\left(\frac{x}{(1+4t^{2})^{1/2}}\right)=(2^{n}n!)^{1/2}\pi^{1/4}\exp\!\left(\frac{x^{2}}{2(1+4t^{2})}\right)\!\varphi_{n}\!\left(\frac{x}{(1+4t^{2})^{1/2}}\right)\!,

we deduce, restoring the semiclassical setting, that

Lemma 5

The explicit form of the Hermite FSE is

\psi_{n}(x,t)=\frac{(1+2{\mathrm{i}}\varepsilon t)^{n/2}}{(1-2{\mathrm{i}}\varepsilon t)^{(n+1)/2}}\exp\!\left(-\frac{{\mathrm{i}}t\varepsilon x^{2}}{1+4\varepsilon^{2}t^{2}}\right)\!\varphi_{n}\!\left(\frac{x}{(1+4\varepsilon^{2}t^{2})^{1/2}}\right)\!.

(4.3)

Moreover, the functions $\psi_{n}$ are subject to the bound

|\psi_{n}(x,t)|\leq\frac{1}{[\pi(1+4\varepsilon^{2}t^{2})]^{1/4}},\qquad t\geq 0,\;\;x\in\mbox{\Bbb R}.

(4.4)

Proof The expression (4.3) follows from the preceding analysis, while (4.4) is an immediate consequence of the Cramér inequality. $\Box$

Fig. 4.1 displays the magnitude of the first six $\psi_{n}$ s. It is evident that they are consistent with the inequality (4.4). There are two facts to bear in mind. Firstly, examining the modulus hides the oscillations in (4.3): in reality, the $\psi_{n}$ s are considerably more violent. Secondly, while the functions $\psi_{n}$ appear to spread energy and $|\psi_{n}|$ seems to approach a steady steady, in reality we are interested only is small values of $t$ , a single time step, so that $t=h={\cal O}\!\left(\varepsilon^{1/2}\right)$ .

An implementation of FSEs based on Hermite functions necessitates in each time step the expansion of the initial value in Hermite functions. There exist powerful algorithms to this end, many based upon the fast multipole algorithm and generalisable to higher spatial dimensions (?).

Lemma 6

The Hermite FSE in Lemma 5 satisfy the three term recurrence,

x\psi_{n}(x,t)=\sqrt{\frac{n}{2}}\left(\frac{1+2{\mathrm{i}}\varepsilon t}{1-2{\mathrm{i}}\varepsilon t}\right)^{\tfrac{1}{2}}\psi_{n-1}(x,t)+\sqrt{\frac{n+1}{2}}\left(\frac{1-2{\mathrm{i}}\varepsilon t}{1+2{\mathrm{i}}\varepsilon t}\right)^{\tfrac{1}{2}}\psi_{n+1}(x,t).

(4.5)

This three term recurrence allows us to evaluate finite expansions in this basis in a stable manner using Clenshaw’s algorithm (?).

4.2 Malmquist–Takenaka functions

The Malmquist–Takenaka system is a complex-valued rational basis of $\mathrm{L}_{2}(\mbox{\Bbb R})$ , introduced independently by Malmquist and Takenaka and repeatedly rediscovered: we refer to (?) for its brief history. It is instructive to introduce them within the narrative of Subsection 2.1, while extending it to complex-valued bases. The starting point is the Laguerre measure ${\mathrm{e}}^{-\xi}\,\mathrm{d}\xi$ , $\xi\geq 0$ . We can use (3.2) to generate an orthonormal system on the real line but this system is not dense in $\mathrm{L}_{2}(\mbox{\Bbb R})$ . It is a basis of $\mathcal{PW}_{[0,\infty)}(\mbox{\Bbb R})$ , of $f\in\mathrm{L}_{2}(\mbox{\Bbb R})$ whose Fourier transform is supported inside $[0,\infty)$ . To recover the orthogonal complement of $\mathcal{PW}_{[0,\infty)}(\mbox{\Bbb R})$ in $\mathrm{L}_{2}(\mbox{\Bbb R})$ , namely $\mathcal{PW}_{(-\infty,0]}(\mbox{\Bbb R})$ , thereby ensuring that the system is dense in $\mathrm{L}_{2}(\mbox{\Bbb R})$ , we need to complement it by the orthonormal system generated by the measure ${\mathrm{e}}^{\xi}\,\mathrm{d}\xi$ for $\xi\in(-\infty,0]$ which, conveniently, we label by $\varphi_{n}$ , $n\leq-1$ . The outcome, the MT system, is

\varphi_{n}(x)=\sqrt{\frac{2}{\pi}}{\mathrm{i}}^{n}\frac{(1+2{\mathrm{i}}x)^{n}}{(1-2{\mathrm{i}}x)^{n+1}},\qquad n\in\mbox{\Bbb Z},

(4.6)

(?). The MT system has a number of elegant features:

$\displaystyle\varphi_{n}^{\prime}$	$\displaystyle=$	$\displaystyle-n\varphi_{n-1}+{\mathrm{i}}(2n+1)\varphi_{n}+(n+1)\varphi_{n+1},\qquad n\in\mbox{\Bbb Z},$
$\displaystyle\|\varphi_{n}(x)\|$	$\displaystyle\leq$	$\displaystyle\sqrt{\frac{2}{\pi}}\frac{1}{(1+4x^{2})^{1/2}},\qquad x\in\mbox{\Bbb R},$
$\displaystyle\varphi_{m}\varphi_{n}$	$\displaystyle=$	$\displaystyle\frac{1}{\sqrt{2\pi}}(\varphi_{m+n}-{\mathrm{i}}\varphi_{m+n+1}),\qquad m,n\in\mbox{\Bbb Z},$
$\displaystyle 2x\varphi_{n}^{\prime}$	$\displaystyle=$	$\displaystyle-{\mathrm{i}}n\varphi_{n-1}-\varphi_{n}-{\mathrm{i}}(n+1)\varphi_{n+1},$
$\displaystyle\varphi_{n+1}(x)$	$\displaystyle=$	$\displaystyle{\mathrm{i}}\left(\frac{1+2{\mathrm{i}}x}{1-2{\mathrm{i}}x}\right)\varphi_{n}(x),$
$\displaystyle\varphi_{-1-n}(x)$	$\displaystyle=$	$\displaystyle{\mathrm{i}}^{2n-1}\varphi_{n}(-x).$

– which make its implementation as a spectral basis considerably easier. However, the most valuable feature of the MT system is that, subject to the change of variables $x=\frac{1}{2}\tan(\theta/2)$ , we have

\hat{f}_{n}=\int_{-\infty}^{\infty}f(x)\overline{\varphi_{n}(x)}\,\mathrm{d}x=\frac{(-{\mathrm{i}})^{n}}{\sqrt{2\pi}}\int_{-\pi}^{\pi}\!\left(1-{\mathrm{i}}\tan\frac{\theta}{2}\right)\!f\!\left(\frac{1}{2}\tan\frac{\theta}{2}\right)\!{\mathrm{e}}^{-{\mathrm{i}}n\theta}\,\mathrm{d}\theta,\qquad n\in\mbox{\Bbb Z}.

(4.7)

In other words, the computation of expansion coefficients is equivalent to the evaluation of standard Fourier coefficients of a modified function, a task that can be accomplished (for sufficiently smooth functions) to very high accuracy using the Fast Fourier Transform.

We note in passing that this feature – the computation of the first $N$ expansion coefficients in ${\cal O}\!\left(N\log N\right)$ operations – is highly unusual in the setting of Section 3.1: it can be accomplished only for the MT basis (or its minor generalisation) using FFT and for four other ‘tanh-Chebyshev’ bases using Fast Cosine (or Sine) Transform (?).

Let us now investigate the FSEs $\Psi(t)$ . For simplicity we consider this only for $n\in\mbox{\Bbb Z}_{+}$ , noting that an extension to $n\leq-1$ is straightforward by the symmetry: $\psi_{-1-n}(x,t)={\mathrm{i}}^{2n-1}\psi_{n}(-x,t)$ . As before, we assume for the time being that $\varepsilon=1$ . Using (3.2) we have

\psi_{n}(x,t)=\frac{(-{\mathrm{i}})^{n}}{\sqrt{2\pi}}\int_{0}^{\infty}\mathrm{L}_{n}(\xi)\exp\!\left(-\tfrac{\xi}{2}+{\mathrm{i}}t\xi^{2}+{\mathrm{i}}x\xi\right)\!\,\mathrm{d}\xi,\qquad n\in\mbox{\Bbb Z}_{+},

(4.8)

where $\mathrm{L}_{n}$ is the $n$ th Laguerre polynomial. We can remove the oscillation from $\exp({\mathrm{i}}x\xi)$ by deforming the contour of integration to the line $\{z\in\mbox{\Bbb C}:\mathrm{Im}(z)=2x\mathrm{Re}(z)\}$ (technically by integrating over a the boundary of a sector of radius $R$ , where the contribution from the arc decays exponentially in $R$ ), which yields the formula

\psi_{n}(x,t)=\sqrt{\frac{2}{\pi}}\frac{(-{\mathrm{i}})^{n}}{1-2{\mathrm{i}}x}\int_{0}^{\infty}\mathrm{L}_{n}\left(\frac{2s}{1-2{\mathrm{i}}x}\right)\exp\left(\frac{4{\mathrm{i}}ts^{2}}{(1-2{\mathrm{i}}x)^{2}}\right){\mathrm{e}}^{-s}\mathrm{d}s.

(4.9)

For small values of $t$ , this integral is not particularly oscillatory, since $\mathrm{Re}\left(\right)$ It is possible to produce for any specific value of $n$ , e.g.

	$\displaystyle\psi_{0}(x,t)$	$\displaystyle=$	$\displaystyle\sqrt{\frac{{\mathrm{i}}}{8t}}\exp\!\left(\frac{(2x+{\mathrm{i}})^{2}}{16{\mathrm{i}}t}\right)\mathrm{erfc}\!\left(\frac{(2x+{\mathrm{i}})}{\sqrt{16{\mathrm{i}}t}}\right),$
	$\displaystyle\psi_{1}(x,t)$	$\displaystyle=$	$\displaystyle-{\mathrm{i}}\psi_{0}(x,t)+\left(1-2{\mathrm{i}}x\right)\frac{\psi_{0}(x,t)-\psi_{0}(x,0)}{4t}.$

There is no need to fear the power of $t$ in the denominator, which cancels as $t\rightarrow 0$ . We discuss handling this removable singularity in the next subsection.

Fig. 4.2 displays $|\psi_{n}|$ , $n=0,\ldots,5$ , for the MT functions in a setting identical to Fig. 4.1. Note that the magnitude for small $t>0$ varies much more violently for $x>0$ – obviously, this is reversed for $n\leq-1$ – and that, like for the Hermite FSE, the magnitude tends to an increasingly regular profile once $t$ grows.

4.3 A four-term recurrence for the Malmquist–Takenaka FSE basis

While a closed form expression of the $\psi_{n}$ s is complicated and not clearly even possible, we can derive a useful recurrence formula. Begin from the following differential difference equation for the Laguerre polynomials (which follows by differentiating (?, 18.17.1)),

\mathrm{L}_{n}(\xi)=\mathrm{L}_{n}^{\prime}(\xi)-\mathrm{L}_{n+1}^{\prime}(\xi).

(4.10)

From this it follows immediately that,

\psi_{n}(x,t)=\frac{(-{\mathrm{i}})^{n}}{\sqrt{2\pi}}\int_{0}^{\infty}\left(\mathrm{L}_{n}^{\prime}(\xi)-\mathrm{L}_{n+1}^{\prime}(\xi)\right)\exp\!\left(-\tfrac{\xi}{2}+{\mathrm{i}}t\xi^{2}+{\mathrm{i}}x\xi\right)\!\,\mathrm{d}\xi.

(4.11)

Integrating by parts, noting that $\mathrm{L}_{n}(0)=1=\mathrm{L}_{n+1}(0)$ so the boundary terms vanish,

\psi_{n}(x,t)=\frac{(-{\mathrm{i}})^{n}}{\sqrt{2\pi}}\int_{0}^{\infty}\left(\mathrm{L}_{n+1}(\xi)-\mathrm{L}_{n}(\xi)\right)(2{\mathrm{i}}t\xi+{\mathrm{i}}x-\tfrac{1}{2})\exp\!\left(-\tfrac{\xi}{2}+{\mathrm{i}}t\xi^{2}+{\mathrm{i}}x\xi\right)\!\,\mathrm{d}\xi.

(4.12)

We can then use the three-term recurrence,

(n+1)\mathrm{L}_{n+1}(\xi)=(2n+1-\xi)\mathrm{L}_{n}(\xi)-n\mathrm{L}_{n-1}(\xi),

(4.13)

to obtain,

$\displaystyle\psi_{n}(x,t)$	$\displaystyle=$	$\displaystyle\frac{(-{\mathrm{i}})^{n}}{\sqrt{2\pi}}\int_{0}^{\infty}\bigg{[}2{\mathrm{i}}t\left((2n+3)\mathrm{L}_{n+1}-(n+1)\mathrm{L}_{n}-(n+2)\mathrm{L}_{n+2}\right)$
		$\displaystyle\qquad\qquad-2{\mathrm{i}}t\left((2n+1)\mathrm{L}_{n}-n\mathrm{L}_{n-1}-(n+1)\mathrm{L}_{n+1}\right)$
		$\displaystyle\qquad\qquad+({\mathrm{i}}x-\tfrac{1}{2})(\mathrm{L}_{n+1}-\mathrm{L}_{n})\bigg{]}\exp\!\left(-\tfrac{\xi}{2}+{\mathrm{i}}t\xi^{2}+{\mathrm{i}}x\xi\right)\!\,\mathrm{d}\xi$
	$\displaystyle=$	$\displaystyle\frac{(-{\mathrm{i}})^{n}}{\sqrt{2\pi}}\int_{0}^{\infty}\bigg{[}-2{\mathrm{i}}t(n+2)\mathrm{L}_{n+2}+(2{\mathrm{i}}t(3n+4)+{\mathrm{i}}x-\tfrac{1}{2})\mathrm{L}_{n+1}$
		$\displaystyle-(2{\mathrm{i}}t(3n+2)+{\mathrm{i}}x-\tfrac{1}{2})\mathrm{L}_{n}+2{\mathrm{i}}tn\mathrm{L}_{n-1})\bigg{]}\exp\!\left(-\tfrac{\xi}{2}+{\mathrm{i}}t\xi^{2}+{\mathrm{i}}x\xi\right)\!\,\mathrm{d}\xi$
	$\displaystyle=$	$\displaystyle\frac{(-{\mathrm{i}})^{n}}{\sqrt{2\pi}}\int_{0}^{\infty}\bigg{[}(-{\mathrm{i}})^{2}2{\mathrm{i}}t(n+2)\mathrm{L}_{n+2}-(-{\mathrm{i}})(2t(3n+4)+x+\tfrac{1}{2}{\mathrm{i}})\mathrm{L}_{n+1}$
		$\displaystyle-(2{\mathrm{i}}t(3n+2)+{\mathrm{i}}x-\tfrac{1}{2})\mathrm{L}_{n}+(-{\mathrm{i}})^{-1}2tn\mathrm{L}_{n-1})\bigg{]}\exp\!\left(-\tfrac{\xi}{2}+{\mathrm{i}}t\xi^{2}+{\mathrm{i}}x\xi\right)\!\,\mathrm{d}\xi$
	$\displaystyle=$	$\displaystyle 2{\mathrm{i}}t(n+2)\psi_{n+2}(x,t)-(2t(3n+4)+x+\tfrac{1}{2}{\mathrm{i}})\psi_{n+1}(x,t)$
		$\displaystyle-(2{\mathrm{i}}t(3n+2)+{\mathrm{i}}x-\tfrac{1}{2})\psi_{n}(x,t)+2tn\psi_{n-1}(x,t).$

Collecting terms yields,

\displaystyle 2{\mathrm{i}}t(n+2)\psi_{n+2}=(2t(3n+4)+x+\tfrac{1}{2}{\mathrm{i}})\psi_{n+1}+(2{\mathrm{i}}t(3n+2)+{\mathrm{i}}x+\tfrac{1}{2})\psi_{n}-2tn\psi_{n-1}.

We now undo the assignment $\varepsilon=1$ to obtain the following lemma.

Lemma 7

The FSE corresponding to the MT system obeys the recurrence for $n\geq 1$ ,

$\displaystyle\psi_{0}(x,t)$	$\displaystyle=$	$\displaystyle\sqrt{\frac{{\mathrm{i}}}{8\varepsilon t}}\exp\!\left(\frac{(2x+{\mathrm{i}})^{2}}{16{\mathrm{i}}\varepsilon t}\right)\mathrm{erfc}\!\left(\frac{(2x+{\mathrm{i}})}{\sqrt{16{\mathrm{i}}\varepsilon t}}\right),$
$\displaystyle\psi_{1}(x,t)$	$\displaystyle=$	$\displaystyle-{\mathrm{i}}\psi_{0}+\left(1-2{\mathrm{i}}x\right)\frac{\psi_{0}(x,t)-\psi_{0}(x,0)}{4\varepsilon t}$
$\displaystyle{\mathrm{i}}(n+1)\psi_{n+1}$	$\displaystyle=$	$\displaystyle\left(3n+1+\frac{2x+{\mathrm{i}}}{4\varepsilon t}\right)\psi_{n}+{\mathrm{i}}\left(3n-1+\frac{2x-{\mathrm{i}}}{4\varepsilon t}\right)\psi_{n-1}-(n-1)\psi_{n-2}.$

Lemma 7 indicates the possibility of computing an expansion in the Malmquist–Takenaka FSE basis using the (generalized) Clenshaw algorithm (?). The functions $\psi_{n}$ for $n\leq-1$ can be addressed using the symmetry $\psi_{-1-n}(x,t)={\mathrm{i}}^{2n-1}\psi_{n}(-x,t)$ , which we omit. Clenshaw’s algorithm is best known to apply to bases satisfying three-term recurrences, and in the case of a two-term recurrence reduces to Horner’s algorithm. The following lemma spells out the Clenshaw algorithm for a basis with a four-term recurrence (such as the Malmquist–Takenaka FSE).

Lemma 8

Let $\Phi=\{\varphi_{n}\}_{n=0}^{\infty}$ be a basis which satisfies the four-term recurrence,

\varphi_{n+1}(x)=A_{n}(x)\varphi_{n}(x)+B_{n}(x)\varphi_{n-1}(x)+C_{n}(x)\varphi_{n-2}(x),

(4.14)

for $n\geq 1$ , where $C_{1}(x)=0$ , then the finite expansion,

f(x)=\sum_{n=0}^{N}a_{n}\varphi_{n}(x),

is equal to

v_{0}(x)\varphi_{0}(x)+v_{1}(x)\varphi_{1}(x),

where $\mbox{\boldmath$v$\unboldmath}(x)=(v_{0}(x),v_{1}(x),\ldots,v_{N}(x))^{\top}$ satisfies the backwards recurrence,

$\displaystyle v_{N}(x)$	$\displaystyle\!\!\!=\!\!\!$	$\displaystyle a_{N}$
$\displaystyle v_{N-1}(x)$	$\displaystyle\!\!\!=\!\!\!$	$\displaystyle a_{N-1}+A_{N-1}(x)v_{N}(x)$
$\displaystyle v_{N-2}(x)$	$\displaystyle\!\!\!=\!\!\!$	$\displaystyle a_{N-2}+A_{N-2}(x)v_{N-1}(x)+B_{N-1}(x)v_{N}(x)$
$\displaystyle v_{n}(x)$	$\displaystyle\!\!\!=\!\!\!$	$\displaystyle a_{n}+A_{n}(x)v_{n+1}(x)+B_{n+1}(x)v_{n+2}(x)+C_{n+2}(x)v_{n+3}(x),$

for $n=N-3,N-4,\ldots 0$ . Where we set $A_{0}=0$ .

Proof We follow the derivation of Clenshaw’s algorithm in (?), but with an extra band below the diagonal in the associated linear system. Indeed, the vector $\boldsymbol{\varphi}(x)=(\varphi_{0}(x),\ldots,\varphi_{N}(x))^{T}$ satisfies

\begin{pmatrix}1&&&&&&\\ -A_{0}&1&&&&&\\ -B_{1}&-A_{1}&1&&&&\\ -C_{2}&-B_{2}&-A_{2}&1&&&\\ 0&-C_{3}&-B_{3}&-A_{3}&1&&\\ &\ddots&\ddots&\ddots&\ddots&\ddots&\\ &&0&-C_{N-1}&-B_{N-1}&-A_{N-1}&1\end{pmatrix}\begin{pmatrix}\varphi_{0}(x)\\ \varphi_{1}(x)\\ \varphi_{2}(x)\\ \varphi_{3}(x)\\ \varphi_{4}(x)\\ \vdots\\ \varphi_{N}(x)\end{pmatrix}=\begin{pmatrix}\varphi_{0}(x)\\ \varphi_{1}(x)\\ 0\\ 0\\ 0\\ \vdots\\ 0\end{pmatrix},

since $A_{0}=0$ . Let us write this as $L(x)\boldsymbol{\varphi}(x)=\boldsymbol{\rho}(x)$ . Clearly $L(x)$ is invertible, so

f(x)=\boldsymbol{a}^{T}\boldsymbol{\varphi}(x)=\boldsymbol{a}^{T}L(x)^{-1}\boldsymbol{\rho}(x)=\left(L(x)^{-T}\boldsymbol{a}\right)^{T}\boldsymbol{\rho}(x).

The result is proved by noting that the backward recurrence for $\boldsymbol{v}(x)$ merely computes $\boldsymbol{v}(x)=L(x)^{-T}\boldsymbol{a}$ by back substitution. $\Box$

In order to evaluate $\psi_{0}$ without trouble from the removable singularity, we rewrite equation (4.2) in the form

\psi_{0}(x,t)=\varphi_{0}(x)G_{0}\left(\frac{2{\mathrm{i}}x-1}{\sqrt{16{\mathrm{i}}\varepsilon t}}\right),

(4.15)

where $G_{0}(z)=-{\mathrm{i}}\sqrt{\pi}z{\mathrm{e}}^{-z^{2}}\mathrm{erfc}(-{\mathrm{i}}z)$ . This function is related to $w(z)={\mathrm{e}}^{-z^{2}}\mathrm{erfc(-{\mathrm{i}}z)}$ , known as the Faddeeva function or plasma dispersion function (?, ?). Note that $x,t\in\mbox{\Bbb R}$ corresponds to evaluating $G_{0}$ in the complex plane in the sector $\{z\in\mbox{\Bbb C}:\mathrm{arg}(z)\in(\pi/4,5\pi/4)\}$ and we are particularly interested in small positive $t$ , which corresponds to large $z$ in this sector. The fact that $G_{0}(z)\to 1$ as $|z|\to\infty$ within this sector shows the recovery of $\varphi_{0}(x)$ as $t\to 0$ .

Following (?, ?), the following continued fraction for $G_{0}$ at $z=\infty$ is convergent in the upper half-plane (?, 7.9.3),

G_{0}(z)=\cfrac{1}{1-\cfrac{\tfrac{1}{2}z^{-2}}{1-\cfrac{z^{-2}}{1-\cfrac{\tfrac{3}{2}z^{-2}}{1-\cfrac{2z^{-2}}{1-\cdots}}}}}.

(4.16)

Truncating this continued fraction yields an extremely good approximation for large $z$ in the upper half-plane, and for the lower half-plane we can use the relation (?, 7.4.3)

G_{0}(z)=G_{0}(-z)-2{\mathrm{i}}\sqrt{\pi}z{\mathrm{e}}^{-z^{2}},

(4.17)

but note that accuracy can be lost near the complex roots of $\mathrm{erfc}(-{\mathrm{i}}z)$ since it relies on heavy cancellation (?, ?).

In order to evaluate $\psi_{1}$ without trouble from the removable singularity, we rewrite the formula in Lemma 7 in the form

\psi_{1}(x,t)=-{\mathrm{i}}\psi_{0}(x,t)+\sqrt{\frac{2}{\pi}}\frac{2{\mathrm{i}}}{(1-2{\mathrm{i}}x)^{2}}G_{1}\left(\frac{2{\mathrm{i}}x-1}{\sqrt{16{\mathrm{i}}\varepsilon t}}\right),

(4.18)

where $G_{1}(z)=2z^{2}(G_{0}(z)-1)$ . While this covers the evaluation of $\psi_{0}(x,t)$ and $\psi_{1}(x,t)$ for small $t$ , the full implementation of Clenshaw’s algorithm may still experience loss of numerical accuracy due to the $1/t$ terms in the recurrence relation. However, numerical issues like this are beyond the scope of this paper.

5 Bringing the elements together

We bring together the different results of the paper into a cohesive whole. In Section 2, we reduced the problem of solving the semiclassical Schrödinger equation to combining time-steps of the form,

u^{k+1}(x)={\mathrm{e}}^{\mathcal{R}_{\ell}}u^{k}(x),

where

$\displaystyle\mathcal{R}_{0}$	$\displaystyle=$	$\displaystyle-\frac{1}{2}\tau\varepsilon^{-1}V,$
$\displaystyle\mathcal{R}_{1}$	$\displaystyle=$	$\displaystyle\frac{1}{2}\tau\varepsilon\partial_{x}^{2},$
$\displaystyle\mathcal{R}_{2}$	$\displaystyle=$	$\displaystyle\color[rgb]{0,0,0}\frac{1}{12}\tau^{3}\varepsilon\left\{\partial_{x}^{2}[V^{(2)}\,\cdot\,]+V^{(2)}\partial_{x}^{2}\right\}+\frac{1}{24}\tau^{3}\varepsilon^{-1}(V^{(1)})^{2},\ldots$

where $\tau={\mathrm{i}}h$ and $\mathcal{R}_{\ell}={\cal O}\!\left(h^{2\ell-1}\varepsilon^{-1}\right)$ for $\ell=1,2,\ldots$ . We propose that the numerical solution be represented implicitly by

u^{k}(x)=\sum_{n=0}^{N}\hat{u}_{n}\varphi_{n}(x),

where $\varphi_{n}$ is either the Hermite function basis or the Malmquist–Takenaka basis (in the latter case the indices should extend from $n=-N-1$ to $n=N$ ). However, explicitly, we propose that the numerical solution be represented by its values on a grid appropriate to the basis. When this basis is Hermite functions, those points are Hermite quadrature points, and for Malmquist–Takenaka functions, those points are mapped equi-spaced points (?),

x_{j}^{[N]}=\tfrac{1}{2}\tan\left(\theta_{j}^{[N]}/2\right),\qquad j=-N-1,\ldots N,

(5.19)

\theta_{j}^{[N]}=\frac{j\pi}{N+1},\qquad j=-N-1,\ldots N.

(5.20)

We call these Malmquist–Takenaka points or MT points.

The reason for these choices of grid points are three-fold. First, the mapping from the values of a finite expansion in the basis at these specific grid points, weighted appropriately, to the coefficients in the finite expansion is unitary, so is invertible and perfectly stable. Second, there are known algorithms to compute this mapping and its inverse, which in the case of Malmquist–Takenaka, can be performed rapidly by the Fast Fourier Transform (FFT) and its inverse. Thirdly, at the end of a full time step, we have the solution given by its values on this grid. The computation of the values of the solution at arbitrary points on the real line can be performed stably by barycentric interpolation formula. The barycentric weights for Hermite quadrature points and for equispaced points on the unit circle (which map to MT points) are known explicitly (?, ?).

When our solution is represented by values at the grid points, the case $\ell=0$ is straightforward — we simply multiply the function value at gridpoint $x_{k}^{[N]}$ by $\exp(-\tfrac{1}{2}\tau\varepsilon^{-1}V(x_{k}^{[N]}))$ .

The case $\ell=1$ is more subtle, and we propose using the free Schrödinger evolutions developed in Section 3. We first compute the coefficients in the $\Phi$ basis, and then evaluate linear combination of those coefficients with the free Schrödinger evolution $\Psi(\tfrac{1}{2}h\varepsilon)$ at the grid points. This is a two-step process, as follows.

•

Compute the coefficients, $a_{0},a_{1},\ldots,a_{N}$ (in the $\Phi$ basis, indexed from $-N-1$ to $N$ in the case of the MT basis) from the values on the grid (using the FFT in the case of the MT basis)
•

Evaluate the sum $\sum_{k=0}^{N}a_{k}\psi_{k}(x,\tfrac{1}{2}h\varepsilon)$ at the grid points using Clenshaw’s algorithm (in the case of the MT basis, using the 4 term version in Lemma 8).

In the case $\ell\geq 2$ we propose the use of Krylov subspace methods. This was first proposed in (?), later generalised to time-dependent potentials (?, ?) as well as the method of quasi-Magnus exponential integrators of (?). There are two facts which make this approach work well. First, $\mathcal{R}_{\ell}={\cal O}\!\left(h^{2\ell-1}\varepsilon^{-1}\right)$ for $\ell>1$ , so we are computing the exponential of a matrix which is small in spectral norm. As a result, a Krylov subspace with a miniscule dimension can be used (?). Second, the sparse differentiation matrix (see (3.1)) implies that the matrices which must be applied to a vector in the Krylov subspace method are a sum of compositions of: diagonal matrices coming from derivatives of the potential function $V$ , pentadiagonal matrices coming from the discretisation of $\partial_{x}^{2}$ in coefficient space, and transforms between function values on the grid and coefficients (which can be performed using the FFT in the case of the MT basis).

Acknowledgments

This work is partially supported by the Simons Foundation Award No 663281 granted to the Institute of Mathematics of the Polish Academy of Sciences for the years 2021-2023

The authors thank the Isaac Newton Institute for Mathematical Sciences for support and hospitality during the programme “Geometry, compatibility and structure preservation in computational differential equations”, supported by EPSRC grant EP/R014604/1, where this work has been initiated.

Katharina Schratz has received funding from the European Research Council (ERC) under the European Unions Horizon 2020 research and innovation programme (grant agreement No. 850941)

The work of Karolina Kropielnicka and of Marcus Webb in this project was financed by The National Center for Science (NCN), based on Grant No. 2019/34/E/ST1/00390

References

[1]
[2] [] Bader, P., Iserles, A., Kropielnicka, K. & Singh, P. (2014), ‘Effective approximation for the semiclassical Schrödinger equation’, Found. Comput. Math. 14(4), 689–720.
[3]
[4] [] Berrut, J.-P. & Trefethen, L. N. (2004), ‘Barycentric lagrange interpolation’, SIAM review 46(3), 501–517.
[5]
[6] [] Blanes, S., Casas, F. & Thalhammer, M. (2017), ‘High-order commutator-free quasi-Magnus exponential integrators for non-autonomous linear evolution equations’, Comput. Phys. Commun. 220, 243–262.
[7]
[8] [] Clenshaw, C. W. (1955), ‘A note on the summation of Chebyshev series’, Math. Tables Aids Comput. 9, 118–120.
[9]
[10] [] Dutt, A., Gu, M. & Rokhlin, V. (1996), ‘Fast algorithms for polynomial interpolation, integration, and differentiation’, SIAM J. Numer. Anal. 33(5), 1689–1711.
[11]
[12] [] Gautschi, W. (1970), ‘Efficient computation of the complex error function’, SIAM Journal on Numerical Analysis 7(1), 187–198.
[13]
[14] [] Gautschi, W. (2004), Orthogonal polynomials: computation and approximation, Oxford University Press.
[15]
[16] [] Golub, G. H. & Welsch, J. H. (1969), ‘Calculation of gauss quadrature rules’, Mathematics of computation 23(106), 221–230.
[17]
[18] [] Hairer, E., Lubich, C. & Wanner, G. (2006), Geometric numerical integration: structure-preserving algorithms for ordinary differential equations, Vol. 31, Springer Science & Business Media.
[19]
[20] [] Hall, M. (1950), ‘A basis for free lie rings and higher commutators in free groups’, Proceedings of the American Mathematical Society 1(5), 575–581.
[21]
[22] [] Hochbruck, M. & Lubich, C. (1997), ‘On krylov subspace approximations to the matrix exponential operator’, SIAM Journal on Numerical Analysis 34(5), 1911–1925.
[23]
[24] [] Iserles, A. & Webb, M. (2019), ‘Orthogonal systems with a skew-symmetric differentiation matrix’, Found. Comput. Math. 19(6), 1191–1221.
[25]
[26] [] Iserles, A. & Webb, M. (2020a), ‘A differential analogue of Favard’s theorem’, arXiv preprint arXiv:2012.07400.
[27]
[28] [] Iserles, A. & Webb, M. (2020b), ‘A family of orthogonal rational functions and other orthogonal systems with a skew-Hermitian differentiation matrix’, J. Fourier Anal. Appl. 26(1), Paper No. 19.
[29]
[30] [] Iserles, A. & Webb, M. (2021), ‘Fast computation of orthogonal systems with a skew-symmetric differentiation matrix’, Communications on Pure and Applied Mathematics 74(3), 478–506.
[31]
[32] [] Iserles, A., Kropielnicka, K. & Singh, P. (2018), ‘Magnus-Lanczos methods with simplified commutators for the Schrödinger equation with a time-dependent potential’, SIAM J. Numer. Anal. 56(3), 1547–1569.
[33]
[34] [] Iserles, A., Kropielnicka, K. & Singh, P. (2019), ‘Solving Schrödinger equation in semiclassical regime with highly oscillatory time-dependent potentials’, J. Comput. Phys. 376, 564–584.
[35]
[36] [] Iserles, A., Luong, K. & Webb, M. (2021), ‘Approximation of wave packets on the real line’, arXiv preprint arXiv:2101.02566.
[37]
[38] [] Ismail, M. E. H., ed. (2020), Univariate Orthogonal Polynomials, Encyclopedia of Special Functions: The Askey–Bateman Project, Cambridge University Press, Cambridge.
[39]
[40] [] Jin, S., Markowich, P. & Sparber, C. (2011), ‘Mathematical and computational methods for semiclassical Schrödinger equations’, Acta Numer. 20, 121–209.
[41]
[42] [] Lasser, C. & Lubich, C. (2020), ‘Computing quantum dynamics in the semiclassical regime’, Acta Numerica 29, 229–401.
[43]
[44] [] Olver, F. W. J., Lozier, D. W., Boisvert, R. F. & Clark, C. W., eds (2010), NIST Handbook of Mathematical Functions, U.S. Department of Commerce, National Institute of Standards and Technology, Washington, DC; Cambridge University Press, Cambridge. With 1 CD-ROM (Windows, Macintosh and UNIX).
[45]
[46] [] Poppe, G. P. & Wijers, C. M. (1990), ‘More efficient computation of the complex error function’, ACM Transactions on Mathematical Software (TOMS) 16(1), 38–46.
[47]
[48] [] Reutenauer, C. (1993), Free Lie Algebras, London Maths Soc. Monographs 7, Oxford University Press, Oxford.
[49]
[50] [] Singh, P. (2016), ‘High accuracy computational methods for the semiclassical Schrödinger equation’.
[51]
[52] [] Townsend, A., Webb, M. & Olver, S. (2018), ‘Fast polynomial transforms based on toeplitz and hankel matrices’, Mathematics of Computation 87(312), 1913–1934.
[53]
[54] [] Wang, H., Huybrechs, D. & Vandewalle, S. (2014), ‘Explicit barycentric weights for polynomial interpolation in the roots or extrema of classical orthogonal polynomials’, Mathematics of Computation 83(290), 2893–2914.
[55]
[56] [] Weideman, J. A. C. (1994), ‘Computation of the complex error function’, SIAM Journal on Numerical Analysis 31(5), 1497–1518.
[57]