This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Ising Hamiltonian Minimization: Gain-Based Computing with Manifold Reduction of Soft-Spins vs Quantum Annealing

James S. Cummins1, Hayder Salman2,3 and Natalia G. Berloff1, N.G.Berloff@damtp.cam.ac.uk 1Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Wilberforce Road, Cambridge CB3 0WA, United Kingdom
2School of Engineering, Mathematics, and Physics, University of East Anglia, Norwich Research Park, Norwich, NR4 7TJ, UK,
3Centre for Photonics and Quantum Science, University of East Anglia, Norwich Research Park, Norwich, NR4 7TJ, UK
(May 8, 2025)
Abstract

We investigate the minimization of Ising Hamiltonians, comparing the performance of gain-based computing paradigms based on the dynamics of semi-classical soft-spin models with quantum annealing. We systematically analyze how the energy landscape for the circulant couplings of a Möbius graph evolves with increased annealing parameters. Our findings indicate that these semi-classical models face challenges due to a widening dimensionality landscape. To counteract this issue, we introduce the ‘manifold reduction’ method, which restricts the soft-spin amplitudes to a defined phase space region. Concurrently, quantum annealing demonstrates a natural capability to navigate the Ising Hamiltonian’s energy landscape due to its operation within the comprehensive Hilbert space. Our study indicates that physics-inspired or physics-enhanced optimizers will likely benefit from combining classical and quantum annealing techniques.

I Introduction

Pursuing enhanced computing speed and power efficiency has led to exploring alternatives to traditional electronic systems in solving complex tasks. Optical Neural Networks (ONNs) promise unprecedented parallelism, potentially superior speeds, and reduced power consumption. ONNs encode neural weights as phase shifts or changes in light intensity, with activation functions instantiated via nonlinear optical materials or components, or via a strong hybridization to matter excitations [1]. They offer the potential to operate in the terahertz range, vastly surpassing the gigahertz frequencies of conventional electronic systems that can be exploited in machine learning and combinatorial optimization. The common feature of ONNs is to utilize a network of optical oscillators dynamically described by a coupled system of soft-spin models on complex-valued fields ψi=riexp[iθi]\psi_{i}=r_{i}\exp[i\theta_{i}] that have amplitude rir_{i} (referred to as ‘soft mode’) and phase θi\theta_{i} (discrete e.g. θi{0,π}\theta_{i}\in\{0,\pi\} or continuous ‘spin’) degrees of freedom. Each spin in the network can be associated with the quadrature of the optical complex-valued fields, thereby in the classical limit reducing the system to a model of real soft-spins given by ricosθir_{i}\cos\theta_{i} which we analyze hereafter.

Optical parametric oscillator based coherent Ising machines (CIMs) [2, 3, 4, 5], lasers [6, 7, 8], spatial light modulators (SLMs) [9], lattices of polariton [10, 11] and photon condensates [12], Microsoft’s analog iterative machine [13], and Toshiba’s simulated bifurcation machine [14] can all minimize the classical hard-spin Ising Hamiltonians HI=i,jJijsisjH_{\rm I}=-\sum_{i,j}J_{ij}s_{i}s_{j} with si=±1s_{i}=\pm 1 for a coupling matrix 𝐉\mathbf{J}, and other spin Hamiltonians (e.g. XY Hamiltonians HXY=i,jJij𝐬i𝐬jH_{\rm XY}=-\sum_{i,j}J_{ij}\mathbf{s}_{i}\cdot\mathbf{s}_{j} with 𝐬i=(cosφi,sinφi)\mathbf{s}_{i}=(\cos\varphi_{i},\sin\varphi_{i})) using soft-spin bifurcation dynamics via the Aharonov-Hopf bifurcation [15]. This principle of operation leads to an exciting new paradigm known as “gain-based computing”. The concept behind gain-based computing is that computational problems can be encoded in the gain and loss rates of driven-dissipative systems, which are then driven through a symmetry-breaking transition (bifurcation), selecting a mode that minimizes losses. Such soft-spin models exploit enhanced dimensionality, marked by small energy barriers during amplitude bifurcation, but also complicate the energy landscape with numerous local minima. In parallel to these methods, quantum annealing is another approach to minimize the hard-spin Ising Hamiltonian. Despite numerous studies contrasting classical and quantum methods, the limitations of currently available hardware and the limitations of simulating quantum systems classically have led to contrasting conclusions as to whether a quantum advantage can potentially be realized using quantum annealing [16, 17, 18, 19, 20, 21, 22, 23] and in particular, how quantum annealers such as D-Wave perform in comparison with CIM [24]. In the latter, the connectivity of the coupling matrix was assumed to be a key factor in performance differences between these machines [24].

An all-optical scalable ONN was recently proposed for cyclic graphs; SLMs are used to discretize the optical field where each pixel defines a different pulse amplitude [25]. The SLM with Mx×MyM_{x}\times M_{y} pixels is set up with a transmission function J~k\tilde{J}_{k} which multiplies the Fourier transform of the amplitudes at each round trip. The SLM couples the fields with coupling matrix JijJ~ji+1J_{ij}\equiv\tilde{J}_{j-i+1}, which corresponds to a circulant graph. An alternative setup allows for any general coupling matrix. However, there is an N=MxN=M_{x} limit to the number of pulses. Circulant graphs such as Möbius ladders therefore lend themselves well to optical solvers, where N=Mx×My106N=M_{x}\times M_{y}\sim 10^{6} spins can be defined.

The couplings are often geometrical in polariton condensates, photon condensates, and laser cavities (e.g. the sign and amplitude of the coupling strength depend on the distance between condensates and outflow wavenumber [26]). The condensates arranged in a circle interact with the nearest neighbors, but the interactions beyond this decay exponentially. Previously, various ways of establishing long-range interactions in polariton-based XY-Ising machines were discussed. An easier way to achieve the couplings between remote sites is to use digital micro-mirror devices (DMDs) to direct the light across the ring. DMDs were shown to perform complex (amplitude and phase) modulation. By splitting the complex field into real and imaginary parts and using the time modulation scheme of the DMD, a complex signal could be synthesized [27]. Reflecting the entire ring of condensates on itself with a radial displacement implements a 3-regular cyclic graph. Cyclic graphs are known to be computationally intractable for classical computers for sampling probability distributions of quantum walks [28].

Using ONNs for optimization has shown promise, yet key questions remain: ‘what are suitable benchmarks for optical machines, how to guide annealing to aid optimization, what are the ONN energy landscape dynamics during annealing to ensure the optimal state is achieved, and what are the distinguishing features between quantum and classical annealing?’. Answers often rely on the coupling matrix 𝐉\mathbf{J}. An instructive problem encoded in 𝐉\mathbf{J} should be technologically feasible, have controllable couplings, possess non-trivial structures resistant to simple local perturbations, and be mathematically tractable. Moreover, it is better to have deterministic rather than random couplings to avoid issues of statistical convergence [29].

Here, we analyse and contrast gain-based computing for soft-spin Ising models (SSIM) with quantum annealing for circulant coupling matrices, which allow complete control of frustration, energy gaps, and the structure of critical points. Furthermore, the potential to realize them in future optical systems [25] make them more suitable to consider over previously reported benchmarks [30, 31, 32]. A highlighted challenge for SSIM annealing lies in the opposing relationship between local and global minima when mapping the Ising Hamiltonian to the energy of the soft-spin system [29]. Notably, we demonstrate that quantum annealing within the whole Hilbert space of the hard-spin system navigates this challenge. Additionally, we suggest that ‘manifold reduction’, aligning amplitudes to the mean, is needed to augment the likelihood of SSIM finding the global minima.

ONNs based on laser operation leverage quantum-inspired principles such as coherence, interference, and parallelism. They are dissipative systems that tend to minimize losses on their route to coherence. The losses can be written as an ‘energy’ (‘cost’) function to be minimized. For instance, in the classical limit, CIM’s energy landscape to be minimized is

E=C4i=1N(p(t)xi2)212i,j=1NJijxixj,E=\frac{C}{4}\sum_{i=1}^{N}\left(p(t)-x_{i}^{2}\right)^{2}-\frac{1}{2}\sum_{i,j=1}^{N}J_{ij}x_{i}x_{j}, (1)

where xix_{i} are quadratures of the OPOs, p(t)p(t) describes the effective laser pumping power (injection minus linear losses), and CC corresponds to the strength of saturable nonlinearity. As p(t)p(t) grows from a large negative to large positive p(t)=pp(t)=p_{\infty} values, EE anneals from the dominant convex first term on the right-hand side of Eq. (1) that is minimized at xi=0x_{i}=0 for all ii, to the minimum of the second term which is the scaled target Ising Hamiltonian with xi=±px_{i}=\pm\sqrt{p_{\infty}}. The temporal change of p(t)p(t), therefore, is the annealing parameter combined with gradient descent as

x˙i=Exi=C(p(t)xixi3)+i=1NJijxj.{\dot{x}}_{i}=-\frac{\partial{E}}{\partial x_{i}}=C\left(p(t)x_{i}-x_{i}^{3}\right)+\sum_{i=1}^{N}J_{ij}x_{j}. (2)

The operation of CIM, therefore, relies on the gradient descent of an annealed energy landscape. All ONN soft spin optimizers exploit this central principle, while the details of the nonlinearity or the gradient dynamics can vary from platform to platform [15]. In particular, CIM dynamics is an example of the Hopfield-Tank (HT) network x˙i=p(t)xi+i=1NJijxj{\dot{x}}_{i}=p(t)x_{i}+\sum_{i=1}^{N}J_{ij}x_{j} also used for Ising Hamiltonian minimization [33, 34]. Another approach uses second-order resonance to project the XY onto the Ising dynamics [35]. In the next section, we describe the principles of gain-based computing and contrast it with simulated and quantum annealing.

II Principles of Operation of Gain-Based Computing, Quantum Annealing, and Simulated Annealing

Gain-based computing is a computational paradigm where problems are encoded in the gain and loss rates of driven-dissipative systems, as illustrated in Fig. (1)(a). These systems undergo a symmetry-breaking transition when various physical modes are excited from the vacuum state. As these modes grow, the loss function evolves until a coherent state that minimizes losses emerges. The mode that achieves the minimum of the loss function is amplified, as shown in Fig. (1)(a). Gain-based computing leverages soft-spin models, which provide enhanced dimensionality and small energy barriers during amplitude bifurcation. Although these models create a complex energy landscape with numerous local minima, making optimization challenging, they are also rich in potential solutions.

Refer to caption
Figure 1: Schematics of the operation of (a) gain-based computing, (b) simulated annealing, and (c) quantum annealing.

Simulated annealing (SA), on the other hand, is a classical optimization technique; see Fig. (1)(b). SA probabilistically explores the solution space by simulating the cooling of a material to reach a state of minimum energy. It uses thermal fluctuations to escape local minima, as the system trajectory shown in blue indicates, with the probability of accepting worse solutions decreasing over time. This simulates a cooling process that gradually refines the search towards the global minimum. Implemented on classical computing systems using stochastic algorithms, SA explores the energy landscape by thermal fluctuations, with a gradual reduction in temperature controlling the balance between exploration and exploitation. The performance of simulated annealing is influenced by the cooling schedule, which determines how the temperature is reduced over time, as well as the specific parameters of the algorithm.

In contrast, quantum annealing (QA) is a quantum computation method used to find the ground state of a system’s energy; see Fig. (1)(c). QA operates by evolving the system from an initial Hamiltonian, which is usually simple and convex, to the target Hamiltonian that encodes the optimization problem. This evolution relies on the principles of quantum mechanics, specifically quantum tunneling, to explore the energy landscape. Quantum annealing utilizes quantum fluctuations to escape local minima and tunnel through energy barriers, potentially leading to faster convergence to the global minimum. This approach can be advantageous in navigating complex landscapes with high barriers between local minima. In Fig. (1)(c), the varying energy landscape is shown as the annealing from the initial convex Hamiltonian to the target Hamiltonian takes place in time. The system starts at the ground state of the initial Hamiltonian and remains in the ground state if annealing is sufficiently slow. The blue line shows the system state at each moment.

III Möbius Ladder Graphs

Cyclic graphs with NN nodes are characterized by the circulant coupling matrix 𝐉N×N\mathbf{J}\in\mathbb{R}^{N\times N}, constructed through cyclical permutations of an NN-vector. These graphs inherently have vertex permutation symmetry, signifying boundary periodicity and uniform neighborhoods. The structure of a circulant matrix is contained in any row, and its eigenvalues and eigenvectors can be analytically derived using the NN roots of unity of a polynomial, where the row components of the matrix act as coefficients: λn=j=1NJ1,jcos[2πnN(j1)]\lambda_{n}=\sum_{j=1}^{N}J_{1,j}\cos[\frac{2\pi n}{N}(j-1)] [36, 37, 38]. We consider the minimization of the Ising Hamiltonian on a particular form of cyclic graph – Möbius ladder graphs with tunable hardness. These have even NN such that the ii-th vertex has two edges connecting it to vertices i±1i\pm 1 with antiferromagnetic coupling with strength Ji,i±1=1J_{i,i\pm 1}=-1 (circle couplings), and an additional antiferromagnetic coupling with vertex i+N/2i+N/2 with strength Ji,i+N/2=JJ_{i,i+N/2}=-J for J>0J>0 (cross-circle couplings). We denote by S0S_{0} the state where the spins alternate along the ring so that sisi±1=1s_{i}s_{i\pm 1}=-1 for all ii (Fig. (2)(a)), and by S1S_{1} the state where the spins alternate everywhere except at two positions on the opposite sides of the ring: sisi+1=1s_{i}s_{i+1}=-1 for all ii0i\neq i_{0} and si0si0+1=si0+N/2si01+N/2=1s_{i_{0}}s_{i_{0}+1}=s_{i_{0}+N/2}s_{i_{0}-1+N/2}=1 (Fig. (2)(b)). When N/2N/2 is odd, S0S_{0} is always the ground state with energy HI(J)=(J+2)N/2H_{\rm I}(J)=-(J+2)N/2. When N/2N/2 is even, the S0S_{0} configuration has energy HI(0)(J)=(J2)N/2H_{\rm I}^{(0)}(J)=(J-2)N/2 and S1S_{1} has HI(1)(J)=4(J+2)N/2H_{\rm I}^{(1)}(J)=4-(J+2)N/2. Therefore, S0S_{0} [S1S_{1}] is the global minimum (while S1S_{1} [S0S_{0}] is the excited state) if J<Jcrit4/NJ<J_{\rm crit}\equiv 4/N [J>JcritJ>J_{\rm crit}]. The eigenvalues of the coupling matrix 𝐉\bf{J} for the Möbius ladder with J1,j{1,0,J}J_{1,j}\in\{-1,0,-J\} are λn=2cos(2πn/N)J(1)n\lambda_{n}=-2\cos(2\pi n/N)-J(-1)^{n}. Equating the two largest eigenvalues 2cos(2π/N)+J2\cos(2\pi/N)+J and 2J2-J gives the value of J=Je=1cos(2π/N)J=J_{\rm e}=1-\cos(2\pi/N) at which the leading eigenvectors change. When Je<J<JcritJ_{\rm e}<J<J_{\rm crit} the eigenvalues for S0S_{0} are less than that for S1S_{1}, despite S0S_{0} being the lower energy state (see Appendix A for the derivation of the spectra). This is in contrast to computationally simple problem instances, in which the ground state minimizer is located at the hypercube corner of the projected eigenvector corresponding to the largest eigenvalue [39].

IV Soft-Spin Ising Model

Refer to caption
Figure 2: (a-c) Schematic representation of the states realized by soft-spin models of Eq. (1) on Möbius ladder graphs with varying cross-circle couplings (shown in red). (a), (b) and (c) depict states that map onto S0S_{0}, S1S_{1} Ising states for N=8N=8 and S1S_{1} state for N=12N=12, respectively. (c) uses a different node arrangement that illustrates the graph relationship with the topology of the Möbius strip. The same colors are used to show equal intensities; the larger sizes correspond to larger intensities. (d) Regions of different global minima of Eq. (1): E1E_{1} in the blue region and E0E_{0} in the pink region, in JpJ-p space for N=8N=8 and C=1C=1. Two critical values of JJ are shown as solid black lines. Between these lines, S0S_{0} is expected as the hard-spin Ising model global minimizer. Thin lines show the contours E1=E0E_{1}=E_{0} for C=1C=1, 1.21.2, 1.51.5, 22, and 44 in that region. (e) Success probability of reaching E0E_{0} (labeled as SP0SP_{0}) and E1E_{1} (labeled as SP1SP_{1}) states of the soft-spin energy in Eq. (1) from a point 𝐱{\bf x} with randomly chosen components xix_{i} in [1,1][-1,1] for different values of pp and J=0.4J=0.4, N=8N=8, and C=1C=1. For larger values of pp, a third state of higher energy appears with success probability SP2SP_{2}; when projected on spins si=xi/|xi|s_{i}=x_{i}/|x_{i}| this state corresponds to S1S_{1}. (f) The height of the minimum energy barrier between E1E_{1} and E0E_{0} calculated as the energy difference between E1E_{1} and the energy of the nearest saddle point is shown as a black dashed line for J=0.4J=0.4 and C=1C=1. The difference between E0E_{0} and E1E_{1} is shown in red.
Refer to caption
Figure 3: (a) Evolution of N=8N=8 soft spins for J=0.35J=0.35 and (b) J=0.55J=0.55 according to Eq. (2). In each case, the ground state is recovered. The amplitudes connected by the frustrated edges are lower than in the rest of the system and are shown in red. In all runs, C=1C=1, p0=J2p_{0}=J-2, ε=0.003\varepsilon=0.003, Δt=0.1\Delta t=0.1, and each xi(0)x_{i}(0) is chosen randomly from a uniform distribution in the range [0.001,0.001][-0.001,0.001].

SSIM in Eq. (1) has real amplitudes xix_{i}. As the laser pumping p(t)p(t) increases from negative values, the minimizers 𝐱{\bf x}^{*} of Eq. (1) and minima of EE change. We associate Ising spins with xix_{i} via si=xi/|xi|s_{i}=x_{i}/|x_{i}|. We expect that the soft-spin energy state E0E_{0} that corresponds to the hard-spin Ising state S0S_{0}, and depicted in Fig. (2)(a), is symmetric in amplitudes as all spins experience the same frustration of the cross-circle coupling, so all amplitudes have the same modulus |xi|=X.|x_{i}|=X. From Eq. (2), XX satisfies X=p(t)+(2J)/CX=\sqrt{p(t)+(2-J)/C}, with the corresponding soft-spin energy E0=(J2)N(2J+2Cp)/4.E_{0}=(J-2)N(2-J+2Cp)/4. This state can be realized from a vacuum state when p(t)p(t) exceeds (J2)/C.(J-2)/C. The soft-spin energy state E1E_{1} corresponding to S1S_{1}, when two side edges are frustrated, is asymmetric in amplitudes. This asymmetry is shown schematically in Figs. (2)(b) and (c), in agreement with the dynamical simulations presented in Fig. (3). This occurs because the lower energy is achieved if the amplitudes connected by the frustrated edges |xi|=XL|x_{i}|=X_{L} are lower than in the rest of the system. For N=8N=8 in Fig. (2)(b), there are two types of amplitudes: 4 nodes with ±|XL|\pm|X_{L}| and 4 with amplitudes |xi|=XB|x_{i}|=X_{B}, where XB=(1JCp)XL+CXL3X_{B}=(1-J-Cp)X_{L}+CX_{L}^{3} as obtained from the steady states of equation Eq. (2) governing the dynamics of XLX_{L}, while the steady-state on the evolution of XLX_{L} gives (p+1+J)XB+XL=XB3(p+1+J)X_{B}+X_{L}=X_{B}^{3}. By solving the polynomial equation for XLX_{L}, we can compute E1E_{1} across any pp, JJ, NN, and CC. This allows us to discern regions in this parameter space where the global minimum aligns with either E0E_{0} or E1E_{1} and confirm if these states correspond with the hard-spin Ising Hamiltonian’s global minimum. Figure (2)(d) depicts distinct regions in the JpJ-p parameter space. Within the Je<J<JcritJ_{\rm e}<J<J_{\rm crit} interval S0S_{0} emerges as the hard-spin Ising model’s lowest energy state. For the soft-spin model, however, only the region shown in pink corresponds to this state (E0E_{0}). Figure (2)(d) shows that for values Je<J<JcritJ_{\rm e}<J<J_{\rm crit}, as laser power pp rises, the E0E_{0} state becomes the energy minimum for the soft-spin model, aligning with the hard-spin Ising Hamiltonian’s S0S_{0}. Yet, the success probability of converging to the true ground state does not increase beyond 0.20.2 as shown in Fig. (2)(e). This is a consequence of increasing amplitudes that bring the increased height of the energy barriers that prevent the system from transitioning to the state S0S_{0} (see Fig. (2)(f)). Figure (4) depicts the basins of attraction for various pp and fixed Je<J<Jcrit.J_{\rm e}<J<J_{\rm crit}. The basins of attraction are defined as the sets of points randomly distributed on [1,1][-1,1] from which gradient descent leads to different minima. At the threshold of large negative pp, the basin of attraction of E1E_{1}, which is the ground state of EE as given by Eq. (1), dominates. As pp increases, the basin of attraction of the excited state E0E_{0} increases while at small positive values of pp, E0E_{0} becomes the ground state. With a further increase of pp, other states with even higher energy appear.

The space structure of soft spin models can be further understood by considering the critical points of their energy landscape for different annealing parameter values [40]. We can determine the critical (minima and the saddle) points by setting E/xi=0{\partial E}/\partial x_{i}=0 for all i=1,,Ni=1,\ldots,N and classify them using the Hessian matrix. The number of critical points grows exponentially fast with pp; however, not in terms of energy and the distance from the state xi=0x_{i}=0 \forall ii as Fig. (5) illustrates.

Refer to caption
Figure 4: Basins of attraction of the soft spin energy in Eq. (1) as defined in the main text. We take N=8N=8, J=0.4J=0.4, C=1C=1, various laser powers pp, and 2000020000 randomly distributed xix_{i} in [1,1][-1,1] to show which minimum is reached via gradient descent using Newton’s method. To characterize points, the average magnetization m=ixi/Nm=\sum_{i}x_{i}/N (vertical axis) and the correlations along the circle between xix_{i} defined as Xcorr=i(xim)(xi+1m)/i(xim)2X_{\rm corr}=\sum_{i}(x_{i}-m)(x_{i+1}-m)/\sum_{i}(x_{i}-m)^{2} (horizontal axis), are used. For small pp, the basin of attraction is dominated by the S1S_{1} state as any initial state descends to E1E_{1}. As pp grows, the ratio of the volume of the basins of attraction of the S1S_{1} state to the volume of the basins of attraction of the S0S_{0} approaches 44. At the critical value of p0.08715p\approx-0.08715, both S1S_{1} and S0S_{0} states have the same energy, and after that S0S_{0} state becomes the ground state: this is indicated by the switch between ground (blue) and excited (red) states.
Refer to caption
Figure 5: Critical points of the CIM energy (1) for N=8N=8, C=1C=1 and different values of laser power pp. Minima are shown in green, saddles with one, two, three and 4+ unstable directions are shown as red, light blue, blue and black points, respectively. The S0S_{0} state is furthest from the origin, which becomes the global minimum for p>pc=0.0872p>p_{\rm c}=-0.0872. The minima for p=2p=2 are S0S_{0}, S1S_{1}, (,,+,,+,,,+)(-,-,+,-,+,-,-,+), (+,,,+,+,,+,)(+,-,-,+,+,-,+,-) and (+,,,+,+,,,+)(+,-,-,+,+,-,-,+) in increasing energy.

The S1S_{1} state is always further away from the origin than other critical points (S0S_{0} and saddle points). At the same time, the transition between minima E0E_{0} and E1E_{1} is possible only through a saddle point whose energy relative to E1E_{1} and E0E_{0} defines the height of the energy barriers; see Fig. (2)(f).

V Manifold Reduction

The aforementioned considerations suggest that amplitude heterogeneities have a severely detrimental effect on the optimization process in some regions of parameter space as they allow the soft-spin energy landscape to find and follow its ground state, which is quite different from the ground state of the hard-spin Ising Hamiltonian. This problem has been recognized before [41, 42], but in the context of the final state, so various feedback schemes were suggested to bring all amplitudes to the same value, say ±1\pm 1, at the end of the simulations. This could be achieved, for instance, by changing the laser intensity individually for each spin as pi˙(t)=ε(1xi2)\dot{p_{i}}(t)=\varepsilon(1-x_{i}^{2}), where ε\varepsilon is a small constant parameter. However, as our results on the simple circulant graphs illustrate, this feedback does not change the most essential part of the dynamics during the pitchfork bifurcation from the vacuum state. Moreover, this feedback becomes important only for amplitudes sufficiently close to ±1\pm 1 when the barriers between states are already too high.

Instead, we suggest introducing feedback restricting the soft spin energy landscape to keep them close to the average value. This restriction can be achieved by modifying the signal intensities bringing them towards the average mass per particle defined by the squared radius of the quadrature R(𝐱)i=1xi2/NR({\bf x})\equiv\sum_{i=1}x_{i}^{2}/N as

xi(1δ)xi+δRxi/|xi|.x_{i}\rightarrow(1-\delta)x_{i}+\delta Rx_{i}/|x_{i}|. (3)

If δ=0\delta=0, then no adjustment is made. If δ=1\delta=1, then all amplitudes are set to the same (average) value. For 0<δ<10<\delta<1, 1/δ1/\delta determines the proportion of the effective space for the restricted evolution.

Figure (6) shows the probability of finding the ground state of the Ising Hamiltonian using the HT networks: Eq. (2) (CIM-I), Eq. (2) with individual pumping adjustments ppip\rightarrow p_{i} according to pi˙=ε(1xi2)\dot{p_{i}}=\varepsilon(1-x_{i}^{2}) (CIM-II), and Eq. (2) with manifold reduction by Eq. (3) (CIM-III). For CIM-I and CIM-III, we set p(t)=(1p0)tanh(εt)+p0p(t)=(1-p_{0})\tanh(\varepsilon\,t)+p_{0}. CIM-III shows a significant improvement in finding the ground state compared with other models. Thus, in soft-spin models, the imperative to constrain the manifold implies that dimensional annealing should be tailored according to the energy landscape’s characteristics. Quantum annealing, on the other hand, harnesses dimensional annealing within an extended Hilbert space. By utilizing only linear dynamics at the expense of operating within this higher-dimensional phase space, it can effectively navigate energy barriers. Next, we study the quantum evolution on the Ising energy landscape of circulant coupling matrices in order to contrast its performance with soft-spin nonlinear models.

Refer to caption
Figure 6: Ground state probability for HT, CIM-I, CIM-II, CIM-III, and quantum annealing (QA) for the Möbius ladder graph with N=8N=8. For CIM-III, for each value of JJ, the optimum value of 0<δ<10<\delta<1 is chosen based on a set of preliminary runs in which δ\delta is varied. Two thousand runs are used to calculate the probability of finding the ground state PGSP_{\rm GS} for each value of JJ. For QA, B=5B=5 and Δt=0.1\Delta t=0.1. Inset shows the same plots but for N=100N=100.

VI Quantum Annealing

We consider the transverse field Ising model given by

H^\displaystyle\hat{H} =12i,j=1ijN,NJijS^izS^jzi=1NhiS^izγ(t)i=1NS^ix,\displaystyle=-\frac{1}{2}\sum_{\begin{subarray}{c}i,j=1\\ i\neq j\end{subarray}}^{N,N}J_{ij}\hat{S}_{i}^{z}\hat{S}_{j}^{z}-\sum_{i=1}^{N}h_{i}\hat{S}_{i}^{z}-\gamma(t)\sum_{i=1}^{N}\hat{S}_{i}^{x}\,,
S^iα\displaystyle\hat{S}_{i}^{\alpha} =𝟙𝟙𝟙S^α𝟙𝟙𝟙i1terms,α=x,y,z,\displaystyle=\mathds{1}\otimes\mathds{1}\otimes\cdots\mathds{1}\otimes\hat{S}^{\alpha}\underbrace{\otimes\mathds{1}\otimes\cdots\otimes\mathds{1}\otimes\mathds{1}}_{i-1\text{terms}},\,\,\alpha=x,y,z\,, (4)

where S^α\hat{S}^{\alpha} are the spin-1/21/2 Pauli matrices, 𝟙\mathds{1} is the 2×22\times 2 identity matrix, and \otimes denotes a tensor product. The first term, H^0\hat{H}_{0}, is diagonal and corresponds to the operator representation of the classical Ising Hamiltonian HIH_{\rm I}; the second term is a symmetry-breaking longitudinal magnetic field; the third term is a transverse field that results in a non-diagonal Hamiltonian operator and gives rise to the quantum Ising model. We will take the annealing term to be of the form γ(t)=B/t+t0\gamma(t)=B/\sqrt{t+t_{0}} for some constant BB [43] and set t0=0.5t_{0}=0.5. Our quantum system is made up of NN spin-1/21/2 subsystems each having a basis |,|{\lvert\downarrow\rangle,\lvert\uparrow\rangle}. A general state |Ψ\lvert\Psi\rangle of the NN-spin system can then be written as

|Ψ=ξCξ|ξwithξ|Cξ|2=1,\displaystyle|\Psi\rangle=\sum_{\xi}C_{\xi}|\xi\rangle\quad\text{with}\quad\sum_{\xi}\left|C_{\xi}\right|^{2}=1, (5)

where the CξC_{\xi}’s are complex numbers and the basis element

|ξ|ξ1ξN=|ξN|ξ1,ξk={|,|},\displaystyle|\xi\rangle\equiv\left|\xi_{1}\cdots\xi_{N}\right\rangle=\left|\xi_{N}\right\rangle\otimes\cdots\otimes\left|\xi_{1}\right\rangle,\quad\xi_{k}=\{\lvert\downarrow\rangle,\,\,\lvert\uparrow\rangle\}\,, (6)

for k=1,,Nk=1,\cdots,N. We begin with an initial state, which is the ground state of the transverse field Hamiltonian. The initial state at time tit_{i} can then be expressed as

|Ψ(ti)=|ψ|ψ,\displaystyle|\Psi(t_{i})\rangle=\lvert\psi_{\rightarrow}\rangle\otimes\cdots\otimes\lvert\psi_{\rightarrow}\rangle\,, (7)

where for each subsystem |ψ=(|+|)/2\lvert\psi_{\rightarrow}\rangle=(\lvert\uparrow\rangle+\lvert\downarrow\rangle)/\sqrt{2}. The wavefunction is then evolved according to the time-dependent Schrödinger equation (see Appendix B for details) [44]. As tt\rightarrow\infty, γ(t)0\gamma(t)\rightarrow 0 and the contribution of the last term decays to bring about the target Hamiltonian. Provided γ(t)\gamma(t) is varied adiabatically, the state evolves whilst remaining in the true ground state of the system, and settles into the target Hamiltonian’s desired ground state at sufficiently long times.

To determine the probability of finding the ground state, we compute the projection of |Ψ(t)\lvert\Psi(t)\rangle onto the ground state |ϕGS\lvert\phi_{{}_{\rm GS}}\rangle of the classical Hamiltonian, H^0\hat{H}_{0}, given by PGS=|ϕGS|Ψ(t)|2P_{{}_{\rm GS}}=|\langle\phi_{{}_{\rm GS}}\,|\,\mathopen{}\Psi(t)\rangle|^{2}. In Fig. (7), we present numerical simulations of the time evolution of the success probability for finding the ground state of an N=8N=8 spin system with J=0J=0 and B=5B=5. For comparison, we have also included the results for simulated annealing [45] and classical annealing by evolving a master equation [43] (see Appendix D for details). In the former, we allow transition probabilities for single-spin flips only, whereas, in the latter, we allow for all spin flips to reveal the importance of spin correlations on the success probability of finding the ground states. Such collective transitions can be important when topological constraints associated with particular spin configurations can render certain single-spin transitions ineffective at escaping local energy minima.

Refer to caption
Figure 7: Time-evolution of ground state probability of target Hamiltonian with J=0J=0 and B=5B=5 for quantum annealing (QA), single-spin simulated annealing (SA), classical annealing (CA) and corresponding probabilities expected for adiabatic simulated (SA-ad) and adiabatic quantum (QA-ad) annealing. (a) and (b) correspond to simulation without symmetry breaking term which corresponds to a doubly degenerate ground state. Each figure corresponds to the projection of the probability density onto each one of the ground states; (c) and (d) correspond to a simulation with a symmetry breaking term added which lifts the degeneracy and leads to a unique ground state.
Refer to caption
Figure 8: Evolution of N=8N=8 spins, with J=0.35J=0.35, B=5B=5 and Δt=0.1\Delta t=0.1; (a) Ground state probability of target Hamiltonian for quantum annealing (QA), single-spin simulated annealing (SA), classical annealing (CA) and corresponding probabilities expected for adiabatic simulated (SA-ad) and adiabatic quantum (QA-ad) annealing. Insets show Bloch vector for single-spin and magnitude of Bloch vector |𝐮||{\mathbf{u}}|; (b) similar results as (a) but with symmetry-breaking terms added to Hamiltonian and Bloch sphere shows typical trajectories of two neighboring spins; ; (c) evolution for probability amplitude of |\lvert\uparrow\rangle state in quantum annealing simulation with J=0.35J=0.35 and (d) J=0.6J=0.6. The amplitudes colored in red correspond to the frustrated spins as in Figs. (2)(b) and (3)(b).

For J<JcritJ<J_{\rm crit}, the S0S_{0} ground state has a two-fold degeneracy. Therefore, in the absence of a symmetry breaking term, we can expect that the probability of finding one of the ground states is PGS=1/2P_{{}_{\rm GS}}=1/2. Figures (7)(a) and (b) present simulations for the case where no symmetry breaking term is included so that the system contains two degenerate energy minima. Time evolution of the probabilities for finding the system in one of the two degenerate ground states show an equal probability for the system to be found in either one of these states. Moreover, the results are relatively similar regardless of which numerical method is considered. Therefore, quantum annealing, simulated annealing and classical annealing show a similar performance in tracking the ground states as indicated by the curves representing adiabatic evolution of the system.

In Figs. (7)(c) and (d), we present simulations for the case where a symmetry breaking term is included. To introduce the symmetry breaking term, we used 0.05|ξS0+0.05|ξS10.05\lvert\xi\rangle_{S_{0}}+0.05\lvert\xi\rangle_{S_{1}}, where |ξS0\lvert\xi\rangle_{S_{0}} and |ξS1\lvert\xi\rangle_{S_{1}} correspond to the S0S_{0} and S1S_{1} states, respectively. In contrast to the previous case, the behavior of the different models is now markedly different. In particular, we observe that both quantum annealing and classical annealing correctly evolve with the true ground state as indicated by the curves corresponding to the adiabatic evolution. Moreover, due to the symmetry breaking, there is one unique ground state that the two methods can follow. In contrast, simulated annealing is not always successful at tracking the true ground state even for this case where J=0J=0. We found a success probability of only 67%, whereas the remaining probability is associated with the system converging to what is now a metastable state. These results demonstrate the importance of the symmetry breaking terms and how they affect the time-evolution of the ground state probabilities.

In Figs. (8)(a) and (8)(b), we present results with and without symmetry breaking terms for the case with J=0.35J=0.35 and B=5B=5 that corresponds to a hard region of the parameter space for the soft-spin models. As can be seen, now the success probability for simulated annealing degrades even in the absence of symmetry breaking terms. In contrast, classical annealing and quantum annealing continue to perform well. Although the convergence of simulated annealing can be enhanced for slower annealing rates, in general, the success probabilities are lower than the other algorithms we have investigated over a range of annealing schedules (see also Ref. [43]).

To compare the quantum annealing and semi-classical soft-spin simulations, we computed the single-spin reduced density matrix ρ^1,i\hat{\rho}_{1,i} from the pure state |Ψ(t)\lvert\Psi(t)\rangle. In general, the single-spin density matrix will correspond to entangled states. This is illustrated by recovering the Bloch vector from ρ^1,i(t)\hat{\rho}_{1,i}(t) (see Appendix C for details). In the inset of Fig. (8)(a), we show the evolution of the Bloch vector with time evaluated for one of the spins (other spins show similar behavior) for a simulation with J=0.35J=0.35 and B=5B=5 in the absence of a symmetry breaking term. We see that the spin is initially aligned along the equator (consistent with the form of |\lvert\rightarrow\rangle) but shrinks towards the origin as the state evolves. The departure of the Bloch vector from the surface of the Bloch sphere is indicative of quantum entanglement while its dynamics towards the origin signals a spin state that is maximally entangled with the rest of the system. A definite state emerges only upon measurement, which then subsequently collapses the corresponding wavefunction to one specific configuration.

These results demonstrate the striking differences between the states of a fully quantum mechanical description, and a semi-classical description considered earlier. To facilitate comparison with the deterministic semi-classical simulations, we removed the ground state degeneracy in our quantum annealing simulations by introducing the symmetry-breaking term H^1\hat{H}_{1} in Eq. (4). We set hih_{i} to correspond to 0.05|ξS0+0.05|ξS10.05\lvert\xi\rangle_{S_{0}}+0.05\lvert\xi\rangle_{S_{1}}. This enforces the evolution towards a specific ground state as can be seen by contrasting the success probability for the ground states presented in Figs. (8)(a) and (8)(b). The resulting Bloch vector is shown in the inset of Fig. (8)(b), and now indicates evolution that ends at the surface of the Bloch sphere, reaching either the |\lvert\uparrow\rangle or |\lvert\downarrow\rangle state which represents a final state that is not in quantum superposition. Since the Bloch vector does not remain on the surface of the Bloch sphere during the evolution, it clearly demonstrates that though individual spins converge towards a non-correlated value, their evolution bears the imprint of inter-spin correlations. Unlike the semi-classical models, our quantum annealing algorithm consistently identifies the correct ground state for a wide range of parameters in the interval Je<J<JcritJ_{\rm e}<J<J_{\rm crit}; (see Fig. (6)) and demonstrates that correlations play a key role in facilitating the system to converge to the true ground state. However, its performance appears to degrade near JcritJ_{\rm crit}. In contrast, the CIM gain-based algorithm is less sensitive near JcritJ_{\rm crit} and indicates an advantage of gain-based computing on such a Möbius ladder graph. The corresponding time-dependent probability of finding each spin, ii, in the |\lvert\uparrow\rangle state is presented in Fig. (8)(c) for J=0.35J=0.35 (and for J=0.6J=0.6 in Fig. (8)(d)). As can be seen from the initial evolution of the single-spin probability amplitude, we strongly perturb the system towards state S1S_{1} through the form of the symmetry breaking terms used. Despite this, the results emphasize the quantum annealing algorithm’s capacity to find the correct ground state during gradual γ(t)\gamma(t) quenches, leveraging the quantum system’s expanded phase space.

In order to perform a more systematic study of the impact of including the symmetry breaking terms on the results presented, in Fig. (9) we present results for simulations performed by using a symmetry breaking term of the form 0.05|ξS0+h1|ξS10.05\lvert\xi\rangle_{S_{0}}+h_{1}\lvert\xi\rangle_{S_{1}}, where h1h_{1} is varied from 0.0050.005 to 0.10.1. For each value of h1h_{1}, we have evaluated the time evolution of the probability amplitudes |ρ^1,k|\langle\uparrow\,|\,\mathopen{}\hat{\rho}_{1,k}|\,\uparrow\rangle for each spin kk, as well as the time evolution of the magnitude of the corresponding Bloch vectors |𝐮k||{\mathbf{u}}_{k}|. The results demonstrate that as h1h_{1} is increased, the initial evolution of the probability amplitudes is to align the spins towards the S1S_{1} state which is caused by the increasing contribution of the symmetry breaking term. However, as the system navigates the energy landscape, quantum correlations develop as indicated by the decreasing amplitude of the Bloch vectors of the individual spins. This emerging quantum entanglement of the spins prevents the system from becoming stuck in local energy minima and subsequently allows the spins to readjust in order to track the true ground state. Subsequently, the system converges to the true ground state that is well described by a product state as the magnitude of the Bloch vectors converge to unity. We note that during the evolution, the maximal entanglement occurs at the time when the projection of some of the spins flips to the opposite direction. This time also coincides with the time where simulated annealing fails to track the correct ground state in comparison to quantum annealing as reflected in the results of Figs. (7) and (8). We, therefore, conclude that quantum correlations play a key role in allowing quantum annealing to outperform other methods in this region of the parameter space of the Möbius circulant graph.

Refer to caption
Figure 9: Time-evolution of probability amplitude of |\lvert\uparrow\rangle state (left panels) and magnitude of Bloch vectors (right panels) in quantum annealing computation with J=0.35J=0.35 and B=5B=5. The dashed lines correspond to spins that at the end of the annealing align along the |\lvert\downarrow\rangle state whereas solid lines correspond to spins that align with the |\lvert\uparrow\rangle state. The red and blue colored lines correspond to the color of the spins shown in Fig. (2)(b) and (3)(b).

VII Conclusions

In summary, we analyzed the optimization of Ising Hamiltonians, contrasting the classical dynamics of semi-classical soft-spin models with quantum annealing. We discussed the challenges that arise with using semi-classical models, which are due to a broadening dimensionality landscape, especially when the models’ global minimum maps to the Ising Hamiltonian’s excited state. A solution, termed ‘manifold reduction’, is presented, constraining the soft-spin amplitudes and restricting the dimensionality landscape. On the other hand, we showed that quantum annealing inherently can traverse the Ising Hamiltonian’s energy terrain, operating within an extensive Hilbert space. The findings highlight the importance of understanding the influence of dimensionality and the energy landscape overall on optimizing physical systems. Furthermore, they demonstrate how extensions of semi-classical models to include quantum effects has the potential to assist the annealing in navigating the system towards the true ground state.

Acknowledgements

J.S.C. acknowledges the PhD support from the UKRI EPSRC DTP EP/T517847/1, N.G.B acknowledges the support from the Julian Schwinger Foundation Grant No. JSF-19-02-0005, HORIZON EIC-2022-PATHFINDERCHALLENGES-01 HEISINGBERG Project 101114978, and Weizmann-UK Make Connection grant 142568.

Appendix A Eigenvectors and Eigenvalues of the Möbius Ladder Coupling Matrix

Refer to caption
Figure 10: Eigenvalues of an N=8N=8 Möbius ladder graph as a function of JJ. JeJ_{\rm e} is the value of JJ where the red dashed line shows the two largest eigenvalues crossing. JcritJ_{\rm crit} shows where their energies are equal E1=E0E_{1}=E_{0}. The ground state corresponds to S0S_{0} for J<JcritJ<J_{\rm crit} and S1S_{1} for J>JcritJ>J_{\rm crit}.

To find the eigenvalues and eigenvectors of the N×NN\times N matrix 𝐉\mathbf{J} for even NN, we use the roots of unity, so the solutions of ωN=1\omega^{N}=1 are ωk=exp(i2πk/N)\omega_{k}=\exp(i2\pi k/N) for k=0,,N1k=0,\ldots,N-1. The corresponding eigenvectors are (1,ωk,ωk2,,ωkN1)(1,\omega_{k},\omega_{k}^{2},\ldots,\omega_{k}^{N-1}) which can be verified by direct computation. Then, from the first row of 𝐉\mathbf{J}, we form the polynomial f(ω)=ωJωN/2ωN1f(\omega)=-\omega-J\omega^{N/2}-\omega^{N-1} and evaluate it at the unit roots ωk=exp[i2πk/N]\omega_{k}=\exp[i2\pi k/N] to obtain the eigenvalues λk=f(ωk)=2cos(2πk/N)J(1)k\lambda_{k}=f(\omega_{k})=-2\cos(2\pi k/N)-J(-1)^{k} with the corresponding eigenvectors vk=(1,ωk,ωk2,,ωkN1).v_{k}=(1,\omega_{k},\omega_{k}^{2},\ldots,\omega_{k}^{N-1}). The largest of f(ωk)f(\omega_{k}) is either λN/2=2J\lambda_{N/2}=2-J or λN/2±1=J+2cos(2π/N)\lambda_{N/2\pm 1}=J+2\cos(2\pi/N) depending on whether J<JeJ<J_{\rm e} or J>JeJ>J_{\rm e} with Je1cos(2π/N)J_{\rm e}\equiv 1-\cos(2\pi/N) as the value where these two eigenvalues cross. The corresponding real-valued and mutually orthogonal eigenvectors μk\mu_{k} can be formed from v(ωk)v(\omega_{k}) as μk=Re[v(ωk)]+Im[v(ωk)]\mu_{k}={\rm Re}[v(\omega_{k})]+{\rm Im}[v(\omega_{k})] [37]. For the two largest eigenvalues, the eigenvectors are μN/2=(1,1,1,1,,1)\mu_{N/2}=(1,-1,1,-1,\ldots,-1) and μN/2±1=(1,±cos(2π/N)±sin(2π/N),,±cos(2πk/N)±sin(2πk/N),,±cos(2π(N1)/N)±sin(2π(N1)/N))\mu_{N/2\pm 1}=(1,\pm\cos(2\pi/N)\pm\sin(2\pi/N),\ldots,\pm\cos(2\pi k/N)\pm\sin(2\pi k/N),\ldots,\pm\cos(2\pi(N-1)/N)\pm\sin(2\pi(N-1)/N)). If N/2N/2 is even, then μN/2±1\mu_{N/2\pm 1} have the components with two zero values at the positions separated by N/21N/2-1 sign alternating components. We illustrate this construction for the Möbius ladder coupling matrix 𝐉\mathbf{J} with N=8N=8 considered in the main text

𝐉=(0100J00110100J00010100J00010100JJ00101000J00101000J00101100J0010).{\bf J}=\left(\begin{array}[]{cccccccc}0&-1&0&0&-J&0&0&-1\\ -1&0&-1&0&0&-J&0&0\\ 0&-1&0&-1&0&0&-J&0\\ 0&0&-1&0&-1&0&0&-J\\ -J&0&0&-1&0&-1&0&0\\ 0&-J&0&0&-1&0&-1&0\\ 0&0&-J&0&0&-1&0&-1\\ -1&0&0&-J&0&0&-1&0\\ \end{array}\right). (8)

The eigenvalues are λ0=f(ω0)=f(1)=2J,\lambda_{0}=f(\omega_{0})=f(1)=-2-J, λ1=f(ω1)=2+J,\lambda_{1}=f(\omega_{1})=-\sqrt{2}+J, λ2=f(ω2)=J,\lambda_{2}=f(\omega_{2})=-J, λ3=f(ω3)=2+J,\lambda_{3}=f(\omega_{3})=\sqrt{2}+J, λ4=f(ω4)=2J,\lambda_{4}=f(\omega_{4})=2-J, λ5=f(ω5)=2+J,\lambda_{5}=f(\omega_{5})=\sqrt{2}+J, λ6=f(ω6)=J,\lambda_{6}=f(\omega_{6})=-J, and λ7=f(ω7)=2+J\lambda_{7}=f(\omega_{7})=-\sqrt{2}+J. The eigenvector that corresponds to λ3\lambda_{3} is μ3=(1,1,1,1,1,1,1,1)\mu_{3}=(1,-1,1,-1,1,-1,1,-1) and the eigenvector that corresponds to, say, λ4\lambda_{4} is μ4=(1,2,1,0,1,2,1,0)\mu_{4}=(1,-\sqrt{2},1,0,-1,\sqrt{2},-1,0). The soft spin system, therefore, follows (+,,+,+,,+,,)(+,-,+,+,-,+,-,-) or (+,,+,,,+,,+)(+,-,+,-,-,+,-,+) direction at the onset of the pitch-fork bifurcation when J>JeJ>J_{\rm e} while λ4\lambda_{4} becomes the dominant eigenvalue of matrix 𝐉\mathbf{J}. Figure (10) illustrates how these eigenvalues vary as a function of JJ.

Appendix B Solution of the Time-Dependent Schrödinger Equation

The wavefunction is evolved according to the time-dependent Schrödinger equation (with =1\hbar=1) given by

iddt|Ψ(t)\displaystyle i\frac{\mbox{d}}{\mbox{d}t}\,\lvert\Psi(t)\rangle =H^(t)|Ψ(t),\displaystyle=\hat{H}(t)\,\lvert\Psi(t)\rangle\,, (9)
H^(ti)|Ψ(ti)\displaystyle\hat{{H}}(t_{i})\,|\Psi(t_{i})\rangle =εgs|Ψ(ti),\displaystyle=\varepsilon_{gs}\,|\Psi(t_{i})\rangle\,, (10)

where εgs\varepsilon_{gs} denotes the ground state energy of the system at the initial time. To evolve the time-dependent Hamiltonian given by Eq. (9), we use a second order accurate Strang time-splitting method where H^0\hat{H}_{0} is evolved for half a time-step Δt\Delta t followed by H^1\hat{H}_{1} for a full time-step and then H^0\hat{H}_{0} for another half a time-step. The resulting time-integration scheme can then be written as

|Ψ(tn+1)=\displaystyle\lvert\Psi(t_{n+1})\rangle= exp(iΔt2H^D)(i2tntn+1H^2(t~)dt~)\displaystyle\exp\left(-i\frac{\Delta t}{2}\hat{H}_{D}\right)\left(-\frac{i}{2}\int_{t_{n}}^{t_{n+1}}\hat{H}_{2}(\tilde{t})\mbox{d}\tilde{t}\right)
×\displaystyle\times exp(iΔt2H^D)|Ψ(tn),\displaystyle\exp\left(-i\frac{\Delta t}{2}\hat{H}_{D}\right)\lvert\Psi(t_{n})\rangle\,, (11)

where H^D=H^0+H^1\hat{H}_{D}=\hat{H}_{0}+\hat{H}_{1} is the diagonal part of the Hamiltonian operator. By placing the Hamiltonian operator H^2(t)\hat{H}_{2}(t) containing the time-dependent term in the middle of the split-step algorithm, we ensure that we have a symmetric time-splitting method. The time integral appearing in Eq. (11) was evaluated analytically. In our simulations, we set Δttn+1tn=0.1\Delta t\equiv t_{n+1}-t_{n}=0.1. The simulations were performed in MATLAB. The exponentials of the diagonal and non-diagonal Hamiltonian can then be readily evaluated using the expm function [44].

Appendix C Computation of Bloch Vectors in Quantum Annealing Simulations

The single-spin reduced density matrix ρ^1,k\hat{\rho}_{1,k} is obtained by taking the partial trace of the 2N×2N2^{N}\times 2^{N} density matrix ρ^\hat{\rho} over the Hilbert space of the other N1N-1 spins. For the kk-th spin, this is defined as

ρ^1,k(t)=Tr{N\k}ρ^,\displaystyle\hat{\rho}_{1,k}(t)=\text{Tr}_{\{N\backslash k\}}\hat{\rho}\,, (12)

where {N\k}\{N\backslash k\} denotes the NN spin Hilbert space excluding the kk-th spin. The single-spin density matrix can then be parameterized as

ρ^1,k\displaystyle\hat{\rho}_{1,k} =12(𝟙+𝐮k𝐒^)=12(1+wkukivkuk+ivk1wk),\displaystyle=\frac{1}{2}\left(\mathds{1}+\mathbf{u}_{k}\cdot\hat{\mathbf{S}}\right)=\frac{1}{2}\left(\begin{array}[]{cc}1+w_{k}&u_{k}-iv_{k}\\ u_{k}+iv_{k}&1-w_{k}\end{array}\right)\,, (15)

where 𝐮k=(uk,vk,wk){\mathbf{u}}_{k}=(u_{k},v_{k},w_{k}) defines the corresponding Bloch vector and 𝐒^=(S^x,S^y,S^z)\hat{\mathbf{S}}=(\hat{S}^{x},\hat{S}^{y},\hat{S}^{z}) correspond to the vector of spin-1/21/2 Pauli matrices. For pure states the single-spin reduced density matrix has rank 1, with a magnitude of the Bloch vector |𝐮k|=1|\mathbf{u}_{k}|=1. The surface of the Bloch sphere, therefore, represents all the possible pure states whereas the interior of the sphere corresponds to mixed states. The collapse of the Bloch vector towards the origin implies that the state represents a maximally entangled Bell-like state.

Appendix D Master Equation for Classical/Simulated Annealing

To model simulated annealing, we follow the method described in Ref. [43], and introduce the Master equation for the transition probability Pi(t)P_{i}(t) for each spin configuration as

dPi(t)dt=j=12NAij(t)Pj(t).\displaystyle\frac{\mbox{d}P_{i}(t)}{\mbox{d}t}=\sum_{j=1}^{2^{N}}A_{ij}(t)P_{j}(t). (16)

The 2N×2N2^{N}\times 2^{N} matrix Aij(t)A_{ij}(t) describes the transition rates. The master equation can be written in the form

dPi(t)dt\displaystyle\frac{\mbox{d}P_{i}(t)}{\mbox{d}t} =ijAij(t)Pj(t)+Aii(t)Pi(t),\displaystyle=\sum_{i\neq j}A_{ij}(t)P_{j}(t)+A_{ii}(t)P_{i}(t), (17)
=ij(Aij(t)Pj(t)Aji(t)Pi(t)),\displaystyle=\sum_{i\neq j}\left(A_{ij}(t)P_{j}(t)-A_{ji}(t)P_{i}(t)\right), (18)

where we have made use of the conservation of probability given by

ddtjPj(t)=i,j(AjiPi)=0,\displaystyle\frac{\mbox{d}}{\mbox{d}t}\sum_{j}P_{j}(t)=\sum_{i,j}(A_{ji}P_{i})=0, (19)

to arrive at the final equality. Since the normalization condition must hold for any probabilities PjP_{j}, it follows that

jAji=0,orAii=ijAji.\displaystyle\sum_{j}A_{ji}=0\,,\qquad\text{or}\qquad A_{ii}=-\sum_{i\neq j}A_{ji}\,. (20)

Using Eq. (20) to represent the diagonal terms of the Master equation ensures that a numerical integration of this equation continues to conserve the normalization of the probabilities. The precise form of the transition probabilities is problem specific, although it is common to use the Boltzmann distribution. In our work, we follow Ref. [43] and use the Bose-Einstein distribution such that

Aij(t)={{1+exp[(EiEj)T(t)]}1,( single-spin flip) kiAki,(i=j)0,(otherwise) .\displaystyle A_{ij}(t)=\begin{cases}\left\{1+\exp\left[\dfrac{\left(E_{i}-E_{j}\right)}{T(t)}\right]\right\}^{-1}\,\,,&(\text{ single-spin flip) }\\ -\sum_{k\neq i}A_{ki}\,\,,&(i=j)\\ 0\,,&\text{(otherwise) }.\end{cases}

The form given above that is used for our simulated annealing simulations means that entries of AijA_{ij} are non-zero only for transitions corresponding to single-spin flips. The annealing is performed by varying the temperature T(t)T(t) with time. To maintain consistency with our quantum annealing simulations, we have varied the temperature according to T(t)=D/t+t0T(t)=D/\sqrt{t+t_{0}}, where t0=0.5t_{0}=0.5, and DD is a free parameter which we set to D=5D=5.

For our classical annealing (CA) simulations, we do not zero out any of the transition probabilities in order to infer how collective transitions of spins at each time-step, as opposed to only single-spin transitions, affects the performance of classical algorithms. It is useful to make the observation that quantum and classical annealing can be closely related to one another if one formulates quantum annealing in imaginary time following a Wick rotation. It then follows that the evolution of the NN-spin wavefunction |Ψ(t)|\Psi(t)\rangle is given by

ddt|Ψ(t)=(μ(t)H^(t))|Ψ(t).\displaystyle\frac{\mbox{d}}{\mbox{d}t}|\Psi(t)\rangle=\left(\mu(t)-\hat{{H}}(t)\right)|\Psi(t)\rangle\,\,. (21)

Here, μ(t)\mu(t) plays the role of a Lagrange multiplier which ensures that the normalisation of the wavefunction is conserved in analogy with the modification introduced above to the diagonal term of the Master equation. Therefore, by comparing quantum annealing, simulated annealing, and classical annealing, we can distinguish between the effects of retaining all-spin transitions from the difference of evolving our equations in real and imaginary time.

References

  • Kasprzak et al. [2006] J. Kasprzak, M. Richard, S. Kundermann, A. Baas, P. Jeambrun, J. Keeling, F. Marchetti, M. Szymańska, R. André, J. Staehli, et al., Nature 443, 409 (2006).
  • Yamamoto et al. [2017] Y. Yamamoto, K. Aihara, T. Leleu, K.-i. Kawarabayashi, S. Kako, M. Fejer, K. Inoue,  and H. Takesue, npj Quantum Information 3, 1 (2017).
  • Inagaki et al. [2016] T. Inagaki, Y. Haribara, K. Igarashi, T. Sonobe, S. Tamate, T. Honjo, A. Marandi, P. L. McMahon, T. Umeki, K. Enbutsu, et al., Science 354, 603 (2016).
  • McMahon et al. [2016] P. L. McMahon, A. Marandi, Y. Haribara, R. Hamerly, C. Langrock, S. Tamate, T. Inagaki, H. Takesue, S. Utsunomiya, K. Aihara, et al., Science 354, 614 (2016).
  • Honjo et al. [2021] T. Honjo, T. Sonobe, K. Inaba, T. Inagaki, T. Ikuta, Y. Yamada, T. Kazama, K. Enbutsu, T. Umeki, R. Kasahara, K. ichi Kawarabayashi,  and H. Takesue, Science Advances 7, eabh0952 (2021).
  • Pal et al. [2020] V. Pal, S. Mahler, C. Tradonsky, A. A. Friesem,  and N. Davidson, Physical Review Research 2, 033008 (2020).
  • Babaeian et al. [2019] M. Babaeian, D. T. Nguyen, V. Demir, M. Akbulut, P.-A. Blanche, Y. Kaneda, S. Guha, M. A. Neifeld,  and N. Peyghambarian, Nature communications 10, 1 (2019).
  • Parto et al. [2020] M. Parto, W. Hayenga, A. Marandi, D. N. Christodoulides,  and M. Khajavikhan, Nature materials 19, 725 (2020).
  • Pierangeli et al. [2019] D. Pierangeli, G. Marcucci,  and C. Conti, Physical review letters 122, 213902 (2019).
  • Berloff et al. [2017] N. G. Berloff, M. Silva, K. Kalinin, A. Askitopoulos, J. D. Töpfer, P. Cilibrizzi, W. Langbein,  and P. G. Lagoudakis, Nature materials  (2017).
  • Kalinin et al. [2020] K. P. Kalinin, A. Amo, J. Bloch,  and N. G. Berloff, Nanophotonics 9, 4127 (2020).
  • Vretenar et al. [2021] M. Vretenar, B. Kassenberg, S. Bissesar, C. Toebes,  and J. Klaers, Physical Review Research 3, 023167 (2021).
  • Kalinin et al. [2023] K. Kalinin, G. Mourgias-Alexandris, H. Ballani, N. G. Berloff, J. H. Clegg, D. Cletheroe, C. Gkantsidis, I. Haller, V. Lyutsarev, F. Parmigiani, L. Pickup, et al., arXiv preprint arXiv:2304.12594  (2023).
  • Goto et al. [2021] H. Goto, K. Endo, M. Suzuki, Y. Sakai, T. Kanao, Y. Hamakawa, R. Hidaka, M. Yamasaki,  and K. Tatsumura, Science Advances 7, eabe7953 (2021).
  • Syed and Berloff [2022] M. Syed and N. G. Berloff, “Physics-enhanced bifurcation optimisers: All you need is a canonical complex network,”  (2022).
  • Liu et al. [2015] C.-W. Liu, A. Polkovnikov,  and A. W. Sandvik, Phys. Rev. Lett. 114, 147203 (2015).
  • King et al. [2022] A. D. King, S. Suzuki, J. Raymond, A. Zucca, T. Lanting, F. Altomare, A. J. Berkley, S. Ejtemaee, E. Hoskinson, S. Huang, E. Ladizinsky, A. J. R. MacDonald, G. Marsden, T. Oh, G. Poulin-Lamarre, M. Reis, C. Rich, Y. Sato, J. D. Whittaker, J. Yao, R. Harris, D. A. Lidar, H. Nishimori,  and M. H. Amin, Nature Physics 18, 1324 (2022).
  • Heim et al. [2015] B. Heim, T. F. Rønnow, S. V. Isakov,  and M. Troyer, Science 348, 215 (2015).
  • King et al. [2023] A. D. King, J. Raymond, T. Lanting, R. Harris, A. Zucca, F. Altomare, A. J. Berkley, K. Boothby, S. Ejtemaee, C. Enderud, E. Hoskinson, S. Huang, E. Ladizinsky, A. J. R. MacDonald, G. Marsden, R. Molavi, T. Oh, G. Poulin-Lamarre, M. Reis, C. Rich, Y. Sato, N. Tsai, M. Volkmann, J. D. Whittaker, J. Yao, A. W. Sandvik,  and M. H. Amin, Nature 617, 61 (2023).
  • Albash and Lidar [2018a] T. Albash and D. A. Lidar, Phys. Rev. X 8, 031016 (2018a).
  • Yan and Sinitsyn [2022] B. Yan and N. A. Sinitsyn, Nature Communications 13, 2212 (2022).
  • Albash and Lidar [2018b] T. Albash and D. A. Lidar, Phys. Rev. X 8, 031016 (2018b).
  • Muthukrishnan et al. [2016] S. Muthukrishnan, T. Albash,  and D. A. Lidar, Phys. Rev. X 6, 031010 (2016).
  • Hamerly et al. [2019] R. Hamerly, T. Inagaki, P. L. McMahon, D. Venturelli, A. Marandi, T. Onodera, E. Ng, C. Langrock, K. Inaba, T. Honjo, K. Enbutsu, T. Umeki, R. Kasahara, S. Utsunomiya, S. Kako, K. ichi Kawarabayashi, R. L. Byer, M. M. Fejer, H. Mabuchi, D. Englund, E. Rieffel, H. Takesue,  and Y. Yamamoto, Science Advances 5, eaau0823 (2019).
  • Calvanese Strinati et al. [2021] M. Calvanese Strinati, D. Pierangeli,  and C. Conti, Phys. Rev. Appl. 16, 054022 (2021).
  • Ohadi et al. [2016] H. Ohadi, R. Gregory, T. Freegarde, Y. Rubo, A. Kavokin, N. Berloff,  and P. Lagoudakis, Physical Review X 6, 031032 (2016).
  • Ayoub and Psaltis [2021] A. B. Ayoub and D. Psaltis, Scientific Reports 11, 1 (2021).
  • Qiang et al. [2016] X. Qiang, T. Loke, A. Montanaro, K. Aungskunsiri, X. Zhou, J. L. O’Brien, J. B. Wang,  and J. C. Matthews, Nature communications 7, 1 (2016).
  • Marsh et al. [2024] B. P. Marsh, R. M. Kroeze, S. Ganguli, S. Gopalakrishnan, J. Keeling,  and B. L. Lev, Physical Review X 14, 011026 (2024).
  • Hamze et al. [2020] F. Hamze, J. Raymond, C. A. Pattison, K. Biswas,  and H. G. Katzgraber, Physical Review E 101, 052102 (2020).
  • Roberts et al. [2020] D. Roberts, L. Cincio, A. Saxena, A. Petukhov,  and S. Knysh, Physical Review A 101, 042317 (2020).
  • Mandrà et al. [2023] S. Mandrà, G. Mossi,  and E. G. Rieffel, arXiv preprint arXiv:2308.09704  (2023).
  • Hopfield [1982] J. J. Hopfield, Proceedings of the national academy of sciences 79, 2554 (1982).
  • Hopfield and Tank [1985] J. J. Hopfield and D. W. Tank, Biological cybernetics 52, 141 (1985).
  • Kalinin and Berloff [2018a] K. P. Kalinin and N. G. Berloff, arXiv preprint arXiv:1806.11457  (2018a).
  • Kalman and White [2001] D. Kalman and J. E. White, The American Mathematical Monthly 108, 821 (2001).
  • Zhang and Yang [2007] H. Zhang and Y. Yang, International journal of quantum chemistry 107, 330 (2007).
  • Gancio and Rubido [2022] J. Gancio and N. Rubido, Chaos, Solitons & Fractals 158, 112001 (2022).
  • Kalinin and Berloff [2020] K. P. Kalinin and N. G. Berloff, arXiv preprint arXiv:2008.00466  (2020).
  • Yamamura et al. [2023] A. Yamamura, H. Mabuchi,  and S. Ganguli, arXiv preprint arXiv:2309.08119  (2023).
  • Kalinin and Berloff [2018b] K. P. Kalinin and N. G. Berloff, New Journal of Physics 20, 113023 (2018b).
  • Leleu et al. [2020] T. Leleu, F. Khoyratee, T. Levi, R. Hamerly, T. Kohno,  and K. Aihara, arXiv e-prints , arXiv (2020).
  • Kadowaki and Nishimori [1998] T. Kadowaki and H. Nishimori, Phys. Rev. E 58, 5355 (1998).
  • Norambuena et al. [2020] A. Norambuena, D. Tancara,  and R. Coto, European Journal of Physics 41, 045404 (2020).
  • Kirkpatrick et al. [1983] S. Kirkpatrick, C. D. Gelatt,  and M. P. Vecchi, Science 220, 671 (1983).