Stability of energy landscape for Ising models

Bruno Hideki Fukushima-Kimura¹¹1Faculty of Science, Hokkaido University, Japan. Akira Sakai¹¹footnotemark: 1 ²²2https://orcid.org/0000-0003-0943-7842 Hisayoshi Toyokawa Yuki Ueda Institute of Mathematics for Industry, Kyushu University, Japan.Department of Mathematics, Hokkaido University of Education, Japan.

Abstract

In this paper, we explore the stability of the energy landscape of an Ising Hamiltonian when subjected to two kinds of perturbations: a perturbation on the coupling coefficients and external fields, and a perturbation on the underlying graph structure. We give sufficient conditions so that the ground states of a given Hamiltonian are stable under perturbations of the first kind in terms of order preservation. Here by order preservation we mean that the ordering of energy corresponding to two spin configurations in a perturbed Hamiltonian will be preserved in the original Hamiltonian up to a given error margin. We also estimate the probability that the energy gap between ground states for the original Hamiltonian and the perturbed Hamiltonian is bounded by a given error margin when the coupling coefficients and local external magnetic fields of the original Hamiltonian are i.i.d. Gaussian random variables. In the end we show a concrete example of a system which is stable under perturbations of the second kind.

1 Introduction

Finding optimal solutions for combinatorial optimization problems, some of which are known to be NP-hard, is a very important problem. Among many possible approaches to such problems, the application of Ising models to solve real social problems has been getting attention due to its versatility (see [1]). More precisely, a given social combinatorial optimization problem can be mapped into a Hamiltonian $H$ on a graph $G=(V,E)$ , whose expression is given by

\displaystyle H(\sigma)=-\sum_{b=\{x,y\}\in E}J_{b}\sigma_{x}\sigma_{y}-\sum_{x\in V}h_{x}\sigma_{x}

for every Ising spin configuration $\sigma\in\{-1,1\}^{V}$ , where $\{J_{b}\}_{b\in E}$ are coupling coefficients and $\{h_{x}\}_{x\in V}$ are local magnetic fields. In that approach, an optimal solution for the intended combinatorial problem corresponds to a ground state (or global minimum) $\sigma_{G}$ of $H$ , that is, $\sigma_{G}\in\mathrm{arg\,min}\,H$ . There are some well-known methods that can be applied to obtain a ground state. Implementing a Markov chain Monte Carlo (such as Glauber dynamics and stochastic cellular automata) is known as a way to find an approximation for the Gibbs distribution whose highest peaks correspond to the ground states of $H$ . We refer for details to [2, 3, 4, 5] and also [6].

However, as long as we use Ising machines or any computer to perform numerical simulations to find a ground state, we cannot avoid the error occurring due to the analog nature or the difficulty of representing real numbers (see [7]). Because of these reasons, we should incorporate the error coming from the coupling coefficients and local magnetic fields by introducing a perturbed Hamiltonian. Hence, our original Hamiltonian $H$ will be perturbed, originating a perturbed Hamiltonian $H_{\delta}$ whose coupling coefficients and local magnetic fields have a maximal error $\delta$ . Then, the following natural questions arise:

(1)

For any pair of configurations which are ordered in terms of energy with respect to the perturbed Hamiltonian $H_{\delta}$ , is that ordering preserved in the original Hamiltonian $H$ , up to a given error margin?
(2)

Given a Hamiltonian $H$ with coupling coefficients and local magnetic fields distributed as i.i.d. Gaussian random variables, what is the probability that the energy gap in $H$ between two ground states respectively for $H$ and the corresponding perturbed Hamiltonian $H_{\delta}$ is sufficiently small?

In addition to the above questions (1) and (2), the following problem is also important when using Ising machines and computers. It may be somewhat a waste of resources taking all coupling coefficients and local magnetic fields into account. It may be useful to “eliminate” vertices of a given graph whose contribution to the total energy is relatively small, in order to save memories of computers. Hence, we also have the following natural question:

(3)

Can we find a subset of a given graph such that for an arbitrary choice of configuration outside of that region, the energy variation can be controlled?

In this paper, we investigate the stability of energy landscape of a given Hamiltonian under perturbations from the view point of order preservation, aiming at answering the questions we addressed above. Thanks to the order preservation property, we can obtain better estimates for the success probability of finding a ground state compared to the result given in [7].

This paper is organized as follows. In Section 2, we provide a precise formulation for the questions we just posed and raise them again. In Section 3, we answer the questions (1’) and (2’) from Section 2. In Section 4, we provide an example together with a sufficient condition that guarantees a positive answer for question (3’).

2 Setting and the main questions

In this section, we introduce some necessary definitions and terminologies for discussing the stability of energy landscape. Further we also introduce the notion of order preservation for a perturbed system, which plays a central role in this paper. Here, order preservation means, roughly speaking, if we take a ground state for a perturbed Hamiltonian (implemented by a device) then it should be close to the ground state for an original Hamiltonian (intended mathematical problem) in energy, up to a given error margin.

Let us begin by introducing the precise setting. Let $G=(V,E)$ be a finite simple graph with the vertex set $V$ and the edge set $E$ . The so-called original Hamiltonian $H$ with coupling coefficients $\{J_{b}\}_{b\in E}$ and external magnetic fields $\{h_{x}\}_{x\in V}$ on $G$ is defined by

\displaystyle H(\sigma)=-\sum_{b=\{x,y\}\in E}J_{b}\sigma_{x}\sigma_{y}-\sum_{x\in V}h_{x}\sigma_{x}

(2.1)

for each $\sigma=\{\sigma_{x}\}_{x\in V}\in\{-1,1\}^{V}$ . Such a function $H$ can be regarded as a cost function of an intended problem. Given $\delta>0$ , we denote by $H_{\delta}$ the perturbed Hamiltonian with the coupling coefficients $\{J^{\prime}_{b}\}_{b\in E}$ and external fields $\{h^{\prime}_{x}\}_{x\in V}$ , i.e.,

\displaystyle H_{\delta}(\sigma)=-\sum_{b=\{x,y\}\in E}J^{\prime}_{b}\sigma_{x}\sigma_{y}-\sum_{x\in V}h^{\prime}_{x}\sigma_{x}

(2.2)

where the $J^{\prime}_{b}$ ’s and $h^{\prime}_{x}$ ’s satisfy the bounds $\sup_{b}\lvert J_{b}-J^{\prime}_{b}\rvert\leq\delta$ and $\sup_{x}\lvert h_{x}-h^{\prime}_{x}\rvert\leq\delta$ . This perturbation will be often interpreted as a round-off in the following way. Let $(J_{b}^{(1)}J_{b}^{(2)}\dots)$ and $(h_{x}^{(1)}h_{x}^{(2)}\dots)$ be the binary expansions of the fractional parts of $J_{b}$ and $h_{x}$ , i.e.,

\displaystyle J_{b}=J_{b}^{(0)}+\sum_{i\geq 1}\frac{J_{b}^{(i)}}{2^{i}},\quad h_{x}=h_{x}^{(0)}+\sum_{i\geq 1}\frac{h_{x}^{(i)}}{2^{i}}

(2.3)

where $J_{b}^{(0)},h_{x}^{(0)}\in\mathbb{Z}$ and $J_{b}^{(i)},h_{x}^{(i)}\in\{0,1\}$ for $i\geq 1$ . If we set $J_{b}^{\prime}=J_{b}^{(0)}+\sum_{i=1}^{N}\frac{J_{b}^{(i)}}{2^{i}}$ and $h_{x}^{\prime}=h_{x}^{(0)}+\sum_{i=1}^{N}\frac{h_{x}^{(i)}}{2^{i}}$ in the equation (2.2), then the error $\delta$ can be taken as $2^{-N}$ . It means that the perturbed Hamiltonian $H_{\delta}$ is obtained by rounding off the given parameters $J_{b}$ ’s and $h_{x}$ ’s uniformly from the $(N+1)$ -th digit of their binary expansions.

The main purpose of this paper is to clarify the stability of the ground states for a given Hamiltonian under a perturbation in terms of order preservation. In this paper, we will answer the following questions:

(1’)

Find a $\delta>0$ corresponding to a given $\varepsilon>0$ , so that, for any pair $(\sigma,\tau)$ that satisfies $H_{\delta}(\sigma)\geq H_{\delta}(\tau)$ , the ordering is preserved in $H$ up to the error margin $\varepsilon\sup_{\xi,\eta}\left\lvert H(\xi)-H(\eta)\right\rvert$ , i.e.,

$\displaystyle H(\sigma)\geq H(\tau)-\varepsilon\sup_{\xi,\eta}\left\lvert H(\xi)-H(\eta)\right\rvert.$ (2.4)

Here, $\sup_{\xi,\eta}\left\lvert H(\xi)-H(\eta)\right\rvert$ is the total margin of the original Hamiltonian.

(2’)

Let $(\Omega,\mathcal{F},\mathbb{P})$ be a probability space and let $\{J_{b}\}_{b\in E}$ and $\{h_{x}\}_{x\in V}$ be mutually independent standard Gaussian random variables on this probability space. Estimate the probability that the energy gap in $H$ between ground states for $H$ and $H_{\delta}$ , say $\sigma_{G}$ and $\tilde{\sigma}_{G}$ , respectively, is controled by the given error margin, explicitly,

\displaystyle\mathbb{P}\left(0\leq H(\tilde{\sigma}_{G})-H(\sigma_{G})\leq\varepsilon\sup_{\xi,\eta}\left\lvert H(\xi)-H(\eta)\right\rvert\right).

(2.5)

A different aspect of stability of a given system is to find a nontrivial subsystem so that the energy gap between any two spin configurations whose spins restricted to the subgraph coincide is bounded above by a given error margin. Also, at the same time, we require that the number of vertices that can be disregarded is at least of order $N^{\alpha}$ , where $N=\lvert V\rvert$ and $\alpha\in{[}0,1)$ , so that such a number can go to infinity as $N\to\infty$ . In the later part of this paper, we answer the following question for a particular case:

(3’)

Let $\lvert V\rvert=N$ , and let $\{J_{b}\}_{b\in E}$ and $\{h_{x}\}_{x\in V}$ be mutually independent standard Gaussian random variables. Find a subset $V_{0}\subset V$ for a given $\varepsilon>0$ and $\alpha\in{[}0,1)$ such that

\displaystyle{\mathbb{P}}\left(\sup_{\sigma,\tau\in\{-1,1\}^{N}}\left\lvert H(\sigma)-H(\sigma_{V_{0}},\tau_{V\setminus V_{0}})\right\rvert<\varepsilon\sup_{\xi,\eta}\left\lvert H(\xi)-H(\eta)\right\rvert\;\&\;CN^{\alpha}\leq\lvert V\setminus V_{0}\rvert<N\right)

is close to $1$ , where $\sigma_{V_{0}}\in\{-1,1\}^{V_{0}}$ is the spin configuration $\sigma$ restricted to $V_{0}$ and $\tau_{V\setminus V_{0}}\in\{-1,1\}^{V\setminus V_{0}}$ is the restriction of the spin configuration $\tau$ to $V\setminus V_{0}$ .

Questions (1’), (2’) and (3’) above correspond to questions (1), (2) and (3) from Section 1, respectively. In Section 3, we investigate the first two questions above, where for the second one we adopt two different approaches. We obtain answers for question (2’) by means of a method involving the $L^{\infty}$ -distance and a graph’s structure approach, and we compare these two methods for three different graphs. Specifically, we consider sufficient conditions on the perturbation $\delta$ to satisfy order preservation, and calculate the probability that such a sufficient condition holds. In Section 4, we obtain an answer for the question (3’) when the graph is a one-dimensional torus $\mathbb{Z}/N\mathbb{Z}$ without external fields.

3 Stability under a Hamiltonian perturbation

This part is dedicated to provide solutions for questions (1’) and (2’) just posed in the end of the previous section. Before we proceed to the next sections, let us introduce the quantity $R_{H}$ defined by

R_{H}\coloneqq\max_{\xi,\eta}\lvert H(\xi)-H(\eta)\rvert,

(3.1)

which is defined whenever a Hamiltonian $H$ is given. Moreover, if $G=(V,E)$ is a finite simple graph, then we define $k_{G}$ by

k_{G}:=|E|+|V|.

(3.2)

Keeping in mind the mathematical setting introduced in the beginning of Section 2, let us start by showing that the order preservation property holds, that is, let us first answer the question (1’), which consists in finding a $\delta>0$ corresponding to a given $\varepsilon>0$ such that $H_{\delta}(\sigma)\geq H_{\delta}(\tau)$ implies $H(\sigma)\geq H(\tau)-\varepsilon R_{H}$ ; and later on, assuming some randomness on the spin-spin couplings and external fields, we adopt two different approaches to answer question (2’) and estimate the probability that the condition $H(\tilde{\sigma}_{G})-H(\sigma_{G})\leq\varepsilon R_{H}$ is satisfied.

In order to solve the second problem, we will adopt two distinct approaches: a method that relies on uniform estimates and a method where combinatorial estimates are considered instead, which will be presented in Sections 3.2 and 3.3, respectively. In the last part of this section, we compare these two methods and conclude that depending on the underlying graph structure of the problem, one of them will give us a better lower bound for the probability from equation (2.5).

3.1 Order preservation of energy

The answer for question (1’) from Section 2 is provided by Theorem 3.2, however, let us show first a preliminary result.

In [5], we have already established lower bounds for the total margin $R_{H}$ of the Hamiltonian $H$ , but for the reader’s convenience we include its proof in the present paper.

Lemma 3.1 (See [5]).

Let us consider a finite simple graph $G=(V,E)$ and a Hamiltonian $H$ written in the form

\displaystyle H(\sigma)=-\sum_{b=\{x,y\}\in E}J_{b}\sigma_{x}\sigma_{y}-\sum_{x\in V}h_{x}\sigma_{x}

for each $\sigma\in\{-1,1\}^{V}$ . Then, we have

\displaystyle R_{H}\geq\sqrt{v_{H}},\qquad\text{where }v_{H}\coloneqq\sum_{b}J_{b}^{2}+\sum_{x}h_{x}^{2}.

Proof.

For any probability measure $\mu$ on the configuration space $\{-1,1\}^{V}$ , we have

\displaystyle R_{H}\geq\left(\mathbb{E}_{\mu}[H^{2}]-\mathbb{E}_{\mu}[H]^{2}\right)^{1/2},

where $\mathbb{E}_{\mu}$ stands for the expectation with respect to the probability measure $\mu$ . If $\mu$ is particularly chosen as the uniform distribution on $\{-1,1\}^{V}$ , then we have

\displaystyle\mathbb{E}_{\mu}[H]

\displaystyle\coloneqq\frac{1}{2^{\lvert V\rvert}}\sum_{\sigma}H(\sigma)=0,{}

and

	$\displaystyle\mathbb{E}_{\mu}[H^{2}]$	$\displaystyle\coloneqq\frac{1}{2^{\lvert V\rvert}}\sum_{\sigma}H(\sigma)^{2}$
		$\displaystyle=\frac{1}{2^{\lvert V\rvert}}\sum_{\sigma}\left(-\sum_{b=\{x,y\}\in E}J_{b}\sigma_{x}\sigma_{y}-\sum_{x\in V}h_{x}\sigma_{x}\right)^{2}$
		$\displaystyle=\frac{1}{2^{\lvert V\rvert}}\sum_{\sigma}\left(\sum_{b,b^{\prime}\in E}J_{b}J_{b^{\prime}}\sigma_{x}\sigma_{y}\sigma_{x^{\prime}}\sigma_{y^{\prime}}+\sum_{x,x^{\prime}\in V}h_{x}h_{x^{\prime}}\sigma_{x}\sigma_{x^{\prime}}\right)$
		$\displaystyle=\frac{1}{2^{\lvert V\rvert}}\sum_{\sigma}\left(\sum_{b\in E}J_{b}^{2}+\sum_{x\in V}h_{x}^{2}\right)={v_{H}}.$

Therefore, $R_{H}\geq\left(\mathbb{E}_{\mu}[H^{2}]-\mathbb{E}_{\mu}[H]^{2}\right)^{1/2}=\sqrt{v_{H}}$ . ∎

In order to prove the next result, it is convenient to consider the following notation introduced by [7]. For any Ising spin configurations $\sigma$ and $\tau$ , we consider the sets $D_{\sigma,\tau}$ and $W_{\sigma,\tau}$ defined by

D_{\sigma,\tau}\coloneqq\{x\in V:\sigma_{x}\tau_{x}=-1\}

and

W_{\sigma,\tau}\coloneqq\{\{x,y\}\in E:\sigma_{x}\sigma_{y}\tau_{x}\tau_{y}=-1\},

where the products $\sigma_{x}\tau_{x}$ and $\sigma_{x}\sigma_{y}\tau_{x}\tau_{y}$ are called the spin overlap and the link overlap, respectively.

Theorem 3.2.

Given $\varepsilon>0$ and configurations $\sigma$ and $\tau$ , if the condition

\displaystyle 0<\delta k_{G}\leq\frac{1}{2}\varepsilon\sqrt{v_{H}}

is satisfied, then $H_{\delta}(\sigma)\geq H_{\delta}(\tau)$ implies $H(\sigma)\geq H(\tau)-\varepsilon R_{H}$ .

Proof.

If we suppose that $H_{\delta}(\sigma)\geq H_{\delta}(\tau)$ , then, we have

	$\displaystyle H(\tau)-H(\sigma)$	$\displaystyle=\left(H(\tau)-H_{\delta}(\tau)\right)+H_{\delta}(\tau)-H_{\delta}(\sigma)+\left(H_{\delta}(\sigma)-H(\sigma)\right)$
		$\displaystyle\leq\left(H_{\delta}(\sigma)-H(\sigma)\right)-\left(H_{\delta}(\tau)-H(\tau)\right)$
		$\displaystyle\leq\sum_{b=\{x,y\}\in E}\left\lvert J_{b}-J_{b}^{\prime}\right\rvert\left\lvert\sigma_{x}\sigma_{y}-\tau_{x}\tau_{y}\right\rvert+\sum_{x\in V}\left\lvert h_{x}-h_{x}^{\prime}\right\rvert\left\lvert\sigma_{x}-\tau_{x}\right\rvert$
		$\displaystyle=2\sum_{b\in W_{\sigma,\tau}}\left\lvert J_{b}-J_{b}^{\prime}\right\rvert+2\sum_{x\in D_{\sigma,\tau}}\left\lvert h_{x}-h_{x}^{\prime}\right\rvert$
		$\displaystyle\leq 2\delta(\lvert W_{\sigma,\tau}\rvert+\lvert D_{\sigma,\tau}\rvert).$

Since $\lvert W_{\sigma,\tau}\rvert\leq|E|$ , $|D_{\sigma,\tau}\rvert\leq|V|$ , $k_{G}=|E|+|V|$ , and $R_{H}\geq\sqrt{v_{H}}$ , then, by our assumption, we obtain

\displaystyle H(\tau)-H(\sigma)\leq 2\delta k_{G}\leq\varepsilon\sqrt{v_{H}}\leq\varepsilon R_{H}.

Therefore, the conclusion of this theorem follows. ∎

3.2 Stability of ground states: first approach

In the previous subsection, we did not assume any randomness on the spin-spin couplings $J_{b}$ ’s and local external fields $h_{x}$ ’s. In this subsection, let us consider the same setting as stated in question (2’) from Section 2. Precisely speaking, we assume that $\{J_{b}\}_{b\in E}$ and $\{h_{x}\}_{x\in V}$ are mutually independent random variables distributed according to a standard Gaussian distribution.

Under such assumptions, let us estimate the probability that the inequality $H(\tilde{\sigma}_{G})-H(\sigma_{G})\leq\varepsilon R_{H}$ holds, where $\varepsilon$ is a given positive constant, by using a method that relies on uniform bounds with respect to certain spin configurations. In the following lemma, we provide an upper bound for the difference $H(\tilde{\sigma}_{G})-H(\sigma_{G})$ .

Lemma 3.3.

Given $\delta>0$ , if $\sigma_{G}$ and $\tilde{\sigma}_{G}$ are ground states for $H$ and $H_{\delta}$ , respectively, then, we have

H(\tilde{\sigma}_{G})-H(\sigma_{G})\leq 2\delta k_{G}.

(3.3)

Proof.

It follows from the definition of a ground state that $H_{\delta}(\tilde{\sigma}_{G})-H_{\delta}(\sigma_{G})\leq 0$ , then

	$\displaystyle H(\tilde{\sigma}_{G})-H(\sigma_{G})$	$\displaystyle=\left(H(\tilde{\sigma}_{G})-H_{\delta}(\tilde{\sigma}_{G})\right)+H_{\delta}(\tilde{\sigma}_{G})-H_{\delta}(\sigma_{G})+\left(H_{\delta}(\sigma_{G})-H(\sigma_{G})\right)$
		$\displaystyle\leq 2\lVert H_{\delta}-H\rVert_{\infty},$

where $\|\cdot\|_{\infty}$ stands for the uniform norm, as usual. Furthermore, for any spin configuration $\sigma$ , we have

	$\displaystyle\lvert H_{\delta}(\sigma)-H(\sigma)\rvert$	$\displaystyle=\left\lvert\sum_{b=\{x,y\}\in E}(J_{b}-J_{b}^{\prime})\sigma_{x}\sigma_{y}+\sum_{x\in V}(h_{x}-h_{x}^{\prime})\sigma_{x}\right\rvert$
		$\displaystyle\leq\sum_{b\in E}\left\lvert J_{b}-J_{b}^{\prime}\right\lvert+\sum_{x\in V}\left\rvert h_{x}-h_{x}^{\prime}\right\rvert$
		$\displaystyle\leq\delta(\lvert E\rvert+\lvert V\rvert)=\delta k_{G}.$

Then, $\lVert H_{\delta}-H\rVert_{\infty}\leq\delta k_{G}$ , therefore, we conclude the proof. ∎

By the lemma above , it follows that

\displaystyle\mathbb{P}\left(H(\tilde{\sigma}_{G})-H(\sigma_{G})\leq\varepsilon R_{H}\right)\geq\mathbb{P}\left(\delta k_{G}\leq\frac{1}{2}\varepsilon R_{H}\right),

and by using the fact that $R_{H}\geq\sqrt{v_{H}}$ (see Lemma 3.1), we conclude that

\mathbb{P}\left(H(\tilde{\sigma}_{G})-H(\sigma_{G})\leq\varepsilon R_{H}\right)\geq\mathbb{P}\left(\delta k_{G}\leq\frac{1}{2}\varepsilon\sqrt{v_{H}}\right).

(3.4)

Finally, we have the following estimation for the probability that $H(\tilde{\sigma}_{G})-H(\sigma_{G})\leq\varepsilon R_{H}$ holds, which consists of one of the answers for the question (2’).

Theorem 3.4.

Let $\{J_{b}\}_{b\in E}$ and $\{h_{x}\}_{x\in V}$ be mutually independent standard Gaussian random variables. It follows that

\mathbb{P}\left(H(\tilde{\sigma}_{G})-H(\sigma_{G})\leq\varepsilon R_{H}\right)\geq 1-\gamma\left(k_{G};\left(\frac{2\delta k_{G}}{\varepsilon}\right)^{2}\right),

(3.5)

where $\gamma(s;x)$ is the distribution function of the chi-square distribution with $s>0$ degrees of freedom, that is,

\displaystyle\gamma(s;x)\coloneqq\frac{1}{2^{s/2}\Gamma(s/2)}\int_{0}^{x}t^{s/2-1}e^{-t/2}dt

for $x\geq 0$ , and $\gamma(s;x)\coloneqq 0$ for $x<0$ .

Proof.

It follows from the above discussion that we have

	$\displaystyle\mathbb{P}\left(H(\tilde{\sigma}_{G})-H(\sigma_{G})\leq\varepsilon R_{H}\right)$	$\displaystyle\geq\mathbb{P}\left(\delta k_{G}\leq\frac{1}{2}\varepsilon\sqrt{v_{H}}\right)$
		$\displaystyle=\mathbb{P}\left(v_{H}\geq\left(\frac{2\delta k_{G}}{\varepsilon}\right)^{2}\right)$
		$\displaystyle=1-\mathbb{P}\left(v_{H}<\left(\frac{2\delta k_{G}}{\varepsilon}\right)^{2}\right).$

Since $\{J_{b}\}_{b\in E}$ and $\{h_{x}\}_{x\in V}$ are mutually independent random variables distributed according to a standard Gaussian distribution, then the random variable $v_{H}$ is distributed as the chi-square distribution with $k_{G}$ degrees of freedom. Therefore,

\displaystyle\mathbb{P}\left(v_{H}<\left(\frac{2\delta k_{G}}{\varepsilon}\right)^{2}\right)=\gamma\left(k_{G};\left(\frac{2\delta k_{G}}{\varepsilon}\right)^{2}\right).

Thus, we obtain the lower bound of the target probability. ∎

3.3 Stability of ground states: second approach

Before we proceed, let us point out the fundamental difference between the uniform approach and the current approach to solve question (2’). Note that, if we use the same computations as considered in the proof of Theorem 3.2 in the particular case where $\tau=\tilde{\sigma}_{G}$ and $\sigma=\sigma_{G}$ and use the fact that $H_{\delta}(\sigma_{G})\geq H_{\delta}(\tilde{\sigma}_{G})$ , then it follows that

H(\tilde{\sigma}_{G})-H(\sigma_{G})\leq 2\delta(\lvert W_{\sigma_{G},\tilde{\sigma}_{G}}\rvert+\lvert D_{\sigma_{G},\tilde{\sigma}_{G}}\rvert).

(3.6)

Recall that the proof of Theorem 3.4 fundamentally relied on the fact that, by using the $L^{\infty}$ -distance estimates, the left-hand side of equation (3.6) could be bounded above by $2\delta k_{G}$ . Note that the right-hand side of equation (3.6) is also bounded above by $2\delta k_{G}$ , therefore, let us explore the geometry of the underlying graph $G$ in order to see whether it is possible to obtain better bounds.

The value of $\lvert W_{\sigma_{G},\tilde{\sigma}_{G}}\rvert+\lvert D_{\sigma_{G},\tilde{\sigma}_{G}}\rvert$ depends on the underlying graph structure and the relationship between the ground states $\sigma_{G}$ and $\tilde{\sigma}_{G}$ . Therefore, we should check the value of $\lvert W_{\sigma_{G},\tilde{\sigma}_{G}}\rvert+\lvert D_{\sigma_{G},\tilde{\sigma}_{G}}\rvert$ for the intended problem. In general, we look for a uniform estimation for the value of $\lvert W_{\sigma,\tau}\rvert+\lvert D_{\sigma,\tau}\rvert$ for any $\sigma$ and $\tau$ since the ground states $\sigma_{G}$ and $\tilde{\sigma}_{G}$ in practice are unknown. First, let us show the following lemma.

Lemma 3.5.

For any two configurations $\sigma$ and $\tau$ , we have

\displaystyle\lvert W_{\sigma,\tau}\rvert\leq(\deg G)\cdot\min\left\{\lvert D_{\sigma,\tau}\rvert,\lvert V\setminus D_{\sigma,\tau}\rvert\right\},

where $\deg G$ stands for the maximum degree of $G$ .

Proof.

Let us assume that $\lvert D_{\sigma,\tau}\rvert=s$ , for some $s$ such that $0\leq s\leq\lvert V\rvert$ . Then, let us enumerate $D_{\sigma,\tau}$ as $D_{\sigma,\tau}=\{x_{1},\dots,x_{s}\}\subset V$ , where $x_{i}\in V$ for each $i=1,\dots,s$ . Moreover, we have $\lvert V\setminus D_{\sigma,\tau}\rvert=\lvert V\rvert-s$ , and therefore we can write $V\setminus D_{\sigma,\tau}=\{y_{1},\dots,y_{\lvert V\rvert-s}\}\subset V$ , where $y_{i}\in V$ for each $i=1,\dots,\lvert V\rvert-s$ . By the definition of $D_{\sigma,\tau}$ , we have $\sigma_{x_{i}}\tau_{x_{i}}=-1$ for every $i=1,\dots,s$ and $\sigma_{y_{j}}\tau_{y_{j}}=1$ for all $j=1,\dots,\lvert V\rvert-s$ . If $\{x_{i},x_{j}\}\in E$ for distinct $i$ and $j$ in $\{1,\dots,s\}$ , then

\displaystyle\sigma_{x_{i}}\sigma_{x_{j}}\tau_{x_{i}}\tau_{x_{j}}=(\sigma_{x_{i}}\tau_{x_{i}})(\sigma_{x_{j}}\tau_{x_{j}})=(-1)^{2}=1.

Thus, $\{x_{i},x_{j}\}\notin W_{\sigma,\tau}$ . In a similar way, we conclude that in case $\{y_{i},y_{j}\}\in E$ for distinct $i$ and $j$ in $\{1,\dots,\lvert V\rvert-s\}$ , it follows that $\{y_{i},y_{j}\}\notin W_{\sigma,\tau}$ . If $\{x_{i},y_{j}\}\in E$ for some $i\in\{1,\dots,s\}$ and some $j\in\{1,\cdots,\lvert V\rvert-s\}$ , then

\displaystyle\sigma_{x_{i}}\sigma_{y_{j}}\tau_{x_{i}}\tau_{y_{j}}=(\sigma_{x_{i}}\tau_{x_{i}})(\sigma_{y_{j}}\tau_{y_{j}})=(-1)\times 1=-1.

Hence, $\{x_{i},y_{j}\}\in W_{\sigma,\tau}$ . It follows that

\displaystyle W_{\sigma,\tau}=\{\{x,y\}\in E:x=x_{i},y=y_{j}\text{ for some }i,j\}.

Therefore, we have

\displaystyle\lvert W_{\sigma,\tau}\rvert\leq(\deg G)\min\{\lvert D_{\sigma,\tau}\rvert,\lvert V\setminus D_{\sigma,\tau}\rvert\}.

∎

Proposition 3.6.

For any graph $G$ , let $\sigma$ and $\tau$ be two spin configurations. Then, we have

\displaystyle\lvert W_{\sigma,\tau}\rvert+\lvert D_{\sigma,\tau}\rvert\leq\frac{(\deg G+1)\lvert V\rvert}{2}.

(3.7)

Proof.

Using Lemma 3.5, if $\lvert D_{\sigma,\tau}\rvert\leq\lvert V\rvert/2$ then

\displaystyle\lvert D_{\sigma,\tau}\rvert+\lvert W_{\sigma,\tau}\rvert\leq(\deg G+1)\lvert D_{\sigma,\tau}\rvert\leq\frac{\deg G+1}{2}\lvert V\rvert,

otherwise, if $\lvert D_{\sigma,\tau}\rvert>\lvert V\rvert/2$ , it follows that

	$\displaystyle\lvert D_{\sigma,\tau}\rvert+\lvert W_{\sigma,\tau}\rvert$	$\displaystyle\leq(\deg G)\lvert V\setminus D_{\sigma,\tau}\rvert+\lvert D_{\sigma,\tau}\rvert$
		$\displaystyle=(\deg G)\lvert V\rvert-(\deg G-1)\lvert D_{\sigma,\tau}\rvert$
		$\displaystyle\leq(\deg G)\lvert V\rvert-\frac{\deg G-1}{2}\lvert V\rvert$
		$\displaystyle=\frac{\deg G+1}{2}\lvert V\rvert.$

∎

Thus, we have the following theorem which is another answer for the question (2’) (see Theorem 3.4 for an alternative approach to the question (2’)).

Theorem 3.7.

Let $\{J_{b}\}_{b\in E}$ and $\{h_{x}\}_{x\in V}$ be mutually independent standard Gaussian random variables. Then, we have

\mathbb{P}\left(H(\tilde{\sigma}_{G})-H(\sigma_{G})\leq\varepsilon R_{H}\right)\geq 1-\gamma\left(k_{G};\left(\frac{\delta\lvert V\rvert(\deg G+1)}{\varepsilon}\right)^{2}\right).

(3.8)

Proof.

Analogously as in the proof of Theorem 3.4, it follows from equation (3.6), $R_{H}\geq\sqrt{v_{H}}$ and Proposition 3.6 that

	$\displaystyle\mathbb{P}\left(H(\tilde{\sigma}_{G})-H(\sigma_{G})\leq\varepsilon R_{H}\right){}$	$\displaystyle\geq\mathbb{P}\left(\delta\leq\frac{\varepsilon\sqrt{v_{H}}}{2(\lvert W_{\sigma_{G},\tilde{\sigma}_{G}}\rvert+\lvert D_{\sigma_{G},\tilde{\sigma}_{G}}\rvert)}\right)$
		$\displaystyle=\mathbb{P}\left(v_{H}\geq\left(\frac{2\delta(\lvert W_{\sigma_{G},\tilde{\sigma}_{G}}\rvert+\lvert D_{\sigma_{G},\tilde{\sigma}_{G}}\rvert)}{\varepsilon}\right)^{2}\right)$
		$\displaystyle\geq\mathbb{P}\left(v_{H}\geq\left(\frac{\delta\lvert V\rvert(\deg G+1)}{\varepsilon}\right)^{2}\right)$
		$\displaystyle=1-\gamma\left(k_{G};\left(\frac{\delta\lvert V\rvert(\deg G+1)}{\varepsilon}\right)^{2}\right),$

where we used the fact that $v_{H}$ is distributed according to a chi-square distribution with $k_{G}$ degrees of freedom. ∎

3.4 Comparison between approaches

In the rest of this section, we compare the methods presented in Sections 3.2 and 3.3 passing through several examples to which we apply Proposition 3.6.

The first example is the case where we consider complete graphs including the SK model. If we consider complete graphs, then Theorem 3.7 provides us with better results if compared to Theorem 3.4.

Example 3.8.

If $G$ is a complete graph (that is, all vertices are connected to each other) with $N$ vertices, then we have

\displaystyle\frac{\deg G+1}{2}\lvert V\rvert=\frac{N^{2}}{2}.

On the other hand, the value of $k_{G}$ will be given by

\displaystyle k_{G}:=\lvert E\rvert+\lvert V\rvert=\frac{N(N-1)}{2}+N=\frac{N(N+1)}{2}.

Therefore,

\displaystyle\frac{\deg G+1}{2}\lvert V\rvert<k_{G}.

Hence the uniform upper bound for $\lvert W_{\sigma,\tau}\rvert+\lvert D_{\sigma,\tau}\rvert$ we obtained in Proposition 3.6 is always better than $k_{G}$ . Furthermore, we can calculate the explicit value of $\lvert W_{\sigma,\tau}\rvert+\lvert D_{\sigma,\tau}\rvert$ when $G$ is a complete graph. From the proof of Lemma 3.5, by assuming that $G$ is a complete graph, we can say that $\lvert W_{\sigma,\tau}\rvert=\lvert D_{\sigma,\tau}\rvert(\lvert V\rvert-\lvert D_{\sigma,\tau}\rvert)$ . Therefore,

\displaystyle\lvert W_{\sigma,\tau}\rvert+\lvert D_{\sigma,\tau}\rvert=\lvert D_{\sigma,\tau}\rvert(N+1-\lvert D_{\sigma,\tau}\rvert)\leq\frac{(N+1)^{2}}{4},

and the proof of Theorem 3.7 implies

	$\displaystyle\mathbb{P}\left(H(\tilde{\sigma}_{G})-H(\sigma_{G})\leq\varepsilon R_{H}\right)$	$\displaystyle\geq\mathbb{P}\left(v_{H}\geq\left(\frac{2\delta(\lvert W_{\sigma_{G},\tilde{\sigma}_{G}}\rvert+\lvert D_{\sigma_{G},\tilde{\sigma}_{G}}\rvert)}{\varepsilon}\right)^{2}\right)$
		$\displaystyle\geq{\mathbb{P}}\left(v_{H}\geq\frac{\delta^{2}(N+1)^{4}}{4\varepsilon^{2}}\right)$
		$\displaystyle=1-\gamma\left(k_{G};\frac{\delta^{2}(N+1)^{4}}{4\varepsilon^{2}}\right).$

The following example considers King’s graphs and Theorem 3.7 works better than Theorem 3.4 as well as the above example.

Example 3.9.

Let $G$ be an $N\times M$ King’s graph. The $N\times M$ King’s graph can be visualized as an $N\times M$ chessboard where each of its squares corresponds to a vertex of the graph, and each edge represents a legal move of a king in a chess game. In that way, the inner vertices of the graph have $8$ neighbors each, while the vertices in the corners have $3$ neighbors each, and each of the remaining vertices on the sides of the graph has $5$ neighbors. For an $N\times M$ King’s graph, we have

\displaystyle\frac{\deg G+1}{2}\lvert V\rvert=\frac{9}{2}MN,

since $\deg G=8$ . Moreover, we have

\displaystyle k_{G}=\lvert E\rvert+\lvert V\rvert=5MN-3(M+N)+2.

If $M$ and $N$ are sufficiently large, then we have

\displaystyle\frac{\deg G+1}{2}\lvert V\rvert<k_{G}.

In the following example, differently from the previous ones, we can see that the estimate provided by Theorem 3.4 suits better than that of Theorem 3.7.

Example 3.10.

If $G$ is a star graph with degree $k\geq 3$ , that is, $G$ consists of one vertex placed in the center and other $k$ vertices connected only with the center, then

\displaystyle\frac{\deg G+1}{2}\lvert V\rvert=\frac{(k+1)^{2}}{2}.

Furthermore, we have

\displaystyle k_{G}=\lvert E\rvert+\lvert V\rvert=2k+1.

Therefore, we obtain

\displaystyle\frac{\deg G+1}{2}\lvert V\rvert>k_{G}.

According to the above examples, we conclude that it is not always possible to guarantee that the uniform upper bound of $\lvert W_{\sigma,\tau}\rvert+\lvert D_{\sigma,\tau}\rvert$ provided by Proposition 3.6 works better than $k_{G}=\lvert E\rvert+\lvert V\rvert$ . Thus, we may have to consider such bounds separately when considering different graphs in order to obtain an optimal estimate for the probability that inequality $H(\tilde{\sigma}_{G})-H(\sigma_{G})<\varepsilon R_{H}$ holds.

Let us consider again the problem of stability where we take into account only a finite number of terms in the binary expansions of the parameters $(J_{b})_{b\in E}$ and $(h_{x})_{x\in V}$ as we illustrated in the beginning of Section 2. In Figure 1, corresponding to the sizes of different graphs, we show the minimum number of digits necessary to be considered in the binary expansions of such parameters such that with probability at least $99\%$ the difference $H(\tilde{\sigma}_{G})-H(\sigma_{G})$ represents a value smaller than $1\%$ of $R_{H}$ . On each plot we compare the different methods developed in this paper, where the first method corresponds to the estimate from Theorem 3.4 and the second method corresponds to the estimate from Theorem 3.7. In Figure 1(a), we also included a third estimate from Example 3.8 which is sharper and gives us better results when compared to the other methods. As we expected, the second method provides us with better results when compared to the first one for complete graphs and for $N\times N$ King’s graphs when $N$ is sufficiently large. On the other hand, for star graphs the first method is more appropriate, moreover, a certain discrepancy of performance is easily observed.

4 Stability under a perturbed graph

In this section, we consider the stability of energy landscape when a given spin system defined on a graph is compressed into a smaller subsystem. Differently from the previous sections, we fix a given Hamiltonian and we assume a sufficient condition that guarantees the existence of a nontrivial subset of the entire vertex set outside of which we can randomly assign any spin configuration and the energy of the system is kept under control up to a certain error margin.

Let $G=(V,E)$ be a finite simple graph, and let $H$ be the Hamiltonian on $G$ given by

\displaystyle H(\sigma)=-\sum_{\{x,y\}\in E}J_{x,y}\sigma_{x}\sigma_{y}

for every configuration $\sigma\in\{-1,1\}^{V}$ , where $\{J_{x,y}\}_{\{x,y\}\in E}$ is a collection of mutually independent standard Gaussian random variables. What we would like to show is that we can compress the whole system into a nontrivial subsystem so that the energy landscape of such subsystem is close to the original one up to a given error margin. More precisely, our goal is to find a class of examples for which given a positive constant $\varepsilon$ , there is a positive $\delta$ such that the subsystem $V_{0}=V_{0}(\delta)$ of $V$ , defined from the relation

\displaystyle V\setminus V_{0}\coloneqq\left\{x\in V:\text{$\lvert J_{x,y}\rvert<\delta$ holds for every $y$ such that $\{x,y\}\in E$}\right\},

is non-trivial, has size comparable to the size of $V$ , and satisfies

\sup_{\sigma,\eta\in\{-1,1\}^{V}}\left\lvert H(\sigma)-H(\sigma_{V_{0}},\eta_{V\setminus V_{0}})\right\rvert<\varepsilon R_{H}

(4.1)

with high probability, see Figure 2.

4.1 One-dimensional discrete torus

Let us solve the problem stated above in the particular case where the graph $G$ is a one-dimensional discrete torus.

Theorem 4.1.

Let $G=(V,E)$ be a one-dimensional discrete torus with $N$ vertices, that is, $V=\{1,2,\dots,N\}$ and $E=\{\{1,2\},\{2,3\},\dots,\{N-1,N\},\{N,1\}\}$ . Given $\varepsilon>0$ , let $\delta$ be a positive number such that $\delta<\varepsilon/\sqrt{2\pi}$ . Then, if $A$ is a subset of the event $\{0<|V\backslash V_{0}|<N\}$ , it follows that

{\mathbb{P}}\left(\Bigg{\{}\sup_{\sigma,\tau\in\{-1,1\}^{N}}\left\lvert H(\sigma)-H(\sigma_{V_{0}},\tau_{V\setminus V_{0}})\right\rvert<\varepsilon R_{H}\Bigg{\}}\cap A\right)\geq{\mathbb{P}}(A)-\frac{1-\frac{2}{\pi}}{N\left(\sqrt{\frac{2}{\pi}}-\frac{2\delta}{\varepsilon}\right)^{2}}

(4.2)

holds for each $N\geq 3$ . In particular, given constants $C>0$ and $\alpha\in{[}0,1)$ , we have

	$\displaystyle{\mathbb{P}}\left(\Bigg{\{}\sup_{\sigma,\tau\in\{-1,1\}^{N}}\left\lvert H(\sigma)-H(\sigma_{V_{0}},\tau_{V\setminus V_{0}})\right\rvert<\varepsilon R_{H}\Bigg{\}}\cap\Big{\{}CN^{\alpha}\leq\|V\backslash V_{0}\|<N\Big{\}}\right)$
	$\displaystyle\geq\left(1-\frac{C}{N^{1-\alpha}\theta^{2}}\right)^{2}\frac{1}{1+\frac{1+2\theta-3\theta^{2}}{N\theta^{2}}}-\theta^{N}-$	$\displaystyle\frac{1-\frac{2}{\pi}}{N\left(\sqrt{\frac{2}{\pi}}-\frac{2\delta}{\varepsilon}\right)^{2}}$		(4.3)

for $N$ sufficiently large, where

\theta=\int_{-\delta}^{\delta}\frac{e^{-\xi^{2}/2}}{\sqrt{2\pi}}d\xi.

(4.4)

Before we follow to the proof of the result above, let us clarify the theorem by providing the reader with some practical results. Let us consider the particular case where $C=1$ and $\alpha\in(0,1)$ . Corresponding to different values of $\varepsilon$ and $\delta$ , we obtain lower bounds for the probability that the size of $V\backslash V_{0}$ is at least $N^{\alpha}$ and condition (4.1) holds, see the table below.

Examples
$N$	$\varepsilon$	$\delta$	$\alpha$	Minimum size of $V\backslash V_{0}$	Right-hand side of (4.1)
$10^{8}$	$0.05$	$0.0198$	$0.4$	$1584$	$0.877$
$10^{8}$	$0.05$	$0.0198$	$0.5$	$10^{4}$	$0.361$
$10^{8}$	$0.1$	$0.0398$	$0.5$	$10^{4}$	$0.810$
$10^{12}$	$0.01$	$0.00398$	$0.5$	$10^{6}$	$0.811$
$10^{12}$	$0.05$	$0.0199$	$0.5$	$10^{6}$	$0.992$
$10^{12}$	$0.1$	$0.0399$	$0.5$	$10^{6}$	$0.998$
$10^{12}$	$0.05$	$0.0199$	$0.6$	$\approx 1.58\times 10^{7}$	$0.879$
$10^{12}$	$0.05$	$0.0199$	$0.65$	$\approx 6.31\times 10^{7}$	$0.563$

Table 1: Applications of Theorem 4.1.

Let us observe that for any pair $\sigma,\tau$ of spin configurations, we have

	$\displaystyle\|H(\sigma)-H(\sigma_{V_{0}},\tau_{V\backslash V_{0}})\|$	$\displaystyle=$	$\displaystyle\left\|\sum_{x\in V_{0}}\sum_{\begin{subarray}{c}y\in V\backslash V_{0}\\ \{x,y\}\in E\end{subarray}}J_{x,y}\sigma_{x}(\sigma_{y}-\tau_{y})+\sum_{\begin{subarray}{c}\{x,y\}\subseteq V\backslash V_{0}\\ \{x,y\}\in E\end{subarray}}J_{x,y}(\sigma_{x}\sigma_{y}-\tau_{x}\tau_{y})\right\|$
		$\displaystyle=$	$\displaystyle\left\|\sum_{x\in V_{0}}\sum_{\begin{subarray}{c}y\in V\backslash V_{0}\\ \{x,y\}\in E\end{subarray}}J_{x,y}\sigma_{x}\sigma_{y}(1-\sigma_{y}\tau_{y})+\sum_{\begin{subarray}{c}\{x,y\}\subseteq V\backslash V_{0}\\ \{x,y\}\in E\end{subarray}}J_{x,y}\sigma_{x}\sigma_{y}(1-\sigma_{x}\tau_{x}\sigma_{y}\tau_{y})\right\|$

		$\displaystyle=$	$\displaystyle\left\|\sum_{y\in V\backslash V_{0}}\sum_{\begin{subarray}{c}x\in V\\ \{x,y\}\in E\end{subarray}}J_{x,y}\sigma_{x}\sigma_{y}\left[\mathbbm{1}_{x\in V_{0}}(1-\sigma_{y}\tau_{y})+\mathbbm{1}_{x\in V\backslash V_{0}}(1-\sigma_{x}\tau_{x}\sigma_{y}\tau_{y})/2\right]\right\|$
		$\displaystyle\leq$	$\displaystyle 2\sum_{y\in V\backslash V_{0}}\sum_{\begin{subarray}{c}x\in V\\ \{x,y\}\in E\end{subarray}}\|J_{x,y}\|\leq 2\delta\sum_{y\in V\backslash V_{0}}\text{deg}(y).$

In particular, if $G$ is the one-dimensional torus as in Theorem 4.1, it follows that

\sup_{\sigma,\tau\in\{-1,1\}^{N}}\left\lvert H(\sigma)-H(\sigma_{V_{0}},\tau_{V\setminus V_{0}})\right\rvert\leq 4\delta|V\backslash V_{0}|.

(4.5)

Now, let us prepare two lemmas in order to prove Theorem 4.1.

Lemma 4.2.

For $R_{H}=\sup_{\xi,\eta}\lvert H(\xi)-H(\eta)\rvert$ , we have

\displaystyle 2\sum_{x=1}^{N}\lvert J_{x,x+1}\rvert-4\min_{x=1,\dots,N}\lvert J_{x,x+1}\rvert\leq R_{H}\leq 2\sum_{x=1}^{N}\lvert J_{x,x+1}\rvert,

(4.6)

hence, with probability 1,

\displaystyle R_{H}\sim 2\sqrt{\frac{2}{\pi}}N\quad\text{as $N$ approaches infinity}.

Proof.

Without loss of generality, we assume $\min_{x}\lvert J_{x,x+1}\rvert=\lvert J_{N,1}\rvert$ . Let us fix $\sigma_{1}=1$ . Then, depending on the sign of $J_{1,2}$ , we can determine $\sigma_{2}$ to minimize (or maximize) $H(\sigma)$ . We continue this procedure up to $\sigma_{N}$ and we have

	$\displaystyle\min_{\sigma\in\{-1,1\}^{N}}H(\sigma)$	$\displaystyle\leq-\sum_{x=1}^{N}\lvert J_{x,x+1}\rvert\ +2\min_{x=1,\dots,N}\lvert J_{x,x+1}\rvert,$
	$\displaystyle\max_{\sigma\in\{-1,1\}^{N}}H(\sigma)$	$\displaystyle\geq\sum_{x=1}^{N}\lvert J_{x,x+1}\rvert\ -2\min_{x=1,\dots,N}\lvert J_{x,x+1}\rvert$

(the additional terms exist if frustration exist at $\sigma_{N}$ and $\sigma_{1}$ ). Hence the inequality of the lemma is proven.

To show the last statement, we divide all terms by $N$ and use the law of large numbers for the folded normal distribution. ∎

Lemma 4.3.

If we assume $N\geq 3$ , then it follows that

\displaystyle\mathbb{E}\left[\lvert V\setminus V_{0}\rvert\right]=N\theta^{2}

and

\displaystyle\mathbb{E}\left[\lvert V\setminus V_{0}\rvert^{2}\right]=N\theta^{2}\left(1+2\theta-3\theta^{2}+N\theta^{2}\right).

Proof.

For each $i=1,\dots,N$ , let us define a random variable $X_{i}$ by letting

\displaystyle X_{i}=\begin{cases}1&\text{if $\lvert J_{i-1,i}\rvert<\delta$ and $\lvert J_{i,i+1}\rvert<\delta$,}\\ 0&\text{otherwise}.\end{cases}

Then, by the definition of $V_{0}$ , the condition $i\in V\setminus V_{0}$ is equivalent to $X_{i}=1$ . Therefore, the expected value of the size of $V\setminus V_{0}$ will be given by

\displaystyle\mathbb{E}\left[\lvert V\setminus V_{0}\rvert\right]=\mathbb{E}\left[\sum_{i=1}^{N}X_{i}\right]=\sum_{i=1}^{N}\mathbb{E}\left[X_{i}\right]=N\theta^{2}.

Furthermore, we write

\displaystyle\lvert V\setminus V_{0}\rvert^{2}=\sum_{i=1}^{N}X_{i}^{2}+2\sum_{i=1}^{N}X_{i}X_{i+1}+\sum_{i=1}^{N}\sum_{\begin{subarray}{c}j\notin\{i-1,\,i,\,i+1\}\end{subarray}}X_{i}X_{j}.

Here, the random variables $X_{i}$ and $X_{i+1}$ are not mutually independent but we have

\displaystyle X_{i}X_{i+1}=\begin{cases}1&\text{if $\lvert J_{i-1,i}\rvert<\delta$,\ $\lvert J_{i,i+1}\rvert<\delta$ and $\lvert J_{i+1,i+2}\rvert<\delta$},\\ 0&\text{otherwise}.\end{cases}

Thus, it follows that the identity

	$\displaystyle\mathbb{E}\left[\lvert V\setminus V_{0}\rvert^{2}\right]$	$\displaystyle=N\theta^{2}+2N\theta^{3}+N(N-3)\theta^{4}$
		$\displaystyle=N\theta^{2}\left(1+2\theta-3\theta^{2}+N\theta^{2}\right)$

holds, and we complete the proof. ∎

Proof of Theorem 4.1.

Let us start by splitting the probability in the left-hand side of equation (4.2) as

	$\displaystyle{\mathbb{P}}$	$\displaystyle\left(\Bigg{\{}\sup_{\sigma,\tau\in\{-1,1\}^{N}}\left\lvert H(\sigma)-H(\sigma_{V_{0}},\tau_{V\setminus V_{0}})\right\rvert<\varepsilon R_{H}\Bigg{\}}\cap A\right)$
		$\displaystyle\geq{\mathbb{P}}\left(\Bigg{\{}\sup_{\sigma,\tau\in\{-1,1\}^{N}}\left\lvert H(\sigma)-H(\sigma_{V_{0}},\tau_{V\setminus V_{0}})\right\rvert<\varepsilon R_{H}\Bigg{\}}\cap\left\{\left\|\frac{1}{N}\sum_{x=1}^{N}\|J_{x,x+1}\|-\sqrt{\frac{2}{\pi}}\right\|<\sqrt{\frac{2}{\pi}}-\frac{2\delta}{\varepsilon}\right\}\cap A\right)$
		$\displaystyle={\mathbb{P}}\left(\sup_{\sigma,\tau\in\{-1,1\}^{N}}\left\lvert H(\sigma)-H(\sigma_{V_{0}},\tau_{V\setminus V_{0}})\right\rvert<\varepsilon R_{H}\,\middle\|\,B\right)\,{\mathbb{P}}\left(B\right),$

where $B$ is the event given by

B=\left\{\left|\frac{1}{N}\sum_{x=1}^{N}|J_{x,x+1}|-\sqrt{\frac{2}{\pi}}\right|<\sqrt{\frac{2}{\pi}}-\frac{2\delta}{\varepsilon}\right\}\cap A.

(4.7)

From equation (4.5), Lemma 4.2, and the fact that, under condition $B$ (subset of $A$ ), $\min_{x=1,\dots,N}|J_{x,x+1}|\leq\delta$ , it follows that the conditional probability above satisfies

	$\displaystyle{\mathbb{P}}\left(\sup_{\sigma,\tau\in\{-1,1\}^{N}}\left\lvert H(\sigma)-H(\sigma_{V_{0}},\tau_{V\setminus V_{0}})\right\rvert<\varepsilon R_{H}\,\middle\|\,B\right)$	$\displaystyle\geq{\mathbb{P}}\left(2\delta\lvert V\setminus V_{0}\rvert<\varepsilon\left(\sum_{x=1}^{N}\lvert J_{x,x+1}\rvert-2\min_{x=1,\dots,N}\lvert J_{x,x+1}\rvert\right)\,\middle\|\,B\right)$
		$\displaystyle\geq{\mathbb{P}}\left(\frac{2\delta}{\varepsilon}\lvert V\setminus V_{0}\rvert<\left(\sum_{x=1}^{N}\lvert J_{x,x+1}\rvert-2\delta\right)\,\middle\|\,B\right)$
		$\displaystyle={\mathbb{P}}\left(\frac{2\delta}{\varepsilon}\frac{\lvert V\setminus V_{0}\rvert+\varepsilon}{N}<\frac{1}{N}\sum_{x=1}^{N}\lvert J_{x,x+1}\rvert\,\middle\|\,B\right)$
		$\displaystyle\geq{\mathbb{P}}\left(\frac{2\delta}{\varepsilon}<\frac{1}{N}\sum_{x=1}^{N}\lvert J_{x,x+1}\rvert\,\middle\|\,B\right)=1,$

then

{\mathbb{P}}\left(\sup_{\sigma,\tau\in\{-1,1\}^{N}}\left\lvert H(\sigma)-H(\sigma_{V_{0}},\tau_{V\setminus V_{0}})\right\rvert<\varepsilon R_{H}\,\middle|\,B\right)=1.

(4.8)

The rest of the proof consists of estimating the probability of the event $B$ . Let us write

{\mathbb{P}}(B)\geq{\mathbb{P}}(A)+{\mathbb{P}}\left(\left|\frac{1}{N}\sum_{x=1}^{N}|J_{x,x+1}|-\sqrt{\frac{2}{\pi}}\right|<\sqrt{\frac{2}{\pi}}-\frac{2\delta}{\varepsilon}\right)-1.

(4.9)

It follows from Chebyshev’s inequality that

{\mathbb{P}}\left(\left|\frac{1}{N}\sum_{x=1}^{N}|J_{x,x+1}|-\sqrt{\frac{2}{\pi}}\right|\geq\sqrt{\frac{2}{\pi}}-\frac{2\delta}{\varepsilon}\right)\leq\frac{\sigma_{FG}^{2}}{N\left(\sqrt{\frac{2}{\pi}}-\frac{2\delta}{\varepsilon}\right)^{2}},

(4.10)

where $\sigma_{FG}^{2}$ is the variance of the folded Gaussian random variable $Y=|J_{1,2}|$ which is equal to $1-\frac{2}{\pi}$ . By using equations (4.8), (4.9) and (4.10), equation (4.2) follows.

In particular, if $A$ is the event given by $A=\{CN^{\alpha}\leq|V\backslash V_{0}|<N\}$ . Note that

{\mathbb{P}}(A)={\mathbb{P}}(|V\backslash V_{0}|\geq CN^{\alpha})-{\mathbb{P}}(|V\backslash V_{0}|=N),

(4.11)

where ${\mathbb{P}}(|V\backslash V_{0}|=N)=\theta^{N}$ . By the Paley-Zygmund inequality and Lemma 4.3, we have

	$\displaystyle{\mathbb{P}}\left(\lvert V\setminus V_{0}\rvert\geq CN^{\alpha}\right)$	$\displaystyle\geq{\left(1-\frac{CN^{\alpha}}{\mathbb{E}[\lvert V\setminus V_{0}\rvert]}\right)^{2}}\frac{\mathbb{E}[\lvert V\setminus V_{0}\rvert]^{2}}{\mathbb{E}[\lvert V\setminus V_{0}\rvert^{2}]}$
		$\displaystyle={\left(1-\frac{CN^{\alpha}}{N\theta^{2}}\right)^{2}}\frac{1}{1+\frac{1+2\theta-3\theta^{2}}{N\theta^{2}}}$

for $N$ sufficiently large, therefore, equation (4.1) holds. ∎

4.2 Generalizations

The most natural step in further investigations is to extend the results obtained in Section 4.1 to the case where we include i.i.d. standard Gaussian external fields, and also extend such results to a larger class of examples such as to a $d$ -dimensional torus or even to finite graphs with bounded degree. Note that, by assuming the absence of external fields, in the same way as we obtained inequality (4.5), one can show that

|H(\sigma)-H(\sigma_{V_{0}},\tau_{V\backslash V_{0}})|\leq 2\delta\sum_{y\in V\backslash V_{0}}\text{deg}(y)

(4.12)

holds for any graph. So, analogously as in the one-dimensional torus case, it is expected that if we find a lower bound for $R_{H}$ , as we did in Lemma 4.2, which is comparable to the right-hand side of equation (4.12), then we may derive an extension of our results for a larger class of graphs. Some numerical results suggest that, for an Ising spin system in a $d$ -dimensional torus with i.i.d. standard Gaussian spin-spin couplings and without external fields, $R_{H}$ is still of order $N$ , but it still lacks a rigorous proof of that observation due to the difficulty of dealing with frustrated configurations in a higher dimensional torus.

The simulations presented in this section were performed by using a modified version of the stochastic cellular automata algorithm studied in [5, 6] to estimate the maximum and minimum value of the Hamiltonian $H$ in order to find an approximation of $R_{H}$ corresponding to different values of $N$ . Note that, in such plots, each dot represents the value of $R_{H}$ (resp. $R_{H}/N$ ) corresponding to a torus with $N$ vertices for a realization of the random values of spin-spin couplings (i.i.d. standard Gaussian random variables). In the one-dimensional case (see Figure 3), we see that the value of $R_{H}/N$ approximates the value $2\sqrt{2/\pi}\approx 1.5957$ , as expected due to Lemma 4.2.

Now, for the two and three dimensional cases (see Figure 4), when we consider larger values of $N$ , the value of $R_{H}/N$ seems to approximate the values $2.564$ and $3.329$ , respectively. Note that such simulated values represent lower bounds for the real value of the limit $R_{H}/N$ as $N$ approaches infinity, so the true limits are still unknown. Furthermore, we conjecture that such limit exists in any dimension and the random variable $R_{H}/N$ converges almost surely due to the fact that, in higher dimension, its simulated values seem to fluctuate less around an asymptotic limit as compared to the one-dimensional case.

It is straightforward to show that, for the $d$ -dimensional torus, we have

R_{H}\leq 2\sum_{k=1}^{d}\sum_{i\in V}|J_{i,i+\mathbf{e_{k}}}|,

where $\mathbf{e_{k}}$ stands for the $k$ -th canonical vector of the $d$ -dimensional Euclidean space, then

\limsup_{N}\frac{R_{H}}{N}\leq 2d\sqrt{\frac{2}{\pi}}.

(4.13)

Moreover, it follows from the fact that $R_{H}\geq\sqrt{\sum_{b}J_{b}^{2}}$ (see Lemma 3.1) and the Cauchy–Schwarz inequality that

\frac{1}{\sqrt{N}}\sum_{b}|J_{b}|\leq R_{H}.

(4.14)

Therefore, we see that there is still room for improvement and the need of rigorous proofs about the existence and determination of the limit $\lim_{N\to\infty}R_{H}/N$ , originating a mathematical problem which is interesting by itself.

Acknowledgment

This work was supported by JST CREST Grant Number JP22180021, Japan. We would like to thank Takashi Takemoto and Normann Mertig of Hitachi, Ltd., for providing us with a stimulating platform for the weekly meeting at Global Research Center for Food & Medical Innovation (FMI) of Hokkaido University. We would also like to thank Hiroshi Teramoto of Kansai University, as well as Masamitsu Aoki, Yoshinori Kamijima, Katsuhiro Kamakura, Suguru Ishibashi and Takuka Saito of Mathematics Department, for valuable comments and encouragement at the aforementioned meetings at FMI.

References

[1] A. Lucas, Ising formulations of many NP problems. Front. Phys. 12 (2014): https://doi.org/10.3389/fphy.2014.00005.
[2] P. Dai Pra, B. Scoppola, E. Scoppola, Sampling from a Gibbs measure with pair interaction by means of PCA, J. Stat. Phys. 149 (2012): 722-737.
[3] B. Hajek, Cooling schedules for optimal annealing. Math. Oper. Res. 13 (2), (1988): 311-329.
[4] M. C. Robini, Theoretically ground acceleration techniques for simulated annealing, Handbook of Optimization (2013).
[5] S. Handa, K. Kamakura, Y. Kamijima, A. Sakai, Finding optimal solutions by stochastic cellular automata, arXiv:1906.06645.
[6] B. H. Fukushima-Kimura, S. Handa, K. Kamakura, Y. Kamijima, A. Sakai, Mixing time and simulated annealing for the stochastic cellular automata, arXiv:2007.11287v2.
[7] T. Albash, V. M. Mayor, I. Hen, Analog Errors in Ising Machines, arXiv:1806.03744.

	$\displaystyle\|H(\sigma)-H(\sigma_{V_{0}},\tau_{V\backslash V_{0}})\|$	$\displaystyle=$	$\displaystyle\left\|\sum_{x\in V_{0}}\sum_{\begin{subarray}{c}y\in V\backslash V_{0}\\ \{x,y\}\in E\end{subarray}}J_{x,y}\sigma_{x}(\sigma_{y}-\tau_{y})+\sum_{\begin{subarray}{c}\{x,y\}\subseteq V\backslash V_{0}\\ \{x,y\}\in E\end{subarray}}J_{x,y}(\sigma_{x}\sigma_{y}-\tau_{x}\tau_{y})\right\|$
		$\displaystyle=$	$\displaystyle\left\|\sum_{x\in V_{0}}\sum_{\begin{subarray}{c}y\in V\backslash V_{0}\\ \{x,y\}\in E\end{subarray}}J_{x,y}\sigma_{x}\sigma_{y}(1-\sigma_{y}\tau_{y})+\sum_{\begin{subarray}{c}\{x,y\}\subseteq V\backslash V_{0}\\ \{x,y\}\in E\end{subarray}}J_{x,y}\sigma_{x}\sigma_{y}(1-\sigma_{x}\tau_{x}\sigma_{y}\tau_{y})\right\|$

	$\displaystyle{\mathbb{P}}\left(\sup_{\sigma,\tau\in\{-1,1\}^{N}}\left\lvert H(\sigma)-H(\sigma_{V_{0}},\tau_{V\setminus V_{0}})\right\rvert<\varepsilon R_{H}\,\middle\|\,B\right)$	$\displaystyle\geq{\mathbb{P}}\left(2\delta\lvert V\setminus V_{0}\rvert<\varepsilon\left(\sum_{x=1}^{N}\lvert J_{x,x+1}\rvert-2\min_{x=1,\dots,N}\lvert J_{x,x+1}\rvert\right)\,\middle\|\,B\right)$
		$\displaystyle\geq{\mathbb{P}}\left(\frac{2\delta}{\varepsilon}\lvert V\setminus V_{0}\rvert<\left(\sum_{x=1}^{N}\lvert J_{x,x+1}\rvert-2\delta\right)\,\middle\|\,B\right)$
		$\displaystyle={\mathbb{P}}\left(\frac{2\delta}{\varepsilon}\frac{\lvert V\setminus V_{0}\rvert+\varepsilon}{N}<\frac{1}{N}\sum_{x=1}^{N}\lvert J_{x,x+1}\rvert\,\middle\|\,B\right)$
		$\displaystyle\geq{\mathbb{P}}\left(\frac{2\delta}{\varepsilon}<\frac{1}{N}\sum_{x=1}^{N}\lvert J_{x,x+1}\rvert\,\middle\|\,B\right)=1,$