Necessary and Sufficient Conditions for Convergence to the Semicircle Distribution

Calvin Wooyoung Chin

Abstract.

We consider random Hermitian matrices with independent upper triangular entries. Wigner’s semicircle law says that under certain additional assumptions, the empirical spectral distribution converges to the semicircle distribution. We characterize convergence to semicircle in terms of the variances of the entries, under natural assumptions such as the Lindeberg condition. The result extends to certain matrices with entries having infinite second moments. As a corollary, another characterization of semicircle convergence is given in terms of convergence in distribution of the row sums to the standard normal distribution.

1. Introduction

Let $W_{n}$ , for each $n\in\mathbf{N}$ , be a random $n\times n$ Hermitian matrix whose upper triangular entries are independent. We call $(W_{n})_{n\in\mathbf{N}}$ a Hermitian Wigner ensemble. In case $W_{n}$ is real symmetric for all $n\in\mathbf{N}$ , we call $(W_{n})_{n\in\mathbf{N}}$ a symmetric Wigner ensemble. We write $W_{n}=(w_{ij})_{i,j=1}^{n}$ throughout this paper. If $\lambda_{1}(W_{n})\geq\cdots\geq\lambda_{n}(W_{n})$ are the eigenvalues of $W_{n}$ counted with multiplicity, then the empirical spectral distribution $\mu_{W_{n}}$ of $W_{n}$ is defined by

\mu_{W_{n}}:=\frac{1}{n}\sum_{i=1}^{n}\delta_{\lambda_{i}(W_{n})}.

Since $\mu_{W_{n}}$ is a random measure, we can think of the mean measure $\operatorname{\mathbf{E}}\mu_{W_{n}}$ , which is defined and treated in Appendix A.

Let us use the term semicircle law to refer to a class of theorems that state, under certain conditions, that $\mu_{W_{n}}$ converges in some sense to the semicircle distribution $\mu_{\mathrm{sc}}$ on $\mathbf{R}$ given by

\mu_{\mathrm{sc}}(dx):=\sqrt{(4-x^{2})_{+}}\,dx.

(We let $x_{+}:=\max\{x,0\}$ .) Wigner initiated the spectral study of random matrices by proving the following very first version of the semicircle law in [Wig55, Wig58].

Theorem 1.1 (semicircle law, Wigner).

Let $(W_{n})_{n\in\mathbf{N}}$ be a symmetric Wigner ensemble such that the upper triangular entries of $W_{n}$ have identical symmetric distribution with mean zero and variance $1/n$ . If for each $k\in\mathbf{N}$ we have

(1.1)

\operatorname{\mathbf{E}}|w_{ij}|^{k}\leq B_{k}n^{-k/2}\qquad\text{for some $B_{k}<\infty$ independent of $n,i,j$,}

then $\operatorname{\mathbf{E}}\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ . Here $\Rightarrow$ denotes convergence in distribution.

Subsequent works by [Arn71], [Pas73], and others led to the following much more general semicircle law.

Theorem 1.2 (semicircle law, [BS10, Theorem 2.9]).

Let $(W_{n})_{n\in\mathbf{N}}$ be a Hermitian Wigner ensemble such that the upper triangular entries of $W_{n}$ are of mean zero and variance $1/n$ . If

(1.2)

\frac{1}{n}\sum_{i,j=1}^{n}\operatorname{\mathbf{E}}[|w_{ij}|^{2};|w_{ij}|>\epsilon]\to 0\qquad\text{for all $\epsilon>0$,}

then $\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ a.s.

Note that (1.1) for $k=3$ implies (1.2) since

\frac{1}{n}\sum_{i,j=1}^{n}\operatorname{\mathbf{E}}[|w_{ij}|^{2};|w_{ij}|>\epsilon]\leq\frac{1}{\epsilon n}\sum_{i,j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{3}\leq\frac{B_{3}}{\epsilon\sqrt{n}}\to 0.

Let us call (1.2) the Lindeberg condition, following the Lindeberg–Feller central limit theorem. Girko [Gir90, Theorem 9.4.1] states that the converse of Theorem 1.2 holds.

Rather surprisingly, we have the following:

Lemma 1.3 (a.s. convergence).

Let $(W_{n})_{n\in\mathbf{N}}$ be a Hermitian Wigner ensemble. Then

\operatorname{\mathbf{E}}\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}\quad\text{if and only if}\quad\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}\text{ a.s.}

A proof of this fact using a concentration-of-measure inequality is given in Appendix A. Thanks to this equivalence, we will be able to go back and forth freely between the two types of convergences throughout the paper.

Theorem 1.2 suggests an extension of the semicircle law to the case where the entries of $W_{n}$ have variances other than $1/n$ . Here is one possible approach to such an extension. Assume that the underlying probability space is the product

(\Omega_{1}\times\Omega_{2},\mathcal{F}_{1}\times\mathcal{F}_{2},\operatorname{\mathbf{P}}_{1}\times\operatorname{\mathbf{P}}_{2})

of two probability spaces $(\Omega_{1},\mathcal{F}_{1},\operatorname{\mathbf{P}}_{1})$ and $(\Omega_{2},\mathcal{F}_{2},\operatorname{\mathbf{P}}_{2})$ . Then let $X_{n}=(x_{ij})_{i,j=1}^{n}$ and $Y_{n}=(y_{ij})_{i,j=1}^{n}$ be random real symmetric matrices defined on $\Omega_{1}$ and $\Omega_{2}$ having i.i.d. upper triangular entries. If $x_{11}$ is standard normal and

\operatorname{\mathbf{P}}_{2}(y_{11}=1)=\operatorname{\mathbf{P}}_{2}(y_{11}=-1)=1/2,

then it is not difficult to show that $(W_{n})_{n\in\mathbf{N}}$ given by

w_{ij}(\omega_{1},\omega_{2}):=x_{ij}(\omega_{1})y_{ij}(\omega_{2})/\sqrt{n}

satisfies the conditions of Theorem 1.2.

Since $\mu_{W_{n}(\omega_{1},\omega_{2})}\Rightarrow\mu_{\mathrm{sc}}$ for $\operatorname{\mathbf{P}}$ -a.e. $(\omega_{1},\omega_{2})$ , Tonelli’s theorem implies that for $\operatorname{\mathbf{P}}_{1}$ -a.e. $\omega_{1}\in\Omega_{1}$ , we have $\mu_{W_{n}(\omega_{1},\cdot)}\Rightarrow\mu_{\mathrm{sc}}$ $\operatorname{\mathbf{P}}_{2}$ -a.s. Note that the $(i,j)$ -entry of the random matrix $W_{n}(\omega_{1},\cdot)$ defined on $(\Omega_{2},\mathcal{F}_{2},\mathcal{P}_{2})$ has variance $x_{ij}(\omega_{1})^{2}/n,$ which can deviate by any amount from $1/n$ .

A problem with this approach is that we do not know for which $\omega_{1}$ we have the a.s. convergence $\mu_{W_{n}(\omega_{1},\cdot)}\Rightarrow\mu_{\mathrm{sc}}$ , even though we know this happens for almost all $\omega_{1}$ . For instance, the above discussion does not tell us whether $\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ a.s. is true when $(W_{n})_{n\in\mathbf{N}}$ is a symmetric Wigner ensemble such that

\operatorname{\mathbf{P}}(w_{ij}=\sqrt{2/n})=\operatorname{\mathbf{P}}(w_{ij}=-\sqrt{2/n})=1/2\qquad\text{if $i+j$ is even}

and $w_{ij}=0$ if $i+j$ is odd.

Götze, Naumov, and Tikhomirov [GNT15] covered this case by proving the following:

Theorem 1.4 (semicircle law, [GNT15, Corollary 1]).

Let $(W_{n})_{n\in\mathbf{N}}$ be a symmetric Wigner ensemble such that $\operatorname{\mathbf{E}}w_{ij}=0$ and $\operatorname{\mathbf{E}}|w_{ij}|^{2}<\infty$ for $i,j=1,\ldots,n$ . If the Lindeberg condition (1.2) holds, and

(1.3)

\frac{1}{n}\sum_{i=1}^{n}\Bigl{|}\sum_{j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{2}-1\Bigr{|}\to 0,

and

(1.4)

\sum_{j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{2}\leq C\qquad\text{for some $C<\infty$ independent of $n$ and $i$,}

then $\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ a.s.

From our main result (Theorem 1.6) it will follow that (1.4) is not needed in Theorem 1.4, and that $(W_{n})_{n\in\mathbf{N}}$ can be assumed to be Hermitian, not necessarily real symmetric.

To illustrate that (1.3) is needed in Theorem 1.4, the authors of [GNT15] considered the random symmetric block matrix

W_{n}=\begin{pmatrix}A&B\\ B^{T}&D\end{pmatrix}

where $A$ and $D$ are of size $\lfloor{n/2}\rfloor\times\lfloor{n/2}\rfloor$ and $\lceil{n/2}\rceil\times\lceil{n/2}\rceil$ , and the upper triangular entries of $W_{n}$ are independent. They let all entries of $W_{n}$ except the non-diagonal entries of $D$ be normal with mean $0$ and variance $1/n$ , and simulated the spectrum of $W_{n}$ for $n=2000$ to see that $\mu_{W_{n}}$ does not look like a semicircle. Note that (1.3) does not hold.

Our main theorem will let us prove what was suggested by the simulation in [GNT15], namely that $\operatorname{\mathbf{E}}\mu_{W_{n}}\not\Rightarrow\mu_{\mathrm{sc}}$ . More generally, we will prove for a large class of Hermitian Wigner ensembles $(W_{n})_{n\in\mathbf{N}}$ that $\operatorname{\mathbf{E}}\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ (or $\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ a.s., equivalently) holds if and only if (1.3) is true.

One thing we should notice is that changing $o(n)$ rows of $W_{n}$ has no effect on the limit of $\operatorname{\mathbf{E}}\mu_{W_{n}}$ due to the following:

Lemma 1.5 (rank inequality).

Let $A$ and $B$ be $n\times n$ Hermitian matrices. If $F_{A}$ and $F_{B}$ are the distribution functions of $\mu_{A}$ and $\mu_{B}$ (defined in the same way as $\mu_{W_{n}}$ ), then

\sup_{x\in\mathbf{R}}\|F_{A}(x)-F_{B}(x)\|\leq\frac{\operatorname{rank}(A-B)}{n}.

Proof.

See [BS10, Theorem A.43]. ∎

We want to say that for certain Hermitian Wigner ensembles $(W_{n})_{n\in\mathbf{N}}$ with $\operatorname{\mathbf{E}}\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ , we have (1.3). However, without further restriction on $(W_{n})_{n\in\mathbf{N}}$ , we can always change $o(n)$ rows and columns of $W_{n}$ so that (1.3) becomes false, while leaving the limiting distribution of $\operatorname{\mathbf{E}}\mu_{W_{n}}$ unchanged. To avoid this problem, we assume that

(1.5)

\frac{1}{n}\sum_{i\in J_{n}}\sum_{j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{2}\to 0\qquad\text{for any $J_{n}\subset\{1,\ldots,n\}$ with $|J_{n}|/n\to 0$.}

Notice that this condition is weaker than (1.4). If $W_{n}$ satisfies (1.3) and (1.5), and we change $o(n)$ rows and columns of it to obtain Hermitian $W_{n}^{\prime}$ which also satisfies (1.5), then $W_{n}^{\prime}$ also satisfies (1.3). The following is our first main theorem:

Theorem 1.6 (characterization of semicircle convergence).

Let $(W_{n})_{n\in\mathbf{N}}$ be a Hermitian Wigner ensemble with $\operatorname{\mathbf{E}}w_{ij}=0$ and $\operatorname{\mathbf{E}}|w_{ij}|^{2}<\infty$ for $i,j=1,\ldots,n$ satisfying (1.5) and the Lindeberg condition (1.2). Then $\operatorname{\mathbf{E}}\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ if and only if (1.3) holds.

Remark 1.7.

Since (1.3) implies

\frac{1}{n}\sum_{i\in J_{n}}\sum_{j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{2}\leq\frac{1}{n}\sum_{i\in J_{n}}\biggl{(}\Bigl{|}\sum_{j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{2}-1\Bigr{|}+1\biggr{)}\to 0

for any $J_{n}\subset\{1,\ldots,n\}$ with $|J_{n}|/n\to 0$ , the sufficiency direction of Theorem 1.6 does not require (1.5). This proves the claim right after Theorem 1.4.

By Lindeberg’s universality scheme [GNT15, Theorem 2] for random matrices, it follows that we can remove the condition (1.4) (which is (5) in [GNT15]) from the semicircle law [GNT15, Theorem 1] for certain random symmetric matrices with dependent upper triangular entries.

We can actually go beyond Theorem 1.6 and allow the entries of $W_{n}$ to have infinite variances, for example when $w_{ij}=c_{ij}x_{ij}/\sqrt{n\log n}$ where $x_{ij}$ has a density

f(x)=\begin{cases}1/|x|^{3}&\text{if $|x|>1$}\\ 0&\text{if $|x|\leq 1$}\end{cases}

and $c_{ij}$ is a real number close to $1$ . To achieve this, instead of $\operatorname{\mathbf{E}}w_{ij}=0$ and the Lindeberg condition (1.2), we assume

(1.6)

\frac{1}{n}\sum_{i,j=1}^{n}\bigl{(}\operatorname{\mathbf{E}}[w_{ij};|w_{ij}|\leq 1]\bigr{)}^{2}\to 0

and

(1.7)

\frac{1}{n}\sum_{i,j=1}^{n}\operatorname{\mathbf{P}}(|w_{ij}|>\epsilon)\to 0\qquad\text{for all $\epsilon>0$.}

If $\operatorname{\mathbf{E}}w_{ij}=0$ and (1.2) hold, then (1.6) follows due to

\bigl{(}\operatorname{\mathbf{E}}[w_{ij};|w_{ij}|\leq 1]\bigr{)}^{2}=\bigl{(}\operatorname{\mathbf{E}}[w_{ij};|w_{ij}|>1]\bigr{)}^{2}\leq\operatorname{\mathbf{E}}[|w_{ij}|^{2};|w_{ij}|>1],

and (1.7) follows by

\operatorname{\mathbf{P}}(|w_{ij}|>\epsilon)\leq\epsilon^{-2}\operatorname{\mathbf{E}}[|w_{ij}|^{2};|w_{ij}|>\epsilon].

Finally, (1.5) is replaced by

(1.8)

\frac{1}{n}\sum_{i\in J_{n}}\sum_{j=1}^{n}\operatorname{\mathbf{E}}[|w_{ij}|^{2};|w_{ij}|\leq 1]\to 0\\ \text{for any $J_{n}\subset\{1,\ldots,n\}$ with $|J_{n}|/n\to 0$.}

The following is our second main theorem:

Theorem 1.8 (characterization, general version).

Let $(W_{n})_{n\in\mathbf{N}}$ be a Hermitian Wigner ensemble satisfying (1.6), (1.7), and (1.8). Then $\operatorname{\mathbf{E}}\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ if and only if

(1.9)

\frac{1}{n}\sum_{i=1}^{n}\Bigl{|}\sum_{j=1}^{n}\operatorname{\mathbf{E}}[|w_{ij}|^{2};|w_{ij}|\leq 1]-1\Bigr{|}\to 0.

Remark 1.9.

(1)

As in Theorem 1.6, the sufficiency direction of Theorem 1.8 does not require 1.8. (See Remark 1.7.)
(2)

Theorem 1.6 follows easily from Theorem 1.8. See Lemma B.1 in the appendix.
(3)

In case the entries of $W_{n}$ are real, the sufficiency part of Theorem 1.8 can be easily proved using Theorem 1.4 and Lemma 1.5. This is covered in Section 2.
(4)

Our full proof of the sufficiency is a careful consideration of Wigner’s moment method proof of the original semicircle law. This is arguably more elementary than the proof of Theorem 1.4 in [GNT15], which first deals with matrices with Gaussian entries using combinatorial arguments, and then generalizes the result to symmetric Wigner ensembles using Lindeberg’s universality scheme for random matrices.

The following corollary relates $\operatorname{\mathbf{E}}\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ to the convergence in distribution of the sum of a row of $W_{n}$ to the standard normal random variable. The sufficiency direction when the entries of $W_{n}$ are identically distributed was covered by [Jun18]. We denote the Lévy metric by $L$ .

Corollary 1.10 (characterization, Gaussian convergence).

Under the hypotheses of Theorem 1.8, we have $\operatorname{\mathbf{E}}\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ if and only if

(1.10)

\frac{1}{n}\sum_{i=1}^{n}L(F_{ni},G)\to 0,

where $F_{ni}$ and $G$ are the distribution functions of $\sum_{j=1}^{n}\pm|w_{ij}|$ and the standard normal random variable. The signs $\pm$ are independent Rademacher random variables independent from $W_{n}$ .

The rest of the paper is organized as follows. Section 2 is a short section that introduces Theorem 2.1. This theorem is a reduction of Theorems 1.6 and 1.8, and will ultimately imply them as shown in Appendix B. This section also proves the sufficiency part of Theorem 2.1 in the case when the entries of $W_{n}$ are real.

Section 3 is the essence of the proof of the necessity part of Theorem 2.1. We add an assumption that the sixth moment of $\operatorname{\mathbf{E}}\mu_{W_{n}}$ is bounded, but as a return we obtain a clean proof that is right to the point. The idea is to express the second and the fourth moments of $\operatorname{\mathbf{E}}\mu_{W_{n}}$ in terms of the variances of the entries of $W_{n}$ .

Section 4 shows that we can remove the additional assumption on the sixth moments, assuming that a certain lemma (Lemma 4.1) holds. The condition (1.5) is used in this section.

Section 5 proves Lemma 4.1 by a systematic computation of the moments of $\operatorname{\mathbf{E}}\mu_{W_{n}}$ . The computation is a variant of Wigner’s original moment method, but it can handle the case when the entries have non-identical variances.

In Section 6, we prove the sufficiency part of Theorem 2.1 using the results of Section 5. The classical argument involving Dyck paths is discussed for completeness.

Finally in Section 7, we derive Corollary 1.10 from Theorem 1.8 by an elementary argument involving the Lindeberg–Feller central limit theorem.

Appendix A defines the mean of a random probability measure, and proves some of their properties that we need. Appendix B contains the fairly standard proof that Theorem 1.8 implies Theorem 1.6, and that Theorem 2.1 implies Theorem 1.8.

2. Proof of sufficiency for symmetric Wigner ensembles

It is enough to prove the following in order to prove Theorems 1.6 and 1.8. The justification for the reduction is fairly standard, and is covered by Lemmas B.1 and B.4 in the appendix.

Theorem 2.1 (characterization, reduced form).

Let $(W_{n})_{n\in\mathbf{N}}$ be a Hermitian Wigner ensemble such that

(2.1)

\operatorname{\mathbf{E}}w_{ij}=0\quad\text{and}\quad|w_{ij}|\leq\epsilon_{n}\qquad\text{for all $n\in\mathbf{N}$ and $i,j=1,\ldots,n$}\\ \text{where}\quad 1\geq\epsilon_{1}\geq\epsilon_{2}\geq\cdots\quad\text{and}\quad\epsilon_{n}\to 0,

and (1.5) is true. Then $\operatorname{\mathbf{E}}\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ if and only if (1.3). Note that (1.2) is automatically satisfied due to $|w_{ij}|\leq\epsilon_{n}$ .

The following shows that we can assume (1.4) in the proof of the sufficiency part of Theorem 2.1.

Lemma 2.2.

Assume that $(W_{n})_{n\in\mathbf{N}}$ satisfies (1.3) and the conditions of Theorem 2.1. Then there exists a Hermitian Wigner ensemble $(W^{\prime}_{n})_{n\in\mathbf{N}}$ satisfying (1.3), (1.4), and the conditions of Theorem 2.1 such that $\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ a.s. if and only if $\mu_{W^{\prime}_{n}}\Rightarrow\mu_{\mathrm{sc}}$ a.s.

Proof.

If we let $J_{n}$ be the set of all $i\in\{1,\ldots,n\}$ such that $\sum_{j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{2}>2$ , then

\frac{|J_{n}|}{n}\leq\frac{1}{n}\sum_{i=1}^{n}\Bigl{|}\sum_{j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{2}-1\Bigr{|}

where the right side goes to $0$ by (1.3). Let $W^{\prime}_{n}=(w_{ij}^{\prime})_{i,j=1}^{n}$ be given by $w_{ij}^{\prime}=w_{ij}$ if $i,j\not\in J_{n}$ and $w_{ij}^{\prime}=0$ otherwise.

Then,

\begin{split}\frac{1}{n}\sum_{i=1}^{n}\Bigl{|}\sum_{j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}^{\prime}|^{2}-1\Bigr{|}&\leq\frac{1}{n}\sum_{i=1}^{n}\Bigl{|}\sum_{j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{2}-1\Bigr{|}+\frac{2}{n}\sum_{i\in J_{n}}\sum_{j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{2}\\ &\to 0\end{split}

where the right side goes to $0$ by (1.3) and (1.5). Notice that $(W_{n}^{\prime})_{n\in\mathbf{N}}$ satisfies (1.4) for $C=2$ . The conditions of Theorem 2.1 can be easily shown for $(W_{n}^{\prime})_{n\in\mathbf{N}}$ .

Since

\frac{\operatorname{rank}(W_{n}-W^{\prime}_{n})}{n}\leq\frac{2|J_{n}|}{n}\to 0,

we have $\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ a.s. if and only if $\mu_{W^{\prime}_{n}}\Rightarrow\mu_{\mathrm{sc}}$ a.s. by Lemma 1.5. ∎

If $W_{n}$ is real symmetric, then the sufficiency part of Theorem 2.1 follows from Lemma 2.2 and Theorem 1.4. In Section 6, we will present a direct proof of the sufficiency that applies to Hermitian Wigner ensembles and does not depend on the result of [GNT15].

3. Proof of necessity under bounded sixth moments

Assume (2.1) throughout this section. In this section, we present a relatively simple proof of necessity in Theorem 2.1 under the following additional assumption:

(3.1)

\sup_{n\in\mathbf{N}}\int_{\mathbf{R}}x^{6}\,\operatorname{\mathbf{E}}\mu_{W_{n}}<\infty.

The number $6$ comes out just because it is an even number greater than $4$ . Our proof is based on an examination of the second and the fourth moments of $\operatorname{\mathbf{E}}\mu_{W_{n}}$ . If $\lambda_{1},\ldots,\lambda_{n}$ are the eigenvalues of $W_{n}$ , then for each $k\in\mathbf{N}$ we have

\int_{\mathbf{R}}x^{k}\,\mu_{W_{n}}(dx)=\frac{1}{n}\sum_{i=1}^{n}\lambda_{i}^{k}=\frac{1}{n}\operatorname{tr}W_{n}^{k},

and thus

(3.2)

\int_{\mathbf{R}}x^{k}\,\operatorname{\mathbf{E}}\mu_{W_{n}}(dx)=\frac{1}{n}\operatorname{\mathbf{E}}\operatorname{tr}W_{n}^{k}.

(See Lemma A.1.)

The second moment of $\operatorname{\mathbf{E}}\mu_{W_{n}}$ can be easily expressed in terms of the variances of $w_{ij}$ .

Lemma 3.1 (computation of the second moment).

We have

\int_{\mathbf{R}}x^{2}\,\operatorname{\mathbf{E}}\mu_{W_{n}}(dx)=\frac{1}{n}\sum_{i,j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{2}.

Proof.

It follows from (3.2) and

\operatorname{tr}W_{n}^{2}=\sum_{i,j=1}^{n}|w_{ij}|^{2}.\qed

Computing the fourth moment requires more effort, but is still tractable.

Lemma 3.2 (computation of the fourth moment).

(3.3)

\sup_{n\in\mathbf{N}}\int_{\mathbf{R}}x^{2}\,\operatorname{\mathbf{E}}\mu_{W_{n}}<\infty,

then

\int_{\mathbf{R}}x^{4}\,\operatorname{\mathbf{E}}\mu_{W_{n}}(dx)-\frac{2}{n}\sum_{i=1}^{n}\biggl{(}\sum_{j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{2}\biggr{)}^{2}\to 0.

We remark that (1.5) implies (3.3). The proof for that is similar to that of Lemma 4.2 below.

Proof.

Note that

\int_{\mathbf{R}}x^{4}\,\operatorname{\mathbf{E}}\mu_{W_{n}}(dx)=\frac{1}{n}\operatorname{\mathbf{E}}\operatorname{tr}W_{n}^{4}=\frac{1}{n}\sum_{i_{1},i_{2},i_{3},i_{4}=1}^{n}\operatorname{\mathbf{E}}[w_{i_{1}i_{2}}w_{i_{2}i_{3}}w_{i_{3}i_{4}}w_{i_{4}i_{1}}].

Since the upper triangular entries of $W_{n}$ are independent and have mean zero, in order for $\operatorname{\mathbf{E}}[w_{i_{1}i_{2}}w_{i_{2}i_{3}}w_{i_{3}i_{4}}w_{i_{4}i_{1}}]$ not to vanish,

\{i_{1},i_{2}\},\{i_{2},i_{3}\},\{i_{3},i_{4}\},\{i_{4},i_{1}\}

should either be all the same, or be partitioned into two groups, where each group consists of two identical sets. This implies either $i_{1}=i_{3}$ or $i_{2}=i_{4}$ or both. Notice that, for instance, $i_{1}=i_{2}\neq i_{3}=i_{4}$ cannot happen because $\{i_{1},i_{2}\}$ would then appear only once among $\{i_{1},i_{2}\}$ , $\{i_{2},i_{3}\}$ , $\{i_{3},i_{4}\}$ , and $\{i_{4},i_{1}\}$ .

Thus, the sum on the right side equals

\sum_{i,j,k=1}^{n}\operatorname{\mathbf{E}}[|w_{ij}|^{2}|w_{ik}|^{2}]+\sum_{i,j,k=1}^{n}\operatorname{\mathbf{E}}[|w_{ij}|^{2}|w_{jk}|^{2}]-\sum_{i,j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{4},

where the last term corresponds to the case where both $i_{1}=i_{3}$ and $i_{2}=i_{4}$ are true. Since the first and the second sum both equal

\sum_{i=1}^{n}\biggl{(}\sum_{j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{2}\biggr{)}^{2}+\sum_{i,j=1}^{n}\bigl{(}\operatorname{\mathbf{E}}|w_{ij}|^{4}-(\operatorname{\mathbf{E}}|w_{ij}|^{2})^{2}\bigr{)},

we have

\int_{\mathbf{R}}x^{4}\,\operatorname{\mathbf{E}}\mu_{W_{n}}(dx)=\frac{2}{n}\sum_{i=1}^{n}\biggl{(}\sum_{j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{2}\biggr{)}^{2}+\frac{1}{n}\sum_{i,j=1}^{n}\bigl{(}\operatorname{\mathbf{E}}|w_{ij}|^{4}-2(\operatorname{\mathbf{E}}|w_{ij}|^{2})^{2}\bigr{)}.

Since both $\operatorname{\mathbf{E}}|w_{ij}|^{4}$ and $(\operatorname{\mathbf{E}}|w_{ij}|^{2})^{2}$ are bounded above by $\epsilon_{n}^{2}\operatorname{\mathbf{E}}|w_{ij}|^{2}$ , we have

\biggl{|}\frac{1}{n}\sum_{i,j=1}^{n}\bigl{(}\operatorname{\mathbf{E}}|w_{ij}|^{4}-2(\operatorname{\mathbf{E}}|w_{ij}|^{2})^{2}\bigr{)}\biggr{|}\leq\frac{3\epsilon_{n}^{2}}{n}\sum_{i,j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{2}\to 0

by (3.3) and Lemma 3.1. ∎

Now we are ready to prove the necessity part of Theorem 2.1 assuming (3.1). Assume $\operatorname{\mathbf{E}}\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ . By Skorokhod’s representation theorem [Bil12, Theorem 25.6], we can take real-valued random variables $X,X_{1},X_{2},\ldots$ on a common probability space such that $\operatorname{\mathbf{E}}\mu_{W_{n}}$ is the distribution of $X_{n}$ , $\mu_{\mathrm{sc}}$ is the distribution of $X$ , and $X_{n}\to X$ a.s.

Since

(3.4)

\sup_{n\in\mathbf{N}}\operatorname{\mathbf{E}}X_{n}^{6}=\sup_{n\in\mathbf{N}}\int_{\mathbf{R}}x^{6}\,\operatorname{\mathbf{E}}\mu_{W_{n}}(dx)<\infty,

$(X_{n}^{2})_{n\in\mathbf{N}}$ and $(X_{n}^{4})_{n\in\mathbf{N}}$ are uniformly integrable. Thus, $X_{n}\to X$ a.s. implies $\operatorname{\mathbf{E}}X_{n}^{2}\to\operatorname{\mathbf{E}}X^{2}$ and $\operatorname{\mathbf{E}}X_{n}^{4}\to\operatorname{\mathbf{E}}X^{4}$ . By Lemma 3.1 and Lemma 3.2, we have

\frac{1}{n}\sum_{i,j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{2}\to\operatorname{\mathbf{E}}X^{2}=\int_{\mathbf{R}}x^{2}\,\mu_{\mathrm{sc}}(dx)=1

and

\frac{2}{n}\sum_{i=1}^{n}\biggl{(}\sum_{j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{2}\biggr{)}^{2}\to\operatorname{\mathbf{E}}X^{4}=\int_{\mathbf{R}}x^{4}\,\mu_{\mathrm{sc}}(dx)=2.

See [AGZ10, 2.1.1] for a computation of the moments of $\mu_{\mathrm{sc}}$ .

Here is the punchline: using $(\operatorname{\mathbf{E}}Y)^{2}\leq\operatorname{\mathbf{E}}Y^{2}$ in the first line, and then applying the two convergences we have just established, we have

\begin{split}\biggl{(}\frac{1}{n}\sum_{i=1}^{n}\Bigl{|}\sum_{j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{2}-1\Bigr{|}\biggr{)}^{2}&\leq\frac{1}{n}\sum_{i=1}^{n}\biggl{(}\sum_{j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{2}-1\biggr{)}^{2}\\ &=\frac{1}{n}\sum_{i=1}^{n}\biggl{(}\sum_{j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{2}\biggr{)}^{2}-\frac{2}{n}\sum_{i,j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{2}+1\\ &\to 1-2+1=0.\end{split}

Thus, the necessity in Theorem 2.1 is proved under the assumption (3.1).

4. Lifting the bounded sixth moment condition

In this section, we prove the necessity part of Theorem 2.1 without assuming the bounded sixth moment condition (3.1). We rely on the following lemma, which will be proved in the next section.

Lemma 4.1 (bounded eighth moments).

If (1.4) and (2.1) hold, then

\sup_{n\in\mathbf{N}}\int_{\mathbf{R}}x^{8}\,\operatorname{\mathbf{E}}\mu_{W_{n}}(dx)<\infty.

The number $8$ is here just because it is even and greater than $6$ . In fact, our proof easily extends to any even natural number. Given $K_{n}\subset\{1,\ldots,n\}$ for all $n\in\mathbf{N}$ , let $W_{n}^{K}$ be the matrix obtained from $W_{n}$ by replacing $w_{ij}$ with $0$ for all $(i,j)\not\in K_{n}\times K_{n}$ .

Lemma 4.2.

Assume (1.5). For any $\epsilon>0$ , there exist $K_{n}\subset\{1,\ldots,n\}$ with $|K_{n}|\geq(1-\epsilon)n$ such that

\sum_{j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{2}\leq C\qquad\text{for all $n\in\mathbf{N}$ and $i\in K_{n}$}

for some $C<\infty$ .

Proof.

We may assume $\epsilon<1$ . Suppose that the claim is false, and let $K_{n}$ be the set of size $\lceil{(1-\epsilon)n}\rceil$ consisting of $i\in\{1,\ldots,n\}$ with $\lceil{(1-\epsilon)n}\rceil$ smallest $\sum_{j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{2}$ . Notice that

\min_{i\in K_{n}^{c}}\sum_{j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{2}\geq\max_{i\in K_{n}}\sum_{j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{2}\to\infty

along some subsequence $(n_{k})_{k\in\mathbf{N}}$ , where $K_{n}^{c}:=\{1,\ldots,n\}\setminus K_{n}$ . For all $n=n_{1},n_{2},\ldots$ such that the left side in the previous display is at least $1$ , let $J_{n}$ be any subset of $K_{n}^{c}$ such that

|J_{n}|=\biggl{\lceil}\frac{\epsilon n}{\min_{i\in K_{n}^{c}}\sum_{j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{2}}\biggr{\rceil}.

Then $|J_{n_{k}}|/n_{k}\to 0$ follows from $\lceil{x}\rceil\leq x+1$ . However,

\frac{1}{n}\sum_{i\in J_{n}}\sum_{j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{2}\geq\frac{|J_{n}|}{n}\min_{i\in K_{n}^{c}}\sum_{j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{2}\geq\epsilon

for $n=n_{1},n_{2},\ldots$ for which $J_{n}$ is defined. If we let $J_{n}:=\emptyset$ for all $n$ for which $J_{n}$ is undefined, then $J_{n}$ contradicts (1.5). ∎

Given $K_{n}\subset\{1,\ldots,n\}$ for each $n\in\mathbf{N}$ , let $W_{n}^{K}$ be the matrix obtained from $W_{n}$ by replacing $w_{ij}$ with $0$ for all $(i,j)\not\in K_{n}\times K_{n}$ .

Lemma 4.3.

If (1.5) and (2.1) hold, then there are $K_{n}\subset\{1,\ldots,n\}$ with $|K_{n}|/n\to 1$ such that

(4.1)

\sup_{n\in\mathbf{N}}\int_{\mathbf{R}}x^{6}\,\operatorname{\mathbf{E}}\mu_{W_{n}^{K}}(dx)<\infty.

Proof.

Let $\epsilon\in(0,1)$ , and $K_{n}$ and $C$ be as in the preceding lemma. Suppose that

\int_{\mathbf{R}}x^{6}\,\operatorname{\mathbf{E}}\mu_{W_{n_{k}}^{K}}(dx)\to c>3^{6}\qquad\text{as $k\to\infty$}

for some $n_{1}<n_{2}<\cdots$ . By Lemmas 4.1 and 4.2,

\sup_{n\in\mathbf{N}}\int_{\mathbf{R}}x^{8}\,\operatorname{\mathbf{E}}\mu_{W_{n}^{K}}(dx)<\infty.

This implies that $(\operatorname{\mathbf{E}}\mu_{W_{n_{k}}^{K}})_{k\in\mathbf{N}}$ is tight, thus it has a subsequence weakly convergent to some $\mu$ , which we still, by abuse of notation, denote by $(\operatorname{\mathbf{E}}\mu_{W_{n_{k}}^{K}})_{k\in\mathbf{N}}$ . By Skorokhod’s theorem and the uniform integrability argument that followed (3.4), we have

\int_{\mathbf{R}}x^{6}\,\mu(dx)=\lim_{k\to\infty}\int_{\mathbf{R}}x^{6}\,\operatorname{\mathbf{E}}\mu_{W_{n_{k}}^{K}}(dx)>3^{6},

and thus $\mu([-3,3]^{c})>0$ .

If $W_{n}^{K}$ has $k$ eigenvalues outside $[-3,3]$ , then the Cauchy interlacing law [Tao12, Exercise 1.3.14] implies that $W_{n}$ has at least $k$ eigenvalues outside $[-3,3]$ . Thus

\mu_{W_{n}}([-3,3]^{c})\geq(1-\epsilon)\mu_{W_{n}^{K}}([-3,3]^{c}),

and therefore

\operatorname{\mathbf{E}}\mu_{W_{n}}([-3,3]^{c})\geq(1-\epsilon)\operatorname{\mathbf{E}}\mu_{W_{n}^{K}}([-3,3]^{c})

by Lemma A.1. Since $\operatorname{\mathbf{E}}\mu_{W_{n_{k}}}\Rightarrow\mu_{\mathrm{sc}}$ , the portmanteau theorem implies

\begin{split}\mu_{\mathrm{sc}}((-3,3)^{c})&\geq\limsup_{k\to\infty}\operatorname{\mathbf{E}}\mu_{W_{n_{k}}}((-3,3)^{c})\\ &\geq(1-\epsilon)\limsup_{k\to\infty}\operatorname{\mathbf{E}}\mu_{W_{n_{k}}^{K}}((-3,3)^{c})\\ &\geq(1-\epsilon)\liminf_{k\to\infty}\operatorname{\mathbf{E}}\mu_{W_{n_{k}}^{K}}([-3,3]^{c})\\ &\geq(1-\epsilon)\mu([-3,3]^{c})>0,\end{split}

but this contradicts the fact that $\mu_{\mathrm{sc}}$ is supported on $[-2,2]$ . Thus, we have

\limsup_{n\to\infty}\int_{\mathbf{R}}x^{6}\,\operatorname{\mathbf{E}}\mu_{W_{n}^{K}}(dx)\leq 3^{6}.

Since $\epsilon>0$ is arbitrary, we have actually proved that for each $\epsilon>0$ we can choose $K_{n}^{\epsilon}\subset\{1,\ldots,n\}$ such that $|K^{\epsilon}_{n}|\geq(1-\epsilon)n$ and

\limsup_{n\to\infty}\int_{\mathbf{R}}x^{6}\,\operatorname{\mathbf{E}}\mu_{W_{n}^{K^{\epsilon}}}(dx)\leq 3^{6}.

Choose positive integers $m_{1}<m_{2}<\cdots$ so that

\int_{\mathbf{R}}x^{6}\,\operatorname{\mathbf{E}}\mu_{W_{n}^{K^{1/k}}}\leq 3^{6}+1\qquad\text{for all $n\geq m_{k}$,}

and let $K_{n}:=K_{n}^{1/k}$ for $n=m_{k},\ldots,m_{k+1}-1$ and $K_{n}:=\emptyset$ for $n=1,\ldots,m_{1}-1$ . (We are redefining $K_{n}$ by abuse of notation.) Then we have $|K_{n}|/n\to 1$ and (4.1). ∎

We are ready to prove the necessity part of Theorem 2.1. Assume (1.5), (2.1), and $\operatorname{\mathbf{E}}\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ . By Lemma 1.3, we have $\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ a.s. If $K_{1},K_{2},\ldots$ are as in Lemma 4.3, then

\frac{\operatorname{rank}(W_{n}-W_{n}^{K})}{n}\leq\frac{2(n-|K_{n}|)}{n}\to 0,

and thus $\mu_{W_{n}^{K}}\Rightarrow\mu_{\mathrm{sc}}$ a.s. by Lemma 1.5. By another application of Lemma 1.3, $\operatorname{\mathbf{E}}\mu_{W_{n}^{K}}\Rightarrow\mu_{\mathrm{sc}}$ . As we have (4.1), the previous section tells us that

\frac{1}{n}\sum_{i\in K_{n}}\Bigl{|}\sum_{j\in K_{n}}\operatorname{\mathbf{E}}|w_{ij}|^{2}-1\Bigr{|}\to 0.

Since $|K_{n}^{c}|/n\to 0$ , the assumption (1.5) implies (1.3). Thus, the necessity part of Theorem 2.1 is proved assuming that Lemma 4.1 holds.

5. Computation of moments

The goal of this section is to prove Lemma 4.1 and also establish some arguments needed in the next section. We use a variant of Wigner’s original moment method that can handle entries with non-identical variances. Those that are very familiar with these arguments may want to jump ahead to the proof of Lemma 4.1.

Assume (1.4) and (2.1) throughout this section. Recall that

(3.2)

\int_{\mathbf{R}}x^{k}\,\operatorname{\mathbf{E}}\mu_{W_{n}}(dx)=\frac{1}{n}\operatorname{\mathbf{E}}\operatorname{tr}W_{n}^{k}

for all $k\in\mathbf{N}$ . In this section, we compute the asymptotics of $n^{-1}\operatorname{\mathbf{E}}\operatorname{tr}W_{n}^{k}$ as $n\to\infty$ .

Fix $k\in\mathbf{N}$ . The boldface lower case letters $\mathbf{i},\mathbf{j},\ldots$ will denote $(i_{0},\ldots,i_{k})$ , $(j_{0},\ldots,j_{k})$ , and so on. Let us call a $(k+1)$ -tuple $\mathbf{i}$ with $i_{0}=i_{k}$ a closed walk of length $k$ . For any closed walk $\mathbf{i}$ with $i_{0},\ldots,i_{k}\in\{1,\ldots,n\}$ , let

w_{\mathbf{i}}:=\prod_{s=1}^{k}w_{i_{s-1}i_{s}}.

Notice that

\frac{1}{n}\operatorname{\mathbf{E}}\operatorname{tr}W_{n}^{k}=\frac{1}{n}\sum_{\mathbf{i}}\operatorname{\mathbf{E}}w_{\mathbf{i}}

where $\mathbf{i}$ ranges over all closed walks $\mathbf{i}$ (of length $k$ ) with $i_{0},\ldots,i_{k}\in\{1,\ldots,n\}$ .

Now we gather together the closed walks which have the same “shape.” Let us say that two closed walks $\mathbf{i}$ and $\mathbf{j}$ are isomorphic if for any $s,t=0,\ldots,k$ we have $i_{s}=i_{t}$ if and only if $j_{s}=j_{t}$ . A canonical closed walk of length $k$ on $t\in\mathbf{N}$ vertices is a closed walk $\mathbf{c}$ such that

(1)

$c_{0}=c_{k}=1$ ,
(2)

$\{c_{0},\ldots,c_{k}\}=\{1,\ldots,t\}$ , and
(3)

$c_{i}\leq\max\{c_{0},\ldots,c_{i-1}\}+1$ for each $i=1,\ldots,k$ .

Let $\gamma(k,t)$ denote the set of such walks. It is straightforward to show that any closed walk is isomorphic to exactly one canonical closed walk. For each $\mathbf{c}\in\gamma(k,t)$ , let $L(n,\mathbf{c})$ denote the set of all closed walks $\mathbf{i}$ with $i_{0},\ldots,i_{k}\in\{1,\ldots,n\}$ which are isomorphic to $\mathbf{c}$ . Then we have

(5.1)

\frac{1}{n}\operatorname{\mathbf{E}}\operatorname{tr}W_{n}^{k}=\frac{1}{n}\sum_{i_{0},\ldots,i_{k}=1}^{n}\operatorname{\mathbf{E}}w_{\mathbf{i}}=\frac{1}{n}\sum_{t=1}^{k}\sum_{\mathbf{c}\in\gamma(k,t)}\sum_{\mathbf{i}\in L(n,\mathbf{c})}\operatorname{\mathbf{E}}w_{\mathbf{i}},

where the upper bound of $t$ is (rather arbitrarily) set to $k$ since $\gamma(k,t)$ is empty for all $t>k$ .

We will fix $t\in\mathbf{N}$ and $\mathbf{c}\in\gamma(k,t)$ , and compute $n^{-1}\sum_{\mathbf{i}\in L(n,\mathbf{c})}\operatorname{\mathbf{E}}w_{\mathbf{i}}$ . As a first step, we get an easy case out of the way.

Lemma 5.1 (zeroed out terms).

If $\mathbf{c}$ crosses some edge $\{i,j\}$ exactly once, i.e., $\{c_{s-1},c_{s}\}=\{i,j\}$ for exactly one $s\in\{1,\ldots,k\}$ , then $\operatorname{\mathbf{E}}w_{\mathbf{i}}=0$ for any $n\in\mathbf{N}$ and $\mathbf{i}\in L(n,\mathbf{c})$ .

Proof.

Since we have (2.1) and the upper triangular entries of $W_{n}$ are independent, $w_{\mathbf{i}}$ is the product of $w_{ij}$ (or $w_{ji}$ ) and a bounded random variable which is independent from $w_{ij}$ . Since $\operatorname{\mathbf{E}}w_{ij}=0$ , we have $\operatorname{\mathbf{E}}w_{\mathbf{i}}=0$ . ∎

Now assume that $\mathbf{c}$ does not cross any edge exactly once, i.e., for each $s\in\{1,\ldots,k\}$ there is some $r\in\{1,\ldots,k\}$ distinct from $s$ such that $\{c_{s-1},c_{s}\}=\{c_{r-1},c_{r}\}$ . To compute $n^{-1}\sum_{i\in L(n,\mathbf{c})}\operatorname{\mathbf{E}}w_{\mathbf{i}}$ , we introduce some notation. Let $G(\mathbf{c})$ be the graph (possibly having loops but no multiple edges) with the vertex set

\{i_{0},\ldots,i_{k}\}

and the edge set

\bigl{\{}\{i_{t-1},i_{t}\}\bigm{|}t=1,\ldots,k\bigr{\}}.

For a tree $T$ , let $V(T)$ and $E(T)$ denote the vertex set and the edge set of $T$ . Given a finite tree $T$ and $n\in\mathbf{N}$ , we let $I(T,n)$ denote the set of injections from the vertex set $V(T)$ of $T$ to $\{1,\ldots,n\}$ . For each $F\in I(T,n)$ , let us write

\Pi(F):=\prod_{e\in E(T)}\operatorname{\mathbf{E}}|w_{F(x_{e})F(y_{e})}|^{2}

where $x_{e},y_{e}\in V(T)$ denote the endpoints of $e\in E(T)$ . We are omitting the dependence of $\Pi(F)$ on $T$ , but there should be no confusion. Note that the value $\Pi(F)$ is well-defined because $W_{n}$ is Hermitian.

Now we get back to the problem of computing $n^{-1}\sum_{i\in L(n,\mathbf{c})}\operatorname{\mathbf{E}}w_{\mathbf{i}}$ . As each edge of $G(\mathbf{c})$ is crossed at least twice by $\mathbf{c}$ , there are at most $k/2$ edges in $G(\mathbf{c})$ . As $G(\mathbf{c})$ is a connected graph with $t$ vertices, we have $t\leq k/2+1$ , and $G(\mathbf{c})$ has a spanning tree $S$ with $t-1$ edges. Choose some $S$ . For each $\mathbf{i}\in L(n,\mathbf{c})$ , consider the injection $F_{\mathbf{i}}\colon\{1,\ldots,t\}\to\{1,\ldots,n\}$ given by $F_{\mathbf{i}}(c_{s}):=i_{s}$ for each $s=0,\ldots,k$ .

First assume $t=k/2+1$ . Since $S$ has $k/2$ edges and each edge of $S$ is crossed twice by $\mathbf{c}$ , we have $S=G(\mathbf{c})$ . As each edge of $G(\mathbf{c})$ is traversed exactly once in each direction, and the map $L(n,\mathbf{c})\to I(S,n)$ given by $\mathbf{i}\mapsto F_{\mathbf{i}}$ is a bijection, we have

(5.2)

\frac{1}{n}\sum_{\mathbf{i}\in L(n,\mathbf{c})}\operatorname{\mathbf{E}}w_{\mathbf{i}}=\frac{1}{n}\sum_{\mathbf{i}\in L(n,\mathbf{c})}\Pi(F_{\mathbf{i}})=\frac{1}{n}\sum_{F\in I(S,n)}\Pi(F).

Now assume $t<k/2+1$ . By $|w_{ij}|\leq\epsilon_{n}$ and the fact that $\mathbf{c}$ crosses any edge of $G(\mathbf{c})$ at least twice, we have

|\operatorname{\mathbf{E}}w_{\mathbf{i}}|\leq\epsilon_{n}^{k-2(t-1)}\Pi(F_{\mathbf{i}}).

Note that $\epsilon_{n}^{k-2(t-1)}\to 0$ since $t<k/2+1$ . By using the bijection $L(n,\mathbf{c})\to I(S,n)$ again, we have

(5.3)

\begin{split}\frac{1}{n}\sum_{\mathbf{i}\in L(n,\mathbf{c})}|\operatorname{\mathbf{E}}w_{\mathbf{i}}|&\leq\frac{\epsilon_{n}^{k-2(t-1)}}{n}\sum_{\mathbf{i}\in L(n,\mathbf{c})}\Pi(F_{\mathbf{i}})\\ &=\frac{\epsilon_{n}^{k-2(t-1)}}{n}\sum_{F\in I(S,n)}\Pi(F).\end{split}

The right side tends to $0$ by the following.

Lemma 5.2 (contribution of a tree).

If $T$ is a finite tree with $m$ edges, $x\in V(T)$ , $n\in\mathbf{N}$ , and $i\in\{1,\ldots,n\}$ , then

(5.4)

\sum_{\begin{subarray}{c}F\in I(T,n)\\ F(x)=i\end{subarray}}\Pi(F)\leq C^{m}

where $C$ is as in (1.4). Note that it follows that

\frac{1}{n}\sum_{F\in I(T,n)}\Pi(F)\leq C^{m}.

Proof.

There is nothing to prove if $m=0$ . To proceed by induction, assume that (5.4) holds, and let $T$ be a tree with $m+1$ edges. Pick any leaf $y\in V(T)$ distinct from $x$ , and let $z\in V(T)$ be the only vertex that is adjacent to $y$ . Since

\begin{split}\sum_{\begin{subarray}{c}F\in I(T,n)\\ F(x)=i,F(z)=j\end{subarray}}\Pi(F)&\leq\sum_{\begin{subarray}{c}H\in I(T\setminus y,n)\\ H(x)=i,H(z)=j\end{subarray}}\biggl{(}\Pi(H)\sum_{l=1}^{n}\operatorname{\mathbf{E}}|w_{jl}|^{2}\biggr{)}\\ &\leq C\sum_{\begin{subarray}{c}H\in I(T\setminus y,n)\\ H(x)=i,H(z)=j\end{subarray}}\Pi(H)\end{split}

by (1.4) for all $j=1,\ldots,n$ , we have

\begin{split}\sum_{\begin{subarray}{c}F\in I(T,n)\\ F(x)=i\end{subarray}}\Pi(F)&=\sum_{j=1}^{n}\sum_{\begin{subarray}{c}F\in I(T,n)\\ F(x)=i,F(z)=j\end{subarray}}\Pi(F)\\ &\leq C\sum_{j=1}^{n}\sum_{\begin{subarray}{c}H\in I(T\setminus y,n)\\ H(x)=i,H(z)=j\end{subarray}}\Pi(H)=C\sum_{\begin{subarray}{c}H\in I(T\setminus y,n)\\ H(x)=i\end{subarray}}\Pi(H)\leq C^{m+1}\end{split}

by the induction hypothesis. ∎

We have now shown the following; see (5.2) and (5.3).

Lemma 5.3 (contribution of a canonical walk).

Let $t\in\mathbf{N}$ and $\mathbf{c}\in\gamma(k,t)$ . Assume that $\mathbf{c}$ does not cross any edge exactly one, i.e., for each $s\in\{1,\ldots,k\}$ there is some $r\in\{1,\ldots,k\}$ distinct from $s$ such that $\{c_{s-1},c_{s}\}=\{c_{r-1},c_{r}\}$ . Then we have $t\leq k/2+1$ . If $t<k/2+1$ , we have

\frac{1}{n}\sum_{\mathbf{i}\in L(n,\mathbf{c})}\operatorname{\mathbf{E}}w_{\mathbf{i}}\to 0.

If $t=k/2+1$ , then $G(\mathbf{c})$ is a tree, and we have

\frac{1}{n}\sum_{\mathbf{i}\in L(n,\mathbf{c})}\operatorname{\mathbf{E}}w_{\mathbf{i}}=\frac{1}{n}\sum_{F\in I(G(\mathbf{c}),n)}\Pi(F).

Combining (3.2), (5.1), Lemma 5.1, and Lemma 5.3, we obtain the following approximation to the moments of $\operatorname{\mathbf{E}}\mu_{W_{n}}$ .

Lemma 5.4 (computation of moments).

Let $\Gamma_{k}$ be the set of all $\mathbf{c}\in\gamma(k,k/2+1)$ which crosses each edge of $G(\mathbf{c})$ twice. (Note that $G(\mathbf{c})$ should be a tree, and that $\Gamma_{k}$ is finite.) Then,

\int_{\mathbf{R}}x^{k}\,\operatorname{\mathbf{E}}\mu_{W_{n}}(dx)-\frac{1}{n}\sum_{\mathbf{c}\in\Gamma_{k}}\sum_{F\in I(G(\mathbf{c}),n)}\Pi(F)\to 0.

We now prove Lemma 4.1 as promised.

Proof of Lemma 4.1.

Let $\Gamma_{k}$ be as in Lemma 5.4. By Lemma 5.2, we have

\frac{1}{n}\sum_{F\in I(G(\mathbf{c}),n)}\Pi(F)\leq C^{4}\qquad\text{for all $\mathbf{c}\in\Gamma_{8}$.}

Since $\Gamma_{8}$ is finite, we have

\sup_{n\in\mathbf{N}}\biggl{(}\frac{1}{n}\sum_{\mathbf{c}\in\Gamma_{8}}\sum_{F\in I(G(\mathbf{c}),n)}\Pi(F)\biggr{)}<\infty,

and thus

\sup_{n\in\mathbf{N}}\int_{\mathbf{R}}x^{8}\,\operatorname{\mathbf{E}}\mu_{W_{n}}(dx)<\infty

by Lemma 5.4. ∎

6. Proof of sufficiency

We continue to use the notation introduced in Section 5. On top of (1.3) and (2.1), we assume (1.4), which is possible by Lemma 2.2. We are one lemma away from proving the sufficiency part of Theorem 2.1. Our proof is essentially a manifestation of Wigner’s original idea.

Lemma 6.1 (each tree contributes one).

For any finite tree $T$ we have

(6.1)

\lim_{n\to\infty}\frac{1}{n}\sum_{F\in I(T,n)}\Pi(F)=1.

(Compare it to Lemma 5.2.)

Proof.

There is nothing to prove if $T$ has no edges. To proceed by induction, assume that (6.1) holds if $T$ has $m$ edges, and let $T$ be a tree with $m+1$ edges. Let $x\in V(T)$ be a leaf of $T$ , and $y$ be the only vertex of $T$ that is adjacent to $x$ . Note that

\begin{split}\biggl{|}\frac{1}{n}\sum_{F\in I(T,n)}\Pi(F)-\frac{1}{n}\sum_{H\in I(T\setminus x,n)}&\biggl{(}\Pi(H)\sum_{i=1}^{n}\operatorname{\mathbf{E}}|w_{H(y)i}|^{2}\biggr{)}\biggr{|}\\ &\leq\frac{(m+1)\epsilon_{n}^{2}}{n}\sum_{H\in I(T\setminus x,n)}\Pi(H)\to 0\end{split}

by $|w_{ij}|\leq\epsilon_{n}$ and Lemma 5.2 (or the induction hypothesis). Applying Lemma 5.2 once again, we have

\begin{split}\biggl{|}\frac{1}{n}\sum_{H\in I(T\setminus x,n)}&\biggl{(}\Pi(H)\sum_{i=1}^{n}\biggl{(}\operatorname{\mathbf{E}}|w_{H(y)i}|^{2}-\frac{1}{n}\biggr{)}\biggr{)}\biggr{|}\\ &=\frac{1}{n}\biggl{|}\sum_{j=1}^{n}\biggl{[}\sum_{i=1}^{n}\biggl{(}\operatorname{\mathbf{E}}|w_{ji}|^{2}-\frac{1}{n}\biggr{)}\cdot\sum_{\begin{subarray}{c}H\in I(T\setminus x,n)\\ H(y)=j\end{subarray}}\Pi(H)\biggr{]}\biggr{|}\\ &\leq\frac{C^{m}}{n}\sum_{j=1}^{n}\biggl{|}\sum_{i=1}^{n}\biggl{(}\operatorname{\mathbf{E}}|w_{ji}|^{2}-\frac{1}{n}\biggr{)}\biggr{|}\to 0\end{split}

by (1.3). The claimed result follows from the previous two displays and

\lim_{n\to\infty}\frac{1}{n}\sum_{H\in I(T\setminus x,n)}\Pi(H)=1.\qed

By Lemma 5.4 and Lemma 6.1, we have

\int_{\mathbf{R}}x^{k}\,\operatorname{\mathbf{E}}\mu_{W_{n}}(dx)\to|\Gamma_{k}|.

If $k$ is odd, then $|\Gamma_{k}|=0$ because $k/2+1$ is not an integer.

Assume that $k$ is even. A Dyck path of length $k$ is a finite sequence $(x_{0},\ldots,x_{k})$ satisfying

(1)

$x_{0}=x_{k}=0$ ,
(2)

$|x_{s}-x_{s-1}|=1$ for all $s=1,\ldots,k$ , and
(3)

$x_{s}\geq 0$ for all $s=0,\ldots,k$ .

Given $\mathbf{c}\in\Gamma_{k}$ , let $D(\mathbf{c}):=(x_{0},\ldots,x_{k})$ where $x_{s}$ is the distance between $c_{0}$ and $c_{s}$ in $G(\mathbf{c})$ . Then it is clear that $D(\mathbf{c})$ is indeed a Dyck path, and it is not difficult to see that $D$ is a bijection from $\Gamma_{k}$ to the set of all Dyck paths of length $k$ . It is well-known that there are exactly $\frac{1}{k/2+1}\binom{k}{k/2}$ Dyck paths of length $k$ ; see [vLW01, Example 14.8]. Thus, we have $|\Gamma_{k}|=\frac{1}{k/2+1}\binom{k}{k/2}$ .

A direct computation (see [AGZ10, 2.1.1]) yields

\int_{\mathbf{R}}x^{k}\,\mu_{\mathrm{sc}}(dx)=\frac{1}{k/2+1}\binom{k}{k/2}\qquad\text{for all even $k\in\mathbf{N}$,}

where the odd moments of $\mu_{\mathrm{sc}}$ are all zero due to the symmetry. Thus,

\int_{\mathbf{R}}x^{k}\,\operatorname{\mathbf{E}}\mu_{W_{n}}(dx)\to\int_{\mathbf{R}}x^{k}\,\mu_{\mathrm{sc}}(dx)\qquad\text{for all $k\in\mathbf{N}$.}

Since

\biggl{|}\sum_{k=1}^{\infty}\frac{1}{k!}\int_{\mathbf{R}}x^{k}\,\mu_{\mathrm{sc}}(dx)\,r^{k}\biggr{|}\leq\sum_{k=1}^{\infty}\frac{|2r|^{k}}{k!}<\infty

by the ratio test for all $r\in\mathbf{R}$ , the probability measure $\mu_{\mathrm{sc}}$ is determined by its moments by [Bil12, Theorem 30.1]. Therefore, the moment convergence theorem [Bil12, Theorem 30.2] tells us that $\operatorname{\mathbf{E}}\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ .

7. Gaussian convergence

Assume that $(W_{n})_{n\in\mathbf{N}}$ satisfies the conditions of Theorem 1.8. In this section, we prove Corollary 1.10 by showing that (1.9) and (1.10) are equivalent. We need the following two simple facts.

Lemma 7.1 (converging averages).

For each $n\in\mathbf{N}$ , let $x_{1}^{(n)},\ldots,x_{n}^{(n)}\geq 0$ . If

\frac{1}{n}\sum_{i=1}^{n}x_{i}^{(n)}\to 0,

then we can take nonempty $K_{n}\subset\{1,\ldots,n\}$ for each $n\in\mathbf{N}$ so that

\frac{|K_{n}|}{n}\to 1\qquad\text{and}\qquad\max_{i\in K_{n}}|x_{i}^{(n)}|\to 0.

Proof.

For each $\epsilon>0$ , we have

\frac{|\{i:x_{i}^{(n)}>\epsilon\}|}{n}\leq\frac{1}{n\epsilon}\sum_{i=1}^{n}x_{i}^{(n)}\to 0.

We can take positive $\epsilon_{1},\epsilon_{2},\ldots$ with $\epsilon_{n}\to 0$ such that

\frac{|\{i:x_{i}^{(n)}>\epsilon_{n}\}|}{n}\to 0.

Let $K_{n}$ be $\{i:x_{i}^{(n)}\leq\epsilon_{n}\}$ if it is nonempty, and let $K_{n}:=\{1\}$ otherwise. ∎

Lemma 7.2 (uniform convergence).

Let $(E,d)$ be a metric space, $A_{1},A_{2},\ldots\subset E$ , and $x\in E$ . Then the following are equivalent:

(1)

$x_{n}\to x$ for any choice of $x_{1}\in A_{1},x_{2}\in A_{2},\ldots\,$ .
(2)

$\sup_{y\in A_{n}}d(x,y)\to 0$ .

Proof.

We omit the easy proof. ∎

By (1.7) and Lemma 7.1, for each $\epsilon>0$ we have nonempty $K_{n}^{\epsilon}\subset\{1,\ldots,n\}$ with

\frac{|K_{n}^{\epsilon}|}{n}\to 1\qquad\text{and}\qquad\max_{i\in K_{n}^{\epsilon}}\sum_{j=1}^{n}\operatorname{\mathbf{P}}(|w_{ij}|>\epsilon)\to 0.

We can take $\epsilon_{1}\geq\epsilon_{2}\geq\cdots$ with $\epsilon_{n}\to 0$ such that

(7.1)

\frac{|K_{n}^{\epsilon_{n}}|}{n}\to 1\qquad\text{and}\qquad\max_{i\in K_{n}^{\epsilon_{n}}}\sum_{j=1}^{n}\operatorname{\mathbf{P}}(|w_{ij}|>\epsilon_{n})\to 0.

First assume (1.9). By Lemma 7.1, we have nonempty $K_{n}\subset\{1,\ldots,n\}$ such that

\frac{|K_{n}|}{n}\to 1\qquad\text{and}\qquad\max_{i\in K_{n}}\Bigl{|}\sum_{j=1}^{n}\operatorname{\mathbf{E}}[|w_{ij}|^{2};|w_{ij}|\leq 1]-1\Bigr{|}\to 0.

By (7.1), we can make $K_{n}$ smaller so that

\max_{i\in K_{n}}\sum_{j=1}^{n}\operatorname{\mathbf{P}}(|w_{ij}|>\epsilon)\to 0

also holds while retaining $|K_{n}|/n\to 1$ .

Let $i_{n}\in K_{n}$ for each $n\in\mathbf{N}$ . By Lemma 7.2, we have

\sum_{j=1}^{n}\operatorname{\mathbf{E}}[|w_{i_{n}j}|^{2};|w_{i_{n}j}|\leq 1]\to 1\qquad\text{and}\qquad\sum_{j=1}^{n}\operatorname{\mathbf{P}}(|w_{i_{n}j}|>\epsilon_{n})\to 0.

Since

\sum_{j=1}^{n}\operatorname{\mathbf{E}}[|w_{i_{n}j}|^{2};\epsilon<|w_{i_{n}j}|\leq 1]\leq\sum_{j=1}^{n}\operatorname{\mathbf{P}}(|w_{i_{n}j}|>\epsilon)\to 0

for all $\epsilon>0$ , the Lindeberg–Feller central limit theorem [Kal02, Theorem 5.12] implies

\sum_{j=1}^{n}\pm|w_{i_{n}j}|\operatorname{\mathbf{1}}\{|w_{i_{n}j}|\leq 1\}\Rightarrow Z

where $Z$ is standard normal. As $\sum_{j=1}^{n}\operatorname{\mathbf{P}}(|w_{i_{n}j}|>1)\to 0$ , it follows that $\sum_{j=1}^{n}\pm|w_{i_{n}j}|\Rightarrow Z$ .

Since $i_{1},i_{2},\ldots$ are arbitrary, Lemma 7.2 implies

\max_{i\in K_{n}}L(F_{ni_{n}},G)\to 0.

As the Lévy distance is bounded above by $1$ , we conclude that

\frac{1}{n}\sum_{i=1}^{n}L(F_{ni},G)\leq\max_{i\in K_{n}}L(F_{ni_{n}},G)+\frac{n-|K_{n}|}{n}\to 0.

Now we assume (1.10). By Lemma 7.1 and (7.1), we have nonempty $K_{n}\subset\{1,\ldots,n\}$ with $|K_{n}|/n\to 1$ ,

\max_{i\in K_{n}}L(F_{ni},G)\to 0,\qquad\text{and}\qquad\max_{i\in K_{n}}\sum_{j=1}^{n}\operatorname{\mathbf{P}}(|w_{ij}|>\epsilon_{n})\to 0.

Let $i_{n}\in K_{n}$ for each $n\in\mathbf{N}$ . By Lemma 7.2, we have

\sum_{j=1}^{n}\pm|w_{i_{n}j}|\Rightarrow Z\qquad\text{and}\qquad\sum_{j=1}^{n}\operatorname{\mathbf{P}}(|w_{i_{n}j}|>\epsilon_{n})\to 0.

Since $\sum_{j=1}^{n}\operatorname{\mathbf{P}}(|w_{i_{n}j}|>1)\to 0$ , we have

(7.2)

\sum_{j=1}^{n}\pm|w_{i_{n}j}|\operatorname{\mathbf{1}}\{|w_{i_{n}j}|\leq 1\}\Rightarrow Z.

Let

c_{n}:=\sum_{j=1}^{n}\operatorname{\mathbf{E}}[|w_{i_{n}j}|^{2};|w_{i_{n}j}|\leq 1].

If $c_{n}\to 0$ along some subsequence, then

\operatorname{\mathbf{E}}\Bigl{(}\sum_{j=1}^{n}\pm|w_{i_{n}j}|\operatorname{\mathbf{1}}\{|w_{i_{n}j}|\leq 1\}\Bigr{)}^{2}\to 0

along that subsequence, but it contradicts (7.2). If $c_{n}\to c\in(0,\infty]$ along some subsequence, then

\frac{1}{\sqrt{c_{n}}}\sum_{j=1}^{n}\pm|w_{i_{n}j}|\operatorname{\mathbf{1}}\{|w_{i_{n}j}|\leq 1\}\Rightarrow Z

along that subsequence by the Lindeberg–Feller central limit theorem, and so $c=1$ . Thus, we have $c_{n}\to 1$ .

Since $i_{1},i_{2},\ldots$ are arbitrary, Lemma 7.2 and (1.8) imply

\begin{split}\frac{1}{n}\sum_{i=1}^{n}\Bigl{|}\sum_{j=1}^{n}\operatorname{\mathbf{E}}[|w_{ij}|^{2}&;|w_{ij}|\leq 1]-1\Bigr{|}\\ &\leq\max_{i\in K_{n}}\Bigl{|}\sum_{j=1}^{n}\operatorname{\mathbf{E}}[|w_{ij}|^{2};|w_{ij}|\leq 1]-1\Bigr{|}\\ &\quad+\frac{1}{n}\sum_{i\in K_{n}^{c}}\sum_{j=1}^{n}\operatorname{\mathbf{E}}[|w_{ij}|^{2};|w_{ij}|\leq 1]+\frac{n-|K_{n}|}{n}\to 0.\end{split}

Appendix A Mean probability measures

In this section, we clarify what we mean by $\operatorname{\mathbf{E}}\mu_{W_{n}}$ , and prove that $\operatorname{\mathbf{E}}\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ is equivalent to $\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ a.s. if $(W_{n})_{n\in\mathbf{N}}$ is a Hermitian Wigner ensemble.

Let $\Pr(\mathbf{R})$ be the set of all Borel probability measures on $\mathbf{R}$ . Equip $\Pr(\mathbf{R})$ with the smallest $\sigma$ -field that makes $\Pr(\mathbf{R})\ni\mu\mapsto\mu(-\infty,x]$ measurable for all $x\in\mathbf{R}$ .

For any random element $\mu$ of $\Pr(\mathbf{R})$ , it is straightforward to show that $\mathbf{R}\ni x\mapsto\operatorname{\mathbf{E}}\mu(-\infty,x]$ is a distribution function of some Borel probability measure on $\mathbf{R}$ . Let $\operatorname{\mathbf{E}}\mu$ denote that measure. Then $\operatorname{\mathbf{E}}\mu$ has the following property:

Lemma A.1 (change of order).

Let $\mu$ be a random element of $\Pr(\mathbf{R})$ , and $f\colon\mathbf{R}\to\mathbf{R}$ be (Borel) measurable.

(1)

If $f$ is nonnegative, then $\int_{\mathbf{R}}f\,d\mu$ is measurable, and we have

(A.1) $\int_{\mathbf{R}}f\,d\operatorname{\mathbf{E}}\mu=\operatorname{\mathbf{E}}\biggl{[}\int_{\mathbf{R}}f\,d\mu\biggr{]}.$
(2)

If $\int_{\mathbf{R}}|f|\,\operatorname{\mathbf{E}}\mu<\infty$ , then $\int_{\mathbf{R}}f\,d\mu$ is a.s. finite and measurable, and we have (A.1).

Proof.

Since (2) follows immediately from (1), we will prove (1) only. As the statement of (1) holds for $f=1_{(-\infty,x]}$ for all $x\in\mathbf{R}$ , Dynkin’s $\pi$ - $\lambda$ theorem implies that the statement holds for all measurable $A\subset\mathbf{R}$ . By the simple function approximation argument, the statement extends to all nonnegative measurable $f$ . ∎

In order to talk about $\operatorname{\mathbf{E}}\mu_{W_{n}}$ , we first need to establish the measurability of $\mu_{W_{n}}$ for each $x\in\mathbf{R}$ .

Lemma A.2 (measurability).

If $W$ is a random $n\times n$ Hermitian matrix, then $\mu_{W}$ is measurable.

Proof.

Let $f(x):=\det(xI-W)$ . Given an interval $[a,b]$ where $a<b$ , the event of having an eigenvalue of $W$ with multiplicity at least $k$ (where $k\leq n$ ) in $[a,b]$ is equal to

\Bigl{\{}\inf_{q\in[a,b]\cap\mathbf{Q}}\bigl{(}(f(q))^{2}+(f^{\prime}(q))^{2}+\cdots+(f^{(k-1)}(q))^{2}\bigr{)}=0\Bigr{\}},

which is indeed measurable. Using this and by partitioning $\mathbf{R}$ into many small intervals, one can show that $\{\lambda_{j}(W)\leq x\}$ is measurable for each $j=1,\ldots,n$ and $x\in\mathbf{R}$ . Since

\mu_{W}(-\infty,x]=\frac{1}{n}\sum_{j=1}^{n}\operatorname{\mathbf{1}}\{\lambda_{j}(W)\leq x\},

$\mu_{W}$ is measurable. ∎

Now we turn to the proof of Lemma 1.3. We need the following inequality, which was found independently by Guntuboyina and Leeb [GL09], and Bordenave, Caputo, and Chafaï [BCC11].

Lemma A.3 (concentration for spectral measures).

Let $(W_{n})_{n\in\mathbf{N}}$ be a Hermitian Wigner ensemble. If the total variation of $f\colon\mathbf{R}\to\mathbf{R}$ is less than or equal to $1$ , then

\operatorname{\mathbf{P}}\biggl{(}\Bigl{|}\int_{\mathbf{R}}f\,d\mu_{W_{n}}-\operatorname{\mathbf{E}}\int_{\mathbf{R}}f\,d\mu_{W_{n}}\Bigr{|}\geq t\biggr{)}\leq 2\exp(-nt^{2}/2).

Proof.

See [BCC11, Lemma C.2]. ∎

Proof of Lemma 1.3.

Assume $\operatorname{\mathbf{E}}\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ . For each $p,q\in\mathbf{Q}$ with $p<q$ , let $f_{p,q}\colon\mathbf{R}\to\mathbf{R}$ be $1$ on $(-\infty,p]$ , $0$ on $[q,\infty)$ , and linear and continuous on $[p,q]$ . Then

\operatorname{\mathbf{E}}\biggl{[}\int_{\mathbf{R}}f_{p,q}\,d\mu_{W_{n}}\biggr{]}\to\int_{\mathbf{R}}f_{p,q}\,d\mu_{\mathrm{sc}}.\qquad\text{for all rational $p<q$.}

By Lemma A.3 and the Borel-Cantelli lemma,

\int_{\mathbf{R}}f_{p,q}\,d\mu_{W_{n}}\to\int_{\mathbf{R}}f_{p,q}\,d\mu_{\mathrm{sc}}\qquad\text{for all rational $p<q$, a.s.}

This proves $\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ a.s.

To show the converse, assume that $\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ a.s., and let $f\colon\mathbf{R}\to\mathbf{R}$ be continuous and bounded. Since $f$ is bounded, we can apply the dominated convergence theorem to obtain

\operatorname{\mathbf{E}}\biggl{[}\int_{\mathbf{R}}f\,d\mu_{W_{n}}\biggr{]}\to\int_{\mathbf{R}}f\,d\mu_{\mathrm{sc}}.

This shows $\operatorname{\mathbf{E}}\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ . ∎

Appendix B Reductions

In this section, we prove that Theorem 1.6 follows from Theorem 1.8 (Lemma B.1), and that Theorem 1.8 follows from Theorem 2.1 (Lemma B.4).

Lemma B.1.

Theorem 1.6 follows from Theorem 1.8.

To prove this, we use the following two lemmas.

Lemma B.2 (perturbation inequality).

If $A$ and $B$ are $n\times n$ Hermitian matrices, and $F_{A}$ and $F_{B}$ are the distribution functions of $\mu_{A}$ and $\mu_{B}$ , then

\bigl{(}L(F_{A},F_{B})\bigr{)}^{3}\leq\operatorname{tr}\bigl{(}(A-B)^{2}\bigr{)}

where $L$ is the Lévy metric.

Proof.

See [BS10, Theorem A.41]. ∎

Lemma B.3.

If (1.2) holds, then the following are true:

(1)

(1.5) holds if and only if (1.8) holds.
(2)

(1.3) holds if and only if (1.9) holds.

Proof.

From (1.2) it follows that

\frac{1}{n}\sum_{i,j=1}^{n}\operatorname{\mathbf{E}}[|w_{ij}|^{2};|w_{ij}|>1]\to 0.

(1) For any $J_{n}\subset\{1,\ldots,n\}$ with $|J_{n}|/n\to 0$ , we have

(B.1)

\biggl{|}\frac{1}{n}\sum_{i\in J_{n}}\sum_{j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{2}-\frac{1}{n}\sum_{i\in J_{n}}\sum_{j=1}^{n}\operatorname{\mathbf{E}}[|w_{ij}|^{2};|w_{ij}|\leq 1]\biggr{|}\\ \leq\frac{1}{n}\sum_{i,j=1}^{n}\operatorname{\mathbf{E}}[|w_{ij}|^{2};|w_{ij}|>1]\to 0.

Thus, (1.5) holds if and only if (1.8) holds.

(2) Since

(B.2)

\biggl{|}\frac{1}{n}\sum_{i=1}^{n}\Bigl{|}\sum_{j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}|^{2}-1\Bigr{|}-\frac{1}{n}\sum_{i=1}^{n}\Bigl{|}\sum_{j=1}^{n}\operatorname{\mathbf{E}}[|w_{ij}|^{2};|w_{ij}|\leq 1]-1\Bigr{|}\biggr{|}\\ \leq\frac{1}{n}\sum_{i,j=1}^{n}\operatorname{\mathbf{E}}[|w_{ij}|^{2};|w_{ij}|>1]\to 0,

(1.3) holds if and only if (1.9) holds. ∎

Proof of Lemma B.1.

We first show the sufficiency direction of Theorem 1.6. Recall that we proved that (1.6) and (1.7) follows from (1.2) and $\operatorname{\mathbf{E}}w_{ij}=0$ . By Lemma B.3, we have (1.8) and (1.9). Thus, $\operatorname{\mathbf{E}}\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ follows from Theorem 1.8.

Now let us show the necessity. On top of (1.6) and (1.7), we have (1.8) by Lemma B.3. Thus, (1.9) follows from Theorem 1.8, and therefore we have (1.3) by Lemma B.3. ∎

Next we reduce Theorem 1.8 to Theorem 2.1.

Lemma B.4.

(1)

The sufficiency part of Theorem 2.1 implies the sufficiency part of Theorem 1.8.
(2)

The necessity part of Theorem 2.1 implies the necessity part of Theorem 1.8.

We need the following lemma.

Lemma B.5 (reduction to vanishing bounds).

Let $(W_{n})_{n\in\mathbf{N}}$ be a Hermitian Wigner ensemble satisfying (1.6) and (1.7). Then there exist $1/2\geq\eta_{1}\geq\eta_{2}\geq\cdots$ with $\eta_{n}\to 0$ such that if we let

W_{n}^{\prime}\equiv(w_{ij}^{\prime})_{i,j=1}^{n}:=\bigl{(}w_{ij}\operatorname{\mathbf{1}}\{|w_{ij}|\leq\eta_{n}\}-\operatorname{\mathbf{E}}[w_{ij};|w_{ij}|\leq\eta_{n}]\bigr{)}_{i,j=1}^{n},

then the following are true:

(1)

$\operatorname{\mathbf{E}}\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ if and only if $\operatorname{\mathbf{E}}\mu_{W_{n}^{\prime}}\Rightarrow\mu_{\mathrm{sc}}$ .
(2)

(1.9) if and only if

(B.3) $\frac{1}{n}\sum_{i=1}^{n}\Bigl{|}\sum_{j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}^{\prime}|^{2}-1\Bigr{|}\to 0.$

(3)

(1.8) if and only if

(B.4)

\frac{1}{n}\sum_{i\in J_{n}}\sum_{j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}^{\prime}|^{2}\to 0\quad\text{for any $J_{n}\subset\{1,\ldots,n\}$ with $|J_{n}|/n\to 0$.}

The proof of (1) will be based on the following lemma.

Lemma B.6 (Bernstein’s inequality).

Suppose that $X_{1},\ldots,X_{n}$ are independent real-valued random variables with $|X_{i}|\leq 1$ and $\operatorname{\mathbf{E}}X_{i}=0$ for $i=1,\ldots,n$ . If $S:=X_{1}+\cdots+X_{n}$ , then

\operatorname{\mathbf{P}}(S\geq x)\leq\exp\biggl{(}\frac{-x^{2}}{2(\operatorname{\mathbf{E}}S^{2}+x)}\biggr{)}\qquad\text{for all $x>0$.}

Proof.

The proof of [Bil99, M20] with a slight modification works. ∎

Proof of Lemma B.5.

Choose $1/2\geq\eta_{1}\geq\eta_{2}\cdots$ with $\eta_{n}\to 0$ such that

(B.5)

\frac{1}{n}\sum_{i,j=1}^{n}\operatorname{\mathbf{P}}(|w_{ij}|>\eta_{n})\to 0.

Proof of (1). By Lemma 1.3, it is enough to show that $\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ a.s. if and only if $\mu_{W_{n}^{\prime}}\Rightarrow\mu_{\mathrm{sc}}$ a.s. Let $\widetilde{W}:=(w_{ij}\operatorname{\mathbf{1}}\{|w_{ij}|\leq\eta_{n}\})_{i,j=1}^{n}$ . We will first show that $\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ a.s. if and only if $\mu_{\widetilde{W}_{n}}\Rightarrow\mu_{\mathrm{sc}}$ a.s. Note that

\frac{\operatorname{rank}(W_{n}-\widetilde{W}_{n})}{n}\leq\frac{2}{n}\sum_{1\leq i\leq j\leq n}\operatorname{\mathbf{1}}\{|w_{ij}|>\eta_{n}\}.

By Lemma 1.5, it is enough to show that the right side tends to $0$ a.s.

Let $\epsilon>0$ be given. By (B.5), we have some $n_{0}\in\mathbf{N}$ such that

\sum_{1\leq i\leq j\leq n}\operatorname{\mathbf{P}}(|w_{ij}|>\eta_{n})\leq\epsilon n/2\qquad\text{for all $n\geq n_{0}$.}

Since $\operatorname{\mathbf{1}}\{|w_{ij}|>\eta_{n}\}$ , $1\leq i\leq j\leq n$ , are independent, Bernstein’s inequality (Lemma B.6) implies

\begin{split}\operatorname{\mathbf{P}}\biggl{(}\sum_{1\leq i\leq j\leq n}&\operatorname{\mathbf{1}}\{|w_{ij}|>\eta_{n}\}\geq\epsilon n\biggr{)}\\ &\leq\operatorname{\mathbf{P}}\biggl{(}\sum_{1\leq i\leq j\leq n}\bigl{(}\operatorname{\mathbf{1}}\{|w_{ij}|>\eta_{n}\}-\operatorname{\mathbf{P}}(|w_{ij}|>\eta_{n})\bigr{)}\geq\epsilon n/2\biggr{)}\\ &\leq\exp\biggl{(}\frac{-\epsilon^{2}n^{2}/8}{\sum_{1\leq i\leq j\leq n}\operatorname{\mathbf{P}}(|w_{ij}|>\eta_{n})+\epsilon n/2}\biggr{)}\\ &\leq\exp\biggl{(}\frac{-\epsilon^{2}n^{2}/8}{\epsilon n}\biggr{)}=\exp(-\epsilon n/8).\end{split}

\sum_{n=1}^{\infty}\exp(-\epsilon n/8)<\infty\qquad\text{for all $\epsilon>0$,}

the Borel-Cantelli lemma implies

\frac{1}{n}\sum_{1\leq i\leq j\leq n}\operatorname{\mathbf{1}}\{|w_{ij}|>\eta_{n}\}\to 0\qquad\text{a.s.}

This implies that $\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ a.s. if and only if $\mu_{\widetilde{W}_{n}}\Rightarrow\mu_{\mathrm{sc}}$ a.s. as explained above.

To show that $\mu_{\widetilde{W}_{n}}\Rightarrow\mu_{\mathrm{sc}}$ a.s. if and only if $\mu_{W_{n}^{\prime}}\Rightarrow\mu_{\mathrm{sc}}$ a.s., use Lemma B.2 to note that

\bigl{(}L(\mu_{\widetilde{W}_{n}},\mu_{W_{n}^{\prime}})\bigr{)}^{3}\leq\frac{1}{n}\sum_{i,j=1}^{n}\bigl{(}\operatorname{\mathbf{E}}[w_{ij};|w_{ij}|\leq\eta_{n}]\bigr{)}^{2}.

Since $|a^{2}-b^{2}|=|a-b||a+b|$ , the difference between the right side and

\frac{1}{n}\sum_{i,j=1}^{n}\bigl{(}\operatorname{\mathbf{E}}[w_{ij};|w_{ij}|\leq 1]\bigr{)}^{2}

is bounded above by

\begin{split}\frac{1}{n}\sum_{i,j=1}^{n}\bigl{|}\operatorname{\mathbf{E}}[w_{ij};\eta_{n}<|w_{ij}|\leq 1]\bigr{|}&\cdot\bigl{|}\operatorname{\mathbf{E}}[w_{ij};|w_{ij}|\leq 1]+\operatorname{\mathbf{E}}[w_{ij};|w_{ij}|\leq\eta_{n}]\bigr{|}\\ &\leq\frac{2}{n}\sum_{i,j=1}^{n}\operatorname{\mathbf{P}}(|w_{ij}|>\eta_{n})\to 0.\end{split}

Thus, by (1.6), we have $L(\mu_{\widetilde{W}_{n}},\mu_{W_{n}^{\prime}})\to 0$ a.s.

Proof of (2). Since $\bigl{|}|a|-|b|\bigr{|}\leq|a-b|$ , we have

\begin{split}\biggl{|}\frac{1}{n}\sum_{i=1}^{n}&\Bigl{|}\sum_{j=1}^{n}\operatorname{\mathbf{E}}[|w_{ij}|^{2};|w_{ij}|\leq 1]-1\Bigr{|}-\frac{1}{n}\sum_{i=1}^{n}\Bigl{|}\sum_{j=1}^{n}\operatorname{\mathbf{E}}|w_{ij}^{\prime}|^{2}-1\Bigr{|}\biggr{|}\\ &\leq\frac{1}{n}\sum_{i,j=1}^{n}\Bigl{|}\operatorname{\mathbf{E}}[|w_{ij}|^{2};|w_{ij}|\leq 1]-\operatorname{\mathbf{E}}|w^{\prime}_{ij}|^{2}\Bigr{|}\\ &=\frac{1}{n}\sum_{i,j=1}^{n}\operatorname{\mathbf{E}}[|w_{ij}|^{2};\eta_{n}<|w_{ij}|\leq 1]+\frac{1}{n}\sum_{i,j=1}^{n}\bigl{(}\operatorname{\mathbf{E}}[w_{ij};|w_{ij}|\leq\eta_{n}]\bigr{)}^{2}.\end{split}

The first term on the right side is bounded above by

\frac{1}{n}\sum_{i,j=1}^{n}\operatorname{\mathbf{P}}(|w_{ij}|>\eta_{n})\to 0,

and we have shown just above that the second term also tends to $0$ .

Proof of (3). The difference between the left sides of (1.8) and (B.4) is also bounded above by

\frac{1}{n}\sum_{i,j=1}^{n}\operatorname{\mathbf{E}}[|w_{ij}|^{2};\eta_{n}<|w_{ij}|\leq 1]+\frac{1}{n}\sum_{i,j=1}^{n}\bigl{(}\operatorname{\mathbf{E}}[w_{ij};|w_{ij}|\leq\eta_{n}]\bigr{)}^{2},

which tends to $0$ as $n\to\infty$ . ∎

Proof of Lemma B.4.

Assume that $W_{n}$ is given as in Theorem 1.8, and define $W^{\prime}_{n}$ as in Lemma B.5. If we let $\epsilon_{n}:=2\eta_{n}$ , then $(W_{n}^{\prime})_{n\in\mathbf{N}}$ satisfies the conditions of Theorem 2.1. In particular, (1.5) for $W_{n}^{\prime}$ follows from (3) of Lemma B.5.

Proof of (1). Assume that the sufficiency part of Theorem 2.1 holds. If (1.9) holds, then (B.3) holds by (2) of Lemma B.5, and thus $\operatorname{\mathbf{E}}\mu_{W_{n}^{\prime}}\Rightarrow\mu_{\mathrm{sc}}$ by the sufficiency part of Theorem 2.1. Then (1) of Lemma B.5 tells us that $\operatorname{\mathbf{E}}\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ .

Proof of (2). Assume that the necessity part of Theorem 2.1 holds. If $\operatorname{\mathbf{E}}\mu_{W_{n}}\Rightarrow\mu_{\mathrm{sc}}$ , then $\operatorname{\mathbf{E}}\mu_{W_{n}^{\prime}}\Rightarrow\mu_{\mathrm{sc}}$ by (1) of Lemma B.5, and thus (B.3) holds by Theorem 2.1. This in turn implies (1.9) by (2) of Lemma B.5. ∎

References

[AGZ10] Greg W. Anderson, Alice Guionnet, and Ofer Zeitouni. An introduction to random matrices, volume 118 of Cambridge Studies in Advanced Mathematics. Cambridge University Press, Cambridge, 2010.
[Arn71] Ludwig Arnold. On wigner’s semicircle law for the eigenvalues of random matrices. Zeitschrift für Wahrscheinlichkeitstheorie und verwandte Gebiete, 19(3):191–198, 1971.
[BCC11] Charles Bordenave, Pietro Caputo, and Djalil Chafaï. Spectrum of non-Hermitian heavy tailed random matrices. Comm. Math. Phys., 307(2):513–560, 2011.
[Bil99] Patrick Billingsley. Convergence of probability measures. Wiley Series in Probability and Statistics: Probability and Statistics. John Wiley & Sons, Inc., New York, second edition, 1999. A Wiley-Interscience Publication.
[Bil12] Patrick Billingsley. Probability and measure. Wiley Series in Probability and Statistics. John Wiley & Sons, Inc., Hoboken, NJ, 2012. Anniversary edition [of MR1324786], With a foreword by Steve Lalley and a brief biography of Billingsley by Steve Koppes.
[BS10] Zhidong Bai and Jack W. Silverstein. Spectral analysis of large dimensional random matrices. Springer Series in Statistics. Springer, New York, second edition, 2010.
[Gir90] V. L. Girko. Theory of random determinants, volume 45 of Mathematics and its Applications (Soviet Series). Kluwer Academic Publishers Group, Dordrecht, 1990. Translated from the Russian.
[GL09] Adityanand Guntuboyina and Hannes Leeb. Concentration of the spectral measure of large Wishart matrices with dependent entries. Electron. Commun. Probab., 14:334–342, 2009.
[GNT15] F. Götze, A. A. Naumov, and A. N. Tikhomirov. Limit theorems for two classes of random matrices with dependent entries. Theory Probab. Appl., 59(1):23–39, 2015.
[Jun18] Paul Jung. Lévy-Khintchine random matrices and the Poisson weighted infinite skeleton tree. Trans. Amer. Math. Soc., 370(1):641–668, 2018.
[Kal02] Olav Kallenberg. Foundations of modern probability. Probability and its Applications (New York). Springer-Verlag, New York, second edition, 2002.
[Pas73] Leonid A Pastur. Spectra of random self adjoint operators. Russian mathematical surveys, 28(1):1, 1973.
[Tao12] Terence Tao. Topics in random matrix theory, volume 132 of Graduate Studies in Mathematics. American Mathematical Society, Providence, RI, 2012.
[vLW01] J. H. van Lint and R. M. Wilson. A course in combinatorics. Cambridge University Press, Cambridge, second edition, 2001.
[Wig55] Eugene P. Wigner. Characteristic vectors of bordered matrices with infinite dimensions. Ann. of Math. (2), 62:548–564, 1955.
[Wig58] Eugene P Wigner. On the distribution of the roots of certain symmetric matrices. Annals of Mathematics, pages 325–327, 1958.