\RS@ifundefined

subsecref \newrefsubsecname = \RSsectxt \RS@ifundefinedthmref \newrefthmname = theorem \RS@ifundefinedlemref \newreflemname = lemma \newrefclaimrefcmd=Claim LABEL:#1 \newreffactrefcmd=Fact LABEL:#1 \newrefdefrefcmd=Definition LABEL:#1 \newrefthmrefcmd=Theorem LABEL:#1 \newrefcorrefcmd=Corollary LABEL:#1 \newreflemrefcmd=Lemma LABEL:#1

Boosting uniformity in quasirandom groups: fast and simple

Harm Derksen
Northeastern University
ha.derksen@northeastern.edu Partially supported by NSF grant DMS 2147769. Chin Ho Lee
North Carolina State University
chinho.lee@ncsu.edu Emanuele Viola
Northeastern University
viola@ccs.neu.edu Supported by NSF grant CCF-2114116.

Abstract

We study the communication complexity of multiplying $k\times t$ elements from the group $H=\text{SL}(2,q)$ in the number-on-forehead model with $k$ parties. We prove a lower bound of $(t\log H)/c^{k}$ . This is an exponential improvement over previous work, and matches the state-of-the-art in the area.

Relatedly, we show that the convolution of $k^{c}$ independent copies of a 3-uniform distribution over $H^{m}$ is close to a $k$ -uniform distribution. This is again an exponential improvement over previous work which needed $c^{k}$ copies.

The proofs are remarkably simple; the results extend to other quasirandom groups.

We also show that for any group $H$ , any distribution over $H^{m}$ whose weight- $k$ Fourier coefficients are small is close to a $k$ -uniform distribution. This generalizes previous work in the abelian setting, and the proof is simpler.

1 Introduction and our results

Iterated multiplication of elements in a group is a fundamental problem that has a long history and wide-ranging applications, and is linked to long-standing open problems. Already in [LZ77] it has been pivotal to provide space-efficient algorithms for Dyck languages. Depending on the underlying group, iterated multiplication is complete for various complexity classes [KMR66, MC87, Mix89, BC92, IL95, Mil14]. For example, Barrington’s famous result [Mix89] shows that it is complete for $\text{NC}^{1}$ if and only if the underlying group is non-solvable. This in particular disproved a conjecture about the complexity of majority [BDFP83]. This type of results has then been taken further in the study of catalytic computation [BCK⁺14], leading to other surprising discoveries [BCK⁺14, CM20].

The focus of this paper is on number-on-forehead communication complexity [CFL83]. For a survey on the communication complexity of group products, see [Vio19], and see [KN97, RY19] for general background on communication complexity. Concretely, the input is a matrix of $k\times t$ elements $a_{ij}$ from a group $H$ , and the goal is computing $\prod_{j=1}^{t}a_{1j}\cdots a_{kj}$ . There are $k$ collaborating parties, with Party $i$ knowing all the input except row $i$ .

This problem is also linked to central open problems in communication complexity. Specifically, [GV19] conjectured that over certain groups this problem remains hard even for $k$ larger than $\log n$ . Establishing such bounds is arguably the most significant open problem in the area. A number of candidates have been put forward over the years, but many have been ruled out via ingenious protocols, e.g. in [PRS97, BGKL03, BC08, ACFN15]. Interestingly, for the iterated-product candidate proposed in [GV19], no non-trivial protocol is known.

Iterated group products are also candidate for providing strong separations between randomized and deterministic number-on-forehead communication. The current bounds (see [Vio19]) give a separation matching one in [BDPW07]. Stronger bounds could simplify and strengthen the recent exciting separation [KLM23].

Returning to the problem, we note that its complexity heavily depends on the underlying group. If it is abelian, then the problem can be solved with constant communication, using the public-coin protocol for equality. Over certain other groups a communication lower bound of $t/c^{k}$ follows via [Mix89] from the landmark lower bound in [BNS92] for generalized inner product; cf. [MV13]. However, this bound does not improve with the size of the group. In particular it is far from the (trivial) upper bound of $t\log H$ , and it gives nothing when $t$ is constant. Motivated by a cryptographic application, [MV13] asked whether a lower bound that grows with the size of the group, ideally $ct\log H$ , can be established over some group $H$ .

Gowers and Viola [GV15, GV19] proved a bound of $(t\log H)/c^{c^{k}}$ for the group $\text{SL}(2,q)$ of $2\times 2$ invertible matrices over $\mathbb{F}_{q}$ , which enables the motivating application from [MV13]. Subsequent work [DV23] simplified the proof and generalized it to any quasi-random group [Gow08], see also [GV19, Sha16]. While such bounds do grow with the size of the group, thus answering the question in [MV13] and enabling the motivating application in cryptography, the dependency on the number $k$ of parties is weak: One can only afford $k$ doubly-logarithmic in the input length.

In this work we give an exponential improvement and obtain bounds of the form $(t\log H)/c^{k}$ , thus matching the state-of-the-art in number-on-forehead communication [BNS92]. As in [GV19], we prove stronger results that even bound the advantage such protocols have when the input is promised to multiply to one of two fixed elements.

Theorem 1.

Let $H=\mathrm{SL}(2,q)$ . Let $P\colon H^{k\times t}\to[2]$ be a number-on-forehead communication protocol with $k$ parties and communication $b$ bits. For $g\in H$ denote by $p_{g}$ the probability that $P$ outputs $1$ over a uniform input $(a_{i,j})_{i\leq k,j\leq t}$ such that $\prod_{j=1}^{t}a_{1j}\cdots a_{kj}=g$ . For any $k$ and any two $g,h\in H$ , if $t\geq c^{k}$ then $|p_{g}-p_{h}|\leq 2^{b}\cdot H^{-t/c^{k}}$ .

The high-level proof technique is the same as in [GV19]. They reduced the problem to boosting uniformity over $m$ copies of $H$ .

Definition 2.

A distribution $p$ over a set $S$ is $\mathrm{\epsilon}$ -uniform if $|p(x)-1/S|\leq\mathrm{\epsilon}/S$ . If $S=H^{t}$ and $k\leq t$ we say $p$ is $(\mathrm{\epsilon},k)$ -uniform if for any $k$ coordinates, the induced distribution over those coordinates is $\mathrm{\epsilon}$ -uniform. We say $p$ is $k$ -uniform if it is $(0,k)$ -uniform.

[GV19] showed that if $s$ is a $2$ -uniform distribution over $H^{m}$ then the convolution (a.k.a. component-wise product) of some $\ell$ independent copies of $s$ is $H^{-m}$ -uniform over the whole space $H^{m}$ . Note that such a result is false for abelian groups – the convolution can remain only $2$ -uniform. Quantitatively, they show that $\ell=c^{m}$ copies suffice. In the application to 1 one has $m=2^{k}$ , which gives the doubly logarithmic dependence on $k$ .

In this work we give a corresponding exponential improvement on the number of copies required to boost uniformity: we show that in fact $\ell=m^{c}$ copies suffice. Our proof is remarkably simple, especially if we start with $3$ -uniform distributions, which we note suffices for 1. (We discuss below extensions to $2$ -uniform and other groups.) We state this result next.

Theorem 3.

Let $H=\mathrm{SL}(2,q)$ . Let $p$ be a $3$ -uniform distribution over $H^{m}$ . The convolution of $m^{c}$ independent copies of $p$ is $H^{-m}$ -uniform.

Our approach allows us to double the uniformity, i.e., go from $k$ -uniform to $2k$ -uniform using only a constant number of convolutions, independently of $k$ , whereas [GV19] would use $\geq k$ convolutions. This points to a key difference in the techniques. In [GV19], boosting uniformity is achieved by reduction to interleaved products, and appears tailored to going from $2$ -uniform to $3$ -uniform. Our approach is different, and simpler, even taking into account the simple proof of interleaved mixing from [DV23]. It can be seen as a $k$ -uniform version of the flattening lemmas discovered in [Gow08, BNP08, GV19]. In a nutshell, the $k$ -uniformity assumption allows us to remove all “low-degree” Fourier coefficients, leaving only those of degree $>k$ . Then the quasi-randomness assumption, combined with the tensor-product structure of the Fourier coefficients allows us to “flatten” distributions at a rate proportional to $H^{-ck}$ , instead of $H^{-c}$ as in previous work. We note that while using $k$ -uniformity to remove low-degree coefficients is a common proof technique, we are not aware of previous work where this is done in the non-abelian setting. This might indicate that our techniques might find other applications, and in general we advocate a systematic study of non-abelian analogues of the Fourier toolkit. Another step in this direction is discussed next.

$(\mathrm{\epsilon},k)$ -uniformity vs. $k$ -uniformity.

Extending the classic work [AGM03], Rubinfeld and Xie [RX13] showed that every almost $k$ -uniform distribution over any Abelian product group is statistically close to some $k$ -uniform distribution. We generalize their result to any product group. Our approach is significantly simpler. [RX13] decomposes the given $k$ -uniform distribution in a real orthogonal basis instead of the Fourier basis; we show that in fact the same argument can be carried out directly over the Fourier basis. A critical observation is that removing Fourier coefficients of a fixed weight from a real function keeps the function real.

This generalization, in combination with 3, gives a refinement of 3 where the number of copies is $k^{c}$ and the final distribution is statistically close to a $k$ -uniform distribution (whereas a direct application of 3 would just give an $(H^{-k^{c}},k)$ -uniform distribution).

Extensions.

1 and 3 above can be generalized to any quasi-random group and to distributions which are $2$ -uniform. This can be done by first using the results in [GV19, DV23] to boost $2$ -uniformity to $v$ -uniformity for a sufficiently large constant $v$ depending on the quasirandomness of the group (for $\text{SL}(2,q)$ , $v=3$ suffices). This requires a number of convolutions that is exponential in $v$ , but since $v$ is constant it can be afforded. After that, our results kick in and allow to boost faster.

2 Preliminaries

In this section we fix some notation, especially about representation theory.

For a set $X$ , we also write $X$ for its size $|X|$ . We write $[i]$ for the set $\{0,1,\ldots,i-1\}$ . Every occurrence of “ $c$ ” denotes a possibly different universal constant. Replacing “ $c$ ” with $O(1)$ everywhere is consistent with a common interpretation of the latter. For a function $f\colon G\to\mathbb{C}$ we denote by $|f|_{2}^{2}$ the un-normalized quantity $\sum_{x\in G}|f(x)|^{2}$ .

Next we present the standard framework of representation theory. The books by Serre [Ser77], Diaconis [Dia88], and Terras [Ter99] are good references for representation theory and non-abelian Fourier analysis. The Barbados notes [Wig10] or Section 13 of [Gow17] or [GV22] provide briefer introductions. The exposition in these sources is not always consistent, and often has different aims from ours. So let us give a quick account of the theory that is most relevant for this work.

Matrices.

Let $M$ be a square complex matrix. We denote by $\operatorname{tr}(M)$ the trace of $M$ , by $\overline{M}$ the conjugate of $M$ , by $M^{T}$ the transpose of $M$ , and by $M^{*}$ the conjugate transpose $\overline{M^{T}}$ (aka adjoint, Hermitian conjugate, etc.). The matrix $M$ is unitary if the rows and the columns are orthonormal; equivalently $M^{-1}=M^{*}$ .

We denote by

|M|_{2}^{2}:=\sum_{i,j}|M_{i,j}|^{2}=\operatorname{tr}(MM^{*}).

This is known as the Frobenius norm, or Schatten 2-norm, or Hilbert-Schmidt operator, etc.

If $M=AB$ we have

|M|_{2}^{2}=\sum_{i,j}\Bigl{|}\sum_{k}A_{i,k}B_{k,j}\Bigr{|}^{2}\leq\sum_{i,j}\Bigl{(}\sum_{k}|A_{i,k}|^{2}\Bigr{)}\Bigl{(}\sum_{k}|B_{k,j}|^{2}\Bigr{)}=|A|_{2}^{2}|B|_{2}^{2},

(1)

where the inequality is Cauchy–Schwarz.

Representation theory.

Let $G$ be a group. A representation $\rho$ of $G$ with dimension $d$ maps elements of $G$ to $d\times d$ unitary, complex matrices so that $\rho(xy)=\rho(x)\rho(y)$ . Thus, $\rho$ is a homomorphism from $G$ to the group of linear transformations of the vector space $\mathbb{C}^{d}$ . We denote by $d_{\rho}$ the dimension of $\rho$ .

If there is a non-trivial subspace $W$ of $\mathbb{C}^{d}$ that is invariant under $\rho$ , that is, $\rho(x)W\subseteq W$ for every $x\in G$ , then $\rho$ is reducible; otherwise it is irreducible. Irreducible representations are abbreviated irreps and play a critical role in Fourier analysis. We denote by $\widehat{G}$ a complete set of inequivalent irreducible representations of $G$ .

In every group we have

\sum_{\rho\in\widehat{G}}d_{\rho}^{2}=G.

(2)

We have the following fundamental orthogonality principle.

Lemma 4 (Schur’s lemma, see [Dia88], Page 11 or Lemma 2.3.3 in [Wig10]).

Let $\rho,\psi$ be irreps. Then $\mathrm{\mathbb{E}}_{x}\rho(x)_{k,h}\overline{\psi(x)}_{i,j}$ is $0$ unless $\rho=\psi$ and $k=i$ and $h=j$ , in which case it is $1/d_{\rho}$ . In particular, $\mathrm{\mathbb{E}}_{x}|\rho(x)_{i,j}|^{2}=1/d_{\rho}$ .

Let $f\colon G\to\mathbb{C}$ . The $\rho$ -th Fourier coefficient of $f$ is

\widehat{f}(\rho):=\mathrm{\mathbb{E}}_{x}f(x)\overline{\rho(x)}.

The Fourier inversion formula is then

f(x)=\sum_{\rho\in\widehat{G}}d_{\rho}\operatorname{tr}\bigl{(}\widehat{f}(\rho)\rho(x)^{T}\bigr{)}.

We define the convolution as follows:

p*q(x):=\sum_{y}p(y)q(y^{-1}x).

Note that if $p$ and $q$ are distributions then $p*q$ is the distribution obtained by sampling $x$ from $p$ , $y$ from $q$ , and then outputting $xy$ .

We note that under this normalization we have

\widehat{p*q}(\alpha)=G\cdot\widehat{p}(\alpha)\widehat{q}(\alpha).

Combining this with 1 we obtain

|\widehat{p*q}(\alpha)|_{2}^{2}\leq G^{2}\cdot|\widehat{p}(\alpha)|_{2}^{2}|\widehat{q}(\alpha)|_{2}^{2}.

(3)

Parseval’s identity is

\mathrm{\mathbb{E}}f(x)\overline{g(x)}=\sum_{\rho}d_{\rho}\operatorname{tr}\bigl{(}\widehat{f}(\rho)\widehat{g}(\rho)^{*}\bigr{)}.

In case $f=g$ this becomes

\mathrm{\mathbb{E}}|f(x)|^{2}=\sum_{\rho}d_{\rho}\operatorname{tr}\bigl{(}\widehat{f}(\rho)\widehat{f}(\rho)^{*}\bigr{)}=\sum_{\rho}d_{\rho}|\widehat{f}(\rho)|_{2}^{2}.

Fact 5 (Theorem 10 in Section 3.2 in [Ser77], or Theorem 9 in [Dia88]).

Any irrep $\rho$ of $H^{n}$ is the tensor product $\otimes_{i=1}^{n}\rho_{i}$ of $n$ irreps $\rho_{i}$ of $H$ . In particular, the dimension of $\rho$ is the product of the dimensions of the $\rho_{i}$ .

For $\rho=\otimes_{i=1}^{n}\rho_{i}$ we denote by $|\rho|$ the number of $i$ s.t. $\rho_{i}$ is not the trivial representation $1$ .

Definition 6 ([Gow08]).

A group $H$ is $d$ -quasirandom if every non-trivial irrep of $H$ has dimension $\geq d$ .

3 Boosting uniformity

In this section we prove 3. The proof follows by repeated applications of the following theorem.

Theorem 7.

Let $H=\mathrm{SL}(2,q)$ . Let $p$ be a distribution over $H^{t}$ that is $(H^{-k},k)$ -uniform for $k\geq 3$ and $m=\lceil(1+c)k\rceil$ . Then the convolution of $c$ independent copies of $p$ is $H^{-m}$ -uniform.

Note for small $k$ we may have $m=k+1$ . But if $k\geq c$ then $m$ is a constant factor larger than $k$ .

The choice of the error parameter is not too important because it can be boosted with convolutions:

Lemma 8 (Lemma 3.3 in [GV19]).

Let $p$ and $q$ be $(\mathrm{\epsilon},k)$ -uniform distributions over $H^{m}$ . Then $p*q$ is $(\mathrm{\epsilon}^{2},k)$ -uniform.

Proof.

It is enough to consider the case $m=k$ . We have

\bigl{|}p*q(x)-1/H^{t}\bigr{|}=\Bigl{|}\sum_{y}\bigl{(}p(y^{-1})-1/H^{t}\bigr{)}\bigl{(}q(yx)-1/H^{t}\bigr{)}\Bigr{|}\leq\sum_{y}(\mathrm{\epsilon}/G)^{2}=\mathrm{\epsilon}^{2}/H^{t}.\qed

In the rest of this section, we prove 7. The proof involves an excursion to 2-norms. The main step is the following new flattening lemma which can be seen as a $k$ -wise variant of the flattening lemmas discovered in [Gow08, BNP08, GV19].

Lemma 9.

Let $p$ be a distribution over $H^{m}$ where $H$ is $d$ -quasirandom. Suppose $p$ is $(H^{-k},k)$ -uniform. Then $|p*p-u|_{2}^{2}\leq|p-u|_{2}^{2}\cdot 2\cdot H^{m-k}d^{-(k+1)}$ .

We need the following couple of claims to go back-and-forth between $\mathrm{\epsilon}$ -uniform and 2-norms.

Claim 10.

$|p*p-u|_{\infty}\leq|p-u|_{2}^{2}$ .

Proof.

$(p*p-1/G)(x)=\sum_{y}(p(y)-1/G)(p(y^{-1}x)-1/G)\leq\sum_{x}(p(x)-1/G)^{2}.$ The last inequality is Cauchy–Schwarz. ∎

Claim 11.

Let $p$ be an $\mathrm{\epsilon}$ -uniform distribution over a group $G$ , and let $\rho$ be a non-trivial representation of $\rho$ with dimension $d_{\rho}$ . Then $|\widehat{p}(\rho)|_{2}^{2}\leq d_{\rho}\mathrm{\epsilon}^{2}G^{-2}$

Proof.

The LHS is

$\displaystyle\sum_{i,j}\|\widehat{p}(\rho)_{i,j}\|^{2}$	$\displaystyle=\sum_{i,j}\|\mathbb{E}_{x}\bigl{[}p(x)\overline{\rho(x)}]_{i,j}\bigr{\|}^{2}$
	$\displaystyle=\sum_{i,j}\bigl{\|}\mathbb{E}_{x}\bigl{[}(G^{-1}+\mathrm{\epsilon}_{x}G^{-1})\overline{\rho(x)}\bigr{]}_{i,j}\bigr{\|}^{2}$	$\displaystyle\text{ (for some }\mathrm{\epsilon}_{x}\text{ with }\|\mathrm{\epsilon}_{x}\|\leq\mathrm{\epsilon}\text{)}$
	$\displaystyle=\sum_{i,j}\bigl{\|}\mathbb{E}_{x}\bigl{[}\mathrm{\epsilon}_{x}G^{-1}\overline{\rho(x)}_{i,j}\bigr{]}\bigr{\|}^{2}$	(by Lemma 4 with $\psi:=1$ , using that $\rho$ is non-trivial)
	$\displaystyle\leq G^{-2}\sum_{i,j}\mathbb{E}_{x}\bigl{[}\mathrm{\epsilon}_{x}^{2}\cdot\|\overline{\rho(x)}_{i,j}\|\bigr{]}^{2}$
	$\displaystyle\leq G^{-2}\sum_{i,j}\mathrm{\epsilon}^{2}\mathbb{E}_{x}\|\overline{\rho(x)}_{i,j}\|^{2}$
	$\displaystyle=G^{-2}\sum_{i,j}\mathrm{\epsilon}^{2}/d_{\rho}$	(by Lemma 4, see “in particular” part)
	$\displaystyle=G^{-2}d_{\rho}\mathrm{\epsilon}^{2}.\qed$

Proof of 9.

Write $G$ for the group $H^{m}$ . For any distribution $q$ we have

|q-u|_{2}^{2}=|q|_{2}^{2}-1/G=G\sum_{\rho}d_{\rho}|\widehat{q}(\rho)|_{2}^{2}-1/G=G\sum_{\rho\neq 1}d_{\rho}|\widehat{q}(\rho)|_{2}^{2}.

In our case $q=p*p$ , and using 3 and the above equality we bound the RHS by

\leq G^{3}\sum_{\rho\neq 1}d_{\rho}|\widehat{p}(\rho)|_{2}^{4}\leq G^{2}\cdot|p-u|_{2}^{2}\cdot\max_{\rho\neq 1}|\widehat{p}(\rho)|_{2}^{2}.

It remains to bound $G^{2}\max_{\rho\neq 1}|\widehat{p}(\rho)|_{2}^{2}$ . We consider two cases:

If $|\rho|>k$ , then $d_{\rho}\geq d^{k+1}$ by 5, so we simply use Parseval to bound

G^{2}|\widehat{p}(\rho)|_{2}^{2}\leq G|p|_{2}^{2}/d_{\rho}\leq G|p|_{2}^{2}/d^{k+1}.

We also have $|p|_{2}^{2}\leq(\max_{x}p(x))\cdot\sum_{x}p(x)=\max_{x}p(x)\leq 2/H^{k}$ , because $p$ is in particular $(1,k)$ -uniform. Hence, we get a bound of $G\cdot 2\cdot H^{-k}d^{-(k+1)}$ , as desired.

If $|\rho|\leq k$ , then restrict to the non-trivial coordinates of $\rho$ . On those coordinates, $p$ induces a distribution that is $H^{-k}$ -uniform. By 11, we have

G^{2}|\widehat{p}(\rho)|_{2}^{2}\leq d_{\rho}H^{-2k}.

Note $d_{\rho}\leq H^{k/2}$ by 2. Thus, we obtain a bound of $H^{-1.5k}\leq d^{-(k+1)}$ . ∎

We can now present the proof of 7.

Proof of 7..

It is known that $H$ is $\geq cH^{1/3}$ -quasirandom, a proof can be found in [DSV03]. Hence, the parameter $d^{-(k+1)}$ in 9 is $\leq cH^{-(k+1)/3}\leq H^{-ck}$ for any $k\geq 3$ . Also, we have $|p-u|_{2}^{2}=|p|_{2}^{2}-1/G$ . If $p$ is $(H^{-k},k)$ -uniform then $|p|_{2}^{2}\leq\max_{x}p(x)\leq 2/H^{k}$ . Moreover, the uniformity parameter is maintained when taking convolutions. So one can apply the lemma a constant number of times to drive the $L_{2}$ norm to $H^{-m}$ , and then convolve one more time to obtain a distribution that is $H^{-m}$ -uniform via 10. ∎

4 Proof of 1

Let $m:=2^{k}$ . As noted in [GV19], an application of the box norm (Corollary 3.11 in [VW08]) shows that the LHS in the conclusion is $\leq cH2^{d}$ times the statistical distance between the uniform distribution over $H^{m}$ and the convolution of $t$ independent copies of the following distribution $s$ over $H^{m}$ .

Definition 12.

Pick $u_{i}^{0},u_{i}^{1}$ for $i\in[k]$ uniformly from $H$ . For $x\in[2]^{k}$ the $x$ coordinate $s(x)$ of $s$ is defined to be $\prod_{i\in[k]}u_{i}^{x_{i}}$ .

Claim 13.

$s$ is $3$ -uniform.

Proof.

Pick a coordinate $i$ s.t. $x_{i}\neq y_{i}$ . W.l.og. let $i=0$ , $x_{0}=0$ , and $y_{0}=1$ . Now $z_{0}$ is equal to either $x_{0}$ or $y_{0}$ . Assume w.l.o.g. that $z_{0}=y_{0}$ . Consider a coordinate $j$ where $z_{j}\neq y_{j}$ . Assume again w.l.o.g. that $j=1$ . We can fix all other $u_{i}$ with $i\geq 2$ and prove 3-uniformity just considering those two coordinates. For concreteness, details follow.

Up to swapping $y$ and $z$ there are only two cases to consider. The first is

	$\displaystyle x$	$\displaystyle=00$
	$\displaystyle y$	$\displaystyle=10$
	$\displaystyle z$	$\displaystyle=11.$

In this case we can fix arbitrarily the $u$ corresponding to $y$ , and then $x$ is uniform because of $u_{0}^{0}$ and $z$ because of $u_{1}^{1}$ .

Alternatively,

	$\displaystyle x$	$\displaystyle=01$
	$\displaystyle y$	$\displaystyle=10$
	$\displaystyle z$	$\displaystyle=11.$

In this case we can similarly fix arbitrarily the $u$ corresponding to $z$ . ∎

We note that $s$ is not $4$ -uniform, again just considering two coordinates.

To conclude the proof of 1, note that the convolution of $m^{c}$ copies of $s$ is $(H^{-m},m)$ -uniform by 3. By 8 the convolution of $t$ copies is then $(H^{-m\cdot t/m^{c}},m)$ -uniform, and the result follows.

5 From $(\mathrm{\epsilon},k)$ -uniform to $k$ -uniform

In this section we prove the following generalization of [RX13].

Theorem 14.

Let $p$ be a distribution on $G=H^{m}$ s.t. $|\widehat{p}(\rho)|_{2}\leq\mathrm{\epsilon}/G$ for every $\rho:$ $|\rho|\in[1,k]$ . Then $p$ has distance at most $3(mH)^{2k}\mathrm{\epsilon}$ from a $k$ -uniform distribution.

First we note the following converse to 11.

Claim 15.

Let $p$ be a distribution over $H^{m}$ . Suppose $\widehat{p}(\rho)=0$ whenever $|\rho|\in\{1,\ldots,k\}$ . Then $p$ is $k$ -uniform.

Proof.

Consider any $k$ coordinates; assume they are the first $k$ w.l.o.g. The probability of a string $x\in H^{k}$ is

\sum_{y\in H^{m-k}}p(xy).

By the inversion formula and the assumption this is

\sum_{y}\sum_{\rho}d_{\rho}\operatorname{tr}\bigl{(}\widehat{p}(\rho)\rho(xy)^{T}\bigr{)}=\sum_{y}\widehat{p}(1)+\sum_{\rho:|\rho|>k}d_{\rho}\operatorname{tr}\Bigl{(}\widehat{p}(\rho)\sum_{y}\rho(xy)\Bigr{)}.

We have $\widehat{p}(1)=\mathbb{E}_{x}p(x)=1/H^{m}$ and so the first summand is $1/H^{k}$ . We show that the second summand is $0$ by showing that $\sum_{y}\rho(xy)$ is the zero matrix. To verify this, write $\rho$ as a tensor product of $\rho_{i}$ using 5. Then one entry of $\rho(xy)$ is the product of the entries of the $\rho_{i}$ . There is a non-trivial $\rho_{i}$ corresponding to a $y$ coordinate. The sum over that coordinate of $y$ yields $0$ by 4. ∎

Proof of 14..

Let

\ell(x):=\sum_{\rho:|\rho|\in[1,k]}d_{\rho}\operatorname{tr}\bigl{(}\widehat{p}(\rho)\rho(x)^{T}\bigr{)}

be the “low-degree” part of $p$ , and let

p^{\prime}(x):=p(x)-\ell(x)=\sum_{\rho:|\rho|\not\in[1,k]}d_{\rho}\operatorname{tr}{\bigl{(}\widehat{p}(\rho)\rho(x)^{T}}\bigr{)}.

We first observe that $\ell$ and hence $p^{\prime}$ is real. This is because the conjugate $\overline{\rho}$ of an irrep is also an irrep, and over $H^{t}$ the number of non-trivial coordinates is the same. Hence the sum over $\rho$ is the same as the sum over $\overline{\rho}$ . Moreover, $\widehat{p}(\overline{\rho})\cdot\overline{\rho}=\overline{\widehat{p}(\rho)\cdot\rho}$ . Therefore, we can write

2\ell(x)=\sum_{\rho:|\rho|\in[1,k]}\Bigl{(}d_{\rho}\operatorname{tr}\bigl{(}\widehat{p}(\rho)\rho(x)^{T}\bigr{)}+d_{\rho}\operatorname{tr}\bigl{(}\overline{\widehat{p}(\rho)\rho(x)^{T}}\bigr{)}\Bigr{)}.

The expression inside the brackets is real and so $p^{\prime}(x)$ is real as well.

Also, $\sum_{x}p^{\prime}(x)=G\cdot\widehat{p^{\prime}}(1)=G\cdot\widehat{p}(1)=1$ by 4.

However, $p^{\prime}$ may be $<0$ on some $x$ . To remedy that, following previous work, we will “mix” $p^{\prime}$ with the uniform distribution so that the mixture becomes a distribution. Note that the mixture is $k$ -uniform by 15 as the low-degree non-trivial Fourier coefficients of both $p^{\prime}$ and uniform are zero.

Concretely, let

q:=(1-\beta)p^{\prime}+\beta\frac{1}{H^{m}}.

for $\beta$ to be determined. Note that $q$ sums to $1$ as $p^{\prime}$ and $1/H^{m}$ both do.

To determine $\beta$ , first note that by definition

p^{\prime}(x)\geq-\ell(x).

Crudely, $\ell(x)\leq(mH)^{k}\max_{\rho}d_{\rho}|\operatorname{tr}(\widehat{p}(\rho)\rho(x)^{T})|$ . Now, each absolute value is $\leq|\widehat{p}(\rho)|_{2}|\rho(x)|_{2}.$ The first term is $\mathrm{\epsilon}/G$ by assumption. For the second, we use the fact that $\rho(x)$ is unitary, and so $|\rho(x)|_{2}=|I|_{2}=\sqrt{d_{\rho}}$ . So $d_{\rho}|\operatorname{tr}(\widehat{p}(\rho)\rho(x)^{T})|\leq d_{\rho}^{3/2}\mathrm{\epsilon}/G\leq H^{3k/4}\mathrm{\epsilon}/G$ , using, in the last inequality, 2 over the underlying group $H^{k}$ .

Hence, $|\ell(x)|\leq(mH)^{2k}\mathrm{\epsilon}/G$ and we can set $\beta:=(tH)^{2k}\mathrm{\epsilon}/G$ and obtain that $q\geq 0$ . As remarked earlier, $q$ sums to 1, and so $q$ is a distribution. It remains to bound the distance between $p$ and $q$ . We have

|p-q|_{1}=\bigl{|}p-(1-\beta)p^{\prime}+\beta/G\bigr{|}_{1}\leq|\ell(x)|_{1}+\beta|p^{\prime}|_{1}+\beta\cdot|1/G|_{1}.

The last two summands are $\beta$ each. The first one is $\leq(mH)^{2k}\mathrm{\epsilon}=\beta$ by the bound on $|\ell(x)|$ above. Hence the distance is $\leq 3\beta=3(mH)^{2k}\mathrm{\epsilon}$ . ∎

References

[ACFN15] Anil Ada, Arkadev Chattopadhyay, Omar Fawzi, and Phuong Nguyen. The NOF multiparty communication complexity of composed functions. Comput. Complex., 24(3):645–694, 2015.
[AGM03] Noga Alon, Oded Goldreich, and Yishay Mansour. Almost k-wise independence versus k-wise independence. Inf. Process. Lett., 88(3):107–110, 2003.
[BC92] Michael Ben-Or and Richard Cleve. Computing algebraic formulas using a constant number of registers. SIAM J. on Computing, 21(1):54–58, 1992.
[BC08] Joshua Brody and Amit Chakrabarti. Sublinear communication protocols for multi-party pointer jumping and a related lower bound. In 25th Symp. on Theoretical Aspects of Computer Science (STACS), pages 145–156, 2008.
[BCK⁺14] Harry Buhrman, Richard Cleve, Michal Koucký, Bruno Loff, and Florian Speelman. Computing with a full memory: catalytic space. In ACM Symp. on the Theory of Computing (STOC), pages 857–866, 2014.
[BDFP83] Allan Borodin, Danny Dolev, Faith E. Fich, and Wolfgang J. Paul. Bounds for width two branching programs. In Proceedings of the 15th Annual ACM Symposium on Theory of Computing, 25-27 April, 1983, Boston, Massachusetts, USA, pages 87–93, 1983.
[BDPW07] Paul Beame, Matei David, Toniann Pitassi, and Philipp Woelfel. Separating deterministic from nondeterministic nof multiparty communication complexity. In 34th Coll. on Automata, Languages and Programming (ICALP), pages 134–145. Springer, 2007.
[BGKL03] László Babai, Anna Gál, Peter G. Kimmel, and Satyanarayana V. Lokam. Communication complexity of simultaneous messages. SIAM J. on Computing, 33(1):137–166, 2003.
[BNP08] László Babai, Nikolay Nikolov, and László Pyber. Product growth and mixing in finite groups. In ACM-SIAM Symp. on Discrete Algorithms (SODA), pages 248–257, 2008.
[BNS92] László Babai, Noam Nisan, and Márió Szegedy. Multiparty protocols, pseudorandom generators for logspace, and time-space trade-offs. J. of Computer and System Sciences, 45(2):204–232, 1992.
[CFL83] Ashok K. Chandra, Merrick L. Furst, and Richard J. Lipton. Multi-party protocols. In 15th ACM Symp. on the Theory of Computing (STOC), pages 94–99, 1983.
[CM20] James Cook and Ian Mertz. Catalytic approaches to the tree evaluation problem. In STOC, pages 752–760. ACM, 2020.
[Dia88] Persi Diaconis. Group representations in probability and statistics, volume 11 of Institute of Mathematical Statistics Lecture Notes—Monograph Series. Institute of Mathematical Statistics, Hayward, CA, 1988.
[DSV03] G. Davidoff, P. Sarnak, and A. Valette. Elementary Number Theory, Group Theory and Ramanujan Graphs. Elementary Number Theory, Group Theory, and Ramanujan Graphs. Cambridge University Press, 2003.
[DV23] Harm Derksen and Emanuele Viola. Quasirandom groups enjoy interleaved mixing. Discrete Analysis, 2023.
[Gow08] W. T. Gowers. Quasirandom groups. Combinatorics, Probability & Computing, 17(3):363–387, 2008.
[Gow17] W. T. Gowers. Generalizations of Fourier analysis, and how to apply them. Bull. Amer. Math. Soc. (N.S.), 54(1):1–44, 2017.
[GV15] W. T. Gowers and Emanuele Viola. The communication complexity of interleaved group products. In ACM Symp. on the Theory of Computing (STOC), 2015.
[GV19] W. T. Gowers and Emanuele Viola. Interleaved group products. SIAM J. on Computing, 48(3):554–580, 2019. Special issue of FOCS 2016.
[GV22] W. T. Gowers and Emanuele Viola. Mixing in non-quasirandom groups. In ACM Innovations in Theoretical Computer Science conf. (ITCS), 2022.
[IL95] Neil Immerman and Susan Landau. The complexity of iterated multiplication. Inf. Comput., 116(1):103–116, 1995.
[KLM23] Zander Kelley, Shachar Lovett, and Raghu Meka. Explicit separations between randomized and deterministic number-on-forehead communication. CoRR, abs/2308.12451, 2023.
[KMR66] Kenneth Krohn, W. D. Maurer, and John Rhodes. Realizing complex Boolean functions with simple groups. Information and Control, 9:190–195, 1966.
[KN97] Eyal Kushilevitz and Noam Nisan. Communication complexity. Cambridge University Press, 1997.
[LZ77] Richard J. Lipton and Yechezkel Zalcstein. Word problems solvable in logspace. J. ACM, 24(3):522–526, 1977.
[MC87] Pierre McKenzie and Stephen A. Cook. The parallel complexity of abelian permutation group problems. SIAM J. Comput., 16(5):880–909, 1987.
[Mil14] Eric Miles. Iterated group products and leakage resilience against $NC^{1}$ . In ACM Innovations in Theoretical Computer Science conf. (ITCS), 2014.
[Mix89] David A. Mix Barrington. Bounded-width polynomial-size branching programs recognize exactly those languages in NC¹. J. of Computer and System Sciences, 38(1):150–164, 1989.
[MV13] Eric Miles and Emanuele Viola. Shielding circuits with groups. In ACM Symp. on the Theory of Computing (STOC), 2013.
[PRS97] Pavel Pudlák, Vojtěch Rödl, and Jiří Sgall. Boolean circuits, tensor ranks, and communication complexity. SIAM J. on Computing, 26(3):605–633, 1997.
[RX13] Ronitt Rubinfeld and Ning Xie. Robust characterizations of k-wise independence over product spaces and related testing results. Random Struct. Algorithms, 43(3):265–312, 2013.
[RY19] Anup Rao and Amir Yehudayoff. Communication complexity. 2019. https://homes.cs.washington.edu/ anuprao/pubs/book.pdf.
[Ser77] Jean Pierre Serre. Linear Representations of Finite Groups. Springer, 1977.
[Sha16] Aner Shalev. Mixing, communication complexity and conjectures of Gowers and Viola. Combinatorics, Probability and Computing, pages 1–13, 6 2016. arXiv:1601.00795.
[Ter99] Audrey Terras. Fourier analysis on finite groups and applications, volume 43 of London Mathematical Society Student Texts. Cambridge University Press, Cambridge, 1999.
[Vio19] Emanuele Viola. Non-abelian combinatorics and communication complexity. SIGACT News, Complexity Theory Column, 50(3), 2019.
[VW08] Emanuele Viola and Avi Wigderson. Norms, XOR lemmas, and lower bounds for polynomials and protocols. Theory of Computing, 4:137–168, 2008.
[Wig10] Avi Wigderson. Representation theory of finite groups, and applications. Available at http://www.math.ias.edu/ $\sim$ avi/TALKS/Green_Wigderson_lecture.pdf, 2010.

$\displaystyle\sum_{i,j}\|\widehat{p}(\rho)_{i,j}\|^{2}$	$\displaystyle=\sum_{i,j}\|\mathbb{E}_{x}\bigl{[}p(x)\overline{\rho(x)}]_{i,j}\bigr{\|}^{2}$
	$\displaystyle=\sum_{i,j}\bigl{\|}\mathbb{E}_{x}\bigl{[}(G^{-1}+\mathrm{\epsilon}_{x}G^{-1})\overline{\rho(x)}\bigr{]}_{i,j}\bigr{\|}^{2}$	$\displaystyle\text{ (for some }\mathrm{\epsilon}_{x}\text{ with }\|\mathrm{\epsilon}_{x}\|\leq\mathrm{\epsilon}\text{)}$
	$\displaystyle=\sum_{i,j}\bigl{\|}\mathbb{E}_{x}\bigl{[}\mathrm{\epsilon}_{x}G^{-1}\overline{\rho(x)}_{i,j}\bigr{]}\bigr{\|}^{2}$	(by Lemma 4 with $\psi:=1$ , using that $\rho$ is non-trivial)
	$\displaystyle\leq G^{-2}\sum_{i,j}\mathbb{E}_{x}\bigl{[}\mathrm{\epsilon}_{x}^{2}\cdot\|\overline{\rho(x)}_{i,j}\|\bigr{]}^{2}$
	$\displaystyle\leq G^{-2}\sum_{i,j}\mathrm{\epsilon}^{2}\mathbb{E}_{x}\|\overline{\rho(x)}_{i,j}\|^{2}$
	$\displaystyle=G^{-2}\sum_{i,j}\mathrm{\epsilon}^{2}/d_{\rho}$	(by Lemma 4, see “in particular” part)
	$\displaystyle=G^{-2}d_{\rho}\mathrm{\epsilon}^{2}.\qed$

Boosting uniformity in quasirandom groups: fast and simple

Abstract

1 Introduction and our results

Theorem 1.

Definition 2.

Theorem 3.

(ϵ,k)(\mathrm{\epsilon},k)-uniformity vs. kk-uniformity.

Extensions.

2 Preliminaries

Matrices.

Representation theory.

Lemma 4 (Schur’s lemma, see [Dia88], Page 11 or Lemma 2.3.3 in [Wig10]).

Fact 5 (Theorem 10 in Section 3.2 in [Ser77], or Theorem 9 in [Dia88]).

Definition 6 ([Gow08]).

3 Boosting uniformity

Theorem 7.

Lemma 8 (Lemma 3.3 in [GV19]).

Proof.

Lemma 9.

Claim 10.

Proof.

Claim 11.

Proof.

Proof of 9.

Proof of 7..

4 Proof of 1

Definition 12.

Claim 13.

Proof.

5 From (ϵ,k)(\mathrm{\epsilon},k)-uniform to kk-uniform

Theorem 14.

Claim 15.

Proof.

Proof of 14..

References

$(\mathrm{\epsilon},k)$ -uniformity vs. $k$ -uniformity.

5 From $(\mathrm{\epsilon},k)$ -uniform to $k$ -uniform