This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

\RS@ifundefined

subsecref \newrefsubsecname = \RSsectxt \RS@ifundefinedthmref \newrefthmname = theorem  \RS@ifundefinedlemref \newreflemname = lemma  \newrefclaimrefcmd=Claim LABEL:#1 \newreffactrefcmd=Fact LABEL:#1 \newrefdefrefcmd=Definition LABEL:#1 \newrefthmrefcmd=Theorem LABEL:#1 \newrefcorrefcmd=Corollary LABEL:#1 \newreflemrefcmd=Lemma LABEL:#1

Boosting uniformity in quasirandom groups: fast and simple

Harm Derksen
Northeastern University
ha.derksen@northeastern.edu
Partially supported by NSF grant DMS 2147769.
   Chin Ho Lee
North Carolina State University
chinho.lee@ncsu.edu
   Emanuele Viola
Northeastern University
viola@ccs.neu.edu
Supported by NSF grant CCF-2114116.
Abstract

We study the communication complexity of multiplying k×tk\times t elements from the group H=SL(2,q)H=\text{SL}(2,q) in the number-on-forehead model with kk parties. We prove a lower bound of (tlogH)/ck(t\log H)/c^{k}. This is an exponential improvement over previous work, and matches the state-of-the-art in the area.

Relatedly, we show that the convolution of kck^{c} independent copies of a 3-uniform distribution over HmH^{m} is close to a kk-uniform distribution. This is again an exponential improvement over previous work which needed ckc^{k} copies.

The proofs are remarkably simple; the results extend to other quasirandom groups.

We also show that for any group HH, any distribution over HmH^{m} whose weight-kk Fourier coefficients are small is close to a kk-uniform distribution. This generalizes previous work in the abelian setting, and the proof is simpler.

1 Introduction and our results

Iterated multiplication of elements in a group is a fundamental problem that has a long history and wide-ranging applications, and is linked to long-standing open problems. Already in [LZ77] it has been pivotal to provide space-efficient algorithms for Dyck languages. Depending on the underlying group, iterated multiplication is complete for various complexity classes [KMR66, MC87, Mix89, BC92, IL95, Mil14]. For example, Barrington’s famous result [Mix89] shows that it is complete for NC1\text{NC}^{1} if and only if the underlying group is non-solvable. This in particular disproved a conjecture about the complexity of majority [BDFP83]. This type of results has then been taken further in the study of catalytic computation [BCK+14], leading to other surprising discoveries [BCK+14, CM20].

The focus of this paper is on number-on-forehead communication complexity [CFL83]. For a survey on the communication complexity of group products, see [Vio19], and see [KN97, RY19] for general background on communication complexity. Concretely, the input is a matrix of k×tk\times t elements aija_{ij} from a group HH, and the goal is computing j=1ta1jakj\prod_{j=1}^{t}a_{1j}\cdots a_{kj}. There are kk collaborating parties, with Party ii knowing all the input except row ii.

This problem is also linked to central open problems in communication complexity. Specifically, [GV19] conjectured that over certain groups this problem remains hard even for kk larger than logn\log n. Establishing such bounds is arguably the most significant open problem in the area. A number of candidates have been put forward over the years, but many have been ruled out via ingenious protocols, e.g. in [PRS97, BGKL03, BC08, ACFN15]. Interestingly, for the iterated-product candidate proposed in [GV19], no non-trivial protocol is known.

Iterated group products are also candidate for providing strong separations between randomized and deterministic number-on-forehead communication. The current bounds (see [Vio19]) give a separation matching one in [BDPW07]. Stronger bounds could simplify and strengthen the recent exciting separation [KLM23].

Returning to the problem, we note that its complexity heavily depends on the underlying group. If it is abelian, then the problem can be solved with constant communication, using the public-coin protocol for equality. Over certain other groups a communication lower bound of t/ckt/c^{k} follows via [Mix89] from the landmark lower bound in [BNS92] for generalized inner product; cf. [MV13]. However, this bound does not improve with the size of the group. In particular it is far from the (trivial) upper bound of tlogHt\log H, and it gives nothing when tt is constant. Motivated by a cryptographic application, [MV13] asked whether a lower bound that grows with the size of the group, ideally ctlogHct\log H, can be established over some group HH.

Gowers and Viola [GV15, GV19] proved a bound of (tlogH)/cck(t\log H)/c^{c^{k}} for the group SL(2,q)\text{SL}(2,q) of 2×22\times 2 invertible matrices over 𝔽q\mathbb{F}_{q}, which enables the motivating application from [MV13]. Subsequent work [DV23] simplified the proof and generalized it to any quasi-random group [Gow08], see also [GV19, Sha16]. While such bounds do grow with the size of the group, thus answering the question in [MV13] and enabling the motivating application in cryptography, the dependency on the number kk of parties is weak: One can only afford kk doubly-logarithmic in the input length.

In this work we give an exponential improvement and obtain bounds of the form (tlogH)/ck(t\log H)/c^{k}, thus matching the state-of-the-art in number-on-forehead communication [BNS92]. As in [GV19], we prove stronger results that even bound the advantage such protocols have when the input is promised to multiply to one of two fixed elements.

Theorem 1.

Let H=SL(2,q)H=\mathrm{SL}(2,q). Let P:Hk×t[2]P\colon H^{k\times t}\to[2] be a number-on-forehead communication protocol with kk parties and communication bb bits. For gHg\in H denote by pgp_{g} the probability that PP outputs 11 over a uniform input (ai,j)ik,jt(a_{i,j})_{i\leq k,j\leq t} such that j=1ta1jakj=g\prod_{j=1}^{t}a_{1j}\cdots a_{kj}=g. For any kk and any two g,hHg,h\in H, if tckt\geq c^{k} then |pgph|2bHt/ck|p_{g}-p_{h}|\leq 2^{b}\cdot H^{-t/c^{k}}.

The high-level proof technique is the same as in [GV19]. They reduced the problem to boosting uniformity over mm copies of HH.

Definition 2.

A distribution pp over a set SS is ϵ\mathrm{\epsilon}-uniform if |p(x)1/S|ϵ/S|p(x)-1/S|\leq\mathrm{\epsilon}/S. If S=HtS=H^{t} and ktk\leq t we say pp is (ϵ,k)(\mathrm{\epsilon},k)-uniform if for any kk coordinates, the induced distribution over those coordinates is ϵ\mathrm{\epsilon}-uniform. We say pp is kk-uniform if it is (0,k)(0,k)-uniform.

[GV19] showed that if ss is a 22-uniform distribution over HmH^{m} then the convolution (a.k.a. component-wise product) of some \ell independent copies of ss is HmH^{-m}-uniform over the whole space HmH^{m}. Note that such a result is false for abelian groups – the convolution can remain only 22-uniform. Quantitatively, they show that =cm\ell=c^{m} copies suffice. In the application to 1 one has m=2km=2^{k}, which gives the doubly logarithmic dependence on kk.

In this work we give a corresponding exponential improvement on the number of copies required to boost uniformity: we show that in fact =mc\ell=m^{c} copies suffice. Our proof is remarkably simple, especially if we start with 33-uniform distributions, which we note suffices for 1. (We discuss below extensions to 22-uniform and other groups.) We state this result next.

Theorem 3.

Let H=SL(2,q)H=\mathrm{SL}(2,q). Let pp be a 33-uniform distribution over HmH^{m}. The convolution of mcm^{c} independent copies of pp is HmH^{-m}-uniform.

Our approach allows us to double the uniformity, i.e., go from kk-uniform to 2k2k-uniform using only a constant number of convolutions, independently of kk, whereas [GV19] would use k\geq k convolutions. This points to a key difference in the techniques. In [GV19], boosting uniformity is achieved by reduction to interleaved products, and appears tailored to going from 22-uniform to 33-uniform. Our approach is different, and simpler, even taking into account the simple proof of interleaved mixing from [DV23]. It can be seen as a kk-uniform version of the flattening lemmas discovered in [Gow08, BNP08, GV19]. In a nutshell, the kk-uniformity assumption allows us to remove all “low-degree” Fourier coefficients, leaving only those of degree >k>k. Then the quasi-randomness assumption, combined with the tensor-product structure of the Fourier coefficients allows us to “flatten” distributions at a rate proportional to HckH^{-ck}, instead of HcH^{-c} as in previous work. We note that while using kk-uniformity to remove low-degree coefficients is a common proof technique, we are not aware of previous work where this is done in the non-abelian setting. This might indicate that our techniques might find other applications, and in general we advocate a systematic study of non-abelian analogues of the Fourier toolkit. Another step in this direction is discussed next.

(ϵ,k)(\mathrm{\epsilon},k)-uniformity vs. kk-uniformity.

Extending the classic work [AGM03], Rubinfeld and Xie [RX13] showed that every almost kk-uniform distribution over any Abelian product group is statistically close to some kk-uniform distribution. We generalize their result to any product group. Our approach is significantly simpler. [RX13] decomposes the given kk-uniform distribution in a real orthogonal basis instead of the Fourier basis; we show that in fact the same argument can be carried out directly over the Fourier basis. A critical observation is that removing Fourier coefficients of a fixed weight from a real function keeps the function real.

This generalization, in combination with 3, gives a refinement of 3 where the number of copies is kck^{c} and the final distribution is statistically close to a kk-uniform distribution (whereas a direct application of 3 would just give an (Hkc,k)(H^{-k^{c}},k)-uniform distribution).

Extensions.

1 and 3 above can be generalized to any quasi-random group and to distributions which are 22-uniform. This can be done by first using the results in [GV19, DV23] to boost 22-uniformity to vv-uniformity for a sufficiently large constant vv depending on the quasirandomness of the group (for SL(2,q)\text{SL}(2,q), v=3v=3 suffices). This requires a number of convolutions that is exponential in vv, but since vv is constant it can be afforded. After that, our results kick in and allow to boost faster.

2 Preliminaries

In this section we fix some notation, especially about representation theory.

For a set XX, we also write XX for its size |X||X|. We write [i][i] for the set {0,1,,i1}\{0,1,\ldots,i-1\}. Every occurrence of “cc” denotes a possibly different universal constant. Replacing “cc” with O(1)O(1) everywhere is consistent with a common interpretation of the latter. For a function f:Gf\colon G\to\mathbb{C} we denote by |f|22|f|_{2}^{2} the un-normalized quantity xG|f(x)|2\sum_{x\in G}|f(x)|^{2}.

Next we present the standard framework of representation theory. The books by Serre [Ser77], Diaconis [Dia88], and Terras [Ter99] are good references for representation theory and non-abelian Fourier analysis. The Barbados notes [Wig10] or Section 13 of [Gow17] or [GV22] provide briefer introductions. The exposition in these sources is not always consistent, and often has different aims from ours. So let us give a quick account of the theory that is most relevant for this work.

Matrices.

Let MM be a square complex matrix. We denote by tr(M)\operatorname{tr}(M) the trace of MM, by M¯\overline{M} the conjugate of MM, by MTM^{T} the transpose of MM, and by MM^{*} the conjugate transpose MT¯\overline{M^{T}} (aka adjoint, Hermitian conjugate, etc.). The matrix MM is unitary if the rows and the columns are orthonormal; equivalently M1=MM^{-1}=M^{*}.

We denote by

|M|22:=i,j|Mi,j|2=tr(MM).|M|_{2}^{2}:=\sum_{i,j}|M_{i,j}|^{2}=\operatorname{tr}(MM^{*}).

This is known as the Frobenius norm, or Schatten 2-norm, or Hilbert-Schmidt operator, etc.

If M=ABM=AB we have

|M|22=i,j|kAi,kBk,j|2i,j(k|Ai,k|2)(k|Bk,j|2)=|A|22|B|22,|M|_{2}^{2}=\sum_{i,j}\Bigl{|}\sum_{k}A_{i,k}B_{k,j}\Bigr{|}^{2}\leq\sum_{i,j}\Bigl{(}\sum_{k}|A_{i,k}|^{2}\Bigr{)}\Bigl{(}\sum_{k}|B_{k,j}|^{2}\Bigr{)}=|A|_{2}^{2}|B|_{2}^{2}, (1)

where the inequality is Cauchy–Schwarz.

Representation theory.

Let GG be a group. A representation ρ\rho of GG with dimension dd maps elements of GG to d×dd\times d unitary, complex matrices so that ρ(xy)=ρ(x)ρ(y)\rho(xy)=\rho(x)\rho(y). Thus, ρ\rho is a homomorphism from GG to the group of linear transformations of the vector space d\mathbb{C}^{d}. We denote by dρd_{\rho} the dimension of ρ\rho.

If there is a non-trivial subspace WW of d\mathbb{C}^{d} that is invariant under ρ\rho, that is, ρ(x)WW\rho(x)W\subseteq W for every xGx\in G, then ρ\rho is reducible; otherwise it is irreducible. Irreducible representations are abbreviated irreps and play a critical role in Fourier analysis. We denote by G^\widehat{G} a complete set of inequivalent irreducible representations of GG.

In every group we have

ρG^dρ2=G.\sum_{\rho\in\widehat{G}}d_{\rho}^{2}=G. (2)

We have the following fundamental orthogonality principle.

Lemma 4 (Schur’s lemma, see [Dia88], Page 11 or Lemma 2.3.3 in [Wig10]).

Let ρ,ψ\rho,\psi be irreps. Then 𝔼xρ(x)k,hψ(x)¯i,j\mathrm{\mathbb{E}}_{x}\rho(x)_{k,h}\overline{\psi(x)}_{i,j} is 0 unless ρ=ψ\rho=\psi and k=ik=i and h=jh=j, in which case it is 1/dρ1/d_{\rho}. In particular, 𝔼x|ρ(x)i,j|2=1/dρ\mathrm{\mathbb{E}}_{x}|\rho(x)_{i,j}|^{2}=1/d_{\rho}.

Let f:Gf\colon G\to\mathbb{C}. The ρ\rho-th Fourier coefficient of ff is

f^(ρ):=𝔼xf(x)ρ(x)¯.\widehat{f}(\rho):=\mathrm{\mathbb{E}}_{x}f(x)\overline{\rho(x)}.

The Fourier inversion formula is then

f(x)=ρG^dρtr(f^(ρ)ρ(x)T).f(x)=\sum_{\rho\in\widehat{G}}d_{\rho}\operatorname{tr}\bigl{(}\widehat{f}(\rho)\rho(x)^{T}\bigr{)}.

We define the convolution as follows:

pq(x):=yp(y)q(y1x).p*q(x):=\sum_{y}p(y)q(y^{-1}x).

Note that if pp and qq are distributions then pqp*q is the distribution obtained by sampling xx from pp, yy from qq, and then outputting xyxy.

We note that under this normalization we have

pq^(α)=Gp^(α)q^(α).\widehat{p*q}(\alpha)=G\cdot\widehat{p}(\alpha)\widehat{q}(\alpha).

Combining this with 1 we obtain

|pq^(α)|22G2|p^(α)|22|q^(α)|22.|\widehat{p*q}(\alpha)|_{2}^{2}\leq G^{2}\cdot|\widehat{p}(\alpha)|_{2}^{2}|\widehat{q}(\alpha)|_{2}^{2}. (3)

Parseval’s identity is

𝔼f(x)g(x)¯=ρdρtr(f^(ρ)g^(ρ)).\mathrm{\mathbb{E}}f(x)\overline{g(x)}=\sum_{\rho}d_{\rho}\operatorname{tr}\bigl{(}\widehat{f}(\rho)\widehat{g}(\rho)^{*}\bigr{)}.

In case f=gf=g this becomes

𝔼|f(x)|2=ρdρtr(f^(ρ)f^(ρ))=ρdρ|f^(ρ)|22.\mathrm{\mathbb{E}}|f(x)|^{2}=\sum_{\rho}d_{\rho}\operatorname{tr}\bigl{(}\widehat{f}(\rho)\widehat{f}(\rho)^{*}\bigr{)}=\sum_{\rho}d_{\rho}|\widehat{f}(\rho)|_{2}^{2}.
Fact 5 (Theorem 10 in Section 3.2 in [Ser77], or Theorem 9 in [Dia88]).

Any irrep ρ\rho of HnH^{n} is the tensor product i=1nρi\otimes_{i=1}^{n}\rho_{i} of nn irreps ρi\rho_{i} of HH. In particular, the dimension of ρ\rho is the product of the dimensions of the ρi\rho_{i}.

For ρ=i=1nρi\rho=\otimes_{i=1}^{n}\rho_{i} we denote by |ρ||\rho| the number of ii s.t. ρi\rho_{i} is not the trivial representation 11.

Definition 6 ([Gow08]).

A group HH is dd-quasirandom if every non-trivial irrep of HH has dimension d\geq d.

3 Boosting uniformity

In this section we prove 3. The proof follows by repeated applications of the following theorem.

Theorem 7.

Let H=SL(2,q)H=\mathrm{SL}(2,q). Let pp be a distribution over HtH^{t} that is (Hk,k)(H^{-k},k)-uniform for k3k\geq 3 and m=(1+c)km=\lceil(1+c)k\rceil. Then the convolution of cc independent copies of pp is HmH^{-m}-uniform.

Note for small kk we may have m=k+1m=k+1. But if kck\geq c then mm is a constant factor larger than kk.

The choice of the error parameter is not too important because it can be boosted with convolutions:

Lemma 8 (Lemma 3.3 in [GV19]).

Let pp and qq be (ϵ,k)(\mathrm{\epsilon},k)-uniform distributions over HmH^{m}. Then pqp*q is (ϵ2,k)(\mathrm{\epsilon}^{2},k)-uniform.

Proof.

It is enough to consider the case m=km=k. We have

|pq(x)1/Ht|=|y(p(y1)1/Ht)(q(yx)1/Ht)|y(ϵ/G)2=ϵ2/Ht.\bigl{|}p*q(x)-1/H^{t}\bigr{|}=\Bigl{|}\sum_{y}\bigl{(}p(y^{-1})-1/H^{t}\bigr{)}\bigl{(}q(yx)-1/H^{t}\bigr{)}\Bigr{|}\leq\sum_{y}(\mathrm{\epsilon}/G)^{2}=\mathrm{\epsilon}^{2}/H^{t}.\qed

In the rest of this section, we prove 7. The proof involves an excursion to 2-norms. The main step is the following new flattening lemma which can be seen as a kk-wise variant of the flattening lemmas discovered in [Gow08, BNP08, GV19].

Lemma 9.

Let pp be a distribution over HmH^{m} where HH is dd-quasirandom. Suppose pp is (Hk,k)(H^{-k},k)-uniform. Then |ppu|22|pu|222Hmkd(k+1)|p*p-u|_{2}^{2}\leq|p-u|_{2}^{2}\cdot 2\cdot H^{m-k}d^{-(k+1)}.

We need the following couple of claims to go back-and-forth between ϵ\mathrm{\epsilon}-uniform and 2-norms.

Claim 10.

|ppu||pu|22|p*p-u|_{\infty}\leq|p-u|_{2}^{2}.

Proof.

(pp1/G)(x)=y(p(y)1/G)(p(y1x)1/G)x(p(x)1/G)2.(p*p-1/G)(x)=\sum_{y}(p(y)-1/G)(p(y^{-1}x)-1/G)\leq\sum_{x}(p(x)-1/G)^{2}. The last inequality is Cauchy–Schwarz. ∎

Claim 11.

Let pp be an ϵ\mathrm{\epsilon}-uniform distribution over a group GG, and let ρ\rho be a non-trivial representation of ρ\rho with dimension dρd_{\rho}. Then |p^(ρ)|22dρϵ2G2|\widehat{p}(\rho)|_{2}^{2}\leq d_{\rho}\mathrm{\epsilon}^{2}G^{-2}

Proof.

The LHS is

i,j|p^(ρ)i,j|2\displaystyle\sum_{i,j}|\widehat{p}(\rho)_{i,j}|^{2} =i,j|𝔼x[p(x)ρ(x)¯]i,j|2\displaystyle=\sum_{i,j}|\mathbb{E}_{x}\bigl{[}p(x)\overline{\rho(x)}]_{i,j}\bigr{|}^{2}
=i,j|𝔼x[(G1+ϵxG1)ρ(x)¯]i,j|2\displaystyle=\sum_{i,j}\bigl{|}\mathbb{E}_{x}\bigl{[}(G^{-1}+\mathrm{\epsilon}_{x}G^{-1})\overline{\rho(x)}\bigr{]}_{i,j}\bigr{|}^{2} (for some ϵx with |ϵx|ϵ)\displaystyle\text{ (for some }\mathrm{\epsilon}_{x}\text{ with }|\mathrm{\epsilon}_{x}|\leq\mathrm{\epsilon}\text{)}
=i,j|𝔼x[ϵxG1ρ(x)¯i,j]|2\displaystyle=\sum_{i,j}\bigl{|}\mathbb{E}_{x}\bigl{[}\mathrm{\epsilon}_{x}G^{-1}\overline{\rho(x)}_{i,j}\bigr{]}\bigr{|}^{2} (by Lemma 4 with ψ:=1\psi:=1, using that ρ\rho is non-trivial)
G2i,j𝔼x[ϵx2|ρ(x)¯i,j|]2\displaystyle\leq G^{-2}\sum_{i,j}\mathbb{E}_{x}\bigl{[}\mathrm{\epsilon}_{x}^{2}\cdot|\overline{\rho(x)}_{i,j}|\bigr{]}^{2}
G2i,jϵ2𝔼x|ρ(x)¯i,j|2\displaystyle\leq G^{-2}\sum_{i,j}\mathrm{\epsilon}^{2}\mathbb{E}_{x}|\overline{\rho(x)}_{i,j}|^{2}
=G2i,jϵ2/dρ\displaystyle=G^{-2}\sum_{i,j}\mathrm{\epsilon}^{2}/d_{\rho} (by Lemma 4, see “in particular” part)
=G2dρϵ2.\displaystyle=G^{-2}d_{\rho}\mathrm{\epsilon}^{2}.\qed
Proof of 9.

Write GG for the group HmH^{m}. For any distribution qq we have

|qu|22=|q|221/G=Gρdρ|q^(ρ)|221/G=Gρ1dρ|q^(ρ)|22.|q-u|_{2}^{2}=|q|_{2}^{2}-1/G=G\sum_{\rho}d_{\rho}|\widehat{q}(\rho)|_{2}^{2}-1/G=G\sum_{\rho\neq 1}d_{\rho}|\widehat{q}(\rho)|_{2}^{2}.

In our case q=ppq=p*p, and using 3 and the above equality we bound the RHS by

G3ρ1dρ|p^(ρ)|24G2|pu|22maxρ1|p^(ρ)|22.\leq G^{3}\sum_{\rho\neq 1}d_{\rho}|\widehat{p}(\rho)|_{2}^{4}\leq G^{2}\cdot|p-u|_{2}^{2}\cdot\max_{\rho\neq 1}|\widehat{p}(\rho)|_{2}^{2}.

It remains to bound G2maxρ1|p^(ρ)|22G^{2}\max_{\rho\neq 1}|\widehat{p}(\rho)|_{2}^{2}. We consider two cases:

If |ρ|>k|\rho|>k, then dρdk+1d_{\rho}\geq d^{k+1} by 5, so we simply use Parseval to bound

G2|p^(ρ)|22G|p|22/dρG|p|22/dk+1.G^{2}|\widehat{p}(\rho)|_{2}^{2}\leq G|p|_{2}^{2}/d_{\rho}\leq G|p|_{2}^{2}/d^{k+1}.

We also have |p|22(maxxp(x))xp(x)=maxxp(x)2/Hk|p|_{2}^{2}\leq(\max_{x}p(x))\cdot\sum_{x}p(x)=\max_{x}p(x)\leq 2/H^{k}, because pp is in particular (1,k)(1,k)-uniform. Hence, we get a bound of G2Hkd(k+1)G\cdot 2\cdot H^{-k}d^{-(k+1)}, as desired.

If |ρ|k|\rho|\leq k, then restrict to the non-trivial coordinates of ρ\rho. On those coordinates, pp induces a distribution that is HkH^{-k}-uniform. By 11, we have

G2|p^(ρ)|22dρH2k.G^{2}|\widehat{p}(\rho)|_{2}^{2}\leq d_{\rho}H^{-2k}.

Note dρHk/2d_{\rho}\leq H^{k/2} by 2. Thus, we obtain a bound of H1.5kd(k+1)H^{-1.5k}\leq d^{-(k+1)}. ∎

We can now present the proof of 7.

Proof of 7..

It is known that HH is cH1/3\geq cH^{1/3}-quasirandom, a proof can be found in [DSV03]. Hence, the parameter d(k+1)d^{-(k+1)} in 9 is cH(k+1)/3Hck\leq cH^{-(k+1)/3}\leq H^{-ck} for any k3k\geq 3. Also, we have |pu|22=|p|221/G|p-u|_{2}^{2}=|p|_{2}^{2}-1/G. If pp is (Hk,k)(H^{-k},k)-uniform then |p|22maxxp(x)2/Hk|p|_{2}^{2}\leq\max_{x}p(x)\leq 2/H^{k}. Moreover, the uniformity parameter is maintained when taking convolutions. So one can apply the lemma a constant number of times to drive the L2L_{2} norm to HmH^{-m}, and then convolve one more time to obtain a distribution that is HmH^{-m}-uniform via 10. ∎

4 Proof of 1

Let m:=2km:=2^{k}. As noted in [GV19], an application of the box norm (Corollary 3.11 in [VW08]) shows that the LHS in the conclusion is cH2d\leq cH2^{d} times the statistical distance between the uniform distribution over HmH^{m} and the convolution of tt independent copies of the following distribution ss over HmH^{m}.

Definition 12.

Pick ui0,ui1u_{i}^{0},u_{i}^{1} for i[k]i\in[k] uniformly from HH. For x[2]kx\in[2]^{k} the xx coordinate s(x)s(x) of ss is defined to be i[k]uixi\prod_{i\in[k]}u_{i}^{x_{i}}.

Claim 13.

ss is 33-uniform.

Proof.

Pick a coordinate ii s.t. xiyix_{i}\neq y_{i}. W.l.og. let i=0i=0, x0=0x_{0}=0, and y0=1y_{0}=1. Now z0z_{0} is equal to either x0x_{0} or y0y_{0}. Assume w.l.o.g. that z0=y0z_{0}=y_{0}. Consider a coordinate jj where zjyjz_{j}\neq y_{j}. Assume again w.l.o.g. that j=1j=1. We can fix all other uiu_{i} with i2i\geq 2 and prove 3-uniformity just considering those two coordinates. For concreteness, details follow.

Up to swapping yy and zz there are only two cases to consider. The first is

x\displaystyle x =00\displaystyle=00
y\displaystyle y =10\displaystyle=10
z\displaystyle z =11.\displaystyle=11.

In this case we can fix arbitrarily the uu corresponding to yy, and then xx is uniform because of u00u_{0}^{0} and zz because of u11u_{1}^{1}.

Alternatively,

x\displaystyle x =01\displaystyle=01
y\displaystyle y =10\displaystyle=10
z\displaystyle z =11.\displaystyle=11.

In this case we can similarly fix arbitrarily the uu corresponding to zz. ∎

We note that ss is not 44-uniform, again just considering two coordinates.

To conclude the proof of 1, note that the convolution of mcm^{c} copies of ss is (Hm,m)(H^{-m},m)-uniform by 3. By 8 the convolution of tt copies is then (Hmt/mc,m)(H^{-m\cdot t/m^{c}},m)-uniform, and the result follows.

5 From (ϵ,k)(\mathrm{\epsilon},k)-uniform to kk-uniform

In this section we prove the following generalization of [RX13].

Theorem 14.

Let pp be a distribution on G=HmG=H^{m} s.t. |p^(ρ)|2ϵ/G|\widehat{p}(\rho)|_{2}\leq\mathrm{\epsilon}/G for every ρ:\rho:|ρ|[1,k]|\rho|\in[1,k]. Then pp has distance at most 3(mH)2kϵ3(mH)^{2k}\mathrm{\epsilon} from a kk-uniform distribution.

First we note the following converse to 11.

Claim 15.

Let pp be a distribution over HmH^{m}. Suppose p^(ρ)=0\widehat{p}(\rho)=0 whenever |ρ|{1,,k}|\rho|\in\{1,\ldots,k\}. Then pp is kk-uniform.

Proof.

Consider any kk coordinates; assume they are the first kk w.l.o.g. The probability of a string xHkx\in H^{k} is

yHmkp(xy).\sum_{y\in H^{m-k}}p(xy).

By the inversion formula and the assumption this is

yρdρtr(p^(ρ)ρ(xy)T)=yp^(1)+ρ:|ρ|>kdρtr(p^(ρ)yρ(xy)).\sum_{y}\sum_{\rho}d_{\rho}\operatorname{tr}\bigl{(}\widehat{p}(\rho)\rho(xy)^{T}\bigr{)}=\sum_{y}\widehat{p}(1)+\sum_{\rho:|\rho|>k}d_{\rho}\operatorname{tr}\Bigl{(}\widehat{p}(\rho)\sum_{y}\rho(xy)\Bigr{)}.

We have p^(1)=𝔼xp(x)=1/Hm\widehat{p}(1)=\mathbb{E}_{x}p(x)=1/H^{m} and so the first summand is 1/Hk1/H^{k}. We show that the second summand is 0 by showing that yρ(xy)\sum_{y}\rho(xy) is the zero matrix. To verify this, write ρ\rho as a tensor product of ρi\rho_{i} using 5. Then one entry of ρ(xy)\rho(xy) is the product of the entries of the ρi\rho_{i}. There is a non-trivial ρi\rho_{i} corresponding to a yy coordinate. The sum over that coordinate of yy yields 0 by 4. ∎

Proof of 14..

Let

(x):=ρ:|ρ|[1,k]dρtr(p^(ρ)ρ(x)T)\ell(x):=\sum_{\rho:|\rho|\in[1,k]}d_{\rho}\operatorname{tr}\bigl{(}\widehat{p}(\rho)\rho(x)^{T}\bigr{)}

be the “low-degree” part of pp, and let

p(x):=p(x)(x)=ρ:|ρ|[1,k]dρtr(p^(ρ)ρ(x)T).p^{\prime}(x):=p(x)-\ell(x)=\sum_{\rho:|\rho|\not\in[1,k]}d_{\rho}\operatorname{tr}{\bigl{(}\widehat{p}(\rho)\rho(x)^{T}}\bigr{)}.

We first observe that \ell and hence pp^{\prime} is real. This is because the conjugate ρ¯\overline{\rho} of an irrep is also an irrep, and over HtH^{t} the number of non-trivial coordinates is the same. Hence the sum over ρ\rho is the same as the sum over ρ¯\overline{\rho}. Moreover, p^(ρ¯)ρ¯=p^(ρ)ρ¯\widehat{p}(\overline{\rho})\cdot\overline{\rho}=\overline{\widehat{p}(\rho)\cdot\rho}. Therefore, we can write

2(x)=ρ:|ρ|[1,k](dρtr(p^(ρ)ρ(x)T)+dρtr(p^(ρ)ρ(x)T¯)).2\ell(x)=\sum_{\rho:|\rho|\in[1,k]}\Bigl{(}d_{\rho}\operatorname{tr}\bigl{(}\widehat{p}(\rho)\rho(x)^{T}\bigr{)}+d_{\rho}\operatorname{tr}\bigl{(}\overline{\widehat{p}(\rho)\rho(x)^{T}}\bigr{)}\Bigr{)}.

The expression inside the brackets is real and so p(x)p^{\prime}(x) is real as well.

Also, xp(x)=Gp^(1)=Gp^(1)=1\sum_{x}p^{\prime}(x)=G\cdot\widehat{p^{\prime}}(1)=G\cdot\widehat{p}(1)=1 by 4.

However, pp^{\prime} may be <0<0 on some xx. To remedy that, following previous work, we will “mix” pp^{\prime} with the uniform distribution so that the mixture becomes a distribution. Note that the mixture is kk-uniform by 15 as the low-degree non-trivial Fourier coefficients of both pp^{\prime} and uniform are zero.

Concretely, let

q:=(1β)p+β1Hm.q:=(1-\beta)p^{\prime}+\beta\frac{1}{H^{m}}.

for β\beta to be determined. Note that qq sums to 11 as pp^{\prime} and 1/Hm1/H^{m} both do.

To determine β\beta, first note that by definition

p(x)(x).p^{\prime}(x)\geq-\ell(x).

Crudely, (x)(mH)kmaxρdρ|tr(p^(ρ)ρ(x)T)|\ell(x)\leq(mH)^{k}\max_{\rho}d_{\rho}|\operatorname{tr}(\widehat{p}(\rho)\rho(x)^{T})|. Now, each absolute value is |p^(ρ)|2|ρ(x)|2.\leq|\widehat{p}(\rho)|_{2}|\rho(x)|_{2}. The first term is ϵ/G\mathrm{\epsilon}/G by assumption. For the second, we use the fact that ρ(x)\rho(x) is unitary, and so |ρ(x)|2=|I|2=dρ|\rho(x)|_{2}=|I|_{2}=\sqrt{d_{\rho}}. So dρ|tr(p^(ρ)ρ(x)T)|dρ3/2ϵ/GH3k/4ϵ/Gd_{\rho}|\operatorname{tr}(\widehat{p}(\rho)\rho(x)^{T})|\leq d_{\rho}^{3/2}\mathrm{\epsilon}/G\leq H^{3k/4}\mathrm{\epsilon}/G, using, in the last inequality, 2 over the underlying group HkH^{k}.

Hence, |(x)|(mH)2kϵ/G|\ell(x)|\leq(mH)^{2k}\mathrm{\epsilon}/G and we can set β:=(tH)2kϵ/G\beta:=(tH)^{2k}\mathrm{\epsilon}/G and obtain that q0q\geq 0. As remarked earlier, qq sums to 1, and so qq is a distribution. It remains to bound the distance between pp and qq. We have

|pq|1=|p(1β)p+β/G|1|(x)|1+β|p|1+β|1/G|1.|p-q|_{1}=\bigl{|}p-(1-\beta)p^{\prime}+\beta/G\bigr{|}_{1}\leq|\ell(x)|_{1}+\beta|p^{\prime}|_{1}+\beta\cdot|1/G|_{1}.

The last two summands are β\beta each. The first one is (mH)2kϵ=β\leq(mH)^{2k}\mathrm{\epsilon}=\beta by the bound on |(x)||\ell(x)| above. Hence the distance is 3β=3(mH)2kϵ\leq 3\beta=3(mH)^{2k}\mathrm{\epsilon}. ∎

References

  • [ACFN15] Anil Ada, Arkadev Chattopadhyay, Omar Fawzi, and Phuong Nguyen. The NOF multiparty communication complexity of composed functions. Comput. Complex., 24(3):645–694, 2015.
  • [AGM03] Noga Alon, Oded Goldreich, and Yishay Mansour. Almost k-wise independence versus k-wise independence. Inf. Process. Lett., 88(3):107–110, 2003.
  • [BC92] Michael Ben-Or and Richard Cleve. Computing algebraic formulas using a constant number of registers. SIAM J. on Computing, 21(1):54–58, 1992.
  • [BC08] Joshua Brody and Amit Chakrabarti. Sublinear communication protocols for multi-party pointer jumping and a related lower bound. In 25th Symp. on Theoretical Aspects of Computer Science (STACS), pages 145–156, 2008.
  • [BCK+14] Harry Buhrman, Richard Cleve, Michal Koucký, Bruno Loff, and Florian Speelman. Computing with a full memory: catalytic space. In ACM Symp. on the Theory of Computing (STOC), pages 857–866, 2014.
  • [BDFP83] Allan Borodin, Danny Dolev, Faith E. Fich, and Wolfgang J. Paul. Bounds for width two branching programs. In Proceedings of the 15th Annual ACM Symposium on Theory of Computing, 25-27 April, 1983, Boston, Massachusetts, USA, pages 87–93, 1983.
  • [BDPW07] Paul Beame, Matei David, Toniann Pitassi, and Philipp Woelfel. Separating deterministic from nondeterministic nof multiparty communication complexity. In 34th Coll. on Automata, Languages and Programming (ICALP), pages 134–145. Springer, 2007.
  • [BGKL03] László Babai, Anna Gál, Peter G. Kimmel, and Satyanarayana V. Lokam. Communication complexity of simultaneous messages. SIAM J. on Computing, 33(1):137–166, 2003.
  • [BNP08] László Babai, Nikolay Nikolov, and László Pyber. Product growth and mixing in finite groups. In ACM-SIAM Symp. on Discrete Algorithms (SODA), pages 248–257, 2008.
  • [BNS92] László Babai, Noam Nisan, and Márió Szegedy. Multiparty protocols, pseudorandom generators for logspace, and time-space trade-offs. J. of Computer and System Sciences, 45(2):204–232, 1992.
  • [CFL83] Ashok K. Chandra, Merrick L. Furst, and Richard J. Lipton. Multi-party protocols. In 15th ACM Symp. on the Theory of Computing (STOC), pages 94–99, 1983.
  • [CM20] James Cook and Ian Mertz. Catalytic approaches to the tree evaluation problem. In STOC, pages 752–760. ACM, 2020.
  • [Dia88] Persi Diaconis. Group representations in probability and statistics, volume 11 of Institute of Mathematical Statistics Lecture Notes—Monograph Series. Institute of Mathematical Statistics, Hayward, CA, 1988.
  • [DSV03] G. Davidoff, P. Sarnak, and A. Valette. Elementary Number Theory, Group Theory and Ramanujan Graphs. Elementary Number Theory, Group Theory, and Ramanujan Graphs. Cambridge University Press, 2003.
  • [DV23] Harm Derksen and Emanuele Viola. Quasirandom groups enjoy interleaved mixing. Discrete Analysis, 2023.
  • [Gow08] W. T. Gowers. Quasirandom groups. Combinatorics, Probability & Computing, 17(3):363–387, 2008.
  • [Gow17] W. T. Gowers. Generalizations of Fourier analysis, and how to apply them. Bull. Amer. Math. Soc. (N.S.), 54(1):1–44, 2017.
  • [GV15] W. T. Gowers and Emanuele Viola. The communication complexity of interleaved group products. In ACM Symp. on the Theory of Computing (STOC), 2015.
  • [GV19] W. T. Gowers and Emanuele Viola. Interleaved group products. SIAM J. on Computing, 48(3):554–580, 2019. Special issue of FOCS 2016.
  • [GV22] W. T. Gowers and Emanuele Viola. Mixing in non-quasirandom groups. In ACM Innovations in Theoretical Computer Science conf. (ITCS), 2022.
  • [IL95] Neil Immerman and Susan Landau. The complexity of iterated multiplication. Inf. Comput., 116(1):103–116, 1995.
  • [KLM23] Zander Kelley, Shachar Lovett, and Raghu Meka. Explicit separations between randomized and deterministic number-on-forehead communication. CoRR, abs/2308.12451, 2023.
  • [KMR66] Kenneth Krohn, W. D. Maurer, and John Rhodes. Realizing complex Boolean functions with simple groups. Information and Control, 9:190–195, 1966.
  • [KN97] Eyal Kushilevitz and Noam Nisan. Communication complexity. Cambridge University Press, 1997.
  • [LZ77] Richard J. Lipton and Yechezkel Zalcstein. Word problems solvable in logspace. J. ACM, 24(3):522–526, 1977.
  • [MC87] Pierre McKenzie and Stephen A. Cook. The parallel complexity of abelian permutation group problems. SIAM J. Comput., 16(5):880–909, 1987.
  • [Mil14] Eric Miles. Iterated group products and leakage resilience against NC1NC^{1}. In ACM Innovations in Theoretical Computer Science conf. (ITCS), 2014.
  • [Mix89] David A. Mix Barrington. Bounded-width polynomial-size branching programs recognize exactly those languages in NC1. J. of Computer and System Sciences, 38(1):150–164, 1989.
  • [MV13] Eric Miles and Emanuele Viola. Shielding circuits with groups. In ACM Symp. on the Theory of Computing (STOC), 2013.
  • [PRS97] Pavel Pudlák, Vojtěch Rödl, and Jiří Sgall. Boolean circuits, tensor ranks, and communication complexity. SIAM J. on Computing, 26(3):605–633, 1997.
  • [RX13] Ronitt Rubinfeld and Ning Xie. Robust characterizations of k-wise independence over product spaces and related testing results. Random Struct. Algorithms, 43(3):265–312, 2013.
  • [RY19] Anup Rao and Amir Yehudayoff. Communication complexity. 2019. https://homes.cs.washington.edu/ anuprao/pubs/book.pdf.
  • [Ser77] Jean Pierre Serre. Linear Representations of Finite Groups. Springer, 1977.
  • [Sha16] Aner Shalev. Mixing, communication complexity and conjectures of Gowers and Viola. Combinatorics, Probability and Computing, pages 1–13, 6 2016. arXiv:1601.00795.
  • [Ter99] Audrey Terras. Fourier analysis on finite groups and applications, volume 43 of London Mathematical Society Student Texts. Cambridge University Press, Cambridge, 1999.
  • [Vio19] Emanuele Viola. Non-abelian combinatorics and communication complexity. SIGACT News, Complexity Theory Column, 50(3), 2019.
  • [VW08] Emanuele Viola and Avi Wigderson. Norms, XOR lemmas, and lower bounds for polynomials and protocols. Theory of Computing, 4:137–168, 2008.
  • [Wig10] Avi Wigderson. Representation theory of finite groups, and applications. Available at http://www.math.ias.edu/\simavi/TALKS/Green_Wigderson_lecture.pdf, 2010.