This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Random Restrictions of High-Rank Tensors and Polynomial Maps

Jop Briët Supported by the Dutch Research Council (NWO) as part of the NETWORKS programme (grant no. 024.002.003).    Davi Castro-Silva Supported by the Dutch Research Council (NWO) as part of the NETWORKS programme (grant no. 024.002.003).
Abstract

Motivated by a problem in computational complexity, we consider the behavior of rank functions for tensors and polynomial maps under random coordinate restrictions. We show that, for a broad class of rank functions called natural rank functions, random coordinate restriction to a dense set will typically reduce the rank by at most a constant factor.

\dajAUTHORdetails

title = Random restrictions of high-rank tensors and polynomial maps, author = Jop Briët and Davi Castro-Silva, plaintextauthor = Jop Briët, Davi Castro-Silva, runningtitle = Random restrictions of tensors and polynomials, keywords = slice rank, analytic rank, Schmidt rank, Talagrand inequality, analysis of boolean functions, \dajEDITORdetailsyear=2024, number=9, received=23 February 2023, published=4 November 2024, doi=10.19086/da.124610,

[classification=text]

1 Introduction

Different but equivalent definitions of matrix rank have been generalized to truly different rank functions for tensors. Although they have proved useful in a variety of applications, the basic theory of these rank functions, describing for instance their interrelations and elementary properties, is still far from complete.

Without going into the definitions (which are given in Section 2), we mention a number of these rank functions to indicate some of the contexts in which they have appeared. The slice rank of a tensor was introduced by Tao [34, 35] to reformulate the breakthrough proof of the cap set conjecture due to Croot, Lev and Pach [8] and Ellenberg and Gijswijt [11]. Slice rank is generalized by the partition rank, which was introduced by Naslund to prove bounds on the size of subsets of 𝔽qn\mathbb{F}_{q}^{n} without kk-right corners [27], as well as provide exponential improvements on the Erdős–Ginzburg–Ziv constant [26]. The analytic rank is based on a measure of equidistribution for multilinear forms associated to tensors over finite fields, and was introduced by Gowers and Wolf to study solutions to linear systems of equations in large subsets of finite vectors spaces [15]. Geometric rank, defined and studied by Kopparty, Moshkovitz and Zuiddam in the context of algebraic complexity theory [23], gives a natural analogue of analytic rank for tensors over infinite fields.

Closely related to these rank functions for tensors are notions of rank for multivariate polynomials. As quadratic forms on finite-dimensional vector spaces are naturally represented by matrices after choosing a basis, matrix rank gives a corresponding notion of rank for quadratic forms. This, in turn, may be generalized to rank functions for arbitrary polynomials by considering their associated homogeneous multilinear forms. Specific problems concerning multivariate polynomials might also give rise to other notions of rank more well-suited to the application at hand.

A notion of polynomial rank akin to the partition rank of tensors was used already in the ’80s by Schmidt in work on number theory [32]. This notion has been re-discovered and proven useful on several occasions (see Section 3), and is referred to by several different names in the literature; here we will refer to it as Schmidt rank. Work on the Inverse Theorem for the Gowers uniformity norms led Green and Tao to define the notion of degree rank [16], which quantifies how hard it is to express the considered polynomial as a function of lower-degree polynomials; this notion was shown to be closely linked to equidistribution properties of multivariate polynomials over prime fields 𝔽p\mathbb{F}_{p}. Tao and Ziegler [36] later studied the relationship between the degree rank of a polynomial111The notion of degree rank studied by Tao and Ziegler was actually a slight modification of the original one, where they also allow for “non-classical polynomials” when expressing higher-degree polynomials in terms of lower-degree ones. and its analytic rank, defined as the (tensor) analytic rank of its associated homogeneous multilinear form, and exploited their close connection in order to prove the general case of the Gowers Inverse Theorem over 𝔽pn\mathbb{F}_{p}^{n}.

Recent work on constant-depth Boolean circuits by Buhrman, Neumann and the present authors gave rise to a problem on equidistribution properties of higher-dimensional polynomial maps under biased input distributions [5]. This motivated a new notion of analytic rank for (high-dimensional) polynomial maps and prompted the study of rank under random coordinate restrictions, which is the topic of this paper.

Common to the tensors, polynomials and polynomial maps considered here is that they can be viewed as maps on 𝔽X\mathbb{F}^{X}, where 𝔽\mathbb{F} is a given field and XX is a finite set indexing the variables. The main question addressed in this paper is whether, if a map ϕ\phi on 𝔽X\mathbb{F}^{X} has high rank, most of its coordinate restrictions ϕ|I\phi_{|I} on 𝔽I\mathbb{F}^{I} also have high rank for dense subsets IXI\subseteq X (where we also respect the product structure of XX in the case of tensors). Our main results show that this is the case for all “natural” rank functions, which include all those mentioned above.

1.1 The matrix case

It is instructive to first consider the case of matrices, which is simpler and illustrates the spirit of our main results. For a matrix A𝔽n×nA\in\mathbb{F}^{n\times n} and subsets I,J[n]I,J\subseteq[n], denote by A|I×JA_{|I\times J} the sub-matrix of AA induced by the rows in II and columns in JJ. Given σ(0,1)\sigma\in(0,1), consider a random set I[n]I\subseteq[n] containing each element independently with probability σ\sigma; we write I[n]σI\sim[n]_{\sigma} when II is distributed as such. Note that, if I[n]ρI\sim[n]_{\rho} and J[n]σJ\sim[n]_{\sigma} are independent, then IJ[n]ηI\cup J\sim[n]_{\eta} with η=1(1ρ)(1σ)\eta=1-(1-\rho)(1-\sigma).

Proposition 1.1.

For every σ(0,1]\sigma\in(0,1] there exists κ(0,1]\kappa\in(0,1] such that for every matrix A𝔽n×nA\in\mathbb{F}^{n\times n} we have

PrI[n]σ[rk(A|I×I)κrk(A)]12eκrk(A).\mbox{\rm Pr}_{I\sim[n]_{\sigma}}\big{[}\operatorname{rk}(A_{|I\times I})\geq\kappa\cdot\operatorname{rk}(A)\big{]}\geq 1-2e^{-\kappa\operatorname{rk}(A)}.
  • Proof:

    Write ρ=11σ\rho=1-\sqrt{1-\sigma} and let J,J[n]ρJ,J^{\prime}\sim[n]_{\rho} be independent random sets; note that JJ[n]σJ\cup J^{\prime}\sim[n]_{\sigma}. Let r=rk(A)r=\operatorname{rk}(A), and fix a set S[n]S\subseteq[n] of rr linearly independent rows of AA. By the Chernoff bound [17], the probability that the set JJ satisfies |JS|<ρr/2|J\cap S|<\rho r/2 is at most eρr/8e^{-\rho r/8}.

    Now let B:=A|(JS)×[n]B:=A_{|(J\cap S)\times[n]} be the (random) sub-matrix of AA formed by the rows in JSJ\cap S. Since its rows are linearly independent, the rank of BB is precisely |JS||J\cap S|; let T[n]T\subseteq[n] be a set of |JS||J\cap S| linearly independent columns of BB. Then the probability that |JT|<ρ|T|/2|J^{\prime}\cap T|<\rho|T|/2 is at most eρ|T|/8e^{-\rho|T|/8}, and the rank of B|(JS)×(JT)=A|(JS)×(JT)B_{|(J\cap S)\times(J^{\prime}\cap T)}=A_{|(J\cap S)\times(J^{\prime}\cap T)} is equal to |JT||J^{\prime}\cap T|. It follows from the union bound and monotonicity of rank under restrictions that, with probability at least 12eρ2r/161-2e^{-\rho^{2}r/16}, the principal sub-matrix of AA induced by JJJ\cup J^{\prime} has rank at least ρ2r/4\rho^{2}r/4. The result now follows since JJ[n]σJ\cup J^{\prime}\sim[n]_{\sigma}. \Box

1.2 Main results and outline of the paper

Here we generalize Proposition 1.1 to tensors and polynomial maps for rank functions that satisfy a few natural properties, namely “sub-additivity”, “monotonicity”, a “Lipschitz condition” and, in the case of polynomial maps, “symmetry” (see Section 2 and Section 3 for the precise definitions). Those functions which satisfy these properties are called natural rank functions; we note that all notions of rank mentioned above are natural rank functions.

Since our results are independent of the field considered (which can be finite or infinite), we will always denote it by 𝔽\mathbb{F} and suppress statements of the form “let 𝔽\mathbb{F} be a field” or “for every field 𝔽\mathbb{F}”. We begin by considering the case of tensors.

Definition 1.2 (Tensors).

For finite sets X1,,XdX_{1},\dots,X_{d}\subset\mathbb{N}, a dd-tensor is a map T:X1××Xd𝔽T:X_{1}\times\cdots\times X_{d}\to\mathbb{F}. We will associate with any dd-tensor TT a multilinear map 𝔽X1××𝔽Xd𝔽\mathbb{F}^{X_{1}}\times\cdots\times\mathbb{F}^{X_{d}}\to\mathbb{F} and an element of 𝔽X1𝔽Xd\mathbb{F}^{X_{1}}\otimes\cdots\otimes\mathbb{F}^{X_{d}} in the obvious way, and also denote these objects by TT.

Definition 1.3 (Restriction of tensors).

For a tensor TT as in Definition 1.2 and subsets I1X1,,IdXdI_{1}\subseteq X_{1},\dots,I_{d}\subseteq X_{d}, denote I[d]=I1××IdI_{[d]}=I_{1}\times\cdots\times I_{d} and write T|I[d]T_{|I_{[d]}} for the restriction of TT to I[d]I_{[d]}. If TT is viewed as an element of 𝔽X1𝔽Xd\mathbb{F}^{X_{1}}\otimes\cdots\otimes\mathbb{F}^{X_{d}}, then T|I[d]T_{|I_{[d]}} is simply a sub-tensor.

We define (𝔽)d(\mathbb{F}^{\infty})^{\otimes d} to be the set of dd-tensors over 𝔽\mathbb{F} with finite support. Note that the tensors defined on finite sets naturally embed into this set, and that the rank functions for tensors discussed above are invariant under this embedding. Our main result regarding tensors is then as follows:

Theorem 1.4.

For every dd\in\mathbb{N} and σ(0,1]\sigma\in(0,1], there exist constants C,κ>0C,\kappa>0 such that the following holds. For every natural rank function rk:(𝔽)d+\operatorname{rk}:(\mathbb{F}^{\infty})^{\otimes d}\to\mathbb{R}_{+} and every dd-tensor Ti=1d𝔽[ni]T\in\bigotimes_{i=1}^{d}\mathbb{F}^{[n_{i}]} we have

PrI1[n1]σ,,Id[nd]σ[rk(T|I[d])κrk(T)]1Ceκrk(T).\mbox{\rm Pr}_{I_{1}\sim[n_{1}]_{\sigma},\dots,I_{d}\sim[n_{d}]_{\sigma}}\big{[}\operatorname{rk}\big{(}T_{|I_{[d]}}\big{)}\geq\kappa\cdot\operatorname{rk}(T)\big{]}\geq 1-Ce^{-\kappa\operatorname{rk}(T)}.

As noted before, the union of independent Bernoulli-random subsets of [n][n] is again a Bernoulli-random subset. As a consequence of this and monotonicity under restrictions, in the standard case of “cubic” tensors where every row is indexed by the same set, one also obtains the following symmetric version of the last theorem:

Corollary 1.5.

For every dd\in\mathbb{N} and σ(0,1]\sigma\in(0,1], there exist constants C,κ>0C,\kappa>0 such that the following holds. For every natural rank function rk:(𝔽)d+\operatorname{rk}:(\mathbb{F}^{\infty})^{\otimes d}\to\mathbb{R}_{+} and every dd-tensor T(𝔽n)dT\in(\mathbb{F}^{n})^{\otimes d} we have

PrI[n]σ[rk(T|Id)κrk(T)]1Ceκrk(T).\mbox{\rm Pr}_{I\sim[n]_{\sigma}}\big{[}\operatorname{rk}\big{(}T_{|I^{d}}\big{)}\geq\kappa\cdot\operatorname{rk}(T)\big{]}\geq 1-Ce^{-\kappa\operatorname{rk}(T)}.

Whereas the proof of the matrix case (Proposition 1.1) uses the fact that a rank-rr matrix contains a full-rank r×rr\times r submatrix, the proof of the general case of Theorem 1.4 proceeds differently and instead uses ideas from probability theory, in particular concerning concentration inequalities on product spaces. It will be presented in Section 2.

Recently, the topic of high-rank restrictions of tensors was explored in detail by Karam [20], and Gowers provided an example of a 3-tensor of slice rank 44 which does not have a full-rank 4×4×44\times 4\times 4 subtensor (see [20, Proposition 3.1]). There are some interesting parallels between our results and those of Karam, which will be discussed later on in Section 4; to the best of our knowledge, his results do not suffice to establish our main theorem above, nor the other way around.

Next we consider the setting of polynomial maps, which are formally defined as follows:

Definition 1.6 (Polynomial map).

A polynomial map is an ordered tuple ϕ(x)=(f1(x),,fk(x))\phi(x)=\big{(}f_{1}(x),\dots,f_{k}(x)\big{)} of polynomials f1,,fk𝔽[x1,,xn]f_{1},\dots,f_{k}\in\mathbb{F}[x_{1},\dots,x_{n}]. We identify with ϕ\phi a map 𝔽n𝔽k\mathbb{F}^{n}\to\mathbb{F}^{k} in the natural way. The degree of ϕ\phi is the maximum degree of the fif_{i}.

Definition 1.7 (Restriction of polynomial maps).

For a polynomial map ϕ:𝔽n𝔽k\phi:\mathbb{F}^{n}\to\mathbb{F}^{k} and a set I[n]I\subseteq[n], define the restriction ϕ|I:𝔽I𝔽k\phi_{|I}:\mathbb{F}^{I}\to\mathbb{F}^{k} to be the map given by ϕ|I(y)=ϕ(y¯)\phi_{|I}(y)=\phi(\bar{y}), where y¯𝔽n\bar{y}\in\mathbb{F}^{n} agrees with yy on the coordinates in II and is zero elsewhere.

We denote the space of all polynomial maps ϕ:𝔽n𝔽k\phi:\mathbb{F}^{n}\to\mathbb{F}^{k} of degree at most dd by Pold(𝔽n,𝔽k)\operatorname{Pol}_{\leq d}(\mathbb{F}^{n},\mathbb{F}^{k}), and write

Pold(𝔽,𝔽k)=nPold(𝔽n,𝔽k).\operatorname{Pol}_{\leq d}(\mathbb{F}^{\infty},\mathbb{F}^{k})=\bigcup_{n\in\mathbb{N}}\operatorname{Pol}_{\leq d}(\mathbb{F}^{n},\mathbb{F}^{k}).

Our main result in this setting is the following:

Theorem 1.8.

For every dd\in\mathbb{N} and σ,ε(0,1]\sigma,\varepsilon\in(0,1], there exist constants κ=κ(d,σ)>0\kappa=\kappa(d,\sigma)>0 and R=R(d,σ,ε)R=R(d,\sigma,\varepsilon)\in\mathbb{N} such that the following holds. For every natural rank function rk:Pold(𝔽,𝔽k)+\operatorname{rk}:\operatorname{Pol}_{\leq d}(\mathbb{F}^{\infty},\mathbb{F}^{k})\to\mathbb{R}_{+} and every map ϕPold(𝔽n,𝔽k)\phi\in\operatorname{Pol}_{\leq d}(\mathbb{F}^{n},\mathbb{F}^{k}) with rk(ϕ)R\operatorname{rk}(\phi)\geq R, we have

PrI[n]σ[rk(ϕ|I)κrk(ϕ)]1ε.\mbox{\rm Pr}_{I\sim[n]_{\sigma}}\big{[}\operatorname{rk}(\phi_{|I})\geq\kappa\cdot\operatorname{rk}(\phi)\big{]}\geq 1-\varepsilon.

The proof of this theorem will be given in Section 3. For reasons that will be made clear in that section, this proof will be (at least superficially) quite different from that of the tensor case, Theorem 1.4; it relies instead on results from analysis of Boolean functions taken together with elementary combinatorial arguments.

Remark 1.9.

The quantitative bounds obtained by Theorem 1.8 are much weaker than those of Theorem 1.4. When d!d! is invertible in 𝔽\mathbb{F}, however, many rank functions for polynomial maps ϕ\phi are closely related to rank functions for the symmetric tensor associated to ϕ\phi (see for instance [15, Section 3]). In such cases it might be possible to replace an application of Theorem 1.8 by Theorem 1.4 so as to obtain good bounds.

In Section 4 we discuss the relationship between our work and other works present in the literature. In particular, we will explain the specific problem which led to the study of rank under random coordinate restrictions. We will also give a simple conditional proof of our main theorems mimicking the case of matrices given in Proposition 1.1, as long as we assume the existence of a so-called “linear core” which would generalize (in a somewhat weak sense) the existence of a full-rank r×rr\times r submatrix inside matrices of rank rr. Finally, we propose some open problems to further our understanding of rank functions.

1.3 Notation

Here we collect some notation that will be used throughout the paper. Given sets I1,,IdI_{1},\dots,I_{d}, we write I[d]=I1××IdI_{[d]}=I_{1}\times\cdots\times I_{d}. For a set JJ and some σ[0,1]\sigma\in[0,1], we denote by πσJ\pi_{\sigma}^{J} the product distribution on {0,1}J\{0,1\}^{J} where each coordinate is independently set to 11 with probability σ\sigma, or to 0 with probability 1σ1-\sigma; when J=[n]J=[n], we write simply πσn\pi_{\sigma}^{n}. We write JσJ_{\sigma} for the probability distribution over subsets of JJ where each element is present independently with probability σ\sigma. To denote that a random variable XX is distributed according to a distribution μ\mu, we write XμX\sim\mu.

2 Tensors

Recall that, for a field 𝔽\mathbb{F} and integer d2d\geq 2, we denote by (𝔽)d(\mathbb{F}^{\infty})^{\otimes d} the set of all dd-tensors of finite support over 𝔽\mathbb{F}. All our results (including the values of the implied constants) will hold independently of the field considered, so we will always denote it by 𝔽\mathbb{F} without further comment.

The notions of tensor rank we will consider here are those called natural rank functions as defined below:

Definition 2.1 (Natural rank).

We say that rk:(𝔽)d+\operatorname{rk}:(\mathbb{F}^{\infty})^{\otimes d}\to\mathbb{R}_{+} is a natural rank function if rk(𝟎)=0\operatorname{rk}({\bf 0})=0 and it satisfies the following properties:

  1. 1.

    Sub-additivity:
    rk(T+S)rk(T)+rk(S)\operatorname{rk}(T+S)\leq\operatorname{rk}(T)+\operatorname{rk}(S) for all T,S(𝔽)dT,S\in(\mathbb{F}^{\infty})^{\otimes d}.

  2. 2.

    Monotonicity under restrictions:
    rk(T|I[d])rk(T)\operatorname{rk}\big{(}T_{|I_{[d]}}\big{)}\leq\operatorname{rk}(T) for all T(𝔽)dT\in(\mathbb{F}^{\infty})^{\otimes d} and all sets I1,,IdI_{1},\dots,I_{d}\subset\mathbb{N}.

  3. 3.

    Restriction Lipschitz property:
    rk(T|J[d])rk(T|I[d])+i=1d|JiIi|\operatorname{rk}\big{(}T_{|J_{[d]}}\big{)}\leq\operatorname{rk}\big{(}T_{|I_{[d]}}\big{)}+\sum_{i=1}^{d}|J_{i}\setminus I_{i}| for all T(𝔽)dT\in(\mathbb{F}^{\infty})^{\otimes d} and all sets I1J1,,IdJdI_{1}\subseteq J_{1},\dots,I_{d}\subseteq J_{d}.

In order to motivate this definition, we now discuss several examples (and one non-example) of natural rank functions that have been studied in the literature. In what follows, given i[d]i\in[d] and an element uXiu\in X_{i}, the restriction of  T:X1××Xd𝔽T:X_{1}\times\cdots\times X_{d}\to\mathbb{F} to the set X1×Xi1×{u}×Xi+1××XdX_{1}\times\cdots X_{i-1}\times\{u\}\times X_{i+1}\times\cdots\times X_{d} is referred to as a slice. Note that property (3) implies that a slice has rank at most 1; conversely, under the assumption of property (1), property (3) is satisfied provided that slices have rank at most 1.

Slice rank

The notion of slice rank was introduced by Tao [34] to give a more symmetric version of Ellenberg and Gijswijt’s proof [11] of the cap set conjecture, and was later further studied by Sawin and Tao [35]. It has been used in the study of several extremal combinatorics problems, such as bounding the maximal sizes of tri-colored sum-free sets [2] and sunflower-free sets [28], as well as obtaining essentially tight bounds for Green’s arithmetic triangle removal lemma [12].

A nonzero dd-tensor TT (viewed as a multi-linear form) has slice rank 1 if there exist i[d]i\in[d], R:𝔽Xi𝔽R:\mathbb{F}^{X_{i}}\to\mathbb{F} and S:j[d]{i}𝔽Xj𝔽S:\prod_{j\in[d]\setminus\{i\}}\mathbb{F}^{X_{j}}\to\mathbb{F} such that TT can be factored as T=RST=RS. In general, the slice rank of TT, denoted srank(T)\operatorname{srank}(T), is then defined as the least rr\in\mathbb{N} such that there is a decomposition T=T1++TrT=T_{1}+\cdots+T_{r} where each TiT_{i} has slice rank 1. Slice rank is sub-additive since the sum of decompositions of two tensors S,TS,T gives a decomposition of S+TS+T. It is monotone under restrictions since a decomposition of TT induces a decomposition of its restrictions. The restriction Lipschitz property can easily be verified inductively slice-by-slice using sub-additivity.

Partition rank

The partition rank was introduced by Naslund [27] as a more general version of the slice rank which allows one to handle problems that require variables to be distinct. It was first used to provide bounds on the size of subsets of 𝔽qn\mathbb{F}_{q}^{n} not containing kk-right corners [27], as well as an upper bound for the Erdős-Ginzburg-Ziv constant of 𝔽pn\mathbb{F}_{p}^{n} [26].

A nonzero dd-tensor TT is defined to have partition rank 1 if there is a nonempty strict subset I[d]I\subset[d] and tensors R:iI𝔽Xi𝔽R:\prod_{i\in I}\mathbb{F}^{X_{i}}\to\mathbb{F} and S=j[d]I𝔽Xj𝔽S=\prod_{j\in[d]\setminus I}\mathbb{F}^{X_{j}}\to\mathbb{F} such that TT can be factored as T=RST=RS . In general, the partition rank of TT, denoted prank(T)\operatorname{prank}(T), is defined as the least rr\in\mathbb{N} such that there is a decomposition T=T1++TrT=T_{1}+\cdots+T_{r} where each TiT_{i} has partition rank 1. The properties of Definition 2.1 for partition rank follow for the same reasons as for slice rank.

Analytic rank

The notion of analytic rank was introduced by Gowers and Wolf [15] to study an arithmetic notion of complexity for linear systems of equations over 𝔽pn\mathbb{F}_{p}^{n}. It gives a quantitative measure of equidistribution for the values taken by a tensor, and as such are well suited for arguments relying on the dichotomy between structure and randomness. The analytic rank was further studied by Lovett [25], and more recently it was used by the first author in a problem concerning a random version of Szemerédi’s Theorem over finite fields [4].

The analytic rank is defined only if 𝔽\mathbb{F} is finite. For a non-trivial additive character χ:𝔽\chi:\mathbb{F}\to\mathbb{C}^{*}, the bias of a dd-tensor TT is defined by

bias(T)=𝔼x1𝔽X1,,xd𝔽Xdχ(T(x1,,xd));\operatorname{bias}(T)={\mathbb{E}}_{x_{1}\in\mathbb{F}^{X_{1}},\dots,x_{d}\in\mathbb{F}^{X_{d}}}\chi\big{(}T(x_{1},\dots,x_{d})\big{)};

this definition is easily shown to be independent of the character χ\chi chosen. The analytic rank of TT is then defined by

arank(T)=log|𝔽|bias(T).\operatorname{arank}(T)=-\log_{|\mathbb{F}|}\operatorname{bias}(T).

Lovett proved that the analytic rank is sub-additive [25, Theorem 1.5]. Monotonicity follows from [25, Lemma 2.1]. The restriction Lipschitz property now follows from sub-additivity and the fact that a single slice has analytic rank at most 1.

Geometric rank

Motivated by applications in algebraic complexity theory and extremal combinatorics, as well as an open problem posed by Lovett, the geometric rank was introduced by Kopparty, Moshkovitz and Zuiddam [23] as a natural extension of analytic rank beyond finite fields. While its definition is algebraic geometric in nature, it turns out to have deep connections with partition rank and analytic rank, as shown by Cohen and Moshkovitz [6, 7].

For an algebraically closed field 𝔽\mathbb{F}, the geometric rank of a dd-tensor TT is defined as

GR(T)=codim{(x1,,xd1):T(x1,,xd1,)=0},\operatorname{GR}(T)=\operatorname{codim}\{(x_{1},\dots,x_{d-1}):\,T(x_{1},\dots,x_{d-1},\cdot)=0\},

where codim\operatorname{codim} denotes the codimension of an algebraic variety. If the field 𝔽\mathbb{F} considered is not algebraically closed, then the geometric rank is naturally defined via the embedding of 𝔽\mathbb{F} in its algebraic closure. Monotonicity and sub-additivity of this rank function follow directly from Lemma 4.2 and Lemma 4.4 of [23], respectively. The restriction Lipschitz property follows from these properties and the fact that a single slice has geometric rank at most 1.

Tensor rank

It is also instructive to remark on an important non-example, the notion usually known simply as tensor rank, which is an important notion in the context of computational complexity (see for instance [24, 30]). A nonzero dd-tensor TT has tensor rank 1 if it decomposes as T=i[d]uiT=\bigotimes_{i\in[d]}u_{i} for some functions ui:𝔽Xi𝔽u_{i}:\mathbb{F}^{X_{i}}\to\mathbb{F}. The tensor rank of a general dd-tensor TT is then defined as the least rr\in\mathbb{N} such that T=T1++TrT=T_{1}+\cdots+T_{r}, where each TiT_{i} has rank 1.

This rank function is not natural according to our Definition 2.1 because it fails the restriction Lipschitz property. Consider for instance the 3-tensor T𝔽[n]×[n]×[2]T\in\mathbb{F}^{[n]\times[n]\times[2]} consisting of the identity matrix stacked on top of an all-ones matrix:

T(i,j,1)=1[i=j] and T(i,j,2)=1for i,j[n].\displaystyle T(i,j,1)=1\big{[}i=j\big{]}\,\text{ and }\,T(i,j,2)=1\quad\text{for $i,j\in[n]$.}

The identity slice has tensor rank nn (which implies that TT has tensor rank at least nn), while the all-ones slice has tensor rank 1. Thus, removing the identity slice T(,,1)T(\cdot,\cdot,1) reduces the tensor rank of TT by at least n1n-1, instead of reducing it by at most 1 as required by the restriction Lipschitz condition. This should not be taken as an indication that our definition is too restrictive, however, as this example also shows that Theorem 1.4 does not hold for tensor rank: a σ\sigma-random restriction of TT has tensor rank at most 1 with probability 1σ1-\sigma. (This simple example can also be generalized in many ways.)


The main result of this section concerns how natural rank functions behave under random coordinate restrictions. Intuitively, it shows that random restrictions of high-rank tensors will also have high rank with high probability. For convenience, we repeat below its formal statement as given in the Introduction.

Theorem 2.2 (Theorem 1.4 restated).

For every dd\in\mathbb{N} and σ(0,1]\sigma\in(0,1], there exist constants C,κ>0C,\kappa>0 such that the following holds. For every natural rank function rk:(𝔽)d+\operatorname{rk}:(\mathbb{F}^{\infty})^{\otimes d}\to\mathbb{R}_{+} and every dd-tensor Ti=1d𝔽[ni]T\in\bigotimes_{i=1}^{d}\mathbb{F}^{[n_{i}]} we have

PrI1[n1]σ,,Id[nd]σ[rk(T|I[d])κrk(T)]1Ceκrk(T).\mbox{\rm Pr}_{I_{1}\sim[n_{1}]_{\sigma},\dots,I_{d}\sim[n_{d}]_{\sigma}}\big{[}\operatorname{rk}\big{(}T_{|I_{[d]}}\big{)}\geq\kappa\cdot\operatorname{rk}(T)\big{]}\geq 1-Ce^{-\kappa\operatorname{rk}(T)}.

We note that our proof also obtains good quantitative bounds for the parameters in the theorem, namely

C=d32σ\displaystyle C=d\sqrt{\frac{3}{2\sigma}}\quad andκ=ln23(σ4)dif σ(0,1/2),\displaystyle\text{and}\quad\kappa=\frac{\ln 2}{3}\Big{(}\frac{\sigma}{4}\Big{)}^{d}\quad\text{if }\sigma\in(0,1/2),
C=d2\displaystyle C=d\sqrt{2}\quad andκ=ln22(16)dif σ[1/2,1].\displaystyle\text{and}\quad\kappa=\frac{\ln 2}{2}\Big{(}\frac{1}{6}\Big{)}^{d}\quad\text{if }\sigma\in[1/2,1].

Note that one must always have κσd\kappa\leq\sigma^{d}, as can be seen by considering a diagonal tensor TT and the slice rank function srank\operatorname{srank}, which for diagonal tensors equals the size of their support [34, Lemma 1].

2.1 Concentration for monotone sub-additive functions

The main step in our proof of Theorem 1.4 is a type of concentration inequality for monotone sub-additive functions on the hypercube.

In order to obtain such a result we will consider a notion of two-point distance from sets A{0,1}nA\subseteq\{0,1\}^{n}, which intuitively measures how many coordinates of some given point xx cannot be captured by any two elements of AA.

Definition 2.3.

Given a point x{0,1}nx\in\{0,1\}^{n} and a set A{0,1}nA\subseteq\{0,1\}^{n}, the two-point distance between xx and AA is

h2(x;A)=miny,zA|{i[n]:xiyi&xizi}|.h_{2}(x;A)=\min_{y,z\in A}\big{|}\big{\{}i\in[n]:\,x_{i}\neq y_{i}\And x_{i}\neq z_{i}\big{\}}\big{|}.

Note that this “distance” h2(x;A)h_{2}(x;A) can be zero even if xAx\notin A; for instance, h2(00;{01,10})=0h_{2}\big{(}00;\,\{01,10\}\big{)}=0. The reason for considering this notion is that one gets much better concentration for h2h_{2} than one does for the usual Hamming distance. This is shown by the following result, which is a special case of an inequality of Talagrand [33, Theorem 3.1.1]:

Theorem 2.4.

For any parameter σ[0,1]\sigma\in[0,1] and set A{0,1}nA\subseteq\{0,1\}^{n}, we have that

Prxπσn[h2(x;A)k]2kπσn(A)2for all k0.\mbox{\rm Pr}_{x\sim\pi_{\sigma}^{n}}\big{[}h_{2}(x;A)\geq k\big{]}\leq 2^{-k}\pi_{\sigma}^{n}(A)^{-2}\quad\text{for all $k\geq 0$.}

The next lemma is the key result needed for proving our main theorem for tensors, and it deals more abstractly with monotone, sub-additive Lipschitz functions on the hypercube. We endow {0,1}n\{0,1\}^{n} with the usual partial order, where xyx\leq y if the support of xx is contained in the support of yy. A function f:{0,1}nf:\{0,1\}^{n}\to\mathbb{R} is monotone if f(x)f(y)f(x)\leq f(y) whenever xyx\leq y, and it is sub-additive if f(x+y)f(x)+f(y)f(x+y)\leq f(x)+f(y) for all x,y{0,1}nx,y\in\{0,1\}^{n} with disjoint supports. Finally, ff is 11-Lipschitz if

|f(x)f(y)||{i[n]:xiyi}|for all x,y{0,1}n.\big{|}f(x)-f(y)\big{|}\leq\big{|}\big{\{}i\in[n]:\,x_{i}\neq y_{i}\big{\}}\big{|}\quad\text{for all $x,y\in\{0,1\}^{n}$}.

For a string x{0,1}nx\in\{0,1\}^{n} and set I[n]I\subseteq[n], we let xI{0,1}nx_{I}\in\{0,1\}^{n} be the string that equals xx on the indices in II and is zero elsewhere. The lemma that follows and its proof are inspired by an argument of Schechtman [31, Corollary 12].

Lemma 2.5 (Concentration inequality).

Let f:{0,1}n+f:\{0,1\}^{n}\to\mathbb{R}_{+} be a monotone, sub-additive 11-Lipschitz function such that f(𝟎)=0f({\bf 0})=0 and f(𝟏)=rf({\bf 1})=r. Then,

Prxπσn[f(x)σr/4]\displaystyle\mbox{\rm Pr}_{x\sim\pi_{\sigma}^{n}}\big{[}f(x)\leq\sigma r/4\big{]} 32σ 2σr/12\displaystyle\leq\sqrt{\frac{3}{2\sigma}}\,2^{-\sigma r/12}\quad if  0<σ<1/2\,0<\sigma<1/2,
Prxπσn[f(x)r/6]\displaystyle\mbox{\rm Pr}_{x\sim\pi_{\sigma}^{n}}\big{[}f(x)\leq r/6\big{]} 2 2r/12\displaystyle\leq\sqrt{2}\,2^{-r/12} if  1/2σ1\,1/2\leq\sigma\leq 1.
  • Proof:

    We first prove the first inequality above. Let σ(0,1/2)\sigma\in(0,1/2), and consider the set

    A={x{0,1}n:f(x)σr/4}.A=\big{\{}x\in\{0,1\}^{n}:\,f(x)\leq\sigma r/4\big{\}}.

    By Talagrand’s Inequality (Theorem 2.4) we have

    Prxπσn[h2(x;A)σr/6]2σr/6Prxπσn[f(x)σr/4]2,\mbox{\rm Pr}_{x\sim\pi_{\sigma}^{n}}\big{[}h_{2}(x;A)\geq\sigma r/6\big{]}\leq 2^{-\sigma r/6}\,\mbox{\rm Pr}_{x\sim\pi_{\sigma}^{n}}\big{[}f(x)\leq\sigma r/4\big{]}^{-2},

    so to finish the proof it suffices to show that

    Prxπσn[h2(x;A)σr/6]2σ/3.\mbox{\rm Pr}_{x\sim\pi_{\sigma}^{n}}\big{[}h_{2}(x;A)\geq\sigma r/6\big{]}\geq 2\sigma/3.

    We claim that

    {x{0,1}n:f(x)2σr/3}{x{0,1}n:h2(x;A)σr/6}.\big{\{}x\in\{0,1\}^{n}:\,f(x)\geq 2\sigma r/3\big{\}}\subseteq\big{\{}x\in\{0,1\}^{n}:\,h_{2}(x;A)\geq\sigma r/6\big{\}}. (1)

    Indeed, if h2(x;A)<σr/6h_{2}(x;A)<\sigma r/6 then there are y,zAy,z\in A such that

    |{i[n]:xiyi&xizi}|<σr/6.\big{|}\big{\{}i\in[n]:\,x_{i}\neq y_{i}\And x_{i}\neq z_{i}\big{\}}\big{|}<\sigma r/6.

    Denote I={i[n]:xiyi&xizi}I=\big{\{}i\in[n]:\,x_{i}\neq y_{i}\And x_{i}\neq z_{i}\big{\}}, J={i[n]:xi=yi}J=\big{\{}i\in[n]:\,x_{i}=y_{i}\big{\}} and J={i[n]:xiyi&xi=zi}J^{\prime}=\big{\{}i\in[n]:\,x_{i}\neq y_{i}\And x_{i}=z_{i}\big{\}}; note that these sets partition [n][n], and by assumption |I|<σr/6|I|<\sigma r/6. We then have

    f(x)\displaystyle f(x) f(xI+xJ)+f(xJ)\displaystyle\leq f(x_{I}+x_{J})+f(x_{J^{\prime}})
    =f(xI+yJ)+f(zJ)\displaystyle=f(x_{I}+y_{J})+f(z_{J^{\prime}})
    |I|+f(yJ)+f(zJ)\displaystyle\leq|I|+f(y_{J})+f(z_{J^{\prime}})
    |I|+f(y)+f(z)\displaystyle\leq|I|+f(y)+f(z)
    <2σr/3,\displaystyle<2\sigma r/3,

    so {h2(x;A)<σr/6}{f(x)<2σr/3}\big{\{}h_{2}(x;A)<\sigma r/6\big{\}}\subseteq\big{\{}f(x)<2\sigma r/3\big{\}} and inclusion (1) follows. It thus suffices to show that Prxπσn[f(x)2σr/3]2σ/3\mbox{\rm Pr}_{x\sim\pi_{\sigma}^{n}}\big{[}f(x)\geq 2\sigma r/3\big{]}\geq 2\sigma/3.

    We will next prove that, for any integer k1k\geq 1, we have

    Prxπ1/kn[f(x)r/k]1/k.\mbox{\rm Pr}_{x\sim\pi_{1/k}^{n}}\big{[}f(x)\geq r/k\big{]}\geq 1/k. (2)

    This is done by a simple coupling argument, which will be important for us again later on. Consider a uniformly random ordered kk-partition (I1,,Ik)(I_{1},\dots,I_{k}) of [n][n]; thus the kk sets IiI_{i} are pairwise disjoint and have union [n][n], with each of the knk^{n} possible such kk-tuples having the same probability. Denoting by 𝟏I\mathbf{1}_{I} the indicator function of set II, we have

    r=f(𝟏[n])=f(𝟏I1++𝟏Ik)i=1kf(𝟏Ii),r=f(\mathbf{1}_{[n]})=f(\mathbf{1}_{I_{1}}+\dots+\mathbf{1}_{I_{k}})\leq\sum_{i=1}^{k}f(\mathbf{1}_{I_{i}}),

    so f(𝟏Ii)r/kf(\mathbf{1}_{I_{i}})\geq r/k holds for at least one i[k]i\in[k] in every ordered partition (I1,,Ik)(I_{1},\dots,I_{k}). By symmetry, it follows that Pr[f(𝟏I1)r/k]1/k\mbox{\rm Pr}\big{[}f(\mathbf{1}_{I_{1}})\geq r/k\big{]}\geq 1/k. Since the marginal distribution of 𝟏I1\mathbf{1}_{I_{1}} (and every other 𝟏Ii\mathbf{1}_{I_{i}}) is precisely π1/kn\pi_{1/k}^{n}, we conclude that

    Prxπ1/kn[f(x)r/k]=Pr[f(𝟏I1)r/k]1/k,\mbox{\rm Pr}_{x\sim\pi_{1/k}^{n}}\big{[}f(x)\geq r/k\big{]}=\mbox{\rm Pr}\big{[}f(\mathbf{1}_{I_{1}})\geq r/k\big{]}\geq 1/k,

    as wished.

    Now let k=1/σk=\lceil 1/\sigma\rceil. Since 0<σ<1/20<\sigma<1/2, we have 2σ/31/kσ2\sigma/3\leq 1/k\leq\sigma, and so by monotonicity of ff

    Prxπσn[f(x)2σr/3]\displaystyle\mbox{\rm Pr}_{x\sim\pi_{\sigma}^{n}}\big{[}f(x)\geq 2\sigma r/3\big{]} Prxπ1/kn[f(x)2σr/3]\displaystyle\geq\mbox{\rm Pr}_{x\sim\pi_{1/k}^{n}}\big{[}f(x)\geq 2\sigma r/3\big{]}
    Prxπ1/kn[f(x)r/k].\displaystyle\geq\mbox{\rm Pr}_{x\sim\pi_{1/k}^{n}}\big{[}f(x)\geq r/k\big{]}.

    From inequality (2) we then conclude that

    Prxπσn[f(x)2σr/3]1/k2σ/3,\mbox{\rm Pr}_{x\sim\pi_{\sigma}^{n}}\big{[}f(x)\geq 2\sigma r/3\big{]}\geq 1/k\geq 2\sigma/3,

    finishing the proof of the first inequality in the statement of the lemma.

    The second inequality is proven using the same arguments, now applied to the parameter σ=1/2\sigma=1/2 and the set A={x{0,1}n:f(x)r/6}A=\big{\{}x\in\{0,1\}^{n}:\,f(x)\leq r/6\big{\}}. By the same argument as above,

    {x{0,1}n:f(x)r/2}{x{0,1}n:h2(x;A)r/6}.\big{\{}x\in\{0,1\}^{n}:\,f(x)\geq r/2\big{\}}\subseteq\big{\{}x\in\{0,1\}^{n}:\,h_{2}(x;A)\geq r/6\big{\}}.

    Talagrand’s inequality and coupling imply that

    2r/6Prxπ1/2n[f(x)r/6]2\displaystyle 2^{-r/6}\mbox{\rm Pr}_{x\sim\pi_{1/2}^{n}}\big{[}f(x)\leq r/6\big{]}^{-2} Prxπ1/2n[h2(x;A)r/6]\displaystyle\geq\mbox{\rm Pr}_{x\sim\pi_{1/2}^{n}}\big{[}h_{2}(x;A)\geq r/6\big{]}
    Prxπ1/2n[f(x)r/2]\displaystyle\geq\mbox{\rm Pr}_{x\sim\pi_{1/2}^{n}}\big{[}f(x)\geq r/2\big{]}
    12.\displaystyle\geq\frac{1}{2}.

    Rearranging then gives the result for σ=1/2\sigma=1/2; we obtain slightly better bounds since in this case 1/σ=21/\sigma=2 is an integer, and so no rounding errors occur. By monotonicity of ff, the same bound continues to hold for all σ>1/2\sigma>1/2. \Box

2.2 The Random Restriction Theorem

It is now a simple matter to use Lemma 2.5 to prove the Random Restriction Theorem for tensors.

  • Proof of Theorem 1.4:

    Let rk:(𝔽)d\operatorname{rk}:(\mathbb{F}^{\infty})^{\otimes d}\to\mathbb{R} be a natural rank function, and Ti=1d𝔽[ni]T\in\bigotimes_{i=1}^{d}\mathbb{F}^{[n_{i}]} be a tensor. We wish to show that

    PrI1[n1]σ,,Id[nd]σ[rk(T|I[d])κ(σ)rk(T)]1C(σ)eκ(σ)rk(T)\mbox{\rm Pr}_{I_{1}\sim[n_{1}]_{\sigma},\dots,I_{d}\sim[n_{d}]_{\sigma}}\big{[}\operatorname{rk}\big{(}T_{|I_{[d]}}\big{)}\geq\kappa(\sigma)\cdot\operatorname{rk}(T)\big{]}\geq 1-C(\sigma)e^{-\kappa(\sigma)\operatorname{rk}(T)}

    for some well-chosen constants C(σ),κ(σ)>0C(\sigma),\kappa(\sigma)>0 depending only on the value of σ>0\sigma>0 and the order dd of the tensor. We consider here the case where σ<1/2\sigma<1/2, as the case where σ1/2\sigma\geq 1/2 is analogous.222This second case is also an immediate consequence of first case together with monotonicity of rk\operatorname{rk} under restrictions.

    Define the function f1:{0,1}n1+f_{1}:\{0,1\}^{n_{1}}\to\mathbb{R}_{+} as follows: for x{0,1}n1x\in\{0,1\}^{n_{1}} with support JJ, f1(x)f_{1}(x) is the rank of the subtensor of TT whose indices of the first row belong to JJ. In formula:

    f1(𝟏J)=rk(T|J×j=2d[nj])for all J[n1].f_{1}(\mathbf{1}_{J})=\operatorname{rk}\big{(}T_{|J\times\prod_{j=2}^{d}[n_{j}]}\big{)}\quad\text{for all }J\subseteq[n_{1}].

    By the definition of natural rank, f1f_{1} is a monotone, sub-additive 11-Lipschitz function with maximum value rk(T)\operatorname{rk}(T). Since 𝟏Jπσn1\mathbf{1}_{J}\sim\pi_{\sigma}^{n_{1}} when J[n1]σJ\sim[n_{1}]_{\sigma}, it follows from Lemma 2.5 that

    PrI1[n1]σ[rk(T|I1×j=2d[nj])σrk(T)4]132σ 2σ12rk(T).\mbox{\rm Pr}_{I_{1}\sim[n_{1}]_{\sigma}}\bigg{[}\operatorname{rk}\big{(}T_{|I_{1}\times\prod_{j=2}^{d}[n_{j}]}\big{)}\geq\frac{\sigma\operatorname{rk}(T)}{4}\bigg{]}\geq 1-\sqrt{\frac{3}{2\sigma}}\,2^{-\frac{\sigma}{12}\operatorname{rk}(T)}. (3)

    For any fixed I1[n1]I_{1}\subseteq[n_{1}] satisfying f1(𝟏I1)σrk(T)/4f_{1}(\mathbf{1}_{I_{1}})\geq\sigma\operatorname{rk}(T)/4, we define the function f2:{0,1}n2+f_{2}:\{0,1\}^{n_{2}}\to\mathbb{R}_{+} by

    f2(𝟏J)=rk(T|I1×J×j=3d[nj])for all J[n2].f_{2}(\mathbf{1}_{J})=\operatorname{rk}\big{(}T_{|I_{1}\times J\times\prod_{j=3}^{d}[n_{j}]}\big{)}\quad\text{for all }J\subseteq[n_{2}].

    This function is again monotone, sub-additive and 11-Lipschitz, and it has maximum value at least σrk(T)/4\sigma\operatorname{rk}(T)/4. By Lemma 2.5 we have

    PrI2[n2]σ[rk(T|I1×I2×j=3d[nj])(σ4)2rk(T)]132σ 2σ12σ4rk(T).\mbox{\rm Pr}_{I_{2}\sim[n_{2}]_{\sigma}}\bigg{[}\operatorname{rk}\big{(}T_{|I_{1}\times I_{2}\times\prod_{j=3}^{d}[n_{j}]}\big{)}\geq\Big{(}\frac{\sigma}{4}\Big{)}^{2}\operatorname{rk}(T)\bigg{]}\geq 1-\sqrt{\frac{3}{2\sigma}}\,2^{-\frac{\sigma}{12}\frac{\sigma}{4}\operatorname{rk}(T)}.

    Since this holds whenever f1(𝟏I1)σrk(T)/4f_{1}(\mathbf{1}_{I_{1}})\geq\sigma\operatorname{rk}(T)/4, taking the union bound together with inequality (3) we obtain

    PrI1[n1]σ,I2[n2]σ[rk(T|I1×I2×j=3d[nj])(σ4)2rk(T)]1232σ 2σ12σ4rk(T).\mbox{\rm Pr}_{I_{1}\sim[n_{1}]_{\sigma},I_{2}\sim[n_{2}]_{\sigma}}\bigg{[}\operatorname{rk}\big{(}T_{|I_{1}\times I_{2}\times\prod_{j=3}^{d}[n_{j}]}\big{)}\geq\Big{(}\frac{\sigma}{4}\Big{)}^{2}\operatorname{rk}(T)\bigg{]}\geq 1-2\sqrt{\frac{3}{2\sigma}}\,2^{-\frac{\sigma}{12}\frac{\sigma}{4}\operatorname{rk}(T)}.

    Proceeding in this same way for each row of the tensor, and always taking the union bound, we eventually conclude that

    PrI1[n1]σ,,Id[nd]σ[rk(T|I[d])(σ4)drk(T)]1d32σ 2σ12(σ4)d1rk(T),\mbox{\rm Pr}_{I_{1}\sim[n_{1}]_{\sigma},\dots,I_{d}\sim[n_{d}]_{\sigma}}\bigg{[}\operatorname{rk}\big{(}T_{|I_{[d]}}\big{)}\geq\Big{(}\frac{\sigma}{4}\Big{)}^{d}\operatorname{rk}(T)\bigg{]}\geq 1-d\sqrt{\frac{3}{2\sigma}}\,2^{-\frac{\sigma}{12}(\frac{\sigma}{4})^{d-1}\operatorname{rk}(T)},

    which finishes the proof. \Box

3 Polynomial maps

Next we consider the setting of polynomials and higher-dimensional polynomial maps. Recall that Pold(𝔽n,𝔽k)\operatorname{Pol}_{\leq d}(\mathbb{F}^{n},\mathbb{F}^{k}) is the space of all polynomial maps ϕ:𝔽n𝔽k\phi:\mathbb{F}^{n}\to\mathbb{F}^{k} of degree at most dd, and Pold(𝔽,𝔽k)=nPold(𝔽n,𝔽k)\operatorname{Pol}_{\leq d}(\mathbb{F}^{\infty},\mathbb{F}^{k})=\bigcup_{n\in\mathbb{N}}\operatorname{Pol}_{\leq d}(\mathbb{F}^{n},\mathbb{F}^{k}) is the space of all kk-dimensional polynomial maps over 𝔽\mathbb{F} of degree at most dd.

Definition 3.1 (Natural rank).

We say that rk:Pold(𝔽,𝔽k)+\operatorname{rk}:\operatorname{Pol}_{\leq d}(\mathbb{F}^{\infty},\mathbb{F}^{k})\to\mathbb{R}_{+} is a natural rank function if rk(𝟎)=0\operatorname{rk}({\bf 0})=0 and it satisfies the following properties:

  1. 1.

    Symmetry:
    rk(ϕ)=rk(ϕ)\operatorname{rk}(\phi)=\operatorname{rk}(-\phi) for all ϕPold(𝔽,𝔽k)\phi\in\operatorname{Pol}_{\leq d}(\mathbb{F}^{\infty},\mathbb{F}^{k}).

  2. 2.

    Sub-additivity:
    rk(ϕ+γ)rk(ϕ)+rk(γ)\operatorname{rk}(\phi+\gamma)\leq\operatorname{rk}(\phi)+\operatorname{rk}(\gamma) for all ϕ,γPold(𝔽,𝔽k)\phi,\gamma\in\operatorname{Pol}_{\leq d}(\mathbb{F}^{\infty},\mathbb{F}^{k}).

  3. 3.

    Monotonicity under restrictions:
    rk(ϕ|I)rk(ϕ)\operatorname{rk}(\phi_{|I})\leq\operatorname{rk}(\phi) for all ϕPold(𝔽,𝔽k)\phi\in\operatorname{Pol}_{\leq d}(\mathbb{F}^{\infty},\mathbb{F}^{k}) and all sets II\subset\mathbb{N}.

  4. 4.

    Restriction Lipschitz property:
    rk(ϕ|IJ)rk(ϕ|I)+|J|\operatorname{rk}(\phi_{|I\cup J})\leq\operatorname{rk}(\phi_{|I})+|J| for all ϕPold(𝔽,𝔽k)\phi\in\operatorname{Pol}_{\leq d}(\mathbb{F}^{\infty},\mathbb{F}^{k}) and all sets I,JI,J\subset\mathbb{N}.

Below we discuss some natural rank functions for polynomial maps which have appeared in the literature. Symmetry holds trivially for each of the ranks discussed.

Degree rank

Motivated by proving a version of the Gowers Inverse Theorem for polynomial phase functions, Green and Tao defined the notion of degree rank for functions over 𝔽pn\mathbb{F}_{p}^{n} and showed that it is closely related to equidistribution properties of polynomials [16]. For an integer d1d\geq 1, the degree-dd rank of a function f:𝔽n𝔽f:\mathbb{F}^{n}\to\mathbb{F}, denoted rkd(f)\operatorname{rk}_{d}(f), is the least rr\in\mathbb{N} such that there exist polynomials Q1,,Qr𝔽[x1,,xn]Q_{1},\dots,Q_{r}\in\mathbb{F}[x_{1},\dots,x_{n}] of degree at most dd and a function Γ:𝔽r𝔽\Gamma:\mathbb{F}^{r}\to\mathbb{F} such that ff can be decomposed as f(x)=Γ(Q1(x),,Qr(x))f(x)=\Gamma\big{(}Q_{1}(x),\dots,Q_{r}(x)\big{)}. A slightly modified version of this rank function, where one also allows “nonclassical polynomials” to enter the decomposition, was instrumental in the proof of the general Gowers Inverse Theorem over 𝔽pn\mathbb{F}_{p}^{n} by Tao and Ziegler [36].

It turns out that, restricted to polynomials of degree at most dd, the function 12rkd1\tfrac{1}{2}\operatorname{rk}_{d-1} is a natural rank function. (The need to divide by 22 is a normalization matter which has no important impact.) Sub-additivity of this rank function follows directly since the sum of decompositions of two polynomials P,QP,Q gives a decomposition of their sum P+QP+Q. Monotonicity follows since a decomposition of PP induces a decomposition of any restriction P|IP_{|I}. The restriction Lipschitz property can be verified inductively using the fact that for a polynomial P𝔽[x1,,xn]P\in\mathbb{F}[x_{1},\dots,x_{n}] of degree at most dd, we have that P=P|[n1]+xnPP=P_{|[n-1]}+x_{n}P^{\prime} for some polynomial PP^{\prime} of degree at most d1d-1. This shows that rkd1(P)rkd1(P|[n1])+2\operatorname{rk}_{d-1}(P)\leq\operatorname{rk}_{d-1}(P_{|[n-1]})+2, where the extra 2 comes from the polynomials PP^{\prime} and xnx_{n}.

Schmidt rank

A refinement of the notion of degree-dd rank was introduced by Schmidt in [32], which was rediscovered independently on multiple occasions later on. The same notion, specialized to cubic polynomials, was defined in [9] and referred to as q-rank, while in [21] it was defined for homogeneous polynomials and referred to simply as rank. It appeared under the name strength in [1], it was studied in [18] without being explicitly defined and then again in [19], where it was called strong rank.

The Schmidt rank of a degree-dd polynomial P𝔽[x1,,xn]P\in\mathbb{F}[x_{1},\dots,x_{n}], which we will denote prank(P)\operatorname{prank}(P) to reflect its similarity with partition rank for tensors, is the least rr\in\mathbb{N} such that PP has a decomposition of the form P=Q1R1+QrRr+Qr+1P=Q_{1}R_{1}+\cdots Q_{r}R_{r}+Q_{r+1} where Qi,RiQ_{i},R_{i} are non-constant polynomials such that deg(Qi)+deg(Ri)d\deg(Q_{i})+\deg(R_{i})\leq d and deg(Qr+1)d1\deg(Q_{r+1})\leq d-1. Sub-additivity and monotonicity follow for the same reasons as for slice and partition rank of tensors. The argument used for degree-dd rank shows the restriction Lipschitz property.

Analytic rank

The next notion of rank is defined more generally for higher-dimensional polynomial maps, and requires the field 𝔽\mathbb{F} to be finite. It was introduced in the context of circuit complexity and error-correcting codes [5]; see Section 4 for more details. For a finite field 𝔽\mathbb{F}, the analytic dd-rank of a map ϕPold(𝔽n;𝔽k)\phi\in\operatorname{Pol}_{\leq d}(\mathbb{F}^{n};\mathbb{F}^{k}) is defined by

arankd(ϕ)=log|𝔽|(maxψ:𝔽n𝔽k,deg(ψ)<dPrx𝔽n[ϕ(x)=ψ(x)]).\operatorname{arank}_{d}(\phi)=-\log_{|\mathbb{F}|}\bigg{(}\max_{\psi:\mathbb{F}^{n}\to\mathbb{F}^{k},\,\deg(\psi)<d}\mbox{\rm Pr}_{x\in\mathbb{F}^{n}}\big{[}\phi(x)=\psi(x)\big{]}\bigg{)}.

Sub-additivity, monotonicity and the restriction Lipschitz property for this rank function were proven in [5, Section 5].

Finally, we note that there is also another notion of analytic rank specific to polynomials, which equals the tensor analytic rank of their associated homogeneous multilinear form. It was introduced together with the notion of tensor analytic rank by Gowers and Wolf [15], and can be equivalently defined for polynomials PP of degree dd by

log|𝔽|χ(P)Ud2d,-\log_{|\mathbb{F}|}\|\chi(P)\|_{U^{d}}^{2^{d}},

where χ\chi is a nontrivial additive character of 𝔽\mathbb{F} and Ud\|\cdot\|_{U^{d}} is the Gowers uniformity norm of order dd. Sub-additivity and monotonicity under restrictions follow from the corresponding properties for tensor analytic rank. Upon normalization by a factor 1/d1/d, the restriction Lipschitz property follows from the same property for the analytic rank of tensors. This notion of rank was also important in Tao and Ziegler’s proof of the Gowers Inverse Theorem over 𝔽pn\mathbb{F}_{p}^{n} [36].


Our main result shows that random restrictions of a high-rank polynomial map will also have high rank with high probability. We recall below its formal statement as given in the Introduction.

Theorem 3.2 (Theorem 1.8 restated).

For every dd\in\mathbb{N} and σ,ε(0,1]\sigma,\varepsilon\in(0,1], there exist constants κ=κ(d,σ)>0\kappa=\kappa(d,\sigma)>0 and R=R(d,σ,ε)R=R(d,\sigma,\varepsilon)\in\mathbb{N} such that the following holds. For every natural rank function rk:Pold(𝔽,𝔽k)\operatorname{rk}:\operatorname{Pol}_{\leq d}(\mathbb{F}^{\infty},\mathbb{F}^{k})\to\mathbb{R} and every map ϕPold(𝔽n,𝔽k)\phi\in\operatorname{Pol}_{\leq d}(\mathbb{F}^{n},\mathbb{F}^{k}) with rk(ϕ)R\operatorname{rk}(\phi)\geq R, we have

PrI[n]σ[rk(ϕ|I)κrk(ϕ)]1ε.\mbox{\rm Pr}_{I\sim[n]_{\sigma}}\big{[}\operatorname{rk}(\phi_{|I})\geq\kappa\cdot\operatorname{rk}(\phi)\big{]}\geq 1-\varepsilon.

The proof of this theorem proceeds differently from the proof of Theorem 1.4, which gives the analogous result for tensors. The reason for this is that the natural choice of function f:{0,1}n+f:\{0,1\}^{n}\to\mathbb{R}_{+} to which one might apply the concentration inequality in Lemma 2.5, given by f(𝟏I)=rk(ϕ|I)f(\mathbf{1}_{I})=\operatorname{rk}(\phi_{|I}) (for some rank function rk\operatorname{rk}), might fail to be sub-additive as a function on the boolean hypercube while rk\operatorname{rk} is sub-additive as a natural rank function. For instance, consider (in dimension k=1k=1) a 2n2n-variate polynomial p(x1,,xn,y1,,yn)p(x_{1},\dots,x_{n},y_{1},\dots,y_{n}) such that each monomial contains both an xx variable and a yy variable. Then the rank of pp restricted only to its xx variables is zero, and similarly for the restriction to the yy variables, while rk(p)\operatorname{rk}(p) can be of order Θ(n)\Theta(n). This shows that rk(p|IJ)\operatorname{rk}(p_{|I\cup J}) can be much larger than rk(p|I)+rk(p|J)\operatorname{rk}(p_{|I})+\operatorname{rk}(p_{|J}), and the argument used in the tensor case breaks down.

3.1 The proof in expectation

The first step in our proof of Theorem 1.8 is to show that the desired result is true in expectation, rather than with high probability. More precisely, we wish to show an inequality of the form

𝔼I[n]σrk(ϕ|I)c(σ,d)rk(ϕ){\mathbb{E}}_{I\sim[n]_{\sigma}}\operatorname{rk}(\phi_{|I})\geq c(\sigma,d)\operatorname{rk}(\phi)

which is valid for all degree-dd maps ϕPold(𝔽n,𝔽k)\phi\in\operatorname{Pol}_{\leq d}(\mathbb{F}^{n},\mathbb{F}^{k}) and all natural rank functions.

Towards proving the inequality above by induction on the degree, it will be convenient to generalize it to polynomial maps over commutative rings rather than only fields. The reason for doing so is that, by considering some subset of variables {xa:aA}\{x_{a}:a\in A\} to be constants, we might be able to decrease the degree of the associated polynomial map over the remaining variables {xb:bA}\{x_{b}:b\notin A\}.

More formally, given a polynomial map ϕPold(𝔽n,𝔽k)\phi\in\operatorname{Pol}_{\leq d}(\mathbb{F}^{n},\mathbb{F}^{k}) and two disjoint subsets A,B[n]A,B\subset[n], denote by ϕ[A,B]\phi[A,B] the sum of the monomials in ϕ\phi involving at least one variable from each AA and BB. Note that ϕ[A,B]\phi[A,B] can be regarded as a polynomial map in three different ways: either as having coefficients in 𝔽\mathbb{F} and variables in {xi:i[n]}\{x_{i}:i\in[n]\}; or having coefficients in the ring 𝔽[xb:bB]\mathbb{F}[x_{b}:b\in B] and variables in {xa:aA}\{x_{a}:a\in A\}; or having coefficients in 𝔽[xa:aA]\mathbb{F}[x_{a}:a\in A] and variables in {xb:bB}\{x_{b}:b\in B\}. The advantage of these last two representations is that the degree of ϕ[A,B]\phi[A,B] decreases, since every monomial will contain formal variables which are now elements of the ring (and are thus regarded as constants).

For a commutative ring RR, denote by R[x1,,xn]dR[x_{1},\dots,x_{n}]_{\leq d} the set of formal polynomials in variables x1,,xnx_{1},\dots,x_{n} with coefficients over RR and degree at most dd. For a bipartition [n]=AB[n]=A\uplus B, define the polynomial ring RA=R[xa:aA]R_{A}=R[x_{a}:\,a\in A] and consider the natural identification of R[x1,,xn]R[x_{1},\dots,x_{n}] and RA[xb:bB]R_{A}[x_{b}:\,b\in B]. Under this identification, a function rk:(R[x1,,xn])k+\operatorname{rk}:(R[x_{1},\dots,x_{n}])^{k}\to\mathbb{R}_{+} induces a function rk:(RA[xb:bB])k+\operatorname{rk}:(R_{A}[x_{b}:\,b\in B])^{k}\to\mathbb{R}_{+} in a way that preserves symmetry, sub-additivity and monotonicity under restrictions.

Lemma 3.3 (Linear expectation).

For every σ(0,1]\sigma\in(0,1] and every integer d1d\geq 1 there exists a constant c(σ,d)>0c(\sigma,d)>0 such that the following holds. Let n,kn,k be positive integers, RR be a commutative ring and rk:(R[x1,,xn]d)k+\operatorname{rk}:(R[x_{1},\dots,x_{n}]_{\leq d})^{k}\to\mathbb{R}_{+} be symmetric, sub-additive and monotone under restrictions. Then, for any map ϕ(R[x1,,xn]d)k\phi\in(R[x_{1},\dots,x_{n}]_{\leq d})^{k} we have

𝔼J[n]σrk(ϕ|J)c(σ,d)rk(ϕ).{\mathbb{E}}_{J\sim[n]_{\sigma}}\operatorname{rk}(\phi_{|J})\geq c(\sigma,d)\operatorname{rk}(\phi).
  • Proof:

    For simplicity, we will prove the result in the special case where σ=1/2\sigma=1/2. Since ϕ|IJ=(ϕ|I)|J\phi_{|I\cap J}=(\phi_{|I})_{|J} and IJ[n]1/2j+1I\cap J\sim[n]_{1/2^{j+1}} when I[n]1/2I\sim[n]_{1/2}, J[n]1/2jJ\sim[n]_{1/2^{j}}, the general case easily follows from this special case by taking constant

    c(σ,d)=c(1/2,d)log(1/σ)c(\sigma,d)=c(1/2,d)^{\lceil\log(1/\sigma)\rceil}

    and using monotonicity under restrictions.

    Denote c(d):=c(1/2,d)c(d):=c(1/2,d) for ease of notation. The proof will proceed by induction on the degree dd, with the base case where d=0d=0 holding trivially with c(0)=1c(0)=1. For the inductive step, let

    c(d)=c(d1)220c(d)=\frac{c(d-1)^{2}}{20}

    and consider two different possibilities.

    First, suppose that for every bipartition [n]=AB[n]=A\uplus B we have that

    max{rk(ϕ|A),rk(ϕ|B)}2c(d)rk(ϕ).\max\big{\{}\operatorname{rk}(\phi_{|A}),\,\operatorname{rk}(\phi_{|B})\big{\}}\geq 2c(d)\operatorname{rk}(\phi).

    Using a similar coupling argument as was used in the proof of Lemma 2.5, we get that

    PrI[n]1/2[rk(ϕ|I)2c(d)rk(ϕ)]12,\displaystyle\mbox{\rm Pr}_{I\sim[n]_{1/2}}\big{[}\operatorname{rk}\big{(}\phi_{|I}\big{)}\geq 2c(d)\operatorname{rk}(\phi)\big{]}\geq\frac{1}{2},

    which implies

    𝔼I[n]1/2rk(ϕ|I)c(d)rk(ϕ){\mathbb{E}}_{I\sim[n]_{1/2}}\operatorname{rk}\big{(}\phi_{|I}\big{)}\geq c(d)\operatorname{rk}(\phi)

    as desired.

    The second possibility is that there exists a bipartition [n]=AB[n]=A\uplus B such that

    rk(ϕ|A)<2c(d)rk(ϕ)andrk(ϕ|B)<2c(d)rk(ϕ).\operatorname{rk}(\phi_{|A})<2c(d)\operatorname{rk}(\phi)\quad\text{and}\quad\operatorname{rk}(\phi_{|B})<2c(d)\operatorname{rk}(\phi). (4)

    Fix such a bipartition for the rest of the proof. For any given subsets IAI\subseteq A, JBJ\subseteq B, let ϕ[I,J]\phi[I,J] be the sum of the monomials in ϕ\phi involving at least one variable from each II and JJ. It is easy to see that

    ϕ[I,J]=ϕ|IJϕ|Iϕ|J+ϕ(0).\phi[I,J]=\phi_{|I\cup J}-\phi_{|I}-\phi_{|J}+\phi(0).

    Define the commutative rings RA:=R[xa:aA]R_{A}:=R[x_{a}:a\in A] and RB:=R[xb:bB]R_{B}:=R[x_{b}:b\in B], and note that ϕ[I,J]\phi[I,J] can be seen both as an element of (RA[xb:bB])k(R_{A}[x_{b}:b\in B])^{k} and as an element of (RB[xa:aA])k(R_{B}[x_{a}:a\in A])^{k}. Moreover, the degree of ϕ[I,J]\phi[I,J] is at most d1d-1 in each of these representations.

    From the identity ϕ=ϕ|A+ϕ|Bϕ(0)+ϕ[A,B]\phi=\phi_{|A}+\phi_{|B}-\phi(0)+\phi[A,B] and sub-additivity, it follows that

    rk(ϕ[A,B])rk(ϕ)rk(ϕ|A)rk(ϕ|B)rk(ϕ(0)).\operatorname{rk}(\phi[A,B])\geq\operatorname{rk}(\phi)-\operatorname{rk}(\phi_{|A})-\operatorname{rk}(\phi_{|B})-\operatorname{rk}(-\phi(0)).

    Since rk(ϕ(0))=rk(ϕ(0))=rk(ϕ|)\operatorname{rk}(-\phi(0))=\operatorname{rk}(\phi(0))=\operatorname{rk}(\phi_{|\emptyset}), we conclude from monotonicity under restrictions and our assumption (4) that

    rk(ϕ[A,B])>rk(ϕ)6c(d)rk(ϕ)>rk(ϕ)/2.\operatorname{rk}(\phi[A,B])>\operatorname{rk}(\phi)-6c(d)\operatorname{rk}(\phi)>\operatorname{rk}(\phi)/2. (5)

    In an analogous way, we obtain the inequalities

    rk(ϕ|IJ)\displaystyle\operatorname{rk}(\phi_{|I\cup J}) rk(ϕ[I,J])rk(ϕ|I)rk(ϕ|J)rk(ϕ(0))\displaystyle\geq\operatorname{rk}(\phi[I,J])-\operatorname{rk}(\phi_{|I})-\operatorname{rk}(\phi_{|J})-\operatorname{rk}(\phi(0)) (6)
    >rk(ϕ[I,J])6c(d)rk(ϕ)\displaystyle>\operatorname{rk}(\phi[I,J])-6c(d)\operatorname{rk}(\phi)

    valid for all IAI\subseteq A, JBJ\subseteq B.

    Applying the inductive hypothesis to ϕ[A,B](RB[xa:aA]d1)k\phi[A,B]\in(R_{B}[x_{a}:a\in A]_{\leq d-1})^{k} we obtain

    𝔼IA1/2rk(ϕ[I,B])=𝔼IA1/2rk(ϕ[A,B]|I)c(d1)rk(ϕ[A,B]).{\mathbb{E}}_{I\sim A_{1/2}}\operatorname{rk}(\phi[I,B])={\mathbb{E}}_{I\sim A_{1/2}}\operatorname{rk}(\phi[A,B]_{|I})\geq c(d-1)\operatorname{rk}(\phi[A,B]).

    Moreover, for any fixed IAI\subseteq A, the inductive hypothesis applied to the map ϕ[I,B](RA[xb:bB]d1)k\phi[I,B]\in(R_{A}[x_{b}:b\in B]_{\leq d-1})^{k} gives

    𝔼JB1/2rk(ϕ[I,J])=𝔼JB1/2rk(ϕ[I,B]|J)c(d1)rk(ϕ[I,B]).{\mathbb{E}}_{J\sim B_{1/2}}\operatorname{rk}(\phi[I,J])={\mathbb{E}}_{J\sim B_{1/2}}\operatorname{rk}(\phi[I,B]_{|J})\geq c(d-1)\operatorname{rk}(\phi[I,B]).

    Combining the two inequalities above we conclude that

    𝔼IA1/2,JB1/2rk(ϕ[I,J])c(d1)2rk(ϕ[A,B]),{\mathbb{E}}_{I\sim A_{1/2},\,J\sim B_{1/2}}\operatorname{rk}(\phi[I,J])\geq c(d-1)^{2}\operatorname{rk}(\phi[A,B]),

    and thus by (5) we have 𝔼IA1/2,JB1/2rk(ϕ[I,J])>c(d1)2rk(ϕ)/2{\mathbb{E}}_{I\sim A_{1/2},\,J\sim B_{1/2}}\operatorname{rk}(\phi[I,J])>c(d-1)^{2}\operatorname{rk}(\phi)/2. Together with equation (6)\eqref{eq:phi_IJ}, we conclude that

    𝔼IA1/2,JB1/2rk(ϕ|IJ)\displaystyle{\mathbb{E}}_{I\sim A_{1/2},\,J\sim B_{1/2}}\operatorname{rk}(\phi_{|I\cup J}) >𝔼IA1/2,JB1/2rk(ϕ[I,J])6c(d)rk(ϕ)\displaystyle>{\mathbb{E}}_{I\sim A_{1/2},\,J\sim B_{1/2}}\operatorname{rk}(\phi[I,J])-6c(d)\operatorname{rk}(\phi)
    >c(d1)2rk(ϕ)/26c(d)rk(ϕ)\displaystyle>c(d-1)^{2}\operatorname{rk}(\phi)/2-6c(d)\operatorname{rk}(\phi)
    >c(d)rk(ϕ),\displaystyle>c(d)\operatorname{rk}(\phi),

    which is precisely what we wanted to prove. \Box

3.2 Monotone functions on the hypercube and boosting

Our next result is a lemma which allows us to boost the probability of some events (such as having high rank under random restrictions) from ε\varepsilon to 1ε1-\varepsilon by paying a relatively small price.

It will again be convenient to take a more abstract approach and deal with Boolean functions rather than restrictions of polynomials. For a Boolean function g:{0,1}n{0,1}g:\{0,1\}^{n}\to\{0,1\}, the total influence of gg under distribution πσn\pi_{\sigma}^{n} is given by

𝐈(σ)(g)=i=1n𝔼xπσn|g(x)g(xi)|,\mathbf{I}^{(\sigma)}(g)=\sum_{i=1}^{n}{\mathbb{E}}_{x\sim\pi_{\sigma}^{n}}\big{|}g(x)-g(x^{i})\big{|},

where xix^{i} differs from xx only in the iith coordinate. Denote by μσ[g]:=𝔼xπσn[g(x)]\mu_{\sigma}[g]:={\mathbb{E}}_{x\sim\pi_{\sigma}^{n}}\big{[}g(x)\big{]} its expectation.

Lemma 3.4 (Boosting lemma).

For every σ>0\sigma>0 there is a constant Cσ>0C_{\sigma}>0 such that the following holds. Let nn\in\mathbb{N}, f:{0,1}n+f:\{0,1\}^{n}\to\mathbb{R}_{+} be a monotone 11-Lipschitz function and ε(0,1/2]\varepsilon\in(0,1/2]. If r1r\geq 1 satisfies

Prxπσn[f(x)r]>ε,\mbox{\rm Pr}_{x\sim\pi_{\sigma}^{n}}\big{[}f(x)\geq r\big{]}>\varepsilon,

then Prxπ1.01σn[f(x)rCσ1/ε2]>1ε\mbox{\rm Pr}_{x\sim\pi_{1.01\sigma}^{n}}\big{[}f(x)\geq r-C_{\sigma}^{1/\varepsilon^{2}}\big{]}>1-\varepsilon.

  • Proof:

    Consider the Boolean function g(x):=1[f(x)r]g(x):=1\big{[}f(x)\geq r\big{]}. Note that there exists q[σ, 1.01σ]q\in[\sigma,\,1.01\sigma] such that 𝐈(q)(g)100/σ\mathbf{I}^{(q)}(g)\leq 100/\sigma, as otherwise by the Margulis-Russo formula we would have

    μ1.01σ[g]=μσ[g]+σ1.01σdμq[g]dq𝑑qσ1.01σ𝐈(q)(g)𝑑q>1.\mu_{1.01\sigma}[g]=\mu_{\sigma}[g]+\int_{\sigma}^{1.01\sigma}\frac{d\mu_{q}[g]}{dq}dq\geq\int_{\sigma}^{1.01\sigma}\mathbf{I}^{(q)}(g)dq>1.

    Let γ(0,ε/2)\gamma\in(0,\varepsilon/2) be a constant to be chosen later. By Friedgut’s Junta Theorem [13], there is a C(q)100/γσC(q)^{100/\gamma\sigma}-junta h:{0,1}n{0,1}h:\{0,1\}^{n}\to\{0,1\} such that

    Prxπqn[g(x)h(x)]<γ.\mbox{\rm Pr}_{x\sim\pi_{q}^{n}}\big{[}g(x)\neq h(x)\big{]}<\gamma.

    Let J[n]J\subseteq[n], |J|C(q)100/γσ|J|\leq C(q)^{100/\gamma\sigma}, be the set of variables on which hh depends. Then

    μq[h]μq[g]γμσ[g]γ>ε/2,\displaystyle\mu_{q}[h]\geq\mu_{q}[g]-\gamma\geq\mu_{\sigma}[g]-\gamma>\varepsilon/2,

    which implies PrzπqJ[h(z)=1]>ε/2\mbox{\rm Pr}_{z\sim\pi_{q}^{J}}\big{[}h(z)=1\big{]}>\varepsilon/2. Since

    𝔼zπqJPryπqJc[g(y,z)h(z)]<γ,{\mathbb{E}}_{z\sim\pi_{q}^{J}}\mbox{\rm Pr}_{y\sim\pi_{q}^{J^{c}}}\big{[}g(y,z)\neq h(z)\big{]}<\gamma,

    it follows that

    PrzπqJ[PryπqJc[g(y,z)h(z)]ε]<γ/ε.\mbox{\rm Pr}_{z\sim\pi_{q}^{J}}\big{[}\mbox{\rm Pr}_{y\sim\pi_{q}^{J^{c}}}\big{[}g(y,z)\neq h(z)\big{]}\geq\varepsilon\big{]}<\gamma/\varepsilon.

    Taking γ=ε2/2\gamma=\varepsilon^{2}/2, we conclude there exists z{0,1}Jz\in\{0,1\}^{J} such that h(z)=1h(z)=1 and

    PryπqJc[g(y,z)=0]=PryπqJc[g(y,z)h(z)]<ε.\mbox{\rm Pr}_{y\sim\pi_{q}^{J^{c}}}\big{[}g(y,z)=0\big{]}=\mbox{\rm Pr}_{y\sim\pi_{q}^{J^{c}}}\big{[}g(y,z)\neq h(z)\big{]}<\varepsilon.

    Since ff is 11-Lipschitz by assumption, we have

    f(y,z)rf(y,z)r|J|z{0,1}J,f(y,z)\geq r\implies f(y,z^{\prime})\geq r-|J|\quad\forall z^{\prime}\in\{0,1\}^{J},

    and thus by monotonicity

    Prxπ1.01σn[f(x)r|J|]\displaystyle\mbox{\rm Pr}_{x\sim\pi_{1.01\sigma}^{n}}\big{[}f(x)\geq r-|J|\big{]} Prxπqn[f(x)r|J|]\displaystyle\geq\mbox{\rm Pr}_{x\sim\pi_{q}^{n}}\big{[}f(x)\geq r-|J|\big{]}
    =𝔼zπqJPryπqJc[f(y,z)r|J|]\displaystyle={\mathbb{E}}_{z^{\prime}\sim\pi_{q}^{J}}\mbox{\rm Pr}_{y\sim\pi_{q}^{J^{c}}}\big{[}f(y,z^{\prime})\geq r-|J|\big{]}
    maxz{0,1}JPryπqJc[f(y,z)r]\displaystyle\geq\max_{z\in\{0,1\}^{J}}\mbox{\rm Pr}_{y\sim\pi_{q}^{J^{c}}}\big{[}f(y,z)\geq r\big{]}
    >1ε.\displaystyle>1-\varepsilon.

    This is precisely what we wanted to prove, with constant

    Cσ=supq[σ, 1.01σ]C(q)200/σ.C_{\sigma}=\sup_{q\in[\sigma,\,1.01\sigma]}C(q)^{200/\sigma}.

    Note that the above supremum is finite: in [29, Section 10.3] it is shown that we can take

    C(q)=max{qc,(1q)c}C(q)=\max\big{\{}q^{-c},\,(1-q)^{-c}\big{\}}

    for some absolute constant c>0c>0. \Box

3.3 The Random Restriction Theorem

Our main result, Theorem 1.8, follows easily from the lemmas given above. Recall that we wish to show that

PrJ[n]σ[rk(ϕ|J)κ(d,σ)rk(ϕ)]1ε\mbox{\rm Pr}_{J\sim[n]_{\sigma}}\big{[}\operatorname{rk}(\phi_{|J})\geq\kappa(d,\sigma)\cdot\operatorname{rk}(\phi)\big{]}\geq 1-\varepsilon

whenever rk(ϕ)R(d,σ,ε)\operatorname{rk}(\phi)\geq R(d,\sigma,\varepsilon), for some well-chosen constants R(d,σ,ε)R(d,\sigma,\varepsilon) and κ(d,σ)>0\kappa(d,\sigma)>0.

  • Proof of Theorem 1.8:

    Applying Lemma 3.3 with σ\sigma substituted by 0.9σ0.9\sigma, we obtain

    𝔼J[n]0.9σrk(ϕ|J)c(0.9σ,d)rk(ϕ).{\mathbb{E}}_{J\sim[n]_{0.9\sigma}}\operatorname{rk}(\phi_{|J})\geq c(0.9\sigma,d)\operatorname{rk}(\phi).

    Since rk(ϕ|J)rk(ϕ)\operatorname{rk}(\phi_{|J})\leq\operatorname{rk}(\phi) for all subsets J[n]J\subseteq[n], we conclude that

    PrJ[n]0.9σ[rk(ϕ|J)c(0.9σ,d)2rk(ϕ)]c(0.9σ,d)2.\mbox{\rm Pr}_{J\sim[n]_{0.9\sigma}}\bigg{[}\operatorname{rk}(\phi_{|J})\geq\frac{c(0.9\sigma,d)}{2}\operatorname{rk}(\phi)\bigg{]}\geq\frac{c(0.9\sigma,d)}{2}.

    By possibly decreasing ε\varepsilon a little, we may assume that ε<c(0.9σ,d)/2\varepsilon<c(0.9\sigma,d)/2. Denote

    κ(d,σ)=c(0.9σ,d)4andR(d,σ,ε)=4C0.9σ1/ε2c(0.9σ,d),\kappa(d,\sigma)=\frac{c(0.9\sigma,d)}{4}\quad\text{and}\quad R(d,\sigma,\varepsilon)=\frac{4C_{0.9\sigma}^{1/\varepsilon^{2}}}{c(0.9\sigma,d)},

    where C0.9σ>0C_{0.9\sigma}>0 is the constant guaranteed by the boosting lemma, Lemma 3.4. Applying the boosting lemma to the function Jrk(ϕ|J)J\mapsto\operatorname{rk}(\phi_{|J}) with r=c(0.9σ,d)rk(ϕ)/2r=c(0.9\sigma,d)\operatorname{rk}(\phi)/2, we get

    PrJ[n]0.9σ[rk(ϕ|J)r]\displaystyle\mbox{\rm Pr}_{J\sim[n]_{0.9\sigma}}\big{[}\operatorname{rk}(\phi_{|J})\geq r\big{]} c(0.9σ,d)/2>ε\displaystyle\geq c(0.9\sigma,d)/2>\varepsilon
    PrJ[n]σ[rk(ϕ|J)rC0.9σ1/ε2]>1ε.\displaystyle\implies\mbox{\rm Pr}_{J\sim[n]_{\sigma}}\big{[}\operatorname{rk}(\phi_{|J})\geq r-C_{0.9\sigma}^{1/\varepsilon^{2}}\big{]}>1-\varepsilon.

    Since c(0.9σ,d)rk(ϕ)/2C0.9σ1/ε2κ(σ,d)rk(ϕ)c(0.9\sigma,d)\operatorname{rk}(\phi)/2-C_{0.9\sigma}^{1/\varepsilon^{2}}\geq\kappa(\sigma,d)\operatorname{rk}(\phi) whenever rk(ϕ)R(d,σ,ε)\operatorname{rk}(\phi)\geq R(d,\sigma,\varepsilon), the result follows. \Box

4 Discussion and open problems

The motivation for this work stems from a problem from theoretical computer science which concerns decoding corrupted error-correcting codes (ECCs) with NC[]0{}^{0}[\oplus] circuits [5]. An ECC is a map E:𝔽2k𝔽2nE:\mathbb{F}_{2}^{k}\to\mathbb{F}_{2}^{n} with the property that the points in its image (codewords) are well-separated in Hamming distance, enabling retrieval of encoded messages provided codewords are not too badly corrupted during transmission or storage. A standard noise model for corruption is the symmetric channel: for a parameter σ[0,1]\sigma\in[0,1] and given an element y𝔽2ny\in\mathbb{F}_{2}^{n}, this channel samples a set I[n]σI\sim[n]_{\sigma} and, for each iIi\in I, replaces the coordinate yiy_{i} with a uniformly distributed element over 𝔽2\mathbb{F}_{2}. The problem is to determine whether NC[]0{}^{0}[\oplus] circuits are capable of correctly decoding corrupted codewords with good probability over this noise distribution.

It turns out that the mappings such circuits can effect are precisely constant-degree polynomial maps ϕ:𝔽2n𝔽2k\phi:\mathbb{F}_{2}^{n}\to\mathbb{F}_{2}^{k}. One of the main results in [5] shows that the fraction of messages such maps can correctly decode with non-negligible probability (over the symmetric channel noise model) tends to zero as the size kk of the messages grows. The proof of this result uses a structure-versus-randomness strategy to analyze the decoding capability of ϕ\phi based on the value arankd(ϕ)\operatorname{arank}_{d}(\phi) for d=deg(ϕ)d=\deg(\phi) (the proof uses no assumptions on the specific ECC). The key to analyzing the case when this rank is high – the pseudorandom setting – is to understand how this rank behaves under random restrictions originating from the symmetric channel noise; this is what motivated Theorem 1.8. While in this application the degree of ϕ\phi may exceed the field size, a similar result but with quantitatively stronger bounds can be obtained in the high-characteristic setting, where char(𝔽)>deg(ϕ)\operatorname{char}(\mathbb{F})>\deg(\phi), by using Theorem 1.4.

As already remarked in the Introduction, our proof of Theorem 1.4 for higher-order tensors proceeds quite differently from the matrix case (Proposition 1.1). The reason for this is that an analogous proof would require the existence of a high-rank sub-tensor; to explain this more precisely, we introduce the following definition.

Definition 4.1 (Core property).

Let A,B:++A,B:\mathbb{R}_{+}\to\mathbb{R}_{+} be unbounded increasing functions, and let rk:(𝔽)d+\operatorname{rk}:(\mathbb{F}^{\infty})^{\otimes d}\to\mathbb{R}_{+}. We say that rk\operatorname{rk} satisfies the (A,B)(A,B)-core property if, for every dd-tensor T(𝔽)dT\in(\mathbb{F}^{\infty})^{\otimes d} of high enough rank rk(T)\operatorname{rk}(T), there exist sets J1,,JdJ_{1},\dots,J_{d}\subset\mathbb{N} of size at most A(rk(T))A(\operatorname{rk}(T)) such that rk(T|J[d])B(rk(T))\operatorname{rk}(T_{|J_{[d]}})\geq B(\operatorname{rk}(T)). We say that rk\operatorname{rk} satisfies the linear core property if it satisfies the (A,B)(A,B)-core property for linear functions A(r)=LrA(r)=Lr and B(r)=βrB(r)=\beta r, with L,β>0L,\beta>0.

Draisma [10] proved that slice rank has a core-like property, up to local linear transformation and provided 𝔽\mathbb{F} is infinite. In particular his result shows that, for every d,rd,r\in\mathbb{N} and every dd-tensor Ti=1d𝔽XiT\in\bigotimes_{i=1}^{d}\mathbb{F}^{X_{i}} of slice rank at least rr, there are linear maps φi:𝔽Xi𝔽n\varphi_{i}:\mathbb{F}^{X_{i}}\to\mathbb{F}^{n} such that the tensor (φ1φd)T(\varphi_{1}\otimes\cdots\otimes\varphi_{d})T also has slice rank at least rr, where n=n(d,r)n=n(d,r) is a constant depending only on dd and rr.

More in line with Definition 4.1, Karam [20] recently proved that several rank functions for tensors defined in terms of decompositions, including the slice rank, partition rank and tensor rank, satisfy the (A,B)(A,B)-core property for some functions AA and BB. For partition rank over finite fields, for instance, he obtains explicit functions A(r)=exp(Od,𝔽(r))A(r)=\exp(O_{d,\mathbb{F}}(r)) and B(r)=Ωd(r/(logr)d)B(r)=\Omega_{d}(r/(\log r)^{d}); for the slice rank of 3-tensors he shows that one can take A(r)=O(r)A(r)=O(r) and B(r)=Ω(r1/3)B(r)=\Omega(r^{1/3}); and for tensor rank, he shows the “perfect” linear core property A(r)=B(r)=rA(r)=B(r)=r. Karam conjectures, moreover, that all these rank functions in fact satisfy the linear core property [20, Conjecture 13.1]. In this case, a similar argument to the one we used for matrices in the introduction allows us to easily deduce a Random Restriction Theorem:

Theorem 4.2.

Suppose rk:(𝔽)d+\operatorname{rk}:(\mathbb{F}^{\infty})^{\otimes d}\to\mathbb{R}_{+} satisfies the linear core property, monotonicity under restrictions and the restriction Lipschitz property. Then for every σ(0,1]\sigma\in(0,1] there exist constants C,κ>0C,\kappa>0 such that, for every dd-tensor Ti=1d𝔽[ni]T\in\bigotimes_{i=1}^{d}\mathbb{F}^{[n_{i}]}, we have

PrI1[n1]σ,,Id[nd]σ[rk(T|I[d])κrk(T)]1Ceκrk(T).\mbox{\rm Pr}_{I_{1}\sim[n_{1}]_{\sigma},\dots,I_{d}\sim[n_{d}]_{\sigma}}\big{[}\operatorname{rk}\big{(}T_{|I_{[d]}}\big{)}\geq\kappa\cdot\operatorname{rk}(T)\big{]}\geq 1-Ce^{-\kappa\operatorname{rk}(T)}.
  • Proof:

    Let L,β>0L,\beta>0 be the constants in the linear core property of rk\operatorname{rk}, and denote λ=β/(3dL)\lambda=\beta/(3dL). For a given dd-tensor Ti=1d𝔽[ni]T\in\bigotimes_{i=1}^{d}\mathbb{F}^{[n_{i}]}, fix sets J1[n1],,Jd[nd]J_{1}\subseteq[n_{1}],\dots,J_{d}\subseteq[n_{d}] of size Lrk(T)L\operatorname{rk}(T) such that rk(T|J[d])βrk(T)\operatorname{rk}(T_{|J_{[d]}})\geq\beta\operatorname{rk}(T).

    Let I1(J1)1λ,,Id(Jd)1λI_{1}\sim(J_{1})_{1-\lambda},\dots,I_{d}\sim(J_{d})_{1-\lambda} be random sets and consider the random event

    E={|Ii|(12λ)|Ji| for all 1id}.E=\big{\{}|I_{i}|\geq(1-2\lambda)|J_{i}|\,\text{ for all }1\leq i\leq d\big{\}}.

    Whenever EE holds, by the restriction Lipschitz property we have

    rk(T|I[d])\displaystyle\operatorname{rk}(T_{|I_{[d]}}) rk(T|J[d])i=1d|JiIi|\displaystyle\geq\operatorname{rk}(T_{|J_{[d]}})-\sum_{i=1}^{d}|J_{i}\setminus I_{i}|
    βrk(T)d2λLrk(T)\displaystyle\geq\beta\operatorname{rk}(T)-d\cdot 2\lambda L\operatorname{rk}(T)
    =βrk(T)/3.\displaystyle=\beta\operatorname{rk}(T)/3.

    By the Chernoff bound and union bound, the probability of EE is at least 1de(β/12d)rk(T)1-d\,e^{-(\beta/12d)\operatorname{rk}(T)}; it then follows from monotonicity that

    PrI1[n1]1λ,,Id[nd]1λ[rk(T|I[d])β3rk(T)]1deβ12drk(T).\mbox{\rm Pr}_{I_{1}\sim[n_{1}]_{1-\lambda},\dots,I_{d}\sim[n_{d}]_{1-\lambda}}\bigg{[}\operatorname{rk}(T_{|I_{[d]}})\geq\frac{\beta}{3}\operatorname{rk}(T)\bigg{]}\geq 1-d\,e^{-\frac{\beta}{12d}\operatorname{rk}(T)}.

    Now we apply the same argument to the (random) tensor T~=T|I[d]\tilde{T}=T_{|I_{[d]}}, and union-bound with the event {rk(T|I[d])βrk(T)/3}\big{\{}\operatorname{rk}(T_{|I_{[d]}})\geq\beta\operatorname{rk}(T)/3\big{\}}. Since (Ii)1λ(I_{i})_{1-\lambda} is distributed as [ni](1λ)2[n_{i}]_{(1-\lambda)^{2}} when Ii[ni]1λI_{i}\sim[n_{i}]_{1-\lambda}, we conclude that

    PrI1[n1](1λ)2,,Id[nd](1λ)2[rk(T|I[d])(β3)2rk(T)]12deβ12dβ3rk(T).\mbox{\rm Pr}_{I_{1}\sim[n_{1}]_{(1-\lambda)^{2}},\dots,I_{d}\sim[n_{d}]_{(1-\lambda)^{2}}}\bigg{[}\operatorname{rk}(T_{|I_{[d]}})\geq\Big{(}\frac{\beta}{3}\Big{)}^{2}\operatorname{rk}(T)\bigg{]}\geq 1-2d\,e^{-\frac{\beta}{12d}\frac{\beta}{3}\operatorname{rk}(T)}.

    In general, applying this same argument recursively tt times in total, we get

    PrI1[n1](1λ)t,,Id[nd](1λ)t[rk(T|I[d])(β3)trk(T)]1tdeβ12d(β3)t1rk(T).\mbox{\rm Pr}_{I_{1}\sim[n_{1}]_{(1-\lambda)^{t}},\dots,I_{d}\sim[n_{d}]_{(1-\lambda)^{t}}}\Big{[}\operatorname{rk}(T_{|I_{[d]}})\geq\Big{(}\frac{\beta}{3}\Big{)}^{t}\operatorname{rk}(T)\Big{]}\geq 1-td\,e^{-\frac{\beta}{12d}(\frac{\beta}{3})^{t-1}\operatorname{rk}(T)}.

    The theorem now follows by taking tt to be the smallest integer for which (1λ)tσ(1-\lambda)^{t}\leq\sigma, and using monotonicity under restrictions. \Box

We quickly remark on another interesting connection between our results and recent work on high-rank maps by Gowers and Karam [14]. These authors studied equidistribution properties of polynomials and multilinear forms on 𝔽pn\mathbb{F}_{p}^{n} when the variables are restricted to subsets of their domain; this setting is quite similar to what motivated our studies, as explained in the beginning of this section. A crucial step in their arguments was a result (Proposition 3.5 in [14]) showing that the values taken by multilinear forms of high partition rank must be close to uniformly distributed under a wide range of non-uniform input distributions. Under the well-known conjecture (within additive combinatorics) that the partition rank and analytic rank of tensors are equivalent up to a multiplicative constant, their result would straightforwardly imply the conclusion of our Random Restriction Theorem (Theorem 1.4) when restricted to either the partition rank or the analytic rank of tensors, although making use of very different arguments.


Analogous to Definition 4.1, it also makes sense to define a core property for polynomial maps (which include single polynomials by setting k=1k=1).

Definition 4.3.

Let A,B:++A,B:\mathbb{R}_{+}\to\mathbb{R}_{+} be unbounded increasing functions, and let rk:Pold(𝔽n;𝔽k)+\operatorname{rk}:\operatorname{Pol}_{\leq d}(\mathbb{F}^{n};\mathbb{F}^{k})\to\mathbb{R}_{+}. We say that rk\operatorname{rk} satisfies the (A,B)(A,B)-core property if, for every polynomial map ϕPold(𝔽n;𝔽k)\phi\in\operatorname{Pol}_{\leq d}(\mathbb{F}^{n};\mathbb{F}^{k}) of high enough rank rk(ϕ)\operatorname{rk}(\phi), there exists a set I[n]I\subseteq[n] of size at most A(rk(ϕ))A(\operatorname{rk}(\phi)) such that rk(ϕ|I)B(rk(ϕ))\operatorname{rk}(\phi_{|I})\geq B(\operatorname{rk}(\phi)). We say that rk\operatorname{rk} satisfies the linear core property if it satisfies the (A,B)(A,B)-core property for linear functions A(r)=LrA(r)=Lr and B(r)=βrB(r)=\beta r, with L,β>0L,\beta>0.

A property comparable to a core property for polynomial maps was proved by Kazhdan and Ziegler [22]. For simplicity, we state it here only for polynomials. They showed that a polynomial P𝔽[x1,,xn]P\in\mathbb{F}[x_{1},\dots,x_{n}] with high Schmidt rank and deg(P)<char(𝔽)\deg(P)<\operatorname{char}(\mathbb{F}) is universal in the following sense: If prank(P)r\operatorname{prank}(P)\geq r, then for any Q𝔽[x1,,xm]Q\in\mathbb{F}[x_{1},\dots,x_{m}] of degree deg(P)\deg(P) there is an affine map ϕ:𝔽m𝔽n\phi:\mathbb{F}^{m}\to\mathbb{F}^{n} such that Q=PϕQ=P\circ\phi, provided rr is large enough in terms of mm. Taking QQ to be a polynomial of maximal Schmidt rank mm shows that, up to an affine transformation, PP restricts to an mm-variate polynomial of maximal Schmidt rank.

It would be of interest to know which notions of rank for polynomial maps have a core property, and especially if any of the rank functions discussed here have the linear core property (see [3] for recent progress on this). The same proof as given for Theorem 4.2 shows that a linear core property implies a Random Restriction Theorem for polynomial maps that is quantitatively stronger than Theorem 1.8.

Acknowledgements

The authors would like to thank Victor Souza for bringing Karam’s paper [20] to their attention, and Jan Draisma and Tamar Ziegler for a number of helpful pointers to the literature. We also thank Thomas Karam for helpful comments on a preliminary version of this manuscript. We are indebted to the anonymous reviewer for simplifying our original proof of Lemma 3.3, and for suggestions which improved the presentation of the paper.

References

  • [1] Tigran Ananyan and Melvin Hochster. Small subalgebras of polynomial rings and Stillman’s conjecture. J. Amer. Math. Soc., 33:291–309, 2020. \hrefhttps://doi.org/10.1090/jams/932 \pathdoi:10.1090/jams/932.
  • [2] Jonah Blasiak, Thomas Church, Henry Cohn, Joshua A. Grochow, Eric Naslund, William F. Sawin, and Chris Umans. On cap sets and the group-theoretic approach to matrix multiplication. Discrete Anal., pages Paper No. 3, 27, 2017. \hrefhttps://doi.org/10.19086/da.1245 \pathdoi:10.19086/da.1245.
  • [3] Andreas Blatter, Jan Draisma, and Filip Rupniewski. A tensor restriction theorem over finite fields, 2022. arXiv:2211.12319. \hrefhttps://doi.org/10.48550/ARXIV.2211.12319 \pathdoi:10.48550/ARXIV.2211.12319.
  • [4] Jop Briët. Subspaces of tensors with high analytic rank. Online J. Anal. Comb., (16):Paper No. 6, 9, 2021.
  • [5] Jop Briët, Harry Buhrman, Davi Castro-Silva, and Niels M. P. Neumann. Noisy decoding by shallow circuits with parities: classical and quantum, 2023. \hrefhttps://doi.org/10.48550/ARXIV.2302.02870 \pathdoi:10.48550/ARXIV.2302.02870.
  • [6] Alex Cohen and Guy Moshkovitz. Structure vs. randomness for bilinear maps. In STOC ’21—Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing, pages 800–808. ACM, New York, 2021. \hrefhttps://doi.org/10.1145/3406325.3451007 \pathdoi:10.1145/3406325.3451007.
  • [7] Alex Cohen and Guy Moshkovitz. Partition and analytic rank are equivalent over large fields. Duke Math. J., 172(12):2433–2470, 2023. \hrefhttps://doi.org/10.1215/00127094-2022-0086 \pathdoi:10.1215/00127094-2022-0086.
  • [8] Ernie Croot, Vsevolod F. Lev, and Péter Pál Pach. Progression-free sets in 4n\mathbb{Z}^{n}_{4} are exponentially small. Ann. of Math. (2), 185(1):331–337, 2017. \hrefhttps://doi.org/10.4007/annals.2017.185.1.7 \pathdoi:10.4007/annals.2017.185.1.7.
  • [9] Harm Derksen, Rob H. Eggermont, and Andrew Snowden. Topological noetherianity for cubic polynomials. Algebra Number Theory, 11(9):2197–2212, 2017. \hrefhttps://doi.org/10.2140/ant.2017.11.2197 \pathdoi:10.2140/ant.2017.11.2197.
  • [10] Jan Draisma. Topological Noetherianity of polynomial functors. J. Amer. Math. Soc., 32(3):691–707, 2019. \hrefhttps://doi.org/10.1090/jams/923 \pathdoi:10.1090/jams/923.
  • [11] Jordan S. Ellenberg and Dion Gijswijt. On large subsets of 𝔽qn\mathbb{F}^{n}_{q} with no three-term arithmetic progression. Ann. of Math. (2), 185(1):339–343, 2017. \hrefhttps://doi.org/10.4007/annals.2017.185.1.8 \pathdoi:10.4007/annals.2017.185.1.8.
  • [12] Jacob Fox and László Miklós Lovász. A tight bound for Green’s arithmetic triangle removal lemma in vector spaces. Adv. Math., 321:287–297, 2017. \hrefhttps://doi.org/10.1016/j.aim.2017.09.037 \pathdoi:10.1016/j.aim.2017.09.037.
  • [13] Ehud Friedgut. Boolean functions with low average sensitivity depend on few coordinates. Combinatorica, 18(1):27–35, 1998. \hrefhttps://doi.org/10.1007/PL00009809 \pathdoi:10.1007/PL00009809.
  • [14] W. T. Gowers and Thomas Karam. Equidistribution of high-rank polynomials with variables restricted to subsets of 𝔽p\mathbb{F}_{p}, 2022. arXiv:2209.04932. \hrefhttps://doi.org/10.48550/ARXIV.2209.04932 \pathdoi:10.48550/ARXIV.2209.04932.
  • [15] W. T. Gowers and J. Wolf. Linear forms and higher-degree uniformity for functions on 𝔽pn\mathbb{F}^{n}_{p}. Geom. Funct. Anal., 21(1):36–69, 2011. \hrefhttps://doi.org/10.1007/s00039-010-0106-3 \pathdoi:10.1007/s00039-010-0106-3.
  • [16] Ben Green and Terence Tao. The distribution of polynomials over finite fields, with applications to the Gowers norms. Contrib. Discrete Math., 4(2):1–36, 2009.
  • [17] Torben Hagerup and Christine Rüb. A guided tour of Chernoff bounds. Information Processing Letters, 33(6):305–308, 1990. \hrefhttps://doi.org/10.1016/0020-0190(90)90214-I \pathdoi:10.1016/0020-0190(90)90214-I.
  • [18] Elad Haramaty and Amir Shpilka. On the structure of cubic and quartic polynomials. In STOC’10—Proceedings of the 2010 ACM International Symposium on Theory of Computing, pages 331–340. ACM, New York, 2010. \hrefhttps://doi.org/10.1145/1806689.1806736 \pathdoi:10.1145/1806689.1806736.
  • [19] Pooya Hatami. On the structure of quintic polynomials. In Approximation, randomization, and combinatorial optimization. Algorithms and techniques, volume 60 of LIPIcs. Leibniz Int. Proc. Inform., pages Art. No. 33, 18. Schloss Dagstuhl. Leibniz-Zent. Inform., Wadern, 2016. \hrefhttps://doi.org/10.4230/LIPIcs.APPROX-RANDOM.2016.33 \pathdoi:10.4230/LIPIcs.APPROX-RANDOM.2016.33.
  • [20] Thomas Karam. High-rank subtensors of high-rank tensors, 2022. arXiv:2207.08030. \hrefhttps://doi.org/10.48550/ARXIV.2207.08030 \pathdoi:10.48550/ARXIV.2207.08030.
  • [21] David Kazhdan and Tamar Ziegler. Approximate cohomology. Selecta Math. (N.S.), 24(1):499–509, 2018. \hrefhttps://doi.org/10.1007/s00029-017-0335-5 \pathdoi:10.1007/s00029-017-0335-5.
  • [22] David Kazhdan and Tamar Ziegler. Properties of high rank subvarieties of affine spaces. Geom. Funct. Anal., 30(4):1063–1096, 2020. \hrefhttps://doi.org/10.1007/s00039-020-00542-4 \pathdoi:10.1007/s00039-020-00542-4.
  • [23] Swastik Kopparty, Guy Moshkovitz, and Jeroen Zuiddam. Geometric rank of tensors and subrank of matrix multiplication. In 35th Computational Complexity Conference, volume 169 of LIPIcs. Leibniz Int. Proc. Inform., pages Art. No. 35, 21. Schloss Dagstuhl. Leibniz-Zent. Inform., Wadern, 2020. \hrefhttps://doi.org/10.4230/LIPIcs.CCC.2020.35 \pathdoi:10.4230/LIPIcs.CCC.2020.35.
  • [24] J. M. Landsberg. Geometry and the complexity of matrix multiplication. Bull. Amer. Math. Soc. (N.S.), 45(2):247–284, 2008. \hrefhttps://doi.org/10.1090/S0273-0979-08-01176-2 \pathdoi:10.1090/S0273-0979-08-01176-2.
  • [25] Shachar Lovett. The analytic rank of tensors and its applications. Discrete Anal., pages Paper No. 7, 10, 2019. \hrefhttps://doi.org/10.19086/da \pathdoi:10.19086/da.
  • [26] Eric Naslund. Exponential bounds for the Erdős-Ginzburg-Ziv constant. J. Combin. Theory Ser. A, 174:105185, 19, 2020. \hrefhttps://doi.org/10.1016/j.jcta.2019.105185 \pathdoi:10.1016/j.jcta.2019.105185.
  • [27] Eric Naslund. The partition rank of a tensor and kk-right corners in 𝔽qn\mathbb{F}_{q}^{n}. J. Combin. Theory Ser. A, 174:105190, 25, 2020. \hrefhttps://doi.org/10.1016/j.jcta.2019.105190 \pathdoi:10.1016/j.jcta.2019.105190.
  • [28] Eric Naslund and Will Sawin. Upper bounds for sunflower-free sets. Forum Math. Sigma, 5:Paper No. e15, 10, 2017. \hrefhttps://doi.org/10.1017/fms.2017.12 \pathdoi:10.1017/fms.2017.12.
  • [29] Ryan O’Donnell. Analysis of Boolean functions. Cambridge University Press, New York, 2014. \hrefhttps://doi.org/10.1017/CBO9781139814782 \pathdoi:10.1017/CBO9781139814782.
  • [30] Ran Raz. Tensor-rank and lower bounds for arithmetic formulas. J. ACM, 60(6):Art. 40, 15, 2013. \hrefhttps://doi.org/10.1145/2535928 \pathdoi:10.1145/2535928.
  • [31] Gideon Schechtman. Concentration results and applications. In Handbook of the geometry of Banach spaces, Vol. 2, pages 1603–1634. North-Holland, Amsterdam, 2003. \hrefhttps://doi.org/10.1016/S1874-5849(03)80044-X \pathdoi:10.1016/S1874-5849(03)80044-X.
  • [32] Wolfgang M. Schmidt. The density of integer points on homogeneous varieties. Acta Math., 154(3-4):243–296, 1985. \hrefhttps://doi.org/10.1007/BF02392473 \pathdoi:10.1007/BF02392473.
  • [33] Michel Talagrand. Concentration of measure and isoperimetric inequalities in product spaces. Inst. Hautes Études Sci. Publ. Math., (81):73–205, 1995.
  • [34] Terence Tao. \urlhttps://terrytao.wordpress.com/2016/05/18/a-symmetric-formulation-of-the-croot-lev-pach-ellenberg-gijswijt-capset-bound/, 2016.
  • [35] Terence Tao and Will Sawin. \urlhttps://terrytao.wordpress.com/2016/08/24/notes-on-the-slice-rank-of-tensors/, 2016.
  • [36] Terence Tao and Tamar Ziegler. The inverse conjecture for the Gowers norm over finite fields in low characteristic. Ann. Comb., 16(1):121–188, 2012. \hrefhttps://doi.org/10.1007/s00026-011-0124-3 \pathdoi:10.1007/s00026-011-0124-3.
{dajauthors}{authorinfo}

[jb] Jop Briët
CWI & QuSoft
Amsterdam, The Netherlands
j.briet\imageatcwi\imagedotnl
\urlhttps://www.cwi.nl/ jop {authorinfo}[dcs] Davi Castro-Silva
CWI & QuSoft
Amsterdam, The Netherlands
davisilva15\imageatgmail\imagedotcom
\urlhttps://sites.google.com/view/davicastrosilva