This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

11institutetext: Eleanor Anthony 22institutetext: Department of Mathematics, University of Mississippi, Hume Hall 305, P. O. Box 1848, University, Mississippi  38677-1848, 22email: ecanthon@go.olemiss.edu . Partially supported by NSF REU grant DMS-1156589. 33institutetext: Sheridan Grant 44institutetext: Department of Mathematics, 640 North College Avenue, Claremont, Calif. 91711, 44email: sheridan.grant@pomona.edu . Partially supported by NSF REU grant DMS-1156589. 55institutetext: Peter Gritzmann 66institutetext: Fakultät für Mathematik, Technische Universität München, D-80290 München, Germany, 66email: gritzmann@tum.de . Work supported in part by the German Research Foundation (DFG). 77institutetext: J. Maurice Rojas 88institutetext: Department of Mathematics, Texas A&M University TAMU 3368, College Station, Texas  77843-3368, USA, 88email: rojas@math.tamu.edu . Partially supported by NSF MCS grant DMS-0915245 and Sandia National Laboratories.

Polynomial-Time Amoeba Neighborhood Membership and Faster Localized Solving

Eleanor Anthony    Sheridan Grant    Peter Gritzmann    J. Maurice Rojas
Abstract

We derive efficient algorithms for coarse approximation of algebraic hypersurfaces, useful for estimating the distance between an input polynomial zero set and a given query point. Our methods work best on sparse polynomials of high degree (in any number of variables) but are nevertheless completely general. The underlying ideas, which we take the time to describe in an elementary way, come from tropical geometry. We thus reduce a hard algebraic problem to high-precision linear optimization, proving new upper and lower complexity estimates along the way.

July 29, 2025


Dedicated to Tien-Yien Li, in honor of his birthday.

1 Introduction

As students, we are often asked to draw, hopefully without a calculator, real zero sets of low degree polynomials in few variables. As scientists and engineers, we are often asked to count or approximate, hopefully with some computational assistance, real and complex solutions of arbitrary systems of polynomial equations in many variables. If one allows sufficiently coarse approximations, then the latter problem is as easy as the former. Our main results clarify this transition from hardness to easiness. In particular, we significantly speed up certain queries involving distances between points and algebraic hypersurfaces (see Theorems 1.41.6 and Remark 1.9 below).

Polynomial equations are ubiquitous in numerous applications, such as algebraic statistics HRS (13), chemical reaction kinetics MFRCSD (13), discretization of partial differential equations HHHLSZ (13), satellite orbit design NAM (11), circuit complexity KPR (13), and cryptography BFP (13). The need to solve larger and larger equations, in applications as well as for theoretical purposes, has helped shape algebraic geometry and numerical analysis for centuries. More recent work in algebraic complexity tells us that many basic questions involving polynomial equations are 𝐍𝐏{\mathbf{NP}}-hard (see, e.g., Pla (84); Koi (96); BL (07); BS (09)). This is by no means an excuse to consider polynomial equation solving hopeless: computational scientists solve problems of near-exponential complexity every day.

More to the point, thanks to recent work on Smale’s 17th Problem BP (09); BC (10), we have learned that randomization and approximation are the key to avoiding the bottlenecks present in hard deterministic questions involving roots of polynomial systems. Smale’s 17th Problem concerns the complexity of approximating a single complex root of a random polynomial system and is well-discussed in Sma (98, 00); SS92a ; SS92b ; SS (93, 96, 94). Our ultimate goal is to extend this philosophy to the harder problem of localized solving: estimating how far the nearest root of a given system of polynomials (or intersection of several zero sets) is from a given point. We make some initial steps by first approximating the shape of a single zero set, and we then outline a tropical-geometric approach to localized solving in Section 3.

Toward this end, let us first recall the natural idea Vir (01) of drawing zero sets on log-paper. In what follows, we let \mathbb{C}^{*} denote the non-zero complex numbers and write [x1±1,,xn±1]\mathbb{C}\!\left[x^{\pm 1}_{1},\ldots,x^{\pm 1}_{n}\right] for the ring of Laurent polynomials with complex coefficients, i.e., polynomials with negative exponents allowed.

[Uncaptioned image]
Definition 1.1

We use the abbreviations x:=(x1,,xn)x\!:=\!(x_{1},\ldots,x_{n}) and Log|x|:=(log|x1|,,log|xn|)\mathrm{Log}|x|\!:=\!(\log|x_{1}|,\ldots,\log|x_{n}|), and, for any


f[x1±1,,xn±1]f\!\in\!\mathbb{C}\!\left[x^{\pm 1}_{1},\ldots,x^{\pm 1}_{n}\right], we define Amoeba(f):={Log|x||f(x)=0,x()n}\mathrm{Amoeba}(f)\!:=\!\{\mathrm{Log}|x|\;|\;f(x)\!=\!0\ ,\ x\!\in\!{(\mathbb{C}^{*})}^{n}\}. We call ff an nn-variate


tt-nomial when we can write f(x)=i=1tcixaif(x)\!=\!\sum^{t}_{i=1}c_{i}x^{a_{i}} with ci0c_{i}\!\neq\!0, ai:=(a1,i,,an,i)a_{i}\!:=(a_{1,i},\ldots,a_{n,i}), and xai:=x1a1,ix2a2,ixnan,ix^{a_{i}}\!:=\!x^{a_{1,i}}_{1}x^{a_{2,i}}_{2}\cdots x^{a_{n,i}}_{n}


for all ii. Finally, we define the Archimedean tropical variety of ff, denoted ArchTrop(f)\mathrm{ArchTrop}(f), to be


the set of all wnw\!\in\!\mathbb{R}^{n} for which maxi|cieaiw|\max_{i}\left|c_{i}e^{a_{i}\cdot w}\right| is attained for at least two distinct indices ii.1 \diamond


Example 1.2

Taking f(x)=1+x13+x223x1x2f(x)\!=\!1+x^{3}_{1}+x^{2}_{2}-3x_{1}x_{2}, an illustration of Amoeba(f)\mathrm{Amoeba}(f) and ArchTrop(f)\mathrm{ArchTrop}(f),


truncated to [7,7]2[-7,7]^{2}, appears to the right. Amoeba(f)\mathrm{Amoeba}(f) is lightly shaded, while ArchTrop(f)\mathrm{ArchTrop}(f) is


the piecewise-linear curve. \diamond

One may be surprised that Amoeba(f)\mathrm{Amoeba}(f) and ArchTrop(f)\mathrm{ArchTrop}(f) are so highly structured: Amoeba(f)\mathrm{Amoeba}(f) has

tentacles reminiscent of a living amoeba, and ArchTrop(f)\mathrm{ArchTrop}(f) is a polyhedral complex, i.e., a union

of polyhedra intersecting only along common faces. One may also be surprised that Amoeba(f)\mathrm{Amoeba}(f)

and ArchTrop(f)\mathrm{ArchTrop}(f) are so closely related: for our example above, one set is strictly contained in the

11footnotetext: Throughout this paper, for any two vectors u:=(u1,,uN)u\!:=\!(u_{1},\ldots,u_{N}) and v:=(v1,,vN)v\!:=\!(v_{1},\ldots,v_{N}) in N\mathbb{R}^{N}, we use uvu\cdot v to denote the standard dot product u1v1++uNvNu_{1}v_{1}+\cdots+u_{N}v_{N}.

other, every point of one set is close to some point of the other, and both sets have topologically
similar complements (4 open connected components, exactly one of which is bounded).

Proving that Amoeba(f)\mathrm{Amoeba}(f) and ArchTrop(f)\mathrm{ArchTrop}(f) are in fact equal when ff has two or fewer monomial terms is a simple exercise (see Proposition 2.1 below). More generally, to quantify exactly how close Amoeba(f)\mathrm{Amoeba}(f) and ArchTrop(f)\mathrm{ArchTrop}(f) are, one can recall the Hausdorff distance, denoted Δ(U,V)\Delta(U,V), between two subsets U,VnU,V\!\subseteq\!\mathbb{R}^{n}: it is defined to be the maximum of supuUinfvV|uv|\sup_{u\in U}{}\inf_{\begin{subarray}{c}\mbox{}\\ v\in V\end{subarray}}|u-v| and supvVinfuU|uv|\sup_{v\in V}{}\inf_{\begin{subarray}{c}\mbox{}\\ u\in U\end{subarray}}|u-v|. We then have the following recent result of Avendaño, Kogan, Nisse, and Rojas.

Theorem 1.3

AKNR (13) For any nn-variate tt-nomial ff we have Δ(Amoeba(f),ArchTrop(f))(2t3)log(t1)\displaystyle{\Delta(\mathrm{Amoeba}(f),\mathrm{ArchTrop}(f))\leq(2t-3)\log(t-1)}. In particular, we also have supuAmoeba(f)infvArchTrop(f)|uv|log(t1)\displaystyle{\sup\limits_{\text{\scalebox{0.7}[1.0]{$u\in\mathrm{Amoeba}(f)$}}}\inf\limits_{\begin{subarray}{c}\mbox{}\\ \text{\scalebox{0.7}[1.0]{$v\in\mathrm{ArchTrop}(f)$}}\end{subarray}}|u-v|\leq\log(t-1)}. Finally, for any t>n1t\!>\!n\!\geq\!1, there is an nn-variate tt-nomial ff with Δ(Amoeba(f),ArchTrop(f))log(t1)\Delta(\mathrm{Amoeba}(f),\mathrm{ArchTrop}(f))\!\geq\!\log(t-1). \blacksquare

Note that the preceding upper bounds are completely independent of the coefficients, degree, and number of variables of ff

We conjecture that an O(logt)O(\log t) upper bound on the above Hausdorff distance is possible. More practically, as we will see in later examples, Amoeba(f)\mathrm{Amoeba}(f) and ArchTrop(f)\mathrm{ArchTrop}(f) are often much closer than guaranteed by any proven upper bound.

Given the current state of numerical algebraic geometry and algorithmic polyhedral geometry, the preceding metric result suggests that it might be useful to apply Archimedean tropical varieties to speed up polynomial system solving. Our first two main results help set the stage for such speed-ups. Recall that [1]\mathbb{Q}[\sqrt{-1}] denotes those complex numbers whose real and imaginary parts are both rational. Our complexity results will all be stated relative to the classical Turing (bit) model, with the underlying notion of input size clarified below in Definition 1.7.

Theorem 1.4

Suppose f[x1±1,,xn±1]f\!\in\!\mathbb{C}\!\left[x^{\pm 1}_{1},\ldots,x^{\pm 1}_{n}\right] and wnw\!\in\!\mathbb{R}^{n}. Then
log(t1)Δ(Amoeba(f),w)Δ(ArchTrop(f),w)(2t3)log(t1)-\log(t-1)\!\leq\!\Delta(\mathrm{Amoeba}(f),w)-\Delta(\mathrm{ArchTrop}(f),w)\!\leq\!(2t-3)\log(t-1).
In particular, if we also assume that nn is fixed and (f,w)[1][x1±1,,xn±1]×n(f,w)\!\in\!\mathbb{Q}[\sqrt{-1}]\left[x^{\pm 1}_{1},\ldots,x^{\pm 1}_{n}\right]\times\mathbb{Q}^{n} with ff a tt-nomial, then we can compute polynomially many bits of Δ(ArchTrop(f),w)\Delta(\mathrm{ArchTrop}(f),w) in polynomial-time, and there is a polynomial-timealgorithm that declares either (a) Δ(Amoeba(f),w)(2t2)log(t1)\Delta(\mathrm{Amoeba}(f),w)\!\leq\!(2t-2)\log(t-1) or
            (b) wAmoeba(f)w\!\not\in\!\mathrm{Amoeba}(f) and Δ(Amoeba(f),w)Δ(ArchTrop(f),w)log(t1)>0\Delta(\mathrm{Amoeba}(f),w)\!\geq\!\Delta(\mathrm{ArchTrop}(f),w)-\log(t-1)\!>\!0.

Theorem 1.5

Suppose nn is fixed. Then there is a polynomial-time algorithm that, for any input(f,w)[1][x1±1,,xn±1]×n(f,w)\!\in\!\mathbb{Q}[\sqrt{-1}]\!\left[x^{\pm 1}_{1},\ldots,x^{\pm 1}_{n}\right]\times\mathbb{Q}^{n} with ff a tt-nomial, outputs the closure of the unique cell σw\sigma_{w} of nArchTrop(f)\mathbb{R}^{n}\!\setminus\!\mathrm{ArchTrop}(f) (or ArchTrop(f)\mathrm{ArchTrop}(f)) containing ww, described as an explicit intersection of O(t2)O(t^{2}) half-spaces.

The importance of Theorem 1.4 is that deciding whether an input point ww lies in an input Amoeba(f)\mathrm{Amoeba}(f), even restricting to the special case n=1n\!=\!1, is already 𝐍𝐏{\mathbf{NP}}-hard AKNR (13). Theorem 1.5 enables us to find explicit regions, containing a given query point ww, where ff can not vanish. As we will see later in Sections 2.2 and 2.3, improving Theorems 1.4 and 1.5 to polynomial dependence in nn leads us to deep questions in Diophantine approximation and the complexity of linear optimization.

It is thus natural to speculate that tropical varieties can be useful for localized polynomial system solving, i.e., estimating how far the nearest root of a given system of nn-variate polynomials f1,,fkf_{1},\ldots,f_{k} is from an input point xx\!\in\!\mathbb{C}. Our framework indeed enables new positive and negative results on this problem.

Theorem 1.6

Suppose nn is fixed. Then there is a polynomial-time algorithm that, for any input kk and(f1,,fk,w)([1][x1±1,,xn±1])k×n(f_{1},\ldots,f_{k},w)\!\in\!\left(\mathbb{Q}[\sqrt{-1}]\!\left[x^{\pm 1}_{1},\ldots,x^{\pm 1}_{n}\right]\right)^{k}\times\mathbb{Q}^{n}, outputs the closure of the unique cell σw\sigma_{w} of ni=1kArchTrop(fi)\mathbb{R}^{n}\setminus\bigcup^{k}_{i=1}\mathrm{ArchTrop}(f_{i}) (or ArchTrop(fi)wArchTrop(fi)\displaystyle{\bigcap\limits_{\mathrm{ArchTrop}(f_{i})\ni w}\!\!\!\!\!\!\!\!\!\!\mathrm{ArchTrop}(f_{i})}) containing ww, described as an explicit intersection of half-spaces. However, if nn is allowed to

vary, then deciding whether σw\sigma_{w} has a vertex in i=1nArchTrop(fi)\displaystyle{\bigcap\limits^{n}_{i=1}\mathrm{ArchTrop}(f_{i})} is 𝐍𝐏{\mathbf{NP}}-hard.

We will see in Section 3 how the first assertion is useful for finding special start-points for Newton Iteration andHomotopy Continuation that sometimes enable the approximation of just the roots with norm vector near (ew1,,ewn)(e^{w_{1}},\ldots,e^{w_{n}}). The second assertion can be considered as a refined tropical analogue to a classical algebraic complexity result: deciding whether an arbitrary input system of polynomials equations (with integer coefficients) has a complex root is 𝐍𝐏{\mathbf{NP}}-hard GJ (79). However, in light of the recent partial solutions to Smale’s 17th Problem BP (09); BC (10) (showing that randomization and approximation help us evade 𝐍𝐏{\mathbf{NP}}-hardness for average-case inputs), we suspect that an analogous speed-up is possible in the tropical case as well.

On the practical side, we point out that the algorithms underlying Theorems 1.41.6 are quite easily implementable. (A preliminary Matlab implementation of our algorithms is available upon request.) Initial experiments, discussed in Section 3 below, indicate that a large-scale implementation could be a worthwhile companion to existing polynomial system solving software.

Theorems 1.4, 1.5, and 1.6 are respectively proved in Sections 5, 4, and 6. Before moving on to the necessary technical background, let us first clarify our underlying input size and point out some historical context.

Definition 1.7

We define the input size of a polynomial f[x1,,xn]f\!\in\!\mathbb{Z}[x_{1},\ldots,x_{n}], written f(x)=i=1tcixaif(x)\!=\!\sum^{t}_{i=1}c_{i}x^{a_{i}}, to be size(f):=i=1tlog((2+|ci|)j=1n(2+|ai,j|))\mathrm{size}(f)\!:=\!\sum^{t}_{i=1}\log\left((2+|c_{i}|)\prod^{n}_{j=1}(2+|a_{i,j}|)\right), where ai=(ai,1,,ai,n)a_{i}\!=\!(a_{i,1},\ldots,a_{i,n}) for all ii. Similarly, we define the input size of a point (v1,,vn)n(v_{1},\ldots,v_{n})\!\in\!\mathbb{Q}^{n} as the sum of sizes of the numerators and denominators of the viv_{i} (written in lowest terms), and thus extend the notion of input size to polynomials in [x1,,xn]\mathbb{Q}[x_{1},\ldots,x_{n}]. Considering real and imaginary parts, and summing the respect sizes, we then extend the definition of input size further still to polynomials in [1][x1,,xn]\mathbb{Q}\!\left[\sqrt{-1}\right]\![x_{1},\ldots,x_{n}]. \diamond

Remark 1.8

Note that size(f)\mathrm{size}(f) is, up to a bounded multiple, the sum of the bit-sizes of all the coefficients and exponents of ff. Put even more simply, assuming we write integers as usual in some fixed base, and we write rational numbers as fractions in lowest terms, size(f)\mathrm{size}(f) is asymptotically the same as the amount of ink needed to write out ff as a sum of monomial terms. We extend our definition of size to a system of polynomials F:=(f1,,fk)F\!:=\!(f_{1},\ldots,f_{k}) in the obvious way by setting size(F):=i=1ksize(fi)\mathrm{size}(F)\!:=\!\sum^{k}_{i=1}\mathrm{size}(f_{i}). Thus, for example, the size of an input in Theorem 1.6 is size(w)+i=1ksize(fi)\mathrm{size}(w)+\sum^{k}_{i=1}\mathrm{size}(f_{i}). \diamond

Via a slight modification of the classical Horner’s Rule CKS (99), it is easy to see that the number of ring operations needed to evaluate an arbitrary ff at an arbitrary xnx\!\in\!\mathbb{C}^{n} easily admits an O(size(f)2)O(\mathrm{size}(f)^{2}) upper bound.222When just counting ring operations we can in fact ignore the contribution of the coefficient sizes.

Remark 1.9

The definition of input size we use implies that our preceding algorithms yield a significant speed-up over earlier techniques: for an nn-variate tt-nomial ff of degree dd, with nn and tt fixed, our algorithms have complexity polynomial in 𝐥𝐨𝐠𝐝\boldsymbol{\log d}. The best previous techniques from computational algebra, including recent advances on Smale’s 17th Problem BP (09); BC (10), have complexity polynomial in (d+n)!d!n!min{dn,nd}\frac{(d+n)!}{d!n!}\!\geq\!\min\{d^{n},n^{d}\}. \diamond

Historical Notes Using convex and/or piecewise-linear geometry to understand solutions of algebraic equations can be traced back to work of Newton around 1676 New (76). The earliest precursor we know to the n=1n\!=\!1 case of the metric estimate of Theorem 1.3 can be found in work of Ostrowski from around 1940 (Ost, 40, Cor. IX, pg. 143).

More recently, tropical geometry EKL (06); LS (09); IMS (09); BR (10); MS (13) has emerged as a rich framework for reducing deep questions in algebraic geometry to more tractable questions in polyhedral and piecewise-linear geometry. For instance, the combinatorial structure of amoebae was first observed by Gelfand, Kapranov, and Zelevinsky around 1994 GKZ (94). \diamond

Remark 1.10

The reader may wonder why we have not considered the phases of the root coordinates and focussed just on norms. The phase analogue of an amoeba is the co-amoeba, which has only recently been studied HHP (08); NP (10); NS (13, 14). While it is known that the phases of the coordinates of the roots of polynomial systems satisfy certain equidistribution laws (see, e.g., (Kho, 91, Thm. 1 (pp. 82–83), Thm. 2 (pp. 87–88), and Cor. 3 (pg. 88)) and AGS (13)), there does not yet appear to be a phase analogue of ArchTrop(f)\mathrm{ArchTrop}(f). Nevertheless, we will see in Section 3 that our techniques sometimes allow us to approximate actual complex roots, in addition to norms. \diamond

2 Background

2.1 Convex, Piecewise-Linear, and Tropical Geometrical Notions

Let us first recall the origin of the phrase “tropical geometry”, according to Pin (98): the tropical semifield trop\mathbb{R}_{\mathrm{trop}} is the set {}\mathbb{R}\cup\{-\infty\}, endowed with with the operations xy:=x+yx\odot y\!:=\!x+y and xy:=max{x,y}x\oplus y\!:=\!\max\{x,y\}. The adjective “tropical” was coined by French computer scientists, in honor of Brazilian computer scientist Imre Simon, who did pioneering work with algebraic structures involving trop\mathbb{R}_{\mathrm{trop}}. Just as algebraic geometry relates geometric properties of zero sets of polynomials to the structure of ideals in commutative rings, tropical geometry relates the geometric properties of certain polyhedral complexes (see Definition 2.7 below) to the structure of ideals in trop\mathbb{R}_{\mathrm{trop}}.

In our setting, we work with a particular kind of tropical variety that, thanks to Theorem 1.3, approximates Amoeba(f)\mathrm{Amoeba}(f) quite well. For example, one can see directly that Amoeba(0)=ArchTrop(0)=n\mathrm{Amoeba}(0)\!=\!\mathrm{ArchTrop}(0)\!=\!\mathbb{R}^{n} and, for any cc\!\in\!\mathbb{C}^{*} and ana\!\in\!\mathbb{Z}^{n}, Amoeba(cxa)=ArchTrop(cxa)=\mathrm{Amoeba}(cx^{a})\!=\!\mathrm{ArchTrop}(cx^{a})\!=\!\emptyset. The binomial case is almost as easy.

Proposition 2.1

For any ana\!\in\!\mathbb{Z}^{n} and non-zero complex c1c_{1} and c2c_{2}, we have
Amoeba(c1+c2xa)=ArchTrop(c1+c2xa)={wn|aw=log|c1/c2|}\mathrm{Amoeba}(c_{1}+c_{2}x^{a})\!=\!\mathrm{ArchTrop}(c_{1}+c_{2}x^{a})\!=\!\{w\!\in\!\mathbb{R}^{n}\;|\;a\cdot w\!=\!\log|c_{1}/c_{2}|\}.

Proof: If c1+c2xa=0c_{1}+c_{2}x^{a}\!=\!0 then |c2xa|=|c1||c_{2}x^{a}|\!=\!|c_{1}|. We then obtain aw=log|c1/c2|a\cdot w\!=\!\log|c_{1}/c_{2}| upon taking logs and setting w=Log|x|w\!=\!\mathrm{Log}|x|. This proves that Amoeba(c1+c2xa)\mathrm{Amoeba}(c_{1}+c_{2}x^{a}) is exactly the stated affine hyperplane. Similarly, since the definition of ArchTrop(c1+c2xa)\mathrm{ArchTrop}(c_{1}+c_{2}x^{a}) implies that we are looking for ww with |c2eaw|=|c1||c_{2}e^{a\cdot w}|\!=\!|c_{1}|, we see that ArchTrop(c1+c2xa)\mathrm{ArchTrop}(c_{1}+c_{2}x^{a}) defines the same hyperplane. \blacksquare

While ArchTrop(f)\mathrm{ArchTrop}(f) and Amoeba(f)\mathrm{Amoeba}(f) are always metrically close, ArchTrop(f)\mathrm{ArchTrop}(f) need not be contained in, nor even have the same homotopy type as Amoeba(f)\mathrm{Amoeba}(f), in general.
[Uncaptioned image][Uncaptioned image]Example 2.2.

Letting f:=1+x22+x24+x1x22+x1x24+x12x2+x12x22+x13f\!:=\!1+x^{2}_{2}+x^{4}_{2}+x_{1}x^{2}_{2}+x_{1}x^{4}_{2}+x^{2}_{1}x_{2}+x^{2}_{1}x^{2}_{2}+x^{3}_{1} and
g:=0.1+0.2x22+0.1x24+10x1x22+0.001x1x24+0.01x12x2+0.1x12x22+0.000005x13g\!:=\!0.1+0.2x^{2}_{2}+0.1x^{4}_{2}+10x_{1}x^{2}_{2}+0.001x_{1}x^{4}_{2}+0.01x^{2}_{1}x_{2}+0.1x^{2}_{1}x^{2}_{2}+0.000005x^{3}_{1}
we obtain the amoebae and tropical varieties (and more lightly shaded neighborhoods), restricted to[11,11]×[9,9][-11,11]\times[-9,9], as respectively drawn on the left and right above. The outermost shape in the left-hand (resp. right-hand) illustration is a neighborhood of ArchTrop(f)\mathrm{ArchTrop}(f) (resp. Amoeba(g)\mathrm{Amoeba}(g)).

We thus see that every point of Amoeba(f)\mathrm{Amoeba}(f) (resp. ArchTrop(g)\mathrm{ArchTrop}(g)) lies well within a distance of 0.650.65 (resp. 0.490.49) of some point of ArchTrop(f)\mathrm{ArchTrop}(f) (resp. Amoeba(g)\mathrm{Amoeba}(g)), safely within the distance log7<1.946\log 7\!<\!1.946 (resp. 13log7<25.313\log 7\!<\!25.3) guaranteed by the second (resp. first) bound of Theorem 1.3. Note in particular that ArchTrop(g)\mathrm{ArchTrop}(g) has two holes while Amoeba(g)\mathrm{Amoeba}(g) has only a single hole.333 For our purposes, a hole of a subset SnS\!\subseteq\!\mathbb{R}^{n} will simply be a bounded connected component of the complement nS\mathbb{R}^{n}\setminus S. \diamond

Given any ff, one can always easily construct a family of deformations whose amoebae tend to ArchTrop(f)\mathrm{ArchTrop}(f) in a suitable sense. This fact can be found in earlier papers of Viro and Mikhalkin, e.g., Vir (01); Mik (04). However, employing Theorem 1.3 here, we can give a 44-line proof.

Theorem 2.3

For any nn-variate tt-nomial ff written i=1tcixai\sum^{t}_{i=1}c_{i}x^{a_{i}}, and s>0s\!>\!0, define fs(x):=i=1tcisxaif^{*s}(x)\!:=\!\sum^{t}_{i=1}c^{s}_{i}x^{a_{i}}. ThenΔ(1sAmoeba(fs),ArchTrop(f))0\Delta\!\left(\frac{1}{s}\mathrm{Amoeba}(f^{*s}),\mathrm{ArchTrop}(f)\right)\!\rightarrow\!0 as s+s\rightarrow+\infty.

Proof: By Theorem 1.3, Δ(Amoeba(fs),ArchTrop(fs))(2t3)log(t1)\Delta\!\left(\mathrm{Amoeba}(f^{*s}),\mathrm{ArchTrop}(f^{*s})\right)\!\leq\!(2t-3)\log(t-1) for all s>0s\!>\!0. Since |cieaiw|=|cjeajw||c_{i}e^{a_{i}\cdot w}|\!=\!|c_{j}e^{a_{j}\cdot w}|\Longleftrightarrow

|cieaiw|s=|cjeajw|s|c_{i}e^{a_{i}\cdot w}|^{s}\!=\!|c_{j}e^{a_{j}\cdot w}|^{s}, and similarly when “==” is replaced by “>>”, we immediately obtain thatArchTrop(fs)=sArchTrop(f)\mathrm{ArchTrop}(f^{*s})\!=\!s\mathrm{ArchTrop}(f). So then Δ(Amoeba(fs),ArchTrop(fs))=sΔ(1sAmoeba(fs),ArchTrop(f))\Delta\!\left(\mathrm{Amoeba}(f^{*s}),\mathrm{ArchTrop}(f^{*s})\right)\!=\!s\Delta\!\left(\frac{1}{s}\mathrm{Amoeba}(f^{*s}),\mathrm{ArchTrop}(f)\right) and thus Δ(1sAmoeba(fs),ArchTrop(f))(2t3)log(t1)s\Delta\!\left(\frac{1}{s}\mathrm{Amoeba}(f^{*s}),\mathrm{ArchTrop}(f)\right)\!\leq\!\frac{(2t-3)\log(t-1)}{s} for all s>0s\!>\!0. \blacksquare

To more easily link ArchTrop(f)\mathrm{ArchTrop}(f) with polyhedral geometry we will need two variations of the classical Newton polygon. First, let us use Conv(S)\mathrm{Conv}(S) to denote the convex hull of444i.e., smallest convex set containing… a subset SnS\!\subseteq\!\mathbb{R}^{n}, 𝐎:=(0,,0)\mathbf{O}\!:=\!(0,\ldots,0), and [N]:={1,,N}[N]\!:=\!\{1,\ldots,N\}. Recall also that a polytope is the convex hull of a finite point set, a (closed) half-space is any set of the form{wn|awb}\{w\!\in\!\mathbb{R}^{n}\;|\;a\cdot w\!\leq\!b\} (for some bb\!\in\!\mathbb{R} and an{𝐎}a\!\in\!\mathbb{R}^{n}\setminus\{\mathbf{O}\}), and a (closed) polyhedron is any finite intersection of (closed) half-spaces. It is a basic fact from convex geometry that every polytope is a polyhedron, but not vice-versa Grü (03); Zie (95).

Definition 2.4

Given any nn-variate tt-nomial ff written i=1tcixai\sum^{t}_{i=1}c_{i}x^{a_{i}}, we define its (ordinary) Newton polytope to be Newt(f):=Conv({ai}i[t])\mathrm{Newt}(f)\!:=\!\mathrm{Conv}\!\left(\{a_{i}\}_{i\in[t]}\right), and the Archimedean Newton polytope of ff to be ArchNewt(f):=Conv({(ai,log|ci|)}i[t])\mathrm{ArchNewt}(f)\!:=\!\mathrm{Conv}\!\left(\{(a_{i},-\log|c_{i}|)\}_{i\in[t]}\right). Also, for any polyhedron PNP\!\subset\!\mathbb{R}^{N} and vNv\!\in\!\mathbb{R}^{N}, we define the face of PP with outer normal vv to bePv:={xP|vx is maximized}P^{v}\!:=\!\{x\!\in\!P\;|\;v\cdot x\text{ is maximized}\}. The dimension of PP, written dimP\dim P, is simply the dimension of the smallest affine linear subspace containing PP. Faces of PP of dimension 0, 11, and dimP1\dim P-1 are respectively called vertices, edges, and facets. (PP is called the improper face of PP and we set dim=1\dim\emptyset\!=\!-1.) Finally, we call any face of PP lower if and only if it has an outer normal (w1,,wN)(w_{1},\ldots,w_{N}) with wN<0w_{N}\!<\!0, and we let the lower hull of ArchNewt(f)\mathrm{ArchNewt}(f) be the union of the lower faces of ArchNewt(f)\mathrm{ArchNewt}(f). \diamond

Note that ArchNewt(f)\mathrm{ArchNewt}(f) usually has dimension 11 greater than that of Newt(f)\mathrm{Newt}(f). ArchNewt(f)\mathrm{ArchNewt}(f) enables us to relate ArchTrop(f)\mathrm{ArchTrop}(f) to linear programming, starting with the following observation.

Proposition 2.5

For any nn-variate tt-nomial ff, ArchTrop(f)\mathrm{ArchTrop}(f) also has the equivalent definition
{wn|(w,1) is an outer normal of a positive-dimensional face of ArchNewt(f)}\{w\!\in\!\mathbb{R}^{n}\;|\;(w,-1)\text{ is an outer normal of a positive-dimensional face of }\mathrm{ArchNewt}(f)\}.

Proof: The quantity |cixaiw||c_{i}x^{a_{i}\cdot w}| being maximized at at least two indices ii is equivalent to the linear form with coefficients (w,1)(w,-1) being maximized at at least two difference points in {(ai,log|ci|)}i[t]\{(a_{i},-\log|c_{i}|)\}_{i\in[t]}. Since a face of a polytope is positive-dimensional if and only if it has at least two vertices, we are done. \blacksquare

Example 2.6

The Newton polytope of our first example, f=1+x13+x223x1x2f\!=\!1+x^{3}_{1}+x^{2}_{2}-3x_{1}x_{2}, is simply the convex hull of the exponent vectors of the monomial terms: Conv({(0,0),(3,0),(0,2),(1,1)})\mathrm{Conv}(\{(0,0),(3,0),(0,2),(1,1)\}). For the Archimedean Newton polytope, we take the coefficients into account via an extra coordinate: ArchNewt(f)=Conv({(0,0,0),(3,0,0),(0,2,0),(1,1,log3)})\mathrm{ArchNewt}(f)\!=\!\mathrm{Conv}(\{(0,0,0),(3,0,0),(0,2,0),(1,1,-\log 3)\}). In particular, Newt(f)\mathrm{Newt}(f) is a triangle and ArchNewt(f)\mathrm{ArchNewt}(f) is a triangular

pyramid with base Newt(f)×{0}\mathrm{Newt}(f)\times\{0\} and apex lying beneath Newt(f)×{0}\mathrm{Newt}(f)\times\{0\}. Note also that the image of the orthogonal projection of the lower hull of ArchNewt(f)\mathrm{ArchNewt}(f) onto 2×{0}\mathbb{R}^{2}\times\{0\} naturally induces a triangulation of Newt(f)\mathrm{Newt}(f), as illustrated to the right. \diamond

[Uncaptioned image]

Our last example motivates us to consider more general subdivisions and duality. (An outstanding reference is dLRS (10).) Recall that a kk-simplex is the convex hull of k+1k+1 points in N\mathbb{R}^{N} not lying in any (k1)(k-1)-dimensional affine linear subspace of N\mathbb{R}^{N}. A simplex is then simply a kk-simplex for some kk.

Definition 2.7

A polyhedral complex is a collection of polyhedra Σ={σi}i\Sigma\!=\!\{\sigma_{i}\}_{i} such that for all ii we have (a) every face of σi\sigma_{i} is in Σ\Sigma and (b) for all jj we have that σiσj\sigma_{i}\cap\sigma_{j} is a face of both σi\sigma_{i} and σj\sigma_{j}. (We allow empty and improper faces.) The σi\sigma_{i} are the cells of the complex, and the underlying space of Σ\Sigma is |Σ|:=iσi|\Sigma|\!:=\!\bigcup_{i}\sigma_{i}.

A polyhedral subdivision of a polyhedron PP is then simply a polyhedral complex Σ={σi}i\Sigma\!=\!\{\sigma_{i}\}_{i} with |Σ|=P|\Sigma|\!=\!P. We call Σ\Sigma a triangulation if and only if every σi\sigma_{i} is a simplex. Given any finite subset AnA\!\subset\!\mathbb{R}^{n}, a polyhedral subdivision of AA is then just a polyhedral subdivision of Conv(A)\mathrm{Conv}(A) where the vertices of the σi\sigma_{i} all lie in AA. Finally, the polyhedral subdivision of Newt(f)\mathrm{Newt}(f) induced by ArchNewt(f)\mathrm{ArchNewt}(f), denoted Σf\Sigma_{f}, is simply the polyhedral subdivision whose cells are {π(Q)|Q is a lower face of ArchNewt(f)}\{\pi(Q)\;|\;Q\text{ is a lower face of }\mathrm{ArchNewt}(f)\}, where π:n+1n\pi:\mathbb{R}^{n+1}\longrightarrow\mathbb{R}^{n} denotes the orthogonal projection forgetting the last coordinate. \diamond

Recall that a (pointed polyhedral) cone is just the set of all nonnegative linear combinations of a finite set of points. Such cones are easily seen to always be polyhedra Grü (03); Zie (95). Recall also that a bijection, ϕ\phi, between two finite sets AA and BB is just a function ϕ:AB\phi:A\longrightarrow B such that the cardinalities of AA, BB, and f(A)f(A) are all equal.

Example 2.8

The illustration from Example 2.6 shows a triangulation of the point set {(0,0),(3,0),(0,2),(1,1)}\{(0,0),(3,0),(0,2),(1,1)\} which happens to be Σf\Sigma_{f} for f=1+x13+x223x1x2f\!=\!1+x^{3}_{1}+x^{2}_{2}-3x_{1}x_{2}. More to the point, it is easily checked that the outer normals to a face of dimension kk of ArchNewt(f)\mathrm{ArchNewt}(f) form a cone of dimension 3k3-k. In this way, thanks to the natural partial ordering of cells in any polyhedral complex by inclusion, we get an order-reversing bijection between the cells of Σf\Sigma_{f} and pieces of ArchTrop(f)\mathrm{ArchTrop}(f). \diamond

That ArchTrop(f)\mathrm{ArchTrop}(f) is always a polyhedral complex follows directly from Proposition 2.5 above. It is then easy to show that there is always an order-reversing bijection between the cells Σf\Sigma_{f} and the cells of ArchTrop(f)\mathrm{ArchTrop}(f) — an incarnation of polyhedral duality Zie (95).

Example 2.9

We illustrate the preceding order-reversing bijection of cells through our first three tropical varieties, and corresponding subdivisions Σf\Sigma_{f} of Newt(f)\mathrm{Newt}(f), below:
[Uncaptioned image] [Uncaptioned image] [Uncaptioned image]
Note that the vertices of ArchTrop(f)\mathrm{ArchTrop}(f) correspond bijectively to the 22-dimensional cells of Σf\Sigma_{f}, and the 11-dimensional cells of ArchTrop(f)\mathrm{ArchTrop}(f) correspond bijectively to the edges of Σf\Sigma_{f}. (In particular, the rays of ArchTrop(f)\mathrm{ArchTrop}(f) are perpendicular to the edges of Newt(f)\mathrm{Newt}(f).) Note also that the vertices of Σf\Sigma_{f} correspond bijectively to connected components of the complement 2ArchTrop(f)\mathbb{R}^{2}\!\setminus\!\mathrm{ArchTrop}(f). We have taken the liberty of slightly distorting the right-most illustration to make the bijections clearer. \diamond

2.2 The Complexity of Linear Programming

Let us first point out that Pap (95); AB (09); Sip (12) are outstanding references for further background on the classical Turing model and 𝐍𝐏{\mathbf{NP}}-completeness. Let us now focus on some well-known late-20th{}^{\text{\lx@text@underline{th}}} century results on the complexity of linear optimization. These results are covered at much greater length in Sch (86); GLS (93).

Definition 2.10

Let N:={(x1,,xN)n|x1,,xN0}\mathbb{R}^{N}_{\geq}\!:=\!\{(x_{1},\ldots,x_{N})\!\in\!\mathbb{R}^{n}\;|\;x_{1},\ldots,x_{N}\!\geq\!0\} denote the nonnegative orthant. Given Mk×NM\!\in\!\mathbb{R}^{k\times N}with linearly independent rows, c=(c1,,cN)Nc\!=\!(c_{1},\ldots,c_{N})\!\in\!\mathbb{R}^{N}, and b=(b1,,bk)kb\!=\!(b_{1},\ldots,b_{k})\!\in\!\mathbb{R}^{k}, the (standard form) linear optimization problem (M,b,c)\mathcal{L}(M,b,c) is the following problem:

                Maximize cxc\cdot x subject to:
                 Mx=bMx=b
                 xNx\!\in\!\mathbb{R}^{N}_{\geq}

We then define size((M,b,c)):=size(M)+size(b)+size(c)\mathrm{size}(\mathcal{L}(M,b,c))\!:=\!\mathrm{size}(M)+\mathrm{size}(b)+\mathrm{size}(c). The set of all xNx\!\in\!\mathbb{R}^{N}_{\geq} satisfying Mx=bMx\!=\!b is the feasible region of (M,b,c)\mathcal{L}(M,b,c). We call (M,b,c)\mathcal{L}(M,b,c) infeasible if and only if there is no xNx\!\in\!\mathbb{R}^{N}_{\geq} satisfying Mx=bMx\!=\!b. Finally, if (M,b,c)\mathcal{L}(M,b,c) is feasible but does not admit a well-defined maximum, then we call (M,b,c)\mathcal{L}(M,b,c) unbounded. \diamond

Theorem 2.11

Given any linear optimization problem (M,b,c)\mathcal{L}(M,b,c) as defined above, we can decide infeasibility, unboundedness, or (if (M,b,c)\mathcal{L}(M,b,c) is feasible) find an optimal solution xx^{*}, all within time polynomial in size((M,b,c))\mathrm{size}(\mathcal{L}(M,b,c)). In particular, if (M,b,c)\mathcal{L}(M,b,c) is feasible, we can find an optimal solution xx^{*} of size polynomial in size((M,b,c))\mathrm{size}(\mathcal{L}(M,b,c)). \blacksquare

Theorem 2.11 goes back to work of Khachiyan in the late 1970s on the Ellipsoid Method, building upon earlier work of Shor, Yudin, and Nemirovskii Sch (86). Since then, Interior Point Methods have emerged as one of the most practical methods attaining the complexity bound asserted in Theorem 2.11. For simplicity, we will not focus on the best current complexity bounds, since we simply want to prove polynomiality for our algorithms in this paper. Further discussion on improved complexity bounds for linear optimization can be found in MT (02).

Any system of linear inequalities, at the expense of a minor increase in size, is essentially equivalent to the feasible region of some (M,b,c)\mathcal{L}(M,b,c). In what follows, MxbMx\!\leq\!b is understood to mean that M1xb1,,MkxbkM_{1}\cdot x\!\leq\!b_{1},\ldots,M_{k}\cdot x\!\leq\!b_{k} all hold, where MiM_{i} denotes the ithi^{\text{\lx@text@underline{th}}} row of MM.

Proposition 2.12

Given Mk×NM\!\in\!\mathbb{R}^{k\times N} and any collection of inequalities of the form MxbMx\!\leq\!b, there is a standard form linear optimization problem (M¯,b¯,𝐎)\mathcal{L}(\bar{M},\bar{b},\mathbf{O}), satisfying size((M¯,b¯,𝐎))2(size(M)+size(b))+k\mathrm{size}(\mathcal{L}(\bar{M},\bar{b},\mathbf{O}))\!\leq\!2(\mathrm{size}(M)+\mathrm{size}(b))+k, that is feasible if and only if {xn|Mxb}\{x\!\in\!\mathbb{R}^{n}\;|\;Mx\!\leq\!b\} is non-empty. \blacksquare

There is thus no loss of generality in restricting to standard form.

We will frequently work with polyhedra given explicitly in the form P={xn|Mxb}P\!=\!\{x\!\in\!\mathbb{R}^{n}\;|\;Mx\!\leq\!b\} (usually called \mathcal{H}-polytopes), and use Proposition 2.12 and Theorem 2.11 together to rapidly decide various basic questions about PP. For instance, we call a constraint MixbiM_{i}\cdot x\!\leq\!b_{i} of MxbMx\!\leq\!b redundant if and only if the corresponding row of MM can be deleted from MM without affecting PP.

Lemma 2.13

Given any system of linear inequalities MxbMx\!\leq\!b we can, in time polynomial insize(M)+size(b)+size(c)\mathrm{size}(M)+\mathrm{size}(b)+\mathrm{size}(c), find a submatrix MM^{\prime} of MM (and a subvector bb^{\prime} obtained by deleting the correspondingentries from bb) such that {xN|Mxb}={xN|Mxb}\{x\!\in\!\mathbb{R}^{N}\;|\;M^{\prime}x\!\leq\!b^{\prime}\}\!=\!\{x\!\in\!\mathbb{R}^{N}\;|\;M^{\prime}x\!\leq\!b^{\prime}\} and MxbM^{\prime}x\!\leq\!b^{\prime} has no redundant constraints. \blacksquare

The new set of inequalities MxbM^{\prime}x\!\leq\!b^{\prime} is called an irredundant representation of MxbMx\!\leq\!b.

A deep subtlety underlying linear optimization is whether (M,b,c)\mathcal{L}(M,b,c) can be solved in strongly polynomial-time, i.e., is there an analogue of Theorem 2.11 where we instead count arithmetic operations to measure complexity, and obtain complexity polynomial in k+Nk+N?

One of the first successful algorithms for linear optimization — the Simplex Method — has arithmetic complexity O(Nk)O(N^{k}), and there are now variations of the Simplex Method (using sophisticated pivoting rules) that attain arithmetic complexity sub-exponential in kk. (It was also discovered in the 1970s by Borgwardt and Smale that the simplex method is strongly polynomial provided one averages over a suitable distribution of inputs Sch (86).) Strong polynomiality remains an important open problem and is in fact Problem 9 on Fields Medalist Steve Smale’s list of mathematical problems for the 21st{}^{\text{\lx@text@underline{st}}} Century Sma (98, 00).

These issues are actually relevant to polynomial system solving since the linear optimization problems we ultimately solve will have irrational “right-hand sides”: bb will usually be a (rational) linear combination of logarithms of integers in our setting.

In particular, as is well-known in Diophantine Approximation Bak (77), it is far from trivial to efficiently decide the sign of such an irrational number. This problem is also easily seen to be equivalent to deciding inequalities of the form α1β1αNβN>?1\alpha^{\beta_{1}}_{1}\cdots\alpha^{\beta_{N}}_{N}\!\stackrel{{\scriptstyle?}}{{>}}\!1, where the αi\alpha_{i} and βi\beta_{i} are integers. Note, in particular, that while the number of arithmetic operations necessary to decide such an inequality is easily seen to be O((i=1Nlog|βi|)2)O((\sum^{N}_{i=1}\log|\beta_{i}|)^{2}) (via the classical binary method of exponentiation), taking bit-operations into account naively results in a problem that appears to have complexity exponential in log|β1|++log|βN|\log|\beta_{1}|+\cdots+\log|\beta_{N}|. Fortunately, another Fields Medalist, Alan Baker, made major progress on this problem in the late 20th century.

2.3 Irrational Linear Optimization and Approximating Logarithms Well Enough

Recall the following result on comparing monomials in rational numbers.

Theorem 2.14

(BRS, 09, Sec. 2.4) Suppose α1,,αn\alpha_{1},\ldots,\alpha_{n}\!\in\!\mathbb{Q} are positive and β1,,βn\beta_{1},\ldots,\beta_{n}\!\in\!\mathbb{Z}. Also let AA be the maximum of the numerators and denominators of the αi\alpha_{i} (when written in lowest terms) and B:=maxi{|βi|}B\!:=\!\max_{i}\{|\beta_{i}|\}. Then, within
O(n30nlog(B)(loglogB)2logloglog(B)(log(A)(loglogA)2logloglogA)n)O\!\left(n30^{n}\log(B)(\log\log B)^{2}\log\log\log(B)(\log(A)(\log\log A)^{2}\log\log\log A)^{n}\right)
bit operations, we can determine the sign of α1β1αnβn1\alpha^{\beta_{1}}_{1}\cdots\alpha^{\beta_{n}}_{n}-1. \blacksquare

While the underlying algorithm is a simple application of Arithmetic-Geometric Mean Iteration (see, e.g., Ber (03)), its complexity bound hinges on a deep estimate of Nesterenko Nes (03), which in turn refines seminal work of Matveev Mat (00) and Alan Baker Bak (77) on linear forms in logarithms. Whether the dependence on nn in the bound above can be improved to polynomial is a very deep open question related to the famous abcabc-Conjecture Bak (98); Nit .

Via the Simplex Method, or even a brute force search through all basic feasible solutions of (M,b,c)\mathcal{L}(M^{\prime},b^{\prime},c^{\prime}), we can obtain the following consequence of Theorems 2.11 and 2.14.

Corollary 2.15

Suppose nn is fixed, knk\!\leq\!n, Mk×nM\!\in\!\mathbb{Q}^{k\times n}, and bi:=log|βi|b_{i}\!:=\!\log|\beta_{i}| with βi\beta_{i}\!\in\!\mathbb{Q}^{*} for all i[k]i\!\in\![k], and we setb:=(b1,,bk)b\!:=\!(b_{1},\ldots,b_{k}). Then we can decide feasibility for MxbMx\!\leq\!b, and compute an irredundant representationMxbM^{\prime}x\!\leq\!b^{\prime} for MxbMx\!\leq\!b, in time polynomial in size(M)+size(b)\mathrm{size}(M)+\mathrm{size}(b). \blacksquare

The key trick behind the proof of Corollary 2.15 is that, after converting to standard form, any basic feasible solution of the underlying linear optimization problem has all its irrationalities concentrated on the right-hand side. In particular, standard linear algebra bounds tell us that the right-hand side involves a linear combination of logarithms with coefficients of size polynomial in the input size.

3 Tropical Start-Points for Numerical Iteration and an Example

We begin by outlining a method for picking start-points for Newton Iteration (see, e.g., (BCSS, 98, Ch. 8) for a modern perspective) and Homotopy Continuation HL (95); SW (05); Ver (10); LL (11); BHSW (13). While we do not discuss these methods for solving polynomial equations in detail, let us point out that Homotopy Continuation (combined with Smale’s α\alpha-Theory for certifying roots BCSS (98); BHSW (13)) is currently the fastest and most reliable method for numerically solving polynomial systems in complete generality. Other important methods include Resultants EC (95) and Gröbner Bases FHP (03). However, while these alternative methods are of great importance in certain algebraic and theoretical applications AKS (13); FGHR (13), Homotopy Continuation is currently the method of choice for practical large-scale numerical computation.

While the boxed steps below admit a simple and easily parallelizable brute-force search, they form the portion of the algorithm that is the most challenging to speed up to complexity polynomial in nn.

Algorithm 3.1

(Coarse Approximation to Roots with Log-Norm Vector Near a Query Point)
Input. Polynomials f1,,fn[x1±1,,xn±1]f_{1},\ldots,f_{n}\!\in\!\mathbb{C}\!\left[x^{\pm 1}_{1},\ldots,x^{\pm 1}_{n}\right], with fi(x)=j=1tici,jxaj(i)f_{i}(x)\!=\!\sum^{t_{i}}_{j=1}c_{i,j}x^{a_{j}(i)} a tit_{i}-nomial for all ii, and a query point
    wnw\!\in\!\mathbb{R}^{n}.
Output. An ordered nn-tuple of sets of indices (Ji)i=1n(J_{i})^{n}_{i=1} such that gi:=jJici,jxaj(i)g_{i}\!:=\!\sum_{j\in J_{i}}c_{i,j}x^{a_{j}(i)} is a sub-summand of fif_{i}, and the
     roots of G:=(g1,,gn)G\!:=\!(g_{1},\ldots,g_{n}) are near the roots of F:=(f1,,fn)F\!:=\!(f_{1},\ldots,f_{n}) with log-norm vector near ww.
Description.
1. Let σw\sigma_{w} be the closure of the unique cell of ni=1nArchTrop(fi)\mathbb{R}^{n}\setminus\bigcup^{n}_{i=1}\mathrm{ArchTrop}(f_{i}) or ArchTrop(fi)wArchTrop(fi)\bigcap\limits_{\mathrm{ArchTrop}(f_{i})\ni w}\mathrm{ArchTrop}(f_{i}) containing ww.
2. If σw\sigma_{w} has no vertices in i=1nArchTrop(fi)\bigcap^{n}_{i=1}\mathrm{ArchTrop}(f_{i}) then output an irredundant collection of facet inequalities for σw\sigma_{w},
   output
‘‘There are no roots of FF in σw\sigma_{w}.’’, and STOP.
3. Otherwise, fix a vertex vv of σw\sigma_{w} and, for each i[n]i\!\in\![n], let EiE_{i} be any edge of ArchNewt(f)\mathrm{ArchNewt}(f) generating a facet of
   ArchTrop(fi)\mathrm{ArchTrop}(f_{i}) containing vv.
4. For all i[n]i\!\in\![n], let Ji:={j|(aj(i),log|ci,j|)Ei}J_{i}\!:=\!\{j\;|\;(a_{j}(i),-\log|c_{i,j}|)\!\in\!E_{i}\}.
5. Output (Ji)i=1n(J_{i})^{n}_{i=1}. \blacksquare

Remark 3.2

The output system GG is useful because, with probability 11 (for most reasonable distributions on the coefficients), all the gig_{i} are binomials, and binomial systems are particularly easy to solve: they are equivalent to linear equations in the logarithms of the original variables. In particular, an n×nn\times n binomial system output by our algorithm always results in a collection of roots all sharing a single vector of norms.

The connection to Newton Iteration is then easy to state: use any root of GG as a start-point z(0)z(0) for the iteration
z(n+1):=z(n)Jac(F)1|z(n)F(z(n))z(n+1)\!:=\!z(n)-\mathrm{Jac}(F)^{-1}|_{z(n)}F(z(n)).
The connection to Homotopy Continuation is also simple: use the pair (G,ζ)(G,\zeta) (for any root ζ\zeta of GG) to start a path converging (under the usual numerical conditioning assumptions on whatever predictor-corrector method one is using) to a root of FF with log-norm vector near ww. Note that it is safer to do the extra work of Homotopy Continuation, but there will be cases where the tropical start-points from Algorithm 3.1 are sufficiently good that Newton Iteration is enough to converge to a true root.

Note in particular that we have the freedom to follow as few start-points, or as few paths, as we want. When our start-points (resp. paths) indeed converge to nearby roots, we obtain a tremendous savings over having to follow all start-points (resp. paths). \diamond

Definition 3.3

Following the notation of Theorem 1.6 and Algorithm 3.1, we call a vertex vv of σw\sigma_{w} mixed if and only if it lies in i=1nArchTrop(fi)\bigcap^{n}_{i=1}\mathrm{ArchTrop}(f_{i}). \diamond

Example 3.4

Let us make a 2×22\times 2 polynomial system out of our first and third examples:
f1:=1+x13+x223x1x2f_{1}\!:=\!1+x^{3}_{1}+x^{2}_{2}-3x_{1}x_{2}
f2:=0.1+0.2x22+0.1x24+10x1x22+0.001x1x24+0.01x12x2+0.1x12x22+0.000005x13f_{2}\!:=\!0.1+0.2x^{2}_{2}+0.1x^{4}_{2}+10x_{1}x^{2}_{2}+0.001x_{1}x^{4}_{2}+0.01x^{2}_{1}x_{2}+0.1x^{2}_{1}x^{2}_{2}+0.000005x^{3}_{1}
[Uncaptioned image]         [Uncaptioned image]
The system F:=(f1,f2)F\!:=\!(f_{1},f_{2}) has exactly 1212 roots in ()2(\mathbb{C}^{*})^{2}, the coordinate-wise log-norms of which form the small clusters near certain intersections of ArchTrop(f1)\mathrm{ArchTrop}(f_{1}) and ArchTrop(f2)\mathrm{ArchTrop}(f_{2}).555The root count was verified via an exact Gröbner basis calculation using the commercial software package Maple 14 . Numerical approximation of the log-norm vectors to accuracy 10410^{-4} per coordinate was then done via the publically available package Bertini BHSW (13), using default settings. Both calculations took a fraction of a second. The cell σw\sigma_{w} was computed via Matlab 7.11.0 (R2010b). In particular, there is a heptagonal cell, which we have magnified, with 22 vertices close to the log-norm vectors of some of the roots. This cell, which looks hexagonal because it has a pair of vertices that are too close to distinguish visually, happens to be σw\sigma_{w} for w=(2,1)w\!=\!(2,1). Note that σw\sigma_{w} has exactly 22 mixed vertices.

Applying Algorithm 3.1 to our (f1,f2,w)(f_{1},f_{2},w) we then have 22 possible outputs, depending on which mixed vertex of σw\sigma_{w} we pick. The output corresponding to the circled vertex is the pair of index sets ({2,3},{3,4})(\{2,3\},\{3,4\}). More concretely, Algorithm 3.1 alleges that the system
G:=(g1,g2):=(x13+x22,0.1x24+10x1x22)G\!:=\!(g_{1},g_{2})\!:=\!(x^{3}_{1}+x^{2}_{2},0.1x^{4}_{2}+10x_{1}x^{2}_{2})
has roots with log-norm vector near a log-norm vector of a root of FF that is in turn close to ww. Indeed, the sole log-norm vector coming from the roots of GG is (log10,32log10)\left(\log 10,\frac{3}{2}\log 10\right) and the roots themselves are {(±10,1000)}\{(\pm 10,\sqrt{\mp 1000})\} (with both values of the square root allowed). All 44 roots in fact converge (under Newton iteration, with no need for Homotopy Continuation) to true roots of FF: (10,10001/2)(-10,1000^{1/2}) and (10,10001/2)(-10,-1000^{1/2}) respectively converge to the roots of FF with closest and third closest log-norm vector to ww. The other two roots of GG converge to a conjugate pair of roots of FF with log-norm vector (2.4139,3.5103)(2.4139,3.5103) (to 44 decimal places) lying in the small circle. \diamond

Remark 3.5

The cell σw\sigma_{w} from Step 1 can be found in polynomial-time, thanks to Theorem 1.5, and its underlying algorithm contained in Corollary 2.15.

As for Steps 2 and 3, thanks to duality, the facets of ArchTrop(fi)\mathrm{ArchTrop}(f_{i}) correspond exactly to lower edges of ArchNewt(fi)\mathrm{ArchNewt}(f_{i}). So, to find the vertex vv (or decide that it doesn’t exist), it suffices to do a brute-force search through all nn-tuples of lower edges, one coming from each of ArchNewt(f1),,ArchNewt(fn)\mathrm{ArchNewt}(f_{1}),\ldots,\mathrm{ArchNewt}(f_{n}). This particular kind of geometric computation has its origins in the algorithmic study of mixed volume EC (95); LL (11). There are various ways of speeding up this search and there is much interesting computational geometry to be studied in this direction. \diamond

Let us be clear that we have not yet proved a metric guarantee for Algorithm 3.1 in the spirit of Theorem 1.3. Rigorous results in this direction, as well as a broad experimental understanding of our techniques, are of the utmost importance and we hope to address these points in the near future.

Remark 3.6

We have intentionally written Algorithm 3.1 in terms of a more general class of inputs than necessary for our examples. For such general inputs, it makes more sense to measure complexity in terms of arithmetic operations instead of bit operations. \diamond

Remark 3.7

The reader should be aware that while we have relied upon Diophantine approximation and subtle aspects of the Simplex Method to prove our bit-complexity bounds, one can certainly be more flexible when using our approach in practical, floating-point computations. For instance, heuristically, it appears that one can get away with less accuracy than stipulated by Theorem 2.14 when comparing linear combinations of logarithms. Similarly, one should feel free to use the fastest (but still reasonably accurate) algorithms for linear optimization when applying our methods to large-scale polynomial systems. \diamond

4 Proof of Theorem 1.5

Using t1t-1 comparisons, we can isolate all indices ii such that maxi|cieaiw|\max_{i}|c_{i}e^{a_{i}\cdot w}| is attained. Thanks to Theorem 2.14, this can be done in polynomial-time. We then obtain, say, JJ equations of the form aiw=log|ci|a_{i}\cdot w\!=\!-\log|c_{i}| and KK inequalities of the form aiw>log|ci|a_{i}\cdot w\!>\!-\log|c_{i}| or aiw<log|ci|a_{i}\cdot w\!<\!-\log|c_{i}|.

Thanks to Lemma 2.13, combined with Corollary 2.15, we can determine the exact cell of ArchTrop(f)\mathrm{ArchTrop}(f) containing ww if J2J\!\geq\!2. Otherwise, we obtain the unique cell of nArchTrop(f)\mathbb{R}^{n}\!\setminus\!\mathrm{ArchTrop}(f) containing ww. Note also that an (n1)(n-1)-dimensional face of either kind of cell must be the dual of an edge of ArchNewt(f)\mathrm{ArchNewt}(f). Since every edge has exactly 22 vertices, there are at most t(t1)/2t(t-1)/2 such (n1)(n-1)-dimensional faces, and thus σw\sigma_{w} is the intersection of at most t(t1)/2t(t-1)/2 half-spaces. So we are done. \blacksquare

Remark 4.1

Theorem 1.5 also generalizes an earlier complexity bound for deciding membership in ArchTrop(f)\mathrm{ArchTrop}(f) from AKNR (13). \diamond

5 Proof of Theorem 1.4

Since ArchTrop(f)\mathrm{ArchTrop}(f) and Amoeba(f)\mathrm{Amoeba}(f) are closed, Δ(w,ArchTrop(f))=|wv|\Delta(w,\mathrm{ArchTrop}(f))\!=\!|w-v| for some point vArchTrop(f)v\!\in\!\mathrm{ArchTrop}(f) and Δ(w,Amoeba(f))=|wu|\Delta(w,\mathrm{Amoeba}(f))\!=\!|w-u| for some point uAmoeba(f)u\!\in\!\mathrm{Amoeba}(f).

Now, by the second upper bound of Theorem 1.3, there is a point vArchTrop(f)v^{\prime}\!\in\!\mathrm{ArchTrop}(f) within distance log(t1)\log(t-1) of uu. Clearly, |wv||wv||w-v|\!\leq\!|w-v^{\prime}|. Also, by the Triangle Inequality, |wv||wu|+|uv||w-v^{\prime}|\!\leq\!|w-u|+|u-v^{\prime}|. So then,
Δ(w,ArchTrop(f))Δ(w,Amoeba(f))+log(t1)\Delta(w,\mathrm{ArchTrop}(f))\!\leq\!\Delta(w,\mathrm{Amoeba}(f))+\log(t-1),
and thus Δ(w,Amoeba(f))Δ(w,ArchTrop(f))log(t1)\Delta(w,\mathrm{Amoeba}(f))-\Delta(w,\mathrm{ArchTrop}(f))\!\geq\!-\log(t-1).

Similarly, by the first upper bound of Theorem 1.3, there is a point uAmoeba(f)u^{\prime}\!\in\!\mathrm{Amoeba}(f) within distance(2t3)log(t1)(2t-3)\log(t-1) of vv. Clearly, |wu||wu||w-u|\!\leq\!|w-u^{\prime}|. Also, by the Triangle Inequality, |wu||wv|+|vu||w-u^{\prime}|\!\leq\!|w-v|+|v-u^{\prime}|. So then, Δ(w,Amoeba(f))Δ(w,ArchTrop(f))+(2t3)log(t1)\Delta(w,\mathrm{Amoeba}(f))\!\leq\!\Delta(w,\mathrm{ArchTrop}(f))+(2t-3)\log(t-1), and thus
Δ(w,Amoeba(f))Δ(w,ArchTrop(f))(2t3)log(t1)\Delta(w,\mathrm{Amoeba}(f))-\Delta(w,\mathrm{ArchTrop}(f))\!\leq\!(2t-3)\log(t-1).
So our first assertion is proved.

Now, if ff has coefficients with rational real and imaginary parts, Theorem 1.5 tells us that we have an explicit description of σw\sigma_{w} as the intersection of a number of half-spaces polynomial in the input size. Moreover, the bit-sizes of the coefficients of the underlying inequalities are also polynomial in the input size. So we can compute the distance DD from ww to ArchTrop(f)\mathrm{ArchTrop}(f) by finding which facet of σw\sigma_{w} has minimal distance to ww. The distance from ww to any such facet can be computed in polynomial-time via the classical formula for distance between a point and an affine hyperplane, and Theorem 2.14:
Δ(w,{x|αx=β})=|αw|sign(αw)β|α|\Delta(w,\{x\;|\;\alpha\cdot x=\beta\})=\frac{|\alpha\cdot w|-\mathrm{sign}(\alpha\cdot w)\beta}{|\alpha|}
In particular, we may efficiently approximate DD by efficiently approximating the underlying square-roots and logarithms. The latter can be accomplished by Arithmetic-Geometric Iteration, as detailed in Ber (03). So our statement on leading bits is proved.

The final assertion then follows easily: we merely decide whether Δ(w,ArchTrop(f))\Delta(w,\mathrm{ArchTrop}(f)) strictly exceeds log(t1)\log(t-1) or not, via the algorithm we just outlined. Thanks to our initial observations using the Triangle Inequality, it is clear that Output (b) or Output (a) occurs according as Δ(w,ArchTrop(f))>log(t1)\Delta(w,\mathrm{ArchTrop}(f))\!>\!\log(t-1) or not. \blacksquare

6 Proving of Theorem 1.6

6.1 Fast Cell Computation: Proof of the First Assertion

First, we apply Theorem 1.5 to (fi,w)(f_{i},w) for each i[k]i\!\in\![k] to find which ArchTrop(fi)\mathrm{ArchTrop}(f_{i}) contain ww.

If ww lies in no ArchTrop(fi)\mathrm{ArchTrop}(f_{i}), then we simply use Corollary 2.15 (as in our proof of Theorem 1.5) to find an explicit description of the closure of the cell of ni=1kArchTrop(fi)\mathbb{R}^{n}\!\setminus\!\bigcup^{k}_{i=1}\mathrm{ArchTrop}(f_{i}) containing ww. Otherwise, we find the cells of ArchTrop(fi)\mathrm{ArchTrop}(f_{i}) (over those ii with ArchTrop(fi)\mathrm{ArchTrop}(f_{i}) containing ww) that contain ww. Then, applying Corollary 2.15 once again, we find the unique cell of ArchTrop(fi)wArchTrop(fi)\bigcap\limits_{\mathrm{ArchTrop}(f_{i})\ni w}\mathrm{ArchTrop}(f_{i}) containing ww.

Assume that fif_{i} has exactly tit_{i} monomial terms for all ii. In either of the preceding cases, the total number of half-spaces involved is no more than i=1kti(ti1)/2\sum^{k}_{i=1}t_{i}(t_{i}-1)/2. So the over-all complexity of our redundancy computations is polynomial in the input size and we are done. \blacksquare

6.2 Hardness of Detecting Mixed Vertices: Proving the Second Assertion

It will clarify matters if we consider a related 𝐍𝐏{\mathbf{NP}}-hard problem for rational polytopes first, before moving on to cells with irrationalities.

Preparation over \mathbb{Q}

In the notation of Definition 3.3, let us first consider the following decision problem. We assume all polyhedra are given explicitly as finite collections of rational linear inequalities, with size defined as in Section 2.2.

Mixed-Vertex:
Given nn\in\mathbb{N} and polyhedra P1,,PnP_{1},\ldots,P_{n} in n\mathbb{R}^{n}, does P:=i=1nPiP:=\bigcap_{i=1}^{n}P_{i} have a mixed vertex? \blacksquare

While Mixed-Vertex can be solved in polynomial time when the dimension is fixed, we will show that, for nn varying, the problem is 𝐍𝐏{\mathbf{NP}}-complete, even when restricting to the case where all polytopes are full-dimensional and P1,,Pn1P_{1},\ldots,P_{n-1} are axes-parallel bricks.

Let eie_{i} denote the ithi^{\text{\lx@text@underline{th}}} standard basis vector in n\mathbb{R}^{n}. Also, given αn\alpha\!\in\!\mathbb{R}^{n} and β\beta\!\in\!\mathbb{R}, we will use the following notation for certain hyperplanes and halfspaces in n\mathbb{R}^{n} determined by α\alpha and β\beta:

H(α,β):={xn|αx=β},H(α,β):={xn|αxβ}.H_{(\alpha,\beta)}:=\{x\in\mathbb{R}^{n}\;|\;\alpha\cdot x=\beta\},\qquad H_{(\alpha,\beta)}^{\leq}:=\{x\in\mathbb{R}^{n}\;|\;\alpha\cdot x\leq\beta\}.

For i[n]i\in[n], let n,sin,s_{i}\in\mathbb{N},

Mi:=[mi,1,,mi,si]Tsi×n,βi:=(βi,1,,βi,si)si, and Pi={xn|Mixbi}.M_{i}:=[m_{i,1},\ldots,m_{i,s_{i}}]^{T}\in\mathbb{Z}^{s_{i}\times n},\quad\beta_{i}:=(\beta_{i,1},\ldots,\beta_{i,s_{i}})\in\mathbb{Z}^{s_{i}},\text{ and }\quad P_{i}=\{x\in\mathbb{R}^{n}\;|\;M_{i}x\leq b_{i}\}.

Since linear programming can be solved in polynomial-time (in the cases we consider) we may assume that the presentations (n,si;Mi,bi)(n,s_{i};M_{i},b_{i}) are irredundant, i.e., PiP_{i} has exactly sis_{i} facets and the sets PiH(ai,j,βi,j)P_{i}\cap H_{(a_{i,j},\beta_{i,j})}, for j[si]j\!\in\![s_{i}], are precisely the facets of PiP_{i} for all i[n]i\!\in\![n].

Now set P:=i=1nPiP:=\bigcap_{i=1}^{n}P_{i} and let vnv\in\mathbb{Q}^{n}. Note that size(P)\mathrm{size}(P) is thus linear in i=1nsize(Pi)\sum^{n}_{i=1}\mathrm{size}(P_{i}).

Lemma 6.1

Mixed-Vertex 𝐍𝐏\in{\mathbf{NP}}.

Proof: Since the binary sizes of the coordinates of the vertices of PP are bounded by a polynomial in the input size, we can use vectors vnv\in\mathbb{Q}^{n} of polynomial size as certificates. We can check in polynomial-time whether such a vector vv is a vertex of PP simply by exhibiting nn facets (with linearly independent normal vectors), one from each PiP_{i}, containing vv. If this is not the case, vv cannot be a mixed-vertex of PP. Otherwise, vv is a mixed-vertex of PP if and only if for each i[n]i\!\in\![n] there exists a facet FiF_{i} of PiP_{i} with vFiv\!\in\!F_{i}. Since the facets of the polytopes PiP_{i} admit polynomial-time decriptions as \mathcal{H}-polytopes, this can be checked by a total of m1++mnm_{1}+\ldots+m_{n} polytope membership tests.

So, we can check in polynomial-time whether a given certificate vv is a mixed-vertex of PP. Hence Mixed-Vertex is in 𝐍𝐏{\mathbf{NP}}. \blacksquare

Since, in fixed dimensions we can actually list all vertices of PP in polynomial-time, one by one, it is clear that Mixed-Vertex can be solved in polynomial-time when nn is fixed. When nn is allowed to vary we obtain hardness:

Theorem 6.2

Mixed-Vertex is 𝐍𝐏{\mathbf{NP}}-hard.

Recall that \sqcup denotes disjoint union. The proof of Theorem 6.2 will be based on a transformation from the following decision problem:

Partition
Given dd\in\mathbb{N}, α1,,αd\alpha_{1},\dots,\alpha_{d}\in\mathbb{N}, is there a partition d=IJd\!=\!I\sqcup J such that iIαi=jJαj\sum_{i\in I}\alpha_{i}=\sum_{j\in J}\alpha_{j}? \blacksquare

Recall that Partition was on the original list of 𝐍𝐏{\mathbf{NP}}-complete problems from Kar (72).

Let an instance (d;α1,,αd)(d;\alpha_{1},\dots,\alpha_{d}) of Partition be given, and set α:=(α1,,αd)\alpha:=(\alpha_{1},\dots,\alpha_{d}). Then we are looking for a point x{1,1}dx\in\{-1,1\}^{d} with αx=0\alpha\cdot x=0.

We will now construct an equivalent instance of Mixed-Vertex. With n:=d+1n\!:=\!d+1, x:=(ξ1,,ξn1)x:=(\xi_{1},\ldots,\xi_{n-1}) and11n:=(1,,1)n{\text{\rm 1}\kern-4.10004pt\text{\rm 1}}_{n}:=(1,\ldots,1)\in\mathbb{R}^{n} let

Pi:={[xξn]|1ξi1,2ξj2 for all j[n]{i}}P_{i}:=\left\{\left.\begin{bmatrix}x\\ \xi_{n}\end{bmatrix}\right|-1\leq\xi_{i}\leq 1,\,-2\leq\xi_{j}\leq 2\text{ for all }j\in[n]\setminus\{i\}\right\}

for i[n1]i\in[n-1],

Pn:={[xξn]|211n1x211n1, 1ξn1, 02αx1},P_{n}:=\left\{\left.\begin{bmatrix}x\\ \xi_{n}\end{bmatrix}\;\right|\;-2\cdot{\text{\rm 1}\kern-4.10004pt\text{\rm 1}}_{n-1}\leq x\leq 2\cdot{\text{\rm 1}\kern-4.10004pt\text{\rm 1}}_{n-1},\,1\leq\xi_{n}\leq 1,\,0\leq 2\alpha\cdot x\leq 1\right\},

and set P:=i=1nPiP\!:=\!\bigcap_{i=1}^{n}P_{i}, α^:=[α0]\widehat{\alpha}:=\begin{bmatrix}\alpha\\ 0\end{bmatrix}.

The next lemma shows that Pn{1,1}nP_{n}\cap\{-1,1\}^{n} still captures the solutions of the given instance of partition.

Lemma 6.3

(d;α1,,αd)(d;\alpha_{1},\dots,\alpha_{d}) is a “no”-instance of Partition if and only if Pn{1,1}nP_{n}\cap\{-1,1\}^{n} is empty.

Proof: Suppose, first, that (d;α1,,αd)(d;\alpha_{1},\dots,\alpha_{d}) is a “no”-instance of Partition. If PnP_{n} is empty there is nothing left to prove. So, let yPny\in P_{n} and w{1,1}n1×w\in\{-1,1\}^{n-1}\times\mathbb{R}. Since αd\alpha\in\mathbb{N}^{d} we have |α^w|1|\widehat{\alpha}\cdot w|\geq 1. Hence, with the aid of the Cauchy-Schwarz inequality, we have

1\displaystyle 1 |α^w|=|α^y+α^(wy)||α^y|+|α^(wy)|\displaystyle\leq|\widehat{\alpha}\cdot w|=|\widehat{\alpha}\cdot y+\widehat{\alpha}\cdot(w-y)|\leq|\widehat{\alpha}\cdot y|+|\widehat{\alpha}\cdot(w-y)|
12+|α^||wy|=12+|a||wy|\displaystyle\leq\frac{1}{2}+|\widehat{\alpha}|\cdot|w-y|=\frac{1}{2}+|a|\cdot|w-y|

and thus |wy|12|a|>0|w-y|\geq\frac{1}{2|a|}>0. Therefore Pn({1,1}n1×)P_{n}\cap\bigl{(}\{-1,1\}^{n-1}\times\mathbb{R}\bigr{)} is empty.

Now, let Pn{1,1}n=P_{n}\cap\{-1,1\}^{n}=\emptyset. Since α^n1×{0}\widehat{\alpha}\in\mathbb{R}^{n-1}\times\{0\} we have Pn{1,1}n=P_{n}\cap\{-1,1\}^{n}=\emptyset. \blacksquare

The next lemma reduces the possible mixed-vertices to the vertical edges of the standard cube.

Lemma 6.4

Following the preceding notation, let vv be a mixed-vertex of PP. Then v{1,1}n1×[1,1]v\!\in\!\{-1,1\}^{n-1}\times[-1,1].

Proof: First note that Q:=i=1n1Pi=[1,1]n1×[2,2]Q\!:=\!\bigcap_{i=1}^{n-1}P_{i}=[-1,1]^{n-1}\times[-2,2]. Therefore, for each i[n1]i\!\in\![n-1], the only facets of PiP_{i} that meet QQ are those in H(ei,±1)H_{(e_{i},\pm 1)} and H(en,±2)H_{(e_{n},\pm 2)}. Since P[1,1]nP\subset[-1,1]^{n}, and for each i[n1]i\in[n-1] the mixed-vertex vv must be contained in a facet of PiP_{i}, we have

v[1,1]ni=1n1(δi{1,1}H(ei,δi))={1,1}n1×[1,1],v\in[-1,1]^{n}\cap\bigcap_{i=1}^{n-1}\left(\bigcup_{\delta_{i}\in\{-1,1\}}H_{(e_{i},\delta_{i})}\right)=\{-1,1\}^{n-1}\times[-1,1],

which proves the assertion. \blacksquare

The next lemma adds PnP_{n} to the consideration.

Lemma 6.5

Let vv be a mixed-vertex of PP. Then v{1,1}nv\in\{-1,1\}^{n}.

Proof: By Lemma 6.4, v{1,1}n1×[1,1]v\subset\{-1,1\}^{n-1}\times[-1,1]. Since the hyperplanes H(en,±2)H_{(e_{n},\pm 2)} do not meet [1,1]n[-1,1]^{n},

vH(ei,2)H(ei,2) for all i[n1].v\not\in H_{(e_{i},-2)}\cup H_{(e_{i},2)}\quad\text{ for all }i\!\in\![n-1].

Hence, vv can only be contained in the constraint hyperplanes H(α^,0),H(2α^,1),H(en,1),H(en,1)H_{(\widehat{\alpha},0)},H_{(2\widehat{\alpha},1)},H_{(e_{n},-1)},H_{(e_{n},1)}. Since α^n1×{0}\widehat{\alpha}\in\mathbb{R}^{n-1}\times\{0\}, the vector α^\widehat{\alpha} is linearly dependent on e1,,en1e_{1},\ldots,e_{n-1}. Hence, vH(en,1)H(en,1)v\!\in\!H_{(e_{n},-1)}\cup H_{(e_{n},1)}, i.e., v{1,1}nv\in\{-1,1\}^{n}. \blacksquare

Now we can prove the 𝐍𝐏{\mathbf{NP}}-hardness of Mixed-Vertex.

Proof of Theorem 6.2: First, let (d;α1,,αd)(d;\alpha_{1},\dots,\alpha_{d}) be a “yes”-instance of Partition, let x:=(ξ1,,ξn1){1,1}n1x^{*}:=(\xi^{*}_{1},\ldots,\xi_{n-1}^{*})\!\in\!\{-1,1\}^{n-1} be a solution, and set

ξn:=1,v:=[xξn],Fi:=H(ei,ξi)Pi for all i[n], and F^n:=H(α^,0)Pn.\xi_{n}^{*}:=1,\quad v:=\begin{bmatrix}x^{*}\\ \xi_{n}^{*}\end{bmatrix},\quad F_{i}:=H_{(e_{i},\xi_{i}^{*})}\cap P_{i}\quad\text{ for all }i\!\in\![n],\text{ and }\hat{F}_{n}:=H_{(\widehat{\alpha},0)}\cap P_{n}.

Then vF^nPnv\in\hat{F}_{n}\subset P_{n}, hence vPv\in P and, in fact, vv is a vertex of PP. Furthermore, FiF_{i} is a facet of PiP_{i} for all i[n]i\!\in\![n], vi=1nFiv\!\in\!\bigcap_{i=1}^{n}F_{i}, and thus vv is a mixed-vertex of PP.

Conversely, let (d;α1,,αd)(d;\alpha_{1},\dots,\alpha_{d}) be a “no”-instance of Partition, and suppose that vnv\in\mathbb{R}^{n} is a mixed-vertex of PP. By Lemma 6.5, v{1,1}nv\in\{-1,1\}^{n}. Furthermore, vv lies in a facet of PP. Hence, in particular, vPnv\in P_{n}, i.e., Pn{1,1}nP_{n}\cap\{-1,1\}^{n} is empty. Therefore, by Lemma 6.3, (d;α1,,αd)(d;\alpha_{1},\dots,\alpha_{d}) is a “yes”-instance of Partition. This contradiction shows that PP does not have a mixed-vertex.

Clearly, the transformation works in polynomial-time. \blacksquare

6.3 Proof of the Second Assertion of Theorem 1.6

We call a polyhedron \ell-rational if and only if it is of the form {xn|Mxb}\{x\!\in\!\mathbb{R}^{n}\;|\;Mx\!\leq\!b\} with Mk×nM\!\in\!\mathbb{Q}^{k\times n} and b=(b1,,bk)b\!=\!(b_{1},\ldots,b_{k}) satisfying bi=β1,ilog|α1|++βk,ilog|αk|b_{i}\!=\!\beta_{1,i}\log|\alpha_{1}|+\cdots+\beta_{k,i}\log|\alpha_{k}|, with βi,j,αj\beta_{i,j},\alpha_{j}\!\in\!\mathbb{Q} for all ii and jj. We measure the size of such a polyhedron as size(M)+size([bi,j])+i=1ksize(αi)\mathrm{size}(M)+\mathrm{size}([b_{i,j}])+\sum^{k}_{i=1}\mathrm{size}(\alpha_{i}). Clearly, it suffices to show that the following variant of Mixed-Vertex is 𝐍𝐏{\mathbf{NP}}-hard:

Logarithmic-Mixed-Vertex:
Given nn\in\mathbb{N} and \ell-rational polyhedra P1,,PnnP_{1},\ldots,P_{n}\!\subset\!\mathbb{R}^{n}, does P:=i=1nPiP:=\bigcap_{i=1}^{n}P_{i} have a mixed vertex? \blacksquare

Via an argument completely parallel to the last section, the 𝐍𝐏{\mathbf{NP}}-hardness of Logarithmic-Mixed-Vertex follows immediately from the 𝐍𝐏{\mathbf{NP}}-hardness of the following variant of Partition:

Logarithmic-Partition
Given dd\in\mathbb{N}, α1,,αd{0}\alpha_{1},\dots,\alpha_{d}\in\mathbb{N}\setminus\{0\}, is there a partition d=IJd\!=\!I\sqcup J such that iIlogαi=jJlogαj\sum_{i\in I}\log\alpha_{i}=\sum_{j\in J}\log\alpha_{j}? \blacksquare

We measure size in Logarithmic-Partition just as in the original Partition Problem: i=1dlogαd\sum^{d}_{i=1}\log\alpha_{d}. Note that Logarithmic-Partition is equivalent to the obvious variant of Partition where we ask for a partition making the two resulting products be identical. The latter problem is easily seen to be 𝐍𝐏{\mathbf{NP}}-hard as well, via an argument mimicking the original proof of the 𝐍𝐏{\mathbf{NP}}-hardness of Partition in Kar (72). \blacksquare

References

  • ABK (09) Allender, E.; Bürgisser, P.; Kjeldgaard-Pedersen, J.; and Miltersen, P. B., “On the Complexity of Numerical Analysis,” SIAM J. Comput. 38(5), pp. 1987–2006, 2009.
  • AKS (13) D’Andrea, Carlos; Krick, Teresa; Sombra, Martin, “Heights of varieties in multiprojective spaces and arithmetic Nullstellensatze,” Math ArXiV preprint 1103.4561 .
  • AGS (13) D’Andrea, Carlos; Galligo, André; and Sombra, Martin, “Quantitative Equidistribution for the Solution of a System of Sparse Polynomial Equations,” Advances in Mathematics, to appear.
  • AB (09) Arora, Sanjeev and Barak, Boaz, Computational complexity. A modern approach. Cambridge University Press, Cambridge, 2009.
  • AKNR (13) Avendaño, Martín; Kogan, Roman; Nisse, Mounir; and Rojas, J. Maurice, “Metric Estimates and Membership Complexity for Archimedean Amoebae and Tropical Hypersurfaces,” submitted for publication, also available as Math ArXiV preprint 1307.3681
  • Bak (77) Baker, Alan, “The theory of linear forms in logarithms,” in Transcendence Theory: Advances and Applications: proceedings of a conference held at the University of Cambridge, Cambridge, Jan.–Feb., 1976, Academic Press, London, 1977.
  • Bak (98) Baker, Alan, “Logarithmic forms and the abcabc-conjecture,” Number theory (Eger, 1996), pp. 37–44, de Gruyter, Berlin, 1998.
  • BR (10) Baker, Matthew and Rumely, Robert, Potential theory and dynamics on the Berkovich projective line,” Mathematical Surveys and Monographs, 159, American Mathematical Society, Providence, RI, 2010.
  • BHSW (13) Bates, Daniel J.; Hauenstein, Jonathan D.; Sommese, Andrew J.; and Wampler, Charles W., Numerically Solving Polynomial Systems with Bertini, Software, Environments and Tools series, Society for Industrial and Applied Mathematics, 2013.
  • BP (09) Beltrán, Carlos and Pardo, Luis M., “Smale’s 17th Problem: Average Polynomial Time to compute affine and projective solutions,” Journal of the American Mathematical Society, 22 (2009), pp. 363–385.
  • Ber (03) Bernstein, Daniel J., “Computing Logarithm Intervals with the Arithmetic-Geometric Mean Iterations,” available from http://cr.yp.to/papers.html .
  • BFP (13) Bettale, Luk; Faugére, Jean-Charles; and Perret, Ludovic, “Cryptanalysis of HFE, multi-HFE and variants for odd and even characteristic,” Designs, Codes and Cryptography, Oct. 2013, vol. 69, issue 1, pp. 1–52.
  • BRS (09) Bihan, Frédéric; Rojas, J. Maurice; and Stella, Casey, “Faster Real Feasibility via Circuit Discriminants,” proceedings of ISSAC 2009 (July 28–31, Seoul, Korea), pp. 39–46, ACM Press, 2009.
  • BCSS (98) Blum, Lenore; Cucker, Felipe; Shub, Mike; and Smale, Steve, Complexity and Real Computation, Springer-Verlag, 1998.
  • CKS (99) Cucker, Felipe; Koiran, Pascal; and Smale, Steve, “A polynomial time algorithm for Diophantine equations in one variable,” J. Symbolic Comput. 27 (1999), pp. 21–29.
  • BL (07) Bürgisser, P. and Lotz, M., “The complexity of computing the Hilbert polynomial of smooth equidimensional complex projective varieties,” Foundations of Computational Mathematics 7(1), pp. 51–86, 2007.
  • BS (09) Bürgisser, P. and Scheiblechner, P., “On the Complexity of Counting Components of Algebraic Varieties,” Journal of Symbolic Computation 44(9), pp. 1114–1136, 2009.
  • BC (10) Bürgisser, P. and Cucker, F., “Solving Polynomial Equations in Smoothed Polynomial Time and a Near Solution to Smale’s 17th Problem,” Proceedings STOC (Symposium on the the Theory of Computation) 2010, pp. 503-512, ACM Press, 2010.
  • dLRS (10) De Loera, Jesús A.; Rambau, Jörg; Santos, Francisco, Triangulations, Structures for algorithms and applications, Algorithms and Computation in Mathematics, 25, Springer-Verlag, Berlin, 2010.
  • EKL (06) Einsiedler, Manfred; Kapranov, Mikhail; and Lind, Douglas, “Non-archimedean amoebas and tropical varieties,” Journal für die reine und angewandte Mathematik (Crelles Journal), Vol. 2006, no. 601, pp. 139–157, December 2006.
  • EC (95) Emiris, Ioannis Z. and Canny, John, “Efficient Incremental Algorithms for the Sparse Resultant and Mixed Volume,” J. Symbolic Comput. 20 (1995), no. 2, pp. 117–149.
  • FGHR (13) Faugère, Jean-Charles; Gaudry, Pierrick; Huot, Louise; and Renault, Guénaël, “Using Symmetries in the Index Calculus for Elliptic Curves Discrete Logarithm,” Journal of Cryptology, pages 1-40, May 2013.
  • FHP (03) Faugère, Jean-Charles; Hering, Milena; and Phan, Jeffrey, “The membrane inclusions curvature equations,” Advances in Applied Mathematics, 31(4):643-658, June 2003.
  • GJ (79) Garey, Michael R. and Johnson, David S., Computers and Intractability: A Guide to the Theory of NP-Completeness, A Series of Books in the Mathematical Sciences, W. H. Freeman and Co., San Francisco, Calif., 1979.
  • GKZ (94) Gel’fand, Israel Moseyevitch; Kapranov, Misha M.; and Zelevinsky, Andrei V.; Discriminants, Resultants and Multidimensional Determinants, Birkhäuser, Boston, 1994.
  • GLS (93) Grötschel, Martin; Lovász, Laszlo; and Schrijver, Alex, Geometric Algorithms and Combinatorial Optimization, Springer-Verlag, New York, 1993.
  • Grü (03) Grünbaum, Branko, Convex Polytopes, 2nd ed. (edited by Zieger, G.), Graduate Texts in Mathematics, vol. 221, Springer-Verlag, 2003.
  • HHHLSZ (13) Hao, Wenrui; Hauenstein, Jonathan D.; Hu, Bei; Liu, Yuan; Sommese, Andrew; and Zhang, Yong-Tao, “ Continuation Along Bifurcation Branches for a Tumor Model with a Necrotic Core,” Journal of Scientific Computation, to appear.
  • HRS (13) Hauenstein, Jonathan; Rodriguez, Jose; and Sturmfels, Bernd, “Maximum likelihood for matrices with rank constraints,” Journal of Algebraic Statistics, to appear. Also available as Math ArXiV preprint 1210.0198 .
  • HHP (08) Herbst, Manfred; Hori, Kentaro; and Page, David, “Phases of N=2N\!=\!2 theories in 1+11+1 dimensions with boundary,” High Energy Physics ArXiV preprint 0803.2045v1 .
  • HL (95) Huan, Liang Jiao and Li, Tien-Yien, “Parallel homotopy algorithm for symmetric large sparse eigenproblems,” Journal of Computational and Applied Mathematics, Volume 60, Issues 1–2, 20 June 1995, pp. 77–100.
  • IMS (09) Itenberg, Ilia; Mikhalkin, Grigory; and Shustin, Eugenii, Tropical algebraic geometry, Second edition, Oberwolfach Seminars, 35, Birkhäuser Verlag, Basel, 2009.
  • Kar (72) Karp, Richard M., “Reducibility among combinatorial problems,” in Complexity of Computer Computations, (Miller, R. E. and Thatcher, J. W., eds.), Plenum Press, New York, pp. 85–103, 1972.
  • Kho (91) Khovanskii, Askold G., Fewnomials, AMS Press, Providence, Rhode Island, 1991.
  • Koi (96) Koiran, Pascal, “Hilbert’s Nullstellensatz is in the Polynomial Hierarchy,” DIMACS Technical Report 96-27, July 1996. (This preprint considerably improves the published version which appeared Journal of Complexity 12 (1996), no. 4, pp. 273–286.)
  • KPR (13) Koiran, Pascal; Portier, Natacha; and Rojas, J. Maurice, “Counting Tropically Degenerate Valuations and pp-adic Approaches to the Hardness of the Permanent,” submitted for publication, also available as Math ArXiV preprint 1309.0486 .
  • LL (11) Lee, Tsung-Lin and Li, Tien-Yien, “Mixed volume computation in solving polynomial systems,” in Randomization, Relaxation, and Complexity in Polynomial Equation Solving, Contemporary Mathematics, vol. 556, pp. 97–112, AMS Press, 2011.
  • LS (09) Tropical and Idempotent Mathematics, International Workshop TROPICAL-07 (Tropical and Idempotent Mathematics, Aug. 25—30, 2007, Independent University, edited by G. L. Litvinov and S. N. Sergeev), Contemporary Mathematics, vol. 495, AMS Press, 2009.
  • MS (13) Maclagan, Diane and Sturmfels, Bernd, Introduction to Tropical Geometry, in progress.
  • Mat (00) Matveev, E. M., “An explicit lower bound for a homogeneous rational linear form in logarithms of algebraic numbers. II.” Izv. Ross. Akad. Nauk Ser. Mat. 64 (2000), no. 6, pp. 125–180; translation in Izv. Math. 64 (2000), no. 6, pp. 1217–1269.
  • MM (90) Meintjes, Keith and Morgan, Alexander P., “Chemical equilibrium systems as numerical test problems,” ACM Toms, Vol. 16, No. 2, pp. 143–151, 1990.
  • Mik (04) Mikhalkin, Grigory, “Decomposition into pairs-of-pants for complex algebraic hypersurfaces,” Topology , vol. 43, no. 5, pp. 1035–1065, 2004
  • MT (02) Monteiro, Renato D.C. and Tsuchiya, Takashi, “A variant of the Vavasis-Ye layered-step interior-point algorithm for linear programming,” SIAM J. Optim., 13(4), pp. 1054–1079, 2002.
  • MFRCSD (13) Müller, Stefan; Feliu, Elisenda; Regensburger, Georg; Conradi, Carsten; Shiu, Anne; and Dickenstein, Alicia, “Sign conditions for injectivity of generalized polynomial maps with applications to chemical reaction networks and real algebraic geometry,” Math ArXiV preprint 1311.5493 .
  • Nes (03) Nesterenko, Yuri, “Linear forms in logarithms of rational numbers,” Diophantine approximation (Cetraro, 2000), pp. 53–106, Lecture Notes in Math., 1819, Springer, Berlin, 2003.
  • New (76) Newton, Isaac, letter to Oldenburg dated 1676 Oct 24, the correspondence of Isaac Newton, II, pp. 126–127, Cambridge University Press, 1960.
  • NP (10) Nilsson, Lisa and Passare, Mikael, “Discriminant coamoebas in dimension two,” J. Commut. Algebra, vol. 2, No. 4 (2010), pp. 447–471.
  • NAM (11) Ning, Y.; Avendaño, M. E.; and Mortari, D., “Sequential Design of Satellite Formations with Invariant Distances,” AIAA Journal of Spacecraft and Rockets, Vol. 48, No. 6, 2011, pp. 1025–1032.
  • NS (13) Nisse, Mounir and Sottile, Frank, “Phase limit set of a variety,” J. of Algebra and Number Theory, 7-2 (2013), pp. 339–352.
  • NS (14) Nisse, Mounir and Sottile, Frank, “Non-Archimedean Coamoebae,” in Tropical and Non-Archimedean Geometry, Contemporary Mathematics, vol. 605, AMS Press, 2014.
  • (51) Nitaj, Abderrahmane, The abc Conjecture Home Page, http://www.math.unicaen.fr/ nitaj/abc.html .
  • Ost (40) Ostrowski, Alexandre, “Recherches sur la méthode de Graeffe et les zéros des polynomes et des séries de Laurent,” Acta Math. 72, (1940), pp. 99–155.
  • Pap (95) Papadimitriou, Christos H., Computational Complexity, Addison-Wesley, 1995.
  • Pin (98) Pin, J.-E., “Tropical semirings,” Idempotency (Bristol, 1994), pp. 50–69, Publ. Newton Inst., 11, Cambridge Univ. Press, Cambridge, 1998.
  • Pla (84) Plaisted, David A., “New NP-Hard and NP-Complete Polynomial and Integer Divisibility Problems,” Theoret. Comput. Sci. 31 (1984), no. 1–2, pp. 125–138.
  • Sch (86) Schrijver, Alex, Theory of Linear and Integer Programming, John Wiley & Sons, 1986.
  • (57) Shub, Mike and Smale, Steve, “The Complexity of Bezout’s Theorem I: Geometric Aspects,” Journal of the American Mathematical Society 6 (1992), pp. 459–501.
  • (58) Shub, Mike and Smale, Steve, “The Complexity of Bezout’s Theorem II: Volumes and Probabilities,” Computational Algebraic Geometry (F. Eyssette and A. Galligo, Eds.), pp. 267–285, Birkhauser, 1992.
  • SS (93) Shub, Mike and Smale, Steve, “The Complexity of Bezout’s Theorem III: Condition Number and Packing,” Journal of Complexity 9 (1993), pp. 4–14.
  • SS (96) Shub, Mike and Smale, Steve, “The Complexity of Bezout’s Theorem IV: Probability of Success; Extensions,” SIAM J. Numer. Anal., Vol. 33, No. 1 (Feb., 1996), pp. 128–148.
  • SS (94) Shub, Mike and Smale, Steve, “The Complexity of Bezout’s Theorem V: Polynomial Time,” Theoretical Computer Science, Vol. 133, Issue 1, 10, October 1994, pp. 141–164.
  • Sip (12) Sipser, Michael, Introduction to the Theory of Computation, 3rd edition, Cengage Learning, 2012.
  • Sma (98) Smale, Steve, “Mathematical Problems for the Next Century,” Math. Intelligencer 20 (1998), no. 2, pp. 7–15.
  • Sma (00) Smale, Steve, “Mathematical Problems for the Next Century,” Mathematics: Frontiers and Perspectives, pp. 271–294, Amer. Math. Soc., Providence, RI, 2000.
  • SW (05) Sommese, Andrew J. and Wampler, Charles W., “The Numerical Solution to Systems of Polynomials Arising in Engineering and Science,” World Scientific, Singapore, 2005.
  • Ver (10) Verschelde, Jan, “Polynomial Homotopy Continuation with PHCpack,” ACM Communications in Computer Algebra 44(4):217-220, 2010.
  • Vir (01) Viro, Oleg Ya., “Dequantization of Real Algebraic Geometry on a Logarithmic Paper,” Proceedings of the 3rd European Congress of Mathematicians, Birkhäuser, Progress in Math, 201, (2001), pp. 135–146.
  • Zie (95) Ziegler, Gunter M., Lectures on Polytopes, Graduate Texts in Mathematics, Springer Verlag, 1995.