This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

3-Manifolds Everywhere

Danny Calegari Department of Mathematics
University of Chicago
Chicago, Illinois, 60637
dannyc@math.uchicago.edu
 and  Henry Wilton DPMMS
Centre for Mathematical Sciences
Wilberforce Road
Cambridge
CB3 0WB
UK
h.wilton@maths.cam.ac.uk
Abstract.

A random group contains many subgroups which are isomorphic to the fundamental group of a compact hyperbolic 3-manifold with totally geodesic boundary. These subgroups can be taken to be quasi-isometrically embedded. This is true both in the few relators model, and the density model of random groups (at any density less than a half).

1. Introduction

Geometric group theory was born in low-dimensional topology, in the collective visions of Klein, Poincaré and Dehn. Stallings used key ideas from 3-manifold topology (Dehn’s lemma, the sphere theorem) to prove theorems about free groups, and as a model for how to think about groups geometrically in general. The pillars of modern geometric group theory — (relatively) hyperbolic groups and hyperbolic Dehn filling, NPC cube complexes and their relations to LERF, the theory of JSJ splittings of groups and the structure of limit groups — all have their origins in the geometric and topological theory of 2- and 3-manifolds.

Despite these substantial and deep connections, the role of 3-manifolds in the larger world of group theory has been mainly to serve as a source of examples — of specific groups, and of rich and important phenomena and structure. Surfaces (especially Riemann surfaces) arise naturally throughout all of mathematics (and throughout science more generally), and are as ubiquitous as the complex numbers. But the conventional view is surely that 3-manifolds per se do not spontaneously arise in other areas of geometry (or mathematics more broadly) amongst the generic objects of study. We challenge this conventional view: 3-manifolds are everywhere.

1.1. Random groups

The “generic” objects in the world of finitely presented groups are the random groups, in the sense of Gromov. There are two models of what one means by a random group, and we shall briefly discuss them both.

First, fix k2k\geq 2 and fix a free generating set x1,x2,,xkx_{1},x_{2},\cdots,x_{k} for FkF_{k}, a free group of rank kk. A kk-generator group GG can be given by a presentation

G:=x1,x2,,xk|r1,r2,,rG:=\langle x_{1},x_{2},\cdots,x_{k}\;|\;r_{1},r_{2},\cdots,r_{\ell}\rangle

where the rir_{i} are cyclically reduced cyclic words in the xix_{i} and their inverses.

In the few relators model of a random group, one fixes 1\ell\geq 1, and then for a given integer nn, selects the rir_{i} independently and randomly (with the uniform distribution) from the set of all reduced cyclic words of length nn.

In the density model of a random group, one fixes 0<D<10<D<1, and then for a given integer nn, define =(2k1)Dn\ell=\lfloor(2k-1)^{Dn}\rfloor and select the rir_{i} independently and randomly (with the uniform distribution) from the set of all reduced cyclic words of length nn.

Thus, the difference between the two models is how the number of relations (\ell) depends on their length (nn). In the few relators model, the absolute number of relations is fixed, whereas in the density model, the (logarithmic) density of the relations among all words of the given length is fixed.

For fixed k,,nk,\ell,n in the few relators model, or fixed k,D,nk,D,n in the density model, we obtain in this way a probability distribution on finitely presented groups (actually, on finite presentations). For some property of groups of interest, the property will hold for a random group with some probability depending on nn. We say that the property holds for a random group with overwhelming probability if the probability goes to 1 as nn goes to infinity.

As remarked above, a “random group” really means a “random presentation”. Associated to a finite presentation of a group GG as above, one can build a 2-complex KK with one 0 cell, with one 1 cell for each generator xix_{i}, and with one 2 cell for each relation rjr_{j}, so that π1(K)=G\pi_{1}(K)=G. We are very interested in the geometry and combinatorics of KK (and its universal cover) in what follows.

1.2. Properties of random groups

All few-relator random groups are alike (with overwhelming probability). There is a phase transition in the behavior of density random groups, discovered by [Gromov(1993)] § 9: for D>1/2D>1/2, a random group is either trivial or isomorphic to /2\mathbb{Z}/2\mathbb{Z}, whereas at any fixed density 0<D<1/20<D<1/2, a random group is infinite, hyperbolic, and 1-ended, and the presentation determining the group is aspherical — i.e. the 2-complex KK defined from the presentation has contractible universal cover. Furthermore, [Dahmani–Guirardel–Przytycki(2011)] showed that a random group with density less than a half does not split, and has boundary homeomorphic to the Menger sponge.

\labellist
\hair

2pt \endlabellistRefer to caption

Figure 1. The boundary of a random group is a Menger sponge

The Menger sponge is obtained from the unit cube by subdividing it into 27 smaller cubes each with one third the side length, then removing the central cube and the six cubes centered on each face; and then inductively performing the same procedure with each of the remaining 20 smaller cubes; see Figure 1.

The Menger sponge has topological dimension 1, and is the universal compact space with this property, in the sense that any compact Hausdorff space of topological dimension 1 embeds in it.

It is important to understand what kinds of abstract groups HH arise as subgroups of a random group GG. However, not all subgroups of a hyperbolic group are of equal importance: the most useful subgroups, and those that tell us the most about the geometry of GG, are the subgroups HH with the following properties:

  1. (1)

    the group HH itself is something whose intrinsic topological and geometric properties we understand very well; and

  2. (2)

    the intrinsic geometry of HH can be uniformly compared with the extrinsic geometry of its embedding in GG.

In other words, we are interested in well-understood groups HH which are quasi-isometrically embedded in GG.

A finitely generated quasi-isometrically embedded subgroup HH of a hyperbolic group GG is itself hyperbolic, and therefore finitely presented. The inclusion of HH into GG induces an embedding of Gromov boundaries HG\partial_{\infty}H\to\partial_{\infty}G. Thus if GG is a random group, the boundary of HH has dimension at most 1, and by work of [Kapovich–Kleiner(2000)], there is a hierarchical description of all possible H\partial_{\infty}H.

First of all, if H\partial_{\infty}H is disconnected, then one knows by [Stallings(1968)] that HH splits over a finite group. Second of all, if H\partial_{\infty}H is connected and contains a local cut point, then one knows by [Bowditch(1998)] that either HH is virtually a surface group, or else HH splits over a cyclic group. Thus, apart from the Menger sponge itself, understanding hyperbolic groups with boundary of dimension at most 1 reduces (in some sense) to the case that H\partial_{\infty}H is a Cantor set, a circle, or a Sierpinski carpet — i.e. one of the “faces” of the Menger cube. The Sierpinski carpet is universal for 1-dimensional compact Hausdorff planar sets of topological dimension 1. Thus one is naturally led to the following fundamental question, which as far as we know was first asked explicitly by François Dahmani:

Question 1.2.1 (Dahmani).

Which of the three spaces — Cantor set, circle, Sierpinski carpet — arise as the boundary of a quasiconvex subgroup of a random group?

Combining the main theorem of this paper with the results of [Calegari–Walker(2015)] gives a complete answer to this question:

Answer.

All three spaces arise in a random group as the boundary of a quasiconvex subgroup (with overwhelming probability).

This is true in the few relators model with any positive number of relators, or the density model at any density less than a half. The situation is summarized as follows:

  1. (1)

    H\partial_{\infty}H is a Cantor set if and only if HH is virtually free (of rank at least 2); we can thus take H=π1(graph)H=\pi_{1}(\text{graph}). The existence of free quasiconvex subgroups HH in arbitrary (nonelementary) hyperbolic groups GG is due to Klein, by the ping-pong argument.

  2. (2)

    H\partial_{\infty}H is a circle if and only if HH is virtually a surface group (of genus at least 2); we can thus take H=π1(surface)H=\pi_{1}(\text{surface}). The existence of surface subgroups HH in random groups (with overwhelming probability) is proved by [Calegari–Walker(2015)], Theorem 6.4.1.

  3. (3)

    H\partial_{\infty}H is a Sierpinski carpet if HH is virtually the fundamental group of a compact hyperbolic 3-manifold with (nonempty) totally geodesic boundary; and if the Cannon conjecture is true, this is if and only if — see [Kapovich–Kleiner(2000)]. The existence of such 3-manifold subgroups HH in random groups (with overwhelming probability) is Theorem 6.2.1 in this paper.

Explicitly, we prove:

3-Manifolds Everywhere Theorem 6.2.1.

Fix k2k\geq 2. A random kk-generator group — either in the few relators model with 1\ell\geq 1 relators, or the density model with density 0<D<1/20<D<1/2 — and relators of length nn contains many quasi-isometrically embedded subgroups isomorphic to the fundamental group of a hyperbolic 3-manifold with totally geodesic boundary, with probability 1O(enC)1-O(e^{-n^{C}}) for some C>0C>0.

Our theorem applies in particular in the few relators model where =1\ell=1. In fact, if one fixes D<1/2D<1/2, and starts with a random 1-relator group G=Fk|rG=\langle F_{k}\;|\;r\rangle, one can construct a quasiconvex 3-manifold subgroup HH (of the sort which is guaranteed by the theorem) which stays injective and quasiconvex (with overwhelming probability) as a further (2k1)Dn(2k-1)^{Dn} relators are added.

1.3. Commensurability

Two groups are said to be commensurable if they have isomorphic subgroups of finite index. Commensurability is an equivalence relation, and it is natural to wonder what commensurability classes of 3-manifold groups arise in a random group.

We are able to put very strong constraints on the commensurability classes of the 3-manifold groups we construct. It is probably too much to hope to be able to construct a subgroup of a fixed commensurability class. But we can arrange for our 3-manifold groups to be commensurable with some element of a family of finitely generated groups given by presentations which differ only by varying the order of torsion of a specific element. Hence our 3-manifold subgroups are all commensurable with Kleinian groups of bounded convex covolume (i.e. the convex hulls of the quotients have uniformly bounded volume). Explicitly:

Commensurability Theorem 7.0.2.

A random group at any density <1/2<1/2 or in the few relators model contains (with overwhelming probability) a subgroup commensurable with the Coxeter group Γ(m)\Gamma(m) for some m7m\geq 7, where Γ(m)\Gamma(m) is the Coxeter group with Coxeter diagram

mm

The Coxeter group Γ(m)\Gamma(m) is commensurable with the group generated by reflections in the sides of a regular super ideal tetrahedron — one with vertices “outside” the sphere at infinity — and with dihedral angles π/m\pi/m.

1.4. Plan of the paper

We now describe the outline of the paper.

As discussed above, our random groups GG come together with the data of a finite presentation

G:=x1,,xk|r1,,rG:=\langle x_{1},\cdots,x_{k}\;|\;r_{1},\cdots,r_{\ell}\rangle

From such a presentation we can build in a canonical way a 2-complex KK with 1-skeleton XX, where XX is a wedge of kk circles, and KK is obtained by attaching \ell disks along loops in XX corresponding to the relators; so π1(K)=G\pi_{1}(K)=G.

Our 3-manifold subgroups arise as the fundamental groups of 2-complexes M¯(Z)\overline{M}(Z) that come with immersions M¯(Z)K\overline{M}(Z)\to K taking (open) cells of M¯(Z)\overline{M}(Z) homeomorphically to cells of KK. Thus, for every 2-cell of M¯(Z)\overline{M}(Z), the attaching map of its boundary to the 1-skeleton ZZ factors through a map onto one of the relators of the given presentation of GG.

One way to obtain such a complex M¯(Z)\overline{M}(Z) is to build a 1-complex ZZ as a quotient of a collection LL of circles together with an immersion ZXZ\to X where XX is the 1-skeleton of KK, and the map LXL\to X takes each component to the image of a relator. We call data of this kind a spine. In § 2 we describe the topology of spines and give sufficient combinatorial conditions on a spine to ensure that M¯(Z)\overline{M}(Z) is homotopic to a 3-manifold.

Since XX is a rose whose edges are endowed with a choice of orientation and labelling by the generators xix_{i}, we will usually encode a map of graphs ΓX\Gamma\to X by labelling (oriented) edges by xi±1x_{i}^{\pm 1}, in the spirit of [Stallings(1983)]. As usual, if an oriented edge ee of Γ\Gamma is labelled xi±1x_{i}^{\pm 1} then the oriented edge e¯\bar{e} with the reverse orientation is labelled xi1x_{i}^{\mp 1}. Note that there is a simple condition to ensure that such a map ΓX\Gamma\to X is an immersion: one simply requires that no two oriented edges of Γ\Gamma incident at the same vertex have the same label. We call such a graph Γ\Gamma folded (also in the spirit of [Stallings(1983)]).

In § 3 we prove the Thin Spine Theorem, which says that we can build such a spine LZL\to Z, satisfying the desired combinatorial conditions, and such that every edge of the 1-skeleton ZZ is long. Here we measure the length of edges of ZZ by pulling back length from XX under the immersion ZXZ\to X, where each edge of XX is normalized to have length 11. In fact, if we let G1G_{1} denote the 1-relator group

G1:=x1,,xk|r1G_{1}:=\langle x_{1},\cdots,x_{k}\;|\;r_{1}\rangle

with associated 2-complex K1K_{1} which comes with a tautological inclusion K1KK_{1}\to K, then our thin spines have the property that the immersion M¯(Z)K\overline{M}(Z)\to K factors through M¯(Z)K1\overline{M}(Z)\to K_{1}.

For technical reasons, rather than working with a random relator r1r_{1}, we work instead with a relator which is merely sufficiently pseudorandom (a condition concerning equidistribution of subwords with controlled error on certain scales), and the theorem we prove is deterministic. Of course, the definition of pseudorandom is such that a random word will be pseudorandom with very high probability.

Explicitly, we prove:

Thin Spine Theorem 3.1.2.

For any λ>0\lambda>0 there is TλT\gg\lambda and ϵ1/T\epsilon\ll 1/T so that, if rr is (T,ϵ)(T,\epsilon)-pseudorandom and KK is the 2-complex associated to the presentation G:=Fk|rG:=\langle F_{k}\;|\;r\rangle, then there is a spine f:LZf:L\to Z over KK for which LL is a union of 648 circles (or 5,832 circles if k=2k=2), and every edge of ZZ has length at least λ\lambda.

This is by far the longest section in the paper, and it involves a complicated combinatorial argument with many interdependent steps. It should be remarked that one of the key ideas we exploit in this section is the method of random matching with correction: randomness (actually, pseudorandomness) is used to show that the desired combinatorial construction can be performed with very small error. In the process, we build a reservoir of small independent pieces which may be adjusted by various local moves in such a way as to “correct” the errors that arose at the random matching step. Similar ideas were also used by [Kahn–Markovic(2011)] in their proof of the Ehrenpreis Conjecture, by [Calegari–Walker(2015)] in their construction of surface subgroups in random groups, and by [Keevash(2014)] in his construction of General Steiner Systems and Designs. Evidently this method is extremely powerful, and its full potential is far from being exhausted.

The Thin Spine Theorem can be summarized by saying that as a graph, ZZ has bounded valence, but very long edges. This means that the image of π1(Z)\pi_{1}(Z) in π1(X)\pi_{1}(X) induced by the inclusion ZXZ\to X is very “sparse”, in the sense that the ball of radius nn in π1(X)\pi_{1}(X) contains O(3n/λ)O(3^{n/\lambda}) elements of π1(Z)\pi_{1}(Z), where we can take λ\lambda as big as we like. This has the following consequence: when we obtain G1G_{1} as a quotient of π1(X)\pi_{1}(X) by adding r1r_{1} as a relator, we should not kill any “accidental” elements of π1(Z)\pi_{1}(Z), so that the image of π1(Z)\pi_{1}(Z) will be isomorphic to π1(M¯(Z))\pi_{1}(\overline{M}(Z)), a 3-manifold group.

This idea is fleshed out in § 4, and shows that random 1-relator groups contain 3-manifold subgroups, although at this stage we have not yet shown that the 3-manifold is of the desired form. The argument in this section depends on a so-called bead decomposition, which is very closely analogous to the bead decomposition used to construct surface groups by [Calegari–Walker(2015)], and the proof is very similar.

In § 5 we show that the 3-manifold homotopic to the 2-complex M¯(Z)\overline{M}(Z) is acylindrical; equivalently, that it is homeomorphic to a compact hyperbolic 3-manifold with totally geodesic boundary. This is a step with no precise analog in [Calegari–Walker(2015)], but the argument is very similar to the argument showing that M¯(Z)\overline{M}(Z) is injective in the 1-relator group GG^{\prime}. There are two kinds of annuli to rule out: those that use 2-cells of M¯(Z)\overline{M}(Z), and those that don’t. The annuli without 2-cells are ruled out by the combinatorics of the construction. Those that use 2-cells are ruled out by a small cancellation argument which uses the thinness of the spine. So at the end of this section, we have shown that random 1-relator groups contain subgroups isomorphic to the fundamental groups of compact hyperbolic 3-manifolds with totally geodesic boundary.

Finally, in § 6, we show that the subgroup π1(M¯(Z))\pi_{1}(\overline{M}(Z)) stays injective as the remaining 1\ell-1 random relators are added. The argument here stays extremely close to the analogous argument in [Calegari–Walker(2015)], and depends (as [Calegari–Walker(2015)] did) on a kind of small cancellation theory for random groups developed by [Ollivier(2007)]. This concludes the proof of the main theorem.

A further section § 7 proves the Commensurability Theorem. The proof is straightforward given the technology developed in the earlier sections.

2. Spines

2.1. Trivalent fatgraphs and spines

Before introducing spines, we first motivate them by describing the analogous, but simpler, theory of fatgraphs.

A fatgraph YY is a (simplicial) graph together with a cyclic ordering of the edges incident to each vertex. This cyclic ordering can be used to canonically “fatten” the graph YY so that it embeds in an oriented surface S(Y)S(Y) with boundary, in such a way that S(Y)S(Y) deformation retracts down to YY. Under this deformation retraction, the boundary S(Y)\partial S(Y) maps to YY in such a way that the preimage of each edge ee of YY consists of two intervals e±e^{\pm} in S(Y)\partial S(Y), each mapping homeomorphically to ee, with opposite orientations.

Abstractly, the data of a fatgraph can be given by an ordinary graph YY, a 1-manifold LL, and a locally injective simplicial map f:LYf:L\to Y of (geometric) “degree 2”; i.e. such that each edge of YY is in the image of two intervals in LL. The surface S(Y)S(Y) arises as the mapping cone of ff. If one orients LL and insists that the preimages of each edge have opposite orientations, the result is a fatgraph and an oriented surface as above. If one does not insist on the orientation condition, the mapping cone need not be orientable. Attaching a disk along its boundary to each component of LL produces a closed surface, which we denote S¯(Y)\overline{S}(Y).

We would like to discuss a more complicated object called a spine, for which the analog of S¯(Y)\overline{S}(Y) is a 2-complex homotopy equivalent to a compact 3-manifold with boundary. The 2-complex will arise by gluing 2-dimensional disks onto the components of a 1-manifold LL, and then attaching these disks to the mapping cone of an immersion f:LZf:L\to Z where ZZ is a 4-valent graph, and the map ff is subject to certain local combinatorial constraints.

The first combinatorial constraint is that the map f:LZf:L\to Z should be “degree 3”; that is, the preimage of each edge of LL should consist of three disjoint intervals in LL, each mapped homeomorphically by ff.

Since ZZ is 4-valent, at each vertex vv of ZZ we have 12 intervals in LL that map to the incident edges; these 12 intervals should be obtained by subdividing 6 disjoint intervals in LL, where the dividing point maps to vv. Since LZL\to Z is an immersion, near each dividing point the given interval in LL runs locally from one edge incident to vv to a different one. There are three local models (up to symmetry) of how six edges of LL can locally run over a 4-valent vertex vv of ZZ so that they run over each incident edge (in ZZ) to vv three times (this notion of “local model” is frequently called a Whitehead graph in the literature). These three local models are illustrated in Figure 2.

\labellist
\hair

2pt \pinlabelType A at -200 0 \endlabellistRefer to caption

Figure 2. Three local models

The third local model is distinguished by the property that for each pair of edges of ZZ adjacent to vv, there is exactly one interval in LL running over vv from one edge to the other. We say that such a local model is good. The second combinatorial constraint is that the local model at every vertex of ZZ is good.

If f:LZf:L\to Z is good, and M(Z)M(Z) is the mapping cone of ff, then the 2-complex M(Z)M(Z) can be canonically thickened to a 3-manifold, since a neighborhood of the mapping cone near a vertex vv embeds in 3\mathbb{R}^{3} in such a way that the tetrahedral symmetry of the combinatorics is realized by symmetries of the embedding. Similarly, along each edge the dihedral symmetry is realized by the symmetry of the embedding. The restriction of this thickening to each component of LL is the total space of an II-bundle over the circle; we say that f:LZf:L\to Z is co-oriented if each of these II-bundles is trivial. The third combinatorial constraint is that f:LZf:L\to Z is co-oriented.

Definition 2.1.1.

A spine is the data of a compact 1-manifold LL and a 4-valent graph ZZ, together with a co-oriented degree 3 immersion f:LZf:L\to Z whose local model at every vertex of ZZ is good. If f:LZf:L\to Z is a spine, we denote the mapping cone by M(Z)M(Z), and by M¯(Z)\overline{M}(Z) the 2-complex obtained by capping each component of LL in M(Z)M(Z) off by a disk.

Lemma 2.1.2 (Spine thickens).

Let f:LZf:L\to Z be a spine. Then M¯(Z)\overline{M}(Z) is canonically homotopy equivalent to a compact 3-manifold with boundary.

Proof.

We have already seen that M(Z)M(Z) has a canonical thickening to a compact 3-manifold in such a way that the restriction of this thickening to each component of LL is an II-bundle. The total space of this II-bundle is an annulus embedded in the boundary of M(Z)M(Z), and we may therefore attach a 2-handle with core the corresponding component of LL providing this II-bundle is trivial. But that is exactly the condition that f:LZf:L\to Z should be co-oriented. ∎

By abuse of notation, we call M¯(Z)\overline{M}(Z) the thickening of ZZ.

2.2. Tautological immersions

Now let’s fix k2k\geq 2 and a free group FkF_{k} on kk fixed generators. Let XX be a rose for FkF_{k}; i.e. a wedge of kk (oriented) circles, with a given labeling by the generators of FkF_{k}. Let GG be a random group (in whatever model) at length nn. Each relator rir_{i} is a cyclically reduced word in FkF_{k}, and is realized geometrically by an immersion of an oriented circle ιi:Si1X\iota_{i}:S^{1}_{i}\to X. Attaching a disk along each such circle gives rise to the 2-complex KK described in the introduction with π1(K)=G\pi_{1}(K)=G.

Definition 2.2.1.

A spine over KK is a spine f:LZf:L\to Z together with an immersion g:ZXg:Z\to X such that for each component LiL_{i} of LL, there is some relator rjr_{j} and a simplicial homeomorphism gi:LiSj1g_{i}:L_{i}\to S^{1}_{j} for which ιjgi=gf\iota_{j}g_{i}=gf.

The existence of the simplicial homeomorphisms gig_{i} lets us label the components LiL_{i} by the corresponding relators in such a way that the map f:LZf:L\to Z has the property that the preimages of each edge of ZZ get the same labels, at least if we choose orientations correctly, and use the convention that changing the orientation of an edge replaces the label by its inverse. So we can equivalently think of the labels as living on the oriented edges of ZZ. Notice that the maps gig_{i}, if they exist at all, are uniquely determined by g,f,ιjg,f,\iota_{j} (at least if the presentation is not redundant, so that no relator is equal to a conjugate of another relator or its inverse).

Evidently, if f:LZf:L\to Z, g:ZXg:Z\to X is a spine over KK, the immersion g:ZXg:Z\to X extends to an immersion of the thickening g¯:M¯(Z)K\overline{g}:\overline{M}(Z)\to K, that we call the tautological immersion.

Our strategy is to construct a spine over KK for which M¯(Z)\overline{M}(Z) is homotopy equivalent to a compact hyperbolic 3-manifold with geodesic boundary, and for which the tautological immersion induces a quasi-isometric embedding on π1\pi_{1}.

3. The Thin Spine Theorem

The purpose of this section is to prove the Thin Spine Theorem, the analog in our context of the Thin Fatgraph Theorem from [Calegari–Walker(2015)].

In words, this theorem says that if rr is a sufficiently long random cyclically reduced word in FkF_{k} giving rise to a random 1-relator group G:=Fk|rG:=\langle F_{k}\;|\;r\rangle with associated 2-complex KK, then with overwhelming probability, there is a good spine f:LZf:L\to Z over KK for which every edge of ZZ is as long as we like; colloquially, the spine is thin.

For technical reasons, we prove this theorem merely for sufficiently “pseudorandom” words, to be defined presently.

3.1. Pseudorandomness

Instead of working directly with random chains, we use a deterministic variant called pseudorandomness.

Definition 3.1.1.

Let Γ\Gamma be a cyclically reduced cyclic word in a free group FkF_{k} with k2k\geq 2 generators. We say Γ\Gamma is (T,ϵ)(T,\epsilon)-pseudorandom if the following is true: if we pick any cyclic conjugate of Γ\Gamma, and write it as a reduced product of reduced words {w1,,wn}\{w_{1},\ldots,w_{n}\} of length TT (and at most one word vv of length <T<T)

Γ:=w1w2w3wnv\Gamma:=w_{1}w_{2}w_{3}\cdots w_{n}v

(so n=|r|/Tn=\lfloor|r|/T\rfloor) then for every reduced word σ\sigma of length TT in FkF_{k}, there is an estimate

1ϵ#{i such that wi=σ}n(2k)(2k1)T11+ϵ1-\epsilon\leq\frac{\#\{i\text{ such that }w_{i}=\sigma\}}{n}\cdot(2k)(2k-1)^{T-1}\leq 1+\epsilon

Here the factor (2k)(2k1)T1(2k)(2k-1)^{T-1} is simply the number of reduced words in FkF_{k} of length TT. Similarly, we say that a collection of nn reduced words {wi}\{w_{i}\} each of length TT is ϵ\epsilon-pseudorandom if for every reduced word σ\sigma of length TT in FkF_{k} the estimate above holds.

For any T,ϵT,\epsilon, a random reduced word of length NN will be (T,ϵ)(T,\epsilon)-pseudorandom with probability 1O(eNc)1-O(e^{-N^{c}}) for a suitable constant c(T,ϵ)c(T,\epsilon). This follows immediately from the standard Chernoff inequality for the stationary Markov process that produces a random reduced word in a free group (cf. [Calegari–Walker(2015), Lemma 3.2.2]).

With this definition in place, the statement of the Thin Spine Theorem is:

Theorem 3.1.2 (Thin Spine Theorem).

For any λ>0\lambda>0 there is TλT\gg\lambda and ϵ1/T\epsilon\ll 1/T so that, if rr is (T,ϵ)(T,\epsilon)-pseudorandom and KK is the 2-complex associated to the presentation G:=Fk|rG:=\langle F_{k}\;|\;r\rangle, then there is a spine f:LZf:L\to Z over KK for which LL is a union of 648 circles (or 5,832 circles if k=2k=2), and every edge of ZZ has length at least λ\lambda.

The strange appearance of the number 648 (or 5,832 for k=2k=2) in the statement of this theorem reflects the method of proof. First of all, observe that if f:LZf:L\to Z is any spine, then since ff has degree 3, the total length of LL is divisible by 3. If this spine is over KK, then each component of LL has length |r||r|, and if we make no assumptions about the value of |r||r| mod 3, then it will be necessary in general for the number of components of LL to be divisible by 3.

Our argument is to gradually glue up more and more of LL, constructing ZZ as we go. At an intermediate stage, the remainder to be glued up consists of a collection of disjoint segments from LL, and the power of our method is precisely that this lets us reduce the gluing problem to a collection of independent subproblems of uniformly bounded size. But each of these subproblems must involve a subset of LL of total length divisible by 3 or 6, and therefore it is necessary to “clear denominators” (by taking 2 or 3 disjoint copies of the result of the partial construction) several times to complete the construction (in the case of rank 2 one extra move might require a further factor of 9).

Finally, at the last step of the construction, we take 2 copies of LL and perform a final adjustment to satisfy the co-orientation condition.

The remainder of this section is devoted to the proof of Theorem 3.1.2.

3.2. Graphs and types

Let LL be a labeled graph consisting of 648 disjoint cycles (or 5,832 disjoint cycles if k=2k=2), each labeled by rr. We will build the spine ZZ and the map f:LZf:L\to Z in stages. We think of ZZ as a quotient space of LL, obtained by identifying segments in LL with the same labels. So the construction of ZZ proceeds by inductively identifying more and more segments of LL, so that at each stage some portion of LL has been “glued up” to form part of the graph ZZ, and some remains still unglued.

We introduce the following notation and terminology. Let Λ0\Lambda_{0} be a single cycle labeled rr. At the iith stage of our construction, we deal with a partially glued graph Λi\Lambda_{i}, constructed from a certain number of copies of Λi1\Lambda_{i-1} via certain ungluing and gluing moves. At each stage, Λi\Lambda_{i} is equipped with a labelling, defining a map ΛiX\Lambda_{i}\to X. We will always be careful to ensure that Λi\Lambda_{i} is folded, i.e. that the map ΛiX\Lambda_{i}\to X is an immersion. The glued subgraph of Λi\Lambda_{i} is denoted by Γi\Gamma_{i}, and the unglued subgraph by Υi\Upsilon_{i}. Shortly, the unglued subgraph Υi\Upsilon_{i} will be expressed as the disjoint union of two subgraphs: the remainder Δi\Delta_{i} and the reservoir Ωi\Omega_{i}.

Each of these graphs are thought of as metric graphs, whose edges have lengths equal to the length of the words that label them. The mass of a metric graph Γ\Gamma is its total length, denoted by m(Γ)m(\Gamma). The type of a graph refers to the collection of edge labels (which are reduced words in FkF_{k}) associated to each edge. A distribution on a certain set of types of graphs is a map that assigns a non-negative number to each type; it is integral if it assigns an integer to each type. We use this terminology without comment in the sequel.

The following properties will remain true at every stage of our construction. The branch vertices of Λi\Lambda_{i} (those of valence greater than two) have valence four. The length of each edge will always be at least λ\lambda. We will be careful to ensure that any branch vertex in the interior of the glued part Γi\Gamma_{i} is good in the sense of Section 2. Vertices in the intersection of the glued and unglued parts, ΓiΥi\Gamma_{i}\cap\Upsilon_{i}, will always be branch vertices, and will be such that one adjacent edge is in Γi\Gamma_{i}, and the remaining three adjacent edges in Υi\Upsilon_{i} have distinct labels.

In particular, we start with Λ0=Υ0\Lambda_{0}=\Upsilon_{0} and Γ0=\Gamma_{0}=\varnothing. At the last stage of our construction we will have Λ8\Lambda_{8} constructed from 324 copies of Λ0\Lambda_{0} (or 2,916 if k=2k=2), which is completely glued up; that is, Γ8=Λ8\Gamma_{8}=\Lambda_{8} and Υ8=\Upsilon_{8}=\varnothing. Finally, the modification in Section 3.12 doubles the mass of Λ8\Lambda_{8} in order to ensure that the co-orientation condition is satisfied. Taking ZZ to be the result of this construction and ff to be the quotient map LZL\to Z proves the theorem.

3.3. Football bubbles

We will regard kk and λ\lambda as constants. The first step of the construction is to pick some very big constant NλN\gg\lambda where still TNT\gg N (we will explain in the sequel how to choose TT and NN big enough) so that NN is odd.

Let ss be the remainder when |r|3λ|r|-3\lambda is divided by 3(N+1)λ3(N+1)\lambda. By pseudorandomness, we may find three subsegments in rr of reduced form

a1xa2,b1xb2,c1xc2a_{1}xa_{2},\quad b_{1}xb_{2},\quad c_{1}xc_{2}

where ai,bi,cia_{i},b_{i},c_{i} are single edges, such that the labels a1,b1,c1a_{1},b_{1},c_{1} are all distinct and a2,b2,c2a_{2},b_{2},c_{2} are all distinct, and the length of xx is ss. We take three copies of Λ0\Lambda_{0}, fix one of the above subsegments in each copy, and glue the parts of these subsegments labeled xx together to obtain Λ1\Lambda_{1}. Note that the requirement that the labels ai,bi,cia_{i},b_{i},c_{i} are distinct ensures that Λ1\Lambda_{1} remains folded. We summarize this in the following lemma.

Lemma 3.3.1.

After gluing three copies of Λ0\Lambda_{0} along subsegments of length ss we obtain Λ1\Lambda_{1}, with the property that the length of each edge of the unglued subgraph Υ1\Upsilon_{1} is congruent to 3λ3\lambda modulo 3(N+1)λ3(N+1)\lambda. The glued subgraph Γ1\Gamma_{1} is a segment of length ss.

We next decompose the unglued subgraph Υ1\Upsilon_{1} into disjoint segments of length 3Nλ3N\lambda separated by segments of length 3λ3\lambda. Call the segments of length 3Nλ3N\lambda long strips and the segments of length 3λ3\lambda short strips. Now further decompose each long strip into alternating segments of length 3λ3\lambda; we call the odd numbered segments sticky and the even numbered segments free.

We will usually denote a long strip by

x1a2x3xNx_{1}a_{2}x_{3}\ldots x_{N}

where the xix_{i} are sticky, the aia_{i} (or bib_{i} etc) are free, and all are of length 3λ3\lambda. When we also need to include the neighboring short strips, we will usually extend this notation to

a0x1a2x3xNaN+1a_{0}x_{1}a_{2}x_{3}\ldots x_{N}a_{N+1}

where a0a_{0} and aN+1a_{N+1} (or b0,bN+1b_{0},b_{N+1} etc) denote the neighboring short strips.

Definition 3.3.2.

Three long strips are compatible if they (and their adjoining short strips) are of the form

a0x1a2x3aN+1,b0x1b2x3bN+1,c0x1c2x3cN+1a_{0}x_{1}a_{2}x_{3}\cdots a_{N+1},\quad b_{0}x_{1}b_{2}x_{3}\cdots b_{N+1},\quad c_{0}x_{1}c_{2}x_{3}\cdots c_{N+1}

(i.e. if their sticky segments agree) and if for even ii (i.e. for the free segments) the letters adjacent to each xi+1x_{i+1} or xi1x_{i-1} disagree.

A compatible triple of long strips can be bunched — i.e. the sticky segments can be glued together in threes, creating (N1)/2(N-1)/2 football bubbles, each football consisting of the three segments ai,bi,cia_{i},b_{i},c_{i} (for some ii) arranged as the edges of a theta graph. See Figure 3.

\labellist
\hair

2pt \endlabellistRefer to caption

Figure 3. Bunching three long strips to create football bubbles

By pseudorandomness, a proportion of approximately (1ϵ)(1-\epsilon) of the long strips in Γ1\Gamma_{1} can be partitioned into compatible triples. Then each compatible triple can be bunched, creating a reservoir of footballs (i.e. the theta graphs appearing as bubbles) and a remainder, consisting of the union of the unglued pieces (except for the footballs). In what follows, we will denote the remainder in Υi\Upsilon_{i} by Δi\Delta_{i} and the reservoir by Ωi\Omega_{i}.

We summarize this observation in the next lemma. By an extended long strip, we mean a long strip, together with half the adjacent short strips.

Lemma 3.3.3.

Suppose that T3(N+1)λT\geq 3(N+1)\lambda, and that NN is sufficiently large and ϵ\epsilon is sufficiently small. After bunching compatible triples in the unglued graph Υ1\Upsilon_{1} we obtain the partially glued graph Λ2\Lambda_{2} with the following properties.

  1. (1)

    The total mass of the unglued subgraph Υ2\Upsilon_{2} satisfies

    m(Υ2)3m(Λ0)12+O(cN)ϵ\frac{m(\Upsilon_{2})}{3m(\Lambda_{0})}\leq\frac{1}{2}+O(c^{N})\epsilon

    for a constant c=c(k,λ)c=c(k,\lambda).

  2. (2)

    We can decompose the unglued subgraph Υ2\Upsilon_{2} as a disjoint union Δ2Ω2\Delta_{2}\sqcup\Omega_{2}. The reservoir Ω2\Omega_{2} is a disjoint union of bubbles.

  3. (3)

    The mass of the remainder satisfies

    m(Δ2)3m(Λ0)=O(1/N)\frac{m(\Delta_{2})}{3m(\Lambda_{0})}=O(1/N)

    as long as ϵ<O(cN)\epsilon<O(c^{-N}).

  4. (4)

    The distribution of the types of bubbles in the reservoir is within O(ϵ)O(\epsilon) of a constant distribution (independent of NN and ϵ\epsilon).

Proof.

The number of types of extended long strips is O(cN)O(c^{N}) for some constant c=c(k,λ)c=c(k,\lambda). The unglued subgraph Υ1\Upsilon_{1} is still (T,ϵ)(T,\epsilon)-pseudorandom, containing a union of

m(Υ1)3(N+1)λ\frac{m(\Upsilon_{1})}{3(N+1)\lambda}

extended long strips. By pseudorandomness, we may restrict to a subset Υ1Υ1\Upsilon^{\prime}_{1}\subseteq\Upsilon_{1} of mass at least (1O(cN)ϵ)m(Υ1)(1-O(c^{N})\epsilon)m(\Upsilon_{1}) so that the types of extended long strips in Υ1\Upsilon^{\prime}_{1} are exactly uniformly distributed. (Here we use that T3(N+1)λT\geq 3(N+1)\lambda.) We then randomly choose a partition into compatible triples, and perform bunching, to produce Λ2\Lambda_{2}.

We estimate the mass of the unglued subgraph as follows.

m(Υ2)\displaystyle m(\Upsilon_{2}) =\displaystyle= m(Υ1Υ1)+m(Υ1)2\displaystyle m(\Upsilon_{1}\smallsetminus\Upsilon^{\prime}_{1})+\frac{m(\Upsilon^{\prime}_{1})}{2}
\displaystyle\leq O(cN)ϵm(Υ1)+(1O(cN)ϵ)2m(Υ1)\displaystyle O(c^{N})\epsilon m(\Upsilon_{1})+\frac{(1-O(c^{N})\epsilon)}{2}m(\Upsilon_{1})
=\displaystyle= (1+O(cN)ϵ)2m(Υ1)\displaystyle\frac{(1+O(c^{N})\epsilon)}{2}m(\Upsilon_{1})

and (1) follows immediately.

As described above, the unglued subgraph Υ2\Upsilon_{2} is naturally a disjoint union of the remainder Δ2\Delta_{2} and the reservoir Ω2\Omega_{2}. The remainder consists, by definition, of the union of Υ1Υ1\Upsilon_{1}\smallsetminus\Upsilon^{\prime}_{1} and the short strips in Υ1\Upsilon^{\prime}_{1} (after gluing). Hence

m(Δ2)<O(cN)ϵm(Υ1)+1O(cN)ϵN+1m(Υ1)=(1+O(cN)ϵ)N+1m(Υ1)m(\Delta_{2})<O(c^{N})\epsilon m(\Upsilon_{1})+\frac{1-O(c^{N})\epsilon}{N+1}m(\Upsilon_{1})=\frac{(1+O(c^{N})\epsilon)}{N+1}m(\Upsilon_{1})

which is O(1/N)O(1/N) as long as ϵ<O(cN)\epsilon<O(c^{-N}).

Bunching uniformly distributed long strips at random induces a fixed distribution on subgraphs of bounded size. This justifies the final assertion. ∎

3.4. Super-compatible long strips

At this point, we have bunched all but ϵ\epsilon of the long strips into triples. In particular, for sufficiently small ϵ\epsilon, the number of bunched triples is far larger than the number of unbunched long strips. We will now argue that we may, in fact, adjust the construction so that every long strip is bunched. The advantage of this is that after this step, the unglued part consists entirely of the reservoir (a union of football bubbles) and a remainder consisting of a trivalent graph in which every edge has length exactly 3λ3\lambda. The key to this operation is the idea a super-compatible 4-tuple of long segments.

Definition 3.4.1.

Four long strips are super-compatible if they are of the form

a0x1a2x3aN+1,b0x1b2x3bN+1,c0x1c2x3cN+1,d0x1d2x3dN+1a_{0}x_{1}a_{2}x_{3}\cdots a_{N+1},\quad b_{0}x_{1}b_{2}x_{3}\cdots b_{N+1},\quad c_{0}x_{1}c_{2}x_{3}\cdots c_{N+1},\quad d_{0}x_{1}d_{2}x_{3}\cdots d_{N+1}

(i.e. if their sticky segments agree) and if for even ii (i.e. for the free segments) the initial and terminal letters of ai,bi,ci,dia_{i},b_{i},c_{i},d_{i} disagree. Alternatively, such a 4-tuple is super-compatible if every sub-triple is compatible.

Remark 3.4.2.

Notice that the existence of super-compatible 4-tuples depends on rank k3k\geq 3. An alternative method to eliminate unbunched long strips in rank 2 is given in § 3.13.

Lemma 3.4.3.

Let Λ2\Lambda_{2} be as above. As long as k3k\geq 3 and ϵ\epsilon is sufficiently small (depending on NN), one can injectively assign to each long strip S0S_{0} in the remainder Δ2\Delta_{2} a bunched triple (S1,S2,S3)(S_{1},S_{2},S_{3}) in Λ2\Lambda_{2} such that the quartet (S0,S1,S2,S3)(S_{0},S_{1},S_{2},S_{3}) is super-compatible.

Proof.

Let S0S_{0} be an unglued long strip. As long as k3k\geq 3, it is clear that there is at least one type of bunched triple (S1,S2,S3)(S_{1},S_{2},S_{3}) such that (S0,S1,S2,S3)(S_{0},S_{1},S_{2},S_{3}) is super-compatible. The total number of types of bunched triples is O(DN)O(D^{N}) for some D=D(k,λ)D=D(k,\lambda). Therefore, the proportion of bunched triples that are super-compatible with S0S_{0} is at least 1/O(DN)O(ϵ)1/O(D^{N})-O(\epsilon). On the other hand, the proportion of long strips that are unbunched is O(ϵ)O(\epsilon). Therefore, we can choose a bunched triple for every unbunched long strip as long as O(ϵ)<1/O(DN)O(\epsilon)<1/O(D^{N}). ∎

By the previous lemma, we may assign to each unglued long strip S0S_{0} in the remainder Δ2\Delta_{2} a glued triple (S1,S2,S3)(S_{1},S_{2},S_{3}) such that (S0,S1,S2,S3)(S_{0},S_{1},S_{2},S_{3}) is super-compatible. We may re-bunch these into the four possible compatible triples that are subsets of our super-compatible 4-tuple, viz:

(S0,S1,S2),(S0,S1,S3),(S0,S2,S3),(S1,S2,S3)(S_{0},S_{1},S_{2}),~(S_{0},S_{1},S_{3}),~(S_{0},S_{2},S_{3}),~(S_{1},S_{2},S_{3})

The result of performing this operation on every long strip in the remainder Δ2\Delta_{2} yields the new partially glued graph Λ3\Lambda_{3}.

Lemma 3.4.4.

Take 3 copies of Λ2\Lambda_{2} as above, and suppose that NN is sufficiently large and ϵ\epsilon is sufficiently small. After choosing super-compatible triples and re-bunching so that every long strip is bunched, we produce the partially glued graph Λ3\Lambda_{3} with the following properties.

  1. (1)

    The partially glued graph Λ3\Lambda_{3} consists of a glued subgraph Γ3\Gamma_{3}, a reservoir Ω3\Omega_{3}, and a remainder Δ3\Delta_{3}.

  2. (2)

    The reservoir Ω3\Omega_{3} consists of football bubbles and its mass is bounded below by

    m(Ω3)m(Λ0)O(1)\frac{m(\Omega_{3})}{m(\Lambda_{0})}\geq O(1)
  3. (3)

    The remainder Δ3\Delta_{3} is a trivalent graph with each edge of length 3λ3\lambda, and its mass is bounded above by

    m(Δ3)m(Λ0)1/O(N)\frac{m(\Delta_{3})}{m(\Lambda_{0})}\leq 1/O(N)
  4. (4)

    Let ρ\rho be the uniform distribution on types of bubbles. Then for any type BB of bubble, the proportion of bubbles in the reservoir of type BB is within O(cN)ϵO(c^{N})\epsilon of ρ\rho.

Proof.

Consider Υ1\Upsilon_{1}, the unglued subgraph of Λ1\Lambda_{1}. Then Υ1\Upsilon_{1} is the union of three arcs, of total mass 3C(N+1)λ+9λ3C(N+1)\lambda+9\lambda, where CC is the number of long strips in Υ1\Upsilon_{1}. Recall that Λ3\Lambda_{3} is constructed from three copies of Λ1\Lambda_{1}, and that for each long strip, one short strip goes into the remainder, and (N1)/2(N-1)/2 go into the reservoir. Therefore, we have that

m(Ω3)3m(Υ1)9λ=N12N+2\frac{m(\Omega_{3})}{3m(\Upsilon_{1})-9\lambda}=\frac{N-1}{2N+2}

and

m(Δ3)9λ3m(Υ1)9λ=1N+1\frac{m(\Delta_{3})-9\lambda}{3m(\Upsilon_{1})-9\lambda}=\frac{1}{N+1}

Since m(Υ1)3m(Λ0)O(N)m(\Upsilon_{1})\geq 3m(\Lambda_{0})-O(N), the estimates (2) and (3) follow.

Finally, we need to check that the re-gluing only has a small effect on the distribution of bubble types. Recall that the unglued long strips in Υ2\Upsilon_{2} are precisely the images of the unglued long strips in Υ1Υ1\Upsilon_{1}\smallsetminus\Upsilon^{\prime}_{1}, which is of mass at most O(cN)ϵm(Υ1)O(c^{N})\epsilon m(\Upsilon_{1}).

Taking three copies of each of these and three copies of a super-compatible triple, re-bunching produces four new bunched triples. In particular, for each three unbunched long strips, we destroy (N1)/2(N-1)/2 bubbles and replace them with 2(N1)2(N-1) new bubbles of different types. The proportion of unglued long strips was at most O(cN)ϵO(c^{N})\epsilon. It follows that the proportional distribution of each bubble type was altered by at most O(cN)ϵO(c^{N})\epsilon, so the final estimate follows. ∎

3.5. Inner and outer reservoirs and slack

As their name indicates, the bubbles in the reservoir will be held in reserve until a later stage of the construction to glue up the remainder. Some intermediate operations on the remainder have “boundary effects”, which might disturb the neighboring bubbles in the reservoir in a predictable way. So it is important to insulate the remainder with a collar of bubbles which we do not disturb accidentally in subsequent operations.

Fix a constant 0<θn<10<\theta_{n}<1 and divide each long strip in Λn\Lambda_{n} into two parts: and inner reservoir, consisting of an innermost sequence of consecutive bubbles of length 1θn1-\theta_{n} times the length of the long strip, and an outer reservoir, consisting of two outer sequences of consecutive bubbles of length θn/2\theta_{n}/2. The number θn\theta_{n} is called the slack. Boundary effects associated to each step that we perform will use up bubbles from the outer reservoir and at most halve the slack. Since the number of steps we perform is uniformly bounded, it follows that — provided NN is sufficiently large — even at the end of the construction we will still have a significant outer reservoir with which to work.

3.6. Adjusting the distribution

After collecting long strips into compatible triples, the collection of football bubbles in the reservoir is “almost equidistributed”, in the sense that the mass of any two different types of football is almost equal (up to an additive error of order ϵ\epsilon). However, it is useful to be able to adjust the pattern of gluing in order to make the distribution of football bubbles conform to some other specified distribution (again, up to an additive error of order ϵ\epsilon).

This operation has an unpredictable effect on the remainder, transforming it into some new 3-valent graph (of some possibly very different combinatorial type); however, it preserves the essential features of the remainder that are known to hold at this stage of the construction: every edge of the remainder (after the operation) has length exactly 3λ3\lambda; and the total mass of the remainder before and after the operation is unchanged (so that it is still very small compared to the mass of any given football type).

Let μ\mu be a probability measure on the set of all football types, with full support — i.e. so that μ\mu is strictly positive on every football type. (In the sequel, μ\mu will be the cube distribution described below, but that is not important at this stage.) Suppose we have three long strips of the form

a0x1a2x3aN+1,b0x1b2x3bN+1,c0x1c2x3cN+1a_{0}x_{1}a_{2}x_{3}\cdots a_{N+1},\quad b_{0}x_{1}b_{2}x_{3}\cdots b_{N+1},\quad c_{0}x_{1}c_{2}x_{3}\cdots c_{N+1}

so that the result of the gluing produces (N+1)/2(N+1)/2 bunches each with the label xix_{i} (for ii odd), and (N1)/2(N-1)/2 football bubbles each with the label (ai,bi,ci)(a_{i},b_{i},c_{i}) (for ii even). We think of the (ai,bi,ci)(a_{i},b_{i},c_{i}) as unordered triples — i.e. we only think of the underlying football as an abstract graph with edge labels up to isomorphism. A given sequence of labels xix_{i} and (unordered!) football types (ai,bi,ci)(a_{i},b_{i},c_{i}) might arise from three long strips s,t,us,t,u in 6(N+3)/26^{(N+3)/2} ways, since there are 6 ways to order each triple ai,bi,cia_{i},b_{i},c_{i}.

Let ρ\rho denote the uniform probability measure on football types, and let μ\mu^{\prime} be chosen to be a multiple of μ\mu such that ρ>μ\rho>\mu^{\prime} for all types. Fix θ4>0\theta_{4}>0 such that minμ/ρ>θ4\min\mu^{\prime}/\rho>\theta_{4}. As the notation hints, θ4\theta_{4} will turn out to be the slack in the partially glued graph Λ4\Lambda_{4}, and we accordingly partition each long strip of Λ3\Lambda_{3} into an inner reservoir, of proportional length (1θ4)(1-\theta_{4}), and an outer reservoir consisting of two strips of proportional length θ4/2\theta_{4}/2.

We put all possible triples of long strips labeled as above into a bucket. Next, color each even index ii in [Nθ4/2,N(1θ4/2)][N\theta_{4}/2,N(1-\theta_{4}/2)] (i.e. those contained in the inner reservoir) black with probability

(ρμ(i))(1θ4)ρ\frac{(\rho-\mu^{\prime}(i))}{(1-\theta_{4})\rho}

and color all the remaining indices white, where μ(i)\mu^{\prime}(i) is short for the μ\mu^{\prime} measure of the football type (ai,bi,ci)(a_{i},b_{i},c_{i}) (note that our choice of θ4\theta_{4} guarantees that the assigned probabilities are never greater than 1).

Now pull apart all the triples of long strips, and match them into new triples s,t,us,t,u according to the following rule: if a given index ii is white, the corresponding labels si,ti,uis_{i},t_{i},u_{i} should all be different, and equal to ai,bi,cia_{i},b_{i},c_{i} (in some order); if a given index ii is black, the corresponding labels si,ti,uis_{i},t_{i},u_{i} should all be the same, and equal to exactly one of ai,bi,cia_{i},b_{i},c_{i}. Then we can glue up s,t,us,t,u to produce footballs (ai,bi,ci)(a_{i},b_{i},c_{i}) exactly for the white labels, and treat the black labels as part of the neighboring sticky segments, so that they are entirely glued up.

We do this operation for each bucket (i.e. for each collection of triples with a given sequence of sticky types xix_{i} and football types (ai,bi,ci)(a_{i},b_{i},c_{i})). The net effect is to eliminate a fraction of approximately (ρμ(i))/ρ(\rho-\mu^{\prime}(i))/\rho of the footballs with label ii; thus, at the end of this operation, the distribution of footballs is proportional to μ\mu, with error of order ϵ\epsilon.

Although this adjustment operation can achieve any desired distribution μ\mu, in practice we will set μ\mu equal to the cube distribution, to be described in the sequel. In any case, for a fixed choice of distribution μ\mu, the slack θ4\theta_{4} only depends on kk and λ\lambda, and therefore can be treated as a constant.

We summarize this in the following lemma.

Lemma 3.6.1.

Let μ\mu be a probability distribution on the set of types of football bubbles and let NN be sufficiently large. The adjustment described above transforms Λ3\Lambda_{3} into a new partially glued graph Λ4\Lambda_{4} with the following properties.

  1. (1)

    The mass of the reservoir Ω4\Omega_{4} is bounded below by a constant depending only on k,λk,\lambda and μ\mu.

    m(Ω4)m(Λ0)O(1)\frac{m(\Omega_{4})}{m(\Lambda_{0})}\geq O(1)
  2. (2)

    The mass of the remainder Δ4\Delta_{4} is bounded above by

    m(Δ4)9m(Λ0)1/N\frac{m(\Delta_{4})}{9m(\Lambda_{0})}\leq 1/N
  3. (3)

    The distribution of football types in the reservoir is proportional to μ\mu, with error O(cN)ϵO(c^{N})\epsilon.

  4. (4)

    The slack θ4\theta_{4} is a constant.

Proof.

As noted above, this operation may completely change the combinatorial type of the remainder, but leaves invariant its total mass, and the fact that it is a 3-valent graph with edges of mass 3λ3\lambda. In particular,

m(Δ4)9m(Λ0)1N+1\frac{m(\Delta_{4})}{9m(\Lambda_{0})}\approx\frac{1}{N+1}

and so m(Δ4)/9m(Λ0)1/Nm(\Delta_{4})/9m(\Lambda_{0})\leq 1/N for sufficiently large NN. This proves item 2.

We lose a fraction of the reservoir—those indices colored black. An index ii between Nθ4/2N\theta_{4}/2 and N(1θ4/2)N(1-\theta_{4}/2) is colored black with probability (ρμ(i))/ρ(\rho-\mu^{\prime}(i))/\rho. Therefore, the proportion of bubbles colored white is bounded below by miniμ(i)/ρ\min_{i}\mu^{\prime}(i)/\rho, and so the mass of the reservoir is bounded below by

m(Ω4)3m(Υ1)9λ(miniμ(i)/ρ)N12N+2\frac{m(\Omega_{4})}{3m(\Upsilon_{1})-9\lambda}\geq\left(\min_{i}\mu^{\prime}(i)/\rho\right)\frac{N-1}{2N+2}

Since μ\mu^{\prime} is a distribution on types of bubbles, depending only on λ\lambda and kk, the first item holds as long as NN is sufficiently large.

We next estimate the distributions of the types of bubbles. Before adjustment, the proportion of each type of bubble in the reservoir Ω3\Omega_{3} was within O(cN)ϵO(c^{N})\epsilon of the uniform distribution. These can be taken to be uniformly distributed between the inner and outer reservoirs. Therefore, after adjustment, the new distribution ν\nu on bubble types satisfies

|νμ|(μ/ρ)O(cN)ϵ|\nu-\mu^{\prime}|\leq(\mu^{\prime}/\rho)O(c^{N})\epsilon

and so, as before, since μ(i)\mu^{\prime}(i) is bounded above in terms of λ\lambda and kk, the third assertion follows.

The final assertion about the slack is immediate from the construction. ∎

3.7. Tearing up the remainder

At this stage the remainder consists of a 3-valent graph in which every edge has length exactly 3λ3\lambda. The total mass of the remainder is very small compared to the mass of the reservoir, but it is large compared to the size of a single long strip. Furthermore, there is no a priori bound on the combinatorial complexity of a component of the remainder.

We explain how to modify the gluing by a certain local move called a tear111“tear” in the sense of: “There were tears in her big brown overcoat”, which (inductively) reduces the combinatorial complexity of the remainder (which a priori is arbitrarily complicated) until it consists of a disjoint collection of simple pieces. These pieces come in three kinds:

  1. (1)

    football bubbles;

  2. (2)

    bizenes: these are graphs with 6 edges and 4 vertices, obtained from a square by doubling two (non-adjacent) edges; and

  3. (3)

    bicrowns: these are complete bipartite graphs K3,3K_{3,3}.

The bubbles, bizenes and bicrowns all have edges of length exactly 3λ3\lambda. They are depicted in Figure 4. Note that bizenes doubly cover footballs and bicrowns triply cover footballs. If the labels on a bizene or bicrown happen to be pulled back from the labels on a football bubble via the covering map, then we say that the bizene or bicrown is of covering type.

\labellist
\hair

2pt \endlabellistRefer to caption

Figure 4. Football bubbles, bizenes, and bicrowns
\labellist
\hair

2pt \endlabellistRefer to caption

Figure 5. Tearing at pp and pp^{\prime}. The “before” picture is on the left, and the “after” picture on the right. The graph Λ4Λ4\Lambda_{4}\cup\Lambda^{\prime}_{4} is transformed by being disconnected at pp and pp^{\prime} and having three new bigons inserted which connect the new six 1-valent ends of Λ4Λ4pp\Lambda_{4}\cup\Lambda^{\prime}_{4}-p-p^{\prime} in pairs. The move also destroys 8 footballs, and creates two bicrowns.

We now describe the operation of tearing. Take two copies of Λ4\Lambda_{4}, denoted by Λ4\Lambda_{4} and Λ4\Lambda^{\prime}_{4} (with vertices, edges and subgraphs of Λ4\Lambda^{\prime}_{4} denoted with primes in the obvious manner) and let pp and pp^{\prime} be branch vertices of Δ4\Delta_{4} and Δ4\Delta^{\prime}_{4} respectively. These vertices are the ends of disjoint sequences of alternating bunched triples and football bubbles. The tearing operation uses up two (appropriately labeled) sequences of alternating bunched triples and football bubbles, each of length 3. The precise definition of this operation is best given by example, and is illustrated in Figure 5.

On the left of the figure we have the vertices pp and pp^{\prime} of Λ4\Lambda_{4} and Λ4\Lambda^{\prime}_{4}, together with two strings of 33 bubbles. These strings are pulled apart and reglued according to the combinatorics indicated in the figure. Thus, the labels on the bunched triples at each horizontal level should all agree, and the labels on the footballs should be such that the result of the gluing is still folded. The existence of strings of football bubbles with these properties is guaranteed by pseudorandomness and the definition of the long strips.

Thus the operation of tearing uses up 8 footballs (as in the figure), and it has several effects on the remainder. First, Λ4Λ4\Lambda_{4}\cup\Lambda^{\prime}_{4} is pulled apart at pp and pp^{\prime}, producing six new vertices p1,p2,p3p_{1},p_{2},p_{3} and p1,p2,p3p_{1}^{\prime},p_{2}^{\prime},p_{3}^{\prime}, and adding two new edges xix_{i} and xix_{i}^{\prime} (from the footballs separating the adjacent bunched triples) joining pip_{i} to pip_{i}^{\prime}. Second, two new bicrowns are created, assembled from the pieces of three identically labeled footballs. Third, the slack at pp and pp^{\prime} is reduced to approximately half of its previous value. If the strings of three football segments are taken from the inner half of the outer reservoir, it will reduce the slack at the vertices of Λ4Λ4\Lambda_{4}\cup\Lambda^{\prime}_{4} at the end of these strips; but the size of the slack at these vertices will stay large.

Lemma 3.7.1.

Suppose that NN is sufficiently large, ϵ\epsilon is sufficiently small and the partially glued graph Λ4\Lambda_{4} is as above. By applying tearing operations to 2 copies of Λ4\Lambda_{4} we may build a partially glued graph Λ5\Lambda_{5} with the following properties.

  1. (1)

    The mass of the reservoir Ω5\Omega_{5} is bounded below by a constant depending only on k,λk,\lambda and μ\mu.

    m(Ω5)m(Λ0)O(1)\frac{m(\Omega_{5})}{m(\Lambda_{0})}\geq O(1)
  2. (2)

    The remainder Δ5\Delta_{5} is a disjoint union of bizenes, and bicrowns of covering type, with mass bounded above by

    m(Δ5)m(Λ0)O(1/N)\frac{m(\Delta_{5})}{m(\Lambda_{0})}\leq O(1/N)
  3. (3)

    The distribution of football bubbles of each type is within O(1/N)O(1/N) of the distribution μ\mu^{\prime} (proportional to μ\mu).

  4. (4)

    The slack satisfies θ5θ4/3\theta_{5}\geq\theta_{4}/3.

Proof.

As described above, we construct Λ5\Lambda_{5} from two copies of Λ4\Lambda_{4}: let us denote them by Λ4\Lambda_{4} and Λ4\Lambda^{\prime}_{4}, and likewise denote subgraphs, vertices and edges with primes as appropriate. At each branch vertex pp of Δ4\Delta_{4} we have three unglued (elementary) edges with labels a,b,ca,b,c (pointing away from pp, say), and one glued edge with label dd (also pointing away from pp). Denote by exe_{x} the short strip in Δ4\Delta_{4} incident at pp with outgoing label xx. Let pp^{\prime} be the corresponding vertex of Δ4\Delta^{\prime}_{4}, which is of course locally isomorphic to pp.

For each such pair of vertices pp of Δ4\Delta_{4} and pp^{\prime} of Δ4\Delta^{\prime}_{4}, we choose a pair of bubbles so that we may perform a tear move at pp and pp^{\prime}. In order to do this, we must choose a pair of bubbles B1B_{1}, B2B_{2}, with certain constraints on the labelings at their branch vertices. We next describe one feasible set of constraints that enables the tearing operation to be performed (there will typically be other possible configurations).

Necessarily, at each branch vertex of B1B_{1} and B2B_{2}, we need the incident glued (elementary) edge to have (outgoing) label dd. We will also require that each bubble BiB_{i} is a union of three short strips sai,sbi,scis^{i}_{a},s^{i}_{b},s^{i}_{c}, with the property that at each branch vertex of BiB_{i} the outgoing label on the short strip sxis^{i}_{x} is equal to xx. Since there are only a finite number of possible local labelings at the branch vertices, and since each type of bubble occurs with roughly equal distribution, there are many bubbles satisfying this condition.

Later in the argument, it will also prove necessary that the strips sis^{i}_{\bullet} satisfy certain other constraints (see Lemma 3.10.2 below). For the moment, it suffices that these constraints are mild enough to guarantee the existence of the bubbles BiB_{i}.

Given bubbles B1B_{1}, B2B_{2} for a vertex pp of Δ4\Delta_{4}, we can perform the tearing operation, in such a way that after tearing, eae_{a} and eae^{\prime}_{a} adjoin sb1s^{1}_{b} and sc2s^{2}_{c}, ebe_{b} and ebe^{\prime}_{b} adjoin sc1s^{1}_{c} and sa2s^{2}_{a}, and ece_{c} and ece^{\prime}_{c} adjoin sa1s^{1}_{a} and sb2s^{2}_{b}. Note that the resulting graph remains folded, and that the remainder Δ4\Delta_{4} has been replaced by a union of bizenes.

Therefore, in order to perform the tearing operation, we need to find VV pairs of bubbles B1,B2B_{1},B_{2} as above, where VV is the number of vertices of Δ4\Delta_{4}. To do this, we divide all football bubbles in the undisturbed segments of the long strips into consecutive runs of seven. For each vertex of the remainder Δ4\Delta_{4}, we need to choose two such runs of a specified type. We furthermore insist on choosing these runs of seven from within the ‘innermost’ part of the ‘outer’ reservoir.

The number VV is equal to 2m(Δ4)/9λm(L)O(1/N)2m(\Delta_{4})/9\lambda\leq m(L)O(1/N). By pseudorandomness, the number of runs of seven bubbles of a fixed type is bounded below by

θ4O(m(Ω4))θ4O(m(L))\theta_{4}O(m(\Omega_{4}))\geq\theta_{4}O(m(L))

(using that ϵ\epsilon is sufficiently small). Half of these runs of seven come from within the innermost part of the outer reservoir. Therefore, as long as NN is large enough, we can always choose two suitable runs of seven bubbles for each vertex of Δ4\Delta_{4}, as required.

Since V/m(L)V/m(L) is bounded above by a constant (depending on k,λk,\lambda and μ\mu) divided by NN, the distribution of each bubble type has only been changed by O(1/N)O(1/N).

After performing tears in this way, we obtain a new partially glued graph Λ5\Lambda_{5}, with the additional property that the remainder Δ5\Delta_{5} is a disjoint union of bizenes and bicrowns (the latter of covering type). The mass of the remainder is still bounded above by

m(Δ5)/m(L)12/Nm(\Delta_{5})/m(L)\leq 12/N

since three half-edges of Δ4\Delta_{4} are replaced by 36 half-edges of Δ5\Delta_{5}, as shown in Figure 5.

Since, by construction, we only used bubbles in the tearing operation which came from the innermost half of the outer reservoir together with one bubble from the outermost part of the outer reservoir, the slack θ5\theta_{5} is no smaller than

θ4(1/2O(1/N))θ4/3\theta_{4}(1/2-O(1/N))\geq\theta_{4}/3

as claimed. ∎

3.8. Adjusting football inventory with trades

It will be necessary at a later stage of the argument to adjust the numbers of football pieces of each kind, so that the reservoir itself can be entirely glued up. At this stage and subsequent stages we must be careful to consider not just the combinatorial graph of our pieces, but also their type — i.e. their edge labels.

We now describe a move called a trade which has the following twofold effect:

  1. (1)

    it reduces the number of footballs of a specified type by 3; and

  2. (2)

    it transforms four sets of 3 footballs, each of a specified type, into four bicrowns each with the associated covering type.

Moreover, unlike the operation described in § 3.6, the trade operation has no effect on the remainder. Thus, the trade moves can be performed after the tear moves, to correct small errors in the distribution of football types, adjusting this distribution to be exactly as desired.

The trade move is illustrated in Figure 6. We start with three strings of five footballs, each string consisting of the same sequence of five football types in the same orders. We also assume the labels on the three sets of four intermediate sticky segments agree. We pull apart the sticky segments and reglue them in the pattern indicated in the figure, in such a way that four sets of three footballs are replaced with bicrowns. If the three middle footballs are of type (a,b,c)(a,b,c) then after regluing we can assume that the aa edges are all together, and similarly for the bb and cc edges; thus these triples of edges may by glued up, eliminating the three footballs.

\labellist
\hair

2pt \endlabellistRefer to caption

Figure 6. The trade move; the “before” picture is on the left, and the “after” picture is on the right. This move eliminates 3 footballs of a given type, and transforms four sets of 3 footballs with a given type into four bicrowns of the given covering type.

To summarize this succinctly, we introduce some notation. For BB a type of football bubble, we denote by B~\widetilde{B} the corresponding covering type of bicrown. For μ\mu a distribution on football bubbles and bicrowns of covering type, we denote by μ~\tilde{\mu} the following distribution on football bubbles.

μ~(B)=μ(B)+3μ(B~)\tilde{\mu}(B)=\mu(B)+3\mu(\widetilde{B})
Lemma 3.8.1.

Let the partially glued graph Λ5\Lambda_{5} be as above and suppose that ϵ\epsilon is sufficiently small. There is a constant C5(0,1)C_{5}\in(0,1) with the following property. Let μ5\mu_{5} be the distribution of bubble types in Ω5\Omega_{5} and let ν\nu be any integral distribution on bubble types such that, for each type BB,

μ5(B)(1C5)<ν(B)μ5(B)\mu_{5}(B)(1-C_{5})<\nu(B)\leq\mu_{5}(B)

Then we may apply trades as above to 3 copies of the partially glued graph Λ5\Lambda_{5} to produce a new partially glued graph Λ6\Lambda_{6} such that:

  1. (1)

    the induced distribution μ6\mu_{6} on bubbles and bicrowns satisfies μ~6=3ν\tilde{\mu}_{6}=3\nu;

  2. (2)

    the mass of the remainder Δ6\Delta_{6} is bounded above by

    m(Δ6)m(Λ0)O(1/N);\frac{m(\Delta_{6})}{m(\Lambda_{0})}\leq O(1/N);
  3. (3)

    the slack satisfies θ6θ5/2\theta_{6}\geq\theta_{5}/2.

Proof.

Let μ5\mu_{5} be the distribution on football bubbles and bicrowns derived from Λ5\Lambda_{5}. From the upper bound on the total mass of the remainder, it follows that 1μ~5/μ51+O(1/N)1\leq\tilde{\mu}_{5}/\mu_{5}\leq 1+O(1/N).

We divide the inner half of the outer reservoir Ω5o\Omega^{o}_{5} into strips of five contiguous football bubbles, and call a football bubble fifth if it lies in the center of such a quintuple. Let μ¯5\bar{\mu}_{5} denote the distribution of fifth football bubbles in the inner half of the outer reservoir Ω5o\Omega^{o}_{5}.

Recall that, in the outer reservoir, the bubbles are distributed within O(cN)ϵO(c^{N})\epsilon of the uniform distribution. Therefore, for any football bubble type BB, |μ¯5(B)ρ|<O(cN)ϵ|\bar{\mu}_{5}(B)-\rho|<O(c^{N})\epsilon (where ρ\rho is the uniform distribution, scaled appropriately). In particular, taking NN sufficiently large and ϵ\epsilon sufficiently small, we have

μ¯5(B)>C5μ5(B)\bar{\mu}_{5}(B)>C_{5}\mu_{5}(B)

for some constant C5C_{5}. Hence the hypothesis of the lemma implies that μ¯5(B)>μ5(B)ν(B)\bar{\mu}_{5}(B)>\mu_{5}(B)-\nu(B) for every type BB of football bubble. If NN is sufficiently large then it follows further that μ¯5(B)μ~5(B)ν(B)\bar{\mu}_{5}(B)\geq\tilde{\mu}_{5}(B)-\nu(B) for every type BB.

Consider each bubble in the center of a quintuple in the outer reservoir of type BB. We color the bubble black with probability (μ~5(B)ν(B))/μ¯5(B)(\tilde{\mu}_{5}(B)-\nu(B))/\bar{\mu}_{5}(B) and white otherwise.

We now construct Λ6\Lambda_{6} from three copies of Λ5\Lambda_{5}, by performing a trade at each bubble colored black. Taking three copies of Λ5\Lambda_{5} triples the number of each bubble type. The lemma is phrased so that replacing three bubbles of a given type by a bicrown of corresponding covering type is neutral. The only remaining effect of a trade is then to remove exactly three bubbles of the central type. This proves the lemma. ∎

In the sequel, we will apply this lemma with a particular distribution ν\nu, described in Lemma 3.11.1 below.

3.9. Cube and prism moves

We have two more gluing steps: a small mass of bicrowns and bizenes must be glued up with footballs (drawn from an almost equidistributed collection of much larger mass), then the distribution of the footballs can be corrected by trades so that they are perfectly evenly distributed, and finally an evenly distributed collection of footballs (i.e. a collection with exactly the same number of footballs with each possible label) must be entirely glued up. We next describe three moves which will enable us to glue up bubbles, bizenes and bicrowns.

3.9.1. The cube move

The idea is very simple: four footballs with appropriate edge labels can be draped over the 1-skeleton of a cube in a manner invariant under the action of the Klein 4-group, and then glued up according to how they match along the edges of the cube. This is indicated in Figure 7.

\labellist
\hair

2pt \endlabellistRefer to caption

Figure 7. Four footballs draped over the 1-skeleton of a cube

We next give an algebraic description of the cube move in terms of covering spaces of graphs, which enables us to give a precise description of the various coverings of the cube moves that we will also need.

Consider two theta-graphs, Θ\Theta and HH. The three edges of Θ\Theta are denoted by α,β,γ\alpha,\beta,\gamma (oriented so they all pointing in the same direction), and likewise the three edges of HH are denoted by a,b,ca,b,c. We consider the immersion ΘH\Theta\to H which maps the edges of Θ\Theta to concatenations of edges of HH as follows:

αab¯c,βbc¯a,γca¯b\alpha\mapsto a\bar{b}c~,~\beta\mapsto b\bar{c}a~,~\gamma\mapsto c\bar{a}b

(as usual, a¯\bar{a} denotes aa with the opposite orientation etc). To enable us to reason group-theoretically, we set δ=αγ¯\delta=\alpha\bar{\gamma} and ϵ=βγ¯\epsilon=\beta\bar{\gamma}, and similarly d=ac¯d=a\bar{c} and e=bc¯e=b\bar{c}. Fixing base points at the initial vertices of all the edges, π1Θ\pi_{1}\Theta is the free group on δ,ϵ\delta,\epsilon and π1H\pi_{1}H is the free group on d,ed,e. We immediately see that the immersion ΘH\Theta\to H induces the identifications

δ=αγ¯=ab¯cb¯ac¯=(ac¯)(bc¯¯)2(ac¯)=de2d\delta=\alpha\bar{\gamma}=a\bar{b}c\bar{b}a\bar{c}=(a\bar{c})(\overline{b\bar{c}})^{2}(a\bar{c})=de^{2}d

and

ϵ=βγ¯=bc¯ab¯ac¯=(bc¯)(ac¯)(bc¯)¯(ac¯)=ede1d\epsilon=\beta\bar{\gamma}=b\bar{c}a\bar{b}a\bar{c}=(b\bar{c})(a\bar{c})\overline{(b\bar{c})}(a\bar{c})=ede^{-1}d

The graph Θ\Theta should be thought of as a bubble, and the graph HH as a pattern for gluing it up. In what follows, we will describe various covering spaces HHH_{\bullet}\to H. The fibre product Θ\Theta_{\bullet} of the maps HHH_{\bullet}\to H and ΘH\Theta\to H, together with the induced map ΘH\Theta_{\bullet}\to H_{\bullet}, will then describe various gluing patterns for (covering spaces of) unions of bubbles.

We first start with the cube move itself. Consider the natural quotient map q4:π1HH1(H;/2)Vq_{4}:\pi_{1}H\to H_{1}(H;\mathbb{Z}/2\mathbb{Z})\cong V, where VV is the Klein 4-group. The corresponding covering space H4HH_{4}\to H is a cube with quotient graph HH. Note that the deck group VV acts freely on the cube H4H_{4}, freely permuting the diagonals. Since q4(π1Θ)=1q_{4}(\pi_{1}\Theta)=1, the fibre product Θ4\Theta_{4} is a disjoint union of four copies of Θ\Theta, each spanning a diagonal in the cube H4H_{4} and freely permuted by VV. In particular, the map Θ4H4\Theta_{4}\to H_{4} precisely defines the cube move.

3.9.2. Gluing bizenes

We next describe a gluing move for bizenes. Consider the quotient map q8q_{8} from π1H\pi_{1}H to the dihedral group D8=σ,τσ4=τ2=τστσ=1D_{8}=\langle\sigma,\tau\mid\sigma^{4}=\tau^{2}=\tau\sigma\tau\sigma=1\rangle defined by dσd\mapsto\sigma and eτe\mapsto\tau, and let H8H_{8} be the covering space of HH corresponding to the kernel of q8q_{8}. Since q8q_{8} factors through q4q_{4}, H8H_{8} is a degree-two covering space of the cube H4H_{4} (the graph H8H_{8} is in fact the 1-skeleton of an octagonal prism). We next calculate the restriction of q8q_{8} to π1Θ\pi_{1}\Theta:

q8(δ)=q8(de2d)=στ2σ=σ2q_{8}(\delta)=q_{8}(de^{2}d)=\sigma\tau^{2}\sigma=\sigma^{2}

while

q8(ϵ)=q8(ede1d)=τστσ=1q_{8}(\epsilon)=q_{8}(ede^{-1}d)=\tau\sigma\tau\sigma=1

The covering space of Θ\Theta corresponding to the restriction of q8q_{8} is therefore a bizene. It follows that the fibre product map

Θ8H8\Theta_{8}\to H_{8}

describes a double cover of the cube move, which glues four bizenes along an octagonal prism. We will call this an 8-prism move.

3.9.3. Gluing bicrowns

Finally, we describe a gluing move for bicrowns. Consider the quotient map q12:π1HD12=σ,τσ6=τ2=τστσ=1q_{12}:\pi_{1}H\to D_{12}=\langle\sigma,\tau\mid\sigma^{6}=\tau^{2}=\tau\sigma\tau\sigma=1\rangle defined by dτd\mapsto\tau and eσe\mapsto\sigma. Then, as before, since q12q_{12} factors through q4q_{4}, the kernel of q12q_{12} corresponds to a regular covering space H12H4H_{12}\to H_{4} of degree three (in fact, H12H_{12} is isomorphic to the 1-skeleton of a dodecagonal prism). Again, we calculate the restriction of q12q_{12} to π1Θ\pi_{1}\Theta, and find that

q12(δ)1=q12(ϵ)=σ2q_{12}(\delta)^{-1}=q_{12}(\epsilon)=\sigma^{2}

(an element of order 3). The covering space of Θ\Theta corresponding to the restriction of q12q_{12} is therefore a bicrown. In particular, the fibre product map

Θ12H12\Theta_{12}\to H_{12}

describes a triple cover of the cube move, which glues four bicrowns along a dodecagonal prism. We will call this a 12-prism move.

3.10. Creating bizenes and bicrowns

To complete the proof of the Thin Spine Theorem, we need to glue up the remaining small mass (of order 1/N1/N) of bizenes and bicrowns using prism moves, before gluing up the remaining football bubbles using cube moves. We shall see that trades provide us with enough flexibility to do this, as long as NN is large enough. However, since prism moves require that bizenes are glued up with bizenes and our bicrowns are glued up with bicrowns, and yet the reservoir consists only of football bubbles, we will need a move that turns football bubbles into bizenes and bicrowns of given types.

3.10.1. Bicrown assembly

Since the bicrowns that we need to assemble are all of covering type, it is straightforward to construct them from football bubbles. Given a bicrown of covering type B~\widetilde{B}, covering a football bubble of type BB, there is a move which takes as input three identical triples of bubbles each of type BB, and transforms them into three bicrowns of type B~\widetilde{B}, without any other changes to the unglued subgraph.

The following lemma is an immediate consequence of the fact that a 12-prism triply covers a cube.

Lemma 3.10.1.

For every bicrown of covering type there exist three more bicrowns of covering type such that the four bicrowns together can be glued with a 12-prism move.

3.10.2. Bizene assembly

Bizene assembly is more subtle, because the bizenes that we need are more general than simply of covering type. We assemble bizenes using the following move.

Consider an adjacent pair of bubbles of type (a,b,c)(a,b,c) and (a,b,c)(a^{\prime},b^{\prime},c^{\prime}), separated by a sticky strip of type xx. Suppose also that the second bubble (of type (a,b,c)(a^{\prime},b^{\prime},c^{\prime})) is followed by a further sticky strip also of type xx. Consider also a second pair of bubbles, of the same type but with the two bubbles swapped. From these two pairs we may construct two bizenes of the same type. The pairs of edges with the same start and end points are labeled (a,b)(a,b) and (a,b)(a^{\prime},b^{\prime}), while the edges joining one pair to the other are labeled cc and cc^{\prime}.

The bizenes that we may assemble in this way satisfy some constraints, arising from the fact that both the ends of the following triples must all be compatible with the start of xx: (a,b,c)(a,b,c), (a,b,c)(a^{\prime},b^{\prime},c^{\prime}), (a,b,c)(a,b,c^{\prime}), (a,b,c)(a^{\prime},b^{\prime},c). In the rank-two case, this creates especially strong constraints, which (up to relabeling) can be simply stated as requiring that the ends of aa, bb and cc should be equal to the ends of aa^{\prime}, bb^{\prime} and cc^{\prime} respectively. We shall call such a bizene constructible.

Just as the bizenes that we can construct are constrained, so the bizenes that we need to glue up from the remainder are also of a special form. Indeed, in the proof of Lemma 3.7.1 we were free to choose the interiors of the bubbles B1B_{1} and B2B_{2} in any way.

Refer to captionee_{\bullet}ee_{\bullet}
Figure 8. Four bizenes draped over an octagonal prism. A remainder bizene is colored red; the only constraint in its construction arises from the two identical arcs labeled α\alpha. The yellow, green and blue bizenes are constructible. They are constrained only by the requirements that the circled vertices should be identically labeled.
Lemma 3.10.2.

There exist choices of the bubble types B1B_{1} and B2B_{2} in the proof of Lemma 3.7.1 such that the resulting remainder bizenes can be glued with a prism move to three constructible bizenes.

Proof.

Such a choice is illustrated in Figure 8. Note that the constrained vertices of the yellow, green and blue constructible bizenes are disjoint from each other and from the determined arcs ee_{\bullet} of the red remainder bizene. Therefore, we can start by labeling the two arcs ee_{\bullet} and the constrained vertices, and then label the rest of the 8-prism in any way we want. Doing this for each ee_{\bullet} determines the bubble types B1B_{1} and B2B_{2}. ∎

3.11. Gluing up the remainder

In this section we use the moves described above to completely glue up the remainder and the reservoir. But first we will address two important details which we have hitherto left undefined: the distributions μ\mu (of Lemma 3.6.1) and ν\nu (of Lemma 3.8.1).

We first describe ‘cubical distributions’. Consider any distribution κ\kappa on the set of types of cubes with side length λ\lambda. The cube move associates to each type of cube a collection of four types of football bubbles. The push forward of any distribution κ\kappa to the set of types of football bubbles is called a cubical distribution. In particular, if κ\kappa is the uniform distribution on the set of types of cubes then we call the push forward the uniform cubical distribution, or just the cube distribution for brevity.

In Subsection 3.6 above we may take μ\mu to be the cube distribution, so that the set of football bubbles in the reservoir Ω5\Omega_{5} is within O(1/N)O(1/N) of μ\mu^{\prime}, a distribution proportional to the cube distribution.

We next address the distribution ν\nu from Lemma 3.8.1. It consists of two parts: any cubical distribution κ\kappa, and a bizene correction distribution β\beta. That is, ν=κ+β\nu=\kappa+\beta. So we need to describe the bizene correction distribution.

The remainder Δ5\Delta_{5} consists of (remainder) bizenes and bicrowns. By Lemma 3.10.2, to each remainder bizene we associate (some choice of) three constructible bizenes. Each constructible bizene can in turn be constructed from a pair of types of football bubble. Thus, to each remainder bizene we associate six football bubbles. Summing over all bizenes in the remainder Δ5\Delta_{5} defines the distribution β\beta.

In order to apply Lemma 3.8.1, we need to check that there is a cubical distribution κ\kappa such that ν=κ+β\nu=\kappa+\beta satisfies the hypotheses of the lemma.

Lemma 3.11.1.

If NN is sufficiently large then there exists a cubical distribution κ\kappa such that the integral distribution ν=κ+β\nu=\kappa+\beta satisfies

1C5ν/μ511-C_{5}\leq\nu/\mu_{5}\leq 1

(where C5C_{5} is the constant from Lemma 3.8.1). Furthermore, as long as m(L)m(L) is sufficiently large, we may take κ\kappa to be integral.

Proof.

Since the mass of the remainder is O(1/N)m(L)O(1/N)m(L) and μ5\mu_{5} is bounded below, it follows that β(B)O(1/N)μ5(B)\beta(B)\leq O(1/N)\mu_{5}(B) for each type BB of football bubble, so it suffices to show that there is an integral cubical distribution κ\kappa satisfying

1C5κ/μ51O(1/N)1-C_{5}\leq\kappa/\mu_{5}\leq 1-O(1/N)

By the construction of Λ5\Lambda_{5}, there is a cubical distribution μ\mu^{\prime} such that |1μ/μ5|<O(1/N)|1-\mu^{\prime}/\mu_{5}|<O(1/N). Choose a rational η(1C5,1)\eta\in(1-C_{5},1). As long as NN is sufficiently large we will also have that 1C5+O(1/N)<η<1O(1/N)1-C_{5}+O(1/N)<\eta<1-O(1/N), and it follows that κ=ημ\kappa=\eta\mu^{\prime} satisfies the required condition. Furthermore, if m(L)m(L) is sufficiently large then η\eta can be chosen so that κ\kappa is integral. ∎

We can now glue up all the bizenes, using bizene assembly and the 8-prism move.

Lemma 3.11.2.

Let Λ6\Lambda_{6} be as in Lemma 3.8.1, using the distribution ν\nu from Lemma 3.11.1. Then we may apply 8-prism moves to 2 copies of Λ6\Lambda_{6} to produce a partially glued graph Λ7\Lambda_{7} such that:

  1. (1)

    every component of the remainder Δ7\Delta_{7} is a bicrown of covering type;

  2. (2)

    the total mass of the remainder Δ7\Delta_{7} satisfies m(Δ7)/m(Λ7)O(1/N)m(\Delta_{7})/m(\Lambda_{7})\leq O(1/N);

  3. (3)

    if μ7\mu_{7} is the distribution of bubbles and bicrowns in Λ7\Lambda_{7} then μ~7\tilde{\mu}_{7} is cubical;

  4. (4)

    the slack θ7\theta_{7} satisfies θ7θ6/2\theta_{7}\geq\theta_{6}/2.

Proof.

Let β~\tilde{\beta} be the distribution of constructible bizenes required to glue up the remainder bizenes in Λ6\Lambda_{6}. Take two copies of Λ6\Lambda_{6}. Using the bizene assembly move, we construct exactly 2β~(B)2\tilde{\beta}(B) new bizenes of each type BB from the inner half of the outer reservoir. From the definition of B~\tilde{B} we may now glue up all the bizenes using the 8-prism move. By the construction of Λ6\Lambda_{6}, it follows that μ~7=2μ~62β=2κ\tilde{\mu}_{7}=2\tilde{\mu}_{6}-2\beta=2\kappa and so is cubical.

Since the total mass of bicrowns was O(1/N)O(1/N) in Λ6\Lambda_{6}, the same is true in Λ7\Lambda_{7}. ∎

The next lemma completes the proof of the Thin Spine Theorem, except for a small adjustment needed to correct co-orientation, in the case when k>2k>2.

Lemma 3.11.3.

From 3 copies of Λ7\Lambda_{7} as above, we can construct a graph Λ8\Lambda_{8} in which the unglued subgraph Υ8\Upsilon_{8} is empty.

Proof.

For each bicrown B~0\widetilde{B}_{0} (of covering type), there exist bicrowns of covering type B~i\widetilde{B}_{i} (where i=1,2,3i=1,2,3) such that the B~i\widetilde{B}_{i} for i=0,1,2,3i=0,1,2,3 can be glued up using a 12-prism move. Three copies of each of these B~i\widetilde{B}_{i} can in turn be constructed from three consecutive copies of bubbles BiB_{i}, using the bicrown assembly move from Lemma 3.10.1. Let α\alpha be the distribution on bubble types that, for each bicrown of type B~0\widetilde{B}_{0} in the remainder, counts three bubbles of each type BiB_{i}. Note that, because all the bicrowns are of covering type and the 12-prism move covers the cube move, the distribution α\alpha is cubical.

The partially glued graph Λ8\Lambda_{8} is constructed from three copies of Λ7\Lambda_{7}. Since the mass of the remainder Δ7\Delta_{7} is bounded above by

m(Δ7)m(Λ7)O(1/N)\frac{m(\Delta_{7})}{m(\Lambda_{7})}\leq O(1/N)

whereas the mass of the outer reservoir Ω7o\Omega^{o}_{7} is bounded below by

m(Ω7o)m(Ω7)θ7O(1),\frac{m(\Omega^{o}_{7})}{m(\Omega_{7})}\geq\theta_{7}\geq O(1)~,

for NN sufficiently large we may use bicrown assembly to construct three times the number of bicrowns needed to glue up the remainder, using football bubbles from the outer reservoir. We can then use 12-prism moves to glue up all the bicrowns.

The distribution of the remaining football bubbles is still cubical, and so they can also be glued up with cube moves. ∎

3.12. The co-orientation condition

The result of all this gluing is to produce f:LZf:L\to Z which is degree 3, and whose local model at every vertex of ZZ is good. What remains is to check that the construction can be done while satisfying the co-orientation condition. The obstruction to this condition can be thought of as an element of H1(L;/2)H^{1}(L;\mathbb{Z}/2\mathbb{Z}). Since LL has a bounded number of components, it should not be surprising that we can adjust the gluing by local moves to ensure the vanishing of the co-orientation obstruction. In fact, it is easier to arrange this after taking 2 disjoint copies of LL, and possibly performing a finite number of moves, which we now describe.

After the first gluing step, we trivialize the II-bundles (in an arbitrary way) along the preimage of each of the short segments. This trivialization determines a relative co-orientation cocycle on each football bubble or component of the remainder; we refer to this relative cocycle as a framing. The set of framings of each component γ\gamma is a torsor for H1(γ;/2)H^{1}(\gamma;\mathbb{Z}/2\mathbb{Z}); the four possible framings of a football bubble are depicted in Figure 9.

\labellist
\hair

2pt \endlabellistRefer to caption

Figure 9. Four framings on a football bubble

Subsequent moves all make sense for framed bubbles, bizenes, bicrowns and so on. Each gluing move can be done locally in a way which embeds in 3\mathbb{R}^{3} (embeddings are illustrated in the figures throughout the last few sections); such an embedding determines a move framing on each of the pieces. The difference between a given framing and the move framing determines a class in H1(γ;/2)H^{1}(\gamma;\mathbb{Z}/2\mathbb{Z}) for each piece, and the sum over all such pieces is the (global) co-orientation cocycle in H1(L;/2)H^{1}(L;\mathbb{Z}/2\mathbb{Z}).

There is a very simple procedure to adjust this global co-orientation cocycle, which we now describe. Suppose we have a pair of footballs γ\gamma and γ\gamma^{\prime} with the same 3 labels, but with framings which differ by a single reflection at a vertex (i.e. they are of the first two types depicted in Figure 9). Swapping γ\gamma and γ\gamma^{\prime} in two cube moves that they participate in adjusts the cocycle six times, once for each of the six edges in the two bubbles; we call this a swap move. If three of these edges are in a component LiL_{i}, and three in LjL_{j}, then the global change to the cocycle is to add a fundamental class of H1(LiLj;/2)H^{1}(L_{i}\cup L_{j};\mathbb{Z}/2\mathbb{Z}). If we performed our original gluing randomly, every component LiL_{i} should contain many pairs of footballs with framings which differ in this way. So if we take two disjoint copies of LZL\to Z, we can trivialize the co-orientation cocycle by finitely many such swaps. This duplication multiplies the total number of components of LL by a factor of 2.

This completes the proof of the Thin Spine Theorem 3.1.2, at least when k3k\geq 3.

3.13. Rank 2

The move described in § 3.4 to deal with an excess of O(ϵ)O(\epsilon) long strips requires rank k3k\geq 3, so that long strips can be grouped into super-compatible 4-tuples if necessary. In this section we briefly explain how to finesse this point in the case k=2k=2.

Fix some constant CC with 1CN1\ll C\ll N; CC will need to satisfy some divisibility properties in what follows, but we leave this implicit. Define a pocket to be three equal segments of length 3(C+2)λ3(C+2)\lambda which can be glued compatibly. Recall at the very first step of our construction that we glued compatible long strips in triples by bunching sticky segments to form bubbles. We modify this construction slightly by also allowing ourselves to create some small mass of bunched pockets. That is, we bunch triples of long strips of the form

a0x1a2x3aN,b0x1b2x3bN,c0x1c2x3cNa_{0}x_{1}a_{2}x_{3}\cdots a_{N^{\prime}},\quad b_{0}x_{1}b_{2}x_{3}\cdots b_{N^{\prime}},\quad c_{0}x_{1}c_{2}x_{3}\cdots c_{N^{\prime}}

if for even ii the letters adjacent to each xi+1x_{i+1} or xi1x_{i-1} disagree, where each aia_{i}, bib_{i} or cic_{i} has length 3λ3\lambda, and where each xjx_{j} either has length 3λ3\lambda, or has length 3(C+2)λ3(C+2)\lambda. We insist that the proportion of xjx_{j} of length 3(C+2)λ3(C+2)\lambda is very small, so that most of the bunched triples are of length 3λ3\lambda, but that some small mass of bunched pockets has also been created. Note that NN^{\prime} will depend on the number of “long” xjx_{j}, but in any case NN^{\prime} will be quite close to NN.

We arrange by pseudorandomness that the mass of bunched pockets of every possible type is O(ϵ)O(\epsilon), but with a constant such that this mass is definitely larger than the number of long strips that will remain unbunched after the first step.

Now consider a long strip σ\sigma left unbunched after the first step. We partition this strip in a different way as

σ:=e0z1e2z3eM\sigma:=e_{0}z_{1}e_{2}z_{3}\cdots e_{M}

where each eje_{j} has length 3λ3\lambda, and each zjz_{j} has length CλC\lambda.

We take 9 copies of σ\sigma. Fix an index jj. Suppose we have two sets of three bunched triples of pockets (hence 18 pockets in all) of the form

aiαixiβyiγiri,biαixiβyiγisi,ciαixiβyiγiti bunched along the pocket αixiβyiγia_{i}\alpha_{i}x_{i}\beta y_{i}\gamma_{i}r_{i},\;b_{i}\alpha_{i}x_{i}\beta y_{i}\gamma_{i}s_{i},\;c_{i}\alpha_{i}x_{i}\beta y_{i}\gamma_{i}t_{i}\text{ bunched along the pocket }\alpha_{i}x_{i}\beta y_{i}\gamma_{i}
aiαixiβyiγiri,biαixiβyiγisi,ciαixiβyiγiti bunched along the pocket αixiβyiγia_{i}^{\prime}\alpha_{i}^{\prime}x_{i}^{\prime}\beta^{\prime}y_{i}^{\prime}\gamma_{i}^{\prime}r_{i}^{\prime},\;b_{i}^{\prime}\alpha_{i}^{\prime}x_{i}^{\prime}\beta^{\prime}y_{i}^{\prime}\gamma_{i}^{\prime}s_{i}^{\prime},\;c_{i}^{\prime}\alpha_{i}^{\prime}x_{i}^{\prime}\beta^{\prime}y_{i}^{\prime}\gamma_{i}^{\prime}t_{i}^{\prime}\text{ bunched along the pocket }\alpha_{i}^{\prime}x_{i}^{\prime}\beta^{\prime}y_{i}^{\prime}\gamma_{i}^{\prime}

for each of i=1,2,3i=1,2,3, and satisfying

  1. (1)

    each of ai,bi,ci,ri,si,ti,xi,yia_{i},b_{i},c_{i},r_{i},s_{i},t_{i},x_{i},y_{i} and their primed versions have length 3λ3\lambda;

  2. (2)

    each of αi,β,γi\alpha_{i},\beta,\gamma_{i} and their primed versions have length CλC\lambda;

  3. (3)

    a1,a2,a3a_{1},a_{2},a_{3} all end with the same letter and r1,r2,r3r_{1},r_{2},r_{3} all start with the same letter, and similarly for bi,ci,si,tib_{i},c_{i},s_{i},t_{i} and the primed versions;

  4. (4)

    ai,bi,cia_{i},b_{i},c_{i} end with different letters and ri,si,tir_{i},s_{i},t_{i} start with different letters for each fixed ii, and similarly for the primed versions;

  5. (5)

    x1,x2,x3x_{1},x_{2},x_{3} start with different letters and end with the same letter and similarly for the primed versions;

  6. (6)

    y1,y2,y3y_{1},y_{2},y_{3} end with different letters and start with the same letter and similarly for the primed versions;

  7. (7)

    α1,α2,α3\alpha_{1},\alpha_{2},\alpha_{3} can be partitioned into an odd number of segments of length 3λ3\lambda which can be compatibly bunched creating a strip of alternate short segments and bubbles, and similarly for the γi\gamma_{i} and the primed versions;

  8. (8)

    the common last letter of the xix_{i} is different from the common last letter of the xix_{i}^{\prime} and from the last letter of eje_{j}, and the common first letter of the yiy_{i} is different from the common first letter of the yiy_{i}^{\prime} and from the first letter of ej+2e_{j+2}; and

  9. (9)

    β\beta, β\beta^{\prime} and zjz_{j} can be partitioned into an odd number of segments of length 3λ3\lambda which can be compatibly bunched creating a strip of alternate short segments and bubbles.

Under these hypotheses, we can pull apart the six bunched pockets, bunch the αi\alpha_{i} in short strips (and similarly bunch the αi\alpha_{i}^{\prime}), bunch the γi\gamma_{i} in short strips (and similarly bunch the γi\gamma_{i}^{\prime}), and finally bunch the three sets of β\beta, β\beta^{\prime} and zjz_{j} in short strips. Explicitly, we are creating bunched segments of length CλC\lambda of the following kinds:

a1α1x1,b2α2x2,c3α3x3;b1α1x1,c2α2x2,a3α3x3;c1α1x1,a2α2x2,b3α3x3;a_{1}\alpha_{1}x_{1},\;b_{2}\alpha_{2}x_{2},\;c_{3}\alpha_{3}x_{3};\quad b_{1}\alpha_{1}x_{1},\;c_{2}\alpha_{2}x_{2},\;a_{3}\alpha_{3}x_{3};\quad c_{1}\alpha_{1}x_{1},\;a_{2}\alpha_{2}x_{2},\;b_{3}\alpha_{3}x_{3};
a1α1x1,b2α2x2,c3α3x3;b1α1x1,c2α2x2,a3α3x3;c1α1x1,a2α2x2,b3α3x3;a_{1}^{\prime}\alpha_{1}^{\prime}x_{1}^{\prime},\;b_{2}^{\prime}\alpha_{2}^{\prime}x_{2}^{\prime},\;c_{3}^{\prime}\alpha_{3}^{\prime}x_{3}^{\prime};\quad b_{1}^{\prime}\alpha_{1}^{\prime}x_{1}^{\prime},\;c_{2}^{\prime}\alpha_{2}^{\prime}x_{2}^{\prime},\;a_{3}^{\prime}\alpha_{3}^{\prime}x_{3}^{\prime};\quad c_{1}^{\prime}\alpha_{1}^{\prime}x_{1}^{\prime},\;a_{2}^{\prime}\alpha_{2}^{\prime}x_{2}^{\prime},\;b_{3}^{\prime}\alpha_{3}^{\prime}x_{3}^{\prime};
y1γ1r1,y2γ2s2,y3γ3t3;y1γ1s1,y2γ2t2,y3γ3r3;y1γ1t1,y2γ2r2,y3γ3s3;y_{1}\gamma_{1}r_{1},\;y_{2}\gamma_{2}s_{2},\;y_{3}\gamma_{3}t_{3};\quad y_{1}\gamma_{1}s_{1},\;y_{2}\gamma_{2}t_{2},\;y_{3}\gamma_{3}r_{3};\quad y_{1}\gamma_{1}t_{1},\;y_{2}\gamma_{2}r_{2},\;y_{3}\gamma_{3}s_{3};
y1γ1r1,y2γ2s2,y3γ3t3;y1γ1s1,y2γ2t2,y3γ3r3;y1γ1t1,y2γ2r2,y3γ3s3;y_{1}^{\prime}\gamma_{1}^{\prime}r_{1}^{\prime},\;y_{2}^{\prime}\gamma_{2}^{\prime}s_{2}^{\prime},\;y_{3}^{\prime}\gamma_{3}^{\prime}t_{3}^{\prime};\quad y_{1}^{\prime}\gamma_{1}^{\prime}s_{1}^{\prime},\;y_{2}^{\prime}\gamma_{2}^{\prime}t_{2}^{\prime},\;y_{3}^{\prime}\gamma_{3}^{\prime}r_{3}^{\prime};\quad y_{1}^{\prime}\gamma_{1}^{\prime}t_{1}^{\prime},\;y_{2}^{\prime}\gamma_{2}^{\prime}r_{2}^{\prime},\;y_{3}^{\prime}\gamma_{3}^{\prime}s_{3}^{\prime};

and finally three copies of

xiβyi,xiβyi,ejzj+1ej+2x_{i}\beta y_{i},\;x_{i}^{\prime}\beta^{\prime}y_{i}^{\prime},\;e_{j}z_{j+1}e_{j+2}

for each of i=1,2,3i=1,2,3.

If we do this for each zjz_{j} in turn, then the net effect is to pair up all the extra long strips, at the cost of creating a new remainder of mass O(ϵ)O(\epsilon), and using up mass O(ϵ)O(\epsilon) of the bunched pockets.

At the end of this step every vertex of the new remainder created is adjacent to a strip of C/3C/3 consecutive short bubbles; because of this, there is ample slack to apply tear moves to the new remainder as in § 3.7. Note that this move requires us to take nine copies of each excess long strip; thus we might have to take a total of 5,832 copies of LL instead of 648 for k3k\geq 3. The rest of the argument goes through as above. This completes the proof in the case k=2k=2 and thus in general.

4. Bead decomposition

The next step of the argument is modeled very closely on § 5 from [Calegari–Walker(2015)]. For the sake of completeness we explain the argument in detail. Throughout this section we fix a free group FkF_{k} with k2k\geq 2 generators and we let rr be a random cyclically reduced word of length nn, and consider the one-relator group G:=F|rG:=\langle F\;|\;r\rangle with presentation complex KK. Using the Thin Spine Theorem, we will construct (with overwhelming probability) a spine f:LZf:L\to Z over KK for which every edge of ZZ has length at least λ\lambda, for some big λ\lambda. The main result of this section is that if this construction is done carefully, the immersion M¯(Z)K\overline{M}(Z)\to K will be π1\pi_{1}-injective, again with overwhelming probability.

4.1. Construction of the beaded spine

A random 1-relator group satisfies the small cancellation property C(μ)C^{\prime}(\mu) for every positive μ\mu, with overwhelming probability. So to show that M¯(Z)K\overline{M}(Z)\to K is π1\pi_{1}-injective, it suffices to show that for any sufficiently long immersed segment γZ\gamma\to Z whose image in XX under g:ZXg:Z\to X lifts to rr, it already lifts to LL. Informally, the only long immersed segments in ZZ which are “pieces” of rr are those that are in the image of segments of LL under f:LZf:L\to Z.

Let ZZ be a 4-valent graph with total edge length |Z|=O(n)|Z|=O(n), in which every edge has length λ\geq\lambda. For any \ell, there are at most |Z|3/λ|Z|\cdot 3^{\ell/\lambda} immersed paths in ZZ of length \ell. Thus if \ell is of order nαn^{\alpha}, and λ1\lambda\gg 1, we would not expect to find any paths of length \ell in common with an independent random relator rr of length nn, for any fixed α>0\alpha>0, with probability 1O(enc)1-O(e^{-n^{c}}) for some cc depending on α\alpha.

There is a nice way to express this in terms of density; or degrees of freedom, which is summarized in the following intersection formula of Gromov; see [Gromov(1993)], § 9.A for details:

Proposition 4.1.1 (Gromov’s intersection formula).

Let CC be a finite set. For a subset AA of CC define the (multiplicative) density of AA, denoted density(A)\textnormal{density}(A), to be density(A):=log|A|/log|C|\textnormal{density}(A):=\log{|A|}/\log{|C|}. If A1A_{1} is any subset of CC, and A2A_{2} is a random subset of CC of fixed cardinality, chosen independently of A1A_{1}, then with probability 1O(|C|c)1-O(|C|^{-c}) for some c>0c>0, there is an equality

density(A1A2)=density(A1)+density(A2)1\textnormal{density}(A_{1}\cap A_{2})=\textnormal{density}(A_{1})+\textnormal{density}(A_{2})-1

with the convention that density<0\textnormal{density}<0 means a set is empty.

Note that Gromov does not actually estimate the probability that his formula holds, but this is an elementary consequence of Chernoff’s inequality. For a proof of an analogous estimate, which explicitly covers the cases of interest that we need, see [Calegari–Walker(2013)], § 2.4.

In our situation, taking =nα\ell=n^{\alpha}, we can take CC to be the set of all reduced words in FkF_{k} of length \ell, which has cardinality approximately (2k1)nα(2k-1)^{n^{\alpha}}. If λ1\lambda\gg 1, then the set of immersed paths in ZZ of length \ell has density as close to 0 as desired; similarly, the set of subwords of a random word rr of length nn has density as close to 0 as desired. Thus if these subwords were independent, Gromov’s formula would show that they were disjoint, with probability 1O(enc)1-O(e^{-n^{c}}).

Of course, the thin spines ZZ guaranteed by the Thin Spine Theorem are hardly independent of rr. Indeed, every subpath of rr appears in LL, and therefore in ZZ! Thus, we must work harder to show that these subpaths (those that are already in LL) amount to all the intersection. The idea of the bead decomposition is to subdivide rr into many subsets bib_{i} of length n1δn^{1-\delta} (for some fixed δ\delta), to build a thin spine ZiZ_{i} “bounding” the subset bib_{i}, and then to argue that no immersed path in ZiZ_{i} of length nαn^{\alpha} can be a piece in any bjb_{j} with iji\neq j.

Fix some small positive constant δ\delta, and write rr as a product

r=r1s1r2s2rmsmr=r_{1}s_{1}r_{2}s_{2}\cdots r_{m}s_{m}

where each rir_{i} has length n1δn^{1-\delta} and each sis_{i} has length approximately nδn^{\delta} (the exact values are not important, just the order of magnitude). Thus mm is approximately equal to nδn^{\delta}; we further adjust the lengths of the rir_{i} and sis_{i} slightly so that mm is divisible by 33.

We say a reduced word xx has small self-overlaps if the length of the biggest proper prefix of xx equal to a proper suffix is at most |x|/3|x|/3. Almost every reduced word of fixed big length has small overlaps. Fix some positive constant C<δ/log(2k1)C<\delta/\log(2k-1), and for each index ii mod m/3m/3 we look for the first triple of subwords of the form a1xa2,b1xb2,c1xc2a_{1}xa_{2},b_{1}xb_{2},c_{1}xc_{2} in si,si+m/3,si+2m/3s_{i},s_{i+m/3},s_{i+2m/3} such that

  1. (1)

    the ai,bi,cia_{i},b_{i},c_{i} are single edges;

  2. (2)

    a1,b1,c1a_{1},b_{1},c_{1} are distinct and a2,b2,c2a_{2},b_{2},c_{2} are distinct;

  3. (3)

    xx has length ClognC\log{n} with CC as above; and

  4. (4)

    xx has small self-overlaps.

Actually, it is not important that xx has length exactly ClognC\log{n}; it would be fine for it to have length in the interval [Clogn/2,Clogn][C\log{n}/2,C\log{n}], for example. Any reduced word of length ClognC\log{n} with C<δ/log(2k1)C<\delta/\log(2k-1) will appear many times in any random reduced word of length nδn^{\delta}, with probability 1O(enc)1-O(e^{-n^{c}}) for some cc depending on δ\delta. See e.g. [Calegari–Walker(2013)], §. 2.3. Then for each index ii mod m/3m/3, the three copies of xx can be glued to produce unusually long bunched triples lil_{i} that we call lips. The lips partition the remainder of rr into subsets which we denote bib_{i}, where the index ii is taken mod m/3m/3, so that each bib_{i} is the union of three segments of length approximately equal to n1δn^{1-\delta} consisting of ri,ri+m/3,ri+2m/3r_{i},r_{i+m/3},r_{i+2m/3} together with the part of the adjacent sjs_{j} outside the lips. We call the bib_{i} beads, and we call the partition of rr minus the lips into beads the bead decomposition.

Now we apply the Thin Spine Theorem to build a thin spine f:LZf:L\to Z such that

  1. (1)

    LL consists of 648 copies of rr (or 5,832 copies if k=2k=2);

  2. (2)

    ZZ is cyclically subdivided by the lips lil_{i} into connected subgraphs ZiZ_{i};

  3. (3)

    the 648 copies of bib_{i} in LL are precisely the part of rr mapping to the ZiZ_{i}, and the remainder of LL consists of segments mapping to the lips as above.

We call the result a beaded spine.

Lemma 4.1.2 (No common path).

For any positive α\alpha, we can construct a beaded spine with the property that no immersed path in ZiZ_{i} of length nαn^{\alpha} can be a piece in any bjb_{j} with iji\neq j mod m/3m/3, with probability 1O(enc)1-O(e^{-n^{c}}), where cc depends on α\alpha.

Proof.

The construction of a beaded spine is easy: with high probability, the labels on each bib_{i} are (T,ϵ)(T,\epsilon)-pseudorandom for any fixed (T,ϵ)(T,\epsilon), and we can simply apply the construction in the Thin Spine Theorem to each bib_{i} individually to build ZiZ_{i}, and correct the co-orientation once at the end by a local modification in Z1Z_{1} (say).

By the nature of the bead decomposition, the bib_{i} are independent of each other. By thinness, there are O(n3/λ)O(n\cdot 3^{\ell/\lambda}) immersed paths in ZiZ_{i} of length \ell. For any fixed positive α\alpha, if we set =nα\ell=n^{\alpha}, and choose λ\lambda big enough, then the density of this set of paths (in the set of all reduced words of length \ell) is as close to 0 as we like. Similarly, the set of subwords in bjb_{j} of length \ell has density as close to 0 as we like for big nn. But now these subwords are independent of the immersed paths in ZiZ_{i}, so by the intersection formula (Proposition 4.1.1), there are no such words in common, with probability 1O(enc)1-O(e^{-n^{c}}). ∎

4.2. Injectivity

We now show why a beaded spine gives rise to a π1\pi_{1}-injective map of a 3-manifold M¯(Z)K\overline{M}(Z)\to K. First we prove another lemma, which is really the key geometric point, and will be used again in § 6:

Lemma 4.2.1 (Common path lifts).

For any positive β\beta, with probability 1O(enc)1-O(e^{-n^{c}}) we can construct a beaded spine LZXL\to Z\to X with the property that any immersed segment γZ\gamma\to Z with |γ|=βn|\gamma|=\beta n whose image in XX under ZXZ\to X lifts to rr or r1r^{-1}, already lifts to LL.

In words, this lemma says that any path in rr of length βn\beta n which immerses in ZZ lifts to LL, and therefore appears in the boundary of a disk of M¯(Z)\overline{M}(Z).

Proof.

The proof follows very closely the proof of Lemma 5.2.3 from [Calegari–Walker(2015)].

First, fix some very small α\alpha with α/log2k1C\alpha/\log{2k-1}\ll C^{\prime} where ClognC^{\prime}\log{n} is the length of the lips in the beaded spine. This ensures that a random word of length nαn^{\alpha} is very unlikely to contain two copies of any word of length ClognC^{\prime}\log{n} with small self-overlaps; see e.g. [Calegari–Walker(2013)] Prop. 2.6 and Prop. 2.11 which says that the likelihood of this occurrence is O(nC)O(n^{-C}) for some CC.

Now, let fZ:γZf_{Z}:\gamma\to Z be an immersed path of length βn\beta n whose label is a subpath of rr or r1r^{-1}; this means that there is another immersion fL:γLf_{L}:\gamma\to L such that the compositions γZX\gamma\to Z\to X and γLX\gamma\to L\to X agree.

Using fLf_{L}, we decompose γ\gamma into subpaths γj\gamma_{j} which are the preimages of the segments of bjb_{j} under fLf_{L}. Apart from boundary terms, each of these γj\gamma_{j} has length approximately n1δn^{1-\delta}. By Lemma 4.1.2, no γj\gamma_{j} can immerse in ZiZ_{i} with iji\neq j mod m/3m/3 unless |γj|<nα|\gamma_{j}|<n^{\alpha}, and in fact fZf_{Z} must therefore take all of γj\gamma_{j} into ZjZ_{j} except possibly for some peripheral subwords of length at most nαn^{\alpha}.

But this means that for all jj, there is a subpath σγ\sigma\subset\gamma centered at the common endpoint of γj\gamma_{j} and γj+1\gamma_{j+1}, with |σ|nα|\sigma|\leq n^{\alpha}, for which one endpoint maps under fZf_{Z} into ZjZ_{j} and the other into Zj+1Z_{j+1}. This means that fZf_{Z} must map σ\sigma over the lip of ZZ separating ZjZ_{j} from Zj+1Z_{j+1}, and must contain a copy of the word xjx_{j} on that lip. But under the map fLf_{L}, the word σ\sigma contains another copy of xjx_{j}. By our hypothesis on α\alpha, the probability that σ\sigma contains two copies of xjx_{j} is O(nC)O(n^{-C}). If these copies are the same, then the composition fL:γLZf_{L}:\gamma\to L\to Z and fZ:γZf_{Z}:\gamma\to Z agree on σ\sigma. But since ZXZ\to X is an immersion, it follows that fL:γLZf_{L}:\gamma\to L\to Z and fZ:γZf_{Z}:\gamma\to Z must agree on all of γ\gamma; i.e. that fZf_{Z} lifts to LL, as claimed.

So the lemma is proved unless there are two distinct copies of xjx_{j} within distance nαn^{\alpha} of each lip in the image of γ\gamma. Since γ\gamma has length βn\beta n, there are nδn^{\delta} such lips; the probability of two distinct copies for each lip is O(nC)O(n^{-C}), and the probabilities for distinct lips are independent, so the total probability is O(enc)O(e^{-n^{c}}) and we are done. ∎

An immediate corollary is the existence of 3-manifold subgroups in random 1-relator groups:

Proposition 4.2.2.

Let G=Fk|rG=\langle F_{k}\;|\;r\rangle be a random 1-relator group where |r|=n|r|=n. Then with probability 1O(enc)1-O(e^{-n^{c}}) we can produce a beaded spine LZXL\to Z\to X for which the associated map M¯(Z)K\overline{M}(Z)\to K is π1\pi_{1}-injective.

Proof.

Suppose not, so that there is an immersed loop γ:S1Z\gamma:S^{1}\to Z which is nontrivial in π1(M¯(Z))\pi_{1}(\overline{M}(Z)), but trivial in KK. There is a van Kampen diagram 𝒟{\mathcal{D}} with boundary γ\gamma. If DD is a disk in this diagram with some segment in common with γ\gamma, and if DZ\partial D\to Z lifts to LL, then D\partial D bounds a disk in M¯(Z)\overline{M}(Z), and we can push 𝒟{\mathcal{D}} across DD by a homotopy, producing a diagram with fewer disks. A diagram which does not admit such a simplification is said to be efficient; without loss of generality therefore we obtain an efficient diagram whose boundary is an immersed loop γ:S1Z\gamma:S^{1}\to Z.

The group GG satisfies the small cancellation property C(μ)C^{\prime}(\mu) for any positive μ\mu. Thus by Greedlinger’s Lemma, if we take μ\mu small enough, some disk DD in the diagram has a segment of its boundary of length at least n/2n/2 in common with γ\gamma. Note that D\partial D is labeled rr or r1r^{-1}, and has length nn. Since 1/2>β1/2>\beta as in Lemma 4.2.1, the boundary of this path actually lifts to LL, whence the diagram is not efficient after all. This contradiction proves the theorem. ∎

Note that each boundary component of M¯(Z)\overline{M}(Z) is of the form S¯(Y)\overline{S}(Y) for some 3-valent fatgraph YY immersed in ZZ; thus the same argument implies that every component of M¯(Z)\partial\overline{M}(Z) is π1\pi_{1}-injective, and therefore M¯(Z)\overline{M}(Z) has incompressible boundary. Furthermore, since KK is aspherical, so is M¯(Z)\overline{M}(Z), and therefore M¯(Z)\overline{M}(Z) is irreducible, and π1(M¯(Z))\pi_{1}(\overline{M}(Z)) does not split as a free product.

To show that M¯(Z)\overline{M}(Z) is homotopic to a hyperbolic 3-manifold with totally geodesic boundary, it suffices to show that it is acylindrical, by Thurston’s hyperbolization theorem—see, for instance, [Bonahon(2002), Theorem 4.3]. We show this in § 5.

5. Acylindricity

In this section we explain why the 3-manifolds we have produced in random 1-relator groups are acylindrical.

Theorem 5.0.1 (1-Relator Acylindrical Subgroup Theorem).

Let G=Fk|rG=\langle F_{k}\;|\;r\rangle be a random 1-relator group where |r|=n|r|=n. Then with probability 1O(enc)1-O(e^{-n^{c}}) for the beaded spine LZXL\to Z\to X guaranteed by Proposition 4.2.2 the 3-manifold M¯(Z)\overline{M}(Z) is acylindrical. Thus, with overwhelming probability, random 1-relator groups contain subgroups isomorphic to the fundamental group of a hyperbolic 3-manifold with totally geodesic boundary.

Proof.

Each boundary component iM¯(Z)\partial_{i}\subset\partial\overline{M}(Z) is of the form S¯(Yi)\overline{S}(Y_{i}) for some trivalent fatgraph YiZY_{i}\to Z immersed in ZZ. Suppose M¯(Z)\overline{M}(Z) admits an essential annulus. Then there is a van Kampen diagram on an annulus 𝒜{\mathcal{A}} with boundary γ1,γ2\gamma_{1},\gamma_{2} where γi\gamma_{i} immerses in YiY_{i} for fatgraphs YiY_{i} associated to boundary components i\partial_{i} as above, and each γi\gamma_{i} is essential in S¯(Yi)\overline{S}(Y_{i}) (which in turn is essential in M¯(Z)\overline{M}(Z)).

Assume that 𝒜{\mathcal{A}} is efficient. Then by Lemma 4.2.1, for any positive β\beta we can insist that no segment of Dγi\partial D\cap\gamma_{i} has length more than βn\beta n. For big nn, with probability 1O(enc)1-O(e^{-n^{c}}) we know that KK is C(μ)C^{\prime}(\mu) for any positive μ\mu; when μ\mu is small, the annular version of Greedlinger’s Lemma (see [Lyndon–Schupp(1977)] Ch. V Thm. 5.4 and its proof) implies that if 𝒜{\mathcal{A}} contains a disk at all, then some disk DD in the diagram has a segment on its boundary of length at least n/3n/3 in common with γ1\gamma_{1} or γ2\gamma_{2}. Taking β<1/3\beta<1/3 we see that 𝒜{\mathcal{A}} can contain no disks at all; i.e. γ1\gamma_{1} and γ2\gamma_{2} have the same image γ\gamma in ZZ.

Now we use the fact that LZL\to Z is a good spine. At each vertex vv of ZZ, four local boundary components of M¯(Z)\overline{M}(Z) meet; the fact that γ\gamma lifts to paths γ1\gamma_{1} and γ2\gamma_{2} in two of these component forces γ\gamma to run between two specific edges incident to vv, and this determines a unique lift of γ\gamma to LL near vv compatible with the existence of the γi\gamma_{i}. Similarly, along each edge ee of ZZ, three local boundary components of M¯(Z)\overline{M}(Z) meet; the components containing γ1\gamma_{1} and γ2\gamma_{2} thus again determine a unique lift of γ\gamma to LL along ee. These local lifts at vertices and along edges are compatible, and determine a global lift of γ\gamma to LL. But this means γ\gamma is inessential in M¯(Z)\overline{M}(Z), contrary to hypothesis. Thus M¯(Z)\overline{M}(Z) is acylindrical after all. ∎

6. 3-Manifolds Everywhere

We now show that the acylindrical 3-manifold subgroups that we have constructed in random 1-relator groups stay essential as we add (2k1)Dn(2k-1)^{Dn} independent random relations of length nn, for any D<1/2D<1/2. Our argument follows the proof of Thm. 6.4.1 [Calegari–Walker(2015)] exactly, and depends only on the following two facts:

  1. (1)

    the beaded spine ZZ has total length O(n)O(n), has valence 4, and every segment has length at least λ\lambda, where we may choose λ\lambda as big as we like (depending on DD); and

  2. (2)

    any immersed segment γZ\gamma\to Z of length βn\beta n whose label is a subword of rr or r1r^{-1} lifts to LL, where we may choose β\beta as small as we like (depending on DD).

Beyond these facts, we use two theorems of [Ollivier(2007)], which give explicit estimates for the linear constant in the isoperimetric function and for the constant of hyperbolicity for a random group at density D<1/2D<1/2.

6.1. Ollivier’s estimates

We use the following theorems of [Ollivier(2007)]:

Theorem 6.1.1 ([Ollivier(2007)], Thm. 2).

Let GG be a random group at density DD. Then for any positive ϵ\epsilon, and any efficient van Kampen diagram 𝒟{\mathcal{D}} containing mm disks, we have

|𝒟|(12Dϵ)nm|\partial{\mathcal{D}}|\geq(1-2D-\epsilon)\cdot nm

with probability 1O(enc)1-O(e^{-n^{c}}).

Theorem 6.1.2 ([Ollivier(2007)], Cor. 3).

Let GG be a random group at density DD. Then the hyperbolicity constant δ\delta of the presentation satisfies

δ4n/(12D)\delta\leq 4n/(1-2D)

with probability 1O(enc)1-O(e^{-n^{c}}).

From this, we will deduce the following lemma, which is the exact analog of [Calegari–Walker(2015)] Lem. 6.3.2, and is deduced in exactly the same way from Ollivier’s theorems:

Lemma 6.1.3.

Let M¯(Z)\overline{M}(Z) be a 3-manifold obtained from a beaded spine, and suppose it is not π1\pi_{1}-injective in GG, a random group at density DD. Then there are constant CC and CC^{\prime} depending only on D<1/2D<1/2, a geodesic path γ\gamma in ZZ of length at most CnCn, and a van Kampen diagram 𝒟{\mathcal{D}} containing at most CC^{\prime} faces so that γ𝒟\gamma\subset\partial{\mathcal{D}} and |γ|>|𝒟|/2|\gamma|>|\partial{\mathcal{D}}|/2.

Proof.

Theorem 6.1.2 says that δ4n/(12D)\delta\leq 4n/(1-2D), and in any δ\delta-hyperbolic geodesic metric space, a kk-local geodesic is a (global) (k+4δk4δ,2δ)(\frac{k+4\delta}{k-4\delta},2\delta)-quasigeodesic for any k>8δk>8\delta (see [Bridson–Haefliger(1999)], Ch. III. H, 1.13 p. 405). A local geodesic in the 1-skeleton of M¯(Z)\overline{M}(Z) corresponding to an element of the kernel must contain a subsegment of length at most 9δ9\delta which is not a local geodesic in KK. The lift of this subsegment to the universal cover K~\widetilde{K} (i.e. the Cayley complex of GG) cobounds a van Kampen diagram 𝒟{\mathcal{D}} with an honest geodesic segment in GG. But by Theorem 6.1.1, the diagram 𝒟{\mathcal{D}} must satisfy

72n/(12D)|D|(12Dϵ)nC72n/(1-2D)\geq|\partial D|\geq(1-2D-\epsilon)\cdot nC^{\prime}

where CC^{\prime} is the number of faces; thus CC^{\prime} is bounded in terms of DD, and independent of nn. ∎

6.2. Proof of the main theorem

Theorem 6.2.1 (3-Manifolds Everywhere).

Fix k2k\geq 2. A random kk-generator group — either in the few relators model with 1\ell\geq 1 relators, or the density model with density 0<D<1/20<D<1/2 — with relators of length nn contains many quasi-isometrically embedded subgroups isomorphic to the fundamental group of a hyperbolic 3-manifold with totally geodesic boundary, with probability 1O(enC)1-O(e^{-n^{C}}) for some C>0C>0.

Proof.

The proof exactly follows the proof of Thm. 6.4.1 from [Calegari–Walker(2015)]. Pick one relation rr and build LZXL\to Z\to X and M¯(Z)\overline{M}(Z) by the method of § 3. We have already seen that M¯(Z)\overline{M}(Z) is homotopy equivalent to a hyperbolic 3-manifold with totally geodesic boundary, and that its fundamental group injects into F|r\langle F\;|\;r\rangle; we now show that it stays injective in GG when we add another (2k1)Dn(2k-1)^{Dn} independent random relations of length nn.

The argument is a straightforward application of Gromov’s intersection formula, i.e. Proposition 4.1.1. It is convenient to express it in terms of degrees of freedom, measured multiplicatively as powers of (2k1)(2k-1). Suppose M¯(Z)\overline{M}(Z) is not π1\pi_{1}-injective. Then by Lemma 6.1.3 there is an efficient van Kampen diagram 𝒟{\mathcal{D}} with mCm\leq C^{\prime} faces (where CC^{\prime} depends only on DD), and a local geodesic γ\gamma which immerses in ZZ with γ𝒟\gamma\subset\partial{\mathcal{D}} and |γ|>|𝒟|/2|\gamma|>|\partial{\mathcal{D}}|/2. The choice of γ\gamma gives nβn\beta^{\prime} degrees of freedom, where β=log(3)α/λ\beta^{\prime}=\log(3)\alpha/\lambda and where |γ|=αn|\gamma|=\alpha n, since there are |Z|3αn/λ|Z|\cdot 3^{\alpha n/\lambda} immersed paths in ZZ of length αn\alpha n, and we have |Z|=O(n)|Z|=O(n). Taking λ\lambda as big as necessary, we can make β\beta^{\prime} as small as we like.

Next, disks in 𝒟{\mathcal{D}} with boundary label rr or r1r^{-1} cannot have segments of length more than βn\beta n in common either with themselves or with γ\gamma in an efficient van Kampen diagram, where β\beta as in Lemma 4.2.1 can be taken as small as we like. Since there are at most mm disks in 𝒟{\mathcal{D}}, two distinct disks cannot have more than mm boundary segments in common. Take β\beta small enough so that βm<1/2\beta m<1/2. Then if we let 𝒟{\mathcal{D}}^{\prime} denote the result of cutting the disks labeled rr or r1r^{-1} out of the diagram, and let γ𝒟\gamma^{\prime}\subset\partial{\mathcal{D}}^{\prime} denote the union

γ:=(γ𝒟)(𝒟𝒟)\gamma^{\prime}:=(\gamma\cap\partial{\mathcal{D}}^{\prime})\cup(\partial{\mathcal{D}}^{\prime}-\partial{\mathcal{D}})

and mm^{\prime} the number of disks in 𝒟{\mathcal{D}}^{\prime}, then we have inequalities mmm^{\prime}\leq m, |γ||γ||\gamma^{\prime}|\geq|\gamma| and |γ||𝒟|/2|\gamma^{\prime}|\geq|\partial{\mathcal{D}}^{\prime}|/2 with equality if and only if 𝒟=𝒟{\mathcal{D}}^{\prime}={\mathcal{D}}.

Each remaining choice of face gives nDnD degrees of freedom, and each segment in the interior of length \ell imposes \ell degrees of constraint. Similarly, γ\gamma^{\prime} itself imposes |γ||\gamma^{\prime}| degrees of constraint. Let {\mathcal{I}} denote the union of interior edges. Then |𝒟|+2||=nm|\partial{\mathcal{D}}^{\prime}|+2|{\mathcal{I}}|=nm^{\prime} so |γ|+||nm/2|\gamma^{\prime}|+|{\mathcal{I}}|\geq nm^{\prime}/2 because |γ||𝒟|/2|\gamma^{\prime}|\geq|\partial{\mathcal{D}}^{\prime}|/2. On the other hand, the total degrees of freedom is nmD+nβ<nm/2nm^{\prime}D+n\beta^{\prime}<nm^{\prime}/2 if β\beta^{\prime} is small enough, so there is no way to assign labels to the faces to build a compatible diagram, with probability 1O(enC)1-O(e^{-n^{C}}). There are polynomial in nn ways to assign lengths to the edges, and a finite number of possible combinatorial diagrams (since each diagram has at most CC^{\prime} disks); summing the exceptional cases over all such diagrams shows that the probability of finding some such diagram is O(enC)O(e^{-n^{C}}). Otherwise M¯(Z)\overline{M}(Z) is π1\pi_{1}-injective, as claimed.

Finally, we prove that M¯(Z)\overline{M}(Z) is quasi-isometrically embedded in GG. Indeed, as observed already in [Calegari–Walker(2015)], the above argument actually shows that for any ϵ>0\epsilon>0 we can construct ZZ for which π1(M¯(Z))\pi_{1}(\overline{M}(Z)) is (1+ϵ)(1+\epsilon) quasi-isometrically embedded in GG. Controlling ϵ\epsilon depends only on applying the argument above to segments γ\gamma of length at most αn\alpha n for suitable α(ϵ)\alpha(\epsilon). The constant α\alpha then bounds the number of disks in an efficient van Kampen diagram, by Theorem 6.1.1. ∎

7. Commensurability

Once we know that random groups contain many interesting 3-manifold subgroups, it is natural to wonder exactly which 3-manifold groups arise. For any fixed non-free finitely presented group HH, there are no injective homomorphisms from HH to a random group GG at fixed density once the relators in GG get sufficiently long (with overwhelming probability). So it is probably hopeless to try to understand precisely which subgroups arise, since this will depend in a very complicated way on the length nn of the relators. However, we are in better shape if we simply try to control the commensurability class of the subgroups.

Example 7.0.1.

There are 24 simplices in the barycentric subdivision of a regular Euclidean tetrahedron. Three edges of this simplex have dihedral angles π/2\pi/2, two have dihedral angles π/3\pi/3, and one (the edge lying on the edge of the original simplex) has a dihedral angle of the form 12cos1(13)\frac{1}{2}\cos^{-1}(\frac{1}{3}), which is approximately 35.264435.2644^{\circ}. If we deform the dihedral angle α\alpha of this last edge while keeping the other dihedral angles fixed, the simplex admits a unique hyperbolic metric for all α>π/6\alpha>\pi/6 at which point one vertex of the simplex becomes ideal. If we try to deform to α<π/6\alpha<\pi/6, then three of the faces of the simplex don’t meet at all, and there is a perpendicular plane which intersects these three faces in the edges of a hyperbolic triangle with angles (π/2,π/3,α)(\pi/2,\pi/3,\alpha). If α\alpha is of the form π/m\pi/m for some integer m>6m>6, the group generated by reflections in the 4 sides of the (now-infinite) hyperbolic polyhedron is discrete, and convex cocompact. This group is a Coxeter group which we denote Γ(m)\Gamma(m), and whose Coxeter diagram is

mm

The groups Γ(m)\Gamma(m) with m<6m<6 are finite. The group Γ(6)\Gamma(6) is commensurable with the fundamental group of the figure 8 knot complement. For any m7m\geq 7 the limit set of Γ(m)\Gamma(m) is a (round) Sierpinski carpet. Figure 10 depicts a simply-connected 2-complex K~\tilde{K} stabilized by Γ(7)\Gamma(7) with cocompact fundamental domain which is a “dual” spine to the (infinite) polyhedron described above. The faces of this 2-complex are regular 7-gons, and the vertices are all 4-valent with tetrahedral symmetry.

\labellist
\hair

2pt \endlabellistRefer to caption

Figure 10. A polyhedron K~\tilde{K} on which Γ(7)\Gamma(7) acts cocompactly

As mm\to\infty, the limit sets converge to the Apollonian gasket, which has Hausdorff dimension about 1.30571.3057. The convex covolumes of Γ(m)\Gamma(m) — i.e. the volumes of the convex hulls of 3/Γ(m)\mathbb{H}^{3}/\Gamma(m) — are uniformly bounded above independently of mm.

We now observe that we can arrange for the 3-manifold groups we construct to be commensurable with some Γ(m)\Gamma(m).

Theorem 7.0.2 (Commensurability Theorem).

A random group at any density <1/2<1/2 or in the few relators model contains (with overwhelming probability) a subgroup commensurable with the Coxeter group Γ(m)\Gamma(m) for some m7m\geq 7, where Γ(m)\Gamma(m) is the Coxeter group with Coxeter diagram

mm
Proof.

The group Γ(m)\Gamma(m) acts cocompactly on a simply-connected 2-dimensional complex K~\tilde{K} in 3\mathbb{H}^{3} whose faces are totally geodesic regular hyperbolic mm-gons, and whose vertices are 4-valent and are stabilized by a tetrahedral symmetry group; the case m=7m=7 is depicted in Figure 10. So to prove the theorem it suffices to show that we can build our thin spines LZL\to Z in such a way that each component of LL maps over exactly mm edges of the 4-valent graph ZZ. But this is elementary to arrange: the only point in the construction in which the number of edges of the components of LL might vary is during the operations of super-compatible gluing, the adjustments in § 3.6, and trades. In each of these cases all that is relevant is the types of pieces being glued or traded, and not which components of LL are involved. Since types of the desired kind for each move can be found on any component, we can simply distribute the moves evenly over the different components, possibly after taking multiple copies of LL to clear denominators. The proof immediately follows ∎

There is nothing very special about the commensurability classes Γ(m)\Gamma(m), except that their fundamental domains are so small, so that their local combinatorics are very easy to describe.

Definition 7.0.3.

A geodesic spine KK is a finite 2-dimensional complex with totally geodesic edges and faces which embeds in some hyperbolic 3-manifold MM with totally geodesic boundary as a deformation retract. We say that a 2-dimensional orbifold complex KK^{\prime} is obtained by orbifolding KK if its underlying complex is homeomorphic to KK, and it is obtained by adding at most one orbifold point to each face of KK. Note that each such KK^{\prime} has an (orbifold) fundamental group which is commensurable with an acylindrical 3-manifold group.

In view of the level of control we are able to impose on the combinatorial type of the thin spines we construct in § 3, we make the following conjecture:

Conjecture 7.0.4.

For any fixed geodesic spine KK, a random group GG — either in the few relators model or the density model with density 0<D<1/20<D<1/2 — contains subgroups commensurable with the (orbifold) fundamental group of some orbifolding KK^{\prime} of KK (with overwhelming probability).

8. Acknowledgments

Danny Calegari was supported by NSF grant DMS 1005246. Henry Wilton was supported by an EPSRC Career Acceleration Fellowship. We would like to thank François Dahmani for raising the main question addressed by this paper, and for his continued interest. We would also like to thank Pierre de la Harpe for comments and corrections.

References

  • [Agol(2013)] I. Agol (with an appendix with D. Groves and J. Manning), The virtual Haken conjecture, Doc. Math. 18 (2013), 1045–1087
  • [Bonahon(2002)] F. Bonahon Geometric structures on 3-manifolds, Handbook of geometric topology, North-Holland, Amsterdam, 2002, 93–164
  • [Bowditch(1998)] B. Bowditch, Cut points and canonical splittings of hyperbolic groups, Acta Math. 180 (1998), no. 2, 145–186
  • [Bridson–Haefliger(1999)] M. Bridson and A. Haefliger, Metric spaces of nonpositive curvature, Grund. der math. Wiss. Springer-Verlag Berlin 1999
  • [Calegari–Walker(2013)] D. Calegari and A. Walker, Random rigidity in the free group, Geom. Topol. 17 (2013), 1707–1744
  • [Calegari–Walker(2015)] D. Calegari and A. Walker, Random groups contain surface subgroups, Jour. AMS 28 (2015), no. 2, 383–419
  • [Dahmani–Guirardel–Przytycki(2011)] F. Dahmani, V. Guirardel, and P. Przytycki. Random groups do not split, Math. Ann. 249 (2011), no. 3, 657–673
  • [Gromov(1993)] M. Gromov, Asymptotic invariants of infinite groups, Geometric group theory, vol. 2; LMS Lecture Notes 182 (Niblo and Roller eds.) Cambridge University Press, Cambridge, 1993
  • [Kahn–Markovic(2011)] J. Kahn and V. Markovic, The good pants homology and a proof of the Ehrenpreis conjecture, preprint, arXiv:1101.1330
  • [Kapovich–Kleiner(2000)] M. Kapovich and B. Kleiner, Hyperbolic groups with low dimensional boundary, Ann. Sci. École Norm. Sup. (4) 33 (2000), no. 5, 647–669
  • [Keevash(2014)] P. Keevash, The existence of designs, preprint, arXiv:1401.3665
  • [Lyndon–Schupp(1977)] R. Lyndon and P. Schupp, Combinatorial group theory, Ergeb. der Math. und ihr. Grenz. 89, Springer-Verlag, Berlin-New York, 1977
  • [Ollivier(2007)] Y. Ollivier, Some small cancellation properties of random groups, Internat. J. Algebra Comput. 17 (2007), no. 1, 37–51
  • [Stallings(1968)] J. Stallings, On torsion-free groups with infinitely many ends, Ann. Math. 88 (1968), 312–334
  • [Stallings(1983)] J. Stallings, Topology of finite graphs, Invent. Math. 71 (1983), no. 3, 551–565