This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

On the Diameters of Friends-and-Strangers Graphs

Ryan Jeong Department of Pure Mathematics and Mathematical Statistics, Wilberforce Road, Cambridge, CB3 0WA, UK rj450@cam.ac.uk
Abstract.

Given simple graphs XX and YY on the same number of vertices, the friends-and-strangers graph 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) has as its vertices all bijections from V(X)V(X) to V(Y)V(Y), where two bijections are adjacent if and only if they differ on two adjacent elements of V(X)V(X) with images adjacent in YY. We study the diameters of connected components of friends-and-strangers graphs: the diameter of a component of 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) corresponds to the largest number of swaps necessary to go from one configuration in the component to another. We show that any component of 𝖥𝖲(Pathn,Y)\mathsf{FS}(\textsf{{Path}}_{n},Y) has O(n2)O(n^{2}) diameter and that any component of 𝖥𝖲(Cyclen,Y)\mathsf{FS}(\textsf{{Cycle}}_{n},Y) has O(n4)O(n^{4}) diameter, improvable to O(n3)O(n^{3}) whenever 𝖥𝖲(Cyclen,Y)\mathsf{FS}(\textsf{{Cycle}}_{n},Y) is connected. These results address an open problem posed by Defant and Kravitz. Using an explicit construction, we show that there exist nn-vertex graphs XX and YY such that 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) has a component with eΩ(n)e^{\Omega(n)} diameter. This answers a question raised by Alon, Defant, and Kravitz in the negative. As a corollary, we observe that for such XX and YY, the lazy random walk on this component of 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) has eΩ(n)e^{\Omega(n)} mixing time. This result deviates from related classical theorems regarding rapidly mixing Markov chains and makes progress on another open problem of Alon, Defant, and Kravitz. We conclude with several suggestions for future research.

1. Introduction

1.1. Background and Motivation

Let XX and YY be nn-vertex simple graphs. Interpret the vertices of XX as positions, and the vertices of YY as people: say two people in the vertex set of YY are friends if they are adjacent and strangers if they are not. Each person picks a position to stand on, yielding a starting configuration. From here, at any point in time, two friends standing on adjacent positions may switch places: we call this operation a friendly swap. From the initial configuration, say the nn people have a final configuration in mind, and they know it can be reached from the initial configuration by some sequence of friendly swaps. What is the worst-case (over pairs of starting and final configurations) number of friendly swaps that is necessary in order for the nn people to achieve the final configuration from the starting configuration?

We may formalize the problem using the following definition.

Definition 1.1 ([DK21]).

Let XX and YY be simple graphs on nn vertices. The friends-and-strangers graph of XX and YY, denoted 𝖥𝖲(X,Y)\mathsf{FS}(X,Y), is a graph with vertices consisting of all bijections from V(X)V(X) to V(Y)V(Y), with bijections σ,τ𝖥𝖲(X,Y)\sigma,\tau\in\mathsf{FS}(X,Y) adjacent if and only if there exists an edge {a,b}\{a,b\} in XX such that

  1. (1)

    {σ(a),σ(b)}E(Y)\{\sigma(a),\sigma(b)\}\in E(Y),

  2. (2)

    σ(a)=τ(b),σ(b)=τ(a)\sigma(a)=\tau(b),\ \sigma(b)=\tau(a),

  3. (3)

    σ(c)=τ(c)\sigma(c)=\tau(c) for all cV(X){a,b}c\in V(X)\setminus\{a,b\}.

In other words, σ\sigma and τ\tau differ precisely on two adjacent vertices of XX whose images under σ\sigma (and τ\tau) are adjacent in YY. For any such bijections σ,τ\sigma,\tau, we say that τ\tau is achieved from σ\sigma by an (X,Y)(X,Y)-friendly swap.

Refer to caption
(a) The graph XX.
Refer to caption
(b) The graph YY.
Refer to caption
(c) A sequence of (X,Y)(X,Y)-friendly swaps. The transpositions between adjacent configurations denote the two vertices in XX over which the (X,Y)(X,Y)-friendly swap takes place. Red text corresponds to vertices in YY placed upon vertices of XX, in black text: using colored text for vertices in YY to distinguish them from vertices in XX in black text will be a convention throughout the work. The leftmost configuration corresponds to the bijection σ\sigma in the vertex set of 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) that maps σ(x1)=y1\sigma(x_{1})=y_{1}, σ(x2)=y5\sigma(x_{2})=y_{5}, σ(x3)=y3\sigma(x_{3})=y_{3}, σ(x4)=y4\sigma(x_{4})=y_{4}, and σ(x5)=y2\sigma(x_{5})=y_{2}. The other configurations correspond analogously to vertices in 𝖥𝖲(X,Y)\mathsf{FS}(X,Y).
Figure 1. A sequence of (X,Y)(X,Y)-friendly swaps in 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) for the 55-vertex graphs XX and YY. Configurations in the bottom row correspond to vertices in V(𝖥𝖲(X,Y))V(\mathsf{FS}(X,Y)). Two consecutive configurations differ by an (X,Y)(X,Y)-friendly swap, so the corresponding vertices are adjacent in 𝖥𝖲(X,Y)\mathsf{FS}(X,Y).

See Figure 1 for an illustration of Definition 1.1 on five-vertex graphs. Defant and Kravitz [DK21], in addition to introducing the framework of friends-and-strangers graphs, described the connected components of 𝖥𝖲(Pathn,Y)\mathsf{FS}(\textsf{{Path}}_{n},Y) and 𝖥𝖲(Cyclen,Y)\mathsf{FS}(\textsf{{Cycle}}_{n},Y) in terms of the acyclic orientations of Y¯\overline{Y} (the complement of YY), and determined both necessary conditions and sufficient conditions for 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) to be connected. In a different paper [Jeo22], we extend their results: [DK21, Corollary 4.14] states that 𝖥𝖲(Cyclen,Y)\mathsf{FS}(\textsf{{Cycle}}_{n},Y) is connected if and only if Y¯\overline{Y} is a forest with trees of jointly coprime sizes, and we establish that if XX is biconnected (i.e., connected and with no cut vertex) and YY is a graph for which 𝖥𝖲(Cyclen,Y)\mathsf{FS}(\textsf{{Cycle}}_{n},Y) is connected, then 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) is connected, settling [DK21, Conjecture 7.1]. In [Jeo22], we also initiate the study of the girth of friends-and-strangers graphs. Motivated by [KMS84] and connections to molecular programming as seen in [Bra+19], the framework of friends-and-strangers was later generalized by [Mil23] to permit for multiplicities onto vertices, in which many of the main results of [DK21, Wil74] were also generalized accordingly.

A central objective in the study of friends-and-strangers graphs is to determine necessary and sufficient conditions for their connectivity. Indeed, 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) being connected corresponds exactly to the property that one can go between any two configurations in 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) via some sequence of (X,Y)(X,Y)-friendly swaps. Of course, the conditions one may derive will depend upon the assumptions on XX and YY under which one works. If one elects to proceed under a regime in which 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) cannot be connected (such as when XX and YY are both bipartite; see the discussion around [DK21, Proposition 2.7] and [ADK23, Subsection 2.3] for a parity obstruction which demonstrates why this is the case), one may instead study how small the number of connected components may be under this regime, and the natural question here is to ask for further conditions on XX and YY ensuring that 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) achieves the smallest possible number of connected components. As pursued in [DK21, Sections 3 and 4] for (respectively) paths and cycles, one direction of inquiry is to fix (without loss of generality, as we will see in Proposition 2.3(1)) XX to be some particular graph, and study the structure of 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) for arbitrary YY: see [Def+22, Lee22, WC23, Wil74, Zhu23]. It is also very natural to ask extremal and probabilistic questions concerning the connectivity of friends-and-strangers graphs, such as minimum degree conditions on XX and YY which ensure that 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) is connected or for threshold probabilities on Erdős-Rényi random graphs X,YX,Y regarding the connectivity of 𝖥𝖲(X,Y)\mathsf{FS}(X,Y): see [ADK23, Ban22, Jeo23, Mil23, WLC23].

The setup proposed by Definition 1.1 is quite general. Indeed, friends-and-strangers graphs serve both as a common natural generalization of many classical combinatorial objects and as a framework which embodies many important problems in discrete mathematics and theoretical computer science. We illustrate this claim with a non-exhaustive listing of relevant examples. The graph 𝖥𝖲(X,Kn)\mathsf{FS}(X,K_{n}) is the Cayley graph of the symmetric group on the vertex set of XX generated by the transpositions corresponding to the edges of XX; we refer the reader to [DK21] and the references therein for a comprehensive discussion regarding the relevance of friends-and-strangers graphs within algebraic combinatorics. Letting XX be the 44-by-44 grid and YY a star graph, studying 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) is equivalent to studying the configurations and moves that can be performed on the famous 1515-puzzle (with the central vertex of the star graph corresponding to the empty tile); see [BK23, DR18, Par15, Yan11] for similar inquiries of a recreational flavor. The works [Naa00, Sta08] both study the structure of the graph 𝖥𝖲(Pathn,Y)\mathsf{FS}(\textsf{{Path}}_{n},Y) under certain restrictions on YY, while the works [BR99, Rei98] utilize 𝖥𝖲(Pathn,Y)\mathsf{FS}(\textsf{{Path}}_{n},Y) to investigate the acyclic orientations of Y¯\overline{Y}. Asking if XX and YY pack [BE78, KO09, SS78, Yap88, Yus07] in the graph packing literature is equivalent to asking if there exists an isolated vertex in 𝖥𝖲(X,Y)\mathsf{FS}(X,Y). Studying the token swapping problem [Aic+22, Bin+23, BMR18, Mil+16, Yam+15] on the graph XX is equivalent to studying distances between configurations in 𝖥𝖲(X,Kn)\mathsf{FS}(X,K_{n}). Finally, as we will briefly touch upon in Subsection 4.4, the interchange process on the graph XX [AF02, AK13, Ang03, BD06, CLR10, ES23, Ham15, HS21, Sch05] can be phrased in terms of (continuous-time) random walks on 𝖥𝖲(X,Kn)\mathsf{FS}(X,K_{n}).

1.2. Main Results and Organization

Unlike the existing body of work that studies the connectivity of friends-and-strangers graphs, the present paper initiates the study of their diameters, corresponding to the length of the “longest shortest path,” with lengths of shortest paths evaluated over all pairs of vertices. Indeed, the diameter of a connected component of 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) corresponds to the largest number of (X,Y)(X,Y)-friendly swaps necessary to achieve one configuration in the component from another. In a more recreational tone, if we think of 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) as a generalized 1515-puzzle, we are asking for the longest solution length for any solvable puzzle involving “board XX and rules YY.” The works [ADK23, DK21] both posed the following question, which asks whether the distance between any two configurations in 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) is polynomial in the size of XX and YY.

Question 1.2 ([ADK23, DK21]).

Does there exist an absolute constant C>0C>0 such that for all nn-vertex graphs XX and YY, every connected component of 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) has diameter at most nCn^{C}?

In Section 2, we introduce some background that we shall need later in the work. Before tackling the more global Question 1.2, in Section 3, we fix (without loss of generality) XX to be a complete, path, or cycle graph, and derive upper bounds on the maximum diameter of a component of 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) in each setting. Our results on paths and cycles address an open problem posed in [DK21, Subsection 7.3]. Furthermore, the discussion therein suggests that one must restrict their attention to rather contrived choices of graphs XX and YY in order for 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) to have a component with diameter that is superpolynomial in the size of XX and YY, suggesting that Question 1.2 may be challenging to settle via constructive means if it holds in the negative.

In Section 4, we establish the main result of this article, Theorem 1.3, which answers Question 1.2 in the negative. We prove this theorem by constructing, for all integers L1L\geq 1, graphs XLX_{L} and YLY_{L} on the same number of vertices: see Figure 2 for a schematic diagram of the construction for L=3L=3. The construction is such that the number of vertices of XLX_{L} and YLY_{L} is Θ(L)\Theta(L), and there exist two configurations σs,σfV(𝖥𝖲(XL,YL))\sigma_{s},\sigma_{f}\in V(\mathsf{FS}(X_{L},Y_{L})) which lie in the same connected component 𝒞\mathscr{C} of 𝖥𝖲(XL,YL)\mathsf{FS}(X_{L},Y_{L}) and for which the distance between σs\sigma_{s} and σf\sigma_{f} is eΩ(n)e^{\Omega(n)}.

Theorem 1.3.

For all n2n\geq 2, there exist nn-vertex graphs XX and YY such that 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) has a connected component with diameter eΩ(n)e^{\Omega(n)}.

Refer to caption
(a) The graph X3X_{3}.
Refer to caption
(b) The graph Y3Y_{3}.
Figure 2. The graphs X3X_{3} and Y3Y_{3}.

At the end of Section 4, we briefly discuss implications of Theorem 1.3 to the study of random walks on friends-and-strangers graphs, and deduce what might be thought of as the natural stochastic analogue of Theorem 1.3. Random walks on friends-and-strangers graphs model a variant of the interchange process where we may prohibit certain pairs of particles from swapping positions. This result contrasts many classical theorems regarding rapidly mixing Markov chains, all of which may be readily rewritten using the language of friends-and-strangers graphs, and makes progress on another open problem posed in [ADK23, Section 7]. We conclude the work with Section 5, which suggests several open problems and directions for future research.

1.3. Notation

In this article, unless stated otherwise, we assume that all graphs are simple. We employ standard asymptotic notation in this paper. Unless stated otherwise, all asymptotic notation in this paper will be with respect to nn. Purely for the sake of completeness, we state the following standard notation.

  • The vertex and edge sets of a graph GG are denoted by V(G)V(G) and E(G)E(G), respectively.

  • The complement of the graph GG is denoted G¯\overline{G}.

  • The statement that GG and HH are isomorphic is written as GHG\cong H.

  • For a subset SV(G)S\subset V(G), we let G|SG|_{S} denote the induced subgraph of GG with vertex set SS.

  • The open neighborhood of vV(G)v\in V(G), which is the collection of all neighbors of vv, is denoted by NG(v)N_{G}(v). The closed neighborhood of vV(G)v\in V(G) is denoted NG[v]=NG(v){v}N_{G}[v]=N_{G}(v)\cup\{v\}. For a subset of vertices SV(G)S\subseteq V(G), we let

    NG(S)=vSNG(v),\displaystyle N_{G}(S)=\bigcup_{v\in S}N_{G}(v), NG[S]=vSNG[v].\displaystyle N_{G}[S]=\bigcup_{v\in S}N_{G}[v].
  • The disjoint of a collection of graphs {Gi}iI\{G_{i}\}_{i\in I}, notated iIGi\bigoplus_{i\in I}G_{i}, is the graph with vertex set iIV(Gi)\bigsqcup_{i\in I}V(G_{i}) and edge set iIE(Gi)\bigsqcup_{i\in I}E(G_{i}). This readily extends to expressing a graph as the disjoint union of its connected components.

  • The distance d(v,w)d(v,w) between v,wV(G)v,w\in V(G) is the length of the shortest path from vv to ww. The diameter of a component 𝒞\mathscr{C} of GG is maxv,wV(𝒞)d(v,w)\max_{v,w\in V(\mathscr{C})}d(v,w).

Graphs with vertex set [n]:={1,,n}[n]\mathrel{\mathop{\ordinarycolon}}=\{1,\dots,n\} that will be relevant later are

  • the complete graph KnK_{n}, with E(Kn):={{i,j}:{i,j[n],ij}}E(K_{n})\mathrel{\mathop{\ordinarycolon}}=\{\{i,j\}\mathrel{\mathop{\ordinarycolon}}\{i,j\in[n],\ i\neq j\}\};

  • the complete bipartite graph Ki,jK_{i,j}, with E(Ki,j):={{v1,v2}:v1[i],v2{i+1,,i+j}}E(K_{i,j})\mathrel{\mathop{\ordinarycolon}}=\{\{v_{1},v_{2}\}\mathrel{\mathop{\ordinarycolon}}v_{1}\in[i],v_{2}\in\{i+1,\dots,i+j\}\}, which naturally partitions V(Ki,j)V(K_{i,j}) into two sets (henceforth called partite sets);

  • the path graph Pathn\textsf{{Path}}_{n}, with E(Pathn):={{i,i+1}:i[n1]}E(\textsf{{Path}}_{n})\mathrel{\mathop{\ordinarycolon}}=\{\{i,i+1\}\mathrel{\mathop{\ordinarycolon}}i\in[n-1]\};

  • the cycle graph Cyclen\textsf{{Cycle}}_{n}, with E(Cyclen):={{i,i+1}:i[n1]}{{n,1}}E(\textsf{{Cycle}}_{n})\mathrel{\mathop{\ordinarycolon}}=\{\{i,i+1\}\mathrel{\mathop{\ordinarycolon}}i\in[n-1]\}\cup\{\{n,1\}\};

  • the star graph Starn:=K1,n1\textsf{{Star}}_{n}\mathrel{\mathop{\ordinarycolon}}=K_{1,n-1}.

2. Background

In this section, we introduce some background and summarize results from prior work that will be relevant later in the present paper, particularly in Section 3. Throughout this section and Section 3, we will assume that the vertex set of all graphs is [n][n], with edge sets as in Subsection 1.3. Note that if both V(X)V(X) and V(Y)V(Y) are [n][n], then the vertices of 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) are the elements of 𝔖n\mathfrak{S}_{n}, the symmetric group of degree nn.

2.1. Acyclic Orientations

An orientation of a graph GG is an assignment of a direction to every edge of GG, and an acyclic orientation of GG is an orientation with no directed cycles. Denote the set of all acyclic orientations of GG by Acyc(G)\textsf{{Acyc}}(G). We will be interested in operations on acyclic orientations of GG called flips and double-flips, as defined in [DK21]. Notably, it was shown in [DK21, Theorem 4.7] that double-flips on acyclic orientations in Acyc(Y¯)\textsf{{Acyc}}(\overline{Y}) are paramount in describing the connected components of 𝖥𝖲(Cyclen,Y)\mathsf{FS}(\textsf{{Cycle}}_{n},Y).

Letting αAcyc(G)\alpha\in\textsf{{Acyc}}(G), converting a source of α\alpha into a sink or a sink of α\alpha into a source by reversing the directions of all its incident edges results in another acyclic orientation α\alpha^{\prime} of GG. We call such an operation a flip, and we say that α\alpha and α\alpha^{\prime} are flip equivalent, denoted αα\alpha\sim\alpha^{\prime}. In the literature, the equivalence classes in Acyc(G)/\textsf{{Acyc}}(G)/\!\!\sim are called toric acyclic orientations; we refer the interested reader to [Che10, DMR16, MM11, Pre86, Spe09] for related reading. We will further say that we perform an inflip on α\alpha if we convert a source into a sink (the direction of all incident edges “go into” the new sink), and an outflip if we convert a sink into a source.111In particular, we may apply these operations to isolated vertices.

Similarly, flipping a nonadjacent source and sink of α\alpha into (respectively) a sink and a source results in another acyclic orientation α′′\alpha^{\prime\prime} of GG: we call such an operation a double-flip, and we say α\alpha and α′′\alpha^{\prime\prime} are double-flip equivalent, denoted αα′′\alpha\approx\alpha^{\prime\prime}. It is easy to show that \sim and \approx are equivalence relations on Acyc(G)\textsf{{Acyc}}(G). We denote the set of double-flip equivalence classes of Acyc(G)\textsf{{Acyc}}(G) by Acyc(G)/\textsf{{Acyc}}(G)/\!\!\approx, and denote the double-flip equivalence class for which α\alpha is a representative by [α][\alpha]_{\approx}.

Assume V(G)=[n]V(G)=[n], and take αAcyc(G)\alpha\in\textsf{{Acyc}}(G). Associated to the acyclic orientation α\alpha is a poset Pα=([n],α)P_{\alpha}=([n],\leq_{\alpha}), where iαji\leq_{\alpha}j if and only if there exists a directed path from ii to jj in α\alpha. We define a linear extension of PαP_{\alpha} to be any permutation σ𝔖n\sigma\in\mathfrak{S}_{n} such that σ1(i)σ1(j)\sigma^{-1}(i)\leq\sigma^{-1}(j) whenever iαji\leq_{\alpha}j. We let (α)\mathcal{L}(\alpha) denote the collection of linear extensions of PαP_{\alpha}. For any σ𝔖n\sigma\in\mathfrak{S}_{n}, it is not hard to see that there exists a unique acyclic orientation αG(σ)Acyc(G)\alpha_{G}(\sigma)\in\textsf{{Acyc}}(G) for which σ(αG(σ))\sigma\in\mathcal{L}(\alpha_{G}(\sigma)), and that this acyclic orientation is the result of directing each edge {i,j}E(G)\{i,j\}\in E(G) from ii to jj if and only if σ1(i)<σ1(j)\sigma^{-1}(i)<\sigma^{-1}(j). It is also not hard to see that the poset PαP_{\alpha} associated to αAcyc(G)\alpha\in\textsf{{Acyc}}(G) has a linear extension (e.g., for i[n]i\in[n], we can construct a linear extension σ\sigma by setting σ1(i)\sigma^{-1}(i) to be a source of α\alpha, then removing the source and all incident edges from α\alpha; in an abuse of notation,222We will commit similar abuses of notation in Section 3. They should not raise any confusion when invoked. we understand α\alpha here as being mutated over the course of this greedy algorithm). We write

([α])=α^[α](α^).\displaystyle\mathcal{L}([\alpha]_{\approx})=\bigsqcup_{\hat{\alpha}\in[\alpha]_{\approx}}\mathcal{L}(\hat{\alpha}).

We refer the reader to [DK21, Section 4] for a more comprehensive discussion regarding why these notions are of importance in the study of friends-and-strangers graphs (though this is illuminated in passing in Subsection 2.2 and in the arguments of Section 3).

For a graph GG and acyclic orientation αAcyc(G)\alpha\in\textsf{{Acyc}}(G), we can partition the directed edges of any cycle subgraph 𝒞\mathcal{C} of GG into 𝒞α\mathcal{C}_{\alpha}^{-} and 𝒞α+\mathcal{C}_{\alpha}^{+}, corresponding to edges directed in one of two possible directions under α\alpha in 𝒞\mathcal{C}. The article [Pre86] studied precisely when an acyclic orientation could be reached from another by a sequence of inflips or outflips, while [Pro21] extends this result by providing an upper bound on the number of inflips or outflips necessary to reach α\alpha from α\alpha^{\prime} whenever αα\alpha\sim\alpha^{\prime}.

Lemma 2.1 ([Pre86, Pro21]).

For α,αAcyc(G)\alpha,\alpha^{\prime}\in\textsf{{Acyc}}(G), α\alpha^{\prime} can be reached from α\alpha by a sequence of inflips if and only if for every cycle subgraph 𝒞\mathcal{C} of GG, |𝒞α|=|𝒞α||\mathcal{C}_{\alpha}^{-}|=|\mathcal{C}_{\alpha^{\prime}}^{-}|. Furthermore, whenever this is the case, α\alpha^{\prime} can be reached from α\alpha by a sequence of at most (n2)\binom{n}{2} inflips. Similarly, α\alpha^{\prime} can be reached from α\alpha by a sequence of outflips if and only if for every cycle subgraph 𝒞\mathcal{C} of GG, |𝒞α|=|𝒞α||\mathcal{C}_{\alpha}^{-}|=|\mathcal{C}_{\alpha^{\prime}}^{-}|. Furthermore, whenever this is the case, α\alpha^{\prime} can be reached from α\alpha by a sequence of at most (n2)\binom{n}{2} outflips.

We build on Lemma 2.1. The following proposition establishes that we could have defined flip equivalence strictly with respect to inflips or outflips, as this would have resulted in the same notion.

Proposition 2.2.

Acyclic orientations α,αAcyc(G)\alpha,\alpha^{\prime}\in\textsf{{Acyc}}(G) are flip equivalent if and only if α\alpha^{\prime} can be reached from α\alpha by a sequence of inflips. Similarly, αα\alpha\sim\alpha^{\prime} if and only if α\alpha^{\prime} can be reached from α\alpha by a sequence of outflips.

Proof.

The statement that α\alpha^{\prime} is reachable from α\alpha via a sequence of inflips (or outflips) implying αα\alpha\sim\alpha^{\prime} is immediate. To prove the converse, notice that for any cycle subgraph 𝒞\mathcal{C} of GG and acyclic orientations α,αAcyc(G)\alpha,\alpha^{\prime}\in\textsf{{Acyc}}(G) for which α\alpha^{\prime} can be reached from α\alpha by a flip, |𝒞α|=|𝒞α||\mathcal{C}_{\alpha}^{-}|=|\mathcal{C}_{\alpha^{\prime}}^{-}|. Thus, if αα\alpha\sim\alpha^{\prime}, then |𝒞α|=|𝒞α||\mathcal{C}_{\alpha}^{-}|=|\mathcal{C}_{\alpha^{\prime}}^{-}|, so α\alpha^{\prime} can be reached from α\alpha via a sequence of inflips (or outflips). ∎

2.2. Background on Friends-and-Strangers Graphs

We mention those general properties of friends-and-strangers graphs that we will need later in the article. We refer the reader to [DK21, Section 2] for a thorough treatment of the general properties of friends-and-strangers graphs.

Proposition 2.3 ([DK21, Proposition 2.6]).

The following properties hold.

  1. (1)

    Definition 1.1 is symmetric with respect to XX and YY: we have that 𝖥𝖲(X,Y)𝖥𝖲(Y,X)\mathsf{FS}(X,Y)\cong\mathsf{FS}(Y,X).

  2. (2)

    The graph 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) is bipartite.

  3. (3)

    If XX or YY is disconnected, or if XX and YY are connected graphs on n3n\geq 3 vertices and each have a cut vertex, then 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) is disconnected.

The definitions concerning acyclic orientations that were introduced in Subsection 2.1 were observed in [DK21] to be central in describing the structure of the connected components of 𝖥𝖲(Pathn,Y)\mathsf{FS}(\textsf{{Path}}_{n},Y) and 𝖥𝖲(Cyclen,Y)\mathsf{FS}(\textsf{{Cycle}}_{n},Y), which are the graphs we will be interested in during Section 3. Specifically, we have the following theorems.

Theorem 2.4 ([DK21, Theorem 3.1]).

Let αAcyc(Y¯)\alpha\in\textsf{{Acyc}}(\overline{Y}). Take any linear extension σ(α)\sigma\in\mathcal{L}(\alpha), and let HαH_{\alpha} denote the connected component of 𝖥𝖲(Pathn,Y)\mathsf{FS}(\textsf{{Path}}_{n},Y) which contains σ\sigma. Then

𝖥𝖲(Pathn,Y)=αAcyc(Y¯)Hα\displaystyle\mathsf{FS}(\textsf{{Path}}_{n},Y)=\bigoplus_{\alpha\in\textsf{{Acyc}}(\overline{Y})}H_{\alpha}

and V(Hα)=(α)V(H_{\alpha})=\mathcal{L}(\alpha). In particular, HαH_{\alpha} is independent of the choice of σ\sigma.

Theorem 2.5 ([DK21, Theorem 4.7]).

Let αAcyc(Y¯)\alpha\in\textsf{{Acyc}}(\overline{Y}). Take any linear extension σ([α])\sigma\in\mathcal{L}([\alpha]_{\approx}), and let H[α]H_{[\alpha]_{\approx}} denote the connected component of 𝖥𝖲(Cyclen,Y)\mathsf{FS}(\textsf{{Cycle}}_{n},Y) which contains σ\sigma. Then

𝖥𝖲(Cyclen,Y)=[α]Acyc(Y¯)/H[α]\displaystyle\mathsf{FS}(\textsf{{Cycle}}_{n},Y)=\bigoplus_{[\alpha]_{\approx}\in\textsf{{Acyc}}(\overline{Y})/\approx}H_{[\alpha]_{\approx}}

and V(H[α])=([α])V(H_{[\alpha]_{\approx}})=\mathcal{L}([\alpha]_{\approx}). In particular, H[α]H_{[\alpha]_{\approx}} is independent of the choice of σ\sigma.

Defant and Kravitz [DK21] also determined precisely when 𝖥𝖲(Cyclen,Y)\mathsf{FS}(\textsf{{Cycle}}_{n},Y) is connected. The coprimality condition on the sizes of the components of Y¯\overline{Y} in Theorem 2.6 may seem surprising at first glance. We refer the reader to the discussion around [DK21, Corollary 4.12] and [DK21, Corollary 4.14] to see where this condition emerges and why it is a natural one.

Theorem 2.6 ([DK21, Corollary 4.14]).

Let YY be a graph on n3n\geq 3 vertices. Then 𝖥𝖲(Cyclen,Y)\mathsf{FS}(\textsf{{Cycle}}_{n},Y) is connected if and only if Y¯\overline{Y} is a forest with trees 𝒯1,,𝒯r\mathcal{T}_{1},\dots,\mathcal{T}_{r} such that gcd(|V(𝒯1)|,,|V(𝒯r)|)=1\gcd(|V(\mathcal{T}_{1})|,\dots,|V(\mathcal{T}_{r})|)=1.

3. Diameters of 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) with One Graph Fixed

Before investigating (and settling) the more global question of whether or not the diameters of connected components of friends-and-strangers graphs are polynomially bounded (in the sense posed by Question 1.2), we begin by restricting our study by choosing one of the two graphs XX and YY to come from a natural family of graphs, and then establish bounds on the diameter of any connected component of 𝖥𝖲(X,Y)\mathsf{FS}(X,Y).

3.1. Complete Graphs

We begin by setting Y=KnY=K_{n}. Take any two configurations σ,τV(𝖥𝖲(X,Kn))\sigma,\tau\in V(\mathsf{FS}(X,K_{n})) that lie in the same connected component. Consider the following iterative algorithm, applied starting from σ\sigma and proceeding sequentially on i[n]i\in[n]. In an abuse of notation, σ\sigma is understood to be mutated over the course of this algorithm as we perform (X,Kn)(X,K_{n})-friendly swaps to modify its mappings.

  1. (1)

    If σ(i)=τ(i)\sigma(i)=\tau(i), do nothing.

  2. (2)

    If σ(i)τ(i)\sigma(i)\neq\tau(i), swap τ(i)\tau(i) onto ii along a simple path, then swap σ(i)\sigma(i) back along the simple path that τ(i)\tau(i) traversed.

It is straightforward to prove via induction that at the beginning of any iteration i[n]i\in[n], σ(i)\sigma(i) and τ(i)\tau(i) lie upon the same connected component of XX (so that the algorithm may always proceed), and that σ(j)=τ(j)\sigma(j)=\tau(j) for all j<ij<i. Thus, σ=τ\sigma=\tau when the algorithm terminates after n1n-1 iterations (it must be that σ(n)=τ(n)\sigma(n)=\tau(n) at the beginning of the nnth iteration). For any iteration i[n]i\in[n], step (2) requires at most n1n-1 (X,Kn)(X,K_{n})-friendly swaps to move τ(i)\tau(i) onto ii, and at most n2n-2 (X,Kn)(X,K_{n})-friendly swaps to move σ(i)\sigma(i) back. This establishes that the diameter of any component of 𝖥𝖲(X,Kn)\mathsf{FS}(X,K_{n}) is therefore at most (n1)((n1)+(n2))=2n25n+3=O(n2)(n-1)((n-1)+(n-2))=2n^{2}-5n+3=O(n^{2}).

Finding the exact distance between two configurations in 𝖥𝖲(X,Kn)\mathsf{FS}(X,K_{n}) is known as the token swapping problem on XX in the theoretical computer science literature. The O(n2)O(n^{2}) bound on the diameter of any component of 𝖥𝖲(X,Kn)\mathsf{FS}(X,K_{n}) is well known, and we also have a bound of Ω(n2)\Omega(n^{2}) on the diameter of any component of 𝖥𝖲(X,Kn)\mathsf{FS}(X,K_{n}) for particular choices of XX (e.g., see Remark 3.4). In general, computing exact distances between two configurations in 𝖥𝖲(X,Kn)\mathsf{FS}(X,K_{n}), as well as the diameters of its connected components, is challenging, even when imposing additional assumptions on XX (e.g., see [Bin+23, Yam+15]). There do exist, however, exact polynomial-time algorithms which solve the token swapping problem for a number of choices of XX, including cliques [Cay49], paths [Jer85], stars [PV90], cycles [KSY19, van+16], and complete bipartite graphs [Yam+15]. See Subsection 5.5 for additional discussion regarding matters of hardness and approximation.

3.2. Path Graphs

In this subsection, we fix X=PathnX=\textsf{{Path}}_{n}. We begin by introducing a notion which will serve as a monovariant in the proof of Proposition 3.2.

Definition 3.1.

For σ,τ𝔖n\sigma,\tau\in\mathfrak{S}_{n}, call the ordered pair (i,j)(i,j) (i,j[n]i,j\in[n], i<ji<j) a (σ,τ)(\sigma,\tau)-inversion if either

  1. (1)

    σ1(i)<σ1(j)\sigma^{-1}(i)<\sigma^{-1}(j) and τ1(j)<τ1(i)\tau^{-1}(j)<\tau^{-1}(i),

  2. (2)

    σ1(j)<σ1(i)\sigma^{-1}(j)<\sigma^{-1}(i) and τ1(i)<τ1(j)\tau^{-1}(i)<\tau^{-1}(j).

Denote the number of (σ,τ)(\sigma,\tau)-inversions by inv(σ,τ)\textup{inv}(\sigma,\tau).

In other words, the ordered pair (i,j)(i,j) is a (σ,τ)(\sigma,\tau)-inversion if the relative ordering of the inverse images of i,ji,j under σ\sigma is opposite that of τ\tau. If (without loss of generality) τ\tau is the identity permutation, then inv(σ,τ)=inv(σ)\textup{inv}(\sigma,\tau)=\textup{inv}(\sigma), the number of inversions of σ\sigma. It also follows immediately that inv(σ,τ)=0\textup{inv}(\sigma,\tau)=0 if and only if σ=τ\sigma=\tau.

Proposition 3.2.

Take αAcyc(Y¯)\alpha\in\textsf{{Acyc}}(\overline{Y}), and let HαH_{\alpha} denote the corresponding connected component of 𝖥𝖲(Pathn,Y)\mathsf{FS}(\textsf{{Path}}_{n},Y). Let Pα=([n],α)P_{\alpha}=([n],\leq_{\alpha}) be the poset on [n][n] for which iαji\leq_{\alpha}j if and only if there exists a directed path from ii to jj in Y¯\overline{Y} under α\alpha. Then diam(Hα)(n2)pα\textup{diam}(H_{\alpha})\leq\binom{n}{2}-p_{\alpha}, where pαp_{\alpha} denotes the number of comparable ordered pairs (i,j)(i,j) with i,j[n]i,j\in[n], i<ji<j in PαP_{\alpha}.

Proof.

We will show for any σ,τV(Hα)\sigma,\tau\in V(H_{\alpha}) that d(σ,τ)=inv(σ,τ)d(\sigma,\tau)=\textup{inv}(\sigma,\tau). Any (Pathn,Y)(\textsf{{Path}}_{n},Y)-friendly swap reduces the number of (σ,τ)(\sigma,\tau)-inversions by at most one, so d(σ,τ)inv(σ,τ)d(\sigma,\tau)\geq\textup{inv}(\sigma,\tau). Now consider the following variant of the bubble sort algorithm, which we perform beginning from σ=σ(1)σ(2)σ(n)\sigma=\sigma(1)\sigma(2)\cdots\sigma(n). Say σ(i1)=τ(1)\sigma(i_{1})=\tau(1), and swap σ(i1)\sigma(i_{1}) down to position 11, yielding σ1\sigma_{1} with σ1(1)=τ(1)\sigma_{1}(1)=\tau(1). Now, say σ(i2)=τ(2)\sigma(i_{2})=\tau(2) (with i22i_{2}\geq 2), and swap σ(i2)\sigma(i_{2}) down to position 22, yielding σ2\sigma_{2} with σ2(j)=τ(j)\sigma_{2}(j)=\tau(j) for j[2]j\in[2]; continue until we achieve σn=τ\sigma_{n}=\tau. It is immediate that the execution of any swap performed during this algorithm would decrement inv(σ,τ)\textup{inv}(\sigma,\tau) by 11. Furthermore, any proposed swap in this algorithm can be executed, i.e., involves two elements which comprise an edge in YY. Indeed, Theorem 2.4 yields σ,τV(Hα)=(α)\sigma,\tau\in V(H_{\alpha})=\mathcal{L}(\alpha), but the existence of a swap in this algorithm that cannot be executed would yield αY¯(σ)αY¯(τ)\alpha_{\overline{Y}}(\sigma)\neq\alpha_{\overline{Y}}(\tau) (if the proposed swap fails to be an edge in YY, it is an edge in Y¯\overline{Y}, and would be directed in opposite directions under αY¯(σ)\alpha_{\overline{Y}}(\sigma) and αY¯(τ)\alpha_{\overline{Y}}(\tau) because the two elements comprising the swap constitute a (σ,τ)(\sigma,\tau)-inversion), which is a contradiction. Thus, d(σ,τ)=inv(σ,τ)d(\sigma,\tau)=\textup{inv}(\sigma,\tau). If (i,j)Pα(i,j)\in P_{\alpha}, it follows from σ,τ(α)\sigma,\tau\in\mathcal{L}(\alpha) that σ1(i)<σ1(j)\sigma^{-1}(i)<\sigma^{-1}(j) and τ1(i)<τ1(j)\tau^{-1}(i)<\tau^{-1}(j), so (i,j)(i,j) is not a (σ,τ)(\sigma,\tau)-inversion. Thus, d(σ,τ)=inv(σ,τ)(n2)pαd(\sigma,\tau)=\textup{inv}(\sigma,\tau)\leq\binom{n}{2}-p_{\alpha}, and therefore diam(Hα)(n2)pα\textup{diam}(H_{\alpha})\leq\binom{n}{2}-p_{\alpha}. ∎

Certainly, the two vertices incident to an edge of Y¯\overline{Y} are comparable in the poset Pα=([n],α)P_{\alpha}=([n],\leq_{\alpha}) for any αAcyc(Y¯)\alpha\in\textsf{{Acyc}}(\overline{Y}). This yields the following statement, as (n2)pα(n2)|E(Y¯)|=|E(Y)|\binom{n}{2}-p_{\alpha}\leq\binom{n}{2}-|E(\overline{Y})|=|E(Y)|. For simplicity, we appeal to Theorem 3.3, rather than Proposition 3.2, in forthcoming arguments.

Theorem 3.3.

The diameter of any connected component of 𝖥𝖲(Pathn,Y)\mathsf{FS}(\textsf{{Path}}_{n},Y) is at most |E(Y)||E(Y)|.

Remark 3.4.

It is not hard to see that 𝖥𝖲(Pathn,Kn)\mathsf{FS}(\textsf{{Path}}_{n},K_{n}) is connected (e.g., for any two configurations σ,τV(𝖥𝖲(Pathn,Kn))\sigma,\tau\in V(\mathsf{FS}(\textsf{{Path}}_{n},K_{n})), the algorithm from Subsection 3.1 yields a path between σ\sigma and τ\tau). From the proof of Proposition 3.2, we have for any σ,τV(𝖥𝖲(Pathn,Kn))\sigma,\tau\in V(\mathsf{FS}(\textsf{{Path}}_{n},K_{n})) that d(σ,τ)=inv(σ,τ)(n2)d(\sigma,\tau)=\textup{inv}(\sigma,\tau)\leq\binom{n}{2}, and inv(σ,τ)=(n2)\textup{inv}(\sigma,\tau)=\binom{n}{2} when τ\tau is the “reverse” of σ\sigma (i.e., τ(i)=σ(ni+1)\tau(i)=\sigma(n-i+1) for all i[n]i\in[n]). So diam(𝖥𝖲(Pathn,Kn))=(n2)\textup{diam}(\mathsf{FS}(\textsf{{Path}}_{n},K_{n}))=\binom{n}{2}. Combined with Proposition 3.2, this establishes that the maximum diameter of a component of 𝖥𝖲(Pathn,Kn)\mathsf{FS}(\textsf{{Path}}_{n},K_{n}) is Ω(n2)\Omega(n^{2}), and thus Θ(n2)\Theta(n^{2}). Thus, there exist families of nn-vertex graphs YY for which the maximum diameter of a component of 𝖥𝖲(Pathn,Y)\mathsf{FS}(\textsf{{Path}}_{n},Y) has diameter Θ(n2)\Theta(n^{2}). The same can be said for 𝖥𝖲(Kn,Y)\mathsf{FS}(K_{n},Y). ∎

Remark 3.5.

The upper bound of (n2)pα\binom{n}{2}-p_{\alpha} on diam(Hα)\textup{diam}(H_{\alpha}) in Theorem 3.2 corresponds to the number of ordered pairs (i,j)(i,j) (i<ji<j; i,j[n]i,j\in[n]) that are incomparable in the poset Pα=([n],α)P_{\alpha}=([n],\leq_{\alpha}). It follows from the proof of Proposition 3.2 that for arbitrary σ,τV(Hα)=(α)\sigma,\tau\in V(H_{\alpha})=\mathcal{L}(\alpha), any (σ,τ)(\sigma,\tau)-inversion must be a pair of incomparable elements in PαP_{\alpha}, and d(σ,τ)=inv(σ,τ)d(\sigma,\tau)=\textup{inv}(\sigma,\tau). We now apply these observations to show that the upper bound on diam(Hα)\textup{diam}(H_{\alpha}) fails to be sharp: Figure 3 provides an illustration of our construction. For n=6n=6, consider the graph shown in Figure 3a, whose complement is shown in Figure 3b. We will take αAcyc(Y¯)\alpha\in\textsf{{Acyc}}(\overline{Y}) to be an acyclic orientation for which the edges in this connected component are oriented as in Figure 3c.

Refer to caption
(a) The graph YY.
Refer to caption
(b) The graph Y¯\overline{Y}.
Refer to caption
(c) Direction of the edges in this component under α\alpha.
Figure 3. The construction we used to show that the bound given in Proposition 3.2 fails to be sharp in general.

Assume (towards a contradiction) that there exist σ,τV(Hα)=(α)\sigma,\tau\in V(H_{\alpha})=\mathcal{L}(\alpha) for which d(σ,τ)=inv(σ,τ)=(n2)pαd(\sigma,\tau)=\textup{inv}(\sigma,\tau)=\binom{n}{2}-p_{\alpha}, so that all pairs of incomparable elements in PαP_{\alpha} are (σ,τ)(\sigma,\tau)-inversions. Any two elements in {1,2,3}\{1,2,3\} are incomparable in PαP_{\alpha}, so the relative ordering of {1,2,3}\{1,2,3\} in σ\sigma must be the relative ordering of {1,2,3}\{1,2,3\} in τ\tau reversed. Without loss of generality, assume σ\sigma has relative ordering 1231\to 2\to 3, so τ\tau has 3213\to 2\to 1. Since σ,τ(α)\sigma,\tau\in\mathcal{L}(\alpha), the element 44 follows vertex 22 in both σ\sigma and τ\tau, so (2,4)(2,4) is not a (σ,τ)(\sigma,\tau)-inversion. But (2,4)(2,4) is incomparable in PαP_{\alpha}, a contradiction. ∎

3.3. Cycle Graphs

In this subsection, we fix X=CyclenX=\textsf{{Cycle}}_{n}. The setting Y=KnY=K_{n} has been studied in the context of circular permutations [Kim16, van+16]. In particular, [Kim16, Procedure 3.6] provides an algorithm that achieves the minimal number of (Cyclen,Kn)(\textsf{{Cycle}}_{n},K_{n})-friendly swaps between any two permutations in 𝔖n\mathfrak{S}_{n}. Extracting these results yields that the diameter of 𝖥𝖲(Cyclen,Kn)\mathsf{FS}(\textsf{{Cycle}}_{n},K_{n}) is n2/4\lfloor n^{2}/4\rfloor. In the spirit of Remark 3.4, it follows that there exist families of nn-vertex graphs YY for which 𝖥𝖲(Cyclen,Y)\mathsf{FS}(\textsf{{Cycle}}_{n},Y) has diameter Θ(n2)\Theta(n^{2}), and it is worth asking what conditions on YY yield that the maximum diameter of a connected component of 𝖥𝖲(Cyclen,Y)\mathsf{FS}(\textsf{{Cycle}}_{n},Y) is at most quadratic in nn. In this direction, we have the following proposition.

Proposition 3.6.

If YY has an isolated vertex or |E(Y)|n2|E(Y)|\leq n-2, then the diameter of any connected component of 𝖥𝖲(Cyclen,Y)\mathsf{FS}(\textsf{{Cycle}}_{n},Y) is at most |E(Y)||E(Y)|.

Proof.

Consider any σ,τV(𝖥𝖲(Cyclen,Y))\sigma,\tau\in V(\mathsf{FS}(\textsf{{Cycle}}_{n},Y)) which lie in the same component. If YY has an isolated vertex vv, then it must be that σ1(v)\sigma^{-1}(v) remains fixed over any sequence of (Cyclen,Y)(\textsf{{Cycle}}_{n},Y)-friendly swaps from σ\sigma to τ\tau. Thus, it must be that any path from σ\sigma to τ\tau in 𝖥𝖲(Cyclen,Y)\mathsf{FS}(\textsf{{Cycle}}_{n},Y) is a path in

𝖥𝖲(Cyclen|V(Cyclen){σ1(v)},YV(Y){v}),\displaystyle\mathsf{FS}\left(\textsf{{Cycle}}_{n}|_{V(\textsf{{Cycle}}_{n})\setminus\{\sigma^{-1}(v)\}},Y_{V(Y)\setminus\{v\}}\right),

from which the result follows from Theorem 3.3. For the setting |E(Y)|n2|E(Y)|\leq n-2, we will show that any σ,τV(𝖥𝖲(Cyclen,Y))\sigma,\tau\in V(\mathsf{FS}(\textsf{{Cycle}}_{n},Y)) in the same connected component will remain in the same component after removing some edge from Cyclen\textsf{{Cycle}}_{n}, from which the desired result again follows immediately from Theorem 3.3. Assume (towards a contradiction) that every path from σ\sigma to τ\tau in 𝖥𝖲(Cyclen,Y)\mathsf{FS}(\textsf{{Cycle}}_{n},Y) involves a swap over every edge in E(Cyclen)E(\textsf{{Cycle}}_{n}). Consider a shortest path Σ={σi}i=0λ\Sigma=\{\sigma_{i}\}_{i=0}^{\lambda} from σ\sigma or τ\tau, which has that σ0=σ\sigma_{0}=\sigma and σλ=τ\sigma_{\lambda}=\tau, and λn\lambda\geq n by the assumption. Consider the subsequence {σi}i=0n1\{\sigma_{i}\}_{i=0}^{n-1} consisting of the first n1n-1 (Cyclen,Y)(\textsf{{Cycle}}_{n},Y)-friendly swaps of Σ\Sigma. This must be a shortest path from σ\sigma to σn1\sigma_{n-1} in 𝖥𝖲(Cyclen,Y)\mathsf{FS}(\textsf{{Cycle}}_{n},Y), and swaps upon at most n1n-1 edges of Cyclen\textsf{{Cycle}}_{n}: say eE(Cyclen)e\in E(\textsf{{Cycle}}_{n}) is an edge upon which a swap does not occur, and let Cyclene\textsf{{Cycle}}_{n}^{-e} be Cyclen\textsf{{Cycle}}_{n} with this edge ee removed. Then {σi}i=0n1\{\sigma_{i}\}_{i=0}^{n-1} is a shortest path from σ\sigma to σn1\sigma_{n-1} in 𝖥𝖲(Cyclene,Y)\mathsf{FS}(\textsf{{Cycle}}_{n}^{-e},Y) with length n1n-1. This contradicts Theorem 3.3, which yields d(σ,σn1)|E(Y)|n2d(\sigma,\sigma_{n-1})\leq|E(Y)|\leq n-2. ∎

We were unable to extend the O(n2)O(n^{2}) bound from Proposition 3.6 to general YY, although we suspect that this is the truth (see Subsection 5.2). However, the existence of a universal constant C>0C>0 such that the maximum diameter of a component of 𝖥𝖲(Cyclen,Y)\mathsf{FS}(\textsf{{Cycle}}_{n},Y) is O(nC)O(n^{C}) remains highly desirable. In conjunction with Theorem 2.6, the following theorem yields such a result whenever 𝖥𝖲(Cyclen,Y)\mathsf{FS}(\textsf{{Cycle}}_{n},Y) is connected.

Theorem 3.7.

Let YY be a graph on n3n\geq 3 vertices, and let n1,,nrn_{1},\dots,n_{r} denote the sizes of the components of Y¯\overline{Y}. If gcd(n1,,nr)=1\gcd(n_{1},\dots,n_{r})=1, then any component of 𝖥𝖲(Cyclen,Y)\mathsf{FS}(\textsf{{Cycle}}_{n},Y) has diameter at most 4n3+|E(Y)|4n^{3}+|E(Y)|.

Proof.

Certainly, r2r\geq 2. Without loss of generality, we assume that n1nrn_{1}\leq\dots\leq n_{r}, and we denote the corresponding components of Y¯\overline{Y} by Y1¯,,Yr¯\overline{Y_{1}},\dots,\overline{Y_{r}}, respectively. For αAcyc(Y¯)\alpha\in\textsf{{Acyc}}(\overline{Y}), we let αi\alpha_{i} denote the acyclic orientation induced by α\alpha on Yi¯\overline{Y_{i}}. We now fix α,α′′Acyc(Y¯)\alpha,\alpha^{\prime\prime}\in\textsf{{Acyc}}(\overline{Y}) such that αα′′\alpha\approx\alpha^{\prime\prime}. Before studying distances in 𝖥𝖲(Cyclen,Y)\mathsf{FS}(\textsf{{Cycle}}_{n},Y), we will first bound the number of double-flips necessary to reach α′′\alpha^{\prime\prime} from α\alpha. Certainly, αiαi′′\alpha_{i}\sim\alpha^{\prime\prime}_{i} for all i[r]i\in[r], and by Proposition 2.2, we can reach αi′′\alpha^{\prime\prime}_{i} in no more than (ni2)\binom{n_{i}}{2} inflips or outflips from αi\alpha_{i}. Observe that for any αi′′\alpha_{i}^{\prime\prime}, we may return to αi′′\alpha_{i}^{\prime\prime} by applying a different sequence of nin_{i} inflips; see Figure 4 for an illustration. Indeed, take a linear extension σ(αi′′)\sigma\in\mathcal{L}(\alpha_{i}^{\prime\prime}), labeled σ=σ(1)σ(2)σ(ni)\sigma=\sigma(1)\sigma(2)\cdots\sigma(n_{i}), and perform an inflip on αi′′\alpha^{\prime\prime}_{i} by converting the source σ(1)\sigma(1) into a sink, so that σ(2)σ(ni)σ(1)\sigma(2)\dots\sigma(n_{i})\sigma(1) is a linear extension of the poset associated to the resulting acyclic orientation in Acyc(Yi¯)\textsf{{Acyc}}(\overline{Y_{i}}). Performing nin_{i} inflips on αi′′\alpha^{\prime\prime}_{i} in this manner returns σ\sigma as a linear extension of the poset associated to the resulting acyclic orientation: since there exists a unique acyclic orientation αY¯(σ)Acyc(Yi¯)\alpha_{\overline{Y}}(\sigma)\in\textsf{{Acyc}}(\overline{Y_{i}}) for which σ(αY¯(σ))\sigma\in\mathcal{L}(\alpha_{\overline{Y}}(\sigma)), this acyclic orientation must be αi\alpha^{\prime}_{i}. Similarly, we can return to αi′′\alpha_{i}^{\prime\prime} by applying a sequence of nin_{i} outflips.

Refer to caption
Figure 4. An example of a sequence of nn inflips which takes an acyclic orientation α\alpha of an nn-vertex graph back to itself. We demonstrate on a 44-vertex graph. The permutations (12344123)\bigl{(}\begin{smallmatrix}1&2&3&4\\ 4&1&2&3\end{smallmatrix}\bigl{)}, (12341234)\bigl{(}\begin{smallmatrix}1&2&3&4\\ 1&2&3&4\end{smallmatrix}\bigl{)}, (12342341)\bigl{(}\begin{smallmatrix}1&2&3&4\\ 2&3&4&1\end{smallmatrix}\bigl{)}, (12343412)\bigl{(}\begin{smallmatrix}1&2&3&4\\ 3&4&1&2\end{smallmatrix}\bigl{)} are linear extensions of the posets associated with the first four acyclic orientations shown, respectively. The first and fifth acyclic orientations are the same.

Recalling that a double-flip applied to an acyclic orientation involves flipping a nonadjacent source and sink into (respectively) a sink and source, we thus proceed as follows. Starting from the acyclic orientation α\alpha, perform a sequence of double-flips that act as inflips on sources in αr\alpha_{r} and outflips on sinks in α1,,αr1\alpha_{1},\dots,\alpha_{r-1} until we have reached α1′′,,αr′′\alpha_{1}^{\prime\prime},\dots,\alpha_{r}^{\prime\prime} at least once. Specifically, begin by performing inflips on αr\alpha_{r} and outflips on α1\alpha_{1} until we either reach α1′′\alpha_{1}^{\prime\prime} (at which point we begin performing outflips on sinks in α2\alpha_{2}) or αr′′\alpha_{r}^{\prime\prime} (at which point we begin performing inflips on sources in αr′′\alpha_{r}^{\prime\prime} as described previously to return to αr′′\alpha_{r}^{\prime\prime} every nrn_{r} inflips). If we reach α1′′,,αr1′′\alpha_{1}^{\prime\prime},\dots,\alpha_{r-1}^{\prime\prime} prior to αr′′\alpha_{r}^{\prime\prime}, then perform outflips on sinks in α1′′\alpha_{1}^{\prime\prime} (returning to α1′′\alpha_{1}^{\prime\prime} every n1n_{1} outflips) until αr′′\alpha_{r}^{\prime\prime} is reached: from here, pair these outflips on sinks with inflips on sources in αr′′\alpha_{r}^{\prime\prime} until we retain α1′′\alpha_{1}^{\prime\prime}. Otherwise, we reach αr′′\alpha_{r}^{\prime\prime} prior to α1′′,,αr1′′\alpha_{1}^{\prime\prime},\dots,\alpha_{r-1}^{\prime\prime}, for which αr′′\alpha_{r}^{\prime\prime} will be “offset” once we have α1′′,,αr1′′\alpha_{1}^{\prime\prime},\dots,\alpha_{r-1}^{\prime\prime}, since we are performing inflips on sources which return to αr′′\alpha_{r}^{\prime\prime} every nrn_{r} inflips. In either case, call the resulting acyclic orientation α~\tilde{\alpha}, which satisfies αi~=αi′′\tilde{\alpha_{i}}=\alpha_{i}^{\prime\prime} for all i[r1]i\in[r-1] while α~r\tilde{\alpha}_{r} differs from αr′′\alpha_{r}^{\prime\prime} by some offset 0c<nr0\leq c<n_{r}. By tracing the preceding description and recalling Proposition 2.2, it follows that the number of double-flips we perform to reach α~\tilde{\alpha} from α\alpha is bounded above by

max{(nr2)+n1,i=1r1(ni2)}i=1rni2(i=1rni)2=n2.\displaystyle\max\left\{\binom{n_{r}}{2}+n_{1},\ \sum_{i=1}^{r-1}\binom{n_{i}}{2}\right\}\leq\sum_{i=1}^{r}n_{i}^{2}\leq\left(\sum_{i=1}^{r}n_{i}\right)^{2}=n^{2}.

By Bézout’s Lemma (recall that gcd(n1,,nr)=1\gcd(n_{1},\dots,n_{r})=1), there exist integers 0d1,,dr1<nr0\leq d_{1},\dots,d_{r-1}<n_{r} such that

d1n1++dr1nr1nrc(modnr).\displaystyle d_{1}n_{1}+\cdots+d_{r-1}n_{r-1}\equiv n_{r}-c\pmod{n_{r}}.

Thus, from α~\tilde{\alpha}, we can reach α′′\alpha^{\prime\prime} by performing dinid_{i}n_{i} outflips on α~i=αi′′\tilde{\alpha}_{i}=\alpha_{i}^{\prime\prime} for i[r1]i\in[r-1] (returning to α~i=αi′′\tilde{\alpha}_{i}=\alpha_{i}^{\prime\prime} every nin_{i} outflips), while performing inflips on α~r\tilde{\alpha}_{r} as discussed to reach αr′′\alpha_{r}^{\prime\prime}. The number of double-flips we perform to reach α′′\alpha^{\prime\prime} from α~\tilde{\alpha} is therefore bounded above by

i=1r1dinimax{d1,,dr1}(i=1r1ni)nrnn2,\displaystyle\sum_{i=1}^{r-1}d_{i}n_{i}\leq\max\left\{d_{1},\dots,d_{r-1}\right\}\left(\sum_{i=1}^{r-1}n_{i}\right)\leq n_{r}n\leq n^{2},

so at most 2n22n^{2} double-flips are necessary to reach α′′\alpha^{\prime\prime} from α\alpha.

We now turn to bounding d(σ,τ)d(\sigma,\tau) for configurations σ,τV(𝖥𝖲(Cyclen,Y))\sigma,\tau\in V(\mathsf{FS}(\textsf{{Cycle}}_{n},Y)) in the same connected component. By Theorem 2.5, we have that σ,τ([α])\sigma,\tau\in\mathcal{L}([\alpha]_{\approx}) for some [α]Acyc(Y¯)/[\alpha]_{\approx}\in\textsf{{Acyc}}(\overline{Y})/\!\!\approx. Denote α=αY¯(σ)\alpha=\alpha_{\overline{Y}}(\sigma) and α′′=αY¯(τ)\alpha^{\prime\prime}=\alpha_{\overline{Y}}(\tau). By the preceding discussion, we can reach α′′\alpha^{\prime\prime} from α\alpha in λ2n2\lambda\leq 2n^{2} double-flips, yielding a sequence of acyclic orientations Σ={αi}i=0λ\Sigma=\{\alpha_{i}\}_{i=0}^{\lambda} in the equivalence class [α][\alpha]_{\approx} with α0=α\alpha_{0}=\alpha and αλ=α′′\alpha_{\lambda}=\alpha^{\prime\prime}. From Σ\Sigma, we will now construct a sequence of (Cyclen,Y)(\textsf{{Cycle}}_{n},Y)-friendly swaps which we can apply on σ\sigma; see Figure 5 for an illustration. If the double-flip we performed to reach α1\alpha_{1} from α\alpha inflips the source vv and outflips the sink ww in α\alpha, it follows from σ(α)\sigma\in\mathcal{L}(\alpha) that for any i<σ1(v)i<\sigma^{-1}(v), {σ(i),v}E(Y)\{\sigma(i),v\}\in E(Y). Indeed, if we had that {σ(i),v}E(Y¯)\{\sigma(i),v\}\in E(\overline{Y}), vv being a source in α\alpha would imply that this edge is directed from vv to σ(i)\sigma(i) in α\alpha, contradicting σ(α)\sigma\in\mathcal{L}(\alpha). Similarly, for any j>σ1(w)j>\sigma^{-1}(w), {σ(j),w}E(Y)\{\sigma(j),w\}\in E(Y). Thus, we can swap vv to 11 and ww to nn in no more than 2n32n-3 (Cyclen,Y)(\textsf{{Cycle}}_{n},Y)-friendly swaps: it is easy to check that the resulting configuration remains in (α)\mathcal{L}(\alpha). Then we perform a (Cyclen,Y)(\textsf{{Cycle}}_{n},Y)-friendly swap which swaps vv and ww along the edge {1,n}\{1,n\} ({v,w}E(Y¯)\{v,w\}\notin E(\overline{Y}) by the definition of a double-flip, so {v,w}E(Y)\{v,w\}\in E(Y)). It is also straightforward to check that the configuration σ1\sigma_{1} resulting from this interchange is now in (α1)\mathcal{L}(\alpha_{1}).

Refer to caption
(a) Acyclic orientations α\alpha, α1Acyc(Y¯)\alpha_{1}\in\textsf{{Acyc}}(\overline{Y}).
Refer to caption
(b) The corresponding sequence of (Cyclen,Y)(\textsf{{Cycle}}_{n},Y)-friendly swaps we construct.
Figure 5. The sequence of (Cyclen,Y)(\textsf{{Cycle}}_{n},Y)-friendly swaps that we construct corresponding to α,α1Acyc(Y¯)\alpha,\alpha_{1}\in\textsf{{Acyc}}(\overline{Y}) that are double-flip equivalent. We demonstrate on 55-vertex graphs. The topmost bijection is in (α)\mathcal{L}(\alpha). We inflip v=2v=2 and outflip w=5w=5 to reach α′′\alpha^{\prime\prime} from α\alpha: swapping vv left to 11, then ww right to 55, then swapping vv and ww along {1,5}\{1,5\} yields a permutation in (α1)\mathcal{L}(\alpha_{1}).

Proceed similarly through all λ\lambda double-flips, and call the resulting configuration σ~\tilde{\sigma}: this configuration satisfies σ~(α′′)\tilde{\sigma}\in\mathcal{L}(\alpha^{\prime\prime}). Since σ~,τ(α′′)\tilde{\sigma},\tau\in\mathcal{L}(\alpha^{\prime\prime}), it follows from Theorem 2.4 that σ~,τ\tilde{\sigma},\tau lie in the same component of 𝖥𝖲(Pathn,Y)\mathsf{FS}(\textsf{{Path}}_{n},Y) (specifically, the copy of Pathn\textsf{{Path}}_{n} in Cyclen\textsf{{Cycle}}_{n} resulting from excluding the edge {1,n}\{1,n\}). By Theorem 3.3, we can now reach τ\tau from σ~\tilde{\sigma} by performing no more than |E(Y)||E(Y)| (Cyclen,Y)(\textsf{{Cycle}}_{n},Y)-friendly swaps. Altogether, we have that

d(σ,τ)2n22n+|E(Y)|=4n3+|E(Y)|,\displaystyle d(\sigma,\tau)\leq 2n^{2}\cdot 2n+|E(Y)|=4n^{3}+|E(Y)|,

so at most 4n3+|E(Y)|4n^{3}+|E(Y)| (Cyclen,Y)(\textsf{{Cycle}}_{n},Y)-friendly swaps are necessary to reach τ\tau from σ\sigma. ∎

Corollary 3.8.

For n3n\geq 3, if 𝖥𝖲(Cyclen,Y)\mathsf{FS}(\textsf{{Cycle}}_{n},Y) is connected, diam(𝖥𝖲(Cyclen,Y))4n3+|E(Y)|\textup{diam}(\mathsf{FS}(\textsf{{Cycle}}_{n},Y))\leq 4n^{3}+|E(Y)|.

Theorem 3.7 can now be invoked to establish the following general bound on the diameter of any connected component of 𝖥𝖲(Cyclen,Y)\mathsf{FS}(\textsf{{Cycle}}_{n},Y), where YY is arbitrary. This proves that, in the sense of Question 1.2, the diameter of 𝖥𝖲(Cyclen,Y)\mathsf{FS}(\textsf{{Cycle}}_{n},Y) is polynomially bounded.

Theorem 3.9.

The diameter of any component of 𝖥𝖲(Cyclen,Y)\mathsf{FS}(\textsf{{Cycle}}_{n},Y) is at most 8n4(1+o(1))8n^{4}(1+o(1)).

Proof.

Consider two configurations σ,τV(𝖥𝖲(Cyclen,Y))\sigma,\tau\in V(\mathsf{FS}(\textsf{{Cycle}}_{n},Y)) in the same connected component. We construct an (n+1)(n+1)-vertex graph YY^{\prime} by adding a vertex vv to YY that is adjacent to all vertices in V(Y)V(Y), so YY^{\prime} has a spanning star subgraph with central vertex vv. Define bijections σ,τV(𝖥𝖲(Cyclen+1,Y))\sigma^{\prime},\tau^{\prime}\in V(\mathsf{FS}(\textsf{{Cycle}}_{n+1},Y^{\prime})) by

σ(i)={σ(i)i[n],vi=n+1,\displaystyle\sigma^{\prime}(i)=\begin{cases}\sigma(i)&i\in[n],\\ v&i=n+1,\end{cases} τ(i)={τ(i)i[n],vi=n+1.\displaystyle\tau^{\prime}(i)=\begin{cases}\tau(i)&i\in[n],\\ v&i=n+1.\end{cases}

The configurations σ,τ\sigma^{\prime},\tau^{\prime} are in the same component of 𝖥𝖲(Cyclen+1,Y)\mathsf{FS}(\textsf{{Cycle}}_{n+1},Y^{\prime}). Indeed, from a sequence of (Cyclen,Y)(\textsf{{Cycle}}_{n},Y)-friendly swaps Σ1\Sigma_{1} from σ\sigma to τ\tau of shortest length, we can construct a sequence Σ1\Sigma_{1}^{\prime} of (Cyclen+1,Y)(\textsf{{Cycle}}_{n+1},Y^{\prime})-friendly swaps from σ\sigma^{\prime} to τ\tau^{\prime} by replacing every swap in Σ1\Sigma_{1} which occurs along {1,n}E(Cyclen)\{1,n\}\in E(\textsf{{Cycle}}_{n}) by a sequence of three swaps along the following edges in E(Cyclen)E(\textsf{{Cycle}}_{n}):

{n,n+1},{1,n+1},{n,n+1}.\displaystyle\{n,n+1\},\{1,n+1\},\{n,n+1\}.

It is straightforward to confirm that Σ1\Sigma_{1}^{\prime} is a path from σ\sigma^{\prime} to τ\tau^{\prime}, constructed from Σ1\Sigma_{1} by “crossing” the vertex vv as needed. Since YY^{\prime} has a spanning star subgraph, Y¯\overline{Y^{\prime}} has an isolated vertex, so it follows immediately that the components of Y¯\overline{Y^{\prime}} have jointly coprime size. So by Theorem 3.7,

d(σ,τ)4(n+1)3+|E(Y)|4(n+1)3+(n+12)=4n3(1+f(n)),\displaystyle d(\sigma^{\prime},\tau^{\prime})\leq 4(n+1)^{3}+|E(Y^{\prime})|\leq 4(n+1)^{3}+\binom{n+1}{2}=4n^{3}(1+f(n)),

where f(n)=o(1)f(n)=o(1). Let Σ2\Sigma_{2}^{\prime} be a sequence of swaps from σ\sigma^{\prime} to τ\tau^{\prime} of length at most 4n3(1+f(n))4n^{3}(1+f(n)). We construct Σ2\Sigma_{2} from Σ2\Sigma_{2}^{\prime} by removing all (Cyclen+1,Y)(\textsf{{Cycle}}_{n+1},Y^{\prime})-friendly swaps involving vv: it is straightforward to notice that Σ2\Sigma_{2} yields a path of length at most 4n3(1+f(n))4n^{3}(1+f(n)) from σ\sigma to some cyclic rotation τ\tau_{*} of τ\tau, i.e., d(σ,τ)4n3(1+f(n))d(\sigma,\tau_{*})\leq 4n^{3}(1+f(n)). Towards a contradiction, assume d(τ,τ)8n4(1+2f(n))+nd(\tau,\tau_{*})\geq 8n^{4}(1+2f(n))+n, and let v1,,vn+1v_{1},\dots,v_{n+1} be vertices along a shortest path from τ\tau to τ\tau_{*} satisfying d(vi,vi+1)>8n3(1+2f(n))d(v_{i},v_{i+1})>8n^{3}(1+2f(n)) for all i[n]i\in[n]. Such vertices viv_{i} exist due to our assumption on d(τ,τ)d(\tau,\tau_{*}). By appealing to the same argument as above, we deduce that there exist cyclic rotations σ1,,σn+1\sigma_{1},\dots,\sigma_{n+1} of σ\sigma such that d(vi,σi)4n3(1+f(n))d(v_{i},\sigma_{i})\leq 4n^{3}(1+f(n)) for all i[n+1]i\in[n+1]. Since there exist nn distinct rotations of σ\sigma, the pigeonhole principle yields the existence of iji\neq j for which

d(vi,vj)d(vi,σ)+d(σ,vj)4n3(1+f(n))+4n3(1+f(n))8n3(1+2f(n))\displaystyle d(v_{i},v_{j})\leq d(v_{i},\sigma^{\prime})+d(\sigma^{\prime},v_{j})\leq 4n^{3}(1+f(n))+4n^{3}(1+f(n))\leq 8n^{3}(1+2f(n))

for some rotation σ\sigma^{\prime} of σ\sigma, which is a contradiction. Therefore, we conclude that

d(σ,τ)d(σ,τ)+d(τ,τ)4n3(1+f(n))+8n3(1+2f(n))(n+1)=8n4(1+o(1)).\displaystyle d(\sigma,\tau)\leq d(\sigma,\tau_{*})+d(\tau_{*},\tau)\leq 4n^{3}(1+f(n))+8n^{3}(1+2f(n))(n+1)=8n^{4}(1+o(1)).

The desired result now follows immediately. ∎

From Theorem 3.9, we can also extract the following analogue of Lemma 2.1 for double-flips.

Corollary 3.10.

If α,α′′Acyc(G)\alpha,\alpha^{\prime\prime}\in\textsf{{Acyc}}(G) satisfy αα′′\alpha\approx\alpha^{\prime\prime}, then we can reach α′′\alpha^{\prime\prime} from α\alpha in no more than 4n4(1+o(1))4n^{4}(1+o(1)) double-flips.

Proof.

Given an nn-vertex graph GG and α,α′′Acyc(G)\alpha,\alpha^{\prime\prime}\in\textsf{{Acyc}}(G) satisfying αα′′\alpha\approx\alpha^{\prime\prime}, extract linear extensions σ(α)\sigma\in\mathcal{L}(\alpha), τ(α′′)\tau\in\mathcal{L}(\alpha^{\prime\prime}), and consider σ\sigma and τ\tau as vertices of 𝖥𝖲(Cyclen,G)\mathsf{FS}(\textsf{{Cycle}}_{n},G). By Theorem 3.9, d(σ,τ)8n4(1+o(1))d(\sigma,\tau)\leq 8n^{4}(1+o(1)), so let Σ={σi}i=0λ\Sigma=\{\sigma_{i}\}_{i=0}^{\lambda} be a shortest sequence of swaps from σ\sigma to τ\tau, so λ8n4(1+o(1))\lambda\leq 8n^{4}(1+o(1)). Let Σ0={σij}j=0λ\Sigma_{0}=\{\sigma_{i_{j}}\}_{j=0}^{\lambda^{\prime}} be the subsequence of Σ\Sigma consisting of all indices iji_{j} for which σij+1\sigma_{i_{j}+1} is reached from σij\sigma_{i_{j}} by a (Cyclen,G)(\textsf{{Cycle}}_{n},G)-friendly swap across the edge {1,n}\{1,n\}. Since λ\lambda is smallest possible, two consecutive swaps of Σ\Sigma cannot both be across the edge {1,n}\{1,n\}, so λ4n4(1+o(1))\lambda^{\prime}\leq 4n^{4}(1+o(1)). We will now describe how to use Σ0\Sigma_{0} to construct a sequence Σ={αj}j=0λ+1\Sigma^{\prime}=\{\alpha_{j}\}_{j=0}^{\lambda^{\prime}+1} of acyclic orientations, with α0=α\alpha_{0}=\alpha and αλ+1=α′′\alpha_{\lambda^{\prime}+1}=\alpha^{\prime\prime}, for which αj\alpha_{j} is reachable from αj1\alpha_{j-1} by a double-flip for all j[λ+1]j\in[\lambda^{\prime}+1]. The desired result will then follow immediately.

Since we reached σi0\sigma_{i_{0}} from σ\sigma by swapping along the graph 𝖥𝖲(Pathn,G)\mathsf{FS}(\textsf{{Path}}_{n},G) (specifically, the copy of Pathn\textsf{{Path}}_{n} in Cyclen\textsf{{Cycle}}_{n} resulting from excluding the edge {1,n}\{1,n\}), it follows from Theorem 2.4 that σi0(α)\sigma_{i_{0}}\in\mathcal{L}(\alpha). Let α1\alpha_{1} be the result of taking α\alpha and performing a double-flip which involves an inflip on the source σi0(1)\sigma_{i_{0}}(1) and an outflip on the sink σi0(n)\sigma_{i_{0}}(n). Note that {σi0(1),σi0(n)}E(Y)\{\sigma_{i_{0}}(1),\sigma_{i_{0}}(n)\}\in E(Y) (we swapped these two vertices to reach σi0+1\sigma_{i_{0}+1} from σi0\sigma_{i_{0}}), so {σi0(1),σi0(n)}E(Y¯)\{\sigma_{i_{0}}(1),\sigma_{i_{0}}(n)\}\notin E(\overline{Y}), from which it follows that this is a valid double-flip.333This correspondence between double-flips and paths in 𝖥𝖲(Cyclen,G)\mathsf{FS}(\textsf{{Cycle}}_{n},G) is the same as that which was observed in the first paragraph of the proof of [DK21, Theorem 4.1]. It is easy to check that σi0+1(α1)\sigma_{i_{0}+1}\in\mathcal{L}(\alpha_{1}), and by appealing to Theorem 2.4 as before, σi1(α1)\sigma_{i_{1}}\in\mathcal{L}(\alpha_{1}). Continuing like this sequentially on j[λ+1]j\in[\lambda^{\prime}+1] (the preceding discussion being the j=1j=1 case) yields the desired sequence Σ\Sigma^{\prime}: for the case j=λ+1j=\lambda^{\prime}+1, it follows as before from Theorem 2.4 that σiλ+1\sigma_{i_{\lambda^{\prime}}+1} and τ\tau are linear extensions of the poset (i.e., associated to the same acyclic orientation of GG), so the final acyclic orientation in Σ\Sigma^{\prime} is αG(τ)=α′′\alpha_{G}(\tau)=\alpha^{\prime\prime}. ∎

4. Proof of Main Result

We devote this section to answering Question 1.2 in the negative, establishing Theorem 1.3.

4.1. The Graphs XLX_{L} and YLY_{L}

We begin with the following observation. One can understand this as the central vertex of Starn\textsf{{Star}}_{n} acting as a “knob” rotating around Cyclen\textsf{{Cycle}}_{n}, and all other vertices of V(Starn)V(\textsf{{Star}}_{n}) moving cyclically around it: n(n1)n(n-1) such swaps in the same direction are needed for all vertices of Starn\textsf{{Star}}_{n} to return to their original positions in the starting configuration. This interpretation will help motivate our construction.

Lemma 4.1.

Every connected component of 𝖥𝖲(Cyclen,Starn)\mathsf{FS}(\textsf{{Cycle}}_{n},\textsf{{Star}}_{n}) is isomorphic to Cyclen(n1)\textsf{{Cycle}}_{n(n-1)}.

Proof.

Consider a component 𝒞\mathcal{C} of 𝖥𝖲(Cyclen,Starn)\mathsf{FS}(\textsf{{Cycle}}_{n},\textsf{{Star}}_{n}) with permutation σ=σ(1)σ(n)\sigma=\sigma(1)\cdots\sigma(n) such that σ(1)\sigma(1) is the central vertex of Starn\textsf{{Star}}_{n}. With V(Cyclen(n1))=[n(n1)]V(\textsf{{Cycle}}_{n(n-1)})=[n(n-1)], construct φ:V(Cyclen(n1))V(𝒞)\varphi\mathrel{\mathop{\ordinarycolon}}V(\textsf{{Cycle}}_{n(n-1)})\to V(\mathcal{C}) by defining φ(i)\varphi(i) to be the permutation achieved by starting from σ\sigma and swapping σ(1)\sigma(1) rightward ii times (e.g., φ(1)=σ(2)σ(1)σ(n)\varphi(1)=\sigma(2)\sigma(1)\cdots\sigma(n)). It follows that φ\varphi is a graph isomorphism. ∎

We will now construct the graphs XLX_{L} and YLY_{L}, for every integer L1L\geq 1, that we study to prove Theorem 1.3. In the following description, assume we have fixed some arbitrary integer L1L\geq 1.

The Graph XLX_{L}.

The graph XLX_{L} contains an L×2L\times 2 array of cycle subgraphs, with adjacent cycles intersecting in exactly one vertex. Say XLX_{L} has LL layers, indexed by [L]\ell\in[L]; we will subscript subgraphs and vertices corresponding to the “left column” of XLX_{L} by aa, and those in the right by bb. As such, we denote the left and right cycle subgraphs in layer \ell by 𝒞a\mathcal{C}_{a}^{\ell} and 𝒞b\mathcal{C}_{b}^{\ell}, respectively. Corresponding to each 𝒞a\mathcal{C}_{a}^{\ell} and 𝒞b\mathcal{C}_{b}^{\ell} is a path subgraph of XLX_{L} extending out of it; that corresponding to 𝒞a\mathcal{C}_{a}^{\ell} is denoted 𝒫a\mathcal{P}_{a}^{\ell}, and similarly 𝒫b\mathcal{P}_{b}^{\ell} for 𝒞b\mathcal{C}_{b}^{\ell}. Denote the subgraph of XLX_{L} consisting of the \ellth layer by XX^{\ell}. The subgraph consisting of 𝒫a\mathcal{P}_{a}^{\ell} and 𝒞a\mathcal{C}_{a}^{\ell} is denoted XaX_{a}^{\ell}, and similarly XbX_{b}^{\ell} for PbP_{b}^{\ell} and 𝒞b\mathcal{C}_{b}^{\ell}. Denote, whenever they are defined for [L]\ell\in[L],

va=V(𝒫a)V(𝒞a),vb=V(𝒫b)V(𝒞b),v=V(𝒞a)V(𝒞b),\displaystyle v_{a}^{\ell}=V(\mathcal{P}_{a}^{\ell})\cap V(\mathcal{C}_{a}^{\ell}),\ v_{b}^{\ell}=V(\mathcal{P}_{b}^{\ell})\cap V(\mathcal{C}_{b}^{\ell}),\ v^{\ell}=V(\mathcal{C}_{a}^{\ell})\cap V(\mathcal{C}_{b}^{\ell}),
va,+1=V(𝒞a)V(𝒞a+1),vb,+1=V(𝒞b)V(𝒞b+1).\displaystyle v_{a}^{\ell,\ell+1}=V(\mathcal{C}_{a}^{\ell})\cap V(\mathcal{C}_{a}^{\ell+1}),\ v_{b}^{\ell,\ell+1}=V(\mathcal{C}_{b}^{\ell})\cap V(\mathcal{C}_{b}^{\ell+1}).

For each of the following sets, we place three inner vertices in the path in 𝒞a\mathcal{C}_{a}^{\ell} between the two vertices in the set:

{va,va,+1},{va,+1,v},{v,va1,},{va1,,va}.\displaystyle\{v_{a}^{\ell},v_{a}^{\ell,\ell+1}\},\{v_{a}^{\ell,\ell+1},v^{\ell}\},\{v^{\ell},v_{a}^{\ell-1,\ell}\},\{v_{a}^{\ell-1,\ell},v_{a}^{\ell}\}.

The analogous statement for 𝒞b\mathcal{C}_{b}^{\ell} holds. The exceptions are layers 11 and LL: we place seven inner vertices in the upper path from va1v_{a}^{1} to v1v^{1} in 𝒞a1\mathcal{C}_{a}^{1} and the upper path from vb1v_{b}^{1} to v1v^{1} in 𝒞b1\mathcal{C}_{b}^{1}, and seven inner vertices in the lower path from vaLv_{a}^{L} to vLv^{L} in 𝒞aL\mathcal{C}_{a}^{L} and from vbLv_{b}^{L} to vLv^{L} in 𝒞bL\mathcal{C}_{b}^{L}. It follows from our construction that for every [L]\ell\in[L],

|V(𝒞a)|=|V(𝒞b)|=16.\displaystyle|V(\mathcal{C}_{a}^{\ell})|=|V(\mathcal{C}_{b}^{\ell})|=16.

We will also set,444It will be important that, for every [L]\ell\in[L], 𝒫a\mathcal{P}_{a}^{\ell} has exactly one more vertex than 𝒫b\mathcal{P}_{b}^{\ell}. The choice of the lengths of these paths, as well as the number of inner vertices in the segments of the cycle subgraphs, is not terribly important as long as they are not too small. The values we chose here suffice. for every [L]\ell\in[L],

|V(𝒫a)|=16,|V(𝒫b)|=15,\displaystyle|V(\mathcal{P}_{a}^{\ell})|=16,|V(\mathcal{P}_{b}^{\ell})|=15,

so that the graph XLX_{L} has

n=60+58(L1)=58L+2\displaystyle n=60+58(L-1)=58L+2

vertices. (Indeed, it can be checked that layer 11 has 6060 vertices, and for each subsequent layer, we add 5858 new vertices to the graph YLY_{L}.) Figure 6 illustrates this construction for L=3L=3.

Refer to caption
Figure 6. Labeled schematic diagram of the construction for X3X_{3}. Subgraphs of X3X_{3} marked a specific color correspond to the σs\sigma_{s}-preimages of the vertices of the same color in Figure 7. We take care in appropriately coloring the vertices between two adjacent cycle subgraphs and between adjacent path and cycle subgraphs. Paths marked with one hatch mark have three inner vertices. The paths 𝒫bi\mathcal{P}_{b}^{i} with two hatch marks have 1515 vertices, while paths 𝒫ai\mathcal{P}_{a}^{i} with three hatch marks have 1616 vertices.

The Graph YLY_{L}.

We construct a complementary graph YLY_{L} for each XLX_{L}: we assign to each cycle subgraph 𝒞a\mathcal{C}_{a}^{\ell} and 𝒞b\mathcal{C}_{b}^{\ell} of XLX_{L} a corresponding “knob vertex” in V(YL)V(Y_{L}), denoted κa\kappa_{a}^{\ell} and κb\kappa_{b}^{\ell}, respectively; we set a collection of vertices of V(YL)V(Y_{L}) to swap only with each knob. The construction of YLY_{L} proceeds sequentially according to [L]\ell\in[L]. Take two disjoint copies of Star15\textsf{{Star}}_{15}, denoted 𝒮a1\mathcal{S}_{a}^{1} and 𝒮b1\mathcal{S}_{b}^{1}, with central vertices κa1\kappa_{a}^{1} and κb1\kappa_{b}^{1}, respectively, and a complete bipartite graph 𝒦1\mathcal{K}^{1} with 1515 vertices in each of its partite sets 𝒦a1\mathcal{K}_{a}^{1} and 𝒦b1\mathcal{K}_{b}^{1}. Set κa1\kappa_{a}^{1} and κb1\kappa_{b}^{1} adjacent to all the vertices in V(𝒦1)V(\mathcal{K}^{1}). If L=1L=1, this completes the construction of YLY_{L}. If L>1L>1, take one vertex each in 𝒦a1\mathcal{K}_{a}^{1} and 𝒦b1\mathcal{K}_{b}^{1}, which shall correspond to κa2\kappa_{a}^{2} and κb2\kappa_{b}^{2}, central vertices of star subgraphs (again both isomorphic to Star15\textsf{{Star}}_{15}) 𝒮a2\mathcal{S}_{a}^{2} and 𝒮b2\mathcal{S}_{b}^{2}, respectively, and also construct complete bipartite graph 𝒦2\mathcal{K}^{2} with 1515 vertices in each of its partite sets 𝒦a2\mathcal{K}_{a}^{2} and 𝒦b2\mathcal{K}_{b}^{2}. Set κa2\kappa_{a}^{2} and κb2\kappa_{b}^{2} adjacent to all the vertices in V(𝒦2)V(\mathcal{K}^{2}). Proceed similarly: for 2L2\leq\ell\leq L, take two vertices of 𝒦1\mathcal{K}^{\ell-1} in opposite partite sets and construct 𝒮a\mathcal{S}_{a}^{\ell}, 𝒮b\mathcal{S}_{b}^{\ell}, and 𝒦\mathcal{K}^{\ell}, related as before, until all n=58L+2n=58L+2 vertices are exhausted. We shall often refer to vertices κa\kappa_{a}^{\ell} and κb\kappa_{b}^{\ell} as knob vertices of YLY_{L}. Figure 7 illustrates this construction for L=3L=3, while Figure 8 provides a “collapsed” view of our construction.

Refer to caption
Figure 7. Labeled schematic diagram of the construction for Y3Y_{3}. The vertices of Y3Y_{3} marked with a particular color correspond to the σs\sigma_{s}-images of the vertices of the same color in Figure 6.

The Starting Configuration σs\sigma_{s} and its Connected Component 𝒞\mathscr{C}.

Take an arbitrary L1L\geq 1 and graphs XLX_{L}, YLY_{L}. We are now going to describe a specific starting configuration σs(XL,YL)V(𝖥𝖲(XL,YL))\sigma_{s}(X_{L},Y_{L})\in V(\mathsf{FS}(X_{L},Y_{L})) which lies in the connected component 𝒞(XL,YL)\mathscr{C}(X_{L},Y_{L}) of 𝖥𝖲(XL,YL)\mathsf{FS}(X_{L},Y_{L}); we will later show that there exists a different configuration in 𝒞(XL,YL)\mathscr{C}(X_{L},Y_{L}) whose distance from σs(XL,YL)\sigma_{s}(X_{L},Y_{L}) is eΩ(n)e^{\Omega(n)}. Henceforth, we abbreviate σs(XL,YL)\sigma_{s}(X_{L},Y_{L}) and 𝒞(XL,YL)\mathscr{C}(X_{L},Y_{L}) to σs\sigma_{s} and 𝒞\mathscr{C}. In forthcoming discussions, XLX_{L} and YLY_{L} will be understood to be arbitrary such graphs on the same number of vertices.

Take all 1515 vertices in V(𝒦a1)V(\mathcal{K}_{a}^{1}) and place them onto V(𝒫a1){va1}V(\mathcal{P}_{a}^{1})\setminus\{v_{a}^{1}\}, and the 1515 vertices in V(𝒦b1)V(\mathcal{K}_{b}^{1}) onto V(𝒫b1)V(\mathcal{P}_{b}^{1}); if L>1L>1, we place κa2\kappa_{a}^{2} onto the leftmost vertex of V(𝒫a1)V(\mathcal{P}_{a}^{1}) and κb2\kappa_{b}^{2} onto vb1v_{b}^{1}. Now take subgraph 𝒮a1\mathcal{S}_{a}^{1} of YLY_{L}: place κa1\kappa_{a}^{1} onto the middle vertex of the upper path between va1v_{a}^{1} and v1v^{1} (which has seven vertices), and place all 1414 leaves of 𝒮a1\mathcal{S}_{a}^{1} onto the remaining 1414 vertices of V(𝒞a1){v1}V(\mathcal{C}_{a}^{1})\setminus\{v^{1}\} in some way. Similarly, take 𝒮b1\mathcal{S}_{b}^{1}: place κb1\kappa_{b}^{1} onto the middle vertex of the upper path between v1v^{1} and vb1v_{b}^{1}, and place all 1414 leaves of 𝒮b1\mathcal{S}_{b}^{1} onto the remaining 1414 vertices of V(𝒞b1)V(\mathcal{C}_{b}^{1}). This has filled all mappings on the subgraph V(X1)V(X^{1}) of XLX_{L} by vertices in V(𝒦1)V(\mathcal{K}^{1}), V(𝒮a1)V(\mathcal{S}_{a}^{1}), and V(𝒮b1)V(\mathcal{S}_{b}^{1}), and thus yields σs\sigma_{s} if L=1L=1.

Proceed sequentially according to the layer [L]\ell\in[L]: say we placed all vertices of V(𝒦i)V(\mathcal{K}^{i}), V(𝒮ai)V(\mathcal{S}_{a}^{i}), and V(𝒮bi)V(\mathcal{S}_{b}^{i}) for i<i<\ell onto the corresponding V(Xi)V(X^{i}) of XLX_{L}. Place all 1515 vertices in V(𝒦a)V(\mathcal{K}_{a}^{\ell}) onto V(𝒫a){va}V(\mathcal{P}_{a}^{\ell})\setminus\{v_{a}^{\ell}\}, and the 1515 vertices in V(𝒦b)V(\mathcal{K}_{b}^{\ell}) onto V(𝒫b)V(\mathcal{P}_{b}^{\ell}); if L>L>\ell, place κa+1\kappa_{a}^{\ell+1} onto the leftmost vertex of V(𝒫a)V(\mathcal{P}_{a}^{\ell}) and κb+1\kappa_{b}^{\ell+1} onto vbv_{b}^{\ell}. Now take 𝒮a\mathcal{S}_{a}^{\ell}, and place its 1414 leaves onto the remaining 1414 vertices in V(𝒞a){v}V(\mathcal{C}_{a}^{\ell})\setminus\{v^{\ell}\}. Similarly take 𝒮b\mathcal{S}_{b}^{\ell}, and place its 1414 leaves onto the 1414 remaining vertices in V(𝒞b)V(\mathcal{C}_{b}^{\ell}). An illustration of this starting configuration is given in Figures 6 and 7: the vertices of a particular color in Figure 7 are placed upon the correspondingly colored subgraph in Figure 6 to achieve σsV(𝖥𝖲(XL,YL))\sigma_{s}\in V(\mathsf{FS}(X_{L},Y_{L})).

Refer to caption
Figure 8. A simplified schematic diagram of Y4Y_{4} to illustrate the neighborhoods of different kinds of vertices. Here, all subgraphs 𝒮a\mathcal{S}_{a}^{\ell}, 𝒮b\mathcal{S}_{b}^{\ell}, 𝒦a\mathcal{K}_{a}^{\ell}, and 𝒦b\mathcal{K}_{b}^{\ell} are to be understood as excluding any knob vertices.
Remark 4.2.

By the construction of σsV(𝖥𝖲(XL,YL))\sigma_{s}\in V(\mathsf{FS}(X_{L},Y_{L})), for any [L]\ell\in[L],

V(𝒮a){κa}σs(V(𝒞a)),\displaystyle V(\mathcal{S}_{a}^{\ell})\setminus\{\kappa_{a}^{\ell}\}\subset\sigma_{s}(V(\mathcal{C}_{a}^{\ell})), V(𝒮b){κb}σs(V(𝒞b)).\displaystyle V(\mathcal{S}_{b}^{\ell})\setminus\{\kappa_{b}^{\ell}\}\subset\sigma_{s}(V(\mathcal{C}_{b}^{\ell})).

As such, all leaves of a star subgraph 𝒮a\mathcal{S}_{a}^{\ell} or 𝒮b\mathcal{S}_{b}^{\ell} of YLY_{L} are placed onto a corresponding cycle subgraph 𝒞a\mathcal{C}_{a}^{\ell} or 𝒞b\mathcal{C}_{b}^{\ell} of XLX_{L}, respectively. This yields that, for any [L]\ell\in[L],

|σs(V(𝒞a))(V(𝒮a){κa})|=|σs(V(𝒞b))(V(𝒮b){κb})|=2.\displaystyle|\sigma_{s}(V(\mathcal{C}_{a}^{\ell}))\setminus(V(\mathcal{S}_{a}^{\ell})\setminus\{\kappa_{a}^{\ell}\})|=|\sigma_{s}(V(\mathcal{C}_{b}^{\ell}))\setminus(V(\mathcal{S}_{b}^{\ell})\setminus\{\kappa_{b}^{\ell}\})|=2.

In other words, the number of vertices upon any cycle subgraph 𝒞a\mathcal{C}_{a}^{\ell} or 𝒞b\mathcal{C}_{b}^{\ell} of XLX_{L} which are not leaves of the corresponding star subgraph of YLY_{L}, under σs\sigma_{s}, is exactly two. ∎

We introduce the following definition for notational convenience in forthcoming arguments.

Definition 4.3.

Fix [L]\ell\in[L].

  • The boundary bd(𝒞a)\textup{bd}(\mathcal{C}_{a}^{\ell}) of 𝒞a\mathcal{C}_{a}^{\ell} is the subset of {va,va1,,va,+1,v}\{v_{a}^{\ell},v_{a}^{\ell-1,\ell},v_{a}^{\ell,\ell+1},v^{\ell}\} defined for \ell.

  • The boundary bd(𝒞b)\textup{bd}(\mathcal{C}_{b}^{\ell}) of 𝒞b\mathcal{C}_{b}^{\ell} is the subset of {vb,vb1,,vb,+1,v}\{v_{b}^{\ell},v_{b}^{\ell-1,\ell},v_{b}^{\ell,\ell+1},v^{\ell}\} defined for \ell.

In Subsections 4.2 and 4.3, unless otherwise stated, we fix an arbitrary integer L1L\geq 1 and refer to the graphs XLX_{L} and YLY_{L}, with σs\sigma_{s} denoting the corresponding starting configuration. We elect to refer to paths in 𝖥𝖲(XL,YL)\mathsf{FS}(X_{L},Y_{L}) as swap sequences, which are denoted by the vertices and edges in 𝖥𝖲(XL,YL)\mathsf{FS}(X_{L},Y_{L}) that constitute the path. More specifically, a swap sequence of length λ\lambda is a sequence of vertices Σ={σi}i=0λV(𝖥𝖲(XL,YL))\Sigma=\{\sigma_{i}\}_{i=0}^{\lambda}\subseteq V(\mathsf{FS}(X_{L},Y_{L})) for which {σi1,σi}E(𝖥𝖲(XL,YL))\{\sigma_{i-1},\sigma_{i}\}\in E(\mathsf{FS}(X_{L},Y_{L})) for all i[λ]i\in[\lambda].

4.2. Configurations in 𝒞\mathscr{C}

In this subsection, we derive properties satisfied by all vertices in 𝒞\mathscr{C}. Intuitively, our aim in this subsection is to uncover many conditions satisfied by all of the vertices in 𝒞\mathscr{C}, which has the effect of producing strong rigidities on the corresponding swapping problem. These rigidities will allow us to argue in Subsection 4.3 that in order to move certain vertices in YLY_{L} down and across the graph XLX_{L}, we necessarily must perform very specific sequences of swaps.

Remark 4.2 observes that in the starting configuration σs\sigma_{s}, the leaves of any star graph 𝒮a\mathcal{S}_{a}^{\ell} or 𝒮b\mathcal{S}_{b}^{\ell} lie upon the vertices of 𝒞a\mathcal{C}_{a}^{\ell} and 𝒞b\mathcal{C}_{b}^{\ell}, respectively. In particular, for any cycle subgraph 𝒞a\mathcal{C}_{a}^{\ell} in XLX_{L}, exactly two vertices that are not leaves of 𝒮a\mathcal{S}_{a}^{\ell} lie upon them; an analogous statement holds for cycle subgraphs of the form 𝒞b\mathcal{C}_{b}^{\ell}. We begin our study of 𝒞\mathscr{C} by establishing that this property is maintained after any sequence of swaps in 𝖥𝖲(XL,YL)\mathsf{FS}(X_{L},Y_{L}) beginning at σs\sigma_{s}, i.e., that all vertices in 𝒞\mathscr{C} satisfy this property: we prove this in Proposition 4.4.

Proposition 4.4.

Any σV(𝒞)\sigma\in V(\mathscr{C}) satisfies, for all [L]\ell\in[L],

V(𝒮a){κa}σ(V(𝒞a)) and V(𝒮b){κb}σ(V(𝒞b)).\displaystyle V(\mathcal{S}_{a}^{\ell})\setminus\{\kappa_{a}^{\ell}\}\subset\sigma(V(\mathcal{C}_{a}^{\ell}))\text{ and }V(\mathcal{S}_{b}^{\ell})\setminus\{\kappa_{b}^{\ell}\}\subset\sigma(V(\mathcal{C}_{b}^{\ell})).

As in Remark 4.2, this means that for any cycle subgraph 𝒞a\mathcal{C}_{a}^{\ell} or 𝒞b\mathcal{C}_{b}^{\ell} in XLX_{L} and σV(𝒞)\sigma\in V(\mathscr{C}),

|σ(V(𝒞a))(V(𝒮a){κa})|=|σ(V(𝒞b))(V(𝒮b){κb})|=2,\displaystyle|\sigma(V(\mathcal{C}_{a}^{\ell}))\setminus(V(\mathcal{S}_{a}^{\ell})\setminus\{\kappa_{a}^{\ell}\})|=|\sigma(V(\mathcal{C}_{b}^{\ell}))\setminus(V(\mathcal{S}_{b}^{\ell})\setminus\{\kappa_{b}^{\ell}\})|=2,

since |V(𝒮a){κa}|=|V(𝒮b){κb}|=14|V(\mathcal{S}_{a}^{\ell})\setminus\{\kappa_{a}^{\ell}\}|=|V(\mathcal{S}_{b}^{\ell})\setminus\{\kappa_{b}^{\ell}\}|=14, and |V(𝒞a)|=|V(𝒞b)|=16|V(\mathcal{C}_{a}^{\ell})|=|V(\mathcal{C}_{b}^{\ell})|=16 for all [L]\ell\in[L].

Remark 4.5.

Although Proposition 4.4 describes a global property maintained by all configurations in 𝒞\mathscr{C}, we frequently appeal to it (for sake of brevity) as a local property satisfied by specific configurations in 𝒞\mathscr{C} during the proof of Proposition 4.4.555Making this clarification is important, as the proof proceeds by assuming (towards a contradiction) that Proposition 4.4 is satisfied by particular configurations in 𝒞\mathscr{C} and is violated by another. This practice of localizing a more global statement to a particular configuration will also be utilized for other results in later proofs in this section, and it should not raise any ambiguity whenever it is invoked. ∎

Proof of Proposition 4.4.

Assume (towards a contradiction) that the proposition is false, so there exists a swap sequence Σ={σi}i=0λ\Sigma=\{\sigma_{i}\}_{i=0}^{\lambda} with σ0=σs\sigma_{0}=\sigma_{s} in 𝒞\mathscr{C} of shortest length λ\lambda containing a vertex violating Proposition 4.4: σλ\sigma_{\lambda} violates Proposition 4.4, while all σi\sigma_{i} for i<λi<\lambda satisfy it, and λ1\lambda\geq 1. Thus, there exists a star subgraph 𝒮\mathcal{S} (of form 𝒮a\mathcal{S}_{a}^{\ell} or 𝒮b\mathcal{S}_{b}^{\ell}) of YLY_{L} and a leaf μV(𝒮)\mu\in V(\mathcal{S}) such that σλ11(μ)\sigma_{\lambda-1}^{-1}(\mu) is in the appropriate cycle subgraph, but σλ1(μ)\sigma_{\lambda}^{-1}(\mu) is not. Say 𝒮=𝒮a\mathcal{S}=\mathcal{S}_{a}^{\ell} for [L]\ell\in[L]: raising a contradiction when 𝒮=𝒮b\mathcal{S}=\mathcal{S}_{b}^{\ell} is entirely analogous. Here, μV(𝒮a){κa}\mu\in V(\mathcal{S}_{a}^{\ell})\setminus\{\kappa_{a}^{\ell}\} has NYL(μ)={κa}N_{Y_{L}}(\mu)=\{\kappa_{a}^{\ell}\} and σλ11(μ)V(𝒞a)\sigma_{\lambda-1}^{-1}(\mu)\in V(\mathcal{C}_{a}^{\ell}), σλ1(μ)V(𝒞a)\sigma_{\lambda}^{-1}(\mu)\notin V(\mathcal{C}_{a}^{\ell}), so σλ11(μ)bd(𝒞a)\sigma_{\lambda-1}^{-1}(\mu)\in\textup{bd}(\mathcal{C}_{a}^{\ell}) and σλ\sigma_{\lambda} is reached from σλ1\sigma_{\lambda-1} by swapping μ\mu and κa\kappa_{a}^{\ell}. Figure 9 depicts the configurations described in the following two cases.

Case 1: σλ11(μ)=va\sigma_{\lambda-1}^{-1}(\mu)=v_{a}^{\ell}.

Here, σλ11(κa)NXL(va)V(𝒫a)\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell})\in N_{X_{L}}(v_{a}^{\ell})\cap V(\mathcal{P}_{a}^{\ell}). Let ξ<λ1\xi<\lambda-1 be the final such index with σξ1(κa)V(𝒫a){va}\sigma_{\xi}^{-1}(\kappa_{a}^{\ell})\notin V(\mathcal{P}_{a}^{\ell})\setminus\{v_{a}^{\ell}\}; ξ\xi is well-defined since

σs1(κa)V(𝒫a){va},\displaystyle\sigma_{s}^{-1}(\kappa_{a}^{\ell})\notin V(\mathcal{P}_{a}^{\ell})\setminus\{v_{a}^{\ell}\},

which implies σsσλ1\sigma_{s}\neq\sigma_{\lambda-1}, so λ2\lambda\geq 2. By the definition of ξ\xi and σλ11(κa)NXL(va)V(𝒫a)\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell})\in N_{X_{L}}(v_{a}^{\ell})\cap V(\mathcal{P}_{a}^{\ell}),

(4.1) σj1(κa)V(𝒫a){va} for ξ+1jλ1.\displaystyle\sigma_{j}^{-1}(\kappa_{a}^{\ell})\in V(\mathcal{P}_{a}^{\ell})\setminus\{v_{a}^{\ell}\}\text{ for }\xi+1\leq j\leq\lambda-1.

Necessarily, σξ1(κa)=va\sigma_{\xi}^{-1}(\kappa_{a}^{\ell})=v_{a}^{\ell} and σξ+11(κa)NXL(va)V(𝒫a)\sigma_{\xi+1}^{-1}(\kappa_{a}^{\ell})\in N_{X_{L}}(v_{a}^{\ell})\cap V(\mathcal{P}_{a}^{\ell}), so

σξ1(μ)=σξ+11(μ)V(𝒞a){va};\displaystyle\sigma_{\xi}^{-1}(\mu)=\sigma_{\xi+1}^{-1}(\mu)\in V(\mathcal{C}_{a}^{\ell})\setminus\{v_{a}^{\ell}\};

note that σξ\sigma_{\xi} satisfies Proposition 4.4. Since NYL(μ)={κa}N_{Y_{L}}(\mu)=\{\kappa_{a}^{\ell}\} and there are no edges between V(𝒞a){va}V(\mathcal{C}_{a}^{\ell})\setminus\{v_{a}^{\ell}\} and V(𝒫a){va}V(\mathcal{P}_{a}^{\ell})\setminus\{v_{a}^{\ell}\}, it follows from (4.1) that σj1(μ)\sigma_{j}^{-1}(\mu) is fixed for ξjλ1\xi\leq j\leq\lambda-1, so

σξ1(μ)=σλ11(μ)V(𝒞a){va},\displaystyle\sigma_{\xi}^{-1}(\mu)=\sigma_{\lambda-1}^{-1}(\mu)\in V(\mathcal{C}_{a}^{\ell})\setminus\{v_{a}^{\ell}\},

contradicting σλ11(μ)=va\sigma_{\lambda-1}^{-1}(\mu)=v_{a}^{\ell}.

Case 2: σλ11(μ)va\sigma_{\lambda-1}^{-1}(\mu)\neq v_{a}^{\ell}.

Here, σλ11(μ)bd(𝒞a){va}\sigma_{\lambda-1}^{-1}(\mu)\in\textup{bd}(\mathcal{C}_{a}^{\ell})\setminus\{v_{a}^{\ell}\}, and

σλ11(κa)=σλ1(μ)NXL(σλ11(μ))V(𝒞a).\displaystyle\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell})=\sigma_{\lambda}^{-1}(\mu)\in N_{X_{L}}(\sigma_{\lambda-1}^{-1}(\mu))\setminus V(\mathcal{C}_{a}^{\ell}).

Proceeding backwards in Σ\Sigma, it must be that either

σλ21(μ)σλ11(μ) or σλ21(κa)σλ11(κa);\displaystyle\sigma_{\lambda-2}^{-1}(\mu)\neq\sigma_{\lambda-1}^{-1}(\mu)\text{ or }\sigma_{\lambda-2}^{-1}(\kappa_{a}^{\ell})\neq\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell});

note that λ2\lambda\geq 2, since σs1(κa)σλ11(κa)\sigma_{s}^{-1}(\kappa_{a}^{\ell})\neq\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell}). Indeed, if not, swapping μ\mu and κa\kappa_{a}^{\ell} directly from σλ2\sigma_{\lambda-2} raises a contradiction on λ\lambda being minimal. Now, NYL(μ)={κa}N_{Y_{L}}(\mu)=\{\kappa_{a}^{\ell}\} implies

σλ21(κa)σλ11(κa) and σλ21(μ)=σλ11(μ),\displaystyle\sigma_{\lambda-2}^{-1}(\kappa_{a}^{\ell})\neq\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell})\text{ and }\sigma_{\lambda-2}^{-1}(\mu)=\sigma_{\lambda-1}^{-1}(\mu),

since if both preimages differ, σλ2=σλ\sigma_{\lambda-2}=\sigma_{\lambda}. Thus, σλ21(κa)V(𝒞a)\sigma_{\lambda-2}^{-1}(\kappa_{a}^{\ell})\notin V(\mathcal{C}_{a}^{\ell}) and

σλ2(σλ11(κa))V(𝒮a){κa}\displaystyle\sigma_{\lambda-2}(\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell}))\notin V(\mathcal{S}_{a}^{\ell})\setminus\{\kappa_{a}^{\ell}\}

by Proposition 4.4 (on σλ2\sigma_{\lambda-2}), so σλ2(σλ11(κa))\sigma_{\lambda-2}(\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell})) is not a leaf (see NYL(κa)N_{Y_{L}}(\kappa_{a}^{\ell})). We further assume that

σλ21(μ)=σλ11(μ)=v;\displaystyle\sigma_{\lambda-2}^{-1}(\mu)=\sigma_{\lambda-1}^{-1}(\mu)=v^{\ell};

raising a contradiction for the cases va1,v_{a}^{\ell-1,\ell} and va,+1v_{a}^{\ell,\ell+1} can be done analogously. So

σλ21({κa,μ})V(𝒞b) and σλ2(σλ11(κa))V(𝒞b).\displaystyle\sigma_{\lambda-2}^{-1}(\{\kappa_{a}^{\ell},\mu\})\subset V(\mathcal{C}_{b}^{\ell})\text{ and }\sigma_{\lambda-2}(\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell}))\in V(\mathcal{C}_{b}^{\ell}).

Altogether, we have that

|σλ2(V(𝒞b))(V(𝒮b){κb})|3,\displaystyle|\sigma_{\lambda-2}(V(\mathcal{C}_{b}^{\ell}))\setminus(V(\mathcal{S}_{b}^{\ell})\setminus\{\kappa_{b}^{\ell}\})|\geq 3,

and since |V(𝒞b)|=16|V(\mathcal{C}_{b}^{\ell})|=16 and |V(𝒮b){κb}|=14|V(\mathcal{S}_{b}^{\ell})\setminus\{\kappa_{b}^{\ell}\}|=14, V(𝒮b){κb}σλ2(V(𝒞b))V(\mathcal{S}_{b}^{\ell})\setminus\{\kappa_{b}^{\ell}\}\not\subset\sigma_{\lambda-2}(V(\mathcal{C}_{b}^{\ell})). Thus, σλ2\sigma_{\lambda-2} violates Proposition 4.4, contradicting λ\lambda being minimal.

Refer to caption
(a) Case 11. After σξ\sigma_{\xi}, κa\kappa_{a}^{\ell} does not exit V(𝒫a){va}V(\mathcal{P}_{a}^{\ell})\setminus\{v_{a}^{\ell}\}, leading to a contradiction on the placement of μ\mu in σλ1\sigma_{\lambda-1}.
Refer to caption
(b) Case 22. Here, σλ1\sigma_{\lambda-1} results by swapping κa\kappa_{a}^{\ell} along 𝒞b\mathcal{C}_{b}^{\ell}, so that σλ2\sigma_{\lambda-2} violates Proposition 4.4 on 𝒞b\mathcal{C}_{b}^{\ell}.
Figure 9. Configurations in Σ\Sigma raising a contradiction for both cases in the proof of Proposition 4.4.

Proposition 4.4 restricts the preimages of the leaves of 𝒮a\mathcal{S}_{a}^{\ell} and 𝒮b\mathcal{S}_{b}^{\ell} under any σV(𝒞)\sigma\in V(\mathscr{C}). We now derive a restriction on the preimages of all other vertices in V(YL)V(Y_{L}) under any σV(𝒞)\sigma\in V(\mathscr{C}). As Proposition 4.6 formalizes, for such σ\sigma, any vertex in V(YL)V(Y_{L}) is close to its preimage in σs\sigma_{s}.

Proposition 4.6.

Any configuration σV(𝒞)\sigma\in V(\mathscr{C}) must satisfy the following four properties.

  1. (1)

    The layer 11 knob vertices lie upon the corresponding subgraph of X1X^{1}, i.e.,

    σ1(κa1)V(Xa1) and σ1(κb1)V(Xb1).\displaystyle\sigma^{-1}(\kappa_{a}^{1})\in V(X_{a}^{1})\text{ and }\sigma^{-1}(\kappa_{b}^{1})\in V(X_{b}^{1}).
  2. (2)

    For 2L2\leq\ell\leq L, the layer \ell knob vertices lie upon the subgraphs X1X^{\ell-1} or XX^{\ell}, i.e.,

    {σ1(κa),σ1(κb)}V(X1)V(X).\displaystyle\{\sigma^{-1}(\kappa_{a}^{\ell}),\sigma^{-1}(\kappa_{b}^{\ell})\}\subset V(X^{\ell-1})\cup V(X^{\ell}).
  3. (3)

    For [L1]\ell\in[L-1], any vertex in V(𝒦)V(\mathcal{K}^{\ell}) that is not a layer +1\ell+1 knob lies upon XX^{\ell}, i.e.,

    σ1(V(𝒦){κa+1,κb+1})V(X),\displaystyle\sigma^{-1}(V(\mathcal{K}^{\ell})\setminus\{\kappa_{a}^{\ell+1},\kappa_{b}^{\ell+1}\})\subset V(X^{\ell}),

    and every vertex in V(𝒦L)V(\mathcal{K}^{L}) lies upon XLX^{L}, i.e.,

    σ1(V(𝒦L))V(XL).\displaystyle\sigma^{-1}(V(\mathcal{K}^{L}))\subset V(X^{L}).
  4. (4)

    For [L]\ell\in[L], there is at most one μV(𝒦)\mu\in V(\mathcal{K}^{\ell}) not in V(𝒫a)V(𝒫b)V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}), i.e.,

    |σ1(V(𝒦))(V(𝒫a)V(𝒫b))|1.\displaystyle|\sigma^{-1}(V(\mathcal{K}^{\ell}))\setminus(V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}))|\leq 1.

Confirming that the starting configuration σs\sigma_{s} satisfies these four properties is straightforward. Case 4 of the proof of Proposition 4.6 relies on the following Lemma 4.7, which is illustrated in Figure 10. In the statement of the lemma, we elect to index the final term of the swap sequence by λ1\lambda-1 as this is where the result applies in the proof of Proposition 4.6.

Lemma 4.7.

Let Σ={σi}i=0λ1\Sigma=\{\sigma_{i}\}_{i=0}^{\lambda-1} with σ0=σs\sigma_{0}=\sigma_{s}, λ1\lambda\geq 1 be a swap sequence in 𝒞\mathscr{C} such that for all 1iλ11\leq i\leq\lambda-1, σi\sigma_{i} satisfies the four properties of Proposition 4.6. Then for all 0iλ10\leq i\leq\lambda-1 and [L]\ell\in[L], the following two statements hold.

  1. (1)

    If σi1(V(𝒦))V(𝒫a)V(𝒫b)\sigma_{i}^{-1}(V(\mathcal{K}^{\ell}))\subset V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}), then

    σi({va,vb})V(𝒦)|σi1({κa,κb})((V(𝒫a){va})(V(𝒫b){vb}))|=1.\displaystyle\sigma_{i}(\{v_{a}^{\ell},v_{b}^{\ell}\})\subset V(\mathcal{K}^{\ell})\implies|\sigma_{i}^{-1}(\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\})\cap((V(\mathcal{P}_{a}^{\ell})\setminus\{v_{a}^{\ell}\})\cup(V(\mathcal{P}_{b}^{\ell})\setminus\{v_{b}^{\ell}\}))|=1.
  2. (2)

    If σi1(V(𝒦))V(𝒫a)V(𝒫b)\sigma_{i}^{-1}(V(\mathcal{K}^{\ell}))\not\subset V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}), then

    σi(va)V(𝒦)σi1({κa,κb})(V(𝒫a){va}),\displaystyle\sigma_{i}(v_{a}^{\ell})\in V(\mathcal{K}^{\ell})\implies\sigma_{i}^{-1}(\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\})\cap(V(\mathcal{P}_{a}^{\ell})\setminus\{v_{a}^{\ell}\})\neq\emptyset,
    σi(vb)V(𝒦)σi1({κa,κb})(V(𝒫b){vb}).\displaystyle\sigma_{i}(v_{b}^{\ell})\in V(\mathcal{K}^{\ell})\implies\sigma_{i}^{-1}(\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\})\cap(V(\mathcal{P}_{b}^{\ell})\setminus\{v_{b}^{\ell}\})\neq\emptyset.
Refer to caption
(a) Lemma 4.7(1) on the configuration σi\sigma_{i}: if σi1(V(𝒦))V(𝒫a)V(𝒫b)\sigma_{i}^{-1}(V(\mathcal{K}^{\ell}))\subset V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}) and σi({va,vb})V(𝒦)\sigma_{i}(\{v_{a}^{\ell},v_{b}^{\ell}\})\subset V(\mathcal{K}^{\ell}), then either κa\kappa_{a}^{\ell} or κb\kappa_{b}^{\ell} lies upon V(𝒫a){va}V(\mathcal{P}_{a}^{\ell})\setminus\{v_{a}^{\ell}\} or V(𝒫b){vb}V(\mathcal{P}_{b}^{\ell})\setminus\{v_{b}^{\ell}\}.
Refer to caption
(b) First implication of Lemma 4.7(2) on the configuration σi\sigma_{i}: if there exists some μV(𝒦)\mu\in V(\mathcal{K}^{\ell}) with σi1(μ)V(𝒫a)V(𝒫b)\sigma_{i}^{-1}(\mu)\notin V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}), and σi(va)V(𝒦)\sigma_{i}(v_{a}^{\ell})\in V(\mathcal{K}^{\ell}), then either κa\kappa_{a}^{\ell} or κb\kappa_{b}^{\ell} lies upon V(𝒫a){va}V(\mathcal{P}_{a}^{\ell})\setminus\{v_{a}^{\ell}\}.
Figure 10. Illustrations for both parts of Lemma 4.7 for some σiΣ\sigma_{i}\in\Sigma. Subgraphs/vertices colored in red correspond to σi\sigma_{i}-preimages of V(𝒦)V(\mathcal{K}^{\ell}), while σi\sigma_{i}-preimages of elements in {κa,κb}\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\} are colored in blue. For Figure 10b, note that by appealing to Proposition 4.6(4) and comparing cardinalities, we can deduce that at most two vertices of σi(V(𝒫a)V(𝒫b))\sigma_{i}(V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell})) can fail to lie in V(𝒦)V(\mathcal{K}^{\ell}).
Proof of Lemma 4.7.

Fix Σ={σi}i=0λ1\Sigma=\{\sigma_{i}\}_{i=0}^{\lambda-1} to be a swap sequence satisfying the assumptions of Lemma 4.7. We prove the two statements of Lemma 4.7 hold for all [L]\ell\in[L] inductively for 0iλ10\leq i\leq\lambda-1. They can be checked to hold for all [L]\ell\in[L] when i=0i=0, so assume they are true for some 0i<λ10\leq i<\lambda-1. We prove that σi+1\sigma_{i+1} satisfies both statements for all [L]\ell\in[L]. In what follows, assume we refer (unless stated otherwise) to some fixed, arbitrary [L]\ell\in[L]. We break into cases based on whether or not σi1(μ)V(𝒫a)V(𝒫b)\sigma_{i}^{-1}(\mu)\subset V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}).

Case 1: σi1(V(𝒦))V(𝒫a)V(𝒫b)\sigma_{i}^{-1}(V(\mathcal{K}^{\ell}))\subset V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}).

We will further break into subcases based on whether or not σi({va,vb})V(𝒦)\sigma_{i}(\{v_{a}^{\ell},v_{b}^{\ell}\})\subset V(\mathcal{K}^{\ell}).

Subcase 1.1: σi({va,vb})V(𝒦)\sigma_{i}(\{v_{a}^{\ell},v_{b}^{\ell}\})\subset V(\mathcal{K}^{\ell}).

By the induction hypothesis, we have that

(4.2) |σi1({κa,κb})((V(𝒫a){va})(V(𝒫b){vb}))|=1.\displaystyle|\sigma_{i}^{-1}(\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\})\cap((V(\mathcal{P}_{a}^{\ell})\setminus\{v_{a}^{\ell}\})\cup(V(\mathcal{P}_{b}^{\ell})\setminus\{v_{b}^{\ell}\}))|=1.

If σi(va)=σi+1(va)\sigma_{i}(v_{a}^{\ell})=\sigma_{i+1}(v_{a}^{\ell}) and σi(vb)=σi+1(vb)\sigma_{i}(v_{b}^{\ell})=\sigma_{i+1}(v_{b}^{\ell}), then σi+1\sigma_{i+1} satisfies Lemma 4.7(1) since

σi+11(V(𝒦))V(𝒫a)V(𝒫b),\displaystyle\sigma_{i+1}^{-1}(V(\mathcal{K}^{\ell}))\subset V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}), |σi+11({κa,κb})((V(𝒫a){va})(V(𝒫b){vb}))|=1,\displaystyle|\sigma_{i+1}^{-1}(\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\})\cap((V(\mathcal{P}_{a}^{\ell})\setminus\{v_{a}^{\ell}\})\cup(V(\mathcal{P}_{b}^{\ell})\setminus\{v_{b}^{\ell}\}))|=1,

and satisfies Lemma 4.7(2) trivially.666Generally, in what follows, we do not comment on the “other statement” in Lemma 4.7 holding trivially, and only check the statement which applies, depending on whether σi+11(V(𝒦))V(𝒫a)V(𝒫b)\sigma_{i+1}^{-1}(V(\mathcal{K}^{\ell}))\subset V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}) or not in the given context. So consider the setting where either σi(va)σi+1(va)\sigma_{i}(v_{a}^{\ell})\neq\sigma_{i+1}(v_{a}^{\ell}) or σi(vb)σi+1(vb)\sigma_{i}(v_{b}^{\ell})\neq\sigma_{i+1}(v_{b}^{\ell}): say σi(va)σi+1(va)\sigma_{i}(v_{a}^{\ell})\neq\sigma_{i+1}(v_{a}^{\ell}) (the setting σi(vb)σi+1(vb)\sigma_{i}(v_{b}^{\ell})\neq\sigma_{i+1}(v_{b}^{\ell}) is analogous). If

σi+11(σi(va))NXL(va)V(𝒫a),\displaystyle\sigma_{i+1}^{-1}(\sigma_{i}(v_{a}^{\ell}))\in N_{X_{L}}(v_{a}^{\ell})\cap V(\mathcal{P}_{a}^{\ell}),

then σi+1\sigma_{i+1} satisfies Lemma 4.7(1). Indeed, since κa,κbV(𝒦)\kappa_{a}^{\ell},\kappa_{b}^{\ell}\notin V(\mathcal{K}^{\ell}), the hypothesis σi+1({va,vb})V(𝒦)\sigma_{i+1}(\{v_{a}^{\ell},v_{b}^{\ell}\})\subset V(\mathcal{K}^{\ell}) implies that σi1(κa)=σi+11(κa)\sigma_{i}^{-1}(\kappa_{a}^{\ell})=\sigma_{i+1}^{-1}(\kappa_{a}^{\ell}) and σi1(κb)=σi+11(κb)\sigma_{i}^{-1}(\kappa_{b}^{\ell})=\sigma_{i+1}^{-1}(\kappa_{b}^{\ell}). If

σi+11(σi(va))V(𝒫a),\displaystyle\sigma_{i+1}^{-1}(\sigma_{i}(v_{a}^{\ell}))\notin V(\mathcal{P}_{a}^{\ell}),

then since σi(va)V(𝒦)\sigma_{i}(v_{a}^{\ell})\in V(\mathcal{K}^{\ell}), we have that

σi+11(V(𝒦))V(𝒫a)V(𝒫b).\displaystyle\sigma_{i+1}^{-1}(V(\mathcal{K}^{\ell}))\not\subset V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}).

From studying the neighborhoods of vertices in V(𝒦)V(\mathcal{K}^{\ell}) to produce possibilities for σi+1(va)\sigma_{i+1}(v_{a}^{\ell}), Propositions 4.4 and 4.6(2,3)777Indeed, σi+1\sigma_{i+1} would violate Proposition 4.6(2) if σi+1(va){κa+2,κb+2}\sigma_{i+1}(v_{a}^{\ell})\in\{\kappa_{a}^{\ell+2},\kappa_{b}^{\ell+2}\} and Proposition 4.6(3) if σi+1(va)V(𝒦+1){κa+2,κb+2}\sigma_{i+1}(v_{a}^{\ell})\in V(\mathcal{K}^{\ell+1})\setminus\{\kappa_{a}^{\ell+2},\kappa_{b}^{\ell+2}\}. Henceforth, we do not explicitly make such further distinctions when appealing to multiple properties from Proposition 4.6 together. imply

σi+1(va)V(𝒦){κa,κb}\displaystyle\sigma_{i+1}(v_{a}^{\ell})\in V(\mathcal{K}^{\ell})\cup\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\}

(consider the possible vertices in NYL(σi(va))N_{Y_{L}}(\sigma_{i}(v_{a}^{\ell}))), from which σi1(V(𝒦))V(𝒫a)V(𝒫b)\sigma_{i}^{-1}(V(\mathcal{K}^{\ell}))\subset V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}) implies

σi+1(va){κa,κb}.\displaystyle\sigma_{i+1}(v_{a}^{\ell})\in\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\}.

This yields =1\ell=1. Indeed, if 2\ell\geq 2, then σi\sigma_{i} violates Proposition 4.6(4) on layer 1\ell-1, since with (4.2),

σi1({κa,κb})σi1(V(𝒦1))(V(𝒫a1)V(𝒫b1)),\displaystyle\sigma_{i}^{-1}(\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\})\subset\sigma_{i}^{-1}(V(\mathcal{K}^{\ell-1}))\setminus(V(\mathcal{P}_{a}^{\ell-1})\cup V(\mathcal{P}_{b}^{\ell-1})),

so that σi+1(va1)=κa1\sigma_{i+1}(v_{a}^{1})=\kappa_{a}^{1} by Proposition 4.6(1) on σi+1\sigma_{i+1}. This result, with Proposition 4.6(1) (on σi\sigma_{i}) and (4.2), yields

σi1(κb1)=σi+11(κb1)V(𝒫b1){vb1},\displaystyle\sigma_{i}^{-1}(\kappa_{b}^{1})=\sigma_{i+1}^{-1}(\kappa_{b}^{1})\in V(\mathcal{P}_{b}^{1})\setminus\{v_{b}^{1}\},

so σi+1\sigma_{i+1} satisfies Lemma 4.7(2).

Subcase 1.2: σi({va,vb})V(𝒦)\sigma_{i}(\{v_{a}^{\ell},v_{b}^{\ell}\})\not\subset V(\mathcal{K}^{\ell}).

Since |V(𝒫a)V(𝒫b)|=31|V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell})|=31 and |V(𝒦)|=30|V(\mathcal{K}^{\ell})|=30, and recalling our initial assumption σi1(V(𝒦))V(𝒫a)V(𝒫b)\sigma_{i}^{-1}(V(\mathcal{K}^{\ell}))\subset V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}), we have that

(4.3) |σi({va,vb})V(𝒦)|=1,\displaystyle|\sigma_{i}(\{v_{a}^{\ell},v_{b}^{\ell}\})\cap V(\mathcal{K}^{\ell})|=1, σi((V(𝒫a){va})(V(𝒫b){vb}))V(𝒦).\displaystyle\sigma_{i}((V(\mathcal{P}_{a}^{\ell})\setminus\{v_{a}^{\ell}\})\cup(V(\mathcal{P}_{b}^{\ell})\setminus\{v_{b}^{\ell}\}))\subset V(\mathcal{K}^{\ell}).

Say σi(va)V(𝒦)\sigma_{i}(v_{a}^{\ell})\in V(\mathcal{K}^{\ell}); the setting σi(vb)V(𝒦)\sigma_{i}(v_{b}^{\ell})\in V(\mathcal{K}^{\ell}) is argued analogously. By (4.3), σi(vb)V(𝒦)\sigma_{i}(v_{b}^{\ell})\notin V(\mathcal{K}^{\ell}). If σi+11(σi(va))V(𝒫a)\sigma_{i+1}^{-1}(\sigma_{i}(v_{a}^{\ell}))\notin V(\mathcal{P}_{a}^{\ell}), then σi1(V(𝒦))V(𝒫a)V(𝒫b)\sigma_{i}^{-1}(V(\mathcal{K}^{\ell}))\subset V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}) and σi(vb)=σi+1(vb)V(𝒦)\sigma_{i}(v_{b}^{\ell})=\sigma_{i+1}(v_{b}^{\ell})\notin V(\mathcal{K}^{\ell}), implying

σi+11(V(𝒦))V(𝒫a)V(𝒫b) and σi+1(va)V(𝒦),\displaystyle\sigma_{i+1}^{-1}(V(\mathcal{K}^{\ell}))\not\subset V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell})\text{ and }\sigma_{i+1}(v_{a}^{\ell})\notin V(\mathcal{K}^{\ell}),

so σi+1\sigma_{i+1} satisfies Lemma 4.7(2). If σi+11(σi(va))V(𝒫a)\sigma_{i+1}^{-1}(\sigma_{i}(v_{a}^{\ell}))\in V(\mathcal{P}_{a}^{\ell}), then σi1(V(𝒦))V(𝒫a)V(𝒫b)\sigma_{i}^{-1}(V(\mathcal{K}^{\ell}))\subset V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}) and σi(vb)V(𝒦)\sigma_{i}(v_{b}^{\ell})\notin V(\mathcal{K}^{\ell}) yield

(4.4) σi+11(V(𝒦))V(𝒫a)V(𝒫b).\displaystyle\sigma_{i+1}^{-1}(V(\mathcal{K}^{\ell}))\subset V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}).

Studying the neighborhoods of vertices in V(𝒦)V(\mathcal{K}^{\ell}) and recalling that σi(vb)V(𝒦)\sigma_{i}(v_{b}^{\ell})\notin V(\mathcal{K}^{\ell}) yields that the only way we can have that σi+1(vb)V(𝒦)\sigma_{i+1}(v_{b}^{\ell})\in V(\mathcal{K}^{\ell}) (exactly when σi+1\sigma_{i+1} does not trivially satisfy Lemma 4.7(1)) without σi+1\sigma_{i+1} violating Proposition 4.4 or 4.6(2,3) is if

σi(vb){κa,κb} and σi+11(σi(vb))NXL(vb)V(𝒫b).\displaystyle\sigma_{i}(v_{b}^{\ell})\in\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\}\text{ and }\sigma_{i+1}^{-1}(\sigma_{i}(v_{b}^{\ell}))\in N_{X_{L}}(v_{b}^{\ell})\cap V(\mathcal{P}_{b}^{\ell}).

These results, along with (4.4), |V(𝒫a)V(𝒫b)|=31|V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell})|=31, |V(𝒦)|=30|V(\mathcal{K}^{\ell})|=30, and κa,κbV(𝒦)\kappa_{a}^{\ell},\kappa_{b}^{\ell}\notin V(\mathcal{K}^{\ell}), imply

|σi+11({κa,κb})((V(𝒫a){va})(V(𝒫b){vb}))|=1,\displaystyle|\sigma_{i+1}^{-1}(\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\})\cap((V(\mathcal{P}_{a}^{\ell})\setminus\{v_{a}^{\ell}\})\cup(V(\mathcal{P}_{b}^{\ell})\setminus\{v_{b}^{\ell}\}))|=1,

so σi+1\sigma_{i+1} satisfies Lemma 4.7(1).

Case 2: σi1(V(𝒦))V(𝒫a)V(𝒫b)\sigma_{i}^{-1}(V(\mathcal{K}^{\ell}))\not\subset V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}).

By Proposition 4.6(4) (on σi\sigma_{i}), there exists a unique μV(𝒦)\mu\in V(\mathcal{K}^{\ell}) such that σi1(μ)V(𝒫a)V(𝒫b)\sigma_{i}^{-1}(\mu)\notin V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}), so |V(𝒫a)V(𝒫b)|=31|V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell})|=31 and |V(𝒦)|=30|V(\mathcal{K}^{\ell})|=30 yield

(4.5) |(V(𝒫a)V(𝒫b))σi1(V(𝒦))|=2.\displaystyle|(V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}))\setminus\sigma_{i}^{-1}(V(\mathcal{K}^{\ell}))|=2.

We break into subcases based on the subset of {σi(va),σi(vb)}\{\sigma_{i}(v_{a}^{\ell}),\sigma_{i}(v_{b}^{\ell})\} that is in V(𝒦)V(\mathcal{K}^{\ell}).

Subcase 2.1: σi({va,vb})V(𝒦)\sigma_{i}(\{v_{a}^{\ell},v_{b}^{\ell}\})\subset V(\mathcal{K}^{\ell}).

By the induction hypothesis,888The resulting observations are enough to deduce that =1\ell=1, but this is not necessary for the proceeding argument.

σi1({κa,κb})(V(𝒫a){va}),\displaystyle\sigma_{i}^{-1}(\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\})\cap(V(\mathcal{P}_{a}^{\ell})\setminus\{v_{a}^{\ell}\})\neq\emptyset, σi1({κa,κb})(V(𝒫b){vb}).\displaystyle\sigma_{i}^{-1}(\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\})\cap(V(\mathcal{P}_{b}^{\ell})\setminus\{v_{b}^{\ell}\})\neq\emptyset.

If σi(va)=σi+1(va)\sigma_{i}(v_{a}^{\ell})=\sigma_{i+1}(v_{a}^{\ell}) and σi(vb)=σi+1(vb)\sigma_{i}(v_{b}^{\ell})=\sigma_{i+1}(v_{b}^{\ell}), then σi+1\sigma_{i+1} satisfies Lemma 4.7(2). If σi(va)σi+1(va)\sigma_{i}(v_{a}^{\ell})\neq\sigma_{i+1}(v_{a}^{\ell}) (the setting σi(vb)σi+1(vb)\sigma_{i}(v_{b}^{\ell})\neq\sigma_{i+1}(v_{b}^{\ell}) is argued analogously), we must have that (exactly) one of

σi+11(σi(va))NXL(va)V(𝒫a),\displaystyle\sigma_{i+1}^{-1}(\sigma_{i}(v_{a}^{\ell}))\in N_{X_{L}}(v_{a}^{\ell})\cap V(\mathcal{P}_{a}^{\ell}), σi+11(σi(va))NXL(va)V(𝒫a) and σi+1(va)=μ\displaystyle\sigma_{i+1}^{-1}(\sigma_{i}(v_{a}^{\ell}))\in N_{X_{L}}(v_{a}^{\ell})\setminus V(\mathcal{P}_{a}^{\ell})\text{ and }\sigma_{i+1}(v_{a}^{\ell})=\mu

must hold, since σi+1\sigma_{i+1} would otherwise violate Proposition 4.6(4), due to

{μ,σi(va)}σi+11(V(𝒦))(V(𝒫a)V(𝒫b)).\displaystyle\{\mu,\sigma_{i}(v_{a}^{\ell})\}\subset\sigma_{i+1}^{-1}(V(\mathcal{K}^{\ell}))\setminus(V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell})).

As before, σi+1\sigma_{i+1} satisfies Lemma 4.7(2) in both situations.

Subcase 2.2: {σi(va),σi(vb)}V(𝒦)={σi(va)}\{\sigma_{i}(v_{a}^{\ell}),\sigma_{i}(v_{b}^{\ell})\}\cap V(\mathcal{K}^{\ell})=\{\sigma_{i}(v_{a}^{\ell})\}.

The setting {σi(va),σi(vb)}V(𝒦)={σi(vb)}\{\sigma_{i}(v_{a}^{\ell}),\sigma_{i}(v_{b}^{\ell})\}\cap V(\mathcal{K}^{\ell})=\{\sigma_{i}(v_{b}^{\ell})\} is argued analogously. The induction hypothesis yields

σi1({κa,κb})(V(𝒫a){va}).\displaystyle\sigma_{i}^{-1}(\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\})\cap(V(\mathcal{P}_{a}^{\ell})\setminus\{v_{a}^{\ell}\})\neq\emptyset.

From (4.5), we deduce that

(4.6) |σi1({κa,κb})(V(𝒫a){va})|=1\displaystyle|\sigma_{i}^{-1}(\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\})\cap(V(\mathcal{P}_{a}^{\ell})\setminus\{v_{a}^{\ell}\})|=1

and also that

(V(𝒫a)V(𝒫b))σi1(V(𝒦))={vb}(σi1({κa,κb})(V(𝒫a){va})).\displaystyle(V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}))\setminus\sigma_{i}^{-1}(V(\mathcal{K}^{\ell}))=\{v_{b}^{\ell}\}\cup(\sigma_{i}^{-1}(\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\})\cap(V(\mathcal{P}_{a}^{\ell})\setminus\{v_{a}^{\ell}\})).

We can argue as in Subcase 2.1 to deduce that σi+1\sigma_{i+1} satisfies Lemma 4.7(2) if σi(va)=σi+1(va)\sigma_{i}(v_{a}^{\ell})=\sigma_{i+1}(v_{a}^{\ell}) and σi(vb)=σi+1(vb)\sigma_{i}(v_{b}^{\ell})=\sigma_{i+1}(v_{b}^{\ell}), or if σi(va)σi+1(va)\sigma_{i}(v_{a}^{\ell})\neq\sigma_{i+1}(v_{a}^{\ell}). If σi(vb)σi+1(vb)\sigma_{i}(v_{b}^{\ell})\neq\sigma_{i+1}(v_{b}^{\ell}),

σi1(μ)=σi+11(μ)V(𝒫a)V(𝒫b),\displaystyle\sigma_{i}^{-1}(\mu)=\sigma_{i+1}^{-1}(\mu)\notin V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}),

so σi+1\sigma_{i+1} satisfies Lemma 4.7(2) if σi+1(vb)V(𝒦)\sigma_{i+1}(v_{b}^{\ell})\notin V(\mathcal{K}^{\ell}). Thus, assume σi+1(vb)V(𝒦)\sigma_{i+1}(v_{b}^{\ell})\in V(\mathcal{K}^{\ell}). Studying the neighborhoods of vertices in V(𝒦)V(\mathcal{K}^{\ell}) yields that σi(vb){κa,κb}\sigma_{i}(v_{b}^{\ell})\in\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\}, as σi\sigma_{i} would violate Proposition 4.4 or 4.6(2,3) otherwise. If

σi+11(σi(vb))NXL(vb)V(𝒫b),\displaystyle\sigma_{i+1}^{-1}(\sigma_{i}(v_{b}^{\ell}))\in N_{X_{L}}(v_{b}^{\ell})\cap V(\mathcal{P}_{b}^{\ell}),

σi+1\sigma_{i+1} satisfies Lemma 4.7(2). If

σi+11(σi(vb))NXL(vb)V(𝒫b),\displaystyle\sigma_{i+1}^{-1}(\sigma_{i}(v_{b}^{\ell}))\in N_{X_{L}}(v_{b}^{\ell})\setminus V(\mathcal{P}_{b}^{\ell}),

it must be that σi+1(vb)=μ\sigma_{i+1}(v_{b}^{\ell})=\mu (recall that μV(𝒦)\mu\in V(\mathcal{K}^{\ell}) is the unique such vertex for which σi1(μ)V(𝒫a)V(𝒫b)\sigma_{i}^{-1}(\mu)\notin V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell})). By (4.6), alongside σi(vb){κa,κb}\sigma_{i}(v_{b}^{\ell})\in\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\} and σi+11(σi(vb))NXL(vb)V(𝒫b)\sigma_{i+1}^{-1}(\sigma_{i}(v_{b}^{\ell}))\in N_{X_{L}}(v_{b}^{\ell})\cap V(\mathcal{P}_{b}^{\ell}), σi+1\sigma_{i+1} satisfies Lemma 4.7(1).

Subcase 2.3: σi({va,vb})V(𝒦)=\sigma_{i}(\{v_{a}^{\ell},v_{b}^{\ell}\})\cap V(\mathcal{K}^{\ell})=\emptyset.

From (4.5), we have that

σi1(V(𝒦){μ})=(V(𝒫a)V(𝒫b)){va,vb}\displaystyle\sigma_{i}^{-1}(V(\mathcal{K}^{\ell})\setminus\{\mu\})=(V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}))\setminus\{v_{a}^{\ell},v_{b}^{\ell}\}

since the LHS is a subset of the RHS and their cardinalities are equal. If σi+11(μ){va,vb}\sigma_{i+1}^{-1}(\mu)\in\{v_{a}^{\ell},v_{b}^{\ell}\}, then

σi+11(V(𝒦))V(𝒫a)V(𝒫b) and σi+1({va,vb})V(𝒦),\displaystyle\sigma_{i+1}^{-1}(V(\mathcal{K}^{\ell}))\subset V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell})\text{ and }\sigma_{i+1}(\{v_{a}^{\ell},v_{b}^{\ell}\})\not\subset V(\mathcal{K}^{\ell}),

so σi+1\sigma_{i+1} satisfies Lemma 4.7(1). Now assume σi+11(μ){va,vb}\sigma_{i+1}^{-1}(\mu)\notin\{v_{a}^{\ell},v_{b}^{\ell}\}, from which it easily follows that

σi+11(V(𝒦))V(𝒫a)V(𝒫b) and |{σi+1(va),σi+1(vb)}V(𝒦)|1.\displaystyle\sigma_{i+1}^{-1}(V(\mathcal{K}^{\ell}))\not\subset V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell})\text{ and }|\{\sigma_{i+1}(v_{a}^{\ell}),\sigma_{i+1}(v_{b}^{\ell})\}\cap V(\mathcal{K}^{\ell})|\leq 1.

Of course, σi+1\sigma_{i+1} satisfies Lemma 4.7(2) if

{σi+1(va),σi+1(vb)}V(𝒦)=.\displaystyle\{\sigma_{i+1}(v_{a}^{\ell}),\sigma_{i+1}(v_{b}^{\ell})\}\cap V(\mathcal{K}^{\ell})=\emptyset.

If σi+1(va)V(𝒦)\sigma_{i+1}(v_{a}^{\ell})\in V(\mathcal{K}^{\ell}) (the setting σi+1(vb)V(𝒦)\sigma_{i+1}(v_{b}^{\ell})\in V(\mathcal{K}^{\ell}) is argued analogously), then by studying the neighborhoods of vertices in V(𝒦)V(\mathcal{K}^{\ell}), it must be that σi(va){κa,κb}\sigma_{i}(v_{a}^{\ell})\in\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\}, since σi\sigma_{i} would otherwise violate Proposition 4.4 or 4.6(2,3). Furthermore,

σi+11(σi(va))NXL(va)V(𝒫a),\displaystyle\sigma_{i+1}^{-1}(\sigma_{i}(v_{a}^{\ell}))\in N_{X_{L}}(v_{a}^{\ell})\cap V(\mathcal{P}_{a}^{\ell}),

since if σi+11(σi(va))NXL(va)V(𝒫a)\sigma_{i+1}^{-1}(\sigma_{i}(v_{a}^{\ell}))\in N_{X_{L}}(v_{a}^{\ell})\setminus V(\mathcal{P}_{a}^{\ell}), we would have

{σi1(μ),σi1(σi+1(va))}σi1(V(𝒦))(V(𝒫a)V(𝒫b)),\displaystyle\{\sigma_{i}^{-1}(\mu),\sigma_{i}^{-1}(\sigma_{i+1}(v_{a}^{\ell}))\}\subset\sigma_{i}^{-1}(V(\mathcal{K}^{\ell}))\setminus(V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell})),

implying σi\sigma_{i} violates Proposition 4.6(4); μσi+1(va)\mu\neq\sigma_{i+1}(v_{a}^{\ell}) since σi+11(μ){va,vb}\sigma_{i+1}^{-1}(\mu)\notin\{v_{a}^{\ell},v_{b}^{\ell}\}. It follows quickly that σi+1\sigma_{i+1} satisfies Lemma 4.7(2). This completes the induction. ∎

We are now ready to prove Proposition 4.6.

Proof of Proposition 4.6.

Assume (towards a contradiction) that the proposition is false, so there exists a swap sequence Σ={σi}i=0λ\Sigma=\{\sigma_{i}\}_{i=0}^{\lambda} with σ0=σs\sigma_{0}=\sigma_{s} of minimal length λ\lambda containing a vertex that violates Proposition 4.6. It is apparent from the preceding comment that λ1\lambda\geq 1. We also observe that all terms σiΣ\sigma_{i}\in\Sigma satisfy Proposition 4.4, and that σλ\sigma_{\lambda} must violate at least one of the four properties of Proposition 4.6. We break into cases based on the property that the configuration σλ\sigma_{\lambda} violates, and reach a contradiction in every case to deduce that none of these four properties can be broken by σλ\sigma_{\lambda}. This will produce the desired contradiction on our initial assumption.

Case 1: σλ1(κa1)V(Xa1)\sigma_{\lambda}^{-1}(\kappa_{a}^{1})\notin V(X_{a}^{1}) or σλ1(κb1)V(Xb1)\sigma_{\lambda}^{-1}(\kappa_{b}^{1})\notin V(X_{b}^{1}).

Assume that this statement holds. We only study the setting in which σλ1(κa1)V(Xa1)\sigma_{\lambda}^{-1}(\kappa_{a}^{1})\notin V(X_{a}^{1}); raising a contradiction when σλ1(κb1)V(Xb1)\sigma_{\lambda}^{-1}(\kappa_{b}^{1})\notin V(X_{b}^{1}) is analogous. To reach σλ\sigma_{\lambda} from σλ1\sigma_{\lambda-1}, we must have σλ11(κa1){v1,va1,2}\sigma_{\lambda-1}^{-1}(\kappa_{a}^{1})\in\{v^{1},v_{a}^{1,2}\} (in particular, we must have λ2\lambda\geq 2, since σs1(κa1){v1,va1,2}\sigma_{s}^{-1}(\kappa_{a}^{1})\notin\{v^{1},v_{a}^{1,2}\}). We break into subcases based on the value of σλ11(κa1)\sigma_{\lambda-1}^{-1}(\kappa_{a}^{1}).

Subcase 1.1: σλ11(κa1)=v1\sigma_{\lambda-1}^{-1}(\kappa_{a}^{1})=v^{1}.

Here, σλ1(κa1)NXL(v1)V(𝒞b1)\sigma_{\lambda}^{-1}(\kappa_{a}^{1})\in N_{X_{L}}(v^{1})\cap V(\mathcal{C}_{b}^{1}). Recall that

NYL(κa1)=(V(𝒮a1){κa1})V(𝒦1).\displaystyle N_{Y_{L}}(\kappa_{a}^{1})=(V(\mathcal{S}_{a}^{1})\setminus\{\kappa_{a}^{1}\})\cup V(\mathcal{K}^{1}).

Since σλ1\sigma_{\lambda-1} satisfies Proposition 4.4 (on 𝒞a1\mathcal{C}_{a}^{1}), the vertex σλ1(σλ1(κa1))\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\kappa_{a}^{1})) that κa1\kappa_{a}^{1} swaps with to reach σλ\sigma_{\lambda} from σλ1\sigma_{\lambda-1} lies in V(𝒦1)V(\mathcal{K}^{1}). Since σλ1\sigma_{\lambda-1} satisfies Proposition 4.4 (on 𝒞b1\mathcal{C}_{b}^{1}), it must be that

{κa1,σλ1(σλ1(κa1))}=σλ1(V(𝒞b1))(V(𝒮b1){κb1}).\displaystyle\{\kappa_{a}^{1},\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\kappa_{a}^{1}))\}=\sigma_{\lambda-1}(V(\mathcal{C}_{b}^{1}))\setminus(V(\mathcal{S}_{b}^{1})\setminus\{\kappa_{b}^{1}\}).

Combining this with σλ11(κb1)V(Xb1)\sigma_{\lambda-1}^{-1}(\kappa_{b}^{1})\in V(X_{b}^{1}) (due to σλ1\sigma_{\lambda-1} satisfying Proposition 4.6(1)), we deduce that

σλ11(κb1)V(𝒫b1){vb1}.\displaystyle\sigma_{\lambda-1}^{-1}(\kappa_{b}^{1})\in V(\mathcal{P}_{b}^{1})\setminus\{v_{b}^{1}\}.

However, by applying Propositions 4.4 and 4.6(1-3) to σλ1\sigma_{\lambda-1}, and recalling our assumption that σλ11(κa1)=v1\sigma_{\lambda-1}^{-1}(\kappa_{a}^{1})=v^{1}, we deduce that

σλ1(𝒫b1){κb1}(V(𝒮b1){κb1})V(𝒦1)=NYL(κb1),\displaystyle\sigma_{\lambda-1}(\mathcal{P}_{b}^{1})\setminus\{\kappa_{b}^{1}\}\subset(V(\mathcal{S}_{b}^{1})\setminus\{\kappa_{b}^{1}\})\cup V(\mathcal{K}^{1})=N_{Y_{L}}(\kappa_{b}^{1}),

so from σλ1\sigma_{\lambda-1}, we can swap κb1\kappa_{b}^{1} along V(𝒫b1)V(\mathcal{P}_{b}^{1}) onto vb1v_{b}^{1}, yielding a configuration τV(𝒞)\tau\in V(\mathscr{C}) satisfying

|τ(V(𝒞b1))(V(𝒮b1){κb1})|3,\displaystyle|\tau(V(\mathcal{C}_{b}^{1}))\setminus(V(\mathcal{S}_{b}^{1})\setminus\{\kappa_{b}^{1}\})|\geq 3,

contradicting Proposition 4.4. See Figure 11 for an illustration. In particular, this argument (with the analogue for the setting where σλ1(κb1)V(Xb1)\sigma_{\lambda}^{-1}(\kappa_{b}^{1})\notin V(X_{b}^{1})) concludes the study of the first three cases for L=1L=1.

Refer to caption
Figure 11. Configurations in Σ\Sigma used to raise a contradiction for Subcase 1.1, where we let μ=σλ1(σλ1(κa1))\mu=\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\kappa_{a}^{1})). From σλ1\sigma_{\lambda-1}, swapping κb1\kappa_{b}^{1} left onto vb1v_{b}^{1} yields a configuration τ\tau which violates Proposition 4.4 on 𝒞b1\mathcal{C}_{b}^{1}.
Subcase 1.2: σλ11(κa1)=va1,2\sigma_{\lambda-1}^{-1}(\kappa_{a}^{1})=v_{a}^{1,2}.

This subcase only applies for L2L\geq 2. Observing that we must have

σλ1(κa1)NXL(va1,2)V(𝒞a2),\displaystyle\sigma_{\lambda}^{-1}(\kappa_{a}^{1})\in N_{X_{L}}(v_{a}^{1,2})\cap V(\mathcal{C}_{a}^{2}),

studying NYL(κa1)N_{Y_{L}}(\kappa_{a}^{1}) yields σλ(va1,2){κa2,κb2}\sigma_{\lambda}(v_{a}^{1,2})\in\{\kappa_{a}^{2},\kappa_{b}^{2}\}, since

σλ(va1,2)V(𝒮a1){κa1} and σλ(va1,2)V(𝒦1){κa2,κb2}\displaystyle\sigma_{\lambda}(v_{a}^{1,2})\in V(\mathcal{S}_{a}^{1})\setminus\{\kappa_{a}^{1}\}\text{ and }\sigma_{\lambda}(v_{a}^{1,2})\in V(\mathcal{K}^{1})\setminus\{\kappa_{a}^{2},\kappa_{b}^{2}\}

imply σλ1\sigma_{\lambda-1} violates Proposition 4.4 and Proposition 4.6(3), respectively. Since σj\sigma_{j} satisfies Propositions 4.4 and 4.6(2,3) for all 0jλ10\leq j\leq\lambda-1, a case check on the types of vertices in V(YL)V(Y_{L}) and considering which of them can be in σj(V(X1){va1,2,vb1,2})\sigma_{j}(V(X^{1})\setminus\{v_{a}^{1,2},v_{b}^{1,2}\}) implies

(4.7) σj(V(X1){va1,2,vb1,2})σs(V(X1)) for all 0jλ1.\displaystyle\sigma_{j}(V(X^{1})\setminus\{v_{a}^{1,2},v_{b}^{1,2}\})\subset\sigma_{s}(V(X^{1}))\text{ for all }0\leq j\leq\lambda-1.

The observations |σλ1(V(X1))|=|σs(V(X1))||\sigma_{\lambda-1}(V(X^{1}))|=|\sigma_{s}(V(X^{1}))| and σs1({κa2,κb2})V(X1)\sigma_{s}^{-1}(\{\kappa_{a}^{2},\kappa_{b}^{2}\})\subset V(X^{1}) together imply that, since κa1\kappa_{a}^{1} swaps with either κa2\kappa_{a}^{2} or κb2\kappa_{b}^{2} into NXL(va1,2)V(𝒞a2)N_{X_{L}}(v_{a}^{1,2})\cap V(\mathcal{C}_{a}^{2}) to reach σλ\sigma_{\lambda} from σλ1\sigma_{\lambda-1},

(4.8) σλ1(V(X1))σs(V(X1)),\displaystyle\sigma_{\lambda-1}(V(X^{1}))\setminus\sigma_{s}(V(X^{1}))\neq\emptyset,

while (4.7) applied to j=λ1j=\lambda-1 and σλ1(va1,2)=κa1σs(V(X1))\sigma_{\lambda-1}(v_{a}^{1,2})=\kappa_{a}^{1}\in\sigma_{s}(V(X^{1})) together imply that

(4.9) |σλ1(V(X1))σs(V(X1))|=|σs(V(X1))σλ1(V(X1))|1.\displaystyle|\sigma_{\lambda-1}(V(X^{1}))\setminus\sigma_{s}(V(X^{1}))|=|\sigma_{s}(V(X^{1}))\setminus\sigma_{\lambda-1}(V(X^{1}))|\leq 1.

If it were true that σλ1(vb1,2)σs(V(X1))\sigma_{\lambda-1}(v_{b}^{1,2})\in\sigma_{s}(V(X^{1})), recalling that σλ1(va1,2)=κa1σs(V(X1))\sigma_{\lambda-1}(v_{a}^{1,2})=\kappa_{a}^{1}\in\sigma_{s}(V(X^{1})), we get

σλ1({va1,2,vb1,2})σs(V(X1))σλ1(V(X1){va1,2,vb1,2})σs(V(X1)),\displaystyle\sigma_{\lambda-1}(\{v_{a}^{1,2},v_{b}^{1,2}\})\subset\sigma_{s}(V(X^{1}))\implies\sigma_{\lambda-1}(V(X^{1})\setminus\{v_{a}^{1,2},v_{b}^{1,2}\})\not\subset\sigma_{s}(V(X^{1})),

with the implication due to (4.8), contradicting (4.7) on j=λ1j=\lambda-1. Therefore,

σλ1(vb1,2)σs(V(X1)).\displaystyle\sigma_{\lambda-1}(v_{b}^{1,2})\notin\sigma_{s}(V(X^{1})).

This result, alongside a case check on the possible values of σλ1(vb1,2)\sigma_{\lambda-1}(v_{b}^{1,2}) (applying Propositions 4.4 and 4.6(2,3) to σλ1\sigma_{\lambda-1}), gives

(4.10) σλ1(vb1,2)(V(𝒮b2){κb2})V(𝒦2).\displaystyle\sigma_{\lambda-1}(v_{b}^{1,2})\in(V(\mathcal{S}_{b}^{2})\setminus\{\kappa_{b}^{2}\})\cup V(\mathcal{K}^{2}).

Let σξ\sigma_{\xi} be the final term of Σ\Sigma before σλ1\sigma_{\lambda-1} satisfying

σξ(vb1,2)σλ1(vb1,2);\displaystyle\sigma_{\xi}(v_{b}^{1,2})\neq\sigma_{\lambda-1}(v_{b}^{1,2});

ξ<λ1\xi<\lambda-1 is well-defined since σλ1(vb1,2)σs(V(X1))\sigma_{\lambda-1}(v_{b}^{1,2})\notin\sigma_{s}(V(X^{1})). To reach σξ+1\sigma_{\xi+1} from σξ\sigma_{\xi}, we swap σλ1(vb1,2)\sigma_{\lambda-1}(v_{b}^{1,2}) with σξ(vb1,2)\sigma_{\xi}(v_{b}^{1,2}), where

σξ1(σλ1(vb1,2))NXL(vb1,2)V(𝒞b2).\displaystyle\sigma_{\xi}^{-1}(\sigma_{\lambda-1}(v_{b}^{1,2}))\in N_{X_{L}}(v_{b}^{1,2})\cap V(\mathcal{C}_{b}^{2}).

Indeed, see (4.10); if we had that

σξ1(σλ1(vb1,2))NXL(vb1,2)V(𝒞b1),\displaystyle\sigma_{\xi}^{-1}(\sigma_{\lambda-1}(v_{b}^{1,2}))\in N_{X_{L}}(v_{b}^{1,2})\cap V(\mathcal{C}_{b}^{1}),

σξ\sigma_{\xi} would violate Proposition 4.4 on 𝒞b2\mathcal{C}_{b}^{2} if σλ1(vb1,2)V(𝒮b2){κb2}\sigma_{\lambda-1}(v_{b}^{1,2})\in V(\mathcal{S}_{b}^{2})\setminus\{\kappa_{b}^{2}\} and Proposition 4.6(2,3) if σλ1(vb1,2)V(𝒦2)\sigma_{\lambda-1}(v_{b}^{1,2})\in V(\mathcal{K}^{2}). By the definition of ξ\xi, σj(vb1,2)\sigma_{j}(v_{b}^{1,2}) remains unchanged for ξ+1jλ1\xi+1\leq j\leq\lambda-1. Furthermore, from (4.10), we observe that

(4.11) σξ(vb1,2){κa2,κb2}V(𝒦2),\displaystyle\sigma_{\xi}(v_{b}^{1,2})\in\{\kappa_{a}^{2},\kappa_{b}^{2}\}\cup V(\mathcal{K}^{2}),

since the statements

σξ(vb1,2)(V(𝒮a3){κa3})(V(𝒮b3){κb3}) and σξ(vb1,2)V(𝒦3)\displaystyle\sigma_{\xi}(v_{b}^{1,2})\in(V(\mathcal{S}_{a}^{3})\setminus\{\kappa_{a}^{3}\})\cup(V(\mathcal{S}_{b}^{3})\setminus\{\kappa_{b}^{3}\})\text{ and }\sigma_{\xi}(v_{b}^{1,2})\in V(\mathcal{K}^{3})

would result in σξ\sigma_{\xi} violating Propositions 4.4 and 4.6(2,3), respectively. If σξ(vb1,2){κa2,κb2}\sigma_{\xi}(v_{b}^{1,2})\in\{\kappa_{a}^{2},\kappa_{b}^{2}\}, then

σλ1(σξ(vb1,2))V(X1);\displaystyle\sigma_{\lambda}^{-1}(\sigma_{\xi}(v_{b}^{1,2}))\in V(X^{1});

this is immediate if σξ(vb1,2)=σλ(va1,2)\sigma_{\xi}(v_{b}^{1,2})=\sigma_{\lambda}(v_{a}^{1,2}) (recall that σλ(va1,2){κa2,κb2}\sigma_{\lambda}(v_{a}^{1,2})\in\{\kappa_{a}^{2},\kappa_{b}^{2}\}), and if σξ(vb1,2)σλ(va1,2)\sigma_{\xi}(v_{b}^{1,2})\neq\sigma_{\lambda}(v_{a}^{1,2}), the assumption σλ11(σξ(vb1,2))=σλ1(σξ(vb1,2))V(X1)\sigma_{\lambda-1}^{-1}(\sigma_{\xi}(v_{b}^{1,2}))=\sigma_{\lambda}^{-1}(\sigma_{\xi}(v_{b}^{1,2}))\notin V(X^{1}) (we swap σλ(va1,2)\sigma_{\lambda}(v_{a}^{1,2}) and κa1\kappa_{a}^{1} to reach σλ\sigma_{\lambda} from σλ1\sigma_{\lambda-1}), alongside σλ11(σλ(va1,2))NXL(va1,2)V(𝒞a2)\sigma_{\lambda-1}^{-1}(\sigma_{\lambda}(v_{a}^{1,2}))\in N_{X_{L}}(v_{a}^{1,2})\cap V(\mathcal{C}_{a}^{2}), would contradict (4.9), since we would have

{σξ(vb1,2),σλ(va1,2)}σs(V(X1))σλ1(V(X1)).\displaystyle\{\sigma_{\xi}(v_{b}^{1,2}),\sigma_{\lambda}(v_{a}^{1,2})\}\subseteq\sigma_{s}(V(X^{1}))\setminus\sigma_{\lambda-1}(V(X^{1})).

Thus, σξ(vb1,2)\sigma_{\xi}(v_{b}^{1,2}) traverses a path from vb1,2v_{b}^{1,2} to va1,2v_{a}^{1,2}, not involving vb1,2v_{b}^{1,2} past σξ\sigma_{\xi}, as we go from σξ\sigma_{\xi} to σλ\sigma_{\lambda}. Certainly, this traversal swaps σξ(vb1,2)\sigma_{\xi}(v_{b}^{1,2}) along both V(𝒞a2)V(\mathcal{C}_{a}^{2}) and V(𝒞b2)V(\mathcal{C}_{b}^{2}). Suppose σξ(vb1,2)=κa2\sigma_{\xi}(v_{b}^{1,2})=\kappa_{a}^{2}. Due to (4.10), σξ(vb1,2)=κa2\sigma_{\xi}(v_{b}^{1,2})=\kappa_{a}^{2} must have swapped with σλ1(vb1,2)V(𝒦2)\sigma_{\lambda-1}(v_{b}^{1,2})\in V(\mathcal{K}^{2}) to reach σξ+1\sigma_{\xi+1} from σξ\sigma_{\xi}. Let ζ>ξ+1\zeta>\xi+1 be the earliest such index satisfying

σζ(σξ+11(κa2))κa2;\displaystyle\sigma_{\zeta}(\sigma_{\xi+1}^{-1}(\kappa_{a}^{2}))\neq\kappa_{a}^{2};

ζ\zeta is well-defined since σξ(vb1,2)=κa2\sigma_{\xi}(v_{b}^{1,2})=\kappa_{a}^{2} swaps along both V(𝒞a2)V(\mathcal{C}_{a}^{2}) and V(𝒞b2)V(\mathcal{C}_{b}^{2}) to reach σλ\sigma_{\lambda}. The vertex σζ(σξ+11(κa2))\sigma_{\zeta}(\sigma_{\xi+1}^{-1}(\kappa_{a}^{2})) must have swapped with κa2\kappa_{a}^{2} to reach σζ\sigma_{\zeta} from σζ1\sigma_{\zeta-1}. Since σζ(vb1,2)=σλ1(vb1,2)V(𝒦2)\sigma_{\zeta}(v_{b}^{1,2})=\sigma_{\lambda-1}(v_{b}^{1,2})\in V(\mathcal{K}^{2}), σζ(σξ+11(κa2))NYL(κa2)\sigma_{\zeta}(\sigma_{\xi+1}^{-1}(\kappa_{a}^{2}))\in N_{Y_{L}}(\kappa_{a}^{2}), and κa2\kappa_{a}^{2} are all not in V(𝒮b2){κb2}V(\mathcal{S}_{b}^{2})\setminus\{\kappa_{b}^{2}\}, we have

|σζ(V(𝒞b2))(V(𝒮b2){κb2})|3,\displaystyle|\sigma_{\zeta}(V(\mathcal{C}_{b}^{2}))\setminus(V(\mathcal{S}_{b}^{2})\setminus\{\kappa_{b}^{2}\})|\geq 3,

contradicting Proposition 4.4. See Figure 12a for an illustration. Thus, σξ(vb1,2)=κb2\sigma_{\xi}(v_{b}^{1,2})=\kappa_{b}^{2}. Let ζ>ξ+1\zeta>\xi+1 be the earliest such index satisfying

σζ1(κb2)V(𝒞a2)NXL[bd(𝒞a2)];\displaystyle\sigma_{\zeta}^{-1}(\kappa_{b}^{2})\in V(\mathcal{C}_{a}^{2})\setminus N_{X_{L}}[\textup{bd}(\mathcal{C}_{a}^{2})];

ζ\zeta is well-defined since κb2\kappa_{b}^{2} goes from vb1,2v_{b}^{1,2} to va1,2v_{a}^{1,2} to reach σλ\sigma_{\lambda}. Here, κb2\kappa_{b}^{2} must have swapped with κa2\kappa_{a}^{2} to reach σζ\sigma_{\zeta} from σζ1\sigma_{\zeta-1}: as in the preceding case, κb2\kappa_{b}^{2} would be “stuck” otherwise, due to σζ\sigma_{\zeta} satisfying Proposition 4.4 (on 𝒞a2\mathcal{C}_{a}^{2}). But then σζ\sigma_{\zeta} would violate Proposition 4.6(4) on =1\ell=1, namely since {κa2,κb2}V(𝒦1)\{\kappa_{a}^{2},\kappa_{b}^{2}\}\subset V(\mathcal{K}^{1}), which implies

|σζ1(V(𝒦1))(V(𝒫a1)V(𝒫b1))|2.\displaystyle|\sigma_{\zeta}^{-1}(V(\mathcal{K}^{1}))\setminus(V(\mathcal{P}_{a}^{1})\cup V(\mathcal{P}_{b}^{1}))|\geq 2.

See Figure 12b for an illustration. So, by (4.11), we must have σξ(vb1,2)V(𝒦2)\sigma_{\xi}(v_{b}^{1,2})\in V(\mathcal{K}^{2}). Since we swap σλ1(vb1,2)\sigma_{\lambda-1}(v_{b}^{1,2}) with σξ(vb1,2)\sigma_{\xi}(v_{b}^{1,2}) to reach σξ+1\sigma_{\xi+1} from σξ\sigma_{\xi}, it follows from (4.10) that σλ1(vb1,2)V(𝒦2)\sigma_{\lambda-1}(v_{b}^{1,2})\in V(\mathcal{K}^{2}), since there is no element of V(𝒦2)V(\mathcal{K}^{2}) (in particular, σξ(vb1,2)\sigma_{\xi}(v_{b}^{1,2})) that can swap with an element of V(𝒮b2){κb2}V(\mathcal{S}_{b}^{2})\setminus\{\kappa_{b}^{2}\}. But then σξ\sigma_{\xi} violates Proposition 4.6(4) (on =2\ell=2), which is our final contradiction in this case. We conclude that Proposition 4.6(1) cannot have been the property violated by σλ\sigma_{\lambda}.

Refer to caption
(a) Assuming σξ(vb1,2)=κa2\sigma_{\xi}(v_{b}^{1,2})=\kappa_{a}^{2}, with μ=σζ(σξ+11(κa2))\mu^{\prime}=\sigma_{\zeta}(\sigma_{\xi+1}^{-1}(\kappa_{a}^{2})). Here, σζ\sigma_{\zeta} violates Proposition 4.4 on 𝒞b2\mathcal{C}_{b}^{2} due to κa2\kappa_{a}^{2}, μ\mu, and μ\mu^{\prime}, none of which are in V(𝒮b2){κb2}V(\mathcal{S}_{b}^{2})\setminus\{\kappa_{b}^{2}\}.
Refer to caption
(b) Assuming σξ(vb1,2)=κb2\sigma_{\xi}(v_{b}^{1,2})=\kappa_{b}^{2}. Here, σζ\sigma_{\zeta} violates Proposition 4.6(4) on =1\ell=1 due to κa2\kappa_{a}^{2}, κb2\kappa_{b}^{2}.
Figure 12. Configurations in Σ\Sigma used to raise a contradiction for Subcase 1.2 when we assume that σξ(vb1,2){κa2,κb2}\sigma_{\xi}(v_{b}^{1,2})\in\{\kappa_{a}^{2},\kappa_{b}^{2}\}. We let μ=σλ1(vb1,2)\mu=\sigma_{\lambda-1}(v_{b}^{1,2}).

Case 2: For some 2\ell\geq 2, we have σλ1(κa)V(X1)V(X)\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell})\notin V(X^{\ell-1})\cup V(X^{\ell}) or σλ1(κb)V(X1)V(X)\sigma_{\lambda}^{-1}(\kappa_{b}^{\ell})\notin V(X^{\ell-1})\cup V(X^{\ell}).

This case is relevant only for L2L\geq 2. Assume this statement holds for some 2L2\leq\ell\leq L. We only study the setting in which σλ1(κa)V(X1)V(X)\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell})\notin V(X^{\ell-1})\cup V(X^{\ell}). Raising a contradiction when σλ1(κb)V(X1)V(X)\sigma_{\lambda}^{-1}(\kappa_{b}^{\ell})\notin V(X^{\ell-1})\cup V(X^{\ell}) is entirely analogous. Notice that

σλ11(κa){va2,1,vb2,1,va,+1,vb,+1}\displaystyle\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell})\in\{v_{a}^{\ell-2,\ell-1},v_{b}^{\ell-2,\ell-1},v_{a}^{\ell,\ell+1},v_{b}^{\ell,\ell+1}\}

(precisely, the RHS above is the subset of these vertices defined for \ell). To reach σλ\sigma_{\lambda} from σλ1\sigma_{\lambda-1}, the vertex σλ1(σλ1(κa))\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell})) that κa\kappa_{a}^{\ell} swaps with satisfies

(4.12) σλ1(σλ1(κa)){κa1,κb1}V(𝒦)(V(𝒮a){κa}),\displaystyle\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell}))\in\{\kappa_{a}^{\ell-1},\kappa_{b}^{\ell-1}\}\cup V(\mathcal{K}^{\ell})\cup(V(\mathcal{S}_{a}^{\ell})\setminus\{\kappa_{a}^{\ell}\}),

as σλ1(σλ1(κa))V(𝒦b1)\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell}))\in V(\mathcal{K}_{b}^{\ell-1}) would cause σλ1\sigma_{\lambda-1} to violate Proposition 4.6(4), since we then get

{κa,σλ1(σλ1(κa))}σλ11(V(𝒦1))(V(𝒫a1)V(𝒫b1))\displaystyle\{\kappa_{a}^{\ell},\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell}))\}\subset\sigma_{\lambda-1}^{-1}(V(\mathcal{K}^{\ell-1}))\setminus(V(\mathcal{P}_{a}^{\ell-1})\cup V(\mathcal{P}_{b}^{\ell-1}))

which implies that

|σλ11(V(𝒦1))(V(𝒫a1)V(𝒫b1))|2.\displaystyle|\sigma_{\lambda-1}^{-1}(V(\mathcal{K}^{\ell-1}))\setminus(V(\mathcal{P}_{a}^{\ell-1})\cup V(\mathcal{P}_{b}^{\ell-1}))|\geq 2.

We break into subcases based on the value of σλ11(κa)\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell}).

Subcase 2.1: σλ11(κa){va2,1,vb2,1}\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell})\in\{v_{a}^{\ell-2,\ell-1},v_{b}^{\ell-2,\ell-1}\}.

This subcase applies for 3\ell\geq 3. The vertex which κa\kappa_{a}^{\ell} swaps onto satisfies

σλ1(κa)(NXL(va2,1)NXL(vb2,1))V(X2).\displaystyle\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell})\in(N_{X_{L}}(v_{a}^{\ell-2,\ell-1})\cup N_{X_{L}}(v_{b}^{\ell-2,\ell-1}))\cap V(X^{\ell-2}).

From (4.12), we deduce that the vertex κa\kappa_{a}^{\ell} swaps with to reach σλ\sigma_{\lambda} from σλ1\sigma_{\lambda-1} satisfies

σλ1(σλ1(κa)){κa1,κb1},\displaystyle\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell}))\in\{\kappa_{a}^{\ell-1},\kappa_{b}^{\ell-1}\},

since the statements

σλ1(σλ1(κa))V(𝒮a){κa} and σλ1(σλ1(κa))V(𝒦)\displaystyle\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell}))\in V(\mathcal{S}_{a}^{\ell})\setminus\{\kappa_{a}^{\ell}\}\text{ and }\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell}))\in V(\mathcal{K}^{\ell})

respectively imply that σλ1\sigma_{\lambda-1} violates Proposition 4.4 and Proposition 4.6(2,3). Proceeding backwards in Σ\Sigma, σλ2σλ\sigma_{\lambda-2}\neq\sigma_{\lambda} (σλ11(κa)σs1(κa)\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell})\neq\sigma_{s}^{-1}(\kappa_{a}^{\ell}) implies σλ1σs\sigma_{\lambda-1}\neq\sigma_{s}, so σλ2\sigma_{\lambda-2} is well-defined, and λ\lambda is minimal). Now, if we had that

σλ21(σλ1(σλ1(κa)))=σλ1(κa) and σλ21(κa)=σλ11(κa),\displaystyle\sigma_{\lambda-2}^{-1}\left(\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell}))\right)=\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell})\text{ and }\sigma_{\lambda-2}^{-1}(\kappa_{a}^{\ell})=\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell}),

then swapping σλ1(σλ1(κa))\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell})) with κa\kappa_{a}^{\ell} directly from σλ2\sigma_{\lambda-2} would contradict λ\lambda being minimal. Thus, we have

σλ21(κa)NXL(σλ11(κa))V(X1),\displaystyle\sigma_{\lambda-2}^{-1}(\kappa_{a}^{\ell})\in N_{X_{L}}(\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell}))\cap V(X^{\ell-1}),

since σλ2\sigma_{\lambda-2} satisfies Proposition 4.4 and neither σλ1(σλ1(κa))\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell})) nor κa\kappa_{a}^{\ell} can swap with vertices in

(V(𝒮a2){κa2})(V(𝒮b2){κb2}).\displaystyle(V(\mathcal{S}_{a}^{\ell-2})\setminus\{\kappa_{a}^{\ell-2}\})\cup(V(\mathcal{S}_{b}^{\ell-2})\setminus\{\kappa_{b}^{\ell-2}\}).

But any vertex in NYL(κa)N_{Y_{L}}(\kappa_{a}^{\ell}) with which κa\kappa_{a}^{\ell} can swap to reach σλ2\sigma_{\lambda-2} from σλ1\sigma_{\lambda-1} raises a contradiction: a vertex of {κa1,κb1}V(𝒦1)\{\kappa_{a}^{\ell-1},\kappa_{b}^{\ell-1}\}\cup V(\mathcal{K}^{\ell-1}) implies σλ1\sigma_{\lambda-1} violates Proposition 4.6(4) (respectively, on layers 2\ell-2 and 1\ell-1, due to σλ1(σλ1(κa))\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell})) and κa\kappa_{a}^{\ell}), a vertex of V(𝒦)V(\mathcal{K}^{\ell}) implies σλ1\sigma_{\lambda-1} violates Proposition 4.6(2,3), and a vertex in V(𝒮a){κa}V(\mathcal{S}_{a}^{\ell})\setminus\{\kappa_{a}^{\ell}\} implies σλ1\sigma_{\lambda-1} violates Proposition 4.4. See Figure 13 for an illustration.

Refer to caption
Figure 13. Configurations in Σ\Sigma used to raise a contradiction for Subcase 2.1, illustrated for σλ11(κa)=va2,1\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell})=v_{a}^{\ell-2,\ell-1} and σλ1(σλ1(κa))=κa1\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell}))=\kappa_{a}^{\ell-1}. Here, κa\kappa_{a}^{\ell} must swap with a vertex μNYL(κa)\mu\in N_{Y_{L}}(\kappa_{a}^{\ell}) to reach σλ2\sigma_{\lambda-2} from σλ1\sigma_{\lambda-1}, for which all possibilities of μ\mu raise a contradiction.
Subcase 2.2: σλ11(κa){va,+1,vb,+1}\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell})\in\{v_{a}^{\ell,\ell+1},v_{b}^{\ell,\ell+1}\}.

This subcase applies for 2<L2\leq\ell<L. The vertex κa\kappa_{a}^{\ell} swaps onto satisfies

σλ1(κa)(NXL(va,+1)NXL(vb,+1))V(X+1).\displaystyle\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell})\in(N_{X_{L}}(v_{a}^{\ell,\ell+1})\cup N_{X_{L}}(v_{b}^{\ell,\ell+1}))\cap V(X^{\ell+1}).

From (4.12), we deduce that the vertex κa\kappa_{a}^{\ell} swaps with to reach σλ\sigma_{\lambda} from σλ1\sigma_{\lambda-1} satisfies

(4.13) σλ1(σλ1(κa)){κa+1,κb+1},\displaystyle\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell}))\in\{\kappa_{a}^{\ell+1},\kappa_{b}^{\ell+1}\},

since the statements

σλ1(σλ1(κa))V(𝒮a){κa},\displaystyle\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell}))\in V(\mathcal{S}_{a}^{\ell})\setminus\{\kappa_{a}^{\ell}\}, σλ1(σλ1(κa)){κa1,κb1}(V(𝒦){κa+1,κb+1})\displaystyle\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell}))\in\{\kappa_{a}^{\ell-1},\kappa_{b}^{\ell-1}\}\cup(V(\mathcal{K}^{\ell})\setminus\{\kappa_{a}^{\ell+1},\kappa_{b}^{\ell+1}\})

imply that σλ1\sigma_{\lambda-1} violates Proposition 4.4 and Proposition 4.6(1-3), respectively. Let σξ\sigma_{\xi}, with ξ<λ1\xi<\lambda-1, be the last term in Σ\Sigma before σλ1\sigma_{\lambda-1} satisfying

σξ1(σλ1(σλ1(κa)))V(X);\displaystyle\sigma_{\xi}^{-1}\left(\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell}))\right)\in V(X^{\ell});

ξ\xi is well-defined since (see (4.13)) σs1({κa+1,κb+1})V(X)\sigma_{s}^{-1}(\{\kappa_{a}^{\ell+1},\kappa_{b}^{\ell+1}\})\subset V(X^{\ell}), which also implies σsσλ1\sigma_{s}\neq\sigma_{\lambda-1} since

σλ11(σλ1(σλ1(κa)))=σλ1(κa)V(X).\displaystyle\sigma_{\lambda-1}^{-1}\left(\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell}))\right)=\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell})\notin V(X^{\ell}).

By the definition of ξ\xi,

(4.14) σj1(σλ1(σλ1(κa)))V(X) for ξ+1jλ1.\displaystyle\sigma_{j}^{-1}\left(\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell}))\right)\notin V(X^{\ell})\text{ for }\xi+1\leq j\leq\lambda-1.

Since σλ1(σλ1(κa))\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell})) traverses a path to σλ11(κa)\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell}) as we go from σξ+1\sigma_{\xi+1} to σλ\sigma_{\lambda}, not involving V(X)V(X^{\ell}) until σλ\sigma_{\lambda}, we further deduce that

σξ1(σλ1(σλ1(κa)))=σλ11(κa),\displaystyle\sigma_{\xi}^{-1}\left(\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell}))\right)=\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell}),

since σλ1(σλ1(κa))\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell})) cannot traverse a path from {va,+1,vb,+1}{σλ11(κa)}\{v_{a}^{\ell,\ell+1},v_{b}^{\ell,\ell+1}\}\setminus\{\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell})\} to σλ11(κa)\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell}) as we swap from σξ+1\sigma_{\xi+1} to σλ\sigma_{\lambda} without violating Proposition 4.4 or 4.6(2,4).999This can be proved using arguments essentially identical to those in Subcase 1.2. Thus, from σξ\sigma_{\xi} to σξ+1\sigma_{\xi+1}, σλ1(σλ1(κa))\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell})) swaps into

NXL(σλ11(κa))V(X+1)\displaystyle N_{X_{L}}(\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell}))\cap V(X^{\ell+1})

from σλ11(κa)\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell}), and since σξ+1\sigma_{\xi+1} satisfies Proposition 4.6(2,3), a case check on NYL(σλ1(σλ1(κa)))N_{Y_{L}}\left(\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell}))\right) (see (4.13)) yields

σξ+1(σλ11(κa))(V(𝒮a+1){κa+1})(V(𝒮b+1){κb+1})V(𝒦+1).\displaystyle\sigma_{\xi+1}(\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell}))\in(V(\mathcal{S}_{a}^{\ell+1})\setminus\{\kappa_{a}^{\ell+1}\})\cup(V(\mathcal{S}_{b}^{\ell+1})\setminus\{\kappa_{b}^{\ell+1}\})\cup V(\mathcal{K}^{\ell+1}).

If it were true that

(4.15) σξ+1(σλ11(κa))(V(𝒮a+1){κa+1})(V(𝒮b+1){κb+1}),\displaystyle\sigma_{\xi+1}(\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell}))\in(V(\mathcal{S}_{a}^{\ell+1})\setminus\{\kappa_{a}^{\ell+1}\})\cup(V(\mathcal{S}_{b}^{\ell+1})\setminus\{\kappa_{b}^{\ell+1}\}),

then it must be that σλ1(σλ1(κa))\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell})) is the corresponding knob vertex. So from (4.14), we deduce that σj(σλ11(κa))\sigma_{j}(\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell})) would be fixed for ξ+1jλ1\xi+1\leq j\leq\lambda-1. Taking j=ξ+1j=\xi+1 and j=λ1j=\lambda-1 would imply

σξ+1(σλ11(κa))=κa,\displaystyle\sigma_{\xi+1}(\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell}))=\kappa_{a}^{\ell},

contradicting (4.15). Therefore, σξ+1(σλ11(κa))V(𝒦+1)\sigma_{\xi+1}(\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell}))\in V(\mathcal{K}^{\ell+1}). We now inductively establish that

(4.16) σj(σλ11(κa))V(𝒦+1) for ξ+1jλ1\displaystyle\sigma_{j}(\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell}))\in V(\mathcal{K}^{\ell+1})\text{ for }\xi+1\leq j\leq\lambda-1

by showing that it is fixed for all such jj. Assume for some jj satisfying ξ+1j<λ1\xi+1\leq j<\lambda-1 that σj\sigma_{j} satisfies this claim. Then to reach σj+1\sigma_{j+1} from σj\sigma_{j}, σj(σλ11(κa))\sigma_{j}(\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell})) cannot swap with either {κa+1,κb+1}\{\kappa_{a}^{\ell+1},\kappa_{b}^{\ell+1}\} (see (4.13); σj+1\sigma_{j+1} would violate either (4.14) or Proposition 4.6(4) for layer \ell, depending on whether it swaps with σλ1(σλ1(κa))\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell})) or not, respectively), another vertex in V(𝒦+1)V(\mathcal{K}^{\ell+1}) (σj+1\sigma_{j+1} would violate Proposition 4.6(4) for layer +1\ell+1), or a vertex in

(V(𝒮a+2){κa+2})(V(𝒮b+2){κb+2})V(𝒦+2)\displaystyle(V(\mathcal{S}_{a}^{\ell+2})\setminus\{\kappa_{a}^{\ell+2}\})\cup(V(\mathcal{S}_{b}^{\ell+2})\setminus\{\kappa_{b}^{\ell+2}\})\cup V(\mathcal{K}^{\ell+2})

(for the setting σj(σλ11(κa)){κa+2,κb+2}\sigma_{j}(\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell}))\in\{\kappa_{a}^{\ell+2},\kappa_{b}^{\ell+2}\}, if it applies; σj+1\sigma_{j+1} would violate Proposition 4.4 or Proposition 4.6(2,3)). This completes the induction. Now, (4.16) on j=λ1j=\lambda-1 raises a contradiction, since κaV(𝒦+1)\kappa_{a}^{\ell}\notin V(\mathcal{K}^{\ell+1}). See Figure 14 for an illustration. This is our final contradiction in this case. We conclude that Proposition 4.6(2) cannot have been the property violated by σλ\sigma_{\lambda}.

Refer to caption
Figure 14. Configurations in Σ\Sigma used to raise a contradiction for Subcase 2.2, illustrated for σλ11(κa)=va,+1\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell})=v_{a}^{\ell,\ell+1} and σλ1(σλ1(κa))=κa+1\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell}))=\kappa_{a}^{\ell+1}. We let μ=σξ+1(va,+1)\mu=\sigma_{\xi+1}(v_{a}^{\ell,\ell+1}). For ξ+1jλ1\xi+1\leq j\leq\lambda-1, we have σj(va,+1)V(𝒦+1)\sigma_{j}(v_{a}^{\ell,\ell+1})\in V(\mathcal{K}^{\ell+1}), contradicting σλ11(κa)=va,+1\sigma_{\lambda-1}^{-1}(\kappa_{a}^{\ell})=v_{a}^{\ell,\ell+1}.

Case 3: There exists [L1]\ell\in[L-1] and μV(𝒦){κa+1,κb+1}\mu\in V(\mathcal{K}^{\ell})\setminus\{\kappa_{a}^{\ell+1},\kappa_{b}^{\ell+1}\} such that σλ1(μ)V(X)\sigma_{\lambda}^{-1}(\mu)\notin V(X^{\ell}), or there exists μV(𝒦L)\mu\in V(\mathcal{K}^{L}) such that σλ1(μ)V(XL)\sigma_{\lambda}^{-1}(\mu)\notin V(X^{L}).

This case is relevant only for L2L\geq 2. The proceeding argument raises a contradiction both when assuming the existence of [L1]\ell\in[L-1] for which there exists μV(𝒦){κa+1,κb+1}\mu\in V(\mathcal{K}^{\ell})\setminus\{\kappa_{a}^{\ell+1},\kappa_{b}^{\ell+1}\} such that σλ1(μ)V(X)\sigma_{\lambda}^{-1}(\mu)\notin V(X^{\ell}), and also when assuming the existence of μV(𝒦L)\mu\in V(\mathcal{K}^{L}) such that σλ1(μ)V(XL)\sigma_{\lambda}^{-1}(\mu)\notin V(X^{L}), taking =L\ell=L.

Observe that (where, more precisely, the RHS is the subset that is well-defined for \ell)

σλ11(μ){va1,,vb1,,va,+1,vb,+1},\displaystyle\sigma_{\lambda-1}^{-1}(\mu)\in\{v_{a}^{\ell-1,\ell},v_{b}^{\ell-1,\ell},v_{a}^{\ell,\ell+1},v_{b}^{\ell,\ell+1}\},

and also that the vertex σλ1(σλ1(μ))\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\mu)) that μ\mu swaps with to reach σλ\sigma_{\lambda} from σλ1\sigma_{\lambda-1} satisfies

(4.17) σλ1(σλ1(μ)){κa,κb},\displaystyle\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\mu))\in\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\},

since NYL(μ)V(𝒦){κa,κb}N_{Y_{L}}(\mu)\subset V(\mathcal{K}^{\ell})\cup\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\}, and σλ1(σλ1(μ))V(𝒦)\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\mu))\in V(\mathcal{K}^{\ell}) would imply σλ1\sigma_{\lambda-1} violates Proposition 4.6(4) on layer \ell. If σλ11(μ){va,+1,vb,+1}\sigma_{\lambda-1}^{-1}(\mu)\in\{v_{a}^{\ell,\ell+1},v_{b}^{\ell,\ell+1}\} (valid for <L\ell<L), then we would have that

σλ1(μ)(NXL(va,+1)NXL(vb,+1))V(X+1),\displaystyle\sigma_{\lambda}^{-1}(\mu)\in(N_{X_{L}}(v_{a}^{\ell,\ell+1})\cup N_{X_{L}}(v_{b}^{\ell,\ell+1}))\cap V(X^{\ell+1}),

from which (4.17) implies that σλ1\sigma_{\lambda-1} violates Proposition 4.6(1) if =1\ell=1 and Proposition 4.6(2) if 2\ell\geq 2. Thus, it must be that 2\ell\geq 2 and σλ11(μ){va1,,vb1,}\sigma_{\lambda-1}^{-1}(\mu)\in\{v_{a}^{\ell-1,\ell},v_{b}^{\ell-1,\ell}\}, so that

(4.18) σλ1(μ)(NXL(va1,)NXL(vb1,))V(X1).\displaystyle\sigma_{\lambda}^{-1}(\mu)\in(N_{X_{L}}(v_{a}^{\ell-1,\ell})\cup N_{X_{L}}(v_{b}^{\ell-1,\ell}))\cap V(X^{\ell-1}).

Proceeding backwards in Σ\Sigma, σλ2σλ\sigma_{\lambda-2}\neq\sigma_{\lambda} (σλ11(μ)σs1(μ)\sigma_{\lambda-1}^{-1}(\mu)\neq\sigma_{s}^{-1}(\mu) implies that σλ1σs\sigma_{\lambda-1}\neq\sigma_{s}, so σλ2\sigma_{\lambda-2} is well-defined, and λ\lambda is minimal). If neither μ\mu nor σλ1(σλ1(μ))\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\mu)) were swapped to reach σλ2\sigma_{\lambda-2} from σλ1\sigma_{\lambda-1}, swapping them directly from σλ2\sigma_{\lambda-2} would contradict λ\lambda being minimal. Furthermore, from (4.17) and (4.18), μ\mu swaps onto

NXL(σλ11(μ))V(X)\displaystyle N_{X_{L}}(\sigma_{\lambda-1}^{-1}(\mu))\cap V(X^{\ell})

to reach σλ2\sigma_{\lambda-2} from σλ1\sigma_{\lambda-1}, since σλ1\sigma_{\lambda-1} satisfies Proposition 4.4 and neither μ\mu nor σλ1(σλ1(κa))\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\kappa_{a}^{\ell})) can swap with vertices in the set

(V(𝒮a1){κa1})(V(𝒮b1){κb1}).\displaystyle(V(\mathcal{S}_{a}^{\ell-1})\setminus\{\kappa_{a}^{\ell-1}\})\cup(V(\mathcal{S}_{b}^{\ell-1})\setminus\{\kappa_{b}^{\ell-1}\}).

But the vertex σλ2(σλ11(μ))\sigma_{\lambda-2}(\sigma_{\lambda-1}^{-1}(\mu)) that μ\mu swaps with to reach σλ2\sigma_{\lambda-2} from σλ1\sigma_{\lambda-1} implies that σλ2\sigma_{\lambda-2} violates Proposition 4.6(4) on layer 1\ell-1 if

σλ2(σλ11(μ)){κa,κb}\displaystyle\sigma_{\lambda-2}(\sigma_{\lambda-1}^{-1}(\mu))\in\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\}

due to σλ1(σλ1(μ))\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\mu)) and σλ2(σλ11(μ))\sigma_{\lambda-2}(\sigma_{\lambda-1}^{-1}(\mu)) (see (4.17)) and on layer \ell if

σλ2(σλ11(μ))V(𝒦)\displaystyle\sigma_{\lambda-2}(\sigma_{\lambda-1}^{-1}(\mu))\in V(\mathcal{K}^{\ell})

due to μ\mu and σλ2(σλ11(μ))\sigma_{\lambda-2}(\sigma_{\lambda-1}^{-1}(\mu)). See Figure 15 for an illustration. We conclude that Proposition 4.6(3) cannot have been the property violated by σλ\sigma_{\lambda}.

Refer to caption
Figure 15. Configurations in Σ\Sigma used to raise a contradiction in Case 3, illustrated for σλ1(σλ1(μ))=κa\sigma_{\lambda-1}(\sigma_{\lambda}^{-1}(\mu))=\kappa_{a}^{\ell} and σλ11(μ)=va1,\sigma_{\lambda-1}^{-1}(\mu)=v_{a}^{\ell-1,\ell}. We let μ=σλ2(σλ11(μ))\mu^{\prime}=\sigma_{\lambda-2}(\sigma_{\lambda-1}^{-1}(\mu)). All possibilities of μ\mu^{\prime} will cause σλ2\sigma_{\lambda-2} to violate Proposition 4.6(4).

Case 4: There exists [L]\ell\in[L] such that |σλ1(V(𝒦))(V(𝒫a)V(𝒫b))|2|\sigma_{\lambda}^{-1}(V(\mathcal{K}^{\ell}))\setminus(V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}))|\geq 2.

Assume that this statement holds for some [L]\ell\in[L]. We must have that

(4.19) |σλ11(V(𝒦))(V(𝒫a)V(𝒫b))|=1 and |σλ1(V(𝒦))(V(𝒫a)V(𝒫b))|=2,\displaystyle|\sigma_{\lambda-1}^{-1}(V(\mathcal{K}^{\ell}))\setminus(V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}))|=1\text{ and }|\sigma_{\lambda}^{-1}(V(\mathcal{K}^{\ell}))\setminus(V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}))|=2,

since |σλ1(V(𝒦))(V(𝒫a)V(𝒫b))|2|\sigma_{\lambda}^{-1}(V(\mathcal{K}^{\ell}))\setminus(V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}))|\geq 2, λ\lambda is minimal, and for any index 1iλ1\leq i\leq\lambda, we have

|σi1(V(𝒦))(V(𝒫a)V(𝒫b))||σi11(V(𝒦))(V(𝒫a)V(𝒫b))|1.\displaystyle|\sigma_{i}^{-1}(V(\mathcal{K}^{\ell}))\setminus(V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}))|-|\sigma_{i-1}^{-1}(V(\mathcal{K}^{\ell}))\setminus(V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}))|\leq 1.

By (4.19) and Proposition 4.6(4), there is a unique μV(𝒦)\mu\in V(\mathcal{K}^{\ell}) such that

σλ11(μ)V(𝒫a)V(𝒫b).\displaystyle\sigma_{\lambda-1}^{-1}(\mu)\notin V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}).

Furthermore, there exists μV(𝒦)\mu^{\prime}\in V(\mathcal{K}^{\ell}) such that (exactly) one of the two following statements hold:

σλ1(va)=μ,σλ1(μ)NXL(va)V(𝒫a);\displaystyle\sigma_{\lambda-1}(v_{a}^{\ell})=\mu^{\prime},\ \sigma_{\lambda}^{-1}(\mu^{\prime})\in N_{X_{L}}(v_{a}^{\ell})\setminus V(\mathcal{P}_{a}^{\ell}); σλ1(vb)=μ,σλ1(μ)NXL(vb)V(𝒫b).\displaystyle\sigma_{\lambda-1}(v_{b}^{\ell})=\mu^{\prime},\ \sigma_{\lambda}^{-1}(\mu^{\prime})\in N_{X_{L}}(v_{b}^{\ell})\setminus V(\mathcal{P}_{b}^{\ell}).

Studying the neighborhoods of vertices in V(𝒦)V(\mathcal{K}^{\ell}) yields

σλ(σλ11(μ)){κa,κb};\displaystyle\sigma_{\lambda}(\sigma_{\lambda-1}^{-1}(\mu^{\prime}))\in\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\};

it can easily be checked that σλ1\sigma_{\lambda-1} would violate one of (4.19), Proposition 4.4, or Proposition 4.6(2,3) otherwise. We will assume (the other three cases are analogous)

σλ1(va)=μ,σλ(σλ11(μ))=κa.\displaystyle\sigma_{\lambda-1}(v_{a}^{\ell})=\mu^{\prime},\ \sigma_{\lambda}(\sigma_{\lambda-1}^{-1}(\mu^{\prime}))=\kappa_{a}^{\ell}.

It follows from Lemma 4.7(2) that σλ11(κb)V(𝒫a){va}\sigma_{\lambda-1}^{-1}(\kappa_{b}^{\ell})\in V(\mathcal{P}_{a}^{\ell})\setminus\{v_{a}^{\ell}\}. But if =1\ell=1, σλ1\sigma_{\lambda-1} violates Proposition 4.6(1) since

σλ11({κa1,κb1})V(Xa1){v1};\displaystyle\sigma_{\lambda-1}^{-1}(\{\kappa_{a}^{1},\kappa_{b}^{1}\})\subset V(X_{a}^{1})\setminus\{v^{1}\};

if 2\ell\geq 2, σλ1\sigma_{\lambda-1} violates Proposition 4.6(4) on layer 1\ell-1 since

σλ11({κa,κb})σλ11(V(𝒦1))(V(𝒫a1)V(𝒫b1)).\displaystyle\sigma_{\lambda-1}^{-1}(\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\})\subset\sigma_{\lambda-1}^{-1}(V(\mathcal{K}^{\ell-1}))\setminus(V(\mathcal{P}_{a}^{\ell-1})\cup V(\mathcal{P}_{b}^{\ell-1})).

See Figure 16 for an illustration.

Refer to caption
Figure 16. Raising a contradiction for Case 4, illustrated under the assumptions σλ1(va)=μ\sigma_{\lambda-1}(v_{a}^{\ell})=\mu^{\prime}, σλ1(μ)NXL(va)V(𝒫a)\sigma_{\lambda}^{-1}(\mu^{\prime})\in N_{X_{L}}(v_{a}^{\ell})\setminus V(\mathcal{P}_{a}^{\ell}), and σλ(σλ11(μ))=κa\sigma_{\lambda}(\sigma_{\lambda-1}^{-1}(\mu^{\prime}))=\kappa_{a}^{\ell}. Since there exists μV(𝒦)\mu\in V(\mathcal{K}^{\ell}) for which σλ11(μ)V(𝒫a)V(𝒫b)\sigma_{\lambda-1}^{-1}(\mu)\notin V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}) and σλ1(va)V(𝒦)\sigma_{\lambda-1}(v_{a}^{\ell})\in V(\mathcal{K}^{\ell}), Lemma 4.7(2) yields σλ11(κb)V(𝒫a){va}\sigma_{\lambda-1}^{-1}(\kappa_{b}^{\ell})\in V(\mathcal{P}_{a}^{\ell})\setminus\{v_{a}^{\ell}\}. This implies that σλ1\sigma_{\lambda-1} violates Proposition 4.6, regardless of what the value of \ell is.

We conclude that Proposition 4.6(4) cannot have been the property violated by σλ\sigma_{\lambda}. Together with the conclusions of the other three cases, we conclude that σλ\sigma_{\lambda} satisfies all properties of Proposition 4.6, which contradicts σλ\sigma_{\lambda} failing to satisfy at least one of the properties, completing the proof. ∎

We can understand Propositions 4.4 and 4.6 as separating elements of V(YL)V(Y_{L}) so that for any configuration σV(𝒞)\sigma\in V(\mathscr{C}), specific vertices of YLY_{L} can lie only upon specific subgraphs of XLX_{L}. In particular, for any [L]\ell\in[L], it follows from these two results that

σ(V(𝒫a)V(𝒫b){va,vb})V(𝒦){κa,κb}.\displaystyle\sigma(V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell})\setminus\{v_{a}^{\ell},v_{b}^{\ell}\})\subseteq V(\mathcal{K}^{\ell})\cup\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\}.

Proposition 4.6 and Lemma 4.7 together now yield the following result.

Proposition 4.8.

For any σV(𝒞)\sigma\in V(\mathscr{C}), the following two statements hold.

  1. (1)

    If σ1(V(𝒦))V(𝒫a)V(𝒫b)\sigma^{-1}(V(\mathcal{K}^{\ell}))\subset V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}), then

    σ({va,vb})V(𝒦)|σ1({κa,κb})((V(𝒫a){va})(V(𝒫b){vb}))|=1.\displaystyle\sigma(\{v_{a}^{\ell},v_{b}^{\ell}\})\subset V(\mathcal{K}^{\ell})\implies|\sigma^{-1}(\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\})\cap((V(\mathcal{P}_{a}^{\ell})\setminus\{v_{a}^{\ell}\})\cup(V(\mathcal{P}_{b}^{\ell})\setminus\{v_{b}^{\ell}\}))|=1.
  2. (2)

    If σ1(V(𝒦))V(𝒫a)V(𝒫b)\sigma^{-1}(V(\mathcal{K}^{\ell}))\not\subset V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}), then

    σ(va)V(𝒦)σ1({κa,κb})(V(𝒫a){va}),\displaystyle\sigma(v_{a}^{\ell})\in V(\mathcal{K}^{\ell})\implies\sigma^{-1}(\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\})\cap(V(\mathcal{P}_{a}^{\ell})\setminus\{v_{a}^{\ell}\})\neq\emptyset,
    σ(vb)V(𝒦)σ1({κa,κb})(V(𝒫b){vb}).\displaystyle\sigma(v_{b}^{\ell})\in V(\mathcal{K}^{\ell})\implies\sigma^{-1}(\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\})\cap(V(\mathcal{P}_{b}^{\ell})\setminus\{v_{b}^{\ell}\})\neq\emptyset.

We now prove a third invariant of any configuration in 𝒞\mathscr{C}. Toward this, we begin by introducing the following notion of ordering for elements of V(𝒦)V(\mathcal{K}^{\ell}) in the same partite set, which is illustrated in Figure 17.

Definition 4.9.

For σV(𝒞)\sigma\in V(\mathscr{C}), [L]\ell\in[L], and μ1,μ2V(𝒦)\mu_{1},\mu_{2}\in V(\mathcal{K}^{\ell}) in the same partite set, say that μ1\mu_{1} is left of μ2\mu_{2} on σ\sigma if (exactly) one of the following holds:

  1. (1)

    σ1({μ1,μ2})V(𝒫a)\sigma^{-1}(\{\mu_{1},\mu_{2}\})\subset V(\mathcal{P}_{a}^{\ell}) and d(σ1(μ2),va)<d(σ1(μ1),va)d(\sigma^{-1}(\mu_{2}),v_{a}^{\ell})<d(\sigma^{-1}(\mu_{1}),v_{a}^{\ell}),

  2. (2)

    σ1({μ1,μ2})V(𝒫b)\sigma^{-1}(\{\mu_{1},\mu_{2}\})\subset V(\mathcal{P}_{b}^{\ell}) and d(σ1(μ1),vb)<d(σ1(μ2),vb)d(\sigma^{-1}(\mu_{1}),v_{b}^{\ell})<d(\sigma^{-1}(\mu_{2}),v_{b}^{\ell}),

  3. (3)

    σ1(μ1)V(𝒫a)\sigma^{-1}(\mu_{1})\in V(\mathcal{P}_{a}^{\ell}) and σ1(μ2)V(𝒫b)\sigma^{-1}(\mu_{2})\in V(\mathcal{P}_{b}^{\ell}).

Refer to caption
(a) Definition 4.9(1).
Refer to caption
(b) Definition 4.9(2).
Refer to caption
(c) Definition 4.9(3).
Figure 17. An illustration of Definition 4.9.

Since σs1(V(𝒦))V(𝒫a)V(𝒫b)\sigma_{s}^{-1}(V(\mathcal{K}^{\ell}))\subset V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}), it follows from Definition 4.9 that for any μ1,μ2V(𝒦)\mu_{1},\mu_{2}\in V(\mathcal{K}^{\ell}) in the same partite set, either μ1\mu_{1} is left of μ2\mu_{2} on σs\sigma_{s} or μ2\mu_{2} is left of μ1\mu_{1} on σs\sigma_{s}. The following proposition asserts that the left relation established by σs\sigma_{s} cannot change for other σV(𝒞)\sigma\in V(\mathscr{C}).

Proposition 4.10.

Take [L]\ell\in[L] and μ1,μ2V(𝒦)\mu_{1},\mu_{2}\in V(\mathcal{K}^{\ell}) in the same partite set, with μ1\mu_{1} left of μ2\mu_{2} on σs\sigma_{s}. If σV(𝒞)\sigma\in V(\mathscr{C}) is such that σ1({μ1,μ2})V(𝒫a)V(𝒫b)\sigma^{-1}(\{\mu_{1},\mu_{2}\})\subset V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}), then μ1\mu_{1} is left of μ2\mu_{2} in σ\sigma.

Proof.

Let Σ={σi}i=0λ\Sigma=\{\sigma_{i}\}_{i=0}^{\lambda} with σ0=σs\sigma_{0}=\sigma_{s} and σλ=σ\sigma_{\lambda}=\sigma be a swap sequence in 𝖥𝖲(XL,YL)\mathsf{FS}(X_{L},Y_{L}) starting from σs\sigma_{s} and ending at σ\sigma, where σ1({μ1,μ2})V(𝒫a)V(𝒫b)\sigma^{-1}(\{\mu_{1},\mu_{2}\})\subset V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}). By Proposition 4.6(4), any σiΣ\sigma_{i}\in\Sigma satisfies

|σi1(V(𝒦))(V(𝒫a)V(𝒫b))|1,\displaystyle|\sigma_{i}^{-1}(V(\mathcal{K}^{\ell}))\setminus(V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}))|\leq 1,

so that in particular,

|σi1({μ1,μ2})(V(𝒫a)V(𝒫b))|1.\displaystyle|\sigma_{i}^{-1}(\{\mu_{1},\mu_{2}\})\setminus(V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}))|\leq 1.

Consider the subsequence Σ={σij}j=0λΣ\Sigma^{\prime}=\{\sigma_{i_{j}}\}_{j=0}^{\lambda^{\prime}}\subseteq\Sigma, λλ\lambda^{\prime}\leq\lambda with i0=0i_{0}=0 and then consisting of all configurations σiΣ\sigma_{i}\in\Sigma for which

|σi11({μ1,μ2})(V(𝒫a)V(𝒫b))|=1 and |σi1({μ1,μ2})(V(𝒫a)V(𝒫b))|=0.\displaystyle|\sigma_{i-1}^{-1}(\{\mu_{1},\mu_{2}\})\setminus(V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}))|=1\text{ and }|\sigma_{i}^{-1}(\{\mu_{1},\mu_{2}\})\setminus(V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}))|=0.

If μ1\mu_{1} is left of μ2\mu_{2} on σiλ\sigma_{i_{\lambda^{\prime}}}, then μ1\mu_{1} is left of μ2\mu_{2} on σk\sigma_{k} for all kλk\geq\lambda^{\prime}. Indeed, the construction of Σ\Sigma^{\prime} and σ1({μ1,μ2})V(𝒫a)V(𝒫b)\sigma^{-1}(\{\mu_{1},\mu_{2}\})\subset V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}) imply that σk1(μ1)\sigma_{k}^{-1}(\mu_{1}) and σk1(μ2)\sigma_{k}^{-1}(\mu_{2}) remain upon the same path subgraphs in XLX_{L} for all such kk, and this claim now follows if μ1\mu_{1} is left of μ2\mu_{2} on σiλ\sigma_{i_{\lambda^{\prime}}} due to Definition 4.9(3) and from {μ1,μ2}E(YL)\{\mu_{1},\mu_{2}\}\notin E(Y_{L}) otherwise. Since λλ\lambda\geq\lambda^{\prime}, it suffices to show that μ1\mu_{1} is left of μ2\mu_{2} on σiλ\sigma_{i_{\lambda^{\prime}}}, toward which we can induct on jj to show that μ1\mu_{1} is left of μ2\mu_{2} on σij\sigma_{i_{j}} for all 0jλ0\leq j\leq\lambda^{\prime}. The statement holds for j=0j=0 by assumption, so assume μ1\mu_{1} is left of μ2\mu_{2} on σij\sigma_{i_{j}} for some 0j<λ0\leq j<\lambda^{\prime}. Take the unique vertex μ{μ1,μ2}\mu\in\{\mu_{1},\mu_{2}\} such that

σij+111(μ)V(𝒫a)V(𝒫b).\displaystyle\sigma_{i_{j+1}-1}^{-1}(\mu)\notin V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}).

It is now straightforward to inductively argue, relying on the definition of Σ\Sigma^{\prime}, Proposition 4.6(4), and the fact that {μ1,μ2}E(YL)\{\mu_{1},\mu_{2}\}\notin E(Y_{L}), that the other vertex in {μ1,μ2}\{\mu_{1},\mu_{2}\} (i.e., not μ\mu) must remain upon the same path subgraph in XLX_{L} over all configurations σk\sigma_{k} for ijkij+1i_{j}\leq k\leq i_{j+1}. With this observation, it quickly follows, by breaking into cases based on which statement of Definition 4.9 yields μ1\mu_{1} left of μ2\mu_{2} on σij\sigma_{i_{j}} and relying on the fact that {μ1,μ2}E(YL)\{\mu_{1},\mu_{2}\}\notin E(Y_{L}), that μ1\mu_{1} is left of μ2\mu_{2} on σij+1\sigma_{i_{j+1}}. ∎

We are now ready to prove the main result (in conjunction with Proposition 4.10) we will need in order to derive a lower bound on the diameter of 𝒞\mathscr{C}.

Proposition 4.11.

For any configuration σV(𝒞)\sigma\in V(\mathscr{C}) and [L1]\ell\in[L-1],

  1. (1)

    σ1(κa+1)V(𝒫a)V(𝒫b)V(𝒦b)σ(V(𝒫a))\sigma^{-1}(\kappa_{a}^{\ell+1})\notin V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell})\implies V(\mathcal{K}_{b}^{\ell})\subset\sigma(V(\mathcal{P}_{a}^{\ell})),

  2. (2)

    σ1(κb+1)V(𝒫a)V(𝒫b)V(𝒦a)σ(V(𝒫a))\sigma^{-1}(\kappa_{b}^{\ell+1})\notin V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell})\implies V(\mathcal{K}_{a}^{\ell})\subset\sigma(V(\mathcal{P}_{a}^{\ell})).

Proof.

We will take σV(𝒞)\sigma\in V(\mathscr{C}) and [L1]\ell\in[L-1] such that σ1(κa+1)V(𝒫a)V(𝒫b)\sigma^{-1}(\kappa_{a}^{\ell+1})\notin V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}); proving the latter implication when assuming σ1(κb+1)V(𝒫a)V(𝒫b)\sigma^{-1}(\kappa_{b}^{\ell+1})\notin V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}) can be done analogously. By Proposition 4.6(4) and the assumption on σ1(κa+1)\sigma^{-1}(\kappa_{a}^{\ell+1}),

|σ1(V(𝒦))(V(𝒫a)V(𝒫b))|=1,\displaystyle|\sigma^{-1}(V(\mathcal{K}^{\ell}))\setminus(V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}))|=1,

from which it follows that V(𝒦b)σ(V(𝒫a)V(𝒫b))V(\mathcal{K}_{b}^{\ell})\subset\sigma(V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell})). To prove that V(𝒦b)σ(V(𝒫a))V(\mathcal{K}_{b}^{\ell})\subset\sigma(V(\mathcal{P}_{a}^{\ell})), let Σ={σi}i=0λ\Sigma=\{\sigma_{i}\}_{i=0}^{\lambda} be a swap sequence from σ0=σs\sigma_{0}=\sigma_{s} to σλ=σ\sigma_{\lambda}=\sigma: note that λ1\lambda\geq 1, since

σs1(κa+1)V(𝒫a)V(𝒫b).\displaystyle\sigma_{s}^{-1}(\kappa_{a}^{\ell+1})\in V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}).

Consider the largest ξ<λ\xi<\lambda for which

σξ1(κa+1)V(𝒫a)V(𝒫b),\displaystyle\sigma_{\xi}^{-1}(\kappa_{a}^{\ell+1})\in V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}),

noting that ξ<λ\xi<\lambda is well-defined, since σs1(κa+1)V(𝒫a)V(𝒫b)\sigma_{s}^{-1}(\kappa_{a}^{\ell+1})\in V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}). It must be that

(4.20) σξ1(κa+1){va,vb},\displaystyle\sigma_{\xi}^{-1}(\kappa_{a}^{\ell+1})\in\{v_{a}^{\ell},v_{b}^{\ell}\}, σξ+11(κa+1)V(𝒫a)V(𝒫b).\displaystyle\sigma_{\xi+1}^{-1}(\kappa_{a}^{\ell+1})\notin V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}).

Since NYL(κa+1)V(𝒦a)=N_{Y_{L}}(\kappa_{a}^{\ell+1})\cap V(\mathcal{K}_{a}^{\ell})=\emptyset, we deduce that

(4.21) σξ1(V(𝒦a){κa+1})V(𝒫a)V(𝒫b),\displaystyle\sigma_{\xi}^{-1}(V(\mathcal{K}_{a}^{\ell})\setminus\{\kappa_{a}^{\ell+1}\})\subset V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}),

as otherwise, we would have that

|σξ+11(V(𝒦a))(V(𝒫a)V(𝒫b))|2,\displaystyle|\sigma_{\xi+1}^{-1}(V(\mathcal{K}_{a}^{\ell}))\setminus(V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}))|\geq 2,

contradicting Proposition 4.6(4). From (4.20), (4.21), and Proposition 4.10, we further observe that

(4.22) σξ1(V(𝒦a){κa+1})V(𝒫b),\displaystyle\sigma_{\xi}^{-1}(V(\mathcal{K}_{a}^{\ell})\setminus\{\kappa_{a}^{\ell+1}\})\subset V(\mathcal{P}_{b}^{\ell}),

as for any μV(𝒦a)\mu\in V(\mathcal{K}_{a}^{\ell}), κa+1\kappa_{a}^{\ell+1} is left of μ\mu on σξ\sigma_{\xi} since κa+1\kappa_{a}^{\ell+1} is left of μ\mu on σs\sigma_{s}. By the definition of ξ\xi,

σk1(κa+1)V(𝒫a)V(𝒫b) for k>ξ,\displaystyle\sigma_{k}^{-1}(\kappa_{a}^{\ell+1})\notin V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell})\text{ for }k>\xi,

so by Proposition 4.6(4),

(4.23) {κa+1}=σk1(V(𝒦))(V(𝒫a)V(𝒫b)) for k>ξ.\displaystyle\{\kappa_{a}^{\ell+1}\}=\sigma_{k}^{-1}(V(\mathcal{K}^{\ell}))\setminus(V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}))\text{ for }k>\xi.

If V(𝒦b)σξ(V(𝒫a))V(\mathcal{K}_{b}^{\ell})\subset\sigma_{\xi}(V(\mathcal{P}_{a}^{\ell})), (4.23) can be used to inductively prove that

V(𝒦b)σk(V(𝒫a)) for k>ξ,\displaystyle V(\mathcal{K}_{b}^{\ell})\subset\sigma_{k}(V(\mathcal{P}_{a}^{\ell}))\text{ for }k>\xi,

with the induction basis following from (4.20). In particular, V(𝒦b)σ(V(𝒫a))V(\mathcal{K}_{b}^{\ell})\subset\sigma(V(\mathcal{P}_{a}^{\ell})), which is the desired statement. Thus, we now proceed under the assumption |V(𝒦b)σξ(V(𝒫a))|1|V(\mathcal{K}_{b}^{\ell})\setminus\sigma_{\xi}(V(\mathcal{P}_{a}^{\ell}))|\geq 1. Further assume (towards a contradiction) that there exists

(4.24) μ(V(𝒦b)σξ(V(𝒫a)))σξ(V(𝒫b)).\displaystyle\mu\in\left(V(\mathcal{K}_{b}^{\ell})\setminus\sigma_{\xi}(V(\mathcal{P}_{a}^{\ell}))\right)\cap\sigma_{\xi}(V(\mathcal{P}_{b}^{\ell})).

Then from (4.22), (4.24), and the fact that the LHS and RHS have equal cardinality,

(4.25) (V(𝒦a){κa+1}){μ}=σξ(V(𝒫b)).\displaystyle(V(\mathcal{K}_{a}^{\ell})\setminus\{\kappa_{a}^{\ell+1}\})\cup\{\mu\}=\sigma_{\xi}(V(\mathcal{P}_{b}^{\ell})).

See (4.20); (4.25) immediately raises a contradiction if σξ1(κa+1)=vb\sigma_{\xi}^{-1}(\kappa_{a}^{\ell+1})=v_{b}^{\ell}, and if σξ1(κa+1)=va\sigma_{\xi}^{-1}(\kappa_{a}^{\ell+1})=v_{a}^{\ell}, (4.20) and Proposition 4.8(2) (the hypotheses necessary for the implication follow from (4.20) and (4.25)) imply that

σξ(V(𝒫b))=σξ+1(V(𝒫b)) and σξ+11({κa,κb})(V(𝒫b){vb}),\displaystyle\sigma_{\xi}(V(\mathcal{P}_{b}^{\ell}))=\sigma_{\xi+1}(V(\mathcal{P}_{b}^{\ell}))\text{ and }\sigma_{\xi+1}^{-1}(\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\})\cap(V(\mathcal{P}_{b}^{\ell})\setminus\{v_{b}^{\ell}\})\neq\emptyset,

respectively, raising a contradiction on (4.25). Therefore,

(4.26) (V(𝒦b)σξ(V(𝒫a)))σξ(V(𝒫b))=.\displaystyle\left(V(\mathcal{K}_{b}^{\ell})\setminus\sigma_{\xi}(V(\mathcal{P}_{a}^{\ell}))\right)\cap\sigma_{\xi}(V(\mathcal{P}_{b}^{\ell}))=\emptyset.

If it were true that |V(𝒦b)σξ(V(𝒫a))|2|V(\mathcal{K}_{b}^{\ell})\setminus\sigma_{\xi}(V(\mathcal{P}_{a}^{\ell}))|\geq 2, Proposition 4.6(4) would imply

|V(𝒦b)(V(𝒫a)V(𝒫b))|1,\displaystyle|V(\mathcal{K}_{b}^{\ell})\setminus(V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}))|\leq 1,

so there would exist μV(𝒦b)σξ(V(𝒫a))\mu\in V(\mathcal{K}_{b}^{\ell})\setminus\sigma_{\xi}(V(\mathcal{P}_{a}^{\ell})) such that σξ1(μ)V(𝒫a)V(𝒫b)\sigma_{\xi}^{-1}(\mu)\in V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}), contradicting (4.26). Thus,

|V(𝒦b)σξ(V(𝒫a))|=1.\displaystyle|V(\mathcal{K}_{b}^{\ell})\setminus\sigma_{\xi}(V(\mathcal{P}_{a}^{\ell}))|=1.

Letting μ\mu denote the unique element in this set, it must be that μσξ(V(𝒫b))\mu\notin\sigma_{\xi}(V(\mathcal{P}_{b}^{\ell})) by (4.26), so that

σξ1(μ)σξ1(V(𝒦))(V(𝒫a)V(𝒫b)).\displaystyle\sigma_{\xi}^{-1}(\mu)\in\sigma_{\xi}^{-1}(V(\mathcal{K}^{\ell}))\setminus(V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell})).

It thus follows from Proposition 4.6(4) that σξ+1(σξ1(κa+1))=μ\sigma_{\xi+1}(\sigma_{\xi}^{-1}(\kappa_{a}^{\ell+1}))=\mu, so that V(𝒦b)σξ+1(V(𝒫a))V(\mathcal{K}_{b}^{\ell})\subset\sigma_{\xi+1}(V(\mathcal{P}_{a}^{\ell})). Now V(𝒦b)σ(V(𝒫a))V(\mathcal{K}_{b}^{\ell})\subset\sigma(V(\mathcal{P}_{a}^{\ell})) can be established by arguing as when we assumed V(𝒦b)σξ(V(𝒫a))V(\mathcal{K}_{b}^{\ell})\subset\sigma_{\xi}(V(\mathcal{P}_{a}^{\ell})). ∎

4.3. Extractions

Equipped with the results of Subsection 4.2, we are now ready to establish that there are two configurations in 𝒞\mathscr{C}, one of them being σs\sigma_{s}, with distance eΩ(n)e^{\Omega(n)}. The idea is to construct a series of swaps, layer-by-layer. For [L1]\ell\in[L-1], each iteration on layer +1\ell+1 will require a certain number of iterations in layer \ell. We formalize this in Definition 4.12 by a notion that we refer to as \ell-extractions, which we illustrate in Figure 18.

Definition 4.12.

For [L]\ell\in[L] and σ,τV(𝒞)\sigma,\tau\in V(\mathscr{C}), we say that τ\tau is an \ell-extraction of σ\sigma if either:

  1. (1)

    V(𝒦a)σ(V(𝒫a))V(\mathcal{K}_{a}^{\ell})\subset\sigma(V(\mathcal{P}_{a}^{\ell})) and V(𝒦a)τ(V(𝒫a))=V(\mathcal{K}_{a}^{\ell})\cap\tau(V(\mathcal{P}_{a}^{\ell}))=\emptyset, V(𝒦b)τ(V(𝒫a))V(\mathcal{K}_{b}^{\ell})\subset\tau(V(\mathcal{P}_{a}^{\ell})),

  2. (2)

    V(𝒦b)σ(V(𝒫a))V(\mathcal{K}_{b}^{\ell})\subset\sigma(V(\mathcal{P}_{a}^{\ell})) and V(𝒦b)τ(V(𝒫a))=V(\mathcal{K}_{b}^{\ell})\cap\tau(V(\mathcal{P}_{a}^{\ell}))=\emptyset, V(𝒦a)τ(V(𝒫a))V(\mathcal{K}_{a}^{\ell})\subset\tau(V(\mathcal{P}_{a}^{\ell})).

In other words, if τ\tau is an \ell-extraction of σ\sigma, one of the two partite sets of V(𝒦)V(\mathcal{K}^{\ell}) is a subset of σ(V(𝒫a))\sigma(V(\mathcal{P}_{a}^{\ell})). Then τ\tau “extracts” this partite set out of 𝒫a\mathcal{P}_{a}^{\ell} and replaces it with the other partite set of V(𝒦)V(\mathcal{K}^{\ell}), which is then a subset τ(V(𝒫a))\tau(V(\mathcal{P}_{a}^{\ell})).

Refer to caption
(a) Definition 4.12(1).
Refer to caption
(b) Definition 4.12(2).
Figure 18. An illustration of Definition 4.12. Red subgraphs/vertices corresponding to preimages of V(𝒦a)V(\mathcal{K}_{a}^{\ell}), while blue subgraphs/vertices correspond to preimages of V(𝒦b)V(\mathcal{K}_{b}^{\ell}). By Proposition 4.10, the relative ordering of the vertices in a partite set of V(𝒦)V(\mathcal{K}^{\ell}) is the same as in σs\sigma_{s}, so the appropriate knob vertex always lies upon the leftmost vertex in 𝒫a\mathcal{P}_{a}^{\ell}.

For use in the proof of Proposition 4.14, we also introduce the following definition, corresponding to knob vertices in YLY_{L} rotating about their corresponding cycle subgraphs in XLX_{L}. Recall from Subsection 4.1 that for all [L]\ell\in[L],

|V(𝒮a)|=|V(𝒮b)|=15,\displaystyle|V(\mathcal{S}_{a}^{\ell})|=|V(\mathcal{S}_{b}^{\ell})|=15, |V(𝒞a)|=|V(𝒞b)|=16.\displaystyle|V(\mathcal{C}_{a}^{\ell})|=|V(\mathcal{C}_{b}^{\ell})|=16.
Definition 4.13.

For [L]\ell\in[L], μaNYL(κa)\mu_{a}\in N_{Y_{L}}(\kappa_{a}^{\ell}) and a positive integer λ\lambda such that λ0(mod16)\lambda\equiv 0\pmod{16}, a κa\kappa_{a}^{\ell}-rotation with μa\mu_{a} is a swap sequence {σi}i=0λ\{\sigma_{i}\}_{i=0}^{\lambda} for which σi(V(𝒞a))={μa}V(𝒮a)\sigma_{i}(V(\mathcal{C}_{a}^{\ell}))=\{\mu_{a}\}\cup V(\mathcal{S}_{a}^{\ell}) for all 0iλ0\leq i\leq\lambda and there exists an enumeration V(𝒞a)={v0,v1,,v15}V(\mathcal{C}_{a}^{\ell})=\{v_{0},v_{1},\dots,v_{15}\} such that {vi1,vi}E(𝒞a)\{v_{i-1},v_{i}\}\in E(\mathcal{C}_{a}^{\ell}) for all i[15]i\in[15] and σj(vi)=κa\sigma_{j}(v_{i})=\kappa_{a}^{\ell} whenever ij(mod16)i\equiv j\pmod{16}. Similarly, for [L]\ell\in[L], μbNYL(κb)\mu_{b}\in N_{Y_{L}}(\kappa_{b}^{\ell}) and a positive integer λ\lambda such that λ0(mod16)\lambda\equiv 0\pmod{16}, a κb\kappa_{b}^{\ell}-rotation with μb\mu_{b} is a swap sequence {σi}i=0λ\{\sigma_{i}\}_{i=0}^{\lambda} for which σi(V(𝒞b))={μb}V(𝒮b)\sigma_{i}(V(\mathcal{C}_{b}^{\ell}))=\{\mu_{b}\}\cup V(\mathcal{S}_{b}^{\ell}) for all 0iλ0\leq i\leq\lambda and there exists an enumeration V(𝒞b)={v0,v1,,v15}V(\mathcal{C}_{b}^{\ell})=\{v_{0},v_{1},\dots,v_{15}\} such that {vi1,vi}E(𝒞b)\{v_{i-1},v_{i}\}\in E(\mathcal{C}_{b}^{\ell}) for all i[15]i\in[15] and σj(vi)=κb\sigma_{j}(v_{i})=\kappa_{b}^{\ell} whenever ij(mod16)i\equiv j\pmod{16}.

Note that Definition 4.13, which is illustrated in Figure 19, corresponds to a cyclic rotation of all elements in σ0(V(𝒞a)){κa}\sigma_{0}(V(\mathcal{C}_{a}^{\ell}))\setminus\{\kappa_{a}^{\ell}\} about the knob vertex κa\kappa_{a}^{\ell}, which is fixed in the same position since σ0(v0)=σλ(v0)=κa\sigma_{0}(v_{0})=\sigma_{\lambda}(v_{0})=\kappa_{a}^{\ell}. The direction and length λ\lambda of this rotation depend on the enumeration of the vertices in the relevant cycle and the value λ/16\lambda/16, respectively.

Refer to caption
Figure 19. An illustration of a κa\kappa_{a}^{\ell}-rotation with μa\mu_{a}, where λ=316=48\lambda=3\cdot 16=48 and κa\kappa_{a}^{\ell} rotates counterclockwise around 𝒞a\mathcal{C}_{a}^{\ell}. Here, μ1,μ2V(𝒮a){κa}\mu_{1},\mu_{2}\in V(\mathcal{S}_{a}^{\ell})\setminus\{\kappa_{a}^{\ell}\}. As κa\kappa_{a}^{\ell} rotates over 𝒞a\mathcal{C}_{a}^{\ell}, it cyclically rotates all elements of (V(𝒮a){κa}){μa}(V(\mathcal{S}_{a}^{\ell})\setminus\{\kappa_{a}^{\ell}\})\cup\{\mu_{a}\} about it. In this case, every such element moves three vertices clockwise along V(𝒞a){v0}V(\mathcal{C}_{a}^{\ell})\setminus\{v_{0}\}.

The final configuration σf\sigma_{f} in 𝒞\mathscr{C} for which we will argue that d(σs,σf)=Ω(nL1)d(\sigma_{s},\sigma_{f})=\Omega(n^{L-1}) is going to be an LL-extraction of σs\sigma_{s} (of the kind from Definition 4.12(1)). We begin by showing that for any [L]\ell\in[L], \ell-extractions of σs\sigma_{s} exist in 𝒞\mathscr{C}. This will follow as an immediate corollary of Proposition 4.14 by taking η=1\eta=1 for this value of \ell, since σλ\sigma_{\lambda} is then an \ell-extraction of σs\sigma_{s}.

Proposition 4.14.

For any positive integer η\eta and [L]\ell\in[L], there exists a swap sequence {σi}i=0λ\{\sigma_{i}\}_{i=0}^{\lambda}, σ0=σs\sigma_{0}=\sigma_{s} with a subsequence {σij}j=0η\{\sigma_{i_{j}}\}_{j=0}^{\eta}, i0=0i_{0}=0, iη=λi_{\eta}=\lambda such that

  1. (1)

    for every j[η]j\in[\eta], σij\sigma_{i_{j}} is an \ell-extraction of σij1\sigma_{i_{j-1}};

  2. (2)

    for every j[η]j\in[\eta] and μV(𝒦L)\mu\in V(\mathcal{K}^{L}), there exists a κaL\kappa_{a}^{L}-rotation with μ\mu and κbL\kappa_{b}^{L}-rotation with μ\mu that is a contiguous subsequence of {σi}i=ij1ij\{\sigma_{i}\}_{i=i_{j-1}}^{i_{j}}.

Proof.

We deviate from our usual practice in Subsections 4.2 and 4.3 of assuming that everything proceeds under the context of some fixed L1L\geq 1, and establish Proposition 4.14 via induction on LL. Specifically, we will show by induction on L1L\geq 1 that for any fixed L1L\geq 1, Proposition 4.14 holds for the graphs XLX_{L} and YLY_{L}. During the induction step, in another deviation from our usual practice, we shall be more explicit about the pairs of graphs and the starting configurations that we reference for sake of clarity.

We begin with the induction basis, L=1L=1. Here, =1\ell=1 is the only value of \ell for which Proposition 4.14 applies. Consider the following sequence of swaps from σs\sigma_{s}: Figure 20 illustrates the first three steps of this procedure.

  1. (1)

    Perform a κb1\kappa_{b}^{1}-rotation with σs(vb1)\sigma_{s}(v_{b}^{1}) to move σs(vb1)\sigma_{s}(v_{b}^{1}) to v1v^{1}.

  2. (2)

    Perform a κa1\kappa_{a}^{1}-rotation with σs(vb1)\sigma_{s}(v_{b}^{1}) to move σs(vb1)\sigma_{s}(v_{b}^{1}) to va1v_{a}^{1}.

  3. (3)

    Swap σs(vb1)\sigma_{s}(v_{b}^{1}) as far left through V(𝒫a1)V(\mathcal{P}_{a}^{1}) as possible, yielding a vertex μV(𝒦a1)\mu\in V(\mathcal{K}_{a}^{1}) upon va1v_{a}^{1}.

  4. (4)

    Perform a κa1\kappa_{a}^{1}-rotation with μ\mu to move μ\mu to v1v^{1}.

  5. (5)

    Perform a κb1\kappa_{b}^{1}-rotation with μ\mu to move μ\mu to vb1v_{b}^{1}.

  6. (6)

    Swap μ\mu as far right through V(𝒫b1)V(\mathcal{P}_{b}^{1}) as possible, producing a vertex in V(𝒦b1)V(\mathcal{K}_{b}^{1}) upon vb1v_{b}^{1}.

It is straightforward to conclude that repeating this algorithm 1515 times (since |V(𝒦a)|=|V(𝒦b)|=15|V(\mathcal{K}_{a}^{\ell})|=|V(\mathcal{K}_{b}^{\ell})|=15) from σs\sigma_{s} (adapted to the mapping upon vb1v_{b}^{1}, then the mapping upon va1v_{a}^{1}, for subsequent iterations) yields a 11-extraction σi1\sigma_{i_{1}} of σs\sigma_{s}, namely of the kind in Definition 4.12(1), since we have

σi1(V(𝒫b1))=σs(V(𝒫a1){va1})=V(𝒦a1),\displaystyle\sigma_{i_{1}}(V(\mathcal{P}_{b}^{1}))=\sigma_{s}(V(\mathcal{P}_{a}^{1})\setminus\{v_{a}^{1}\})=V(\mathcal{K}_{a}^{1}), σi1(V(𝒫a1){va1})=σs(V(𝒫b1))=V(𝒦b1),\displaystyle\sigma_{i_{1}}(V(\mathcal{P}_{a}^{1})\setminus\{v_{a}^{1}\})=\sigma_{s}(V(\mathcal{P}_{b}^{1}))=V(\mathcal{K}_{b}^{1}),

and that for every μV(𝒦1)\mu\in V(\mathcal{K}^{1}), there exists a κa1\kappa_{a}^{1}-rotation with μ\mu and κb1\kappa_{b}^{1}-rotation with μ\mu that is a contiguous subsequence of the resulting swap sequence. It is similarly straightforward to see that we can repeat this algorithm to interchange the positions of V(𝒦a1)V(\mathcal{K}_{a}^{1}) and V(𝒦b1)V(\mathcal{K}_{b}^{1}) arbitrarily many times (i.e., for any positive integer η\eta), with a κa1\kappa_{a}^{1}-rotation with μ\mu and κb1\kappa_{b}^{1}-rotation with μ\mu for every μV(𝒦1)\mu\in V(\mathcal{K}^{1}) executed as a contiguous subsequence of every such interchange. On even iterations of this interchange, we simply switch the roles of V(𝒦a1)V(\mathcal{K}_{a}^{1}) and V(𝒦b1)V(\mathcal{K}_{b}^{1}) in the above algorithm, resulting in 11-extractions of the kind in Definition 4.12(2).

Refer to caption
Figure 20. An illustration of the first half of the sequence of swaps discussed for the induction basis, L=1L=1. Preimages of V(𝒦a1)V(\mathcal{K}_{a}^{1}) are colored green, and preimages of V(𝒦b1)V(\mathcal{K}_{b}^{1}) are colored blue. This segment of the sequence of swaps involves a κb1\kappa_{b}^{1}-rotation with σs(vb1)\sigma_{s}(v_{b}^{1}), a κa1\kappa_{a}^{1}-rotation with σs(vb1)\sigma_{s}(v_{b}^{1}), and a sequence of swaps moving σs(vb1)\sigma_{s}(v_{b}^{1}) left through V(𝒫a)V(\mathcal{P}_{a}^{\ell}). A vertex μaV(𝒦a1)\mu_{a}\in V(\mathcal{K}_{a}^{1}) now lies upon vav_{a}^{\ell}; we can similarly move μa\mu_{a} to the right. Continuing until every vertex of V(𝒦1)V(\mathcal{K}^{1}) is moved in an analogous subroutine yields a 11-extraction σi1\sigma_{i_{1}} of σs\sigma_{s}. We can interchange 𝒦a1\mathcal{K}_{a}^{1} and 𝒦b1\mathcal{K}_{b}^{1} in this way arbitrarily many times.

Now assume Proposition 4.14 holds for some fixed L1L\geq 1 (i.e., for this fixed L1L\geq 1, Proposition 4.14 holds for the graphs XLX_{L} and YLY_{L}). By the induction hypothesis applied on η=31\eta=31 and [L]\ell\in[L], we can extract a swap sequence {σi}i=0λ\{\sigma_{i}\}_{i=0}^{\lambda} in V(𝒞(XL,YL))V(\mathscr{C}(X_{L},Y_{L})) with σ0=σs(XL,YL)\sigma_{0}=\sigma_{s}(X_{L},Y_{L}) and with a subsequence {σij}j=031\{\sigma_{i_{j}}\}_{j=0}^{31} satisfying Proposition 4.14. Now consider XL+1X_{L+1} and YL+1Y_{L+1}, which has corresponding starting configuration σs(XL+1,YL+1)\sigma_{s}(X_{L+1},Y_{L+1}) in the connected component 𝒞(XL+1,YL+1)\mathscr{C}(X_{L+1},Y_{L+1}), which we denote σs\sigma_{s} and 𝒞\mathscr{C}, respectively. In an abuse of notation, for the rest of the present proof we let XLX_{L} denote the first LL layers of XL+1X_{L+1} and YL=YL+1|σs(V(XL))Y_{L}=Y_{L+1}|_{\sigma_{s}(V(X_{L}))}. These subgraphs are isomorphic to the graphs XLX_{L} and YLY_{L} as they were originally defined during their construction in Subsection 4.1, and under these isomorphisms, σs\sigma_{s} restricted to XLX_{L} can be understood to be the same as σs(XL,YL)\sigma_{s}(X_{L},Y_{L}) as defined in Subsection 4.1. Furthermore, the swap sequence {σi}i=0λ\{\sigma_{i}\}_{i=0}^{\lambda} can be understood as being in 𝒞\mathscr{C}, with σ0=σs\sigma_{0}=\sigma_{s}, if we set

σi(v)=σs(v) for all vV(XL+1)V(XL),i=0,,λ.\displaystyle\sigma_{i}(v)=\sigma_{s}(v)\text{ for all }v\in V(X_{L+1})\setminus V(X_{L}),\ i=0,\dots,\lambda.

As such, it follows from the induction hypothesis that Proposition 4.14 holds for (XL+1,YL+1)(X_{L+1},Y_{L+1}) if we take [L]\ell\in[L], and all that remains is to confirm that Proposition 4.14 holds for (XL+1,YL+1)(X_{L+1},Y_{L+1}) for =L+1\ell=L+1. In the proceeding argument, we assume that the swap sequence {σi}i=0λ\{\sigma_{i}\}_{i=0}^{\lambda} we extracted above using the induction hypothesis was for =L\ell=L. In a similar vein, given some σV(𝒞)\sigma\in V(\mathscr{C}) with σ(XL)=V(YL)\sigma(X_{L})=V(Y_{L}) and σi\sigma_{i} from the swap sequence {σi}i=0λ\{\sigma_{i}\}_{i=0}^{\lambda}, define the extension of σi\sigma_{i} with respect to σ\sigma to be the configuration101010Whenever we construct such extensions in the forthcoming argument, it will be clear that they lie in V(𝒞)V(\mathscr{C}). τV(𝖥𝖲(XL+1,YL+1))\tau\in V(\mathsf{FS}(X_{L+1},Y_{L+1})) with

  • τ(v)=σ(v)\tau(v)=\sigma(v) for all vV(XL+1)V(XL)v\in V(X_{L+1})\setminus V(X_{L});

  • τ(v)=σi(v)\tau(v)=\sigma_{i}(v) for all vV(XL)v\in V(X_{L}).

We will say that we extend σi\sigma_{i} with respect to σ\sigma, and will generally apply this notion en masse to subsequences of {σi}i=0λ\{\sigma_{i}\}_{i=0}^{\lambda} with respect to a single configuration of V(𝒞)V(\mathscr{C}).

We will now construct a swap sequence {σi}i=0λ\{\sigma^{\prime}_{i}\}_{i=0}^{\lambda^{\prime}} in V(𝒞)V(\mathscr{C}), with σ0=σs\sigma^{\prime}_{0}=\sigma_{s}, satisfying Proposition 4.14 for η=1\eta=1. This is illustrated in Figure 21. From the swap sequence {σi}i=0λ\{\sigma_{i}\}_{i=0}^{\lambda}, with subsequence {σij}j=031\{\sigma_{i_{j}}\}_{j=0}^{31} as discussed before, consider {σi}i=i0i1\{\sigma_{i}\}_{i=i_{0}}^{i_{1}}, which, by the induction hypothesis, has a contiguous subsequence {σi}i=j1k1\{\sigma_{i}\}_{i=j_{1}}^{k_{1}} that is a κbL\kappa_{b}^{L}-rotation with κbL+1\kappa_{b}^{L+1}. Let t1t_{1} be such that j1t1k1j_{1}\leq t_{1}\leq k_{1} and σt1(κbL+1)=vbL,L+1\sigma_{t_{1}}(\kappa_{b}^{L+1})=v_{b}^{L,L+1}: the observation that such a t1t_{1} exists follows quickly from the restrictions of Definition 4.13. We construct a swap sequence 𝒮1\mathscr{S}_{1} in 𝒞\mathscr{C} by merging {σi1,1}i=0t1i0,{σi1,2}i=0z1\{\sigma_{i}^{1,1}\}_{i=0}^{t_{1}-i_{0}},\{\sigma_{i}^{1,2}\}_{i=0}^{z_{1}}, and {σi1,3}i=0i1t1\{\sigma_{i}^{1,3}\}_{i=0}^{i_{1}-t_{1}}, which we now define.

  1. (1)

    Extend {σi}i=i0t1\{\sigma_{i}\}_{i=i_{0}}^{t_{1}} with respect to σs\sigma_{s}, yielding {σi1,1}i=0t1i0\{\sigma_{i}^{1,1}\}_{i=0}^{t_{1}-i_{0}}.

  2. (2)

    Let {σi1,2}i=0z1\{\sigma_{i}^{1,2}\}_{i=0}^{z_{1}}, with σ01,2=σt1i01,1\sigma_{0}^{1,2}=\sigma_{t_{1}-i_{0}}^{1,1}, be a κbL+1\kappa_{b}^{L+1}-rotation with σt1i01,1(vbL+1)\sigma_{t_{1}-i_{0}}^{1,1}(v_{b}^{L+1}), with length such that σt1i01,1(vbL+1)\sigma_{t_{1}-i_{0}}^{1,1}(v_{b}^{L+1}) is moved to vL+1v^{L+1}.

  3. (3)

    Extend {σi}i=t1i1\{\sigma_{i}\}_{i=t_{1}}^{i_{1}} with respect to σt1i01,1\sigma_{t_{1}-i_{0}}^{1,1}, yielding {σi1,3}i=0i1t1\{\sigma_{i}^{1,3}\}_{i=0}^{i_{1}-t_{1}}.

Now take the subsequence {σi}i=i1i2\{\sigma_{i}\}_{i=i_{1}}^{i_{2}} of {σi}i=0λ\{\sigma_{i}\}_{i=0}^{\lambda}, which has contiguous subsequence {σi}i=j2k2\{\sigma_{i}\}_{i=j_{2}}^{k_{2}} that is a κaL\kappa_{a}^{L}-rotation with κaL+1\kappa_{a}^{L+1}, and t2t_{2} such that j2t2k2j_{2}\leq t_{2}\leq k_{2} and σt2(κaL+1)=vaL,L+1\sigma_{t_{2}}(\kappa_{a}^{L+1})=v_{a}^{L,L+1}. Construct 𝒮2\mathscr{S}_{2} by merging {σi2,1}i=0t2i1,{σi2,2}i=0t2i1\{\sigma_{i}^{2,1}\}_{i=0}^{t_{2}-i_{1}},\{\sigma_{i}^{2,2}\}_{i=0}^{t_{2}-i_{1}}, and {σi2,3}i=0t2i1\{\sigma_{i}^{2,3}\}_{i=0}^{t_{2}-i_{1}}, which we now define.

  1. (1)

    Extend {σi}i=i1t2\{\sigma_{i}\}_{i=i_{1}}^{t_{2}} with respect to σi1t11,3\sigma^{1,3}_{i_{1}-t_{1}}, yielding {σi2,1}i=0t2i1\{\sigma_{i}^{2,1}\}_{i=0}^{t_{2}-i_{1}}.

  2. (2)

    Let {σi2,2}i=0z2\{\sigma_{i}^{2,2}\}_{i=0}^{z_{2}}, with σ02,2=σt2i12,1\sigma_{0}^{2,2}=\sigma_{t_{2}-i_{1}}^{2,1}, be the result of performing a κaL+1\kappa_{a}^{L+1}-rotation with σs(vbL+1)\sigma_{s}(v_{b}^{L+1}) to move σs(vbL+1)\sigma_{s}(v_{b}^{L+1}) to vaL+1v_{a}^{L+1}, then swapping σs(vbL+1)\sigma_{s}(v_{b}^{L+1}) as far left as possible across V(𝒫aL+1)V(\mathcal{P}_{a}^{L+1}), then performing a κaL+1\kappa_{a}^{L+1}-rotation to swap the resulting vertex μ\mu upon vaL+1v_{a}^{L+1} onto vL+1v^{L+1}.

  3. (3)

    Extend {σi}i=t2i2\{\sigma_{i}\}_{i=t_{2}}^{i_{2}} with respect to σt2i12,1\sigma_{t_{2}-i_{1}}^{2,1}, yielding {σi2,3}i=0i2t2\{\sigma_{i}^{2,3}\}_{i=0}^{i_{2}-t_{2}}.

It is now straightforward to see how to similarly construct the sequences 𝒮1,,𝒮31\mathscr{S}_{1},\dots,\mathscr{S}_{31}, and why we took η=31\eta=31 when appealing to the induction hypothesis: each sequence corresponding to a different vertex in V(𝒦L+1)V(\mathcal{K}^{L+1}) lying on vL+1v^{L+1}, and following 𝒮1\mathscr{S}_{1}, we alternate the path that we “push” this vertex through. The only modification arises when we construct 𝒮31\mathscr{S}_{31}: during the κbL\kappa_{b}^{L}-rotation with κbL+1\kappa_{b}^{L+1} within {σi}i=iη1iη\{\sigma_{i}\}_{i=i_{\eta-1}}^{i_{\eta}}, simply include a κbL+1\kappa_{b}^{L+1}-rotation which moves the vertex upon vL+1v^{L+1} onto vbL+1v_{b}^{L+1}. Merging 𝒮1,,𝒮31\mathscr{S}_{1},\dots,\mathscr{S}_{31} yields a sequence {σi}i=0λ\{\sigma^{\prime}_{i}\}_{i=0}^{\lambda^{\prime}}, such that σλ\sigma^{\prime}_{\lambda^{\prime}} is an (L+1)(L+1)-extraction of σ0=σs\sigma^{\prime}_{0}=\sigma_{s}, since

σλ(V(𝒫bL+1))=σs(V(𝒫aL+1){vaL+1})=V(𝒦aL+1),\displaystyle\sigma^{\prime}_{\lambda^{\prime}}(V(\mathcal{P}_{b}^{L+1}))=\sigma_{s}(V(\mathcal{P}_{a}^{L+1})\setminus\{v_{a}^{L+1}\})=V(\mathcal{K}_{a}^{L+1}),
σλ(V(𝒫aL+1){vaL+1})=σs(V(𝒫bL+1))=V(𝒦bL+1).\displaystyle\sigma^{\prime}_{\lambda^{\prime}}(V(\mathcal{P}_{a}^{L+1})\setminus\{v_{a}^{L+1}\})=\sigma_{s}(V(\mathcal{P}_{b}^{L+1}))=V(\mathcal{K}_{b}^{L+1}).

It is evident by tracing the above construction that for every μV(𝒦L+1)\mu\in V(\mathcal{K}^{L+1}), there exists a κaL+1\kappa_{a}^{L+1}-rotation with μ\mu and κbL+1\kappa_{b}^{L+1}-rotation with μ\mu that is a contiguous subsequence of {σi}i=0λ\{\sigma^{\prime}_{i}\}_{i=0}^{\lambda^{\prime}}. Thus, the swap sequence {σi}i=0λ\{\sigma^{\prime}_{i}\}_{i=0}^{\lambda^{\prime}} establishes that Proposition 4.14 holds for L+1L+1 on η=1\eta=1.

Refer to caption
Figure 21. An illustration of the sequences of swaps defined during the construction of {σi}i=0λ\{\sigma_{i}^{\prime}\}_{i=0}^{\lambda^{\prime}} in the induction step, on L+1L+1 layers. Subgraphs/vertices corresponding to preimages of V(𝒦aL)V(\mathcal{K}_{a}^{L}), V(𝒦bL)V(\mathcal{K}_{b}^{L}), V(𝒦aL+1)V(\mathcal{K}_{a}^{L+1}), and V(𝒦bL+1)V(\mathcal{K}_{b}^{L+1}) are red, blue, gold, and pink, respectively. We specifically depict the construction of the sequence 𝒮2\mathscr{S}_{2}, constructed from the subsequence {σi}i=i1i2\{\sigma_{i}\}_{i=i_{1}}^{i_{2}} of the original sequence {σi}i=0λ\{\sigma_{i}\}_{i=0}^{\lambda}. Initially, for σ02,1\sigma_{0}^{2,1}, we have σs(vbL+1)\sigma_{s}(v_{b}^{L+1}) upon vL+1v^{L+1}. At σ02,2\sigma_{0}^{2,2}, we extend a κaL\kappa_{a}^{L}-rotation with κaL+1\kappa_{a}^{L+1} in {σi}i=i1i2\{\sigma_{i}\}_{i=i_{1}}^{i_{2}} (which is guaranteed to exist by the induction hypothesis) so that it includes the following sequence of swaps: a κaL+1\kappa_{a}^{L+1}-rotation with σs(vbL+1)\sigma_{s}(v_{b}^{L+1}), swapping σs(vbL+1)\sigma_{s}(v_{b}^{L+1}) left into 𝒫aL+1\mathcal{P}_{a}^{L+1}, and a κaL+1\kappa_{a}^{L+1}-rotation with the resulting μa\mu_{a} on vaL+1v_{a}^{L+1}. This will result in the configuration σ02,3\sigma_{0}^{2,3}. From σ02,3\sigma_{0}^{2,3} to the final configuration in 𝒮2\mathscr{S}_{2}, we execute the rest of {σi}i=i1i2\{\sigma_{i}\}_{i=i_{1}}^{i_{2}}, leading to an LL-extraction of σ02,1\sigma_{0}^{2,1}. Exhausting the original sequence {σi}i=0λ\{\sigma_{i}\}_{i=0}^{\lambda} by proceeding like this will yield an (L+1)(L+1)-extraction of σs\sigma_{s}, and the resulting swap sequence satisfies Proposition 4.14(2).

For general η1\eta\geq 1, we can invoke the induction hypothesis, applied to 31η31\eta, to extract a swap sequence {σi}i=0λ\{\sigma_{i}\}_{i=0}^{\lambda} in V(𝒞(XL,YL))V(\mathscr{C}(X_{L},Y_{L})) with subsequence {σij}j=031η\{\sigma_{i_{j}}\}_{j=0}^{31\eta}. Then we can proceed as in the η=1\eta=1 case for every contiguous subsequence {σk}k=31(i1)31i\{\sigma_{k}\}_{k=31(i-1)}^{31i} in {σi}i=0λ\{\sigma_{i}\}_{i=0}^{\lambda}, for i[η]i\in[\eta], to construct a swap sequence {σi}i=0λV(𝒞)\{\sigma^{\prime}_{i}\}_{i=0}^{\lambda^{\prime}}\subset V(\mathscr{C}) which establishes Proposition 4.14 for this value of η\eta. ∎

In the proof of Proposition 4.14, during the induction basis we reached a 11-extraction of σs\sigma_{s} by performing Ω(n)\Omega(n) iterations of an algorithm which executed Ω(n2)\Omega(n^{2}) swaps.111111Note that we inducted on LL in the proof of Proposition 4.14, so the sequence of swaps we found for smaller values of LL would be executed on subgraphs of (XL,YL)(X_{L},Y_{L}) for larger values of LL. However, it is easy to verify, by tracing the construction in Subsection 4.1, that the asymptotic statements here hold regardless of the fixed value of LL that we choose. Then in the inductive step, we reached an (+1)(\ell+1)-extraction of σs\sigma_{s} by taking Ω(n)\Omega(n) \ell-extractions of σs\sigma_{s} and stringing them together by appending some other swap sequences. Altogether, it follows that we found a sequence of Ω(nL+2)\Omega(n^{L+2}) swaps to reach an LL-extraction of σs\sigma_{s} — if this were tight, taking LL to be as large as desired would be enough to answer Question 1.2 in the negative. Motivated by these ideas, we prove Proposition 4.15, which will lend itself to a lower bound on d(σs,σf)d(\sigma_{s},\sigma_{f}).

Proposition 4.15.

Fix integers L2L\geq 2 and [L1]\ell\in[L-1], and take σ,τV(𝒞)\sigma,\tau\in V(\mathscr{C}) such that τ\tau is an (+1)(\ell+1)-extraction of σ\sigma. Any swap sequence {σi}i=0λ\{\sigma_{i}\}_{i=0}^{\lambda} with σ0=σ\sigma_{0}=\sigma and σλ=τ\sigma_{\lambda}=\tau must have a subsequence {σij}j=025\{\sigma_{i_{j}}\}_{j=0}^{25} such that, for j[25]j\in[25], there exists a configuration σ~{σi}i=ij1ij\tilde{\sigma}\in\{\sigma_{i}\}_{i=i_{j-1}}^{i_{j}} that is an \ell-extraction of σij1\sigma_{i_{j-1}}.

Proof.

Assume τ\tau is an (+1)(\ell+1)-extraction of σ\sigma of the kind of Definition 4.12(1). Proposition 4.15 can be proved in the setting where τ\tau is an (+1)(\ell+1)-extraction of σ\sigma of the kind of Definition 4.12(2) entirely analogously, where we switch the roles of several expressions corresponding to the “left and right sides” of the subgraphs X+1X^{\ell+1} and 𝒦+1\mathcal{K}^{\ell+1} in this case.

We will say that Xa+1X_{a}^{\ell+1} is the initial subgraph of any vertex μV(𝒦a+1){κa+2}\mu\in V(\mathcal{K}_{a}^{\ell+1})\setminus\{\kappa_{a}^{\ell+2}\} (μV(𝒦aL)\mu\in V(\mathcal{K}_{a}^{L}) if =L1\ell=L-1) with σ1(μ)V(Xa+1)\sigma^{-1}(\mu)\in V(X_{a}^{\ell+1}). By Proposition 4.6(3), μ\mu leaving the initial subgraph corresponds to an (XL,YL)(X_{L},Y_{L})-friendly swap where μ\mu is upon v+1v^{\ell+1} and swaps onto some vertex in NXL(v+1)V(Xb+1)N_{X_{L}}(v^{\ell+1})\cap V(X_{b}^{\ell+1}). Similarly, Xb+1X_{b}^{\ell+1} is the initial subgraph for μV(𝒦b+1){κb+2}\mu\in V(\mathcal{K}_{b}^{\ell+1})\setminus\{\kappa_{b}^{\ell+2}\} (μV(𝒦bL)\mu\in V(\mathcal{K}_{b}^{L}) if =L1\ell=L-1) with σ1(μ)V(Xb+1)\sigma^{-1}(\mu)\in V(X_{b}^{\ell+1}). By Proposition 4.6(3), μ\mu leaving the initial subgraph corresponds to an (XL,YL)(X_{L},Y_{L})-friendly swap where μ\mu is upon v+1v^{\ell+1} and swaps onto some vertex in NXL(v+1)V(Xa+1)N_{X_{L}}(v^{\ell+1})\cap V(X_{a}^{\ell+1}).

Let Σ={σi}i=0λ\Sigma=\{\sigma_{i}\}_{i=0}^{\lambda} be a swap sequence with σ0=σ\sigma_{0}=\sigma and σλ=τ\sigma_{\lambda}=\tau. It is straightforward to show from Proposition 4.6(4) and Definition 4.12(1) that at least 2626 vertices in V(𝒦+1){κa+2,κb+2}V(\mathcal{K}^{\ell+1})\setminus\{\kappa_{a}^{\ell+2},\kappa_{b}^{\ell+2}\} (V(𝒦L)V(\mathcal{K}^{L}) for =L1\ell=L-1) switch to the “opposite” layer +1\ell+1 subgraph in XLX_{L} over the course of Σ\Sigma. Take any 2626 such vertices {μ1,,μ26}\{\mu_{1},\dots,\mu_{26}\}, indexed in the order that they first leave their initial subgraph during Σ\Sigma (it is clear that at most one such vertex can leave their initial subgraph over a given swap). Construct a subsequence {σij}j=126\{\sigma_{i_{j}}\}_{j=1}^{26} of Σ\Sigma such that, for every j[26]j\in[26], iji_{j} is the smallest index for which

  • σij1(μj)=v+1\sigma_{i_{j}}^{-1}(\mu_{j})=v^{\ell+1};

  • σij+11(μj)\sigma_{i_{j}+1}^{-1}(\mu_{j}) is not a vertex in the initial subgraph of μj\mu_{j}.

Consider any j[26]j\in[26] for which μjV(𝒦a+1)\mu_{j}\in V(\mathcal{K}_{a}^{\ell+1}). The neighborhood of μj\mu_{j} is

NYL(μj)=V(𝒦b+1){κa+1,κb+1}.\displaystyle N_{Y_{L}}(\mu_{j})=V(\mathcal{K}_{b}^{\ell+1})\cup\{\kappa_{a}^{\ell+1},\kappa_{b}^{\ell+1}\}.

The vertex σij+1(v+1)\sigma_{i_{j}+1}(v^{\ell+1}) that μj\mu_{j} swaps with to reach σij+1\sigma_{i_{j}+1} from σij\sigma_{i_{j}} satisfies

σij+1(v+1){κa+1,κb+1},\displaystyle\sigma_{i_{j}+1}(v^{\ell+1})\in\{\kappa_{a}^{\ell+1},\kappa_{b}^{\ell+1}\},

since σij\sigma_{i_{j}} would violate Proposition 4.6(4) (on layer +1\ell+1) if we had that σij+1(v+1)V(𝒦b+1)\sigma_{i_{j}+1}(v^{\ell+1})\in V(\mathcal{K}_{b}^{\ell+1}). Assume (towards a contradiction) that σij+1(v+1)=κa+1\sigma_{i_{j}+1}(v^{\ell+1})=\kappa_{a}^{\ell+1}, and let 1ξij1\leq\xi\leq i_{j} (the lower bound is since σs1(μj)v+1\sigma_{s}^{-1}(\mu_{j})\neq v^{\ell+1}) be the smallest such index satisfying

(4.27) σξ1(μj)=σij1(μj)=v+1 and σξ1(κa+1)=σij1(κa+1)NXL(v)V(Xb+1).\displaystyle\sigma_{\xi}^{-1}(\mu_{j})=\sigma_{i_{j}}^{-1}(\mu_{j})=v^{\ell+1}\text{ and }\sigma_{\xi}^{-1}(\kappa_{a}^{\ell+1})=\sigma_{i_{j}}^{-1}(\kappa_{a}^{\ell+1})\in N_{X_{L}}(v^{\ell})\cap V(X_{b}^{\ell+1}).

Exactly one of the two statements

  • σξ11(κa+1)σξ1(κa+1)\sigma_{\xi-1}^{-1}(\kappa_{a}^{\ell+1})\neq\sigma_{\xi}^{-1}(\kappa_{a}^{\ell+1});

  • σξ11(μj)σξ1(μj)\sigma_{\xi-1}^{-1}(\mu_{j})\neq\sigma_{\xi}^{-1}(\mu_{j})

is true; both being false would contradict ξ\xi being the smallest possible, while both being true would contradict iji_{j} being the smallest possible. But σξ11(κa+1)σξ1(κa+1)\sigma_{\xi-1}^{-1}(\kappa_{a}^{\ell+1})\neq\sigma_{\xi}^{-1}(\kappa_{a}^{\ell+1}) would imply that σξ1\sigma_{\xi-1} violates Proposition 4.4 (on 𝒞b+1\mathcal{C}_{b}^{\ell+1}), and σξ11(μj)σξ1(μj)\sigma_{\xi-1}^{-1}(\mu_{j})\neq\sigma_{\xi}^{-1}(\mu_{j}) would imply that σξ1\sigma_{\xi-1} violates Proposition 4.6(4) (on layer \ell if it swaps with κb+1\kappa_{b}^{\ell+1}, and on layer +1\ell+1 if it swaps with a vertex in V(𝒦b+1)V(\mathcal{K}_{b}^{\ell+1})). So for all j[26]j\in[26],

(4.28) μjV(𝒦a+1)σij+1(v+1)=κb+1 and μjV(𝒦b+1)σij+1(v+1)=κa+1,\displaystyle\mu_{j}\in V(\mathcal{K}_{a}^{\ell+1})\implies\sigma_{i_{j}+1}(v^{\ell+1})=\kappa_{b}^{\ell+1}\text{ and }\mu_{j}\in V(\mathcal{K}_{b}^{\ell+1})\implies\sigma_{i_{j}+1}(v^{\ell+1})=\kappa_{a}^{\ell+1},

where the latter claim can be deduced from an entirely analogous argument. See Figure 22 for an illustration.

Refer to caption
(a) Case where μjV(𝒦a+1)\mu_{j}\in V(\mathcal{K}_{a}^{\ell+1}).
Refer to caption
(b) Case where μjV(𝒦b+1)\mu_{j}\in V(\mathcal{K}_{b}^{\ell+1}).
Figure 22. The two possibilities for the configuration σij\sigma_{i_{j}} for any j[26]j\in[26]. Subgraphs/vertices corresponding to preimages of V(𝒦a)V(\mathcal{K}_{a}^{\ell}) and V(𝒦b)V(\mathcal{K}_{b}^{\ell}) are colored red and blue, respectively. The coloring of 𝒫a\mathcal{P}_{a}^{\ell} in both cases follows from Proposition 4.11.

Now consider 2j262\leq j\leq 26 for which μjV(𝒦a+1)\mu_{j}\in V(\mathcal{K}_{a}^{\ell+1}). For such values of jj which have that μjV(𝒦b+1)\mu_{j}\in V(\mathcal{K}_{b}^{\ell+1}), establishing the existence of a configuration σ~{σi}i=ij1ij\tilde{\sigma}\in\{\sigma_{i}\}_{i=i_{j-1}}^{i_{j}} that is an \ell-extraction of σij1\sigma_{i_{j-1}} can be done entirely analogously. By (4.28), σij+1(v+1)=κb+1\sigma_{i_{j}+1}(v^{\ell+1})=\kappa_{b}^{\ell+1}, so certainly

σij1(κb+1)V(𝒫a)V(𝒫b),\displaystyle\sigma_{i_{j}}^{-1}(\kappa_{b}^{\ell+1})\notin V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}),

and since κb+1V(𝒦)\kappa_{b}^{\ell+1}\in V(\mathcal{K}^{\ell}),

σij1(V(𝒦))V(𝒫a)V(𝒫b).\displaystyle\sigma_{i_{j}}^{-1}(V(\mathcal{K}^{\ell}))\not\subset V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}).

By Proposition 4.11(2),

(4.29) V(𝒦a)σij(V(𝒫a)).\displaystyle V(\mathcal{K}_{a}^{\ell})\subset\sigma_{i_{j}}(V(\mathcal{P}_{a}^{\ell})).

If it were true that V(𝒦b)σij(V(𝒫a))V(\mathcal{K}_{b}^{\ell})\cap\sigma_{i_{j}}(V(\mathcal{P}_{a}^{\ell}))\neq\emptyset, there would exist η1V(𝒦b)\eta_{1}\in V(\mathcal{K}_{b}^{\ell}) satisfying η1σij(V(𝒫a))\eta_{1}\in\sigma_{i_{j}}(V(\mathcal{P}_{a}^{\ell})). Combined with (4.29), we would have

(4.30) V(𝒦a){η1}=σij(V(𝒫a)),\displaystyle V(\mathcal{K}_{a}^{\ell})\cup\{\eta_{1}\}=\sigma_{i_{j}}(V(\mathcal{P}_{a}^{\ell})),

since the LHS is a subset of the RHS and their cardinalities are equal. In particular, it must be that σij(va)V(𝒦)\sigma_{i_{j}}(v_{a}^{\ell})\in V(\mathcal{K}^{\ell}), so Proposition 4.8(2) would imply that

σij1({κa,κb})(V(𝒫a){va}),\displaystyle\sigma_{i_{j}}^{-1}(\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\})\cap(V(\mathcal{P}_{a}^{\ell})\setminus\{v_{a}^{\ell}\})\neq\emptyset,

so there exists η2{κa,κb}\eta_{2}\in\{\kappa_{a}^{\ell},\kappa_{b}^{\ell}\} that is in σij(V(𝒫a))\sigma_{i_{j}}(V(\mathcal{P}_{a}^{\ell})), contradicting (4.30). So it must be that σij1(V(𝒦b))V(𝒫a)=\sigma_{i_{j}}^{-1}(V(\mathcal{K}_{b}^{\ell}))\cap V(\mathcal{P}_{a}^{\ell})=\emptyset, i.e., that

(4.31) V(𝒦b)σij(V(𝒫a))=.\displaystyle V(\mathcal{K}_{b}^{\ell})\cap\sigma_{i_{j}}(V(\mathcal{P}_{a}^{\ell}))=\emptyset.

If μj1V(𝒦b+1)\mu_{j-1}\in V(\mathcal{K}_{b}^{\ell+1}), then (4.28) implies σij1+1(v+1)=κa+1\sigma_{i_{j-1}+1}(v^{\ell+1})=\kappa_{a}^{\ell+1}, and so

σij11(κa+1)V(𝒫a)V(𝒫b).\displaystyle\sigma_{i_{j-1}}^{-1}(\kappa_{a}^{\ell+1})\notin V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}).

Proposition 4.11(1) thus implies

V(𝒦b)σij1(V(𝒫a)).\displaystyle V(\mathcal{K}_{b}^{\ell})\subset\sigma_{i_{j-1}}(V(\mathcal{P}_{a}^{\ell})).

This statement, with (4.29) and (4.31), implies that σij\sigma_{i_{j}} is an \ell-extraction of σij1\sigma_{i_{j-1}}, namely of the Definition 4.12(2) kind. If μj1V(𝒦a+1)\mu_{j-1}\in V(\mathcal{K}_{a}^{\ell+1}), then (4.28) implies σij1+1(v+1)=κb+1\sigma_{i_{j-1}+1}(v^{\ell+1})=\kappa_{b}^{\ell+1}, so

σij11(κb+1)V(𝒫a)V(𝒫b).\displaystyle\sigma_{i_{j-1}}^{-1}(\kappa_{b}^{\ell+1})\notin V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}).

Proposition 4.11(2) now implies that

V(𝒦a)σij1(V(𝒫a)).\displaystyle V(\mathcal{K}_{a}^{\ell})\subset\sigma_{i_{j-1}}(V(\mathcal{P}_{a}^{\ell})).

Since iji_{j} is the smallest possible, σij11(μj)V(Xa+1)\sigma_{i_{j-1}}^{-1}(\mu_{j})\in V(X_{a}^{\ell+1}), and it follows that σij11(μj)V(𝒫a+1)\sigma_{i_{j-1}}^{-1}(\mu_{j})\in V(\mathcal{P}_{a}^{\ell+1}), as σij1\sigma_{i_{j-1}} would otherwise violate Proposition 4.6(4) on layer \ell (due to κb+1\kappa_{b}^{\ell+1} and μj\mu_{j}). It is straightforward to confirm, appealing to Proposition 4.4 on 𝒞a+1\mathcal{C}_{a}^{\ell+1}, that μj\mu_{j} moves to v+1v^{\ell+1} during {σi}i=ij1ij\{\sigma_{i}\}_{i=i_{j-1}}^{i_{j}}, and swaps with κa+1\kappa_{a}^{\ell+1} upon V(𝒞a+1)V(\mathcal{C}_{a}^{\ell+1}) at some point in this swap sequence.121212This can be proved using ideas and arguments which are essentially identical to those that were carried out in Subcase 1.2 of the proof of Proposition 4.6. Thus, there exists a configuration σ~{σi}i=ij1ij\tilde{\sigma}\in\{\sigma_{i}\}_{i=i_{j-1}}^{i_{j}} for which

σ~1({μj,κa+1})V(𝒞a+1) and σ~1(κa+1)va,\displaystyle\tilde{\sigma}^{-1}(\{\mu_{j},\kappa_{a}^{\ell+1}\})\subset V(\mathcal{C}_{a}^{\ell+1})\text{ and }\tilde{\sigma}^{-1}(\kappa_{a}^{\ell+1})\neq v_{a}^{\ell},

from which it immediately follows that

σ~1(κa+1)V(𝒫a)V(𝒫b),\displaystyle\tilde{\sigma}^{-1}(\kappa_{a}^{\ell+1})\notin V(\mathcal{P}_{a}^{\ell})\cup V(\mathcal{P}_{b}^{\ell}),

and Proposition 4.11(1) implies

V(𝒦b)σ~(V(𝒫a)).\displaystyle V(\mathcal{K}_{b}^{\ell})\subset\tilde{\sigma}(V(\mathcal{P}_{a}^{\ell})).

This, with (4.29) and (4.31), implies that σ~\tilde{\sigma} is an \ell-extraction of σij1\sigma_{i_{j-1}}, namely of the Definition 4.12(2) kind.

Therefore, taking {σij}j=126\{\sigma_{i_{j}}\}_{j=1}^{26} yields the desired subsequence of {σi}i=0λ\{\sigma_{i}\}_{i=0}^{\lambda}. ∎

4.4. Proof of Theorem 1.3

We finally derive the desired lower bound on the diameter of 𝒞\mathscr{C}.

See 1.3

Proof.

For L2L\geq 2, take XLX_{L}, YLY_{L} on 58L+258L+2 vertices (see Subsection 4.1). For [L1]\ell\in[L-1], define

(4.32) λ(L,n)():=min{d(σ,τ):σ,τV(𝒞),τ is an -extraction of σ}.\displaystyle\lambda_{(L,n)}(\ell)\mathrel{\mathop{\ordinarycolon}}=\min\left\{d(\sigma,\tau)\mathrel{\mathop{\ordinarycolon}}\sigma,\tau\in V(\mathscr{C}),\tau\text{ is an }\ell\text{-extraction of }\sigma\right\}.

It follows from Proposition 4.15 that for all [L1]\ell\in[L-1],

(4.33) λ(L,n)(+1)25λ(L,n)().\displaystyle\lambda_{(L,n)}(\ell+1)\geq 25\lambda_{(L,n)}(\ell).

Let σfV(𝒞)\sigma_{f}\in V(\mathscr{C}) be such that σf\sigma_{f} is an LL-extraction of σs\sigma_{s}, which exists by Proposition 4.14. By (4.32) and (4.33),

d(σs,σf)λ(L,n)(L)25λ(L,n)(L1)25L1λ(L,n)(1)25L1.\displaystyle d(\sigma_{s},\sigma_{f})\geq\lambda_{(L,n)}(L)\geq 25\lambda_{(L,n)}(L-1)\geq\dots\geq 25^{L-1}\lambda_{(L,n)}(1)\geq 25^{L-1}.

Now, for n60n\geq 60, fix L=(n2)/58L=\lfloor(n-2)/58\rfloor (here, L1L\geq 1), and construct nn-vertex graphs X~n\tilde{X}_{n}, Y~n\tilde{Y}_{n} by adding n=n(58L+2)n^{\prime}=n-(58L+2) isolated vertices to XLX_{L} and YLY_{L}, respectively. Let 𝒞(X~n\mathscr{C}(\tilde{X}_{n}, Y~n)\tilde{Y}_{n}) denote the connected component of 𝖥𝖲(X~n\mathsf{FS}(\tilde{X}_{n}, Y~n)\tilde{Y}_{n}) containing the configuration resulting from placing V(Y~n)V(\tilde{Y}_{n}) upon V(X~n)V(\tilde{X}_{n}) as usual (i.e., under the starting configuration as defined in Subsection 4.1), and then placing the nnn-n^{\prime} isolated vertices in Y~n\tilde{Y}_{n} upon the nnn-n^{\prime} isolated vertices of X~n\tilde{X}_{n} in some way. It easily follows from our construction that 58L+2n58L+5858L+2\leq n\leq 58L+58, so

d(σs,σf)25L1=eΩ(n).\displaystyle d(\sigma_{s},\sigma_{f})\geq 25^{L-1}=e^{\Omega(n)}.

By accounting for the values 2n592\leq n\leq 59, which may weaken the constant implicit in the Ω(n)\Omega(n) term, the desired result now follows immediately. ∎

To conclude Section 4, we mention an especially notable implication of Theorem 1.3 in the study of random walks on friends-and-strangers graphs. In the proceeding discussion, to avoid distracting from the nature of this article, we elect to be terse and do not define many of the objects we consider; we refer the reader to [Lov93] for a thorough treatment of random walks on graphs. We begin by providing the following Definition 4.16. The fact that there is a natural discrete-time Markov chain associated to a friends-and-strangers graph was observed in passing in [ADK23, Section 7], and an investigation of its mixing properties was separately proposed in [Alo21].

Definition 4.16.

Let XX and YY be nn-vertex graphs. The friends-and-strangers Markov chain of XX and YY is the discrete-time Markov chain whose state space is V(𝖥𝖲(X,Y))V(\mathsf{FS}(X,Y)) and such that at each time step, a pair of friends standing upon adjacent vertices is chosen uniformly at random amongst all such pairs and swap places with probability 1/21/2.

The friends-and-strangers Markov chain of XX and KnK_{n}, which models a lazy random walk on a connected component of 𝖥𝖲(X,Y)\mathsf{FS}(X,Y), is aperiodic by construction (recall from Proposition 2.3(2) that friends-and-strangers graphs are bipartite, which warrants the laziness condition in Definition 4.16 if we would like to discuss mixing to stationarity) and irreducible when restricted to a connected component of 𝖥𝖲(X,Y)\mathsf{FS}(X,Y). From a different perspective, Definition 4.16 may be interpreted as the generalization of a natural discrete variant of the interchange process131313The interchange process is usually posed as a continuous-time stochastic process by assigning, to the edges of XX, independent point processes on the positive half-line, and transposing the particles upon the vertices incident to a given edge at the points of its corresponding process. One can certainly adapt Definition 4.16 to accommodate for such differences in the presentation of the model. (sometimes called the random stirring process) where we include the condition that arbitrary pairs of particles may be forbidden from swapping positions. The friends-and-strangers Markov chain has received substantial attention under certain restricted settings (chiefly that in which Y=KnY=K_{n}), and classical polynomial upper bounds on the mixing time (in total variation distance) of the underlying Markov chain are known; see [Ald83, AD86, DS81, DS93, Jon12, Mat88, Wil04]. An immediate corollary of Theorem 1.3, which might be thought of as its natural stochastic analogue (especially in light of the polynomial upper bounds that we derived in Section 3), is the following.

Corollary 4.17.

For all n2n\geq 2, there exist nn-vertex graphs XX and YY for which there exists a connected component of 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) such that the friends-and-strangers Markov chain of XX and YY, when restricted to this component, has mixing time (in total variation distance) which is eΩ(n)e^{\Omega(n)}.

In other words, for this variant of the interchange process in which we may further forbid certain pairs of particles from swapping places with each other, it is possible to fix restrictions between particles in such a way that the mixing time of the underlying Markov chain is exponential in the size of the graph on which the process occurs. This is in stark contrast to the aforementioned polynomial upper bounds regarding rapidly mixing Markov chains.

5. Open Questions and Future Directions

Theorem 1.3 of this paper proves that diameters of connected components of friends-and-strangers graphs may grow exponentially in the size of their input graphs. There are many other interesting questions concerning distance and diameter that remain unresolved by this article.

5.1. Other Choices of Fixed Graphs

In Section 3, we fixed XX to be from a particular class of graphs, and derived bounds on the maximal diameter of a connected component 𝖥𝖲(X,Y)\mathsf{FS}(X,Y). Of course, we could pursue similar inquiries for other choices of XX. One natural choice would be to take X=StarnX=\textsf{{Star}}_{n}. It is known (see [Bin+23]) that the diameter of any component of 𝖥𝖲(Starn,Kn)\mathsf{FS}(\textsf{{Star}}_{n},K_{n}) is at most 32n+O(1)\frac{3}{2}n+O(1), but to our knowledge, there are no known bounds on the maximum diameter for general YY. We also remark that it may be possible to extract a bound on the maximum diameter of a component of 𝖥𝖲(Starn,Y)\mathsf{FS}(\textsf{{Star}}_{n},Y) for biconnected graphs YY by tracing the arguments in [Wil74].

5.2. Improvements

For much of our discussion in Subsection 3.3, we were primarily interested in showing that the maximum diameter of a connected component of 𝖥𝖲(Cyclen,Y)\mathsf{FS}(\textsf{{Cycle}}_{n},Y) was polynomially bounded (in the sense of Question 1.2), rather than achieving tight asymptotic statements. It would be desirable to improve these results, toward which we pose the following conjectures. We mention that generalizing the lattice-theoretic methods of [Pro21], which are of a very different flavor than the arguments presented here, might lead to the resolution of Conjecture 5.2. We also note that if Conjecture 5.1 were settled, then tracing the proof of Corollary 3.10 would immediately lead to an O(n3)O(n^{3}) bound on the number of double-flips needed to go between double-flip equivalent acyclic orientations α\alpha and α′′\alpha^{\prime\prime} on an nn-vertex graph, sharpening Corollary 3.10.

Conjecture 5.1.

The maximum diameter of a connected component of 𝖥𝖲(Cyclen,Y)\mathsf{FS}(\textsf{{Cycle}}_{n},Y) is O(n2)O(n^{2}).

Conjecture 5.2.

For an nn-vertex graph GG and two acyclic orientations α,α′′Acyc(G)\alpha,\alpha^{\prime\prime}\in\textsf{{Acyc}}(G) that are double-flip equivalent, it is possible to go from α\alpha to α′′\alpha^{\prime\prime} in O(n2)O(n^{2}) double-flips.

In another direction, Theorem 1.3 states that for all n2n\geq 2, there exist nn-vertex XX and YY such that the maximum diameter of a connected component of 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) is eΩ(n)e^{\Omega(n)}. It is unclear how close this is to the truth. As a first step, we pose the following problem.

Question 5.3.

For nn-vertex graphs XX and YY, does there exist a nontrivial upper bound (in terms of nn) on the maximum diameter of a component of 𝖥𝖲(X,Y)\mathsf{FS}(X,Y)?

We briefly clarify what we mean by a nontrivial upper bound in Question 5.3. Let 𝒟(n)\mathcal{D}(n) denote the maximum possible diameter of a connected component of 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) when XX and YY are nn-vertex graphs. By Theorem 1.3 for the lower bound and Stirling’s approximation applied to n!n! for the upper bound, we observe that

eΩ(n)=𝒟(n)e(1o(1))nlogn.\displaystyle e^{\Omega(n)}=\mathcal{D}(n)\leq e^{\left(1-o(1)\right)n\log n}.

Thus, our understanding of 𝒟(n)\mathcal{D}(n) is tight up to a logarithmic factor in the exponent. Any improvement over this naive upper bound, or confirmation that this upper bound is essentially the truth, would be highly desirable. In particular, we ask the following more precise question. Indeed, given the preceding discussion, Question 5.4 is the natural next target.

Question 5.4.

Is it true that 𝒟(n)=eO(n)\mathcal{D}(n)=e^{O(n)}?

We propose one final problem in this subsection which we would especially like to see resolved.

Problem 5.5.

Find a shorter (perhaps via non-constructive141414Certainly, a proof of Theorem 1.3 using non-constructive techniques would be a novel contribution. In another direction, recall that the central idea behind Section 4 was to construct an exponentially increasing recursive sequence of swaps in such a way that executing this sequence of swaps is necessary in order to reach one configuration from another. A constructive proof which either proceeds via a similar paradigm with a construction that is more amenable to analysis or leverages different ideas altogether would also be of interest. means) proof of Theorem 1.3.

5.3. Connected Friends-and-Strangers Graphs

The proof of Theorem 1.3 relied heavily on characterizing all vertices of 𝖥𝖲(XL,YL)\mathsf{FS}(X_{L},Y_{L}) in the same connected component of σs\sigma_{s}. It is thus natural to ask Question 1.2 in the setting where 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) is assumed to be connected, which was separately raised by Defant and Kravitz.

Question 5.6 ([DK21, Subsection 7.3]).

Does there exist an absolute constant C>0C>0 such that for all nn-vertex graphs XX and YY with 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) connected, it holds that diam(𝖥𝖲(X,Y))\textup{diam}(\mathsf{FS}(X,Y)) is O(nC)O(n^{C})?

If Question 5.6 holds in the negative, then settling it will likely require very different techniques and paradigms than those which were developed in this article. Indeed, the proof of the negative result for Question 1.2 relies heavily on “rigging” the configurations that lie in a particular connected component of 𝖥𝖲(XL,YL)\mathsf{FS}(X_{L},Y_{L}), which allows us to argue that two particular configurations (namely, σs\sigma_{s} and σf\sigma_{f}) are necessarily very far apart. Such a strategy is not applicable if we require 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) to be connected. Additionally, by Proposition 2.3(3), we can assume (without loss of generality) that XX is biconnected under this setting, and that either XX or YY has no cut vertices. Theorem 3.7 already gives a positive result for Cyclen\textsf{{Cycle}}_{n}, the “simplest” biconnected graph (e.g., the nn-vertex cycle has the smallest Betti number amongst all nn-vertex biconnected graphs: see [Whi31, Theorem 19], which might lend itself to an inductive argument) and for KnK_{n}, the most “complicated” (KnK_{n} has the largest Betti number amongst all nn-vertex biconnected graphs). Furthermore, the constructions XLX_{L} and YLY_{L} contain cut vertices which hold central roles in the proofs of the intermediate propositions (namely, vertices on the paths 𝒫a,𝒫b\mathcal{P}_{a}^{\ell},\mathcal{P}_{b}^{\ell} for XLX_{L}, and the knob vertices κa,κb\kappa_{a}^{\ell},\kappa_{b}^{\ell} in YLY_{L}).

In another direction, a negative answer to Question 5.6 implies the existence of long paths in the connected graph 𝖥𝖲(X,Y)\mathsf{FS}(X,Y). The following result shows that the extreme end of this is not possible.

Proposition 5.7.

For n4n\geq 4, 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) is not isomorphic to a tree on n!n! vertices (e.g., Pathn!\textsf{{Path}}_{n!}) or a tree on n!n! vertices with one edge appended (e.g., Cyclen!\textsf{{Cycle}}_{n!}).

Proof.

The number of edges of 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) is |E(X)||E(Y)|(n2)!|E(X)|\cdot|E(Y)|\cdot(n-2)!, while this is n!1n!-1 and n!n! for a tree on n!n! vertices and a tree with one edge appended on n!n! vertices, respectively. Notice that |E(X)||E(Y)|(n2)!|E(X)|\cdot|E(Y)|\cdot(n-2)! is divisible by 22 while n!1n!-1 is not, so 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) cannot be isomorphic to a tree on n!n! vertices. Assume 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) is isomorphic to a tree with an edge appended to it, so |E(X)||E(Y)|(n2)!=n!|E(X)|\cdot|E(Y)|\cdot(n-2)!=n!, or |E(X)||E(Y)|=n(n1)|E(X)|\cdot|E(Y)|=n(n-1). Then XX and YY must both be connected, so that (without loss of generality) |E(X)|=n|E(X)|=n and |E(Y)|=n1|E(Y)|=n-1, so YY is a tree. Due to Proposition 2.3(3), XX is biconnected, so necessarily X=CyclenX=\textsf{{Cycle}}_{n}. But |E(Y¯)|=(n2)(n1)|E(\overline{Y})|=\binom{n}{2}-(n-1), contradicting Theorem 2.6, which gives |E(Y¯)|n1|E(\overline{Y})|\leq n-1. ∎

5.4. Probabilistic Problems

In a different direction, we may study notions of distance in friends-and-strangers graphs when we take XX and YY to be random graphs. We propose the following problem; we leave the meaning of “small diameter” up to interpretation.

Problem 5.8.

Let XX and YY be independently-chosen random graphs from 𝒢(n,p)\mathcal{G}(n,p). Find conditions on pp (in terms of nn) which guarantee that every connected component of 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) has small diameter with high probability.

We also restate a problem of this kind proposed by [ADK23].

Problem 5.9 ([ADK23, Problem 7.9]).

Obtain estimates (in terms of nn and pp) for the expectation of the maximum diameter of a connected component in 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) when XX and YY are independently-chosen random graphs from 𝒢(n,p)\mathcal{G}(n,p).

In a manner analogous to how we fixed one of the two graphs XX and YY in Section 3 and studied the resulting variant of Question 1.2 before addressing the more global question, it may be insightful to first fix (without loss of generality) XX to be a particular kind of graph and study the variants of Problems 5.8 and 5.9 which only take YY to be drawn from 𝒢(n,p)\mathcal{G}(n,p). The graphs we studied in Section 3 (complete graphs, paths, and cycles) may also serve as natural starting points here.

5.5. Complexity

As the literature on the token swapping problem suggests, computing exact distances between two configurations in 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) and the maximum diameter of a component of 𝖥𝖲(X,Y)\mathsf{FS}(X,Y), under mild assumptions on XX and YY, seems to be intractable. We might thus study distances and diameters in friends-and-strangers graphs from the perspective of complexity theory. We discuss one possible direction of study along these lines here. We start by introducing a decision problem which encapsulates finding the shortest swap sequence between two configurations.

Definition 5.10.

In an instance of the distance problem, we are given graphs XX and YY on nn vertices, configurations σ,τV(𝖥𝖲(X,Y))\sigma,\tau\in V(\mathsf{FS}(X,Y)), and a positive integer KK, and want to know if d(σ,τ)Kd(\sigma,\tau)\leq K.

This problem has been studied in many restricted contexts. If we proceed under the assumption that Y=KnY=K_{n}, the distance problem is known to be PSPACE-complete [Jer85], APX-hard [Mil+16], and W[1]W[1]-hard when parametrized by the shortest number of swaps [BMR18]. Furthermore, it is NP-hard when we impose certain additional restrictions, such as when we take XX to be a tree and Y=KnY=K_{n} [Aic+22]. It might be fruitful to study the complexity of this problem at the level of generality proposed by Definition 5.10.

A natural follow-up to Definition 5.10 is to ask for worst-case distances between two configurations, which corresponds to the maximum diameter of a component of 𝖥𝖲(X,Y)\mathsf{FS}(X,Y).

Definition 5.11.

In an instance of the diameter problem, we are given graphs XX and YY on nn vertices and a positive integer KK, and want to know if the maximum diameter of a component of 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) is at most KK.

The literature suggests that the diameter problem has not been studied as thoroughly as the distance problem, even when assuming that one of the two graphs is complete. Towards bridging this gap, we pose the following primitive question and problem.

Question 5.12.

Is the diameter problem in EXPSPACE? If so, is it EXPSPACE-complete? What changes if we fix Y=KnY=K_{n}?

Problem 5.13.

Find assumptions on XX, YY, and KK which guarantee that the diameter problem (under these assumptions) is in PSPACE.

Notably, even simpler decision problems than those proposed in Definitions 5.10 and 5.11 seem to be poorly understood. For instance, we may consider the following decision problems.

Definition 5.14.

In an instance of the component problem, we are given nn-vertex graphs XX and YY and configurations σ,τV(𝖥𝖲(X,Y))\sigma,\tau\in V(\mathsf{FS}(X,Y)), and want to know if σ\sigma and τ\tau lie in the same connected component of 𝖥𝖲(X,Y)\mathsf{FS}(X,Y).

Definition 5.15.

In an instance of the connectivity problem, we are given nn-vertex graphs XX and YY, and want to know if 𝖥𝖲(X,Y)\mathsf{FS}(X,Y) is connected.

We may think of the component problem and the connectivity problem as, respectively, the simplest instances of the distance problem (asking whether d(σ,τ)d(\sigma,\tau) is finite) and of the diameter problem (asking whether the diameter of 𝖥𝖲(X,Y)\mathsf{FS}(X,Y), when we do not restrict to connected components, is finite). To our knowledge, an understanding of the complexity of the component problem and the connectivity problem remains open, though the results in [Ala+23] address these problems when studying friends-and-strangers graphs with multiplicities, as elaborated in [Mil23].

In a different direction, [AK89, VP95, Yam+15] independently found 22-approximation algorithms for determining the distance between two configurations in 𝖥𝖲(X,Kn)\mathsf{FS}(X,K_{n}) when XX is a tree. Recall from the proof of Proposition 3.2 that for any σ,τ𝖥𝖲(Pathn,Y)\sigma,\tau\in\mathsf{FS}(\textsf{{Path}}_{n},Y) in the same connected component, d(σ,τ)=inv(σ,τ)d(\sigma,\tau)=\textup{inv}(\sigma,\tau), and an algorithm which exactly computes the distance between any two configurations in 𝖥𝖲(Pathn,Y)\mathsf{FS}(\textsf{{Path}}_{n},Y) is one which, starting from σ\sigma, reverses a τ\tau-inversion at every step. These two observations naturally suggest the following problem, which one can also pursue by replacing Cyclen\textsf{{Cycle}}_{n} with a different fixed graph.

Problem 5.16.

Find, under the most general assumptions on YY possible, an O(1)O(1)-approximation algorithm for computing the distance between two configurations in 𝖥𝖲(Cyclen,Y)\mathsf{FS}(\textsf{{Cycle}}_{n},Y).

Acknowledgments

This research was initiated at the University of Minnesota Duluth REU and was supported, in part, by NSF-DMS grant 1949884 and NSA Grant H98230-20-1-0009. I would like to thank Professor Joseph Gallian for organizing the Duluth REU and for giving me a chance to participate in his 2021 program, and I am deeply grateful to Colin Defant and Noah Kravitz for suggesting these problems to me and for many helpful conversations over the course of that summer. In particular, I thank Colin Defant for carefully reading through a draft of this article, providing many productive comments on the manuscript. I also thank Yelena Mandelshtam for reviewing a draft of this work and providing constructive feedback. Finally, I would like to sincerely thank the two anonymous referees for thoroughly reviewing an earlier draft of this work, catching many typos and grammatical mistakes, and providing many detailed comments and suggestions which have vastly improved this article’s clarity. Notably, I am immensely grateful to the anonymous referee who proposed a simple way to sharpen the main result of this paper.

References

References

  • [AD86] David Aldous and Persi Diaconis “Shuffling cards and stopping times” In The American Mathematical Monthly 93.5 Taylor & Francis, 1986, pp. 333–348
  • [ADK23] Noga Alon, Colin Defant and Noah Kravitz “Typical and extremal aspects of friends-and-strangers graphs” In Journal of Combinatorial Theory, Series B 158.1 Elsevier, 2023, pp. 3–42
  • [AF02] David Aldous and Jim Fill “Reversible Markov chains and random walks on graphs” Berkeley, 2002 URL: https://www.stat.berkeley.edu/~aldous/RWG/book.html
  • [Aic+22] Oswin Aichholzer, Erik D Demaine, Matias Korman, Jayson Lynch, Anna Lubiw, Zuzana Masárová, Mikhail Rudoy, Virginia Vassilevska Williams and Nicole Wein “Hardness of token swapping on trees” In 30th Annual European Symposium on Algorithms, 2022, pp. 33:1–33:15
  • [AK13] Gil Alon and Gady Kozma “The probability of long cycles in interchange processes” In Duke Mathematical Journal 162.9 Duke University Press, 2013, pp. 1567–1585
  • [AK89] Sheldon B. Akers and Balakrishnan Krishnamurthy “A group-theoretic model for symmetric interconnection networks” In IEEE Transactions on Computers 38.4 IEEE, 1989, pp. 555–566
  • [Ala+23] Robert M. Alaniz, Josh Brunner, Michael Coulombe, Erik D. Demaine, Jenny Diomidova, Ryan Knobel, Timothy Gomez, Elise Grizzell, Jayson Lynch, Andrew Rodriguez, Robert Schweller and Tim Wylie “Complexity of Reconfiguration in Surface Chemical Reaction Networks”, 2023 arXiv:2303.15556 [cs.CC]
  • [Ald83] David Aldous “Random walks on finite groups and rapidly mixing Markov chains” In Séminaire de Probabilités XVII 1981/82: Proceedings, 1983, pp. 243–297 Springer
  • [Alo21] Noga Alon “Random friends walking on random graphs”, 2021 Talk given at the Oxford Discrete MathsProbability Seminar URL: https://www.youtube.com/watch?v=uheP2ctJx10
  • [Ang03] Omer Angel “Random infinite permutations and the cyclic time random walk” In Discrete Mathematics & Theoretical Computer Science Episciences. org, 2003
  • [Ban22] Kiril Bangachev “On the asymmetric generalizations of two extremal questions on friends-and-strangers graphs” In European Journal of Combinatorics 104 Elsevier, 2022, pp. 103529
  • [BD06] Nathanaël Berestycki and Rick Durrett “A phase transition in the random transposition random walk” In Probability theory and related fields 136 Springer, 2006, pp. 203–233
  • [BE78] Béla Bollobás and Stephen E Eldridge “Packings of graphs and applications to computational complexity” In Journal of Combinatorial Theory, Series B 25.2 Elsevier, 1978, pp. 105–124
  • [Bin+23] Ahmad Biniaz, Kshitij Jain, Anna Lubiw, Zuzana Masárová, Tillmann Miltzow, Debajyoti Mondal, Anurag Murty Naredla, Josef Tkadlec and Alexi Turcotte “Token Swapping on Trees” In Discrete Mathematics & Theoretical Computer Science 24.2 Episciences.org, 2023
  • [BK23] Florestan Brunck and Matthew Kwan “Books, Hallways and Social Butterflies: A Note on Sliding Block Puzzles”, 2023 arXiv:2303.09459 [math.CO]
  • [BMR18] Édouard Bonnet, Tillmann Miltzow and Paweł Rzkażewski “Complexity of token swapping and its variants” In Algorithmica 80 Springer, 2018, pp. 2656–2682
  • [BR99] Christopher L Barrett and Christian M Reidys “Elements of a theory of computer simulation I: sequential CA over random graphs” In Applied Mathematics and Computation 98.2-3 Elsevier, 1999, pp. 241–259
  • [Bra+19] Tatiana Brailovskaya, Gokul Gowri, Sean Yu and Erik Winfree “Reversible computation using swap reactions on a surface” In DNA Computing and Molecular Programming: 25th International Conference, DNA 25, Seattle, WA, USA, August 5–9, 2019, Proceedings 25, 2019, pp. 174–196 Springer
  • [Cay49] Arthur Cayley “LXXVII. Note on the theory of permutations” In The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science 34.232 Taylor & Francis, 1849, pp. 527–529
  • [Che10] Beifang Chen “Orientations, lattice polytopes, and group arrangements I: Chromatic and tension polynomials of graphs” In Annals of Combinatorics 13 Springer, 2010, pp. 425–452
  • [CLR10] Pietro Caputo, Thomas Liggett and Thomas Richthammer “Proof of Aldous’ spectral gap conjecture” In Journal of the American Mathematical Society 23.3, 2010, pp. 831–851
  • [Def+22] Colin Defant, David Dong, Alan Lee and Michelle Wei “Connectedness and Cycle Spaces of Friends-and-Strangers Graphs”, 2022 arXiv:2209.01704 [math.CO]
  • [DK21] Colin Defant and Noah Kravitz “Friends and strangers walking on graphs” In Combinatorial Theory, 1 (0), 2021
  • [DMR16] Mike Develin, Matthew Macauley and Victor Reiner “Toric partial orders” In Transactions of the American Mathematical Society 368.4, 2016, pp. 2263–2287
  • [DR18] Erik D Demaine and Mikhail Rudoy “A simple proof that the (n21)(n^{2}-1)-puzzle is hard” In Theoretical Computer Science 732 Elsevier, 2018, pp. 80–84
  • [DS81] Persi Diaconis and Mehrdad Shahshahani “Generating a random permutation with random transpositions” In Zeitschrift für Wahrscheinlichkeitstheorie und verwandte Gebiete 57.2 Springer, 1981, pp. 159–179
  • [DS93] Persi Diaconis and Laurent Saloff-Coste “Comparison techniques for random walk on finite groups” In The Annals of Probability JSTOR, 1993, pp. 2131–2156
  • [ES23] Dor Elboim and Allan Sly “Infinite cycles in the interchange process in five dimensions”, 2023 arXiv:2211.17023 [math.PR]
  • [Ham15] Alan Hammond “Sharp phase transition in the random stirring model on trees” In Probability Theory and Related Fields 161 Springer, 2015, pp. 429–448
  • [HS21] Jonathan Hermon and Justin Salez “The interchange process on high-dimensional products” In Annals of Applied Probability 31.1, 2021
  • [Jeo22] Ryan Jeong “On Structural Aspects of Friends-And-Strangers Graphs” To appear in the Annals of Combinatorics, 2022 arXiv:2203.10337 [math.CO]
  • [Jeo23] Ryan Jeong “Bipartite Friends and Strangers Walking on Bipartite Graphs”, 2023 arXiv:2309.03848 [math.CO]
  • [Jer85] Mark R Jerrum “The complexity of finding minimum-length generator sequences” In Theoretical Computer Science 36 Elsevier, 1985, pp. 265–289
  • [Jon12] Johan Jonasson “Mixing times for the interchange process” In ALEA 9.2, 2012, pp. 667–683
  • [Kim16] Dohan Kim “Sorting on graphs by adjacent swaps using permutation groups” In Computer Science Review 22 Elsevier, 2016, pp. 89–105
  • [KMS84] D. Kornhauser, G. Miller and P. Spirakis “Coordinating Pebble Motion On Graphs, The Diameter Of Permutation Groups, And Applications” In 25th Annual Symposium on Foundations of Computer Science, 1984., 1984, pp. 241–250
  • [KO09] Daniela Kühn and Deryk Osthus “The minimum degree threshold for perfect graph packings” In Combinatorica 29.1 Springer, 2009, pp. 65–107
  • [KSY19] Jun Kawahara, Toshiki Saitoh and Ryo Yoshinaka “The time complexity of permutation routing via matching, token swapping and a variant” In Journal of Graph Algorithms and Applications 23.1, 2019, pp. 29–70
  • [Lee22] Alan Lee “Connectedness in Friends-and-Strangers Graphs of Spiders and Complements”, 2022 arXiv:2210.04768 [math.CO]
  • [Lov93] L Lovász “Random walks on graphs: a survey” In J. Bolyai Math. Soc. 2, 1993, pp. 1–46
  • [Mat88] Peter Matthews “A strong uniform time for random transpositions” In Journal of Theoretical Probability 1 Springer, 1988, pp. 411–423
  • [Mil+16] Tillmann Miltzow, Lothar Narins, Yoshio Okamoto, Günter Rote, Antonis Thomas and Takeaki Uno “Approximation and Hardness of Token Swapping” In 24th Annual European Symposium on Algorithms 57, 2016, pp. 66:1–66:15
  • [Mil23] Aleksa Milojevic “Connectivity of Old and New Models of Friends-and-Strangers Graphs”, 2023 arXiv:2210.03864 [math.CO]
  • [MM11] Matthew Macauley and Henning S Mortveit “Posets from Admissible Coxeter Sequences” In The Electronic Journal of Combinatorics 18.1, 2011, pp. P197
  • [Naa00] Michael Naatz “The graph of linear extensions revisited” In SIAM Journal on Discrete Mathematics 13.3 SIAM, 2000, pp. 354–369
  • [Par15] Ian Parberry “Solving the (n21)(n^{2}-1)-Puzzle with 83n\frac{8}{3}n Expected Moves” In Algorithms 8.3 MDPI, 2015, pp. 459–465
  • [Pre86] Oliver Pretzel “On reorienting graphs by pushing down maximal vertices” In Order 3.2 Springer, 1986, pp. 135–153
  • [Pro21] James Propp “Lattice structure for orientations of graphs”, 2021 arXiv:math/0209005 [math.CO]
  • [PV90] Frederick J Portier and Theresa P Vaughan “Whitney numbers of the second kind for the star poset” In European Journal of Combinatorics 11.3 Elsevier, 1990, pp. 277–288
  • [Rei98] CM Reidys “Acyclic orientations of random graphs” In Advances in Applied Mathematics 21.2 Elsevier, 1998, pp. 181–192
  • [Sch05] Oded Schramm “Compositions of random transpositions” In Israel Journal of Mathematics 147 Springer, 2005, pp. 221–243
  • [Spe09] David Speyer “Powers of Coxeter elements in infinite groups are reduced” In Proceedings of the American Mathematical Society 137.4, 2009, pp. 1295–1302
  • [SS78] Norbert Sauer and Joel Spencer “Edge disjoint placement of graphs” In Journal of Combinatorial Theory, Series B 25.3 Academic Press, 1978, pp. 295–302
  • [Sta08] Richard P Stanley “An equivalence relation on the symmetric group and multiplicity-free flag h-vectors” In Journal of Combinatorics 0.0, 2008
  • [van+16] Anke van Zuylen, James Bieron, Frans Schalekamp and Gexin Yu “A tight upper bound on the number of cyclically adjacent transpositions to sort a permutation” In Information Processing Letters 116.11, 2016, pp. 718–722
  • [VP95] Theresa P Vaughan and Frederick J Portier “An algorithm for the factorization of permutations on a tree” In Journal of Combinatorial Mathematics and Combinatorial Computing 18, 1995, pp. 11–31
  • [WC23] Lanchao Wang and Yaojun Chen “Connectivity of friends-and-strangers graphs on random pairs” In Discrete Mathematics 346.3 Elsevier, 2023, pp. 113266
  • [Whi31] Hassler Whitney “Non-separable and planar graphs” In Proceedings of the National Academy of Sciences 17.2 National Acad Sciences, 1931, pp. 125–127
  • [Wil04] David Bruce Wilson “Mixing times of lozenge tiling and card shuffling Markov chains” In The Annals of Applied Probability 14.1 Institute of Mathematical Statistics, 2004, pp. 274–325
  • [Wil74] Richard M Wilson “Graph puzzles, homotopy, and the alternating group” In Journal of Combinatorial Theory, Series B 16.1 Elsevier, 1974, pp. 86–96
  • [WLC23] Lanchao Wang, Junying Lu and Yaojun Chen “Connectedness of friends-and-strangers graphs of complete bipartite graphs and others” In Discrete Mathematics 346.8, 2023, pp. 113499
  • [Yam+15] Katsuhisa Yamanaka, Erik D Demaine, Takehiro Ito, Jun Kawahara, Masashi Kiyomi, Yoshio Okamoto, Toshiki Saitoh, Akira Suzuki, Kei Uchizawa and Takeaki Uno “Swapping labeled tokens on graphs” In Theoretical Computer Science 586 Elsevier, 2015, pp. 81–94
  • [Yan11] Chao Yang “Sliding puzzles and rotating puzzles on graphs” In Discrete Mathematics 311.14 Elsevier, 2011, pp. 1290–1294
  • [Yap88] HP Yap “Packing of graphs-a survey” In Annals of Discrete Mathematics 38 Elsevier, 1988, pp. 395–404
  • [Yus07] Raphael Yuster “Combinatorial and computational aspects of graph packing and graph decomposition” In Computer Science Review 1.1 Elsevier, 2007, pp. 12–26
  • [Zhu23] Honglin Zhu “Evacuating ”O”- and ”Y”-shaped houses on fire: the connectivity of friends-and-strangers graphs on complete multipartite graphs”, 2023 arXiv:2307.08121 [math.CO]