This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Filling the gap between Turán’s theorem and Pósa’s conjecture

Peter Allen Julia Böttcher  and  Jan Hladký
(Date: July 26, 2025)
Abstract.

Much of extremal graph theory has concentrated either on finding very small subgraphs of a large graph (Turán-type results) or on finding spanning subgraphs (Dirac-type results). In this paper we are interested in finding intermediate-sized subgraphs. We investigate minimum degree conditions under which a graph GG contains squared paths and squared cycles of arbitrary specified lengths. We determine precise thresholds, assuming that the order of GG is large. This extends results of Fan and Kierstead [J. Combin. Theory Ser. B 63 (1995), 55–64] and of Komlós, Sarközy, and Szemerédi [Random Structures Algorithms 9 (1996), 193–211] concerning the containment of a spanning squared path and a spanning squared cycle, respectively. Our results show that such minimum degree conditions constitute not merely an interpolation between the corresponding Turán-type and Dirac-type results, but exhibit other interesting phenomena.

* DIMAP and Mathematics Institute, University of Warwick, Coventry, CV4 7AL, United Kingdom. E-mail: P.D.Allen@warwick.ac.uk
†Zentrum Mathematik, Technische Universität München, Boltzmannstraße 3, D-85747 Garching bei München, Germany. E-mail: boettche@ma.tum.de
‡Department of Applied Mathematics, Faculty of Mathematics and Physics, Charles University, Malostranské náměstí 25, 118 00, Prague, Czech Republic and DIMAP and Department of Computer Science, University of Warwick, Coventry, CV4 7AL, United Kingdom. E-mail: honzahladky@gmail.com
PA was partially supported by DIMAP, EPSRC award EP/D063191/1, JB by DFG grant TA 309/2-1, JH by the Charles University grant GAUK 202-10/258009, by DAAD, by BAYHOST, and by DIMAP, EPSRC award EP/D063191/1.

1. Introduction

One of the main programmes of extremal graph theory is the study of conditions on the vertex degrees of a host graph GG under which a target graph HH appears as a subgraph of GG (which we denote by HGH\subseteq G). Turán’s theorem [21] is a prominent example for results of this type. It asserts that an average degree d(G)>r2r1nd(G)>\frac{r-2}{r-1}n forces the copy of a complete graph KrK_{r} in GG (and that this is best possible), where here and throughout nn is the number of vertices in the host graph GG. More generally, the celebrated theorem of Erdős and Stone [5] implies that for a fixed graph HH the chromatic number χ(H)\chi(H) of HH determines the average degree that is necessary to guarantee a copy of HH: If HH has chromatic number χ(H)=r\chi(H)=r and d(G)(r2r1+o(1))nd(G)\geq(\frac{r-2}{r-1}+o(1))n, then HH is a subgraph of GG. This settles the problem for fixed target graphs (with chromatic number at least 33), that is, graphs that are ‘small’ compared to the host graph.

Dirac’s theorem [4], another classical result from the area, considers target graphs that are of the same order as the host graph, i.e., so-called spanning target graphs. Clearly, any average degree condition on the host graph that enforces a connected spanning subgraph must be trivial, and hence the average degree needs a suitable replacement in this setting. Here, the minimum degree is a natural candidate, and indeed, Dirac’s theorem asserts that every graph GG with minimum degree δ(G)>12n\delta(G)>\frac{1}{2}n has a Hamilton cycle. This implies in particular that GG has a matching covering 2n/22\lfloor n/2\rfloor vertices.

A 33-chromatic version of this matching result follows from a theorem by Corrádi and Hajnal [3]: the minimum degree condition δ(G)2n/3\delta(G)\geq 2\lfloor n/3\rfloor implies the existence of a so-called spanning triangle factor in GG, that is, a collection of n/3\lfloor n/3\rfloor vertex disjoint triangles. A well-known conjecture of Pósa (see, e.g., [6]) asserts that roughly the same minimum degree actually guarantees the existence of a connected super-graph of a spanning triangle factor. It states that any graph GG with δ(G)23n\delta(G)\geq\frac{2}{3}n contains a spanning squared cycle Cn2C^{2}_{n}, where the square of a graph, F2F^{2}, is obtained from FF by adding edges between all pairs of vertices with distance 22 in FF. This can be seen as a 33-chromatic analogue of Dirac’s theorem, which turned out to be much more difficult than its 22-chromatic cousin.

Fan and Kierstead [7] proved an approximate version of Pósa’s conjecture for large nn. In addition they determined a sufficient and best possible minimum degree condition for the case that the squared cycle in Pósa’s conjecture is replaced by a squared path Pn2P^{2}_{n}, i.e., the square of a spanning path PnP_{n}.

Theorem 1 (Fan & Kierstead [8]).

If GG is a graph on nn vertices with minimum degree δ(G)(2n1)/3\delta(G)\geq(2n-1)/3, then GG contains a spanning squared path Pn2P_{n}^{2}.

The Pósa Conjecture was verified for large values of nn by Komlós, Sarközy, and Szemerédi [10]. The proof in [10] actually asserts the following stronger result, which guarantees not only spanning squared cycles but additionally squared cycles of all lengths between 33 and nn that are divisible by 33.

Theorem 2 (Komlós, Sárközy & Szemerédi [10]).

There exists an integer n0n_{0} such that for all integers n>n0n>n_{0} any graph GG of order nn and minimum degree δ(G)23n\delta(G)\geq\frac{2}{3}n contains all squared cycles C32GC^{2}_{3\ell}\subseteq G with 33n3\leq 3\ell\leq n. If furthermore K4GK_{4}\subseteq G, then C2GC^{2}_{\ell}\subseteq G for any 3n3\leq\ell\leq n with 5\ell\neq 5.

For squared cycles C2C^{2}_{\ell} with \ell not divisible by 33 the additional condition K4GK_{4}\subseteq G is necessary because these target graphs are not 33-colourable and hence a complete 33-partite graph shows that one cannot hope to force C2C^{2}_{\ell} unless δ(G)(2n+1)/3\delta(G)\geq(2n+1)/3. If δ(G)(2n+1)/3\delta(G)\geq(2n+1)/3, on the other hand, then Turán’s Theorem asserts that GG contains a copy of K4K_{4} and hence Theorem 2 implies C2GC^{2}_{\ell}\subseteq G for any 3n3\leq\ell\leq n with l5l\neq 5. The case =5\ell=5 has to be excluded because C52C^{2}_{5} is the 55-chromatic K5K_{5}.

In this paper we address the question of what happens between these two extrema of target graphs with constant order and order nn. We are interested in essentially best possible minimum degree conditions that enforce subgraphs covering a certain percentage of the host graph.

Let us start with a simple example. It is easy to see that every graph GG with minimum degree δ(G)δ\delta(G)\geq\delta for 0δ12n0\leq\delta\leq\frac{1}{2}n has a matching covering at least 2δ2\delta vertices (see Proposition 12()). This gives a linear dependence between the forced size of a matching in the host graph and its minimum degree. A more general form of the result of Corrádi and Hajnal [3] mentioned earlier is a variant of this linear dependence for triangle factors.

Theorem 3 (Corrádi & Hajnal [3]).

Let GG be a graph on nn vertices with minimum degree δ(G)=δ[12n,23n]\delta(G)=\delta\in[\frac{1}{2}n,\frac{2}{3}n]. Then GG contains 2δn2\delta-n vertex disjoint triangles.

The main theorem of this paper is a corresponding result mediating between Turán’s theorem and Pósa’s conjecture. More precisely, our aim is to provide exact minimum degree thresholds for the appearance of a squared path P2P_{\ell}^{2} and a squared cycle C2C_{\ell}^{2}.

There are at least two reasonable guesses one might make as to what minimum degree δ(G)=δ\delta(G)=\delta will guarantee which length =(n,δ)\ell=\ell(n,\delta) of squared path (or longest squared cycle). On the one hand, the degree threshold for a spanning squared path or cycle and for a spanning triangle factor are approximately the same. So perhaps this remains true for smaller \ell: in light of Theorem 3 one could expect that (n,δ)\ell(n,\delta) were roughly 3(2δ(G)n)3(2\delta(G)-n). This turns out to be far too optimistic.

On the other hand, proofs of preceding results dealing with spanning subgraphs essentially combine greedy techniques with local changes. They simply start to construct the desired subgraph in (almost) any location, and in the event of getting stuck change only a few of the vertices embedded so far; at no time do they scrap an entire half-constructed object and start anew. It would not be unreasonable to believe that this technique also leads to best possible minimum degree conditions for large but not spanning subgraphs. Clearly, in the case of (unsquared) paths such a greedy strategy provides a path of length δ(G)+1\delta(G)+1. As GG might be disconnected, however, it cannot guarantee longer paths if δ(G)<n/2\delta(G)<n/2. For squared paths the following construction shows that with an arbitrary starting location one cannot hope for squared paths on more than 32(2δ(G)n)\frac{3}{2}(2\delta(G)-n) vertices: If GG contains disjoint cliques CC and CC^{\prime} of orders 2δn2\delta-n and nδn-\delta, and an independent set II of order nδn-\delta such that all vertices of CC and CC^{\prime} are connected to all vertices of II but not to other vertices of GG, then it is not difficult to see that the longest squared path in GG starting in an edge of CC has length 32(2δ(G)n)\frac{3}{2}(2\delta(G)-n). This could lead to the idea that (n,δ)\ell(n,\delta) were approximately 32(2δ(G)n)\frac{3}{2}(2\delta(G)-n). It is true that there are squared paths of this length in GG—but this lower bound is almost always excessively pessimistic. In other words, it turns out that one has to carefully choose the ‘region’ of GG to look for the desired squared path. Since spanning squared paths use all vertices of GG this problem does not occur for these subgraphs.

For fixed nn both guesses propose a linear dependence between δ\delta and the length (n,δ)\ell(n,\delta) of a forced squared path (or cycle). As we will see below (n,δ)\ell(n,\delta) as a function of δ\delta behaves very differently: it is piece-wise linear but jumps at certain points. (These jumps can be viewed as phase transitions for the appearance of squared paths or cycles.) To make this precise we introduce the following functions. Given two positive integers nn and δ\delta with δ(12n,n1]\delta\in(\frac{1}{2}n,n-1], we define rp(n,δ)r_{p}(n,\delta) to be the largest integer rr such that nδ+δ/r>δn-\delta+\lfloor\delta/r\rfloor>\delta and rc(n,δ)r_{c}(n,\delta) to be the largest integer rr such that nδ+δ/r>δn-\delta+\lceil\delta/r\rceil>\delta. We then define

(1) sp(n,δ):=min{32δ/rp(n,δ)+12,n},andsc(n,δ):=min{32δ/rc(n,δ),n}.\begin{split}\operatorname{sp}(n,\delta)&:=\min\Big{\{}\,\Big{\lceil}\tfrac{3}{2}\lceil\delta/r_{p}(n,\delta)\rceil+\tfrac{1}{2}\Big{\rceil},\,n\,\Big{\}}\,,\quad\text{and}\\ \operatorname{sc}(n,\delta)&:=\min\Big{\{}\,\Big{\lfloor}\tfrac{3}{2}\lceil\delta/r_{c}(n,\delta)\rceil\,\Big{\rfloor},\,n\,\Big{\}}\,.\end{split}

Observe that sc(n,δ)sp(n,δ)\operatorname{sc}(n,\delta)\leq\operatorname{sp}(n,\delta) and that for almost every α(0,1)\alpha\in(0,1) we have limnsc(n,αn)/n=limnsp(n,αn)/n\lim_{n\to\infty}\operatorname{sc}(n,\alpha n)/n=\lim_{n\to\infty}\operatorname{sp}(n,\alpha n)/n. The dependence between sp(n,δ)\operatorname{sp}(n,\delta) and δ\delta is illustrated in Figure 1.

\psfrag{gamma}{\hskip 19.91692pt$\delta$}\psfrag{3gamma-1.5}{\hskip-8.53581pt\scalebox{0.9}{$\frac{3}{2}(2\delta-n)$}}\psfrag{4gamma-2}{\hskip 0.0pt\scalebox{0.9}{\ $4\delta-2n$}}\psfrag{6gamma-3}{\hskip 0.0pt\scalebox{0.9}{\ $6\delta-3n$}}\psfrag{sp(gamma)}{\hskip 0.0pt\scalebox{0.9}{$\operatorname{sp}(n,\delta)$}}\psfrag{ 1.2}{}\psfrag{sp}{}\psfrag{ 1}{\scalebox{0.9}{$n$}}\psfrag{ 0.8}{\scalebox{0.9}{$\frac{4n}{5}$}}\psfrag{ 0.6}{\scalebox{0.9}{$\frac{3n}{5}$}}\psfrag{ 0.4}{\scalebox{0.9}{$\frac{2n}{5}$}}\psfrag{ 0.2}{\scalebox{0.9}{$\frac{n}{5}$}}\psfrag{ 0}{\scalebox{0.9}{$0$}}\psfrag{ 0.5}{\scalebox{0.9}{$\frac{n}{2}$}}\psfrag{ 6/11}{\scalebox{0.9}{$\frac{6n}{11}$}}\psfrag{ 5/9}{\scalebox{0.9}{$\frac{5n}{9}$}}\psfrag{ 4/7}{\scalebox{0.9}{$\frac{4n}{7}$}}\psfrag{ 2/3}{\scalebox{0.9}{$\frac{2n}{3}$}}\includegraphics[height=227.62204pt,width=355.65944pt]{sqp_plot.eps}
Figure 1. The behaviour of sp(n,δ)\operatorname{sp}(n,\delta).

Our main theorem now states states that sp(n,δ)\operatorname{sp}(n,\delta) and sc(n,δ)\operatorname{sc}(n,\delta) are the maximal lengths of squared paths and cycles, respectively, forced in an nn-vertex graph GG with minimum degree δ\delta. More generally, and in accordance with Theorem 2, we show that GG also contains any shorter squared cycle with length divisible by 33 (see () of Theorem 4). We shall show below that these results are tight by explicitly constructing extremal graphs Gp(n,δ)G_{p}(n,\delta) and Gc(n,δ)G_{c}(n,\delta) for squared paths and cycles. While the extremal graphs of all previously discussed results are Turán graphs (complete rr-partite graphs, where r=3r=3 in the case of squared paths and cycles) the graphs Gp(n,δ)G_{p}(n,\delta) and Gc(n,δ)G_{c}(n,\delta) have a rather different structure. In fact they do contain squared cycles C2C^{2}_{\ell} for all 3sc(n,δ)3\leq\ell\leq\operatorname{sc}(n,\delta) with 5\ell\neq 5. If any one of these ‘extra’ squared cycles with chromatic number 44 is not present in the host graph GG, then () of Theorem 4 guarantees even much longer squared cycles C2C^{2}_{\ell} in GG, where \ell is a multiple of 33.

Theorem 4.

For any ν>0\nu>0 there exists an integer n0n_{0} such that for all integers n>n0n>n_{0} and δ[(12+ν)n,23n]\delta\in[(\frac{1}{2}+\nu)n,\frac{2}{3}n] the following holds for all nn-vertex graphs GG with minimum degree δ(G)δ\delta(G)\geq\delta.

  1. ()

    Psp(n,δ)2GP^{2}_{\operatorname{sp}(n,\delta)}\subseteq G and C2GC^{2}_{\ell}\subseteq G for every \ell\in\mathbb{N} with 3sc(n,δ)3\leq\ell\leq\operatorname{sc}(n,\delta) such that 33 divides \ell.

  2. ()

    Either C2GC^{2}_{\ell}\subseteq G for every \ell\in\mathbb{N} with 3sc(n,δ)3\leq\ell\leq\operatorname{sc}(n,\delta) and 5\ell\neq 5, or C2GC^{2}_{\ell}\subseteq G for every \ell\in\mathbb{N} with 36δ3nνn3\leq\ell\leq 6\delta-3n-\nu n such that 33 divides \ell.

The proof of this result relies on Szemerédi’s Regularity Lemma111We refer to [14] for a survey on applications of the Regularity Lemma on graph embedding problems. and is presented together with the main lemmas in Section 2. Theorem 4 cannot be extended to all values of δ(G)\delta(G) with δ(G)12n=o(n)\delta(G)-\frac{1}{2}n=o(n) because for infinitely many values of mm there are C4C_{4}-free graphs FF on mm vertices with δ(F)12m\delta(F)\geq\frac{1}{2}\sqrt{m} (see [18]). Then, letting GG be the nn-vertex graph obtained from FF by adding an independent set II on m12mm-\lfloor\frac{1}{2}\sqrt{m}\rfloor vertices and inserting all edges between FF and II, it is easy to see that δ(G)>12n+15n\delta(G)>\frac{1}{2}n+\frac{1}{5}\sqrt{n} but GG does not contain a copy of C62C^{2}_{6}.

The following extremal graphs show that the bounds in () and () of Theorem 4 are tight (see also Figure 2). For () consider the complete tripartite graph Knδ,nδ,2δnK_{n-\delta,n-\delta,2\delta-n}. Clearly, this graph has minimum degree δ\delta and does not contain C2C^{2}_{\ell} for any 3\ell\geq 3 not divisible by 33 or 3(2δn)\ell\geq 3(2\delta-n). For the first part of (), let Gp(n,δ)G_{p}(n,\delta) be the nn-vertex graph obtained from the disjoint union of an independent set YY on nδn-\delta vertices and r:=rp(n,δ)r:=r_{p}(n,\delta) cliques X1,,XrX_{1},\ldots,X_{r} with |X1||Xr||X1|+1|X_{1}|\leq\dots\leq|X_{r}|\leq|X_{1}|+1 on a total of δ\delta vertices, by inserting all edges between YY and XiX_{i} for each i[r]i\in[r]. It is easy to check that δ(Gp(n,δ))=δ\delta(G_{p}(n,\delta))=\delta. Moreover any squared path Pm2Gp(n,δ)P^{2}_{m}\subseteq G_{p}(n,\delta) contains vertices from at most one clique XiX_{i}. As YY is independent and Pm2P^{2}_{m} has independence number m/3\lceil m/3\rceil we have 2m/3δ/rp(n,δ)\lfloor 2m/3\rfloor\leq\lceil\delta/r_{p}(n,\delta)\rceil and thus m12(3δ/rp(n,δ)+1)=sp(n,δ)m\leq\lfloor\frac{1}{2}(3\lceil\delta/r_{p}(n,\delta)\rceil+1)\rfloor=\operatorname{sp}(n,\delta). For the second part of (), we construct the graph Gc(n,δ)G^{\prime}_{c}(n,\delta) in the same way as Gp(n,δ)G_{p}(n,\delta) but with r:=rc(n,δ)r:=r_{c}(n,\delta) and with |Xi|=δ/r|X_{i}|=\lceil\delta/r\rceil for all i[r]i\in[r]. To obtain an nn-vertex graph Gc(n,δ)G_{c}(n,\delta) from Gc(n,δ)G^{\prime}_{c}(n,\delta) choose viv_{i} in XiX_{i} arbitrarily for each i[r]i\in[r] and identify all viv_{i} with irδ/rδi\leq r\lceil\delta/r\rceil-\delta. Again Gc(n,δ)G_{c}(n,\delta) has minimum degree δ\delta, any squared cycle Cm2C^{2}_{m} in Gc(n,δ)G_{c}(n,\delta) touches only one of the XiX_{i}, and hence msc(n,δ)m\leq\operatorname{sc}(n,\delta).

\psfrag{n-d}{\scalebox{0.8}{$n-\delta$}}\psfrag{Gp}{$G_{p}(n,\delta)$}\psfrag{Gc}{$G_{c}(n,\delta)$}\psfrag{2d-n}{\scalebox{0.8}{$2\delta-n$}}\psfrag{kttt}{$K_{n-\delta,n-\delta,2\delta-n}$}\includegraphics[width=433.62pt]{extreme.eps}
Figure 2. The extremal graphs, for the case rp(n,δ)=rc(n,δ)=4r_{p}(n,\delta)=r_{c}(n,\delta)=4.

Before closing this introduction let us remark that similar phenomena to those described in Theorem 4 are observed with simple paths and cycles. Every graph with minimum degree δ\delta contains a path of length n/n/(δ+1)\lceil n/\lfloor n/(\delta+1)\rfloor\rceil, and the extremal graph is a vertex disjoint union of cliques. This follows from an easy adjustment of the proof of Dirac’s theorem. Improving on results of Nikiforov and Schelp [17] the first author proved the following theorem in [1]. The methods used for obtaining this result are quite different from those applied in this paper. In particular they do not rely on the Regularity Lemma.

Theorem 5 (Allen [1]).

Given an integer k2k\geq 2 there is n0n_{0} such that whenever nn0n\geq n_{0} and GG is an nn-vertex graph with minimum degree δn/k\delta\geq n/k, the following are true.

  1. ()

    GG contains CtC_{t} for every even 4tn/(k1)4\leq t\leq\lceil n/(k-1)\rceil,

  2. ()

    if GG does not contain a cycle of every length from 2n/δ1\lfloor 2n/\delta\rfloor-1 to n/(k1)\lceil n/(k-1)\rceil inclusive then GG does contain CtC_{t} for every even 4t2δ4\leq t\leq 2\delta.

2. Main lemmas and proof of Theorem 4

Our proof of Theorem 4 combines the Stability Method pioneered by Simonovits [19], the Regularity Method which pivots around the joint application of Szemerédi’s celebrated Regularity Lemma [20], and the so-called Blow-up Lemma by Komlós, Sárközy and Szemerédi [11]. The combination of these three methods has proved useful for a variety of exact embedding results and was applied for example in [10]. However, this well-established technique provides only a rather loose framework for proofs of this kind. For our application we will embellish this framework with a new concept, which we call the connected triangle components of a graph.

In this section we explain how we use connected triangle components, the Regularity Method, and the Stability Method. We first provide the necessary definitions, formulate our main lemmas (whose proofs are provided in the remaining sections of this paper), and sketch how they work together in the proof of Theorem 4. The details of this proof are then presented at the end of this section.

Notation.

For a graph GG we write V(G)V(G) and E(G)E(G) to denote its vertex set and edge set, respectively, and set v(G)=|V(G)|v(G)=|V(G)|, e(G)=|E(G)|e(G)=|E(G)| and e(X,Y)=|{xyE(G):xX,yY}|e(X,Y)=|\{xy\in E(G):x\in X,y\in Y\}| for sets X,YV(G)X,Y\subseteq V(G). The graph G[X]G[X] is the subgraph of GG induced by XX. The neighbourhood of a vertex vv in GG is denoted by Γ(v)\Gamma(v) and Γ(u,v)\Gamma(u,v) is the common neighbourhood of u,vV(G)u,v\in V(G). For an edge uv=eE(G)uv=e\in E(G) we also write Γ(e)=Γ(u,v)\Gamma(e)=\Gamma(u,v). The minimum degree of GG is denoted by δ(G)\delta(G) and for two sets X,YV(G)X,Y\subseteq V(G) we define δY(X)=minxX|Γ(x)Y|\delta_{Y}(X)=\min_{x\in X}|\Gamma(x)\cap Y| and δG(X)=δV(G)(X)\delta_{G}(X)=\delta_{V(G)}(X).

When we say that a statement 𝖲(ϵ,ϵ)\mathsf{S}(\epsilon,\epsilon^{\prime}) holds for positive real numbers εε\varepsilon\gg\varepsilon^{\prime}, then we mean that, given an arbitrary ε>0\varepsilon>0, we can find an ϵ′′>0\epsilon^{\prime\prime}>0 such that 𝖲(ϵ,ϵ)\mathsf{S}(\epsilon,\epsilon^{\prime}) holds for all ϵ(0,ϵ′′]\epsilon^{\prime}\in(0,\epsilon^{\prime\prime}].

Connected triangle components and triangle factors.

Connected triangle components and connected triangle factors are the main protagonists in the proof of Theorem 4. Roughly speaking, in a connected triangle component we can start in an arbitrary triangle and reach each other triangle by “walking” through a sequence of triangles, and a connected triangle factor is a collection of vertex disjoint triangles each pair of which is connected in this way.

To make this precise, let G=(V,E)G=(V,E) be a graph. A triangle walk in GG is a sequence of edges e1,,epe_{1},\dots,e_{p} in GG such that eie_{i} and ei+1e_{i+1} share a triangle of GG for all i[p1]i\in[p-1]. We say that e1e_{1} and epe_{p} are triangle connected in GG. A triangle component of GG is a maximal set of edges CEC\subseteq E such that every pair of edges in CC is triangle connected. Observe that this induces an equivalence relation on the edges of GG, but a vertex may be part of many triangle components. In addition a triangle component does not need to form an induced subgraph of GG in general. The vertices of a triangle component CiC_{i} are all vertices vv such that some edge uvuv of GG is contained in CiC_{i}. We define the size |C||C| of a triangle component CC to be the number of vertices of CC.

A triangle factor TT in a graph GG is a collection of vertex disjoint triangles in GG. TT is a connected triangle factor if all edges of TT are in the same triangle component of GG. We define the size of TT to be the number of vertices covered by TT. We let CTF(G)\operatorname{CTF}(G) denote the maximum size of a connected triangle factor in GG. It is not difficult to check for example that any connected triangle factor in Gp(n,δ)G_{p}(n,\delta) contains only vertices of at most one of the cliques XiX_{i} (cf. the definition of Gp(n,δ)G_{p}(n,\delta) below Theorem 4) and of the independent set YY. Hence

(2) CTF(Gp(n,δ))=3sp(n,δ)3.\operatorname{CTF}\big{(}G_{p}(n,\delta)\big{)}=3\left\lfloor\frac{\operatorname{sp}(n,\delta)}{3}\right\rfloor.

Suppose that a graph GG contains a square-path of length \ell. Then obviously, CTF(G)3/3\operatorname{CTF}(G)\geq 3\lfloor\ell/3\rfloor. Thus, (2) together with Theorem 4() says that Gp(n,δ)G_{p}(n,\delta) minimises CTF\operatorname{CTF} among all graphs of order nn and minimum degree δ\delta.

We will usually find that the number of vertices in a triangle component and the size of a maximum connected triangle factor in that component are quite different. As we will explain next, for the purposes of embedding squared paths and squared cycles, it is the size of a connected triangle factor that is important.

The Regularity Method.

The Regularity Lemma provides a partition of a dense graph that is suitable for an application of the Blow-up Lemma, which is an embedding result for large host graphs. In order to formulate the versions of these two lemmas that we will use, we first introduce some terminology.

Let G=(V,E)G=(V,E) be a graph and ε,d(0,1]\varepsilon,d\in(0,1]. For disjoint nonempty U,WVU,W\subseteq V the density of the pair (U,W)(U,W) is d(U,W)=e(U,W)/|U||W|d(U,W)=e(U,W)/|U||W|. A pair (U,W)(U,W) is ε\varepsilon-regular if |d(U,W)d(U,W)|ε|d(U^{\prime},W^{\prime})-d(U,W)|\leq\varepsilon for all UUU^{\prime}\subseteq U and WWW^{\prime}\subseteq W with |U|ε|U||U^{\prime}|\geq\varepsilon|U| and |W|ε|W||W^{\prime}|\geq\varepsilon|W|. An ε\varepsilon-regular partition of GG is a partition V0˙V1˙˙VkV_{0}\dot{\cup}V_{1}\dot{\cup}\dots\dot{\cup}V_{k} of VV with |V0|ε|V||V_{0}|\leq\varepsilon|V|, |Vi|=|Vj||V_{i}|=|V_{j}| for all i,j[k]i,j\in[k], and such that for all but at most εk2\varepsilon k^{2} pairs (i,j)[k]2(i,j)\in[k]^{2}, the pair (Vi,Vj)(V_{i},V_{j}) is ε\varepsilon-regular.

Given some 0<d<10<d<1 and a pair of disjoint vertex sets (Vi,Vj)(V_{i},V_{j}) in a graph GG, we say that (Vi,Vj)(V_{i},V_{j}) is (ε,d)(\varepsilon,d)-regular if it is ε\varepsilon-regular and has density at least dd. We say that an ε\varepsilon-regular partition V0˙V1˙˙VkV_{0}\dot{\cup}V_{1}\dot{\cup}\dots\dot{\cup}V_{k} of a graph GG is an (ε,d)(\varepsilon,d)-regular partition if the following is true. For every 1ik1\leq i\leq k, and every vertex vViv\in V_{i}, there are at most (ε+d)n(\varepsilon+d)n edges incident to vv which are not contained in (ε,d)(\varepsilon,d)-regular pairs of the partition.

Given an (ε,d)(\varepsilon,d)-regular partition V0˙V1˙˙VkV_{0}\dot{\cup}V_{1}\dot{\cup}\dots\dot{\cup}V_{k} of a graph GG, we define a graph RR, called the reduced graph of the partition of GG, where R=(V(R),E(R))R=(V(R),E(R)) has V(R)={V1,,Vk}V(R)=\{V_{1},\ldots,V_{k}\} and ViVjE(R)V_{i}V_{j}\in E(R) whenever (Vi,Vj)(V_{i},V_{j}) is an (ε,d)(\varepsilon,d)-regular pair. We will usually omit the partition, and simply say that GG has (ε,d)(\varepsilon,d)-reduced graph RR. We call the partition classes ViV_{i} with i[k]i\in[k] clusters of GG. Observe that our definition of the reduced graph RR implies that for TV(R)T\subseteq V(R) we can for example refer to the set T\bigcup T, which is a subset of V(G)V(G).

The celebrated Szemerédi Regularity Lemma [20] states that every large graph has an ε\varepsilon-regular partition with a bounded number of parts. Here we state the so-called degree form of this lemma (see, e.g., [13, Theorem 1.10]).

Lemma 6 (Regularity Lemma, degree form).

For every ε>0\varepsilon>0 and every integer m0m_{0}, there is m1m_{1} such that for every d[0,1]d\in[0,1] every graph G=(V,E)G=(V,E) on nk1n\geq k_{1} vertices has an (ε,d)(\varepsilon,d)-reduced graph RR on mm vertices with m0mm1m_{0}\leq m\leq m_{1}.

For our purpose it is convenient to work with even a different version of the regularity lemma, which takes into account that we are dealing with graphs of high minimum degree. This lemma is an easy corollary of Lemma 6. A proof can be found, e.g., in [16, Proposition 9].

Lemma 7 (Regularity Lemma, minimum degree form).

For all ε\varepsilon, dd, γ\gamma with 0<ε<d<γ<10<\varepsilon<d<\gamma<1 and for every m0m_{0}, there is m1m_{1} such that every graph GG on n>m1n>m_{1} vertices with δ(G)γn\delta(G)\geq\gamma n has an (ε,d)(\varepsilon,d)-reduced graph RR on mm vertices with m0mm1m_{0}\leq m\leq m_{1} and δ(R)(γdε)m\delta(R)\geq(\gamma-d-\varepsilon)m.

This lemma asserts that the reduced graph RR of GG “inherits” the high minimum degree of GG. We shall use this property in order to reduce the original problem of finding a squared path (or cycle) in an nn-vertex graph with minimum degree γn\gamma n to the problem of finding an arbitrary connected triangle factor of a certain size in an mm-vertex graph RR with minimum degree (γdε)m(\gamma-d-\varepsilon)m. The new problem is much less particular about the required subgraph than the original one and hence easier to attack (see Lemma 9).

This kind of reduction is made possible by the Blow-up Lemma. Roughly, this lemma asserts that a bounded degree graph HH can be embedded into a graph GG with reduced graph RR if there is a homomorphism from HH to a subgraph SS of RR which does not “overfill” any of the clusters in SS. In our setting we apply this lemma with S=K3S=K_{3} and conclude that for each triangle tt of a connected triangle factor TT in RR we find a squared path in GG that almost fills the clusters of GG corresponding to tt. By using the fact that TT is triangle connected it is then possible to connect these squared paths into squared paths or cycles of the desired overall length. In addition, the Blow-up Lemma allows for some control about the start- and end-vertices of the path that is constructed in this way (cf. Lemma 8()).

The following lemma summarises this embedding technique, which is also implicit, e.g., in [10]. For completeness we provide a proof of this lemma in the appendix.

Lemma 8 (Embedding Lemma).

For all d>0d>0 there exists εel>0\varepsilon_{\textsc{{el}}}>0 with the following property. Given 0<ε<εel0<\varepsilon<\varepsilon_{\textsc{{el}}}, for every melm_{\textsc{{el}}}\in\mathbb{N} there exists neln_{\textsc{{el}}}\in\mathbb{N} such that the following hold for any graph GG on nneln\geq n_{\textsc{{el}}} vertices with (ε,d)(\varepsilon,d)-reduced graph RR^{\prime} on mmelm\leq m_{\textsc{{el}}} vertices.

  1. ()

    C32GC^{2}_{3\ell}\subseteq G for every \ell\in\mathbb{N} with 3(1d)CTF(R)nm3\ell\leq(1-d)\operatorname{CTF}(R^{\prime})\frac{n}{m}.

  2. ()

    If K4CK_{4}\subseteq C for each triangle component CC of RR^{\prime}, then C2GC^{2}_{\ell}\subseteq G for every {5}\ell\in\mathbb{N}\setminus\{5\} with 3(1d)CTF(R)nm3\leq\ell\leq(1-d)\operatorname{CTF}(R^{\prime})\frac{n}{m}.

Furthermore, let TT be a connected triangle factor in a triangle component CC of RR with K4CK_{4}\subseteq C, let u1v1,u2v2E(G)u_{1}v_{1},u_{2}v_{2}\in E(G) be disjoint edges, and suppose that there are (not necessarily disjoint) edges X1Y1,X2Y2CX_{1}Y_{1},X_{2}Y_{2}\in C such that the edge uiviu_{i}v_{i} has at least 2dnm2d\frac{n}{m} common neighbours in each cluster XiX_{i} and YiY_{i} for i=1,2i=1,2. Then

  1. ()

    P2GP^{2}_{\ell}\subseteq G for every \ell\in\mathbb{N} with 6(m+2)3<<(1d)|T|nm6(m+2)^{3}<\ell<(1-d)|T|\frac{n}{m}, such that P2P^{2}_{\ell} starts in u1,v1u_{1},v_{1} and ends in u2,v2u_{2},v_{2} (in those orders) and at most (ε+d)n(\varepsilon+d)n vertices of P2P^{2}_{\ell} are not in T\bigcup T.

The copies of K4K_{4} that are required in this lemma play a crucial rôle when embedding squared cycles which are not 33-chromatic.

The Stability Method.

The strategy we just described leaves us with the task of finding a big connected triangle factor TT in the reduced graph RR of GG. However, there is one problem with this approach: The proportion τ\tau of RR covered by TT is roughly equal to the proportion of GG covered by the squared path PP that we obtain from the Embedding Lemma (Lemma 8). However, as explained above, the relative minimum degree γR=δ(R)/|V(R)|\gamma_{R}=\delta(R)/|V(R)| of RR is in general slightly smaller than γG=δ(G)/|V(G)|\gamma_{G}=\delta(G)/|V(G)|, but the extremal graphs for squared paths and connected triangle factors are the same. It follows that we cannot expect that τ\tau is larger than the proportion a maximum squared path covers in a graph with relative minimum degree γR\gamma_{R}, and hence smaller than the proportion we would like to cover for relative minimum degree γG\gamma_{G}.

Consequently we need to be more ambitious and shoot for a bigger connected triangle factor in RR than we can expect for this minimum degree (cf. Lemma 9 (S1) and (S2)). This will of course not always be possible, but it will only fail if RR (and hence GG) is ‘very close’ to the extremal graph Gp(|V(R)|,δ(R))G_{p}(|V(R)|,\delta(R)) (and hence also to Gc(|V(R)|,δ(R))G_{c}(|V(R)|,\delta(R))) in which case we will say that RR is near-extremal (cf. Lemma 9 (S3)).

This approach is called the Stability Method and the following lemma states that it is feasible for our purposes. This lemma additionally guarantees copies of K4K_{4} as required by the Embedding Lemma. We formulate this lemma for graphs GG, but use it on the reduced graph RR later. Its proof does not rely on the Regularity Lemma and is given in Section 3.

Lemma 9 (Stability Lemma).

Given μ>0\mu>0, for any sufficiently small η>0\eta>0 there exists n0n_{0} such that if GG has n>n0n>n_{0} vertices and δ(G)=δ((12+μ)n,2n13)\delta(G)=\delta\in((\frac{1}{2}+\mu)n,\frac{2n-1}{3}), then either

  1. (S1)

    CTF(G)3(2δn)\operatorname{CTF}(G)\geq 3(2\delta-n), or

  2. (S2)

    CTF(G)min(sp(n,δ+ηn),11n20)\operatorname{CTF}(G)\geq\min(\operatorname{sp}(n,\delta+\eta n),\frac{11n}{20}), or

  3. (S3)

    GG has an independent set of size at least nδ11ηnn-\delta-11\eta n whose removal disconnects GG into components, each of size at most 1910(2δn)\frac{19}{10}(2\delta-n).

Moreover, in cases (S2) and (S3) each triangle component of GG contains a K4K_{4}.

By the discussion above, it remains to handle the graphs with near-extremal reduced graph. For these graphs we have a lot of structural information which enables us to show directly that they contain the squared paths and squared cycles we desire, as the following lemma documents. The proof of this lemma is provided in Section 4. In this proof we shall again make use of the embedding lemma, Lemma 8. Accordingly Lemma 10 inherits the upper bound melm_{\textsc{{el}}} on the number of clusters from Lemma 8.

Lemma 10 (Extremal Lemma).

For every ν>0\nu>0, given 0<η,d<108ν40<\eta,d<10^{-8}\nu^{4} there exists ε0>0\varepsilon_{0}>0 such that for every 0<ε<ε00<\varepsilon<\varepsilon_{0} and every melm_{\textsc{{el}}}, there exists NN such that the following holds. Suppose that

  1. ()

    GG is an nn-vertex graph with n>Nn>N and δ(G)=δ>n2+νn\delta(G)=\delta>\frac{n}{2}+\nu n,

  2. ()

    RR is an (ε,d)(\varepsilon,d)-reduced graph of GG of order mmelm\leq m_{\textsc{{el}}},

  3. ()

    each triangle component of RR contains a copy of K4K_{4}.

  4. ()

    V(R)=I˙B1˙B2˙˙BkV(R)=I\dot{\cup}B_{1}\dot{\cup}B_{2}\dot{\cup}\cdots\dot{\cup}B_{k} with k2k\geq 2,

  5. ()

    II is an independent set with |I|(nδ11ηn)m/n|I|\geq(n-\delta-11\eta n)m/n,

  6. ()

    for each i[k]i\in[k] we have 0<|Bi|19m(2δn)/(10n)0<|B_{i}|\leq 19m(2\delta-n)/(10n), and for every j[k]{i}j\in[k]\setminus\{i\} there are no edges between BiB_{i} and BjB_{j} in RR.

Then GG contains Psp(n,δ)2P^{2}_{\operatorname{sp}(n,\delta)} and C2C^{2}_{\ell} for each [3,sc(n,δ)]{5}\ell\in[3,\operatorname{sc}(n,\delta)]\setminus\{5\}.

It is interesting to notice that, although the two functions sp(n,δ)\operatorname{sp}(n,\delta) and sc(n,δ)\operatorname{sc}(n,\delta) are different—their jumps as δ\delta increases occur at slightly different values—they are similar enough that the Stability Lemma covers them both. We will only need to distinguish between squared paths and squared cycles when we examine the near-extremal graphs.

Proof of Theorem 4.

With this we have all the ingredients for the proof of our main theorem, which uses the Regularity Lemma (in form of Lemma 7) to construct a regular partition with reduced graph RR of the host graph GG, the Stability Lemma (Lemma 9) to conclude that RR either contains a big connected triangle factor or is near-extremal, the Embedding Lemma (Lemma 8) to find long squared paths and cycles in GG in the first case, and the Extremal Lemma (Lemma 10) in the second case.

Proof of Theorem 4.

We require our constants to satisfy

νμηdε>0,\nu\gg\mu\gg\eta\gg d\gg\varepsilon>0\,,

which we choose, given ν\nu, as follows. First, we choose μ:=ν/2\mu:=\nu/2. We then choose η>0\eta>0 to be small enough for both Lemma 9 and Lemma 10. Now we set d>0d>0 to be small enough for Lemma 10 and such that dν/10d\leq\nu/10 and dη/10d\leq\eta/10. For this dd Lemma 8 then produces a constant εel\varepsilon_{\textsc{{el}}}. We choose ε>0\varepsilon>0 to be smaller than min{εel,ν/10}\min\{\varepsilon_{\textsc{{el}}},\nu/10\} and sufficiently small for Lemma 10. We choose m0m_{0} to be sufficiently large to apply Lemma 9 to any graph with at least m0m_{0} vertices. We then choose melm_{\textsc{{el}}} such that Lemma 7 guarantees the existence of an (ε,d)(\varepsilon,d)-regular partition with at least m0m_{0} and at most melm_{\textsc{{el}}} parts. Finally we choose n0>neln_{0}>n_{\textsc{{el}}} to be sufficiently large for both Lemma 8 and Lemma 10.

Let n>n0n>n_{0} and δ(n/2+νn,n1]\delta\in(n/2+\nu n,n-1]. Let GG be any nn-vertex graph with δ(G)δ\delta(G)\geq\delta. Observe that it suffices to show that Psp(n,δ)2GP^{2}_{\operatorname{sp}(n,\delta)}\subseteq G and that () of Theorem 4 holds. We first apply Lemma 7 to GG to obtain an (ε,d)(\varepsilon,d)-reduced graph RR on m0mmelm_{0}\leq m\leq m_{\textsc{{el}}} vertices. Let δ=δ(R)(δ/ndε)m>m/2+μm\delta^{\prime}=\delta(R)\geq(\delta/n-d-\varepsilon)m>m/2+\mu m. Then we apply Lemma 9 to RR. There are three possibilities.

First, we could find that CTF(R)3(2δm)\operatorname{CTF}(R)\geq 3(2\delta^{\prime}-m). In this case by Lemma 8 we are guaranteed that for every integer \ell^{\prime} with 3<(1d)CTF(R)n/m3\ell^{\prime}<(1-d)\operatorname{CTF}(R)n/m we have C32GC^{2}_{3\ell^{\prime}}\subseteq G. By choice of dd and ε\varepsilon we have (1d)3(2δm)n/m>6δ3nνn(1-d)\cdot 3(2\delta^{\prime}-m)n/m>6\delta-3n-\nu n. Noting that P2C2P^{2}_{\ell}\subseteq C^{2}_{\ell} we conclude that Psp(n,δ)2GP^{2}_{\operatorname{sp}(n,\delta)}\subseteq G and C2GC^{2}_{\ell}\subseteq G for each integer 6δ3nνn\ell\leq 6\delta-3n-\nu n such that 33 divides \ell, i.e., the second case of Theorem 4() holds.

Second, we could find that CTF(R)min(sp(m,δ+ηm),11m20)\operatorname{CTF}(R)\geq\min(\operatorname{sp}(m,\delta^{\prime}+\eta m),\frac{11m}{20}) and that every triangle component of RR contains a copy of K4K_{4}. By Lemma 8 we are guaranteed that for every [6,(1d)CTF(R)n/m]{5}\ell\in[6,(1-d)\operatorname{CTF}(R)n/m]\setminus\{5\} we have C2GC^{2}_{\ell}\subseteq G. By choice of η\eta and dd we have (1d)CTF(R)n/m>sp(n,δ)sc(n,δ)(1-d)\operatorname{CTF}(R)n/m>\operatorname{sp}(n,\delta)\geq\operatorname{sc}(n,\delta), so we have Psp(n,δ)2GP^{2}_{\operatorname{sp}(n,\delta)}\subseteq G and for each integer [3,sc(n,δ)]{5}\ell\in[3,\operatorname{sc}(n,\delta)]\setminus\{5\} we have C2GC^{2}_{\ell}\subseteq G, i.e., the first case of Theorem 4() holds.

Third, we could find that RR is near-extremal. Then RR contains an independent set on at least mδ11ηmm-\delta^{\prime}-11\eta m vertices whose removal disconnects RR into components of size at most 1910(2δm)\frac{19}{10}(2\delta^{\prime}-m), and each triangle component of RR contains a copy of K4K_{4}. But now GG satisfies the conditions of Lemma 10. It follows that GG contains Psp(n,δ)2P^{2}_{\operatorname{sp}(n,\delta)} and for each [3,sc(n,δ)]{5}\ell\in[3,\operatorname{sc}(n,\delta)]\setminus\{5\} the graph GG contains C2C^{2}_{\ell}, i.e., the first case of Theorem 4() holds. ∎

3. Triangle Components and the proof of Lemma 9

In this section we provide a proof of our stability result for connected triangle factors, Lemma 9. Distinguishing different cases, we analyse the sizes and the structure of the triangle components in the graph GG under study. Before we give more details about our strategy and a sketch of the proof, we introduce some additional definitions and provide a preparatory lemma (Lemma 11).

Let GG be a graph with triangle components C1,,CrC_{1},\ldots,C_{r}. The interior int(G)\operatorname{int}(G) of GG is the set of vertices of GG which are in more than one of the triangle components. For a component CiC_{i}, the interior of CiC_{i}, written int(Ci)\operatorname{int}(C_{i}), is the set of vertices of CiC_{i} which are in int(G)\operatorname{int}(G). The remaining vertices of CiC_{i} are called the exterior (Ci)\partial(C_{i}). That is, (Ci)\partial(C_{i}) is formed by the set of vertices of CiC_{i} which are in no other triangle component of GG. To give an example, by definition the graph Gp(n,δ)G_{p}(n,\delta) has rp(n,δ)r_{p}(n,\delta) triangle components; its interior is the independent set YY (using the notation of the construction of Gp(n,δ)G_{p}(n,\delta) on page 4 in Section 1), with the component exteriors being the cliques X1,,XrX_{1},\ldots,X_{r}.

The following lemma collects some observations about triangle components.

Lemma 11.

Let GG be an nn-vertex graph with δ(G)=δ>n/2\delta(G)=\delta>n/2. Then

  1. ()

    each triangle component CC of GG satisfies |C|>δ|C|>\delta,

  2. ()

    for distinct triangle components CC, CC^{\prime} we have e((C),(C))=0e(\partial(C),\partial(C^{\prime}))=0,

  3. ()

    for each triangle component CC, each vertex uu of CC, and U:={v:uvC}U:=\{v\colon uv\in C\}, the minimum degree in G[U]G[U] is at least 2δn2\delta-n and hence |G[U]|2δn+1|G[U]|\geq 2\delta-n+1.

Proof.

To see () let MM be the vertices of a maximal clique in CC (clearly |M|3|M|\geq 3). If uu and vv are in MM, and xx is a common neighbour of uu and vv, then xx is also in CC. Thus vertices of GCG\setminus C are adjacent to at most 11 vertex of MM and vertices of CC are adjacent to at most |M|1|M|-1 vertices of MM, by maximality of MM. This gives the inequality

|M|δmMd(m)xC(|M|1)+xC1|M|\delta\leq\sum_{m\in M}d(m)\leq\sum_{x\in C}(|M|-1)+\sum_{x\notin C}1

and hence |M|δn(|M|2)|C||M|\delta-n\leq(|M|-2)|C|. Since n<2δn<2\delta we have |C|>δ|C|>\delta as required.

Since δ>n/2\delta>n/2, we have that Γ(u,u)\Gamma(u,u^{\prime})\neq\emptyset for any two vertices uu and uu^{\prime}. Now, if u(C)u\in\partial(C), u(C)u^{\prime}\in\partial(C^{\prime}), xΓ(u,u)x\in\Gamma(u,u^{\prime}), and uuuu^{\prime} was an edge, then uuxuu^{\prime}x would form a triangle. Then uu and uu^{\prime} would be together in some triangle component C′′C^{\prime\prime}, contradicting the fact that they are in the exterior. Therefore, the assertion () follows.

Moreover, for an edge uvuv of CC we have Γ(u,v)C\Gamma(u,v)\subseteq C as CC is a triangle component. Since |Γ(u,v)|2δn|\Gamma(u,v)|\geq 2\delta-n we get (). ∎

Now let us sketch the proof of Lemma 9. Lemma 11() states that triangle components cannot be too small. However, it is not solely the size of the triangle components we are interested in: we want to find a triangle component that contains many vertex disjoint triangles. At this point, Lemma 11() comes into play. It asserts that certain spots in a triangle component induce a graph with minimum degree 2δn2\delta-n. In the proof of Lemma 9 we shall usually (i.e., for many values of δ\delta) use this fact in order to find a big matching MM in such spots (Proposition 12() below asserts that this is possible). Clearly all edges in such a matching are triangle connected and hence it will remain to extend MM to a set of vertex disjoint triangles. For this purpose we will analyse the size of the common neighbourhood Γ(u,v)\Gamma(u,v) of an edge uvuv in MM. We will usually find that Γ(u,v)\Gamma(u,v) is so big that a simple greedy strategy allows us to construct the triangles. For estimating Γ(u,v)\Gamma(u,v) we will often use the following technique: We find a large set XX such that neither uu nor vv has neighbours in XX. This implies |Γ(u,v)|2δ(n|X|)|\Gamma(u,v)|\geq 2\delta-(n-|X|). Observe that Lemma 11() implies that (C)\partial(C) can serve as XX if both uu, v(C)v\in\partial(C^{\prime}) for some triangle components CC and CC^{\prime}.

The strategy we just described works for most values of δ\delta below 35n\frac{3}{5}n (we describe the exceptions below). For δ35n\delta\geq\frac{3}{5}n however, the greedy type argument fails, the reason being that we usually bound the common neighbourhood of an edge used in the argument above by 4δ2n4\delta-2n. But for δ35n\delta\geq\frac{3}{5}n we might have sp(n,δ)>4δ2n\operatorname{sp}(n,\delta)>4\delta-2n (see Figure 1). We solve this problem by using a different strategy in this range of δ\delta. We will still start with a big connected matching MM as before, but use a Hall-type argument to extend MM to a triangle factor TT. More precisely, we find MM in the exterior of some triangle component and then consider for each edge uvuv of MM all common neighbours of uvuv in int(G)\operatorname{int}(G). The Hall-type argument then permits us to find distinct extensions for the edges of MM. To make this argument work we use the fact that in this range of δ\delta the set int(G)\operatorname{int}(G) is an independent set.

We indicated earlier that there are some exceptional values of δ\delta that require special treatment: namely δ\delta close to 35n\frac{3}{5}n and 47n\frac{4}{7}n. Observe that in both ranges the number of triangle components of Gp(n,δ)G_{p}(n,\delta) changes (from 22 to 33 for 35n\frac{3}{5}n, and from 33 to 44 for 47n\frac{4}{7}n) and thus the value sp(n,δ)\operatorname{sp}(n,\delta) as a function in δ\delta jumps (see Figure 1). Roughly speaking, the reason that these two ranges need to be treated separately is that again sp(n,δ)\operatorname{sp}(n,\delta) is not substantially smaller than 4δ2n4\delta-2n here, but we also do not know now that int(G)\operatorname{int}(G) is an independent set. For dealing with these values of δ\delta we will use a somewhat technical case analysis which we provide at the end of this section (as proof of Fact 17).

As explained above, we will apply the following simple observations about matchings in graphs of given minimum degree.

Proposition 12.


  1. ()

    Let G=(X,E)G=(X,E) be a graph with minimum degree δ\delta. Then GG has a matching covering 2min(δ,|X|/2)2\min(\delta,\lfloor|X|/2\rfloor) vertices.

  2. ()

    Let G=(A˙B,E)G=(A\dot{\cup}B,E) be a bipartite graph with parts AA and BB, such that every vertex in AA has degree at least aa and every vertex in BB has degree at least bb. Then GG has a matching covering 2min(a+b,|A|,|B|)2\min(a+b,|A|,|B|) vertices.

Proof.

We first prove (). Let MM be a maximum matching in GG, and assume that MM contains less than min(δ,|X|/2)\min(\delta,\lfloor|X|/2\rfloor) edges. In particular, there are two vertices x,yXx,y\in X not covered by MM. Clearly, all neighbours of xx and yy are covered by MM.

We claim that there is an edge uvuv in MM with xu,yvExu,yv\in E. Indeed, suppose that this is not the case. Then |eΓ(x)|+|eΓ(y)|2|e\cap\Gamma(x)|+|e\cap\Gamma(y)|\leq 2 for each eMe\in M. We therefore have

δ+δ|Γ(x)|+|Γ(y)|=eM(|eΓ(x)|+|eΓ(y)|)2|M|,\delta+\delta\leq|\Gamma(x)|+|\Gamma(y)|=\sum_{e\in M}(|e\cap\Gamma(x)|+|e\cap\Gamma(y)|)\leq 2|M|\,,

contradicting the fact that δ>|M|\delta>|M|.

Now, let uvMuv\in M be an edge as in the claim above. Since xu,yuExu,yu\in E we get that x,u,v,yx,u,v,y is an MM-augmenting path, a contradiction.

Next we prove (). Let MM be a maximum matching in GG. We are done unless there are vertices uAu\in A and vBv\in B not contained in MM. There cannot be an edge xyMxy\in M such that uyuy and xvxv are edges of GG by maximality of MM, since then u,y,x,vu,y,x,v was an MM-augmenting path. But uu has at least aa neighbours in V(M)BV(M)\cap B, and vv at least bb neighours in V(M)AV(M)\cap A, so there must be at least a+ba+b edges in MM. ∎

Before turning to the proof of Lemma 9 let us quickly collect some analytical data about sp(n,δ)\operatorname{sp}(n,\delta) and rp(n,δ)=:rr_{p}(n,\delta)=:r. It is not difficult to check that

(3) (r+1)nr2(r+1)1δ<rnr+12r1andnδ2δn+1r<δ+12δn+1.\begin{split}\frac{(r+1)n-r}{2(r+1)-1}&\leq\delta<\frac{rn-r+1}{2r-1}\quad\text{and}\quad\\ \frac{n-\delta}{2\delta-n+1}&\leq r<\frac{\delta+1}{2\delta-n+1}\,.\end{split}

For the proof of Lemma 9 it will be useful to note in addition that given μ>0\mu>0, for every 0<η<η0=η0(μ)0<\eta<\eta_{0}=\eta_{0}(\mu), there is n1=n1(η)n_{1}=n_{1}(\eta) such that the following holds for all nn1n\geq n_{1}. For all δ,δ>n2+μn\delta,\delta^{\prime}>\frac{n}{2}+\mu n, where δ\delta is such that sp(n,δ+ηn)1120n\operatorname{sp}(n,\delta+\eta n)\leq\frac{11}{20}n, and where δ\delta^{\prime} is such that we have rp(n,δ)3r_{p}(n,\delta^{\prime})\geq 3 and either rp(n,δ)5r_{p}(n,\delta^{\prime})\geq 5 or rp(n,δ)=rp(n,δ+ηn)r_{p}(n,\delta^{\prime})=r_{p}(n,\delta^{\prime}+\eta n), we have

(4) sp(n,δ+ηn)32min(δrp(n,δ+ηn)12,δ+3ηnrp(n,δ+ηn)2),\operatorname{sp}(n,\delta+\eta n)\leq\frac{3}{2}\min\Big{(}\frac{\delta}{r_{p}(n,\delta+\eta n)-1}-2,\,\frac{\delta+3\eta n}{r_{p}(n,\delta+\eta n)}-2\Big{)},
(5) sp(n,δ+ηn)19203(2δn)26δ3n100ηn,andsp(n,δ+ηn)4δ2n,\begin{split}\operatorname{sp}(n,\delta+\eta n)&\leq\tfrac{19}{20}\cdot 3(2\delta-n)-2\leq 6\delta-3n-100\eta n,\quad\text{and}\quad\\ \operatorname{sp}(n,\delta^{\prime}+\eta n)&\leq 4\delta^{\prime}-2n,\end{split}

which follows immediately from the definition of sp(n,δ)\operatorname{sp}(n,\delta) in (1) (see also Figure 1).

Proof of Lemma 9.

Given μ\mu and any 0<η<min(11000,η0(μ),2μ2/3)0<\eta<\min(\frac{1}{1000},\eta_{0}(\mu),2\mu^{2}/3), where η0(μ)\eta_{0}(\mu) is as above (4), let n0:=max(n1(η),2/η)n_{0}:=\max(n_{1}(\eta),2/\eta) with n1(η)n_{1}(\eta) as above (4). Let nn0n\geq n_{0}. This in particular means that we may assume the inequalities (4) and (5) in what follows. Define γ:=δ/n\gamma:=\delta/n, and r:=rp(n,δ)r:=r_{p}(n,\delta) and r:=rp(n,δ+ηn)r^{\prime}:=r_{p}(n,\delta+\eta n).

If GG has only one triangle component then Theorem 3 guarantees that CTF(G)6δ3n\operatorname{CTF}(G)\geq 6\delta-3n and so we are in Case (S1). Thus we may assume in the following that GG has at least two triangle components. Then Lemma 11() implies that int(C)\operatorname{int}(C)\not=\emptyset for any triangle component CC.

Suppose that CC is a triangle component of GG which does not contain a copy of K4K_{4}. Let uu be a vertex of CC, and U:={v:uvC}U:=\{v\colon uv\in C\}. By Lemma 11() we have δ(G[U])2δn\delta(G[U])\geq 2\delta-n. Because CC contains no copy of K4K_{4}, UU contains no triangle. By Turán’s theorem we have |U|2(2δn)|U|\geq 2(2\delta-n), and so by Proposition 12() the set UU contains a matching MM with 2δn2\delta-n edges. Finally we choose greedily for each eMe\in M a distinct vertex vV(G)v\in V(G) such that evev is a triangle. Since UU is triangle free all these vertices must lie outside UU, and since |Γ(e)|2δn|\Gamma(e)|\geq 2\delta-n we cannot fail to find distinct vertices for each edge. This yields a set TT of 2δn2\delta-n vertex-disjoint triangles which are all in CC. So CTF(G)6δ3n\operatorname{CTF}(G)\geq 6\delta-3n and we are in case (S1). Henceforth we assume that every triangle component of GG contains a copy of K4K_{4}.

We continue by considering the case 3n25δ<2n13\frac{3n-2}{5}\leq\delta<\frac{2n-1}{3}. The following observation readily implies the lemma in this range, as we will see in Fact 14.

Fact 13.

If δ(G)(352η)n\delta(G)\geq(\frac{3}{5}-2\eta)n, GG has exactly 22 triangle components, int(G)\operatorname{int}(G) is independent, and either |int(G)|<nδ11ηn|\operatorname{int}(G)|<n-\delta-11\eta n or the exterior XX of the triangle component with most vertices satisfies |X|1910(2δn)|X|\geq\frac{19}{10}(2\delta-n), then CTF(G)min(sp(n,δ+ηn),1120n)\operatorname{CTF}(G)\geq\min(\operatorname{sp}(n,\delta+\eta n),\frac{11}{20}n).

Proof of Fact 13.

First, by Lemma 11() a vertex xXx\in X cannot have neighbours in the exterior of the other triangle component, so Γ(x)Xint(G)\Gamma(x)\subseteq X\cup\operatorname{int}(G). Thus δ(G[X])δ|int(G)|\delta(G[X])\geq\delta-|\operatorname{int}(G)|, which by Proposition 12() means that there is a matching MM in G[X]G[X] with

(6) |M|=min(δ|int(G)|,|X|/2)|M|=\min(\delta-|\operatorname{int}(G)|,\lfloor|X|/2\rfloor)

edges.

We aim to pair off edges of MM with vertices of int(G)\operatorname{int}(G) to form a sufficiently large number of vertex-disjoint triangles. To see that a triangle factor resulting from this process will be connected, observe that all edges of MM are in XX, and since XX is a triangle component exterior, the edges of MM are triangle connected. To form triangles from edges of MM and vertices of int(G)\operatorname{int}(G), we introduce an auxiliary bipartite graph HH with vertex set M˙int(G)M\dot{\cup}\operatorname{int}(G), where uvMuv\in M is adjacent in HH to wint(G)w\in\operatorname{int}(G) iff uvwuvw is a triangle of GG. Every vertex of XX has at least δ|X|\delta-|X| neighbours in int(G)\operatorname{int}(G), and so every edge of MM has at least a:=2(δ|X|)|int(G)|a:=2(\delta-|X|)-|\operatorname{int}(G)| common neighbours in int(G)\operatorname{int}(G). At the same time, since int(G)\operatorname{int}(G) is independent, every vertex of int(G)\operatorname{int}(G) has at least δ(n|int(G)||X|)\delta-(n-|\operatorname{int}(G)|-|X|) neighbours in XX, of which all but |X|2|M||X|-2|M| must be in MM. So every vertex of int(G)\operatorname{int}(G) must have at least

b:=δ(n|int(G)||X|)(|X|2|M|)|M|=δn+|int(G)|+|M|b:=\delta-(n-|\operatorname{int}(G)|-|X|)-(|X|-2|M|)-|M|=\delta-n+|\operatorname{int}(G)|+|M|

edges of MM in its neighbourhood. By Proposition 12() there is a matching in HH on at least min(a+b,|M|,|int(G)|)\min(a+b,|M|,|\operatorname{int}(G)|) edges, and hence a connected triangle factor in GG with so many triangles. Observe that

(7) a+b=2δ2|X||int(G)|+δn+|int(G)|+|M|=3δn2|X|+|M|.\begin{split}a+b&=2\delta-2|X|-|\operatorname{int}(G)|+\delta-n+|\operatorname{int}(G)|+|M|\\ &=3\delta-n-2|X|+|M|\,.\end{split}

Since there are two triangle components in GG, there is a vertex uu in a triangle component exterior which is not XX. Therefore uu has no neighbour in XX, so |X|<nδ|X|<n-\delta. Since δ(352η)n\delta\geq(\frac{3}{5}-2\eta)n, by (7) we have

(8) a+b>|M|10ηn.a+b>|M|-10\eta n\,.

Furthermore,

(9) if|X|(253η)n,thena+b|M|.\quad\text{if}\quad|X|\leq(\tfrac{2}{5}-3\eta)n\,,\quad\text{then}\quad a+b\geq|M|\,.

By Lemma 11() we have |int(G)|2δnn54ηn|\operatorname{int}(G)|\geq 2\delta-n\geq\frac{n}{5}-4\eta n. Since η11000\eta\leq\frac{1}{1000} we have

3|int(G)|3n512ηn>11n20.3|\operatorname{int}(G)|\geq\frac{3n}{5}-12\eta n>\frac{11n}{20}\,.

Thus we have CTF(G)11n20\operatorname{CTF}(G)\geq\frac{11n}{20} if we find a matching in HH covering int(G)\operatorname{int}(G). It remains, then, to check that we have

(10) 3min(a+b,|M|)min(sp(n,δ+ηn),1120n).3\min(a+b,|M|)\geq\min(\operatorname{sp}(n,\delta+\eta n),\frac{11}{20}n).

We distinguish two cases.

Case 1: a+b<|M|a+b<|M|. By (9) this forces |X|>(253η)n|X|>(\frac{2}{5}-3\eta)n. Since we have |M|=min(δ|int(G)|,|X|/2)|M|=\min(\delta-|\operatorname{int}(G)|,\lfloor|X|/2\rfloor) by (6), there are two possibilities. If |M|=|X|/2|M|=\lfloor|X|/2\rfloor then we have

a+b(8)|X|210ηn>n512ηn>11n60,a+b\overset{\mbox{\tiny{\eqref{eq:fact1:aplusb2}}}}{\geq}\Big{\lfloor}\frac{|X|}{2}\Big{\rfloor}-10\eta n>\frac{n}{5}-12\eta n>\frac{11n}{60}\,,

which proves (10) in this subcase. If, on the other hand, |M|=δ|int(G)||M|=\delta-|\operatorname{int}(G)|, then we use that int(G)\operatorname{int}(G) is independent, which implies int(G)nδ\operatorname{int}(G)\leq n-\delta and thus

a+b(8)|M|10ηn=δ|int(G)|10ηn2δn10ηn(5)13sp(n,δ+ηn),\begin{split}a+b&\overset{\mbox{\tiny{\eqref{eq:fact1:aplusb2}}}}{\geq}|M|-10\eta n=\delta-|\operatorname{int}(G)|-10\eta n\geq 2\delta-n-10\eta n\\ &\overset{\mbox{\tiny{\eqref{eq:sqpb}}}}{\geq}\tfrac{1}{3}\operatorname{sp}(n,\delta+\eta n)\,,\end{split}

which proves (10) in this subcase.

Case 2: a+b|M|a+b\geq|M|. In this case, HH contains a matching of size |M||M|, so we have CTF(G)3|M|=3min(δ|int(G)|,|X|/2)\operatorname{CTF}(G)\geq 3|M|=3\min(\delta-|\operatorname{int}(G)|,\lfloor|X|/2\rfloor). Again there are two possibilities, depending on |M||M|. If |M|=δ|int(G)||M|=\delta-|\operatorname{int}(G)|, we are done by (5) exactly as before. If, on the other hand, |M|=|X|/2|M|=\lfloor|X|/2\rfloor, then (10) holds (and hence we are done) unless

(11) 3|X|2<min(sp(n,δ+ηn),1120n).3\lfloor\tfrac{|X|}{2}\rfloor<\min\big{(}\operatorname{sp}(n,\delta+\eta n),\tfrac{11}{20}n\big{)}\,.

We now assume (11) in order to derive a contradiction, and make a final subcase distinction.

First assume that sp(n,δ+ηn)<1120n\operatorname{sp}(n,\delta+\eta n)<\frac{11}{20}n. Then r2r^{\prime}\geq 2 and hence (11) and (1) imply

|X|<12(δ+ηn)+3<51100δ<1910(2δn),|X|<\tfrac{1}{2}(\delta+\eta n)+3<\tfrac{51}{100}\delta<\tfrac{19}{10}(2\delta-n)\,,

because δ(352η)n\delta\geq(\frac{3}{5}-2\eta)n and η11000\eta\leq\frac{1}{1000}. Furthermore, since GG has two triangle components whose exterior is of size at most XX by assumption we have |int(G)|>n2|X|=nδηn6|\operatorname{int}(G)|>n-2|X|=n-\delta-\eta n-6, a contradiction to the the conditions of Fact 13.

Now assume that sp(n,δ+ηn)1120n\operatorname{sp}(n,\delta+\eta n)\geq\frac{11}{20}n. Then we have δ>(232η)n\delta>(\frac{2}{3}-2\eta)n. By Lemma 11() we have |X|nδ<(13+2η)n|X|\leq n-\delta<(\frac{1}{3}+2\eta)n and so |X|<1910(2δn)|X|<\frac{19}{10}(2\delta-n). Further |int(G)|n2|X|2δn>n34ηn>nδ11ηn|\operatorname{int}(G)|\geq n-2|X|\geq 2\delta-n>\frac{n}{3}-4\eta n>n-\delta-11\eta n, which again contradicts the conditions of Fact 13. ∎

Fact 14.

Lemma 9 is true for 3n25δ<2n13\frac{3n-2}{5}\leq\delta<\frac{2n-1}{3}.

Proof of Fact 14.

Observe that in this range r=2r=2. Assume GG has an edge uvuv in int(G)\operatorname{int}(G), let xx be a common neighbour of uu and vv and CC be the triangle component containing uxux and vxvx. Since uvint(G)uv\in\operatorname{int}(G) there are edges uyuy and vzvz of GG outside CC. The sets Γ(u,y)\Gamma(u,y), Γ(v,z)\Gamma(v,z) and {u,v,x,y,z}\{u,v,x,y,z\} are pairwise disjoint, and xx is not adjacent to Γ(u,y)Γ(v,z){y,z}\Gamma(u,y)\cup\Gamma(v,z)\cup\{y,z\}. So δd(x)(n1)2(2δn)2\delta\leq d(x)\leq(n-1)-2(2\delta-n)-2 which is only possible when δ(3n3)/5\delta\leq(3n-3)/5, a contradiction. Thus int(G)\operatorname{int}(G) is an independent set, which implies |int(G)|nδ|\operatorname{int}(G)|\leq n-\delta. Hence, by Lemma 11(), GG cannot have more than two triangle components. In particular, all vertices in int(G)\operatorname{int}(G) lie in both triangle components of GG. So if |int(G)|nδ11ηn|\operatorname{int}(G)|\geq n-\delta-11\eta n then int(G)\operatorname{int}(G) is the desired large independent set for Case (S3). If moreover all triangle component exteriors are of size 1910(2δn)\frac{19}{10}(2\delta-n) at most we are in Case (S3). Otherwise (if int(G)\operatorname{int}(G) is small or a triangle component exterior is large) Fact 13 gives CTF(G)min(sp(n,δ+ηn),1120n)\operatorname{CTF}(G)\geq\min(\operatorname{sp}(n,\delta+\eta n),\frac{11}{20}n) which is Case (S2). ∎

For the remainder of the proof, we suppose δ<3n25\delta<\frac{3n-2}{5} and accordingly r3r\geq 3 and r2r^{\prime}\geq 2. For dealing with this case we first establish two auxiliary facts. The first one captures the greedy technique for finding a large connected triangle factor that we sketched in the beginning of this section. We will use this technique throughout the rest of the proof.

Fact 15.

If there are two sets U1,U2V(G)U_{1},U_{2}\subseteq V(G) such that no vertex in U1U_{1} has a neighbour in U2U_{2}, all edges in G[U1]G[U_{1}] are triangle connected and δ(G[U1])δ1\delta(G[U_{1}])\geq\delta_{1} then CTF(G)min(3|U1|/2,3δ1,2δn+|U2|)\operatorname{CTF}(G)\geq\min(3\lfloor|U_{1}|/2\rfloor,3\delta_{1},2\delta-n+|U_{2}|).

Proof of Fact 15.

By Proposition 12() we can find a matching MM^{\prime} in U1U_{1} covering

min(2|U1|/2,2δ1)\min(2\lfloor|U_{1}|/2\rfloor,2\delta_{1})

vertices. Let MM be a subset of MM^{\prime} covering min(2|U1|/2,2δ1,(4δ2n+2|U2|)/3)\min(2\lfloor|U_{1}|/2\rfloor,2\delta_{1},(4\delta-2n+2|U_{2}|)/3) vertices. For each edge eMe\in M in turn we pick greedily a common neighbour of ee outside both MM and the previously chosen common neighbours to obtain a set TT of disjoint triangles. For any x,yU1x,y\in U_{1} we have |Γ(x,y)|2δ(n|U2|)|\Gamma(x,y)|\geq 2\delta-(n-|U_{2}|). We claim that this implies that TT can be constructed, covering all of MM. Indeed, in each step of the greedy procedure we have strictly more than 2δ(n|U2|)3|M|02\delta-(n-|U_{2}|)-3|M|\geq 0 common neighbours of eMe\in M available. Hence TT covers at least min(3|U1|/2,3δ1,2δn+|U2|)\min(3\lfloor|U_{1}|/2\rfloor,3\delta_{1},2\delta-n+|U_{2}|) vertices. Note further that TT is a connected triangle factor because all edges in G[U1]G[U_{1}] are triangle connected. ∎

Below, our goal will be to show that int(G)\operatorname{int}(G) is an independent set. The following fact prepares us for this step.

Fact 16.

Let uvuv be an edge in int(G)\operatorname{int}(G). Unless r=2r^{\prime}=2 at least one vertex, uu or vv, is contained in at most r1r^{\prime}-1 triangle components.

Proof of Fact 16.

Let C1C_{1} be the triangle component containing uvint(G)uv\in\operatorname{int}(G) along with the (non-empty) common neighbourhood Γ(u,v)\Gamma(u,v) (and perhaps some other neighbours of uu or vv separately). Suppose that CC1C\neq C_{1}, and uu is a vertex of CC. Then by Lemma 11(), there are at least 2δn+12\delta-n+1 neighbours xx of uu such that the edge uxux is in CC. Now suppose that uu lies in at least r1r^{\prime}-1 triangle components other than C1C_{1}. It follows that there is a set UuΓ(u)U_{u}\subseteq\Gamma(u) of vertices xx such that uxux is not in C1C_{1}, with |Uu|(r1)(2δn+1)|U_{u}|\geq(r^{\prime}-1)(2\delta-n+1), since no edge lies in two distinct triangle components. Suppose furthermore that vv too lies in at least r1r^{\prime}-1 triangle components other than C1C_{1}. Then there exists an analogously defined set UvU_{v}. Since all vertices of Γ(u,v)\Gamma(u,v) form triangles of C1C_{1} with uu and vv, the three sets Γ(u,v)\Gamma(u,v), UuU_{u} and UvU_{v} are pairwise disjoint, and thus |UuUv|(2r2)(2δn+1)|U_{u}\cup U_{v}|\geq(2r^{\prime}-2)(2\delta-n+1). Now given any xΓ(u,v)x\in\Gamma(u,v), since uxux and vxvx are both in C1C_{1}, xx cannot be adjacent to any vertex of UuUvU_{u}\cup U_{v}. But then δd(x)<n(2r2)(2δn+1)\delta\leq d(x)<n-(2r^{\prime}-2)(2\delta-n+1) which is equivalent to 2r2<(nδ)/(2δn+1)2r^{\prime}-2<(n-\delta)/(2\delta-n+1). By (3) the right-hand side is at most rr and thus we get 2r2<r2r^{\prime}-2<r. Since rr+1r\leq r^{\prime}+1 however this is a contradiction unless r2r^{\prime}\leq 2. ∎

We assume from now on, that

(12) CTF(G)<sp(n,δ+ηn),\operatorname{CTF}(G)<\operatorname{sp}(n,\delta+\eta n)\,,

that is, we are not in Cases (S1) or (S2). Our aim is to conclude that then (*) int(G)\operatorname{int}(G) is an independent set and that its vertices are contained in at least rr^{\prime} triangle components. It turns out, however, that we need to consider the cases r=r+1=3r=r^{\prime}+1=3 and r=r+1=4r=r^{\prime}+1=4 (i.e., the cases when the minimum degree δ\delta is just a little bit below 35n\frac{3}{5}n and 47n\frac{4}{7}n, respectively) separately. Unfortunately these two cases, which are treated by Fact 17, require a somewhat technical case analysis, which we prefer to defer to the end of the section.

Fact 17.

If r=r+1=3r=r^{\prime}+1=3 or r=r+1=4r=r^{\prime}+1=4 then int(G)\operatorname{int}(G) is an independent set all of whose vertices are contained in at least rr^{\prime} triangle components.

Assuming this fact is true we can deduce (*) for all values r3r\geq 3 as follows.

Fact 18.

The set int(G)\operatorname{int}(G) is an independent set (and hence of size at most nδn-\delta) all of whose vertices are contained in at least rr^{\prime} triangle components.

Proof of Fact 18.

Recall that we have r3r\geq 3 at this point of the proof. Moreover, the cases r=r+1=3r=r^{\prime}+1=3 and r=r+1=4r=r^{\prime}+1=4 are handled by Fact 17. So we assume we are not in these cases; in particular, r3r^{\prime}\geq 3. We will show that then each vertex of int(G)\operatorname{int}(G) is contained in at least rr^{\prime} triangle components. Once we establish this, Fact 16 implies that there are no edges in int(G)\operatorname{int}(G) and so int(G)\operatorname{int}(G) is an independent set as desired.

To prove that each vertex of int(G)\operatorname{int}(G) is contained in at least rr^{\prime} triangle components we assume the contrary and show that then CTF(G)sp(n,δ+ηn)\operatorname{CTF}(G)\geq\operatorname{sp}(n,\delta+\eta n), a contradiction to (12). Indeed, let wint(G)w\in\operatorname{int}(G) and suppose that there are k>1k>1 triangle components C1,,CkC_{1},\dots,C_{k} containing ww. For i[k]i\in[k] let UiU_{i} be the set of neighbours uu of ww such that uwCiuw\in C_{i}. By Lemma 11() we have δ(G[Ui])2δn\delta(G[U_{i}])\geq 2\delta-n and |Ui|2δn+1|U_{i}|\geq 2\delta-n+1. Suppose that U1U_{1} is the largest of the UiU_{i}. No vertex in U1U_{1} has a neighbour in U2U_{2}, since the components are distinct. In addition, all edges in G[U1]G[U_{1}] are triangle connected, because U1Γ(w)U_{1}\subseteq\Gamma(w). Therefore Fact 15 implies that there is a connected triangle factor TT in GG covering min(3|U1|/2,3(2δn),2δn+|U2|)min(3|U1|/2,4δ2n)\min(3\lfloor|U_{1}|/2\rfloor,3(2\delta-n),2\delta-n+|U_{2}|)\geq\min(3\lfloor|U_{1}|/2\rfloor,4\delta-2n) vertices. If ww lies only in r1r^{\prime}-1 triangle components then |U1|δ/(r1)|U_{1}|\geq\delta/(r^{\prime}-1) and therefore TT covers at least min(3δ/(2r2),4δ2n)\min(3\lfloor\delta/(2r^{\prime}-2)\rfloor,4\delta-2n) vertices. Now since (4) holds, we have 32δ/(r1)2sp(n,δ+ηn)\frac{3}{2}\delta/(r^{\prime}-1)-2\geq\operatorname{sp}(n,\delta+\eta n). Since rr3r\geq r^{\prime}\geq 3 and we have excluded the case r=r+1=4r=r^{\prime}+1=4, by (5) we have 4δ2nsp(n,δ+ηn)4\delta-2n\geq\operatorname{sp}(n,\delta+\eta n). It follows that TT covers at least sp(n,δ+ηn)\operatorname{sp}(n,\delta+\eta n) vertices, in contradiction to (12). ∎

Fact 19.

We are in Case (S3).

Proof of Fact 19.

Fact 18 tells us that int(G)\operatorname{int}(G) is an independent set. By Lemma 11() and the fact that δ>nδ\delta>n-\delta we have that every triangle component in GG has an exterior, and by Lemma 11() that there are no edges between any triangle component exteriors. Hence, to show that we are in Case (S3), it is enough to prove that

(13) |int(G)|:=αnδ11ηnand|X1|1910(2δn)|\operatorname{int}(G)|:=\alpha\geq n-\delta-11\eta n\qquad\text{and}\qquad|X_{1}|\leq\frac{19}{10}(2\delta-n)

for the biggest triangle component exterior X1X_{1} in GG. Suppose for a contradiction that this is not the case. We first claim that this forces GG to have exactly rr^{\prime} triangle components.

Indeed, assume GG has kr+1k\geq r^{\prime}+1 triangle components. Each of these components CC has vertices in its exterior (C)\partial(C), and so by Lemma 11() the minimum degree of GG implies |(C)|δα+12δn+1|\partial(C)|\geq\delta-\alpha+1\geq 2\delta-n+1. We let these triangle component exteriors be X1,,XkX_{1},\ldots,X_{k}, with X1X_{1} being the biggest. Since n=|X1˙˙Xk˙int(G)|n=|X_{1}\dot{\cup}\ldots\dot{\cup}X_{k}\dot{\cup}\operatorname{int}(G)|, we have (r+1)(δα)+α<n(r^{\prime}+1)(\delta-\alpha)+\alpha<n. We distinguish two cases.

Case 1: (13) fails because α<nδ11ηn\alpha<n-\delta-11\eta n. Then we obtain

(r+1)δ<n+rα<n+r(nδ11ηn)=(r+1)n(9r1)ηnrδ(2r+1)ηn.\begin{split}(r^{\prime}+1)\delta&<n+r^{\prime}\alpha<n+r^{\prime}(n-\delta-11\eta n)\\ &=(r^{\prime}+1)n-(9r^{\prime}-1)\eta n-r^{\prime}\delta-(2r^{\prime}+1)\eta n\,.\end{split}

Straightforward manipulation gives

δ+ηn<(r+1)n(9r1)ηn2(r+1)1.\delta+\eta n<\frac{(r^{\prime}+1)n-(9r^{\prime}-1)\eta n}{2(r^{\prime}+1)-1}\,.

Since (9r1)ηn9r1r(9r^{\prime}-1)\eta n\geq 9r^{\prime}-1\geq r^{\prime} this contradicts (3) applied to r=rp(n,δ+ηn)r^{\prime}=r_{p}(n,\delta+\eta n).

Case 2: (13) fails because |X1|>1910(2δn)|X_{1}|>\frac{19}{10}(2\delta-n). Let xx be any vertex in X2X_{2}. Since xx has at least δ\delta neighbours, none of which are in X1˙X3˙˙XkX_{1}\dot{\cup}X_{3}\dot{\cup}\ldots\dot{\cup}X_{k}, we have

1+δ+1910(2δn)+(k2)(2δn+1)\displaystyle 1+\delta+\frac{19}{10}(2\delta-n)+(k-2)(2\delta-n+1) n, hence\displaystyle\leq n\,\text{, hence}
1910(2δn)+(r1)(2δn)\displaystyle\frac{19}{10}(2\delta-n)+(r^{\prime}-1)(2\delta-n) <nδ.\displaystyle<n-\delta\,.

By (3) we have r(nδηn)/(2δ+2ηnn+1)r^{\prime}\geq(n-\delta-\eta n)/(2\delta+2\eta n-n+1). Combined with the last inequality, this gives

910(2δn)+nδηn2δn+1+2ηn(2δn)<nδ\frac{9}{10}(2\delta-n)+\frac{n-\delta-\eta n}{2\delta-n+1+2\eta n}(2\delta-n)<n-\delta

Now provided that η<2μ2/3\eta<2\mu^{2}/3, and since 2δn2μn2\delta-n\geq 2\mu n, we have

(2δn+2ηn+1)(1μ)<2δn+3ηnμ(2δn)2δn+3ηn2μ2n<2δn,\begin{split}(2\delta-n+2\eta n+1)(1-\mu)&<2\delta-n+3\eta n-\mu(2\delta-n)\\ &\leq 2\delta-n+3\eta n-2\mu^{2}n<2\delta-n\,,\end{split}

and we obtain 95μn+(1μ)(nδηn)<nδ\frac{9}{5}\mu n+(1-\mu)(n-\delta-\eta n)<n-\delta which is a contradiction since nδ<n/2n-\delta<n/2 and η<μ\eta<\mu.

Hence, if (13) fails, then GG has indeed exactly rr^{\prime} triangle components.

Now we use this fact in order to derive a contradiction to (12). Observe that, if r=2r^{\prime}=2, and accordingly δ(352η)n\delta\geq(\frac{3}{5}-2\eta)n, then Fact 13 implies that (13) holds, because according to (12) we have CTF(G)<sp(n,δ+ηn)\operatorname{CTF}(G)<\operatorname{sp}(n,\delta+\eta n). In the remainder we assume r3r^{\prime}\geq 3.

Since every vertex in X1X_{1} has neighbours only in X1X_{1} and int(G)\operatorname{int}(G), and |int(G)|nδ|\operatorname{int}(G)|\leq n-\delta, we have δ(G[X1])2δn\delta(G[X_{1}])\geq 2\delta-n. Furthermore, since no vertex in X1X_{1} has neighbours in either X2X_{2} or X3X_{3}, and |X2˙X3|2(2δn+1)|X_{2}\dot{\cup}X_{3}|\geq 2(2\delta-n+1), we can apply Fact 15 to obtain

CTF(G)\displaystyle\operatorname{CTF}(G) min(3|X1|/2,3(2δn),2δn+2(2δn+1))\displaystyle\geq\min\big{(}3\lfloor|X_{1}|/2\rfloor,3(2\delta-n),2\delta-n+2(2\delta-n+1)\big{)}
=min(3|X1|/2,3(2δn)).\displaystyle=\min\big{(}3\lfloor|X_{1}|/2\rfloor,3(2\delta-n)\big{)}\,.

Now by (5), CTF(G)3(2δn)\operatorname{CTF}(G)\geq 3(2\delta-n) is a contradiction to (12), so to complete our proof it remains to show that if (13) fails, then CTF(G)3|X1|/2\operatorname{CTF}(G)\geq 3\lfloor|X_{1}|/2\rfloor is also a contradiction to (12). Again, we distinguish two cases.

Case 1: (13) fails because α<nδ11ηn\alpha<n-\delta-11\eta n. Since X1X_{1} is the largest exterior, we have |X1|(δ+11ηn)/r|X_{1}|\geq(\delta+11\eta n)/r^{\prime}. But we have by (4) that

sp(n,δ+ηn)32δ+3ηnr2<3δ+11ηn2r,\operatorname{sp}(n,\delta+\eta n)\leq\frac{3}{2}\frac{\delta+3\eta n}{r^{\prime}}-2<3\Big{\lfloor}\frac{\delta+11\eta n}{2r^{\prime}}\Big{\rfloor}\,,

so that CTF(G)3|X1|/2\operatorname{CTF}(G)\geq 3\lfloor|X_{1}|/2\rfloor is indeed a contradiction to (12).

Case 2: (13) fails because |X1|>1910(2δn)|X_{1}|>\frac{19}{10}(2\delta-n). Then CTF(G)3|X1|/25720(2δn)2\operatorname{CTF}(G)\geq 3\lfloor|X_{1}|/2\rfloor\geq\frac{57}{20}(2\delta-n)-2, which by (5) is a contradiction to (12), as desired. ∎

This completes, modulo the proof of Fact 17, the proof of Lemma 9. ∎

It remains to show Fact 17. Note that we can use all facts from the proof of Lemma 9 that precede Fact 17. We will further assume that all constants and variables are set up as in this proof.

Proof of Fact 17.

Recall that we assumed (12), i.e., CTF(G)<sp(n,δ+ηn)\operatorname{CTF}(G)<\operatorname{sp}(n,\delta+\eta n), in this part of the proof of Lemma 9. We distinguish two cases.

Case 1: r=3r=3 and r=2r^{\prime}=2. In this case δ(G)[(352η)n,(35+η)n]\delta(G)\in[(\frac{3}{5}-2\eta)n,(\frac{3}{5}+\eta)n]. Trivially each vertex of int(G)\operatorname{int}(G) is contained in at least r=2r^{\prime}=2 triangle components. Suppose for a contradiction that there is an edge uvuv in int(G)\operatorname{int}(G). Let xx be a common neighbour of uu and vv, and CC be the triangle component containing the triangle uvxuvx. Let U1:={y:uyC}U_{1}:=\{y:uy\in C\} and V1:={y:vyC}V_{1}:=\{y:vy\in C\} and let U2:=Γ(u)U1U_{2}:=\Gamma(u)\setminus U_{1} and V2:=Γ(v)V1V_{2}:=\Gamma(v)\setminus V_{1}. Observe that U2V2=U_{2}\cap V_{2}=\emptyset.

By definition xx is not in, and has no neighbour in, U2˙V2U_{2}\dot{\cup}V_{2}. It follows that |U2˙V2|<nδ(25+2η)n|U_{2}\dot{\cup}V_{2}|<n-\delta\leq(\frac{2}{5}+2\eta)n. On the other hand, by Lemma 11(), we have |U2||U_{2}|, |V2|>2δn15n4ηn|V_{2}|>2\delta-n\geq\frac{1}{5}n-4\eta n, and thus

|U2|,|V2|[(154η)n,(15+6η)n].|U_{2}|,|V_{2}|\in\big{[}(\tfrac{1}{5}-4\eta)n,(\tfrac{1}{5}+6\eta)n\big{]}\,.

Since d(u)δ(352η)nd(u)\geq\delta\geq(\tfrac{3}{5}-2\eta)n, we have |U1|δ|U2|(258η)n|U_{1}|\geq\delta-|U_{2}|\geq(\tfrac{2}{5}-8\eta)n. But no vertex in U2U_{2} is adjacent to any vertex in U1U_{1}. This implies that every vertex in U2U_{2} is adjacent to all but at most nδ|U1|10ηnn-\delta-|U_{1}|\leq 10\eta n vertices outside U1U_{1}. Since η<11000\eta<\tfrac{1}{1000} we have |U2|>20ηn|U_{2}|>20\eta n, so δ(G[U2])>|U2|/2\delta(G[U_{2}])>|U_{2}|/2, and by Proposition 12(), U2U_{2} contains a matching MuM_{u} with |U2|/2\lfloor|U_{2}|/2\rfloor edges. Since each vertex of U2U_{2} has at most 10ηn10\eta n non-neighbours outside U1U_{1}, each pair of vertices has common neighbourhood covering all but at most 20ηn20\eta n vertices of V(G)U1V(G)\setminus U_{1}. In particular, the common neighbourhood of each edge of MuM_{u} covers all but at most 20ηn20\eta n vertices of V(G)U1V(G)\setminus U_{1}. Similarly, V2V_{2} contains a matching MvM_{v} with |V2|/2\lfloor|V_{2}|/2\rfloor edges, and the common neighbourhood of each edge covers all but at most 20ηn20\eta n vertices of V(G)V1V(G)\setminus V_{1}.

Since 20ηn<|U2|/420\eta n<|U_{2}|/4 and U2V1=U_{2}\cap V_{1}=\emptyset, the common neighbourhood of each edge of MvM_{v} contains more than half of the edges of MuM_{u}. By symmetry, the reverse is also true. Thus all edges in Mu˙MvM_{u}\dot{\cup}M_{v} are in the same triangle component of GG. Finally, each edge of Mu˙MvM_{u}\dot{\cup}M_{v} has at least δ10ηn|U2˙V2|(1524η)n\delta-10\eta n-|U_{2}\dot{\cup}V_{2}|\geq(\tfrac{1}{5}-24\eta)n common neighbours outside U2˙V2U_{2}\dot{\cup}V_{2}. Choosing greedily for each edge of Mu˙MvM_{u}\dot{\cup}M_{v} in succession distinct common neighbours outside U2˙V2U_{2}\dot{\cup}V_{2}, we obtain a connected triangle factor with min(|U2|/2+|V2|/2,(1524η)n)=(1524η)n\min(\lfloor|U_{2}|/2\rfloor+\lfloor|V_{2}|/2\rfloor,(\tfrac{1}{5}-24\eta)n)=(\tfrac{1}{5}-24\eta)n triangles. But then CTF(G)(3572η)n>n/2>sp(n,δ+ηn)\operatorname{CTF}(G)\geq(\tfrac{3}{5}-72\eta)n>n/2>\operatorname{sp}(n,\delta+\eta n), a contradiction to (12). This proves Fact 17 for the case r=3r=3 and r=2r^{\prime}=2.

Case 2: r=4r=4 and r=3r^{\prime}=3. This implies that (472η)nδ(G)(47+η)n(\frac{4}{7}-2\eta)n\leq\delta(G)\leq(\frac{4}{7}+\eta)n, and consequently sp(n,δ+ηn)<(27+2η)n\operatorname{sp}(n,\delta+\eta n)<(\frac{2}{7}+2\eta)n. We first prove two statements about the structure of GG which are forced by (12).

  1. (Ψ\Psi)

    If a vertex uu has sets of neighbours UU, UU^{\prime} on edges in exactly two different triangle components with |U||U||U|\geq|U^{\prime}| then (174η)n<|U|<(17+6η)n(\frac{1}{7}-4\eta)n<|U^{\prime}|<(\frac{1}{7}+6\eta)n and (378η)n<|U|<(37+2η)n(\frac{3}{7}-8\eta)n<|U|<(\frac{3}{7}+2\eta)n.

Proof of (Ψ\Psi).

For the lower bound on |U||U^{\prime}|, observe that by () of Lemma 11 we have δ(G[U])2δn(174η)n\delta(G[U^{\prime}])\geq 2\delta-n\geq(\frac{1}{7}-4\eta)n. To obtain the upper bound, again by Lemma 11() we have δ(G[U])2δn\delta(G[U])\geq 2\delta-n, and since the sets UU and UU^{\prime} are neighbours of uu in different triangle components CC and CC^{\prime}, there are no edges from UU to UU^{\prime}. Furthermore, since any edge in G[U]G[U] forms a triangle with uu using an edge from uu to UU, all edges in G[U]G[U] are in CC. Now by Fact 15 we have

CTF(G)min(3|U|/2,3(2δn),2δn+|U|).\operatorname{CTF}(G)\geq\min(3\lfloor|U|/2\rfloor,3(2\delta-n),2\delta-n+|U^{\prime}|)\,.

Since |U|δ/2|U|\geq\delta/2 we have 3|U|/2(373η)n2>sp(n,δ+ηn)3\lfloor|U|/2\rfloor\geq(\tfrac{3}{7}-3\eta)n-2>\operatorname{sp}(n,\delta+\eta n). By (5) we have 3(2δn)>sp(n,δ+ηn)3(2\delta-n)>\operatorname{sp}(n,\delta+\eta n). Because (12) holds, we have 2δn+|U|<sp(n,δ+ηn)<(27+2η)n2\delta-n+|U^{\prime}|<\operatorname{sp}(n,\delta+\eta n)<(\frac{2}{7}+2\eta)n, and therefore |U|<(17+6η)n|U^{\prime}|<(\frac{1}{7}+6\eta)n. Now the claimed lower and upper bounds on |U||U| follow from U=Γ(u)UU=\Gamma(u)\setminus U^{\prime}, and from the fact that no vertex in UU^{\prime} has a neighbour in UU, respectively. ∎

  1. (Ξ\Xi)

    If a vertex uu has sets of neighbours U1U_{1}, U2U_{2}, U3U_{3} on edges in exactly three different triangle components then (421+2η)n>|Ui|>(4216η)n(\frac{4}{21}+2\eta)n>|U_{i}|>(\frac{4}{21}-6\eta)n for i[3]i\in[3].

Proof of (Ξ\Xi).

Assume that U1U_{1} is the largest of the three sets. By () of Lemma 11 we have δ(G[Ui])2δn(174η)n\delta(G[U_{i}])\geq 2\delta-n\geq(\frac{1}{7}-4\eta)n for each ii, so |Ui|>(174η)n|U_{i}|>(\frac{1}{7}-4\eta)n for each ii. As in the previous case, there can be no edge from U1U_{1} to U2˙U3U_{2}\dot{\cup}U_{3}, and all edges in U1U_{1} are triangle-connected. Thus by Fact 15 we have

CTF(G)min(3|U1|/2,3(2δn),2δn+|U2˙U3|).\operatorname{CTF}(G)\geq\min\big{(}3\lfloor|U_{1}|/2\rfloor,3(2\delta-n),2\delta-n+|U_{2}\dot{\cup}U_{3}|\big{)}\,.

Now since sp(n,δ+ηn)<(3710η)n\operatorname{sp}(n,\delta+\eta n)<(\tfrac{3}{7}-10\eta)n and (12) holds, we have

3|U1|/2<sp(n,δ+ηn)(27+2η)n3\lfloor|U_{1}|/2\rfloor<\operatorname{sp}(n,\delta+\eta n)\leq(\frac{2}{7}+2\eta)n\,

which implies |U1|<(421+2η)n|U_{1}|<(\frac{4}{21}+2\eta)n. Since |U2|,|U3||U1||U_{2}|,|U_{3}|\leq|U_{1}| this completes the desired upper bounds. The lower bounds follow from |U1|+|U2|+|U3|δ(472η)n|U_{1}|+|U_{2}|+|U_{3}|\geq\delta\geq(\tfrac{4}{7}-2\eta)n. ∎

Next we show that

  1. (Θ\Theta)

    int(G)\operatorname{int}(G) is an independent set.

Proof of (Θ\Theta).

Assume for a contradiction that there is an edge uvint(G)uv\in\operatorname{int}(G). By Fact 16 one of the vertices of this edge, say uu, is in only 22 triangle components. Let its neighbours be U1U_{1} and U2U_{2} in these two triangle components, and let the neighbours of vv be partitioned into sets V1,,VkV_{1},\dots,V_{k} according to the triangle component containing the edge to vv. Assume further that Γ(u,v)U1V1\Gamma(u,v)\subseteq U_{1}\cap V_{1}, so that U2,V2,,VkU_{2},V_{2},\ldots,V_{k} are pairwise disjoint. Let xΓ(u,v)x\in\Gamma(u,v). Since xx has neighbours in neither U2U_{2} nor V2V_{2}, and since by Lemma 11() we have |V2|>(174η)n|V_{2}|>(\frac{1}{7}-4\eta)n, we conclude that δd(x)n1|U2||V2|\delta\leq d(x)\leq n-1-|U_{2}|-|V_{2}|. In particular, |U2|<(378η)n|U_{2}|<(\frac{3}{7}-8\eta)n because δ(472η)n\delta\geq(\frac{4}{7}-2\eta)n, and therefore by (Ψ\Psi) we have

(174η)n<|U2|<(17+6η)n.(\frac{1}{7}-4\eta)n<|U_{2}|<(\frac{1}{7}+6\eta)n\,.

Next we want to derive analogous bounds for |V2||V_{2}|. For this purpose we first show that k=2k=2.

Indeed, if we had k=3k=3, then by (Ξ\Xi)

d(x)n1|U2||V2||V3|n1(174η)n2(4216η)n<(1021+16η)n<δ,\begin{split}d(x)&\leq n-1-|U_{2}|-|V_{2}|-|V_{3}|\\ &\leq n-1-(\tfrac{1}{7}-4\eta)n-2(\tfrac{4}{21}-6\eta)n<(\tfrac{10}{21}+16\eta)n<\delta\,,\end{split}

and this contradicts δ(G)δ\delta(G)\geq\delta. Similarly, if k4k\geq 4, then by Lemma 11() we have |Vi|(174η)n|V_{i}|\geq(\tfrac{1}{7}-4\eta)n for each ii, and hence

d(x)n1|U2||V2||V3||V4|<(37+16η)n<δ,d(x)\leq n-1-|U_{2}|-|V_{2}|-|V_{3}|-|V_{4}|<(\tfrac{3}{7}+16\eta)n<\delta\,,

which too is a contradiction. It follows that k=2k=2 as claimed.

Hence, we can argue analogously as before (for U2U_{2}) that |V2|>(378η)|V_{2}|>(\frac{3}{7}-8\eta) would contradict d(x)δd(x)\geq\delta. Consequently, by (Ψ\Psi) we have

(174η)n<|V2|<(17+6η)n.(\frac{1}{7}-4\eta)n<|V_{2}|<(\frac{1}{7}+6\eta)n\,.

We now argue that this yields a contradiction to (12) in much the same way as we argued in the r=r+1=3r=r^{\prime}+1=3 case. Every vertex of U2U_{2} is adjacent to all but at most n|U1|δ10ηnn-|U_{1}|-\delta\leq 10\eta n vertices of V(G)U1V(G)\setminus U_{1}. Since |U2|>20ηn|U_{2}|>20\eta n, by Proposition 12() there is a matching MuM_{u} in U2U_{2} covering all but at most one vertex of U2U_{2}. Each edge of MuM_{u} has at least δ10ηn(4712η)n\delta-10\eta n\geq(\tfrac{4}{7}-12\eta)n common neighbours outside U1U_{1}. Similarly, in V2V_{2} there is a matching MvM_{v} covering all but at most one vertex of V2V_{2}, each edge of which has at least (4712η)n(\tfrac{4}{7}-12\eta)n common neighbours outside V1V_{1}. Since Γ(u,v)=U1V1\Gamma(u,v)=U_{1}\cap V_{1}, we have U1V2=U_{1}\cap V_{2}=\emptyset. It follows that every edge of MvM_{v} has more than half of the edges of MuM_{u} in its common neighbourhood, and thus the edges Mu˙MvM_{u}\dot{\cup}M_{v} are triangle connected. Choosing greedily for each edge in Mu˙MvM_{u}\dot{\cup}M_{v} in succession a distinct common neighbour outside Mu˙MvM_{u}\dot{\cup}M_{v}, we obtain a connected triangle factor with as many triangles as there are edges in Mu˙MvM_{u}\dot{\cup}M_{v}. Since |U2|,|V2|>(174η)n|U_{2}|,|V_{2}|>(\frac{1}{7}-4\eta)n, we have CTF(G)>(3712η)n3>sp(n,δ+ηn)\operatorname{CTF}(G)>(\tfrac{3}{7}-12\eta)n-3>\operatorname{sp}(n,\delta+\eta n), contradicting (12). This completes the proof that int(G)\operatorname{int}(G) is an independent set. ∎

It remains to show that each vertex uint(G)u\in\operatorname{int}(G) is contained in at least r=3r^{\prime}=3 triangle components. Assume for a contradiction that this is not the case and that some vertex uu is only contained in 22 triangle components, CC and CC^{\prime}. Let UU and UU^{\prime}, respectively, be the neighbours of uu on edges in CC and CC^{\prime}. Without loss of generality |U||U||U|\geq|U^{\prime}|. Because int(G)\operatorname{int}(G) is an independent set, UU and UU^{\prime} are contained in the exteriors of CC and CC^{\prime}. By Lemma 11() there are thus no edges between UU and (C)\partial(C^{\prime}). By Lemma 11() we have δ(G[U])2δn\delta(G[U])\geq 2\delta-n, and since U(C)U\subseteq\partial(C) every edge of G[U]G[U] is in CC. It follows that we may apply Fact 15 to obtain

CTF(G)min(3|U|/2,3(2δn),2δn+|(C)|).\operatorname{CTF}(G)\geq\min\big{(}3\lfloor|U|/2\rfloor,3(2\delta-n),2\delta-n+|\partial(C^{\prime})|\big{)}\,.

Since |U|δ/2|U|\geq\delta/2 we have 3|U|/2(373η)n2>sp(n,δ+ηn)3\lfloor|U|/2\rfloor\geq(\tfrac{3}{7}-3\eta)n-2>\operatorname{sp}(n,\delta+\eta n). By (5) we have 3(2δn)>sp(n,δ+ηn)3(2\delta-n)>\operatorname{sp}(n,\delta+\eta n). Since (12) holds, we conclude that 2δn+|(C)|<sp(n,δ+ηn)<(27+2η)n2\delta-n+|\partial(C^{\prime})|<\operatorname{sp}(n,\delta+\eta n)<(\frac{2}{7}+2\eta)n, and therefore |(C)|<(17+6η)n|\partial(C^{\prime})|<(\frac{1}{7}+6\eta)n.

Now any vertex in (C)\partial(C^{\prime}) has neighbours only in (C)˙int(G)\partial(C^{\prime})\dot{\cup}\operatorname{int}(G), and therefore |int(G)|δ|(C)|(378η)n|\operatorname{int}(G)|\geq\delta-|\partial(C^{\prime})|\geq(\tfrac{3}{7}-8\eta)n. The vertex uu has neighbours only in U(C)U^{\prime}\subseteq\partial(C^{\prime}) and UU, and therefore

|U|δ|U|δ|(C)|(378η)n.|U|\geq\delta-|U^{\prime}|\geq\delta-|\partial(C^{\prime})|\geq(\tfrac{3}{7}-8\eta)n\,.

By Lemma 11() we have δ(G[U])2δn(174η)n\delta(G[U])\geq 2\delta-n\geq(\tfrac{1}{7}-4\eta)n, and since |U|>(278η)n|U|>(\tfrac{2}{7}-8\eta)n we obtain by Proposition 12() a matching MM in UU with at least (174η)n(\tfrac{1}{7}-4\eta)n edges. Now each vertex in int(G)\operatorname{int}(G) is adjacent to all but at most nδ|int(G)|10ηnn-\delta-|\operatorname{int}(G)|\leq 10\eta n vertices outside int(G)\operatorname{int}(G). In particular, each vertex in int(G)\operatorname{int}(G) is adjacent to all but at most 10ηn10\eta n vertices of MM, and is therefore a common neighbour of all but at most 10ηn10\eta n edges of MM. We now match greedily vertices of int(G)\operatorname{int}(G) with distinct edges of MM forming triangles. Since |int(G)|>|M||\operatorname{int}(G)|>|M|, we will be forced to halt only when we come to a vertex xint(G)x\in\operatorname{int}(G) which is not a common neighbour of any remaining edge of MM, i.e., when we have used all but at most 10ηn10\eta n edges of MM. It follows that we obtain a triangle factor TT with at least (1714η)n(\tfrac{1}{7}-14\eta)n triangles. Since each triangle uses an edge of MG[U]G[(C)]M\subseteq G[U]\subseteq G[\partial(C)], TT is a connected triangle factor, and we have CTF(G)(3742η)n>sp(n,δ+ηn)\operatorname{CTF}(G)\geq(\tfrac{3}{7}-42\eta)n>\operatorname{sp}(n,\delta+\eta n) in contradiction to (12). ∎

4. Near-extremal graphs

In this section we provide the proof of Lemma 10. To prepare this proof we start with two useful lemmas. The first will be used to construct squared paths and squared cycles from simple paths and cycles.

Lemma 20.

Given a graph GG, let T=(t1,t2,,t2l)T=(t_{1},t_{2},\ldots,t_{2l}) be a path in GG and WW a set of vertices disjoint from TT. Let Q1=(t1,t2)Q_{1}=(t_{1},t_{2}), Qi=(t2i3,t2i2,t2i1,t2i)Q_{i}=(t_{2i-3},t_{2i-2},t_{2i-1},t_{2i}) for all 1<il1<i\leq l, and Ql+1=(t2l1,t2l)Q_{l+1}=(t_{2l-1},t_{2l}). If there exists an ordering σ\sigma of [l+1][l+1] such that for each ii the vertices in Qσ(i)Q_{\sigma(i)} have at least ii common neighbours in WW, then there is a squared path

(q1,t1,t2,q2,t3,t4,q3,,t2,q+1)(q_{1},t_{1},t_{2},q_{2},t_{3},t_{4},q_{3},\ldots,t_{2\ell},q_{\ell+1})

in GG, with qiWq_{i}\in W for each ii, using every vertex of TT.

If TT is a cycle on 2l2l vertices we let instead Q1=(t2l1,t2l,t1,t2)Q_{1}=(t_{2l-1},t_{2l},t_{1},t_{2}), Qi=(t2i3,t2i2,t2i1,t2i)Q_{i}=(t_{2i-3},t_{2i-2},t_{2i-1},t_{2i}) for all 1<il1<i\leq l, and σ\sigma be an ordering on [l][l]. Then, under the same conditions, we obtain a squared cycle C3l2C_{3l}^{2}.

Proof.

We need only ensure that for each ii one can choose qiq_{i} such that qiq_{i} is a common neighbour of QiQ_{i} and the qiq_{i} are distinct. This is possible by choosing for each ii in succession qσ(i)q_{\sigma(i)} to be any so far unused common neighbour of Qσ(i)Q_{\sigma(i)}. ∎

The second lemma is a variant on Dirac’s theorem and permits us to construct paths and cycles of desired lengths which keep some ‘bad’ vertices far apart.

Lemma 21.

Let HH be a graph on hh vertices and BV(H)B\subseteq V(H) be of size at most h/100h/100. Suppose that every vertex in BB has at least 9|B|9|B| neighbours in HH, and every vertex outside BB has at least h/2+9|B|+10h/2+9|B|+10 neighbours in HH. Then for any given 3h3\leq\ell\leq h we can find a cycle TT_{\ell} of length \ell in HH on which no four consecutive vertices contain more than one vertex of BB. Furthermore, if xx and yy are any two vertices not in BB and 5h5\leq\ell\leq h, we can find an \ell-vertex path TT_{\ell} whose endvertices are xx and yy on which no four consecutive vertices contain more than one vertex of B{x,y}B\cup\{x,y\}.

Proof.

If we seek a path in HH from xx to yy then we create a ‘dummy edge’ between xx and yy. If we seek a cycle, let xyxy be any edge of HBH-B.

First we construct a path PP in HH covering BB with the desired property. Let B={b1,b2,,b|B|}B=\{b_{1},b_{2},\ldots,b_{|B|}\}. For each 1i|B|11\leq i\leq|B|-1, choose a vertex uiHBu_{i}\in H-B adjacent to bib_{i} and a vertex viHBv_{i}\in H-B adjacent to bi+1b_{i+1}. Because both uiu_{i} and viv_{i} have h/2+9|B|+10h/2+9|B|+10 neighbours in HH, they have at least 18|B|+2018|B|+20 common neighbours. At most 3|B|3|B| of these are either in BB or amongst the chosen uj,vju_{j},v_{j}, and so we can find a so far unused vertex wiw_{i} adjacent to uiu_{i} and viv_{i}. Since we require only |B|1|B|-1 vertices w1,,w|B|1w_{1},\ldots,w_{|B|-1} we can pick the vertices greedily.

We let v0v_{0} be yet another vertex adjacent to b1b_{1}, and u|B|u_{|B|} adjacent to b|B|b_{|B|}, and choose any further vertices w0,v0,w|B|,u|B|w_{0},v_{0},w_{|B|},u_{|B|} such that

P=(x,y,u0,w0,v0,b1,u1,w1,v1,b2,,v|B|1,b|B|,u|B|,w|B|,v|B|)P=(x,y,u_{0},w_{0},v_{0},b_{1},u_{1},w_{1},v_{1},b_{2},\ldots,v_{|B|-1},b_{|B|},u_{|B|},w_{|B|},v_{|B|})

is a path on 4|B|+54|B|+5 vertices.

Now we let PP^{\prime} be a path extending PP in HH of maximum length. We claim that PP^{\prime} is in fact spanning. Suppose not: let uu be an end-vertex of PP^{\prime} and vv a vertex not on PP^{\prime}. Since PP^{\prime} is maximal every neighbour of uu is on PP^{\prime}, so v(P)>h/2+9|B|+10v(P^{\prime})>h/2+9|B|+10. If there existed an edge uvu^{\prime}v^{\prime} of PPP^{\prime}-P with uuu^{\prime}u and vvv^{\prime}v edges of HH, with vv^{\prime} closer to uu on PP^{\prime} than uu^{\prime}, then we would have a longer path extending PP in HH. Counting the edges leaving uu and vv yields a contradiction.

Finally we let uu and vv be the end-vertices of the spanning path PP^{\prime}. If uvuv is an edge of HH, or if uvu^{\prime}v^{\prime} is an edge of PPP^{\prime}-P, with uu^{\prime} nearer to uu on PP^{\prime} than vv^{\prime}, such that uvuv^{\prime} and uvu^{\prime}v are edges of HH, then we obtain a cycle TT spanning HH and containing PP as a subpath. Again edge counting reveals that such an edge must exist.

To obtain a cycle TT_{\ell} with h|B|2<hh-|B|-2\leq\ell<h we take uu to be an end-vertex of the path TPT-P and vv its successor on TPT-P. If we can find two further vertices uu^{\prime} and vv^{\prime} on TPT-P (in that order from uu along TPT-P) with hh-\ell vertices between them and with uuuu^{\prime} and vvvv^{\prime} edges of HH then we would obtain a cycle TT_{\ell} of length \ell. Again simple edge counting reveals that such a pair of vertices exists. To obtain a cycle TT_{\ell} with 3<h|B|23\leq\ell<h-|B|-2 we note that HBH-B has minimum degree h/2+8|B|+10>(h|B|)/2+1h/2+8|B|+10>(h-|B|)/2+1 and thus contains a cycle of every possible length using the edge xyxy.

The cycle TT_{\ell} satisfies the condition that no four consecutive vertices contain more than one vertex of BB, since either it preserves PP as a subpath or it contains no vertices of BB at all. Similarly the path from xx to yy within TT_{\ell} satisfies the required conditions. ∎

Before embarking upon the proof of Lemma 10 we give an outline of the method. We recall that the Szemerédi partition supplied to the Lemma is essentially the extremal structure. We shall show that the underlying graph either also has an extremal structure, or possesses features which actually lead to longer squared paths and cycles than required for the conclusion of the Lemma. This is complicated by the fact that the Szemerédi partition is insensitive both to mis-assignment of a sublinear number of vertices and to editing of a subquadratic number of edges: we must assume, for example, that although the vertex set II in the reduced graph RR is independent, the vertex set I\bigcup I may fail to contain some vertices of GG with no neighbours in I\bigcup I, and may contain a small number of edges meeting every vertex. However, observe that by the definition of an (ε,d)(\varepsilon,d)-regular partition, there are no vertices of I\bigcup I with more than (ε+d)n(\varepsilon+d)n neighbours in I\bigcup I. Fortunately, it is possible to apply the following strategy in this case.

We start by separating those vertices with ‘few’ neighbours in I\bigcup I, which we shall collect in a set WW, and those with ‘many’. We are then able to show (as Fact 23 below) that, if there are two vertex disjoint edges in WW, then the sets B1\bigcup B_{1} and B2\bigcup B_{2} are in the same triangle component of GG (‘unexpectedly’, since B1B_{1} and B2B_{2} are in different triangle components in RR). We shall show that in this case it is possible to construct very long squared paths and cycles by making use of Lemma 8.

Hence we can assume that there are not two disjoint edges in WW, which in turn implies that WW is almost independent and will give us rather precise control about the size of WW. In addition, the minimum degree condition will guarantee that almost every edge from WW to the remainder of GG is present. We would like to then say that in V(G)WV(G)\setminus W we can find a long path, which together with vertices from WW forms a squared path (and similarly for squared cycles). Unfortunately since G[W,V(G)W]G[W,V(G)\setminus W] is not necessarily a complete bipartite graph, this statement is not obviously true: although by definition no vertex outside WW has very few neighbours in WW, it is certainly possible that two vertices outside WW could fail to have a common neighbour in WW. But the statement is true for a path possessing sufficiently nice properties—specifically, satisfying the conditions of Lemma 20—and the purpose of Lemma 21 is to provide paths and cycles with those nice properties. The remainder of our proof, then, consists of setting up conditions for the application of Lemma 21.

Proof of Lemma 10.

Given ν>0\nu>0, suppose the parameters η>0\eta>0 and d>0d>0 satisfy the following inequalities.

(14) ην4108anddν4108\eta\leq\frac{\nu^{4}}{10^{8}}\qquad\text{and}\qquad d\leq\frac{\nu^{4}}{10^{8}}

Given d>0d>0, Lemma 8 returns a constant εel>0\varepsilon_{\textsc{{el}}}>0. We set

(15) ε0=min(ν4108,εel).\varepsilon_{0}=\min\big{(}\frac{\nu^{4}}{10^{8}},\varepsilon_{\textsc{{el}}}\big{)}\,.

Given melm_{\textsc{{el}}} and 0<ε<ε00<\varepsilon<\varepsilon_{0}, Lemma 8 returns a constant neln_{\textsc{{el}}}. We set

(16) N=max(1000mel4,100η1ν1,nel).N=\max\big{(}1000m_{\textsc{{el}}}^{4},100\eta^{-1}\nu^{-1},n_{\textsc{{el}}}\big{)}\,.

Now let GG, RR, and the partition V(R)=I˙B1˙˙BkV(R)=I\dot{\cup}B_{1}\dot{\cup}\dots\dot{\cup}B_{k} satisfy conditions ()() of the lemma.

If δ(G)=δ2n13\delta(G)=\delta\geq\frac{2n-1}{3} then we can appeal to Theorem 1 to find a spanning squared path in GG; if δ2n3\delta\geq\frac{2n}{3} then we can appeal to Theorem 2 to find C2C^{2}_{\ell} for each [3,n]{5}\ell\in[3,n]\setminus\{5\}. Therefore, the definition of sp(n,δ)\operatorname{sp}(n,\delta) and sc(n,δ)\operatorname{sc}(n,\delta) imply that we may assume δ<2n/3\delta<2n/3 in the following, and that we only need to find

(17) squared paths and squared cycles of length at most 11n/2011n/20.

We now start by investigating the sizes of II and of the BiB_{i}. Define δ=(δ/ndε)m\delta^{\prime}=(\delta/n-d-\varepsilon)m. Since RR is an (ε,d)(\varepsilon,d)-reduced graph we have

(18) δ(R)δ=(δ/ndε)m.\delta(R)\geq\delta^{\prime}=(\delta/n-d-\varepsilon)m\,.

Observe that moreover

(19) |I|mδ(1δn+d+ε)m,|I|\leq m-\delta^{\prime}\leq\Big{(}1-\frac{\delta}{n}+d+\varepsilon\Big{)}m\;,

by () because clusters in II have δ\delta^{\prime} neighbours outside II in RR. For i[k]i\in[k], fix a cluster CBiC\in B_{i}. By assumption () CC has neighbours only in BiIB_{i}\cup I in RR. Since

δdeg(C)=deg(C,BiI)deg(C,Bi)+|I|deg(C,Bi)+mδ,\delta^{\prime}\leq\deg(C)=\deg(C,B_{i}\cup I)\leq\deg(C,B_{i})+|I|\leq\deg(C,B_{i})+m-\delta^{\prime}\,,

we have

|Bi|>deg(C,Bi)2δmmn(2(δdnεn)n)=mn(2δn(d+ε)n).\begin{split}|B_{i}|&>\deg(C,B_{i})\geq 2\delta^{\prime}-m\geq\frac{m}{n}\big{(}2(\delta-dn-\varepsilon n)-n\big{)}\\ &=\frac{m}{n}\big{(}2\delta-n-(d+\varepsilon)n\big{)}\,.\end{split}

Now since 2δn2νn2\delta-n\geq 2\nu n by (), we conclude from (14) and (15) that

(20) |Bi|2m(2δn)3n43νm.|B_{i}|\geq\frac{2m(2\delta-n)}{3n}\geq\frac{4}{3}\nu m\;.

We next show that each BiB_{i} is part of exactly one triangle component of RR.

Fact 22.

For each 1ik1\leq i\leq k the following holds. All edges in R[Bi]R[B_{i}] are triangle connected in RR.

Proof of Fact 22.

By assumption () we have

(21) |Bi|19m(2δn)/(10n)39(2δm)/20,|B_{i}|\leq 19m(2\delta-n)/(10n)\leq 39(2\delta^{\prime}-m)/20\,,

where the second inequality comes from (14) and (15). Since we have δR(Bi)2δm>|Bi|/2\delta_{R}(B_{i})\geq 2\delta^{\prime}-m>|B_{i}|/2, the graph R[Bi]R[B_{i}] is connected. It follows that if there are two edges in R[Bi]R[B_{i}] which are not triangle-connected, then there are two adjacent edges in R[Bi]R[B_{i}] with this property. That is, there are vertices PPQQ and QQ^{\prime} of BiB_{i} such that PQPQ is in triangle component CC and PQPQ^{\prime} is in triangle component CC^{\prime} with CCC\neq C^{\prime}.

We now show that there are at least 2δm2\delta^{\prime}-m edges leaving PP in R[Bi]R[B_{i}] which are in CC. There are two possibilities. First, suppose there are no CC-edges from PP to II. In this case, the common neighbourhood Γ(PQ)\Gamma(PQ) must lie entirely in BiB_{i}. Every vertex of Γ(PQ)\Gamma(PQ) makes a CC-edge with PP, and we have |Γ(PQ)|2δm|\Gamma(PQ)|\geq 2\delta^{\prime}-m as required. Second, suppose that there is a CC-edge PPPP^{\prime} with PIP^{\prime}\in I. Since II is an independent set in RR, the set Γ(PP)\Gamma(PP^{\prime}) lies entirely within BiB_{i}, and has size at least 2δm2\delta^{\prime}-m. Again, every edge from PP to Γ(PP)\Gamma(PP^{\prime}) is a CC-edge, as desired.

By the identical argument, there are at least 2δm2\delta^{\prime}-m edges leaving PP in R[Bi]R[B_{i}] which are in CC^{\prime}. Since no edge is in both CC and CC^{\prime}, there are at least 2(2δm)2(2\delta^{\prime}-m) edges leaving PP in R[Bi]R[B_{i}], so |Bi|2(2δm)|B_{i}|\geq 2(2\delta^{\prime}-m). This contradicts (21). It follows that all edges of BiB_{i} are triangle connected, as desired. ∎

We next define a set WW of those vertices in GG which have few neighbours in I\bigcup I. The intuition is that WW consists of I\bigcup I and only a few more vertices of GG. To simplify notation, we set ξ=ε+d+11η4\xi=\sqrt[4]{\varepsilon+d+11\eta}. By (14) and (15), we have

(22) ξν/100.\xi\leq\nu/100\,.

Let WW be the vertices of GG which do not have more than ξn\xi n neighbours in I\bigcup I. Since ξ>d+ε\xi>d+\varepsilon, by the independence of II and by the definition of an (ε,d)(\varepsilon,d)-regular partition, we have IW\bigcup I\subseteq W. By assumption () we have |I|(nδ11ηn)m/n|I|\geq(n-\delta-11\eta n)m/n. Hence every edge in WW has at least

(23) 2(δξn)(n|I|)>δ(2δn)162(\delta-\xi n)-\Big{(}n-|\bigcup I|\Big{)}>\frac{\delta-(2\delta-n)}{16}

common neighbours outside I\bigcup I, where we use assumption () that 2δn>2νn2\delta-n>2\nu n, (14) and (22).

By this observation, the next fact implies that we are done if there are two vertex disjoint edges in WW.

Fact 23.

If u1v1u_{1}v_{1} and u2v2u_{2}v_{2} are vertex disjoint edges of GG such that for i=1,2i=1,2 the edge uiviu_{i}v_{i} has at least δ(2δn)/16\delta-(2\delta-n)/16 common neighbours outside I\bigcup I, then GG contains Psp(n,δ)2P^{2}_{\operatorname{sp}(n,\delta)} and C2C^{2}_{\ell} for each [3,sc(n,δ)]{5}\ell\in[3,\operatorname{sc}(n,\delta)]\setminus\{5\}.

Proof of Fact 23.

Let DD^{\prime} be the set of clusters CV(R)IC\in V(R)\setminus I such that u1v1u_{1}v_{1} has at most 2dn/m2dn/m common neighbours in CC. By the hypothesis, u1v1u_{1}v_{1} has at least δ(2δn)/16\delta-(2\delta-n)/16 common neighbours outside I\bigcup I. Of these, at most εn\varepsilon n are in the exceptional set V0V_{0} of the regular partition, and at most 2dn|D|/m2dn|D^{\prime}|/m are in D\bigcup D^{\prime}. The remaining common neighbours must all lie in (V(R)(ID)\bigcup(V(R)\setminus(I\cup D^{\prime}), and hence we have the inequality

δ2δn16εn2dn|D|m(m|I||D|)nm()n(nδ11ηn)|D|nm.\begin{split}\delta-\frac{2\delta-n}{16}-\varepsilon n-\frac{2dn|D^{\prime}|}{m}&\leq(m-|I|-|D^{\prime}|)\frac{n}{m}\\ &\overset{\mbox{\tiny{\ref{ext:assm5}}}}{\leq}n-(n-\delta-11\eta n)-|D^{\prime}|\frac{n}{m}\,.\end{split}

Simplifying this, we obtain

n2dnm|D|11ηn+εn+2δn16,\frac{n-2dn}{m}|D^{\prime}|\leq 11\eta n+\varepsilon n+\frac{2\delta-n}{16}\,,

and by (14) and (15), we get |D|(2δn)m/(14n)|D^{\prime}|\leq(2\delta-n)m/(14n).

Now let DD be the set of clusters CV(R)IC\in V(R)\setminus I such that either u1v1u_{1}v_{1} or u2v2u_{2}v_{2} has at most 2dn/m2dn/m common neighbours in CC. The same analysis holds for u2v2u_{2}v_{2}, so we obtain

(24) |D|(2δn)m7n.|D|\leq\frac{(2\delta-n)m}{7n}\,.

Therefore, we conclude from (20) that B1DB_{1}\setminus D\neq\emptyset. Take XB1DX\in B_{1}\setminus D arbitrarily. We have

deg(X,B1)()deg(X)|I|δ|I|(19)δ(1δn+d+ε)m(18)(δndε)m(1δn+d+ε)m(14),(15)12(2δn)mn>(24)|D|.\begin{split}\deg(X,B_{1})&\overset{\mbox{\tiny{\ref{ext:assm6}}}}{\geq}\deg(X)-|I|\geq\delta^{\prime}-|I|\overset{\mbox{\tiny{\eqref{eq:IisSmall}}}}{\geq}\delta^{\prime}-\Big{(}1-\frac{\delta}{n}+d+\varepsilon\Big{)}m\\ &\overset{\mbox{\tiny{\eqref{eq:ext:delta'}}}}{\geq}\Big{(}\frac{\delta}{n}-d-\varepsilon\Big{)}m-\Big{(}1-\frac{\delta}{n}+d+\varepsilon\Big{)}m\\ &\!\!\!\!\overset{\mbox{\tiny{\eqref{ext:chooseetad},\eqref{ext:chooseeps}}}}{\geq}\frac{1}{2}(2\delta-n)\frac{m}{n}\overset{\mbox{\tiny{\eqref{eq:ext:D}}}}{>}|D|\,.\end{split}

Thus there exists a cluster YΓ(X)(B1D)Y\in\Gamma(X)\cap(B_{1}\setminus D). Hence we have clusters X,YB1DX,Y\in B_{1}\setminus D such that XYE(R)XY\in E(R). Analogously, we can find clusters X,YB2DX^{\prime},Y^{\prime}\in B_{2}\setminus D such that XYE(R)X^{\prime}Y^{\prime}\in E(R).

Since δR(B1),δR(B2)δ|I|2δm\delta_{R}(B_{1}),\delta_{R}(B_{2})\geq\delta^{\prime}-|I|\geq 2\delta^{\prime}-m, we can find greedily a matching MM in R[B1B2]R[B_{1}\cup B_{2}] with δ|I|\delta^{\prime}-|I| edges. Since every cluster in II has at most m|I|δm-|I|-\delta^{\prime} non-neighbours outside II, every cluster in II forms a triangle with at least |M|(m|I|δ)=2δm|M|-(m-|I|-\delta^{\prime})=2\delta^{\prime}-m edges of MM. In addition, by assumption (), (14), and since δ<2n/3\delta<2n/3 we have |I|>(1311η)m14m|I|>(\frac{1}{3}-11\eta)m\geq\frac{1}{4}m. Therefore we may choose greedily clusters in II to obtain a set TT of at least

min{2δm,|I|}min{2δm,14m}\min\big{\{}2\delta^{\prime}-m,|I|\big{\}}\geq\min\Big{\{}2\delta^{\prime}-m,\frac{1}{4}m\Big{\}}

vertex-disjoint triangles formed from edges of MM and clusters of II. Let T1T_{1} be the triangles of TT contained in B1IB_{1}\cup I, and T2T_{2} those contained in B2IB_{2}\cup I.

By Fact 22, since each triangle in T1T_{1} contains an edge of B1B_{1}, all triangles in T1T_{1} are in the same triangle component as the edge XYXY. Similarly all the triangles in T2T_{2} are in the same triangle component as the edge XYX^{\prime}Y^{\prime}.

Noting that ε\varepsilon satisfies (15) and n>Nn>N satisfies (16), we can apply Lemma 8 with X1=X2=XX_{1}=X_{2}=X, Y1=Y2=YY_{1}=Y_{2}=Y to find a squared path starting with u1v1u_{1}v_{1} and finishing with u2v2u_{2}v_{2} using the triangles T1T_{1}. Similarly, using Lemma 8 with X1=X2=XX_{1}=X_{2}=X^{\prime}, Y1=Y2=YY_{1}=Y_{2}=Y^{\prime} we find a squared path (intersecting the first only at u1u_{1}, v1v_{1}, u2u_{2}, and v2v_{2}) starting with u2v2u_{2}v_{2} and finishing with u1v1u_{1}v_{1} using the triangles T2T_{2}. Choosing appropriate lengths for these squared paths and concatenating them we get a squared cycle C2C^{2}_{\ell} in GG, for any 36(mel+2)33(1d)min{2δm,m/4}n/m36(m_{\textsc{{el}}}+2)^{3}\leq\ell\leq 3(1-d)\min\{2\delta^{\prime}-m,m/4\}n/m. Applying Lemma 8 to the copy of K4K_{4} in B1B_{1} directly we obtain C2C^{2}_{\ell} for each [3,3n/m]{5}\ell\in[3,3n/m]\setminus\{5\}. By (16) we have 3n/m>36(mel+2)33n/m>36(m_{\textsc{{el}}}+2)^{3}, and by (5), (14), (15), and (17) we have 3(1d)(2δm)n/m>sp(n,δ)sc(n,δ)3(1-d)(2\delta^{\prime}-m)n/m>\operatorname{sp}(n,\delta)\geq\operatorname{sc}(n,\delta) and 3(1d)n/411n/20>sp(n,δ)sc(n,δ)3(1-d)n/4\geq 11n/20>\operatorname{sp}(n,\delta)\geq\operatorname{sc}(n,\delta). It follows that GG contains both Psp(n,δ)2P^{2}_{\operatorname{sp}(n,\delta)} and C2C^{2}_{\ell} for each [3,sc(n,δ)]{5}\ell\in[3,\operatorname{sc}(n,\delta)]\setminus\{5\} as required. ∎

By (23), if there are two vertex disjoint edges in WW, then we are done by Fact 23. Thus we assume in the following that no such two edges exist. This implies that there are two vertices in WW which meet every edge in WW. Since neither of these two vertices has more than ξn\xi n neighbours in IW\bigcup I\subseteq W, while |I|>(1311η)m|I|>(\frac{1}{3}-11\eta)m by () and because δ<2n/3\delta<2n/3, there is a vertex in WW adjacent to no vertex of WW. We conclude that

(25) nδ11ηn|I||W|nδ.n-\delta-11\eta n\leq|\bigcup I|\leq|W|\leq n-\delta.

Our next goal is to extract from each set Bi\bigcup B_{i} a large set AiA_{i} of vertices which are adjacent to almost all vertices in WW and are such that G[Ai]G[A_{i}] has minimum degree somewhat above |Ai|/2|A_{i}|/2. Because at least |W|δ2|W||W|\delta-2|W| edges leave WW, the total number of non-edges between WW and V(G)WV(G)\setminus W is at most

|W||V(G)W||W|(δ2)(nδ)(δ+11ηnδ+2)(25)11ηn2+2n.|W||V(G)\setminus W|-|W|(\delta-2)\leq(n-\delta)(\delta+11\eta n-\delta+2)\overset{\mbox{\tiny{\eqref{eq:ext-like:W}}}}{\leq}11\eta n^{2}+2n\,.

In particular, by the definition of ξ\xi, by (14) and (16),

(26) |{vV(G)W:deg(v,W)<|W|ξ2n}|ξ2n.\Big{|}\big{\{}v\in V(G)\setminus W\colon\deg(v,W)<|W|-\xi^{2}n\big{\}}\Big{|}\leq\xi^{2}n\,.

In addition, by assumption () we have |Bi|19m(2δn)/(10n)|B_{i}|\leq 19m(2\delta-n)/(10n), which together with δ2n/3\delta\leq 2n/3, (14), (15) and (22) implies

(27) |Bi|1910(2δn)1920δ<δξn(d+ε)n.\Big{|}\bigcup B_{i}\Big{|}\leq\frac{19}{10}(2\delta-n)\leq\frac{19}{20}\delta<\delta-\xi n-(d+\varepsilon)n\,.

However, by assumption () and the definition of an (ε,d)(\varepsilon,d)-regular partition, vertices in Bi\bigcup B_{i} send at most (d+ε)n(d+\varepsilon)n edges to V(G)BiIV(G)-\bigcup B_{i}-\bigcup I. It follows from the definition of WW that

BiW=for all i[k].\bigcup B_{i}\cap W=\emptyset\qquad\text{for all $i\in[k]$}\,.

Furthermore, (14), (15) and (22) imply that vBiv\in\bigcup B_{i} has at least

(28) δ|W|(d+ε)n(25)2δn(d+ε)n>(27)|Bi|/2+32ξ2n\delta-|W|-(d+\varepsilon)n\overset{\mbox{\tiny{\eqref{eq:ext-like:W}}}}{\geq}2\delta-n-(d+\varepsilon)n\overset{\mbox{\tiny{\eqref{eq:ext:bigcupB}}}}{>}|\bigcup B_{i}|/2+32\xi^{2}n

neighbours in Bi\bigcup B_{i}.

Now, for each i[k]i\in[k] we let AiA_{i} be the set of vertices in Bi\bigcup B_{i} which are adjacent to at least |W|ξ2n|W|-\xi^{2}n vertices of WW. In the rest of this paragraph we determine some important properties of the sets AiA_{i}. By (26) we have

(29) |i[k](Bi)Ai|ξ2nfor all i[k].\Big{|}\bigcup_{i\in[k]}\big{(}\bigcup B_{i}\big{)}\setminus A_{i}\big{|}\leq\xi^{2}n\qquad\text{for all $i\in[k]$}\,.

But the vertices which are neither in WW nor any of the sets AiA_{i} must be either in the exceptional set V0V_{0} or in BiAi\bigcup B_{i}\setminus A_{i} for some ii. Hence we have

(30) |V0i[k](Bi)Ai|εn+ξ2n<2ξ2n.\Big{|}V_{0}\cup\bigcup_{i\in[k]}\big{(}\bigcup B_{i}\big{)}\setminus A_{i}\Big{|}\leq\varepsilon n+\xi^{2}n<2\xi^{2}n\,.

Accordingly (28) implies that

(31) δ(G[Ai])|Ai|/2+30ξ2n,\delta(G[A_{i}])\geq|A_{i}|/2+30\xi^{2}n\,,

and since |Bi|>δ|I|2δm|B_{i}|>\delta^{\prime}-|I|\geq 2\delta^{\prime}-m we have

(32) |Ai||Bi|2ξ2n(1ε)nm|Bi|2ξ2n2δn3ξ2n|A_{i}|\geq|\bigcup B_{i}|-2\xi^{2}n\geq(1-\varepsilon)\frac{n}{m}|B_{i}|-2\xi^{2}n\geq 2\delta-n-3\xi^{2}n

for each i[k]i\in[k], where we used the definition of ξ\xi, (14), 15, and (18) in the last inequality.

In the remainder of the proof we utilize the sets AiA_{i} in order to find the desired squared path and squared cycles. We start by showing that we obtain squared cycles on \ell vertices for each [3,32|A1|]{5}\ell\in[3,\frac{3}{2}|A_{1}|]\setminus\{5\}. To see this note first that by Lemma 21 (with B=B=\emptyset) we find in A1A_{1} a copy of C2C_{2\ell^{\prime}} for each 2[4,min{|A1|,2n4}]2\ell^{\prime}\in\big{[}4,\min\{|A_{1}|,2\frac{n}{4}\}\big{]}. By the definition of A1A_{1} every quadruple of consecutive vertices on such a cycle has at least |W|4ξ2n|W|-4\xi^{2}n common neighbours in WW, and by the definition of ξ\xi, (14), (15), and (25) we have |W|4ξ2nn/4|W|-4\xi^{2}n\geq n/4. Hence we can apply Lemma 20 to GG and WW to square this cycle. This gives us squared cycles of lengths \ell with

3min{32|A1|,3n4}=(17)32|A1|3\leq\ell\leq\min\Big{\{}\frac{3}{2}|A_{1}|,3\frac{n}{4}\Big{\}}\overset{\mbox{\tiny{\eqref{eq:ext:1120}}}}{=}\frac{3}{2}|A_{1}|

such that \ell is divisible by three, but not of lengths not divisible by three.

If we seek a squared cycle C3+12C^{2}_{3\ell^{\prime}+1} or C3+22C^{2}_{3\ell^{\prime}+2} (with 3+253\ell^{\prime}+2\neq 5) then we need to perform a process which we will call parity correction and which we explain in the following two paragraphs. We shall use this parity correction process also in all remaining steps of the proof to obtain squared cycles of lengths not divisible by 33.

For obtaining a squared cycle of length 3+13\ell^{\prime}+1 we proceed as follows. We pick (using Turán’s theorem) a triangle abcabc in A1A_{1} and clone the vertex bb, i.e., we insert a dummy vertex bb^{\prime} into GG with the same adjacencies as bb. Then we apply Lemma 21 to A1{b}A_{1}-\{b\} to find a path P=(a,p2,p3,,p21,c)P=(a,p_{2},p_{3},\ldots,p_{2\ell^{\prime}-1},c) on 22\ell^{\prime} vertices whose end-vertices are aa and cc. Finally we apply Lemma 20 to the path bPbbPb^{\prime}, taking Q1=(b,a)Q_{1}=(b,a), Q2=(b,a,p2,p3)Q_{2}=(b,a,p_{2},p_{3}) as the first quadruple and thereafter every other set of four consecutive vertices on PP, finishing with (p22,p21,c,b)(p_{2\ell^{\prime}-2},p_{2\ell^{\prime}-1},c,b^{\prime}). This yields a squared path (q1,b,a,,c,b)(q_{1},b,a,\ldots,c,b^{\prime}) on 3(+1)3(\ell^{\prime}+1) vertices, which gives a squared cycle (b,a,,c)(b,a,\ldots,c) in GG (without q1q_{1} and the clone vertex bb^{\prime}) on 3+13\ell^{\prime}+1 vertices as required.

If we seek a squared cycle of length 3+23\ell^{\prime}+2 with >1\ell^{\prime}>1 on the other hand, then we perform a similar process, except that we identify not one triangle in A1A_{1} but two triangles abcabc, xyzxyz connected with an edge cxcx (which we obtain by the Erdős-Stone theorem). We apply Lemma 21 to find a path P=(a,,z)P=(a,\ldots,z) in A1{b,c,y,z}A_{1}\setminus\{b,c,y,z\} on 22\ell^{\prime} vertices. We then apply Lemma 20 once to the path bPybPy and once to (b,c,x,y)(b,c,x,y). Omitting the first vertex on each of the resulting squared paths and concatenating, we get a squared cycle C3+22C^{2}_{3\ell^{\prime}+2}.

Hence we do indeed obtain squared cycles C2C^{2}_{\ell} for all [3,32|A1|]{5}\ell\in[3,\frac{3}{2}|A_{1}|]\setminus\{5\}. It remains to show that we can also find C2C^{2}_{\ell} for all \ell with 32|A1|sc(n,δ)\frac{3}{2}|A_{1}|\leq\ell\leq\operatorname{sc}(n,\delta) and that we can find Psp(n,δ)2P^{2}_{\operatorname{sp}(n,\delta)}.

For this purpose, we first re-incorporate the vertices that are neither in WW nor in any of the sets AiA_{i} by examining in which of the AiA_{i} they have many neighbours. More precisely, for each i[k]i\in[k], we let XiX_{i} be AiA_{i} together with all vertices in V(G)WV(G)\setminus W which are adjacent to at least 30ξ2n30\xi^{2}n vertices of AiA_{i}. Because every vertex in V(G)WV(G)\setminus W has at least δ|W|\delta-|W| neighbours outside WW, by (25) every vertex in GWG-W is in XiX_{i} for at least one ii. Moreover, by the definition of an (ε,d)(\varepsilon,d)-regular partition, assumption () and since AjBjA_{j}\subseteq\bigcup B_{j}, we have for all j[k]j\in[k] with jij\neq i that

(33) AjXi=.A_{j}\cap X_{i}=\emptyset\,.

Hence it follows from (30) that

(34) |Xi|<|Ai|+2ξ2nand|X1A1|2ξ2n.|X_{i}|<|A_{i}|+2\xi^{2}n\qquad\text{and}\qquad|X_{1}-A_{1}|\leq 2\xi^{2}n\,.

We finish the proof by distinguishing three cases.

Case 1: |XiXj|2|X_{i}\cap X_{j}|\geq 2 for some iji\neq j. Let v1v_{1} and v2v_{2} be distinct vertices of XiXjX_{i}\cap X_{j}. Let u1u_{1} and u2u_{2} be distinct neighbours in AiA_{i} of v1v_{1} and v2v_{2} respectively, and similarly y1y_{1} and y2y_{2} in AjA_{j}. Applying Lemma 21 to AiA_{i} we can find a path from u1u_{1} to u2u_{2} of length \ell^{\prime} for any 4|Ai|24\leq\ell^{\prime}\leq|A_{i}|-2. We can find a similar path in AjA_{j} from y1y_{1} to y2y_{2}. Concatenating these paths with v1v_{1} and v2v_{2} we can find a 22\ell^{\prime}-vertex cycle T2T_{2\ell^{\prime}} in X1X2X_{1}\cup X_{2} for any 102|Ai|+|Aj|210\leq 2\ell^{\prime}\leq|A_{i}|+|A_{j}|-2. There are no quadruples of consecutive vertices on T2T_{2\ell^{\prime}} using both v1v_{1} and v2v_{2}. The four quadruples that use either v1v_{1} or v2v_{2} each have at least (ξ3ξ2)n>100k(\xi-3\xi^{2})n>100k common neighbours in WW, where the inequality follows from (16), (22), from

(35) kν1,k\leq\nu^{-1}\,,

and from ξ3ξ2>0\xi-3\xi^{2}>0. All other quadruples have at least |W|4ξ2n|W|-4\xi^{2}n common neighbours in WW. So applying Lemma 20 we obtain a squared cycle on 33\ell^{\prime} vertices. Again it is possible to perform parity corrections (prior to applying Lemma 21) so that in this case we have C2GC^{2}_{\ell}\subseteq G for every [3,32(|Ai|+|Aj|10)]{5}\ell\in[3,\frac{3}{2}(|A_{i}|+|A_{j}|-10)]\setminus\{5\}. By (32), we have sc(n,δ)sp(n,δ)<32(|Ai|+|Aj|10)\operatorname{sc}(n,\delta)\leq\operatorname{sp}(n,\delta)<\frac{3}{2}(|A_{i}|+|A_{j}|-10).

Case 2: for some ii every vertex of AiA_{i} is adjacent to at least one vertex outside XiWX_{i}\cup W. Since

|Ai|(29)|Bi|ξ2n(20)43ν(1ε)nξ2n(22)13ξn>(22),(35)31kξ2n|A_{i}|\overset{\mbox{\tiny{\eqref{eq:ext:BminusA}}}}{\geq}\Big{|}\bigcup B_{i}\Big{|}-\xi^{2}n\overset{\mbox{\tiny{\eqref{eq:BiBig}}}}{\geq}\frac{4}{3}\nu(1-\varepsilon)n-\xi^{2}n\overset{\mbox{\tiny{\eqref{ext:smallxi}}}}{\geq}13\xi n\overset{\mbox{\tiny{\eqref{ext:smallxi},\eqref{eq:ext:knu}}}}{>}31k\xi^{2}n

we can certainly find jij\neq i such that there are 31ξ2n31\xi^{2}n vertices in AiA_{i} all adjacent to vertices of XjXiX_{j}\setminus X_{i}. Since no vertex of XjXiX_{j}\setminus X_{i} is adjacent to 30ξ2n30\xi^{2}n vertices of AiA_{i} (by definition of XiX_{i}), we find two disjoint edges u1v1u_{1}v_{1} and u2v2u_{2}v_{2} from u1,u2Aiu_{1},u_{2}\in A_{i} to v1,v2Xjv_{1},v_{2}\in X_{j}. Choosing distinct neighbours y1y_{1} of v1v_{1} and y2y_{2} of v2v_{2} in AjA_{j} and applying the same reasoning as in the previous case we are done.

Case 3: for each iji\neq j we have |XiXj|1|X_{i}\cap X_{j}|\leq 1, and for each ii some vertex in AiA_{i} is adjacent only to vertices in WXiW\cup X_{i}. Thus |Xi|δ|W|+1|X_{i}|\geq\delta-|W|+1 for each ii. We now first focus on finding a squared path on sp(n,δ)\operatorname{sp}(n,\delta) vertices in GG, and then turn to the squared cycles which will complete the proof. If for some iji\neq j we have |XiXj|=1|X_{i}\cap X_{j}|=1 then we obtain a squared path of the desired length as in Case 1. There we required two vertices in XiXjX_{i}\cap X_{j} to obtain a squared cycle (which must return to its start), but one vertex suffices for a squared path to cross from XiX_{i} to XjX_{j}.

So, assume that the sets XiX_{i} are all disjoint. It follows that k(n|W|)/(δ|W|+1)k\leq(n-|W|)/(\delta-|W|+1). Since |W|nδ|W|\leq n-\delta by (25), this implies

kn(nδ)δ(nδ)+1=δ2δn+1.k\leq\frac{n-(n-\delta)}{\delta-(n-\delta)+1}=\frac{\delta}{2\delta-n+1}\,.

Now if krp(n,δ)+1k\geq r_{p}(n,\delta)+1 then we would have

rp(n,δ)+1kδ2δn+1,r_{p}(n,\delta)+1\leq k\leq\frac{\delta}{2\delta-n+1}\,,

and so

rp(n,δ)nδ12δn+1,r_{p}(n,\delta)\leq\frac{n-\delta-1}{2\delta-n+1}\,,

but by (3) we have rp(n,δ)nδ2δn+1r_{p}(n,\delta)\geq\frac{n-\delta}{2\delta-n+1}, so

krp(n,δ).k\leq r_{p}(n,\delta)\,.

Thus the largest of the sets XiX_{i}, say X1X_{1}, has at least

(36) |X1|n|W|k(25)δkδrp(n,δ)|X_{1}|\geq\frac{n-|W|}{k}\overset{\mbox{\tiny{\eqref{eq:ext-like:W}}}}{\geq}\frac{\delta}{k}\geq\frac{\delta}{r_{p}(n,\delta)}

vertices.

We now want to apply Lemma 21 with H=G[X1]H=G[X_{1}] and ‘bad’ vertices B=X1A1B=X_{1}-A_{1}. Note that by (34) there are at most 2ξ2n2\xi^{2}n vertices in B=X1A1B=X_{1}-A_{1}, and so we have

|B|(34)2ξ2n(22)νδ100(35)δ100k|H|100.|B|\overset{\mbox{\tiny{\eqref{eq:ext:XA}}}}{\leq}2\xi^{2}n\overset{\mbox{\tiny{\eqref{ext:smallxi}}}}{\leq}\frac{\nu\delta}{100}\overset{\mbox{\tiny{\eqref{eq:ext:knu}}}}{\leq}\frac{\delta}{100k}\leq\frac{|H|}{100}\,.

Moreover, δ(H)=δ(G[X1])30ξ2n\delta(H)=\delta(G[X_{1}])\geq 30\xi^{2}n by definition of X1X_{1}, and therefore every vertex in BB has at least 30ξ2n92ξ2n9|B|30\xi^{2}n\geq 9\cdot 2\xi^{2}n\geq 9|B| neighbours in HH. In addition, vertices vv in HBA1H-B\subseteq A_{1} satisfy

deg(v,X1)(31)|A1|2+30ξ2n>(34)|X1|2+25ξ2n=|H|2+25ξ2n(16)|H|2+9|B|+10.\begin{split}\deg(v,X_{1})&\overset{\mbox{\tiny{\eqref{eq:ext:deltaAi}}}}{\geq}\frac{|A_{1}|}{2}+30\xi^{2}n\overset{\mbox{\tiny{\eqref{eq:ext:XA}}}}{>}\frac{|X_{1}|}{2}+25\xi^{2}n\\ &=\frac{|H|}{2}+25\xi^{2}n\overset{\mbox{\tiny{\eqref{ext:chooseN}}}}{\geq}\frac{|H|}{2}+9|B|+10\,.\end{split}

Hence we can indeed apply Lemma 21, to obtain a path TT covering min{X1,n/2}\min\{X_{1},n/2\} vertices on which every quadruple of consecutive vertices contains at most one ‘bad’ vertex. Finally we want to apply Lemma 20 to the graph G[X1W]G[X_{1}\cup W] and the cycle TT with the following ordering σ\sigma of the quadruples of consecutive vertices in TT: σ\sigma is such that all quadruples containing vertices from BB come first, followed (by an arbitrary ordering of) all other quadruples. There are at most 22ξ2n2\cdot 2\xi^{2}n quadruples containing vertices from B=X1A1B=X_{1}-A_{1}, and by the definition of A1A_{1} and of WW, each of them has at least (ξ3ξ2)nξ2n(\xi-3\xi^{2})n\geq\xi^{2}n common neighbours in WW. All remaining quadruples have, by the definition of A1A_{1}, by (25) and since δ 2n/3\delta\leq\ 2n/3, at least |W|4ξ2nn412min{|X1|,n2}|W|-4\xi^{2}n\geq\tfrac{n}{4}\geq\frac{1}{2}\min\{|X_{1}|,\frac{n}{2}\} common neighbours in WW. Hence, we can indeed apply Lemma 20 to obtain a squared path on at least 32min{|X1|,n/2}sp(n,δ)\frac{3}{2}\min\{|X_{1}|,n/2\}\geq\operatorname{sp}(n,\delta) vertices, where the inequality follows from the definition of sp(n,δ)\operatorname{sp}(n,\delta), from (17), and from (36).

At last, we show that we can find in GG the desired long squared cycles in Case 3. Assume first that there is a cycle of sets (relabelling the indices if necessary) X1,X2,,XsX_{1},X_{2},\ldots,X_{s} for some 3sk3\leq s\leq k such that XiXi+1mods={vi}X_{i}\cap X_{i+1\!\mod s}=\{v_{i}\} for each ii, and the viv_{i} are all distinct, then for each ii we may choose neighbours uiAiu_{i}\in A_{i} and yiy_{i} in Ai+1modsA_{i+1\!\mod s} of viv_{i}, and we may insist that all these 3s3s vertices are distinct. Similarly as before we can apply Lemma 21 to each G[Ai]G[A_{i}] in turn and concatenate the resulting paths, in order to find a cycle T2T_{2\ell^{\prime}} for every 8s2|A1|+|A2|8s\leq 2\ell^{\prime}\leq|A_{1}|+|A_{2}| on which there are no quadruples using more than one vertex outside iAi\bigcup_{i}A_{i}. Again (checking the conditions similarly as before) we may apply Lemma 20 to T2T_{2\ell^{\prime}} to obtain a squared cycle on 33\ell vertices. Finally by performing parity corrections we obtain C2C^{2}_{\ell} for every [3,32(|A1|+|A2|)]{5}\ell\in[3,\frac{3}{2}(|A_{1}|+|A_{2}|)]\setminus\{5\}.

If there exists no such cycle of sets, then i=1k|Xi|n|W|+k1\sum_{i=1}^{k}|X_{i}|\leq n-|W|+k-1. Since we have also |Xi|δ|W|+1|X_{i}|\geq\delta-|W|+1 for each ii and |W|nδ|W|\leq n-\delta, it follows from the definition of rc(n,δ)r_{c}(n,\delta) (by establishing a relation similar to (3)) that krc(n,δ)k\leq r_{c}(n,\delta), and by averaging, that the largest of the sets XiX_{i}, say X1X_{1}, contains at least 2sc(n,δ)/32\operatorname{sc}(n,\delta)/3 vertices. As before, we can apply Lemma 21 to X1X_{1} to obtain a cycle T2T_{2\ell^{\prime}} for each 42|X1|4\leq 2\ell^{\prime}\leq|X_{1}| on which the ‘bad’ vertices from B=X1A1B=X_{1}-A_{1} are separated, and apply Lemma 20 to it to obtain a squared cycle C32C^{2}_{3\ell^{\prime}} for each 63sc(n,δ)6\leq 3\ell^{\prime}\leq\operatorname{sc}(n,\delta) as required. Again the parity correction procedure is applicable, so we get C2C^{2}_{\ell} for every [3,sc(n,δ)]{5}\ell\in[3,\operatorname{sc}(n,\delta)]\setminus\{5\}. ∎

5. Concluding remarks

The proof of Theorem 4.

Our results were most difficult to prove for δ4n/7\delta\approx 4n/7. This is somewhat surprising given the experience from the partial and perfect packing results of Komlós [9] and Kühn and Osthus [15]. In the setting of these results it becomes steadily more difficult to prove packing results as the minimum degree of the graph (and hence the required size of a packing) increases, with perfect packings as the most difficult case. Yet in our setting it is relatively easy to prove our results when the minimum degree condition is large. This difference occurs because we have to embed triangle-connected graphs; as the minimum degree increases the possibilities for bad behaviour when forming triangle-connections are reduced.

This is paralleled by the behaviour of K4K_{4}-free graphs: For any minimum degree δ(G)>2v(G)/3\delta(G)>2v(G)/3 the graph GG is not K4K_{4}-free. However, if δ(G)>5v(G)/8\delta(G)>5v(G)/8 then by the Andrásfai-Erdős-Sós theorem [2] the K4K_{4}-free graph GG is forced to be tripartite, while for smaller values of δ(G)\delta(G) there exist more possibilities.

Extremal graphs.

It is straightforward to check (from our proofs) that up to some trivial modifications the graphs Gp(n,δ)G_{p}(n,\delta) and Gc(n,δ)G_{c}(n,\delta) are the only extremal graphs. We believe that the graph Gp(n,δ)G_{p}(n,\delta) remains extremal for squared paths even when δ\delta is not bounded away from n/2n/2, although as noted in Section 1 the same is not true for Gc(n,δ)G_{c}(n,\delta) and squared cycles.

However it is not the case that the only extremal graph excluding some C2C^{2}_{\ell} of chromatic number four is Knδ,nδ,2δnK_{n-\delta,n-\delta,2\delta-n} (cf. () of our main theorem, Theorem 4). Let us briefly explain this. Suppose \ell is not divisible by three. Since C2C^{2}_{\ell} has no independent set on more than /3\lfloor\ell/3\rfloor vertices, whenever we remove an independent set from C2C^{2}_{\ell} we must leave some three consecutive vertices, which form a triangle. Now suppose that we can find a graph HH on δ\delta vertices with minimum degree 2δn2\delta-n which is both triangle-free and contains no even cycle on more than 2(2δn)2(2\delta-n) vertices. Then the graph GG obtained by adding an independent set of size nδn-\delta to HH, all of whose vertices are adjacent to all of HH, contains no squared cycle of length indivisible by three and no squared cycle with more than 3(2δn)3(2\delta-n) vertices.

To mention one possible HH, take δ=6n11\delta=\frac{6n}{11} and let HH be obtained as follows. We take the disjoint union of three copies of Kn/11,n/11K_{n/11,n/11} and fix a bipartition. Now we add three vertex disjoint edges within one of the two partition classes, one between each copy of Kn/11,n/11K_{n/11,n/11}. The resulting triangle-free graph has no even cycle leaving any copy of Kn/11,n/11K_{n/11,n/11}. Hence all even cycles have at most 2n11\frac{2n}{11} vertices. However, it has odd cycles of up to 6n113\frac{6n}{11}-3 vertices.

Long squared cycles.

Theorem 5 () states that if any of various odd cycles are excluded from GG we are guaranteed even cycles of every length up to 2δ(G)2\delta(G), whereas the equivalent statement in our Theorem 4 contains an error term. We believe this error term can be removed, but at the cost of significantly more technical work, requiring both a new version of the stability lemma and new extremal results.

Higher powers of paths and cycles.

We note that Theorem 2 has a natural generalisation to higher powers of cycles, the so called Pósa-Seymour Conjecture. This conjecture was proved for all sufficiently large nn by Komlós, Sárközy and Szemerédi [12]. We conjecture a natural generalisation of Theorem 4 for higher powers of paths and cycles.

Given kk, nn and δ\delta, we construct an nn-vertex graph Gp(k)(n,δ)G_{p}^{(k)}(n,\delta) by partitioning the vertices into an ‘interior’ set of =(k1)(nδ)\ell=(k-1)(n-\delta) vertices upon which we place a complete balanced k1k-1-partite graph, and an ‘exterior’ set of nn-\ell vertices upon which we place a disjoint union of (n)/(δ+1)\lfloor(n-\ell)/(\delta-\ell+1)\rfloor almost-equal cliques. We then join every ‘interior’ vertex to every ‘exterior’ vertex. We construct Gc(k)(n,δ)G_{c}^{(k)}(n,\delta) similarly, permitting the cliques in the ‘exterior’ vertices to overlap in cut-vertices of the ‘exterior’ set if this reduces the size of the largest clique while preserving the minimum degree δ\delta.

Conjecture 24.

Given ν>0\nu>0 and kk there exists n0n_{0} such that whenever nn0n\geq n_{0} and GG is an nn-vertex graph with δ(G)=δ>k1kn+νn\delta(G)=\delta>\frac{k-1}{k}n+\nu n, the following hold.

  1. ()

    If PkGp(k)(n,δ)P^{k}_{\ell}\subseteq G_{p}^{(k)}(n,\delta) then PkGP^{k}_{\ell}\subseteq G.

  2. ()

    If C(k+1)kGc(k)(n,δ)C^{k}_{(k+1)\ell}\subseteq G_{c}^{(k)}(n,\delta) for some integer \ell, then C(k+1)kGC^{k}_{(k+1)\ell}\subseteq G.

  3. ()

    If CkGc(k)(n,δ)C^{k}_{\ell}\subseteq G_{c}^{(k)}(n,\delta) with χ(Ck)=k+2\chi(C^{k}_{\ell})=k+2 and CkGC^{k}_{\ell}\not\subseteq G for some integer \ell, then C(k+1)kGC^{k}_{(k+1)\ell}\subseteq G for each integer <kδ(k1)nνn\ell<k\delta-(k-1)n-\nu n.

It seems likely that again the νn\nu n error term in the last statement is not required, but again (at least for powers of cycles) it is required in the minimum degree condition.

Acknowledgement

This project was started at DocCourse 2008, organised by the research training group Methods for Discrete Structures, Berlin. In particular, we would like to thank Mihyun Kang and Mathias Schacht for organising this nice event.

References

  • [1] P. Allen, Minimum degree conditions for cycles, (2009), submitted.
  • [2] B. Andrásfai, P. Erdős, and V. T. Sós, On the connection between chromatic number, maximal clique and minimal degree of a graph, Discrete Math. 8 (1974), 205–218.
  • [3] K. Corrádi and A. Hajnal, On the maximal number of independent circuits in a graph, Acta Math. Acad. Sci. Hungar. 14 (1963), 423–439.
  • [4] G. A. Dirac, Some theorems on abstract graphs, Proc. London Math. Soc. (3) 2 (1952), 69–81.
  • [5] P. Erdős and A. H. Stone, On the structure of linear graphs, Bull. Amer. Math. Soc. 52 (1946), 1087–1091.
  • [6] P. Erdős, Problem 9, Theory of Graphs and its Applications (Proc. Sympos. Smolenice, 1963) (M. Fiedler, ed.), Publ. House Czechoslovak Acad. Sci., Prague, 1964, p. 159.
  • [7] G. Fan and H. A. Kierstead, The square of paths and cycles, J. Combin. Theory Ser. B 63 (1995), no. 1, 55–64.
  • [8] by same author, Hamiltonian square-paths, J. Combin. Theory Ser. B 67 (1996), no. 2, 167–182.
  • [9] J. Komlós, Tiling Turán theorems, Combinatorica 20 (2000), no. 2, 203–218.
  • [10] J. Komlós, G. N. Sárközy, and E. Szemerédi, On the square of a Hamiltonian cycle in dense graphs, Random Structures Algorithms 9 (1996), no. 1-2, 193–211.
  • [11] by same author, Blow-up lemma, Combinatorica 17 (1997), no. 1, 109–123.
  • [12] by same author, Proof of the Seymour conjecture for large graphs, Annals of Combinatorics 2 (1998), 43–60.
  • [13] J. Komlós and M. Simonovits, Szemerédi’s regularity lemma and its applications in graph theory, Combinatorics, Paul Erdős is eighty, Vol. 2 (Keszthely, 1993), Bolyai Soc. Math. Stud., vol. 2, János Bolyai Math. Soc., Budapest, 1996, pp. 295–352.
  • [14] D. Kühn and D. Osthus, Embedding large subgraphs into dense graphs, Surveys in Combinatorics (S. Huczynka, J. Mitchell, and C. Roney-Dougal, eds.), Cambridge University Press, 2009, pp. 137–167.
  • [15] D. Kühn and D. Osthus, The minimum degree threshold for perfect graph packings, Combinatorica 29 (2009), 65–107.
  • [16] D. Kühn, D. Osthus, and A. Taraz, Large planar subgraphs in dense graphs, J. Comb. Theory, Ser. B 95 (2005), no. 2, 263–282.
  • [17] V. Nikiforov and R. H. Schelp, Cycle lengths in graphs with large minimum degree, Journal of Graph Theory 52 (2006), no. 2, 157–170.
  • [18] I. Reiman, Über ein Problem von K. Zarankiewicz, Acta Math. Acad. Sci. Hungar. 9 (1958), 269–273.
  • [19] M. Simonovits, A method for solving extremal problems in graph theory, stability problems, Theory of Graphs (Proc. Colloq., Tihany, 1966), Academic Press, 1968, pp. 279–319.
  • [20] E. Szemerédi, Regular partitions of graphs, Problèmes combinatoires et théorie des graphes (Orsay, 1976), Colloques Internationaux CNRS, vol. 260, CNRS, 1978, pp. 399–401.
  • [21] P. Turán, Eine Extremalaufgabe aus der Graphentheorie, Mat. Fiz. Lapok 48 (1941), 436–452.

Appendix A Proof of Lemma 8

For the proof of Lemma 8 we apply the following version (which is a special case) of the Blow-up Lemma of Komlós, Sárközy and Szemerédi [11].

Lemma 25 (Blow-up Lemma [11]).

Given fixed c,d>0c,d>0, there exist ε0>0\varepsilon_{0}>0 and nbln_{\textsc{{bl}}} such that for any 0<ε<ε00<\varepsilon<\varepsilon_{0} the following holds. Let HH be any graph on at least nbln_{\textsc{{bl}}} vertices with V(H)=V1˙V2˙V3V(H)=V_{1}\dot{\cup}V_{2}\dot{\cup}V_{3} and |Vi|16|V(H)||V_{i}|\geq\frac{1}{6}|V(H)|, in which each bipartite graph H[Vi,Vj]H[V_{i},V_{j}] is (3ε,d)(3\varepsilon,d)-regular and furthermore δVi(Vj)12d|Vi|\delta_{V_{i}}(V_{j})\geq\frac{1}{2}d|V_{i}| for each 1i,j31\leq i,j\leq 3.

Let FF be any subgraph of the complete tripartite graph with parts V1V_{1}V2V_{2} and V3V_{3} such that the maximum degree of FF is at most four. Assume further, that at most four vertices xix_{i} (i[4]i\in[4]) of FF are endowed with sets CxiVjC_{x_{i}}\subseteq V_{j} such that xiVjx_{i}\in V_{j} and |Cxi|c|Vj||C_{x_{i}}|\geq c|V_{j}|

Then there is an embedding ψ:V(F)V(H)\psi\colon V(F)\rightarrow V(H) of FF into HH with ψ(xi)Cxi\psi(x_{i})\in C_{x_{i}} for i[4]i\in[4].

We also say that the xix_{i} in Lemma 25 are image restricted to CxiC_{x_{i}}.

Proof of Lemma 8.

Given dd, we let c=d2/4c=d^{2}/4. Now Lemma 25 gives values ε0>0\varepsilon_{0}>0 and nbln_{\textsc{{bl}}}. We choose εel=min(ε0,d2/24)\varepsilon_{\textsc{{el}}}=\min(\varepsilon_{0},d^{2}/24). Given ε<εel\varepsilon<\varepsilon_{\textsc{{el}}} and melm_{\textsc{{el}}} we choose

nel=max(2melnbl,6m4ε).n_{\textsc{{el}}}=\max\left(2m_{\textsc{{el}}}n_{\textsc{{bl}}},\frac{6m^{4}}{\varepsilon}\right)\,.

Let nneln\geq n_{\textsc{{el}}}, let GG be an nn-vertex graph, and let RR^{\prime} be an (ε,d)(\varepsilon,d)-reduced graph of GG on mmelm\leq m_{\textsc{{el}}} vertices.

Fix a set 𝒯={T1,,TCTF(R)/3}\mathcal{T^{\prime}}=\{T^{\prime}_{1},\ldots,T^{\prime}_{\operatorname{CTF}(R^{\prime})/3}\} of vertex-disjoint triangles in a triangle component of RR^{\prime} covering CTF(R)\operatorname{CTF}(R^{\prime}) vertices. For each triangle Ti=Xi,1Xi,2Xi,3T^{\prime}_{i}=X^{\prime}_{i,1}X^{\prime}_{i,2}X^{\prime}_{i,3} we may by regularity for each j[3]j\in[3] remove at most ε|Xi,j|\varepsilon|X^{\prime}_{i,j}| vertices from Xi,jX^{\prime}_{i,j} to obtain a set Xi,jX_{i,j} such that each pair (Xi,j,Xi,k)(X_{i,j},X_{i,k}) is not only (2ε,d)(2\varepsilon,d)-regular but also satisfies δXi,k(Xi,j)(d3ε)|Xi,k|\delta_{X_{i,k}}(X_{i,j})\geq(d-3\varepsilon)|X_{i,k}|. We let RR be the (2ε,d)(2\varepsilon,d)-reduced graph corresponding to the new vertex partition given by replacing each Xi,jX^{\prime}_{i,j} with Xi,jX_{i,j}; then every edge of RR^{\prime} carries over to RR, and we let 𝒯\mathcal{T} be the set of CTF(R)/3\operatorname{CTF}(R^{\prime})/3 vertex disjoint triangles in RR corresponding to 𝒯\mathcal{T^{\prime}}. We set r=CTF(R)/3r=\operatorname{CTF}(R^{\prime})/3.

Our strategy now is as follows. We shall first fix a collection of suitable triangle walks W1,,Wr1W_{1},\dots,W_{r-1} and WW^{\prime} in RR. Next, for each of these triangle walks W=(E1,E2,)W=(E_{1},E_{2},\ldots) we do the following. Let U1V1​​​{\overset{\scalebox{1.8}[0.7]{\!\!\!\raisebox{-0.5pt}{$\rightharpoonup$}}}{\smash{U_{1}V_{1}}\vphantom{a}}} be (a suitable) orientation of the first edge E1E_{1} of WW. We shall construct a sequence Q(W,U1V1​​​)Q(W,{\overset{\scalebox{1.8}[0.7]{\!\!\!\raisebox{-0.5pt}{$\rightharpoonup$}}}{\smash{U_{1}V_{1}}\vphantom{a}}}) of vertices of RR whose first two vertices are U1U_{1} and V1V_{1}, in that order, and which has the property that every vertex in the sequence is adjacent to the two preceding vertices (as is the case for a squared path). Then we use this sequence Q(W,U1V1​​​)Q(W,{\overset{\scalebox{1.8}[0.7]{\!\!\!\raisebox{-0.5pt}{$\rightharpoonup$}}}{\smash{U_{1}V_{1}}\vphantom{a}}}) to obtain a squared path in GG following WW, whose first two vertices are in U1U_{1} and V1V_{1}. Finally, connecting suitable paths appropriately will lead to a proof of (), (), and ().

We first construct the triangle walks W1,,Wr1W_{1},\dots,W_{r-1} and WW^{\prime}. For each 1ir11\leq i\leq r-1 let WiW_{i} be a fixed triangle walk in RR whose first edge is in TiT_{i} and whose last is in Ti+1T_{i+1}. We suppose (repeating edges in the triangle walk WiW_{i} if necessary) that each triangle walk WiW_{i} contains at least ten edges, that the first edge of Wi+1W_{i+1} is not the same as the last edge of WiW_{i}, and such that each walk with more than ten edges is of minimal length. We have |Wi|(m2)|W_{i}|\leq\binom{m}{2} for each ii. Let WW^{\prime} be the triangle walk obtained by concatenating W1,,Wr1W_{1},\ldots,W_{r-1}.

Next, we describe how to construct the sequence Q(W,A1B1​​​)Q(W,{\overset{\scalebox{1.8}[0.7]{\!\!\!\raisebox{-0.5pt}{$\rightharpoonup$}}}{\smash{A_{1}B_{1}}\vphantom{a}}}) for any triangle walk W=(E1,E2,)W=(E_{1},E_{2},\ldots) in RR and any orientation A1B1​​​{\overset{\scalebox{1.8}[0.7]{\!\!\!\raisebox{-0.5pt}{$\rightharpoonup$}}}{\smash{A_{1}B_{1}}\vphantom{a}}} of its first edge E1E_{1}. We construct Q(W,A1B1​​​)Q(W,{\overset{\scalebox{1.8}[0.7]{\!\!\!\raisebox{-0.5pt}{$\rightharpoonup$}}}{\smash{A_{1}B_{1}}\vphantom{a}}}) iteratively as follows. Let Q1=(A1,B1)Q_{1}=(A_{1},B_{1}). Now for each 2i|W|2\leq i\leq|W| successively, we define QiQ_{i} as follows. The last two vertices Ai1,Bi1A_{i-1},B_{i-1} of Qi1Q_{i-1} are an orientation of Ei1E_{i-1}. If Ei=Ai1BiE_{i}=A_{i-1}B_{i} we create QiQ_{i} by appending (Bi,Ai1)(B_{i},A_{i-1}) to Qi1Q_{i-1}; if Ei=Bi1BiE_{i}=B_{i-1}B_{i} we append (Bi,Ai1,Bi1,Bi)(B_{i},A_{i-1},B_{i-1},B_{i}) to Qi1Q_{i-1} to create QiQ_{i}. At each step the final two vertices of QiQ_{i} are an orientation of EiE_{i}. Furthermore every vertex of QiQ_{i} is adjacent in RR to the two vertices preceding it in QiQ_{i}. Finally, we let Q(W,A1B1​​​)=Q|W|Q(W,{\overset{\scalebox{1.8}[0.7]{\!\!\!\raisebox{-0.5pt}{$\rightharpoonup$}}}{\smash{A_{1}B_{1}}\vphantom{a}}})=Q_{|W|}.

We shall need the following observations concerning the lengths of sequences constructed in this way. It is easy to check by induction that for any triangle walk WW with at least two edges whose first edge is U1V1U_{1}V_{1}, we have

(37) |Q(W,U1V1​​​)|+|Q(W,V1U1​​​)|1mod3.|Q(W,{\overset{\scalebox{1.8}[0.7]{\!\!\!\raisebox{-0.5pt}{$\rightharpoonup$}}}{\smash{U_{1}V_{1}}\vphantom{a}}})|+|Q(W,{\overset{\scalebox{1.8}[0.7]{\!\!\!\raisebox{-0.5pt}{$\rightharpoonup$}}}{\smash{V_{1}U_{1}}\vphantom{a}}})|\equiv 1\mod 3\,.

Now consider the concatenation WW^{\prime} of the walks WiW_{i}. Let U1V1U_{1}V_{1} be the first edge of W1W_{1}. If we construct Q(W,U1V1​​​)Q(W^{\prime},{\overset{\scalebox{1.8}[0.7]{\!\!\!\raisebox{-0.5pt}{$\rightharpoonup$}}}{\smash{U_{1}V_{1}}\vphantom{a}}}) then the first edge UiViU_{i}V_{i} and the last edge UiViU^{\prime}_{i}V^{\prime}_{i} of each WiW_{i} obtains an orientation, say UiVi​​​{\overset{\scalebox{1.8}[0.7]{\!\!\!\raisebox{-0.5pt}{$\rightharpoonup$}}}{\smash{U_{i}V_{i}}\vphantom{a}}} and UiVi​​​{\overset{\scalebox{1.8}[0.7]{\!\!\!\raisebox{-0.5pt}{$\rightharpoonup$}}}{\smash{U^{\prime}_{i}V^{\prime}_{i}}\vphantom{a}}}. Clearly, there are sequences Q~i\tilde{Q}_{i} of vertices in TiT_{i} for 1<i<r1<i<r, such that Q(W,V1U1​​​)Q(W^{\prime},{\overset{\scalebox{1.8}[0.7]{\!\!\!\raisebox{-0.5pt}{$\rightharpoonup$}}}{\smash{V_{1}U_{1}}\vphantom{a}}}) is the concatenation of

Q(W1,V1U1​​​),Q~2,Q(W2,V2U2​​​),,Q~r1,Q(Wr1,Vr1Ur1​​​).Q(W_{1},{\overset{\scalebox{1.8}[0.7]{\!\!\!\raisebox{-0.5pt}{$\rightharpoonup$}}}{\smash{V_{1}U_{1}}\vphantom{a}}}),\tilde{Q}_{2},Q(W_{2},{\overset{\scalebox{1.8}[0.7]{\!\!\!\raisebox{-0.5pt}{$\rightharpoonup$}}}{\smash{V_{2}U_{2}}\vphantom{a}}}),\dots,\tilde{Q}_{r-1},Q(W_{r-1},{\overset{\scalebox{1.8}[0.7]{\!\!\!\raisebox{-0.5pt}{$\rightharpoonup$}}}{\smash{V_{r-1}U_{r-1}}\vphantom{a}}})\,.

Further we let Q~1=T1U1V1\tilde{Q}_{1}=T_{1}-U_{1}V_{1} and Q~r=TrUr1Vr1\tilde{Q}_{r}=T_{r}-U^{\prime}_{r-1}V^{\prime}_{r-1}. We define fi=|Q~i|mod3f_{i}=|\tilde{Q}_{i}|\!\mod 3 for i[r]i\in[r]. Together with (37) we obtain

|Q(W,U1V1​​​)|+|Q(W1,V1U1​​​)|+1<i<r(|Q(Wi,ViUi​​​)|+fi)1mod3|Q(W^{\prime},{\overset{\scalebox{1.8}[0.7]{\!\!\!\raisebox{-0.5pt}{$\rightharpoonup$}}}{\smash{U_{1}V_{1}}\vphantom{a}}})|+|Q(W_{1},{\overset{\scalebox{1.8}[0.7]{\!\!\!\raisebox{-0.5pt}{$\rightharpoonup$}}}{\smash{V_{1}U_{1}}\vphantom{a}}})|+\sum_{1<i<r}\big{(}|Q(W_{i},{\overset{\scalebox{1.8}[0.7]{\!\!\!\raisebox{-0.5pt}{$\rightharpoonup$}}}{\smash{V_{i}U_{i}}\vphantom{a}}})|+f_{i}\big{)}\equiv 1\mod 3

and hence

(38) |Q(W,U1V1​​​)|+i[r1](|Q(Wi,ViUi​​​)|+fi)+fr0mod3.|Q(W^{\prime},{\overset{\scalebox{1.8}[0.7]{\!\!\!\raisebox{-0.5pt}{$\rightharpoonup$}}}{\smash{U_{1}V_{1}}\vphantom{a}}})|+\sum_{i\in[r-1]}\big{(}|Q(W_{i},{\overset{\scalebox{1.8}[0.7]{\!\!\!\raisebox{-0.5pt}{$\rightharpoonup$}}}{\smash{V_{i}U_{i}}\vphantom{a}}})|+f_{i}\big{)}+f_{r}\equiv 0\mod 3\,.

This will enable us to construct cycles of lengths divisible by three later.

In order to construct squared paths in GG from short vertex sequences in RR we use the following fact.

Fact 26.

Let X1,X2,X3X_{1},X_{2},X_{3} be vertices of RR (not necessarily distinct), and ZZ be any set of at most 2ε|X1|2\varepsilon|X_{1}| vertices of GG. Suppose that X1X2X_{1}X_{2} and X1X3X_{1}X_{3} are edges of RR. Suppose furthermore that we have two vertices uu and vv of GG such that uu and vv have at least (d2ε)2|X1|(d-2\varepsilon)^{2}|X_{1}| common neighbours in X1X_{1}, and vv has at least (d2ε)|X2|(d-2\varepsilon)|X_{2}| neighbours in X2X_{2}.

Then there is a vertex wX1Zw\in X_{1}-Z adjacent to uu and vv such that vv and ww have at least (d2ε)2|X2|(d-2\varepsilon)^{2}|X_{2}| common neighbours in X2X_{2} and ww has at least (d2ε)|X3|(d-2\varepsilon)|X_{3}| neighbours in X3X_{3}.

Proof of Fact 26.

Let WW be the set of common neighbours of uu and vv in X1X_{1}. Since X1X2E(R)X_{1}X_{2}\in E(R), at most 2ε|X1|2\varepsilon|X_{1}| vertices of WW have fewer than (d2ε)|Γ(v)X2|(d2ε)2|X2|(d-2\varepsilon)|\Gamma(v)\cap X_{2}|\geq(d-2\varepsilon)^{2}|X_{2}| common neighbours with vv in X2X_{2}. Since X1X3E(R)X_{1}X_{3}\in E(R) at most 2ε|X1|2\varepsilon|X_{1}| vertices of WW have fewer than (d2ε)(d-2\varepsilon) neighbours in X3X_{3}. Finally since 6ε|X1|<(d2ε)2|X1|6\varepsilon|X_{1}|<(d-2\varepsilon)^{2}|X_{1}| we can find a vertex of WZW\setminus Z satisfying the desired properties. ∎

With these buiding bricks at hand we can now turn to the proofs of (), (), and ().

Proof of (), i.e., GG contains C32C^{2}_{3\ell} for each 3(1d)CTF(R)n/m3\ell\leq(1-d)\operatorname{CTF}(R)n/m: When (1d)n/m\ell\leq(1-d)n/m we have C32K(1d)n/m,(1d)n/m,(1d)n/mC^{2}_{3\ell}\subseteq K_{(1-d)n/m,(1-d)n/m,(1-d)n/m} and thus by Lemma 25 we can find C32C^{2}_{3\ell} as a subgraph of GG (whose vertices are in T1T_{1}, with no restrictions required). Otherwise we use the following strategy. Let UVUV be the first edge of the triangle walk W1W_{1}.

Our first goal will be to construct a squared path PP^{\prime} in GG which ‘connects’ T1T_{1} to T2T_{2}, T2T_{2} to T3T_{3}, and so on. For this purpose we choose two adjacent vertices uu and vv of GG in UU and VV respectively, such that uu and vv have (d2ε)2n/m(d-2\varepsilon)^{2}n/m common neighbours in both the third vertex of T1T_{1} and the third vertex of Q(W,UV​​​)Q(W^{\prime},{\overset{\scalebox{1.8}[0.7]{\!\!\!\raisebox{-0.5pt}{$\rightharpoonup$}}}{\smash{UV}\vphantom{a}}}), such that vv has (d2ε)n/m(d-2\varepsilon)n/m neighbours in the fourth vertex of Q(W,UV​​​)Q(W^{\prime},{\overset{\scalebox{1.8}[0.7]{\!\!\!\raisebox{-0.5pt}{$\rightharpoonup$}}}{\smash{UV}\vphantom{a}}}), and such that uu has (d2ε)n/m(d-2\varepsilon)n/m neighbours in VV. This is possible by the regularity of the various pairs. (Observe that the required sizes for the neighbourhoods and joint neighbourhoods are chosen large enough for an application of Lemma 25 in the triangle T1T_{1}.) Now we apply Fact 26 with the vertices uu and vv and the third, fourth and fifth vertices of Q(W,UV​​​)Q(W^{\prime},{\overset{\scalebox{1.8}[0.7]{\!\!\!\raisebox{-0.5pt}{$\rightharpoonup$}}}{\smash{UV}\vphantom{a}}}) to obtain a third vertex vv^{\prime} in the third vertex of Q(W,UV​​​)Q(W^{\prime},{\overset{\scalebox{1.8}[0.7]{\!\!\!\raisebox{-0.5pt}{$\rightharpoonup$}}}{\smash{UV}\vphantom{a}}}) such that uu and vv are adjacent to vv^{\prime}. By repeatedly applying Fact 26 we construct a sequence of vertices PP^{\prime} (starting with u,v,vu,v,v^{\prime}), where the iith vertex of PP^{\prime} is in the iith set of Q(W,UV​​​)Q(W^{\prime},{\overset{\scalebox{1.8}[0.7]{\!\!\!\raisebox{-0.5pt}{$\rightharpoonup$}}}{\smash{UV}\vphantom{a}}}) and is adjacent to its two predecessors, and where the vertices are all distinct (noting that 3|W|<εn/m3|W^{\prime}|<\varepsilon n/m). Thus PP^{\prime} is a squared path running from T1T_{1} to Tr1T_{r-1} following all the triangle walks WiW_{i}.

In addition we construct similarly (and without re-using vertices) for each 1ir11\leq i\leq r-1 a squared path PiP_{i} following the triangle walk WiW_{i}. However, this time we use the opposite orientation for the first edge: that is, instead of constructing P1P_{1} from Q(W1,UV​​​)Q(W_{1},{\overset{\scalebox{1.8}[0.7]{\!\!\!\raisebox{-0.5pt}{$\rightharpoonup$}}}{\smash{UV}\vphantom{a}}}) we use Q(W1,VU​​​)Q(W_{1},{\overset{\scalebox{1.8}[0.7]{\!\!\!\raisebox{-0.5pt}{$\rightharpoonup$}}}{\smash{VU}\vphantom{a}}}), and similarly for each PiP_{i} we use the opposite orientation of the first edge of WiW_{i} to that used in PP^{\prime}. Again, for each PiP_{i} we insist that the first two vertices have suitable neighbourhoods in TiT_{i}, and the last two in Ti+1T_{i+1}, for an application of Lemma 25 in these triangles. Again, this is possible by regularity.

We note that the total number of vertices on all of these squared paths is not more than 6m(m2)<εn/m6m\binom{m}{2}<\varepsilon n/m. Finally, we remove from T1T_{1} all vertices of P=PP1Pr1P=P^{\prime}\cup P_{1}\cup\cdots\cup P_{r-1}. Since at most εn/m\varepsilon n/m vertices are removed, and each cluster of T1T_{1} has size at least (13ε)n/m(1-3\varepsilon)n/m, even after removal all three pairs remain (3ε,d)(3\varepsilon,d)-regular and each cluster still has size at least (14ε)n/m(1-4\varepsilon)n/m.

Thus the conditions of Lemma 25 are satisfied, and hence we may embed a squared path S1S_{1} into T1T_{1}, with the four restrictions that its first vertex is a common neighbour of the first two vertices of PP^{\prime}, its second a neighbour of the first vertex of PP^{\prime}, its penultimate vertex a neighbour of the first vertex of P1P_{1} and its final vertex a common neighbour of the first two vertices of P1P_{1} (noting that by choice of the first two vertices of PP^{\prime} and of P1P_{1} the sets to which these vertices are restricted are indeed of size cn/mcn/m because c=d2/4c=d^{2}/4). In this way we can construct a squared path on 31+f13\ell_{1}+f_{1} vertices for any integer 1[10,(1d)n/m]\ell_{1}\in[10,(1-d)n/m] (since 34ε<d3\cdot 4\varepsilon<d), where f1{0,1,2}f_{1}\in\{0,1,2\} is as defined above (38). Similarly we may apply Lemma 25 to each TiT_{i} (2ir2\leq i\leq r), after removing PP from TiT_{i}, to obtain squared paths SiS_{i} of length 3i+fi3\ell_{i}+f_{i} for any integer i[10,(1d)n/m]\ell_{i}\in[10,(1-d)n/m].

Finally S=PS1P1Pr1SrS=P^{\prime}\cup S_{1}\cup P_{1}\cup\ldots\cup P_{r-1}\cup S_{r} forms a squared cycle in GG. It follows from (38) that the length of SS is divisible by three. We conclude that indeed S=C3k2S=C^{2}_{3k}, where we may choose any integer kk with 3k[6m3,(1d)CTF(R)n/m]3k\in[6m^{3},(1-d)\operatorname{CTF}(R)n/m], as required.

Proof of () When every triangle component of RR contains K4K_{4} we also want to obtain squared cycles whose lengths are not divisible by three. Observe that if ABCDABCD is a copy of K4K_{4} in RR, then the vertex sequences ABCABC, ABCDABCABCDABC and ABCDABCDABCABCDABCDABC each start and end with the same pair. Hence, with the help of Fact 26, these sequences can be used to construct squared paths in GG of length 33 (which is 0mod30\!\mod 3), length 77 (1mod31\!\!\mod 3), and length 1111 (2mod32\!\mod 3).

We construct C2C^{2}_{\ell} for [3,20]{5}\ell\in[3,20]\setminus\{5\} within a copy of K4K_{4} in RR directly (by the above methods). To obtain C2C^{2}_{\ell} with 213(1d)n/m21\leq\ell\leq 3(1-d)n/m we remove at most 2εn/m2\varepsilon n/m vertices from each of AA, BB and CC to obtain a triangle satisfying the conditions of Lemma 25, construct a short path in A,B,C,DA,B,C,D following the appropriate vertex sequence for mod3\ell\!\mod 3 and apply Lemma 25 to obtain C2C^{2}_{\ell}. Finally, to obtain longer squared cycles we perform the same construction as above, with the exception that WW^{\prime} is any triangle walk to and from a copy of K4K_{4}, and so Q(W,UV​​​)Q(W^{\prime},{\overset{\scalebox{1.8}[0.7]{\!\!\!\raisebox{-0.5pt}{$\rightharpoonup$}}}{\smash{UV}\vphantom{a}}}) may be taken (using one of the three vertex sequences above) to have any desired number of vertices modulo three (and not more than 2m22m^{2} in total).

Proof of () Lastly, when we are required to construct a squared path between two specified edges u1v1u_{1}v_{1} (with 2dn/m2dn/m common neighbours in both X1X_{1} and Y1Y_{1}) and u2v2u_{2}v_{2} (with 2dn/m2dn/m common neighbours in both X2X_{2} and Y2Y_{2}) using triangles TT in RR, we apply the identical strategy, noting that the conditions on u1v1u_{1}v_{1} and u2v2u_{2}v_{2} are already suitable for an application of Fact 26. ∎