This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Halin’s Infinite Ray Theorems: Complexity and Reverse Mathematics: Version Ethanks: All the authors were partially supported by NSF Grant DMS-1161175.

James S. Barnes
Department of Mathematics
Yale University
New Haven CT 06520
   Jun Le Goh
Department of Mathematics
National University of Singapore
Singapore 119076
   Richard A. Shore
Department of Mathematics
Cornell University
Ithaca NY 14853
Abstract

Halin [1965] proved that if a graph has nn many pairwise disjoint rays for each nn then it has infinitely many pairwise disjoint rays. We analyze the complexity of this and other similar results in terms of computable and proof theoretic complexity. The statement of Halin’s theorem and the construction proving it seem very much like standard versions of compactness arguments such as König’s Lemma. Those results, while not computable, are relatively simple. They only use arithmetic procedures or, equivalently, finitely many iterations of the Turing jump. We show that several Halin type theorems are much more complicated. They are among the theorems of hyperarithmetic analysis. Such theorems imply the ability to iterate the Turing jump along any computable well ordering. Several important logical principles in this class have been extensively studied beginning with work of Kreisel, H. Friedman, Steel and others in the 1960s and 1970s. Until now, only one purely mathematical example was known. Our work provides many more and so answers Question 30 of Montalbán’s Open Questions in Reverse Mathematics [2011]. Some of these theorems including ones in Halin [1965] are also shown to have unusual proof theoretic strength as well.

Mathematics Subject Classification 2020: Primary 05C63, 03D55, 03B30; Secondary 03D80, 03F35, 05C38, 05C69, 05C70

1 Introduction

In this paper we analyze the complexity of several results in infinite graph theory. These theorems are said to be ones of Halin type or, more generally, of ubiquity theory. The classical example is a theorem of Halin [11]: If a countable graph GG contains, for each nn, a sequence R0,,Rn1\left\langle R_{0},\ldots,R_{n-1}\right\rangle of disjoint rays (a ray is a sequence xiiN\left\langle x_{i}\mid i\in N\right\rangle of distinct vertices such that there is an edge between each xix_{i} and xi+1x_{i+1}) then it contains an infinite such sequence of rays. (Note: As will be described in Definition 3.2, when we talk about disjoint rays we always mean pairwise disjoint.) Halin actually deals with arbitrary graphs and formulates the result differently. The uncountable cases, however, are essentially just counting arguments. We deal only with countable structures but discuss his formulation in §6. This standard formulation of his theorem seems like a typical compactness theorem going from arbitrarily large finite collections of objects to an infinite collection. The archetypical example here is König’s Lemma: If a finitely branching tree has paths of length nn for every nn then it has a branch, i.e. an infinite path. In outline, a modern proof of Halin’s theorem for countable graphs (due to Andreae, see [5, Theorem 8.2.5(i)]) seems much like that of König’s Lemma (and many others in infinite graph theory). The construction of the desired sequence of rays proceeds by a recursion through the natural numbers in which each step is a simple procedure. While the procedure is much more delicate than for König’s Lemma, it is basically of the same complexity. It uses Menger’s theorem for finite graphs at each step but this represents a computable procedure (for finite graphs). The other parts of the step depend on the same type of information as in König’s Lemma. They ask, for example, if various sets (computable in the given graph) are nonempty or infinite. Nonetheless, we prove that the complexity of this construction and theorem are much higher than that for König’s Lemma or other applications of compactness. The concepts from graph theory, computability theory and proof theory/reverse mathematics that we need for our analysis are discussed in §3. Basic references for terminology, background and standard results not explicitly stated or otherwise attributed are Diestel [5] for graph theory; Rogers [22] and Sacks [24] for computability theory; and for reverse mathematics Simpson [28] with an approach which is primarily in terms of formal systems and Hirschfeldt [13] with one primarily emphasizing computability.

We follow two well established procedures for measuring the complexity of constructions and theorems. The first is basically computability theoretic. It has its formal beginnings in the 1950s but has much earlier roots in constructive or computable mathematics reaching back to antiquity. (See Ershov et al. [6] for history and surveys of the approach in several areas of combinatorics, algebra and analysis.) The measuring rod here is relative computability. We say a set AA of natural numbers is (Turing) computable from a set BB, ATBA\leq_{T}B, if there is an algorithm (say on a Turing machine or any other reasonable model of general computation) that, when given access to all membership facts about BB (an oracle for BB) computes membership in AA. The standard hierarchies of complexity here are based on iterations of the Turing jump. This operator takes BB to BB^{\prime}, the halting problem relativized to BB, i.e. the set of programs with oracle for BB, ΦeB\Phi_{e}^{B}, such that ΦeB\Phi_{e}^{B} halts on input ee. For example, if the tree of König’s Lemma is computable in BB then there is a branch computable in the double jump B′′B^{\prime\prime} of BB.

The second approach is proof theoretic. It measures the complexity of a theorem by the logical strength of the axioms needed to prove it. This approach also has a long history but the formal subject, now called reverse mathematics, starts with H. Friedman’s work in the 1970s (e.g. [7, 8]). One compares axiomatic systems SS and TT by saying that TT is stronger than SS, TST\vdash S (TT proves SS) if one can prove every sentence ΘS\Theta\in S from the axioms of TT. Of course, we know what it means for Θ\Theta to be provable in SS. The goal here is to characterize to the extent possible the axioms needed to prove a given mathematical theorem Θ\Theta. To this end, one begins with a weak base theory. Then one wants to find a system SS such that not only does SΘS\vdash\Theta but also Θ\Theta (with the weak base theory) proves all the axioms of SS. Hence the name reverse mathematics as we seek to prove the “axioms” of SS from the theorem Θ\Theta. Typically, the systems here are formalized in arithmetic with quantification over sets as well as numbers. The standard base theory (𝖱𝖢𝖠0\mathsf{RCA}_{0}) corresponds to the axioms needed to do computable constructions. Stronger systems are then usually generated by adding comprehension axioms which assert the existence of specific families of sets. For example, a very important system is 𝖠𝖢𝖠0\mathsf{ACA}_{0}. It is equivalent in the sense of reverse mathematics just described to König’s Lemma. Formally, it asserts that every subset of the natural numbers defined by a formula that quantifies only over numbers (and not sets) exists. This is also equivalent to asserting that for every set BB, the set BB^{\prime} exists.

The early decades of reverse mathematics were marked by a large variety of results characterizing a wide array of theorems and constructions as being one of five specific levels of complexity including 𝖱𝖢𝖠0\mathsf{RCA}_{0} and 𝖠𝖢𝖠0\mathsf{ACA}_{0}. Each of these systems (Simpson’s “big five”) have corresponding specific recursion theoretic construction principles. In more recent decades, there has been a proliferation of results placing theorems and constructions outside the big five. Sometimes these are inserted linearly and sometimes with incomparabilities. They are now collectively often called the “zoo” of reverse mathematics. (See https://rmzoo.math.uconn.edu/diagrams/ for pictures.)

Theorems and constructions in combinatorics in general, and graph theory in particular, have been a rich source of such denizens of this zoo. Almost all of them have fallen below 𝖠𝖢𝖠0\mathsf{ACA}_{0} (König’s Lemma) and so have the objects they seek constructible computably in finitely many iterations of the Turing jump. Ramsey theory, in particular, has provided a very large class of constructions and theorems of distinct complexity. One example of the infinite version of a classical theorem of finite graph theory that is computationally and reverse mathematically strictly stronger than 𝖠𝖢𝖠0\mathsf{ACA}_{0} is König’s Duality Theorem (𝖪𝖣𝖳\mathsf{KDT}) for countable graphs. (Every bipartite graph has a matching and a cover consisting of one vertex from each edge of the matching.) The proofs of this theorem for infinite graphs (Podewski and Steffens [21] for countable and Aharoni [1] for arbitrary ones) are not just technically difficult but explicitly used both transfinite recursions and well orderings of all subsets of the given graph. These techniques lie far beyond 𝖠𝖢𝖠0\mathsf{ACA}_{0}. Aharoni, Magidor and Shore [2] proved that this theorem is of great computational strength in that there are computable graphs for which the required matching and cover compute all the iterations of the Turing jump through all computable well-orderings. They also showed that it was strong reverse mathematically as it implied 𝖠𝖳𝖱0\mathsf{ATR}_{0}, the standard system above 𝖠𝖢𝖠0\mathsf{ACA}_{0} used to deal with such transfinite recursions. Some of the lemmas used in each of the then known proofs were shown to be equivalent to the next and final of the big five systems, Π11\Pi_{1}^{1}-𝖢𝖠0\mathsf{CA}_{0} and of corresponding computational strength. Simpson [27] later provided a new proof of the theorem using logical methods that avoided these lemmas and showed that the theorem itself is equivalent to 𝖠𝖳𝖱0\mathsf{ATR}_{0} and so strictly weaker than the lemmas both computationally and in terms of reverse mathematics.

The situation for the theorems of Halin type that we study here is quite different. The standard proofs do not seem to use such strong methods. Nonetheless, as we mentioned above, the theorems are much stronger than 𝖠𝖢𝖠0\mathsf{ACA}_{0} with some versions not even provable in 𝖠𝖳𝖱0\mathsf{ATR}_{0}. We prove that these theorems occupy a few houses in the area of the reverse mathematics zoo devoted to what are called theorems (or theories) of hyperarithmetic analysis, THAs (Definition 3.13). Computationally, for each computable well ordering α\alpha, there is a computable instance of any THA which has all of its required objects Turing above 0(α)0^{(\alpha)}, the α\alphath iteration of the Turing jump. On the other hand, they are computationally and proof theoretically much weaker than 𝖠𝖳𝖱0\mathsf{ATR}_{0} and so 𝖪𝖣𝖳\mathsf{KDT}. The point here is that there is a single computable graph such that the matching and cover required by 𝖪𝖣𝖳\mathsf{KDT} lies above 0(α)0^{(\alpha)} for all the computable well-orderings α\alpha, while for each computable instance of a THA there is a computable well-ordering α\alpha such that 0(α)0^{(\alpha)} computes the desired object. In our cases, the instances are graphs with arbitrarily many disjoint rays and the desired object is an infinite sequence of disjoint rays. (The general usage of terms like instances and solutions of a theorem or principle is described at the end of §3.)

Beginning with work of Kreisel [15], H. Friedman [9], Steel [29] and others in the 1960s and 1970s and continuing into the last decade (by Montalbán [16, 17], Neeman [19, 20] and others), several axiomatic systems and logical theorems were found to be THAs and proven to lie in a number of distinct classes in terms of proof theoretic complexity. Until now, however, there has been only one mathematical but not logical example, i.e. one not mentioning classes of first order formulas or their syntactic complexity. This was a result (INDEC) about indecomposability of linear orderings in Jullien’s thesis [14] (see Rosenstein [23, Lemma 10.3]). It was shown to be a THA by Montalbán [16].

The natural quest then became to find out if there are any other THAs in the standard mathematical literature. The issue was raised explicitly in Montalbán’s “Open Questions in Reverse Mathematics” [18, Q30]. As our answer, we provide many examples. Most of them are provable in a well known system above 𝖠𝖢𝖠0\mathsf{ACA}_{0} gotten by adding on a weak form of the axiom of choice (Σ11-𝖠𝖢0\Sigma^{1}_{1}\text{-}\mathsf{AC}_{0}).

Several of the basic Halin type theorems (the 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} defined after Definition 3.4) have versions (the 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}^{\ast} of Definition 6.1) like those appearing in the original papers that show that there are always families of disjoint rays of maximal cardinality which are of the same computational strength as the basic versions (Proposition 6.3 and Corollary 6.4). On the other hand, the 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}^{\ast} are strictly stronger proof theoretically than the 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} because they imply more induction than is available in Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0} (Theorem 6.8 and Corollary 6.9). Two of the variations we consider are as yet open problems of graph theory ([3] and Bowler, personal communication). We show that if we restrict the class of graphs to directed forests the principles are not only provable but reverse mathematically equivalent to Σ11-𝖠𝖢0+𝖨Σ11\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0}+\mathsf{I}\Sigma_{1}^{1}. Note that as 𝖠𝖳𝖱0𝖨Σ11\mathsf{ATR}_{0}\nvdash\mathsf{I}\Sigma_{1}^{1} [28, IX.4.7], these theorems are not provable even in 𝖠𝖳𝖱0\mathsf{ATR}_{0} or from 𝖪𝖣𝖳\mathsf{KDT} (Corollary 6.14). We do not know of other mathematical but nonlogical theorems of this strength. Other versions that require maximal sets of rays (Definition 6.17) are much stronger and, in fact, equivalent to Π11\Pi_{1}^{1}-𝖢𝖠0\mathsf{CA}_{0} (Theorem 6.18).

2 Outline of Paper

Section 3 discusses basic concepts and background information. The first subsection (3.1) provides what we need from graph theory. Almost all the definitions are standard. At times we give slight variations that are equivalent to the standard ones but make dealing with the computability and proof theoretic analysis easier. We also state the theorems of Halin and some variants that are the main targets of our analysis.

The second subsection (3.2), assumes an intuitive view of computability of functions f:NNf:N\to N such as having an algorithm given by a program in any standard computer language (possibly with access to an “oracle” providing information about a given set or function). It then gives the standard notions and theorems that can be found in basic texts on computability theory needed to follow our analysis of the computational complexity of the graph theoretic theorems we study. In particular, it notes the Turing jump operator and its iterations along countable well orderings. These are our primary computational measuring rods. The final subsection (3.3) provides the syntax and semantics for the formal systems of arithmetic that are used to measure proof theoretic complexity. It also describes the standard basic axiomatic systems and their connections to the computational measures of the previous subsection. It includes the formal definition of the class of theorems which includes most of our graph theoretic examples, the THAs, Theorems (or Theories) of Hyperarithmetic Analysis. These are defined in terms of the transfinite iterations of the Turing jump and the hyperarithmetic sets of the previous subsection. In addition it defines Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0} a weak version of the axiom of choice that is an early well known example of such theories and plays a crucial role in our analysis.

Section 4 provides the proof that Halin’s original theorem 𝖨𝖱𝖳\mathsf{IRT} (Definition 3.4) is computationally very complicated. For example, given any iteration 0(α)0^{(\alpha)} of the Turing jump, there is a computable graph satisfying the hypotheses of 𝖨𝖱𝖳\mathsf{IRT} such that any instance of its conclusion computes 0(α)0^{(\alpha)}. Indeed, 𝖨𝖱𝖳\mathsf{IRT} is a THA. At times, theorems or lemmas are stated in terms of the formal systems of §3.3, but the proofs rely only on the computational notions of §3.2.

Section 5 studies several variations 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} of Halin’s 𝖨𝖱𝖳\mathsf{IRT} where we consider directed as well as undirected graphs, edge rather than vertex disjointness for the rays and double as well as single rays. (See Definitions 3.1 and 3.2 and the discussion after Definition 3.4.) We provide reductions over 𝖱𝖢𝖠0\mathsf{RCA}_{0} between many of the pairs of the eight possible variants. The proofs of these reductions proceed purely combinatorially by providing one computational process that takes an instance of some 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}, i.e. a graph satisfying its hypotheses, and produces a graph satisfying the hypotheses of another 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{X^{\prime}Y^{\prime}Z^{\prime}}} and another computable process that takes any solution to the 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{X^{\prime}Y^{\prime}Z^{\prime}}} instance, i.e. any sequence of rays satisfying the conclusion of 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{X^{\prime}Y^{\prime}Z^{\prime}}}, and produces a solution to the original instance of 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}. (See Propositions 5.3, 5.5 and 5.7 and the associated Lemmas. An additional reduction using a stronger base theory is given in the next section (Theorem 6.15).)

We then show that five of the eight possible variants of 𝖨𝖱𝖳\mathsf{IRT}_{\mathrm{}} are THAs (Theorem 5.1). As mentioned in §1, of the remaining three, two are still open problems in graph theory. We do, however, have an analysis of their restrictions to special classes of graphs in Theorem 5.16 and §6.1. The last of the variations, 𝖨𝖱𝖳UED\mathsf{IRT}_{\mathrm{UED}}, has been proven more recently by Bowler, Carmesin, Pott [3] using more sophisticated methods than the other results. We have some lower bounds (Theorem 5.9) but we have yet to fully analyze the complexity of their construction.

In the next section (§6) we study some variations of 𝖨𝖱𝖳\mathsf{IRT} that ask for different types of maximality for the solutions. The first sort actually follow the original formulation of 𝖨𝖱𝖳\mathsf{IRT} in Halin [11]: In any graph there is a set of disjoint rays of maximum cardinality. For uncountable graphs this amounts to a basic counting argument on uncountable cardinals as all rays are countable. When restricted to countable graphs these variations, 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}^{\ast}, are easily seen to be equivalent to our more modern formulation by induction. Technically, the induction used is for Σ11\Sigma_{1}^{1} formulas (𝖨Σ11\mathsf{I}\Sigma_{1}^{1}) which is not available in 𝖱𝖢𝖠0\mathsf{RCA}_{0}. More specifically we show (Proposition 6.3) that 𝖨𝖱𝖳XYZ+𝖨Σ11\mathsf{IRT}_{\mathrm{XYZ}}+\mathsf{I}\Sigma_{1}^{1} and 𝖨𝖱𝖳XYZ+𝖨Σ11\mathsf{IRT}_{\mathrm{XYZ}}^{\ast}+\mathsf{I}\Sigma_{1}^{1} are equivalent (over 𝖱𝖢𝖠0\mathsf{RCA}_{0}). As the definition of THAs only depends on standard models where full induction holds, if 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} is a THA then so is 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}^{\ast}.

We then prove that these maximal cardinality variants 𝖨𝖱𝖳XYZ\mathsf{IRT}^{\ast}_{\mathrm{XYZ}} are strictly stronger proof theoretically than the basic 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} (when they are known to be provable in Σ11-𝖠𝖢0\Sigma^{1}_{1}\text{-}\mathsf{AC}_{0}). This is done by showing (Theorem 6.8 and the Remark that follows it) that the relevant 𝖨𝖱𝖳XYZ\mathsf{IRT}^{\ast}_{\mathrm{XYZ}} all imply weaker versions of 𝖨Σ11\mathsf{I}\Sigma_{1}^{1} that are analogous to the restrictions of Σ11-𝖠𝖢0\Sigma^{1}_{1}\text{-}\mathsf{AC}_{0} embodied in weak (or unique)-Σ11-𝖠𝖢0\Sigma^{1}_{1}\text{-}\mathsf{AC}_{0} and finite-Σ11-𝖠𝖢0\Sigma^{1}_{1}\text{-}\mathsf{AC}_{0} (Definitions 7.1 and 7.2). In all the cases, it is enough induction to prove (with the apparatus of the basic 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}) the consistency of Σ11-𝖠𝖢0\Sigma^{1}_{1}\text{-}\mathsf{AC}_{0} and so by Gödel’s second incompleteness theorem they cannot be proved in Σ11-𝖠𝖢0\Sigma^{1}_{1}\text{-}\mathsf{AC}_{0} (Corollary 6.9).

As for proving full Σ11\Sigma_{1}^{1} induction from an 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}^{\ast} we are in much the same situation mentioned above for Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0} and 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}. In particular, 𝖨𝖱𝖳DVD\mathsf{IRT}_{\mathrm{DVD}}^{\ast} and 𝖨𝖱𝖳DED\mathsf{IRT}_{\mathrm{DED}}^{\ast} for directed forests each proves 𝖨Σ11\mathsf{I}\Sigma_{1}^{1} as well as Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0} (Theorems 6.12 and 6.13) and so are equivalent to 𝖨Σ11+Σ11-𝖠𝖢0\mathsf{I}\Sigma_{1}^{1}+\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0}. As before, this shows that they are strictly stronger than Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0} (Corollary 6.14). Indeed, as mentioned at the end of §1, they are not even provable in 𝖠𝖳𝖱0\mathsf{ATR}_{0}. We do not know of any other mathematical theorem with this level of reverse mathematical strength.

The second variation of maximality, 𝖬𝖨𝖱𝖳XYZ\mathsf{MIRT}_{\mathrm{XYZ}}, studied in §6.2 is also mentioned in the original Halin paper [11]. It asks for a set of disjoint rays which is maximal in the sense of set containment. Of course, this follows immediately from Zorn’s Lemma for all graphs. For countable graphs we provide a reverse mathematical analysis, showing that each of the 𝖬𝖨𝖱𝖳XYZ\mathsf{MIRT}_{\mathrm{XYZ}} is equivalent to Π11\Pi^{1}_{1}-𝖢𝖠0\mathsf{CA}_{0} (Theorem 6.18).

In §7, we discuss the reverse mathematical relationships between the THAs associated with variations of Halin’s theorem and previously studied THAs as well as one new logical THA (finite-Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0} of Definition 7.1). Basically, all the 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}^{\ast} (and so 𝖨𝖱𝖳XYZ+𝖨Σ11\mathsf{IRT}_{\mathrm{XYZ}}+\mathsf{I}\Sigma_{1}^{1}) imply H. Friedman’s 𝖠𝖡𝖶0\mathsf{ABW}_{0} (Definition 7.4) by Theorem 7.7 and finite-Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0} (Theorem 7.3). On the other hand, none of them are implied by it (Theorem 7.10) or by Δ11\Delta_{1}^{1}-𝖢𝖠0\mathsf{CA}_{0} (Definition 7.8 and Theorem 7.9). 𝖠𝖡𝖶0+𝖨Σ11\mathsf{ABW}_{0}+\mathsf{I}\Sigma_{1}^{1} does, however, imply finite-Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0} which is not implied by weak (unique)-Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0} (Goh [10]). Figure 4 summarizes many of the known relations with references.

In the penultimate section (§8) we study the only use of Σ11-𝖠𝖢0\Sigma^{1}_{1}\text{-}\mathsf{AC}_{0} in each of our proofs of 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}. It consists of 𝖲𝖢𝖱XYZ\mathsf{SCR}_{\mathrm{XYZ}} which says we can go from the hypothesis that there are arbitrarily many disjoint rays to a sequence Xkk\langle X_{k}\rangle_{k} in which each XkX_{k} is a sequence of kk many disjoint rays. We analyze the strength of the 𝖲𝖢𝖱XYZ\mathsf{SCR}_{\mathrm{XYZ}} and the weakenings 𝖶𝖨𝖱𝖳XYZ\mathsf{WIRT}_{\mathrm{XYZ}} of 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} which each take the existence of such a sequence Xkk\langle X_{k}\rangle_{k} as its hypothesis in place of there being arbitrarily many disjoint rays. For example, for all the 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} which are consequences of Σ11-𝖠𝖢0\Sigma^{1}_{1}\text{-}\mathsf{AC}_{0} and so are THAs, 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} is equivalent to 𝖲𝖢𝖱XYZ\mathsf{SCR}_{\mathrm{XYZ}} over 𝖱𝖢𝖠0\mathsf{RCA}_{0} (Corollary 8.5) and so all of them are also THAs. For the same choices of XYZXYZ, 𝖠𝖢𝖠0\mathsf{ACA}_{0} proves 𝖶𝖨𝖱𝖳XYZ\mathsf{WIRT}_{\mathrm{XYZ}} over 𝖱𝖢𝖠0\mathsf{RCA}_{0}. While a natural strengthening of 𝖶𝖨𝖱𝖳XYZ\mathsf{WIRT}_{\mathrm{XYZ}} does imply 𝖠𝖢𝖠0\mathsf{ACA}_{0} and indeed is equivalent to it (Theorem 8.9), we do not know if 𝖶𝖨𝖱𝖳XYZ\mathsf{WIRT}_{\mathrm{XYZ}} itself implies 𝖠𝖢𝖠0\mathsf{ACA}_{0}. All we can prove is that it is not a consequence of 𝖱𝖢𝖠0\mathsf{RCA}_{0} (Theorem 8.10).

In the last section (§9), we mention some open problems.

3 Basic Notions and Background

We begin with basic notions and terminology from graph theory. At times we use formalizations that are clearly equivalent to more standard ones but are easier to work with computationally or proof-theoretically. The following two subsections supply background and basic information about the standard computational and logical/proof theoretic notions that we use here to measure the complexity of the graph theorems and constructions that we analyze in the rest of this paper. Note that we denote the set of natural numbers by NN when we may be thinking of them in a model of arithmetic as in §3.3 and by \mathbb{N} when we emphasize that we specifically want the standard natural numbers.

3.1 Graph Theoretic Notions

Definition 3.1.

A graph HH is a pair V,E\left\langle V,E\right\rangle consisting of a set VV (of vertices) and a set EE of unordered pairs {u,v}\{u,v\} with uvu\neq v from VV (called edges). These structures are also called undirected graphs (or here U-graphs). A structure HH of the form V,E\left\langle V,E\right\rangle as above is a directed graph (or here D-graph) if EE consists of ordered pairs u,v\left\langle u,v\right\rangle of vertices with uvu\neq v. To handle both cases simultaneously, we often use XX to stand for undirected (U) or directed (D). We then use (u,v)(u,v) to stand for the appropriate kind of edge, i.e. {u,v}\{u,v\} or u,v\langle u,v\rangle.

An  XX-subgraph of the XX-graph HH is an XX-graph H=V,EH^{\prime}=\left\langle V^{\prime},E^{\prime}\right\rangle such that VVV^{\prime}\subseteq V and EEE^{\prime}\subseteq E. It is an induced XX-subgraph if E={(u,v)u,vV(u,v)E}E^{\prime}=\{(u,v)\mid u,v\in V^{\prime}\land(u,v)\in E\}.

Definition 3.2.

An XX-ray in H is a pair consisting of an XX-subgraph H=V,EH^{\prime}=\left\langle V^{\prime},E^{\prime}\right\rangle of HH and an isomorphism ff from NN with edges (n,n+1)(n,n+1) for nNn\in N to HH^{\prime}. (Note that this implies that the range of ff is the set VV^{\prime}.) We say that the ray begins at f(0)f(0). We also describe this situation by saying that HH contains the XX-ray H,f\left\langle H^{\prime},f\right\rangle. We sometimes abuse notation by saying that the sequence f(n)\left\langle f(n)\right\rangle of vertices is an XX-ray in HH. A tail of an XX-ray is a final segment of said XX-ray. Similarly we consider double X-rays where the isomorphism ff is from the integers 𝒵={n,nnN}\mathcal{Z}=\{-n,n\mid n\in N\} with edges (z,z+1)(z,z+1) for z𝒵z\in\mathcal{Z}. A tail of a double XX-ray is a final segment of said XX-ray, or an initial segment of said XX-ray considered in reverse order.

We use ZZ-ray to stand for either a (single) ray (Z=Z={}S) or double ray (Z=Z={}D) and so we have, in general, ZZ-XX-rays or just ZZ-rays if the type of graph (U or D) is already established. For brevity, when we describe rays we will often only list their vertices in order instead of defining HH^{\prime} and ff explicitly. However the reader should be aware that we always have HH^{\prime} and ff in the background.

HH contains kk many ZZ-XX-rays for kNk\in N if there is a sequence Hi,fii<k\left\langle H_{i},f_{i}\right\rangle_{i<k} such that each Hi,fi\left\langle H_{i},f_{i}\right\rangle is a ZZ-XX-ray in HH (with Hi=Vi,EiH_{i}=\left\langle V_{i},E_{i}\right\rangle). HH contains kk many disjoint (or vertex-disjoint) ZZ-XX-rays if the ViV_{i} are pairwise disjoint. HH contains kk many edge-disjoint ZZ-XX-rays if the EiE_{i} are pairwise disjoint. We often use YY to stand for either vertex (V) or edge (E) as in the following definitions.

An XX-graph HH contains arbitrarily many Y-disjoint Z-X-rays if it contains kk many such rays for every kNk\in N.

An XX-graph HH contains infinitely many Y-disjoint Z-X-rays if there is an XX-subgraph H=V,EH^{\prime}=\left\langle V^{\prime},E^{\prime}\right\rangle of HH and a sequence Hi,fiiN\left\langle H_{i},f_{i}\right\rangle_{i\in N} such that each Hi,fi\left\langle H_{i},f_{i}\right\rangle is a ZZ-XX-ray in HH (with Hi=Vi,EiH_{i}=\left\langle V_{i},E_{i}\right\rangle) such that the ViV_{i} or EiE_{i}, respectively for Y=V,EY=V,E, are pairwise disjoint and V=ViV^{\prime}=\bigcup V_{i} and E=EiE^{\prime}=\bigcup E_{i}.

Definition 3.3.

An XX-path PP in an XX-graph HH is defined similarly to single rays except that the domain of ff is a proper initial segment of NN instead of NN itself. Thus they are finite sequences of distinct vertices with edges between successive vertices in the sequence. If P=x0,,xnP=\left\langle x_{0},\ldots,x_{n}\right\rangle is a path, we say it is a path of length nn between x0x_{0} and xnx_{n}. Our notation for truncating and combining paths P=x0,,xnP=\left\langle x_{0},\ldots,x_{n}\right\rangle, Q=y0,,ymQ=\left\langle y_{0},\ldots,y_{m}\right\rangle and R=z0,,zlR=\left\langle z_{0},\ldots,z_{l}\right\rangle is as follows: xiP=xi,,xnx_{i}P=\left\langle x_{i},\ldots,x_{n}\right\rangle, Pxi=x0,,xiPx_{i}=\left\langle x_{0},\ldots,x_{i}\right\rangle, and we use concatenation in the natural way, e.g., if the union of PxPx, xQyxQy and yRyR is a path, we denote it by PxQyRPxQyR. We treat rays as we do paths in this notation, as long as it makes sense, writing, for example, xiRx_{i}R for the ray which is gotten by starting RR at an element xix_{i} of RR; RxiRx_{i} is the path which is the initial segment of RR ending in xix_{i} and we use concatenation as for paths as well.

The starting point of the work in this paper is a theorem of Halin [11] that we call the infinite ray theorem as expressed in [5, Theorem 8.2.5(i)].

Definition 3.4 (Halin’s Theorem).

𝖨𝖱𝖳\mathsf{IRT}, the infinite ray theorem, is the principle that every graph HH which contains arbitrarily many disjoint rays contains infinitely many disjoint rays.

The versions of Halin’s theorem which we consider in this paper allow for HH to be an undirected or a directed graph and for the disjointness requirement to be vertex or edge. We also allow the rays to be single or double. The corresponding versions of Halin’s Theorem are labeled as 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} for appropriate values of X,X, YY and ZZ to indicate whether the graphs are undirected or directed (X=X={}U or D); whether the disjointness refers to the vertices or edges (Y=Y={}V or E) and whether the rays are single or double (Z=Z={}S or D), respectively, in the obvious way. We often state a theorem for several or all XYZXYZ and then in the proof use “graph”, “edge” and “disjoint” unmodified with the intention that the proof can be read for any of the cases. This is convenient for minimizing repetition in some of our arguments.

We will also consider restrictions of these theorems to specific families of graphs. We need a few more notions to define them.

Definition 3.5.

A tree is a graph TT with a designated element rr called its root such that for each vertex vrv\neq r there is a unique path from rr to vv. A branch on (or in) TT is a ray that begins at its root. We denote the set of its branches by [T][T] and say that TT is well-founded if [T]=[T]=\emptyset and otherwise it is ill-founded. A forest is an effective disjoint union of trees, or more formally, a graph with a designated set RR (of vertices called roots) such that for each vertex vv there is a unique rRr\in R such that there is a path from rr to vv and, moreover, there is only one such path. In general, the effectiveness we assume when we take effective disjoint unions of graphs means that we can effectively (i.e. computably) uniquely identify each vertex in the union with the original vertex (and the graph to which it belongs) which it represents in the disjoint union.

A directed tree is a directed graph T=V,ET=\left\langle V,E\right\rangle such that its underlying graph T^=V,E^\hat{T}=\langle V,\hat{E}\rangle where E^={{u,v}u,vEv,uE}\hat{E}=\{\{u,v\}\mid\langle u,v\rangle\in E\,\vee\,\langle v,u\rangle\in E\} is a tree. A directed forest is a directed graph whose underlying graph is a forest.

Definition 3.6.

An XX-graph HH is locally finite if, for each uVu\in V, the set {vE(u,v)E(v,u)E}\{v\in E\mid(u,v)\in E\lor(v,u)\in E\} of neighbors of uu is finite. A locally finite XX-tree is also called finitely branching. (Note this does not mean there are finitely many branches in the tree.)

Of course, there are many well known equivalent definitions of trees and associated notions. We have given one possible set of graph-theoretic definitions. In the case of undirected graphs our definition is equivalent to all the standard ones. Readers are welcome to think in terms of their favorite definition. Note, however, we are restricting ourselves to what would (in set theory) be called countable trees with all nodes of finite rank. Thus, we typically think of trees as subtrees of N<NN^{<N}, i.e. the sets of finite strings of numbers (as vertices) with an edge between σ\sigma and τ\tau if and only if they differ by one being an extension of the other by one element, e.g. σk=τ\sigma\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}k=\tau.

It does not seem as if there is a single standard definition for directed graphs being directed trees. We have picked one that seems to be at least fairly common and works for the only situations for which we consider them in Theorems 5.16, 6.12, and 6.13 and Corollary 6.14.

3.2 Computability Hierarchies

While we may cite results about uncountable graphs, all sets and structures actually studied in this paper will be countable. Thus for purposes of defining their complexity, we can think of all of them as being subsets of, or relations or functions on, \mathbb{N}.

We do not give a formal stand alone definition of computability for sets or functions but assume an at least intuitive grasp of some model of computation such as by a Turing or Register machine that has unbounded memory and is allowed to run for unboundedly many steps. (We do provide in §3.3 a definition via definability in arithmetic that is equivalent to the formal versions of machine model definitions.) Thus we say a function f:f:\mathbb{N}\rightarrow\mathbb{N} is computable if there is a program for one of these machines that computes f(n)f(n) as output when given input nn. A set XX is computable if its characteristic function {0,1}\mathbb{N}\rightarrow\{0,1\} is computable. Note that as the alphabets or our languages are finite, there are only countably many programs and as our formation rules are effective, we have a computable list of the programs and hence one, Φe\Phi_{e}, of the partial functions they compute. (They are only partial as, of course, some programs fail to halt on some inputs.)

Fundamental to measuring the relative computational complexity of sets or functions is the notion of machines with oracles and Turing reduction. Given a set XX or function ff we consider machines augmented by the ability to produce X(n)X(n) or f(n)f(n) if it has already produced nn. We say that such a machine is one with an oracle for XX or ff. We then say that XX is computable from (or Turing reducible to) YY if there is a machine with oracle YY which computes XX via some reduction ΦeX\Phi_{e}^{X}. We write this as XTYX\leq_{T}Y. We say XX is of the same (Turing) degree as YY, XTYX\equiv_{T}Y, if XTYX\leq_{T}Y and YTXY\leq_{T}X. We use all the same terminology and notations for functions.

The first level beyond the computable in our basic hierarchy of computable complexity is given by the halting problem H={e|Φe(e)H=\{e|\Phi_{e}(e) converges}\} that is HH is set of ee such that the computation of the eeth machine Φe\Phi_{e} on input ee eventually halts. We then define an operator on sets XX={e|ΦeX(e)X\longmapsto X^{\prime}=\{e|\Phi_{e}^{X}(e) converges}\} that is XX^{\prime} is the set of ee such that the computation of the eeth machine with oracle XX, ΦeX\Phi_{e}^{X} on input ee eventually halts. (It is easy to see that HTH\equiv_{T}\emptyset^{\prime}.) The crucial fact here is the undecidability of the halting problem (for every oracle), i.e. for every XX, XX^{\prime} is strictly above XX in terms of Turing computability. The other basic fact that we need about \emptyset^{\prime} is that it is computably enumerable, i.e. there is a computable function ff whose range is \emptyset^{\prime}. If f(s)=xf(s)=x we say that xx is enumerated in, or enters, 00^{\prime} at (stage) ss. If we view HH as defined by using the empty oracle \emptyset, the procedure that takes us from the halting problem to the Turing jump by replacing \emptyset as oracle by XX is an instance of a general procedure called relativization. It takes any computable function or proof about computable functions or degrees (i.e. ones with oracle \emptyset) to the same function, or proof about functions, computable in XX (or degrees above that of XX). Almost always this procedure trivially transforms correct proofs with oracle \emptyset to ones with arbitrary oracle XX. Typically, this transformation keeps the same programs doing the required work with any oracle. For example, XX^{\prime} is computably enumerable in XX (or relative to XX), i.e. there is a function ΦeX\Phi_{e}^{X} whose range is XX^{\prime} and this can be taken to be the same ee such that Φe\Phi_{e}^{\emptyset} enumerates \emptyset^{\prime}. We also use XsX_{s}^{\prime} to denote the set of numbers enumerated in (or that have entered) XX^{\prime} by stage ss. This phenomena of the procedure or result not depending on the particular oracle or depending in a fixed computable way on some other parameters is described as its being uniform in the oracle or other parameters. We describe an important example of uniformity in Remark 3.9.

We can now generate a hierarchy of computational complexity by iterating the jump operator beginning with any set XX: X(0)=XX^{(0)}=X; X(n+1)=(Xn)X^{(n+1)}=(X^{n})^{\prime}. While the finite iterations of the jump capture most construction techniques and theorems in graph theory (and most other areas of classical countable/separable mathematics), we will be interested in ones that go beyond such techniques and proofs. The basic idea is that we continue the hierarchy by iteration into the transfinite while still tying the iteration to computable procedures.

Definition 3.7.

We represent well-orderings or ordinals α\alpha as well-ordered relations on \mathbb{N}. Typically such ordinal notations are endowed with various additional structure such as identifying 0, successor and limit ordinals and specifying cofinal ω\omega-sequences for the limit ordinals. If we have a representation of α\alpha then restricting the well-ordering to numbers in its domain provides representations of each ordinal β<α\beta<\alpha. We generally simply work with ordinals and omit concerns about translating standard relations and procedures to the representation. An ordinal is recursive (in a set XX) if it has a recursive (in XX) representation. For a set XX and ordinal (notation) α\alpha computable from XX, we define the transfinite iterations X(β)X^{(\beta)} of the Turing jump of XX by transfinite induction on βα\beta\leq\alpha: X(0)=XX^{(0)}=X; X(β+1)=(Xβ)X^{(\beta+1)}=(X^{\beta})^{\prime} and for a limit ordinal λ\lambda, X(λ)={X(β)|β<λ}={β×X(β)|β<λ}X^{(\lambda)}=\oplus\{X^{(\beta)}|\beta<\lambda\}=\cup\{\beta\times X^{(\beta)}|\beta<\lambda\} (or as the effective disjoint sum over the X(β)X^{(\beta)} in the specified cofinal sequence in λ\lambda).

Definition 3.8.

HYP(X)\mathrm{HYP}(X), the collection of all sets hyperarithmetic in XX consists of those sets recursive in some X(α)X^{(\alpha)} for α\alpha an ordinal recursive in XX. We say that YY is hyperarithmetic in XX or hyperarithmetically reducible to XX , YhXY\leq_{h}X if YHYP(X)Y\in\mathrm{HYP}(X).

These sets too, will be characterized by a definability class in arithmetic in §3.3. For now we just note that they clearly go far beyond the sets computable from the finite iterations of the jump.

The computational strength of our graph theoretic theorems such as 𝖨𝖱𝖳\mathsf{IRT}_{\mathrm{}} is measured by this hierarchy as we will show that, for every set XX and every set YY hyperarithmetic in XX, there is a graph GG computable from XX which satisfies the hypotheses of 𝖨𝖱𝖳\mathsf{IRT}_{\mathrm{}} but for which any collection of rays satisfying its conclusion computes YY. On the other hand, placing an upper bound on the strength of 𝖨𝖱𝖳\mathsf{IRT}_{\mathrm{}} requires analyzing its proof and the principles used in it. The relevant one is a form of the axiom of choice. We define it in the next subsection along with a general class of such principles, the theorems/theories of hyperarithmetic analysis which are, computationally, the primary objects of our analysis in this paper.

We note one important well known basic fact relating the jumps of XX to trees computable from XX. We will need it for our proofs that 𝖨𝖱𝖳\mathsf{IRT}_{\mathrm{}} and its variants are computationally complex enough to compute all the sets hyperarithmetic in any given set XX (as the instances of the graphs range over graphs computable from XX).

Remark 3.9.

For any set XX and any ordinal α\alpha computable from XX, there is a sequence Tβ|β<α\left\langle T_{\beta}|\beta<\alpha\right\rangle computable from XX of trees (necessarily) computable from XX such that each tree has exactly one branch PβP_{\beta} and PβP_{\beta} is of the same complexity as X(β)X^{(\beta)}, i.e. PβTX(β)P_{\beta}\equiv_{T}X^{(\beta)}. The procedure for computing this sequence is uniform in XX and the index for the program computing the well ordering α\alpha from XX, i.e. there is one computable function that when given an oracle for XX, an index for α\alpha (i.e. the ii such that ΦiX\Phi_{i}^{X} is the well ordering α\alpha) and a β\beta in the ordering, computes the whole sequence Tβ|β<α\left\langle T_{\beta}|\beta<\alpha\right\rangle and the indices for the reductions between PβP_{\beta} and X(β)X^{(\beta)}. (See, e.g. [25, Theorem 2.3]). We may also easily assure that the TβT_{\beta} are effectively disjoint so that their union is a forest.

Some versions of the variations on 𝖨𝖱𝖳\mathsf{IRT}_{\mathrm{}} (see §6.2) that call for types of maximality for the infinite set of disjoint rays are stronger both computationally and proof theoretically than the 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} described above. Their computational strength is captured by a kind of jump operator that goes beyond all the hyperarithmetic ones. It captures the ability to tell if a computable ordering is a well-ordering.

Definition 3.10.

The hyperjump of XX, 𝒪X\mathcal{O}^{X}, is the set {e|ΦeX\{e|\Phi_{e}^{X} is (the characteristic function of) a subtree of <\mathbb{N}^{<\mathbb{N}} which is well-founded}\}.

This operator also corresponds to a syntactically defined level of comprehension as we note in §3.3.

3.3 Logical and Axiomatic Hierarchies

The basic notions from logic that we need here are those of languages, structures and axiomatic systems and proofs. As we will deal only with countable sets and structures, we can assume that we are dealing just with the natural numbers with a way to define and use sets and functions on them. Thus, at the beginning, we have in mind the natural numbers \mathbb{N} along with the usual apparatus of the language of (first order) arithmetic, say +,×,<,0+,\times,<,0 and 11 along with the syntax of standard first order logic (the Boolean connectives ,\vee,\wedge and ¬\lnot; the variables such as xx and yy ranging over the numbers with the usual quantifiers x\forall x and y\exists y as well as the standard equality relation ==). A structure for this language is a set NN along with elements for 0 and 11, binary functions for ++ and ×\times and a binary relation for <<. We also need a way of talking about subsets of (or functions on) the numbers. We follow the standard practice in reverse mathematics of using sets and defining functions in terms of their graphs. So we expand our language by adding on new classes of (second order) variables such as XX and YY and the associated quantifiers X\forall X and Y\exists Y along with a new relation symbol \in between numbers and sets.

A structure for this language is one of the form 𝒩=N,S,+,×,<,0,1,\mathcal{N}=\left\langle N,S,+,\times,<,0,1,\in\right\rangle where its restriction N,+,×,<,0,1\left\langle N,+,\times,<,0,1\right\rangle is a structure for first order arithmetic and S2NS\subseteq 2^{N} is a specified nonempty collection of subsets of NN disjoint from NN, the set of “numbers” of 𝒩\mathcal{N}, over which the second order quantifiers and variables of our language range. It is called the second order part of 𝒩\mathcal{N}. The usual membership symbol \in always denotes the standard membership relation between elements of NN and subsets of NN that are in SS and the language only allows atomic formulas using \in which are of the form tXt\in X for tt a term of the first order language and XX is a second order variable. So a sentence Θ\Theta is true in NN, 𝒩Θ\mathcal{N}\vDash\Theta, if first order quantification is interpreted as ranging over NN, second order quantification ranges over SS and the relations and functions of the language are as described. This specifies the semantics for second order arithmetic. Note that, following [28], we do not take equality for sets to be a primitive relation on this structure. The notation for it is viewed as being defined by A=Bn(nAnB)A=B\leftrightarrow\forall n(n\in A\leftrightarrow n\in B).

Proof theoretic notions deal with all possible structures for the language and axiom systems to specify what we need in any particular argument. For most of our purposes and all of the computational ones, one can restrict attention to standard models of arithmetic, i.e. ones 𝒩\mathcal{N} with N=N=\mathbb{N} and some S2S\subseteq 2^{\mathbb{N}} with the usual interpretations of the functions and relations. We generally abbreviate these structures as ,S\left\langle\mathbb{N},S\right\rangle with S2S\subseteq 2^{\mathbb{N}}, or simply SS, as all the functions and relations are then fixed.

We view the syntax as one for a two sorted first order logic. So the (first order) variables x,y,x,y,\ldots range over the first sort (NN) and the second order ones X,Y,X,Y,\ldots over the second sort (SS). We assume any standard proof theoretic system with the caveat that == is interpreted as true equality with the equality axioms included only for NN. For SS it is a relation defined as above. This generates the provability notion \vdash used above to define our notion of logical strength and equivalences of theories (sets of sentences often called axioms) as above. We now define the standard weak base theory 𝖱𝖢𝖠0\mathsf{RCA}_{0} used to define the logical strength of mathematical theorems as described above. We then define a few other common systems that will be used later. The formal details can be found in [28].

Each axiomatic subsystem of second order arithmetic that we consider contains the standard basic axioms for ++, ×\times, and << (which say that NN is a discrete ordered semiring) and an Induction Axiom:

(𝖨0\mathsf{I}_{0}) (X)((0Xn(nXn+1X))n(nX))(\forall X)((0\in X\land\forall n\,(n\in X\rightarrow n+1\in X))\rightarrow\forall n\,(n\in X)).

Typically axiom systems for second order arithmetic are defined by adding various types of set existence axioms although at times additional induction axioms are used as well. In order to define them we need to specify various standard syntactic classes of formulas determined by quantifier complexity. As usual, we add to our language bounded quantifiers x<t\forall x<t and x<t\exists x<t for first order (i.e. arithmetic) terms tt defined in the standard way. We typically denote formulas by capital Greek letters except that the indexed Φe\Phi_{e} and ΦeX\Phi_{e}^{X} refer, as above, to our fixed enumeration of the Turing machines and associated partial functions.

Definition 3.11.

The Σ00\Sigma_{0}^{0} and Π00\Pi_{0}^{0} formulas of second order arithmetic are just the ones with only bounded quantifiers but we allow parameters for elements of either NN or S(𝒩)S(\mathcal{N}) when working with a structure 𝒩\mathcal{N}. Proceeding inductively, a formula Φ\Phi is Σn+10\Sigma_{n+1}^{0} (Πn+10\Pi_{n+1}^{0}) if it is of the form xΨ\exists x\Psi (xΨ\forall x\Psi) where Ψ\Psi is Πn0\Pi_{n}^{0} (Σn0\Sigma_{n}^{0}). We assume some computable coding of all these formulas (viewed as strings of symbols from our language) by natural numbers. We say Φ\Phi is arithmetic if it is Σn0\Sigma_{n}^{0} or Πn0\Pi_{n}^{0} for some nn\in\mathbb{N}. It is Σ11\Sigma_{1}^{1} (Π11\Pi_{1}^{1}) if it is of the form XΨ\exists X\Psi (XΨ\forall X\Psi) where Ψ\Psi is arithmetic. (One can continue to define Σn1\Sigma_{n}^{1} and Πn1\Pi_{n}^{1} in the natural way but we will not need to consider such formulas here.) We say a set XX is in one of these classes Γ\Gamma relative to AA (i.e. with AA as a parameter) if there is a formula Ψ(n,A)Γ\Psi(n,A)\in\Gamma such that nXΨ(n,A)n\in X\Leftrightarrow\Psi(n,A). If XX is both Σni\Sigma_{n}^{i} in AA and Πni\Pi_{n}^{i} in AA it is called Δni\Delta_{n}^{i} in AA.

We mention a few additional standard connections between the syntactic complexity of the definition of a set XX and XX’s properties in terms of computability and graph theoretic notions. They can all be found in [22].

Proposition 3.12.

The sets A(n)A^{(n)} are Σn0\Sigma_{n}^{0} in AA. A set XX is computable in AA if and only if it is Δ10\Delta_{1}^{0} in AA. More generally, it is computable in A(n)A^{(n)} if and only if it is Δn+10\Delta_{n+1}^{0} in AA. It is hyperarithmetic in AA if and only if it is Δ11\Delta_{1}^{1} in AA. There is a computable function f(e,n)f(e,n) such that if XX is Σ11\Sigma_{1}^{1} in AA via the Σ11\Sigma_{1}^{1} formula with code ee then for every nn, Φf(e,n)A\Phi_{f(e,n)}^{A} is (the characteristic function of) a tree TT such that nXTn\in X\Leftrightarrow T has a branch.

The first system for analyzing the proof theoretic strength of theorems and theories in reverse mathematics is just strong enough to prove the existence of the computable sets and so supplies us with all the usual computable functions such as pairing n,m\left\langle n,m\right\rangle or more generally those coding finite sequences as numbers. In particular, it provides the predicates defining the (codes ee of) the partial computable functions ΦeX\Phi_{e}^{X} and the relations saying the computation ΦeX(n)\Phi_{e}^{X}(n) halts in ss many steps with output yy. Thus we have the basic tools to define and discuss Turing reducibility and the Turing jump. It is our weak base theory and is assumed to be included in every system we consider.

(𝖱𝖢𝖠0\mathsf{RCA}_{0}) Recursive Comprehension Axioms: In addition to the ones mentioned above, its axioms include the schemes of recursive (generally called Δ10\Delta_{1}^{0}) comprehension and Σ10\Sigma_{1}^{0} induction:

(Δ10\Delta_{1}^{0}-𝖢𝖠\mathsf{CA}) n(Φ(n)Ψ(n))Xn(nXΦ(n))\forall n\,(\Phi(n)\leftrightarrow\Psi(n))\rightarrow\exists X\,\forall n\,(n\in X\leftrightarrow\Phi(n)) for all
Σ10\Sigma_{1}^{0} formulas Φ\Phi and Π10\Pi_{1}^{0} formulas Ψ\Psi in which XX is not free.
(𝖨Σ10\mathsf{I}\Sigma_{1}^{0}) (Φ(0)n(Φ(n)Φ(n+1)))nΦ(n)(\Phi(0)\land\forall n\,(\Phi(n)\rightarrow\Phi(n+1)))\rightarrow\forall n\,\Phi(n) for all Σ10\Sigma_{1}^{0} formulas Φ\Phi.

Note that these formulas may have free set or number variables. As usual, the existence assertion X.\exists X.... of the axiom is taken to mean that for each instantiation of the free variables (by numbers or sets, as appropriate, called parameters) there is an XX as described. We take this for granted as well as the restriction that the XX is not free in the rest of the formula in all of the set existence axioms of any of our systems.

𝖱𝖢𝖠0\mathsf{RCA}_{0} suffices to define and manipulate the basic notions of computability listed at the beginning of Section 3.2. The standard models of 𝖱𝖢𝖠0\mathsf{RCA}_{0} are just those whose second order part is closed under Turing reduction and disjoint union (XY={0,xxX}{1,yyY}X\oplus Y=\{\left\langle 0,x\right\rangle\mid x\in X\}\cup\{\left\langle 1,y\right\rangle\mid y\in Y\}). As suggested above what are now often called the computable in AA sets which are, as mentioned above, the Δ10\Delta_{1}^{0} in AA sets, were originally called the sets recursive in AA. Hence the terminology in RCA0.

Any axiom system we consider from now on will be assumed to include 𝖱𝖢𝖠0\mathsf{RCA}_{0}. If we have some axiom scheme or principle 𝖠𝖡𝖢\mathsf{ABC} we typically denote the system formed by adding it to 𝖱𝖢𝖠0\mathsf{RCA}_{0} by 𝖠𝖡𝖢0\mathsf{ABC}_{0}. We next move up to the arithmetic comprehension axiom and its system.

(𝖠𝖢𝖠\mathsf{ACA}) Xn(nXΦ(n))\exists X\,\forall n\,(n\in X\leftrightarrow\Phi(n)) for every arithmetic formula Φ\Phi.

As mentioned above the X(n)X^{(n)} are defined by a Σn0\Sigma_{n}^{0} formula with XX as a parameter. So one can show that this system is equivalent (over RCA0) to the totality of the Turing jump operator, i.e. for every XX, XX^{\prime} exists. Its standard models are those of RCA0 whose second order part is also closed under Turing jump. It is also equivalent (in the sense of reverse mathematics) to König’s Lemma, which asserts that every finitely branching tree with paths of arbitrarily long length has a branch.

In general, we say one system of axioms SS is logically or reverse mathematically reducible to another TT over one RR if RTψR\cup T\vdash\psi for every sentence ψS\psi\in S. Note that SS and/or TT may be a single sentence or theorem. We say that SS and TT are equivalent over RR if each is reducible to the other. If no system RR is specified we assume that RCA0 is intended.

As we will not deal with it, we have omitted the formal definition of the usual system WKL0 which falls strictly between RCA0 and ACA0. It is characterized by the restriction of König’s Lemma to trees that are subsets of 2<N2^{<N}, the tree of finite binary strings under extension.

The next system of the five basic ones after ACA0 is ATR0. Its defining axiom says that arithmetic comprehension can be iterated along any countable well-order and so implies the existence of the sets hyperarithmetic in XX for each XX but is computationally stronger than this assumption. As usual the formal definition can be found in [28].

Instead, we formally describe the computationally defined class of theorems/theories that are the main focus of this paper and include several variations of 𝖨𝖱𝖳\mathsf{IRT}. The definition is semantic, not axiomatic and involves only standard models. (Indeed by Van Wesep [30, 2.2.2], there can be no axiomatic characterization of this class in second order arithmetic.)

Definition 3.13.

A sentence (theory) TT is a theorem (theory) of hyperarithmetic analysis (THA) if

  1. 1.

    For every XX\subseteq\mathbb{N}, ,HYP(X)T\langle\mathbb{N},\mathrm{HYP}(X)\rangle\vDash T and

  2. 2.

    For every S2S\subseteq 2^{\mathbb{N}}, if ,ST\langle\mathbb{N},S\rangle\vDash T and XSX\in S then HYP(X)S\mathrm{HYP}(X)\subseteq S.

It is worth pointing out some of the relations between THAs and 𝖠𝖳𝖱0\mathsf{ATR}_{0}. THAs are defined by only using standard models and iterations of the jump over true well orderings. 𝖠𝖳𝖱0\mathsf{ATR}_{0} talks about all models of 𝖱𝖢𝖠0\mathsf{RCA}_{0} and asserts the existence of iterates of the jump over all orderings that appear well-founded in the model. Thus for standard models it implies the second clause of the definition of THAs (Definition 3.13). On the other hand, there is a recursive linear order with no hyperarithmetic infinite descending sequence and so it seems well-founded in HYP\mathrm{HYP} but it has a well-founded part longer than every recursive ordinal. Thus iterating the jump along this ordering would yield a set strictly Turing above every hyperarithmetic set. In particular, HYP\mathrm{HYP} is not a model of 𝖠𝖳𝖱0\mathsf{ATR}_{0} which therefore is not a THA. (See e.g. [28, V.2.6].)

The last of the standard axiomatic systems, Π11\Pi_{1}^{1}-CA0, is characterized by the comprehension axiom for Π11\Pi_{1}^{1} formulas:

(Π11\Pi_{1}^{1}-CA) Xk(kXΦ(k))\exists X\,\forall k\,(k\in X\leftrightarrow\Phi(k)) for every Π11\Pi_{1}^{1} formula Φ(k)\Phi(k).

Remark 3.14.

The hyperjump, TXT^{X}, is clearly a Π11\Pi_{1}^{1} set with parameter XX. In fact, every Π11\Pi_{1}^{1} set with parameter XX is reducible to TXT^{X}. Indeed, there is a computable function f(e,n)f(e,n) such that for every index ee for a Π11\Pi_{1}^{1} formula Ψ(n)\Psi(n) with parameter XX and every nn, Ψ(n)f(e,n)TX\Psi(n)\Leftrightarrow f(e,n)\in T^{X} [22, Corollary 16.XX(b)]. Thus Π11\Pi_{1}^{1}-𝖢𝖠0\mathsf{CA}_{0} corresponds to closure under the hyperjump. We will see it appear as equivalent to a version of 𝖨𝖱𝖳\mathsf{IRT} where we ask for a maximal set of disjoint rays in Theorem 6.18.

For this paper, the most important other existence axiom is a restricted form of the axiom of choice.

(Σ11\Sigma_{1}^{1}-AC) nXΦ(n,X)XnΦ(n,X[n])\forall n\exists X\Phi(n,X)\rightarrow\exists X\forall n\Phi(n,X^{[n]}) where Φ\Phi is arithmetic and X[n]={mn,mX}X^{[n]}=\{m\mid\langle n,m\rangle\in X\} is the nthnth column of XX.

A more common but clearly equivalent version of this axiom allows Φ\Phi to be Σ11\Sigma_{1}^{1}. A variant commonly called weak-Σ11\Sigma_{1}^{1}-AC (introduced in Definition 7.1 as unique-Σ11\Sigma_{1}^{1}-AC) requires the corresponding Φ\Phi to be arithmetic. To make these and other choice axioms uniform we have adopted the format required elsewhere and equivalent here to be used for all the variations. The system Σ11\Sigma_{1}^{1}-AC0 is well known to be a THA (essentially in Kreisel [15]). Thus it is strictly stronger than 𝖠𝖢𝖠0\mathsf{ACA}_{0}. On the other hand, it is strictly weaker than 𝖠𝖳𝖱0\mathsf{ATR}_{0}. (It is known that 𝖠𝖳𝖱0Σ11\mathsf{ATR}_{0}\vdash\Sigma_{1}^{1}-AC0 [28, V.8.3] but the converse fails as Σ11\Sigma_{1}^{1}-AC is true in HYP while 𝖠𝖳𝖱0\mathsf{ATR}_{0}, as we have pointed out, is not.) This choice axiom plays a crucial role in our analysis because we provide the upper bound on the strength of most of our theorems by showing that they follow from Σ11\Sigma_{1}^{1}-AC0. This provides the computational upper bound for being a THA as any consequence of a THA must also satisfy Definition 3.13(1). Thus the bulk of our proofs for the computational complexity of the theorems we study consist of showing that they imply Definition 3.13(2), i.e. closure under “hyperarithmetic in”.

Over the past fifty years, several other logical axioms have been shown to be THA. We will discuss some of them in §7. However, as we discussed in §1, only one somewhat obscure purely mathematical theorem was previously known to be a THA. We provide several more in this paper (Theorem 5.1, Corollary 6.4 and Theorem 6.13). We also introduce a new logical axiom, finite-Σ11-𝖠𝖢\Sigma_{1}^{1}\text{-}\mathsf{AC} (Definition 7.2) which is a THA as well.

For those interested in the proof theory and so nonstandard models, we also at times explicitly consider the induction axiom at the same Σ11\Sigma_{1}^{1} level.

(𝖨Σ11\mathsf{I}\Sigma_{1}^{1}) (Φ(0)n(Φ(n)Φ(n+1)))nΦ(n)(\Phi(0)\land\forall n(\Phi(n)\rightarrow\Phi(n+1)))\rightarrow\forall n\Phi(n) for every Σ11\Sigma_{1}^{1} formula Φ\Phi.

This axiom does not imply the existence of any infinite sets and is, of course, true in every standard model. Thus the readers interested only in the computational complexity of the Halin type theorems can safely ignore these considerations.

It is in the nature of reverse mathematics that sentences and sets of sentences of second order arithmetic are often viewed in several different ways. In different contexts they may be seen as mathematical or logical principles, axioms, axiom schemes, theories, theorems or the like. We point out what may be a less familiar terminology that is currently popular. What might be seen as a typical axiom or theorem asserting that for every XX of some sort there is a YY with some relation to XX, i.e. a sentence of the form X(Φ(X)YΨ(X,Y))\forall X(\Phi(X)\rightarrow\exists Y\Psi(X,Y)) may be called a principle. With this terminology come the notions of an instance of the principle, i.e. an XX satisfying Φ\Phi and a solution for XX, i.e. a YY such that Ψ(X,Y)\Psi(X,Y) holds.

4 𝖨𝖱𝖳\mathsf{IRT} and Hyperarithmetic Analysis

We devote this section to the proof of

Theorem 4.1.

𝖨𝖱𝖳\mathsf{IRT} is a theorem of hyperarithmetic analysis.

In this section we consider only vertex-disjoint single rays in undirected graphs, as in the statement of 𝖨𝖱𝖳\mathsf{IRT}. The proof of Theorem 4.1 will be split into two parts. The first part verifies that 𝖨𝖱𝖳\mathsf{IRT} satisfies the second clause of Definition 3.13.

Theorem 4.2.

Every standard model of 𝖱𝖢𝖠0+𝖨𝖱𝖳\mathsf{RCA}_{0}+\mathsf{IRT} is closed under hyperarithmetic reduction.

Proof.

Fix a standard model \mathcal{M} of 𝖱𝖢𝖠0+𝖨𝖱𝖳\mathsf{RCA}_{0}+\mathsf{IRT}. First, we show that \mathcal{M} contains \emptyset^{\prime}. By relativizing the proof, it follows that \mathcal{M} is closed under Turing jump.

For each nn, consider the tree TnN<NT_{n}\subseteq N^{<N} consisting of all strings of the form s0ts^{\smallfrown}0^{t} such that some number below nn is enumerated into \emptyset^{\prime} at stage ss, and either tst\leq s or sn=tn\emptyset_{s}^{\prime}\upharpoonright n=\emptyset_{t}^{\prime}\upharpoonright n. Observe that TnT_{n} has a unique computable branch {s0ttN}\{s^{\smallfrown}0^{t}\mid t\in N\mathbb{\}}, where ss is the smallest number such that n=sn\emptyset^{\prime}\upharpoonright n=\emptyset_{s}^{\prime}\upharpoonright n.

Consider the disjoint union nTn\bigsqcup_{n}T_{n}. Observe that nTn\bigsqcup_{n}T_{n} satisfies the premise of 𝖨𝖱𝖳\mathsf{IRT} (in \mathcal{M}), because each TnT_{n} has a computable branch. Apply 𝖨𝖱𝖳\mathsf{IRT} to nTn\bigsqcup_{n}T_{n} to obtain a sequence Rii\langle R_{i}\rangle_{i} of disjoint rays in nTn\bigsqcup_{n}T_{n}. Each RiR_{i} is contained in some TnT_{n}. We can, uniformly in ii, extend or truncate RiR_{i} to the unique branch PnP_{n} of TnT_{n}. Hence Rii\langle R_{i}\rangle_{i} computes a sequence of infinitely many distinct branches PnP_{n}, which in turn computes longer and longer initial segments of \emptyset^{\prime}. This proves that \mathcal{M} contains \emptyset^{\prime}.

Next we show that, for each computable limit ordinal λ\lambda, if \mathcal{M} contains (α)\emptyset^{(\alpha)} for every α<λ\alpha<\lambda then \mathcal{M} contains (λ)\emptyset^{(\lambda)}. (Again the desired result follows by relativization.) By Proposition 3.9, there is a computable sequence Tββ<λ\langle T_{\beta}\rangle_{\beta<\lambda} of trees such that each tree has exactly one branch PβT(β)P_{\beta}\equiv_{T}\emptyset^{(\beta)} with these reductions computed uniformly. Fix an increasing computable sequence αnn\langle\alpha_{n}\rangle_{n} which is cofinal in λ\lambda and consider the disjoint union nTαn\bigsqcup_{n}T_{\alpha_{n}}. Observe that nTαn\bigsqcup_{n}T_{\alpha_{n}} satisfies the premise of 𝖨𝖱𝖳\mathsf{IRT} (in \mathcal{M}): for each nn, (αn)\emptyset^{(\alpha_{n})} computes the branches PαmP_{\alpha_{m}} for mnm\leq n. Apply 𝖨𝖱𝖳\mathsf{IRT} to nTαn\bigsqcup_{n}T_{\alpha_{n}} to obtain a sequence Rii\langle R_{i}\rangle_{i} of disjoint rays in nTαn\bigsqcup_{n}T_{\alpha_{n}}. As before, Rii\langle R_{i}\rangle_{i} computes a sequence of infinitely many distinct branches PαnP_{\alpha_{n}}, and hence a sequence of infinitely many distinct (αn)\emptyset^{(\alpha_{n})}. Each (αn)\emptyset^{(\alpha_{n})} uniformly computes (αm)\emptyset^{(\alpha_{m})} for mnm\leq n, so we conclude that Rii\langle R_{i}\rangle_{i} computes m(αm)\bigoplus_{m}\emptyset^{(\alpha_{m})} as desired. ∎

It follows that 𝖨𝖱𝖳\mathsf{IRT} is not provable in 𝖠𝖢𝖠0\mathsf{ACA}_{0}, despite the apparent similarity between 𝖨𝖱𝖳\mathsf{IRT} and a compactness result. (Indeed, 𝖨𝖱𝖳\mathsf{IRT}_{\mathrm{}} is not even provable in Δ11\Delta_{1}^{1}-𝖢𝖠0\mathsf{CA}_{0} (Theorem 7.9).)

Next, we present essentially the proof of 𝖨𝖱𝖳\mathsf{IRT} attributed to Andreae (see [5, Theorem 8.2.5 and bottom of pg. 275]) and then analyze it with an eye to the axioms which can be used to formalize it. We then use this analysis to complete the proof of Theorem 4.1.

The key combinatorial lemma implicit in Andreae’s proof is:

Lemma 4.3.

Given disjoint rays Rii<n\left\langle R_{i}\right\rangle_{i<n} and disjoint rays Sjj<n2+1\left\langle S_{j}\right\rangle_{j<n^{2}+1} there are n+1n+1 disjoint rays R0,,RnR_{0}^{\prime},\dots,R_{n}^{\prime} such that for each i<ni<n, RiR_{i} and RiR_{i}^{\prime} start at the same vertex.

Before proving Lemma 4.3, let us use it to prove 𝖨𝖱𝖳\mathsf{IRT}.

Proof of 𝖨𝖱𝖳\mathsf{IRT} assuming Lemma 4.3.

Given a graph which has arbitrarily many disjoint rays, we build by recursion on n1n\geq 1 sequences Rini<n\left\langle R_{i}^{n}\right\rangle_{i<n} of disjoint rays with initial segments PinP_{i}^{n} of length nn such that Pin+1P_{i}^{n+1} is PinP_{i}^{n} followed by one more vertex for i<ni<n. The required infinite sequence of disjoint rays will then be given by Ri={Pinn>0}R_{i}=\bigcup\{P_{i}^{n}\mid n>0\}.

Suppose that we have Rini<n\left\langle R_{i}^{n}\right\rangle_{i<n} and Pini<n\left\langle P_{i}^{n}\right\rangle_{i<n}. By assumption, let S0,,S2n2S_{0},\dots,S_{2n^{2}} be a sequence of disjoint rays. Discard all rays SjS_{j} which contain a vertex of some PinP_{i}^{n}. There are at most n2n^{2} many of them, so by discarding and renumbering if necessary we are left with S0,,Sn2S_{0},\dots,S_{n^{2}}.

For each i<ni<n, let xix_{i} denote the first vertex on RinR_{i}^{n} after PinP_{i}^{n}. Apply Lemma 4.3 to xiRini<n\left\langle x_{i}R_{i}^{n}\right\rangle_{i<n} and S0,,Sn2S_{0},\dots,S_{n^{2}}. We obtain n+1n+1 disjoint rays R0,,RnR_{0}^{\prime},\dots,R_{n}^{\prime} such that for each i<ni<n, RiR_{i}^{\prime} begins at vertex xix_{i}. Now let Rin+1=PinxiRiR_{i}^{n+1}=P_{i}^{n}x_{i}R_{i}^{\prime} for i<ni<n and Rnn+1=RnR_{n}^{n+1}=R_{n}^{\prime}. These are disjoint by construction. This completes the inductive step of the construction of the R0n,,Rn1nR_{0}^{n},\dots,R_{n-1}^{n} and so provides the required witnesses for 𝖨𝖱𝖳\mathsf{IRT}. ∎

It remains to prove Lemma 4.3. The key ingredient is Menger’s theorem for finite graphs. If AA and BB are disjoint sets of vertices in a graph, we say that PP is an AA-BB path if PP starts with some vertex in AA and ends with some vertex in BB. A set of vertices SS separates AA and BB if any AA-BB path contains at least one vertex in SS.

Theorem 4.4 (Menger, see [5, Theorem 3.3.1]).

Let GG be a finite graph. If AA and BB are disjoint sets of vertices in GG, then the minimum size of a set of vertices which separate AA and BB is equal to the maximum size of a set of disjoint AA-BB paths.

We now present the proof of Lemma 4.3.

Proof of Lemma 4.3.

Suppose we are given nn disjoint rays R0,,Rn1R_{0},\dots,R_{n-1} and n2+1n^{2}+1 disjoint rays S0,,Sn2S_{0},\dots,S_{n^{2}}. First, define the set

{i,qRi intersects Sq}.\{\langle i,q\rangle\mid R_{i}\text{ intersects }S_{q}\}.

Then we perform the following recursive procedure. At each step, check if there is some i<ni<n such that RiR^{\prime}_{i} has not been defined and RiR_{i} intersects at most nn many rays SqS_{q} which have not been discarded. If there is no such ii, we end the procedure. Otherwise, find the least such ii and do the following:

  1. 1.

    discard all rays SqS_{q} which intersect RiR_{i};

  2. 2.

    define Ri=RiR^{\prime}_{i}=R_{i}.

After the procedure is complete, let II be the set of i<ni<n for which RiR_{i}^{\prime} has not been defined. Let 𝒮\mathcal{S} be the set of rays SqS_{q} which have not been discarded. Let m=|I|m=|I|. We observe that |𝒮|m2+1|\mathcal{S}|\geq m^{2}+1, because

(n2+1)(nm)n=mn+1m2+1.(n^{2}+1)-(n-m)n=mn+1\geq m^{2}+1.

Next, for each iIi\in I, let ziz_{i} be the first vertex on RiR_{i} such that RiziR_{i}z_{i} meets exactly mm many rays in 𝒮\mathcal{S}. (Each ziz_{i} exists by construction of II.)

Observe that the finite set iIRizi\bigcup_{i\in I}R_{i}z_{i} meets at most m2m^{2} many rays in 𝒮\mathcal{S}. Since |𝒮|m2+1|\mathcal{S}|\geq m^{2}+1, we may pick some ray in 𝒮\mathcal{S} which does not meet iIRizi\bigcup_{i\in I}R_{i}z_{i}. We define RnR_{n}^{\prime} to be said ray. Then, discard all rays in 𝒮\mathcal{S} which do not meet iIRizi\bigcup_{i\in I}R_{i}z_{i}.

Finally, we use Menger’s theorem to define RiR_{i}^{\prime} for each iIi\in I. For each iIi\in I, let xix_{i} denote the first vertex of RiR_{i}. For each qq such that SqS_{q} remains in 𝒮\mathcal{S}, let yqy_{q} be the first vertex on SqS_{q} such that yqSqy_{q}S_{q} and iIRizi\bigcup_{i\in I}R_{i}z_{i} are disjoint. Then consider the following finite sets of vertices:

X\displaystyle X ={xiiI}\displaystyle=\{x_{i}\mid i\in I\}
Y\displaystyle Y ={yqSq𝒮}\displaystyle=\{y_{q}\mid S_{q}\in\mathcal{S}\}
H\displaystyle H =iIRiziSq𝒮Sqyq.\displaystyle=\bigcup_{i\in I}R_{i}z_{i}\cup\bigcup_{S_{q}\in\mathcal{S}}S_{q}y_{q}.

We want to apply Menger’s theorem to X,YHX,Y\subseteq H. Towards that end, we claim that XX cannot be separated from YY in HH by fewer than mm vertices.

Suppose that AHA\subseteq H and |A|<m|A|<m. Since |I|=m|I|=m and {RiiI}\{R_{i}\mid i\in I\} is disjoint, there is some iIi\in I such that RiR_{i} does not meet AA. Next, since RiziR_{i}z_{i} meets mm many disjoint rays in 𝒮\mathcal{S}, there is some qq such that Sq𝒮S_{q}\in\mathcal{S} and RiziR_{i}z_{i} meets SqS_{q}, but SqS_{q} does not meet AA. Let zz be any vertex in both RiziR_{i}z_{i} and SqS_{q}. Then RizSqyqR_{i}zS_{q}y_{q} is a path in HH from xix_{i} to yqy_{q} which does not meet AA. This proves our claim.

By Menger’s theorem, there are mm many disjoint XX-YY paths in HH. Then, for each iIi\in I, define RiR_{i}^{\prime} by starting from xix_{i}, then following the XX-YY path given by Menger’s theorem to some yqy_{q}, and finally following SqS_{q}.

We have constructed a collection R0,,RnR^{\prime}_{0},\dots,R^{\prime}_{n} of rays. It is straightforward to check that they are disjoint, and that for each i<ni<n, RiR_{i} and RiR^{\prime}_{i} start at the same vertex. ∎

We now analyze these proofs from a reverse mathematical perspective to show that 𝖨𝖱𝖳\mathsf{IRT} follows from the THA Σ11\Sigma_{1}^{1}-𝖠𝖢0\mathsf{AC}_{0} and so also satisfies the first clause of Definition 3.13.

Theorem 4.5.

i) 𝖨𝖱𝖳\mathsf{IRT} and Σ11\Sigma_{1}^{1}-𝖠𝖢0\mathsf{AC}_{0} each implies 𝖠𝖢𝖠0\mathsf{ACA}_{0}.

ii) Σ11\Sigma_{1}^{1}-𝖠𝖢0\mathsf{AC}_{0} implies 𝖨𝖱𝖳\mathsf{IRT}. Hence for every YY\subseteq\mathbb{N}, HYP(Y)\mathrm{HYP}(Y) satisfies 𝖨𝖱𝖳\mathsf{IRT}.

Proof.

i) The proof of the first part of Theorem 4.2 essentially shows that 𝖨𝖱𝖳\mathsf{IRT} implies 𝖠𝖢𝖠0\mathsf{ACA}_{0}. One point to note is that for each nn, 𝖱𝖢𝖠0\mathsf{RCA}_{0} proves there is some ss such that n=sn\emptyset^{\prime}\upharpoonright n=\emptyset_{s}^{\prime}\upharpoonright n (using e.g. [28, II.3.9]). Now note that the same argument also proves the fact (essentially in [15]) that Σ11\Sigma_{1}^{1}-𝖠𝖢0\mathsf{AC}_{0} implies 𝖠𝖢𝖠0\mathsf{ACA}_{0} as it directly supplies the sequence of branches PnP_{n} in TnT_{n}.

ii) The proof of 𝖨𝖱𝖳\mathsf{IRT} presented above is easily seen to be one in 𝖠𝖢𝖠0\mathsf{ACA}_{0}, except for two points. First, each step nn of the induction assumed we had available a sequence S0,,Sn2S_{0},\dots,S_{n^{2}} of disjoint rays in our graph. However, all we know is that for each nn, there is some collection of disjoint rays of size nn. To access this information for each nn in a recursive construction, we require that there is a single sequence RnR_{n} such that the nnth entry of the sequence is a collection of disjoint rays of size nn. Such a sequence can be obtained using the axiom of choice. In this case, since the predicate “there exists nn many disjoint rays” is Σ11\Sigma_{1}^{1}, such a sequence can be obtained using Σ11\Sigma_{1}^{1}-𝖠𝖢0\mathsf{AC}_{0}. Therefore we may assume we have a sequence S=Sjnj<nn>0S=\langle\langle S_{j}^{n}\rangle_{j<n}\rangle_{n>0} with each Sjnj<n\langle S_{j}^{n}\rangle_{j<n} a sequence of disjoint rays and the graph GG consisting of all the vertices and edges occurring in any SjnS_{j}^{n}. We begin our construction with R01=S01R_{0}^{1}=S_{0}^{1}.

The second point is the iterated application of Lemma 4.3. The proof of that Lemma can be done in 𝖠𝖢𝖠0\mathsf{ACA}_{0}. (In particular, one can check that Menger’s theorem is provable in 𝖱𝖢𝖠0\mathsf{RCA}_{0} by following the first proof for it given in [5, Theorem 3.3.1].) However, here we need a bit more information as generally we cannot, in 𝖠𝖢𝖠0\mathsf{ACA}_{0}, carry out recursive constructions that increase complexity (even by some fixed number of jumps) at each step. We can only carry out recursions where each step is done computably in some set of the model. So it suffices to show that the constructions for all the instances of Lemma 4.3 needed in its iterations in the proof can be done computably in (GS)(G\oplus S)^{\prime}.

The crucial observation is that, by an induction starting with R01=S01R_{0}^{1}=S_{0}^{1}, we may take each RinR_{i}^{n} to be of the form PQPQ where PP is a finite path in GG and QQ is a tail of one of the SjS_{j}. To see this simply note that each ray RkRkR_{k}^{\prime}\neq R_{k} constructed in the Lemma starts with an initial segment of some RiR_{i}; it continues with a finite path in GG and ends with a tail of some Sj2n2+1S_{j}^{2n^{2}+1}. Thus computably in (GS)(G\oplus S)^{\prime} we can at every stage find one of these descriptions that provides the disjoint rays needed for the next stage of the recursion. Thus the sequences Rini<nn>0\left\langle\left\langle R_{i}^{n}\right\rangle_{i<n}\right\rangle_{n>0} and Pini<nn>0\left\langle\left\langle P_{i}^{n}\right\rangle_{i<n}\right\rangle_{n>0} are computable from (GS)(G\oplus S)^{\prime}. The sequence {Pinn>0}iN\left\langle\bigcup\{P_{i}^{n}\mid n>0\}\right\rangle_{i\in N} is then also computable from (GS)(G\oplus S)^{\prime} and computing the formal presentation of infinitely many disjoint rays then takes only one more jump. This completes the analysis of our proof of 𝖨𝖱𝖳\mathsf{IRT} in Σ11\Sigma_{1}^{1}-𝖠𝖢0\mathsf{AC}_{0}. ∎

Theorem 4.1 follows from Theorems 4.2 and 4.5.

We will establish some implications and nonimplications between 𝖨𝖱𝖳\mathsf{IRT} and other THAs in §7.

5 Variants of 𝖨𝖱𝖳\mathsf{IRT} and Hyperarithmetic Analysis

In this section, we show that at least five of the eight principles 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} are THAs:

Theorem 5.1.

All single-ray variants of 𝖨𝖱𝖳\mathsf{IRT} (i.e., 𝖨𝖱𝖳XYS\mathsf{IRT}_{\mathrm{XYS}}) and 𝖨𝖱𝖳UVD\mathsf{IRT}_{\mathrm{UVD}} are theorems of hyperarithmetic analysis.

𝖨𝖱𝖳UVS\mathsf{IRT}_{\mathrm{UVS}} and 𝖨𝖱𝖳UVD\mathsf{IRT}_{\mathrm{UVD}} were proved by Halin [11, 12]. 𝖨𝖱𝖳UES\mathsf{IRT}_{\mathrm{UES}} is an exercise in [5, 8.2.5(ii)]. 𝖨𝖱𝖳DVS\mathsf{IRT}_{\mathrm{DVS}} and 𝖨𝖱𝖳DES\mathsf{IRT}_{\mathrm{DES}} may be folklore.

Of the other three variants, 𝖨𝖱𝖳DED\mathsf{IRT}_{\mathrm{DED}} and 𝖨𝖱𝖳DVD\mathsf{IRT}_{\mathrm{DVD}} are open problems of graph theory ([3] and Bowler, personal communication). We do, however, have interesting results about these principles when restricted to directed forests (Theorem 6.13, Corollary 6.14). The other one, 𝖨𝖱𝖳UED\mathsf{IRT}_{\mathrm{UED}}, was proved by Bowler, Carmesin, Pott [3] using structural results about ends. We hope to analyze its strength in future work.

The proof of Theorem 5.1 consists of several variations of the proof of Theorem 4.1. One of which (𝖨𝖱𝖳DES\mathsf{IRT}_{\mathrm{DES}}) requires some additional ideas.

In order to minimize repetition, we establish some implications between some variants of 𝖨𝖱𝖳\mathsf{IRT} over 𝖱𝖢𝖠0\mathsf{RCA}_{0}. The proofs of each of these reductions follow the same basic plan. To deduce 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} from 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{X^{\prime}Y^{\prime}Z^{\prime}}} we provide computable maps gg, hh and kk which, provably in 𝖱𝖢𝖠0\mathsf{RCA}_{0}, take XX-graphs GG to XX^{\prime}-graphs GG^{\prime}, YY-disjoint ZZ-rays or sets of YY-disjoint ZZ-rays in GG to YY^{\prime}-disjoint ZZ^{\prime}-rays or sets of YY^{\prime}-disjoint ZZ^{\prime}-rays in GG^{\prime}, and YY^{\prime}-disjoint ZZ^{\prime}-rays or sets of YY^{\prime}-disjoint ZZ^{\prime}-rays in GG^{\prime} to YY-disjoint ZZ-rays or sets of YY-disjoint ZZ-rays in GG, respectively. These functions are designed to take witnesses of the hypothesis of 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} in GG to witnesses of the hypothesis of 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{X^{\prime}Y^{\prime}Z^{\prime}}} in GG^{\prime} and witnesses to the conclusion of 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{X^{\prime}Y^{\prime}Z^{\prime}}} in GG^{\prime} to witnesses to the conclusion of 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} in GG. Clearly it suffices to provide such computable maps to establish the desired reduction in 𝖱𝖢𝖠0\mathsf{RCA}_{0}. (Those familiar with Weihrauch reducibility will recognize that these arguments establish Weihrauch reductions between certain problems corresponding to variants of 𝖨𝖱𝖳\mathsf{IRT}_{\mathrm{}}.)

Unless otherwise noted all definitions and proofs in this section are in 𝖱𝖢𝖠0\mathsf{RCA}_{0}.

Lemma 5.2.

Given an undirected graph GG, we can uniformly compute a directed graph GG^{\prime} and mappings between Z-rays in GG and Z-rays in GG^{\prime} which preserve Y-disjointness.

Proof.

We define a computable map gg from undirected graphs GG to directed graphs GG^{\prime} as follows. The set of vertices of GG^{\prime} consists of the vertices of GG, together with two new vertices x=x(u,v)x=x(u,v) and y=y(u,v)y=y(u,v) for each edge {u,v}\{u,v\} in GG. The set of edges of GG^{\prime} consists of five edges u,x\langle u,x\rangle, v,x\langle v,x\rangle, x,y\langle x,y\rangle, y,u\langle y,u\rangle, y,v\langle y,v\rangle for each edge {u,v}\{u,v\} in GG.

Next we define a computable map hSh_{\mathrm{S}}: given a ray u0,u1,u_{0},u_{1},\dots in GG, hSh_{\mathrm{S}} maps it to the ray u0,x(u0,u1),y(u0,u1),u1,u_{0},x(u_{0},u_{1}),y(u_{0},u_{1}),u_{1},\dots in GG^{\prime}. Conversely, we define a computable map kSk_{\mathrm{S}} from rays RR^{\prime} in GG^{\prime} into rays RR in GG as follows. Observe that exactly one of the first three vertices in RR^{\prime} is a vertex in GG, because the only outgoing edges from a vertex y(u,v)y(u,v) lead to uu or to vv, and the only outgoing edge from a vertex x(u,v)x(u,v) leads to y(u,v)y(u,v). We take this vertex (say u0u_{0}) to be the first vertex of RR. Every outgoing edge from u0u_{0} leads to some x(u0,v)x(u_{0},v). Combining the above observations, we deduce that the tail u0Ru_{0}R^{\prime} has the form u0,x(u0,u1),y(u0,u1),u1,u_{0},x(u_{0},u_{1}),y(u_{0},u_{1}),u_{1},\dots. Then kSk_{\mathrm{S}} maps RR^{\prime} to the ray R=u0,u1,R=u_{0},u_{1},\dots in GG.

Similarly, given a double ray ,u1,u0,u1,\dots,u_{-1},u_{0},u_{1},\dots in GG, hDh_{\mathrm{D}} maps it to the double ray ,u1,x(u1,u0),y(u1,u0),u0,x(u0,u1),y(u0,u1),u1,\dots,u_{-1},x(u_{-1},u_{0}),y(u_{-1},u_{0}),u_{0},x(u_{0},u_{1}),y(u_{0},u_{1}),u_{1},\dots in GG^{\prime}. We can show that every double ray in GG^{\prime} has this form by considering the incoming edges to each vertex in GG^{\prime}. Therefore we can define a computable map from double rays in GG^{\prime} to double rays in GG by kD=hD1k_{\mathrm{D}}=h_{\mathrm{D}}^{-1}.

It is straightforward to check that hSh_{\mathrm{S}}, kSk_{\mathrm{S}}, hDh_{\mathrm{D}} and kDk_{\mathrm{D}} preserve YY-disjointness. ∎

Therefore we have

Proposition 5.3.

The directed variants of 𝖨𝖱𝖳\mathsf{IRT} imply their corresponding undirected variants, i.e., 𝖨𝖱𝖳DYZ\mathsf{IRT}_{\mathrm{DYZ}} implies 𝖨𝖱𝖳UYZ\mathsf{IRT}_{\mathrm{UYZ}} for each value of YY and ZZ.

Lemma 5.4.

Given a directed graph GG, we can uniformly compute a directed graph GG^{\prime} and mappings between ZZ-rays in GG and ZZ-rays in GG^{\prime} which satisfy the following properties: if two ZZ-rays in GG are vertex-disjoint, then the corresponding ZZ-rays in GG^{\prime} are edge-disjoint, and if two ZZ-rays in GG^{\prime} are edge-disjoint, then the corresponding ZZ-rays in GG are vertex-disjoint.

Proof.

We define a computable map gg from directed graphs G=V,EG=\langle V,E\rangle to directed graphs GG^{\prime} as follows. The set of vertices of GG^{\prime} is {xi,xoxV}\{x_{i},x_{o}\mid x\in V\}, where ii and oo stand for incoming and outgoing respectively. The set of edges of GG^{\prime} consists of uo,vi\langle u_{o},v_{i}\rangle for each u,vE\langle u,v\rangle\in E, and xi,xo\langle x_{i},x_{o}\rangle for each xVx\in V.

Next we define a computable map hSh_{\mathrm{S}}: given a ray x0,x1,x^{0},x^{1},\dots in GG, hSh_{\mathrm{S}} maps it to the ray xi0,xo0,xi1,xo1,x_{i}^{0},x_{o}^{0},x_{i}^{1},x_{o}^{1},\dots in GG^{\prime}. Conversely, we define a computable map kSk_{\mathrm{S}} from rays RR^{\prime} in GG^{\prime} to rays RR in GG as follows. Given RR^{\prime}, the ray RR visits the vertex xx in GG whenever xi,xo\langle x_{i},x_{o}\rangle appears in RR^{\prime}. (For example, we map xi0,xo0,xi1,xo1,x_{i}^{0},x_{o}^{0},x_{i}^{1},x_{o}^{1},\dots to x0,x1,x^{0},x^{1},\dots and we map xo0,xi1,xo1,x_{o}^{0},x_{i}^{1},x_{o}^{1},\dots to x1,x2,x^{1},x^{2},\dots.)

Similarly, hDh_{\mathrm{D}} maps a given double ray ,x1,x0,x1,\dots,x^{-1},x^{0},x^{1},\dots in GG to the double ray

,xi1,xo1,xi0,xo0,xi1,xo1,\dots,x^{-1}_{i},x^{-1}_{o},x^{0}_{i},x^{0}_{o},x^{1}_{i},x^{1}_{o},\dots

in GG^{\prime}. Every double ray in GG^{\prime} has this form, so we may define kD=hD1k_{\mathrm{D}}=h_{\mathrm{D}}^{-1}.

It is straightforward to check that the above mappings have the desired properties. ∎

Therefore we have

Proposition 5.5.

The directed edge-disjoint variants of 𝖨𝖱𝖳\mathsf{IRT} imply their corresponding directed vertex-disjoint variants, i.e., 𝖨𝖱𝖳DEZ\mathsf{IRT}_{\mathrm{DEZ}} implies 𝖨𝖱𝖳DVZ\mathsf{IRT}_{\mathrm{DVZ}} for each value of ZZ.

Lemma 5.6.

Given a directed graph GG, we can uniformly compute a directed graph GG^{\prime} and mappings between sets of YY-disjoint rays in GG and sets of YY-disjoint double rays in GG^{\prime} which preserve cardinality.

Proof.

We define a computable map gg from directed graphs GG to directed graphs GG^{\prime} containing GG as follows. For each vertex xx of GG we add new vertices xnx_{n} for each n<0n<0 and edges x1,x\left\langle x_{-1},x\right\rangle and xn1,xn\langle x_{n-1},x_{n}\rangle for all n<0n<0.

Next we define a computable map hh from sets of YY-disjoint rays in GG to sets of YY-disjoint double rays in GG^{\prime} as follows. Given a set of YY-disjoint rays in GG, we first ensure that each ray begins at a different vertex, by replacing it with a tail if necessary. (This is only relevant if the rays are edge-disjoint rather than vertex-disjoint.) Then for each ray x0,x1,x^{0},x^{1},\dots, we consider the double ray ,x20,x10,x0,x1,\dots,x_{-2}^{0},x_{-1}^{0},x^{0},x^{1},\dots in GG^{\prime}. This yields a set of YY-disjoint double rays in GG^{\prime} of the same cardinality.

Finally we define a computable map kk from sets of YY-disjoint double rays in GG^{\prime} to sets of YY-disjoint rays in GG. Given a double ray ,x1,x0,x1,\dots,x^{-1},x^{0},x^{1},\dots in GG^{\prime}, we search for the least n0n\geq 0 such that xnx^{n} is a vertex in GG. (If none of these vertices were in GG then as there are edges between them they would all have to be of the form yny_{-n} for a single yy in GG. The only edges between these yny_{-n} make them into a copy of the reverse order on NN. This order cannot have any subsequence of order type NN.) Now we map the given double ray to the ray xn,xn+1,x^{n},x^{n+1},\dots in GG. It is straight forward to check that this map induces a cardinality-preserving map kk from sets of YY-disjoint double rays in GG^{\prime} to sets of YY-disjoint rays in GG. ∎

Therefore we have

Proposition 5.7.

The directed double ray variants of 𝖨𝖱𝖳\mathsf{IRT} imply their corresponding directed single ray variants, i.e., 𝖨𝖱𝖳DYD\mathsf{IRT}_{\mathrm{DYD}} implies 𝖨𝖱𝖳DYS\mathsf{IRT}_{\mathrm{DYS}} for each value of YY.

UVSUVDUESUEDDVSDVDDESDEDIΣ11I\Sigma^{1}_{1}
Figure 1: Known implications between variants of 𝖨𝖱𝖳\mathsf{IRT}. All implications are over 𝖱𝖢𝖠0\mathsf{RCA}_{0}, except for the implication from UVD to UVS (Theorem 6.15).

Figure 1 summarizes the known implications between variants of 𝖨𝖱𝖳\mathsf{IRT} over 𝖱𝖢𝖠0\mathsf{RCA}_{0}. We will show that 𝖨𝖱𝖳UVD\mathsf{IRT}_{\mathrm{UVD}} implies 𝖨𝖱𝖳UVS\mathsf{IRT}_{\mathrm{UVS}} over 𝖱𝖢𝖠0+𝖨Σ11\mathsf{RCA}_{0}+\mathsf{I}\Sigma^{1}_{1} (Theorem 6.15).

Remark 5.8.

Bowler, Carmesin, Pott [3, pg. 2 l. 3–7] describe an implication from 𝖨𝖱𝖳UVS\mathsf{IRT}_{\mathrm{UVS}} to 𝖨𝖱𝖳UES\mathsf{IRT}_{\mathrm{UES}} which appears to use Σ11-𝖠𝖢0\Sigma^{1}_{1}\text{-}\mathsf{AC}_{0}. It turns out that the graph-theoretic principle used to carry out the implication does not imply even 𝖠𝖢𝖠0\mathsf{ACA}_{0} over 𝖱𝖢𝖠0\mathsf{RCA}_{0} (and is hence much weaker than Σ11-𝖠𝖢0\Sigma^{1}_{1}\text{-}\mathsf{AC}_{0}), but when combined with 𝖠𝖢𝖠0\mathsf{ACA}_{0}, yields a THA. It and several other principles with the same property (almost theorems/theories of hyperarithmetic analysis) are analyzed in Shore [26].

We return to the goal of proving Theorem 5.1.

Theorem 5.9.

For each choice of XYZ, 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} implies 𝖠𝖢𝖠0\mathsf{ACA}_{0}. Furthermore, every standard model of 𝖱𝖢𝖠0\mathsf{RCA}_{0} and 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} is closed under hyperarithmetic reduction.

Proof.

By Proposition 5.3, it suffices to prove the desired result for the undirected variants of 𝖨𝖱𝖳\mathsf{IRT}. Theorems 4.2 and 4.5(i) together assert the desired result for 𝖨𝖱𝖳\mathsf{IRT}. We describe how to modify the proofs of Theorems 4.2 and 4.5(i) to prove the desired result for the other variants of 𝖨𝖱𝖳\mathsf{IRT}.

Observe that in the aforementioned proofs, we only applied 𝖨𝖱𝖳\mathsf{IRT} to forests such that each of the constituent trees has a unique branch. In such graphs, none of the constituent trees can contain two rays which are edge-disjoint. Hence the aforementioned proofs establish the desired result for 𝖨𝖱𝖳UES\mathsf{IRT}_{\mathrm{UES}} as well.

In order to prove the desired result for 𝖨𝖱𝖳UVD\mathsf{IRT}_{\mathrm{UVD}} and 𝖨𝖱𝖳UED\mathsf{IRT}_{\mathrm{UED}}, we modify the aforementioned proofs as follows. For each tree, we relabel as needed to then add one computable branch of new vertices (other than the root of the tree) such that the new branches are also disjoint. The resulting trees satisfy the following properties:

  • Each tree contains some double ray which is Turing equivalent to the branch on the original tree.

  • No two double rays in the tree can be vertex-disjoint or edge-disjoint.

  • Given any double ray in the tree, we can uniformly compute the branch on the original tree.

It is straightforward to check that the modified proofs establish the desired result for 𝖨𝖱𝖳UVD\mathsf{IRT}_{\mathrm{UVD}} and 𝖨𝖱𝖳UED\mathsf{IRT}_{\mathrm{UED}}. ∎

Henceforth we will not explicitly mention uses of 𝖠𝖢𝖠0\mathsf{ACA}_{0} whenever we are assuming any 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} or Σ11\Sigma_{1}^{1}-𝖠𝖢0\mathsf{AC}_{0}.

Next, we show that 𝖨𝖱𝖳UVD\mathsf{IRT}_{\mathrm{UVD}} and 𝖨𝖱𝖳DES\mathsf{IRT}_{\mathrm{DES}} are provable in Σ11\Sigma_{1}^{1}-𝖠𝖢0\mathsf{AC}_{0} (Theorems 5.10, 5.15). It then follows from Propositions 5.3 and 5.5 that 𝖨𝖱𝖳UES\mathsf{IRT}_{\mathrm{UES}}, 𝖨𝖱𝖳DVS\mathsf{IRT}_{\mathrm{DVS}} and 𝖨𝖱𝖳UVS\mathsf{IRT}_{\mathrm{UVS}} are also provable in Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0}, completing the proof of Theorem 5.1.

Theorem 5.10.

Σ11\Sigma_{1}^{1}-𝖠𝖢0\mathsf{AC}_{0} implies 𝖨𝖱𝖳UVD\mathsf{IRT}_{\mathrm{UVD}}.

Proof.

The mathematical result is due to [11] and is Exercise 42 of Chapter 8 in [5]. Our proof is very similar to that of Theorem 4.5(ii) which follows [5, Theorem 8.2.5(i)] except we need to grow our family in “two directions”.

Our plan is to construct, by recursion on nn, sequences Rini<n\left\langle R_{i}^{n}\right\rangle_{i<n} and Pini<n\left\langle P_{i}^{n}\right\rangle_{i<n} such that the RinR_{i}^{n} are disjoint double rays with subpaths PinP_{i}^{n} of length 2n2n such that, for i<ni<n, Pin+1P_{i}^{n+1} is PinP_{i}^{n} extended by a new vertex at each end. Our required sequence of disjoint double rays is then {PinnN}iN\left\langle\bigcup\left\{P_{i}^{n}\mid n\in N\right\}\right\rangle_{i\in N}.

As we want to reuse the proof of Lemma 4.3, we want to decompose double rays into single rays. To that end we introduce some notation. If RR and RiR_{i} are double rays we let Rf=RR_{f}=R and Ri,f=RiR_{i,f}=R_{i} while RbR_{b} and Ri,bR_{i,b} are RR and RiR_{i}, respectively, but with order reversed. We use dd or dd^{\prime} to stand for one of ff or bb. For single rays RR we use R{}^{\ast}\!R to denote the reverse sequence of vertices. So, for example, if (c,d)(c,d) is an edge in a double ray RR, we have (cRb)dRf=R{}^{\ast}(cR_{b})dR_{f}=R.

We note the changes needed in the proof of Theorem 4.5. Now our sequence S=Sjnj<nn>0S=\langle\langle S_{j}^{n}\rangle_{j<n}\rangle_{n>0} given by Σ11\Sigma_{1}^{1}-𝖠𝖢0\mathsf{AC}_{0} consists of double rays and GG is the corresponding graph. Our construction of the desired Rini<n\left\langle R_{i}^{n}\right\rangle_{i<n} and Pini<n\left\langle P_{i}^{n}\right\rangle_{i<n} again proceeds recursively in (GS)(G\oplus S)^{\prime} and at the end uses one more jump to get the required sequence of double rays. We begin our recursion by setting R01=S01R_{0}^{1}=S_{0}^{1} and P01P_{0}^{1}\ as any subpath of length 22. Here we note that by construction every RinR_{i}^{n} will be of the form QaPbRQaPbR where PP is a finite path in GG while QQ and RR are each SfS_{f} or SbS_{b} for SS some SjnS_{j}^{n}. Again, finding each Rin+1i<n\langle R_{i}^{n+1}\rangle_{i<n} and Pin+1i<n\langle P_{i}^{n+1}\rangle_{i<n} in turn is recursive in (GS)(G\oplus S)^{\prime} once we prove in 𝖠𝖢𝖠0\mathsf{ACA}_{0} that it exists given Rini<n\langle R_{i}^{n}\rangle_{i<n} and Pini<n\langle P_{i}^{n}\rangle_{i<n}.

Thus it suffices to prove an analog of Lemma 4.3 in 𝖠𝖢𝖠0\mathsf{ACA}_{0}: Given Rini<n\langle R_{i}^{n}\rangle_{i<n} and Pini<n\left\langle P_{i}^{n}\right\rangle_{i<n} and Siki<k\langle S_{i}^{k}\rangle_{i<k} for k=11n2+1k=11n^{2}+1 we can construct Rin+1i<n+1\langle R_{i}^{n+1}\rangle_{i<n+1} and Pin+1i<n+1\langle P_{i}^{n+1}\rangle_{i<n+1} as required. To simplify the notation we omit the superscripts nn and kk. For i<ni<n, let xi,bx_{i,b} be the first vertex in RiR_{i} before PiP_{i} and xi,fx_{i,f} be the first vertex in RiR_{i} after PiP_{i}. We will arrange for Pin+1P_{i}^{n+1} to be PiP_{i} preceded by xi,bx_{i,b} and followed by xi,fx_{i,f} which will then also be xi,bRin+1xi,fx_{i,b}R_{i}^{n+1}x_{i,f}.

We first discard from {Sjj<11n2+1}\{S_{j}\mid j<11n^{2}+1\} any SjS_{j} sharing a vertex with any PiP_{i}. As there are n(2n+1)3n2n(2n+1)\leq 3n^{2} such vertices we have at least 8n2+18n^{2}+1 many SjS_{j} remaining. We relabel these as SjS_{j} for j<8n2+1j<8n^{2}+1 and choose an edge (cj,b,cj,f)(c_{j,b},c_{j,f}) in each SjS_{j}. We now essentially follow the proof of Lemma 4.3 but for the sets of single rays ={xi,fRi,f,xi,bRi,bi<n}\mathcal{R}=\{x_{i,f}R_{i,f},x_{i,b}R_{i,b}\mid i<n\} and 𝒮={cj,fSj,f,cj,bSj,bj<8n2+1}\mathcal{S}=\{c_{j,f}S_{j,f},c_{j,b}S_{j,b}\mid j<8n^{2}+1\}. Our goal for each xi,dRi,dx_{i,d}R_{i,d}\in\mathcal{R}, i<ni<n and d{b,f}d\in\{b,f\}, is to find a suitable replacement (beginning with the same vertex while maintaining the required disjointness) so that we can assemble the Rin+1R_{i}^{n+1} from them for i<ni<n. We also want an ShS_{h} disjoint from all the double rays Rin+1R^{n+1}_{i} with i<ni<n which will be Rnn+1R_{n}^{n+1}.

We follow the procedure described in the proof of Lemma 4.3 (using 2n2n for nn as \mathcal{R} has 2n2n elements) to define single ray replacements Ri,dR_{\left\langle i,d\right\rangle}^{\prime} for the xi,dRi,dx_{i,d}R_{i,d} from which we will construct the Rin+1R_{i}^{n+1}. When in that procedure we would keep the old ray we do so here as well and let Ri,f=R_{\left\langle i,f\right\rangle}^{\prime}= xi,fRi,fx_{i,f}R_{i,f}. When the procedure is completed we let I={i,di<nI=\{\left\langle i,d\right\rangle\mid i<n, d{b,f}d\in\{b,f\} and we have not defined Ri,d}R_{\left\langle i,d\right\rangle}^{\prime}\} and let m=|I|m=|I|. Clearly we have discarded at most 4n24n^{2} many more of the rays in 𝒮\mathcal{S}.

Now continue the construction for the i,dI\left\langle i,d\right\rangle\in I and the first vertices zi,dz_{i,d} on xi,dRi,dx_{i,d}R_{i,d} such that xi,dRi,dzi,dx_{i,d}R_{i,d}z_{i,d} meets exactly mm many rays in 𝒮\mathcal{S}. The union FF of these finite paths meets at most m2n4n2m\cdot 2n\leq 4n^{2} many rays in 𝒮\mathcal{S} and so at most 4n24n^{2} many double rays SjS_{j}, j<8n2+1j<8n^{2}+1. Thus there must be at least one SjS_{j} such that it does not meet FF and no cj,dSj,dc_{j,d}S_{j,d} was discarded during the process. We let one such be Rnn+1R_{n}^{n+1} and take any subpath of length 2n+22n+2 as Pnn+1P_{n}^{n+1}.

Now discard all single rays SS in 𝒮\mathcal{S} not meeting FF. For each cj,dSj,dc_{j,d}S_{j,d} remaining in 𝒮\mathcal{S} let yj,dy_{j,d} be the first vertex on cj,dSj,dc_{j,d}S_{j,d} such that yj,dSj,dy_{j,d}S_{j,d} is disjoint from FF. As in the proof of Lemma 4.3, we apply Menger’s theorem to X,YX,Y and HH where

X\displaystyle X ={xi,di,dI}\displaystyle=\{x_{i,d}\mid\left\langle i,d\right\rangle\in I\}
Y\displaystyle Y ={yj,dcj,dSj,d𝒮}\displaystyle=\{y_{j,d}\mid c_{j,d}S_{j,d}\in\mathcal{S}\}
H\displaystyle H =Fcj,dSj,d𝒮cj,dSj,dyj,d.\displaystyle=F\cup\bigcup_{c_{j,d}S_{j,d}\in\mathcal{S}}c_{j,d}S_{j,d}y_{j,d}.

This produces a set of mm disjoint paths Pi,dP_{i,d} in HH from each xi,dx_{i,d}, i,dI\left\langle i,d\right\rangle\in I, to a yj,dy_{j,d^{\prime}} in cj,dSj,d𝒮c_{j,d^{\prime}}S_{j,d^{\prime}}\in\mathcal{S}. We can now define Ri,dR_{\left\langle i,d\right\rangle}^{\prime} for i,dI\left\langle i,d\right\rangle\in I as the single ray beginning with Pi,dP_{i,d} and then continuing with Sj,dS_{j,d^{\prime}} after yj,dy_{j,d^{\prime}}.

We can now define the required double rays Rin+1R_{i}^{n+1} for i<ni<n as Ri,bPinRi,f{}^{\ast}\!R_{\left\langle i,b\right\rangle}^{\prime}P_{i}^{n}R_{\left\langle i,f\right\rangle}^{\prime} and check that these have all the desired properties. ∎

As for 𝖨𝖱𝖳DES\mathsf{IRT}_{\mathrm{DES}}, instead of following the proof of Theorem 4.5, we will reduce 𝖨𝖱𝖳DES\mathsf{IRT}_{\mathrm{DES}} to the problem of finding an infinite sequence of vertex-disjoint rays in a certain locally finite graph (see [3, pg. 2 l. 3–7]). To carry out this reduction, we define the line graph:

Definition 5.11.

The line graph L(G)L(G) of an XX-graph GG is the XX-graph whose vertices are the edges of GG and whose edges are the ((u,v),(v,w))((u,v),(v,w)), where (u,v)(u,v) and (v,w)(v,w) are edges in GG.

Lemma 5.12.

Let GG be an X-graph. There is a computable mapping from rays in GG to rays in L(G)L(G) such that if two rays in GG are edge-disjoint, then their images are vertex-disjoint.

Proof.

Map x0,x1,x2,x_{0},x_{1},x_{2},\dots to (x0,x1),(x1,x2),(x_{0},x_{1}),(x_{1},x_{2}),\dots. ∎

However, vertex-disjoint rays in L(G)L(G) do not always yield edge-disjoint rays in GG. An extreme counterexample is what is called the (undirected) star graph which consists of a single vertex with infinitely many neighbors: It does not contain any rays yet its line graph is isomorphic to the complete graph on NN which contains infinitely many vertex-disjoint rays. Nonetheless, if GG is locally finite, then vertex-disjoint rays in L(G)L(G) do correspond to edge-disjoint rays in GG:

Lemma 5.13 (𝖠𝖢𝖠0\mathsf{ACA}_{0}).

Let GG be a locally finite X-graph. There is a mapping from rays in L(G)L(G) to rays in GG such that if two rays in L(G)L(G) are vertex-disjoint, then their images are edge-disjoint rays in GG.

Proof.

Given a ray R=e0,e1,R=e_{0},e_{1},\dots in L(G)L(G), we construct a ray S=y0,y1,S=y_{0},y_{1},\dots in GG by recursion. Say that ei=(ui,vi)e_{i}=(u_{i},v_{i}). Start by defining y0y_{0} to be u0u_{0}. Having defined yny_{n}, we define yn+1y_{n+1} as follows. Let knk_{n} be the largest kk such that yny_{n} is an endpoint of eke_{k}. Such kk exists because GG is locally finite and RR is a ray. We can find knk_{n} by 𝖠𝖢𝖠0\mathsf{ACA}_{0}. Then define yn+1y_{n+1} to be the endpoint of ekne_{k_{n}} other than yny_{n}. This completes the recursion. Note that the knk_{n} are strictly increasing: ekn=(yn,yn+1)e_{k_{n}}=(y_{n},y_{n+1}) as yny_{n} is not an endpoint of ekn+1e_{k_{n}+1} by the maximality of knk_{n}. The next edge then includes yn+1y_{n+1} and so kn+1kn+1k_{n+1}\geq k_{n}+1. Also note that if the graph is directed the eie_{i} must be of the form xi,xi+1\left\langle x_{i},x_{i+1}\right\rangle and the last occurrence of any xx in an eie_{i} must be as its first element. By construction SS is infinite and contains no repeated vertices because of the maximality requirement, hence it is a ray. Observe that every edge in SS is a vertex in RR, so the above mapping maps vertex-disjoint rays in L(G)L(G) to edge-disjoint rays in GG. ∎

It remains to show that we can restrict our attention to locally finite graphs. We accomplish this with the help of Σ11\Sigma_{1}^{1}-𝖠𝖢0\mathsf{AC}_{0}. Given a directed graph GG with arbitrarily many edge-disjoint rays, we can use Σ11\Sigma_{1}^{1}-𝖠𝖢0\mathsf{AC}_{0} to choose a family Rjkj<kk>0\langle\langle R_{j}^{k}\rangle_{j<k}\rangle_{k>0} where, for each k>0k>0, the rays Rjkj<k\langle R_{j}^{k}\rangle_{j<k} are edge-disjoint. From this family, we may construct an appropriate locally finite subgraph of GG:

Lemma 5.14.

Suppose that GG is an X-graph and there is some family Rjkj<kk>0\langle\langle R_{j}^{k}\rangle_{j<k}\rangle_{k>0} of rays in GG such that for each k>0k>0, the rays Rjkj<k\langle R_{j}^{k}\rangle_{j<k} are edge-disjoint. Then there is some locally finite XX-subgraph GG^{\prime} of GG and some family Sjkj<kk>0\langle\langle S_{j}^{k}\rangle_{j<k}\rangle_{k>0} of rays in GG^{\prime} such that for each k>0k>0, the rays Sjkj<k\langle S_{j}^{k}\rangle_{j<k} are edge-disjoint.

Proof.

Define the vertices of GG^{\prime} to be the vertices of GG, say {viiN}\{v_{i}\mid i\in N\}. We specify the set of edges EE^{\prime} of GG^{\prime} by providing a recursive construction of sets EiE_{i} of edges putting in a set of edges at each step. We guarantee that each EiE_{i} is a union of finitely many finite sets of edge-disjoint rays in GG and that after stage kk no edge with a vertex viv_{i} for i<ki<k as an endpoint is ever put into EE^{\prime}.

Begin at stage 0 by putting all the edges in R01R_{0}^{1} into E1E_{1}. Proceeding recursively at stage k>0k>0 we have EkE_{k} and consider the edge-disjoint rays RjkR_{j}^{k}, j<kj<k. For each j<kj<k, say Rjk=xj,0k,xj,1k,R_{j}^{k}=x_{j,0}^{k},x_{j,1}^{k},\dots. Each viv_{i} for i<ki<k appears at most once in RjkR_{j}^{k} as RjkR_{j}^{k} is a ray. For each j<kj<k, since we have access to the set of vertices of RjkR_{j}^{k}, we can decide whether RjkR_{j}^{k} contains viv_{i} and, if so, find the index nn such that vi=xj,nkv_{i}=x_{j,n}^{k}. Call it ni,jkn_{i,j}^{k}. If there is no such nn, set ni,jk=0n_{i,j}^{k}=0. Define SjkS_{j}^{k} to be the tail of RjkR_{j}^{k} after xj,maxi<kni,jkkx_{j,\max_{i<k}n_{i,j}^{k}}^{k}. We put all the edges in SjkS_{j}^{k}, j<kj<k, into Ek+1E_{k+1}. Let E=kEkE^{\prime}=\bigcup_{k}E_{k}.

It remains to show that GG^{\prime} is locally finite. Consider any vertex vkv_{k}. No edge containing vkv_{k} as an endpoint is put in after stage kk. On the other hand, EkE_{k} is the union of finitely many finite sets of edge-disjoint rays (all of which have been computed uniformly). Each set of edge-disjoint rays in this union has vkv_{k} appearing at most once in each of its rays. Thus at most two edges containing vkv_{k} appear in each of the finitely many rays in this set. Therefore there are only finitely many edges containing vkv_{k} in each of the finite sets of edge-disjoint rays making up EkE_{k}. All in all, only finitely many edges in GG^{\prime} contain vkv_{k}. ∎

We are ready to prove

Theorem 5.15.

Σ11\Sigma^{1}_{1}-𝖠𝖢0\mathsf{AC}_{0} implies 𝖨𝖱𝖳XES\mathsf{IRT}_{\mathrm{XES}} for each value of X.

Proof.

Given an XX-graph GG with arbitrarily many edge-disjoint rays, we can use Σ11\Sigma_{1}^{1}-𝖠𝖢0\mathsf{AC}_{0} to choose a family Qjkj<kk>0\langle\langle Q_{j}^{k}\rangle_{j<k}\rangle_{k>0} such that for each kk, the rays QjkQ_{j}^{k}, j<kj<k, are edge-disjoint. By Lemma 5.14, there is a locally finite subgraph HH of GG and a family Rjkj<kk>0\langle\langle R_{j}^{k}\rangle_{j<k}\rangle_{k>0} such that for each kk, the RjkR_{j}^{k}, j<kj<k, are edge-disjoint rays in HH. By Lemma 5.12, there is a family Sjkj<kk>0\langle\langle S_{j}^{k}\rangle_{j<k}\rangle_{k>0} such that for each k>0k>0, the SjkS_{j}^{k}, j<kj<k, are vertex-disjoint rays in L(H)L(H). By the second part of the proof of Theorem 4.5(ii) (which can be carried out in 𝖠𝖢𝖠0\mathsf{ACA}_{0}), L(H)L(H) has infinitely many vertex-disjoint rays. Finally by Lemma 5.13, HH has infinitely many edge-disjoint rays. Hence GG has infinitely many edge-disjoint rays. ∎

By the discussion before Theorem 5.10, this completes the proof of Theorem 5.1.

Finally, we give a proof of 𝖨𝖱𝖳DED\mathsf{IRT}_{\mathrm{DED}} for directed forests using Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0} (recall that 𝖨𝖱𝖳DED\mathsf{IRT}_{\mathrm{DED}} remains open). We will see that Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0} and 𝖨𝖱𝖳DED\mathsf{IRT}_{\mathrm{DED}} for directed forests are equivalent over 𝖱𝖢𝖠0+𝖨Σ11\mathsf{RCA}_{0}+\mathsf{I}\Sigma_{1}^{1} but note that Σ11\Sigma_{1}^{1}-𝖠𝖢0\mathsf{AC}_{0} does not imply 𝖨Σ11\mathsf{I}\Sigma_{1}^{1}. (See Theorem 6.13 and the comment following it).

Theorem 5.16.

Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0} implies 𝖨𝖱𝖳DED\mathsf{IRT}_{\mathrm{DED}} for directed forests.

We first prove two lemmas.

Lemma 5.17 (𝖠𝖢𝖠0\mathsf{ACA}_{0}).

Let GG be a directed forest and let R0=x0,ii𝒵R_{0}=\left\langle x_{0,i}\mid i\in\mathcal{Z}\right\rangle, R1=x1,ii𝒵R_{1}=\left\langle x_{1,i}\mid i\in\mathcal{Z}\right\rangle be directed double rays in GG. Suppose R0R_{0} and R1R_{1} have an edge u,v\left\langle u,v\right\rangle in common. Then there are vertices tt and ww such tR0w=tR1wtR_{0}w=tR_{1}w and R0R_{0}, R1R_{1} have no vertices in common outside of those in tR0w=tR1wtR_{0}w=tR_{1}w. Note that we allow for the possibility that t=t=-\infty and/or w=+w=+\infty in the sense that ()R=R=R(+)(-\infty)R=R=R(+\infty) for any double ray RR. We call tR0w=tR1wtR_{0}w=tR_{1}w the intersection of R0R_{0} and R1R_{1}.

Proof.

Suppose R0R_{0}, R1R_{1} provide a counterexample. As they have an edge u,v\left\langle u,v\right\rangle in common, they lie in the same directed tree TT in GG and can be viewed as (undirected) double rays in T^\hat{T} (the underlying graph for TT). As they form a counterexample to the lemma, there must be either a first tR0t\in R_{0} (i.e. earliest in the double ray R0R_{0}) such that tR0v=tR1vtR_{0}v=tR_{1}v or a last wR0w\in R_{0} such that uR0w=uR1wuR_{0}w=uR_{1}w but R0R_{0} and R1R_{1} have a vertex zz in common outside the common interval. The situations are symmetric and we consider the second. The immediate successors xx and yy of ww in R0R_{0} and R1R_{1}, respectively, must be different by our choice of ww. Consider now the location of zz in R0R_{0}. If it is after xx then the paths from ww to zz in R0R_{0} and ww to zz in R1R_{1} (both considered now as undirected graphs within T^\hat{T}) are different as the immediate successor of ww in R0R_{0} is xx while in R1R_{1} it is either yy or a vertex in vR1wvR_{1}w. Thus there are two different paths in T^\hat{T} from ww to zz contradicting T^\hat{T}’s being a tree. If, on the other hand zz is before uu in R0R_{0}, it must be before tR0t\in R_{0} and a similar argument provides different paths from tt to zz in R0R_{0} and R1R_{1}. ∎

Lemma 5.18 (𝖠𝖢𝖠0\mathsf{ACA}_{0}).

There is a computable function ff such that given any sequence Sii<n\left\langle S_{i}\right\rangle_{i<n} of DED rays with subpaths Pii<n\left\langle P_{i}\right\rangle_{i<n} of length 2n2n in a directed tree TT and sequence Rjj<f(n)\left\langle R_{j}\right\rangle_{j<f(n)} of DED rays in TT, we can construct a sequence Siin\left\langle S_{i}^{\prime}\right\rangle_{i\leq n} of DED rays with subpaths Piin\left\langle P_{i}^{\prime}\right\rangle_{i\leq n} of length 2n+22n+2 in TT such that PiP_{i}^{\prime} extends PiP_{i} at each end for i<ni<n. Indeed, we may take f(n)=2n2+22nn!+1f(n)=2n^{2}+2^{2n}n!+1.

Proof.

First we remove all the RjR_{j} that contain an edge in any PiP_{i}\ at cost of at most 2n22n^{2} many jj. Consider any remaining RjR_{j} in the second sequence. By Lemma 5.17, its intersections with the SiS_{i} are intervals Qj,iQ_{j,i} of edges in RjR_{j} which are disjoint as the SiS_{i} are. By our first thinning of the RjR_{j} list, none of the Qj,iQ_{j,i} intersect any of the PiP_{i} so each Qj,iQ_{j,i} must lie entirely above or entirely belowPi\;P_{i}. We associate to each RjR_{j} a label consisting of the set Cj={i<nQj,i}C_{j}=\{i<n\mid Q_{j,i}\neq\emptyset\}; the elements ii of CjC_{j} in the order in which the Qj,iQ_{j,i} (for iCji\in C_{j}) appear in RjR_{j} (in terms of the ordering of 𝒵\mathcal{Z}) along with a ++ or - depending on which side of PiP_{i} it falls in SiS_{i}. We write Qj,isQ_{j,i}^{s} for the starting vertex of Qj,iQ_{j,i} and Qj,ieQ_{j,i}^{e} for the ending one. As above, we allow the values ±\pm\infty for these endpoints if the intervals are infinite. Now there are, of course, at most finitely many such labels. In particular, there are at most 2nn!2n2^{n}n!2^{n} such labels. Thus if we have 22nn!+12^{2n}n!+1 many RjR_{j} left at least two of them, say RaR_{a} and ReR_{e} have the same label say with set CC.
Claim: |C|<2|C|<2.
For the sake of a contradiction, assume we have klk\neq l in CC with kk preceding ll in the ordering of CC in the label. Say RaR_{a} is the ray such that Qa,kQ_{a,k} is before Qe,kQ_{e,k} in SkS_{k}. Note that Qa,kQ_{a,k} and Qe,kQ_{e,k} are edge disjoint as RaR_{a} and ReR_{e} are. We consider two cases: (1) Qa,lQ_{a,l} is before Qe,lQ_{e,l} in SlS_{l} and (2) Qe,lQ_{e,l} is before Qa,lQ_{a,l} in SlS_{l}. We now produce, for each case, two vertices with two distinct sequences (i) and (ii) of adjacent edges in TT connecting these two vertices. These sequences are illustrated in Figures 2 and 3. Note that by our assumptions on the orderings of the intervals Qc,dQ_{c,d} (for c{a,e}c\in\{a,e\} and d{k,l}d\in\{k,l\}) as displayed, all of the starting or ending points of Qc,dQ_{c,d} that appear in our sequences are vertices in one of the rays (i.e. none are ±\pm\infty):

  • (1i)

    Start at Qa,keQ_{a,k}^{e} in RaR_{a} and go to Qa,leQ_{a,l}^{e} then in SlS_{l} go to Qe,lsQ_{e,l}^{s}.

  • (1ii)

    Start at Qa,keQ_{a,k}^{e} in SkS_{k} and go to Qe,ksQ_{e,k}^{s} then go in ReR_{e} to Qe,lsQ_{e,l}^{s}.

  • (2i)

    Start at Qa,ksQ_{a,k}^{s} in SkS_{k} and go to Qe,keQ_{e,k}^{e} then in ReR_{e} go to Qe,leQ_{e,l}^{e} then in SlS_{l} go to Qa,leQ_{a,l}^{e}.

  • (2ii)

    Start at Qa,ksQ_{a,k}^{s} in RaR_{a} and go to Qa,leQ_{a,l}^{e}.

SkS_{k}\vdotsQa,kQ_{a,k}Qe,kQ_{e,k}\vdotsQa,keQ^{e}_{a,k}SlS_{l}\vdotsQa,lQ_{a,l}Qe,lQ_{e,l}\vdotsQe,lsQ^{s}_{e,l}RaR_{a}\vdotsQa,kQ_{a,k}Qa,lQ_{a,l}\vdotsQa,keQ^{e}_{a,k}ReR_{e}\vdotsQe,kQ_{e,k}Qe,lQ_{e,l}\vdotsQe,lsQ^{s}_{e,l}
Figure 2: (1i) follows the thick arrows in RaR_{a} and SlS_{l}. (1ii) follows the thick arrows in SkS_{k} and ReR_{e}.
SkS_{k}\vdotsQa,kQ_{a,k}Qe,kQ_{e,k}\vdotsQa,ksQ^{s}_{a,k}SlS_{l}\vdotsQe,lQ_{e,l}Qa,lQ_{a,l}\vdotsQa,leQ^{e}_{a,l}RaR_{a}\vdotsQa,kQ_{a,k}Qa,lQ_{a,l}\vdotsQa,ksQ^{s}_{a,k}Qa,leQ^{e}_{a,l}ReR_{e}\vdotsQe,kQ_{e,k}Qe,lQ_{e,l}\vdots
Figure 3: (2i) follows the thick arrows in SkS_{k}, ReR_{e} and SlS_{l}. (2ii) follows the thick arrow in RaR_{a}.

To see that the two sequences of vertices are different, note for (1) that (1i) contains an edge in Qa,lQ_{a,l} but (1ii) does not. For (2) note that (2i) contains an edge in Qe,kQ_{e,k} but (2ii) does not. We now, in each case, view the associated two distinct sequences of vertices with the same endpoints in the underlying (undirected) tree T^\hat{T}. The only way one can have such sequences in a tree is for one of the sequences to contain some vertices uvuuvu in order. However, any three successive vertices in any of these sequences lie within one of the RjR_{j} or SiS_{i} or both and so cannot have two instances of the same vertex. The crucial point is that each Qj,iQ_{j,i} is in both RjR_{j} and SiS_{i} and has at least two vertices. Any transition along the sequence between an RjR_{j} and an SiS_{i} (in either order) goes through Qj,iQ_{j,i} and so any three consecutive vertices are all contained in one RjR_{j} or one SiS_{i} (or both).

Knowing now that |C||C| is 0 or 11, we complete the proof of the Lemma. If |C|=0|C|=0, then both RaR_{a} and ReR_{e} are disjoint from all the SiS_{i} and so we may add on either one of them as SnS_{n}^{\prime} with PnP_{n}^{\prime} an arbitrary subpath of length 2n+22n+2 while keeping Si=SiS_{i}=S_{i}^{\prime} and extending the PiP_{i}^{\prime} appropriately for all i<ni<n. Otherwise say C={i}C=\{i\}. Let cc be the one of aa or ee such that Qc,iQ_{c,i} is closer to PiP_{i}. (Remember that they are both on the same side of this interval in SiS_{i} by our fixing the label.) Now replace the tail of SiS_{i} starting with Qc,iQ_{c,i} and going away from PiP_{i} by the tail of RcR_{c} starting with Qc,iQ_{c,i} and going in the same direction. Let this ray be SiS_{i}^{\prime}. Note that it is disjoint from all the SjS_{j}, jij\neq i as it contains only edges that are in SiS_{i} or RcR_{c} neither of which share any edges with such SjS_{j}. It is also disjoint from RdR_{d} where dd is the one of aa, ee which is not cc since all the edges of SiS_{i}^{\prime} are either in RcR_{c} or in SiS_{i} outside of Qd,iQ_{d,i} by our choice of Qc,iQ_{c,i} as closer to PiP_{i}. As RdR_{d} is also disjoint from all the SjS_{j} for jij\neq i by our fixing the label, we may define Sj=SjS_{j}^{{}^{\prime}}=S_{j} and PjP_{j} appropriately for j<nj<n, jij\neq i and Sn=RdS_{n}^{\prime}=R_{d} and choosing PnP_{n}^{\prime} of length 2n+22n+2 arbitrarily so as to get the sequence required in the Lemma. ∎

Lemma 5.18 provides the inductive step for the following proof:

Proof of Theorem 5.16.

Assume we are given a directed forest GG with arbitrarily many DED rays. By Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0} we may take a sequence Rk,ii<kkN\langle\langle R_{k,i}\rangle_{i<k}\rangle_{k\in N} such that, for each kk, Rk,ii<k\langle R_{k,i}\rangle_{i<k} is a sequence of kk many DED rays in GG. If there are infinitely many of the trees making up GG each of which contains some Rk,iR_{k,i} then we are done. So we may assume that all of them are in one directed tree TT. We now wish to define Sk,sk<ssN\langle\langle S_{k,s}\rangle_{k<s}\rangle_{s\in N} by recursion such that, for each ss, Sk,sk<s\langle S_{k,s}\rangle_{k<s} is a sequence of ss many DED rays with subpaths Pk,sP_{k,s} of length 2s+12s+1 such that for each sNs\in N, Pk,s+1P_{k,s+1} extends Pk,sP_{k,s}\ at each end so that the limsPk,s\lim_{s}P_{k,s} form an infinite sequence of DED rays in TT as required. Lemma 5.18 provides precisely the required inductive step for the construction since we have the required sequences of DED rays Rf(n),ii<f(n)\left\langle R_{f(n),i}\mid i<f(n)\right\rangle at each step nn of the construction. Once again we just have to note that Lemma 5.18 provides witnesses for the double rays that are composed of finite paths in TT and final segments in one direction or the other of some of the Rk,iR_{k,i} and so they can be found recursively in (TRk,ii<kkN)(T\oplus\langle\langle R_{k,i}\rangle_{i<k}\rangle_{k\in N})^{\prime}. ∎

6 Variations on Maximality

In this section, we consider variants of 𝖨𝖱𝖳\mathsf{IRT} whose solutions are required to be maximal in terms of cardinality or maximal in terms of set inclusion.

6.1 Maximum Cardinality Variants of 𝖨𝖱𝖳\mathsf{IRT}

Definition 6.1.

Let 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}^{\ast} be the statement that every XX-graph GG has a set of YY-disjoint ZZ-rays of maximum cardinality, or more formally, the statement that for every XX-graph GG:

  • there is no ZZ-ray in GG, or

  • there is some n>0n>0 and some RR such that R[i]i<n\langle R^{[i]}\mid i<n\rangle is a sequence of YY-disjoint ZZ-rays in GG, and there is no RR such that R[i]i<n+1\langle R^{[i]}\mid i<n+1\rangle is a sequence of YY-disjoint ZZ-rays in GG, or

  • there is some RR such that R[i]iN\langle R^{[i]}\mid i\in N\rangle is a sequence of YY-disjoint ZZ-rays in GG.

When we talk about the cardinality of a (possibly empty) finite sequence R[i]i<n\langle R^{[i]}\mid i<n\rangle we mean the number nn (which may be 0). Of course a sequence R[i]iN\langle R^{[i]}\mid i\in N\rangle is said to have infinite cardinality.

𝖨𝖱𝖳UVS\mathsf{IRT}^{\ast}_{\mathrm{UVS}} was proved by Halin [11], who also proved the corresponding statement for uncountable graphs.

Remark 6.2.

The notation in Definition 6.1 is inspired by the well known version 𝖠𝖢𝖠0\mathsf{ACA}_{0}^{\ast} of 𝖠𝖢𝖠0\mathsf{ACA}_{0} which, for every AA, asserts the existence of A(n)A^{(n)}, the nnth jump of AA, for every nn:

𝖠𝖢𝖠0:(A)(n)(W)(W[0]=A(i<n)(W[i+1]=W[i])).\mathsf{ACA}_{0}^{\ast}:\quad(\forall A)(\forall n)(\exists W)(W^{[0]}=A\wedge(\forall i<n)(W^{[i+1]}=W^{[i]^{\prime}})).

This asserts (in addition to 𝖠𝖢𝖠0\mathsf{ACA}_{0}) particular instances of 𝖨Σ11\mathsf{I}\Sigma_{1}^{1}. So too (in addition to 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}) do the 𝖨𝖱𝖳XYZ\mathsf{IRT}^{\ast}_{\mathrm{XYZ}} as we are about to see.

Proposition 6.3.

For each choice of XYZ, 𝖨𝖱𝖳XYZ\mathsf{IRT}^{\ast}_{\mathrm{XYZ}} implies 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} over 𝖱𝖢𝖠0\mathsf{RCA}_{0} and 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} implies 𝖨𝖱𝖳XYZ\mathsf{IRT}^{\ast}_{\mathrm{XYZ}} over 𝖱𝖢𝖠0+𝖨Σ11\mathsf{RCA}_{0}+\mathsf{I}\Sigma^{1}_{1}. Therefore 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} and 𝖨𝖱𝖳XYZ\mathsf{IRT}^{\ast}_{\mathrm{XYZ}} are equivalent over 𝖱𝖢𝖠0+𝖨Σ11\mathsf{RCA}_{0}+\mathsf{I}\Sigma^{1}_{1}. In particular, they have the same standard models.

Proof.

The first implication holds because if an XX-graph has arbitrarily many YY-disjoint ZZ-rays, then any sequence of YY-disjoint ZZ-rays in the graph of maximum cardinality must be infinite. To prove the second implication, let GG be an XX-graph and Φ(n)\Phi(n) be the Σ11\Sigma_{1}^{1} formula which says that there is a sequence of length nn of YY-disjoint ZZ-rays in GG. If n(Φ(n)Φ(n+1))\forall n(\Phi(n)\rightarrow\Phi(n+1)), then by 𝖨Σ11\mathsf{I}\Sigma_{1}^{1}, nΦ(n)\forall n\Phi(n) holds and so by 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} there is a sequence RiiN\langle R_{i}\rangle_{i\in N} of YY-disjoint ZZ-rays in GG as required. On the other hand, if there is an nn such that Φ(n)\Phi(n) holds but Φ(n+1)\Phi(n+1) fails, then this nn witnesses 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}^{\ast}. ∎

It follows from Proposition 6.3 and Theorem 5.1 that

Corollary 6.4.

𝖨𝖱𝖳XYS\mathsf{IRT}_{\mathrm{XYS}}^{\ast} and 𝖨𝖱𝖳UVD\mathsf{IRT}_{\mathrm{UVD}}^{\ast} are theorems of hyperarithmetic analysis.

It follows from Proposition 6.3 and Theorem 5.9 that 𝖨𝖱𝖳XYZ\mathsf{IRT}^{\ast}_{\mathrm{XYZ}} implies 𝖠𝖢𝖠0\mathsf{ACA}_{0}, so we will not explicitly mention uses of 𝖠𝖢𝖠0\mathsf{ACA}_{0} whenever we are assuming any 𝖨𝖱𝖳XYZ\mathsf{IRT}^{\ast}_{\mathrm{XYZ}}.

Using Lemmas 5.2, 5.4 and 5.6, we can prove

Proposition 6.5.

𝖨𝖱𝖳DYZ\mathsf{IRT}^{\ast}_{\mathrm{DYZ}} implies 𝖨𝖱𝖳UYZ\mathsf{IRT}^{\ast}_{\mathrm{UYZ}}, 𝖨𝖱𝖳DEZ\mathsf{IRT}^{\ast}_{\mathrm{DEZ}} implies 𝖨𝖱𝖳DVZ\mathsf{IRT}^{\ast}_{\mathrm{DVZ}} and 𝖨𝖱𝖳DYD\mathsf{IRT}^{\ast}_{\mathrm{DYD}} implies 𝖨𝖱𝖳DYS\mathsf{IRT}^{\ast}_{\mathrm{DYS}}.

Next, we show that 𝖨𝖱𝖳XYZ\mathsf{IRT}^{\ast}_{\mathrm{XYZ}} proves sufficient induction in order to transcend Σ11\Sigma^{1}_{1}-𝖠𝖢0\mathsf{AC}_{0}. This implies that 𝖨𝖱𝖳XYZ\mathsf{IRT}^{\ast}_{\mathrm{XYZ}} is strictly stronger than 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} for certain choices of XYZXYZ (Corollary 6.10). The connection between Σ11-𝖠𝖢0\Sigma^{1}_{1}\text{-}\mathsf{AC}_{0} and graphs is obtained by viewing the set of solutions of an arithmetic predicate as the set of (projections of) branches on a subtree of N<NN^{<N}. In detail:

Lemma 6.6 ([28, V.5.4]).

If A(X)A(X) is an arithmetic formula, 𝖠𝖢𝖠0\mathsf{ACA}_{0} proves that there is a tree TN<NT\subseteq N^{<N} such that

X(A(X)f(X,f[T]))\displaystyle\forall X(A(X)\leftrightarrow\exists f(\langle X,f\rangle\in[T]))
and X( at most one f)(X,f[T]).\displaystyle\forall X(\exists\text{ at most one }f)(\langle X,f\rangle\in[T])\text{.}

In fact, as the proof in [28, V.5.4] shows, the required functions ff are what are called the minimal Skolem functions and are arithmetically defined uniformly in XX and the formula AA.

The following easy corollary will be useful.

Lemma 6.7.

If A(n,X)A(n,X) is an arithmetic formula, 𝖠𝖢𝖠0\mathsf{ACA}_{0} proves that there is a sequence of subtrees Tnn\langle T_{n}\rangle_{n} of N<NN^{<N} such that for each nNn\in N,

X(A(n,X)f(X,f[Tn]))\displaystyle\forall X(A(n,X)\leftrightarrow\exists f(\langle X,f\rangle\in[T_{n}]))
and X( at most one f)(X,f[Tn]).\displaystyle\forall X(\exists\text{ at most one }f)(\langle X,f\rangle\in[T_{n}]).
Proof.

Say that B(Y)B(Y) holds if and only if A(Y(0),X)A(Y(0),X) holds, where XX is such that Y=Y(0)XY=Y(0)\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}X. Apply Lemma 6.6 to the arithmetic formula B(Y)B(Y) to obtain a tree TN<NT\subseteq N^{<N}. For each nNn\in N, define TnT_{n} to be the set of all σ\sigma such that nσTn\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}\sigma\in T. It is straightforward to check that Tnn\langle T_{n}\rangle_{n} satisfies the desired properties. ∎

Theorem 6.8.

𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}^{\ast} proves 𝖠𝖢𝖠0\mathsf{ACA}_{0}^{\ast}.

Before proving the above theorem, we derive some corollaries:

Corollary 6.9.

𝖨𝖱𝖳XYZ\mathsf{IRT}^{\ast}_{\mathrm{XYZ}} proves the consistency of Σ11-𝖠𝖢0\Sigma^{1}_{1}\text{-}\mathsf{AC}_{0}. Therefore it is not provable in Σ11-𝖠𝖢0\Sigma^{1}_{1}\text{-}\mathsf{AC}_{0}.

Proof.

Simpson [28, IX.4.6] proves that 𝖠𝖢𝖠0+𝖨Σ11\mathsf{ACA}_{0}+\mathsf{I}\Sigma^{1}_{1} implies the consistency of Σ11-𝖠𝖢0\Sigma^{1}_{1}\text{-}\mathsf{AC}_{0}. The only use of 𝖨Σ11\mathsf{I}\Sigma^{1}_{1} in Simpson’s proof is to establish 𝖠𝖢𝖠0\mathsf{ACA}_{0}^{\ast}, so Simpson’s proof shows that 𝖠𝖢𝖠0\mathsf{ACA}_{0}^{\ast} implies the consistency of Σ11-𝖠𝖢0\Sigma^{1}_{1}\text{-}\mathsf{AC}_{0}. The desired result then follows from Theorem 6.8 and Gödel’s second incompleteness theorem. ∎

Corollary 6.10.

𝖨𝖱𝖳XYZ\mathsf{IRT}^{\ast}_{\mathrm{XYZ}} is strictly stronger than 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} for the following choices of XYZ: XYS and UVD.

Proof.

We showed in §5 that the specified variants of 𝖨𝖱𝖳\mathsf{IRT} are provable in Σ11\Sigma_{1}^{1}-𝖠𝖢0\mathsf{AC}_{0}. On the other hand, none of the 𝖨𝖱𝖳\mathsf{IRT}^{\ast} are provable in Σ11\Sigma_{1}^{1}-𝖠𝖢0\mathsf{AC}_{0} (Corollary 6.9). ∎

We now prove Theorem 6.8:

Proof that 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}^{\ast} implies 𝖠𝖢𝖠0\mathsf{ACA}_{0}^{\ast}.

By Proposition 6.5, it suffices to prove the desired result for 𝖨𝖱𝖳UYZ\mathsf{IRT}_{\mathrm{UYZ}}^{\ast}. To prove 𝖠𝖢𝖠0\mathsf{ACA}_{0}^{\ast} from 𝖨𝖱𝖳UYS\mathsf{IRT}_{\mathrm{UYS}}^{\ast}, begin by using Lemma 6.7 to define for each AA a sequence of trees Tnn\langle T_{n}\rangle_{n} such that for each nn and WW there is at most one ff such that W,f[Tn]\langle W,f\rangle\in[T_{n}] and

(f)(W,f[Tn])\displaystyle(\exists f)(\langle W,f\rangle\in[T_{n}])
\displaystyle\leftrightarrow\; W[0]=A(in)((W[i])=W[i+1])(i>n)(W[i]=).\displaystyle W^{[0]}=A\;\land\;(\forall i\leq n)((W^{[i]})^{\prime}=W^{[i+1]})\;\land\;(\forall i>n)(W^{[i]}=\emptyset).

We want to show that each TnT_{n} is ill-founded. Note that if m<nm<n and TnT_{n} is ill-founded, then so is TmT_{m}. Therefore it suffices to show that for cofinally many nn, TnT_{n} is ill-founded.

Apply 𝖨𝖱𝖳UYS\mathsf{IRT}^{\ast}_{\mathrm{UYS}} to the disjoint union nTn\bigsqcup_{n}T_{n} to obtain a collection CC of YY-disjoint rays of maximum cardinality. We prove that CC is infinite. Suppose not. Then there is some maximum mm such that CC contains a ray in TmT_{m}. A ray in TmT_{m} can be computably truncated or extended to a branch on TmT_{m}, so TmT_{m} is ill-founded. Hence Tm+1T_{m+1} is ill-founded as well (by 𝖠𝖢𝖠0\mathsf{ACA}_{0}). But then there is a collection of YY-disjoint rays in nTn\bigsqcup_{n}T_{n} which has cardinality greater than that of CC, contradiction.

We have proved that CC is infinite. Next we prove that each TnT_{n} has at most one branch. That would imply that each TnT_{n} contains at most one ray in CC, so CC contains rays in cofinally many TnT_{n}, as desired.

If TnT_{n} has two distinct branches W0,f0\langle W_{0},f_{0}\rangle and W1,f1\langle W_{1},f_{1}\rangle, then W0W1W_{0}\neq W_{1} by the “at most one” condition in the definition of the TnT_{n}. Consider the least ii such that W0[i]W1[i]W_{0}^{[i]}\neq W_{1}^{[i]}. Such ii exists by 𝖠𝖢𝖠0\mathsf{ACA}_{0}. Note that 0<in0<i\leq n because W0[0]=A=W1[0]W_{0}^{[0]}=A=W_{1}^{[0]} and W0[i]==W1[i]W_{0}^{[i]}=\emptyset=W_{1}^{[i]} for i>ni>n. But then W0[i1]=W1[i1]W_{0}^{[i-1]}=W_{1}^{[i-1]} and (W0[i1])(W1[i1])(W_{0}^{[i-1]})^{\prime}\neq(W_{1}^{[i-1]})^{\prime}, contradiction.

This proves that 𝖨𝖱𝖳UYS\mathsf{IRT}_{\mathrm{UYS}}^{\ast} implies 𝖠𝖢𝖠0\mathsf{ACA}_{0}^{\ast}. In order to prove that 𝖨𝖱𝖳UYD\mathsf{IRT}_{\mathrm{UYD}}^{\ast} implies 𝖠𝖢𝖠0\mathsf{ACA}_{0}^{\ast}, we modify the above proof by adding to each TnT_{n} a computable branch consisting of new vertices to form a tree SnS_{n}. Apply 𝖨𝖱𝖳UYD\mathsf{IRT}_{\mathrm{UYD}}^{\ast} to nSn\bigsqcup_{n}S_{n} to obtain a collection CC of YY-disjoint double rays of maximum cardinality. Following the above proof, we may prove that CC is infinite and each SnS_{n} contains at most one double ray in CC. So CC contains double rays in cofinally many SnS_{n}, as desired. ∎

Remark 6.11.

The same proof shows that 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}^{\ast} implies the following induction scheme: Suppose Tnn\langle T_{n}\rangle_{n} is a sequence of trees such that

  1. 1.

    T0T_{0} has a unique branch;

  2. 2.

    for all nn, the number of branches on Tn+1T_{n+1} is the same as the number of branches on TnT_{n}.

Then for all nn, there is a sequence Pmm<n\langle P_{m}\rangle_{m<n} such that for each m<nm<n, PmP_{m} is the unique branch on TmT_{m}. It also shows that 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}^{\ast} implies 𝖠𝖢𝖠0+\mathsf{ACA}_{0}^{+} (i.e. closure under the ω\omega-jump) and much more. Indeed, similar ideas prove in Theorem 7.3 that 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}^{\ast} implies unique-Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0} (Definition 7.1). 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}^{\ast} also implies a similar induction scheme analogous to finite-Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0} (Definition 7.2).

We can prove that even fragments of 𝖨𝖱𝖳DVD\mathsf{IRT}_{\mathrm{DVD}}^{\ast} give more induction than the specific instances derived in Theorem 6.8.

Theorem 6.12.

𝖨𝖱𝖳DVD\mathsf{IRT}_{\mathrm{DVD}}^{\ast} (even for directed forests) implies 𝖨Σ11\mathsf{I}\Sigma_{1}^{1} over 𝖱𝖢𝖠0\mathsf{RCA}_{0}.

Proof.

Suppose Ψ(n)\Psi(n) is a Σ11\Sigma_{1}^{1} formula such that Ψ(0)\Psi(0) and n(Ψ(n)Ψ(n+1))\forall n(\Psi(n)\rightarrow\Psi(n+1)) hold. Let SiiN\langle S_{i}\rangle_{i\in N} be a sequence of subtrees of N<NN^{<N} such that SiS_{i} is ill-founded if and only if Ψ(i)\Psi(i) holds. Let TnnN\langle T_{n}\rangle_{n\in N} be a sequence of subtrees of N<NN^{<N} such that for each nn, TnT_{n} consists of all sequences σi:in\langle\sigma_{i}:i\leq n\rangle where for each ini\leq n, σi\sigma_{i} is (a code for) a string in SiS_{i}. We order these sequences by component-wise extension. It is clear that TnT_{n} is ill-founded if and only if (in)Ψ(i)(\forall i\leq n)\Psi(i) holds.

For each nn, orient each edge in TnT_{n} towards its root and add a computable DD-ray of new vertices which starts at its root. This forms a directed tree GnG_{n}. If GnG_{n} contains a double ray, TnT_{n} is ill-founded and (in)Ψ(i)(\forall i\leq n)\Psi(i) holds. Furthermore, no two disjoint double rays can lie in the same GnG_{n}.

Let GG be the directed forest nGn\bigsqcup_{n}G_{n}. By 𝖨𝖱𝖳DVD\mathsf{IRT}_{\mathrm{DVD}}^{\ast}, there is a sequence Rii\langle R_{i}\rangle_{i} of disjoint double rays in GG of maximum cardinality, so the sequence may be for i<ki<k for some kk or iNi\in N. Since Ψ(0)\Psi(0) holds, Rii\langle R_{i}\rangle_{i} is nonempty. If Rii\langle R_{i}\rangle_{i} is finite, let nn be maximal such that GnG_{n} contains some RiR_{i}. Then Ψ(n)\Psi(n) holds, so Ψ(n+1)\Psi(n+1) holds as well. It follows that Gn+1G_{n+1} contains some double ray, which we can then add to Rii\langle R_{i}\rangle_{i} to obtain a larger sequence of disjoint double rays in GG for the desired contradiction. Therefore Rii\langle R_{i}\rangle_{i} is infinite. Since each GnG_{n} contains at most one RiR_{i}, infinitely many GnG_{n} contain some RiR_{i}. Therefore Ψ(n)\Psi(n) holds for all nn. ∎

In fact, we have the following equivalences:

Theorem 6.13.

The following are equivalent (over 𝖱𝖢𝖠0\mathsf{RCA}_{0}):

  1. 1.

    Σ11-𝖠𝖢0+𝖨Σ11\Sigma^{1}_{1}\text{-}\mathsf{AC}_{0}+\mathsf{I}\Sigma^{1}_{1};

  2. 2.

    𝖨𝖱𝖳DED\mathsf{IRT}_{\mathrm{DED}} for directed forests+𝖨Σ11\;+\;\mathsf{I}\Sigma^{1}_{1};

  3. 3.

    𝖨𝖱𝖳DED\mathsf{IRT}^{\ast}_{\mathrm{DED}} for directed forests;

  4. 4.

    𝖨𝖱𝖳DVD\mathsf{IRT}^{\ast}_{\mathrm{DVD}} for directed forests;

  5. 5.

    𝖨𝖱𝖳DVD\mathsf{IRT}_{\mathrm{DVD}} for directed forests+𝖨Σ11\;+\;\mathsf{I}\Sigma^{1}_{1}.

Proof.

(1) \to (2) follows from Theorem 5.16. (2) \to (3) follows from the proof of Proposition 6.3. (3) \to (4) follows from the observation that the mapping of graphs defined in Lemma 5.4 sends a directed forest to a directed forest. (4) \to (5) follows from Theorem 6.12 and the proof of Proposition 6.3.

To prove (5) \rightarrow (1), suppose A(n,X)A(n,X) is an arithmetic formula such that nXA(n,X)\forall n\exists XA(n,X). By Lemma 6.7, there is a sequence Tnn\langle T_{n}\rangle_{n} of subtrees of N<NN^{<N} such that

nX(A(n,X)f(X,f[Tn]))\displaystyle\forall n\forall X(A(n,X)\leftrightarrow\exists f(\langle X,f\rangle\in[T_{n}]))
and X( at most one f)(X,f[Tn]).\displaystyle\forall X(\exists\text{ at most one }f)(\langle X,f\rangle\in[T_{n}]).

By assumption on A(n,X)A(n,X), each TnT_{n} is ill-founded. We use Tnn\langle T_{n}\rangle_{n} to construct a sequence Gnn\langle G_{n}\rangle_{n} of directed trees as we did in the proof of Theorem 6.12 to construct TnT_{n} from the SiS_{i}.

By IΣ11I\Sigma_{1}^{1}, the directed forest nGn\bigsqcup_{n}G_{n} contains arbitrarily many disjoint double rays. Therefore nGn\bigsqcup_{n}G_{n} contains infinitely many disjoint double rays Rkk\langle R_{k}\rangle_{k}, by 𝖨𝖱𝖳DVD\mathsf{IRT}_{\mathrm{DVD}}. Note that any double ray in any GnG_{n} must contain the computable ray we added, so any two double rays in the same GnG_{n} must intersect. This implies that each RkR_{k} belongs to some distinct GnG_{n}. Therefore for every mm, there is some kk and some n>mn>m such that RkR_{k} is a double ray in GnG_{n}. When we remove the added computable ray from RkR_{k} we are left with a branch in TnT_{n} which is of the form X,f\langle X,f\rangle where XX consists of witnesses XiX_{i} for i<ni<n. ∎

Since Σ11-𝖠𝖢0\Sigma^{1}_{1}\text{-}\mathsf{AC}_{0} (𝖠𝖳𝖱0\mathsf{ATR}_{0}, even) does not prove 𝖨Σ11\mathsf{I}\Sigma^{1}_{1} [28, IX.4.7], it follows that

Corollary 6.14.

𝖨𝖱𝖳DYD\mathsf{IRT}^{\ast}_{\mathrm{DYD}} (even for directed forests) is not provable in 𝖠𝖳𝖱0\mathsf{ATR}_{0}, and strictly implies Σ11-𝖠𝖢0\Sigma^{1}_{1}\text{-}\mathsf{AC}_{0} over 𝖱𝖢𝖠0\mathsf{RCA}_{0}.

Next, we show that 𝖨𝖱𝖳UVD\mathsf{IRT}^{\ast}_{\mathrm{UVD}} implies 𝖨𝖱𝖳UVS\mathsf{IRT}_{\mathrm{UVS}} over 𝖱𝖢𝖠0\mathsf{RCA}_{0} (see Figure 1).

Theorem 6.15.

𝖨𝖱𝖳UVD\mathsf{IRT}^{\ast}_{\mathrm{UVD}} implies 𝖨𝖱𝖳UVS\mathsf{IRT}_{\mathrm{UVS}} over 𝖱𝖢𝖠0\mathsf{RCA}_{0}. Therefore (1) 𝖨𝖱𝖳UVD\mathsf{IRT}_{\mathrm{UVD}} implies 𝖨𝖱𝖳UVS\mathsf{IRT}_{\mathrm{UVS}} over 𝖱𝖢𝖠0+𝖨Σ11\mathsf{RCA}_{0}+\mathsf{I}\Sigma^{1}_{1}; (2) if any standard model of 𝖱𝖢𝖠0\mathsf{RCA}_{0} satisfies 𝖨𝖱𝖳UVD\mathsf{IRT}_{\mathrm{UVD}}, then it satisfies 𝖨𝖱𝖳UVS\mathsf{IRT}_{\mathrm{UVS}} as well.

Proof.

Let GG be a graph which contains arbitrarily many disjoint single rays. By 𝖨𝖱𝖳UVD\mathsf{IRT}^{\ast}_{\mathrm{UVD}}, there is a sequence of disjoint double rays in GG of maximum cardinality. If this sequence is infinite, then there are infinitely many disjoint single rays in GG as desired. Otherwise, suppose that Rii<j\langle R_{i}\rangle_{i<j} is a sequence of disjoint double rays in GG of maximum cardinality jj. Let \mathcal{R} be the subgraph of GG consisting of the union of all RiR_{i}. Let HH be the induced subgraph of GG consisting of all vertices which do not lie in \mathcal{R}. Note that HH does not contain any double ray, otherwise GG would contain j+1j+1 many disjoint double rays. Next, we expand HH to the graph HH^{\prime}, defined below.

Decompose HH into its connected components Hii\langle H_{i}\rangle_{i} (there may only be finitely many). Any two single rays in the same HiH_{i} must intersect, because if S0S_{0} and S1S_{1} are disjoint single rays in the same HiH_{i}, then we can construct a double ray in HiH_{i} by connecting them (start with a path between S0S_{0} and S1S_{1} of minimum length, then connect it to the tails of S0S_{0} and S1S_{1} which begin at the endpoints of the path).

For each ii, define HiH_{i}^{\prime} by adding a computable ray of new vertices to HiH_{i}, which begins at the <N<_{N}-least vertex in HiH_{i}. Define HH^{\prime} to be the disjoint union iHi\bigsqcup_{i}H_{i}^{\prime}.

By 𝖨𝖱𝖳UVD\mathsf{IRT}^{\ast}_{\mathrm{UVD}}, there is a sequence of disjoint double rays in HH^{\prime} of maximum cardinality.

Case 1. If this sequence is infinite, then GG contains infinitely many disjoint single rays because each double ray in the sequence has a tail which lies in HH. In this case we are done.

Case 2. Otherwise, HH^{\prime} does not contain arbitrarily many disjoint double rays. Since any two single rays in the same HiH_{i} must intersect, we can transform any collection of disjoint single rays in HH into a collection of disjoint double rays in HH^{\prime} of equal cardinality by connecting each single ray to the <N<_{N}-least vertex in its connected component HiH_{i} and then following the computable ray we added. It follows that HH does not contain arbitrarily many disjoint single rays. Fix ll such that HH does not contain l+1l+1 many disjoint single rays.

Towards a contradiction, we construct a collection of (j+1)(j+1)-many disjoint double rays in GG as follows. Fix a collection 𝒮\mathcal{S} of l+2j+4j(j+1)l+2j+4j(j+1) many disjoint single rays in GG. First, at most 2j2j of these single rays lie in \mathcal{R}. In fact at most 2j2j of these single rays can have finite intersection with HH, because given a collection of disjoint single rays each of which has finite intersection with HH, we can obtain a collection of disjoint single rays in \mathcal{R} of the same cardinality by replacing each ray with an appropriate tail. Second, by reasoning analogous to the above, at most ll of these single rays can have finite intersection with \mathcal{R}. Therefore, there are at least 4j(j+1)4j(j+1) many disjoint single rays in 𝒮\mathcal{S} each of which have infinite intersection with both \mathcal{R} and HH.

Next, choose an edge (ui,vi)(u_{i},v_{i}) in each RiR_{i} and split RiR_{i} into two single rays uiRi,bu_{i}R_{i,b} and viRi,fv_{i}R_{i,f}. By the pigeonhole principle, there is some single ray RR of the form uiRi,bu_{i}R_{i,b} or viRi,fv_{i}R_{i,f}, and at least 2(j+1)2(j+1) many disjoint single rays in 𝒮\mathcal{S}, each of which have infinite intersection with both RR and HH. Call these rays S0,S1,,S2(j+1)1S_{0},S_{1},\dots,S_{2(j+1)-1}. Discard all the other rays in 𝒮\mathcal{S}. Below we describe how to connect pairs of single rays SkS_{k} using segments of RR to form a collection of (j+1)(j+1)-many disjoint double rays in GG.

Let x0,x1,x_{0},x_{1},\dots denote the vertices of RR. Since each single ray SkS_{k} has infinite intersection with RR, by the pigeonhole principle, there is a pair of disjoint rays Sk0S_{k_{0}} and Sl0S_{l_{0}} such that for each tail RR^{\prime} of RR, there is a vertex in Sk0RS_{k_{0}}\cap R^{\prime} and a vertex in Sl0RS_{l_{0}}\cap R^{\prime} such that no SkS_{k} intersects RR between these two vertices. (Formally, we justify this by defining the following coloring recursively. Start from the first vertex in RR which is also in some SkS_{k}. Search for the next vertex on RR which intersects some SlS_{l}, lkl\neq k. Then we color 0 with the unordered pair {k,l}\{k,l\}. Then we search for the next vertex on RR which intersects some SmS_{m}, mlm\neq l and color 11 with {l,m}\{l,m\}, and so on. Some color {k0,l0}\{k_{0},l_{0}\} must appear infinitely often.) Then we commit to connecting Sk0S_{k_{0}} and Sl0S_{l_{0}} (but we do not do so just yet). Applying the pigeonhole principle again, there is a pair of disjoint rays Sk1S_{k_{1}} and Sl1S_{l_{1}} (with k1,l1,k0,l0k_{1},l_{1},k_{0},l_{0} all distinct) such that for each tail RR^{\prime} of RR, there is a vertex xx in Sk1RS_{k_{1}}\cap R^{\prime} and a first vertex yy in Sl1RS_{l_{1}}\cap R^{\prime} (after xx in RR) such that no SkS_{k}, except perhaps Sk1S_{k_{1}}, Sk0S_{k_{0}} or Sl0S_{l_{0}}, intersects RR between these two vertices. We may eliminate any elements of Sk1S_{k_{1}} by changing xx (if necessary) to the last element of RyRy in Sk1S_{k_{1}}. Again we commit to connecting Sk1S_{k_{1}} and Sl1S_{l_{1}}. Repeat this process until we have obtained j+1j+1 pairs of single rays. That is, when we have SkiS_{k_{i}} and SliS_{l_{i}} for an i<ji<j, we find Ski+1S_{k_{i+1}} and Sli+1S_{l_{i+1}} with ki+1k_{i+1} and li+1l_{i+1} distinct from all previous kmk_{m} and lml_{m} such that for each tail RR^{\prime} of RR there is a vertex xSki+1Rx\in S_{k_{i+1}}\cap R and a first ySli+1Ry\in S_{l_{i+1}}\cap R after xx in RR such that no SkS_{k}, except perhaps SkmS_{k_{m}} or SlmS_{l_{m}} for mim\leq i, intersects RR between these two vertices. This process stops when we define kjk_{j} and ljl_{j}.

Finally, we connect these pairs of single rays in the opposite order in which we defined them: Start by picking some xjSkjRx^{j}\in S_{k_{j}}\cap R and some yjSljRy^{j}\in S_{l_{j}}\cap R. Then we define a double ray DjD_{j} by following Skj{}^{\ast}\!S_{k_{j}} until xjx^{j}, then following RR until yjy^{j}, and finally following SljS_{l_{j}}, i.e., Dj:=(xjSkj)RyjSljD_{j}:=~{}^{\ast}\!(x^{j}S_{k_{j}})Ry^{j}S_{l_{j}}. Having defined Dj,Dj1,,Di+1D_{j},D_{j-1},\dots,D_{i+1}, define Di:=(xiSki)RyiSliD_{i}:=~{}^{\ast}\!(x^{i}S_{k_{i}})Ry^{i}S_{l_{i}}, where xiSkiRx^{i}\in S_{k_{i}}\cap R and yiSliRy^{i}\in S_{l_{i}}\cap R are chosen as follows: Consider a tail RR^{\prime} of RR such that the union of xjRyj,,xi+1Ryi+1x^{j}Ry^{j},\dots,x^{i+1}Ry^{i+1} is disjoint from (1) RR^{\prime}; (2) xSki,xS_{k_{i,}} for each xSkiRx\in S_{k_{i}}\cap R^{\prime}; (3) ySliyS_{l_{i}} for each ySliRy\in S_{l_{i}}\cap R^{\prime}. By choice of kik_{i} and lil_{i}, there are vertices xiSkiRx^{i}\in S_{k_{i}}\cap R^{\prime} and yiSliRy^{i}\in S_{l_{i}}\cap R^{\prime} such that none of Skj,,Ski+1S_{k_{j}},\dots,S_{k_{i+1}} or Slj,,Sli+1S_{l_{j}},\dots,S_{l_{i+1}} intersect xiRyix^{i}Ry^{i}.

It is straightforward to check that each of Dj,Dj1,,Di+1D_{j},D_{j-1},\dots,D_{i+1} is disjoint from DiD_{i}. This process yields disjoint double rays Dj,Dj1,,D0D_{j},D_{j-1},\dots,D_{0} in GG, contradicting the maximality of jj. ∎

Using some of the ideas in the previous proof, we can prove

Theorem 6.16.

𝖨𝖱𝖳UYD\mathsf{IRT}_{\mathrm{UYD}}^{\ast} for forests implies 𝖨𝖱𝖳UYS\mathsf{IRT}_{\mathrm{UYS}}^{\ast} for forests over 𝖱𝖢𝖠0\mathsf{RCA}_{0}. Therefore 𝖨𝖱𝖳UYD\mathsf{IRT}_{\mathrm{UYD}} for forests implies 𝖨𝖱𝖳UYS\mathsf{IRT}_{\mathrm{UYS}} for forests over 𝖱𝖢𝖠0+𝖨Σ11\mathsf{RCA}_{0}+\mathsf{I}\Sigma_{1}^{1}.

This result will be used in the proofs of Theorems 7.3, 7.7 and 7.10.

Proof.

Let GG be a forest. If GG happens to have arbitrarily many disjoint double rays, then by 𝖨𝖱𝖳UYD\mathsf{IRT}^{\ast}_{\mathrm{UYD}}, GG has infinitely many disjoint double rays. Therefore there is an infinite sequence of disjoint single rays in GG. Such a sequence has maximum cardinality, so we are done in this case.

Suppose GG does not have arbitrarily many disjoint double rays. By 𝖨𝖱𝖳UYD\mathsf{IRT}^{\ast}_{\mathrm{UYD}} for forests, there is a sequence Rii<j\langle R_{i}\rangle_{i<j} of disjoint double rays in GG of maximum cardinality. Following the proof of Theorem 6.15, define the forests \mathcal{R}, HH, and HH^{\prime}. There, we proved that no two single rays in the same connected component HiH_{i} of HH can be disjoint.

By 𝖨𝖱𝖳UYD\mathsf{IRT}^{\ast}_{\mathrm{UYD}} for forests, there is a sequence of disjoint double rays in HH^{\prime} of maximum cardinality. If this sequence is infinite, then there is an infinite sequence of disjoint single rays in HH because each double ray in the sequence has a tail which lies in HH. This is a sequence of disjoint single rays of maximum cardinality in GG, so we are done in this case.

Otherwise, suppose Skk<l\langle S_{k}\rangle_{k<l} is a disjoint sequence of double rays in HH^{\prime} of maximum cardinality. Consider the following disjoint sequence of single rays in GG. First, for each k<lk<l, consider the single ray formed by intersecting HH and the double ray SkS_{k}. Second, for each i<ji<j, we can split the double ray RiR_{i} into a pair of disjoint single rays in GG. This yields a finite sequence Qmm<n\langle Q_{m}\rangle_{m<n} of disjoint single rays in GG.

We claim that Qmm<n\langle Q_{m}\rangle_{m<n} is a sequence of disjoint single rays in GG of maximum cardinality. Suppose there is a larger sequence of disjoint single rays in GG. Since GG is a forest, any two single rays in GG which share infinitely many edges or vertices must share a tail. Therefore there is a single ray QQ in this larger sequence which only shares finitely many edges and vertices with each QmQ_{m}. Then some tail of QQ, say xQxQ, is vertex-disjoint from each QmQ_{m}. In particular, xQxQ is vertex-disjoint from each RiR_{i}, i.e. xQxQ lies in HH. Extend xQxQ to a double ray in HH^{\prime} by first connecting xx to the <N<_{N}-least vertex in its connected component HiH_{i}, then following the computable ray which we added. The resulting double ray is disjoint from every SkS_{k}, because no SkS_{k} can lie in the same HiH_{i}^{\prime} as xQxQ (for xQxQ is vertex-disjoint from SkHS_{k}\cap H by construction). This contradicts the maximality of ll. ∎

6.2 Maximal Variants of 𝖨𝖱𝖳\mathsf{IRT}

Instead of sets of disjoint rays of maximum cardinality, we could consider sets of disjoint rays which are maximal with respect to set inclusion. For uncountable graphs, Halin [11] observed that any uncountable maximal set of disjoint rays is in fact of maximum cardinality (because rays are countable). This suggests another variant of 𝖨𝖱𝖳\mathsf{IRT}, which we call maximal 𝖨𝖱𝖳\mathsf{IRT}:

Definition 6.17.

Let 𝖬𝖨𝖱𝖳XYZ\mathsf{MIRT}_{\mathrm{XYZ}} be the statement that every XX-graph GG has a (possibly finite) sequence (Ri)i(R_{i})_{i} of YY-disjoint ZZ-rays which is maximal, i.e., for any ZZ-ray RR in GG, there is some ii such that RR and RiR_{i} are not YY-disjoint.

𝖬𝖨𝖱𝖳XYZ\mathsf{MIRT}_{\mathrm{XYZ}} immediately follows from Zorn’s Lemma. It is straightforward to show that 𝖬𝖨𝖱𝖳XYZ\mathsf{MIRT}_{\mathrm{XYZ}} implies Π11\Pi_{1}^{1}-𝖢𝖠0\mathsf{CA}_{0} (see the proof of Theorem 6.18 below), hence 𝖬𝖨𝖱𝖳XYZ\mathsf{MIRT}_{\mathrm{XYZ}} is much stronger than 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} or even 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}^{\ast}. We show below that 𝖬𝖨𝖱𝖳XYZ\mathsf{MIRT}_{\mathrm{XYZ}} is equivalent to Π11\Pi_{1}^{1}-𝖢𝖠0\mathsf{CA}_{0}. This situation is reminiscent of König’s duality theorem for countable graphs. Aharoni, Magidor, Shore [2] proved that the theorem implies 𝖠𝖳𝖱0\mathsf{ATR}_{0} and that Π11\Pi_{1}^{1}-𝖢𝖠0\mathsf{CA}_{0} suffices to prove the required existence of a König cover. Simpson [27] later proved that 𝖠𝖳𝖱0\mathsf{ATR}_{0} actually suffices. The covers produced in [2], and indeed in all then known proofs of this duality theorem actually had various maximality properties. Aharoni, Magidor, Shore proved that the existence of covers with any of a variety of maximality properties actually implies Π11\Pi_{1}^{1}-𝖢𝖠0\mathsf{CA}_{0}.

Theorem 6.18.

Π11\Pi_{1}^{1}-𝖢𝖠0\mathsf{CA}_{0} is equivalent to 𝖬𝖨𝖱𝖳XYZ\mathsf{MIRT}_{\mathrm{XYZ}}.

Proof that 𝖬𝖨𝖱𝖳XYZ\mathsf{MIRT}_{\mathrm{XYZ}} implies Π11\Pi_{1}^{1}-𝖢𝖠0\mathsf{CA}_{0}.

We first prove that 𝖬𝖨𝖱𝖳XYZ\mathsf{MIRT}_{\mathrm{XYZ}} implies 𝖠𝖢𝖠0\mathsf{ACA}_{0} by adapting the proof of Theorem 5.9: If we apply 𝖬𝖨𝖱𝖳XYZ\mathsf{MIRT}_{\mathrm{XYZ}} instead of 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} to any of the forests constructed in that proof, we obtain a sequence containing a ZZ-ray in each tree which constitutes the forest. This is more than sufficient for carrying out the remainder of the proof of Theorem 5.9.

To prove that 𝖬𝖨𝖱𝖳UVS\mathsf{MIRT}_{\mathrm{UVS}} implies Π11\Pi_{1}^{1}-𝖢𝖠0\mathsf{CA}_{0}, suppose we are given a set AA. Consider the disjoint union of all AA-computable trees (this exists, by 𝖠𝖢𝖠0\mathsf{ACA}_{0}). Any maximal sequence of YY-disjoint rays in this forest must contain a ray in each ill-founded AA-computable tree. Hence its jump computes the hyperjump TAT^{A}. This shows that 𝖬𝖨𝖱𝖳UYS\mathsf{MIRT}_{\mathrm{UYS}} implies Π11\Pi_{1}^{1}-𝖢𝖠0\mathsf{CA}_{0}. To prove that the other 𝖬𝖨𝖱𝖳XYZ\mathsf{MIRT}_{\mathrm{XYZ}} imply Π11\Pi_{1}^{1}-𝖢𝖠0\mathsf{CA}_{0}, it suffices to exhibit a computable procedure which takes trees TN<NT\subseteq N^{<N} to XX-graphs TT^{\prime} such that TT is ill-founded if and only if TT^{\prime} contains a ZZ-ray. For 𝖬𝖨𝖱𝖳UYD\mathsf{MIRT}_{\mathrm{UYD}}, it suffices to modify each tree by adding a computable branch which is not already on the tree (as we did in the proof of Theorem 5.9). For 𝖬𝖨𝖱𝖳DYZ\mathsf{MIRT}_{\mathrm{DYZ}}, it suffices to orient each of the graphs we constructed above in the obvious way. ∎

Proof that Π11\Pi_{1}^{1}-𝖢𝖠0\mathsf{CA}_{0} implies 𝖬𝖨𝖱𝖳XYZ\mathsf{MIRT}_{\mathrm{XYZ}}.

First, we give a mathematical proof for 𝖬𝖨𝖱𝖳XVZ\mathsf{MIRT}_{\mathrm{XVZ}} that is a direct construction not relying on Zorn’s Lemma or the like. We will then explain how to modify it to apply to the other cases and then how to get it to work in Π11\Pi_{1}^{1}-𝖢𝖠0\mathsf{CA}_{0}.

Suppose we are given an XX-graph GG whose vertices are elements of NN. We build a sequence of disjoint ZZ-rays in GG by recursion. If there are none we are done. Otherwise start with R0R_{0} as any ZZ-ray in GG. Suppose at stage nn we have constructed disjoint ZZ-rays Rii<m\langle R_{i}\rangle_{i<m} for some mnm\leq n such that for each ii, RiR_{i} begins at xix_{i}. If there is a ZZ-ray beginning at nn which is disjoint from the RiR_{i} for i<mi<m, choose one as Rm+1R_{m+1}, if not move on to stage n+1n+1. This construction produces a (possibly finite) sequence Ri,Ri,R_{i},R_{i},\dots of disjoint ZZ-rays in GG. We show that this sequence is maximal. If RR is a ZZ-ray which is disjoint from every RiR_{i}, then go to stage nn of the construction, where nn is the first vertex of RR. If we did insert some RmR_{m} during stage nn, then RmR_{m} would not be disjoint from RR. Hence we did not insert any ZZ-ray during stage nn. But RR is a ZZ-ray that begins at nn and is disjoint from Rijij<n\langle R_{i_{j}}\rangle_{i_{j}<n}, contradiction.

To prove 𝖬𝖨𝖱𝖳XEZ\mathsf{MIRT}_{\mathrm{XEZ}}, we modify the above construction as follows. At stage n=(u,v)n=(u,v), we search for a ZZ-ray RR which is disjoint from the previous rays and has (u,v)(u,v) as its first edge. The rest of the proof proceeds as above.

The only real obstacle in formalizing the above proofs even in 𝖠𝖢𝖠0\mathsf{ACA}_{0} is being able to find out at stage nn if there is an RR as requested and, if so, choosing one. The question is Σ11\Sigma_{1}^{1} and so in Π11\Pi_{1}^{1}-𝖢𝖠0\mathsf{CA}_{0} we can answer it and then perhaps use some construction or choice principle to produce it. As we only get a yes answer some of the time, Σ11\Sigma_{1}^{1}-𝖠𝖢0\mathsf{AC}_{0} does not seem sufficient. Also later choices depend on previous ones. A computability argument using the Gandy basis theorem and its uniformities works but requires more background development. A choice principle that returns an element if there is one satisfying a Σ11\Sigma_{1}^{1} property (but may act arbitrarily otherwise) and that can be iterated in a recursion is strong Σ11\Sigma_{1}^{1}-𝖣𝖢0\mathsf{DC}_{0} which consists of the scheme

(W)(n)(Y)(Φ(n,i<nW[i],Y)Φ(n,i<nW[i],W[n])),(\exists W)(\forall n)(\forall Y)\left(\Phi\left(n,\bigoplus_{i<n}W^{[i]},Y\right)\rightarrow\Phi\left(n,\bigoplus_{i<n}W^{[i]},W^{[n]}\right)\right),

for any Σ11\Sigma_{1}^{1} formula Φ(n,X,Y)\Phi(n,X,Y). It is known that strong Σ11\Sigma_{1}^{1}-𝖣𝖢0\mathsf{DC}_{0} and Π11\Pi_{1}^{1}-𝖢𝖠0\mathsf{CA}_{0} are equivalent [28, VII.6.9]. This clearly has the right flavor and the only issue is defining the required Φ(n,X,Y)\Phi(n,X,Y) with parameter GG. This is slightly fussy but not problematic. We provide the details: To define Φ\Phi, first recursively define a finite sequence i0,,ik<ni_{0},\dots,i_{k}<n. If i0,,ij1i_{0},\dots,i_{j-1} have been defined, define iji_{j} to be the least number (if any) above ij1i_{j-1} and below nn such that X[ij]X^{[i_{j}]} is a ZZ-ray in GG which is disjoint from X[i0],,X[ij1]X^{[i_{0}]},\dots,X^{[i_{j-1}]}. It is clear that there is an arithmetic formula with parameter GG which defines i0,,iki_{0},\dots,i_{k} from nn and XX. Next, we say that Φ(n,X,Y)\Phi(n,X,Y) holds if YY is a ZZ-ray in GG which begins with nn, and YY is disjoint from X[i0],,X[ik]X^{[i_{0}]},\dots,X^{[i_{k}]}.

Apply strong Σ11\Sigma_{1}^{1}-𝖣𝖢0\mathsf{DC}_{0} for the formula Φ\Phi to obtain some set WW. By Σ10\Sigma_{1}^{0}-comprehension with parameter GWG\oplus W, we may inductively define a (possibly finite) sequence i0,i1,i_{0},i_{1},\dots, just as we did in the definition of Φ\Phi. Clearly W[ij]j\langle W^{[i_{j}]}\rangle_{j} is a sequence of disjoint ZZ-rays in GG. We claim that it is maximal.

Suppose that RR is a ZZ-ray in GG which is disjoint from every Z[ij]Z^{[i_{j}]}. Suppose that RR begins with vertex nn. Then RR is disjoint from W[i0],,W[ik]W^{[i_{0}]},\dots,W^{[i_{k}]}, where iki_{k} is the largest iji_{j} below nn. It follows that Φ(n,i<nW[i],R)\Phi(n,\bigoplus_{i<n}W^{[i]},R) holds. So Φ(n,i<nW[i],W[n])\Phi(n,\bigoplus_{i<n}W^{[i]},W^{[n]}) holds, i.e., W[n]W^{[n]} is a Z-ray which begins with nn and W[n]W^{[n]} is disjoint from W[i0],,W[ik]W^{[i_{0}]},\dots,W^{[i_{k}]}. By definition of ik+1i_{k+1}, that means that n=ik+1n=i_{k+1}. But then RR and W[ik+1]W^{[i_{k+1}]} are not disjoint, contradiction.

A slicker proof suggested by the referee requires perhaps more background in the metamathematics of Reverse Mathematics. It uses a “countable coded β\beta-model” with the given graph GG as an element. By Π11\Pi_{1}^{1}-𝖢𝖠0\mathsf{CA}_{0}, for any set GG there is a set XX such that X[0]=GX^{[0]}=G and B(X)=N,{X[i]iN}B(X)=\langle N,\{X^{[i]}\mid i\in N\}\rangle is a β\beta-model, i.e. any Σ11\Sigma_{1}^{1} formula Φ\Phi with parameters from among the X[i]X^{[i]} is true if and only if it is true in B(X)B(X) [28, VII.2.10]. Now one carries out the mathematical proof above but whenever one asks if there is a ray with some property one asks if there is one such among the X[i]X^{[i]}. This converts the Σ11\Sigma_{1}^{1} questions to ones of fixed arithmetic complexity in XX. Then, if the answer is yes, finding an appropriate ii is also arithmetic in XX with fixed complexity. This converts the entire construction to one arithmetic in XX (and so certainly in Π11\Pi_{1}^{1}-𝖢𝖠0\mathsf{CA}_{0}). Consider now the claim that the sequence of rays given by this construction over B(X)B(X) is actually maximal. The existence of a counter-example RR with first element nn gives the same contradiction as before. The point is that, as B(X)B(X) is a β\beta-model, the answer to the question asked at stage nn of the construction of whether there is a ray in B(X)B(X) with first vertex nn disjoint from the finite sequence is the same as the question as to whether there exists one at all. The only other point to note is that even though the sequence of rays RiR_{i} is constructed outside of B(X)B(X) each finite initial segment is in B(X)B(X) by an external induction and the fact that B(X)B(X) is obviously a model of 𝖠𝖢𝖠0\mathsf{ACA}_{0}. The argument with the adjustments for 𝖬𝖨𝖱𝖳XEZ\mathsf{MIRT}_{\mathrm{XEZ}} is then the same. ∎

7 Relationships Between 𝖨𝖱𝖳\mathsf{IRT} and Other Theories of Hyperarithmetic Analysis

In this section, we establish implications and nonimplications between variants of 𝖨𝖱𝖳\mathsf{IRT} and THAs other than Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0}. One such standard theory is as follows:

Definition 7.1.

The theory unique-Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0} consists of 𝖱𝖢𝖠0\mathsf{RCA}_{0} and the principle

(n)(!X)A(n,X)(Y)(n)A(n,Y[n])(\forall n)(\exists!X)A(n,X)\rightarrow(\exists Y)(\forall n)A(n,Y^{[n]})

for each arithmetic formula A(n,X)A(n,X).

The above theory is typically known as weak-Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0} (e.g., [28, VIII.4.12]). We deviate from this terminology to introduce a new choice principle where the requirement for unique solutions is replaced by one for finitely many solutions.

Definition 7.2.

The theory finite-Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0} consists of 𝖱𝖢𝖠0\mathsf{RCA}_{0} and the principle

(n)( nonzero finitely many X)A(n,X)(Y)(n)A(n,Y[n])(\forall n)(\exists\text{ nonzero finitely many }X)A(n,X)\rightarrow(\exists Y)(\forall n)A(n,Y^{[n]})

for each arithmetic formula A(n,X)A(n,X). Formally, “((\exists nonzero finitely many X)A(n,X)X)A(n,X)” means that there is a nonempty sequence Xii<j\langle X_{i}\rangle_{i<j} such that for each XX, A(n,X)A(n,X) holds if and only if X=XiX=X_{i} for some i<ji<j.

Similarly to Σ11\Sigma_{1}^{1}-𝖠𝖢\mathsf{AC}, each of these two choice principles are equivalent to ones where AA is allowed to be of the form (!Y)B(n,X,Y)(\exists!Y)B(n,X,Y) or ((\exists nonzero finitely many Y)B(n,X,Y)Y)B(n,X,Y), respectively. However, unlike Σ11\Sigma_{1}^{1}-𝖠𝖢\mathsf{AC} neither of these two principle is equivalent to the version where AA is allowed to be Σ11\Sigma_{1}^{1}. Not only would those versions fail to capture the idea that we are dealing with unique or finitely many witnesses and paths through trees but they should be stronger than the stated principles. It is easy to see, for example, that even the unique version with AA Σ11\Sigma_{1}^{1} implies Δ11\Delta_{1}^{1}-𝖢𝖠\mathsf{CA} (Definition 7.8) which is stronger than unique-Σ11\Sigma_{1}^{1}-𝖠𝖢\mathsf{AC} by Van Wesep [30].

Since the THA Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0} implies finite-Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0} which in turn implies unique-Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0} whose models are closed under hyperarithmetic reducibility by Proposition 3.9, it follows that finite-Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0} is a THA (as is unique-Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0}). Goh [10] shows that finite-Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0} is strictly stronger than unique-Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0}. We were led to study this version of choice by realizing that a variant of our original proof that 𝖨𝖱𝖳UVS\mathsf{IRT}_{\mathrm{UVS}}^{\ast} implies unique-Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0} worked for the finite version.

Theorem 7.3.

𝖨𝖱𝖳XYZ\mathsf{IRT}^{\ast}_{\mathrm{XYZ}} implies finite-Σ11-𝖠𝖢0\Sigma^{1}_{1}\text{-}\mathsf{AC}_{0} over 𝖱𝖢𝖠0\mathsf{RCA}_{0}. (It follows that 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} implies finite-Σ11-𝖠𝖢0\Sigma^{1}_{1}\text{-}\mathsf{AC}_{0} over 𝖱𝖢𝖠0+𝖨Σ11\mathsf{RCA}_{0}+\mathsf{I}\Sigma^{1}_{1}, but this is superseded by Theorem 7.7 below.)

Proof.

We first prove that 𝖨𝖱𝖳UYS\mathsf{IRT}_{\mathrm{UYS}}^{\ast} for forests implies finite-Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0}. By Lemma 6.7, it suffices to prove that for any sequence Tnn\langle T_{n}\rangle_{n} of subtrees of N<NN^{<N} such that each TnT_{n} has finitely many branches, a sequence Pnn\langle P_{n}\rangle_{n} exists with each Pn[Tn]P_{n}\in[T_{n}]. As in the proof of Theorem 6.12, we construct a sequence of trees Snn\langle S_{n}\rangle_{n} such that for each nn, the branches on SnS_{n} are precisely those of the form P0PnP_{0}\oplus\dots\oplus P_{n} where PiP_{i} is a branch on TiT_{i} for ini\leq n.

By 𝖨𝖱𝖳UYS\mathsf{IRT}_{\mathrm{UYS}}^{\ast} (for forests) there is a sequence Rkk\langle R_{k}\rangle_{k} of YY-disjoint rays in nSn\bigsqcup_{n}S_{n} of maximum cardinality. We claim that Rkk\langle R_{k}\rangle_{k} is infinite. If not, let mm be least such that there is no RkR_{k} in SmS_{m}. Then we can increase the cardinality of Rkk\langle R_{k}\rangle_{k} by adding any ray RR from SmS_{m} while maintaining disjointness by our choice of nn. The point here is that if RRkR\cap R_{k}\neq\emptyset for any kk then they are both in SmS_{m} as the trees SnS_{n} are disjoint and there are no edges between them. Therefore Rkk\langle R_{k}\rangle_{k} has a ray in infinitely many SnS_{n}. Thus we may construct the desired sequence Pnn\langle P_{n}\rangle_{n} recursively by searching at stage nn for an RkR_{k} in SmS_{m} for some m>nm>n and take PnP_{n} to be the branch in TnT_{n} which shares a tail with the nnth coordinate of RkR_{k}.

Now, by Theorem 6.16, it follows that 𝖨𝖱𝖳UYD\mathsf{IRT}_{\mathrm{UYD}}^{\ast} for forests implies finite-Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0}. By Proposition 6.5, it follows that 𝖨𝖱𝖳DYZ\mathsf{IRT}_{\mathrm{DYZ}}^{\ast} implies finite-Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0} as well and so we are done. ∎

Another theory of hyperarithmetic analysis which follows from 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}^{\ast} is arithmetic Bolzano-Weierstrass (𝖠𝖡𝖶0\mathsf{ABW}_{0}):

Definition 7.4.

The theory 𝖠𝖡𝖶0\mathsf{ABW}_{0} consists of 𝖱𝖢𝖠0\mathsf{RCA}_{0} and the following principle: If A(X)A(X) is an arithmetic predicate on 2N2^{N}, either there is a finite sequence Xii\langle X_{i}\rangle_{i} which contains every XX such that A(X)A(X) holds or there is an XX such that every one of its neighborhoods has two YY such that A(Y)A(Y) holds. Such an XX is called an accumulation point of the class {XA(X)}\{X\mid A(X)\}.

Friedman [8] introduced 𝖠𝖡𝖶0\mathsf{ABW}_{0} and asserted that it follows from Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0} (with unrestricted induction). Conidis [4] proved Friedman’s assertion and established relationships between 𝖠𝖡𝖶0\mathsf{ABW}_{0} and most then known theories of hyperarithmetic analysis. Goh [10] shows that 𝖠𝖡𝖶0+𝖨Σ11\mathsf{ABW}_{0}+\mathsf{I}\Sigma_{1}^{1} implies finite-Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0}. We do not know if 𝖠𝖡𝖶0\mathsf{ABW}_{0} is strictly stronger than finite-Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0}.

The following two lemmas will be useful in deriving 𝖠𝖡𝖶0\mathsf{ABW}_{0} from 𝖨𝖱𝖳XYZ\mathsf{IRT}^{\ast}_{\mathrm{XYZ}}. The first lemma describes a connection between sets of solutions of arithmetic predicates and disjoint rays in trees.

Lemma 7.5 (𝖠𝖢𝖠0\mathsf{ACA}_{0}).

Suppose A(X)A(X) is an arithmetic predicate. Then there is a tree TN<NT\subseteq N^{<N} such that if there is a sequence of distinct solutions of A(X)A(X), then there is a sequence of Y-disjoint single rays in TT of the same cardinality, and vice versa.

Proof.

By Lemma 6.6, there is a tree TN<NT\subseteq N^{<N} such that

X(A(X)f(X,f[T])\displaystyle\forall X(A(X)\leftrightarrow\exists f(\langle X,f\rangle\in[T])
and X( at most one f)(X,f[T]).\displaystyle\forall X(\exists\text{ at most one }f)(\langle X,f\rangle\in[T])\text{.}

If Xii\langle X_{i}\rangle_{i} is a sequence of distinct solutions of A(X)A(X), then, as the required fif_{i} are arithmetic uniformly in the XiX_{i}, there is a sequence of distinct branches Xi,fii\langle\langle X_{i},f_{i}\rangle\rangle_{i} on TT of the same cardinality.

By taking an appropriate tail of each branch, we obtain a sequence Rii\langle R_{i}\rangle_{i} of vertex-disjoint (hence edge-disjoint) single rays in TT of the same cardinality with each one being a tail of Xi,fi\langle X_{i},f_{i}\rangle: As no two distinct branches in a tree can have infinitely many vertices in common, simply take RnR_{n} to be the tail of Xn,fn\left\langle X_{n},f_{n}\right\rangle starting after all vertices it has in common with any Xi,fi\langle X_{i},f_{i}\rangle, i<ni<n.

Conversely, suppose there is a sequence Rii\langle R_{i}\rangle_{i} of YY-disjoint single rays in TT. For each RiR_{i}, we define a branch on TT which corresponds to it as follows. Let xx be the vertex in RiR_{i} which is closest to the root of TT. Then we can extend xRixR_{i} to the root to obtain a branch Xi,fi\langle X_{i},f_{i}\rangle on TT. We claim that Xii\langle X_{i}\rangle_{i} is a sequence of distinct solutions of A(X)A(X). For each iji\neq j, since RiR_{i} and RjR_{j} are YY-disjoint, they cannot share a tail. So Xi,fi\langle X_{i},f_{i}\rangle and Xj,fj\langle X_{j},f_{j}\rangle must be distinct. Since for each XX, there is at most one ff such that X,f\langle X,f\rangle is a branch on TT, it follows that XiXjX_{i}\neq X_{j} as desired. ∎

The second lemma is essentially the well-known fact that the Bolzano-Weierstrass theorem is provable in 𝖠𝖢𝖠0\mathsf{ACA}_{0}:

Lemma 7.6 ([28, III.2.7]).

𝖠𝖢𝖠0\mathsf{ACA}_{0} proves that if Xnn\langle X_{n}\rangle_{n} is a sequence of distinct elements of 2N2^{N}, then there is some ZZ which is an accumulation point of {XnnN}\{X_{n}\mid n\in N\}.

Theorem 7.7.

𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}^{\ast} implies 𝖠𝖡𝖶0\mathsf{ABW}_{0} over 𝖱𝖢𝖠0\mathsf{RCA}_{0}. Therefore 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} implies 𝖠𝖡𝖶0\mathsf{ABW}_{0} over 𝖱𝖢𝖠0+𝖨Σ11\mathsf{RCA}_{0}+\mathsf{I}\Sigma_{1}^{1}.

Proof.

By Proposition 6.5, it suffices to show that the undirected variants of 𝖨𝖱𝖳\mathsf{IRT}^{\ast} imply 𝖠𝖡𝖶0\mathsf{ABW}_{0}.

Suppose A(X)A(X) is an arithmetic predicate on 2N2^{N} such that no finite sequence Xii\langle X_{i}\rangle_{i} contains every XX such that A(X)A(X) holds. By Lemma 7.5, there is a tree TN<NT\subseteq N^{<N} such that for any sequence of distinct solutions of A(X)A(X), there is a sequence of YY-disjoint single rays in TT of the same cardinality, and vice versa.

By 𝖨𝖱𝖳UYS\mathsf{IRT}_{\mathrm{UYS}}^{\ast}, or by 𝖨𝖱𝖳UYD\mathsf{IRT}_{\mathrm{UYD}}^{\ast} and Theorem 6.16, there is a sequence of YY-disjoint single rays in TT of maximum cardinality. This yields a sequence of distinct solutions of A(X)A(X) of the same cardinality.

If this sequence is finite, then there is a solution YY of A(X)A(X) not in the sequence by our assumption. Hence there is a sequence of distinct solutions of A(X)A(X) of larger cardinality, which yields a sequence of YY-disjoint single rays in TT of larger cardinality for the desired contradiction.

Thus there is an infinite sequence Xnn\langle X_{n}\rangle_{n} of distinct solutions of AA. By Lemma 7.6, there is an accumulation point of {XnnN}\{X_{n}\mid n\in N\}, which is of course an accumulation point of {XA(X)}\{X\mid A(X)\}, as desired. ∎

We now turn our attention to nonimplications. One prominent theory of hyperarithmetic analysis is the scheme of Δ11\Delta_{1}^{1}-comprehension (studied by Kreisel [15]):

Definition 7.8.

The theory Δ11\Delta_{1}^{1}-𝖢𝖠0\mathsf{CA}_{0} consists of 𝖱𝖢𝖠0\mathsf{RCA}_{0} and the principle

(n)(Φ(n)¬Ψ(n))X(nXΦ(n))(\forall n)(\Phi(n)\leftrightarrow\lnot\Psi(n))\rightarrow\exists X(n\in X\leftrightarrow\Phi(n))

for all Σ11\Sigma_{1}^{1} formulas Φ\Phi and Ψ\Psi.

Theorem 7.9.

Δ11\Delta^{1}_{1}-𝖢𝖠0𝖨𝖱𝖳XYZ,𝖨𝖱𝖳XYZ\mathsf{CA}_{0}\nvdash\mathsf{IRT}_{\mathrm{XYZ}},\mathsf{IRT}^{\ast}_{\mathrm{XYZ}}.

Proof.

Conidis [4, Theorem 3.1] constructed a standard model which satisfies Δ11\Delta^{1}_{1}-𝖢𝖠0\mathsf{CA}_{0} but not 𝖠𝖡𝖶0\mathsf{ABW}_{0}. By Theorem 7.7, this model does not satisfy 𝖨𝖱𝖳XYZ\mathsf{IRT}^{\ast}_{\mathrm{XYZ}}. Since standard models satisfy full induction, this model does not satisfy 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} either (by Proposition 6.3). ∎

Theorem 7.10.

𝖠𝖡𝖶0𝖨𝖱𝖳XYZ,𝖨𝖱𝖳XYZ\mathsf{ABW}_{0}\nvdash\mathsf{IRT}_{\mathrm{XYZ}},\mathsf{IRT}^{\ast}_{\mathrm{XYZ}}.

Proof.

By Propositions 5.3 and 6.3, it suffices to show that 𝖠𝖡𝖶0𝖨𝖱𝖳UYZ\mathsf{ABW}_{0}\nvdash\mathsf{IRT}_{\mathrm{UYZ}}. Van Wesep [30, I.1] constructed a standard model 𝒩\mathcal{N} which satisfies unique-Σ11\Sigma_{1}^{1}-𝖠𝖢0\mathsf{AC}_{0} but not Δ11\Delta_{1}^{1}-𝖢𝖠0\mathsf{CA}_{0}. Conidis [4, Theorem 4.1], using the approach of [19], showed that 𝒩\mathcal{N} satisfies 𝖠𝖡𝖶0\mathsf{ABW}_{0}. We show below that 𝒩\mathcal{N} does not satisfy 𝖨𝖱𝖳UYZ\mathsf{IRT}_{\mathrm{UYZ}}.

In order to define 𝒩\mathcal{N}, van Wesep constructed a tree TGT^{G} and branches fiGi\langle f_{i}^{G}\rangle_{i\in\mathbb{N}} of TGT^{G} such that (1) 𝒩\mathcal{N} contains TGT^{G} and infinitely many (distinct) fiGf_{i}^{G} (see [30, pg. 13 l. 1–11]); (2) 𝒩\mathcal{N} does not contain any infinite sequence of distinct branches of TGT^{G} (see [30, pg. 12 l. 7–9] and Steel [29, Lemma 7].) Then TGT^{G} is an instance of 𝖨𝖱𝖳UYS\mathsf{IRT}_{\mathrm{UYS}} in 𝒩\mathcal{N} which has no solution in 𝒩\mathcal{N}. This shows that 𝒩\mathcal{N} does not satisfy 𝖨𝖱𝖳UYS\mathsf{IRT}_{\mathrm{UYS}} for trees. (The reader who wants to follow the details of the proofs in [4] and [30, I.1] should look at the presentation of the basic methods in [19].)

Since 𝒩\mathcal{N} is a standard model, it satisfies full induction. By Theorem 6.16, it follows that 𝒩\mathcal{N} does not satisfy 𝖨𝖱𝖳UYD\mathsf{IRT}_{\mathrm{UYD}} for forests. ∎

Figure 4 illustrates some of our results. In order to simplify the diagram, we have omitted all variants of 𝖨𝖱𝖳\mathsf{IRT} except 𝖨𝖱𝖳UVS\mathsf{IRT}_{\mathrm{UVS}}.

Σ11\Sigma^{1}_{1}-𝖠𝖢0\mathsf{AC}_{0}Π11\Pi^{1}_{1}-𝖲𝖤𝖯0\mathsf{SEP}_{0}Δ11\Delta^{1}_{1}-𝖢𝖠0\mathsf{CA}_{0}𝖨𝖭𝖣𝖤𝖢0\mathsf{INDEC}_{0}unique-Σ11\Sigma^{1}_{1}-𝖠𝖢0\mathsf{AC}_{0}𝖨𝖱𝖳UVS\mathsf{IRT}_{\mathrm{UVS}}𝖠𝖡𝖶0\mathsf{ABW}_{0}finite-Σ11\Sigma^{1}_{1}-𝖠𝖢0\mathsf{AC}_{0}(1)(2)(3)𝖨Σ11\mathsf{I}\Sigma^{1}_{1}(4)(5)(6)𝖨Σ11\mathsf{I}\Sigma^{1}_{1}𝖨Σ11\mathsf{I}\Sigma^{1}_{1}||(7)||
Figure 4: Partial zoo of theories of hyperarithmetic analysis. Single arrows indicate implication while double arrows indicate strict implication. The references for the above results are as follows: (1, 2) Montalbán [17, Theorems 2.1, 3.1]; (3, 4) Montalbán [16, Theorem 2.2], Neeman [19, Theorems 1.2, 1.3, 1.4], see also Neeman [20, Theorem 1.1]; (5) Theorem 4.5; (6) Theorems 7.7, 7.10; (7) Conidis [4, Theorem 4.1]. All results concerning finite-Σ11-𝖠𝖢0\Sigma^{1}_{1}\text{-}\mathsf{AC}_{0} are in Goh [10].

8 Isolating the Use of Σ11\Sigma^{1}_{1}-𝖠𝖢0\mathsf{AC}_{0} in Proving 𝖨𝖱𝖳\mathsf{IRT}

We isolate the use of Σ11\Sigma^{1}_{1}-𝖠𝖢0\mathsf{AC}_{0} in our proofs of 𝖨𝖱𝖳XYS\mathsf{IRT}_{\mathrm{XYS}} and 𝖨𝖱𝖳UVD\mathsf{IRT}_{\mathrm{UVD}} (Theorems 4.5, 5.10, 5.15) by identifying the following principles:

Definition 8.1.

Let 𝖲𝖢𝖱XYZ\mathsf{SCR}_{\mathrm{XYZ}} be the assertion that if GG is an XX-graph with arbitrarily many YY-disjoint ZZ-rays, then there is a sequence of sets Xkk\langle X_{k}\rangle_{k} such that for each kNk\in N, XkX_{k} is a set of kk YY-disjoint ZZ-rays in GG.

Let 𝖶𝖨𝖱𝖳XYZ\mathsf{WIRT}_{\mathrm{XYZ}} be the assertion that if GG is an XX-graph and there is a sequence of sets Xkk\langle X_{k}\rangle_{k} such that for each kNk\in N, XkX_{k} is a set of kk YY-disjoint ZZ-rays in GG, then GG has infinitely many YY-disjoint ZZ-rays.

𝖲𝖢𝖱\mathsf{SCR} stands for Strongly Collecting Rays. 𝖶𝖨𝖱𝖳\mathsf{WIRT} stands for Weak Infinite Ray Theorem.

It is clear that Σ11\Sigma^{1}_{1}-𝖠𝖢0\mathsf{AC}_{0} implies 𝖲𝖢𝖱XYZ\mathsf{SCR}_{\mathrm{XYZ}} and 𝖲𝖢𝖱XYZ+𝖶𝖨𝖱𝖳XYZ\mathsf{SCR}_{\mathrm{XYZ}}+\mathsf{WIRT}_{\mathrm{XYZ}} implies 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}. The only use of Σ11\Sigma^{1}_{1}-𝖠𝖢0\mathsf{AC}_{0} in our proofs of 𝖨𝖱𝖳XYS\mathsf{IRT}_{\mathrm{XYS}} and 𝖨𝖱𝖳UVD\mathsf{IRT}_{\mathrm{UVD}} is to prove 𝖲𝖢𝖱XYS\mathsf{SCR}_{\mathrm{XYS}} and 𝖲𝖢𝖱UVD\mathsf{SCR}_{\mathrm{UVD}} respectively:

Theorem 8.2.

𝖠𝖢𝖠0\mathsf{ACA}_{0} proves 𝖶𝖨𝖱𝖳XYS\mathsf{WIRT}_{\mathrm{XYS}} and 𝖶𝖨𝖱𝖳UVD\mathsf{WIRT}_{\mathrm{UVD}}.

Proof.

For 𝖶𝖨𝖱𝖳UVS\mathsf{WIRT}_{\mathrm{UVS}}, see the proof of Theorem 4.5(ii). In particular, note that the hypothesis of 𝖶𝖨𝖱𝖳\mathsf{WIRT} is exactly the instance of Σ11\Sigma_{1}^{1}-𝖠𝖢\mathsf{AC} needed in that proof (and the one referenced there). From then on, the argument proceeds in 𝖠𝖢𝖠0\mathsf{ACA}_{0} to give the conclusion of 𝖨𝖱𝖳\mathsf{IRT}_{\mathrm{}} and so of 𝖶𝖨𝖱𝖳\mathsf{WIRT}_{\mathrm{}} as desired. Similarly, for 𝖶𝖨𝖱𝖳UVD\mathsf{WIRT}_{\mathrm{UVD}}, see the proof of Theorem 5.10 and for 𝖶𝖨𝖱𝖳XES\mathsf{WIRT}_{\mathrm{XES}}, see the proof of Theorem 5.15. The desired result for 𝖶𝖨𝖱𝖳DVS\mathsf{WIRT}_{\mathrm{DVS}} then follows from Lemma 5.4. ∎

Next we will use the above result to show that 𝖲𝖢𝖱XYS\mathsf{SCR}_{\mathrm{XYS}} and 𝖲𝖢𝖱UVD\mathsf{SCR}_{\mathrm{UVD}} are equivalent over 𝖱𝖢𝖠0\mathsf{RCA}_{0} to 𝖨𝖱𝖳XYS\mathsf{IRT}_{\mathrm{XYS}} and 𝖨𝖱𝖳UVD\mathsf{IRT}_{\mathrm{UVD}} respectively. First, observe that 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} implies 𝖲𝖢𝖱XYZ\mathsf{SCR}_{\mathrm{XYZ}} for each choice of XYZXYZ. Second, Lemmas 5.2, 5.4 and 5.6 imply

Proposition 8.3.

𝖲𝖢𝖱DYZ\mathsf{SCR}_{\mathrm{DYZ}} implies 𝖲𝖢𝖱UYZ\mathsf{SCR}_{\mathrm{UYZ}}, 𝖲𝖢𝖱DEZ\mathsf{SCR}_{\mathrm{DEZ}} implies 𝖲𝖢𝖱DVZ\mathsf{SCR}_{\mathrm{DVZ}} and 𝖲𝖢𝖱DYD\mathsf{SCR}_{\mathrm{DYD}} implies 𝖲𝖢𝖱DYS\mathsf{SCR}_{\mathrm{DYS}}.

Proposition 8.4.

𝖲𝖢𝖱XYZ\mathsf{SCR}_{\mathrm{XYZ}} implies 𝖠𝖢𝖠0\mathsf{ACA}_{0}.

Proof.

By Proposition 8.3, it suffices to establish the desired result for the undirected variants of 𝖲𝖢𝖱\mathsf{SCR}. The proofs are almost identical to those of Theorems 4.5(i) and 5.9. There, we applied 𝖨𝖱𝖳UYZ\mathsf{IRT}_{\mathrm{UYZ}} to forests G=nTnG=\bigsqcup_{n}T_{n}, where each TnT_{n} contains a ZZ-ray, and no two ZZ-rays in TnT_{n} can be YY-disjoint. Any infinite sequence of YY-disjoint ZZ-rays in GG must contain a ZZ-ray in cofinally many graphs TnT_{n}. Therefore from such a sequence we can uniformly compute ZZ-rays in cofinally many graphs TnT_{n}, which establishes 𝖠𝖢𝖠0\mathsf{ACA}_{0} by the construction of nTn\bigsqcup_{n}T_{n}. If we assume 𝖲𝖢𝖱UYZ\mathsf{SCR}_{\mathrm{UYZ}} instead of 𝖨𝖱𝖳UYZ\mathsf{IRT}_{\mathrm{UYZ}}, we only have access to a sequence XkkN\langle X_{k}\rangle_{k\in N} such that for each kk, XkX_{k} is a set of kk YY-disjoint ZZ-rays in GG. From such a sequence we can still uniformly compute ZZ-rays in cofinally many graphs TnT_{n}, because for any kk, Xk+1X_{k+1} must contain a ZZ-ray in some TnT_{n}, nkn\geq k. ∎

By Theorem 8.2, Proposition 8.4, and the observation that 𝖲𝖢𝖱XYZ+𝖶𝖨𝖱𝖳XYZ𝖨𝖱𝖳XYZ\mathsf{SCR}_{\mathrm{XYZ}}+\mathsf{WIRT}_{\mathrm{\mathrm{XYZ}}}\vdash\mathsf{IRT}_{\mathrm{XYZ}}, we obtain

Corollary 8.5.

𝖲𝖢𝖱XYZ\mathsf{SCR}_{\mathrm{XYZ}} and 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} are equivalent over 𝖱𝖢𝖠0\mathsf{RCA}_{0} for the following choices of XYZ: XYS and UVD.

We now turn our attention to 𝖶𝖨𝖱𝖳XYZ\mathsf{WIRT}_{\mathrm{XYZ}}. As usual, Lemmas 5.2, 5.4 and 5.6 imply

Proposition 8.6.

𝖶𝖨𝖱𝖳DYZ\mathsf{WIRT}_{\mathrm{DYZ}} implies 𝖶𝖨𝖱𝖳UYZ\mathsf{WIRT}_{\mathrm{UYZ}}, 𝖶𝖨𝖱𝖳DEZ\mathsf{WIRT}_{\mathrm{DEZ}} implies 𝖶𝖨𝖱𝖳DVZ\mathsf{WIRT}_{\mathrm{DVZ}} and 𝖶𝖨𝖱𝖳DYD\mathsf{WIRT}_{\mathrm{DYD}} implies 𝖶𝖨𝖱𝖳DYS\mathsf{WIRT}_{\mathrm{DYS}}.

Recall that 𝖶𝖨𝖱𝖳XYS\mathsf{WIRT}_{\mathrm{XYS}} and 𝖶𝖨𝖱𝖳UVD\mathsf{WIRT}_{\mathrm{UVD}} are provable in 𝖠𝖢𝖠0\mathsf{ACA}_{0} (Theorem 8.2). 𝖶𝖨𝖱𝖳DVD\mathsf{WIRT}_{\mathrm{DVD}} and 𝖶𝖨𝖱𝖳DED\mathsf{WIRT}_{\mathrm{DED}} are open, because Σ11-𝖠𝖢0+𝖶𝖨𝖱𝖳XYZ\Sigma^{1}_{1}\text{-}\mathsf{AC}_{0}+\mathsf{WIRT}_{\mathrm{XYZ}} implies 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}, and 𝖨𝖱𝖳DVD\mathsf{IRT}_{\mathrm{DVD}} and 𝖨𝖱𝖳DED\mathsf{IRT}_{\mathrm{DED}} are open (see comments after Theorem 5.1). We do not have an upper bound on the proof-theoretic strength of 𝖶𝖨𝖱𝖳UED\mathsf{WIRT}_{\mathrm{UED}} (an upper bound on 𝖶𝖨𝖱𝖳UED\mathsf{WIRT}_{\mathrm{UED}} would yield an upper bound on 𝖨𝖱𝖳UED\mathsf{IRT}_{\mathrm{UED}}, which we do not currently have).

We do not know if any 𝖶𝖨𝖱𝖳XYZ\mathsf{WIRT}_{\mathrm{XYZ}} is equivalent to 𝖠𝖢𝖠0\mathsf{ACA}_{0}. In an effort to clarify the situation, we define an apparent strengthening of 𝖶𝖨𝖱𝖳XYZ\mathsf{WIRT}_{\mathrm{XYZ}} and show that it implies 𝖠𝖢𝖠0\mathsf{ACA}_{0}:

Definition 8.7.

Let nonuniform-𝖶𝖨𝖱𝖳XYZ\mathsf{WIRT}_{\mathrm{XYZ}} be the assertion that if GG is an XX-graph and there is a sequence of ZZ-rays R0,R1,R_{0},R_{1},\dots in GG such that for each kk, there are i0,,iki_{0},\dots,i_{k} such that Ri0,,RikR_{i_{0}},\dots,R_{i_{k}} are YY-disjoint, then GG has infinitely many YY-disjoint ZZ-rays.

Every instance of 𝖶𝖨𝖱𝖳XYZ\mathsf{WIRT}_{\mathrm{XYZ}} is also an instance of nonuniform-𝖶𝖨𝖱𝖳XYZ\mathsf{WIRT}_{\mathrm{XYZ}}, so nonuniform-𝖶𝖨𝖱𝖳XYZ\mathsf{WIRT}_{\mathrm{XYZ}} implies 𝖶𝖨𝖱𝖳XYZ\mathsf{WIRT}_{\mathrm{XYZ}}. Conversely, we have

Proposition 8.8.

𝖠𝖢𝖠0+𝖶𝖨𝖱𝖳XYZ\mathsf{ACA}_{0}+\mathsf{WIRT}_{\mathrm{XYZ}} implies nonuniform-𝖶𝖨𝖱𝖳XYZ\mathsf{WIRT}_{\mathrm{XYZ}}.

Proof.

Suppose GG is an instance of nonuniform-𝖶𝖨𝖱𝖳XYZ\mathsf{WIRT}_{\mathrm{XYZ}}, i.e., GG is an XX-graph and RnnN\langle R_{n}\rangle_{n\in N} is a sequence of ZZ-rays in GG such that for each kk, there are i0,,iki_{0},\dots,i_{k} such that Ri0,,RikR_{i_{0}},\dots,R_{i_{k}} are YY-disjoint. Then 𝖠𝖢𝖠0\mathsf{ACA}_{0} can find such i0,,iki_{0},\dots,i_{k} uniformly in kk. Therefore by 𝖠𝖢𝖠0\mathsf{ACA}_{0}, GG is an instance of 𝖶𝖨𝖱𝖳XYZ\mathsf{WIRT}_{\mathrm{XYZ}}. By 𝖶𝖨𝖱𝖳XYZ\mathsf{WIRT}_{\mathrm{XYZ}}, GG has infinitely many YY-disjoint ZZ-rays as desired. ∎

Theorem 8.9.

Nonuniform-𝖶𝖨𝖱𝖳XYZ\mathsf{WIRT}_{\mathrm{XYZ}} implies 𝖠𝖢𝖠0\mathsf{ACA}_{0} over 𝖱𝖢𝖠0\mathsf{RCA}_{0}. It follows that nonuniform-𝖶𝖨𝖱𝖳XYS\mathsf{WIRT}_{\mathrm{XYS}} and nonuniform-𝖶𝖨𝖱𝖳UVD\mathsf{WIRT}_{\mathrm{UVD}} are both equivalent to 𝖠𝖢𝖠0\mathsf{ACA}_{0} over 𝖱𝖢𝖠0\mathsf{RCA}_{0}.

Proof.

By Proposition 8.8 and Theorem 8.2, 𝖠𝖢𝖠0\mathsf{ACA}_{0} implies nonuniform-𝖶𝖨𝖱𝖳XYS\mathsf{WIRT}_{\mathrm{XYS}} and nonuniform-𝖶𝖨𝖱𝖳UVD\mathsf{WIRT}_{\mathrm{UVD}}.

Next, we show that nonuniform-𝖶𝖨𝖱𝖳XYZ\mathsf{WIRT}_{\mathrm{XYZ}} implies 𝖠𝖢𝖠0\mathsf{ACA}_{0}. By Lemma 5.2, it suffices to consider the undirected versions of nonuniform-𝖶𝖨𝖱𝖳\mathsf{WIRT}. First, we prove that nonuniform-𝖶𝖨𝖱𝖳UYS\mathsf{WIRT}_{\mathrm{UYS}} implies 𝖠𝖢𝖠0\mathsf{ACA}_{0} by constructing a computable instance of nonuniform-𝖶𝖨𝖱𝖳UVS\mathsf{WIRT}_{\mathrm{UVS}} such that every nonuniform-𝖶𝖨𝖱𝖳UES\mathsf{WIRT}_{\mathrm{UES}} solution computes \emptyset^{\prime}. (The desired result follows by relativization.) We use a variation of the graph used in the analogous result in Theorem 4.2.

Construction of G=(V,E)G=(V,E): V={0nn>0}{ns0tn>0V=\{0^{n}\mid n>0\}\cup\{n\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}s\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}0^{t}\mid n>0 and some number below nn is enumerated into \emptyset^{\prime} at stage ss, and either tst\leq s or sn=tn}\emptyset_{s}^{\prime}\upharpoonright n=\emptyset_{t}^{\prime}\upharpoonright n\}. E={(0n,0n+1)n>0}{(ns0t,ns0t+1)ns0t,ns0t+1V}{(ns0t,0)ns0tVE=\{(0^{n},0^{n+1})\mid n>0\}\cup\{(n\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}s\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}0^{t},n\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}s\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}0^{t+1})\mid n\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}s\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}0^{t},n\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}s\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}0^{t+1}\in V\}\cup\allowbreak\{(n\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}s\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}0^{t},0)\mid n\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}s\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}0^{t}\in V and ns0t+1V}n\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}s\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}0^{t+1}\notin V\}. GG is clearly computable.

Verification: It is clear that there is exactly one ray Rn,sR_{\left\langle n,s\right\rangle} in GG beginning with nsn\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}s for nsVn\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}s\in V and the sequence Rn,snsV\langle R_{\langle n,s\rangle}\rangle_{n\mathord{\mathchoice{\raisebox{3.01389pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01389pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{2.1097pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{2.1097pt}{\scalebox{0.5}{$\frown$}}}}s\in V} is also computable. Note that if n=sn\emptyset^{\prime}\upharpoonright n=\emptyset_{s}^{\prime}\upharpoonright n then this ray is ns0ttN\left\langle n\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}s\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}0^{t}\right\rangle_{t\in N}. Otherwise, it is ns0tns0tV0nn>0\left\langle n\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}s\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}0^{t}\right\rangle_{n\mathord{\mathchoice{\raisebox{3.01389pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01389pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{2.1097pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{2.1097pt}{\scalebox{0.5}{$\frown$}}}}s\mathord{\mathchoice{\raisebox{3.01389pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01389pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{2.1097pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{2.1097pt}{\scalebox{0.5}{$\frown$}}}}0^{t}\in V}\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}\left\langle 0^{n}\right\rangle_{n>0}. Next observe that for each kNk\in N, {n,ii<nk\{\left\langle n,i\right\rangle\mid i<n\leq k and i}i\in\emptyset^{\prime}\} is Σ10\Sigma_{1}^{0} and contained in k×kk\times k and so is a set by bounded Σ10\Sigma_{1}^{0} comprehension [28, II.3.9]. Thus the finite function taking n>ln>l (the least number in \emptyset^{\prime}) to the last stage sns_{n} at which an i<ni<n is enumerated in \emptyset^{\prime} is also (coded by) a finite set. So we have, for each kk, a sequence of V-disjoint rays Rn,snl<nl+k\langle R_{n,s_{n}}\rangle_{l<n\leq l+k} of length kk as required for the hypothesis of nonuniform-𝖶𝖨𝖱𝖳UVS\mathsf{WIRT}_{\mathrm{UVS}}.

Suppose then that SiS_{i} is the sequence of rays in a solution for nonuniform-𝖶𝖨𝖱𝖳UES\mathsf{WIRT}_{\mathrm{UES}}. We wish to compute \emptyset^{\prime} from this solution. As the SiS_{i} are E-disjoint at most one of them contains the edge (0,00)(0,00). So by eliminating that one, we can assume none of the SiS_{i} contain (0,00)(0,00). If any of the remaining rays contain some edge of the form (0j,0j+1)(0^{j},0^{j+1}) for j>0j>0 then (as it does not contain (0,00)(0,00)) it must contain (0k,0k+1)(0^{k},0^{k+1}) for every kjk\geq j. Thus there can be at most one such ray among the remaining SiS_{i} and so we can discard it and assume there are no such rays in our list. No remaining ray can have 0 as its first vertex as if it did its second vertex would have to be of the form ns0tn\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}s\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}0^{t} with ns0t+1Vn\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}s\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}0^{t+1}\notin V. Any continuation of this sequence would have to follow the ns0rn\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}s\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}0^{r} with rr descending from tt and so would have to terminate at nsn\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}s and not be a ray. Thus all the remaining SiS_{i} are of the form nisi0t+jjN\left\langle n_{i}\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}s_{i}\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}0^{t+j}\right\rangle_{j\in N} for some tt with ninkn_{i}\neq n_{k} for iki\neq k. So the remaining SiS_{i} witness the conclusion of nonuniform-𝖶𝖨𝖱𝖳UVS\mathsf{WIRT}_{\mathrm{UVS}} as desired.

As we can replace the first vertex of SiS_{i} by the sequence beginning with nsn\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}s and ending with its second vertex, we know that ni=sni\emptyset^{\prime}\upharpoonright n_{i}=\emptyset_{s}^{\prime}\upharpoonright n_{i}. Since the sequence SiS_{i} and so that of the nin_{i} is infinite, given any mm we can find an ni>mn_{i}>m and so compute m\emptyset^{\prime}\upharpoonright m as nim\emptyset_{n_{i}}^{\prime}\upharpoonright m as required.

To show that nonuniform-𝖶𝖨𝖱𝖳UYD\mathsf{WIRT}_{\mathrm{UYD}} implies 𝖠𝖢𝖠0\mathsf{ACA}_{0}, define GG as above. Consider the graph GG^{\prime} gotten by adding on for each nsVn\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}s\in V new vertices xn,s,kx_{n,s,k} for k>0k>0 and edges (ns,xn,s,1)(n\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}s,x_{n,s,1}) and (xn,s,k,xn,s,k+1)(x_{n,s,k},x_{n,s,k+1}) for k>0k>0. The witnesses for the hypothesis of nonuniform-𝖶𝖨𝖱𝖳UVS\mathsf{WIRT}_{\mathrm{UVS}} in GG supply ones for nonuniform-𝖶𝖨𝖱𝖳UVD\mathsf{WIRT}_{\mathrm{UVD}} by tacking on the xn,s,kx_{n,s,k} before nsn\mathord{\mathchoice{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{4.30554pt}{\scalebox{0.7}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}{\raisebox{3.01385pt}{\scalebox{0.5}{$\frown$}}}}s in reverse order. The witnesses for the conclusion of nonuniform-𝖶𝖨𝖱𝖳UED\mathsf{WIRT}_{\mathrm{UED}} can be converted into ones for the conclusion of nonuniform-𝖶𝖨𝖱𝖳UES\mathsf{WIRT}_{\mathrm{UES}} in GG by removing the new vertices. So once again we can compute \emptyset^{\prime}. ∎

We are unable to show that 𝖶𝖨𝖱𝖳XYZ\mathsf{WIRT}_{\mathrm{XYZ}} implies 𝖠𝖢𝖠0\mathsf{ACA}_{0}, but we can prove the following:

Theorem 8.10.

𝖶𝖨𝖱𝖳XYZ\mathsf{WIRT}_{\mathrm{XYZ}} is not provable in 𝖱𝖢𝖠0\mathsf{RCA}_{0}.

Proof.

By Proposition 8.6, it suffices to consider the undirected variants of 𝖶𝖨𝖱𝖳\mathsf{WIRT}. For all these variants it suffices to construct a computable graph GG on \mathbb{N} and a computable sequence Xiki<kk\left\langle\left\langle X_{i}^{k}\right\rangle_{i<k}\right\rangle_{k\in\mathbb{N}} such that (1) for each kk\in\mathbb{N}, the XikX_{i}^{k} for i<ki<k are pairwise vertex-disjoint double rays in GG and (2) there is no computable sequence Rjj\left\langle R_{j}\right\rangle_{j\in\mathbb{N}} of edge-disjoint single rays in GG. It is clear that the GG constructed for this CC is a counterexample to each 𝖶𝖨𝖱𝖳UYZ\mathsf{WIRT}_{\mathrm{UYZ}} in the standard model of 𝖱𝖢𝖠0\mathsf{RCA}_{0} with second order part the recursive sets. Of course, as this model is standard, 𝖶𝖨𝖱𝖳XYZ\mathsf{WIRT}_{\mathrm{XYZ}} is not provable in 𝖱𝖢𝖠\mathsf{RCA}, 𝖱𝖢𝖠0\mathsf{RCA}_{0} plus induction for all formulas.

The computable construction will be a finite injury priority argument. At the end of stage ss of our construction, for each i<ksi<k\leq s, we will have defined a path Pi,s+1kP_{i,s+1}^{k} with lengths strictly increasing with ss which is intended to be a segment of the double ray XikX_{i}^{k}. We think of these paths PsP_{s} as having domain a segment [u,v][u,v] of \mathbb{Z} containing [s,s][-s,s]. Its endpoints are Ps(u)P_{s}(u) and Ps(v)P_{s}(v). The intention is that the Xik=sPi,skX_{i}^{k}=\bigcup_{s}P_{i,s}^{k} will be the desired double rays such that, for each kk, the XikX_{i}^{k} for i<ki<k will be vertex-disjoint. We will also have put all numbers less than ss in as vertices in at least one of these Pi,skP_{i,s}^{k}. In future stages, we will not add any edges between vertices which are currently in any Pi,skP_{i,s}^{k}\ for i<ki<k. Thus GG will be a computable graph given by the union of the double rays Xik=sPi,skX_{i}^{k}=\bigcup_{s}P_{i,s}^{k}. We let GsG_{s} be the graph defined so far, i.e. {Pi,ski<k<s}.\bigcup\{P_{i,s}^{k}\mid i<k<s\}. We let Gs>tG_{s}^{>t} be its subgraph defined as the union of the P0,sk,,Pk1,skP_{0,s}^{k},\dots,P_{k-1,s}^{k} for k>tk>t and similarly for k<tk<t and other interval notations.

We say that the disjointness condition, d.c., holds at stage ss if for every m<sm<s and distinct nn and n<mn^{\prime}<m, Pn,smP_{n,s}^{m} and Pn,smP_{n^{\prime},s}^{m} are vertex-disjoint. Otherwise we say we have violated the d.c. Clearly, if we never violate the d.c. the XnmX_{n}^{m} (for fixed mm) are pairwise vertex-disjoint. We arrange the construction so that we obviously never violate the d.c.

So it suffices to also meet the following requirements:

Qe:\displaystyle Q_{e}: If R0,R1, is a computable sequence of single rays in G defined by Φe,\displaystyle\text{ If }R_{0},R_{1},\dots\text{ is a computable sequence of single rays in }G\text{ defined by $\Phi_{e}$, }
i.e. Φe(i,n)\displaystyle\text{i.e. }\Phi_{e}(i,n) =Ri(n) then the Ri are not edge-disjoint.\displaystyle=R_{i}(n)\text{ then the }R_{i}\text{ are not edge-disjoint.}

The requirements QeQ_{e} are listed in order of priority. During our construction, if all else fails, we will attempt to satisfy each QeQ_{e} at some stage ss by merging certain rays XikX_{i}^{k} and XjlX_{j}^{l} using vertices xx and yxy\neq x which are endpoints of Pi,skP_{i,s}^{k} and Pj,slP_{j,s}^{l}, respectively. We do this by adding the least new number rr (the merge point of this merger) as a vertex of GG as well as edges (x,r)(x,r) and (y,r)(y,r) which are appended to each Pr,sqP_{r,s}^{q} with xx or yy, respectively, as an endpoint. We also ensure that Pi,tkP_{i,t}^{k} and Pj,tlP_{j,t}^{l} henceforth agree after the vertex rr as they grow in the corresponding directions.

Without loss of generality, and to simplify notation later, we make the assumption that if Φe,s(i,u)\Phi_{e,s}(i,u) is convergent for any ee, ii and uu then so is Φe,s(i,u)\Phi_{e,s}(i,u^{\prime}) for every u<uu^{\prime}<u.

Construction. At stage ss of the construction, we are given a finite graph GsG_{s} consisting of, for each k<sk<s, finite vertex-disjoint paths P0,sk,,Pk1,skP_{0,s}^{k},\dots,P_{k-1,s}^{k} as described above. We let fs(e)f_{s}(e) be the final stage before ss at which QeQ_{e} was initialized. For notational convenience when ss and ee are specified we simply write ff for fs(e)f_{s}(e).

First, we act for the requirement QeQ_{e} of highest priority with e<se<s which requires attention as described below. All requirements QeQ_{e} are initialized and unsatisfied at stage 0 and are initialized and declared to be unsatisfied whenever we act for a QeQ_{e^{\prime}} with e<ee^{\prime}<e.

We say that QeQ_{e} requires attention at stage ss if QeQ_{e} is not satisfied and there are a,u,x,b,v,y<sa,u,x,b,v,y<s such that u>0u>0, Φe,s(a,u)\Phi_{e,s}(a,u)\downarrow, Φe,s(a,n)nu\left\langle\Phi_{e,s}(a,n)\right\rangle_{n\leq u} is a path in GsG_{s} disjoint from Gs<fG_{s}^{<f} which can be extended in only one way to a maximal path in GsG_{s} and this extension eventually reaches an xGs<fx\notin G_{s}^{<f} which is an endpoint of some Pi,skP_{i,s}^{k} for kfk\geq f and similarly for b,vb,v and yy for some Pj,slP_{j,s}^{l} such that xx and yy are not both endpoints of the same PrqP_{r}^{q}. We also require that the merger using xx and yy would not violate the d.c. We then let a(e,s)a(e,s) etc. be the associated witnesses for the least such computation. In this case, the actions for QeQ_{e} is to perform the merger using x(e,s)x(e,s) and y(e,s)y(e,s) as defined above and declare QeQ_{e} to be satisfied.

Finally, for each ww, in turn, which is an endpoint of any the paths PikP_{i}^{k} as now defined we extend those paths by taking the least new number zz which we append after ww in each of these paths (and so add (w,z)(w,z) as a new edge). For each i<si<s, in turn, we also take the next 2s+12s+1 least new numbers and let the Pis(n)P_{i}^{s}(n)\ be these numbers in order for n[s,s]n\in[-s,s]. This defines the Pi,s+1kP_{i,s+1}^{k} for i<k<s+1i<k<s+1 and completes stage ss of the construction. As promised we let Xik=sPi,skX_{i}^{k}=\bigcup_{s}P_{i,s}^{k}.

Verification. It is clear that, for each i<ki<k\in\mathbb{N}, XikX_{i}^{k} is a double ray and that GG is a computable graph consisting of the union of these rays. It is also clear that by construction we never violate the d.c. and so for each kk the XikX_{i}^{k} for i<ki<k are pairwise vertex disjoint as required. Thus we only need to prove that we meet each QeQ_{e}.

We now state a series of facts about GG each of which follows immediately (or by simple inductions) from the construction and previous facts on the list.

Lemma 8.11.

Every vertex rr which is a merge point has exactly three neighbors and they are the xx and yy used in the merger and the zz added on after rr in the final part of the action at the merger stage. In addition, no endpoint of any Pi,skP_{i,s}^{k} is a merge point and every vertex which is not a merge point has exactly two neighbors.

If uGsu\in G_{s} and so uPi,sku\in P_{i,s}^{k} for some i<k<si<k<s, then for any j<lj<l, uXjluPj,slu\in X_{j}^{l}\Leftrightarrow u\in P_{j,s}^{l} and if so l<sl<s.

If (u,v)Gs(u,v)\in G_{s} then not both uu and vv are merge points. If neither are merge points then ki<k(vPi,skuPi,sk)\forall k\forall i<k(v\in P_{i,s}^{k}\Leftrightarrow u\in P_{i,s}^{k}). If one, say uu, is a merge point rr for a merger at some stage tt necessarily less than ss using some xx and yy with (r,z)(r,z) the edge added on at the end of stage tt, then the other (vv) is xx, yy or zz; ki<k(rPi,t+1kxPi,tkyPi,tk)\forall k\forall i<k(r\in P_{i,t+1}^{k}\Leftrightarrow x\in P_{i,t}^{k}\vee y\in P_{i,t}^{k}); ki<k(rPi,t+1krPi,skzPi,t+1kzPi,sk)\forall k\forall i<k(r\in P_{i,t+1}^{k}\Leftrightarrow r\in P_{i,s}^{k}\Leftrightarrow z\in P_{i,t+1}^{k}\Leftrightarrow z\in P_{i,s}^{k}).

Any ray in GG which begins in GsG_{s} remains in GsG_{s} until it reaches an endpoint of some Pi,skP_{i,s}^{k}.

Each requirement acts at most once after the last time it is initialized. So, by induction, each requirement acts and is initialized only finitely often. Thus there is an infinite sequence wiw_{i} such that at each wiw_{i} we act for some QeQ_{e} and we never act for any QeQ_{e^{\prime}} with eee^{\prime}\leq e afterwards. The observation to make here is that if there were a last stage ww at which we act for any QeQ_{e} then there would be no mergers of any XikX_{i}^{k} and XjlX_{j}^{l} for k,l>wk,l>w. In this case all the XikX_{i}^{k}, i<ki<k for k>wk>w would be disjoint. It is then easy to see from the observations above that at some stage after ww we would act for a QeQ_{e} with e>we>w such that Φe(0,n)=Xik(n)\Phi_{e}(0,n)=X_{i}^{k}(n) and Φe(1,n)=Xjl(n)\Phi_{e}(1,n)=X_{j}^{l}(n) for k,l>wk,l>w.

If ww is one of the wiw_{i} just defined, zG<wz\in G^{<w} and (z,z)G(z,z^{\prime})\in G then zGwz^{\prime}\notin G^{\geq w}. The point to notice here is that by construction no merger at a stage s>ws>w can use any xG<wx\in G^{<w} as every QeQ_{e} which can act after ww is initialized at ww by the choice of the wiw_{i}. Thus any path in GG which starts with a zG<wz\in G^{<w} never enters GwG^{\geq w}. \square

Suppose now that Φe\Phi_{e} is total and defines a sequence of edge-disjoint single rays Rii\left\langle R_{i}\right\rangle_{i\in\mathbb{N}} in GG. By Lemma 8.11, we may choose a w=wcw=w_{c} for some cc after which no QeQ_{e^{\prime}} for eee^{\prime}\leq e ever acts again.

If QeQ_{e} is not satisfied at the end of stage ww, we argue that we act for it later to get a contradiction. Once a ray begins in G<wG^{<w}, it must stay there by Lemma 8.11 and once beyond all the (finitely many) merge points in G<wG^{<w} (none can be put in after stage ww and no ray has repeated vertices) it remains in some XikX_{i}^{k} for k<wk<w with which it shares a tail (as each vertex which is not a merge point has exactly two neighbors). So by the edge-disjointness of the RiR_{i} there is an aa such that Φe(a,0)=Ra(0)\Phi_{e}(a,0)=R_{a}(0) is not in G<wG^{<w}. It starts in G<wG^{<w^{\prime}} with w=wcw^{\prime}=w_{c^{\prime}} for some c>cc^{\prime}>c and so shares a tail with some PjkP_{j}^{k} with wck<wcw_{c}\leq k<w_{c^{\prime}}. Similarly there is bb such that RbR_{b} begins in some GwdG^{\geq w_{d}} with d>cd>c^{\prime} and shares a tail with some XjlX_{j}^{l} with wdlw_{d}\leq l. So eventually we have a stage ss such that Φe,s(a,u)\Phi_{e,s}(a,u)\downarrow and Φe,s(b,v)\Phi_{e,s}(b,v)\downarrow define paths in GsG_{s} which go beyond the points by which RaR_{a} and RbR_{b} share tails with XikX_{i}^{k} and XjlX_{j}^{l}, respectively, and after which neither tail contains a merge point and so these paths have unique extensions to paths in GsG_{s} (determined by the appropriate tails of XikX_{i}^{k} and XjlX_{j}^{l}) ending with the endpoints of Pi,skP_{i,s}^{k} and Pj,slP_{j,s}^{l}, respectively. Finally, note that the merger of XikX_{i}^{k} and XjlX_{j}^{l} at ss would not violate d.c. as by Lemma 8.11 Pi,skP_{i,s}^{k} can share vertices only with Pn,smP_{n,s}^{m} with wcm<wcw_{c}\leq m<w_{c^{\prime}} and Pj,slP_{j,s}^{l} can share vertices only with Pn,smP_{n^{\prime},s}^{m} with mwd>wcm\geq w_{d}>w_{c^{\prime}}. Thus at stage ss we would act to satisfy QeQ_{e} for the desired contradiction.

So QeQ_{e} is satisfied at ww and was satisfied at some sws\leq w. With the notations as at ss, Φe(a,u)\Phi_{e}(a,u) and Φe(b,v)\Phi_{e}(b,v) define initial segments of RaR_{a} and RbR_{b} which can each be extended in only one way to maximal paths in GsG_{s} eventually reaching the vertices xx and yy, respectively, as described at ss. The merger performed at ss puts the merge point rr in both Pi,s+1kP_{i,s+1}^{k} and Pl,s+1jP_{l,s+1}^{j} and so in RaR_{a} and RbR_{b} (as the successor of xx and yy, respectively, as by Lemma 8.11 rr is the only neighbor in GG of xx other than its predecessor in RaR_{a} and similarly for yy and RbR_{b}). We then add in (r,z)(r,z) to GG at stage ss. By Lemma 8.11, the only neighbors of rr are xx, yy and zz so any continuation of RaR_{a} and RbR_{b} after rr must produce a shared edge, as RaR_{a} can continue only with either (y,r)(y,r) which is an edge of RbR_{b} or with (r,z)(r,z) and RbR_{b} can continue only with (x,r)(x,r) which is an edge of RaR_{a} or (r,z)(r,z). This yields the final contradiction. ∎

9 Open Questions

In addition to the variations of the Halin type theorems investigated here that remain open problems of graph theory (𝖨𝖱𝖳DVD\mathsf{IRT}_{\mathrm{DVD}} and 𝖨𝖱𝖳DED\mathsf{IRT}_{\mathrm{DED}}) the most intriguing computational and reverse mathematical questions are about either separating the variants or providing additional reductions or equivalences among the 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} and Σ11\Sigma_{1}^{1}-𝖠𝖢0\mathsf{AC}_{0}. Clearly the most important issue is deciding if any (or even all) the 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} which are known to be provable in Σ11\Sigma_{1}^{1}-𝖠𝖢0\mathsf{AC}_{0} are actually equivalent to it. We extend this problem to include the 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}^{\ast} and 𝖨Σ11\mathsf{I}\Sigma_{1}^{1}.

Question 9.1.

Can one show that any of the 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} which are provable in Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0} (𝖨𝖱𝖳XYS\mathsf{IRT}_{\mathrm{XYS}} and 𝖨𝖱𝖳UVD\mathsf{IRT}_{\mathrm{UVD}}) do not imply Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0} over 𝖱𝖢𝖠0\mathsf{RCA}_{0} or even over 𝖱𝖢𝖠0+𝖨Σ11\mathsf{RCA}_{0}+\mathsf{I}\Sigma_{1}^{1}? An intermediate result might be that 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}^{\ast} (for one of these versions) does not imply Σ11-𝖠𝖢0\Sigma_{1}^{1}\text{-}\mathsf{AC}_{0} over 𝖱𝖢𝖠0\mathsf{RCA}_{0}.

Should any of these 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} be strictly weaker than Σ11\Sigma_{1}^{1}-𝖠𝖢0\mathsf{AC}_{0}, the question would then be to determine the relations among the 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} and analogously the 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}^{\ast}.

Question 9.2.

Can any additional arrows be added to Figure 1 over 𝖱𝖢𝖠0\mathsf{RCA}_{0} or 𝖱𝖢𝖠0+𝖨Σ11\mathsf{RCA}_{0}+\mathsf{I}\Sigma_{1}^{1}? (This includes the question of whether 𝖱𝖢𝖠0𝖨𝖱𝖳UVD𝖨𝖱𝖳UVS\mathsf{RCA}_{0}\vdash\mathsf{IRT}_{\mathrm{UVD}}\rightarrow\mathsf{IRT}_{\mathrm{UVS}}.)

As we noted in Remark 5.8 there is an apparent additional reduction in Bowler, Carmesin, Pott [3, pg. 2 l. 3–7]. They use an intermediate reduction to locally finite graphs in the sense of relying on the fact that if a graph has arbitrarily many disjoint rays it has a locally finite subgraph with arbitrarily many disjoint rays. This is the principle to which that Remark refers. It plus 𝖠𝖢𝖠0\mathsf{ACA}_{0} is a THA but over 𝖱𝖢𝖠0\mathsf{RCA}_{0} it does not imply 𝖠𝖢𝖠0\mathsf{ACA}_{0} and is provably very weak (in the sense of being highly conservative over 𝖱𝖢𝖠0\mathsf{RCA}_{0}). Shore [26] proves these results and further analyzes this and many similar principles some related to the 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} and others to an array of classical logical principles.

Any reductions in 𝖱𝖢𝖠0\mathsf{RCA}_{0} as requested in the Question above would, of course, provide the analogous ones for the 𝖨𝖱𝖳XYZ\mathsf{IRT}^{\ast}_{\mathrm{XYZ}}. However, it is possible that other implications can be proven for the 𝖨𝖱𝖳XYZ\mathsf{IRT}^{\ast}_{\mathrm{XYZ}}:

Question 9.3.

Can any implications of the form 𝖨𝖱𝖳XYZ𝖨𝖱𝖳XYZ\mathsf{IRT}^{\ast}_{\mathrm{XYZ}}\rightarrow~{}\mathsf{IRT}^{\ast}_{\mathrm{X^{\prime}Y^{\prime}Z^{\prime}}} be proven in 𝖱𝖢𝖠0\mathsf{RCA}_{0} other than the ones known to hold for the 𝖨𝖱𝖳\mathsf{IRT} versions?

Probably more challenging is the problem of separating the principles.

Question 9.4.

Can one prove any nonimplication over 𝖱𝖢𝖠0\mathsf{RCA}_{0} or over 𝖱𝖢𝖠0+𝖨Σ11\mathsf{RCA}_{0}+\mathsf{I}\Sigma^{1}_{1} for any pair of the 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}?

Of course, any such separation for the 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} of Question 9.1 would answer a case of that question by proving that at least one of these principles is strictly weaker than Σ11\Sigma_{1}^{1}-𝖠𝖢0\mathsf{AC}_{0}. In addition, a separation by standard models or even ones over 𝖨Σ11\mathsf{I}\Sigma_{1}^{1} for the 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}} would give nonimplication for the corresponding 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}^{\ast} but it might be that nonstandard models could be used to separate one pair of versions but not the other.

The next natural question looks below 𝖠𝖡𝖶0\mathsf{ABW}_{0} in Figure 4.

Question 9.5.

Can one prove that finite-Σ11-𝖠𝖢0\Sigma^{1}_{1}\text{-}\mathsf{AC}_{0} does not imply 𝖠𝖡𝖶0\mathsf{ABW}_{0} over 𝖱𝖢𝖠0\mathsf{RCA}_{0} or 𝖱𝖢𝖠0+𝖨Σ11\mathsf{RCA}_{0}+\mathsf{I}\Sigma^{1}_{1}?

The weaker versions, 𝖶𝖨𝖱𝖳XYZ\mathsf{WIRT}_{\mathrm{XYZ}}, of the 𝖨𝖱𝖳XYZ\mathsf{IRT}_{\mathrm{XYZ}}, prompt a question about 𝖠𝖢𝖠0\mathsf{ACA}_{0}.

Question 9.6.

Do any of the 𝖶𝖨𝖱𝖳XYZ\mathsf{WIRT}_{\mathrm{XYZ}} (especially the ones provable from 𝖠𝖢𝖠0\mathsf{ACA}_{0}) imply 𝖠𝖢𝖠0\mathsf{ACA}_{0}? An easier question might be whether they imply 𝖶𝖪𝖫0\mathsf{WKL}_{0}?

References

  • [1] Ron Aharoni. König’s duality theorem for infinite bipartite graphs. J. London Math. Soc. (2), 29(1):1–12, 1984.
  • [2] Ron Aharoni, Menachem Magidor, and Richard A. Shore. On the strength of König’s duality theorem for infinite bipartite graphs. J. Combin. Theory Ser. B, 54(2):257–290, 1992.
  • [3] Nathan Bowler, Johannes Carmesin, and Julian Pott. Edge-disjoint double rays in infinite graphs: a Halin type result. J. Combin. Theory Ser. B, 111:1–16, 2015.
  • [4] Chris J. Conidis. Comparing theorems of hyperarithmetic analysis with the arithmetic Bolzano-Weierstrass theorem. Trans. Amer. Math. Soc., 364(9):4465–4494, 2012.
  • [5] Reinhard Diestel. Graph theory, volume 173 of Graduate Texts in Mathematics. Springer, Berlin, fifth edition, 2017.
  • [6] Yu. L. Ershov, S. S. Goncharov, A. Nerode, J. B. Remmel, and V. W. Marek, editors. Handbook of recursive mathematics. Vol. 2, volume 139 of Studies in Logic and the Foundations of Mathematics. North-Holland, Amsterdam, 1998. Recursive algebra, analysis and combinatorics.
  • [7] Harvey Friedman. Algorithmic procedures, generalized Turing algorithms, and elementary recursion theory. In Logic Colloquium ’69 (Proc. Summer School and Colloq., Manchester, 1969), pages 361–389, 1971.
  • [8] Harvey Friedman. Some systems of second order arithmetic and their use. Proceedings of the International Congress of Mathematicians (Vancouver, B. C., 1974), Vol. 1, pages 235–242, 1975.
  • [9] Harvey M. Friedman. Subsystems of set theory and analysis. 1967. Thesis (Ph. D.)–Massachusetts Institute of Technology, Dept. of Mathematics.
  • [10] Jun Le Goh. The strength of an axiom of finite choice for branches in trees. To appear in the Journal of Symbolic Logic.
  • [11] Rudolf Halin. Über die Maximalzahl fremder unendlicher Wege in Graphen. Math. Nachr., 30:63–85, 1965.
  • [12] Rudolf Halin. Die Maximalzahl fremder zweiseitig unendlicher Wege in Graphen. Mathematische Nachrichten, 44(1‐6):119–127, 1970.
  • [13] Denis R. Hirschfeldt. Slicing the truth, volume 28 of Lecture Notes Series. Institute for Mathematical Sciences. National University of Singapore. World Scientific Publishing Co. Pte. Ltd., Hackensack, NJ, 2015. On the computable and reverse mathematics of combinatorial principles, Edited and with a foreword by Chitat Chong, Qi Feng, Theodore A. Slaman, W. Hugh Woodin and Yue Yang.
  • [14] Pierre Jullien. Contribution à l’étude des types d’ordres dispersés. 1968. Thesis (Ph.D.)–Marseille.
  • [15] Georg Kreisel. The axiom of choice and the class of hyperarithmetic functions. Nederl. Akad. Wetensch. Proc. Ser. A 65 = Indag. Math., 24:307–319, 1962.
  • [16] Antonio Montalbán. Indecomposable linear orderings and hyperarithmetic analysis. J. Math. Log., 6(1):89–120, 2006.
  • [17] Antonio Montalbán. On the Π11\Pi_{1}^{1}-separation principle. MLQ Math. Log. Q., 54(6):563–578, 2008.
  • [18] Antonio Montalbán. Open questions in reverse mathematics. Bull. Symbolic Logic, 17(3):431–454, 2011.
  • [19] Itay Neeman. The strength of Jullien’s indecomposability theorem. J. Math. Log., 8(1):93–119, 2008.
  • [20] Itay Neeman. Necessary use of Σ11\Sigma^{1}_{1} induction in a reversal. J. Symbolic Logic, 76(2):561–574, 2011.
  • [21] Klaus-Peter Podewski and Karsten Steffens. Injective choice functions for countable families. J. Combinatorial Theory Ser. B, 21(1):40–46, 1976.
  • [22] Hartley Rogers, Jr. Theory of recursive functions and effective computability. MIT Press, Cambridge, MA, second edition, 1987.
  • [23] Joseph G. Rosenstein. Linear orderings, volume 98 of Pure and Applied Mathematics. Academic Press, Inc. [Harcourt Brace Jovanovich, Publishers], New York-London, 1982.
  • [24] Gerald E. Sacks. Higher recursion theory. Perspectives in Mathematical Logic. Springer-Verlag, Berlin, 1990.
  • [25] Richard A. Shore. On the strength of Fraïssé’s conjecture. In Logical methods (Ithaca, NY, 1992), volume 12 of Progr. Comput. Sci. Appl. Logic, pages 782–813. Birkhäuser Boston, Boston, MA, 1993.
  • [26] Richard A. Shore. Almost theorems of hyperarithmetic analysis. J. Symbolic Logic, 88(2):664–696, 2023.
  • [27] Stephen G. Simpson. On the strength of König’s duality theorem for countable bipartite graphs. J. Symbolic Logic, 59(1):113–123, 1994.
  • [28] Stephen G. Simpson. Subsystems of second order arithmetic. Perspectives in Logic. Cambridge University Press, Cambridge; Association for Symbolic Logic, Poughkeepsie, NY, second edition, 2009.
  • [29] John R. Steel. Forcing with tagged trees. Ann. Math. Logic, 15(1):55–74, 1978.
  • [30] Robert Alan Van Wesep. Subsystems of Second-order Arithmetic, and Descriptive Set Theory under the Axiom of Determinateness. ProQuest LLC, Ann Arbor, MI, 1977. Thesis (Ph.D.)–University of California, Berkeley.