This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

\gridframe

N \pyear2010 \pmonthMonth \doinu10.1017/S1755020300000000 \leftrunningheadLucius T. Schoenbaum \rightrunningheadOn the Syntax of Logic and Set Theory

On the Syntax of Logic and Set Theory

LUCIUS T. SCHOENBAUM Louisiana State University111Received: August 25, 2008 DEPARTMENT OF MATHEMATICS
  LOUISIANA STATE UNIVERSITY
   BATON ROUGE, LA 70803
E-mail: lschoe2@lsu.edu
URL: http://www.math.lsu.edu/~lschoe2
Abstract

We introduce an extension of the propositional calculus to include abstracts of predicates and quantifiers, employing a single rule along with a novel comprehension schema and a principle of extensionality, which are substituted for the Bernays postulates for quantifiers and the comprehension schemata of ZF and other set theories. We prove that it is consistent in any finite Boolean subset lattice. We investigate the antinomies of Russell, Cantor, Burali-Forti, and others, and discuss the relationship of the system to other set theoretic systems ZF, NBG, and NF. We discuss two methods of axiomatizing higher order quantification and abstraction, and then very briefly discuss the application of one of these methods to areas of mathematics outside of logic.

volume: 0issue: 0

1 Introduction

This work is an extended investigation of the idea of regarding the set {a,b}\{a,b\} not as the extension of a class (in this case, the class whose existence is guaranteed in ZF by the Pairing axiom), but as a mathematical product aba\star b, an approach which, as we will see, naturally gives rise to a hierarchy of products 1,2,3\star_{1},\star_{2},\star_{3}\ldots in some ways comparable to the recursive hierarchy of arithmetical operations +,,+,\cdot, etc. A major goal is to obtain a well-defined notion of the abstract {xϕ(x)}\{x\mid\phi(x)\} of the predicate ϕ\phi such that, informally stated, one has {xϕ(x)ψ(x)}={xϕ(x)}{xψ(x)}\{x\mid\phi(x)\lor\psi(x)\}=\{x\mid\phi(x)\}\star\{x\mid\psi(x)\}, and other theorems of the same kind. There are two facts to underline:

  1. [(ii).]

  2. (i).

    In order to form such abstract classes, one is forced to introduce some kind of notion of \star-primality. Havoc would result if \star-divisible objects could be ``included" or ``added" to the class by satisfying the condition ϕ\phi, because this property might not necessarily be inherited by its \star-divisors.

  3. (ii).

    In order to form such abstract classes, the empty set must acquire a status unlike that of ordinary sets. Were such a class to be empty, {xϕ(x)}=Ø\{x\mid\phi(x)\}=\text{{\O }}, it must be well defined whether it is meant that ϕ(Ø),¬ϕ(Ø)\phi(\text{{\O }}),\lnot\phi(\text{{\O }}), or perhaps neither.

It is not surprising that difficulties arise once, so to speak, the animals have all been released from their cages—once, that is, braces are eliminated and totalities give way to trickier (and more fluid) multiplicities. What is remarkable is that it seems that surpassing these obstacles, as a straightforward consequence, yields a naive comprehension principle, since two critical paradoxes, those of Russell and of Cantor, are correlated with the two points above. Cantor's paradox, which arises due to the unwieldy growth of power sets, can be avoided if the power set is not allowed to be \star-prime. Russell's paradox is avoided if we do not grant \star-primality to the empty set.

There is a second idea, closely related to the first, involved in our investigation. By building set theory not as the semantics of first order logic, but (through a kind of ``radical Henkinization") syntactically of a piece with it, we demonstrate that the basic philosophical principles of formalism can be applied in a setting further removed from logicist foundations than that setting established by Hilbert and his circle during the formative years of mathematical logic. Through a slight modification of the model-theoretic boundary between syntactic and semantic spaces, we invest formulas with location among model-theoretic individuals, allowing a number of simplifying reductions of the main artifacts of logic to be implemented. This appears to carry interesting effects through to more advanced levels of analysis in proof theory and model theory (§3), set theory (§4), and in other areas of mathematics (§5). From a broader vantage, it gives rise to an organization or Weltanschauung for mathematics which may be of interest to scientists and philosophers in areas related to mathematical foundations.

Our presentation is essentially self-contained, in order to address a broad audience inclusive of researchers (philosophers and mathematicians alike) in areas closely related to logic who may not have a background in logic comparable to those of specialists in the field. We would be most content if our work were considered interdisciplinary, and sincerely hope that our presentation contains something of interest to much-admired target readers in a plurality of disciplines.

2 Elements

2.1 Preliminaries

Our treatment of propositional calculus in this section will be brief. We study the domain of objects, or (synonymously) expressions. Anything that ``exists", we say, is an object—for example, propositions, desks, and numbers are objects. A handful of objects are designated as signs, these are:

Ø˙^\text{{\O }}\quad{}^{\prime}\mspace{1.0mu}\quad^{\dot{}}\quad\mspace{2.0mu}\hat{}\quad\rightarrow\quad\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\quad\cap\quad

The symbol Ø is the system's basic constant. The next three symbols of the list generate new objects. An object of the form α\alpha{}^{\prime}\mspace{1.0mu} for some object α\alpha is a variable, an object of the form α˙\alpha^{\dot{}} is an index, and a constant is, in addition to Ø, any object of the form α^\alpha\mspace{2.0mu}\hat{}. We call the first handful of variables Ø,Ø,Ø,\text{{\O }}{}^{\prime}\mspace{1.0mu},\text{{\O }}{}^{\prime}\mspace{1.0mu}{}^{\prime}\mspace{1.0mu},\text{{\O }}{}^{\prime}\mspace{1.0mu}{}^{\prime}\mspace{1.0mu}{}^{\prime}\mspace{1.0mu}, etc. with the standard abbreviations a,b,c,x,y,z,A,B,C,a,b,c,x,y,z,A,B,C, etc.; this assignment is taken to be precise, but need not be specified here. In our presentation, when we have a need for metamathematical variables, we use Greek letters α,β,γ,\alpha,\beta,\gamma, etc. We call the three unary connectives ,˙,^{}^{\prime}\mspace{1.0mu},^{\dot{}},\mspace{2.0mu}\hat{} stops or logical stops. This name is chosen since they are meant to ``stop" the progress of a logical substitution (defined below). The index stop can be put aside for now; we will not see it again until §3.

The symbol \rightarrow is the fundamental relation, referred to as containment or universal containment. The symbols \mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu} and \cap are (binary) operations. Table 1 displays the system's basic formation rules. These rules allow, for any objects α\alpha, β\beta, γ\gamma, the formation of a new object, having α\alpha, β\beta, γ\gamma as subobjects or subexpressions; this language is heritable in the obvious sense. The object (αβ)(\alpha\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\beta) is the conjunction, union, or federation of α\alpha and β\beta.222We have searched for an apt name and symbol for the product (αβ\alpha\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\beta): a universal expression of objects-in-multiplicity, akin to ordinary addition but idempotent and, in practice, more basic. The symbol employed is a formal comma modeled after the commas in the notation {a,b}\{a,b\} for the pair and the Gentzen-Kleene-Rosser notation Γ,AB\Gamma,A\vdash B for the carriage of assumptions. The use of an ordinary comma is tempting but troublesome, so one considers using one or the other of \cup and \land. However, because the product should carry neither logical nor set-theoretical connotation in every instance, stultifying and plainly odd-looking expressions arise as a result. The symbol ()(\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\,) smooths over these distinctions, and thus helps us achieve the desired conceptual view. Our approach is to use the terminology federation, union, and conjunction (and three distinct symbols) to refer to instances of the same operation appearing at three distinct positions in the order of operations (see footnote, p. 26). It is hoped that a new symbol and a new term will aid the reader and avoid confusion to the greatest possible extent. The object (αβ)(\alpha\cap\beta) is the intersection of α\alpha and β\beta. We shall suppress these parentheses when they are unnecessary.

We call any object generated from Ø using only the given formation rules (and those that may be added later) formal objects, or well-formed objects. Objects which are not well-formed are informal objects, and include chairs, desks, letters of the alphabet (when not intended as abbreviations of formal objects), and strings of signs which are not well-formed. In this paper, we shall loosely allow consideration of informal or ``ordinary" objects as objects of the system, while giving precedence to the view in which all but the formal objects are laid aside or ignored. Thus the terms object and expression will hereafter refer to formal objects.

Table 1: Formation Rules (objects α\alpha, β\beta)
{oldtabular}

c {oldtabular}l ØØ\text{{\O }}\longmapsto\text{{\O }}

α(α)^\hskip 0.5pt\alpha\hskip 0.6pt\longmapsto(\alpha)\mspace{2.0mu}\hat{}

α(α)\hskip 0.5pt\alpha\hskip 0.6pt\longmapsto(\alpha){}^{\prime}\mspace{1.0mu}

{oldtabular}

l αβ(αβ)\alpha\,\beta\longmapsto(\alpha\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\beta)

αβ(αβ)\alpha\,\beta\longmapsto(\alpha\rightarrow\beta)

αβ(αβ)\alpha\,\beta\longmapsto(\alpha\cap\beta)

We make the following definitions. For Ø^\text{{\O }}\mspace{2.0mu}\hat{} we write \bot. For aa\rightarrow\bot we write ¬a\lnot a, the negation of aa. For Øa\text{{\O }}\rightarrow a we write a¯\underline{a}, the trivialization of aa.333This notation mirrors the notation a¯\bar{a} for the negation of aa in, e.g., Hilbert & Ackermann (1928). We write aba\lor b for a¯b¯¯\underline{\underline{a}\cap\underline{b}}, the disjunction of aa and bb. This expression is classically equivalent to the expression ¬(¬a¬b)\lnot(\lnot a\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\lnot b); however, the former expression remains serviceable in the intuitionistic case.444Thus \cap can replace Heyting’s connective \lor. For bab\rightarrow a we may write aba\leftarrow b, and for (ab)(ab)(a\rightarrow b)\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}(a\leftarrow b) we write a=ba=b.

We call expressions which may be put in the form aba\rightarrow b555Given an object ξ\xi, that is, we may find objects α\alpha and β\beta such that ξ=(αβ)\xi=(\alpha\rightarrow\beta). formulas; an object that is not a formula is said to be concrete. Expressions of the form a=ba=b may also be called equations or equalities in addition to being called formulas, as usual. In order to improve readability and respect normal usage, we will usually use the synonymous symbol \leftrightarrow instead of == when we denote an equality between formulas. We may also similarly write aba\land b in place of aba\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}b in the case when aa and bb are both formulas. In its role, this technique is comparable to the practice (common, for example, in physics) of using modified parentheses (brackets, braces) in lengthy expressions to highlight syntactic units. It is also comparable in its role to the system of dots devised by Peano and featured in the Principia Mathematica. Here, because our work is not too involved, and in order to gain familiarity with the machine language (so to speak), we shall use \land sparingly.

We must pause to clarify the aforementioned notion of logical substitution. First, an instance of an object β\beta in the syntax of an object α\alpha may be one of two kinds: if β\beta is, in this instance, a subobject of α\alpha, it is said to appear in α\alpha, and otherwise to merely occur in α\alpha. We denote the object α\alpha modified so that every appearance (note) of the variable xx is replaced by the object β\beta by α[β/x]\alpha[\beta/x]. We refer to xx as the target variable, and to β\beta as the substituend of the substitution. The reader can verify that this is a well-defined object for its inputs α\alpha, β\beta, and xx, and that no expression which occurs within (``is guarded by") an index, variable, or constant stop is modified by any substitution. If there is no risk of confusion, in place of α[β/x]\alpha[\beta/x], we may sometimes write simply α(β)\alpha(\beta).

Table 2: Postulates of Section 2 (objects α\alpha, β\beta, variables a,b,c,d,xa,b,c,d,x).
{oldtabular}

c {oldtabular}cc

R1. Exchange.         R2. Substitution.

ααββ\begin{array}[b]{l}\,\alpha\\ \,\alpha\rightarrow\beta\\ \,\beta\rule{0.0pt}{10.0pt}\\ \raisebox{24.5pt}[0.0pt][0.0pt]{$\begin{array}[]{c}\phantom{xxxx}\\ \hline\cr\end{array}$}\end{array}         αα[β/x]\begin{array}[b]{l}\\ \,\alpha\\ \,\alpha[\beta/x]\rule{0.0pt}{10.0pt}\\ \raisebox{24.5pt}[0.0pt][0.0pt]{$\begin{array}[]{c}\phantom{xxx\;}\\ \hline\cr\end{array}$}\end{array}

{oldtabular}

l A1. ((ca)(c(ab)))(cb).((c\rightarrow a)\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}(c\rightarrow(a\rightarrow b)))\rightarrow(c\rightarrow b).

A2. ((d(ab))(d(bc)))(d(ac)).((d\rightarrow(a\rightarrow b))\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}(d\rightarrow(b\rightarrow c)))\rightarrow(d\rightarrow(a\rightarrow c)).

A3. ((ca)(cb))(c(ab)).((c\rightarrow a)\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}(c\rightarrow b))\rightarrow(c\rightarrow(a\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}b)).

{oldtabular}

l A4a. (ab)a.(a\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}b)\rightarrow a.

A4b. (ab)b.(a\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}b)\rightarrow b.       {oldtabular}l A5a. a(ab).a\rightarrow(a\cap b).

A5b. b(ab).b\rightarrow(a\cap b).

A6. ((ac)(bc))((ab)c).((a\rightarrow c)\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}(b\rightarrow c))\rightarrow((a\cap b)\rightarrow c).

{oldtabular}

[t]l A7. aa.a\rightarrow a.

A8. aØ.a\rightarrow\text{{\O }}.

A9. a.\bot\rightarrow a.      {oldtabular}[t]l A10. (Øa)((Øb)(ab)))(\text{{\O }}\rightarrow a)\rightarrow((\text{{\O }}\rightarrow b)\rightarrow(a\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}b)))

A11. (ab)(Ø(ab)).(a\rightarrow b)\rightarrow(\text{{\O }}\rightarrow(a\rightarrow b)).

A12. (((ab)))(ab).(((a\rightarrow b)\rightarrow\bot)\rightarrow\bot)\rightarrow(a\rightarrow b).

A formal proof Π=(α1,,αn)\Pi=(\alpha_{1},\ldots,\alpha_{n}) is a finite numbered list of substitutions, exchanges, and axioms. These steps of the proof are usually written in descending order down the page along with a citation or attribution column (as in the formal proof of Lemma 2.1 below). If it is possible for an expression to appear in a formal proof, we say that the expression may be obtained or deduced, and we call (α1,,αn)(\alpha_{1},\ldots,\alpha_{n}) a formal proof of αn\alpha_{n}. Rules R1-2 and axioms A1-12 are presented in Table 2. They are styled postulates; a postulate is a rule or an axiom, according to Kleene (1952). They are based primarily on the systems of Heyting (1930) (see Mancosu (1998, pp. 311-27)) and Kleene, op. cit., but see also Hilbert (1927), and Hilbert & Ackermann (1928, pp. 55ff.). The system is what is known in proof theory as a substitution Frege system (see, e.g., Pudlák (1998)). The rule of Combination:

αβαβ\begin{array}[]{c}\alpha\\ \beta\\ \alpha\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\beta\rule{0.0pt}{10.0pt}\\ \raisebox{24.5pt}[0.0pt][0.0pt]{$\begin{array}[]{c}\phantom{xx\,}\\ \hline\cr\end{array}$}\end{array}

is immediately derived from A10 and A11, for formulas α\alpha and β\beta.

2.2 Basics

Formal proofs are greatly simplified through the use of assumptions. An arbitrary object (normally a formula) α\alpha is taken or assumed at a given step, and subsequent rules may combine relying on the assumption as if it were proven, until the assumption is discharged. At the discharge step the conclusion of the immediately preceding step is rewritten under the hypothesis of the assumption. A proof with assumptions is not complete until all assumptions have been discharged. That the use of assumptions does not allow any new formulas to be proven is a result known as the deduction theorem, which we will now prove. We first require a lemma.

Lemma 1.

a¯(ba).\underline{a}\rightarrow(b\rightarrow a).

{proof*}
1.aØ1.\;a\rightarrow\text{{\O }} A8
2.(Øa)Ø2.\;(\text{{\O }}\rightarrow a)\rightarrow\text{{\O }} (1)[(Øa)/a](1)[(\text{{\O }}\rightarrow a)/a]
3.aØ3.\;a\rightarrow\text{{\O }} A8
4.bØ4.\;b\rightarrow\text{{\O }} (3)[b/a](3)[b/a]
5.(ab)(Ø(ab))5.\;(a\rightarrow b)\rightarrow(\text{{\O }}\rightarrow(a\rightarrow b)) A11
6.(aØ)(Ø(aØ))6.\;(a\rightarrow\text{{\O }})\rightarrow(\text{{\O }}\rightarrow(a\rightarrow\text{{\O }})) (5)[Ø/b](5)[\text{{\O }}/b]
7.(bØ)(Ø(bØ))7.\;(b\rightarrow\text{{\O }})\rightarrow(\text{{\O }}\rightarrow(b\rightarrow\text{{\O }})) (6)[b/a](6)[b/a]
8.Ø(bØ)8.\;\text{{\O }}\rightarrow(b\rightarrow\text{{\O }}) Exch. 4, 7
9.(ab)((bc)(ac))9.\;(a\rightarrow b)\rightarrow((b\rightarrow c)\rightarrow(a\rightarrow c)) Lemma (A4a, A4b,
Comb., A2, Exch.)
10.((Øa)b)((bc)((Øa)c))10.\;((\text{{\O }}\rightarrow a)\rightarrow b)\rightarrow((b\rightarrow c)\rightarrow((\text{{\O }}\rightarrow a)\rightarrow c)) (9)[Øa/a](9)[\text{{\O }}\rightarrow a/a]
11.((Øa)Ø)((Øc)((Øa)c))11.\;((\text{{\O }}\rightarrow a)\rightarrow\text{{\O }})\rightarrow((\text{{\O }}\rightarrow c)\rightarrow((\text{{\O }}\rightarrow a)\rightarrow c)) (10)[Ø/b](10)[\text{{\O }}/b]
12.((Øa)Ø)12.\;((\text{{\O }}\rightarrow a)\rightarrow\text{{\O }})
    ((Ø(bØ))((Øa)(bØ)))\rightarrow((\text{{\O }}\rightarrow(b\rightarrow\text{{\O }}))\rightarrow((\text{{\O }}\rightarrow a)\rightarrow(b\rightarrow\text{{\O }}))) (11)[(bØ)/c](11)[(b\rightarrow\text{{\O }})/c]
13.(Ø(bØ))((Øa)(bØ))13.\;(\text{{\O }}\rightarrow(b\rightarrow\text{{\O }}))\rightarrow((\text{{\O }}\rightarrow a)\rightarrow(b\rightarrow\text{{\O }})) Exch. 2, 12
14.(Øa)(bØ)14.\;(\text{{\O }}\rightarrow a)\rightarrow(b\rightarrow\text{{\O }}) Exch. 8, 13
15.aa15.\;a\rightarrow a A7
16.(Øa)(Øa)16.\;(\text{{\O }}\rightarrow a)\rightarrow(\text{{\O }}\rightarrow a) (15)[(Øa)/a](15)[(\text{{\O }}\rightarrow a)/a]
17.((d(ab)(d(bc)))(d(ac))17.\;((d\rightarrow(a\rightarrow b)\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}(d\rightarrow(b\rightarrow c)))\rightarrow(d\rightarrow(a\rightarrow c)) A2
18.18.\; (17)[(Øa)/c](17)[(\text{{\O }}\rightarrow a)/c]
19.19.\; (18)[Ø/b](18)[\text{{\O }}/b]
20.20.\; (19)[b/a](19)[b/a]
21.21.\; (20)[a/c](20)[a/c]
22.((Øa)(bØ))((Øa)(Øa))22.\;((\text{{\O }}\rightarrow a)\rightarrow(b\rightarrow\text{{\O }}))\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}((\text{{\O }}\rightarrow a)\rightarrow(\text{{\O }}\rightarrow a)) Comb. 14, 16
23.(Øa)(ba)23.\;(\text{{\O }}\rightarrow a)\rightarrow(b\rightarrow a) Exch. 22, 21

Because of A11, and because A1 implies that (Øa)a(\text{{\O }}\rightarrow a)\rightarrow a, the equation

a¯=a\underline{a}=a (1)

is characteristic of formulas. Thus, Lemma 2.1 becomes, for all formulas aa and bb, formula (1) of Frege (1879),

a(ba).a\rightarrow(b\rightarrow a). (2)

The connection between (2) and assumption mechanisms was noticed early on (see van Heijenoort (1967, p. 465)). However, it does not obtain when aa and bb range over all the objects of our system, as can easily be checked by letting aa in (2) be concrete.666Provided that one has already crossed over to the intended conceptual view, this point is obvious. We are considering logic and set theory as they might be conceived in a single common space or worldview in which a commitment exists to a general notion of objecthood encompassing both language and its referents. Consider, then, the content of (2) set-theoretically, e.g., in the case when aa and bb are ordinary objects (desks, chairs, etc.) and the relation is one of ordinary containment. In our system concrete objects live in the midrange or midst of the lattice. Viewed logically, the lattice encompasses an infinite array of truth values; a concrete expression—a desk, a number, a geometric figure, etc.—is an object whose truth value is neither true, nor false, but rather a thing unique to the object itself. As the formalism is developed in the sequel, the load on the symbol \rightarrow will be decreased by the introduction of the symbols ,,\supset,\subset,\in etc.—though the distinction between these symbols and the family ,,\rightarrow,\leftarrow, etc. shall solely involve properties of their arguments. The formal unity between the concepts of implication and containment, whose naturality we propose in this section and whose fundamental importance to the system should become clear as we proceed, rests undisturbed throughout (echoing in certain respects an early system of Quine, see footnote, p. 18).

Theorem 2 (Deduction Theorem).

A proof with assumptions in which all variables appearing in assumptions are held constant, and in which all assumptions are assumed and discharged in last-in-first-out order, may be converted to a formal proof.

{proof*}

Let (α1,,αn)(\alpha_{1},\ldots,\alpha_{n}) be such a proof with assumptions. Consider a sequence of steps (ai,,(a_{i},\ldots, ai+j)a_{i+j}), 1i<i+jn1\leq i<i+j\leq n, from an assumption step, αi\alpha_{i}, to the first subsequent discharge step, αi+j\alpha_{i+j}, inclusive, containing no intervening assumption steps. Replacing the entire string with the new string (αiαi+1,αiαi+2,,αiαi+j1)(\alpha_{i}\rightarrow\alpha_{i+1},\alpha_{i}\rightarrow\alpha_{i+2},\ldots,\alpha_{i}\rightarrow\alpha_{i+j-1}), it is clear that every prior use of the rule of Exchange can be carried out under the lingering hypothesis αi\alpha_{i} by inserting the Axiom A1, instantiating it as desired,777As a minor additional hypothesis, one must let variables appearing in assumptions be distinct from those appearing in the axioms. and applying Combination and Exchange. Every Substitution can be executed under the assumption since, by hypothesis, all variables are held constant. All of the axioms, finally, can themselves be written under the assumption of αi\alpha_{i} by using Lemma 2.1 and Exchange, and we obtain a new formal proof. If it has assumptions, we iterate the process described. If we continue until we have treated all the assumptions in turn, we obtain a formal proof of αn\alpha_{n}.

The stipulation that assumed variables be held constant is perfunctory at this level; there is no motivation to vary assumptions until quantifiers (and/or classes) are introduced. All naturally arising assumptions may in fact be discharged in any order, but one must exercise a bit of caution, since

a(bc)(ab)ca¯(bc),a\rightarrow(b\rightarrow c)\quad\rightarrow\quad(a\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}b)\rightarrow c\quad\rightarrow\quad\underline{a}\rightarrow(b\rightarrow c), (3)

and these expressions are not equivalent in general,888If your wallet is empty, I will give you my coat. But I am not inclined, otherwise, to give you my coat for your wallet. but are seen to be equivalent when aa is a formula. A few details remain which we omit showing that we may generalize these principles, and therefore make and manipulate assumptions naturally, provided that they are formulas.

The following relations easily obtain: conjunctions and intersections of formulas are formulas. The converses of axioms 3, 6, 11, and 12. Of union/conjunction/ federation ()(\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\,): associativity, commutativity, idempotence, identity Ø, zero element \bot, as well as the relations

(ab)\displaystyle(a\rightarrow b) (acbc)(isotonicity)\displaystyle\rightarrow(a\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}c\rightarrow b\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}c)\quad\text{(isotonicity)} (4)
(a=b)\displaystyle(a=b) (ac=bc)(substitutivity)\displaystyle\rightarrow(a\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}c=b\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}c)\quad\text{(substitutivity)} (5)

Of intersection \cap: associativity, commutativity, idempotence, identity \bot, zero Ø, and isotonicity (implying substitutivity999The term isotone (and the dual antitone) is found, e.g., in Birkhoff (1940); substitutivity appears, e.g., in Quine (1951); Bernays (1958). One may show that antecedents are antitone, that double antecedents are again isotone, etc.). The same laws therefore hold also for disjunction \lor. Finally, both of the distributive laws hold, yielding a distributive lattice of objects under federation and intersection, and a Boolean ring of formulas (under conjunction and disjunction, or equivalently, federation/union and intersection) isomorphic to 2\mathbb{Z}_{2}.

2.3 Remarks

All students of modern mathematics are well accustomed to the assignment of formulas and equations to some value or thing that is in some sense the storing ground of true formulas, and another which is the home and identifier of false formulas. It is a truth which is all the more remarkable given the advanced technology, both industrial and intellectual, available to scientists and philosophers who investigate these instruments in our own age, that whether they are taken to be abstract forms, letters, or empirical entities, they have up to now resisted all efforts to make them rigorously understood and canonized. We do not contest these efforts, which lie well beyond our ken. We follow formalists and intuitionists in distinguishing mathematical discourse from ordinary linguistic discourse, and place before ourselves only the very limited task of calibrating and arranging adequate grounds for the former, laying aside questions surrounding the latter. We make no claims here concerning the concepts of truth and falsehood as they arise in ordinary language.

Table 3 illustrates the syntactic medium in which we are developing logic and set theory. This arrangement may surprise some readers, many of whom may never have paused before over the evocative relations which naturally arise in proof theory (such as ØΔ\text{{\O }}\vdash\Delta, denoting that the set of formulas Δ\Delta follows under no assumptions). The author readily agrees that the arrangement is conceptually surprising (it stirs the imagination even after long familiarity); however, having weighed the matter carefully, he finds arguments which speak for the system as it has been defined, and to condemn it, neither any falsifier, nor such a drastic departure from the norm in foundations. Logic itself is surprising and elusive—it receives these qualities naturally and instantly from the concepts it traffics in. Assigning those formulas which are formally true to the formal constant Ø identifying empty and trivial structures is no more remarkable than assigning them to a letter \top, ensconced in a detached logical space. The formulas, in either case, remain what they are; we encounter them, and they inspire us, in the same way. The author believes that it is rare indeed to find an a priori basis for criticism of variations in formal orientation which is invulnerable to a challenge on a priori grounds. One must be practical and, as Hilbert implored us to do, measure theories by their fruits. Given the inconclusive suggestion of patterns in the structure of formal and informal thought, we should be neglecting the due diligence of science to leave either possibility unexamined, in case one orientation should offer distinct utilitarian advantages over its alternative. The author cautions the reader to carefully entertain a wide-ranging field of evidence before adopting the Einsteinian critique that the system is simple, yes—and too simple. So far, we have seen a number of ways in which the present system recommends itself: a compact set of basic symbols, accessible definitions, the felicitous distinction between the formula and the concrete term, as well as the convenience of a unified intuitive presentation. What remains to be discovered is, of course, unknown to the author. In the next section, the reader shall have the opportunity to consider the effects of the novel approach upon the fundamental devices of set theory and model theory.

Table 3: Propositional calculus in a set-theoretical lattice.
{oldtabular}

cccc   Truth Table   Name Generalization Formula

TTTT TRUE empty/trivial set Ø

FTTT NAND {a,b}\{a,b\} is a covering of \bot ¬(ab)\lnot(a\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}b)

TFTT containment aba\rightarrow b

TTFT containment bab\rightarrow a

TTTF OR at least one set is empty/trivial a¯b¯¯\underline{\underline{a}\cap\underline{b}}

FFTT {a}\{a\} covers \bot ¬a\lnot a

FTFT {b}\{b\} covers \bot ¬b\lnot b

FTTF XOR disjointness ab¯(Øab)\underline{a\cap b}\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}(\text{{\O }}\neq a\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}b)

TFTF bb itself bb

TTFF aa itself aa

FFFT NOR  all sets are nonempty/nontrivial    ¬a¯b¯¯\lnot\underline{\underline{a}\cap\underline{b}}

FFTF strict containment ba¬(ab)b\rightarrow a\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\lnot(a\rightarrow b)

FTFF strict containment ab¬(ba)a\rightarrow b\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\lnot(b\rightarrow a)

TFFF AND union aba\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}b

FFFF    FALSE      absurd/undefined \bot

{oldtabular}

c \begin{array}[]{c}\phantom{xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx}\\ \hline\cr\end{array}

Aside from these remarks (and with respectful apologies to those in the philosophical domain), the author does not wish to suggest here a single philosophical vantage from which the formal system might best be viewed. According to strict formalism, the system is the article found on the page, dissociated from all semantic interpretations, and whatever metaphysics it might inspire is irrelevant to its use in practice. The formalist, as Carnap once said, is ``among the most pacifistic of mankind," able to partake in the methods of all others, and the author has, in principle, no disagreements with him. We believe that one could incorporate the system into intuitionism by introducing, to begin with, the symbol Ø as the empty hand—the moment before the unfolding construction of proof—while introducing the symbol \bot to represent an impossible object of reflection, or what is called in Gödel (1944) ``the notion of `something' in an unrestricted sense". One may then proceed to define the system of Heyting.101010The intuitionistic case is interesting in its details, but requires additional work; we intend to discuss it elsewhere. The opposite assignment conflicts with intuitionism, for consider: truth is obviously constructive, while the domain in which construction occurs cannot be entirely so. With the early work of Wittgenstein, there is a shared goal of a comprehensive Weltanschauung for analytic thought, and with sustained effort along these lines there might be developed a framework bearing out some of his views on the limitations of language and the world of coherent ideas. The relationship of these ideas to Platonism and its many later manifestations is also quite striking. The contradiction-as-unity (Ø=\text{{\O }}=\bot) has some curious and remarkable parallels to concepts explored by several ancient Greek philosophers including Plato, later figures such as, for example, Plotinus, Cusanus, and Pascal, and modern figures such as Cantor, Brouwer, Whitehead, and many others. We believe considerable opportunity remains to discuss the system in light of other leading schools. The relationship with Cantor's thought is further touched upon in §4.2, where we shall see that the novel approach adds a note of prescience to one of his more rarely-cited ideas.

3 Predicate Calculus and Set Theoretical Abstraction

To first summarily recount our progress in §2: to an unformalized intuitive space we have added the presence of objects, allowed that these objects may be collected into multiplicities (also considered objects), defined the intersections of such multiplicities (again, objects), and developed a system generating formulas (also objects) describing the containment relationship, if any, holding between two given objects. To this world of things, we wish to apply set-theoretical abstraction. This demands that we make a few incomplete remarks, before we begin, on the general question of how best to introduce sets of elements satisfying a stated condition—abstracts, or what most authors call classes—to a system like ours.

3.1 Individuals

In an early attempt to develop the class rigorously for analytic purposes, Peano (1889) refers to the formation of classes as inversion, apparently on the notion that if a formula were taken as a mapping into the truth values 0 and 1, the class derived from the formula would be its kernel. Prima facie, this seems quite natural, perhaps even obvious, but it runs into difficulties having to do with the fact that the domain of definition of such a function remains unclarified. In the particular case of such a system as the one at hand, there is no reason, at present, to expect that any given multiplicity should have a well-defined content. Since it is far from the case that all properties are heritable, this presents quite a grave problem.

Recall that we have chosen to build out from the notion of containment—the principle that objects are formed by fixing an interior. Let an object aa have ``content" if there is bØ,ab\neq\text{{\O }},\neq a such that aba\rightarrow b. What do we find ``inside" the set of objects xx such that xx has content? If the set can be formed, the transitivity of containment becomes destabilizing should there exist objects without content. Is such a predicate formally unstatable? It would appear not. In all nontrivial cases some objects indeed have content; if they all have content, the set is fine. With that allowance, however, we seem pushed into the boundless possibilities of a hopelessly ill-founded world. One way to deal with all of this trouble is to suppose that set-theoretical abstraction doesn't care about objects with content in this sense. If it cares only about objects that lack content, we begin to arrive at the notion of a discrete atomic point as a specialized multiplicity, and a theory of classes based upon it.

This approach, a retreat into the safety of wellfoundedness, bears a certain resonance, even if one were hypothetically prepared to dismiss the general tendency of modern set theoretical systems as an historic trend. It remains today an obvious and unavoidable fact (one that no one has ever been quite sure what to make of) that the world of mathematical models is very different, qualitatively speaking, from the world of ``real" objects. The many-pronged efforts during the early twentieth century to clarify the foundations of mathematics (Hilbert, whose school was perhaps preeminent among these, constantly emphasized the need to divest symbols of their ``meaning" in order to establish perfect rigor) has since been met with the runaway success of modern set theory, logic, and computer science, while the outcome of the movement to clear away the ambiguities of language (led by the logical positivists, significists, and others) remains unclear, having been frustrated by a wave of objections which crested during the postwar period and from which, it seems, it is not likely to recover. This suggests that the venture to clarify ordinary language and the venture to clarify mathematics are (at least in some significant respects) distinguishable. Instead of confronting or attempting to repair this dichotomy—and by more tenuous extension, that between mathematical models and empirically observed systems—we would, by enhancing the theoretical significance of the above-mentioned distinction, accept it into our system. In the endeavor to characterize things which are ``one" (robustly ``single" objects, or individuals) contradictions that have beleaguered the study of classes—the paradoxes named after Russell, Curry, Berry, Mirimanoff, Grelling, and so on—shall serve as so many tests to objectively measure candidate definitions. Should they be resolved, we shall have evidence that a satisfactory abstract definition of the intuitive notion of a point or individual is in reach.

Our approach will be based loosely upon the notion of primality in a unique factorization domain.111111This structure is discussed in most algebra textbooks, e.g. Hungerford (1974, §3.3).

Definition 3.1.

We say that an object is single, 𝔰(a)\operatorname{\mathfrak{s}}(a), if (a¯a)((ab)(b¯a=b)).(\underline{a}\neq a)\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}((a\rightarrow b)\rightarrow(\underline{b}\lor a=b)). That is, aa is single if it is a concrete object with no proper concrete content. A single object is an individual.

Proposition 3.

a=b𝔰(a)=𝔰(b)a=b\rightarrow\operatorname{\mathfrak{s}}(a)=\operatorname{\mathfrak{s}}(b). ¬𝔰(Ø)\lnot\operatorname{\mathfrak{s}}(\text{{\O }}). ¬𝔰()\lnot\operatorname{\mathfrak{s}}(\bot).

Proposition 4.

Individuals are irreducible and prime with respect to ()(\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\,). In symbols:

𝔰(ab)(a¯b¯a=b),\displaystyle\operatorname{\mathfrak{s}}(a\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}b)\rightarrow(\underline{a}\lor\underline{b}\lor a=b), (6)
𝔰(a)(((bc)a)((ba)(ca))).\displaystyle\operatorname{\mathfrak{s}}(a)\rightarrow(((b\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}c)\rightarrow a)\rightarrow((b\rightarrow a)\lor(c\rightarrow a))). (7)
{proof*}

Let 𝔰(ab)\operatorname{\mathfrak{s}}(a\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}b). (ab)a(a\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}b)\rightarrow a, so a=Øa=(ab)a=\text{{\O }}\lor a=(a\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}b). If aØa\neq\text{{\O }}, then a=(ab)a=(a\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}b); use (ab)b(a\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}b)\rightarrow b. Now let 𝔰(a)\operatorname{\mathfrak{s}}(a) and bcab\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}c\rightarrow a. Suppose ¬(ba)\lnot(b\rightarrow a). aaba\rightarrow a\cap b. So ab=aab=Øa\cap b=a\lor a\cap b=\text{{\O }}. Since ¬(ba)\lnot(b\rightarrow a), ab=Øa\cap b=\text{{\O }}. So (bc)a=(ba)(ca)=ca(b\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}c)\cap a=(b\cap a)\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}(c\cap a)=c\cap a. Since aØa\neq\text{{\O }} and (bc)a(b\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}c)\rightarrow a, (bc)aØ(b\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}c)\cap a\neq\text{{\O }}, so caØc\cap a\neq\text{{\O }}, so ca=ac\cap a=a. So c(ca)ac\rightarrow(c\cap a)\rightarrow a.

Definition 3.2.

We say that aa is an element or member of an object bb, aba\in b, if 𝔰(a)\operatorname{\mathfrak{s}}(a) and bab\rightarrow a.

Proposition 5.

abca\in b\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}c implies aba\in b or aca\in c.

Proposition 6.

\bot is not an element of any object, and nor is Ø.

Let the reader note, we do not see in \bot the set-theoretical universe—it is a still higher maximum, but it serves only as a kind of universal wastepaper basket. We will later encounter an object, the ``universe" VV; it, too, will fail to be an individual in any nontrivial system. Here, the system echoes the notion, advanced in systems like NBG and MK, that the universe is a proper class, and therefore not an element of any set.

3.2 Classes

Because we could ask meaningful questions about classes that have not been constructed—for example, whether (under some formal criteria) they may be constructed or not—we could introduce these objects initially as expressions (say, of the symbolic form {xϕ}\{x\mid\phi\}) which dwell in the lattice of objects but have ab initio no containment relationships with any previously defined concrete expression, nor with one another. Then we might ensure, proceeding abstractly, that the classes so understood behave formally as though they were objects already in some sense defined—possibly by construction. Where any kind of construction is possible, all conceivable ways of access must converge, and conclude in perfect agreement. In other words, while the introduction of classes may succeed in generating new descriptions or formulations, it must fail to generate new constructions. If the class represents a kind of bridge across a sea of complexity, we wish to ensure that it begins and ends somewhere. We would not build bridges on a planet with no land, nor if it meant giving up the land for bridges.

These ideas will resurface several times as we proceed, particularly in §4.2. Before turning to the formal development, let us consider one more rough analogy. Consider the sequence 1,1+1,1+(1+1),1,1+1,1+(1+1),\ldots and the sequence 2,22,222,2,2^{2},2^{2^{2}},\ldots The syntax of both expressions is very basic, and one should agree that, whatever these symbols mean relative to one another or to other things, if one of the two sequences could be constructed, then the other could as well. Most people would also understand the latter expressions to be defined in terms of the former expressions—however, they need not be. If they were not, then in order to establish equivalence with the ordinary definitions, one would demand that a proof could be supplied showing that we may understand one to be a subsequence of the other, or that a member of the latter sequence takes on a unique location somewhere in the former sequence. Even if one were not exactly sure where in the former sequence, say, the twentieth element of the latter sequence could be correctly placed, one could easily stand about discussing the twentieth element of the latter sequence, more or less as we are doing right now, and proof in hand, refer to it, think of it, regard it, etc., as an element of the former sequence simpliciter.

Now in place of the natural numbers of this example, consider the signs and the formation rules introduced in §2. We add classes to the system by adding the sign \mid to the list of basic signs, and to the formation rules of Table 1 the formation rule

αβ(α˙β[α/˙α])\alpha\,\beta\longmapsto(\alpha{}^{\prime}\mspace{1.0mu}^{\dot{}}\mid\beta[\alpha{}^{\prime}\mspace{1.0mu}^{\dot{}}/\alpha{}^{\prime}\mspace{1.0mu}])

We shall axiomatize these new objects in such a way that in every respect they act precisely as though they were in fact members of the original group of well-formed objects—precisely those members that we think of when we imagine the expression factored with respect to ()(\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\,).

We define the class or abstract of variable xx and object α\alpha (sometimes called the extension of α\alpha with respect to xx), denoted (x˙α[x˙/x])(x^{\dot{}}\mid\alpha[x^{\dot{}}/x]) or usually more simply with the abbreviation {αx}\{{}^{x}\,\alpha\}, to be a product of xx, the class index, and α\alpha, the condition of the class. (Though α\alpha may range freely over objects, there is little interest in the class if α\alpha is not a formula.) The modified—N.B.—condition α[x˙/x]\alpha[x^{\dot{}}/x] and the modified class index x˙x^{\dot{}} are subobjects of the class. We refer to an appearance of an index x˙x^{\dot{}} as a bound occurence of xx, as usual, or say that xx appears as an index. Thus when xx simply appears, we sometimes say that xx appears free. This allows us in general to do all our thinking in terms of variables, some of which are ``guarded" from substitution processes.

3.3 Sets

Having discussed the individual and the class, we can introduce the remaining rules and axioms of this section, and begin to develop the rudiments of a theory of sets. We begin by introducing the simplest class of all:

Definition 3.3.

V={Øx}V=\{{}^{x}\,\text{{\O }}\}.

This class may be referred to as the single-element or type-0 universe, or the universal class. The formula Ø which appears as the condition can be thought of as expressing that there are no conditions upon members of the class.

If we select a multiplicity from the object just defined, then we have a set.

Definition 3.4.

An object aa is a set if VaV\rightarrow a. If aa is a set and bab\rightarrow a, we say that aa is contained in bb, and write aba\subset b. If bb is also a set, we say that aa is a subset of bb.

Unless specified otherwise, the letters ϕ\phi and ψ\psi shall always denote formulas, and we shall use the variables A,B,CA,B,C, etc. to denote sets.121212As was stated in §2, all variables are syntactically generic: α\alpha{}^{\prime}\mspace{1.0mu} for some object α\alpha. We shall not make use of typed variables anywhere in the sequel. We define all of the close relatives of \subset and \in in the expected way.131313Because the relation ¬(aA)\lnot(a\in A) is rarely used, it might be best to let aAa\notin A denote that 𝔰(a)\operatorname{\mathfrak{s}}(a) and aAa\not\subset A. However, we shall preserve the usual convention regarding slashed relations here. Schematically, this gives:

\displaystyle\text{\raisebox{-1.1pt}{$\varsubsetneq$}}\quad\ni\quad\subset\quad =\displaystyle=\quad\supset\quad\in\quad\text{\raisebox{-1.1pt}{$\varsupsetneq$}}
∌\displaystyle\not\ni\quad\not\subset\quad \displaystyle\neq\quad\not\supset\quad\notin\quad

In expressions these relations will always bind more tightly than the ones introduced in §2, with the exception of ==, unless it is denoted by the character \leftrightarrow.

Passing for a moment to the context of the integers under ordinary multiplication (the model of primality that everyone knows best): we do not regard the integer 1 as a prime, since it is the multiplicative identity, and since it is what mathematicians call a unit. In a certain sense, it permeates multiplicative space—it is like the solvent in which prime number combinations are dissolved. Similarly, the basic idea of axiom schema A13 (a version of Cantor's comprehension principle, Frege's rule V, Quine's 202\ast 202, and Zermelo's principle of Aussonderong) is to regard Ø as a different kind of object, distinct from ordinary single objects. It falls outside the domain of things which are targeted for membership in classes—a subset of any set, yet not a member of any set. Granting Ø such special status provides a kind of valve for the release of pressure, which checks the advance of Russell's paradox.

Axiom. (Schema) For every well-formed formula ϕ\phi and variable xx,

a{ϕx}(𝔰(a)ϕ(a)).a\in\{{}^{x}\,\phi\}\leftrightarrow(\operatorname{\mathfrak{s}}(a)\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\phi(a)). (8)

Comprehension schema A13 is presented in a convenient deployable form in (8), while the formal axiom schema is displayed in Table 4. Note, as mentioned above, that since variables, when they are bound, are encased within an index stop, there is no way that a variable appearing in aa can be inadvertently bound in deriving ϕ(a)\phi(a) from aa's membership in (x˙ϕ(x˙))(x^{\dot{}}\mid\phi(x^{\dot{}})).

There is one rule, R3, governing the classes introduced in this section. It states that under a group of assumptions γ\gamma which do not depend on a given variable xx (i.e., in which the variable xx does not appear), if an object α\alpha contains an object β\beta, then the class {αx}\{{}^{x}\,\alpha\} is contained in the class {βx}\{{}^{x}\,\beta\}. We point out the immediate derived rule

αβ{αx}{βx}.\begin{array}[]{c}\alpha\rightarrow\beta\\ \,\{{}^{x}\,\alpha\}\subset\{{}^{x}\,\beta\}.\rule{0.0pt}{10.0pt}\\ \raisebox{24.5pt}[0.0pt][0.0pt]{$\begin{array}[]{c}\phantom{xxxxxxxxx.}\\ \hline\cr\end{array}$}\end{array}\vskip-6.0pt (9)

The use of \subset here is justified, since for any object ϕ\phi,

{ϕx}V\{{}^{x}\,\phi\}\subset V

and R3 itself may be written

γ(αβ)γ{αx}{βx}.\begin{array}[]{c}\gamma\rightarrow(\alpha\rightarrow\beta)\\ \,\gamma\rightarrow\{{}^{x}\,\alpha\}\subset\{{}^{x}\,\beta\}.\rule{0.0pt}{10.0pt}\\ \raisebox{24.5pt}[0.0pt][0.0pt]{$\begin{array}[]{c}\phantom{xxxxxxxxxxxxx}\\ \hline\cr\end{array}$}\end{array} (10)
Table 4: Postulates of Section 3 (variables xx, aa, bb, objects α,β,γ\alpha,\beta,\gamma)
{oldtabular}

c {oldtabular}c R3. xx does not appear in γ\gamma.

γ(αβ)γ((x˙β[x˙/x])(x˙α[x˙/x]))\begin{array}[]{c}\gamma\rightarrow(\alpha\rightarrow\beta)\\ \gamma\rightarrow((x^{\dot{}}\mid\beta[x^{\dot{}}/x])\rightarrow(x^{\dot{}}\mid\alpha[x^{\dot{}}/x]))\rule{0.0pt}{10.0pt}\\ \raisebox{24.0pt}[0.0pt][0.0pt]{$\begin{array}[]{c}\phantom{xxxxxxxxxxxxxxxxxxxxxxxxxx.}\\ \hline\cr\end{array}$}\end{array}

{oldtabular}

l A13 (Comprehension). objects α\alpha, variables xx.

a(x˙α[x˙/x])(𝔰(a)α[a/x])a\in(x^{\dot{}}\mid\alpha[x^{\dot{}}/x])\leftrightarrow(\operatorname{\mathfrak{s}}(a)\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\alpha[a/x])

A14. variables xx.

A14a.  a(x˙x˙a)a\rightarrow(x^{\dot{}}\mid x^{\dot{}}\in a)

A14b.  (Va)((x˙x˙a)a)(V\rightarrow a)\rightarrow((x^{\dot{}}\mid x^{\dot{}}\in a)\rightarrow a)

Since these are the last of our rules, we may now note that the deduction theorem continues to hold, permitting the normal introduction and discharge of assumptions. We refer to a variable which appears in an open assumption as a named variable. This vocabulary is chosen naturally, because such a variable is thought of as the ``name" of something, and this name is retained as the steps of the proof flow along. We say that a proof is mindful of names if assumptions open during the occurrence of a substitution which has a named variable as target variable appear, upon discharge, with the named variable modified appropriately, and if the rule R3 is used only when the variable to be converted to an index does not appear in any open assumption. We will assume, from now on, that assumptions are formulas unless stated otherwise.

Theorem 7 (Deduction theorem).

Every formal proof with assumptions mindful of names can be converted into a formal proof.

{proof*}

We present the verifications to be inserted in Theorem 2.2; the setup is identical. Clearly, the rule R2 can be used to target named variables, provided the proof is mindful of names. We have to show that instances of R3 can be carried out under the lingering hypothesis αi\alpha_{i}. Let an occurence of R3 (using variable xx) act on step αk\alpha_{k}, i<k<i+ji<k<i+j; then αk\alpha_{k} is of the form γ(δϵ)\gamma\rightarrow(\delta\rightarrow\epsilon), where γ,δ,ϵ\gamma,\delta,\epsilon are objects, and xx does not appear in γ\gamma. Hence (αiαk)=(αi(γ(δϵ)))(\alpha_{i}\rightarrow\alpha_{k})=(\alpha_{i}\rightarrow(\gamma\rightarrow(\delta\rightarrow\epsilon))). We use this expression to derive a proof (requiring only the postulates of §2, for which the deduction theorem has been verified) of (αiγ)(δϵ)(\alpha_{i}\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\gamma)\rightarrow(\delta\rightarrow\epsilon), apply R3 (noting that the proof is mindful of names), then insert a proof of (αi(γ{δx}{ϵx}))(\alpha_{i}\rightarrow(\gamma\rightarrow\{{}^{x}\,\delta\}\subset\{{}^{x}\,\epsilon\})), allowing the proof to proceed under the assumption αi\alpha_{i}. A simple induction argument shows that an arbitrary number of assumptions can be folded up in the same way, to make way for an occurence of R3.141414In fact, because our system is not a resource logic, the open assumptions could simply be treated as an abiding unordered multiplicity. This completes the proof.

In certain respects, we expect all sets to behave like deliberately constructed sets. Hence, one thinks of the set as having a unique decomposition, or of being able to break every set up into its single elements—to ``spread them out" on a universal table top. Since we have defined sets in terms of a class, VV, this will be an acceptable notion for us, since all sets will be classifiable, well-founded objects. At the present stage of development, however, there might exist sets which have subsets, but which do not have any elements. Suppose, that is, that there exists an object ξ\xi such that

𝔰(x)xξ,\operatorname{\mathfrak{s}}(x)\rightarrow x\notin\xi,

and now suppose that ξ\xi is a set. With such an object present, {xx(12)}\{{}^{x}\,x\in(1\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}2)\} could be put equal to 12ξ1\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}2\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\xi without contradiction. We exclude the possibility of such objects being defined with a version of the principle of extensionality: that every set ``is determined by its elements" (Zermelo, 1908). The important property is

a(b˙b˙a),a\rightarrow(b^{\dot{}}\mid b^{\dot{}}\in a), (11)

which asserts that for any given set AA, the class {xxA}\{{}^{x}\,x\in A\} is at worst a proper subset of AA. For definable sets, the two expressions are equivalent, and any finite union of single objects a1ana_{1}\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\ldots\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}a_{n} is definable (Theorem 3.14). For undefinable sets, however, we must enforce a rather imponderable discipline, and thus we include axiom A14b of Table 4.

Axiom. (Schema) For variables xx, {xxa}a,\{{}^{x}\,x\in a\}\subset a, and aVa{xxa}a\subset V\rightarrow a\subset\{{}^{x}\,x\in a\}.

We can now begin our development in earnest.

3.4 Extensionality

The following theorem is easy to prove in the adopted line of development, and is of central importance to what follows.

Theorem 8 (Extensionality).

aV({xxa}{xxb}ab).a\subset V\rightarrow(\{{}^{x}\,x\in a\}\subset\{{}^{x}\,x\in b\}\rightarrow a\subset b).

{proof*}

A straightforward application of \rightarrow-transitivity, relying on A14.

We can now examine some well-known antinomies. The Russell set is s={xxx}s=\{{}^{x}\,x\notin x\}. A single object xx will belong to ss if and only if ¬(xx)\lnot(x\in x) by A13, but xxx\in x for all such xx. Therefore ss contains no individuals. Therefore (by extensionality) ss is empty.151515Though we avoid Russell’s paradox, Cantor’s theorem on the size of the power set may still be proven (see §4.1),and the theories of computability and decidability appear unaffected, as one might expect. A discussion of several similar paradoxically-self-containing sets is found in Beth (1965, pp. 483-4). Extensions of formulas

y1ynxy1y1y2yn1ynynx\mathop{\forall y_{1}\ldots y_{n}\,}\quad x\notin y_{1}\lor y_{1}\notin y_{2}\lor\ldots\lor y_{n-1}\notin y_{n}\lor y_{n}\notin x

(which Quine calls ``epsilon cycles") are under our definitions all equal to Ø. It is not possible under our definitions for any paradox to arise from the relation ss(𝔰(s)ϕ(s))s\in s\leftrightarrow(\operatorname{\mathfrak{s}}(s)\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\phi(s)), where ϕ(s)\phi(s) implies sss\notin s. More generally, we have:

Theorem 9.

{x}=Ø.\{{}^{x}\,\bot\}=\text{{\O }}.

Curry's paradox arises from considering the extension (with respect to xx) of the formula xxϕx\in x\rightarrow\phi, where ϕ\phi may be any formula or object. In naive set theory there is a paradox: let the set be called cc. By naive comprehension, cc(ccϕ(c))c\in c\leftrightarrow(c\in c\rightarrow\phi(c)). Therefore it is true that ccϕ(c)c\in c\rightarrow\phi(c). So ccc\in c. So ϕ(c)\phi(c), which might be any statement at all, is true—all objects, in other words, are trivial. The A13 comprehension schema also implies that ccϕ(c)c\in c\rightarrow\phi(c). However, it does not then follow from A13 that ccc\in c, only that ccc\in c if cc is single. It then falls to ϕ\phi to determine whether in fact 𝔰(c)\operatorname{\mathfrak{s}}(c) or not.161616In either case, the class is essentially a description which does not affect the underlying ontology. If cc is the sole individual xx such that ϕ(x)\phi(x)—or in other words, if there is but one individual aa such that ϕ(a)\phi(a)—then cc is single and ccc\in c, whereas if there are at least two such individuals, or no such individual, it is formally proven that cc is not single, and therefore that ccc\notin c.

The recurring theme of these solutions is that paradoxes are avoided by countenancing multiplicities which are not individuals. We will see this theme again when we examine other paradoxes in §4. Next, we verify several basic theorems.

Theorem 10.

The class of objects equal to aa is aa itself if aa is single. The class of all elements of a class is equal to the class itself. Alphabetic variants are equal; reindexing is permitted. Briefly:

𝔰(a)({xxa}=a),\displaystyle\operatorname{\mathfrak{s}}(a)\rightarrow(\{{}^{x}\,x\in a\}=a), (12)
{xx{ϕy}}={ϕy},\displaystyle\{{}^{x}\,x\in\{{}^{y}\,\phi\}\}=\{{}^{y}\,\phi\}, (13)
{ϕx}={ϕy(y)}.\displaystyle\{{}^{x}\,\phi\}=\{{}^{y}\,\phi(y)\}. (14)

The proofs of these formulas offer no difficulty. One simply applies the general method of Theorem 3.12, as in, e.g., the following proof:

Theorem 11.

For all formulas ϕ\phi and ψ\psi,

{ϕxψ}={ϕx}{ψx},\displaystyle\{{}^{x}\,\phi\lor\psi\}=\{{}^{x}\,\phi\}\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\{{}^{x}\,\psi\}, (15)
{ϕxψ}={ϕx}{ψx}.\displaystyle\{{}^{x}\,\phi\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\psi\}=\{{}^{x}\,\phi\}\cap\{{}^{x}\,\psi\}. (16)
{proof*}

Let y{ϕxψ}y\in\{{}^{x}\,\phi\lor\psi\} (name yy, and the variables appearing in ϕ\phi and ψ\psi). Then by comprehension (ϕψ)[y/x](\phi\lor\psi)[y/x], this becomes ϕ[y/x]ψ[y/x]\phi[y/x]\lor\psi[y/x]. Dividing into cases (axiom A6), one concludes y({ϕx}{ψx})y\in(\{{}^{x}\,\phi\}\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\{{}^{x}\,\psi\}).171717The parentheses required here invite the use of \cup (defined below). Discharge the assumption (there are no longer any assumptions, thus no longer any named variables) and apply extensionality. The other direction is similar. For (16), use A6.

Theorem 12.

If AA and BB are sets, AB={xxAxB}A\cap B=\{{}^{x}\,x\in A\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}x\in B\}.

{proof*}

As in Theorem 3.15. We note that this theorem, like many mathematical theorems, requires a formal stipulation on the involved variables that is often omitted; in this instance, the variable xx cannot appear free in the expressions substituted for AA and BB. For us, however, this restriction is made automatically due to the index-variable distinction drawn at the syntactic level, and the formal statement of the theorem is here given in its entirety.

Since we still lack the ability to form sets of sets, we are still unable to define many familiar devices of set theory, such as the sum set and power set. However, we may define AB={xxAxB}A\setminus B=\{{}^{x}\,x\in A\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}x\notin B\}, the difference of AA and BB, AΔB={(xAxB)x(xBxA)}A\mspace{-2.0mu}\mathbin{{}_{\scriptscriptstyle{\Delta}}}\mspace{-2.0mu}B=\{{}^{x}\,(x\in A\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}x\notin B)\lor(x\in B\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}x\notin A)\}, the symmetric difference or distinction of AA and BB, and A={xxA}\mathop{{}^{\scriptscriptstyle{\sim}}\mspace{-3.0mu}}\!A=\{{}^{x}\,x\notin A\}, the complement of AA. Under our definitions, the union ABA\cup B of two sets AA and BB is once again their federation, the fundamental product ABA\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}B defined for any two objects AA and BB. Elementary formulas involving these can now be proven in the usual way, for example: {ϕx}={¬xϕ}\mathop{{}^{\scriptscriptstyle{\sim}}\mspace{-3.0mu}}\{{}^{x}\,\phi\}=\{{}^{x}\,\lnot\phi\}, etc.

3.5 Quantification

The principle of Extensionality, first proposed by Frege in 1893, and later by Zermelo in 1908, has been subsequently used to provide a rigorous and intuitively natural basis for theorems throughout mathematics. We will use it to ground quantification.181818 During the period just prior to the appearance of his work on NF, Quine (1937) experimented with a similar approach, and proved its equivalence to a system of Tarski. In the system, to the study of which Quine devoted two papers, the simple theory of types is developed using only the connectives of inclusion (containment) and abstraction. For example, the condional ϕψ\phi\rightarrow\psi is an abbreviation of the formula {xϕ}{xψ}\{x\mid\phi\}\subset\{x\mid\psi\}, where xx does not appear in ϕ\phi or in ψ\psi. Since comprehension is one of the basic assumptions in topos theory and related areas, quantification by means of comprehension is commonly used in category-theoretical developments of deductive systems; see, e.g., Bell (1988, p. 70ff). The author is indebted to Andreas Blass for pointing out this and other references in topos theory.

Definition 3.5.

Given a variable xx and formula ϕ\phi, we write x(ϕ)\mathop{\forall x\,}(\phi) if V={ϕx}V=\{{}^{x}\,\phi\}, and x(ϕ)\mathop{\exists x\,}(\phi) if Ø{ϕx}\text{{\O }}\neq\{{}^{x}\,\phi\}.191919We will suppress parentheses whenever they are unnecessary. Our existential quantifier is the double negation of the intuitionistic one. We might also define symbols for the expressions 𝔰({ϕx})\operatorname{\mathfrak{s}}(\{{}^{x}\,\phi\}): there exists a unique individual xx such that ϕ\phi. Since ϕ(Ø)\phi(\text{{\O }}) does not imply that xϕ\mathop{\exists x\,}\phi, we might write, say, Øxϕ\mathop{\exists^{\scriptscriptstyle{\text{{\O }}}}x\;}\phi to mean (xϕ)ϕ(Ø)(\mathop{\exists x\,}\phi)\lor\phi(\text{{\O }}). This quantifier is surprisingly rare, however; in mathematical practice, discovery of the existence of structure elements or nonempty sets is the overwhelming norm. We make note of the presence, in the ordinary language of logic and mathematics, of the words such that coupled with the words there exists and the class of, which suggests—if weakly—a connection between the grammar of the quantifier and the class construction. But what then, about the incantation “for all xx, ϕ(x)\phi(x)”? If that hypothesis were true, shouldn’t such that appear here as well? Let {αxϕ}\{{}^{x}\,\alpha\mid\phi\} denote the class {xy(yαϕ)}\{{}^{y}\,\mathop{\exists x\,}(y\in\alpha\land\phi)\} (where we insert the needed substitutions should yy appear in α\alpha or in ϕ\phi). Then the universal and existential quantifiers over the scope ϕ\phi may be written: Ø={ϕxØ},\displaystyle\text{{\O }}=\{{}^{x}\,\phi\mid\text{{\O }}\}, Ø{ϕx}.\displaystyle\text{{\O }}\neq\{{}^{x}\,\phi\}. In this form, the scope appears in the condition only in the case of the existential quantifier, prompting such that ϕ\phi, while the condition of the universal quantifier, being empty, is subject to elision; one need not remark the constraint such that…well, nothing. Some might argue that since their scope is confined to VV, these quantifiers establish a limited or even artificial system of quantification; others, however, might take the view that quantification over objects at the general level has already been defined to satisfaction by the universal rule of substitution R2.

Before continuing any further, we need to extend R3 to the case when xx may appear in γ\gamma:

Theorem 13.

({γx}{αxβ})(V{γx}{αx}{βx}).(\{{}^{x}\,\gamma\}\subset\{{}^{x}\,\alpha\rightarrow\beta\})\rightarrow(V\subset\{{}^{x}\,\gamma\}\rightarrow\{{}^{x}\,\alpha\}\subset\{{}^{x}\,\beta\}).

{proof*}

Let {γx}{αxβ}\{{}^{x}\,\gamma\}\subset\{{}^{x}\,\alpha\rightarrow\beta\}, V{γx}V\subset\{{}^{x}\,\gamma\}. Then V={αxβ}V=\{{}^{x}\,\alpha\rightarrow\beta\}. Let a{αx}a\in\{{}^{x}\,\alpha\}. Then α(a)β(a)\alpha(a)\rightarrow\beta(a), and since α(a)\alpha(a), β(a)\beta(a) as well, so a{βx}a\in\{{}^{x}\,\beta\}. So {αx}{βx}\{{}^{x}\,\alpha\}\subset\{{}^{x}\,\beta\} by extensionality.

Once objects may have classes as subobjects, the management of assumptions becomes something of a puzzle. Theorem 3.18 presents a way around this obstacle: it directly follows from it that, given any objects γ,α,β\gamma,\alpha,\beta, if γ(αβ)\gamma\rightarrow(\alpha\rightarrow\beta) has been proven, then one immediately has (xγ){αx}{βx}(\mathop{\forall x\,}\gamma)\rightarrow\{{}^{x}\,\alpha\}\subset\{{}^{x}\,\beta\}. Thus it becomes allowed to convert a named variable to an index, provided the assumptions in which the variable is named are discharged with the introduction of a universal quantifier. In practice this all falls naturally into place most of the time, but it can incite minor technical fallacies.

For example, given formulas ϕ\phi and ψ\psi, the statement ϕψ\phi\rightarrow\psi should imply that {ϕx}{ψx}\{{}^{x}\,\phi\}\subset\{{}^{x}\,\psi\}, but as writing (ϕψ){ϕx}{ψx}(\phi\rightarrow\psi)\rightarrow\{{}^{x}\,\phi\}\subset\{{}^{x}\,\psi\} produces absurdities by substituting for xx, this will not be a provable formula unless xx does not appear in ϕ\phi or in ψ\psi. On the other hand, if for every individual xx, ϕψ\phi\rightarrow\psi (a weaker assumption in general), the expected class relation obtains, as has just been shown.

Kleene (1952) investigates this phenomenon under the rubric variation (p. 102ff). There, one works with a system in which proofs with open assumptions are considered finished formal proofs. In order to keep his assumptions in order, he stipulates that variables which appear in assumptions are ``held constant" until they are substituted for. One is free to do so; however, they are then added to a list appended to the proof-theoretical symbol \vdash which indicates that they have been varied, and that further substitutions are no longer possible. His approach may been influenced by the Peanian pasigraphy (by way of the Principia, perhaps), in which subscripts of the quantified variables are attached to certain logical connectives in place of quantifier operators. Having obtained a structural analog to Kleene's approach, we are prepared to proceed.

We next show how to pass from the universal to the particular, and vice versa.

Theorem 14 (Generalization).

a(𝔰(a)ϕ(a))xϕ.\mathop{\forall a\,}(\operatorname{\mathfrak{s}}(a)\rightarrow\phi(a))\rightarrow\mathop{\forall x\,}{\phi}.

{proof*}

Let 𝔰(a)ϕ(a)\operatorname{\mathfrak{s}}(a)\rightarrow\phi(a). Suppose aVa\in V; then ϕ(a)\phi(a), so a{ϕx}a\in\{{}^{x}\,\phi\} by comprehension, and therefore aVa{ϕx}a\in V\rightarrow a\in\{{}^{x}\,\phi\}. So by extensionality V{ϕx}V\subset\{{}^{x}\,\phi\}.

Theorem 15 (Instantiation).

(𝔰(a)xϕ)ϕ(a).(\operatorname{\mathfrak{s}}(a)\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\mathop{\forall x\,}\phi)\rightarrow\phi(a).

{proof*}

1. Let 𝔰(a)xϕ.\operatorname{\mathfrak{s}}(a)\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\mathop{\forall x\,}\phi. 2. 𝔰(a).\operatorname{\mathfrak{s}}(a). 3. 𝔰(a)Ø.\operatorname{\mathfrak{s}}(a)\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\text{{\O }}. 4. 𝔰(a)Ø[a/x].\operatorname{\mathfrak{s}}(a)\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\text{{\O }}[a/x]. 5. aV.a\in V. 6. a{ϕx}.a\in\{{}^{x}\,\phi\}. 7. ϕ(a).\phi(a).

In the universal case, we have given a backbone to the quasi-reasoning which echoes in the corridors of mathematics departments, where proofs of universal statements proceed by ``choosing" elements ``arbitrarily", ``freely", sometimes even ``at random". Existential statements prompt arguments of a similar style: the mathematician argues that he may ``pick one" and insert it into his algebra; if he is subsequently able to derive an expression that does not depend upon which element he picked, he concludes that the statement is true, usually without accounting for the fate of the assumption naming his choice. This form of reasoning, too, can be filled in with the tools we have defined. We first pause momentarily to verify the inverse direction:

Theorem 16 (\exists-Introduction).

a{ϕx}xϕ.a\in\{{}^{x}\,\phi\}\rightarrow\mathop{\exists x\,}\phi.

{proof*}

Argue by contradiction, by concluding from a=Øa=\text{{\O }} that 𝔰(Ø)\operatorname{\mathfrak{s}}(\text{{\O }}).

Theorem 17 (\exists-Elimination).

Let aa appear nowhere in formulas ϕ\phi and ψ\psi, and let xx appear nowhere in ψ\psi. Then if, a\mathop{\forall a\,},

((xϕ)a{ϕx})ψ,((\mathop{\exists x\,}\phi)\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}a\in\{{}^{x}\,\phi\})\rightarrow\psi,

then

xϕψ.\mathop{\exists x\,}\phi\rightarrow\psi.

We might state the sense of this statement by saying that ``if it has already been proven that xϕϕ(a)ψ\mathop{\exists x\,}\phi\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\phi(a)\rightarrow\psi"—here, aa is simply an independent variable—``then by using R3 and R1 with the theorem as stated, it is proven that xϕψ\mathop{\exists x\,}\phi\rightarrow\psi."202020In Kleene’s system, one has xϕ,ϕ(a)axϕψ\mathop{\exists x\,}\phi,\phi(a)\vdash^{a}\mathop{\exists x\,}\phi\rightarrow\psi. Notice that (xϕϕ(a)ψ)(xϕψ)(\mathop{\exists x\,}\phi\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\phi(a)\rightarrow\psi)\rightarrow(\mathop{\exists x\,}\phi\rightarrow\psi) is false; let 1V1\in V, a=1a=1, ϕ=(x=1)\phi=(x=1), ψ=(11)\psi=(1\neq 1).

{proof*}

We need the following.

Lemma 18.

Let aa appear nowhere in ϕ\phi. Then ({ϕx}Ø)ϕ(\{{}^{x}\,\phi\}\neq\text{{\O }})\rightarrow\phi.

To prove the lemma, let {ϕx}Ø\{{}^{x}\,\phi\}\neq\text{{\O }}, and let ¬ϕ\lnot\phi. Suppose (choosing a new variable) that b{ϕx}b\in\{{}^{x}\,\phi\}. Then ϕ[b/x]\phi[b/x], that is, ϕ\phi. Therefore \bot. Therefore [b/x]\bot[b/x], and b{x}b\in\{{}^{x}\,\bot\}. So {ϕx}{x}\{{}^{x}\,\phi\}\subset\{{}^{x}\,\bot\} by extensionality. So {ϕx}=Ø\{{}^{x}\,\phi\}=\text{{\O }}, contrary to hypothesis. Discharging gives ¬¬ϕ\lnot\lnot\phi, or ϕ\phi by the law of excluded middle.

Let ϕ\phi and ψ\psi be defined as stated above.

1.Let ((xϕ)a{ϕx})ψ.1.\,\text{Let }((\mathop{\exists x\,}\phi)\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}a\in\{{}^{x}\,\phi\})\rightarrow\psi.
2.Let xϕ.2.\,\text{Let }\mathop{\exists x\,}\phi.
3.Let a{ϕx}.3.\,\text{Let }a\in\{{}^{x}\,\phi\}.
4.ψ4.\,\psi
5.ψ[a/x]5.\,\psi[a/x]
6.a{ψx}6.\,a\in\{{}^{x}\,\psi\}
7.(3) a{ϕx}a{ψx}7.\,\text{(3) }a\in\{{}^{x}\,\phi\}\rightarrow a\in\{{}^{x}\,\psi\}
8.{ϕx}{ψx}8.\,\{{}^{x}\,\phi\}\subset\{{}^{x}\,\psi\} (bind aa)
9.{ψx}=Ø{ϕx}=Ø9.\,\{{}^{x}\,\psi\}=\text{{\O }}\rightarrow\{{}^{x}\,\phi\}=\text{{\O }}
10.({ψx}=Ø)10.\,(\{{}^{x}\,\psi\}=\text{{\O }})\rightarrow\bot
11.ψ11.\,\psi (using Lemma 3.23)
12.12.\, (2), (1) Q.E.D.

Many readers ought to suspect that a full system of first-order quantification over individuals has been established by these theorems. We can proceed to develop the identities of classical predicate calculus directly from them, or we can rely instead on the following theorem:

Theorem 19.

Any theorem provable in classical predicate calculus using the Bernays postulates212121According to Hilbert & Ackermann (1928, ch. 3), the postulates defining the universal and existential quantifier in Hilbert-type systems are due to Bernays. is provable under our postulates A1-14 and R1-3.

{proof*}

Although the reader may know the four postulates in question well, we should probably review them. In Kleene's notation, they consist of two axioms: xP(x)P(t)\mathop{\forall x\,}P(x)\rightarrow P(t), and P(t)xP(x)P(t)\rightarrow\mathop{\exists x\,}P(x), where tt is an individual free for xx in P(x)P(x), along with two rules:

CP(t)CxP(x)P(t)CxP(x)C\begin{array}[]{c}C\rightarrow P(t)\\ C\rightarrow\mathop{\forall x\,}P(x)\rule{0.0pt}{10.0pt}\\ \raisebox{24.5pt}[0.0pt][0.0pt]{$\begin{array}[]{c}\phantom{xxxxxxxxx}\\ \hline\cr\end{array}$}\end{array}\qquad\qquad\begin{array}[]{c}P(t)\rightarrow C\\ \mathop{\exists x\,}P(x)\rightarrow C\rule{0.0pt}{10.0pt}\\ \raisebox{24.5pt}[0.0pt][0.0pt]{$\begin{array}[]{c}\phantom{xxxxxxxxx}\\ \hline\cr\end{array}$}\end{array}\vskip-12.0pt

where the variable xx does not appear free in the formula CC.

Theorems 3.20 and 3.21 are analogues of Bernays' axioms, with the proviso that tt be an individual converted to one that tt be single.222222If we refer to single objects as individuals, then the vocabulary is coincidentally a match. The rules may be translated into our system as the formula

(x(ϕψ(x))(ϕxψ))(x(ψ(x)ϕ)(x(ψ)ϕ))(\mathop{\forall x\,}(\phi\rightarrow\psi(x))\rightarrow(\phi\rightarrow\mathop{\forall x\,}\psi))\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}(\mathop{\forall x\,}(\psi(x)\rightarrow\phi)\rightarrow(\mathop{\exists x\,}(\psi)\rightarrow\phi))

where ϕ\phi and ψ\psi are formulas and xx does not appear in ϕ\phi. This formula may be formally proven using the methods already presented.

3.6 Elementary Model Theory, Consistency

We will conclude this section with a short investigation of the metamathematics of the system we have introduced, following Kleene (1952); Simmons (2000). Because the new approach to the core syntax of logic adopted in this paper demands some revision of the details of an ordinary consistency proof (while allowing much to stand unchanged), we will proceed in a thorough style, assuming that the reader familiar with logic will easily pick out the relevant changes.

We begin by introducing revisions to the system that are convenient for the work of this section, since we are only interested in the classical case. All our previous work goes through essentially unchanged.232323Though in order to undertake this step we must assume primality (Proposition 3.7), a principle which as an axiom is unnatural in the author’s opinion. Trying to smooth the wrinkle out causes it to pop up elsewhere; in the balance it seems A5-6 are wanted after all.

  1. 1.

    Cross out axioms A5, A6, and formation rule αβ(αβ)\alpha\,\beta\longmapsto(\alpha\cap\beta).

  2. 2.

    Add abbreviations: aba\lor b for ¬(¬a¬b)\lnot(\lnot a\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\lnot b), and aba\cap b for {xxaxb}\{{}^{x}\,x\in a\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}x\in b\}.

We (momentarily) call an object Π\Pi a string if Π\Pi may be generated from the following formation rules:

Ai\displaystyle A_{i} Ai\displaystyle\longmapsto A_{i} for axiom Ai\displaystyle\text{ for axiom }A_{i}
Παx\displaystyle\Pi\,\alpha\,x Π(α/x)\displaystyle\longmapsto\Pi(\alpha/x) for strings Π\Pi, objects α\alpha, and variables xx
ΠΣ\displaystyle\Pi\,\Sigma (ΠΣ)\displaystyle\longmapsto(\Pi\cdot\Sigma) for strings Π\Pi and Σ\Sigma
Πx\displaystyle\Pi\,x Π[x]\displaystyle\longmapsto\Pi[x] for strings Π\Pi and variables xx

A string Π\Pi is subject to the following substitutive one-step reductions:

α(β/x)\displaystyle\alpha(\beta/x) α[β/x]\displaystyle\vdash\alpha[\beta/x]
(α(αβ))\displaystyle(\alpha\cdot(\alpha\rightarrow\beta)) β\displaystyle\vdash\beta
(γ(αβ))[x]\displaystyle(\gamma\rightarrow(\alpha\rightarrow\beta))[x] (γ({αx}{βx}))\displaystyle\vdash(\gamma\rightarrow(\{{}^{x}\,\alpha\}\subset\{{}^{x}\,\beta\}))

provided that xx does not appear in γ\gamma. If a string may be reduced to a well-formed object, we call the string a well-formed proof string or simply a proof string. We say that an object α\alpha is provable if there is a well-formed proof string Π\Pi and a many-step reduction reducing Π\Pi to α\alpha. In this case, we may sometimes call α\alpha the theorem of Π\Pi, and Π\Pi a proof of α\alpha, and denote this relation Πα\Pi\vdash\alpha. The length of proof string Π\Pi is defined as usual.

Let kk be a nonnegative integer. A kk-structure k\mathscr{M}_{k} is a metamathematically collected group (hereafter: a set) (M,P)(M,P), where MM, the constants of k\mathscr{M}_{k}, is a set Ø,1,2,,2k1,\text{{\O }},1,2,\ldots,2^{k}-1,\bot. The tables PP of k\mathscr{M}_{k} are sets of ordered triplets, often summarized (informally) in the form of written tables. The tables are a generalization of the notion of a truth table. It is convenient to correlate the values found in the tables of a kk-structure to the decomposition of each concrete constant into its base 2 expansion. For example, in any kk-structure with k4k\geq 4, (27,18,Ø)(27,18,\text{{\O }}) will be a member of the \rightarrow-table, indicating that 271827\rightarrow 18 is valid (in the sense defined shortly). This is because 27=24+23+2+127=2^{4}+2^{3}+2+1, and 18=24+218=2^{4}+2 (see Table 5). The individuals of each kk-structure are thus the kk constants 1,2,4,,2k11,2,4,\ldots,2^{k-1}. We also note that the value of an object αβ\alpha\rightarrow\beta should be, according to PP, an object having weight 0 (as defined below).

Valuation of closed objects. Let β\beta be a subobject of α\alpha. The class degree of an instance of β\beta with respect to α\alpha is the number of classes in α\alpha having β\beta in its scope, i.e., the number of classes escaped on a walk out of the object beginning at β\beta. If an object (formula or concrete) contains no variables, it is said to be closed. The weight of a closed object α\alpha, Wt(α)\operatorname{Wt}(\alpha), is defined recursively as follows, letting α,β\alpha,\beta be formal objects: Wt(Ø)=0,Wt(α^)=Wt({αx})=Wt(α)\operatorname{Wt}(\text{{\O }})=0,\operatorname{Wt}(\alpha\mspace{2.0mu}\hat{}\,)=\operatorname{Wt}(\{{}^{x}\,\alpha\})=\operatorname{Wt}(\alpha), Wt(αβ)=max(Wt(α)Wt(β))\operatorname{Wt}(\alpha\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\beta)=\max(\operatorname{Wt}(\alpha)\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\operatorname{Wt}(\beta)), Wt(αβ)=max(Wt(α)Wt(β))+1\operatorname{Wt}(\alpha\rightarrow\beta)=\max(\operatorname{Wt}(\alpha)\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\operatorname{Wt}(\beta))+1. This is a ``metamathematical" function, since its argument is not substitutive. The procedure we shall adopt for the evaluation or valuation of a closed object α\alpha with respect to a kk-structure k\mathscr{M}_{k} proceeds in two stages.

Table 5: kk-Structures

k=0:ØØØ{oldtabular}cØØØØØ{oldtabular}ck=2:Ø123ØØ123111332232333333{oldtabular}cØ123ØØ1ØØ2ØØ3ØØØØØØØØØ{oldtabular}ck=3:Ø1234567ØØ1234567111335677223236767333337777445674567555775577667676767777777777{oldtabular}cEtc.\begin{array}[]{c}\begin{array}[t]{c}k=0:\\ \begin{array}[]{c|cc}\,\;\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}&\text{{\O }}&\bot\\ \text{{\O }}&\text{{\O }}&\bot\\ \bot&\bot&\bot\end{array}\\ \oldtabular{c}\raisebox{33.4pt}[0.0pt][0.0pt]{$\begin{array}[]{c}\phantom{xxxxxx..}\\ \hline\cr\end{array}$}\\ \begin{array}[]{c|cc}\rightarrow&\text{{\O }}&\bot\\ \text{{\O }}&\text{{\O }}&\bot\\ \bot&\text{{\O }}&\text{{\O }}\end{array}\\ \oldtabular{c}\raisebox{33.4pt}[0.0pt][0.0pt]{$\begin{array}[]{c}\phantom{xxxxxx..}\\ \hline\cr\end{array}$}\end{array}\hskip-6.0pt\begin{array}[t]{c}k=2:\\ \begin{array}[]{c|ccccc}\,\;\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}&\text{{\O }}&1&2&3&\bot\\ \text{{\O }}&\text{{\O }}&1&2&3&\bot\\ 1&1&1&3&3&\bot\\ 2&2&3&2&3&\bot\\ 3&3&3&3&3&\bot\\ \bot&\bot&\bot&\bot&\bot&\bot\end{array}\\ \oldtabular{c}\raisebox{66.4pt}[0.0pt][0.0pt]{$\begin{array}[]{c}\phantom{xxxxxxxxxxxxxxxx.}\\ \hline\cr\end{array}$}\\ \begin{array}[]{c|ccccc}\rightarrow&\text{{\O }}&1&2&3&\bot\\ \text{{\O }}&\text{{\O }}&\bot&\bot&\bot&\bot\\ 1&\text{{\O }}&\text{{\O }}&\bot&\bot&\bot\\ 2&\text{{\O }}&\bot&\text{{\O }}&\bot&\bot\\ 3&\text{{\O }}&\text{{\O }}&\text{{\O }}&\text{{\O }}&\bot\\ \bot&\text{{\O }}&\text{{\O }}&\text{{\O }}&\text{{\O }}&\text{{\O }}\end{array}\\ \oldtabular{c}\raisebox{66.4pt}[0.0pt][0.0pt]{$\begin{array}[]{c}\phantom{xxxxxxxxxxxxxxxx.}\\ \hline\cr\end{array}$}\end{array}\hskip-6.0pt\begin{array}[t]{c}k=3:\\ \begin{array}[]{c|ccccccccc}\,\;\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}&\text{{\O }}&1&2&3&4&5&6&7&\bot\\ \text{{\O }}&\text{{\O }}&1&2&3&4&5&6&7&\bot\\ 1&1&1&3&3&5&6&7&7&\bot\\ 2&2&3&2&3&6&7&6&7&\bot\\ 3&3&3&3&3&7&7&7&7&\bot\\ 4&4&5&6&7&4&5&6&7&\bot\\ 5&5&5&7&7&5&5&7&7&\bot\\ 6&6&7&6&7&6&7&6&7&\bot\\ 7&7&7&7&7&7&7&7&7&\bot\\ \bot&\bot&\bot&\bot&\bot&\bot&\bot&\bot&\bot&\bot\end{array}\\ \oldtabular{c}\raisebox{110.4pt}[0.0pt][0.0pt]{$\begin{array}[]{c}\phantom{xxxxxxxxxxxxxxxxxxxxxxxxxxxxx.}\\ \hline\cr\end{array}$}\\ \\ \text{Etc.}\end{array}\end{array}

Direct evaluation: If every subobject in α\alpha has class degree 0, then the object is class-free. Class-free objects are evaluated as follows: replace all subobjects having weight 1 with objects having weight 0 by using the tables of k\mathscr{M}_{k}. This shifts the weights of all weighted subobjects by 1. If necessary, evaluate all multiplicities of weight 0. Iteration of this procedure eventually yields an object CC of weight 0.

Declassification: If not, then all classes π\pi with maximum class degree appearing in α\alpha are replaced systematically with a subset of the constants of MM which happen to be single, namely 1,2,4,,2k11,2,4,\ldots,2^{k-1}. For each ii, 0i<k0\leq i<k, 2i2^{i} is tested in order with the condition απ\alpha_{\pi} of π\pi (which may be directly evaluated). If evaluating απ(2i)\alpha_{\pi}(2^{i}) yields Ø, the constant is included in the multiplicity. If απ(2i)\alpha_{\pi}(2^{i}) takes any other value, then Ø is included in the multiplicity instead. When this is finished, the multiplicity is itself evaluated, and the class π\pi is replaced. After treating all such classes π\pi, this step yields an object with strictly smaller class degree. The entire algorithm is then repeated until α\alpha itself is class-free and may be evaluated. In the case that this procedure yields the product CC given α\alpha, we say that α\alpha takes the value CC in k\mathscr{M}_{k}, or that the object is CC-valid in kk, and write α𝑘C\smash{\alpha\underset{k}{\Rightarrow}C}. If the object is Ø-valid, we may say simply that it is valid.

Objects that are not closed. If an object α\alpha in which variables appear may be closed by an assignment of its variables to the constants of MM (including Ø and \bot) such that it takes the value CC, then we say that the object can take the value CC in k\mathscr{M}_{k}, or that it is CC-satisfiable in kk. If the object is Ø-satisfiable, we may say simply that it is satisfiable. If under every such assignment α\alpha can take no other value except CC, we again say that α\alpha is CC-valid.

Statability. An object α\alpha is statable in k\mathscr{M}_{k} or statable in kk if no constants except those of k\mathscr{M}_{k} appear in α\alpha. α\alpha is provable in kk, or k\mathscr{M}_{k}-provable, if there is a proof string Π\Pi that is statable in kk (in the obvious extension of the just-stated definition of statability) such that Πα\Pi\vdash\alpha. Note that if an object is provable in k\mathscr{M}_{k}, then it is necessarily statable in k\mathscr{M}_{k}.

We refer to 0\mathscr{M}_{0} as the empty structure. In accordance with ordinary usage, we refer to class-free formulas statable in the empty structure as propositions. The system restricted to class free formulas (including propositions) statable in the empty structure we call the extended propositional calculus. The system of objects (including arbitrary formulas and concrete objects) statable in the empty structure we call the extended predicate calculus.242424Predicates, or predicate letters, can be modeled by variables. They are functions which take the value Ø or \bot under their inputs, normally taken to be individuals.

Theorem 20.

Any formal object provable in kk is valid in k+lk+l, for every l0l\geq 0. In particular, all theorems of the extended predicate calculus are valid in any kk-structure.

{proof*}

We only sketch the proof. Fix kk, and let Π\Pi be statable in kk. We wish to show that l0,\mathop{\forall l\,}\geq 0, Πk+lØ\smash{\Pi\underset{k+l}{\Rightarrow}\text{{\O }}}, that is,

Πredasgk+ldeck+levk+l=Ø,\Pi^{\operatorname{red}\,\operatorname{asg}_{k+l}\,\operatorname{dec}_{k+l}\,\operatorname{ev}_{k+l}}=\text{{\O }},

where the superscripted operators represent (in order) the reduction of Π\Pi to its theorem (say, α\alpha), the division of α\alpha into a set αi\alpha_{i} distinguished by different possible assignments of the variables of α\alpha to constants of k+l\mathscr{M}_{k+l}, the declassification of the αi\alpha_{i} in k+lk+l, and finally, the direct evaluation of the declassified αi\alpha_{i} using the tables of k+l\mathscr{M}_{k+l}. We proceed by induction on the length ss of Π\Pi.

First, if s=0s=0, then Π=Ai\Pi=A_{i} is an axiom, statable in the empty structure and thus statable in kk. Choosing ll arbitrarily, we can verify that in this case Π\Pi is valid in ll. For example, to verify A1, ((ca)(c(ab)))(cb)((c\rightarrow a)\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}(c\rightarrow(a\rightarrow b)))\rightarrow(c\rightarrow b), note that if the cc is assigned to \bot the resulting formula is valid. Suppose c=2c1++2cnc=2^{c_{1}}+\ldots+2^{c_{n}}, with 0ci<l0\leq c_{i}<l, and check that in all possible cases, the formula is valid. This type of inference can be applied to produce verifications of axioms 1, 2, 3, 4, 7, 8, 9, 10, 11, 12, 13, and 14. We omit the remaining details of this step.

Next, assume the claim holds for every proof of length <s<s, and let Π\Pi statable in kk have length ss. There are three cases to consider. Let l0l\geq 0.

Case 1. Let Π=Σ[x]\Pi=\Sigma[x]. First, assume that the theorem of Π\Pi, Πred\Pi^{\operatorname{red}}, is closed with the possible exception of the variable xx. We know that the theorem of Σ\Sigma, Σred\Sigma^{\operatorname{red}}, is of the form γ(δϵ)\gamma\rightarrow(\delta\rightarrow\epsilon), for objects γ,δ,ϵ\gamma,\delta,\epsilon, and we know that (γ(δϵ))(\gamma\rightarrow(\delta\rightarrow\epsilon)) is valid in k+lk+l (hereafter: (γ(δϵ))Ø(\gamma\rightarrow(\delta\rightarrow\epsilon))\Rightarrow\text{{\O }}). Since (δϵ)Ø(\delta\rightarrow\epsilon)\Rightarrow\text{{\O }} (for we may assume that γØ\gamma\Rightarrow\text{{\O }}, since otherwise the valuation of Πred\Pi^{\operatorname{red}} is immediately Ø), δ(2ci)Ø\delta(2^{c_{i}})\Rightarrow\text{{\O }} implies that ϵ(2ci)Ø\epsilon(2^{c_{i}})\Rightarrow\text{{\O }} as well, for any individual 2ci2^{c_{i}} in k+l\mathscr{M}_{k+l}. So if {δx}2c1++2cn\{{}^{x}\,\delta\}\Rightarrow 2^{c_{1}}+\ldots+2^{c_{n}} (as in the previous step), then for every ii, ϵ(2ci)Ø\epsilon(2^{c_{i}})\Rightarrow\text{{\O }}, so {δx}{ϵx}Ø\{{}^{x}\,\delta\}\subset\{{}^{x}\,\epsilon\}\Rightarrow\text{{\O }}, showing that Π\Pi is valid. If Πred\Pi^{\operatorname{red}} is not closed, the same argument can be repeated on each variable assignment.

Case 2. Let Π=Σ(β/x)\Pi=\Sigma(\beta/x). The validity of Σred\Sigma^{\operatorname{red}} implies that assigning any value to the variable xx, if it appears in Σred\Sigma^{\operatorname{red}}, the resulting object is valid. So for any possible value CC of β\beta, Σred[x/C]\Sigma^{\operatorname{red}}[x/C] will clearly also be valid. Since declassification and valuation both proceed from the leaves to the root of the syntax tree, this proves that Πred\Pi^{\operatorname{red}} is valid also in this case.

Case 3. Let Π=Σ1Σ2\Pi=\Sigma_{1}\cdot\Sigma_{2}. If Σ1red=α\Sigma_{1}^{\operatorname{red}}=\alpha, then Σ2red\Sigma_{2}^{\operatorname{red}} is of the form αβ\alpha\rightarrow\beta, for object β\beta. Since αØ\alpha\Rightarrow\text{{\O }}, β\beta must necessarily also be valid; otherwise Σ2red\Sigma_{2}^{\operatorname{red}} will not be valid, contrary to hypothesis.

Corollary 21.

No proof string statable in kk proves any constant of k\mathscr{M}_{k} except Ø. In particular, \bot is not provable in kk, for every kk.

In the author's opinion, our approach to the distinction between the syntax and semantics of model theory—treating it as a distinction in method instead of a distinction in kind—enhances the model theoretic approach in many respects, simplifying proofs and allowing ideas to more easily commingle with those areas of mathematics where a syntactic/semantic distinction is not implemented. Other routes to consistency, e.g., via the subformula property, cut elimination, the extended Hauptsatz, and the reduction of the Hilbert-type system to the Gentzen-type systems to which these theorems apply, requires a great deal of technical verification to carry out. Consistency is more easily provable in natural deduction systems, by using the inversion principle and normalization lemma of Prawitz (1965), but more effort is required to relate these systems to Hilbert-type systems.

4 Second and Higher Order Logic

The quantifiers that were introduced in §3.5 are of the first order, insofar as their range is constrained to single objects, or in essence those bodies we usually term individuals. As yet there is no way, given only the tools defined so far, to form a collection of subsets distinct from their set-theoretic sum, nor is there a way to quantify over such a collection. The first-order quantifiers of ZF, in stark contrast, range over any set in the cumulative hierarchy. At this stage, further development requires that we select one of two very broad options. First, we can bestow upon sets a route to obtaining the property of singlehood, thereby filling the universe VV in the way normally done in ZF and similar theories. Second, following the paradigm of putting eggs into cartons, cartons onto racks, racks onto pallets, pallets into containers, and so on, we can build, atop of VV, a larger universe of collections. We present versions of these two approaches in this section, beginning with the latter, and discuss Cantor's theorem and the Burali-Forti paradox.

4.1 Genera

In certain respects the approach of Russell, Ramsey, Quine and other typed systems is quite similar to what follows in this subsection—so much so that the present system can be considered a descendent of theirs.

We carry out a construction of the collection of sets, which we style the genus. Let a pair of signs \mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu} and 1\mid_{\scriptscriptstyle{1}} be introduced, together with the formation rules

αβ\displaystyle\alpha\,\beta (αβ)\displaystyle\longmapsto(\alpha\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}\beta)
αβ\displaystyle\alpha\,\beta (α1˙β[α/˙α])\displaystyle\longmapsto(\alpha{}^{\prime}\mspace{1.0mu}^{\dot{}}\mid_{\scriptscriptstyle{1}}\beta[\alpha{}^{\prime}\mspace{1.0mu}^{\dot{}}/\alpha{}^{\prime}\mspace{1.0mu}])

Thus (\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}) is a binary operation on objects, called type-1 federation. Objects aa and bb are subobjects of the object aba\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}b, as usual. In our axiomatization of (\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}), we shall rely upon equalities between objects to formally define a product that is identical to union/conjunction/federation (\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\,), except that it does not play the role that federation plays in logic.

Axiom. Type-1 federation is associative, commutative, idempotent, substitutive, and has zero element Ø (axioms B1-6 of Table 6).

Definition 4.1.

Let aa and bb be objects. We write a1ba\subset_{\scriptscriptstyle{1}}b, or simply aba\subset b if context allows, if b=ab.b=a\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}b. The relation 1\subset_{\scriptscriptstyle{1}} is called type-1 containment or first containment. When a1ba\subset_{\scriptscriptstyle{1}}b, we say that aa is contained in bb, or type-1 contained in, or first-contained in bb.

We can eventually define type-nn containment where nn is any integer or even any ordinal.

Theorem 22.

The type-1 containment relation is reflexive, transitive, and antisymmetric. In addition,

(a1bb1a)a=b.\displaystyle(a\subset_{\scriptscriptstyle{1}}b\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}b\subset_{\scriptscriptstyle{1}}a)\rightarrow a=b. (17)
Ø1a1ab.\displaystyle\text{{\O }}\subset_{\scriptscriptstyle{1}}a\subset_{\scriptscriptstyle{1}}a\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}b. (18)
(a1b)(a1bc).\displaystyle(a\subset_{\scriptscriptstyle{1}}b)\rightarrow(a\subset_{\scriptscriptstyle{1}}b\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}c). (19)
(a1cb1c)(ab1c).\displaystyle(a\subset_{\scriptscriptstyle{1}}c\,\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\,b\subset_{\scriptscriptstyle{1}}c)\rightarrow(a\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}b\subset_{\scriptscriptstyle{1}}c). (20)
{proof*}

To prove that 1\subset_{\scriptscriptstyle{1}} is transitive, let a1ba\subset_{\scriptscriptstyle{1}}b and b1cb\subset_{\scriptscriptstyle{1}}c. Then b=abb=a\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}b and c=bcc=b\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}c. Therefore

c=(ab)c=a(bc)=ac.c=(a\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}b)\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}c=a\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}(b\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}c)=a\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}c.
Table 6: Postulates of Section 4 (variables a,b,ca,b,c, objects α,β,γ\alpha,\beta,\gamma)
{oldtabular}

c {oldtabular}[t]lc B1. (ab)c=a(bc)(a\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}b)\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}c=a\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}(b\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}c)

B2. aØ=aa\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}\text{{\O }}=a

B3. a=a\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}\bot=\bot

B4. ab=baa\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}b=b\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}a

B5. aa=aa\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}a=a

B6. (a=b)(ac=bc)(a=b)\rightarrow(a\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}c=b\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}c)

B7. 𝔰(a)𝔰1(a)\operatorname{\mathfrak{s}}(a)\rightarrow\operatorname{\mathfrak{s}}_{\scriptscriptstyle{1}}(a)

B8. 𝔰1(a)aV\operatorname{\mathfrak{s}}_{\scriptscriptstyle{1}}(a)\rightarrow a\subset V

B9. (¬𝔰1(a)bØba)(\lnot\operatorname{\mathfrak{s}}_{\scriptscriptstyle{1}}(a)\land b\neq\text{{\O }}\land b\neq a)\rightarrow

(ab=¬(ab)¬(ba))(a\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}b=\bot\land\lnot(a\rightarrow b)\land\lnot(b\rightarrow a))

B10. a1bc(a1b)(a1c).a\in_{\scriptscriptstyle{1}}b\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}c\rightarrow(a\in_{\scriptscriptstyle{1}}b)\lor(a\in_{\scriptscriptstyle{1}}c).

B11. a1(x˙1ϕ[x˙/x])𝔰1(a)ϕ[a/x]a\in_{\scriptscriptstyle{1}}(x^{\dot{}}\mid_{\scriptscriptstyle{1}}\phi[x^{\dot{}}/x])\leftrightarrow\operatorname{\mathfrak{s}}_{\scriptscriptstyle{1}}(a)\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\phi[a/x]

(objects ϕ\phi, variables xx)

B12a. (x˙1x˙1a)1a(x^{\dot{}}\mid_{\scriptscriptstyle{1}}x^{\dot{}}\in_{\scriptscriptstyle{1}}a)\subset_{\scriptscriptstyle{1}}a

B12b. (a1V1)(a1(x˙1x˙1a))(a\subset_{\scriptscriptstyle{1}}V_{1})\rightarrow(a\subset_{\scriptscriptstyle{1}}(x^{\dot{}}\mid_{\scriptscriptstyle{1}}x^{\dot{}}\in_{\scriptscriptstyle{1}}a))

(variables xx)

{oldtabular}

[t]c

R4. xx does not appear in γ\gamma

γ(αβ)γ(x˙1α[x˙/x])1(x˙1β[x˙/x])\begin{array}[]{c}\gamma\rightarrow(\alpha\rightarrow\beta)\\ \gamma\rightarrow(x^{\dot{}}\mid_{\scriptscriptstyle{1}}\alpha[x^{\dot{}}/x])\subset_{\scriptscriptstyle{1}}(x^{\dot{}}\mid_{\scriptscriptstyle{1}}\beta[x^{\dot{}}/x])\rule{0.0pt}{10.0pt}\\ \raisebox{24.0pt}[0.0pt][0.0pt]{$\begin{array}[]{c}\phantom{xxxxxxxxxxxxxxxxxxxxxxxxxxx}\\ \hline\cr\end{array}$}\end{array}

{oldtabular}

lc C1. 𝔰(a)\operatorname{\mathfrak{s}}(a{}^{\text{\textendash}\mspace{2.0mu}})

C2. a=aa{}^{\text{\textendash}\mspace{2.0mu}}{}^{\sim\mspace{2.0mu}}=a

C3.   a=b(a=ba=b)a=b\rightarrow(a{}^{\text{\textendash}\mspace{2.0mu}}=b{}^{\text{\textendash}\mspace{2.0mu}}\land a{}^{\sim\mspace{2.0mu}}=b{}^{\sim\mspace{2.0mu}})

Definition 4.2.

An object is type-1 single or first-single, or a generic individual, 𝔰1(a)\operatorname{\mathfrak{s}}_{\scriptscriptstyle{1}}(a), if it has no generic content. That is, 𝔰1(a)(b1a(b¯b=a)).\operatorname{\mathfrak{s}}_{\scriptscriptstyle{1}}(a)\,\leftrightarrow\,(b\subset_{\scriptscriptstyle{1}}a\rightarrow(\underline{b}\lor b=a)). We write aba\in b, or a1ba\in_{\scriptscriptstyle{1}}b, if a1ba\subset_{\scriptscriptstyle{1}}b and 𝔰1(a)\operatorname{\mathfrak{s}}_{\scriptscriptstyle{1}}(a). A genus that is not first-single is a proper genus. A nonsingle, first-single object is a plurality.

Axioms B7-10 further characterize generic individuals. Axiom B7 ensures that no proper genus can be single (this principle is exigent in light of Theorem 4.33 below). Axiom B8 introduces the simplifying assumption that there are no generic individuals beyond the sets. Thus the generic individuals are the sets: all individuals together with all pluralities. Axiom B9 grants meaning to the expressions aba\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}b and aba\rightarrow b when either aa or bb is a proper genus. Axiom B10, perhaps the most important of this group, asserts the primality of first-single objects with respect to ()(\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}).

Proposition 23.

𝔰1(ab)(a¯b¯a=b).\operatorname{\mathfrak{s}}_{\scriptscriptstyle{1}}(a\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}b)\rightarrow(\underline{a}\lor\underline{b}\lor a=b).

{proof*}

As in §3.1.

Proposition 24.

aV𝔰1(a).a\subset V\rightarrow\operatorname{\mathfrak{s}}_{\scriptscriptstyle{1}}(a).

{proof*}

Suppose that ¬𝔰1(a)\lnot\operatorname{\mathfrak{s}}_{\scriptscriptstyle{1}}(a). Then aa is a proper genus by Definition 4.29. Axiom B9 therefore implies that if aVa\subset V, then a=(aV)a=(a\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}V), and hence a=a=\bot.

The rest of the apparatus is quite analogous to previous definitions and postulates. We define, for every variable XX and proposition ϕ\phi, the type-1 class of XX and ϕ\phi or the class of sets XX such that ϕ\phi, formally as (X˙1ϕ[X˙/X])(X^{\dot{}}\mid_{\scriptscriptstyle{1}}\phi[X^{\dot{}}/X]\,). We shall usually denote this {ϕX}\{{}^{X}\,\phi\}, or {ϕX}1{}_{\scriptscriptstyle{1}}\{{}^{X}\,\phi\}, if it is necessary to distinguish the type. The modified condition ϕ[X˙/X]\phi[X^{\dot{}}/X] and the modified class index X˙X^{\dot{}} are subobjects of the class {ϕX}\{{}^{X}\,\phi\}. These objects are subject to postulates B11, B12, and R4.

Definition 4.3.

V1={ØX}V_{1}=\{{}^{X}\,\text{{\O }}\}, the class of sets with no condition. If the object a1V1a\subset_{\scriptscriptstyle{1}}V_{1}, aa is said to be a genus. If bb is a genus and a1ba\subset_{\scriptscriptstyle{1}}b, we say that aa is a subgenus of bb.

V1V_{1} is the type-1 universe, or the universal genus. Script letters 𝒜,,𝒞,\mathcal{A},\mathcal{B},\mathcal{C}, etc. are used to denote genera. By letting 1X(ϕ)=(V1={ϕX})\mathop{\forall_{\scriptscriptstyle{1}}X\;}(\phi)=(V_{1}=\{{}^{X}\,\phi\}), 1X(ϕ)=(Ø{ϕX})\mathop{\exists_{\scriptscriptstyle{1}}X\;}(\phi)=(\text{{\O }}\neq\{{}^{X}\,\phi\}), we define type-1 quantifiers, which range over sets as well as individuals. We usually denote these Xϕ\mathop{\forall X\,}\phi and Xϕ\mathop{\exists X\,}\phi.

The curious reader may verify that letting

a1b={XX1aX1b}a\cap_{\scriptscriptstyle{1}}b=\{{}^{X}\,X\in_{\scriptscriptstyle{1}}a\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}X\in_{\scriptscriptstyle{1}}b\}

everything remains the same as in §2 and §3 at the generic level, and flows from the principle of extensionality much as before. For example, to verify A6, let c1ac\subset_{\scriptscriptstyle{1}}a and c1bc\subset_{\scriptscriptstyle{1}}b. Then a1b={XX1acX1bc}=a\cap_{\scriptscriptstyle{1}}b=\{{}^{X}\,X\in_{\scriptscriptstyle{1}}a\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}c\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}X\in_{\scriptscriptstyle{1}}b\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}c\}= (by primality) {XX1c(X1aX1b)}=c(a1b).\{{}^{X}\,X\in_{\scriptscriptstyle{1}}c\lor(X\in_{\scriptscriptstyle{1}}a\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}X\in_{\scriptscriptstyle{1}}b)\}=c\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}(a\cap_{\scriptscriptstyle{1}}b). Here, as elsewhere, much depends upon the formula {ϕXψ}={ϕX}{ψX}\{{}^{X}\,\phi\lor\psi\}=\{{}^{X}\,\phi\}\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}\{{}^{X}\,\psi\}.

We will not do much with genera here (though see §5), but we can define a few familiar objects. For every genus 𝒜\mathcal{A}, the sum set of 𝒜\mathcal{A} is 𝔖𝒜={Xx(xXX𝒜)}.\mathfrak{S}\mspace{-2.0mu}\mathcal{A}=\{{}^{x}\,\mathop{\exists X\,}(x\in X\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}X\in\mathcal{A})\}. We can also define, for all sets AA, its set power or subset genus, the genus of all subsets of AA: 𝒫A={XXA}.\mathscr{P}\!A=\{{}^{X}\,X\subset A\}. Thus we have, for single a,b,c,a,b,c,\ldots

𝒫(Ø)=Ø,\displaystyle\mathscr{P}(\text{{\O }})=\text{{\O }},
𝒫(a)=a,\displaystyle\mathscr{P}(a)=a,
𝒫(ab)=ab(ab),\displaystyle\mathscr{P}(a\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}b)=a\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}b\mspace{-5.0mu}\mathbin{{}_{\scriptstyle{\And}}}\mspace{-6.0mu}(a\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}b),
\displaystyle\vdots

In general, the power of a finite set of nn individuals has 2n12^{n}-1 elements. If this is something less than the reader expected, he or she will note that the cardinality of infinite sets is unaffected by the missing null (as noted above, Ø is not an element of any set); in fact, the power is itself of less theoretical importance without the motivation of Zermelo's principle of Aussonderong and the Power Set axiom. The usual value 2n2^{n} can be regained with a simple adjustment of the definitions above: instead of having as the type-1 identity Ø, one introduces a new identity which might be denoted Ø1\text{{\O }}_{1}, or even {Ø}\{\text{{\O }}\}. However, since these constants quickly proliferate, the user of the system is then forced to define scores of symbols all intuitively signifying a trivial structure, a ``set of nothing". The author's approach is therefore to pursue the simpler of the two options, and to stand prepared should a clear and distinct reason for introducing typed identities arise. This may well occur, even though it has yet to so far; we should recall the nearly two millenia that passed in the West before 0 was finally distinguished from Ø.252525The infinite reduplication of each logically definable class at each type in the simple theory of types (which, as noted in Quine (1938, p. 137), is “particularly strange in the case of the null class”) was foremost among the concerns that gave rise to NF.

We assume (without formal development) the notion of a relation RR as a set of ordered pairs, that is, of substitutive products (a,b)(a,b), for objects aa and bb. In place of {(x,y)yR}\{{}^{y}\,(x,y)\subset R\}, we write R(a)R(a). We call a relation ff a function if a𝔰(f(a))\mathop{\forall a\,}\operatorname{\mathfrak{s}}(f(a)), etc.

Theorem 25 (Cantor).

Let the set AA be a plurality. Then there is no injective function from AA onto 𝒫A\mathscr{P}\!A.

{proof*}

Let AA be a plurality, let ff be an injective mapping of AA onto 𝒫A\mathscr{P}\!A, and consider the set Y={xxf(x)}Y=\{{}^{x}\,x\notin f(x)\}. On pain of contradiction, YY must be empty. In that case (encountered when the Russell set was examined above), an element of AA can be mapped to no other value but itself. But since AA is a plurality, this contradicts the hypothesis that ff is onto.

Because Cantor's theorem is unavoidable, genera cannot be members of VV. If genera were allowed to be single, Cantor's theorem would lead directly to Cantor's paradox, because the power of VV, a subgenus of V1V_{1}, would be mapped in its entirety into VV itself, leading to contradictory results on the cardinality of VV. This means, effectively, that it is not possible to form a set of proper genera. Per axiom B9, such sets are left undefined.

4.2 Totalities

The reader is no doubt familiar with the construction introduced by von Neumann in 1923 defining sets that may be conveniently identified with order types. This construction is carried out within our system in this section. The original intent and style of thought is preserved: rather than sets (as given above) we formally generate a class of individuals sharing characteristic properties (which we label totalities) and apply abstraction to study them as an unrestricted whole. We begin by introducing two new symbols, {}^{\text{\textendash}\mspace{2.0mu}} and {}^{\sim\mspace{2.0mu}}, with formation rules

α(α)\displaystyle\alpha\longmapsto(\alpha){}^{\text{\textendash}\mspace{2.0mu}}
α((α))\displaystyle\alpha\longmapsto((\alpha){}^{\text{\textendash}\mspace{2.0mu}}){}^{\sim\mspace{2.0mu}}

(we will suppress these parentheses when they are unnecessary). With these come axioms C1-3:262626As noted above, the product \land, equivalent to \mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu} except for a lower place in the order of operations, may be used at times to suppress unwanted parentheses. This gives rise to the order of operations as follows (where, for added context, ring operations are added): ,,+,,,{=,,},,,{,,}\cdot,\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu},+,\cap,\cup,\{=,\subset,\supset\},\land,\lor,\{\leftrightarrow,\rightarrow,\leftarrow\}

𝔰(a)\displaystyle\operatorname{\mathfrak{s}}(a{}^{\text{\textendash}\mspace{2.0mu}})
a=a\displaystyle a{}^{\text{\textendash}\mspace{2.0mu}}{}^{\sim\mspace{2.0mu}}=a
a=b(a=ba=b)\displaystyle a=b\rightarrow(a{}^{\text{\textendash}\mspace{2.0mu}}=b{}^{\text{\textendash}\mspace{2.0mu}}\land a{}^{\sim\mspace{2.0mu}}=b{}^{\sim\mspace{2.0mu}})

The operation ()(\cdot){}^{\text{\textendash}\mspace{2.0mu}} is analogue to the braces of normal set notation. Following D. R. Finkelstein, we call it the bracing operation. The antibracing operation {}^{\sim\mspace{2.0mu}} strips away the shell introduced by bracing. We call objects aa such that a=ba=b{}^{\text{\textendash}\mspace{2.0mu}} for some bb totalities. Let the ordered pair (a,b)(a,b) of two objects be the totality (a(ab))(a{}^{\text{\textendash}\mspace{2.0mu}}\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}(a\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}b){}^{\text{\textendash}\mspace{2.0mu}}){}^{\text{\textendash}\mspace{2.0mu}}; relations may now be defined as usual (as in, for example, Suppes (1960), or Kunen (1980)). In particular, let a<ba<b if aba\in b{}^{\sim\mspace{2.0mu}}. We say that a set AA is transitive if b,bAbA\mathop{\forall b\,},b\in A\rightarrow b{}^{\sim\mspace{2.0mu}}\subset A, as usual; however, a totality aa is subtransitive if b,baba\mathop{\forall b\,},b\in a{}^{\sim\mspace{2.0mu}}\rightarrow b{}^{\sim\mspace{2.0mu}}\subset a{}^{\sim\mspace{2.0mu}}.

Definition 4.4.

An object aa is an ordinal if it is a subtransitive totality and aa{}^{\sim\mspace{2.0mu}} is well-ordered by <<.

The first few ordinals are then Ø,Ø,(ØØ),\text{{\O }}{}^{\text{\textendash}\mspace{2.0mu}},\text{{\O }}{}^{\text{\textendash}\mspace{2.0mu}}{}^{\text{\textendash}\mspace{2.0mu}},(\text{{\O }}{}^{\text{\textendash}\mspace{2.0mu}}\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\text{{\O }}{}^{\text{\textendash}\mspace{2.0mu}}{}^{\text{\textendash}\mspace{2.0mu}}){}^{\text{\textendash}\mspace{2.0mu}}, etc., or (introducing abbreviations) 0,1,2,(012)=3,0,1,2,(0\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}1\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}2){}^{\text{\textendash}\mspace{2.0mu}}=3, etc.

Forming a totality out of the set of all ordinals leads to the oldest modern set-theoretical paradox.

Theorem 26.

Let Ω={xx is an ordinal}.\Omega=\{{}^{x}\,x\textup{ is an ordinal}\}. Then if ΩV\Omega{}^{\text{\textendash}\mspace{2.0mu}}\in V, then Ø=\text{{\O }}=\bot.

{proof*}

Ω\Omega, the set of all ordinals, is transitive and well-ordered by \in. Therefore Ω\Omega{}^{\text{\textendash}\mspace{2.0mu}} is itself an ordinal, that is, ΩΩ\Omega{}^{\text{\textendash}\mspace{2.0mu}}\in\Omega. But for any xΩx\in\Omega, ¬(x<x)\lnot(x<x), that is, xxx\notin x{}^{\sim\mspace{2.0mu}}. Therefore ΩΩ\Omega{}^{\text{\textendash}\mspace{2.0mu}}\notin\Omega, a contradiction.

This is certainly not the first time that this paradoxical result has come to light in this fashion. It was found by Cantor while he was developing the theory of the transfinite, and it was formally derived by Burali-Forti in a paper which appeared around the turn of the century. Many years later, it impinged upon Quine when he extended his system NF with the class-existence axioms of Quine (1951). After the first edition of the book appeared in 1940, J.B. Rosser discovered that the system it contained (known as ML) was inconsistent. Quine had posited that all classes having stratified conditions and whose quantifiers could range freely over classes could be considered elements, and made members of his universe VV. Having worked out the difficulties related to Cantor's paradox of the largest cardinal, he had been convinced, wrongly as it turned out, that the system would not be troubled by the paradox of Burali-Forti.272727A repair of ML due to Hao Wang was used for the 1951 edition cited in the bibliography.

After what happened to no less creative and exact a logician than Quine, we must count ourselves fortunate that we noticed the paradox before hastening to press ourselves. This may be due in some part to the simplicity of the derivation here compared to the rather lengthy derivation required in ML. The problem with the naive procedure seems to be that the formation rule

αα\alpha\longmapsto\alpha{}^{\text{\textendash}\mspace{2.0mu}}

and the axiom

𝔰(a)\operatorname{\mathfrak{s}}(a{}^{\text{\textendash}\mspace{2.0mu}})

when combined give rise to an ungovernable feedback when operating in tandem with the class structure introduced in §3. A class can be chosen and traded in for a new individual; the creation of an individual sparks a revision of the classes. Given such a potential source of instability, it shouldn't be surprising that a paradox was discovered lurking.

Let us see if we can distill the logic of Theorem 4.35. Since the crucial property is ¬(x<x)\lnot(x<x), we could consider the class

{xxx}\{{}^{x}\,x\notin x{}^{\sim\mspace{2.0mu}}\}

which contains Ω\Omega, and falls to the Russell antinomy.282828Call this set ss. ss{}^{\text{\textendash}\mspace{2.0mu}} is an object; ss{}^{\text{\textendash}\mspace{2.0mu}} is single. If ss,s{}^{\text{\textendash}\mspace{2.0mu}}\in s, then ss,s{}^{\text{\textendash}\mspace{2.0mu}}\notin s{}^{\text{\textendash}\mspace{2.0mu}}{}^{\sim\mspace{2.0mu}}, i.e., sss{}^{\text{\textendash}\mspace{2.0mu}}\notin s. And vice versa: if ss,ss,s{}^{\text{\textendash}\mspace{2.0mu}}\notin s,s{}^{\text{\textendash}\mspace{2.0mu}}\notin s{}^{\text{\textendash}\mspace{2.0mu}}{}^{\sim\mspace{2.0mu}}, i.e., sss{}^{\text{\textendash}\mspace{2.0mu}}\in s. It seems that classes can now be definitions of the non-predicative kind rejected by Poincaré. Having betrayed our intention (of §3.2) never to use any class as a vehicle for the formation of new objects in addition to those already in place before the class has been formed, we have found ourselves again bushwhacked by Russell's paradox.

The problem may be as old as set theory itself, but we have new notions to employ—the distinct concepts of multiplicity and totality—which allow us hope that the paradox might yet be dispelled. It must be noted, however, that these notions were introduced already for this purpose by Cantor. First to arrive at the difficulty, he concluded that Ω\Omega is a multiplicity for which there is no corresponding totality. He wrote to Dedekind in 1899: ``If we start from the notion of a definite multiplicity (a system, a totality) of things, it is necessary … to distinguish two kinds of multiplicities… For a multiplicity can be such that the assumption that all of its elements `are together' leads to a contradiction, so that it is impossible to conceive of the multiplicity as a unity, as `one finished thing'."292929See van Heijenoort (1967, pp. 113-7). He called such multiplicities absolutely infinite or inconsistent multiplicities.

Cantor's interpretation of the difficulty, that the the combustible totalities cannot be formed, or that ¬𝔰(Ω)\lnot\operatorname{\mathfrak{s}}(\Omega{}^{\text{\textendash}\mspace{2.0mu}}), ¬𝔰(s)\lnot\operatorname{\mathfrak{s}}(s{}^{\text{\textendash}\mspace{2.0mu}}), etc., is motivated by the intuitive view (held by Cantor) that this construction and all such constructions stand on the cusp of the natural boundaries of the classifiable. For Ω\Omega, in particular, this account is suggestive, given the commonplace associations (from a range of sources upholding varying standards of rigor) between the incomplete infinite (the Cantorian ``absolute" infinite) and the inconceivable or impossible. In one form or another, this vision of a great divide in the ontology of set theory is shared by almost all of Cantor's descendents, but our work has at least one profound contrast with others: it envisions the boundary brought down, so to speak, from a great height, and made into something more familiar: the distinction between one and many.

By taking what we have here denoted << instead of what we have here denoted \in as set-theoretical membership, the individual is to this system essentially what the set is to NBG and MK. Since set theorists have a long experience with the use of proper classes in these systems (whose relationship with ZF has long been well understood), accepting the existence of proper, ``unfinishable", or ``untotalitizable" classes as Cantor recommended will likely seal the present system from further duress.303030For instance, Mirimanoff’s paradox is thus resolved. Mirimanoff considers the set WW of all well-founded sets—or rather, the totality WW{}^{\text{\textendash}\mspace{2.0mu}} of well-founded totalities. It must be well-founded, since an infinitely descending chain <wn<w1<W\ldots<w_{n}<\ldots w_{1}<W{}^{\text{\textendash}\mspace{2.0mu}} from WW{}^{\text{\textendash}\mspace{2.0mu}} implies an infinite descending chain from w1w_{1}, but w1<Ww_{1}<W{}^{\text{\textendash}\mspace{2.0mu}} is well-founded. If WW{}^{\text{\textendash}\mspace{2.0mu}} is well-founded, however, then W<WW{}^{\text{\textendash}\mspace{2.0mu}}<W{}^{\text{\textendash}\mspace{2.0mu}}, contradiction. In NBG, it is deduced that WW is a proper class—or in Cantor’s idiom, WW is an inconsistent (we would almost prefer to say “perfect”) multiplicity—one which cannot be gathered into a whole. As Fraenkel points out, however, in the preface of Bernays (1958, §7), a ``vast field of uncertainty still remains" between those totalities which trigger an alarm and those which can be constructively imagined. In most axiomatizations of set theory (including ZFC, NBG, and MK), this field of uncertainty is pushed back with seemingly innocuous reassurances—𝔰((ab))\operatorname{\mathfrak{s}}((a\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}b){}^{\text{\textendash}\mspace{2.0mu}}), 𝔰(𝒫(x))\operatorname{\mathfrak{s}}(\mathscr{P}(x){}^{\text{\textendash}\mspace{2.0mu}}), 𝔰((x))\operatorname{\mathfrak{s}}(\bigcup(x){}^{\text{\textendash}\mspace{2.0mu}}), 𝔰((f(A)\operatorname{\mathfrak{s}}((f(A){}^{\text{\textendash}\mspace{2.0mu}})—and other axioms impacting the structure of the transfinite.

There is, in any case, a guiding assurance: the paradox of the largest ordinal number has arisen for us in precisely the way it arose for Quine in the 1940s—because computationally tricky totalities (or elements, in the case of ML) have been introduced to an underlying system which, as Quine put it, ``is a completely safe basal logic…to which more daring structures may be added at the constructor's peril" (Quine, 1941, p. 135). As an independent form, the system of §3 remains a consistent and adequate setting for basic mathematical intuition—the intuition of aggregates of objects, finite in number, invested with location in a common space. This lends support to the view that the problem of the Burali-Forti paradox should be diagnosed as an attribute of the construction of naïvely defined totalities, and does not reflect doubt upon the fundamental principles governing the multiplicity and the class outlined above.

There likely exists a number of modifications of C1-3 (perhaps along with the formation rules for totalities) by which the difficulties we have observed could be compellingly resolved. Set theorists have already focused a great and impressive effort upon this pursuit. One might also pursue an orthogonal route, laying axioms C1-3 aside and carrying out further constructions and experiments leading into different mathematical and logical domains, employing the B axioms, or something similar, when need arises for collections of sets and second order quantifiers. If we use the system of §3 to stage the integers (beginning along the lines of 𝔰(1),𝔰(a)𝔰(a+1),\operatorname{\mathfrak{s}}(1),\operatorname{\mathfrak{s}}(a)\rightarrow\operatorname{\mathfrak{s}}(a+1), etc., or by defining Church numerals or combinatory terms), the real number system, an elementary topos, or an abstract group, things might all proceed as one might hope and expect, and we could soon find ourselves having access to a language quite indiscernible from that of researchers whose work is formally grounded by other means. However, because practical attainment of these diaphanous connections might pose unexpected challenges (and these may be instructive and fruitful), we await the findings of further investigations.

5 Lower Order Logic, Modulation, Mathematics

Having completed the task set out in the introduction, we are prepared to conclude. However, we hesitate, since readers might now still wish to ask: why go to all the trouble of laying set-theoretical and logical foundations when, to be sure, set-theoretical and logical foundations have already been laid? There are several ways to answer this objection based on our previous work. There is the need in science for endless exploration and inquiry; there is the simplified form of our basic definitions and the pull of Occam's razor; there is the manner in which the elements of the system are well-correlated to natural human mathematical intuition, providing unity and explanatory power; there is the convenient reduction in the complexity of metamathematics provided, in particular, by using syntactically defined classes, syntactically defined indices and variables, and the synoptic rule R3; the manner in which the great nineteenth-century set-theoretical paradoxes have been, we believe, freshly illuminated, and perhaps better understood. In sum, the construction has so far proven to possess a number of compelling features. Our purpose in this section is to add a final observation to this reply felt to be worthy of note, by briefly discussing the manners in which our prior definitions offer practical utility to a working mathematician, and thus gain ground towards what we consider an essential motive for research in foundations. We ask the reader's forgiveness if in this section we do not pause to define all of our terms perfectly; the previous work ought to be a satisfactory guide to those seeking a more formal treatment. We begin by generalizing axiom group B slightly.

Axiom. For nonzero integers nn and m<nm<n and objects aa, bb, and cc, the type-n federal product, or type-n federation, is defined, subject to the following axioms:

B1. (ab)c=a(bc).(a\mathbin{{}_{{\scriptscriptstyle{\setminus}}_{n}}}b)\mathbin{{}_{{\scriptscriptstyle{\setminus}}_{n}}}c=a\mathbin{{}_{{\scriptscriptstyle{\setminus}}_{n}}}(b\mathbin{{}_{{\scriptscriptstyle{\setminus}}_{n}}}c).
B2. aØ=a.a\mathbin{{}_{{\scriptscriptstyle{\setminus}}_{n}}}\text{{\O }}=a.
B3. a=a\mathbin{{}_{{\scriptscriptstyle{\setminus}}_{n}}}\bot=\bot
B4. ab=ba.a\mathbin{{}_{{\scriptscriptstyle{\setminus}}_{n}}}b=b\mathbin{{}_{{\scriptscriptstyle{\setminus}}_{n}}}a.
B5. aa=a.a\mathbin{{}_{{\scriptscriptstyle{\setminus}}_{n}}}a=a.
B6. (a=b)(ac=bc).(a=b)\rightarrow(a\mathbin{{}_{{\scriptscriptstyle{\setminus}}_{n}}}c=b\mathbin{{}_{{\scriptscriptstyle{\setminus}}_{n}}}c).
B7. 𝔰n1(a)𝔰n(a).\operatorname{\mathfrak{s}}_{\scriptscriptstyle{n-1}}(a)\rightarrow\operatorname{\mathfrak{s}}_{\scriptscriptstyle{n}}(a).
B8. 𝔰n(a)an1Vn1.\operatorname{\mathfrak{s}}_{\scriptscriptstyle{n}}(a)\rightarrow a\subset_{\scriptscriptstyle{n-1}}V_{n-1}.
B9. ¬𝔰n(a)((bØba)(ab=¬(amb)¬(bma))).\lnot\operatorname{\mathfrak{s}}_{\scriptscriptstyle{n}}(a)\rightarrow((b\neq\text{{\O }}\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}b\neq a)\rightarrow(a\mathbin{{}_{{\scriptscriptstyle{\setminus}}_{m}}}b=\bot\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\lnot(a\subset_{\scriptscriptstyle{m}}b)\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\lnot(b\subset_{\scriptscriptstyle{m}}a))).
B10. anbcanbanc.a\in_{\scriptscriptstyle{n}}b\mathbin{{}_{{\scriptscriptstyle{\setminus}}_{n}}}c\rightarrow a\in_{\scriptscriptstyle{n}}b\lor a\in_{\scriptscriptstyle{n}}c.

The analogous generalization of axioms B11, B12, and R4 is straightforward. Type-nn abstraction is denoted {ϕx}n{}_{\scriptscriptstyle{n}}\{{}^{x}\,\phi\}, or {ϕx}\{{}^{x}\,\phi\} if context allows; elements aa of the class must be type-nn single, 𝔰n(a)\operatorname{\mathfrak{s}}_{\scriptscriptstyle{n}}(a). We write n\in_{\scriptscriptstyle{n}}, n\subset_{\scriptscriptstyle{n}}, etc.; type subscripts may often be suppressed if confusion does not arise as a result. The type-nn universe {Øx}n{}_{\scriptscriptstyle{n}}\{{}^{x}\,\text{{\O }}\} is denoted VnV_{n}. We set ab=aba\mathbin{{}_{{\scriptscriptstyle{\setminus}}_{0}}}b=a\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}b, etc.

Names tend to lend concreteness to abstract ideas, so we shall say that if…then we call the object a…

xV1,x\subset V_{-1}, residue.
xV0,x\subset V_{0}, set.
xV1,x\subset V_{1}, genus.
xV2,x\subset V_{2}, family.
xV3,x\subset V_{3}, phylum.
xV4,x\subset V_{4}, regnum.

This taxonomic hierarchy is of course taken over from the ranks of the eighteenth century Linnaean system. The terms order and class have been removed, since they are frequently used for other purposes in mathematics.313131Nor are they alone: the terms form, variety, type, and domain all have currency in taxonomy. The terms set and residue are added since there is an admirable correlation between these and existing mathematical notions. Since the term family is often used for a set of collections (as for example in Rudin (1987)), there is another agreement with standard terminology at the V2V_{2} rank. The term species might be suggested over set, since Aristotle's original binomial taxonomy can then be recognized in the generalized system, and since that term is generally used by Brouwer in his papers on intuitionistic set theory (see, e.g., Brouwer (1923)). It also suggests itself since it is useful to have the term set available at the metalevel (as we saw in §3.5). Nevertheless, we shall continue to refer to generic individuals as sets.

Definition 5.1.

Let A={xxnA}n={ϕx}nA={}_{\scriptscriptstyle{n}}\{{}^{x}\,x\in_{\scriptscriptstyle{n}}A\}={}_{\scriptscriptstyle{n}}\{{}^{x}\,\phi\}. The modulation of AA is defined to be the expression

{xxnA}n1={ϕx}n1,{}_{\scriptscriptstyle{n-1}}\{{}^{x}\,x\in_{\scriptscriptstyle{n}}A\}={}_{\scriptscriptstyle{n-1}}\{{}^{x}\,\phi\},

conveniently denoted ˘A¯\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{A}.

The inverse operation, or antimodulation of AA may also be defined in the natural way. Thus from a given set AA, one obtains a residue through modulation, and a genus through antimodulation. Among the two, the modulation (which may be pronounced modulo AA, or simply mod AA) serves particularly well as a basic tool in algebra. Let RR be a set, and let (+),()(+),(\cdot) be products defined on RR, with a+ba+b, ab=abRab=a\cdot b\in R for a,bRa,b\in R. Let

a+(bc)=(a+b)(a+c),\displaystyle a+(b\mathbin{{}_{{\scriptscriptstyle{\setminus}}_{n}}}c)=(a+b)\mathbin{{}_{{\scriptscriptstyle{\setminus}}_{n}}}(a+c), (21)
a(bc)=abac,\displaystyle a(b\mathbin{{}_{{\scriptscriptstyle{\setminus}}_{n}}}c)=ab\mathbin{{}_{{\scriptscriptstyle{\setminus}}_{n}}}ac, (22)

so that (+)(+) and ()(\cdot) are distributive over the entire federal hierarchy. Let (R,+,,0,1)(R,+,\cdot,0,1) be a ring with unity, and let SRS\subset R. The question of when the set R+˘S¯R+\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{S} forms a ring may be answered by examining the elementary calculations

(a+˘S¯)+(b+˘S¯)=a+b+2˘S¯,\displaystyle(a+\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{S})+(b+\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{S})=a+b+2\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{S}, (23)
(a+˘S¯)(b+˘S¯)=ab+a˘S¯+b˘S¯+˘S¯˘S¯.\displaystyle(a+\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{S})(b+\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{S})=ab+a\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{S}+b\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{S}+\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{S}\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{S}. (24)

We wish to obtain a set of conditions under which these expressions become

(a+˘S¯)+(b+˘S¯)=(a+b)+˘S¯,\displaystyle(a+\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{S})+(b+\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{S})=(a+b)+\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{S}, (25)
(a+˘S¯)(b+˘S¯)=ab+˘S¯,\displaystyle(a+\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{S})(b+\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{S})=ab+\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{S}, (26)

which correlate multiplication and addition with and without the modulated factor SS. (25) will follow from (23) when SS is additively closed and contains 0. A solution to the problem of satisfying the constraint of (26) is to allow that for all aS,aS=Sa\in S,aS=S, and that SS be multiplicatively closed, so that S2SS^{2}\subset S (remembering 0S0\in S). Thus if aSa\in S, 2a,3a,2a,3a,\ldots and a2,a3,Sa^{2},a^{3},\ldots\in S as well. Moreover aS-a\in S since a(1)S-a\in(-1)S, giving an additive group, and the multiplicative closure requirement can similarly be dropped. We call such a structure an ideal inside the ring.

The reader should note that the ideal concept has been defined neither dogmatically nor informally. Rather, it has been directly detected and recovered, by means of a brief, real-time examination of algebraic structure. The style of calculation can be continued to yield elegant direct proofs of the isomorphism theorems, and generally to facilitate investigation within the theory of homomorphisms and beyond.

Consider the case R=R=\mathbb{Z}. Each ideal has the form m=2mm0m2mm\mathbb{Z}=\ldots\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}-2m\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}-m\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}0\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}m\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}2m\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\ldots\; for some mm\in\mathbb{Z}. The study of equations in \mathbb{Z} carrying the factor m˘¯m\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{\,\mathbb{Z}} was first undertaken systematically by Gauss (1986), who wrote

ab(mod.m)a\equiv b\;\,(\text{mod.}\;m)

for the relation

a+m˘¯=b+m˘¯,a+m\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{\,\mathbb{Z}}=b+m\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{\,\mathbb{Z}},

or equivalently,

ab+m.a\in b+m\mathbb{Z}.

Thus the formal expressions we obtain using the algebraic set bear a striking resemblance to the classical notation coined by Gauss.

We present a proof of a well-known elementary theorem in the calculus of residues, a proof that will in fact be similar to the one given by Gauss (op. cit., pp. 10-13). Because the symbol \mathbb{Z} recurs frequently, it will be suppressed under modulation, thus m˘¯=m˘¯m\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{\phantom{Z}\;}=m\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{\,\mathbb{Z}}. We will work with equalities and containments between residues, i.e., federal products a+m˘¯a+m\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{\,\mathbb{Z}}.

Consider the problem of finding the integer solutions xx of the equation ax=bax=b modulo an integer mm, that is, finding the complete solution xx\subset\mathbb{Z} of the expression

ax+m˘¯=b+m˘¯ax+m\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{\phantom{Z}\;}=b+m\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{\phantom{Z}\;} (27)

where a,b,ma,b,m\in\mathbb{Z}. In order to bring the expression into a shorter and more standard form, we write (27) as

ax=b+m˘¯{ax=b}^{\,\>+\>m\mbox{\raisebox{-0.52pt}{\u{}}}\mspace{-2.42mu}\overline{\phantom{Z}\;}} (28)

to indicate that the superscripted factor is applied on both sides of the equation. Let d>0d>0 be the greatest common divisor of aa and mm. Adding323232This operation is permitted; however, ideals do not have additive inverses, so one may not subtract them out. the residue d˘¯d\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{\phantom{Z}\;} to both sides of (28) gives

ax=b+m˘¯+d˘¯{ax=b}^{\,\>+\>m\mbox{\raisebox{-0.52pt}{\u{}}}\mspace{-2.42mu}\overline{\phantom{Z}\;}\>+\>d\mbox{\raisebox{-0.52pt}{\u{}}}\mspace{-2.42mu}\overline{\phantom{Z}\;}} (29)

but since ax+d˘¯=d˘¯ax+d\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{\phantom{Z}\;}=d\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{\phantom{Z}\;} and m˘¯+d˘¯=d˘¯m\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{\phantom{Z}\;}+d\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{\phantom{Z}\;}=d\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{\phantom{Z}\;}, this becomes

0=b+d˘¯{0=b}^{\,\>+\>d\mbox{\raisebox{-0.52pt}{\u{}}}\mspace{-2.42mu}\overline{\phantom{Z}\;}} (30)

or

d˘¯=b+d˘¯.d\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{\phantom{Z}\;}=b+d\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{\phantom{Z}\;}. (31)

Selecting a member from the right hand side gives

d˘¯b.d\mbox{\raisebox{1.2pt}{\u{}}}\mspace{-2.3mu}\overline{\phantom{Z}\;}\ni b. (32)

Antimodulation gives bdb\in d\mathbb{Z}, i.e., dd divides bb. Dividing by dd gives

ax=b+m˘¯{a^{\prime}x=b^{\prime}}^{\,\>+\>m^{\prime}\mbox{\raisebox{-0.52pt}{\u{}}}\mspace{-2.42mu}\overline{\phantom{Z}\;}} (33)

where aa^{\prime} is relatively prime to mm^{\prime}. Since a+m=da\mathbb{Z}+m\mathbb{Z}=d\mathbb{Z} (which we do not pause to prove), b=a+mb^{\prime}\in\mathbb{Z}=a^{\prime}\mathbb{Z}+m^{\prime}\mathbb{Z}, and therefore

ba+m˘¯{b^{\prime}\in a^{\prime}\mathbb{Z}}^{\,\>+\>m^{\prime}\mbox{\raisebox{-0.52pt}{\u{}}}\mspace{-2.42mu}\overline{\phantom{Z}\;}} (34)

that is,

ba(012m1)+m˘¯.{b^{\prime}\in a^{\prime}(0\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}1\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}2\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\ldots\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}m^{\prime}-1)}^{\,\>+\>m^{\prime}\mbox{\raisebox{-0.52pt}{\u{}}}\mspace{-2.42mu}\overline{\phantom{Z}\;}.} (35)

By multiplying (35) through by dd, we see that the solution of (28) is nonempty. Since a0+m˘¯{a^{\prime}\neq 0}^{\,\>+\>m^{\prime}\mbox{\raisebox{-0.52pt}{\u{}}}\mspace{-2.42mu}\overline{\phantom{Z}\;}}, this solution is single modulo mm^{\prime}.333333We do not intend to flout tradition: a solution exists, and is unique modulo mm^{\prime}. Our intent here is to point out a certain agreement between the shared notions of mathematics and notions we have developed in the preceding sections while working in foundations. We may thus write the solution set in the form x0+mx_{0}+m^{\prime}\mathbb{Z}, for some single solution x0x_{0}. It remains only to determine the form of the solutions in the modulus mm^{\prime}. This is easy: we proceed by extracting the coarser modulus:

x\displaystyle x =x0+m\displaystyle=x_{0}+m^{\prime}\mathbb{Z}
=x0+m+m\displaystyle=x_{0}+m^{\prime}\mathbb{Z}+m\mathbb{Z}
=x0+m(01d1)+m.\displaystyle=x_{0}+m^{\prime}(0\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}1\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}\ldots\mspace{-1.0mu}\mathbin{{}_{\scriptscriptstyle{\setminus}}}\mspace{-3.0mu}d-1)+m\mathbb{Z}. (36)

We therefore conclude that, when dd divides bb, there are infinitely many solutions of (28) in the set of integers, dd solutions modulo mm, and a single solution modulo mm^{\prime}. The argument has a more elementary character than a standard proof, since it relies upon natural extensions of familiar algebraic operations and has a very basic quantificational structure. We let this suffice as a simple demonstration of the computational methods organized by instruments developed in the foregoing sections, since we intend to continue this discussion in work now in progress.

6 Acknowledgements

This study began while the author was a student at the University of Georgia advised by Brad Bassler, Robert Varley, David Edwards, and Edward Halper. The author would like to thank them, as well as David Finkelstein and the anonymous referees, for all their help in editing and improving earlier versions of this paper.

References

  • Bell (1988) Bell, J. (1988). Toposes and Local Set Theories: An Interpretation. Oxford: Clarendon.
  • Bernays (1958) Bernays, P. (1958). Axiomatic Set Theory. Amsterdam: North Holland.
  • Beth (1965) Beth, E. W. (1965). The Foundations of Mathematics (2nd ed.). Amsterdam: North-Holland.
  • Birkhoff (1940) Birkhoff, G. (1940). Lattice Theory (3rd ed.). Providence, RI: American Mathematical Society.
  • Brouwer (1923) Brouwer, L. E. J. (1923). Über die Bedeutung des Satzes vom ausgeschlossenen Dritten in der Mathematik, insbesondere in der Funktionentheorie. Journal fur die reine und angewandte Mathematik 154, 1–7. English translation in van Heijenoort (1967, 334-341).
  • Frege (1879) Frege, G. (1879). Begriffsschrift, eine der arithmetischen nachgebildete Formelsprache des reinen Denkens. Halle: Louis Nebert. English translation in van Heijenoort (1967, 1-82).
  • Gauss (1986) Gauss, K. F. (1986). Disquisitiones Arithmeticae. Yale University Press. English translation by A. Clarke, W. Waterhouse. Originally published in 1801.
  • Gödel (1944) Gödel, K. (1944). Russell's Mathematical Logic. In Schilpp, P., editor, The Philosophy of Bertrand Russell, pp.  125–53. Evanston & Chicago: Northwestern University Press.
  • Heyting (1930) Heyting, A. (1930). Die formalen Regeln der intuitionistischen Logik. Sigzungsberichte der Preussischen Akademie der Wissenschaften, 42–56. English translation in Mancosu (1998, pp. 311-27).
  • Hilbert (1927) Hilbert, D. (1927). Die Grundlagen der Mathematik. Abhandlungen aus dem mathematischen Seminar der Hamburgischen Universität 6, 65–85. English translation in van Heijenoort (1967, pp. 464-479).
  • Hilbert & Ackermann (1928) Hilbert, D., & Ackermann, W. (1928). Grundzüge der theoretischen Logik (2nd ed.). Berlin: Springer.
  • Hungerford (1974) Hungerford, T. W. (1974). Algebra. New York: Springer.
  • Kleene (1952) Kleene, S. C. (1952). Introduction to Metamathematics. Amsterdam: North-Holland.
  • Kunen (1980) Kunen, K. (1980). Set Theory: An Introduction to Independence Proofs. Amsterdam: Elsevier B.V.
  • Mancosu (1998) Mancosu, P. (1998). From Brouwer to Hilbert: The Debate on the Foundations of Mathematics in the 1920s. Oxford University Press.
  • Peano (1889) Peano, G. (1889). Arithmetices principia, nova methodo exposita. Turin. English translation in van Heijenoort (1967, pp. 83-97).
  • Prawitz (1965) Prawitz, D. (1965). Natural Deduction: A Proof-Theoretical Study. Stockholm: Almqvist & Wiksell.
  • Pudlák (1998) Pudlák, P. (1998). The lengths of proofs. In Buss, S., editor, Handbook of Proof Theory, Chapter 8. Elsevier Science B.V.
  • Quine (1937) Quine, W. V. O. (1937). Logic Based On Inclusion and Abstraction. Journal of Symbolic Logic 2(4), 145–152.
  • Quine (1938) Quine, W. V. O. (1938). On the Theory of Types. Journal of Symbolic Logic 3(4), 125–139.
  • Quine (1941) Quine, W. V. O. (1941). Element and Number. Journal of Symbolic Logic 6(4), 135–149.
  • Quine (1951) Quine, W. V. O. (1951). Mathematical Logic (revised ed.). Cambridge: Harvard University Press.
  • Rudin (1987) Rudin, W. (1987). Real and Complex Analysis (3rd ed.). WCB/McGraw-Hill.
  • Simmons (2000) Simmons, H. (2000). Derivation and Computation. Cambridge University Press.
  • Suppes (1960) Suppes, P. (1960). Axiomatic Set Theory. D. Van Nostrand Company.
  • van Heijenoort (1967) van Heijenoort, J., editor (1967). From Frege to Gödel: A Source Book in Mathematical Logic. New York: toExcel Press.
  • Zermelo (1908) Zermelo, E. (1908). Untersuchungen über die Grundlagen der Mengenlehre I. Mathematische Annalen 59, 261–281. English translation in van Heijenoort (1967, pp. 199-215).