2 Construction of Nonstandard Extensions
The notation we use here should be consistent with some standard
textbooks. Consult, for example, [1, 4, 7]
for more details. If is a function, then denotes
the image of as an element in
and for some .
§2.1 Superstructure Let be the set of all standard non-negative integers used
at the meta-level and be an infinite set of urelements,
i.e., elements without members. The superstructure on
(cf. [1, page 263]), denoted by
, is composed of the base set and the membership relation
on where
|
|
|
and
is defined recursively by letting
|
|
|
for every where is the powerset operator.
For notational convenience we often write also for its base set .
Hopefully, this will not cause confusion. One can define a rank function for
every recursively. Set iff (the abbreviation of
“if and only if”) and
|
|
|
(1) |
for any .
Notice that iff .
Let be the set of all standard positive
integers and be the set of all standard real numbers.
By the standard universe we mean the superstructure
on .
Notice that all standard mathematical objects mentioned in this paper
have ranks below, say, . For example, an ordered pair of
standard real numbers can be viewed as the set
and a function can be viewed as a set of ordered
pairs in . Hence .
§2.2 Logic and model theory Before introducing nonstandard universe we should mention
briefly, without rigor, some concepts in model theory.
For simplicity we consider only model theory
on finite relational languages. We call a set of
finitely many symbols with arity
for each a (relational) language.
An –model is a structure composed of
a nonempty base set and an -nary relation
, called the interpretation of
in , for
each symbol . For notational convenience
we sometimes write for
the base set of .
We can define (first-order)
–formulas recursively starting from atomic formulas. If
and are variables, then
is an atomic formula.
If and is
an -tuple of variables, then is an atomic formula.
The word “first-order” means that these variables are intended to
take only elements of some –model as their values. All formulas mentioned in this paper are first-order.
We will use the symbol to represent
an -tuple of elements with some suitable generic number .
When is intended to be substituted by ,
we assume implicitly that they have the same length.
If
and are –formulas and is a variable,
then , , ,
, , , and
are also –formulas. The symbols
, , , , and are called
logic connectives, and and are called universal and
existential quantifiers. An occurrence of
in is called bounded if it occurs in a sub-formula of the form
or in .
An occurrence of in is called free if it is not bounded.
A formula without free variable is called a sentence.
Let be an –model. For any –formula
, where is an -tuple of variables
containing all free variables in ,
and any , called parameters, we can define
,
meaning is true in , recursively by (a)
iff and are identical elements
in and iff ; (b) for any –formulas and
with all free variables being substituted
by in ,
iff
( means “not”),
iff
or ( means “or”),
iff
and , ( means “and”),
iff
( means “imply”),
and
iff (
means “if and only if”);
(c) for any –formulas ,
iff
for every ( means
“for all”) and iff
for some
( means “there exist”). From the truth definition above, for
every –formula
there is another –formula
using only logic connectives and , and only quantifier ,
such that iff
for any –model
and any .
We sometimes call a formula with all free variables being substituted by
parameters a sentence. Clearly, the truth value of a sentence with parameters
from is determined in .
When we write , we mean implicitly
that all free variables in the formula are among
and all parameters from model are among .
Suppose and are two –models.
A function
is called an elementary embedding from
to if for any –formula
and any -tuple we have
|
|
|
(2) |
where .
An elementary embedding is necessarily injective.
If there exists an elementary embedding ,
we can view as an elementary submodel of
and call an elementary
extension of , denoted by .
We sometimes write
to emphasize that is not surjective.
If is an –model,
, and for , the –model
by adding the relations to is denoted by
. We
call a model expansion of .
§2.3 Ultrapower construction Next we construct an elementary extension of a model using ultrapower
construction (cf. [1, §4]).
Definition 2.1
Let be an infinite set. A set is a
non-principal ultrafilter on if it satisfies the following:
-
1.
and for any
where is the collection of all finite subsets of ,
-
2.
,
-
3.
,
-
4.
.
It is well known that the existence of
non-principal ultrafilters on follows from .
Definition 2.2
Let be an –model and
be a non-principal ultrafilter on an infinite set .
Let be the set of all functions from to .
For any define
|
|
|
It is easy to check that is an equivalence relation. Denote
for the equivalence class containing . The ultrapower
of modulo , denoted by ,
is an –model which is composed of the base set
|
|
|
and the interpretation of by
|
|
|
for every . Notice that
|
|
|
The right side above is the ultrapower of modulo .
For an element denote
for the constant function with value .
Let be the natural embedding,
i.e., . The following is often called Łoś’s theorem.
(cf. [1, Theorem 4.1.9, Corollary 4.1.13].)
Proposition 2.3
For any –formula and
any it is true that
|
|
|
The proof of Proposition 2.3 is done by induction on the complexity
of .
Corollary 2.4
The natural embedding with is
an elementary embedding. Furthermore, if
is a model expansion of , then is also an elementary embedding
from to .
So, the model can be viewed
as an elementary submodel of via the natural embedding
and is an elementary extension of .
§2.4 Construction of Fix a non-principal ultrafilter on .
: From now on let . The ultrafilters
which will be used are , the tensor product of ’s, and
nonstandard versions of them.
Recall that represents the standard universe,
which is an –model. Let
be the ultrapower of modulo
and be the natural embedding.
Denote for the interpretation of in .
Notice that iff for any .
One can define the rank function for every
according to (1) with being replaced by .
Notice that some elements in may not have
a finite -rank. For example, if and
for every , then does not have a
finite -rank. Let
|
|
|
The set is just the ultrapower of modulo truncated
at -rank .
Notice that
for every and
.
Let and .
Assume that every element in
is an urelement and identify by for every .
Then .
Since the natural order on ,
addition and multiplication on
can be viewed as elements in ,
we have that is a linear order on extending ,
is the addition on extending , and is the
multiplication on extending . For notational convenience
we write , , and for , , and ,
respectively. By the elementality of
the structure
is an ordered field containing the standard
real field as its subfield.
Notice that if for every , then
and for every , i.e., contains
natural numbers such as which are infinitely large
relative to real numbers in .
Let be the Mostowski collapsing map on , i.e.,
for every and
|
|
|
for every . Then
is an injection and iff .
If one identifies with the image of under ,
one can pretend that is the true membership relation
and consider as a subset of the superstructure .
Hence, we can pretend that is the true membership relation
and drop ∗ for notational convenience.
The purpose of the truncation of at -rank
is to make sure is well defined in the standard sense.
Let . We call a nonstandard universe
extending . We will extend further later. Notice that
due to the truncation, is no longer an elementary extension of
from the model theoretic point of view. However,
is a so-called bounded elementary extension of .
An –formula
has bounded quantifiers if every occurrence of quantifiers
and in has the form
and . Notice that
is the abbreviation of and
is the abbreviation of
. Similar to (2),
it is easy to show that for any –formula
with bounded quantifiers
and any we have
|
|
|
(3) |
So, the map is called a bounded elementary embedding
from to . It is a common abuse of notation to write
to indicate the existence of the bounded elementary embedding (instead of
just elementary embedding) and to
emphasize that is not surjective. The property (3)
is sometimes called the transfer principle
in nonstandard analysis. Notice that if , then
can be viewed as the ultrapower of modulo , i.e.,
.
Each for is called –internal (or “standard” in
some literature) and
each is called –internal. Hence –internal set
is also a –internal set.
For example, and
are –internal sets.
Some –internal sets are not –internal. For example,
the set
is –internal subset of but not –internal.
Some subsets of a –internal set are not
–internal. For example, as a subset of is
not –internal because it is bounded above in and has no
largest element. Notice that and .
The following proposition says that a subset of
defined by an –formula with parameters
from is –internal. The proposition is
an easy consequence of Proposition 2.3.
Proposition 2.5
Let be an –formula
with bounded quantifiers, and parameters and
being –internal. Then
is a –internal set. (cf. [1, Theorem 4.4.14].)
The following is called the overspill principle
in nonstandard analysis. Notice that is an infinite
initial segment of .
Proposition 2.6
Let be an infinite proper
initial segment of and not –internal.
Let be an –internal subset of .
If is upper unbounded in ,
then .
The proof of Proposition 2.6 is easy.
If , then
can be defined by a formula with bounded quantifiers and
parameter . Hence is –internal.
§2.5 Construction of and We now extend further to and to form a nonstandard
extension chain
|
|
|
using ultrapower construction
and show the existence of bounded elementary embeddings ,
, , and besides the natural embeddings
and . The chain and embeddings
will satisfy the following properties. Let and
be the set of all positive
integers and the set of all real numbers, respectively, in for .
Property 2.7
-
1.
For , is an end–extension of , i.e.,
every number in is greater than each number
in .
-
2.
There is a bounded elementary embedding from the
–model
to
the –model , where
for
two new unary predicate symbols and not in .
Furthermore, the map is a bounded elementary embedding
from to . Notice that
for each .
-
3.
There is a bounded elementary embedding from to
such that is an identity map and
for each .
We are now going to work towards establishing this property in this section.
: From now on, let
always represent, for notational convenience,
the set of all functions from to such that
is a bounded set in .
Notice that
satisfies Part 1–4 of Definition 2.1
with , , and
being replaced by ,
, and
, respectively.
We call a –internal
non-principal ultrafilter on .
Definition 2.8
Let . Denote for the model such that
|
|
|
|
|
|
for all . Let
be the natural embedding that for every
.
We call a –internal ultrapower of modulo
the –internal ultrafilter .
Proposition 2.9
If is an –formula with
bounded quantifier and
, then
|
|
|
Corollary 2.10
The natural embedding is a bounded
elementary embedding from to .
The proof of Proposition 2.9 is almost the same as the
proof of Proposition 2.3 except one step that shows
implies . Let with
such that .
By the axiom of choice there is a well-order on . So,
every nonempty set has a -least element
by the transfer principle. Suppose that
. For each , if , let and if
, let be the -least element in
the nonempty set .
Since are in , so does the function
by Proposition 2.5.
Hence . By the induction hypothesis we have that
|
|
|
which implies
.
Let
|
|
|
(4) |
Then is a bounded elementary embedding from to ,
which will be used in Theorem 5.4.
Same as for we can assume by Mostowski collapsing that
is a subset of the superstructure and
is the true membership relation .
Notice that is an identity map.
The element is called –internal,
is called –internal for any , and
is called –internal for every .
Notice that and as subsets of
are not –internal.
If is a
function from to for some , then
for some by (3) with
being replaced by .
Hence is a proper end-extension of .
By the same way, we can define as a –internal ultrapower
of modulo .
Definition 2.11
Let be the –internal ultrafilter on .
Let be the model such that
|
|
|
|
|
|
for all , and define
the natural mebedding by
for every .
Generalizing the arguments above, we have that the map is a bounded
elementary embedding from to . We say that is a
nonstandard extension of . It is also easy to see that
is an end-extension of .
We have completed the construction of
and verified that Part 1 of Property 2.7 is true.
§2.6 Bounded elementary embeddings , , and To verify Part 2–3 of Property 2.7 we have to view the construction
of for from a different angle. For a set
and , let .
Let and be two non-principal ultrafilters on .
The tensor product of and is defined by
|
|
|
It is easy to check that is a non-principal
ultrafilter on . For simplicity we assume that
and are the same ultrafilter.
Lemma 2.12
Let where
iff
. Then
|
|
|
Proof Let range over the first copy of
and over the second copy of in .
Given , there is an
such that . Since the ranks of all image of is bounded, the range
of is in a –internal set .
So, by identifying with its graph, we have .
Since , there is a such that
where
is the graph of a function for all .
Now let be such that
. Then .
We view as a relation between
and . Notice that iff
.
Notice also that for any , we have
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Hence, the relation
is an injective function from to .
On the other hand, given
, let be such that .
Let be such that . Then .
So, is such that
. This shows that
is surjective.
Notice also that for any ,
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
This completes the proof of .
If we identify each of and
with its image of Mostowski collapsing, then and can be viewed
as the same model.
Lemma 2.13
Let where
iff
. Then
|
|
|
for any .
The proof of Lemma 2.13 can be found in [1, Proposition 6.5.2].
We call the external ultrapower of modulo . By Lemma 2.12
and Lemma 2.13 we can view and as the same model and write
for . To summarize, we have that
|
|
|
(5) |
The term on the left side is the two-step iteration when we take the ultrapowers by using
first and second. The term on the right side is when we use
first so that becomes , becomes ,
and becomes .
However, the bounded elementary embedding from to
induced by the –internal ultrapower modulo and the
bounded elementary embedding from to
induced by the external ultrapower of modulo are different.
Let be such that
for each .
If , then clearly we have .
If with and
for some , then for every .
Hence .
This shows that . The map will
be for a bounded elementary embedding defined below.
Notice that above arguments still work if the ultrafilters
and are different.
Generalizing the construction further and with the help of Mostowski
collapsing, we have that
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Let be the bounded elementary embedding
determined by (2) with for every
. Since and we conclude that the map
is also a bounded elementary embedding from
model expansion to .
Since , we have that coincides with
mentioned above when is constructed, and
is a bounded elementary embedding from to .
Hence Part 2 of Property 2.7 is verified.
Let be the bounded elementary embedding
determined by (2) with for every
. Since every –internal function from to for some
is equivalent to a constant function modulo we conclude
that is an identity map. Similar to the argument for , we have
for every .
This verifies Part 3 of Property 2.7.
The following propositions are –versions of Proposition 2.5
and Proposition 2.6. Let .
Proposition 2.14
Let be an –formula with
bounded quantifiers, and parameters and be –internal. Then
is a –internal set.
Proposition 2.15
Let be an infinite proper
initial segment of and not –internal.
Let be –internal.
If is upper unbounded in ,
then .
We would like to mention that , , and are ultrapowers of modulo
the ultrafilters , , and ,
respectively. Hence, they are all countably saturated (cf. [1, Corollary 4.4.24])
although the countable saturation is not used in this paper.
The proofs of Proposition 2.14 and Proposition 2.15
are the same as Proposition 2.5 and Proposition 2.6.
Notice also that and are –internal ultrapowers of
modulo the –internal ultrafilters and ,
respectively.
The ultrafilters , , and do not have to be on the
countable set and do not have to be the same. The only restriction
is that they have to be in .
Iterated nonstandard extensions were
used in combinatorial number theory before, e.g. in [2, 3, 6].
But we will use them in a new way by exploring the advantages of
various bounded elementary embeddings between and (see
Property 2.7). For most of the time we
work within . The nonstandard universe is only used for
one step in the proof of Lemma 5.1.
3 Nonstandard Versions of Some Facts
The Greek letters will represent
“standard” reals unless specified otherwise. Let
in this paragraph.
An is a –infinitesimal, denoted by
, if for every .
By an infinitesimal we mean a –infinitesimal. Denote
for the –standard part map, i.e., is the unique real number
such that when
for some . Notice that and are definable
by a formula with bounded quantifiers and parameters in .
Sometimes, the subscript will be dropped. For example,
means and means .
For any two positive integers we denote for
and .
Hence means .
Notice that if for some and ,
then where is in Part 2 of Property 2.7.
This is true because ,
, and iff
for each .
Similarly, we have
for in the domain of .
Capital letters , , , represent sets of integers
except , , , which are
reserved for integers in .
The letter represents exclusively the
length of the arithmetic progression in
Szemerédi’s theorem and represents an integer between and .
All unspecified sets mentioned in this paper will be either standard subsets of
or –internal sets for some .
For any let .
For any bounded set and
denote for the quantity in
where means the internal cardinality of in . Denote
for .
Notice that is an internal function while
are often external functions but definable in
for , i.e.,
|
|
|
We often write
for . If and ,
then coincides with the Loeb measure of
in . The term is often used for
an internal argument.
: The abbreviation a.p. stands for “arithmetic progression”
and –a.p. stands for “-term arithmetic progression.”
The length of an a.p. is the number of terms in which
can be written as . The letters are reserved exclusively for
a.p.’s of length , and for a.p.’s of
length or other standard length. When we run out of letters,
we may also use for –a.p.’s. If , then
represents the -th term of .
We denote for for all .
We allow the common difference of an a.p. to be any
integer including, occasionally, the trivial case for .
If and are two a.p.’s of the same length, then
represents the –a.p. . If is an a.p. and
is an element or a set, then represents the
sequence . By
we mean for every .
If and let be
the supremum of in the sense of , i.e., the unique least upper
bound of in , or if is unbounded
above in .
Let . For any and
any collection of a.p.’s ,
there exists with
(because every subset of is
in ) such that iff there exists a
with and .
By the elementality of in Part 2 of Property 2.7 we have that
for any and
any collection of a.p.’s ,
there exists with such that iff there exists a
with and .
Therefore, the operator and hence below are well defined.
Similarly, below is also well defined.
Definition 3.1
For and with the strong upper
Banach density of in is defined by
|
|
|
(8) |
The letter above always represents an a.p.
If has and , the
strong upper Banach density of relative to
is defined by
|
|
|
(9) |
When , defined in (9), is used in this paper,
the set is often a subset of although there is no such restriction
in the definition.
Definition 3.2
If , then the strong upper Banach density
of is defined by where is defined
by (8) and is defined by (4).
We would like to point out that for standard sets ,
|
|
|
This equality will not be used. The purpose here is to give the reader
some intuition because the right side is a standard expression.
The superscript in will be omitted.
Notice that the upper density of a set is
less than or equal to the upper Banach density of , which is less than
or equal to the strong upper Banach density of .
The strong upper Banach density of is the nonstandard version of
the density of along a collection of arbitrarily long
arithmetic progressions satisfying the double counting property in [9].
Of course, if we know that Szemerédi’s theorem is true, then
implies that . Also and imply that .
Suppose that the strong upper Banach density of is a positive
real number . Instead of looking for –a.p.’s in we will look for
–a.p.’s in for some infinitely long a.p. such that
the distribution of in is very uniform, i.e., the measure
and strong upper Banach density of in are the same value .
The uniformity allows the use of an argument similar to
the so called density increment argument in the standard literature.
The next lemma is the beginning of this effort.
Lemma 3.3
For let
with
and with .
Then the following are true:
-
1.
iff there exists a
with and ;
-
2.
If , then there exists a with
such that ;
-
3.
Suppose . Then iff
there exists a with ,
, and ;
-
4.
Suppose . If , then
there exists a with
such that and
.
Proof Part 1: If , then there is a with
such that for
every . Let
|
|
|
Then is –internal
and is unbounded above in . By Proposition 2.15,
there is a . Hence there is an a.p. such that
and .
Therefore, .
On the other hand, if , then
by the definition of in (8).
Part 2: If , we can find with
such that by Part 1. Clearly,
by the definition of . Hence .
Part 3: If , then there is a with
such that and
for every .
By Proposition 2.15 as in the proof of Part 1
there is a for some
with such that
and ,
which implies and .
On the other hand, if and
, then by
the definition of in (9).
Part 4: If , then and
for
some with
by Part 3. Clearly, by the definition of .
Hence .
The following lemma is the internal version of
an argument similar to so-called the double counting property in
the standard literature. Let .
Roughly speaking, if is very uniformly distributed
in with measure , then for almost all
the measure of inside is . Since
the measure is not an internal function we use instead
and require for some infinite
instead of . The lemma is stated in a
more general case with being viewed as the “standard” universe
in .
Lemma 3.4
Let , ,
and with
for . For each let
|
|
|
(10) |
Then there exists a
such that .
Notice that if .
Proof Fix , , and . The
subscripts and in will be omitted in the proof.
If , then for every we have
by the supremality of . Hence the maximal
with such that
for every is in . Now works.
Assume that . So, and
coincide. If for every , then the maximal
satisfying must be in
by Proposition 2.15. Hence .
So we can assume that for some
and derive a contradiction.
Notice that for each , it is impossible to have
by the definition of .
Let . Then .
Notice that implies
.
By the following double counting argument, by ignoring some
–infinitesimal amount inside , we have
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
which is absurd. This completes the proof.
Suppose , in ,
, ,
, and . For each let
be the following internal statement:
|
|
|
(11) |
The statement infers
that the densities of in the interval go to
, respectively, as in . The statement
will be referred a few times in Lemma 5.1 and its proof.
The following lemma is the application of Lemma 3.4 to the
sets simultaneously.
Lemma 3.5
Let ,
, and be such that
, , and
for some
and . For any let
|
|
|
(12) |
-
(a)
For each with
there exists a such that ;
-
(b)
For each , there is an with
such that .
Proof Part (a): Applying Lemma 3.4 for
and we can find
such that and
where is defined in (10) and is replaced by
for and for . Let .
For each let
|
|
|
|
|
|
Notice that both and are –internal.
If for some ,
then .
Let . Then we have and
, which contradicts . Hence
for every . By
Proposition 2.15 we can
find such that for any
. If for some ,
then
for some by the fact that .
Hence for every .
By Proposition 2.15 again we can find such that
for any .
The proof is complete by setting and
|
|
|
Part (b): Suppose Part (b) is not true. Then there exists an such that
for any in . By Proposition 2.15
there is an
such that . By Part (a) there is a
such that . We have a contradiction because and hence
.
Notice that for a given one can choose to be the least
such that in Lemma 3.5 (b).
So we can assume that is an internal function of .
Hence we can assume that is also an internal function
of .
5 Proof of Szemerédi’s Theorem
We work within in this section except in
the proof of Claim 1 in Lemma 5.1 where is needed.
Szemerédi’s theorem is an easy consequence of
Lemma 5.1, denoted by for all .
For an integer define an interval by
|
|
|
(17) |
The set is the subinterval of
in the middle of with the length
for or . If , then .
For notational convenience we denote
|
|
|
(18) |
: Fix a . The number is the length
of an interval which will play an important role in Lemma 5.1.
Keeping unchanged is one of the advantages from nonstandard analysis,
which is unavailable in the standard setting.
There is a summary of ideas used in the proof of Lemma 5.1 right after
the proof. It explains some motivation of the steps taken in the proof.
Lemma 5.1 ()
Given any , ,
any , and any
and with
|
|
|
(19) |
the following are true:
-
L
:
There exists a –a.p. with satisfying the
statement for
, where is defined in (11), and there exist
with where is defined
in (17) and with for every
, and
collections of –a.p.’s
|
|
|
|
|
|
(20) |
satisfying for all and , and
|
|
|
(21) |
satisfying
for all and .
-
L
): There exist a set
of –consecutive integers
where is defined in (18)
and a collection of –a.p.’s
such that for each we have
for , for , and .
Lemma 5.3
implies
for any
satisfying the conditions of Lemma 5.1.
Proof Assume we have obtained the –a.p. with , sets
and with
and , and
collections of –a.p.’s and
as in .
Call a –a.p.
good if
for and bad otherwise. Let be
the collection of all good –a.p.’s and
be the collection of all bad –a.p.’s.
Let .
Then .
We show that .
Let .
Notice that
|
|
|
and for each ,
iff and .
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Notice that . Hence we have
and
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
which implies .
Recall that . Hence
contains a set of consecutive integers.
So, is proven if we let where
is one of the –a.p.’s in such that .
The idea of the proof of Lemma 5.3 is due to Szemerédi. See
[9].
Proof of Lemma 5.1 We prove
by induction on . By Lemmas 5.3
it suffices to prove .
For , given any , ,
, , , and satisfying
(19), by Lemma 3.5 (b) we can find a –a.p. such that
is true
for ,
where is defined in (11).
For each let and .
For each , , and let
|
|
|
Clearly, we have . By some pruning
we can assume that .
It is trivial that and
iff for each
. This completes the proof of
. follows from
Lemma 5.3.
Assume is true for some .
We now prove . Given any and ,
fix , , and
satisfying (19).
For each , by Lemma 3.5 (b),
there is an in
and defined in
(12) such that .
Notice that
because and .
Let and fix an .
Claim 1 The following internal statement is true:
is a collection of –a.p.’s such that
|
|
|
|
|
|
Proof of Claim 1 Working in by considering as the standard universe, we can find
with by Lemma 3.3
and Part 2 of Property 2.7 such that
|
|
|
For each
let . Since there are at most
different ’s and ,
we can find one, say, such that the set
|
|
|
satisfies .
Notice that could be .
Let with be
such that and
|
|
|
by Part 4 of Lemma 3.3 and Part 2 of Property 2.7.
Let be the common difference of the a.p.
and be the order-preserving bijection, i.e.,
|
|
|
Let and
.
We have that and in the place of
and satisfy the –version of (19)
with , and being replaced by , , and .
Let , , and where is in Part
3 of Property 2.7. Recall that is an identity map.
Since , we have
.
Notice also that
and .
By the induction hypothesis that is true we have
|
|
|
|
|
|
|
|
|
|
|
|
|
Since and are
elementarily equivalent by Part 2 of Property 2.7 via , we have, by
universal instantiation, that
|
|
|
(23) |
Notice that the right side above no longer depends on or .
So, we have
|
|
|
(24) |
because and .
Since is a bounded elementary embedding, we have
|
|
|
which means that there is a set of
–consecutive integers
and a collection of –a.p.’s such that
for every we have for , ,
and for .
Notice that . Let
and , where
,
such that for each we have
for ,
, and
for . If , then . If
, then . Hence
. (Claim 1)
The following claim follows from Claim 1 by Proposition 2.15.
Claim 2 There exists a such that the
is true, i.e.,
is a collection of –a.p.’s such that
,
, and .
For notational convenience let and be obtained in Claim 2
and rename , ,
for some (or any) and . Let
be the increasing enumeration of .
Notice that .
We now go back to consider as our standard universe. Notice that
, , and
is true for every where is defined in (11).
Claim 3 For each we can find an internal
with such that for each
and each , and is true.
Proof of Claim 3 For each we have
is true because .
By Lemma 3.5 (a), we can find a set
with such that
is true
for every . Set
|
|
|
Then we have and . (Claim 3)
Notice that .
By Proposition 2.15 we can find and
|
|
|
such that . Hence .
Applying the induction hypothesis for
, we obtain
a –a.p. with ,
with and
with for each ,
and collections of –a.p.’s
|
|
|
such that (i) for each and
we have
and for each we have ,
for , , and (ii)
for each and we have
, and for each
we have iff
for every and .
For each , , and
let
|
|
|
Then , , and .
Since we have that for each ,
is true.
Applying Part (iii) of Lemma 4.2 with
and being replaced by
we can find ,
with and with
for each such that for each and we have
|
|
|
(25) |
|
|
|
(26) |
Let .
Clearly, we have .
We also have that ,
,
and
because and
. For each , , and let
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Then by (25).
If , then there is a such that . If
for and
for some such that , then
for , , and ,
which imply by induction hypothesis.
Hence we have . Clearly,
implies .
Thus we have .
Clearly, by (26).
Summarizing the argument above we have that for
each
-
•
for
because ,
-
•
,
-
•
for because of ,
-
•
.
For each ,
iff there is a with
such that
-
•
for
because ,
-
•
which is equivalent to
,
-
•
.
This completes the proof of
as well as by Lemma 5.3.
Summary of the ideas used in the proof of Lemma 5.1:
We want to use to create a sequence of blocks
of size
that are in arithmetic progression such that (a) the set
in the first blocks for
are identical copies of in , (b) the initial points
of all -st blocks form an a.p. of infinite length which should
be used when applying Part (iii) of Lemma 4.2,
(c) the initial points of the rest of the blocks satisfy an appropriate
version of (11) for some infinite , i.e.,
. Then we work
inside , using
to create collections of –a.p.’s and in instead of .
For applying Part (iii) of Lemma 4.2
we want to make sure, if we can, that
the -st terms of all form a set of positive measure
and the -st terms of all form
a set of positive measure. Then mixing one for some
with and at
-st terms yields and validating .
Unfortunately, using to create collections and of
–a.p.’s in
cannot guarantee that the set of the -st terms of all
and the set of the -st terms of
all have positive measures in (the positive measures
can be guaranteed
when but not for ). So, instead of getting
the sets and to have positive measures
in we make sure that the set
and have positive measures in some
subinterval of length in ,
where is fixed and could be much smaller than .
This is achieved by requiring . When we
use the mixing lemma we want to make sure that in all of these relevant
intervals
of length has measure . This requirement
is again achieved by requiring and
after shrinking to its initial
segment .
Since we want to be significantly smaller than
we assume that and is at least one universe apart. Hence must be
at least in . If we want to apply
for instead of , then must also be at least in
. If we want to have a –standard
positive measure, must be in in order to use the
–version of with being considered
as the “standard” universe. But can only be guaranteed
one universe apart from by Definition 3.1 even though
is assumed to be at least two universes apart from .
Therefore, we use (instead of ) which leads to
and use to lift to while keeping .
This allows the use of –version of with being
considered as the “standard” universe. Then spill over
to and apply with
being replaced by to obtain
desired collections
and of –a.p.’s. All these steps rely on the fact that
is an internal statement with internal parameters.
Theorem 5.4 (E. Szemerédi, 1975)
Let . If has positive upper density,
then contains nontrivial -term arithmetic progressions.
Proof It suffices to find a nontrivial
–a.p. in . Let be an a.p. such that and
. Then
because is greater than or equal to the upper density of .
Let .
Without loss of generality, we can assume for some .
We can also assume that because otherwise replace
by and by .
Then we have . Set . Trivially,
, , and
. To start with
instead of , we have many nontrivial –a.p.’s
such that for
in . So there must be many
nontrivial –a.p.’s in . By ,
there must be nontrivial –a.p.’s in .