Documente Academic
Documente Profesional
Documente Cultură
J. Donald Monk
Set theory
1. First-order logic
Here we describe the rigorous logical framework for set theory, indicating some central
notions and facts, without proofs.
Except for proofs, this treatment is self-contained, but a more thorough study of firstorder logic, going into the proofs of the statements in this chapter, is a good idea for anyone
studying set theory carefully. See the references at the end of the chapter.
We assume some elementary mathematics: notions and facts concerning finite sequences, the intuitive notion of a function, elementary properties of natural numbers, and
induction. The exact development of these things within set theory comes later. A finite sequence of objects a0 , . . . , an1 is denoted by ha0 , . . . , an1 i. We always assume that a finite
sequence has positive length n. The case n = 1 is important here. The finite sequence hai
of a single object a is considered as different from a itself. The concatenation of two finite
sequences ha0 , . . . , an1 i and hb0 , . . . , bm1 i is the sequence ha0 , . . . , an1 , b0 , . . . , bm1 i.
First order languages all have the following symbols in common:
n, a, e, x. (Symbols for negation, conjunction, equality, and there exists.)
v0 , v1 , . . . (Variables, intuitively ranging over some nonempty domain; see below for a more
exact description. In practice we use many other symbols in place of these.)
Our set-theoretic language has one more symbol m (membership).
An expression is a finite sequence of symbols. Expr is the set of all expressions. Now we
define functions , , =, , :
maps Expr into Expr, and = hni .
def
iff
ai = aj .
iff
ai Raj .
(A, R) |= [a0 , . . .]
iff
(A, R) |= ( )[a0 , . . .]
iff
(A, R) |= vi [a0 , . . .]
iff
A formula is universally valid iff for every structure (A, R) and every selection a0 , . . . of
elements of A we have (A, R) |= [a0 , . . .].
We now give the logical axioms. Here , , . . . are arbitrary formulas unless otherwise
restricted, and i, j, . . . are arbitrary members of .
( ) ( ).
( ) ( ).
( ).
[ ( )] [( ) ( )].
( ) ( ).
vi vi .
3
vi vi .
vi ( ) (vi vi ).
vi , if vi does not occur in .
vi = vi .
vi = vj vk (vk = vi vk = vj ) for k 6= i, j.
vi = vj (vi = vk vj = vk ).
vi = vj (vk = vi vk = vj ).
vi = vj (vi vk vj vk ).
vi = vj (vk vi vk vj ).
It is easy to check that every logical axiom is universally valid. Now let be a set of
formulas. A -proof is a finite sequence h0 , . . . , m of formulas such that for each i m
one of the following conditions holds:
i is a logical axiom.
i is a member of .
There exist j, k < i such that k is j i .
There is a j < i and a natural number k such that i is vk j .
A -theorem is a formula which appears in some -proof. We write to abbreviate
this.
The completeness theorem says that iff for every structure (A, R) and every
sequence a0 , a1 , . . . of members of A, if (A, R) |= [a0 , a1 , . . .] for every in , then also
(A, R) |= [a0 , a1 , . . .]. This theorem is rather hard to prove, and will not be used in these
notes.
Now much of these notes consists in demonstrating that ZFC for various formulas
, where ZFC is the set of axioms for set theory introduced in the next chapter.
We need a few more logical notions.
An occurrence of a variable vi in a formula is bound iff it occurs in a segment of
of the form vi for some formula ; otherwise the occurrence is free. A sentence is a
formula in which no variable occurs free. For any formula , the universal closure of
is the formula vi0 . . . vim1 , where vi0 , . . . , vim1 are the variables occurring free in ,
with i0 < < im1 .
Here are some examples illustrating these notions:
The first occurrence of v0 in the formula v0 = v1 v0 (v0 = v0 ) is free; the other
occurrences are bound.
v0 occurs free in the formula v0 = v2 , but not in the bigger formula v0 (v0 = v2 ).
v0 = v0 is a not a sentence, but v0 (v0 = v0 ) is.
4
There are some important universally valid formulas involving the notions of free and
bound variables. These can be derived from our logical axioms; i.e., for each of the following
formulas , one can show that .
vi , if vi does not occur free in .
vi , where is obtained from by replacing each free occurrence of vi by vj ,
provided that no free occurrence of vi in becomes a bound occurrence of vj .
If is obtained from by replacing each bound occurrence of vi by vj , where vj does not
occur in , then is universally valid.
EXERCISES
E1.1. Prove Theorem 1.1.
E1.2. We change the basic logical set-up as follows. Two new symbols are added, ( and
). An atomic formula has one of the following forms: hvi , e, vj i, hvi , m, vj i. If and
are formulas, so are h(i h)i and h(i hai h)i. If is a formula and i a natural
number, then hx, vi i is a formula.
Formulate and prove a theorem analogous to Theorem 1.1 for this new set-up.
E1.3. Prove that the following formulas are universally valid, for any formulas , :
(i) ( ). (Hence we could have taken and as primitives instead
of , .)
(ii) ( ). (Hence we could have taken and as primitives instead
of , .)
E1.4. Define | to be . Prove that the following are universally valid:
(i) |.
(ii) ()|(). (Hence we could have taken just | as a primitive instead of
both and .)
E1.5. Show that vi vi is universally valid. (Hence we could have taken as
primitive instead of .)
E1.6. Prove the following:
(A, R) |= ( )[a0 , . . .] iff (A, R) |= [a0 , . . .] or (A, R) |= [a0 , . . .] (or both).
(A, R) |= ( )[a0 , . . .] iff from (A, R) |= [a0 , . . .] it follows that (A, R) |= [a0 , . . .].
(A, R) |= ( )[a0 , . . .] iff ( (A, R) |= [a0 , . . .] iff (A, R) |= [a0 , . . .]).
(A, R) |= vi [a0 , . . .] iff for every sequence b0 , . . . of elements of A such that aj = bj for
all j 6= i we have (A, R) |= [b0 , . . .].
E1.7. Check that every logical axiom is universally valid.
E1.8. Show that for any formula we have . Hint: start with
{ [( ) ]} {[ ( )] ( )}.
5
E1.9. A sentential proof is a proof in which only the first five axioms are used, and the
rule concerning is not used. We write s iff there is a sentential proof of from .
Show that if {} s , then s .
E1.10. Show that s ( ) [ ) ( )].
E1.11. Show that s .
E1.12. Show that vi = vi .
E1.13. Suppose that v0 = v1 ( ). Show that v0 = v1 v2 v2 ).
E1.14. Show that the following formula is universally valid, if vi does not occur free in :
vi .
E1.15. Show that vi is universally valid, where is obtained from by replacing
each free occurrence of vi by vj , provided that no free occurrence of vi in becomes a
bound occurrence of vj .
E1.16. Show that if is obtained from by replacing each bound occurrence of vi by vj ,
where vj does not occur in , then is universally valid.
2. ZFC axioms
Before introducing any set-theoretic axioms at all, we can introduce some important abbreviations.
x y abbreviates z(z x z y).
x y abbreviates x y x 6= y.
x
/ y abbreviates (x y).
x y abbreviates x(x y ).
x y abbreviates x(x y ).
In principle, any statement involving these abbreviations could be reformulated as a statement not using them; we take this as intuitively obvious.
Now we introduce the axioms of ZFC set theory. We give both a formal and informal
description of them. The informal versions will suffice for much of these notes.
Axiom 1. (Extensionality) If two sets have the same members, then they are equal.
Formally:
xy[z(z x z y) x = y].
Note that the other implication here holds on the basis of logic.
Axiom 2. (Comprehension) Given any set z and any property , there is a subset of z
consisting of those elements of z with the property .
Formally, for any formula with free variables among x, z, w1 , . . . , wn we have an
axiom
zw1 . . . wn yx(x y x z ).
Note that the variable y is not free in .
From these first two axioms the existence of a set with no members, the empty set ,
follows:
Proposition 2.1. There is a unique set with no members.
Proof. On the basis of logic, there is at least one set z. By the comprehension axiom,
let y be a set such that x(x y x z x 6= x). Thus y does not have any elements.
By the extensionality axiom, such a set y is unique.
In general, the set asserted to exist in the comprehension axiom is unique; we denote it by
{x z : }. We sometimes write {x : } if a suitable z is evident.
Axiom 3. (Pairing) For any sets x, y there is a set which has them as members (possibly
along with other sets). Formally:
xyz(x z y z).
The unordered pair {x, y} is by definition the set {u z : u = x or u = y}, where z is
as in the pairing axiom. The definition does not depend on the particular such z that is
chosen. This same remark can be made for several other definitions below. We define the
singleton {x} to be {x, x}.
7
Axiom 4. (Union) For any family A of sets, we can form a new set A which has as
elements all elements which are in at least one member of A (maybe A has even more
elements). Formally:
A AY x(x Y Y A x A).
With A Sas in this axiom, we define
x y = {x, y}.
Axiom 5. (Power set) For any set x, there is a set which has as elements all subsets of x,
and again possibly has more elements. Formally:
xyz(z x z y).
Axiom 6. (Infinity) There is a set which intuitively has infinitely many elements:
x[ x y x(y {y} x)].
If we take the smallest set x with these properties we get the natural numbers, as we will
see later.
Axiom 7. (Replacement) If a function has domain a set, then its range is also a set.
Here we use the intuitive notion of a function. Later we define the rigorous notion of a
function. The present intuitive notion is more general, however; it is expressed rigorously
as a formula with a function-like property. The rigorous version of this axiom runs as
follows.
For each formula with free variables among x, y, A, w1, . . . , wn , the following is an
axiom.
Aw1 . . . wn [x A!y Y x Ay Y ].
For the next axiom, we need another definition. For any sets x, y, let xy = {z x : z y}.
Axiom 8. (Foundation) Every nonempty set x has a member y which has no elements in
common with x. This is a somewhat mysterious axiom which rules out such anti-intuitive
situations as a a or a b a.
x[x 6= y x(x y = )].
Axiom 9. (Choice) This axiom will be discussed carefully later; it allows one to pick out
elements from each of an infinite family of sets. A convenient form of the axiom to start
with is as follows:
A [x A (x 6= )x A y A (x 6= y xy = ) Bx A !y(y xy B).
2nd (a, b) = b.
These are the first and second coordinates of the ordered pair.
The notion of intersection similar to that of union, but there is a minor problem
concerning what to define the intersection of the empty set to be. We have decided to let
it be the empty set.
Theorem 3.4. For any set F there is a set y such that if F 6= then x[x y
z F [x z]], while y = if F = .
9
where (x, y) is some set determined by x and y. That is, there is a formula (w, x, y) in
our set theoretic language such that x, y!w(x, y); (x, y) is this w. For example (x, y)
might be x y, or hx, yi. Then () really means
{w A : x, y[(w, x, y) (x, y)]}.
()
SS
R.
SS
R. Similarly y
SS
R.
This lemma justifies the following definitions of the domain and range of a set R (we think
of R as a relation, but the definitions apply to any set):
dmn(R) = {x : y(hx, yi R)};
rng(R) = {y : x(hx, yi R)}.
Now we define, using the notation above,
R1 = {hx, yi rng(R) dmn(R) : hy, xi R}.
This is the inverse or converse of R. Note that R1 is a relation, even if R is not. Clearly
hx, yi R1 iff hy, xi R, for any x, y, R. Usually we use this notation only when R is
a function (defined shortly as a special kind of relation), and even then it is more general
than one might expect, since the function in question does not have to be 1-1 (another
notion defined shortly).
A function is a relation f such that
x dmn(f )!y rng(f )[hx, yi f ].
Some common notation and terminology is as follows. f : A B means that f is a
function, dmn(f ) = A, and rng(f ) B. We say then that f maps A into B. If f : A B
and x A, then f (x) is the unique y such that hx, yi f . This is the value of f with
the argument x. We may write other things like fx , f x in place of f (x). Note that if
f, g : A B, then f = g iff a A[f (a) = g(a)]. If f : A B and C A, the
restriction of f to C is f (C B); it is denoted by f C. The image of a subset C of A
def
is f [C] = rng(f C). Note that f [C] = {f (c) : c C}. If D B then the preimage of D
def
iI
EXERCISES
In the exercises that ask for counterexamples, it is reasonable to use any prior knowledge
rather that restricting to the material in these notes.
E3.1.
S Prove
that
S if f : A B and hCi : i Ii is a system of subsets of A, then
f
C
=
iI i
iI f [Ci ].
E3.2. Prove that if f : A B and C, D A, then f [C D] f [C] f [D]. Give an
example showing that equality does not hold in general.
E3.3. Given f : A B and C, D A, compare f [C\D] and f [C]\f [D]: prove the
inclusions (if any) which hold, and give counterexamples for the inclusions that fail to
hold.
E3.4.S Prove that
: A B and hCi : i Ii is a system of subsets of B, then
S if f 1
1
f
[Ci ].
iI Ci =
iI f
E3.5.T Prove that
: A B and hCi : i Ii is a system of subsets of B, then
T if f 1
1
f
[Ci ].
iI Ci =
iI f
E3.6. Prove that if f : A B and C, D B, then f 1 [C\D] = f 1 [C]\f 1 [D].
E3.7. Prove that if f : A B and C A, then
{b B : f 1 [{b}] C} = B\f [A\C].
12
E3.8. For any sets A, B define AB = (A\B) (B\A); this is called the symmetric
difference of A and B. Prove that if A, B, C are given sets, then A(BC) = (AB)C.
E3.9. For any set A let
IdA = {hx, xi : x A}.
This is the identity function on the set A. Justify this definition on the basis of the axioms.
E3.10. Suppose that f : A B. Prove that f is surjective iff there is a g : B A such
that f g = IdB . Note: the axiom of choice might be needed.
E3.11. Let A be a nonempty set. Suppose that f : A B. Prove that f is injective iff
there is a g : B A such that g f = IdA .
E3.12. Suppose that f : A B. Prove that f is a bijection iff there is a g : B A such
that f g = IdB and g f = IdA . Prove this without using the axiom of choice.
E3.13. For any sets R, S define
R|S = {hx, zi : y(hx, yi R hy, zi S)}.
This is the relative product of R and S. Justify this definition on the basis of the axioms.
E3.14. Suppose that f, g : A A. Prove that
(A A)\[((A A)\f )|((A A)\g)]
is a function.
E3.15. Suppose that f : A B is a surjection, g : A C, and x, y A[f (x) = f (y)
g(x) = g(y)]. Prove that there is a function h : B C such that h f = g. Define h as a
set of ordered pairs.
E3.16. The statement
A A B B(A B) implies that
13
A.
B.
4. Ordinals
July 26, 2011
In this chapter we introduce the ordinals, prove a general recursion theorem, and develop
some elementary ordinal arithmetic.
A set A is transitive iff x Ay x(y A); in other words, iff every element of
A is a subset of A. This is a very important notion in the foundations of set theory, and
it is essential in our definition of ordinals. An ordinal number, or simply an ordinal, is a
transitive set of transitive sets. We use the first few Greek letters to denote ordinals. Note
that for any set x, x
/ x by the foundation axiom. If , , are ordinals and ,
then since is transitive. These two facts justify writing < instead of
when and are ordinals. This helps the intuition in thinking of the ordinals as kinds of
numbers. We also define iff < or = .
It is useful in what follows to introduce some of the terminology of ordered sets and
prove a lemma before proceeding with our development of ordinals. A poset or partially
ordered set is an ordered pair (A, <) such that < is a binary relation contained in A A,
it is irreflexive (there is no x such that x < x), and it is transitive (x < y and y < z
imply that x < y). Frequently we will simply write A for the poset, with the relation <
understood. Then we define x y iff x < y or x = y. Given a subset X of A, a minimal
element of X is an element x X such that there is no y X such that y < x. Elements
x, y of A are comparable iff x < y, y < x, or x = y. A poset is a simple ordered set or
linearly ordered set iff any two of its elements are comparable.
Lemma 4.1. Suppose that (A, <) is a poset satisfying the following conditions:
(i) Any nonempty subset of A has a minimal element.
(ii) If x, y A, x 6= y, and for all z A, z < y implies that z < x, then there is a
z < x such that z 6< y.
Then (A, <) is linearly ordered.
Proof. Suppose not; so there are incomparable elements in A. Let
X = {x A : there is a b A which is not comparable with x}.
Thus X is nonempty by supposition. By (i), let x be a minimal element of X. Then let
Y = {y A : x and y are not comparable}.
Hence Y is nonempty by the fact that x X. By (i), let y be a minimal element of Y .
Suppose that z < y. Then z
/ Y , and so x and z are comparable. If x z, then
x < y, contradicting the fact that y Y . It follows that z < x.
Thus we have shown that z < y implies that z < x. Hence by (ii) there is a z < x
such that z 6< y. By the choice of x we thus have that z
/ X, and so in particular z is
comparable with y. So y z, hence y < x, contradiction.
This lemma will be used shortly. Now we return to the discussion of ordinals; The following
properties of ordinals are easy to prove:
14
is an ordinal.
Because of this fact, the empty set is a number; it will turn out to be the first nonnegative
integer, and the first cardinal number. For this reason, we will use 0 and interchangably,
trying to use 0 when numbers are involved, and when they are not.
If is an ordinal, then so is {}.
We denote {} by + 1. After introducing addition of ordinals, it will turn out that
+ 1 = + 1 for every ordinal , so that the prime can be dropped. This ordinal + 1
is the successor of . We define 1 = 0 + 1, 2 = 1 + 1, etc. (up through 9; no further since
we do not want to try to justify decimal notation).
S
If A is a set of ordinals, then A is an ordinal.
S
S
We sometimes write sup(A) for A. In fact, A is the least ordinal each member of
A. We prove this shortly.
Every member of an ordinal is an ordinal.
There does not exist a set which has every ordinal as a member. In fact, suppose to
the contrary that A is such a set. Let B = {x A : x is an ordinal}. Then B is a set
of transitive sets and B itself is transitive. Hence B is an ordinal. So B A, and even
B B, contradicting the foundation axiom.
This fact is what happens in our axiomatic framework to the Burali-Forti paradox.
Now we can prove that the ordinals are simply ordered. More precisely, any set of ordinals
is linearly ordered.
Theorem 4.2. If and are ordinals, then = , , or .
Proof. Let and be ordinals. Let A = ( + 1) ( + 1). So (A, <) is a poset. We
are going to verify the conditions of Lemma 4.1. Suppose that X is a nonempty subset of
A. By the regularity axiom, choose X such that X = . Thus for any < we
have
/ X, since < is the same as among ordinals. So is a minimal element of X, as
desired in 4.1(i). For 4.1(ii), suppose that , A, 6= , and < implies that < .
Then since < is . Since 6= , choose \. Thus < but 6< , as desired in
4.1(ii).
So we apply 4.1 and infer that (A, <) is a linear order. Since , A, they are
comparable.
We give some more simple properties of ordinals. For some of them, it is convenient to use
Theorem 4.2 to check them.
iff .
< iff .
< iff + 1 .
S
These facts can be used to check the implicit statement above that SA is the leastSupper
bound of A, for any set A of ordinals. In fact, if S
A, then A,
S so A. If
for all A, then for each A, hence A , hence A .
15
T
T
Theorem
4.3.
If
is
a
nonempty
set
of
ordinals,
then
is
an
ordinal,
,
T
and for every .
T
Proof. The
members
of
are clearly ordinals,T
so for the first statement it suffices
T
to show that is transitive. Suppose that ; and suppose that T
. Then
, and hence since
T is transitive. This argument shows that . Since
is arbitrary, it follows that Tis transitive, and hence
is an ordinal.
T
For every T we have , and hence T by a previous
fact.
T
T
Suppose that
/ . For any T we have , hence , hence
<
T
since T but we are assuming that
/ . But this means that [ ]. So
T
, contradiction.
A linearly ordered set (A, <) is a well-ordered set iff every nonempty subset X of A has
a least element, i.e., an element x X such that x y for all y X. So 4.3 says that
for any ordinal , (, <) is a well-ordered set. Later in this chapter we will show that
every well-ordered set is isomorphic to an ordinal; this is the fundamental theorem about
ordinals.
Ordinals are divided into three classes as follows. First there is 0, the empty set. An
ordinal is a successor ordinal if = + 1 for some . Finally, is a limit ordinal if it
is nonzero and is not a successor ordinal. Thus 1, 2, etc. are successor ordinals.
To prove the existence of limit ordinals, we need the infinity axiom. Let x be as in
the statement of the infinity axiom. Thus 0 x, and y {y} x for all y x. We define
=
\
{z x : 0 z and y {y} z for all y z}.
This definition does not depend on the choice of x. The members of are natural numbers,
and the usual induction principle is built into the definition. It is easy to check that is
an ordinal, and in fact is the least limit ordinal.
Proposition 4.4. The following conditions are equivalent:
(i) is a limit ordinal;
(ii) 6= 0,Sand for every < there is a such that < < .
(iii) = 6= 0.
Proof. (i)(ii): suppose that is a limit ordinal. So 6= 0, by definition. Suppose
that < . Then + 1 < , since is not a successor ordinal. Thus = + 1 works
as indicated.
S
(ii)(iii): if , choose such that
is an ordinal.
S . Then since S
If , choose with < < . Thus . This proves that = , and 6= 0
is given.
S
(iii)(i): suppose that = + 1. Then = , so choose such that
. Thus < , so < , contradiction.
S
Also note that if = + 1, then = .
16
contradiction.
17
Proposition 4.7. Suppose that F is a normal ordinal class function with domain A,
and A is a limit ordinal. Then F() is a limit ordinal too.
S
Proof. Suppose that < F(). Thus < F(), so there is a < such that
< F(). Now F() < F(). Hence F() is a limit ordinal.
Proposition 4.8. Suppose that F and G are normal ordinal class functions, with
domains A, B respectively, and the range of F is contained in B. Then also G F is also
normal.
Proof. Clearly G F is strictly increasing. Now suppose that A is a limit ordinal.
Then F() is a limit ordinal by 4.7.
S
Suppose that < . Then F() < F(), so G(F()) <F() G() = G(F()).
Thus
[
G(F()) G(F()).
()
<
S
Now if < F(), then by the continuity of F, < < F(), and hence there is a
S < such that < F(); so G() < G(F()). So for any < F() we have G()
< G(F()). Hence
G(F()) =
G()
G(F());
<
<F()
(*) f is a function with domain , and for every < we have f () = G(f ).
First we show
(1) If f, satisfy (*) and g, satisfy (*) and , then f = g .
To prove this, we prove by transfinite induction on that if < then f () = g().
Suppose that this is true for all < , where < . Then f = g , so f () = G(f
) = G(g ) = g(), finishing the inductive proof.
(2) For every ordinal there is a function f such that (*) holds.
We prove this by transfinite induction. It trivially holds for = 0. Assume that it holds
for , and f is a function such that (*) holds for . Let h = f {(, G(f ))}. Clearly (*)
holds for h and + 1. Finally, suppose that is a limit ordinal, and (*) holds for every
< . By (1), for each < there is a unique
f satisfying (1); we denote it by f (the
S
replacement axiomSis being used). Let g = < f . Then g is a function by (1), and its
domain is clearly .
(3) For any < we have f = g , and g() = G(g ).
In fact, the first condition is clear. For the second,
g() = f+ 1 () = G(f+ 1 ) = G(g ).
So, (3) holds; hence (*) holds for .
This finishes the inductive proof of (2).
Now for any ordinal we let F() = f (), where f is chosen so that (*) holds for
Proof. Suppose not. Then then set B = {x A : f (x) < x} is nonempty. Let b be
the least element of B. Thus f (b) < b. Hence by the choice of b, we have f (b) f (f (b)).
Hence by 4.14, b f (b), contradiction.
Proposition 4.16. If (A, <) is a well-ordered set, then there does not exist a strictly
increasing function from A onto a proper initial segment of A.
Proof. Suppose that f is such a function. By 4.13, the proper initial segment is
determined by some element x. By 4.15, x f (x), contradicting the assumption that f
maps into the given proper initial segment.
Let (A, <) and (B, ) be linearly ordered sets. We say that they are similar if there is a
a strictly increasing function f mapping A onto B. We call f a similarity mapping, or an
order-isomorphism.
Proposition 4.17. If (A, <) and (B, ) are similar well-ordered sets, then there is
a unique strictly increasing function f mapping A onto B.
Proof. The existence of f follows from the definition. Suppose that both f and g
are strictly increasing functions mapping A onto B. Then f 1 g is a strictly increasing
function from A into A, so by 4.15 we get x (f 1 g)(x) for every x A; so f (x) g(x)
for every x A. Similarly, g(x) f (x) for every x A, so f = g.
Corollary 4.18. If 6= , then and are not similar.
Proof. Suppose to the contrary that f is a similarity mapping from onto , with
< . This contradicts 4.16.
21
Now by the recursion theorem let F be a ordinal class function with domain On such that
F() = G(F ) for each ordinal .
(1) If < and F() = A, then F() = A.
For, A\rng(F ) A\rng(F ), so (1) is clear.
(2) if < and F() 6= A, then F() 6= A and F() F().
The first assertion follows from (1). For the second assertion, note that A\rng(F )
A\rng(F ), hence F() A\rng(F ), so F() F() by definition. Also F()
rng(F ), and F()
/ rng(F ), so F() F(), as desired in (2).
By (2), F() cannot be 6= A for all , because then F would be a one-one function
mapping On into A, contradicting the replacement axiom. Choose minimum such that
F() = A. (Note that F(0) 6= A, since A is nonempty and so has a least element.) By (2),
F is one-one and maps onto A. So is similar to A.
By 4.17 and 4.20, every well-ordered set (A, ) is similar to a unique ordinal. This ordinal
is called the order type of (A, ), and is denoted by o.t.(A, ).
Ordinal addition
We define ordinal addition by transfinite recursion:
+ 0 = ;
+ ( + 1) = ( + ) + 1;
[
+ =
( + ) for limit.
<
We state the simplest properties of ordinal addition in the following theorem, but only
prove a couple of representative parts of it. Weakly increasing means that a < b implies
that F(a) F(b). An ordinal is infinite iff .
Theorem 4.22. (i) If m, n , then m + n .
(ii) For any ordinal , the class function F which takes each ordinal to + is a
normal function.
(iii) For any ordinal , the class function F which takes each ordinal to + is
continuous and weakly increasing.
(iv) + ( + ) = ( + ) + .
(v) + .
(vi) 0 + = .
(vii) iff there is a such that + = .
(viii) < iff there is a > 0 such that + = .
(ix) is infinite iff 1 + = .
Proof. (i) is easily proved by ordinary induction on n; this is the induction given by
the very definition of .
We prove (iv) by fixing and and proceeding by induction on . The case = 0 is
obvious. Assume that + ( + ) = ( + ) + . Then
+ ( + ( + 1)) = + (( + ) + 1)
= ( + ( + )) + 1
= (( + ) + ) + 1
= ( + ) + ( + 1).
Finally, suppose that is a limit ordinal and we know our result for all < . Let F, G, H
be the ordinal class functions such that, for any ordinal ,
F() = + ;
G() = ( + ) + ;
H() = + .
Thus according to (i), all three of these functions are normal. Hence, using 4.8,
+ ( + ) = F(H())
[
=
F(H())
<
( + ( + ))
<
(( + ) +
<
G()
<
= G()
= ( + ) + .
23
Note, with reference to 4.22(i), that addition, restricted to natural numbers, is ordinary
addition familiar to the reader. Also note that ordinal addition is not commutative in
general. For example, = 1 + 6= + 1.
There is an equivalent definition of ordinal addition which is more intuitive and direct:
Theorem 4.23. For any ordinals , let
= ( {0}) ( {1}).
We define a relation on as follows. For any x, y , x y iff one of the
following three conditions holds:
(i) There are , < such that x = (, 0), y = (, 0), and < .
(ii) There are , < such that x = (, 1), y = (, 1), and < .
(ii) There are < and < such that x = (, 0) and y = (, 1).
Then ( , ) is a well-order which is order-isomorphic to + .
A simple picture helps to explain the construction in this theorem:
Thus a copy of is put to the right of a copy of . The purpose of the definition of
is to make the copies of and disjoint.
Proof. Clearly is a well-order. We show by transfinite induction on , with
fixed, that ( , ) is order isomorphic to + . For = 0 we have + = + 0 = ,
while = 0 = {0}. Clearly 7 (, 0) defines an order-isomorphism from
onto ( {0}, ). So our result holds for = 0. Assume it for , and suppose that f is
an order-isomorphism from + onto ( , ). Now the last element of ( + 1) is
(, 1), and the last element of + ( + 1) is + , so the function
f {( + , (, 1))}
is an order-isomorphism from + ( + 1) onto ( + 1).
Now assume that is a limit ordinal, and for each < , the ordinal + is
isomorphic to . For each such let f be the unique isomorphism from + onto
. Note that if < < , then f is an isomorphism from + onto ; hence
f = f . It follows that
[
f
<
24
Ordinal multiplication
We define ordinal multiplication by recursion:
0 = 0;
( + 1) = + ;
[
( ) for limit.
=
<
Finally, suppose that is a limit ordinal and we know (v) for all < . Then
( + ) = G(F())
= (G F)()
[
=
(G F)()
<
( ( + ))
<
(( ) + ( ))
<
F (G())
<
(F G)()
<
= (F G)()
= ( ) + ( ),
as desired. This completes the proof of (v).
The following is a generalization of the division algorithm for natural numbers. That
algorithm is very useful for the arithmetic of natural numbers, and the ordinal version is
a basic result for more advanced arithmetic of ordinals.
Theorem 4.25. (Division algorithm) Suppose that and are ordinals, with 6= 0.
Then there are unique ordinals , such that = + with < .
Proof. First we prove the existence. Note that < + 1 ( + 1). Thus there
is an ordinal number such that <S ; take the least such . Obviously 6= 0. If is
a limit ordinal, then because = < ( ), it follows that there is a < such that
< , contradicting the minimality of . Thus is a successor ordinal + 1. By the
definition of we have . Hence there is an ordinal such that + = . We
claim that < . Otherwise, = + + = ( + 1) = , contradicting
the definition of . This finishes the proof of existence.
For uniqueness, suppose that also = + with < . Suppose that 6= .
By symmetry, say < . Then
= + < + = ( + 1) + = ,
contradiction. Hence = . Hence also = .
We now give, similarly to the case of ordinal addition, an equivalent definition of ordinal
multiplication which is somewhat more intuitive than the definition above. Given ordinals
, , we define the following relation on :
(, ) ( , )
iff
for limit.
<
(iii) 0+1 = 0;
(iv) 0 = 1 for a limit ordinal;
(v) 1 = 1;
(vi) If 6= 0, then 6= 0;
(vii) If > 1 then < +1 ;
(viii) If > 1, then the ordinal class function assigning to each ordinal the value
is normal;
(ix) If > 1, then ;
(x) If 0 < < , then ;
(xi) For 6= 0, + = ;
(xii) For 6= 0, ( ) = .
The following is another kind of division algorithm for ordinals.
Theorem 4.28. (Extended division algorithm) Let and be ordinals, with 6= 0
and 1 < . Then there exist unique ordinals , , such that the following conditions hold:
(i) = + .
(ii) .
(iii) 0 < < ,
(iv) < .
Proof. We have < + 1 +1 ; so there is an ordinal such that < . We
take the least such . Clearly is a successor ordinal + 1. So we have < +1 .
Now 6= 0, since > 1. Hence by the division algorithm there are ordinals , such that
= + , with < . Now < ; for if , then
= + = +1 > ,
contradiction. We have 6= 0, as otherwise = < , contradiction.. Also, , since
= + .
This proves the existence of , , as called for in the theorem.
Suppose that , , also satisfy the indicated conditions; thus
(1) = + ,
(2) ,
(3) 0 < < ,
(4) < .
Suppose that 6= ; by symmetry, say that < . Then
= + < + = ( + 1) = +1 ,
contradiction. Hence = . Hence by the ordinary division algorithm we also have =
and = .
28
We can obtain an interesting normal form for ordinals by re-applying 4.28 to the remainder over and over again. That is the purpose of the following definitions and results.
This generalizes the ordinary decimal and binary systems of notation, by taking = 10 or
= 2 and restricting to natural numbers. For infinite ordinals it is useful to take = ;
this gives the Cantor normal form.
To abbreviate some long expressions, we let N (, m, , ) stand for the following statement:
is an ordinal > 1, m is a positive integer, and are sequences of ordinals each of
length m, and:
(1) (0) > (1) > > (m 1);
(2) 0 < (i) < for each i < m.
If N (, m, , ), then we define
k(, m, , ) = (0) (0) + (1) (1) + + (m1) (m 1).
Lemma 4.29. Assume that N (, m, , ) and N (, n, , ). Then
(i) k(, m, , ) (0).
(ii) k(, m, , ) < (0) ((0) + 1) (0)+1 .
(iii) If (0) 6= (0), then k(, m, , ) < k(, n, , ) iff (0) < (0).
(iv) If (0) = (0) and (0) 6= (0), then k(, m, , ) < k(, n, , ) iff (0) < (0).
(v) If (j) = (j) and (j) = (j) for all j < i, while (i) 6= (i), then k(, m, , ) <
k(, n, , ) iff (i) < (i).
(vi) If (j) = (j) and (j) = (j) for all j < i, while (i) = (i) and (i) 6= (i),
then k(, m, , ) < k(, n, , ) iff (i) < (i).
(vii) If , , and m < n, then k(, m, , ) < k(, n, , )
Proof. (i): k(, m, , ) (0) (0).
(ii): We prove this by induction on m. It is clear for m = 1. Now assume that it holds
for m 1, where m > 1. Then
(0) (0) + (1) (1) + + (m1) (m 1) < (0) (0) + (1)+1
(0) (0) + (0)
= (0) ((0) + 1)
(0)
= (0)+1 ,
finishing the inductive proof.
For (iii), assume the hypothesis, and suppose that (0) < (0). Then
k(, m, , ) < (0) ((0) + 1) (0)+1
(0)
by (ii)
k(, n, , ) by (i).
29
(0)
((0) + 1) by (ii)
(0) (0)
k(, n, , )
By symmetry (iv) now follows.
(v) is clear from (iii), by deleting the first i summands of the sums.
(vi) is clear from (iv), by deleting the first i summands of the sums.
(vii) is clear.
Theorem 4.30. Let and be ordinals, with 6= 0 and 1 < . Then there exist a
unique m and finite sequences h(i) : i < mi and h(i) : i < mi of ordinals such that
the following conditions hold:
(i) = (0) (0) + (1) (1) + + (m1) (m 1).
(ii) (0) > (1) > > (m 1).
(iii) 0 < (i) < for each i < m.
Proof. For the existence, with > 1 fixed we proceed by induction on . Assume
that the theorem holds for every < such that 6= 0, and suppose that 6= 0. By
Theorem 4.28, let , , be such that
(1) = + ,
(2) ,
(3) 0 < < ,
(4) < .
If = 0, then we can take our sequences to be h(0)i and h(0)i, with (0) = and
(0) = . Now assume that > 0. Then
< + = ;
so < . Hence by the inductive assumption we can write
= (0) (0) + (1) (1) + + (m1) (m 1)
with
(5) (0) > (1) > > (m 1).
(6) 0 < (i) < for each i < m.
Then our desired sequences for are
h, (0), (1), . . . , (m 1)i and h, (0), (1), . . . , (m 1)i.
30
To prove this, we just need to show that > (0). If (0), then
(0) ,
contradiction.
This finishes the existence part of the proof.
For the uniqueness, we use the notation introduced above, and proceed by induction
on . Suppose the uniqueness statement holds for all nonzero < , and now we have
N (, m, , ), N (, n, , ), and
= k(, m, , ) = k(, n, , ).
We suppose that the uniqueness fails. Say m n. Then there is an i < m such that
(i) 6= (i) or (i) 6= (i); we take the least such i. Then we have a contradiction of
Lemma 4.29.
Now we give an equivalent definition of exponentiation similar to those given above for
addition and multiplication. A set X is finite iff there is a bijection from some natural
number m onto X.
Theorem 4.31. Suppose that and are ordinals, with 6= 0. We define
w
= {f : { < : f () 6= 0} is finite}.
For f, g w we write f g iff f 6= g and f () < g() for the greatest < for which
f () 6= g().
Then ( w , ) is a well-order which is order-isomorphic to the ordinal exponent .
Proof. If = 0, then = 1, and w also has only one element, the empty function
(= the emptyset). So, assume that 6= 0. If = 1, then w has only one member, namely
the function with domain whose value is always 0. This is clearly order-isomorphic to 1,
as desired. So, suppose that > 1.
Now we define a function f mapping into w . Let f (0) be the member of w
which takes only the value 0. Now suppose that 0 < < . By 4.30 write
= (0) (0) + (1) (1) + + (m1) (m 1),
where (0) > (1) > > (m 1) and 0 < (i) < for each i < m. Note that
(0) < , so (0) < . Then we define, for any < ,
(f ())() =
0
if
/ {(0), . . . , (m 1)},
(i) if = (i) with i < m.
where (0) > (1) > > (m 1). Let (i) = x((i)) for each i < m, and let
= (0) (0) + (1) (1) + + (m1) (m 1).
Clearly then f () = x.
Now we complete the proof by showing that for any , < , < iff f () < f ().
This equivalence is clear if one of , is 0, so suppose that both are nonzero. Write
= (0) (0) + (1) (1) + + (m1) (m 1),
where (0) > (1) > > (m 1) and 0 < (i) < for each i < m, and
=
(0)
(0) +
(1)
(1) + +
(n1)
(n 1),
where (0) > (1) > > (n 1) and 0 < (i) < for each i < n.
By symmetry we may suppose that m n. Note that N (, m, , ), k(, m, , ) = ,
N (, n, , ), and k(, n, , ) = . We now consider several possibilities.
Case 1. = . Then clearly f () = f ().
Case 4. , , and m < n. Thus < . Also, (m) is the largest < such
that (f ())() 6= (f ())(), and (f ())() = 0 < (m) = (f ())( (m)), so f () < f ().
Case 3. There is an i < m such that (j) = (j) and (j) = (j) for all j < i, while
(i) 6= (i). By symmetry, say that (i) < (i). Then we have < . Since (i) is the
largest < such that (f ())() 6= (f ())(), and (f ())( (i)) = 0 < (i) = (f ())( (i)),
we also have f () < f ().
Case 4. There is an i < m such that (j) = (j) and (j) = (j) for all j < i, while
(i) = (i) and (i) 6= (i). By symmetry, say that (i) < (i). Then we have < .
Since (i) is the largest < such that (f ())() 6= (f ())(), and (f ())( (i)) = (i) <
(i) = (f ())( (i)), we also have f () < f ().
We finish this chapter with two important characterizations of absorption properties of
ordinals.
Lemma 4.32. If < , then + = .
Proof. First we prove
(1) If < , then + = .
In fact, suppose that < . Then there is a nonzero such that + = . Then
+ = + + = + = (1 + ) = = .
By an easy ordinary induction, we obtain from (1)
(2) If < and m , then m + = .
Now we turn to the general case. If = 0 or < , the desired conclusion is clear. So
suppose that and > 0. Then we can write = m + with m and < .
Then
+ = m + + (m + 1) + =
32
(iii)(i): Assume (iii). Clearly 0, 1, 2 satisfy (i), so assume that = ( ) . Take any
< with 6= 0. If < , clearly = . So assume that . Write = m +
with m and < . Then < , and so
= (
= ( m + ) (
( m + ) (
= (m + 1) (
+1 (
= +1+
= ( )
=
33
EXERCISES
E4.1. Prove that the following conditions are equivalent:
(i) x is an ordinal;
(ii) x is transitive, and for all y, z x, either y z, y = z, or z y;
(iii) x is transitive, and for all y, if y x and y is transitive, then y x;
S
S(iv) for all y x, either y {y} = x or y {y} x; and for all y x, either y = x
or y x;
(v) x is transitive and {(y, z) x x : y z} well-orders x.
Hint: prove this in the following order: (i)(v) [easy]; (v)(ii) [obvious]; (ii)(iii)
[Assume that y x, y transitive; apply the foundation axiom to x\y]; (iii)(i) [Let
y = {z x : z is an ordinal}, and get a contradiction from assuming that y x]; (i)(iv)
[easy]; (iv)(i) [Let be an ordinal not in x, and take the least ordinal {} such
that
/ x. Work with to show that x is an ordinal.
E4.2. Show that (2 2) 6= 2 2 .
E4.3. Use the transfinite recursion principle to justify the definition of ordinal addition.
E4.4. Use the transfinite recursion principle to justify the definition of ordinal multiplication.
E4.5. Use the transfinite recursion principle to justify the definition of ordinal exponentiation.
E4.6. Formulate and prove a set version of the transfinite recursion principle.
E4.7. Suppose that , , are ordinal numbers with < . Prove that + + =
+ .
E4.8. Show that for every nonzero ordinal there are only finitely many ordinals such
that = for some .
E4.9. Prove that n(
= (
(ii) There exist an ordinal and natural numbers k, l such that = k and = l.
References
Sierpi
nski, W. Cardinal and ordinal numbers. Pa
n. Wyd. Naukowe, 1958, 487 pp.
35
<D =
<.
(B,<)Q
Clearly (D, <D ) is a linear order with D A. Suppose that X is a nonempty subset of
D. Fix x X, and choose (B, <) Q such that x B. Then X B is a nonempty
subset of B; let x be its least element under <. Suppose that y X and y <D x. Choose
(C, ) Q such that x, y C and y x. Since Q is simply ordered by , we have two
cases.
Case 1. (C, ) (B, <). Then y C B and y X. so y < x, contradicting the
choice of x.
Case 2. (B, <) (C, ). If y B, then y < x, contradicting the choice of x. So
y C\B. But then x y, contradiction.
Thus we have shown that x is the <D -least element of X. So (D, <D ) is the desired
upper bound for Q.
Having verified the hypotheses of Zorns lemma, we get a maximal element (B, <) of
P . Suppose that B 6= A. Choose any a A\B, and let
C = B {a},
<C =< {(b, a) : b B}.
Clearly this gives an element (C, <C ) of P such that (B, <) (C, <C ), contradiction.
Well-ordering principle Axiom of choice. Assume the well-ordering
principle,
S
and let A be a family of pairwise disjoint nonempty sets. Let C = A , and let be
a well-order of C. Define B = {c C : c is the -least element of the P A for which
c P }. Clearly B has exactly one element in common with each member of A.
There are many statements which are equivalent to the axiom of choice on the basis of ZF.
We list some striking ones. A fairly complete list is in
37
without going into any side issues. We follow Wagon, The Banach-Tarski paradox,
where variations and connections to measure theory are explained in full. The proof
involves very little set theory, only the axiom of choice. Some third semester calculus and
some linear algebra and simple group theory are used. Altogether the proof should be
accessible to a first-year graduate student who has seen some applications of the axiom of
choice.
We start with some preliminaries on geometry and linear algebra. The reassembling
mentioned in the Banach-Tarski paradox is entirely done by rotations and translations.
Given a line in 3-space and an angle , we imagine the rotation about the given line
through the angle . Mainly we will be interested in rotations about lines that go through
the origin. We indicate how to obtain the matrix representations of such rotations. First
suppose that is the rotation about the z-axis counterclockwise through the angle .
Then, using polar coordinates,
x
r cos
y = r sin
z
z
r cos( + )
= r sin( + )
z
x cos y sin
= x sin + y cos ,
z
which gives the matrix representation of :
cos
sin
0
sin
cos
0
0
0.
1
1
0
0
0
cos
sin
0
sin
cos
and
cos
0
sin
0 sin
1
0 .
0 cos
Next, note that any rotation with respect to a line through the origin can be obtained
as a composition of rotations about the three axes. This is easy to see using spherical
coordinates. If l is a line through the origin and a point P different from the origin with
spherical coordinates , , , a rotation about l through an angle can be obtained as
39
follows: rotate about the z axis through the angle , then about the y-axis through the
angle (thereby transforming l into the z-axis), then about the z-axis through the angle
, then back through about the y axis and through about the z-axis.
We want to connect this to linear algebra. Recall that a 3 3 matrix A is orthogonal
provided that it is invertible and AT = A1 . Thus the matrices above are orthogonal. A
matrix is orthogonal iff its columns form a basis for 3 R consisting of mutually orthogonal
unit vectors; this is easy to see. It is easy to check that a product of orthogonal matrices
is orthogonal. Hence all of the rotations about lines through the origin are represented by
orthogonal matrices.
Lemma 5.2. If A is an orthogonal 3 3 real matrix and X and Y are 3 1 column
vectors, then (AX) (AY ) = X Y , where is scalar multiplication.
Proof. This is a simple computation:
(AX) (AY ) = (AX)T (AY ) = X T AT AY = X T A1 AY = X T Y = X Y.
It follows p
that any rotation about a line through the origin preserves distance, because
|P Q| = (P Q) (P Q) for any vectors P and Q. Such rotations have an additional
property: their matrix representations have determinant 1. This is clear from the discussion
above. It turns out that this additional property characterizes the rotations about lines
through the origin (see M. Artin, Algebra), but we do not need to prove that. The
following property of such matrices is very useful, however.
Lemma 5.3. Suppose that A is an orthogonal 3 3 real matrix with determinant 1,
A not the identity. Then there is a non-zero 3 1 matrix X such that for any 3 1 matrix
Y,
AY = Y iff a R[Y = aX].
Proof. Note that AT (A I) = I AT = (I A)T . Hence
|A I| = |I A| = |(I A)T | = |AT (A I)| = |AT | |A I| = |A I|.
It follows that |A I| = 0. Hence the system of equations (A I)X = 0 has a nontrivial
solution, which gives the X we want. Namely, we then have AX = X, of course. Then
A(aX) = aAX = aX. This proves in the equivalence of the lemma. It remains to do
the converse. We may assume that X has length 1. Now we apply the Gram-Schmidt
process to get a basis for 3 R consisting of mutually orthogonal unit vectors, the first of
which is X. We put them into a matrix B as column vectors, X the first column. Note
that the first column of AB is X, since AX = X, and hence the first column of B 1 AB
is (1 0 0)T . Since B 1 AB is an orthogonal matrix, it follows because its columns are
mutually orthogonal that it has the form
1
0
0
0 0
a b.
c d
40
or
(a 1)e + bf = 0
ce + (d 1)f = 0
a 1
b
is 0. Thus adad+1bc =
Since (1) fails, it follows that the determinant
c
d 1
0. Now B 1 AB has determinant 1, and its determinant is adbc, so we infer that a+d = 2.
But a2 + c2 = 1 and b2 + d2 = 1 since the columns of B 1 AB are unit vectors, so |a| 1
and |d| 1. Hence a = d = 1 and b = c = 0. So B 1 AB is the identity matrix, so A is
also, contradiction. Hence (1) holds after all.
From (1) we get Y = B(u 0 0)T = uX, as desired.
One more remark on geometry: any rotation preserves distance. We already said this for
rotations about lines through the origin. If l does not go through the origin, one can use
a translation to transform it into a line through the origin, do the rotation then, and then
translate back. Since translations clearly preserve distance, so arbitrary rotations preserve
distance.
The first concrete step in the proof of the Banach, Tarski theorem is to describe a very
special group of permutations of 3 R. Let be the counterclockwise rotation about the
z-axis through the angle cos1 ( 31 ), and let the similar rotation about the x-axis. The
matrix representation of these rotations and their inverses is, by the above,
(1)
1 =
1
3
2 2
3
232
1
3
0
0
1
1
0
1
0
=
3
0 232
232 .
1
3
(1) For every terminal segment w of w of nonzero even length the vector w ( 1 0 0 )
T
has the form (1/3k ) ( a b 2 c ) , with a divisible by 3 and b not divisible by 5.
2
0
2
0
3
6
3
6
1
1
1
6 2 ; 1 = 2 2
1
6 2 ;
= 2 2
9
9
3
3
8
2 2
8
2 2
3
6
3
6
2
0
2
0
1
1
1
6 2 ; 1 1 = 2 2
1
6 2.
1 = 2 2
9
9
8 2 2
3
8
2 2
3
Now we proceed by induction. For w of length 2 we have
3
3
1
1
1
1
0 = 2 2 ; 1 0 = 2 2 ;
9
9
8
8
0
0
1
1
3
3
1
1
1 0 = 2 2 ; 1 1 0 = 2 2 ;
9
9
0
8
0
8
hence (1) holds in this case. The induction step:
a
3a
12b
1
1
k b 2 = k+2 2 2a + b 2 6 2c ;
3
3
c
8a + 4b + 3c
a
3a +12b
1
1
1 k b 2 = k+2 2 2a + b 2 6 2c ;
3
3
c
8a + 4b + 3c
a
3a
12b
1
1
1 k b 2 = k+2 2 2a + b 2 + 6 2c ;
3
3
c
8a 4b + 3c
a
3a +12b
1
1
1 1 k+2 k b 2 = 2 2a + b 2 + 6 2c .
3
3
c
8a 4b + 3c
So, our assertion (1) is true. If w itself is of even length, then a contradiction has been
reached, since b is not divisible by 5. If w is of odd length, then the following shows that
T
1
a
a 4b
1
1
k b 2 = k+1 2 2a + b 2 ;
3
3
c
3c
42
1
1
b 2 = k+1
k
3
3
c
a
+
4b
2 2a + b 2 .
3c
= w = 0 . . . n1 ;
since i = i for all i < m, we obtain I = m . . . n1 , I the identity. But hm , . . . , n1 i
is reduced, contradicting 5.4.
Thus neither w nor w is a proper initial segment of the other. Hence there is an
i < min(m, n) such that i 6= i , while j = j for all j < i (maybe i = 0). But then we
1
have by cancellation i . . . m1 = i . . . n1 , so n1
. . . i1 i . . . m1 = I.
1
But since i 6= i , the word hn1
, . . . , i1 , i , . . . , m1 i is reduced, again contradicting
5.4.
If G is a group and X is a set, we say that G acts on X if there is a homomorphism from
G into the group of all permutations of X. Usually this homomorphism will be denoted
by , so that g is the permutation of X corresponding to g G. (Most mathematicians
dont even use , using the same symbol for elements of the group and for the image
under the homomorphism.) An important example is: any group G acts on itself by left
multiplication. Thus for any g G, g : G G is defined by g(h) = g h, for all h G.
Let G act on a set X, and let E X. Then we say that E is G-paradoxical if
there are positive integers m, n and pairwise disjoint subsets A0 , . . . , Am1
S , B0 , . . . , Bn1
of E, and elements hgi : i < mi and hhi : i < ni of G such that E = i<m gi [Ai ] and
S
E = j<n hj [Bj ]. Note that this comes close to the Banach-Tarski formulation, except
that the sets X and E are unspecified.
Lemma 5.6. G0 , acting on itself by left multiplication, is G0 -paradoxical.
Proof. If is one of 1 , 1 , we denote by W () the set of all reduced words
beginning on the left with , and W () = {w : w W ()}. Thus, obviously,
G0 = {I} W () W (1 ) W () W (1 ),
where I is the identity element of G0 . These five sets are pairwise disjoint by 5.5. Thus
the lemma will be proved, with m = n = 2, by proving the following two statements:
(1 )].
(1) G0 = W () [W
43
x M , and h(z)
= y, z M . Then clearly x and z are in the same G-orbit, so x = z since
they are both in M . Then (g 1 h)(z) = z and g 1 h is not the identity, contradicting
the no non-trivial fixed
S point assumption.
SSo, (1) holds.
Now let Ai = gAi g[M ] and Bj = gBj g[M ], for all i < m and j < n.
(2) Ai Ak = 0 if i < k < m.
In fact, suppose that x Ai Ak . Then we can choose g Ai and h Ak such that
]. But g 6= h since Ai Ak = 0, so this contradicts (1). Similarly the
x g[M ] h[M
following two conditios hold:
(3) Bi Bk = 0 if i < k < n.
(4) Ai Bj = 0 if i < m and j < n.
S
(5) i<m gi [Ai ] = X.
For, let x X. Say by (1) that x g[M ]. Then by the choice of the Ai s there is an i < m
such that g gi [Ai ]. Say h Ai and g = gi (h) = gi h. Since x g[M ], say x = g(m)
44
] A , so x gi [A ],
with m M . Then x = (gi h)(m) = gi (h(m)).
Now h(m)
h[M
i
i
as desired in (5).
S
(6)
hi [B ] = X.
i<n
j<n
And
because E andS E are finitely G=equidecomposable we can find partitions E =
S
k<p Ck and E =
k<p Dk with elements fi G such that fi [Ck ] = Dk for all k < p.
1
Then the following sets are pairwise disjoint: Ai gi [Ck ] for i < m and k < p, and
1
Bj hj [Ck ] for j < n and k < p. And
[
[
E =
Dk =
fk [Ck ]
k<p
k<p
fk [Ck
gi [Ai ]]
i<m
k<p
fk [Ck gi [Ai ]]
k<p,i<m
k<p,i<m
and similarly
E =
1
(fk hj )[Bi hj [Ck ]].
k<p,j<n
through the angle / 2. Note that by the considerations at the beginning of this section,
T
T
consists of the translation ( x y z ) 7 ( x y z 12 ) , followed by the rotation
T
T
through 2 about the x-axis, followed by the translation ( x y z ) 7 ( x y z + 12 ) .
Lemma 5.13. For any positive integer k,
0
0
k
1
sin
k
2
0 =
2
k
1
0
+
2 cos
2
Hence p ( 0
0 ) 6= ( 0
1
2
Proof. The first equation is easily proved by induction on k. Then the second
p
is never equal to m
inequality follows since for any positive integer p, the argument
2
In fact, let y B\{0}. Let x = y/|y|. Then x S 2 , so there is an i < m such that
x gi [Ai ]. Say that x = gi (z) with z Ai . Then |y|z Ai , and gi (|y|z) = |y|gi(z) =
|y|x = y. So y gi [Ai ]. This proves the first equality in (2), and the second equality is
proved similarly.
So far, we have shown that B\{0} is G2 -paradoxical. Now we show that B and B\{0}
are finitely G3 -equidecomposable, which will finish the proof. By lemma 5.13 we have
B = D (B\D) G3 [D] (B\D) = B\{0}.
This proves the desired equidecomposablity.
As in the case of S 2 , a translation can be made if one wants one copy of B to be disjoint
from the other.
EXERCISES
In the first four exercises, we assume elementary background and ask for the proofs of
some standard mathematical facts that require the axiom of choice.
E5.1. Show that any vector space over a field has a basis (possibly infinite).
E5.2. A subset C of R is closed iff the following condition holds:
For every sequence f C, if f converges to a real number x, then x C.
Here to say that f converges to x means that
> 0M m M [|fm x| < ].
Prove that if hCm : m i is a sequence of nonempty closed
T subsets of R, m x, y
Cm [|x y| < 1/(m + 1)], and Cm Cn for m < n, then m Cm is nonempty. Hint: use
the Cauchy convergence criterion.
E5.3. Prove that every nontrivial commutative ring with identity has a maximal ideal.
Nontrivial means that 0 6= 1. Only very elementary definitions and facts are needed here;
they can be found in most abstract algebra books. Hint: use Zorns lemma.
E5.4. A function g : R R is continuous at a R iff for every sequence f R which
converges to a, the sequence g f converges to g(a). (See Exercise E5.2.) Show that g is
continuous at a iff the following condition holds:
> 0 > 0x R[|x a| < |g(x) g(a)| < ].
Hint: for , argue by contradiction.
E5.5 Show by induction on m, without using the axiom of choice, that if m and
hAi : i mi is a system of nonempty sets, then there is a function f with domain m such
that f (i) Ai for all i m.
48
E5.6 Using AC, prove the following, which is called the Principle of Dependent Choice
(which is also weaker than the axiom of choice, but cannot be proved in ZF). If A is a
nonempty set, R is a relation, R A A, and for every a A there is a b A such that
aRb, then there is a function f : A such that f (i)Rf (i + 1) for all i .
The remaining exercises outline proofs of some equivalents to the axiom of choice; so each
exercise states something provable in ZF. We are interested in the following statements.
(1) If < is a partial ordering and is a simple ordering which is a subset of <, then there
is a maximal (under ) simple ordering such that is a subset of , which in turn is
a subset of <.
(2) For any two sets A and B, either there is a one-one function mapping A into B or
there is a one-one function mapping B into A.
(3) For any two nonempty sets A and B, either there is a function mapping A onto B or
there is a function mapping B onto A.
(4) A family F of subsets of a set A has finite character if for all X A, X F iff every
finite subset of X is in F . Principle (4) says that every family of finite character has a
maximal element under .
(5) For any relation R there is a function f R such that dmn R = dmn f .
E5.7. Show that the axiom of choice implies (1). [Use Zorns lemma]
E5.8. Prove that (1) implies (2). [Given sets A and B, define f < g iff f and g are one-one
functions which are subsets of A B, and f g. Apply (1) to < and the empty simple
ordering.]
E5.9. Prove that (2) implies (3). [Easy]
E5.10. Show in ZF that for any set A there is an ordinal such that there is no one-one
function mapping into A. Hint: consider all well-orderings contained in A A.
E5.11. Prove that (3) implies the axiom of choice. [Show that any set A can be wellordered, as follows. Use exercise 4 to find an ordinal which cannot be mapped one-one
into P(A). Show that if f : A maps onto , then hf 1 [{}] : < i is a one-one
function from into P(A).
E5.12. Show that the axiom of choice implies (4). [Use Zorns lemma.]
E5.13. Show that (4) implies (5). [Given a relation R, let F consist of all functions
contained in R.]
E5.14. Show that (5) implies the axiom of choice. [Given a family hAi : i Ii of nonempty
sets, let R = {(i, x) : i I and x Ai }.]
49
6. Cardinals
This chapter is concerned with the basics of cardinal arithmetic.
Definition and basic properties
A cardinal, or cardinal number, is an ordinal such that there is no smaller ordinal
which can be put in one-one correspondence with . We generally use Greek letters , ,
for cardinals. Obviously if and are distinct cardinals, then they cannot be put in
one-one correspondence with each other.
Proposition 6.1. For any set X there is an ordinal which can be put in one-one
correspondence with X.
Proof. By the well-ordering principle, let < be a well-ordering of X. Then X under
< is isomorphic to an ordinal.
By this proposition, any set can be put in one-one correspondence with a cardinalnamely
the least ordinal that is in one-one correspondence with the set. This justifies the following
definition. For any set X, the cardinality, or size, or magnitude, etc. of X is the unique
cardinal |X| which can be put in one-one correspondence with X. The basic property of
this definition is given in the following theorem.
Theorem 6.2. For any sets X and Y , the following conditions are equivalent:
(i) |X| = |Y |.
(ii) There is a one-one function mapping X onto Y .
The following proposition gives obvious facts about the particular way that we have defined
the notion of cardinality.
Proposition 6.3. (i) || .
(ii) || = iff is a cardinal.
Proposition 6.4. Every natural number is a cardinal.
Proof. We prove by ordinary induction on n that for every natural number n and
for every natural number m, if m < n then there is no bijection from n to m. This is
vacuously true for n = 0. Now assume it for n, but suppose that m is a natural number
less than n + 1 and f is a bijection from n + 1 onto m. Since n + 1 6= 0, obviously also
m 6= 0. So m = m + 1 for some natural number m . [An easy induction shows that every
nonzero natural number is a successor ordinal.] Let g be the bijection from m onto m
which interchanges m and f (n) and leaves fixed all other elements of m. Then g f is a
bijection from n + 1 onto m which takes n to m . Hence (g f ) n is a bijection from n
onto m , and m < n, contradicting the inductive hypothesis.
Thus the natural numbers are the first cardinals, in the ordering of cardinals determined
by the fact that they are special kinds of ordinals. A set is finite iff it can be put in one-one
correspondence with some natural number; otherwise it is infinite. The following general
lemma helps to prove that is the next cardinal. It is easily proved by induction.
50
Lemma 6.5. If (A, <) is a simple ordering, then every finite nonempty subset of A
has a greatest element.
Theorem 6.6. is a cardinal.
It is harder to find larger cardinals, but they exist; in fact the collection of cardinals is so
big that, like the collection of ordinals, it does not exist as a set. We will see this a little
bit later.
Note that a cardinal is infinite iff it is greater or equal . The following fact will be
useful later.
Proposition 6.7. Every infinite cardinal is a limit ordinal.
Proof. Suppose not: is an infinite cardinal, and = + 1. We define f : as
follows: f (0) = , f (m + 1) = m for all m , and f () = for all \. Clearly f
is one-one and maps onto , contradiction.
Lemma 6.8. If and are cardinals and f : is one-one, then .
Proof. We define iff , and f () < f (). Clearly well-orders . Let g
be an isomorphism from (, ) onto an ordinal . Thus by the definition of cardinals.
If < < , then g 1 () g 1 (), hence by definition of , f (g 1 ()) < f (g 1 ()).
Thus f g 1 : is strictly increasing. Hence . We already know that , so
.
The purpose of this lemma is to prove the following basic theorem.
Theorem 6.9. If A B, then |A| |B|.
Proof. Let = |A|, = |B|, and let f and g be one-one functions from onto A and
of onto B, respectively. Then g f 1 is a one-one function from into , so .
Corollary 6.10. For any sets A and B the following conditions are equivalent:
(i) |A| |B|.
(ii) There is a one-one function mapping A into B.
(iii) A = 0, or there is a function mapping B onto A.
Corollary 6.11. If there is a one-one function from A into B and a one-one function
from B into A, then there is a one-one function from A onto B.
This corollary is called the Cantor-Bernstein, or Schroder-Bernstein theorem. Our proof,
if traced back, involves the axiom of choice. It can be proved without the axiom of choice,
and this is sometimes desirable when describing a small portion of set theory to students.
Some exercises outline such a proof.
The following simple theorem is very important and basic for the theory of cardinals.
It embodies in perhaps its simplest form the Cantor diagonal argument.
Theorem 6.12. For any set A we have |A| < |P(A)|.
Proof. The function given by a 7 {a} is a one-one function from A into P(A), and
so |A| |P(A)|. [Saying that a 7 {a} is giving the value of the function at the argument
a.] Suppose equality holds. Then there is a one-one function f mapping A onto P(A). Let
51
X = {a A : a
/ f (a)}. Since f maps onto P(A), choose a0 A such that f (a0 ) = X.
Then a0 X iff a0
/ X, contradiction.
By this theorem, for every ordinal there is a larger cardinal, namely |P()|. Hence we
can define + to be the least cardinal > . Cardinals of the form + are called successor
cardinals; other infinite cardinals are called limit cardinals. Is + = |P()|? The statement
that this is true for every infinite cardinal is the famous generalized continuum hypothesis
(GCH). The weaker statement that + = |P()| is the continuum hypothesis (CH).
It can be shown that the generalized continuum hypothesis is consistent with our
axioms. But also its negation is consistent; in fact, the negation of the weaker continuum
hypothesis is consistent. All of this under the assumption that our axioms are consistent.
(It is not possible to prove this consistency.)
S
Theorem 6.13. If is a set of cardinals, then is also a cardinal.
S
S
def S
Proof. We
S know already that is an ordinal. Suppose that = | | < . By
definition
of , there isSa such that < . (Membership is the same as <.) Now
S
. So = || | | = , contradiction.
We can now define the standard sequence of infinite cardinal numbers, by transfinite recursion:
0 = ;
+1 = +
;
[
=
for a limit ordinal.
<
For historical reasons, one sometimes writes in place of . Now we get the following
two results.
Lemma 6.14. If < , then < .
Lemma 6.15. for every ordinal .
Theorem 6.16. For every infinite cardinal there is an ordinal such that = .
Proof. Let be any infinite cardinal. Then < +1 . Here + 1 refers to
ordinal addition. This shows that there is an ordinal such that < ; choose the least
such . Clearly 6= 0 and is not a limit ordinal. Say = + 1. Then < +1 ,
so = .
We can now say a little more about the continuum hypothesis. Not only is it consistent that
it fails, but it is even consistent that |P()| = 2 , or |P()| = 17 , or |P()| = +1 ;
the possibilities have been spelled out in great detail. Some impossible situations are
|P()| = and |P()| = + ; we will establish this later in this chapter.
Addition of cardinals
Let and be cardinals. We define
+ = |{(, 0) : } {(, 1) : }|.
52
The idea is to take disjoint copies {0} and {1} of and and count the number
of elements in their union.
Two immediate remarks should be made about this definition. First of all, this is not,
in general, the same as the ordinal sum + . We depend on the context to distinguish
the two notions of addition. For example, + 1 = in the cardinal sense, but not in
the ordinal sense. In fact, we know that < + 1 in the ordinal sense. To show that
+ 1 = in the cardinal sense, it suffices to define a one-one function from onto the set
{(m, 0) : m } {(0, 1)}.
Let f (0) = (0, 1) and f (m + 1) = (m, 0) for any m .
Secondly, the definition is consistent with our definition of addition for natural numbers (as a special case of ordinal addition), and thus it does coincide with ordinal addition
when restricted to :
Proposition 6.17. If m and n are natural numbers, then addition in the sense of
chapter 2 and in the cardinal number sense are the same.
Proof. By induction on n.
Aside from simple facts about addition, there is the remarkable fact that + = for
every infinite cardinal . We shall prove this as a consequence of the similar result for
multiplication.
The definition of cardinal addition can be extended to infinite sums, and very elementary properties of the binary sum are then special cases of more general results; so we
proceed with the general definition. Let hi : i Ii be a system of cardinals (this just
means that is a function with domain I whose values are always cardinals). Then we
define
[
X
i = (i {i}) .
iI
iI
S
P
Proposition 6.21. If hAi : i Ii is any system of sets, then iI Ai iI |Ai |.
Corollary 6.22. If hi : i Ii is a system of cardinals, then
53
iI
iI
i .
iI
Proof. Most of these facts are very easy. We give the proof for (i). Q
According to
the
Q definition of product, it suffices to find a one-one function g mapping iI Ai onto
mapping Ai onto |Ai . (We are using
iI |Ai |. For each i I, let fi be a one-one function
Q
the axiom of choice here.) Then for each x iI Ai and each i I let g(x)i = fi (xi ). It
is easily checked that g is as desired.
General commutative, associative, and distributive laws hold also:
Proposition 6.33. (Commutative law) If hi : i Ii is a system of cardinals and
f : I I is one-one and onto, then
Y
Y
i =
f (i) .
iI
then
iI
Y Y
Y
ij =
ij .
iI
jJ
(i,j)IJ
i =
iI
( i ).
iI
iI
(iii) 1 = .
(iv) 1 = 1.
(v) 2 = .
(vi) = + .
(vii) ( ) = .
(viii) ( ) = .
(ix) Q
If 6= 0 and , then .
|I|
(x) P
iI = .
Q
(xi) iI i = iI i .
Q
Q
(xii)
= iI i .
i
iI
Proof. The proofs are straightforward. We give the proof of (xi) as an illustration.
By the definitions of arbitrary sums and products it suffices to find a bijection f from the
set
S
{i {i}:iI}
()
Q
to the set iI i , where the product here is just the product of sets. So take any
Q
member x of (). We define f (x) in iI i by defining its value (f (x))i for each i I;
and we define (f (x))i in i by defining its value ((f (x))i)() for each i . Since
(, i) i {i}, the pair (, i) is in the domain of x. We define ((f (x))i)() = x(, i).
f is one-one: suppose that x, y () and f (x) = f (y). Take any u dmn(x); say
u = (, i) with i I and i . Then x(u) = x(, i) = ((f (x))i )() = ((f (y))i)() =
y(, i) = y(u). Q
Q
f maps onto iI i : take any g iI i . Define x () by setting, for any i I
and i , x(, i) = (g(i))(). Then ((f (x))i )() = x(, i) = (g(i))(), and since i and
are arbitrary we get f (x) = g.
Proposition 6.38. If m, n , then mn
Proposition 6.39. |P(A)| = 2|A| .
For each X A define X A 2 by setting
n
1
X (a) =
0
if a X,
otherwise.
[This is the characteristic function of X.] It is easy to check that is a bijection from
P(A) onto A 2.
The calculation of exponentiation is not as simple as that for addition and multiplication.
The following result gives one of the most useful facts about exponentiation, however.
Theorem 6.40. If 2 , then = 2 .
Proof. Note that each function f : is a subset of . Hence P( ),
and so |P( ). Therefore,
2 |P( )| = 2 = 2 ;
57
so all the entries in this string of inequalities are equal, and this gives = 2 .
Cofinality, and
regular and singular cardinals
Further cardinal arithmetic depends on the notion of cofinality. For later purposes we
define a rather general version of this notion. Let (P, <) be a partially ordered set. A
subset X of P is dominating iff for every p P there is an x X such that p x. The
cofinality of P is the smallest cardinality of a dominating subset of P . We denote this
cardinal by cf(P ).
A subset X of P is unbounded iff there does not exist a p P such that x p for all
x X. If P is simply ordered without largest element, then these notionsdominating
and unboundedcoincide. In fact, suppose that X is dominating but not unbounded.
Since X is not unbounded, choose p P such that x p for all x X. Since P does
not have a largest element, choose q P such that p < q. Then because X is dominating,
choose x X such that q x. Then q x p < q, contradiction. Thus X dominating
implies that X is unbounded. Now suppose that Y is unbounded but not dominating.
Since Y is not dominating, there is a p P such that p 6 x, for all x Y . Since P is a
simple order, it follows that x < p for all x Y . This contradicts Y being unbounded.
A cardinal is regular iff is infinite and cf() = . An infinite cardinal that is not
regular is called singular.
Theorem 6.41. For every infinite cardinal , the cardinal + is regular.
Proof. Suppose that + , is unbounded in + , and || < + . Hence
X
[
X
+
||
= = ,
=
Proof. Let X be a dominating subset of A of size cf(A), and let g be a bijection from
cf(A) onto X. We define a function f : cf(A) X by recursion, as follows. If f () X
has been defined for all < , where < cf(A), then {f () : < } has size less than
cf(A), and hence it is not dominating. Hence there is an a A such that f () < a for all
< . We let f () be an element of X such that a, g() f ().
Clearly f is strictly increasing. If a A, choose < cf(A) such that a g(). Then
a f ().
Proposition 6.43. Suppose that (A, <) is a simple ordering with no largest element.
Then cf(cf(A)) = cf(A).
Proof. Clearly cf() for any ordinal ; in particular, cf(cf(A)) cf(A). Now by
Theorem 6.42, let f : cf(A) A be strictly increasing with rng(f ) unbounded in A, and
let g : cf(cf(A)) cf(A) be strictly increasing with rng(g) unbounded in cf(A). Clearly
f g : cf(cf(A)) A is strictly increasing. We claim that rng(f g) is unbounded in A.
For, given a A, choose < cf(A) such that a f (), and then choose < cf(cf(A))
such that g(). Then a f () f (g()), proving the claim. It follows that
cf(A) cf(cf(A)).
S
Proposition 6.44. If is a regular cardinal, , and || < , then < .
Proof. Since cf() = , from the definition of cf it follows
that is bounded in .
S
Hence there is an < such that for all . So < .
Proposition 6.45. If A is a linearly ordered set with no greatest element, is a
regular cardinal, and f : A is strictly increasing with rng(f ) unbounded in A, then
= cf(A).
Proof. By the definition of cf we have cf(A) . Suppose that cf(A) < . By
Theorem 6.42 let g : cf(A) A be strictly increasing with rng(g) unbounded in A. For
each < cf() choose < such S
that g() f ( ). Then { : < cf(A)} and
|{ : < cf(A)}| < , so by 6.44, <cf(A) < . Let < be such that < for
all < cf(A). Then f () is a bound for rng(g), contradiction.
Proposition 6.46. A cardinal isP
regular iff for every system hi : i Ii of cardinals
less than , with |I| < , one also has iI i < .
P
S
S
Proof.
:
|I|
<
.
:
if
and
||
<
,
then
|
|
i
i
iI
iI
P
S
||
<
,
so
also
<
.
Thus
is
regular.
+
[
max ,
< ,
<
59
and we define P
to be this cardinal.
Now <cf() for each < cf(), so
=
cf() = .
<cf()
<
<cf()
= cf() .
<cf()
[
<
a cardinal
cf()
f () if f () < ,
0
otherwise.
60
= | |
<cf()
<cf() a <
cardinal
cf()
[
=
<
a cardinal
( )cf() = cf() = ,
and the lemma follows.
The following theorem is not needed for the main result, but it is a classical result about
exponentiation.
Theorem 6.51. (Hausdorff) If and are infinite cardinals, then (+ ) = + .
Proof. If + , then both sides are equal to 2 . Suppose that < + . Then
[
+
+
( ) = | ( )| =
<+
X
|| + (+ ) ,
<+
as desired.
Here is the promised theorem giving computation rules for exponentiation. It essentially
reduces the computation of to two special cases: 2 , and cf () . Generalizations of the
results mentioned about the continuum hypothesis give a pretty good picture of what can
happen to 2 . The case of cf( is more complicated, and there is still work being done on
what the possibilities here are. Recent deep work of Shelah on pcf theory has shed some
light on this. For example, he showed that 0 20 + 4 . The role of 4 here is still
unclear.
Theorem 6.52. (main theorem of cardinal arithmetic) Let and be cardinals with
2 and . Then
(i) If , then = 2 .
(ii) If is infinite and there is a < such that , then = .
(iii) Assume that is infinite and < for all < . Then < , and:
61
<
[
<
a cardinal
cf()
cf() ,
62
EXERCISES
The first four exercises outline a proof of the Cantor-Schroder-Bernstein theorem without
using the axiom of choice. This theorem says that if there is an injection from A into B
and one from B into A, then there is a bijection from A to B. In the development in the
text, using the axiom of choice, the hypothesis implies that |A| |B| |A|, and hence
|A| = |B|. But it is of some interest that it can be proved in an elementary way, without
using the axiom of choice or anything about ordinals and cardinals. Of course, the axiom
of choice should not be used in these four exercises.
E6.1. Let F : P(A) P(A), and assume that for all X, Y A, ifS X Y , then
F (X) F (Y ). Let A = {X : X A and X F (X)}, and set X0 = XA X. Then
X0 F (X0 ).
E6.2. Under the assumptions of exercise 1 we actually have X0 = F (X0 ).
E6.3. Suppose that f : A B is one-one and g : B A is also one-one. For every X A
let F (X) = A\g[B\f [X]]. Show that for all X, Y A, if X Y then F (X) F (Y ).
E6.4. Prove the Cantor-Schroder-Bernstein theorem as follows. Assume that f and g are
as in exercise E6.3, and choose F as in that exercise. Let X0 be as in exercise E6.1. Show
that A\X0 rng(g). Then define h : A B by setting, for any a A,
h(a) =
f (a)
if a X0 ,
1
g (a) if a A\X0 .
where P =
iI
ij =
XY
i,f (i) ,
f P iI
Ji .
E6.12. For every infinite cardinal there is a cardinal > such that > .
E6.13. Prove that for every n , and every infinite cardinal , n = 2 n .
E6.14. Prove that 1 = 21 0 .
Q
E6.15. Prove that 0 = n n .
E6.16. Prove that for any infinite cardinal , (+ ) = 2 .
E6.17. Show that if is an infinite cardinal and C is the collection of all cardinals less
than , then |C| .
E6.18. Show that if is an infinite cardinal and C is the collection of all cardinals less
than , then
!cf()
X
.
2 =
2
C
<
2 = 2 .
E6.20.
Show that if is an infinite cardinal and C is the set of all cardinals , then
P
= 2 .
C
E6.21.PShow that if is an infinite cardinal, = + , and C is the set of all cardinals < ,
then C = 2 .
E6.22. Assume that is an infinite cardinal, and 2 < for every cardinal < . Show
that 2 = cf() .
E6.23. Suppose that is a singular cardinal, cf() = , and 2 < for every < . Prove
that 2 = .
References
Abraham, U.; Magidor, M. Cardinal arithmetic. Chapter in Handbook of Set Theory.
Springer 2010, 2197pp.
Holz, M.; Steffens, K.; Weitz, E. Introduction to Cardinal Arithmetic. Birkh
auser
1999, 304pp.
64
Now suppose that c2m and d2m have been defined so that the following condition hold:
(*) For all i, j 2m, ci < cj iff di < dj .
(Note that then a similar equivalence holds for = and for >.) We let c2m+1 = am+1 . Now
we consider several cases.
Case 1. am+1 = ci for some i 2m. Take the least such i, and let d2m+1 = di .
Case 2. am+1 < ci for all i 2m. Let d2m+1 be any element of B less than each di ,
i 2m.
Case 3. ci < am+1 for all i 2m. Let d2m+1 be any element of B greater than each
di , i 2m.
Case 4. Case 1 fails, and there exist i, j 2m such that ci < am+1 < cj . Let d2m+1
be any element b of B such that di < b < dj whenever ci < am+1 < cj ; such an element b
exists by (*).
This finishes the definition of d2m+1 . d2m+2 and c2m+2 are defined similarly. Namely,
we let d2m+2 = bm+1 and then define c2m+2 similarly to the above, with a, b interchanged
and c, d interchanged.
Note that each ai appears in the sequence of ci s, namely c0 = a0 and c2i+1 = ai+1 ,
and similarly each bi appears in the sequence of di s. Hence it is clear that {(ci , di ) : i }
is the desired order-isomorphism.
Theorem 7.3. If L is an infinite linear order, then there is a subset M of L which
is order isomorphic to (, <), or to (, >).
Proof. Suppose that L does not have a subset order isomorphic to (, >). We claim
then that L is well-ordered, and therefore is isomorphic to an infinite ordinal and hence
has a subset isomorphic to (, <). To prove this claim, suppose it is not true. So L has
some nonempty subset P with no least element. We now define a sequence hai : i i of
elements of P by recursion. Let a0 be any element of P . If ai P has been defined, then
it is not the least element of P and so there is an ai+1 P with ai+1 < ai . This finishes
the construction. Thus we have essentially produced a subset of L order isomorphic to
(, >), contradiction.
It would be natural to conjecture that Theorem 7.3 generalizes in the following way: for
any infinite cardinal and any linear order L of size , there is a subset M of L order
isomorphic to (, <) or to (, >). This is clearly false, as the real numbers under their
usual order form a counterexample. (Given a set of real numbers order isomorphic to
2 , one could choose rationals between successive members of the set, and produce 2
rationals, contradiction.) We want to give an example that works for many cardinals. The
construction we use is very important for later purposes too.
The following definitions apply to any infinite ordinal .
If f and g are distinct elements of 2, we define
(f, g) = min{ < : f () 6= g()}.
Let f < g iff f and g are distinct elements of 2 and f ((f, g)) < g((f, g)). (Thus
f ((f, g)) = 0 and g((f, g)) = 1.) Clearly ( 2, <) is a linear order; this is called the
lexicographic order.
66
We also need some general set-theoretic notation. If A is any set and any cardinal, then
[A] = {X A : |X| = };
[A]< = {X A : |X| < };
[A] = {X A : |X| }.
Theorem 7.4. For any infinite cardinal , the linear order 2 does not contain a
subset order isomorphic to + or to (+ , >).
Proof. The two assertions are proved in a very similar way, so we give details only for
the first assertion. In fact, we assume that hf : < + i is a strictly increasing sequence
of members of 2, and try to get a contradiction. The contradiction will follow rather
easily from the following statement:
+
(1) If , [+ ] , and f < f for any , such that < , then there
+
exist < and [] such that f < f for any , such that < .
To prove this, assume the hypothesis. For each let f = f . Clearly does not
have a largest element. For each let be the least member of which is greater
than . Then
[
=
{ : (f , f ) = }.
<
+
Since || = + , it follows that there are < and [] such that (f , f ) = for
all . We claim now that f < f for any two , such that < , as
desired in (1). For, take any such , . Suppose that f = f . (Note that we must
have f f .) Now from (f , f ) = we get f () = 1, and from (f , f ) =
we get f () = 0. Now f = f = f , so we get f < f f , contradiction.
This proves (1).
Clearly from (1) we can construct an infinite decreasing sequence > 1 > 2 >
of ordinals, contradiction.
Now we give some more definitions, leading to a kind of generalization of Theorem 7.2.
If (L, <) is a linear order and A, B L, we write A < B iff x Ay B[x < y]. If
A = {a} here, we write a < B; similarly for A < b.
Intervals in linear orders are defined in the usual way. For example, [a, b) = {c : a c <
b}.
An -set is a linear order (L, <) such that if A, B L, A < B, and |A|, |B| < , then
there is a c L such that A < c < B. Taking A = and B = {a} for some a L, we
see that -sets do not have first elements; similarly they do not have last elements. Note
that an 0 -set is just a densely ordered set without first or last elements.
For any ordinal , we define
H = {f 2 : there is a < such that f () = 1 and f () = 0 for all (, )}.
67
Theorem 7.5. Let be an ordinal, and let cf( ) = . Then the following conditions hold:
(i) H is an -set.
(ii) cf(H , <) = .
(iii) cf(H , >) = .
P
(iv) |H0 | = 0 , and for > 0, |H | = < 2 .
Proof. For each f H let f < be such that f (f ) = 1 and f () = 0 for all
(f , ).
For (i), suppose that A, B H with A < B and |A|, |B| < . Obviously we may
assume that one of A, B is nonempty. Then there are three possibilities:
Case 1. A 6= =
6 B. Let
= sup{f : f A};
= max( + 1, sup{f : f B}).
Thus , < since |A|, |B| < = cf( ). We now define g 2 by setting, for each
< ,
1 if = + 1;
0 if + 1 < < .
Clearly g H . We claim that A < g < B. Note that g
/ A B since g( + 1) = 1 while
f ( + 1) = 0 for any f A B.
To prove the claim first suppose that f A. Assume that g < f ; we will get a
contradiction. Let = (g, f ). Then g() = 0 and f () = 1. It follows that and
g = f , contradicting the definition of g().
Second, suppose that f B. Assume that f < g; we will get a contradiction. Let
= (f, g). Thus f () = 0 and g() = 1. We claim that = + 1. For, otherwise since
g() = 1 we must have , and then there is an h A such that h = g and
h() = 1. So f = g = h , f () = 0, and h() = 1, so f < h. But f B and
h A, contradiction. This proves our claim that = + 1.
Now clearly f . Since
g ( + 1) = g = f = f ( + 1),
it follows that g(f ) = 1. So from f we infer that f . Thus since g(f ) = 1, it
follows that there is a k A such that k f = g f and k(f ) = 1. But now we have
k (f + 1) = g (f + 1) = f (f + 1) and f () = 0 for all (f , ). Hence f k,
which contradicts f B and k A.
This finishes the proof of (i) in Case 1.
68
Case 2. A = =
6 B. Let
= sup{f : f B}.
Define g H by setting, for each < ,
g() =
0 if ,
1 if = + 1,
0 if + 1 < .
1 if + 1,
0 if + 1 < .
1
0
if ,
if < .
0 if < ,
f () = 1 if = ,
0 if < .
Clearly hf : < is a strictly decreasing sequence of members of H and {f : < }
is cofinal in (H , >). So (iii) holds.
Finally, for (iv), for each < let
L = {f H : f () = 1 and f () = 0 for all (, )}.
Clearly these sets are pairwise disjoint, and their union is H . For = 0,
|H | =
|L | =
<
<
For > 0,
|H | =
|L |
<
69
2|| = 0 .
|L | +
<
= 0 +
|L |
<
2||
<
= 0 +
2 |{ < : || = }|
<
= 0 +
=
2 +1
<
2 .
<
is any upper bound for X. Clearly a least upper bound for X is unique if it exists. If
a is the least upper bound of the empty set, then a is the smallest element of L. We use
lub or sup to abbreviate least upper bound. Also supremum is synonymous with least
upper bound.
Similarly one defines lower bound and greatest lower bound. Any element is a lower
bound of the empty set, and if a is the greatest lower bound of the empty set, then a is
the largest element of L. We use glb or inf to abbreviate greatest lower bound. infimum
is synonymous with greatest lower bound.
A linear order L is complete iff every subset of L has a greatest lower bound and a
least upper bound.
Proposition 7.11. For any linear order L the following conditions are equivalent:
(i) L is complete.
(ii) Every subset of L has a least upper bound.
(iii) Every subset of L has a greatest lower bound.
Proof. (i)(ii): obvious. (ii)(iii). Assume that every subset of L has a least upper
bound, and let X L; we want to show that X has a greatest lower bound. Let Y be the
set of all lower bounds of X. Then let a be a least upper bound for Y . Take any x X.
Then y Y [y x], so a x since a is the lub of Y . This shows that a is a lower bound
for X. Suppose that y is any lower bound for X. Then y Y , and hence y a since a is
an upper bound for Y .
(iii)(i) is treated similarly.
Let (L, <) be a linear order. We say that a linear order (M, ) is a completion of L iff the
following conditions hold:
(C1) L M , and for any a, b L, a < b iff a b.
(C2) M is complete.
(C3) Every element of M is the lub of a set of elements of L.
(C4) If a L is the lub in L of a subset X of L, then a is the lub of X in M .
Theorem 7.12. Any linear order has a completion.
Proof. Let (L, <) be a linear order. We let M be the collection of all X L such
that the following conditions hold:
(1) For all a, b L, if a < b X then a X.
(2) If X has a lub a in L, then a X.
We consider the structure (M , ). It is clearly a partial order; we claim that it is a linear
order. (Up to isomorphism it is the completion that we are after.) Suppose that X, Y M
and X 6= Y ; we want to show that X Y or Y X. By symmetry take a X\Y . Then
we claim that Y X (hence Y X). For, take any b Y . If a < b, then a Y by (1),
contradiction. Hence b a, and so b X by (1), as desired. This proves the claim.
71
as M \f [L], and it is disjoint from L by the foundation axiom. Let g be a bijection from
A onto M \f [L]. Now let N = L A, and define h : N M by setting, for any x N ,
h(x) =
f (x) if x L,
g(x) if x A.
(ii)(i): Assume (ii). Then (C1) and (C2) are clear. For (C3), let x M , and let
X = {a L : a < x}. Then x is an upper bound for X, and (ii)(d) clearly implies that
it is the lub of X. For (C4), suppose that a L is the lub in L of a set X of elements of
L. Suppose that x M is an upper bound for X and x < a. Then by (ii)(d) there is an
element b L such that x < b < a. Then there is an element c X such that b < c a.
It follows that c x, contradiction.
Note from this corollary that the completion of a dense linear order is also dense.
We now take up a special topic, Suslin lines.
A subset U of a linear order L is open iff U is a union of open intervals (a, b) or (, a)
or (a, ). Here (, a) = {b L : b < a} and (a, ) = {b L : a < b}. L itself is also
counted as open. (If L has at least two elements, this follows from the other parts of this
definition.) Note that if L has a largest element a, then (a, ) = ; similarly for smallest
elements.
An antichain in a linear order L is a collection of pairwise disjoint open sets.
A linear order L has the countable chain condition, abbreviated ccc, iff every antichain
in L is countable.
A subset D of a linear order L is topologically dense in L iff D U 6= for every
nonempty open subset U of L. Then dense in the sense at the beginning of the chapter
implies topologically dense. In fact, if D is dense in the original sense and U is a nonempty
open set, take some non-empty open interval (a, b) contained in U . There is a d D with
a < d < b, so D U 6= . If =
6 (a, ) U for some a, choose b (a, ), and then choose
d D such that a < d < b. Then again D U 6= . Similarly if (, a) U for some a.
Conversely, if L itself is dense, then topological denseness implies dense in the order
sense; this is clear. On the other hand, take for example the ordered set ; itself is
topologically dense in , but is not dense in in the order sense.
A linear order L is separable iff there is a countable subset C of L which is topologically
dense in L. Note that if L is separable and (a, b) is a nonempty open interval of L, then
(a, b), with the order induced by L (x < y for x, y (a, b) iff x < y in L) is separable.
In fact, if C is countable and topologically dense in L clearly C (a, b) is countable and
topologically dense in (a, b). Similarly, [a, b] is separable, taking (C [a, b]) {a, b}. This
remark will be used shortly.
A Suslin line is a linear ordered set (S, <) satisfying the following conditions:
(i) S has ccc.
(ii) S is not separable.
Suslins Hypothesis (SH) is the statement that there do not exist Suslin lines.
Later in these notes we will prove that MA + CH implies SH. Here MA is Martins axiom,
which we will define and discuss later. The consistency of MA + CH requires iterated
forcing, and will be proven much later in these notes. Also later in these notes we will
prove that implies SH, and still later we will prove that is consistent with ZFC,
namely it follows from V = L. Both and L are defined later.
74
For now we want to connect our notion of Suslin line with more familiar mathematics,
and with the original conjecture of Suslin. The following is a theorem of elementary set
theory.
Theorem 7.15. For any linear order (L, ) the following conditions are equivalent:
(i) (L, ) is isomorphic to (R, <).
(ii) The following conditions hold:
(a) L has no first or last elements.
(b) L is dense.
(c) Every nonempty subset of L which is bounded above has a least upper bound.
(d) L is separable.
Proof. (i)(ii): standard facts about real numbers.
(ii)(i): By (d), let C be a countable subset of L such that (a, b) C 6= whenever
a < b in L. Clearly C is infinite, is dense, and has no first or last element. By Theorem
7.2, let f be an isomorphism from (C, <) onto (Q, <). We now apply the procedure used
at the end of the proof of 7.12. Let P be a set disjoint from Q such that |L\C| = |P |,
and let R = Q P . Let g be a bijection from L\C onto P , and define h = f g. Define
x y iff x, y R and h1 (x) <L h1 (y). This makes R into a linearly ordered set with
h an isomorphism from L onto R. Now we adjoin first and last elements aR , bR to R and
similarly aR , bR for R; call the resulting linearly ordered sets R and R . Then R and
R are both completions of Q according to Corollary 7.14. Hence (i) holds by Theorem
7.13.
Originally, Suslin made the conjecture that separability in Theorem 7.15 can be replaced
by the condition that every family of pairwise disjoint open intervals is countable. The
following theorem shows that this conjecture and our statement of Suslins hypothesis are
equivalent.
Theorem 7.16. The following conditions are equivalent:
(i) There is a Suslin line.
(ii) There is a linearly ordered set (L, <) satisfying the following conditions:
(a) L has no first or last elements.
(b) L is dense.
(c) Every nonempty subset of L which is bounded above has a least upper bound.
(d) No nonempty open subset of L is separable.
(e) L is ccc.
Proof. Obviously (ii) implies (i). Now suppose that (i) holds, and let S be a Suslin
line. We obtain (ii) in two steps: first taking care of denseness, and then taking the
completion to finish up.
We define a relation on S as follows: for any a, b S,
ab
iff
a = b,
or a < b and [a, b] is separable,
or b < a and [b, a] is separable.
75
that x is not the first element of L and does not have an immediate predecessor. Then
the left character of x is the smallest cardinal such that there is a strictly increasing
sequence of elements of L with supremum x. This cardinal is then regular. A symmetric
definition for right character is clear. The character of x is the pair (, ) where is the
left character and is the right character. The point-character set of L is the collection of
all characters of points of L; we denote it by Pchar(L). Note that Pchar(L) 6= .
A gap of L is an ordered pair (M, N ) such that M 6= =
6 N , L = M N , M has no
largest element, N has no smallest element, and x M y N (x < y). The definitions of
left and right characters of a gap are similar to the above definitions for points; but they
are always infinite regular cardinals. Again, the character of (M, N ) is the pair (, )
where is the left character and is the right character. The gap-character set of L is the
collection of all characters of gaps of L; we denote it by Gchar(L). If Gchar(L) = then
we say that L is inner-complete; L may or may not have first and last elements.
The full character set of L is the pair (Pchar(L), Gchar(L)).
If L does not have a first element, then the coinitiality of L is the least cardinal
such that there is a strictly decreasing sequence ha : < i of elements of L such that
x L < [a < x]; we denote this cardinal by ci(L). Similarly for the right end, if L
does not have a greatest element then we define the cofinality of L, denoted by cf(L).
L is irreducible iff it has no first or last elements, and the full character set of (x, y)
is the same as the full character set of L for any two elements x, y L with x < y.
Now a complete character system is a set R RegReg with the following properties:
(C1) dmn(R) = [, left (R))reg .
(C2) rng(R) = [, right (R))reg .
(C3) There is a such that (, ) R.
(C4) There is a linear order L of size |R| 1 with no strictly increasing sequence of regular
order type left (R) and no strictly decreasing sequence of regular order type right (R).
Note that conditions (1)(3) do not mention orderings. There are many sets which satisfy
the condition (4) and whose descriptions do not mention orderings. For example, any finite
set satisfies (4). If neither left (R) nor right (R) is regular limit, then (4) holds (this is the
condition of Hausdorff).
Proposition 7.17. If L is an irreducible infinite inner-complete dense linear order,
then Pchar(L) is a complete character system. Moreover, ci(L) right (Pchar(L)) and
cf(L) left (Pchar(L)).
Proof. Let R = Pchar(L). (C1): the inclusion is obvious. Now suppose that
[, left(R))reg . Then there is a point x of L with character (, ) such that .
Let ha : < i be a strictly increasing sequence of elements of L with supremum x. Let
y = sup< a . Clearly y has left character , as desired.
(C2): symmetric to (C1).
(C3): By a straightforward transfinite construction one gets (for some ordinal ) a
strictly increasing sequence hx : < i and a strictly decreasing sequence hy : < i such
that x < y for all , < , and such that there is exactly one point z with x < z < y
for all , < . Then is a limit ordinal, and z has character (cf(), cf()), as desired.
78
(C4): Suppose that cf(L) > left (R). Then by the argument for (C1), L has a point
with left character left (R), contradiction. A similar argument works for ci.
We shall use the sum construction for linear P
orders. If hLi : i Ii is a system of linear
orders, and I itself is an ordered set, then by iI Li we mean the set
{(i, a) : i I, a Li }
ordered lexicographically.
The following lemma is probably well-known.
Lemma 7.18.PIf hLi : i Ii is a system of complete linear orders, and I is a complete
linear order, then iI Li is also complete.
P
Proof. Suppose that C is a nonempty subset of iI Li . Let i0 = sup{i I : (i, a)
C for some a Li }. We consider two cases.
Case 1. There is an a Li0 such that (i0 , a) C. Then we let a0 = sup{a Li0 :
(i0 , a) C}. Clearly (i0 , a0 ) is the supremum of C.
Case 2. There is no a Li0 such that (i0 , a) C. Then the supremum of C is (i0 , a),
where a is the first element of Li0 .
Another construction we shall use is the infinite product. Suppose
Q that I is a well-ordered
set and hLi : i Ii is aQsystem of linear orders. Then we make iI Li into a linear order
by defining, for f, g iI Li ,
f <g
iff
where i = f.d.(f, g), and f.d.(f, g) is the first i I such that f (i) 6= g(i).
Given such an infinite product, and given a strictly increasing sequence x = hx : <
i of members of it, with a limit ordinal, we call x of argument type if the following two
conditions hold:
(A1) hf.d.(x , x+1 ) : < i is strictly increasing
(A2) For each < , the sequence hf.d.(x , x ) : < < i is a constant sequence.
On the other hand, x is of basis type iff there is an i I such that f.d.(x , x ) = i for all
distinct , < .
Lemma Q
7.19. Let hMi : i Ii be a system of ordered sets, with I well-ordered. If
x < y < z in iI Mi , then f.d.(x, z) = min{f.d.(x, y), f.d.(y, z)}.
Proof. Let i = min{f.d.(x, y), f.d.(y, z)}.
Case 1. i = f.d.(x, y) = f.d.(y, z). Then x i = y i = z i and x(i) < y(i) < z(i), so
f.d.(x, z) = i.
Case 2. i = f.d.(x, y) < f.d.(y, z). Then x i = y i = z i and x(i) < y(i) = z(i), so
f.d.(x, z) = i.
Case 3. i = f.d.(y, z) < f.d.(x, y). Then x i = y i = z i and x(i) = y(i) < z(i), so
f.d.(x, z) = i.
79
A variant of the product construction will be useful. Let be an infinite regular cardinal.
A V-system is a pair (T, M ) with the following properties:
(V1) For each , T is a collection of functions with domain .
(V2) For each < and each x T , Mx is a linear order.
(V3) For each < we have
T+1 = {x {(, a)} : x T , a Mx }.
(V4) If is a limit ordinal and x is a sequence with domain such that x T
for every < , then x T .
(V5) If < and x T , then x T .
We define a linear order on T by setting, for any x, y T , x < y iff x 6= y, and
x() < y(), where is minimum such that x() 6= y(). Here the second < relation is
that of M(x) .
The idea is that this is a variable product: not all functions in a cartesian product are
allowed. If x T , then for each < the value x() lies in an ordered set M(x) which
depends on x . Thus the linear order has a tree-like property.
Theorem 7.21. Assume the above notation. For each < let M = {(x, y) : x
T , y Mx }. Let be any linear ordering of T , for each <Q, and let M have the
lexicographic ordering. Then there is an isomorphism of T into < M .
Namely, for each x T define f (x) by (f (x)) = (x , x()) for any < .
Then f is the indicated isomorphism. Moreover, for all x, y T we have f.d.(x, y) =
f.d.(f (x), f (y)).
Q
Proof. Clearly f maps T into < M . Suppose that x, y T and x < y.
Choose minimum such that x() 6= y(); so x() < y(). Hence (x , x()) < (x
, y()) = (y , y()). If < , then (x , x()) = (y , y()). Hence f (x) < f (y)
and f.d.(x, y) = f.d.(f (x), f (y)). On the other hand, suppose that f (x) < f (y). Let
= f.d.(f (x), f (y)). If < , then (f (x)) = (f (y)) , i.e., (x , x()) = (y , y()).
Hence x = y . Since (f (x)) < (f (y)) , we have (x , x()) < (y , y()).
Hence x() < y(). It follows that x < y.
Theorem 7.22. If (T, M ) is a V -system on a regular cardinal and each linear order
Mx is complete, then T is complete.
Proof. It suffices to take any regular cardinal , suppose that x = hx : < i
is a strictly increasing sequence in T , and show that it has a supremum. By Theorems
7.20 and 7.21 we may assume that x is of argument type or of basis type. Now if x is of
basis type, then it is a sequence in some Mx , and so it has a supremum by assumption.
Suppose that it is of argument type. Thus . Let = sup{f.d.(x , x+1 ) : < }.
Thus . Let y = x f.d.(x , x+1 ) for each < . Hence y Tf.d.(x ,x+1 ) by (V5).
Now
(1) y y if < < .
81
In fact, suppose that this is not true; say that < < but y 6 y . So there is an
< f.d.(x , x+1 ) such that y () 6= y (). Thus < f.d.(x , x +1 ), and x () 6= x ().
So f.d.(x , x ) , contradiction (see the definition of argument type, (A2)).
From (1), clearly
(2) y y if < < .
def S
Now consider the function z = < y . We consider two cases.
Case 1. = . Then z T and = . We claim that z is the supremum of
x in this case. If < , let = f.d.(x , x+1 ). Then z = y = x . Now
= f.d.(x , x+1 ) < f.d.(x+1 , x+2 ) by (A1). So x () < x+1 () = y+1 () z().
Thus x < z. Now suppose that w < z. Let = f.d.(w, z). Since = , choose <
such that < f.d.(x , x+1 ). Then w = z = y , and w() < z() = y (). So
w < y , as desired.
Case 2. < . Then also < . Now z T by (V4). We define an extension
v T of z by recursion. Let w0 = z. If w has been defined as a member of T+ , with
+ < , let a() be the least member of Mw , and set w+1 = w {(, a())}. So
w+1 T++1 by (V3). If is limitSand w has been defined as
S a member of T+ for
all < , and if + < , let w = < w . Finally, let v = < w . So v T and
it is an extension of z. We claim that it is the l.u.b. of hx : < i. First suppose that
< . Then x f.d.(x , x+1 ) = y = x+1 f.d.(x , x+1 ), and
x (f.d.(x , x+1 )) < x+1 (f.d.(x , x+1 )) = y+1 (f.d.(x , x+1 )) = z(f.d.(x , x+1 )).
def
Thus x < v. Now suppose that t < v. Then = f.d.(t, v) is less than by construction,
so t = z and t() < z(). By the definition of z this gives a < such that
t = x and t() < x (). So t < x , as desired.
Our main theorem is as follows. It generalizes Satz XVII of Hausdorff [1908], and its proof
is just a modification of his proof.
Theorem 7.23. Suppose that R is a complete character system, and , are regular
cardinals with right (R) and left (R). Then there is an irreducible inner-complete
dense order L such that Pchar(L) = R, with ci(L) = and cf(L) = .
Proof. By assumption, there is a linear order U of size |R| 1 with no increasing
sequence of regular order type left (R) and no decreasing sequence of regular order type
right (R). Let U be the completion of U . Then |U | |R|, and also U has no increasing
sequence of regular order type left (R) and no decreasing sequence of regular order type
right (R). Let g be a function mapping U onto R.
Now we define some important orders which are components of the final order L. Let
and be regular cardinals.
= + 1 + ;
X
g(a) ;
=
aU
(, ) = 1 + + + + 1.
82
The symmetry of this definition will enable us to shorten several proofs below. Since we are
using the standard notation for sums of order types, and some order types are repeated,
it is good to have an exact notation for the indicated orders. m, f, l are new elements
standing for middle, first, and last respectively. We suppose that with each ordinal
we associate a new element , used in forming things like . Thus more precisely,
= {m} { : < };
the ordering here is: has its natural order; for , < , we define < iff < . The
ordering between the indicated parts of are as suggested in the left to right order.
is as in the discussion of sums above.
(, ) = {f } { : < } {l};
the ordering should be obvious on the basis of the above remarks. We implicitly assume
the distinctness of the various objects making up (, ).
(1) (, ) is a complete linear order, for any regular cardinals , .
This is clear on the basis of Lemma 7.18.
(2) If right (R) is regular and left (R) is regular, then:
(a) the right character of the left end point of (, ) is ;
(b) the left character of the right end point of (, ) is ;
(c) if a (, ) is not an end point, and its character is (, ), then < left (R)
and < right (R).
In fact, (a) and (b) are clear. Now suppose that a (, ) is not an end point, and its
character is (, ). If a is in the or portion, the conclusion of (c) is clear. So, suppose
that a is in the portion. Thus it is within some g(b) with b U . Thus g(b) = (, ) for
some (, ) R, so the conclusion of (c) is clear. Hence (2) holds.
Let p be a new element, not appearing in any of the above orders. Let be the least
regular cardinal such that (, ) R; it exists by condition (C3) in the definition of a
complete character system. For each regular < left (R), let be the least cardinal such
that (, ) R; it exists by (C1) in the definition of complete character set. Similarly,
for each regular right (R) let be the least cardinal such that ( , ) R.
Now we define by recursion a V -system (T, M ) associated with . We do this so that
the following condition holds:
(*) If y Mx is not an endpoint of Mx , then the left character of y is less than left (R)
and the right character of y is less than right (R). Moreover, except for = 0, these
conditions on y also hold for the endpoints of Mx .
To start with, we let T0 = {} and M0 = (, ). Note by (2) that the first element of
M0 has right character , its last element has left character , all other right characters
are less than right (R), and all other left characters are less than left (R). So (*) holds for
= 0.
83
Now suppose that is a limit ordinal. We let T be the set of all x with domain
such that x T for all < . Now suppose that < , still with a limit ordinal.
Now if x T and |M(x) | > 1 for all < , we set
Mx = (cf() , cf() ).
On the other hand, if |M(x) | = 1 for some < , we set Mx = {p}. Clearly (*) holds
for .
Now suppose that = + 1. Then we set
T = {x hbi : x T and b Mx }.
Now we define Mx for each x T .
(3) If x() = p, then Mx = {p}.
(4) If x() is an endpoint of M(x) or has no immediate neighbors, then Mx = {p}.
(5) If x() has a right neighbor but no left neighbor, and the left character of x() in
M(x) is , then Mx = ( , ). Note that < left (R) by (*).
(6) If x() has a left neighbor but no right neighbor, and the right character of x() in
M(x) is , then Mx = (, ). Note that < right (R) by (*).
(7) If x() has both a left and a right neighbor, then Mx = (, ).
Clearly (*) holds for . This finishes the definition of (T, M ). The linear order T is close
to the order we are after.
The following two facts are clear from the construction:
(8) If x T , < < , and x() = p, then x() = p.
(9) If x T and < , then either M(x) = {p} or M(x) = (, ) for some , ;
except for M0 we have < left (R) and < right (R).
From Theorem 7.22 we know that T is complete. Now we find the characters of the
elements of T .
(10) The smallest element of T has character (0, ).
def
To prove this, note that the smallest element of T is x = hf, p, p, . . .i, where f is the first
element of M = (, ). For each < let y = h( + 1) , f, p, p, . . .i. Clearly x < y for
each < , and y < y if < < . Now suppose that x < w. Then there is a <
such that x(0) < w( ); hence y+1 < w, as desired. This proves (10).
By symmetry we get
(11) The largest element of T has character (, 0).
Let a be the first element of T , and b the last element.
(12) If a < x < b and |Mx | > 1 for every < , then x has character (, ).
84
For, by symmetry it suffices to show that x has left character . For each < let
y = (x ) hf, p, p, . . .i.
Clearly y < x for all < , and hy : < i is strictly increasing. Now suppose that
z N and z < x. Let = f.d.(z, x). Then clearly z < y+1 , as desired. So (12) holds.
Now suppose that a < x < b and x() = p for some < , and let be minimum
with this property. Then by construction, is a successor ordinal + 1, and x() is an
endpoint of M(x) or is an element of M(x) with no neighbor.
Case 1. x() is an element of M(x) with no neighbor. Then by definition, there is a
c U such that x() = (c, m), i.e., x() is the middle element of g(c) . Write g(c) = (, ).
So (, ) R. We claim that x has character (, ). To see this, for each < let
y = (x ) h(c, + 1), f, p, p, . . .i.
Then y < x, the sequence hy < < i is strictly increasing, and x is its supremum. So
the left character of x is , and similarly the right character of x is .
Case 2. x() is an endpoint of M(x) ; by symmetry, say that x() is f , the first
element. We now consider three subcases.
Subcase 2.1. = 0. This would imply that x = a, contradiction.
Subcase 2.2. is a limit ordinal. Then cf() < . So by construction, M(x) is
(cf() , cf() ). Clearly the character of x is (cf(), cf() ) R.
Subcase 2.3. = + 1 for some . Then
x = (x ) hx(), x(), p, p, . . .i.
Clearly then one of (5)(7) holds for x().
Subsubcase 2.3.1. (5) holds for x(). So x() has a right neighbor, but no left
neighbor. Say the left character of x() is . Then by (5), Mx is ( , ). We claim that
x has character (, ). To see this, let h : < i be strictly increasing with supremum
x(). Then for each < let y be any element of N such that x = y and
y () = . So clearly y < x and hy : < i is strictly increasing. Now suppose that
z N and z < x. If f.d.(z, x) < , then z < y0 . Suppose that f.d.(z, x) = . Then
z() < x(), so z() < for some < , and hence z < y . Clearly by the form of x,
one of these possibilities for z must hold. Hence the left character of x is . Clearly its
right character is .
Subsubcase 2.3.2. (6) holds for x(). So x() has a left neighbor , but no right
neighbor. Hence has a right neighbor, and hence (5) or (7) holds for (x ) hi in place
of x and in place of . Hence M(x) hi is ( , ) for some , or (, ). Now
def
From the form of M(x) hi it is clear that the left character of y is . Now Mx is
(, ) for some , and so it is clear that the right character of x is also .
Subsubcase 2.3.3. (7) holds for x(). So x has a left neighbor and a right
neighbor. This case is actually almost the same as Subsubcase 2.3.2, except that Mx is
(, ).
Summarizing our investigation of characters of elements of T , we have:
(14) If a < x < b, then one of the following holds:
(a) x has no neighbors, and its character is in R.
(b) x has an immediate predecessor y, and the characters of x, y are (1, ) and
(, 1) respectively.
(c) x has an immediate successor y, and the characters of x, y are (, 1) and (1, )
respectively.
Now let L be obtained from T by deleting the second element of any pair (x, y) of elements
of T such that y is the immediate successor of x. Then L is a complete linear order which
has left endpoint of character (0, ), right endpoint of character (, 0), and the characters
of each of its members different from a and b are in R.
It remains to show that every member of R is the character of some element of L, and
in fact this is true in any interval. So, suppose that x and y are elements of T , x < y, y
not the immediate successor of x. Let = f.d.(x, y).
Case 1. y() is not the immediate successor of x(). Then
(15) There is an element u such that x() < u < y() and u has an immediate predecessor
or an immediate successor.
To see this, note that x() 6= p 6= y(a), and Mx = My has the form (, ) for some
(, ) R. Then (15) follows by considering the position of x(a) in (, ).
By (15), the interval (x, y) contains all members of T that begin with (x ) hui.
Now M(x) hui is a set (, ). So, given (, ) W , let v be the element of (, ), in
the part , which is the middle point of . Then
(x ) hu, v, p, p, . . .i
is a member of N with character (, ).
Case 2. y() is the immediate successor of x(). Let z = (x ( + 1)) hl, p, p, . . .i
and w = (y ( + 1)) hf, p, p, . . .i. Thus x z < w y. So w is the immediate successor
of z. Since y is not the immediate successor of x, we have x 6= z or y 6= w. Say x < z.
Now
x = (x ) hx(), x( + 1), . . .i
z = (x ) hx(), l, p, p, . . .i.
and
Now l is not the immediate successor of x( + 1), so we can find our desired element
between x and z as in Case 1.
Theorem 7.24. Suppose that R is a complete character system and and are
regular cardinals with right (R) and left (R). Suppose also that S T = R,
86
with S nonempty. Then there is an irreducible inner-complete dense order L such that
Pchar(L) = S, Gchar(L) = T , with ci(L) = and cf(L) = .
Proof. Take L as in Theorem 7.23 Let
M = {a L : the character of a is (, ) for some (, ) S T }.
Possibly S T is empty, so that M is also empty. But if S T 6= , then M is nonempty
and dense in L. So by Theorem 7.1, in this case we can write M = M0 M1 with M0 , M1
disjoint and dense in M , hence also dense in L. Let
N = {a L : the character of a is (, ) for some (, ) S\T }.
Then let P = N M0 , or simply P = N if M = . Then P is as desired in the theorem.
EXERCISES
E7.1. Show that Theorem 7.2 does not extend to 1 . Hint: consider 1 Q and 1 Q,
both with the lexicographic order, where 1 is 1 under the reverse order ( < iff
< ).
E7.2. For any infinite cardinal , consider 2 under the lexicographic order, as for H .
Show that it is a complete linear order.
E7.3. Suppose that and are cardinals, with . Let be minimum such that
< . Take the lexicographic order on , as for H . Show that this gives a dense linear
order of size with a dense subset of size .
E7.4. Show that P() under contains a chain of size 2 . Hint: remember that || = |Q|.
E7.5. A subset S of a linear order L is weakly dense iff for all a, b L, if a < b then there
is an s S such that a s b. Show that the following conditions are equivalent for any
cardinals , such that :
(i) There is a linear order of size with a weakly dense subset of size .
(ii) P() has a chain of size .
E7.6. Suppose that Li is a linear order with at least two elements, for each i . Let
Q
i Li have the lexicographic order. Show that it is not a well-order.
E7.7. Suppose that L is a ccc dense linear order. Show that L has a dense subset of size
1 . Hint: let be a well-order of L, and let
N = {p L : there is an open set U in L such that p is the -first element of U },
and show that N is dense in L and has size at most 1 .
E7.8. Let hLi : i Ii be a system of linear orders, with I itself
P an ordered set. Show that
if each Li is dense without first or last elements, then also iI Li is dense without first
or last elements.
87
E7.9. Let be any infinite cardinal number. Let L0 be a linear order similar to + + 1;
specifically, let it consist of a copy of Z followed by one element a greater than every
integer, and let L1 be a linear order similar to + + 2; say it consists of a copy of Z
followed by two elements a < b greater than every integer. For any f 2 let
Mf =
Lf () .
<
Lf () .
<
Show that each Mf is a dense linear order without first or last elements, and if f, g 2
and f 6= g, then Mf and Mg are not isomorphic.
Conclude that for uncountable there are exactly 2 dense linear orders without first
or last elements, of size , up to isomorphism.
References
Harzheim, E. Ordered sets. Springer 2005, 386pp.
Hausdorff, F. Grundz
uge einer Theorie der geordneten Mengen, Math. Ann. 65 (1908),
435507. (Contains interesting theorems, relatively modern terminology.)
Rosenstein, J. Linear orderings., Academic Press 1982, 487pp.
88
8. Trees
In this chapter we study infinite trees. The main things we look at are Konigs tree
theorem, Aronszajn trees, and Suslin trees.
A tree is a partially ordered set (T, <) such that for each t T , the set {s T : s < t}
is well-ordered by the relation <. Thus every ordinal is a tree, but that is not so interesting
in the present context. We introduce some standard terminology concerning trees.
For each t T , the order type of {s T : s < t} is called the height of t, and is denoted
by ht(t, T ) or simply ht(t) if T is understood.
A root of a tree T is an element of T of height 0, i.e., it is an element of T with no
elements of T below it. Frequently we will assume that there is only one root.
For each ordinal , the -th level of T , denoted by Lev (T ), is the set of all elements of
T of height .
The height of T itself is the least ordinal greater than the height of each element of T ; it
is denoted by ht(T ).
A chain in T is a subset of T linearly ordered by <.
A branch of T is a maximal chain of T .
Note that chains and branches of T are actually well-ordered, and so we may talk about
their lengths.
Some further terminology concerning trees will be introduced later. A typical tree is
<
2, which is by definition the set of all finite sequences of 0s and 1s, with as the partial
order. More generally, one can consider < 2 for any ordinal .
Theorem 8.1. (Konig) Every tree of height in which every level is finite has an
infinite branch.
Proof. Let T be a tree of height in which every level S
is finite. We define a sequence
htm : m i of elements of T by recursion. Clearly T = r a root {s T : r s}, and
the index set is finite, so we can choose a root t0 such that {s T : t0 s} is infinite.
Suppose now that we have defined an element tm of height m such that {s T : tm s}
is infinite. Let S = {u T : tm < u and u has height ht(tm ) + 1}. Clearly
[
{s T : tm s} = {tm }
{s T : u s}
uS
and the index set of the big union is finite, so we can choose tm+1 of height ht(tm ) + 1
such that {s T : tm+1 s} is infinite.
This finishes the construction. Clearly {tm : m } is an infinite branch of T .
In attempting to generalize Konigs theorem, one is naturally led to Aronszajn trees and
Suslin trees. For the following definitions, let be any infinite cardinal.
A tree (T, <) is a -tree iff it has height and every level has size less than .
A -Aronszajn tree is a -tree which has no chain of size .
89
A subset X of a tree T is an antichain iff any two distinct members of X are incomparable.
Note that each set Lev (T ) is an antichain.
A -Suslin tree is a tree of height which has no chains or antichains of size .
An Aronszajn tree is an 1 -Aronszajn tree, and a Suslin tree is an 1 -Suslin tree.
It is natural to guess that Aronszajn trees and Suslin trees are the same thing, since the
definition of -tree implies that all levels have size less than , and a guess is that this
implies that all antichains are of size less than . This guess is not right though. Even our
simplest example of a tree, < 2, forms a counterexample. This tree has all levels finite,
but it has infinite antichains, for example
{h0i, h1, 0i, h1, 1, 0i, h1, 1, 1, 0i, . . .}.
In the rest of this chapter we investigate these notions, and state some consistency results,
some of which will be proved later. There is also one difficult natural open problem which
we will formulate.
First we consider Aronszajn trees. Note that Theorem 8.1 can be rephrased as saying
that there does not exist an -Aronszajn tree. As far as existence of Aronszajn trees is
concerned, the following theorem takes care of the case of singular :
Theorem 8.2. If is singular, then there is a -Aronszajn tree.
Proof. Let h : < cf()i be a strictly increasing sequence of infinite cardinals with
supremum . Consider the tree which has a single root, and above the root has disjoint
chains which are copies of the s. Clearly this tree is a -Aronszajn tree. We picture
this tree here:
+1
Very rigorously, we could define T to be the set {0} {(, ) : < cf() and < },
with the ordering 0 < (, ) for all < cf() and < , and (, ) < ( , ) iff =
and < .
90
91
Now we want to check that (1 )(6 ) hold. Conditions (1 ) and (3 ) are very clear.
For (2 ), suppose that w S . Then w T and there is an s U such that
{ < : w() 6= ts ()} is finite. Since \rng(ts ) is infinite, clearly \rng(w) is infinite. For
(4), note that U is countable by the assumption that (4) holds for every < , while
for each s U the set
{w T : { < : w() 6= ts ()} is finite}
is also countable. So (4) holds. For (5), suppose that w S , x T , and { < :
w() 6= x()} is finite. Choose s U such that { < : w() 6= ts ()} is finite. Then of
course also { < : x() 6= ts ()} is finite. So x S , and (5) holds. Finally, for (6 ),
suppose that w S and < ; we want to show that w S . Choose s U such
that { < : w() 6= ts ()} is finite. Assume the notation introduced above when defining
ts (n, , u, v). Choose i such that n+i . Then
{ < n+i : w() 6= ui ()} = { < n+i : w() 6= v()}
{ < n+i : w() 6= ts ()} {n+j : j < i},
and the last union is clearly finite. It followsSfrom (5n+1 that w S . So (6 ) holds.
This finishes the construction. Clearly <1 S is the desired Aronszajn tree.
We defer a discussion of possible generalizations of Theorem 8.3 until we discuss the closely
related notion of a Suslin tree.
The proof of Theorem 8.2 gives
Theorem 8.4. If is singular, then there is a -Suslin tree.
Note also that Theorem 8.1 implies that there are no -Suslin trees. There do not exist ZFC
results about existence or non-existence of -Suslin trees for uncountable and regular.
We limit ourselves at this point to some simple facts about Suslin trees.
Proposition 8.5. If T is a -Suslin tree with uncountable and regular, then T is a
-tree.
Proposition 8.6. For any infinite cardinal , every -Suslin tree is a -Aronszajn
tree.
This is a good place to notice that the construction of an 1 -Aronszajn tree given in the
proof of Theorem 8.3 does not give an 1 -Suslin tree. In fact, assume the notation of that
proof, and for each n let
[
An =
{s S+1 : s() = n}.
<1
S
S
S
S
Clearly An is an antichain in <1 S , and n AnS= <1 S+1 . Hence n An =
1 . It follows that some An is uncountable, so that <1 S is not a Suslin tree.
We now introduce some notions that are useful in talking about -trees; these conditions
were implicit in part of the proof of Theorem 8.3.
92
A well-pruned -tree is a -tree T with exactly one root such that for all < < ht(T )
and for all x Lev (T ) there is a y Lev (T ) such that x < y.
A normal subtree of a tree (T, <) is a tree (S, ) satisfying the following conditions:
(i) S T .
(ii) For any s1 , s2 S, s1 s2 iff s1 < s2 .
(iii) For any s, t T , if s < t and t S, then s S.
Note that each level of a normal subtree is a subset of the corresponding level of T . Clearly
a normal subtree of height of a -Aronszajn tree is a -Aronszajn tree; similarly for Suslin trees.
A tree T is eventually branching iff for all t T , the set {s T : t s} is not a chain.
Clearly a well-pruned -Aronszajn tree is eventually branching; similarly for -Suslin trees.
Theorem 8.7. If is regular, then any -tree T has a normal subtree T which is a
well-pruned -tree. Moreover, if x T and |{y T : x y}| = then we may assume
that x T .
Proof. Let be regular, and let T be a -tree. We define
S = {t T : |{s T : t s}| = }.
Clearly S is a normal subtree of T , although it may contain more than one root of T . Now
we claim
(1) Some root of T is in S.
In fact, Lev0 (T ) has size less than , and
T =
{t T : s t},
sLev0 (T )
{t Lev (T ) : u t}
<
[
vLev (T )
u<v
93
{t T : v t},
and the first big union here is the union of fewer than sets, each of size less than .
Hence there is a v Lev (T ) such that u < v and |{t T : v t}| = . So v S and
u < v, as desired.
Proposition 8.8. Let be an uncountable regular cardinal. If T is an eventually
branching -tree in which every antichain has size less than , then T is a Suslin tree.
Proof. Suppose to the contrary that C is a chain of length . We may assume that
C is maximal, so that it has elements of each level less than . For each t T choose
f (t) T such that t < f (t)
/ C; this is possible by the eventually branching hypothesis.
Now we define hs : < i by recursion, choosing
(
)
s
iff
B2
B1
B3
B1
B1
B2
B3
B2
B3
,C )
Hence
(2) If < d(B, E(B,C)), then < (B,C) , and so bB () = bE(B,C) () = bE(B ,C ) ().
Now recall that d(B, E(B,C)) < (B,C) . Hence
bB (d(B, E(B,C) )) bE(B,C) (d(B, E(B,C))) = bE(B ,C ) (d(B, E(B,C) )),
and so B < E(B ,C ) . Similarly, E(B ,C ) < C, as follows:
(3) If < d(C, E(B,C)), then < (B,C) , and so bC () = bE(B,C) () = bE(B ,C ) ().
Now recall that d(C, E(B,C)) < (B,C) . Hence
bC (d(C, E(B,C) )) bE(B,C) (d(C, E(B,C) )) = bE(B ,C ) (d(C, E(B,C))),
and so C > E(B ,C ) . Hence E(B ,C ) (B, C). But also E(B ,C ) (B , C ), contradiction.
To show that B(T, ) is not separable, it suffices to show that for each < 1 the
set {B B(T, ) : len(B) < } is not dense in B(T, ). Take any x T of height .
Since {y : y > x} has elements of every level greater than , it cannot be a chain, as
this would give a chain of size 1 . So there exist incomparable y, z > x. Similarly, there
exist incomparable u, v > y. Let B, C, D be branches containing u, v, z respectively. By
symmetry say B < C.
(4) ht(y) < d(B, C)
This holds since y B C.
(5) d(B, D) ht(y) and d(C, D) ht(y); hence d(B, D) < d(B, C) and d(C, D) < d(B, C).
In fact, y B\D, so d(B, D) ht(y) follows. Similarly d(C, D) ht(y). Now the rest
follows by (4).
(6) d(B, D) = d(C, D).
For, if d(B, D) < d(C, D), then bC (d(B, D)) = bD (d(B, D)) 6= bB (d(B, D)), contradicting
d(B, D) < d(B, C), part of (5). If d(C, D) < d(B, D), then bB (d(C, D)) = bD (d(C, D)) 6=
bC (d(C, D)), contradicting d(C, D) < d(B, C), part of (5).
(7) B < C < D or D < B < C.
In fact, otherwise we have B < D < C, and Lemma 8.9 gives the possibilities d(B, C) =
d(B, D) < d(D, C), d(B, C) = d(B, D) = d(C, D) or d(B, C) = d(D, C) < d(B, D); each
of these contradicts (5) or (6).
Case 1. B < C < D. Thus (B, D) is a nonempty open interval. Suppose that
there is some branch E with len(E) < and B < E < D. Then d(B, E), d(E, D) < . By
Lemma 8.9 one of the following holds: d(B, D) = d(B, E) < d(E, D); d(B, D) = d(B, E) =
96
d(E, D); d(B, D) = d(E, D) < d(B, E). Hence d(B, D) < . Since x B D, we have
bB = bD , contradiction.
Case 2. D < B < C. Thus (D, C) is a nonempty open interval. Suppose that there
is some branch E with len(E) < and D < E < C. Then d(D, E), d(E, C) < . By
Lemma 8.9 one of the following holds: d(D.C) = d(D, E) < d(E, C); d(D.C) = d(D, E) =
d(E, C); d(D.C) = d(E, D) < d(D, E); hence d(D, C) < . Since x C D, we have
bB = bD , contradiction.
In the other direction, we prove:
Theorem 8.11. If there is a Suslin line, then there is a Suslin tree.
Proof. Assume that there is a Suslin line. Then by Theorem 7.16 we may assume
that we have a linear order L satisfying the following conditions:
(1) L is dense, with no first or last elements.
(2) No nonempty open subset of L is separable.
(3) L is ccc.
(We do not need the other condition given in Theorem 7.16.) Let I be the collection of
all open intervals (a, b) with a < b in L. So by denseness, each such interval is nonempty.
We are now going to define a sequence hJ : < 1 i of subsets of I. Let J0 be a maximal
disjoint subset of I. Now suppose that 0 < < 1 and we have defined J for all <
so that the following conditions hold:
(4 ) The elements of J are pairwise disjoint.
S
(5 ) J is dense in L.
(6 ) If < , I J , and J J , then either I J = , or else J I.
(7 ) If < and I J , then there are at least two J J such that J I.
Note that (40 )(70 ) hold: (60 ) and (70 ) trivially hold, (40 ) holds by definition, and (50 )
holds by the maximality of J0 .
First suppose that is a successor ordinal + 1. For each M J , choose disjoint
members I1 , I2 of I such that I1 I2 M , and let KM be a maximal disjoint subset of
{K I : K M }
such
S that I1 , I2 KM . The existence of I1 and I2 is clear by denseness. Then let J =
M J KM . Clearly (4 ) holds.
S
For (5 ), suppose that a, b L and a < b. By (5 ), choose c
J such that
a < c < b. Say c (d, e) J . Thus max(a, d) < c < min(b, e). We claim:
(8) There is a K K(d,e) such that (max(a, d), min(b, e)) K 6= .
For, suppose that (8) fails. Choose u, v with max(a, d) < u < v < min(b, e). Then
(u, v) K = for all K K(d,e) and (u, v) (d, e). This contradicts the maximality of
K(d,e) . So (8) holds.
97
S
Choose K as in (8). It follows that J (a, b) 6= , as desired for (5 ).
For (6 ), suppose that , I J , and J J . Choose M J such that J KM .
Now we consider two cases.
Case 1. = . Then by (4 ), either I M = or I = M . If I M = , then I J =
since J M , as desired in (6 ). If I = M , then J I by definition.
Case 2. < . In this case, by (6 ) we have two further possibilities. If I M = ,
then also I J = , as desired in (6 ). Otherwise we have M I and clearly also J I.
(7 ) is clear by construction.
Second, suppose that is a limit ordinal. Let
K = {K I : for all < and all I J [I K = or K I}.
Before defining J , we need to know that K is nonempty. This follows from the following
stronger statement.
(9) If a, b I and a < b, then there is a K K such that K (a, b).
To prove this, suppose
S that a, b I and a < b. Let E be the collection of all endpoints
of the intervals in < J . Since is countable and each J is countable by virtue of
(4 ) and the ccc for L, it follows that E is countable. Since (a, b) is not separable,
there
S
are c, d L such that a < c < d < b and E (c, d) = . For every I < J , the
interval (c, d) does not contain either of the endpoints of I, so it follows that I (c, d) =
or (c, d) I. Hence (c, d) K and (c, d) (a, b), as desired in (9)
Now let J be a maximal pairwise disjoint subset of K. So (4 ) holds, and also (6 )
is clear.
Now to prove (5 ), take any a, b L with a < b, and choose K K such that
K (a, b), as given in (9).
S By the maximality of J , there is an L J such that
K L 6= . Hence (a, b) J 6= , as desired in (5 ).
Finally, for (7 ), let I J where < . By (7+1 ), there are two distinct J, K J+1
such that J, K I, and then the construction gives J J and K K with J , K J ,
as desired.
S
This finishes the construction. Let T = <1 J , with the ordering . So this gives
a partial order.
(10) If I T , then there is a unique < 1 such that I J .
For, by definition there is some < 1 such that I J . Suppose that also I J , with
6= . By symmetry, say that < . This contradicts (7 ).
We denote the given by (10) by I .
(11) If I T and < I , then there is a unique J J such that I J.
In fact, by (5 ) there is a J J such that I J 6= . Then by (6I ), I J. Hence by
(4I ), also J is unique.
Let I T . For each < I , let f () be the unique J given by (11). Then f is an
order-isomorphism from I onto {J T : I J} under . In fact, if < < I , then
I f () f (), and so (6 ) and 6( ) imply that f () f (). The function f maps onto,
since if I J with J J , then < by (6I ), and so f () = J.
98
Lev (T ) Lev (T );
<
E8.11. Show that if is an uncountable regular cardinal and T is a -Aronszajn tree, then
T has a subset S such that under the order induced by T , S is a well-pruned -Aronszajn
tree in which every element has at least two immediate successors.
E8.12.Let (T, <) be a -Aronszajn tree, and let be any linear order of T . For incomparable elements s, t of T let (s, t), (s, t) be the elements of T such that (s, t) s, (s, t) t,
(s, t) and (s, t) are different elements of the same height, and w < (s, t)[w < t].
We define s t iff s, t T and one of the following conditions holds:
(i) s < t.
(ii) s 6< t and t 6< s and (s, t) (s, t).
Show that (T, ) is a linear order in which every element has character (, ) with , < .
Reference
Todorcevic, S. Trees and linearly ordered sets. In Handbook of set-theoretic topology.
North-Holland 1984, 235293.
100
if = 0,
0
max(f
(),
g()
+
1)
if = + 1 for some ,
g() =
sup
if is a limit ordinal.
< g()
Clearly then g is a normal function from cf() into , with rng(g) unbounded in . By
Theorem 9.2(ii), the existence of the desired set C follows.
If cf() = , then Corollary 9.4 yields a strictly increasing function f : with rng(f )
unbounded in . Then rng(f ) is club in . The condition on limit ordinals in the definition
of club is trivial in this case. Most of our results concern limit ordinals of uncountable
cofinality.
If is any limit ordinal and < , then the interval [, ) is a club of . Another
simple fact about clubs is that if C is club in a limit ordinal of uncountable cofinality, then
the set D of all limit ordinals which are in C is also club in . (We need of uncountable
cofinality in order to have D unbounded.) Also, if C is club in with cf() > , then
the set E of all limit points of members of C is also club in . This set E is defined to be
{ < : is a limit ordinal and C is unbounded in }; clearly E C.
Now we give the first major fact about clubs.
Theorem 9.5. If is a limit ordinal with cf() > , then the intersection of fewer
than cf() clubs of is again a club.
Proof.
Suppose that < cf() and hC : < i is a system of clubs of . Let
T
D = < C . First we show that D is closed. To this end, suppose that < is a limit
ordinal, and D is unbounded in . Then for each < , the set C is unbounded in ,
and hence C since C is closed in . Therefore D.
To show that D is unbounded in , take any < ; we want to find > such that
D. We make a simple recursive construction of a sequence hn : n i of ordinals
less than . Let 0 = . Suppose that n has been defined. Using the fact that each C is
unbounded in , for each < choose n, C such that n < n, . Then let
n+1 = sup n, ;
<
102
we have n+1 < since < cf(). This finishes the recursive construction. Let =
supn n . Then < since cf() > . Clearly C is unbounded in for each < ,
and hence C . So D, as desired.
Again let be any limit ordinal, and suppose that hC : < i is a system of subsets of
. We define the diagonal intersection of this system:
< C = { : < ( C )}.
This construction is used often in discussion of clubs, in particular in the definition of
some of the large cardinals.
Theorem 9.6. Suppose that cf() > . Assume that hC : < i is a system of
clubs of .T
(i) If < C is unbounded in for each < , then < C is club in .
(ii) If is regular, then < C is club in .
Proof. Clearly (ii) follows from (i) (using Theorem 9.5 to verify the hypothesis of
(i)), so it suffices to prove (i). Assume the hypothesis of (i).
For brevity set D = < C First we show that D is closed in . So, assume that
is a limit ordinal less than , and D is unbounded in . To show that D, take any
< ; we show that C . Let E = { D : < }. Then E is unbounded in ,
and for each E we have C , by the definition of D. So C since C is closed.
Second we show that D is unbounded in . So, take any < . We define a sequence
hi : i < i of ordinals less than by recursion.
T Let 0 = . If i has been defined, by
the hypothesis of (i) let i+1 be a member of <i C which is greater than i . Finally,
let = supi i . So < since cf() > . We claim that D. To see this, take any
< . Choose i such that < i . Then j C for all j i, and hence C is
unbounded in , so C . This argument shows that D.
We give one more general fact about closed and unbounded sets; this one is frequently
useful in showing that specific sets are closed and unbounded.
A finitary partial operation on a set A is a nonempty function whose domain is a
subset of mA for some positive integer m and whose range is a subset of A. We say that a
subset B of A is closed under such an operation iff for every a (m B) dmn(f ) we have
f (a) B.
Theorem 9.7. Suppose that is an uncountable regular cardinal, X []< , and F
is a collection of finitary partial operations on , with |F | < . Then { < : X and
is closed under each f F } is club in .
Proof. Denote the indicated set by C. To show that it is closed, suppose that is a
limit ordinal less than , and C is unbounded in . To show that is closed under any
partial operation f F , suppose that dmn(f ) m and a (m ) dmn(f ). For each
def S
i < m choose i < such that ai i . Since is a limit ordinal, the ordinal = i<m i
is still less than . Since C is unbounded in , choose C such that < . Then
a m so, since C, we have f (a) . Thus is closed under f . Hence C; so
C is closed in .
103
\
D( ()+1)
104
C .
(1)
Hence we can apply Theorem 9.6(i) to infer that F = < E is club in . Hence also
the set G of all limit ordinals which are in F is club in . Choose G S. Now f () < ;
since is a limit ordinal, choose < such that f () < . But G F , so it follows by
the definition of diagonal intersection that E . From (1) we then see that
/ f 1 [].
This contradicts f () < .
For the second part of S
the theorem, assume that is regular. Note that, with as
1
in the first part, f [] = < f 1 [{}]. Hence the second part follows from the fact
mentioned above that a union of fewer than nonstationary sets is nonstationary.
To illustrate the use of Fodors theorem we give the following result about Aronszajn trees
which answers a natural question.
Theorem 9.10. Suppose that is an uncountable regular cardinal, T is a -Aronszajn
tree, and is an infinite cardinal less than . Further, suppose that x T and |{y T :
x < y}| = . Then there is an > ht(x) such that
|{y Lev (T ) : x < y}| .
Proof. By Theorem 8.7 we may assume that T is well-pruned, and by taking {y
T : x y} we may assume that x is the root of T . So now we want to find a level such
that |Lev (T )| . We assume that this is not the case. So |Lev (T )| < for all < .
Suppose that is singular. Then
[
{ < : |Lev (T )| < + },
=
<
a cardinal
def
so there is a < such that = { < : |Lev (T )| < + } has power . Because T is
well-pruned, we have |Lev (T )| |Lev (T ) whenever < . It follows that |Lev (T )| <
+ for all < , since is clearly unbounded in . Thus we may assume that is regular.
For each s T and each < ht(s) let s be the unique element of height less than
s.
Let = { < : cf() = }. So is stationary in . Now we claim
(1) For every and every s Lev (T ) there is a < such that the set {t T :
s t, ht(t) < } is a chain.
To prove this, suppose not. Thus we can choose and s Lev (T ) such that
(2) For all < there is a [, ) and a t Lev (T ) such that s < t 6= s and s+1 6 t.
Now we use (2) to construct by recursion two sequences h : < i and ht : < i.
Suppose that these have been defined for all < , where < , so that each < . Let
105
S
= < . So < since cf() = . By (2), choose [ + 1, ) and t Lev (T )
such that s < t 6= s and s +1 6 t . Since Lev (T ) has size less than , there exist ,
with < and t = t . Then s +1 s < t = t , contradiction. Hence (1) holds.
(3) For every there is a < such that for each s Lev (T ) the set {t T : s
t, ht(t) < } is a chain.
To prove this, let . By (1), for each s Lev (T ) choose s < such tha the set
{t T : ss t, s ht(t) < } is a chain. Let = supht(s)= s . Clearly is as desired
in (3).
Now for each choose f () to be a as in (3). So f is a regressive function
defined on the stationary set . Hence there is a < such that f 1 [{}] is stationary,
and hence of size . So T does not branch beyond , and hence has a branch of size
because it is well-pruned, contradiction.
For the next result we need another important construction. Suppose that is an infinite
cardinal, f = hf : < + i is a family of injections f : , and S is a cofinal subset of
+ . The (, f, S)-Ulam matrix is the function A : + P() defined for any <
and < + by
A = { S\( + 1) : f () = }.
Theorem 9.11. (Ulam) Let be an infinite cardinal, S is a stationary subset of + ,
and I a collection of subsets of + having the following properties:
(i) I.
S
(ii) If X [I] , then X I.
(iii) If Y X I, then Y I.
(iv) If < + , then {} I.
(v) S
/ I.
Then there is a system hX : < + i of subsets of S such that X X = for distinct
, < + , and X
/ I for all < + .
Proof. Let f = hf : < + i be a family of injections f : , and let A be the
(, f, S)-Ulam matrix. If < , then for distinct , < + we have A A = , since
the functions f are one-one. Moreover, for any < + we have
S\
A S ( + 1) I
<
by (ii)(iv). By conditions (ii) and (v) it then follows that for each < + there is
h()
an h() < such that A
Proof. (i): Let I be the collection of all nonstationary subsets of + . The conditions
of Theorem 9.11 are all clear, and so by it we get a system hX : < + i of subsets of S
+
such
/ I for all < + . We can union
S that X X = for distinct , < , and X
S\ <+ X with X0 to get the desired partition of S.
(ii) For each regular cardinal < , let S = { < : cf() = }. Thus S is
stationary by Proposition 9.8. By induction it is clear that if < , then +1 < .
Hence there are regular cardinals less than . Thus we have many pairwise disjoint
stationary subsets of , and these can be extended to a partition of as in the proof of
(i).
The first part of Theorem 9.12 can actually be extended to weak inaccessibles too, but the
proof is longer.
While the above results are classical and have found wide use, the following related result
has mainly been used in PCF theory, although it should find a much wider use in the
future.
Theorem 9.13. (Club guessing) Suppose that is a regular cardinal, is a cardinal
such that cf() ++ , and S = { : cf() = }. Then there is a sequence hC :
S i such that:
(i) For every S the set C is club, of order type .
(ii) For every club D there is a D S such that C D.
The sequence hC : S i is called a club guessing sequence for S .
Proof. First we take the case of uncountable . Fix a sequence C = hC : S i
such that C is club in of order type , for every S . For any club E of , let
C E = hC E : S E i,
where E = { E : E is unbounded in }. Clearly E is also club in . Also note that
C E is club in for each S E . We claim:
(1) There is a club E of such that for every club D of there is a D E S such
that C E D.
Note that if we prove (1), then the theorem follows by defining C = C E for all
E S , and C = C for S \E .
Assume that (1) is false. Hence for every club E there is a club DE such
that for every DE E S we have
C E 6 DE .
We now define a sequence hE : < + i of clubs of decreasing under inclusion, by
induction on :
(2) E 0 = .
(3) If < + is a limit ordinal and E has been defined for all < , we set E =
Since < + < cf(), E is club in .
107
<
E .
(4) If E has been defined, let E +1 be the set of all limit points of E DE , i.e., the
set of all < such that E DE is unbounded in .
T
This defines the sequence. Let E = <+ E . Then E is club in . Take any S E.
Since |C | = and the sequence hE : < + i is decreasing, there is an < + such that
C E = C E . So C E = C E +1 . Hence C E DE , contradiction.
Thus the case uncountable has been finished.
Now we take the case = . For S = S0 fix C = hC : Si so that C is club in
with order type . We denote the n-th element of C by C (n). For any club E and
any S E we define
CE = {max(E (C (n) + 1)) : n },
where again E is the set of limit points of members of E. This set is cofinal in . In fact,
given < , there is a E such that < since E , and there is an n such
that < C (n). Then < max(E (C (n) + 1)), as desired. There may be repetitions
in the description of CE , but max(E (C (n) + 1)) max(E (C (m) + 1)) if n < m, so
CE has order type . We claim
(5) There is a closed unbounded E such that for every club D there is a
D S E such that CE D. [This proves the club guessing property.]
Suppose that (5) fails. Thus for every closed unbounded E there exist a club DE
such that for every DE S E we have CE 6 D. Then we construct a descending
sequence E of clubs in as in the case > , for < T
1 . Thus for each < 1 and
E
each DE S (E ) we have C 6 DE . Let E = <1 E . Take any S E.
For n and < we have
E (C (n) + 1) E (C (n) + 1),
and so max(E (C (n) + 1)) max(E (C (n) + 1)); it follows that there is an n < 1
such that max(E (C (n) + 1)) = max(E n (C (n) + 1)) for all > n . Choose
greater than all n . Thus
(6) For all > and all n we have max(E (C (n) + 1)) = max(E (C (n) + 1)).
But there is a CE \DE ; say that = max(E (C (n) + 1)). Then = max(E +1
(C (n) + 1)) E +1 = (E DE ) DE , contradiction.
Next we introduce an important combinatorial principle and show that it implies the
existence of Suslin trees. is the following statement:
There exists a sequence hA : < 1 i of sets with the following properties:
(i) A for each < 1 .
(ii) For every subset A of 1 , the set { < 1 : A = A } is stationary in 1 .
A sequence as in is called a -sequence. Such a sequence in a sense captures all subsets
of 1 in a sequence of length 1 . Later in these notes we will show that follows from
V = L.
108
E9.5. Suppose that is regular and uncountable, and S is stationary. Also, suppose
that every S is an uncountable regular cardinal. Show that
def
T = { S : S is non-stationary in }
is stationary in . Hint: given a club C in , let C be the set of all limit points of C and
let be the least element of C S; show that T C.
For the next few exercises we use the following definition. Suppose that is an uncountable
regular cardinal and A is a set such that |A| . Then a subset X of [A]< is closed iff for
every system haS
: < i of elements of X, with < and with a a for all < < ,
also the union < a is in X. And we say that X is unbounded in [A]< iff for every
x [A]< there is a y X such that x y.
E9.6. Suppose that is an uncountable regular cardinal, |A| , and a [A]< . Show
that {x [A]< : a x} is club in [A]< .
E9.7. Suppose that is an regular cardinal > 1 and |A| . Show that {x [A]< :
|x| 1 } is club in [A]< .
E9.8. Suppose that is an uncountable regular cardinal and is a cardinal > . Show
that {x []< : x } is club in []< .
E9.9. Suppose that is an uncountable regular cardinal and |A| . Show that the
intersection of two clubs of [A]< is a club.
E9.10. Suppose that is an uncountable regular cardinal and |A| . Show that the
intersection of fewer than clubs of [A]< is a club.
If is an uncountable regular cardinal, |A| , and hXa : a Ai is a system of subsets of
[A]< , then the diagonal intersection of this system is the set
def
aA Xa =
x [A]< : x
Xa
ax
113
2n = .
(X\Z)
6= .
ZC
BK
114
root
The existence theorem for -systems that is most often used is as follows.
Theorem 10.3. If is an uncountable regular cardinal and A is a collection of finite
sets with |A | , then there is a B [A ] such that B is a -system.
Proof. First we prove the following special case of the theorem.
() If A is a collection of finite sets each of size m , with |A | = , then there is a
B [A ] such that B is a -system.
We prove this by induction on m. The hypothesis implies that m > 0. If m = 1, then
each member of A is a singleton, and so A is a collection of pairwise disjoint sets; hence
it is a -system with root . Now assume that () holds for m, and suppose that A is a
collection of finite sets each of size m + 1, with |A | = , and with m > 0. We consider
two cases.
def
Case 1. There is an element x such that C = {A A : x A} has size . Let
D = {A\{x} : A C }. Then D is a collection of finite sets each of size m, and |D| = .
Hence by the inductive assumption there is an E [D] which is a -system, say with
kernel r. Then {A {x} : A E} [A ] and it is a -system with kernel r {x}.
Case 2. Case 1 does not hold. Let hA : < i be a one-one enumeration of A .
Then from the assumption that Case 1 does not hold we get:
115
{ < : x A }.
{A A : |A| = m}.
116
Theorem 10.5. Suppose that and are cardinals, < , is regular, and
for all < , |[]< | < . Suppose that A is a collection of sets, with each A A of size
less than , and with |A | . Then there is a B [A ] which is a -system.
Proof.
(1) There is a regular cardinal such that < .
In fact, if is regular, we may take = . If is singular, then + |[]< | < , so we
may take = + .
We take as in (1). Let S = { < : is a limit ordinal and cf() = }. Then S is
a stationary subset of .
S
Let A0 S
be a subset of A of size . Now AA0 A since < . Let a be an
injection of AA0 A into , and let A be a bijection of onto A0 . Set b = a[A ] for
each < . Now if S, then |b | |b | = |A | < = cf(), so there is an
ordinal g() such that sup(b ) < g() < . Thus g is a regressive function on S. By
Fodors theorem, there exist a stationary S S and a < such that g[S ] = {}. For
each S let F () = b . Thus F () []< , and |[]< | < , so there exist an
S [S ] and a B []< such that b = B for all S .
Now we define h : < i by recursion. For any < , is a member of S such
that
(2) < for all <
S, and
(3) < for all < b .
S
Since < b < , this is possible by the regularity of .
Now let A1 = A[{ : < }] and r = a1 [B]. We claim that C D = r for distinct
C, D A1 . For, write C = A and D = A . Without loss of generality, < . Suppose
that x r. Thus a(x) B b , so by the definition of b we have x A = C.
Similarly x D. Conversely, suppose that x C D. Thus x A A , and hence
a(x) b b . By the definition of , since a(x) b we have a(x) < . So
a(x) b = B, and hence x r.
Clearly |A1 | = .
By the proof of Theorem 10.4 we also get
Theorem 10.6. Suppose that and are cardinals, < , is regular, and for all
< , |[]< | < . Suppose that hAi : i Ii is a system of sets, with each Ai of size less
than , and with |I| . Then there is a J [I] such that hAi : i Ji is an indexed
-system.
Now we give some important results of the partition calculus, which is infinitary Ramsey
theory. The basic definition is as follows:
Suppose that is a nonzero cardinal number, h : < i is a sequence of cardinals,
and , are cardinals. We also assume that 1 for all < . Then we write
(h : < i)
117
i<
and so there is some i < such that |{ < : f ({}) = i}| = i , as desired.
For the first few theorems we assume that is finite, and use the letter r instead of .
The general infinite Ramsey theorem is as follows.
Theorem 10.7. (Ramsey) If n and r are positive integers, then
( , . . . , )n .
| {z }
r times
and
Let pi < r be the constant value of gmi [{mj : j > i}]n , for each i . Hence
=
{i : pi = j};
j<r
def
{k Si : fk [i + 1]n = s},
s:[i+1]n r
def
119
{ n \n : f ({, }) = 1}.
In fact, suppose that i < n. Then i i and i i+1 = . Since n i+1 , it follows
that i
/ n and so i 6= n . So (5) holds.
Now with i < n we have n n i+1 , and hence f ({i , n }) = 1. Thus
f [{n : n }]2 {1}, as desired.
To formulate another generalization of Ramseys theorem it is convenient to introduce a
notation for a special form of the arrow notation. We write
() iff
(h : < i)
In direct terms, then, () means that for every f : [] there is a [] such
that |f []| = 1.
The following cardinal notation is also needed for our next result: for any infinite
cardinal we define
20 = ;
2n+1 = 2(2n )
for all n .
Theorem 10.12. (Erdos-Rado) For every infinite cardinal and every positive integer n, (2n1 )+ (+ )n .
Proof. Induction on n. For n = 1 we want to show that + (+ )1 , and this is
obvious. Now assume the statement for n 1, and suppose that f : [(2n )+ ]n+1 . For
each (2n )+ define F : [(2n )+ \{}]n by setting F (x) = f (x {}).
(1) There is an A [(2n )+ ]2n such that for all C [A]2n1 and all u (2n )+ \C there is a
v A\C such that Fu [C]n = Fv [C]n .
To prove this, we define a sequence hA : <S(2n1 )+ i of subsets of (2n )+ , each of size
2n . Let A0 = 2n , and for limit let A = < A . Now suppose that A has been
element in common with each equivalence class. Let A+1 = A {KC : C [A ]2n1 }.
Since (2n )2n1 = 2n , we still have |A+1 | = 2n . This finishes the construction. Clearly
def S
A = (2 )+ A is as desired in (1).
n1
Take A as in (1), and fix a (2n )+ \A. We now define a sequence hx : < (2n1 )+ i
def
Now by the inductive hypothesis there is an H [X] such that G is constant on [H]n .
By the above, f is constant on [H]n+1 .
Corollary 10.13. (2 )+ (+ )2 for any infinite cardinal .
We close this chapter with some more non-arrow results.
Theorem 10.14. For any infinite cardinal we have 2 6 (3)2 .
Proof. Define F : [ 2]2 by setting F ({f, g}) = (f, g) for any two distinct
f, g 2. If {f, g, h} is homogeneous for F with f, g, h distinct, let = (f, g). Then
f (), g(), h() are distinct members of 2, contradiction.
Corollary 10.15. For any infinite cardinal we have 2 6 (+ )2 .
Our final result in the partition calculus indicates that infinite exponents are in general
hopeless.
Theorem 10.16. 6 (, ) .
Proof. Let < well-order [] . We define for any X []
F (X) =
Suppose that H [] is homogeneous for F . Let X be the <-least element of [H] . Thus
F (X) = 1. So we must have F (Y ) = 1 for all Y [H] . Write H = {mi : i } without
repetitions. For each k let
Ik = {m0 , m2 , . . . , m2k } {m2i+1 : i }.
Thus these are infinite subsets of H. Choose k0 so that Ik0 is minimum among all of the
Ik s. Then Ik0 Ik0 +1 and Ik0 < Ik0 +1 , so F (Ik0 +1 ) = 0, contradiction.
We close this chapter with the following theorem of Comfort and Negrepontis.
Theorem 10.17. Suppose that = < . Then there is a system hf : < 2 i of
members of such that
<
M 2
g M < M [f () = g()].
Proof. Let
F = (F, G, s) : F []< , G [P(F )]< , and s G .
Now if F []< , say |F | = , then
[P(F )]< (2 )< | ( )< < = ,
122
s(k() F )
0
if k() F G,
otherwise.
Now to prove that hf : < 2 i is as desired, suppose that M [2 ]< and g M . For
distinct members , of M choose (, ) k()k(). Then let
F = {(, ) : , M, 6= }
and G = {k() F : M }.
2 such that F = {f F : g f } is
F are compatible. Thus the original set F
P has ccc. This argument is rather typical of
MA().
So, by MA(2 ) let G be a filter on P which intersects each of the sets Dn and Eh . Let
S
k = G.
(*) k : .
In fact, k is obviously a relation. Suppose that (m, ), (m, ) k. Choose f, g G such
that (m, ) f and (m, ) g. Then choose s G such that s f, g. So f, g s, and s
is a function. It follows that = . Thus k is a function.
If n , choose f G Dn . So n dmn(f ), and so n dmn(k). So we have proved
().
Now take any f G Ek . Choose n dmn(f ) such that f (n) 6= k(n). But f k,
contradiction.
There is one more fact concerning the definition of MA which should be mentioned.
Namely, for > the assumption of ccc is essential in the statement of MA(). (Recall our comment above that ccc is not needed in order to prove that MA() holds.) To
see this, define
P = {f : f is a finite function, dmn(f ) , and rng(f ) 1 };
f g iff f, g P and f g;
P = (P, ).
This example is similar to two of the partial orders above. Note that P does not have ccc,
since for example {{(0, )} : < 1 } is an uncountable antichain. Defining Dn as in the
proof of Theorem 11.3, we clearly get dense subsets of P. Also, for each < 1 let
F = {f P : rng(f )}.
Then F is dense in P. For, suppose that g P . If rng(g), then g itself is in F , so
suppose that
/ rng(g). Choose n \dmn(g). Let f = g {(n, )}. Then f F and
f g, as desired. Now if MA(1 ) holds without the assumption of ccc, then we can apply
it to our present partial order. Suppose that G is a filter on P which intersects each of
def S
these sets Dn and F . As in the proof of Theorem 11.3, k = G is a function mapping
into 1 . For any < 1 choose f G F . Thus rng(f ), and so rng(k). Thus
k has range 1 . This is impossible.
Now we proceed beyond the discussion of the definition of MA in order to give several
typical applications of it. First we consider again almost disjoint sets of natural numbers.
Our result here will be used to derive some important implications of MA for cardinal
arithmetic. We proved in Theorem 8.1 that there is a family of size 2 of almost disjoint
sets of natural numbers. Considering this further, we may ask what the size of maximal
almost disjoint families can be; and we may consider the least such size. This is one
of many min-max questions concerning the natural numbers which have been considered
recently. There are many consistency results saying that numbers of this sort can be less
than 2 ; in particular, it is consistent that there is a maximal family of almost disjoint
subsets of which has size less than 2 . MA, however, implies that this size, and most of
the similarly defined min-max functions, is 2 .
127
Let A P(). The almost disjoint partial order for A is defined as follows:
PA = {(s, F ) : s []< and F [A ]< };
(s , F ) (s, F ) iff s s , F F , and x s s for all x F ;
PA = (PA , ).
We give some useful properties of this construction.
Lemma 11.4. Let A P().
(i) PA is a partial order.
(ii) Let (s, F ), (s, F ) PA . Then the following conditions are equivalent:
(a) (s, F ) and (s , F ) are compatible.
(b) x F (x s s) and x F (x s s ).
(c) (s s , F F ) (s, F ), (s, F ).
(iii) Suppose that x A , and let Dx = {(s, F ) PA : x F }. Then Dx is dense in
PA .
(iv) PA has ccc.
Proof. (i): Clearly is reflexive on PA and it is antisymmetric, i.e. (s, F )
(s , F ) (s, F ) implies that (s, F ) = (s , F ). Now suppose that (s , F ) (s , F )
(s, F ). Thus s s s , so s s . Similarly, F F . Now take any x F . Then
x F , so x s s because (s , F ) (s , F ). Hence x s x s . And x s s
because (s , F ) (s, F ). So x s s, as desired.
(ii): For (a)(b), assume (a). Choose (s , F ) (s, F ), (s, F ). Now take any x F .
Then x s x s since s s , and x s s since (s , F ) (s, F ); so x s s .
The other part of (b) follows by symmetry.
(b)(c): By symmetry it suffices to show that (s s , F F ) (s, F ), and for this
we only need to check the last condition in the definition of . So, suppose that x F .
Then x (s s ) = (x s) (x s ) s by (b).
(c)(a): Obvious.
(iii): For any (s, F ) PA , clearly (s, F {x}) (s, F ).
(iv) Suppose that h(s , F ) : < 1 i is a pairwise incompatible system of elements of
PA . Clearly then s 6= s for distinct , < 1 , contradiction.
Theorem 11.5. Let be an infinite cardinal, and assume MA(). Suppose that
A , B P(), and |A |, |B| . Also assume that
S
(i) For all y B and all F [A ]< we have |y\ F | = .
Then there is a d such that |d x| < for all x A and |d y| = for all y B.
Proof. For each y B and each n let
Eny = {(s, F ) PA : s y 6 n}.
We S
claim that each such set is dense. For,
S suppose that (s, F ) PA . Then by assumption,
|y\ F | = , so we can pick m y\ F such that m > n. Then (s {m}, F ) (s, F ),
128
A
\F
;
then
a
F =
T
S
(a
b)
is
finite.
Thus
\
F
is
infinite.
Hence
we
can
apply
Theorem
11.5
to
A
and
bF
def
Proof. Assume MA, and suppose that < 2 . Then 2 = 2 by Theorem 11.9,
and so cf(2 ) = cf(2 ) > by Corollary 4.411.
Another important application of Martins axiom is to the existence of Suslin trees; in fact,
Martins axiom arose out of the proof of this theorem:
Theorem 11.11. MA(1 ) implies that there are no Suslin trees.
Proof. Suppose that (T, ) is a Suslin tree. By 8.7 and the remarks before it, we may
assume that T is well-pruned. We are going to apply MA(1 ) to the partial order (T, ),
i.e., to T turned upside down. Because T has no uncountable antichains in the tree sense,
(T, ) has no uncountable antichains in the incompatibility sense. Now for each < 1
let
D = {t T : ht(t, T ) > }.
Then each D is dense in (T, ). For, suppose that s T . By well-prunedness, choose
t T such that s < t and ht(t, T ) > . Thus t D and t > s, as desired.
Now we let G be a filter on (T, ) which intersects each D . Any two elements of
G are compatible in (T, ), so they are comparable in (T, ). Since G D 6= for all
< 1 , G has a member of T of height greater than , for each < 1 . Hence G is an
uncountable chain, contradiction.
Our last application of Martins axiom involves Lebesgue measure. In order not to assume
too much about measures, we give some results of measure theory that will be used in our
application but may have been omitted in your standard study of measure theory.
Lemma 11.12. Suppose that is a measure and E, F, G are -measurable. Then
(EF ) (EG) + (GF ).
Proof.
(EF ) = (E\F ) + (F \E)
= ((E\F ) G) + ((E\F )\G) + (F \E) G) + ((F \E)\G)
(G\F ) + (E\G) + (G\E) + (F \G)
= (EG) + (GF ).
Lemma 11.13. If E is Lebesgue measurable with finite measure, then for any > 0
there is an open set U E such that (E) S(U ) (E)+. Moreover,
there is a system
P
hKj : j < i of open intervals such that U = j< Kj and (U ) j< (Kj ) (E)+.
Proof. By the basic definition of Lebesgue measure,
(
X
(E) = inf
(Ij ) : hIj : j i is a sequence of half-open intervals
j
such that E
Ij .
130
X
[
(Ij ) (E) + .
Ij
2
j
j
j Ij
Kj = aj j+2 , bj ;
2
[
Kj and
E
then
Kj
(Kj )
X
+
(I
)
j
2j+2
j
X
X
=
(Ij )
+
j+2
2
j
j
=
+ (E) + = (E) + .
2
2
Proof. By Lemma 11.13 let U E be open such that
S (E) (U ) P(E) + 2 .
Then choose C and B as in Lemma 11.14(ii). Let W = B.
P So (W ) =
P IB (I).
Then choose m Pand hIi : i < mi elements
of
B
such
that
(I)
IB
S
P
S i<m (Ii )
. Now (W ) =
IB (I) and ( i<m Ii ) =
i<m (Ii ). Let V =
i<m Ii . Thus
2
(W ) (V ) 2 . Hence V W U , and
(EV ) (EU ) + (U W ) + (W V )
= (U \E) + (C) + (W \V )
= (U ) (E) + (W ) (V )
+ = .
2 2
Now we are ready for an application of Martins axiom to Lebesgue measure.
Theorem 11.16. Suppose that is an infinite cardinal and MA() holds.
If hM :
S
< i is a system of subsets of R each of Lebesgue measure 0, then also < M has
Lebesgue measure 0.
S
Proof. Let > 0. We are going to find an open set U such that < M U and
(U ) ; this will prove our result. Let
P = {p R : p is open and (p) < }.
The ordering, as usual, is .
(1) Elements p, q P are compatible iff (p q) < .
In fact, the direction is clear, while if p and q are compatible, then there is an r P
with r p, q, hence p q r and (r) < , hence (p q) < .
Next we check that P has ccc. Suppose that hp : < 1 i is a system of pairwise
incompatible elements of P. Now
1 =
[
n
1
< 1 : (p )
n+1
1
so there exist an uncountable 1 and a positive integer m such that (p ) m
for all . Let C be the collection of all finite unions of open intervals with rational
coefficients. Note that C is countable. By Lemma 11.15, for each let C be a
1
. Now take any two distinct members , .
member of C such that (p C ) 3m
Then
1
(p p ) = (p p ) + (p p )
+ (p p ),
m
and hence (p p )
1
m.
1
1
1
(p p ) (p C ) + (C C ) + (C p )
+ (C C ) +
;
m
3m
3m
132
1
. It follows that C 6= C . But this means that hC : i is a
Hence (C C ) 3m
one-one system of members of C , contradiction. So P has ccc.
Now for each < let
D = {p P : M p}.
To show that D is dense, take any p P. Thus (p) < . By Lemma 11.13, let U be an
open set such that M U and (U ) < (p). Then (p U ) (p) + (U ) < ; so
p U D and p U p, as desired.
S
Now let G be a filter on P which intersects each D . Set V = G. So V is an open
set. For each < , choose p G D . Then M p V . It remains only to show
that (V ) S
. Let B be the set of all open intervals with rational endpoints. We claim
that V = (G B). In fact, is clear, so suppose that x V . Then x p for some
p G, hence there is a U B such that x U p, since p is open. Then U G since
G isS
a filter and the partial order is . So we found a U G B such thatSx U ; hence
x (G B). This provesSour claim. Now if F is a finite subset of G, then F G since
G is a filter. In particular, F P, so its measure
is less than . Now G B is countable;
S
let hpi : i S
i enumerate
it. Define qi =S
pi \ j<i pj for
S
S all i . Then by induction one
sees that i<m pi = i<m qi , and hence (G B) = i< qi . So
[
(V ) =
(G B) =
qi
i<
X
i<
(qi ) = lim
X
i<m
(qi ) = lim
m
[
i<m
qi
= lim
m
pi
i<m
EXERCISES
11.1. Assume MA(). Suppose that X is a compact Hausdorff space, and any pairwise
disjoint collection of open
T sets in X is countable. Suppose that U is dense open in X for
each < . Show that < U 6= .
11.2. A partial order P is said to have 1 as a precaliber iff for every system hp : < 1 i
of elements of P there is an X [1 ]1 such that for every finite subset F of X there is a
q P such that q p for all F .
Show that MA(1 ) implies that every ccc partial order P has 1 as a precaliber.
Hint: for each < 1 let
W = {q P : > (q and p are compatible)}.
Show that there is an < 1 such that W = W for all > , and apply MA(1 ) to
W .
11.3. Call a topological space
ofQpairwise disjoint open sets in
Q X ccc iff every collection
<
X is countable. Show that iI Xi is ccc iff F [I] [ iF Xi is ccc]. Hint: use the
-system theorem.
133
11.4 Assuming MA(1 ), show that any product of ccc spaces is ccc.
11.5. Assume MA(1 ). Suppose that P and Q are ccc partially ordered sets. Define on
P Q by setting (a, b) (c, d) iff a c and b d. Show that < is a ccc partial order on
P Q. Hint: use exercise 11.1.
11.6. We define < on by setting f < g iff f, g and nm > n(f (m) < g(m).
Suppose that MA() holds and F [ ] . Show that there is a g such that f < g
for all f F . Hint: let P be the set of all pairs (p, F ) such that p is a finite function
mapping a subset of into and F is a finite subset of F . Define (p, F ) (q, G) iff
q p, G F , and
f Gn dmn(p)\dmn(q)[p(n) > f (n)].
11.7. Let B [] be almost disjoint of size , with < 2 . Let A B with A
countable. Assume MA(). Show that there is a d such that |d x| < for all x A ,
and |x\d| < for all x B\A . Hint: Let hai : i i enumerate A . Let
P = {(s, F, m) : s []< , F [B\A ]< , and m };
(s , F , m ) (s, F, m) iff s s , F F , m m , and
"
!
#
[
x F
x\
ai s s .
im
Show that P satisfies ccc. To apply MA(), one needs various dense sets. The most
complicated is defined as follows. Let D = {(s, F, m, i, n) : (s, F, m) P, i < m, and
n ai \s}. Clearly |D| = . For each (s, F, m, i, n) D let
E(s,F,m,i,n) = {(s , F , m ) P : (s, F, m) and (s , F , m ) are incompatible
or (s , F , m ) (s, F, m) and n s }.
11.8. [The condition that A is countable is needed in exercise 7.] Show that there exist
A , B such that B is an almost disjoint family of infinite subsets of , A B, |A | =
|B\A | = 1 , and there does not exist a d such that |x\d| < for all x A , and
|x d| < for all x B\A . Hint: construct A = {a : < 1 } and B\A = {b :
< 1 } by constructing a , b inductively, making sure that the elements are infinite and
pairwise almost disjoint, and also a b = , while for 6= we have a b 6= .
T
11.9. Suppose that A is a family of infinite subsets of such that F is infinite for every
finite subset F of A . Suppose that |A | . Assuming MA(), show that there is an
infinite X such that X\A is finite for every A A . Hint: use Theorem 11.5.
11.10. Show that MA() is equivalent to MA() restricted to ccc partial orders of cardinality . Hint: Assume the indicated special form of MA(), and assume given a ccc
partially ordered set P and a family D of at most dense sets in P ; we want to find a
filter on P intersecting each member of D. We introduce some operations on P . For each
D D define fD : P P by setting, for each p P , fD (p) to be some element of D
which is p. Also we define g : P P P by setting, for all p, q P ,
p if p and q are incompatible,
g(p, q) =
r with r p, q if there is such an r.
134
Here, as in the definition of fD , we are implicitly using the axiom of choice; for g, we
choose any r of the indicated form.
We may assume that D 6= . Choose D D, and choose s D. Now let Q be the
intersection of all subsets of P which have s as a member and are closed under all of the
operations fD and g. We take the order on Q to be the order induced from P . Apply the
special form to Q.
11.11. Define x y iff x, y , x\y is finite, and y\x is infinite. Assume MA(), and
suppose that L, <) is a linear ordering of size at most . Show that there is a system
hax : x Li of infinite subsets of such that for all x, y L, x < y iff ax ay . Hint: let
P consist of all pairs (p, n) such that n , p is a function whose domain is a finite subset
of L, and x dmn(p)[p(x) n]. Define (p, n) (q, m) iff m n, dmn(q) dmn(p),
x dmn(q)[p(x) m = q(x)], and x, y dmn(q)[x < y p(x)\p(y) m].
For the remaining exercises we use the following definitions.
a b
a b
iff
a\b is finite;
iff
incomparable elements of T , then let (x, y) be the smallest ordinal min(ht(x), ht(y))
such that x 6= y . Let < be a well-order of T . Then we define, for any distinct x, y T ,
x < y
iff
We claim that this gives a linear order of T . To prove transitivity, suppose that x < y <
z. Then there are several possibilities. These are illustrated in diagrams below.
Case 1. x < y < z. Then x < z, so x < z.
Case 2. x < y, while y and z are incomparable, with y (y,z) < z (y,z) .
Subcase 2.1. ht(x) < (y, z). Then x = xht(x) = y ht(x) = z ht(x) so that x < z,
hence x < z.
Subcase 2.2. (y, z) ht(x). Then x and z are incomparable. In fact, if z < x
then z < y, contradicting the assumption that y and z are incomparable; if x z, then
y ht(x) = x = xht(x) = z ht(x) , contradiction. Now if < (x, z) then y = x = z ; it
follows that (x, z) (y, z). If < (y, z) then ht(x), and hence x = y = z ; this
shows that (y, z) (x, z). So (y, z) = (x, z). Hence x(x,z) = y (x,z) = y (y,z) <
z (y,z) = z (x,z) , and hence x < z.
Case 3. x and y are incomparable, and y < z. Then x and z are incomparable. Now
if < (x, y), then x = y = z ; this shows that (x, y) (x, z). Also, x(x,y) <
y (x,y) = z (x,y) , and this implies that (x, z) (x, y). So (x, y) = (x, z). It follows
that x(x,z) = x(x,y) < y (x,y) = z (x,z) , and hence x < z.
Case 4. x and y are incomparable, and also y and z are incomparable. We consider
subcases.
Subcase 4.1. (y, z) < (x, y). Now if < (y, z), then x = y = z ; so
(y, z) (x, z). Also, x(y,z) = y (y,z) < z (y,z) , so that (x, z) (y, z). Hence
(x, z) = (y, z), and x(x,z) = y (y,z) < z (y,z) , and hence x < z.
Subcase 4.2. (y, z) = (x, y). Now x(x,y) < y (x,y) = y (y,z) < z (y,z) =
z (x,y) . It follows that (x, z) (x, y). For any < (x, y) we have x = y = z since
(y, z) = (x, y). So (x, y) = (x, z). Hence x(x,z) = x(x,y) < y (x,y) = y (y,z) <
z (y,z) = z (x,z) , so x < z.
Subcase 4.3. (x, y) < (y, z). Then x(x,y) < y (x,y) = z (x,y) , and if < (x, y)
138
x
Case 1
x
Subcase 2.1
y
Subcase 4.1
Subcase 2.2
x
Case 3
x
Subcase 4.2
Subcase 4.3
Vt ;
ht(t)=
since there are fewer than elements of height less than , this set has size , and so there
is a t such that ht(t) = and |Vt | = . We claim that t B. To prove this, take any
x Vt such that t < x. Suppose that a L and x a. Choose y Vt with a < y and
t < y. Then t < x, t < y, and x a < y. If x = a, then t a, as desired. If x < a,
then t < a by (*).
This finishes the case in which L has a subset of order type . The case of order type
is similar, but we give it. So, suppose that L has order type . Define
B = {t T : x La L[a x t a]}.
We claim that B is a chain in T of size . Suppose that t0 , t1 B with t0 6= t1 , and choose
x0 , x1 L correspondingly. Say wlog x0 < x1 . Now t0 B and x0 x0 , so t0 x0 . and
t1 B and x0 x1 , so t1 x0 . So t0 and t1 are comparable.
139
Now let < ; we show that B has an element of height . For each t of height let
Vt = {x L : t x}. Then
[
{x L : ht(x) } =
Vt ;
ht(t)=
since there are fewer than elements of height less than , this set has size , and so there
is a t such that ht(t) = and |Vt | = . We claim that t B. To prove this, take any
x Vt such that t < x. Suppose that a L and a x. Choose y Vt with y < a and
t < y. Then t < x, t < y, and y < a x. If a = x, then t < a, as desired. If a < x,
then t < a by (*).
Theorem 12.7. For any uncountable cardinal the following conditions are equivalent:
(i) is weakly compact.
(ii) is inaccessible, and it has the linear order property.
(iii) is inaccessible, and it has the tree property.
(iv) For any cardinal such that 1 < < we have ()2 .
Proof. (i)(ii): Assume that is weakly compact. First we need to show that is
inaccessible.
P
To show that is regular, suppose to the contrary that = < , where <
and < for each
S < . By the definition of infinite sum of cardinals, it follows that
we can write = < M , where |M | = for each < and the M s are pairwise
disjoint. Define f : []2 2 by setting, for any distinct , < ,
n
0 if , M for some < ,
f ({, }) =
1 otherwise.
Let H be homogeneous for f of size . First suppose that f [[H]2 ] = {0}. Fix 0 H, and
say 0 M . For any H we then have M also, by the homogeneity of H. So
H M , which is impossible since |M | < . Second, suppose that f [[H]2 ] = {1}. Then
any two distinct members of H lie in distinct M s. Hence if we define g() to be the
< such that M for each H, we get a one-one function from H into , which
is impossible since < .
To show that is strong limit, suppose that < but 2 . Now by Theorem
10.9 we have 2 6 (+ , + )2 . So choose f : [2 ]2 2 such that there does not exist an
+
X [2 ] with f [X]2 constant. Define g : []2 2 by setting g(A) = f (A) for any
+
A []2 . Choose Y [] such that g [Y ]2 is constant. Take any Z [Y ] . Then
f [Z]2 is constant, contradiction.
So, is inaccessible. Now let (L, <) be a linear order of size . Let be a well order
of L. Now we define f : [L]2 2; suppose that a, b L with a b. Then
n
0 if a < b,
f ({a, b}) =
1 if b > a.
Let H be homogeneous for f and of size . If f [[H]2 ] = {0}, then H is well-ordered by <.
If f [[H]2 ] = {1}, then H is well-ordered by >.
140
if there is no such .
Now eventually the second condition here must hold, as otherwise hs : < i would
be a one-one function from into {t : < }, which is impossible. Take the least
such that s() = , and let t = s . This finishes the construction of the t s. Let
T = {t : < }, with the partial order . Clearly this gives a tree.
By construction, if < and < dmn(t ), then t T . Thus the height of an
element t is dmn(t ).
(2) The sequence ht : < i is one-one.
In fact, suppose that < and t = t . Say that dmn(t ) = . Then t = t = t ,
and the construction of t gives something with domain greater than , contradiction.
Thus (2) holds, and hence |T | = .
(3) The set of all elements of T of level < has size less than .
In fact, let U be this set. Then
|U |
= <
<
since is inaccessible. So (3) holds, and hence, since |T | = , T has height and is a
-tree.
(4) If t t , then < and F ({, }) = t (dmn(t )).
This is clear from the definition.
Now by the tree property, there is a branch B of size . For each < let
H = { < : t B and t
hi B}.
We claim that each H is homogeneous for F . In fact, take any distinct , H . Then
t , t B. Say t t . Then < , and by construction t (dmn(t )) = F ({, }). So
F ({, }) = by the definition of H , as desired. Now
{ < : t B} =
[
<
141
{ < : t H },
_ _
(d = c ) .
{d 6= c : < }
X <
If []< , let A be the set of all < such that d = c is in . So |A| < . Take any
\A, and consider the structure M = (, , )<. There is a X with < , and
this shows that M is a model of .
Thus every subset of of size less than has a model, so has a model; but this is
clearly impossible.
(2) is strong limit.
In fact, suppose not; let < with 2 . We consider the language with distinct
individual constants c , di for all < and i < 2. Let be the following set of sentences
in this language:
(
) (
)
^
_
[(c = d0 c = d1 ) d0 6= d1 ]
(c 6= df() ) : f 2 .
<
<
142
<
g()
Clearly every subset of of size less than has a model. Hence has a model N =
(A, < , S , at , b)<,tT . For each < choose e S with e < b. Now the following
sentence holds in M and hence in N :
_
x P x
(x = cs ) .
sLev (T )
Hence for each < we can choose t() T such that ea = at() . Now the sentence
x, y, z[x < z y < z x and y are comparable]
holds in M , and hence in N . Now fix < < . Now e , e < b, so it follows that e and
e are comparable under . Hence at() and at() are comparable under . It follows
that t() and t() are comparable under . So t() < t(). Thus we have a branch of
size .
Now suppose that is weakly compact. Let L be an L -language with at most
symbols, and suppose that is a set of sentences in L such that every subset of of
size less than has a model M . We will construct a model of by modifying Henkins
proof of the completeness theorem for first-order logic.
First we note that there are at most formulas of L. This is easily seen by the
following recursive construction of all formulas:
F0 = all atomic formulas;
F+1 = F { : F }
[
F =
F for limit.
n_
o
: [F ]< {x : F , x of length < };
<
143
By induction, |F | for all , and F is the set of all formulas. (One uses that is
inaccessible.)
Expand L to L by adjoining a set C of new individual constants, with |C| = . Let
be the set of all subformulas of the sentences in . Let h : < i list all sentences
of L which are of the form x (x) and are obtained from a member of by replacing
variables by members of C. Here x is a one-one sequence of variables of length less than
; say that x has length . Now we define a sequence hd : < i; each d will be a
sequence of members of C of length less than . If d has been defined for all < , then
[
<
has size less than . We then let d be a one-one sequence of members of C not in this
set; d should have length . Now for each let
= {x (x) (d ) : < }.
Note that if < . Now we define for each []< and each a
model N of . Since 0 = , we can let N0 = M . Having defined N , since
the range of d consists of new constants, we can choose denotations of those constants,
expanding N to N+1
, so that the sentence
x (x) (d )
S
holds in N+1
. For limit we let N = < N .
It follows that N is a model of . So each subset of of size less than
has a model.
It suffices now to find a model of in the language L . Let h : < i be
an enumeration of all sentences obtained from members of by replacing variables by
members of C, each such sentence appearing times. Let T consist of all f satisfying the
following conditions:
(3) f is a function with domain < .
(4) < [( f () = ) and
/ f () = )].
(5) rng(f ) has a model.
Thus T forms a tree .
(6) T has an element of height , for each < .
def
144
if P |= ,
if P |= .
iff
R(0 , . . . , m1 ) .
Lemma 12.9. Let A be a set of infinite cardinals such that for every regular cardinal
, the set A is non-stationary in . Then there is a one-one regressive function with
domain A.
def S
Proof. We proceed by induction on =
A. Note that is a cardinal; it is 0 if
A = . The cases = 0 and = are trivial, since then A = or A = {} respectively.
Next, suppose that is a successor cardinal
+ . Then A = A {+ } for some set A
S
+
of infinite cardinals less than . Then A < + , so by the inductive hypothesis there
is a one-one regressive function f on A . We can extend f to A by setting f (+ ) = , and
so we get a one-one regressive function defined on A.
Suppose that is singular. Let h : < cf()i be a strictly increasing continuous
sequence of infinite cardinals with supremum , with cf() < 0 . Note then that for every
cardinal < , either < 0 or else there is a unique < cf() such that < +1 .
For every < cf() we can apply the inductive hypothesis to A to get a one-one
regressive function g with domain A . We now define f with domain A. In case
cf() = we define, for each A,
g0 () + 2
+ g+1 () + 1
f () =
1
0
if
if
if
if
if
< 0 ,
< < +1 ,
= +1 ,
= 0 ,
= A.
Here the addition is ordinal addition. Clearly f is as desired in this case. If cf() > , let
h : < cf()i be a strictly increasing sequence of limit ordinals with supremum cf().
Then we define, for each A,
g () + 1
if < 0 ,
0
+ g+1 () + 1 if < < +1 ,
f () =
if = ,
0
if = A.
Clearly f works in this case too.
Finally, suppose that is a regular limit cardinal. By assumption, there is a club C
in such that C A = . We may assume that C = . Let h : < i be the
strictly increasing enumeration of C. Then we define, for each A,
if < 0 ,
g0 () + 1
f () = + g+1 () + 1 if < < +1 ,
0
if = A.
Clearly f works in this case too.
Lemma 12.10. Suppose that is weakly compact, and S is a stationary subset of .
Then there is a regular < such that S is stationary in .
Proof. Suppose not. Thus for all regular < , the set S is non-stationary in
. Let C be the collection of all infinite cardinals less than . Clearly C is club in , so
146
Y
T = s : < s
and s is one-one .
<
For every < the set S is non-stationary in every regular cardinal, and hence by
Lemma 12.9 there is a one-one regressive function s with domain S . Now S =
{ : < }. Hence s T .
Clearly T forms a tree of height under . Now for any < ,
Y
<
sup
!||
< .
<
B is a one-one
Theorem 12.11. Every weakly compact cardinal is Mahlo, hyper-Mahlo, hyper-hyperMahlo, etc.
Proof. Let be weakly compact. Let S = { < : is regular}. Suppose that C
is club in . Then C is stationary in , so by Lemma 12.10 there is a regular < such
that C is stationary in ; in particular, C is unbounded in , so C since C is
closed in . Thus we have shown that S C 6= . So is Mahlo.
Let S = { < : is a Mahlo cardinal}. Suppose that C is club in . Let
To prove the second assertion, let A be a collection of subsets of X with fip, and let
C be the collection of all proper filters on X which contain A . Clearly the filter generated
by A is proper, so C 6= . We consider C as a partially ordered set S
under inclusion.
Any subset D of C which is a chain has an upper bound in C , namely D, as is easily
checked. So by Zorns lemma C has a maximal member F . By Proposition 12.13, F is an
ultrafilter.
Let X be an infinite set, and let be T
an infinite cardinal. An ultrafilter F on X is <
complete iff for any A [F ] we have A F . We also say -complete synonomously
with 1 -complete.
This notion is clearly a generalization of one of the properties of ultrafilters. In fact, every
ultrafilter is -complete, and every principal ultrafilter is -complete for every infinite
cardinal .
Lemma 12.15. Suppose that X is an infinite set, F is an ultrafilter
on X, and is
T
the least infinite cardinal such that there is an A [F ] such that A
/ F . Then there
is a partition P of X such that |P| = and X\Y F for all Y P.
T Proof. Let hY : < i enumerate A . Let Z0 = X\Y0 , and for > 0 let Z =
( < Y )\Y
T . Note that Y X\Z , and so X\Z F . Clearly Z Z = for 6= .
Let W = < Y . Clearly W Z = for all < . Let
P = ({Z : < } {W })\{}.
So P is a partition of X and X\Z F for all Z P. Clearly |P| . If |P| < , then
=
(X\Z) F,
ZP
contradiction. So |P| = .
Theorem 12.16. Suppose that is the least infinite cardinal such that there is a
nonprincipal -complete ultrafilter F on . Then F is -complete.
Proof. Assume the
T hypothesis, but suppose that F is not -complete. So there is a
A [F ]< such that A
/ F . Hence by Lemma 12.15 there is a partition P of such
that |P| < and X\P F for every P P. Let hP : < i be a one-one enumeration
of P, an infinite cardinal. We are now going to construct a nonprincipal -complete
ultrafilter G on , which will contradict the minimality of .
Define f : by letting f () be the unique < such that P . Then we
define
G = {D : f 1 [D] F }.
We check the desired conditions for G.
/ G, since f 1 [] =
/ F . If D G and D E,
1
1
1
1
then f [D] F and f [D] f [E], so f [E] F and hence E G. Similarly, G
is closed under . Given D , either f 1 [D] F or f 1 [\D] = \f 1 [D] F , hence
D G or \D G. So G is an ultrafilter on . It is nonprincipal, since for any <
149
we have f 1 [{}] = P
/ F and hence {}
/ G. Finally, G is -complete, since if D is a
countable subset of G, then
h\ i
\
1
f
D =
f 1 [P ] F,
P D
and hence
D G.
contradiction.
Next, is strong limit. For, suppose that < and 2 . Let S [ 2] . Let
hf : < i be a one-one enumeration of S. Now for each < , one of the sets
{ < : f () = 0} and { < : f () = 1} is in U , so we can let () 2 be such that
{ < : f () = ()} U . Then
\
{ < : f () = ()} U ;
<
Now by regularity of we have |{t T : ht(t, T ) < }| < , and so the complement of
this set is in U , and then (1) yields
[
(2)
{s T : t s} U.
tLev (T )
150
Xi ) =
Let be an infinite cardinal. Then is -additive iff for every system hX : < i of
nonempty pairwise disjoint sets, wich < , we have
!
[
X
X =
(X ).
<
<
Here this sum (where the index set might be uncountable), is understood to be
sup
F ,
F finite
(X ).
E12.5. Suppose that is a measure on a set S. A subset A of S is a -atom iff (A) > 0
and for every X A, either (X) = 0 or (X) = (A).
Show that if is a realvalued measurable cardinal, is a -additive measure on , and A is a -atom, then
{X A : (X) = (A)} is a -complete nonprincipal ultrafilter on A. Conclude that is
a measurable cardinal if there exist such and A.
E12.6. Prove that if is real-valued measurable then either is measurable or 2 .
Hint: if there do not exist any -atoms, construct a binary tree of height at most 1 .
E12.7. Let be a regular uncountable cardinal. Show that the diagonal intersection of
the system h( + 1, ) : < i is the set of all limit ordinals less than .
E12.8. Let F be a filter on a regular uncountable cardinal . We say that F is normal
iff it is closed under diagonal intersections. Suppose that F is normal, and (, ) F for
every < . Show that every club of is in F . Hint: use exercise E12.7.
E12.9. Let F be a proper filter on a regular uncountable cardinal . Show that the
following conditions are equivalent.
(i) F is normal
(ii) For any S0 , if \S0
/ F and f is a regressive function defined on S0 , then
there is an S S0 with \S
/ F and f is constant on S.
E12.10. A probability measure on a set S is a real-valued function with domain P(S)
having the following properties:
(i) () = 0 and (S) = 1.
(ii) If X Y , then (X) (Y ).
(iii) ({a}) = 0 for all a S.
S
P (iv) If hXn : n i is a system of pairwise disjoint sets, then ( n Xn ) =
n (Xn ). (Some of the sets Xn might be empty.)
Prove that there does not exist a probability measure on 1 . Hint: consider an Ulam
matrix.
E12.11. Show that if is a measurable cardinal, then there is a normal -complete nonprincipal ultrafilter on . Hint: Let D be a -complete nonprincipal ultrafilter on . Define
f g iff f, g and { < : f () = g()} D. Show that is an equivalence relation
on . Show that there is a relation on the collection of all -classes such that for all
f, g , [f ] [g] iff { < : f () < g()} D. Here for any function h we use [h]
for the equivalence class of h under . Show that makes the collection of all equivalence
classes into a well-order. Show that there is a smallest equivalence class x such that
f x < [{ < : < f ()} D. Let E = {X : f 1 [X] D}. Show that E
satisfies the requirements of the exercise.
Reference
Kanamori, A. The higher infinite. Springer 2005, 536pp.
152
dm for every m .
0
m+1
m
S
Let b = m dm . ThenS a = d0 b. Suppose that x y b. Choose m such
that y dm . Then x dm dm+1 b. Thus b is transitive. Now suppose that c is
a transitive set such that a c. We show by induction that dm c for every m .
First, d0 = a S
c, so thisSis true for m = 0. Now assume that it is true for m. Then
dm+1 = dm dm c c = c, completing the inductive proof.
153
Hence b =
dm c.
The set shown to exist in Theorem 13.2 is called the transitive closure of a, and is denoted
by trcl(a).
Theorem 13.3. Every set is a member of some V .
Proof. Suppose that this is not true, and let a be a set which is not a member of
any V . Let A = {x trcl(a {a}) : x is not in any of the sets V }. Then a A, so A
is nonempty. By the foundation axiom, choose x A such that x A = 0. Suppose that
y x. Then
S y trcl(a {a}), so y is a member of some V . Let y be the least such .
Let = yx y . Then by 13.1(ii), x V . So x V+1 , contradiction.
S
Thus by Theorem 13.3 we have V = On V . An important technical consequence of
Theorem 13.3 is the following definition, known as Scotts trick:
Let R be a class equivalence relation on a class A. For each a A, let be the smallest
ordinal such that there is a b V with (a, b) R, and define
typeR (a) = {b V : (a, b) R}.
This is the reduced equivalence class of a. It could be that the collection of b such that
(a, b) R is a proper class, but typeR (a) is always a set. An important use of this trick is
in defining order types, where we modify this procedure slightly. Define the class R to be
the collection of all ordered pairs (L, M ) such that L and M are order-isomorphic linear
orders. For any linear order L, we define its order type to be
o.t.(L) =
On the basis of this definition one can introduce the usual terminology for order types,
e.g., for the order type of the rationals, for the order type of (, >), etc.
On the basis of our hierarchy we can define the important notion of rank of sets:
For any set x, the rank of x, denoted by rank(x), is the smallest ordinal such that
x V+1 .
We take + 1 here instead of just for technical reasons. Some of the most important
properties of ranks are given in the following theorem.
Theorem 13.4. Let x be a set and an ordinal. Then
(i) V = {y : rank(y) < }.
(ii) For all y x we have rank(y) < rank(x).
(iii) rank(y) rank(x) for every y x.
(iv) rank(x) = supyx (rank(y) + 1).
(v) rank() = .
(vi) V On = .
154
<
for limit.
<
Thus under GCH, = i for every ordinal ; in fact, this is just a reformulation of
GCH.
Theorem 13.5. (i) n |Vn | for any n .
(ii) For any ordinal , |V+ | = i .
Proof. (i) is clear by ordinary induction on n. We prove (ii) by the three-step
transfinite induction (where is a limit ordinal below):
[
Vn = = i0 by (i);
|V | =
n
|V++1 = |P(V+ )|
= 2|V+ |
= 2i (inductive hypothesis)
= i+1 ;
[
V+
|V+ | =
<
155
|V+ |
<
(inductive hypothesis)
<
<
= || i
= i
to finish this last inductive step, note that for each < we have i = |V+ | |V+ |,
and hence i |V+ |.
Well-founded class relations
We now introduce a kind of generalization of set membership.
If A is a class, a class relation R is well-founded on A iff for every nonempty subset X
of A there is an x X such that for all y X it is not the case that (y, x) R. Such a
set x is called R-minimal.
This notion is important even if A and R are mere sets. The protypical example of a well
founded relation is itself; it is well founded on V as one sees by the foundation axiom.
Recall that our intuitive notion of class is made rigorous by using formulas instead.
Thus we would talk about a formula (x, y) being well-founded on another formula (x).
In the case of , we are really looking at the formula x y being well-founded on the
formula x = x. So, rigorously we are associating with two formulas (x, y) and (x)
another formula (x, y) is well-founded on (x), namely the following formula:
X[x X(x) X 6= x Xy X(y, x)].
We are going to formulate and prove a recursion principle for well-founded relations. But
first we need some technical preparation.
A class relation R is set-like on a class A iff for every a A the class {b A : (b, a) R}
is a set.
Again we give a rigorous version. Given two formulas (x, y) and (x), the formula (x, y)
is set-like on (x) is the following formula:
x[(x) zy(y z (y) (y, x))].
Let the class relation R be set-like on the class A, and suppose that x A. We define
predn (A, x, R) for every natural number n by recursion.
pred0 (A, x, R) = {y A : (y, x) R};
[
predn+1 (A, x, R) = {pred0 (A, y, R) : y predn (A, x, R)}.
156
Then we define
cl(A, x, R) =
To prove (1) suppose that x d e and f (x) 6= g(x). Thus the set X = {y (d e) :
f (y) 6= g(y)} is a nonempty subset of A, so by well-foundedness we can take an Rminimal element z of X. Then f (w) = g(w) for all w d e such that wRz. Now clearly
pred(A, z, R) (d e), so f pred(A, z, R) = g pred(A, z, R). Hence
f (z) = F(z, f pred(A, z, R)) = F(z, g pred(A, z, R)) = g(z),
contradiction.
(2) For every x A, let d(x) = {x} cl(A, x, R). Then for every x A there is a f such
that f and d(x) satisfy (*).
Suppose that (2) is not true, and let x A be R-minimal such that there is no such
f . For each y such that (y, x) R there is a g so that g and d(y) satisfy (*). By (1)
this g is unique, and so by the replacement axiom we can associate with each such y the
corresponding function gy . Let
h=
gy
(y,x)R
It is straightforward to check that f and d(x) satisfy (*), contradiction. So (2) holds.
158
Now for any x A, let G(x) = f (x), where f and d(x) satisfy (*). This definition is
unambiguous by (1) and (2). It is easy to see that G is what is needed in the theorem.
The uniqueness of G follows by an easy argument by contradiction.
As a first application of this theorem we can define rank for well-founded relations.
If R is well-founded and set-like on A, then for any x A,
rank(x, A, R) = sup{rank(y, A, R) + 1 : y A and (y, x) R}.
By induction, this is always an ordinal.
Proposition 13.9. If A is a transitive class, then for any x A, rank(x, A, ) =
rank(x). In particular, rank(x, V, ) = rank(x) for any set x.
Proof. Otherwise, let x be R-minimal such that x A and rank(x, A, ) 6= rank(x).
Then
rank(x, A, ) = sup{rank(y, A, ) + 1 : y A and y x}
= sup{rank(y) + 1 : y A and y x} (R-minimality of x)
= sup{rank(y) + 1 : y x} (transitivity of A)
= rank(x) by Theorem 13.4(iii)).
This is a contradiction.
The Mostowski collapse
We give here an important technical result about well-founded relations.
Suppose that R is well-founed and set-like on A. The Mostowski collapsing function is
a class function G : A V defined by recursion as follows: for any x A,
G(x) = {G(y) : y A and (y, x) R}.
The Mostowski collapse of A, R is defined as the range of this function G.
Proposition 13.10. Suppose that R is well-founded and set-like on A, G is the
Mostowski collapsing function for A, R, and M is the Mostowski collapse. Then
(i) For all x, y A, if (x, y) R then G(x) G(y).
(ii) M is transitive.
(iii) For any x A we have rank(x, A, R) = rank(G(x)).
Proof. (i) is obvious from the definition. If a b M, choose y A such that
b = G(y). Since a b, by the definition we have a rng(G) = M. So (ii) holds. For
(iii), suppose it is not true, and let x be R-minimal such that x A and rank(x, A, R) 6=
rank(G(x)). Then
rank(x, A, R) = sup{rank(y, A, R) + 1 : y A and (y, x) R}
= sup{rank(G(y)) + 1 : y A and (y, x) R} (minimality of x)
= sup{rank(u) + 1 : u G(x)} (definition of G)
= rank(G(x)) (by 13.4(iii)).
159
is nonempty, and we take an R-minimal element of this set. Also, let y A with x 6= y
and G(x) = G(y). Since both x and y are in A, and x 6= y, the extensionality condition
gives two cases.
Case 1. There is a z A such that (z, x) R and (z, y)
/ R. Since (z, x) R, it
follows that z is not in the set (). Now G(z) G(x) by Proposition 13.10(i), so the fact
that G(x) = G(y) implies that G(z) G(y). Hence by definition of G we can choose
w A such that (w, y) R and G(z) = G(w). Then from z not in () we infer that
z = w, hence (z, y) R, contradiction.
Case 2. There is a z A such that (z, y) R and (z, x)
/ R. Since (z, y) R,
by Proposition 13.10(i) we get G(z) G(y) = G(x), and so there is a v A such that
(v, x) R and G(z) = G(v). Now v is not in () by the minimality of x, so z = v and
(z, x) R, contradiction.
Therefore, G is one-one. Now the implication in the second part of (ii) holds by
Proposition 13.10(i). Suppose now that G(x) G(y). Choose w A such that (w, y) R
and G(x) = G(w). Then x = w since G is one-one, so (x, y) R, as desired.
(ii)(i): Assume (ii), and suppose that x, y A, and z A[(z, x) R iff (z, y)
R]. Take any u G(x). By the definition of G, choose z A such that (z, x) R and
u = G(z). Then also (z, y) R, so u = G(z) G(y). This shows that G(x) G(y).
Similarly G(y) G(x), so G(x) = G(y). Since G is one-one, it follows that x = y.
Theorem 13.12. Suppose that R is a well-founded class relation that is set-like and
extensional on a class A. Then there are unique G, M such that M is a transitive class
and G is an isomorphism from (A, R) onto (M, ).
Note that we have formulated this in the usual fashion for isomorphism of structures,
but of course we cannot form the ordered pairs (A, R) and (M, ) if A, R, M are proper
classes. So we understand the above as an abbreviation for a longer statement, that G is
a bijection from A onto M, etc.
160
P(S())
<
162
we want to show that (A, E) |= (a). To do this, we define another structure (B, F ) for
our language. Let
B = {b A : (A, E) |= (b, a)}
163
and F = E (B B).
iff
iff
c0 = c1
(B, F ) |= c0 = c1 ;
(A, E) |= (c0 c1 )
iff
iff
iff
(c0 , c1 ) E
(c0 , c1 ) F
(B, F ) |= c0 c1 ;
iff
iff
We do the quantifier step in each direction separately. First suppose that (A, E) |=
(x) (x, c, a). Thus
(3)
Choose b A such that (A, E) |= [(b, a) (b, c, a)]. Now (A, E) |= (b, a) implies
that b B. Hence (A, E) |= (b, c, a)] implies by the inductive hypothesis that (B, F ) |=
(b, c). Hence (B, F ) |= x(x, c).
Conversely, suppose that (B, F ) |= x(x, c); we want to prove (3). Choose b B
such that (B, F ) |= (b, c). Hence (A, E) |= (b, a), and by the inductive hypothesis,
(A, E) |= (b, c, a). So (A, E) |= (b, a) (b, c, a). Now (3) follows.
This finishes the proof of (2).
Now by (1) and (2), (B, F ) |= . By an assumption of the theorem, |= , so
(B, F ) |= . Now an application of (2) yields (A, E) |= .
This shows that |= x(x, y) .
This theorem gives the basic idea of consistency proofs in set theory; we express this as
follows. By consistent we mean has a model.
Corollary 14.2. Suppose that and are collections of sentences in our language
of set theory. Suppose that M is a class, and |= [M 6= and M is a model of ].
Then consistent implies that is consistent.
Although this is the form we use in practice, for the proof we give the more rigorous
version:
164
Suppose that and are collections of sentences in our language of set theory. Suppose
that (x, y) is a formula, and for every sentence , |= x(x, y) . Also assume
that has a model. Then has a model.
Proof. Suppose to the contrary that does not have a model. Then trivially
|= (x = x). By Theorem 14.1, |= x(x, y) (x = x). Hence by hypothesis we
get |= (x = x), contradiction.
Our main examples of the use of this corollary are with = ZFC and = ZFC+there
are no inaccessibles; and with = ZF and = ZFC + AC + GCH. The class M is more
complicated to describe, and we defer that until actually ready to give the applications of
the corollary.
Now we give some simple facts which will be useful in checking the axioms of ZFC in
the transitive classes which we will define. So we need to refer back to the beginning of
the notes, where the ZFC axioms were given.
Theorem 14.3. The extensionality axiom holds in any nonempty transitive class.
Proof. The relativized version of the extensionality axiom is
x My M[z M(z x z y) x = y].
To prove this, assume that x, y M, and suppose that for all z M, z x iff z y. Take
any z x. Because M is transitive, we get z M. Hence z y. Thus z x implies that
z y. The converse is similar. So x = y.
The following theorem reduces checking the comprehension axioms to checking a closure
property.
Theorem 14.4. Suppose that M is a nonempty class, and for each formula with
with free variables among x, z, w1 , . . . , wn ,
z, w1 , . . . , wn M[{x z : M (x, z, w1 , . . . , wn )} M].
Then the comprehension axioms hold in M.
Proof. The straightforward relativization of an instance of the comprehension axioms
is
z Mw1 M . . . wn My Mx M(x y x z M ).
So, we take z, w1 , . . . , wn M. Let
y = {x z : M (x, z, w1 , . . . , wn )};
by hypothesis, we have y M. Then for any x M,
xy
iff
x z and M (x, z, w1 , . . . , wn ).
The following theorems are obvious from the forms of the pairing axiom and union axioms:
165
{y M : x AM (x, y, A, w1, . . . , wn )} Y.
^
x1 , . . . , xn
(xi ) [ (x1 , . . . , xn ) (x1 , . . . , xn )] .
1in
In full generality, very few formulas are absolute; for example, see exercise E14.5. Usually
we need to assume that the sets are transitive. Then there is an important set of formulas
all of which are absolute; this class is defined as follows.
The set of 0 -formulas is the smallest set of formulas satisfying the following conditions:
(a) Each atomic formula is in .
(b) If and are in , then so are and .
(c) If is in , then so are x y and x y.
Recall here that x y and x y are abbreviations for x(x y ) and x(x
y ) respectively.
Theorem 14.10. If M is transitive and is 0 , then is absolute for M.
Proof. We show that the collection of formulas absolute for M satisfies the conditions defining the set 0 . Absoluteness is clear for atomic formulas. It is also clear
that if and are absolute for M, then so are and . Now suppose that
is absolute for M; we show that x y is absolute for M. Implicitly, can involve additional parameters w1 , . . . , wn . Assume that y, w1 , . . . , wn M. First suppose that x y(x, y, w1, . . . , wn ). Choose x y so that (x, y, w1 , . . . , wn ). Since
M is transitive, x M. Hence by the inductive assumption, M (x, y, w1, . . . , wn )
holds. This shows that (x y(x, y, w1 , . . . , wn ))M . Conversely suppose that (x
y(x, y, w1, . . . , wn ))M . Thus x M[x y M (x, y, w1 , . . . , wn ). By the inductive
assumption, (x, y, w1 , . . . , wn ). So this shows that x y(x, y, w1 , . . . , wn ). The case
x y is treated similarly.
Ordinals and special kinds of ordinals are absolute since they could have been defined using
0 formulas:
Theorem 14.11. The following are absolute for any transitive class:
(i) x is an ordinal
(ii) x is a limit ordinal
Proof.
x is an ordinal y xz y[z x] y xz yw z[w y];
x is a limit ordinal y x[y = y] x is an ordinal y xz x(y z);
x is a successor ordinal x is an ordinal x 6= x is not a limit ordinal;
x is a finite ordinal y[y
/ x] (x is a successor ordinal
y x(z[z
/ y] y is a successor ordinal));
x = x is a limit ordinal y x(y is a finite ordinal);
168
FM (x0 , . . . , xn1 ) = y
N
(definition of F)
The following theorem gives many explicit absoluteness results, and will be used frequently
along with some similar results below. Note that we do not need to be explicit about how
the relations and functions were really defined in Chapter 1; in fact, we were not very
explicit about that in the first place.
Theorem 14.14. The following relations and functions were defined by formulas
equivalent to 0 -formulas on the basis of ZF Inf, and hence are absolute for all transitive
class models of ZF Inf:
(i) x y
(ii) x = y
(iii) x y
(iv) {x, y}
(v) {x}
(xi) x {x}
(xii) xSis transitive
(xiii) T x
T
(xiv) x (with = )
(vi) (x, y)
(vii)
(viii) x y
(ix) x y
(x) x\y
Note here, for example, that in (iv) we really mean the 2-place function assigning to sets
x, y the unordered pair {x, y}.
Proof. (i) and (ii) are already 0 formulas. (iii):
x y z x(z y).
(iv):
z = {x, y} w z(w = x w = y) x z y z.
(v): Similarly. (vi):
z = (x, y) w z[w = {x, y} w = {x}] w z[w = {x, y}] w z[w = {x}].
(vii):
x = y x(y 6= y).
(viii):
z = x y w z(w x w y) w x(w z) w y(w z).
(ix):
z = x y w z(w x w y) w x(w y w z).
(x):
z = x\y w z(w x w
/ y) w x(x
/ y w z).
(xi):
y = x {x} w y(w x w = x) w x(w y) x y.
(xii):
x is transitive y x(y x).
(xiii):
y=
(xiv):
y=
x [x 6= w yz x(w z)
w xt w[z x(t z) t y] [x = y = ].
A stronger form of Theorem 14.14. For each of the indicated relations and functions,
we do not need M to be a model of all of ZF Inf. In fact, we need only finitely many
of the axioms of ZF Inf: enough to prove the uniqueness condition for any functions
involved, and enough to prove the equivalence of the formula with a 0 -formula, since
0 formulas are absolute for any transitive class model. To be absolutely rigorous here,
one would need an explicit definition for each relation and function symbol involved, and
then an explicit proof of equivalence to a 0 formula; given these, a finite set of axioms
becomes clear. And since any of the relations and functions of Theorem 14.14 require only
finitely many basic relations and functcions, this can always be done. For Theorem 14.14
it is easy enough to work this all out in detail. We will be interested, however, in using
this fact for more complicated absoluteness results to come.
As an illustration, however, we do some details for the function {x, y}. The definition
involved is naturally taken to be the following:
x, y, z[z = {x, y} w[w z w = x x = y]].
The axioms involved are the pairing axiom and one instance of the comprehension axiom:
x, yw[x w y w];
x, y, wzu(u z u w (u = x u = y)).
{x, y} is then absolute for any transitive class model of these three sentences, by the proof
of (iv) in Theorem 14.14, for which they are sufficient.
For further absoluteness results we will not reduce to 0 formulas. We need the
following extensions of the absoluteness notion.
Suppose that M N are classes, and (w1 , . . . , wn ) is a formula. Then we say
that is absolute upwards for M, N iff for all w1 , . . . , wn M, if M (w1 , . . . , wn ),
then N (w1 , . . . , wn ). It is absolute downwards for M, N iff for all w1 , . . . , wn M, if
N (w1 , . . . , wn ), then M (w1 , . . . , wn ).
Theorem 14.15. Suppose that (x1 , . . . , xn , w1 , . . . , wm ) is absolute for M, N. Then
(i) x1 , . . . xn (x1 , . . . , xn , w1 , . . . , wm ) is absolute upwards for M, N.
(ii) x1 , . . . xn (x1 , . . . , xn , w1 , . . . , wm ) is absolute downwards for M, N.
Theorem 14.16. Absoluteness is preserved under composition. In detail: suppose
that M N are classes, and the following are absolute:
(x1 , . . . , xn );
F, an n-ary function ;
For each i = 1, . . . , n, an m-ary function Gi .
171
n
^
(zi = Gi (x1 , . . . , xm )) ;
i=1
^
n
i=1
(zi = Gi (x1 , . . . , xm ))
(z1 , . . . , zn ) ;
y = F(G1 (x1 , . . . , xm ), . . . , Gn (x1 , . . . , xm )) z1 , . . . zn (y = F(z1 , . . . , zn ))
n
^
(zi = Gi (x1 , . . . , xm )) ;
i=1
^
n
(zi = Gi (x1 , . . . , xm ))
i=1
(y = F(z1 , . . . , zn )) .
Theorem 14.17. Suppose that M N are classes, (y, x1 , . . . , xm , w1 , . . . , wn ) is
absolute for M, N, and F and G are n-ary functions absolute for M, N. Then the following
are also absolute for M, N:
(i) z F(x1 , . . . , xm ).
(ii) F(x1 , . . . , xm ) z.
(iii) y F(x1 , . . . , xm )(y, x1 , . . . , xm , w1 , . . . , wn ).
(iv) y F(x1 , . . . , xm )(y, x1 , . . . , xm , w1 , . . . , wn ).
(v) F(x1 , . . . , xm ) = G(x1 , . . . , xm ).
(vi) F(x1 , . . . , xm ) G(x1 , . . . , xm ).
Proof.
z F(x1 , . . . , xm ) w[z w w = F(x1 , . . . , xm )];
w[w = F(x1 , . . . , xm ) z w];
F(x1 , . . . , xm ) z w z[w = F(x1 , . . . , xm )];
172
y F(x1 , . . . , xm )(y, x1 , . . . , xm , w1 , . . . , wn )
wy w[w = F(x1 , . . . , xm ) (y, x1 , . . . , xm , w1 , . . . , wn )];
w[w = F(x1 , . . . , xm ) y w(y, x1 , . . . , xm , w1 , . . . , wn )];
(iv)(vi) are proved similarly.
We now give some more specific absoluteness results.
Theorem 14.18. The following relations and functions are absolute for all transitive
class models of ZF Inf:
(i) x is an ordered pair
(ii) A B
(iii) R is a relation
(iv) dmn(R)
(v) rng(R)
(vi) R is a function
(vii) R(x)
(viii) R is a one-one function
(ix) x is an ordinal
Note concerning (vii): This is supposed to have its natural meaning if R is a function and
x is in its domain; otherwise, R(x) = .
Proof.
[
[
x is an ordered pair y
x z
x [x = (y, z)];
y = A B (a A)(b B)[(a, b) y]
(z y)(a A)(b B)[z = (a, b)];
R is a relation x R[x is an ordered pair];
[[
x = dmn(R) (y x) z
R [(x, z) R]
[[
[[
y
R z
R [(y, z) R y x];
[[
x = rng(R) (y x) z
R [(z, x) R]
[[
[[
y
R z
R [(y, z) R z x];
[[
[[
R is a function R is a relation x
R y
R
[[
z
R [(x, y) R (x, z) R y = z];
y = R(x) [R is a function (x, y) R]
Proof. We have
x M[ x y x(y {y} x)],
so by the absoluteness of the notions here we get
[x[ x y x(y {y} x)]]M ,
which means that the infinity axiom holds in M.
Theorem 14.20. If M is a transitive class model of ZF, then , M and M is
closed under the following set-theoretic operations:
S
(i)
(iv) (a, b) 7 {a, b}
(vii) T
(ii)
(v) (a, b) 7 (a, b)
(viii)
(iii) (a, b) 7 a\b
(vi) x 7 x {x}
Moreover, + 1 M and [M]< M.
Proof. M has elements x, y such that (x = )M and (y = )M . So x = and y =
by absoluteness. (See Theorem 14.13(vii) and Theorem 14.19(iv).) (i)(viii) are all very
similar, so we only treat (i). Let a, b M. Then because M |= ZF, there is a c M such
that (c = a b)M . By absoluteness, c = a b.
Each i is in M by transitivity. Hence + 1 M. Finally, we show by induction
on n that if x M and |x| = n then x M. This is clear for n = 0. Now suppose
inductively that x M and |x| = n + 1. Let a x and set y = x\{a}. So |y| = n. Hence
y M by the inductive hypothesis. Hence x = y {a} M by previous parts of this
theorem.
Our final abstract absoluteness result concerns recursive definitions.
Theorem 14.21. Suppose that R is a class relation which is well-founded and set-like
on A, and F : A V V. Let G be given by Theorem 11.8: for all x A,
G(x) = F(x, G pred(A, x, R)).
Let M be a transitive class model of ZF, and assume the following additional conditions
hold:
(i) F, R, and A are absolute for M.
(ii) (R is set-like on A)M .
(iii) x M A[pred(A, x, R) M].
Conclusion: G is absolute for M.
Proof. First we claim
(1) (R is well-founded on A)M .
In fact, by absolutenss RM = R (M M) and AM = A M, so it follows that in
M every nonempty subset of AM has an RM -minimal element. Hence we can apply the
recursion theorem within M to define a function H : AM M such that for all x AM ,
H(x) = FM (x, H predM (AM , x, RM )).
174
We claim that H = G AM , which will prove the theorem. In fact, suppose that x is
R-minimal such that x AM and G(x) 6= H(x). Then using absoluteness again,
H(x) = FM (x, H predM (AM , x, RM )) = F(x, H pred(A, x, R)) = G(x),
contradiction.
Theorem 14.21 is needed for many deeper applications of absoluteness. We illustrate its
use by the following result.
Theorem 14.22. The following are absolute for transitive class models of ZF.
(i) + (ordinal addition)
(iv) rank(x)
(ii) (ordinal multiplication)
(v) trcl(x)
(iii) (ordinal exponentiation)
Proof. In each case it is mainly a matter of identifying A, R, F to which to apply
Theorem 14.21; checking the conditions of that theorem are straightforward.
(i): A = On, R = {(, ) : , On, and }, and F : On V V is defined
as follows:
if f = ,
otherwise.
(ii) and (iii) are treated similarly. For (iv), take R = {(x, y) : x y}, A = V, and define
F : V V V by setting, for all x, f V,
S
yx (f (y) {f (y)}) if f is a function with domain x,
F(x, f ) =
otherwise.
For (v), let R = {(i, j) : i, j and i < j}, A = , and define F : V V by setting,
for all m and f V,
(
x S
if m = 0,
S S
F(m, f ) = f ( m) f ( m) if m > 0 and f is a function with domain m,
otherwise
Then the function GSobtained from Theorem 14.20 is absolute for transitive class models
of ZF, and trcl(x) = m G(m).
Theorem 14.23. If M is a transitive model of ZF, then:
(i) P M (x) = P(x) M for any x M;
(ii) VM = V M for any M.
Proof. (i): Assume that x M. Then for any set y,
y P M (x)
iff
y M and (y x)M
iff
iff
y M and y x
y P(x) M.
175
iff
iff
Now we show that V is as desired. First, we need to check all the ZFC axioms. Here
we use Theorems 14.314.9. Now V is transitive, so by Theorem 14.3, the extensionality
axiom holds in V .
For the comprehension axioms, we are going to apply Theorem 14.4. Suppose that is
a formula with free variables among x, z, w1 , . . . , wn , and we are given z, w1 , . . . , wn V .
Let A = {x z : M (x, z, w1 , . . . , wn )}. Then A z. Say z V with < . Then
A z V , so A P(V ) = V+1 V . It follows from Theorem 14.4 that the
comprehension axioms hold in V .
Suppose that x, y V . Say x, y V with < . Then {x, y} V , so {x, y}
V+1 V . By Theorem 14.5, the pairing axiom holds in V
S .
S
Suppose that x V . Say x V with < . Then x V , so x V+1 V .
By Theorem 14.6, the union axiom holds in V .
Suppose that x V . Say x V with < . Then x V . Hence y V for
each y x, so y P(V ) = V+1 for each y P(x). This means that P(x) V+1 , so
P(x) V+2 . By Theorem 14.7, the power set axiom holds in V .
For the replacement axioms, we will apply Theorem 14.8. So, suppose that is a
formula with free variables among x, y, A, w1, . . . , wn , any A, w1 , . . . , wn V , and
x A!y[y V V (x, y, A, w1, . . . , wn )].
For each x A, let yx be the unique set such that yx V and V (x, yx , A, w1 , . . . , wn ),
and let x < be such that yx Vx . Choose < such that A V . Then A V ,
def S
and hence |A| |V | < by Theorem 11.5(ii). It follows that = {x : x A} < .
Let
Y = {z V : x AV (x, z, A, w1 , . . . , wn )}.
Thus Y V , so Y V+1 V . Suppose that x A and z V is such that
V (x, z, A, w1 , . . . , wn ). Then z = yx by the above, and so z Y , as desired.
By Theorem 14.9, the foundation axiom holds in V .
We have now shown that V is a model of ZF Inf.
For the infinity axiom, note that by Theorem 11.4(v) we have V+1 V . Hence
the infinity axiom holds in V by Theorem 14.18.
For the axiomSof choice, suppose that A V is a family of pairwise disjoint nonempty
sets, and let B A have exactly
S one element in common with each member of A . Say
A V with < . Then B A V , so B V+1 V , and the axiom of choice
thus holds in V .
So V is a model of ZFC.
Finally, suppose that x V and (x is an inaccessible cardinal)V ; we want to get
a contradiction. In particular, (x is an ordinal)V , so by absoluteness, x is an ordinal.
Absoluteness clearly implies that x is infinite. We claim that x is a cardinal. For, if
f : y x is a bijection with y < x, then clearly f V , and hence by absoluteness
(f : y x is a bijection and y < x)V , contradiction. Similarly, x is regular; otherwise
there is an injection f : y x with rng(f ) unbounded in x, so clearly f V , and
absolutenss again yields a contradiction. Thus x is a regular cardinal. Hence, since is
the smallest inaccessible, there is a y x such that there is a one-one function g from x
177
Note the seemingly minor respects in which (ii) differs from the definition of absoluteness:
the implication goes only one direction, and on the right side of the implication we relativize
j to N rather than M.
Proof. (i)(ii): Assume (i) and the hypothesis of (ii). Suppose that y1 , . . . , yt M
M
and x NN
j (x, y1 , . . . , yt ). Thus by absoluteness x Mj (x, y1 , . . . , yt ); choose
N
x M such that M
j (x, y1 , . . . , yt ). Hence by absoluteness again, j (x, y1 , . . . , yt )). Hence
x MN
j (x, y1 , . . . , yt ), as desired.
(ii)(i): Assume (ii). We prove that i is absolute for M, N by induction on the
length of i . This is clear if i is atomic, and it easily follows inductively if i has the
form j or j k . Now suppose that i is xj (x, y1 , . . . , yt ), and y1 , . . . , yt M. then
M
M
i (y1 , . . . , yt ) x Mj (x, y1 , . . . , yt ) (definition of relativization)
x MN
j (x, y1 , . . . , yt ) (induction hypothesis)
x NN
j (x, y1 , . . . , yt ) (by (ii)
N
i (y1 , . . . , yt ) (definition of relativization)
Theorem 14.27. Suppose that Z() is a set for every ordinal , and the following
conditions hold:
(i) If < , then Z() Z().
S
(ii) If is a limit ordinal, then Z() = < Z().
S
Let Z = On Z(). Then for any formulas 0 , . . . , n1 ,
> [0 , . . . , n1 are absolute for Z(), Z].
Proof. Assume the hypothesis, and let an ordinal be given. We are going to apply
Lemma 14.26 with N = Z, and we need to find an appropriate > so that we can take
M = Z() in 14.26.
178
if there is such,
otherwise.
Let be a well-order of Z(). Let B be the set of all i < n such that i begins with an
existential quantifier. Suppose that i B and i is the formula xj (x, y1 , . . . , yt ), where
179
x, y1 , . . . , yt are exactly all the free variables of j . We now define a function Hi for each
i B as follows. For any sets y1 , . . . , yt Z(),
Hi (y1 , . . . , yt ) =
Z()
Let A Z() be closed under each function Hi , with X A. We claim that A is as desired.
To prove the absoluteness, it suffices by Lemma 14.26 to take any formula i with i A,
with notation as above, assume that y1 , . . . , yt A and x ZZ
j (x, y1 , . . . , yt ), and find
Z
x A such that j (x, y1 , . . . , yt ). By (1), there is an x Z() such that Z
j (x, y1 , . . . , yt ).
Z
Hence Hi (y1 , . . . , yt ) is an element of A such that j (x, y1 , . . . , yt ), as desired.
It remains only to check the cardinality estimate. This is elementary.
Lemma 14.30. Suppose that G is a bijection from A onto M, and for any a, b A we
have a b iff G(a) G(b). Then for any formula (x1 , . . . , xn ) and any x1 , . . . , xn A,
A (x1 , . . . , xn ) M (G(x1 ), . . . , G(xn )).
Proof. An easy induction on .
Theorem 14.31. Suppose that Z is a transitive class and 0 , . . . , m1 are sentences.
Suppose that X is a transitive subset of Z. Then there is a transitive set M such that
Z
X M , |M | max(, |X|), and for every i < m, M
i i .
Proof. We may assume that the extensionality axiom is one of the i s. Now we
apply Theorem 14.29 to get a set A as indicated there. By Proposition 13.11, there is
a transitive set M and a bijection G from A onto M such that for any a, b A, a b
iff G(a) G(b). Hence all of the desired conditions are clear, except possibly X M .
We show that G[X] = X by proving that G(x) = x for all x X. In fact, suppose that
G(x) 6= x for some x X, and by the foundation axiom choose y such that G(y) 6= y while
G(z) = z for all z y. Then if z y we have z, y X A, and hence z = G(z) G(y).
So y G(y). If w G(y), then w M = rng(G), so we can choose z A such that
w = G(z). Then G(z) G(y), so z y. Hence w = G(z) = z and so w y. This gives
G(y) y, and finishes the proof.
Corollary 14.32. Suppose that S is a set of sentences containing ZFC. Suppose also
that 0 , . . . , n1 S. Then
S |= M
M is transitive, |M | = , and
M
i
i<n
it follows that
S |= M M is transitive, |M| = , and
M
i
i<n
"
(vn+1 vn )
isomorphism from (M, E, W, kx, y)xV onto N = (N, E , G[W ], x, G(y))xV . Of course
N is still well-founded. It is also extensional, since the extensionality axiom holds in (V , )
and hence in (M, E) and (N, E ). Let H, P be the Mostowski collapse of (N, E ). Thus P
is a transitive set, and
(1) H is an isomorphism from (N, E ) onto (P, ).
(2) a, b N [aE b V a b].
In fact, suppose that a, b N and aE b V . Let the individual constants used in the
expansion of (V , , U ) to (V , , U, x)aV be hcx : x V i. Then
"
#
_
(V , , U, x)aV |= z z kb
(z = kw ) ,
wb
and hence this sentence holds in (N, E , G[W ], x, G(y))xV as well, and so there is a w b
such that a = w, i.e., a b. So (2) holds.
(3) a, b V [a b aE b]
In fact, suppose that a, b V and a b. Then the sentence ka kb holds in (V ,
, U, x)xV , so it also holds in (N, E , G[W ], x, G(y))xV , so that aE b.
We have now verified the hypotheses of Lemma 14.35. It follows that H V is
the identity. In particular, V P . Now take any sentence in the language of (V ,
, U, x)xV . Then
(V , , U, x)xV |=
iff
(M, E, W, kx)xV |=
iff
iff
Now an infinite cardinal is 11 -indescribable iff for every U V and every secondorder sentence of the form S, with no quantifiers on S within , if (V , , U ) |= ,
then there is an < such that (V , , U V ) |= .
Theorem 14.37. An infinite cardinal is weakly compact iff it is 11 -indescribable.
Proof. First suppose that is 11 -indescribable. By Theorem 14.34 it is inaccessible.
So it suffices to show that it has the tree property. By the proof of Theorem 12.7(iii)(iv)
it suffices to check the tree property for a tree T < . Note that < V . Let be
the following sentence in the second-order language of (V , , T ):
S[T is a tree under , and
S T and S is a branch of T of unbounded length].
Thus for each < the sentence holds in (V , , T V ). Hence it holds in (V , , T ),
as desired.
Now suppose that is weakly compact. Let U V , and let be a 11 -sentence
holding in (V , , U ). By Lemma 14.36, let (M, , U ) be a transitive elementary extension
of (V , , U ) such that M . Say that is S, with having no quantifiers on S. Now
X V [(V , , U ) |= (X)].
(1)
Now since M and (M, ) is a model of ZFC, VM exists, and by absoluteness it is equal
to V . Hence by (1) we get
(M, , U ) |= X V V (U V ).
Hence
(M, , U ) |= X V V (U V ),
so by the elementary extension property we get
(V , , U ) |= X V V (U V ).
We choose such an . Since V On = , it follows that < . Hence (V , , U V ) |= ,
as desired.
A diagram of large cardinals
We define some more large cardinals, and then indicate relationships between them by a
diagram.
All cardinals are assumed to be uncountable.
1. regular limit cardinals.
2. inaccessible.
3. Mahlo.
184
4. weakly compact.
5. indescribable. The -order language is an extension of first order logic in which one
has variables of each type n . For n positive, a variable of type n ranges over P n (A)
for a given structure A. In addition to first-order atomic formulas, one has formulas P Q
with P n-th order and Q (n + 1)-order. Quantification is allowed over the higher order
variables.
is indescribable iff for all U V and every higher order sentence , if (V , , U ) |=
then there is an < such that (V , , U V ) |= .
6. ()<
2 . Here in general
()<
m
S
means that for every function f : n []n m there is a subset H of order type
such that for each n , f [H]n is constant.
7. 0 exists. This means that there is a non-identity elementary embedding of L into L.
Thus no actual cardinal is referred to. But 0 implies the existence of some large cardinals,
and the existence of some large cardinals implies that 0 exists.
8. J
onsson is a Jonsson cardinal iff every model of size has a proper elementary
substructure of size .
9. Rowbottom is a Rowbottom cardinal iff for every uncountable < , every model
of type (, ) has an elementary submodel of type (, ).
10. Ramsey ()<
2 .
11. measurable
12. strong is a strong cardinal iff for every set X there exists a nontrivial elementary
embedding from V to M with the first ordinal moved and with M.
13. Woodin is a Woodin cardinal iff
A V < (, ) < j[j is a nontrivial elementary embedding of V
into some set M, with the first ordinal moved, such that
j() > , V M, A V = j(A) V ]
14. superstrong is superstrong iff there is a nontrivial elementary embedding j : V
M with the first ordinal moved, such that Vj() M.
15. strongly compact is strongly compact iff for any L -language, if is a set of
sentences and every subset of of size less than has a model, then itself has a model.
16. supercompact is supercompact iff for every A with |A| there is normal measure
on P (A).
17. extendible For an ordinal , we say that k is -extendible iff there exist and a
nontrivial elementary embedding j : V+ V with first ordinal moved, with < j().
is extendible iff it is -extendible for every > 0.
185
18. Vop
enkas principle If C is a proper class of models in a given first-order language,
then there exist two distinct members A, B C such that A can be elementarily embedded
in B.
19. huge A cardinal is huge iff there is a nontrivial elementary embedding j : V M
with the first ordinal moved, such that Mj() M.
20. I0. There is an ordinal and a proper elementary embedding j of L(V+1 ) into L(V+1 )
such that the first ordinal moved is less than .
In the diagram on the next page, a line indicates that (the consistency of the) existence of
the cardinal above implies (the consisteny of the) existence of the one below.
EXERCISES
E14.1. For any infinite cardinal , let H() be the set of all x such that |trcl(x)| < .
Prove that V = H(). (H() is the collection of all hereditarily finite sets.) Hint:
V H() is easy. For the other direction, suppose that x H(), let t = trcl(x), and
let S = {rank(y) : y t}. Show that S is an ordinal.
E14.2. Which axioms of ZFC are true in On?
E14.3. Show that the power set operation is absolute for V for limit.
E14.4. Let M be a countable transitive model of ZFC. Show that the power set operation
is not absolute for M .
E14.5. Show that V is a model of ZFC Inf.
E14.6. Show that the formula x(x y) is not absolute for all nonempty sets, but it is
absolute for all nonempty transitive sets.
E14.7. Show that the formula z(x z) is not absolute for every nonempty transitive set.
E14.8. A formula is 1 iff it has the form x with a 0 formula; it is 1 iff it has the
form x with a 0 formula.
(i) Show that X is countable is equivalent on the basis of ZF to a 1 formula.
(ii) Show that is a cardinal is equivalent on the basis of ZF to a 1 formula.
E14.9. Prove that if is an infinite cardinal, then H() V .
E14.10. Prove that for regular, H() = V iff = or is inaccessible.
186
I0
huge
Vopenka
extendible
supercompact
super strong
strongly compact
Woodin
strong
measurable
Ramsey
Rowbottom
(1 )<
2
Jonsson
0 exists
()<
2
indescribable
weakly compact
Mahlo
strongly inaccessible
regular limit
187
188
The following rather trivial fact will be technically useful in what follows.
Lemma 15.1. Df(A, n) for any set A and any natural number n.
Proof. Let A be arbitrary. Note that = Diag (A, 0, 0, 1) Df (0, A, 1), and
hence = Proj(A, , 0) Df (1, A, 0), so that Df(A, 0). For n > 0 we have =
Diag (A, 0, 0, n) Df (0, A, n), so that Df(A, n).
The following proposition is really just an elementary consequence of the definition.
Proposition 15.2. Let A be a set and n a natural number. Then
(i) {Diag (A, n, i, j) : i, j < n} Df(A, n).
(ii) {Diag= (A, n, i, j) : i, j < n} Df(A, n).
(iii) For all R, if R Df(A, n), then n A\R Df(A, n).
(iv) For all R, S, if R, S Df(A, n), then R S Df(A, n).
(v) For all R, if R Df(A, n + 1), then Proj(A, R, n) Df(A, n).
189
Diag (A, n, i, j)
if r = 1, k = 0, and i, j < n,
Diag= (A, n, i, j)
if r = 1, k = 1, and i, j < n,
n
A\En(i, A, n)
if r = 1, k = 2,
En(m, A, n) =
En(i, A, n) En(j, A, n)
if r = 1, k = 3,
otherwise.
Lemma 15.3. For any m and any set A, Df(A, n) = {En(m, A, n) : m }.
Proof. By an easy induction using Lemma 15.2 we have En(m, A, n) Df(A, n) for
all m; Lemma 15.1 is needed too. This gives the inclusion . If we apply Lemma 15.2 to
the system hEn : n i given by En = {En(m, A, n) : m } for each n , we get the
other inclusion.
Lemma 15.4. If (x0 , . . . , xn1 ) is a formula with free variables among x0 , . . . , xn1 ,
then there is an m such that for every set A,
{s n A : A (s(0), . . . , s(n 1))} = En(m, A, n).
Proof. We proceed by induction on the number of quantifiers in , and within that,
by induction on formulas in the usual sense. For brevity let S() be the set {s n A :
A (s(0), . . . , s(n 1))}. Then
S(xi xj ) = Diag (A, n, i, j) = En(2i 3j , A, n);
S(xi = xj ) = Diag= (A, n, i, j) = En(2i 3j 5, A, n);
if S() = En(i, A, n), then
S() = n A\S() = En(2i 52 , A, n);
if S() = En(i, A, n) and S() = En(j, A, n), then
S( ) = S() S() = En(2l 3j 53 , A, n).
The inductive step from to y requires more care. By a change of bound variable we
obtain a formula such that y is logically equivalent to xn , with the free variables
of among x0 , . . . , xn . Hence with S() = En(i, A, n + 1),
S(y) = {s n A : xn (s(0), . . . , s(n 1), xn )}
= Proj(A, S(), n + 1) = En(2i 54 , A, n).
190
Corollary 15.5. Let (x0 , . . . , xn1 ) be a formula in our set-theoretic language with
free variables among x0 , . . . , xn1 . Then the following is provable in ZF:
A[{s n A : A (s(0), . . . , s(n 1))} Df(A, n)].
Proof. By Lemma 15.4, choose m such that
{s n A : A (s(0), . . . , s(n 1))} = En(m, A, n).
Thus our corollary follows by Lemma 15.3.
For any nonempty sets A, B, we say that A is elementarily contained in B, written A B,
iff A B and for all m, n ,
En(m, A, n) = nA En(m, B, n).
Lemma 15.6. Suppose that (x0 , . . . , xn1 ) is a formula whose free variables are
among x0 , . . . , xn1 . If A B then is absolute for A, B.
Proof. Choose m by Lemma 5.4. Suppose that a n A. Then
A (a0 , . . . , an1 ) iff
iff
iff
for all m by induction on m. For each step we assume that s n A is arbitrary. If m does
not have the special forms given in the first parts of the definition of En(m, B, n), then
both sides of () are empty; so we assume that it does have the special forms appropriate
for the several parts of the definition. So with obvious assumptions on i, j we have
s En(2i 3j , A, n) iff
s Diag (A, n, i, j)
iff
iff
s(i) s(j)
s Diag (B, n, i, j)
iff
i
j
s En(2 3 5, A, n) iff
s En(2i 3j , B, n);
s Diag= (A, n, i, j)
iff
iff
iff
s(i) = s(j)
s Diag= (B, n, i, j)
s En(2i 3j 5, B, n);
s En(2i 3j 52 , A, n) iff
iff
s n A\En(i, A, n)
s n A\En(i, B, n)
iff
i
j
3
s En(2 3 5 , A, n) iff
s En(2i 3j 52 , B, n);
s En(i, A, n) En(j, A, n)
iff
iff
s En(i, B, n) En(j, B, n)
s En(2i 3j 53 , B, n).
For the remaining case, first suppose that s En(2i 3j 54 , A, n). It follows that s
Proj(A, En(i, A, n + 1), n). Hence we can choose t En(i, A, n + 1) such that s = t n. By
the inductive hypothesis, t En(i, B, n + 1), so s Proj(B, En(i, B, n + 1), n), and hence
s En(2i 3j 54 , B, n).
Conversely, suppose that s En(2i 3j 54 , B, n). Let m = 2i 3j 54 . Now s
Proj(, En(i, B, n + 1), n). Hence we can choose t En(i, B, n + 1) such that s = t n.
Thus there is an x, namely x = t(n), such that s hxi En(i, B, n + 1) (since s hxi = t).
This means that Case 1 in the definition of fmn (s) holds, and so, since A is closed under
fmn , it follows that fmn (s) A. Let u = s hfmn (s)i. Then u En(i, B, n + 1), so by the
inductive hypothesis, u En(i, A, n + 1). It follows that s Proj(, En(i, A, n + 1), n), and
hence s En(2i 3j 54 , B, n).
Lemma 15.8. For any set A and any n , |Df(A, n) .
Proof. Immediate from Lemma 15.3.
Now we prove a sequence of lemmas leading up to the fact that Df is absolute for transitive
models of ZF. To do this, we have to extend the definitions of our functions above so that
they are defined for all sets, since absoluteness was developed only for such functions. We
do this by just letting the values be 0 for arguments not in the domain of the original
functions.
Lemma 15.9. The function Proj is absolute for transitive models of ZF.
192
Proof.
y = T3 (X) z yR, S X[z = R S] R, S X[R S y].
Lemma 15.15. For any sets A, X and any natural number n, let T4 (A, n, X) =
{Proj(A, R, n) : R X}. Then T4 is absolute for transitive models of ZF.
Proof. x = T4 (A, n, X) iff not(n ) and x = 0, or n and the following condition
holds:
R X[Proj(A, R, n) x] z[R X(z = Proj(A, R, n)) z x].
This is shown to be absolute as in the proof of Lemma 15.8.
Lemma 15.16. Let B = V , and define S B B as follows:
(k, A, n)S(k , A , n )
iff
if k = 0,
T1 (A, n)
y xk [y Df (k, A, n)]
k y Df (k, A, n)[y x].
It should be reasonably clear that Df has been defined in a very explicit fashion. We need
to express this rigorously; it will lead, eventually, to the conclusion that the axiom of choice
holds in the constructible universe. Just as for absoluteness, we give a sequence of lemmas
leading up to a result saying that any member of the range of Df has a natural well-order.
We will deal frequently with some obvious lexicographic orders, which we uniformly denote
by <lex , leaving to the reader exactly which lexicographic order is referred to.
Let A be any set, and n any natural number. For each R {Diag (A, n, i, j) :
i, j < n}, let (Ch(0, A, n, R), Ch(1, A, n, R)) be the smallest pair (i, j), in the lexicographic
order of such that i, j < n and R = Diag (A, n, i, j). Note, for example, that
Diag (A, 2, 0, 0) = Diag (A, 2, 1, 1) = . Now we define
R <0An S
iff
iff
For the next few constructions, suppose that X and A are sets, n , and we are given a
well-ordering < of X. Then we well-order {n A\R : R X} by setting
S 0,A,n,<,X T
iff
Next, for any set A and any k, n we define a well-order <3kAn of Df (k, A, n) by
induction on k. Let <30An be <2An . Assume that <3kAn has been defined for all n ,
and let R, S Df (k + 1, A, n). Then we define R <3(k+1)An S iff one of the following
conditions holds:
(1) R, S Df (k, A, n) and R <3kAn S
(2) R Df (k, A, n) and S
/ Df (k, A, n).
(3) R, S
/ Df (k, A, n), R, S {n A\T : T Df (k, A, n)}, and
R 0,A,n,<3kAn ,Df (k,A,n) S.
(4) R, S
/ Df (k, A, n), R {n A\T : T Df (k, A, n)}, and S
/ {n A\T : T
Df (k, A, n)}.
(5) R, S
/ Df (k, A, n), R, S
/ {n A\T : T Df (k, A, n)}, R, S {T U : T, U
Df (k, A, n)}, and
R 1,A,n,<3kAn ,Df (k,A,n) S.
(6) R, S
/ Df (k, A, n), R, S
/ {n A\T : T Df (k, A, n)}, R {T U : T, U
Df (k, A, n)}, and S
/ {T U : T, U Df (k, A, n)}.
(7) R, S
/ Df (k, A, n), R, S
/ {n A\T : T Df (k, A, n)}, R, S
/ {T U : T, U
iff
Constructible sets
The following is the definable power set operation: For any set A,
D(A) = {X A : n s n AR Df(A, n + 1)[X = {x A : s hxi R}]}.
Here s hxi is the member t of
n+1
Lemma 15.20. Let (v0 , . . . , vn1 , x) be a formula with the indicated free variables.
Then
Av0 , . . . , vn1 A[{x A : A (v0 , . . . , vn1 , x)} D(A)].
Proof. Let v0 , . . . , vn1 A and R = {s
Lemma 15.5, R Df(A, n + 1). Clearly
n+1
i<n
n
sn A
On the other hand, {a} D(A) for each a A by (iii), so |A| |D(A). So (iv) holds.
Now we define the hierarchy of constructible sets:
L0 = ;
L+1 = D(L );
[
L =
L for limit;
<
L=
L .
On
197
{x z : L (x, z, w1 , . . . , wn )} L
199
(since L is transitive)
D(L ) = L+1 ,
as desired.
Power set: See Theorem 14.7. Suppose that x L. For each z L such that z x
choose z such that z Lz . Let = supzx Lz . Then, using Theorem 15.23(v),
P(x) L = {z L : z x} L L+1 L,
as desired.
Replacement: See Theorem 14.8. Suppose that is a formula with free variables
among x, y, A, w1, . . . , wn , we are given A, w1 , . . . , wn L, and
(1)
On
On
Hence L = LM M.
Theorem 15.30. V = L implies AC. In fact, under V = L there is a class wellordering of the universe.
Proof. We define a well-ordering <5 of L by recursion (continuing our numbering
of well-orders from earlier in this chapter). First of all, <50 = . If is a limit ordinal,
then for any x, y L we define
x <5 y
iff
iff
Clearly this is a well-order of n L . Now for any X L+1 = D(L ), let n(X) be the least
natural number n such that
s n L R Df(L , n + 1)[X = {x L : s hxi R}].
Then let s(X) be the least member of
n(X)
EXERCISES
E15.1. Describe En(m, , n) for m 12 and n .
E15.2. Calculate En(625, V3 , 1) and En(20, 000, A, 2).
E15.3. In the ordering <L determine the first four sets and their order. Hint: use Corollary
15.5.
E15.4. Suppose that M is a nonempty transitive class satisfying the comprehension axioms,
and also x My M[x y]. Show that M is a model of ZF.
E15.5. Show that if M is a transitive proper class model of ZF, then x My M[x
y].
E15.6. Show that for every ordinal > , |L | = |V | iff = i .
E15.7. Assume V = L and > . Then L = V iff = i .
E15.8. Assume V = L and prove that L = H() for every infinite cardinal .
In the remaining exercises we develop the theory of ordinal definable sets. OD is the class
of all sets a such that:
> rank(a)n s n R Df(V , n + 1)
x V [s hxi R x = a].
E15.9. Show that if (y1 , . . . , yn , x) is a formula with at most the indicated variables free,
then
1 , . . . , n a[x[(1 , . . . , n , x) x = a] a OD].
Also show that OD.
E15.10. We define s t iff s, t < ON and one of the following holds:
(i) s = and t 6= ;
(ii) s, t 6= and max(rng(s)) < max(rng(t));
(iii) s, t 6= and max(rng(s)) = max(rng(t)) and dmn(s) < dmn(t);
(iv) s, t 6= and max(rng(s)) = max(rng(t)) and dmn(s) = dmn(t) and k
dmn(s)[s k = t k and s(k) < t(k)].
Prove the following:
(v) well-orders < ON.
(vi) t < ON[{s : s t} is a set].
(vii) For every infinite ordinal we have |< ( + 1)| = ||.
(viii) For every uncountable cardinal , the set < is well-ordered by in order type
and is an initial segment of < ON.
(ix) < is well-ordered by in order type 2 .
E15.11. By exercise E15.10, for each uncountable cardinal there is an isomorphism f
from (, ) onto (< , ). Then f f for < . It follows that there is a function
Enon mapping ON onto < ON such that < iff Enon() Enon().
205
Now we define a class function Enod with domain ON, as follows. For any ordinal ,
x V [s x En(m, V , n + 1) x = a],
0 otherwise.
Prove that OD = {Enod() : ON}.
E15.12. Now we define HOD = {x OD : trcl(x) OD}.
Prove that ON HOD and HOD is transitive.
E15.13. Show that (V HOD) HOD for every ordinal .
E15.14. Prove without using the axiom of choice that HOD is a model of ZFC.
References
Devlin, K. Constructibility, 425pp.
Jech, T. Set Theory, 769pp.
Kunen, K. Set Theory, 313pp.
206
x + (y + z) = (x + y) + z;
x + y = y + x;
x + (x y) = x;
x (y + z) = (x y) + (x z);
x + (x) = 1;
(A )
(C )
(L )
(D )
(K )
x (y z) = (x y) z;
x y = y x;
x (x + y) = x;
x + (y z) = (x + y) (x + z);
x (x) = 0.
The main example of a Boolean algebra is a field of sets: a set A of subsets of some set X,
closed under union, intersection, and complementation with respect to X. The associated
Boolean algebra is hA, , , \, 0, Xi. Here \ is treated as a one-place operation, producing
X\a for any a A. This example is really all-encompassingevery BA is isomorphic to
one of these. We will not prove this, or use it.
As is usual in algebra, we usually denote a whole algebra hA, +, , , 0, 1i just by
mentioning its universe A, everything else being implicit.
Some notations used in some treatments of Boolean algebras are: or for +; or
for ; for . These notations might be confusing if discussing logic, or elementary set
theory. Our notation might be confusing if discussing ordinary algebra.
Now we give the elementary arithmetic of Boolean algebras. We recommend that the
reader go through them, but then approach any arithmetic statement in the future from
the point of view of seeing if it works in fields of sets; if so, it should be easy to derive from
the axioms.
First we have the duality principle, which we shall not formulate carefully; our particular uses of it will be clear. Namely, notice that the axioms come in pairs, obtained from
each other by interchanging + and and 0 and 1. This means that also if we prove some
arithmetic statement, the dual statement, obtained by this interchanging process, is also
valid.
Proposition 16.1. x + x = x and x x = x.
Proof.
x + x = x + x (x + x) by (L )
= x by (L);
the second statement follows by duality.
Proposition 16.2. x + y = y iff x y = x.
Proof. Assume that x + y = y. Then, by (L ),
x y = x (x + y) = x.
The converse follows by duality.
207
(x + y) + x y = x (y + y) + y + x y
= x y + x y + y + x y
= y + x y + x y
= y + y = 1,
210
X
iI
ai =
(b ai ).
iI
P
Proof. Let s = iI ai ; we shall show that b s is the least upper bound of {b ai :
i I}. If i I, then ai s and so b ai b s; so b s is an upper bound for the indicated
set. Now suppose that x is any upper bound for this set. Then for any i I we have
b ai x, hence b ai x = 0 and so ai (b x) = b + x; so b + x is an upper
bound for {ai : i I}. It follows that s b + x, and hence s b x, as desired.
Quasi-orders
A quasi-order is a triple P = (P, , 1) such that is a reflexive and transitive relation
on the nonempty set P , and p P (p 1). Note that we do not assume that is
antisymmetric. Partial orders are special cases of quasi-orders in which this is assumed.
Note that we assume that every quasi-order has a largest element; this is non-standard in
the theory of quasi-orders, but standard in treatments of forcing. Many set-theorists use
partial order instead of quasi-order.
Frequently we use just P for a quasi-order; and 1 are assumed.
We say that elements p, q P are compatible iff there is an r p, q. We write p q
to indicate that p and q are incompatible. A set A of elements of P is an antichain iff any
two distinct members of A are incompatible. WARNING: sometimes antichain is used
to mean pairwise incomparable, a much different notion. A subset Q of P is dense iff for
every p P there is a q Q such that q p.
211
Now we are going to describe how to embed a quasi-order into a complete BA. We
take the regular open algebra of a certain topological space. We assume a very little
bit of topology. To avoid assuming any knowledge of topology we now give a minimalist
introduction to topology.
A topology on a set X is a collection O of subsets of X satisfying the following conditions:
(1) X, O.
(2) O is closed under arbitrary unions.
(3) O is closed under finite intersections.
The members of O are said to be open. The interior of a subset Y X is the union of all
open sets contained in Y ; we denote it by int(Y ).
Proposition 16.15. (i) int() = .
(ii) int(X) = X.
(iii) int(Y ) Y .
(iv) int(Y Z) = int(Y ) int(Z).
(v) int(int(Y )) = int(Y ).
(vi) int(Y ) = {x X : x U Y for some open set U }.
Proof. (i)(iii), (v), and (vi) are obvious. For (iv), if U is an open set contained in
Y Z, then it is contained in Y ; so int(Y Z) int(Y ). Similarly for Z, so holds. For
, note that the right side is an open set contained in Y Z. (v) holds since int(Y ) is
open.
A subset C of X is closed iff X\C is open.
Proposition 16.16. (i) and X are closed.
(ii) The collection of all closed sets is closed under finite unions and intersections of
any nonempty subcollection.
For any Y X, the closure of Y , denoted by cl(Y ), is the intersection of all closed sets
containing Y .
Proposition 16.17. (i) cl(Y ) = X\int(X\Y ).
(ii) int(Y ) = X\cl(X\Y ).
(iii) cl() = .
(iv) cl(X) = X.
(v) Y cl(Y ).
(vi) cl(Y Z) = cl(Y ) cl(Z).
(vii) cl(cl(Y )) = cl(Y ).
(viii) cl(Y ) = {x X :for every open set U , if x U then U Y 6= }.
Proof. (i): int(X\Y ) is an open set contained in X\Y , so Y is a subset of the closed
set X\int(X\Y ). Hence cl(Y ) X\int(X\Y ). Also. cl(Y ) is a closed set containing
Y , so X\cl(Y ) is an open set contained in X\Y . Hence X\cl(Y ) int(X\Y ). Hence
X\int(X\Y cl(Y ). This proves (i).
212
hence
hence
the axioms. The following are completely obvious: (A ), (C ), (C). Now let unexplained
variables range over RO(X). For (A), note by 16.18(i) that U U + V (U + V ) + W ;
and similarly V (U + V ) + W and W U + V (U + V ) + W . If U, V, W Z, then by
16.18(iv), U + V Z and hence (U + V ) + W Z. Thus (U + V ) + W is the least upper
bound in RO(X) of U, V, W . This is true for all U, V, W . So U + (V + W ) = (V + W ) + U
is also the least upper bound of them; so (A) holds. For (L):
U + U V = int(cl(U (U V ))) = int(cl(U )) = U.
(L ) holds by 16.18(i). For (D), first note that
Y (Z + W ) = Y int(cl(Z W ))
int(cl(Y (Z W ))) by 16.18(vi)
= int(cl((Y Z) (Y W )))
= Y Z + Y W.
On the other hand, (Y Z) (Y W ) = Y (Z W ) Y, Z W , and hence easily
Y Z + Y W = int(cl((Y Z) (Y W )))
int(cl(Y ) = Y and
Y Z + Y W = int(cl((Y Z) (Y W )))
int(cl(Z W ) = Z + W ;
so the other inclusion follows, and (D) holds.
(K): For any regular open Y , from Proposition 16.17(ii) we get Y = int(X\Y ) =
X\cl(X\(X\Y )) = X\cl(Y ). Hence
X = cl(Y ) (X\cl(Y )) cl(Y ) cl((X\cl(Y )) = cl(Y (X\cl(Y ))),
and hence X = Y + Y .
(K ): Clearly = Y int(X\Y ) = Y Y .
Thus we have now proved that hRO(X), +, , , , Xi is a BA. Since is the same as ,
is the same as . Hence by Proposition 16.18(ix), hRO(X), +, , , , Xi is a complete
BA.
Now we return to our task of embedding a quasi-order into a complete Boolean algebra.
Let P be a given quasi-order. For each p P let P p = {q : q p}. Now we define
OP = {X P : (P p) X for every p X}.
We check that this gives a topology on P . Clearly P, O. To
S show that O is closed
under arbitrary unions, suppose that X S
O. Take any p X . Choose X X
such that p X. Then (P p) X X , as desired. If X, Y OP , suppose that
215
p X Y . Then p X, so (P p) X. Similarly (P p) Y , so (P p) X Y .
Thus X Y OP , finishing the proof that OP is a topology on P .
We denote the complete BA of regular open sets in this topology by RO(P ).
Now for any p P we define
e(p) = int(cl(P p)).
Thus e maps P into RO(P ).
This is our desired embedding. Actually it is not really an embedding in general, but
it has several useful properties, and for many quasi-orders it really is an embedding.
The useful properties mentioned are as follows. We say that a subset X of P is dense
below p iff for every r p there is a q r such that q X.
Theorem 16.20. Let P be a quasi-order. Suppose that p, q P , F is a finite subset
of P , a, b RO(P ), and N is a subset of RO(P )
(i) e[P ] is dense in RO(P ), i.e., for any nonzero Y RO(P ) there is a p P such
that e(p) Y .
(ii) If p q then e(p) e(q).
(iii) p q iff e(p) e(q) = .
(iv) If e(p) e(q), then p and q are compatible.
(v) The following conditions are equivalent:
(a) e(p) e(q).
(b) {r : r p, q} is dense below p.
(vi) The following
Q conditions are equivalent, for F nonempty:
(a) e(p) qF e(q).
(b) {r : r q for all q F } is dense below p.
(vii) The following
Q conditions
P are equivalent:
(a) e(p) ( qF e(q)) N .
(b) {r : r q for all q F and e(r) s for some s N } is dense below p.
(viii) e(p) a iff there is no q p such that e(q) a.
(ix) e(p) a + b iff for all q p, if e(q) a then e(q) b.
Proof. (i): Assume the hypothesis. By the definition of the topology and since Y is
nonempty and open, there is a p P such that P p Y . Hence e(p) = int(cl(P p))
int(cl(Y )) = Y .
(ii): If p q, then P p P q, and so e(p) = int(cl(P p)) int(cl(P q) = e(q)).
(iii): Assume that p q. Then (P p) (P q) = , and hence by Proposition
16.18(viii), e(p) e(q) = .
Conversely, suppose that e(p) e(q) = . Then (P p) (P q) e(p) e(q) = ,
and so p q.
(iv): If e(p) e(q), then e(p) e(q) = e(p) 6= , so p and q are compatible by (iii).
(v): For (a)(b), suppose that e(p) e(q) and s p. Then e(s) e(p) e(q), so s
and q are compatible by (iv); say r s, q. Then r s p, hence r p, q, as desired.
For (b)(a), suppose that e(p) 6 e(q). Thus e(p) e(q) 6= 0. Hence there is an s
such that e(s) e(p) e(q). Hence e(s) e(q) = , so s q by (iii). Now e(s) e(p), so s
216
and p are compatible by (iv); say t s, p. For any r t we have r s, and hence r q.
So (b) fails.
(vi): We proceed by induction on |F |. The case |F | = 1 is given by (v).
Q Now assume
the result for F , and suppose that tQ P \F . First suppose that e(p) qF e(q) e(t).
Suppose that s p. Now e(p) qF e(q), so by the inductive hypothesis there is a
u s such that u q for all q F . Thus e(u) e(s) e(p) e(t), so by (iv), u and t
are compatible. Take any v u, t. then v q for any q F {t}, as desired.
Second, suppose that (b) holds
Q for F {t}. In particular, {r : r q for all q F }
is dense below p, and so e(p) qF e(q) by the inductive hypothesis. But also clearly
{r : r t} is dense below p, so e(p) e(t) too, as desired.
Q
P
(vii): First assume that e(p) ( qF e(q)) N , and suppose that u p. P
By (vi),
there is a v uPsuch that
v
q
for
each
q
F
.
Now
e(v)
e(u)
e(p)
N , so
P
0 6= e(v) = e(v) N = sN (e(v)e(s)). Hence there is an s N such that e(v)e(s) 6= 0.
Hence by (iii), v and s are compatible; say r v, s. Clearly r is in the set described in (b).
Second, suppose
that (b) holds. Clearly then {r : r q for
Q
P all q F } is densePbelow p,
and so e(p) qF e(q) by (vi). Now P
suppose that e(p) 6
N . Then e(p) N 6= 0,
so there is a q such that e(q) e(p) N . By (iv), q and p are compatible; say s p, q.
Then
P by (b)
P choose r s and t N such that e(r) t. Thus e(r) e(s) t e(p) t
( N ) N = 0, contradiction.
(viii): Assume that e(p) a. Suppose that q p and e(q) a. Then e(q)
a a = 0, contradiction.
(viii): Assume that e(p) 6 a. Then e(p) a 6= 0, so there is a q such that
e(q) e(p) a. By (vii) there is an r p, q with e(r) a, as desired.
(ix): Assume that e(p) a + b, q p, and e(q) a. Then e(q) a (a + b) b,
as desired.
(ix): Assume the indicated condition, but suppose that e(p) 6 a + b. Then
e(p) a b 6= 0, so there is a q such that e(q) e(p) a b. By (vii) with F = {p} and
N = {a b} we get q such that q p and e(q) a b. So q p and e(q) a, so by our
condition, e(q) b. But also e(q) b, contradiction.
We now expand on the remarks above concerning when e really is an embedding. Note
that if P is a simple ordering, then the closure of P p is P itself, and hence P has only
two regular open subsets, namely the empty set and P itself. If the ordering on P is trivial,
meaning that no two elements are comparable, then every subset of P is regular open.
An important condition satisfied by many quasi-orders is defined as follows. We say
that P is separative iff it is a partial order (thus is an antisymmetric quasi-order), and for
any p, q P , if p 6 q then there is an r p such that r q.
Proposition 16.21. Let P be a quasi-order.
(i) cl(P p) = {q : p and q are compatible}.
(ii) e(p) = {q : for all r q, r and p are compatible}.
(iii) The following conditions are equivalent:
(a) P is separative.
(b) e is one-one, and for all p, q P , p q iff e(p) e(q).
217
Proof. (i) and (ii) are clear. For (iii), (a)(b), assume that P is separative. Take
any p, q P . If p q, then e(p) e(q) by 16.20(ii). Suppose that p 6 q. Choose r p
such that r q. Then r e(p), while r
/ e(q) by (ii). Thus e(p) 6 e(q).
Now suppose that e(p) = e(q). Then p q p by what was just shown, so p = q
since P is a partial order.
For (iii), (b)(a), suppose that p q p. Then e(p) e(q) e(p), so e(p) = e(q),
and hence p = q. So P is a partial order. Suppose that p 6 q. Then e(p) 6 e(q). Choose
s e(p)\e(q). Since s
/ e(q), by (ii) we can choose t s such that t q. Since s e(p),
it follows that t and p are compatible; choose r t, p. Clearly r q.
Now we prove a theorem which says that the regular open algebra of a quasi-order is unique
up to isomorphism.
Theorem 16.22. Let P be a quasi-order, A a complete BA, and j a function mapping
P into A with the following properties:
(i) j[P ] is dense in A, i.e., for any nonzero a A there is a p P such that j(p) a.
(ii) For all p, q P , if p q then j(p) j(q).
(iii) For any p, q P , p q iff j(p) j(q) = 0.
Then there is a unique isomorphism f from RO(P ) onto A such that f e = j. That is,
f is a bijection from RO(P ) onto A, and for any x, y RO(P ), x y iff f (x) f (y).
Note that since the Boolean operations are easily expressible in terms of (as least upper
bounds, etc.), the condition here implies that f preserves all of the Boolean operations
too; this includes the infinite sums and products.
Proof. Before beginning the proof, we introduce some notation in order to make the
situation more symmetric. Let B0 = RO(P ), B1 = A, k0 = e, and k1 = j. Then for each
m < 2 the following conditions hold:
(1) km [P ] is dense in Bm .
(2) For all p, q P , if p q then km (p) km (q).
(3) For all p, q P , p q iff km (p) km (q) = 0.
(4) For all p, q P , if km (p) km (q), then p and q are compatible.
In fact, (1)(3) follow from 16.20 and the assumptions of the theorem. Condition (4) for
m = 0, so that km = e, follows from 16.20(iv). For m = 1, so that km = j, it follows easily
from (iii).
Now we begin the proof. For each m < 2 we define, for any x Bm ,
gm (x) =
The proof of the theorem now consists in checking the following, for each m 2:
(5) If x, y Bm and x y, then gm (x) gm (y).
(6) g1m gm is the identity on Bm .
218
In fact, suppose that (5) and (6) have been proved. If x, y RO(P ), then
x y implies that g0 (x) g0 (y) by (5);
g0 (x) g0 (y) implies that x = g1 (g0 (x)) g1 (g0 (y)) = y by (5) and (6).
Also, (6) holding for both m = 0 and m = 1 implies that g0 is a bijection from RO(P )
onto A. So g0 is the desired function f of the theorem.
Now (5) is obvious from the definition. To prove (6), assume that m 2. We first
prove
(7) For any p P and any b Bm , km (p) b iff k1m (p) gm (b).
To prove (7), first suppose that km (p) b. Then obviously k1m (p) gm (b). Second,
suppose that k1m (p) gm (b) but km (p) 6 b. Thus km (p) b 6= 0, so by the denseness
of km [P ] in Bm , choose q P such that km (q) km (p) b. Then p and q are compatible
by (4), so let r P be such that r p, q. Hence
k1m (r) k1m (p) gm (b) =
P
Hence k1m (r) = {k1m (s) k1m (r) : s P, km (s) b}, so there is an s P such that
km (s) b and k1m (s) k1m (r) 6= 0. Hence s and r are compatible; say t s, r. Hence
km (t) km (r) km (q) b, but also km (t) km (s) b, contradiction. This proves (7).
Now take any b Bm . Then
g1m (gm (b)) =
=
X
X
= b.
Thus (6) holds.
This proves the existence of f . Now suppose that g is also an isomorphism from
RO(P ) onto A such that g e = j, but suppose that f 6= g. Then there is an X RO(P )
such that f (X) 6= g(X). By symmetry, say that f (X) g(X) 6= 0. By (ii), choose p P
such that j(p) f (X) g(X). So f (e(p)) = j(p) f (X), so e(p) X, and hence
j(p) = g(e(p)) g(X). This contradicts j(p) g(X).
Corollary 16.23. Suppose that A is a complete BA with at least two elements, and
consider the partial order P = (A\{0}, , 1). Then A is isomorphic to RO(P).
Proof. Let P = A\{0}, and let j be the identity on P . Then of course j[P ] is dense
in A. If p, q P and p q, then j(p) j(q). Condition (iii) of 16.22 is also clear. Hence
there is an isomorphism f of RO(P) onto A such that f (e(p)) = p for every p P .
EXERCISES
E16.1. Let (A, +, , , 0, 1) be a Boolean algebra. Show that (A, , , 0, 1) is a ring with
identity in which every element is idempotent. This means that x x = x for all x.
219
E16.16. Let (A, +, , 0, 1) be a ring with identity in which every element is idempotent.
Show that A is a commutative ring, and (A, , , , 0, 1) is a Boolean algebra, where for
any x, y A, x y = x + y + xy and for any x A, x = 1 + x. Hint: expand (x + y)2 .
E16.3. Show that the processes described in exercises E16.1 and E16.2 are inverses of one
another.
E16.4. A filter in a BA A is a subset F of A with the following properties:
(1) 1 F .
(2) If a F and a b, then b F .
(3) If a, b F , then a b F .
An ultrafilter in A is a filter F such that 0
/ F , and for any a A, a F or a F .
Prove that a filter F is an ultrafilter iff F is maximal among the set of all filters G
such that 0
/ G.
E16.5. (Continuing exercise E16.4) Prove that for any nonzero a A there is an ultrafilter
F such that a F .
E16.6. (Continuing exercise E16.4) Prove that any BA is isomorphic to a field of sets.
(Stones representation theorem) Hint: given a BA A, let X be the set of all ultrafilters
on A and define f (a) = {F X : a F }.
E16.7 (Continuing exercise E16.4) Suppose that F is an ultrafilter on a BA A. Let 2 be
the two-element BA. (This is, up to isomorphism, the BA of all subsets of 1.) For any
a A let
n
1 if a F ,
f (a) =
0 if a
/ F.
Show that f is a homomorphism of A into 2. This means that for any a, b A, the
following conditions hold:
f (a + b) = f (a) + f (b);
f (a b) = f (a) f (b);
f (a) = f (a);
f (0) = 0;
f (1) = 1.
E16.8. (Lindenbaum-Tarski algebras; A knowledge of logic is assumed.) Suppose that L
is a first-order language and T is a set of sentences of L . Define T iff and are
sentences of L and T |= . Show that this is an equivalence relation on the set S of
all sentences of L . Let A be the collection of all equivalence classes under this equivalence
relation. Show that there are operations +, , on A such that for any sentences , ,
[] + [] = [ ];
[] [] = [ ];
[] = [].
220
Finally, show that (A, +, , , [v0((v0 = v0 ))], [v0 (v0 = v0 )]) is a Boolean algebra.
E16.9. (A knowledge of logic is assumed.) Show that every Boolean algebra is isomorphic
to one obtained as in exercise E16.8. Hint: Let A be a Boolean algebra. Let L be the
first-order language which has a unary relation symbol Ra for each a A. Let T be the
following set of sentences of L :
xy(x = y);
x[Ra (x) Ra (x)] for each a A;
x[Rab (x) Ra (x) Rb (x)] for all a, b A;
xR1 (x).
def
I 2 iI
2=
(xi + xi )
while
X Y
2
(i)
xi
221
= ,
iff
p, q P, p q, and q p.
Show that is an equivalence relation, and if Q is the collection of all -classes, then
there is a relation on Q such that for all p, q P , [p] [q] iff p q. Finally, show
that (Q, , [1]) is a partial order.
E16.15. We say that (P, <, 1) is a partial order in the second sense iff < is transitive and
irreflexive, and x < 1 for all x 6= 1. (Irreflexive means that for all p P , p 6< p.) Show that
if (P, <, 1) is a partial order in the second sense and if we define by p q iff (p, q P
def
and p < q or p = q), then A (P, <, 1) = (P, ) is a partial order, and p 1 for all p P .
Furthermore, show that if (P, ) is a partial order in which p 1 for all p, and we define
def
222
Then G
/ M.
Proof. Suppose to the contrary that G M . Then also P \G M , since M is a
model of ZFC and by absoluteness. We claim that P \G is dense. In fact, given p P ,
choose q, r as in (1). Then q, r cannot both be in G, by the definition of filter. So one
at least is in P \G, as desired. Since P \G is dense and in M , we contradict G being
generic.
Most quasi-orders used in forcing arguments satisfy the condition of Lemma 17.2; for more
details on this lemma, see the exercises.
The following elementary proposition gives six equivalent ways to define generic filters.
Proposition 17.3. Suppose that M is a c.t.m. of ZFC and P is a quasi-order in M .
Suppose that G P satisfies condition (2), i.e., if p G and p q, then q G. Then the
following conditions are equivalent:
(i) G D 6= whenever D M and D is dense in P.
(ii) G A 6= whenever A M and A is a maximal antichain of P.
(iii) G E 6= whenever E M and for every p P there is a q E such that p
and q are compatible.
Moreover, suppose that G satisfies (2) and one, hence all, of the conditions (i)(iii). Then
G is P-generic over M iff the following condition holds:
(iv) For all p, q G, p and q are compatible.
Proof. (i)(ii): Assume (i), and suppose that A M is a maximal antichain of P.
Let D = {p P : p q for some q A}. We claim that D is dense. Suppose that r is
arbitrary. Choose q A such that r and q are compatible. Say p r, q. Thus p D.
So, indeed, D is dense. Clearly D M , since A M . By (i), choose p D G. Say
p q A. Then q G A, as desired.
(ii)(iii): Assume (ii), and suppose that E is as in (iii). By Zorns lemma, let A be
a maximal member of
(1) {B P : B is an antichain, and for every p B there is a q E such that p q}.
We claim that A is a maximal antichain. For, suppose that p q for all q A. Choose
s E such that p and s are compatible. Say r p, s. Hence r q for all q A, so r
/ A.
Thus A {r} is a member of (1), contradiction.
Clearly A M , since E M . So, since A is a maximal antichain, choose p A G.
Then choose q E such that p q. So q E G, as desired.
(iii)(i): Obvious.
Now we assume (2) in the definition, and (i)(iii).
If G is P-generic over M , clearly (iv) holds.
Now asume that (i)(iv) hold, and suppose that p, q G; we want to find r G such
that r p, q. Let
D = {r : r p or r q or r p, q}.
We claim that D is dense in P. For, let s P be arbitrary. If s p, then s s and s D,
as desired. So suppose that s and p are compatible; say t s, p. If t q, then t s and
224
1
0
iff
Note that M P M , and hence M P is countable. Hence by the replacement axiom, M [G]
is also countable.
Lemma 17.4. If M is a c.t.m. of ZFC, P M is a quasi-order, and G is a filter on
P, then M [G] is transitive.
Proof. Suppose that x y M [G]. Then there is a M P such that y = G . Since
x G , there is a M P such that x = G . So x M [G].
The following Lemma says that M [G] is the smallest c.t.m. of ZFC which contains M as
a subset and G as a member, once we show that it really is a model of ZFC. This lemma
will be extremely useful in what follows.
Lemma 17.5. Suppose that M is a c.t.m. of ZFC, P M is a quasi-order, G is a
filter on P, N is a c.t.m. of ZFC, M N , and G N . Then M [G] N .
Proof. Take any x M [G]. Say x = val(, G) with M P . Then , G N , so by
absoluteness, x = (val(, G))N N .
To show that M is a subset of M [G], we need a function mapping M into the collection
of all P -names. We now assume that P = (P, , 1) is a quasi-order. Again the definition
is by recursion.
x
= {(
y , 1) : y x}.
Note that this depends on P; we could denote it by check(P, x) to bring this out, if necessary.
Again this function is absolute for transitive models of ZFC.
Lemma 17.6. Suppose that M is a transitive model of ZFC, P M is a quasi-order,
and G is a non-empty filter on P . Then
(i) For all x M , x
M P and val(
x, G) = x.
(ii) M M [G].
Proof. Absoluteness implies that x
M P for all x M . We prove val(
x, G) = x by
-induction. If it is true for all y x, then
val(
x, G) = {val(, G) : (, 1) x
}
= {val(
y , G) : y x}
= {y : y x}
= x.
Finally (ii) is immediate from (i).
Next, for any partial order P we define a P -name . It depends on P and could be defined
as P to bring this out.
= {(
p, p) : p P }.
Lemma 17.7. Suppose that M is a transitive model of ZFC, P M is a quasi-order,
and G is a non-empty filter on P . Then G = G. Hence G M [G].
Proof. G = {
pG : p G} = {p : p G} = G.
226
Note by absoluteness of the rank function that rank( ) is the same within M or M [G].
Lemma 17.9. Suppose that M is a transitive model of ZFC, P M is a quasi-order,
and G is a non-empty filter on P . Then M and M [G] have the same ordinals.
Proof. Since M M [G], every ordinal of M is an ordinal of M [G]. Now suppose that
is any ordinal of M [G]. Write = G , where M P . Now rank( ) = rankM ( ) M .
So by Lemma 17.8, = rank() = rank(G ) rank( ) M , so M .
The following lemma will be used often.
Lemma 17.10. Suppose that M is a transitive model of ZFC, P M is a quasi-order,
E P , E M , and G is a P-generic filter over M . Then:
(i) Either G E 6= , or there is a q G such that r q for all r E.
(ii) If p G and E is dense below p, then G E 6= .
Proof. Let
D = {p : p r for some r E} {q : q r for all r E}.
We claim that D is dense. For, suppose that q P . We may assume that q
/ D. So q
is not in the second set defining D, and so there is an r E which is compatible with q.
Take p with p q, r. then p D and p q, as desired.
Since D is dense, we can choose s G D. Now to prove (i), suppose that G E = .
Then s is not in the first set defining D, so it is in the second set, as desired.
For (ii), suppose that G E = , and by (i) choose q G such that q r for every
r E. By the definition of filter, there is a t G with t p, q. Since E is dense below
p, there is then a u E with u t. Thus u q, so it is not the case that u q,
contradiction.
Proposition 17.11. Suppose that M is a transitive model of ZFC, P M is a quasiorder, and G is a P-generic filter over M . Suppose that p P and p is compatible with
each member of G. Then p G.
Proof. The set {q P : q p or q p} is clearly dense in P.
Now we introduce the idea of forcing. Recall that the logical primitive notions are , ,
, and =.
With each formula (v0 , . . . , vm1 ) of the language of set theory we define another
formula
p
P,M (0 , . . . , m1 ),
227
Y
X
e(p) +
[[ = ]] =
(e(q) [[ = ]])
(,p)
(,q)
(,q)
e(q) +
(,p)
(e(p) [[ = ]]) .
Now we associate with each formula of the language of set theory and with each seqeuence
of members of VP a Boolean value, by metalanguage recursion, as follows:
X
[[ ]] =
(e(p) [[ = ]]);
(,p)
Note that the last big sum has index set which is a proper class in general. But the values
are all in the Boolean algebra A, so this makes sense.
Lemma 17.12.
[[(0 , . . . , m1 ) (0 , . . . , m1 )]] = [[(0 , . . . , m1 )]] + [[(0 , . . . , m1 )]];
[[(0 , . . . , m1 ) (0 , . . . , m1 )]] = [[(0 , . . . , m1 )]] + [[(0 , . . . , m1 )]];
Y
[[x(0 , . . . , m1 , x)]] =
[[(0 , . . . , m1 , )]].
VP
Note that the proof of Lemma 17.12 depends on the standard way of defining the indicated
connectives in terms of the ones we have selected as primitive: is ( ),
is , and x is x.
228
[[ = ]] =
(,p)
(e(p) + [[ ]])
(,q)
(e(q) + [[ ]]) .
(iii) [[ = ]] = 1.
(iv) If (, r) , then e(r) [[ ]], and hence r
.
(v) p
iff the set
{q : (, s) (q s and q
= )}
is dense below p.
(vi) p
= iff the following two conditions hold:
(a) For all (, s) , the set
{q p : if q s, then there is a (, u) such that q u and q
= }
is dense below p;
(b) For all (, u) , the set
{q p : if q u, then there is a (, s) such that q s and q
= }
is dense below p.
(vii)
p
(0 , . . . , n1 ) (0 , . . . , n1 )
iff
p
(0 , . . . , n1 ) and p
(0 , . . . , n1 ).
(viii) p
(0 , . . . , n1 ) (0 , . . . , n1 ) iff for all q p there is an r q such
that r
(0 , . . . , n1 ) or r
(0 , . . . , n1 ).
(ix) p
(0 , . . . , n1 ) iff for all q p, q 6
(0 , . . . , n1 ).
(x) {p : p
(0 , . . . , n1 ) or p
(0 , . . . , n1 )} is dense.
(xi) p
x(x, 0 , . . . , n1 ) iff the set
{r p : there is a VP such that r
(, 0 , . . . , n1 )}
229
is dense below p.
(xii) p
x(x, 0 , . . . , m1 ) iff for all VP , p
(, 0 , . . . , m1 ).
(xiiii) The following are equivalent:
(a) p
(0 , . . . , m1 ).
(b) For every r p, r
(0 , . . . , m1 ).
(c) {r : r
(0 , . . . , m1 )} is dense below p.
(xiv) p
(0 , . . . , m1 ) (0 , . . . , m1 ) iff the set
{q : q
(0 , . . . , m1 ) or q
(0 , . . . , m1 )}
is dense below p.
(xv) If p
x(x, 0 , . . . , m1 ), then the set
{q : there is a VP such that q
(, 0 , . . . , m1 )}
is dense below p.
(xvi) If p
(0 , . . . , m1 ) and p
(0 , . . . , m1 ) (0 , . . . , m1 ), then
p
(0 , . . . , m1 ).
Proof.
(i): By induction:
[[ = ]] =
Y
(,p)
(,q)
e(q) +
Y
(,q)
(,p)
(,q)
(,p)
(,p)
230
(,p)
(,q)
(e(p) [[ = ]])
(e(q) [[ = ]])
X
X
= [[ = ]].
(,q)
(e(q) [[ = ]])
(e(p) [[ = ]])
e(q) +
e(p) +
(,q)
(,p)
e(q) +
e(p) +
X
X
Y
Y
e(p) +
(e(p) [[ = ]])
(e(q) [[ = ]])
(ii):
[[ = ]] =
Y
(,p)
(,q)
e(q) +
Y
(,p)
(,q)
e(q) +
(,p)
(,q)
X
(,p)
e(p) +
e(p) +
(,q)
(,p)
(e(p) [[ = ]])
X
X
(e(q) [[ = ]])
(e(q) [[ = ]])
(e(p) [[ = ]])
(e(p) + [[ ]])
(,q)
(e(q) + [[ ]]) .
We prove (iii) and (iv) simultaneously by induction on the rank of ; so suppose that they
hold for all of rank less than . Assume that (, r) . Then by the definition of
[[ ]],
X
[[ ]] =
(e(s) [[ = ]]) e(r) [[ = ]] = e(r),
(,s)
as desired in (iii).
We now use Theorem 16.20(vii) in several of our arguments.
(v):
p
iff
iff
e(p) [[ ]]
X
e(p)
(e(s) [[ = ]])
(,s)
iff
Take any r p, and choose q r and (, x) such that e(q) e(s) [[ = ]]. Then q
and s are compatible; say t q, s. Then t q r and e(t) e(q) e(s) [[ = ]]. Thus
() holds.
Now () is clearly equivalent to
{q : (, s) [q s and s
= ]} is dense below p.
(vi): Assume that p
= .
For (a), suppose that (, s) and r p. If r 6 s, then r itself is in the desired set;
so suppose that r s. Then
X
(,u)
(e(u) [[ = ]]).
(,u)
Hence there is a (, u) such that e(r) e(s) e(u) [[ = ]] 6= 0. Hence there exists a
v r, s such that e(v) e(u) [[ = ]]. (See the argument for (v)). It follows that there
is a q v, u with e(q) [[ = ]]. So q
= , and q is in the desired set.
(b) is treated similarly.
Now assume that (a) and (b) hold. We want to show that p
= , i.e., that
e(p) [[ = ]]. To show that e(p) is below the first big product in the definition of
[[ = ]], take any (, q) ; we want to show that
X
e(p) e(q) +
(e(r) [[ = ]]),
(,r)
i.e., that
e(p) e(q)
(e(r) [[ = ]]).
(,r)
Suppose that this is not the case. Then there is an s such that
e(s) e(p) e(q)
(,r)
(e(r) + [[ = ]]).
(,r)
e(q)
[[(, 0 , . . . , n1 )]].
M P
Now suppose that q p. Then e(q) is the sum here, so we easily get r q and M P
such that e(r) [[(, 0 , . . . , m1 )]]. Hence r
(, 0 , . . . , m1 ), as desired.
(xvi): The hypotheses yield
e(p) [[(0 , . . . , m1 )]] and
e(p) [[(0 , . . . , m1 )]] + [[(0 , . . . , m1 )]],
so e(p) [[(0 , . . . , m1 )]] and hence p
(0 , . . . , m1 )]].
Note that the discussion of Boolean values and of
has taken place in our usual framework
of set theory. The complete BA RO(P ) is in general uncountable. Given a c.t.m. M of
ZFC, the definitions can take place within M , and while M may be a model of RO(P )
is uncountable, actually RO(P )M is countable. Thus even if 0 , . . . , m1 are members
of M P , the statements p
(0 , . . . , m1 ) and (p
(0 , . . . , m1 ))M are much
different, since the products and sums involved in the definition of the former range over
a possibly uncountable complete BA, while those in the latter range only over a countable
BA (which is actually incomplete if it is infinite). It can be shown, however, that the
statements are equivalent; see exercise 17.20. We do not need this, as we will really be
concerned only with the forcing relativized to M .
Now we prove the fundamental theorem connecting the notion
in a c.t.m. M with
the notion
, whose definition takes place outside M .
Theorem 17.14. (The Forcing Theorem) Suppose that M is a c.t.m. of ZFC, P M
is a quasi-order, and G is P-generic over M . Then the following conditions are equivalent:
(i) There is a p G such that (p
(0 , . . . , m1 ))M .
(ii) (0G , . . . , (m1)G ) holds in M [G].
Proof. First we prove the equivalence for the formula x = y, by induction on the pair
(0 , 1 ).
For (i)(ii), suppose that p G and (p
0 = 1 )M . We want to show that
0G = 1G . First we prove that 0G 1G . So, suppose that a 0G . Then there is
a (, t) 0 such that t G and a = G . Since p, t G, we can choose r G such
that r p, t. Then (r
0 = 1 )M since r p. Now we work within M . We have
e(r) [[0 = 1 ]]. Hence by definition we have
e(r) e(t) +
X
(,u)1
234
(e(u) [[ = ]]);
(,u)1 (e(u)
[[ = ]]. Now
{s P : (, u) 1 [e(s) [[ = ]]]}
P
is dense below r. In fact, suppose that t r. Then e(t) (,u)1 (e(u) [[ = ]], and
so there is a (, u) 1 such that e(t) e(u) [[ = ]] 6= 0; hence there is a v t, u such
that e(v) h = ]], as desired.
By this denseness, using Lemma 17.10 we get s G and (, u) 1 such that e(s)
[[ = ]]. So s
= . By the inductive hypothesis we get G = G . Hence a = G =
G 1G , as desired. The proof that 1G 0G is similar.
For (ii)(i), suppose that 0G = 1G holds in M [G]. By Lemma 17.13(x), choose
p G such that (p
0 = 1 )M or (p
(0 = 1 ))M . It suffices now to get a
contradiction from the assumption that (p
(0 = 1 ))M . First we claim:
(1) The set
{r : there is a (, s) 1 such that
Y
e(r) e(s)
(e(t) + [[ = ]]),
(,t)0
is dense below p.
In fact, suppose that q p. Then
e(q) [[0 = 1 ]]
X
=
(e(s)
(,s)1
(,t)0
(e(t)
(,t)0
(e(t) + [[ = ]]))
(e(s) + [[ = ]])).
(,s)1
Now if e(q) times the first big sum here is nonzero, then we easily get r q and (, s) 1
such that
Y
e(r) e(s)
(e(t) + [[ = ]]).
(,t)0
Thus r is in the set (1), as desired. If e(q) times the first big sum is zero, then we easily
get r q in (1) again. So (1) is dense below p.
Hence there is an r G such that r is in the set (1).
Case 1. There is a (, s) 1 such that
e(r) e(s)
(e(t) + [[ = ]]).
(,t)0
235
Now it is easily seen that the set {u : u r, s} is dense below r, so we can choose u G
such that u r, s. It follows that s G, and hence G 1G . Hence G 0G , and so
there is a (, t) 0 such that t G and G = G . By the induction hypothesis, there is
a v G such that (v
= )M . Now choose w G with w u, t, v. Now w u r,
so e(w) e(r), and hence e(w) e(t) + [[ = ]]. Also w t, so e(w) e(t), and
hence e(w) [[ = ]]. But also w v, so (w
= )M , and hence e(w) [[ = ]],
contradiction.
Case 2. There is a (, t) 0 such that
e(r) e(t)
(e(s) + [[ = ]]).
(,s)1
If the equivalence holds for and then it clearly also holds for .
Finally, we deal with the formula x(x, 0 , . . . , n1 ). For (i)(ii), assume that
p G and (p
x(x, 0 , . . . , n1 ))M . By Lemma 17.13(xi), choose q p with q G
and a name such that (q
(, 0 , . . . , n1 ))M . Then by the inductive hypothesis,
M [G] (0G , 0G , . . . , (m1)G ), as desired.
For (ii)(i), suppose that (x(x, 0G , . . . , (m1)G )M [G] . Let be a name such
that (0G , 0G , . . . , (m1)G ). Then by the inductive hypothesis there is a p G such
that (p
(, 0 , . . . , n1 ))M . By Lemma 17.13(xi), (p
x(x, 0 , . . . , n1 ))M , as
desired.
Theorem 17.15. Let M be a c.t.m. of ZFC and P M a quasi-order.
(i) For all p P , p
(0 , . . . , m1 ) iff (p
(0 , . . . , m1 ))M .
(ii) For every G which is P-generic over M ,
(0G , . . . , (m1)G ) holds in M [G] iff
there is a p G such that p
(0 , . . . , m1 ).
Proof. (i): Assume that p
(0 , . . . , m1 ). Working in M , suppose that
p 6
(0 , . . . , m1 ). Then
and hence
is dense below p.
(ii) If p
x (x, 0 , . . . , m1 ), then there exist a q p and a dmn( ) such
that q
(, 0 , . . . , m1 ).
(iii) The following are equivalent:
(a) p
x (x, 0 , . . . , m1 ).
(b) For all (, r) and all q p, r, q
(, 0 , . . . , m1 ).
Proof. (i)(a)(b): Assume (a), and suppose that r p. Let G be generic with
r G. Thus by definition of
, x G (x, 0G , . . . , (m1)G ) holds in M [G], so there is
a member a of M [G] such that a G and (a, 0G , . . . , (m1)G ) holds in M [G]. Then
there is a (, r) such that r G and a = G . By Theorem 17.15, there is an s G
such that s
(, 0 , . . . , m1 ). Choose q such that q p, r, s. Then q is in the set of
(b), as desired.
(i)(b)(a): Assume (b), and suppose that G is generic and p G. Then by (b) we
can choose q p with q G, and q in the set of (b). So, choose (, r) such that q r
and q
(, 0 , . . . , m1 ). Hence p
(, 0 , . . . , m1 ), and (a) holds.
(ii): immediate from (i).
(iii)(a)(b): assume (a) and suppose that (, r) and q p, r. Suppose that
G is generic and q G. Then also p G, so by (a) and the definition of
, the statement x G (x, 0G , . . . , (m1)G ) holds in M [G]. Also r G, so G G . Hence
(G , 0G , . . . , (m1)G ) holds in M [G]. This proves (b).
(iii)(b)(a): Assume (b), and suppose that p G with G generic. To prove that
x G (x, 0G , . . . , (m1)G ) holds in M [G], let a be any member of G . Write a = G
with (, r) and r G. Choose q G with q p, r. Then by (b) we see that
(G , 0G , . . . , (m1)G ) holds in M [G]. This proves (a).
Proposition 17.18.
(i)
[[x (0 , . . . , m1 , x)]] =
(,p)
(ii)
[[x (0 , . . . , m1 , x)]] =
(,p)
Proof. For (i), we first show that [[x (0 , . . . , m1 , x)]] is an upper bound for
def
(e(p) + (0 , . . . , m1 , x)]]).
(,p)
Now we are going to show that M [G] is always a model of ZFC. To treat the pairing axiom,
relations and functions, the following definitions are useful.
Let P be any set, and let and be P -names. Then we define
up(, ) = {(, 1), (, 1)};
op(, ) = up(up(, ), up(, )).
Lemma 17.19. Suppose that M is a c.t.m. of ZFC, P M is a quasi-order, and G
is P-generic over M . Suppose that , M P . Then
(i) up(, ) M P and up(, )G = {G , G }.
(ii) op(, ) M P and op(, )G = (G , G ).
Theorem 17.20. Let M be a c.t.m. of ZFC, P M a quasiorder, and G P-generic
over M . Then the relativization of each axiom of ZFC to M [G] holds.
Proof. We are going to use the simplified procedure developed in Chapter 14 to check
axioms.
Since M [G] is transitive by Lemma 17.4, extensionality and foundation hold by Theorems 14.3 and 14.9.
Next we take comprehension, using Theorem 14.4. Let (x, z, w1 , . . . , wn ) be a formula
with free variables among those mentioned, and let , 1 , . . . , n M P . Then by the
comprehension axiom, the set
A = {a G : (a, G , 0G , . . . , nG )M [G] }
239
exists, and it clearly suffices to show that this set is in M [G]. Let
= {(, p) dmn( ) P : p
( (, , 1, . . . , n ))}.
Thus M P . We claim that G = A, as desired. First suppose that a G . Then
there is a (, p) as in the definition of such that p G and G = a. Now p
(
(, , 1, . . . , n )), so by the definition of
, a = G G and (G , G , 1G , . . . , nG )
holds in M [G]. Thus a A. Conversely, suppose that a A. Then there is a (, p)
such that p G and a = G . So G G (G , G , 0G , . . . , nG ) holds in M [G]. By
Theorem 17.15, choose q G such that q
(, , 1, . . . , n )). Thus (, q) ,
so a = G G , as desired.
For pairing, we apply Theorem 14.5. Suppose that x, y M [G]. Say x = G and
y = G . Clearly op(, )G = {x, y}.
For union, we apply Theorem
14.6. We needSto take any element G of M [G] and
S
find G in M [G] such that G G . Let = dmn(). Clearly M P . Suppose
that x y G . Then we can choose (, p) such that p G and y = G ; and we
can choose (, q) such that q G and G = x. Thus dmn() and (, q) , so
(, q) . It follows that x = G G , as desired.
Next comes the power set axiom. By Theorem 14.7 it suffices to take any G M [G]
and find a M P such that P(G ) M [G] G . Let
= ({(, 1) : dmn( ) dmn()}.
Suppose that x P(G ) M [G]. So we can write x = G . In M , let
= {(, p) : dmn() and p
}.
Since dmn( ) dmn(), we have (, 1) and so G G . Hence it suffices to show that
G = G .
Suppose that y G . Thus y x G . Say y = G with (, q) for some q G.
Now G G , so by Theorem 17.15 there is a p G such that (p
)M . So we have
(, p) , and hence y = G G . This shows that G G .
Conversely, suppose that y G . Hence we can write y = G with (, p) for some
p G. By virtue of (, p) we have (p
)M , and so by Theorem 17.15 we have
y = G G . Hence we have now fully shown that G = G .
Next we take care of the replacement axioms, using Theorem 14.8. Thus suppose that
is a formula with free variables among x, y, A, w1, . . . , wn , we are given A, w1 , . . . , wn
M [G] such that
(1)
(,p)B
def
This ends our work within M . We claim that Y = G is as desired in (2). For, suppose
that y M [G] and x A with M [G] (x, y, A, w1, . . . , wn ). Write y = G and x = G
with dmn( ). So M [G] (G , G , G , 1G , . . . , nG ). Hence by Theorem 17.15 there
is a p G such that (p
(, , , 1, . . . n ))M . It follows that (, p) B. Choose
V M P such that p
(, , , 1, . . . n ). Thus M [G] (G , G , G , 1G , . . . , nG )
by Theorem 17.15. So G G = Y . Since we also have M [G] (G , G , G , 1G , . . . , nG ),
by (1) we get y = G = G Y , as desired in (2).
So now we have checked ZF Inf. By Lemma 17.6, =
G , so M [G]. So by
Theorem 14.19, infinity holds in M [G].
Thus the axioms of ZF hold. To prove that AC holds, we can take any equivalent of
it. It is convenient to take the one given in the following statement:
(*) In ZF, AC is equivalent to the statement that for any set X there exist an ordinal
and a function f with domain such that X rng(f ).
In fact, obviously AC implies the indicated statement. Now suppose that the statement
holds, and we want to find a choice function for all of the nonempty subsets of some set
A. Let be an ordinal and f a function with domain such that A rng(f ). Then for
any nonempty subset B of A we define h(B) = f (), where is the smallest ordinal less
than such that f () B. Thus (*) holds.
Now assume that x M [G]; say x = G . Then there is a function M with domain
some ordinal such that dmn() = rng(). Now define
= {op(
, ) : < } {1}.
Then M , and G = {(, ( )G ) : < }. Hence G is a function with domain . If
y x, then there is a ( , p) such that p G and y = ( )G . Hence y = G (). Thus
x rng(G ).
Theorem 17.21. Let M be a c.t.m. of ZFC, P M a quasiorder, and G P-generic
over M . Suppose that G
/ M . (See Lemma 17.2.) Then M [G] models (V = L).
L
Proof. By Lemma 17.9, M and M [G] have the same ordinals. By absoluteness then,
= LM M , so M [G] has elements not in LM [G] .
M [G]
(iv) Let e be the two-place class function whose domain is the set of all pairs (P, p)
such that P is a quasi-order and p P , with e(P, p) = int(cl(P p)). Show that e is
absolute for c.t.m. of ZFC.
(v) Let M be a c.t.m. of ZFC, and suppose that P M is a quasi-order. Show that
(RO(P ))M is a subalgebra of RO(P ).
(vi) Let M be a c.t.m. of ZFC, and suppose that P M is a quasi-order. Suppose
that X (RO(P ))M with X M . Then
X(RO(P ))M
X=
XRO(P )
X,
244
p
: A B.
Now for each a A let
F (a) = {b B : there is a q p such that q
op(
a, b) }.
To prove (i), suppose that a A. Let b = f (a). Thus (a, b) f , so by Theorem 3.15
there is an r G such that r
op(
a, b) . Let q G with q p, r. Then q shows that
b F (a).
To prove (ii), again suppose that a A. By the axiom of choice in M , there is a
function Q : F (a) P such that for any b F (a), Q(b) p and Q(b)
op(
a, b) .
(1) If b, b F (a) and b 6= b , then Q(b) Q(b ).
238
(2)
hence
but also r Q(b) p, so r
: A B,
r
x, y, z[op(x, y) op(x, z) y = z]
and hence
r
op(
a, b) op(
a, b ) b = b .
It follows from (2) and Theorem 3.13(xvi) that r
b = b . Thus if H is P-generic over M
and r H, then b = bH = b H = b by Lemma 3.6, contradiction. Thus (1) holds.
By (1), hQ(b) : b F (a)i is a one-one function onto an antichain of P . Hence
(|F (a)| < )M by the -cc.
Proposition 18.4. If M is a c.t.m. of ZFC, is a cardinal of M , and P M
satisfies -cc in M , then P preserves regular cardinals , and also preserves cofinalities
. If also is regular in M , then P preserves cardinals .
Proof. First we want to show that if is regular in M then also is regular in
M [G] (and hence is a cardinal of M [G]). Suppose that this is not the case. Hence in M [G]
there is an < and a function f : such that the range of f is cofinal in . Recall
from Theorem 3.9 that M and M [G] have the same ordinals. Thus M . By Theorem
18.3, S
let F : P() be such that f () F () and (|F ()| < )M for all < . Let
S = < F (). Then S is a subset of which is cofinal in and has size less than ,
contradiction.
The rest of the proposition follows from Proposition 18.2.
Next, for any sets I, J let
fin(I, J) = {f : f is a finite function and f I J}.
We consider this as a partial order under ; it has a largest element .
By the countable chain condition, abbreviated ccc, we mean the 1 -chain condition.
Lemma 18.5. If I is arbitrary and J is countable, then fin(I, J) has the ccc.
Proof. Suppose that F fin(I, J) is uncountable. Since for each finite F I there
are only countably many members of F with domain F , it is clear that {dmn(f ) : f F }
is an uncountable collection of finite sets. By the -system lemma, let G be an uncountable
subset of this collection which forms a -system, say with root R. Then
[
G =
{f G : f R = k};
kR J
H = {f G : f R = k}
239
1
= .
240
(ii) If G is P-generic over M and a G in M [G], then a = G for some nice name
for a subset of .
Proof. Assume the hypotheses of the proposition.
(i): Assume also that M P . For each dmn() let A P be such that
(1) p
( ) for all p A .
(2) A is an antichain of P.
(3) A is maximal with respect to (1) and (2).
Moreover, we do this definition inside M , so that hA : dmn()i M . Now let
=
({} A ).
dmn()
if < ,
M [G]
(2 )
=
+ if .
Proof. By GCH we have = . Hence the hypothesis of Theorem 18.11 holds, and
the conclusion follows using GCH in M .
We give several more specific corollaries.
Corollary 18.12. If ZFC is consistent, then so is each of the following:
(i) ZF C + 20 = 2 .
(ii) ZF C + 20 = 203 .
(iii) ZF C + 20 = 1 .
(iv) ZF C + 20 = 4 .
Corollary 18.13. If ZFC+there is an uncountable regular limit cardinal is consistent, so is ZFC+ 2 is a regular limit cardinal.
Corollary 18.14. Suppose that M is a c.t.m. of ZFC. Then there is a generic
extension M [G] such that in it, 2 = ((2 )+ )M .
Since clearly ((2 )+ ) = (2 )+ in M , this is immediate from Theorem 18.11.
Now we turn to powers of regular uncountable cardinals, where similar results hold. But
to preserve cardinals, we need a new idea, given in the following definition.
242
Let be an infinite cardinal. A quasi-order P = (P, , 1) is -closed iff for all <
and every system hp : < i of elements of P such that p p whenever < < ,
there is a q P such that q p for all < .
The importance of this notion for generic extensions comes about because of the
following theorem, which is similar to Theorem 18.3.
Theorem 18.15. Suppose that M is a c.t.m. of ZFC, P M is a quasi-order, is
a cardinal of M , P is -closed, A, B M , and |A| < . Suppose that G is P-generic over
M and f M [G] with f : A B. Then f M .
Proof. It suffices to prove this when A is an ordinal. For, suppose that this special
case has been shown, and now suppose that A is arbitrary. In M , let j be a bijection from
def
= |A|M onto A. Then f j : B, so f j M by the special case. Hence f M .
So now we assume that A = , an ordinal less than . Let K = ( B)M . Let f = G .
G , so by
We want to show that f K, for then f M . Suppose not. Thus G
/ K
Theorem 3.15 there is a p G such that
p
:
B
/ K.
(1)
p
:
B,
hence
() = x],
p
x
B[
To proceed we need some elementary facts about cardinals. For cardinals , , we define
< = sup | |.
<
Note here that the supremum is over all ordinals less than , not only cardinals.
Proposition 18.17. Let and be cardinals with 2 and infinite and regular.
Then (< )< = < .
Proof. Clearly holds. For , by the fact that = it suffices to find an injection
from
[ [
(1)
<
into
<
(2)
( + 1).
,<
Let x be a member of (1), and choose < accordingly. Then for each < there is a
x, < such that x() x, . Let x = sup< x, . Then x < by the regularity of
. We now define f (x) with domain x by setting, for any < and < x
n
(f (x))(, ) = (x())() if < x, ,
otherwise.
Then f is one-one. In fact, suppose that f (x) = f (y). Let the domain of f (x) be x
as above. Suppose that < . If x, 6= y, , say x, < y, . Then x = y y, > x, ,
and (f (x))(, x, ) = while (f (y))(, x, ) < , contradiction. Hence x, = y, . Finally,
take any < x . Then
(x())() = (f (x))(, ) = (f (y))(, ) = (y())();
it follows that x = y.
Now the direction follows.
Proposition 18.18. For any cardinals , , |[]< | < .
Proof. For each cardinal < define f : [] \{} by setting f (x) = rng(x)
for any x . Clearly f is an onto map. It follows that |[] | | | < . Hence
[
<
[]
|[] | =
<,
a cardinal
X
|[] |
<,
a cardinal
<,
a cardinal
<
= < .
244
<
We now define a new partial order for the remaining forcing results of this section. For
any sets I, J and infinite cardinal ,
Fn(I, J, ) = {f : f is a function, f I J, and |f | < }.
We consider this as a partial order under ; the greatest element is again . We claim
that this partial order has the (|J|< )+ -chain condition.
Lemma 18.19. If I, J are sets and is an infinite cardinal, then Fn(I, J, ) has the
(|J| ) -cc.
< +
{ < : |p | < },
<cf()
def
so there is an < cf() such that = { < : |p | < } has size . Let h : < i be a
one-one enumeration of . Then we can apply the first case to the set {p : < } with
replaced by to obtain the desired result.
Lemma 18.20. If I, J are sets and is a regular cardinal, then Fn(I, J, ) is -closed.
Proof. Suppose that < and hp : < i S
is a system of elements of Fn(I, J, )
such that p p whenever < < . Let q = < p . Clearly q Fn(I, J, ) and
q p for each < .
We now need another little fact about cardinal arithmetic.
Lemma 18.21. If is regular, then < = 2< .
245
||||
| max(, )|| max(,)|
2| max(,)| 2< < ;
| | =
< <
<
<
hence the lemma follows.
Lemma 18.22. Suppose that M is a c.t.m. of ZFC, I, J, M , and in M , is
a regular cardinal, 2< = and |J| . Then Fn(I, J, )M preserves cofinalities and
cardinalities.
Proof. By Lemma 18.26, the set Fn(I, J, ) is -closed, and so by Proposition 18.17,
Fn(I, J, ) preserves cofinalities and cardinalities . Now |J|< < = 2< = by
Lemma 18.27. Hence by Lemma 18.25, Fn(I, J, ) has the + -cc. By Proposition 18.4,
Fn(I, J, ) preserves cofinalities and cardinals + .
Now we can give our main theorem concerning making 2 as large as we want, for any
regular given in advance.
Theorem 18.23. Suppose that M is a c.t.m. of ZFC and in M we have cardinals
, such that < , is regular, 2< = , and = . Let P = Fn(, 2, ) ordered by
. Then P preserves cofinalities and cardinalities. Let G be P-generic over M . Then
(i) (2 = )M [G] .
(ii) If and are cardinals of M and < , then ( )M = ( )M [G] .
(iii) For any cardinal of M , if then (2 )M [G] = ( )M .
Proof. Preservation of cofinalities and cardinalities follows from Lemma 18.28. Now
we turn S
to (i). To show that (2 )M [G] , we proceed as in the proof of Theorem 18.7.
Let g = G. So g is a function mapping a subset of into 2.
(1) For each , the set {f Fn(, 2, ) : dmn(f )} is dense in P (and it is a
member of M ).
In fact, given f Fn(, 2, ), either f is already in the above set, or else
/ dmn(f ) and
then f {(, 0)} is an extension of f which is in that set. So (1) holds.
Since G intersects each set (1), it follows that g maps into 2. Let (in M ) h :
be a bijection. For each < let a = { : g(h(, )) = 1}. We claim that a 6= a
for distinct , ; this will give (2 )M [G] . The set
{f Fn(, 2, ) : there is a such that
h(, ), h(, ) dmn(f ) and f (h(, )) 6= f (h(, ))}
is dense in P (and it is in M ). In fact, let distinct and be given, and suppose that
f Fn(, 2, ). Now { : h(, ) f or h(, ) f } has size less than , so choose
not in this set. Thus h(, ), h(, )
/ f . Let h = f {(h(, ), 0), (h(, ), 1)}. Then h
extends f and is in the above set, as desired.
It follows that G contains a member of this set. Hence a 6= a . Thus we have now
shown that (2 )M [G] .
246
For the other inequality, note by Lemma 18.25 that P has the (2< )+ -cc, and by
hypothesis (2< )+ = + . By the assumption that = we also have |P | = . Hence by
Proposition 18.10 the other inequality follows. Thus we have finished the proof of (i).
For (ii), assume the hypothesis. If f M [G] and f : , then f M by Theorem
18.16. Hence (ii) follows.
Finally, for (iii), suppose that is a cardinal of M such that . By Proposition
18.10 with replaced by + we have (2 )M [G] ( )M . Now ( )M ( )M [G] =
((2 ) )M [G] = (2 )M [G] , so (iii) holds.
Corollary 18.24. Suppose that M is a c.t.m. of ZFC + GCH, and in M we have
cardinals , , both regular, with < . Let P = Fn(, 2, ) ordered by . Then P
preserves cofinalities and cardinalities. Let G be P-generic over M . Then for any infinite
cardinal ,
+ if < ,
(2 )M [G] =
if < ,
+
if .
Proof. Immediate from Theorem 18.29.
Theorem 18.29 gives quite a bit of control over what can happen to powers 2 for regular.
We can apply this theorem to obtain a considerable generalization of it.
Theorem 18.25. Suppose that n and M is a c.t.m. of ZFC. Also assume the
following:
(i) 1 < < n are regular cardinals in M .
(ii) 1 n are cardinals in M .
(iii) (cf(i ) > i )M for each i = 1, . . . , n.
(iv) (2<i = i )M for each i = 1, . . . , n.
(v) (i i )M = i for each i = 1, . . . , n
Then there is a c.t.m. N M with the same cofinalities and cardinals such that:
(vi) (2i = i )N for each i = 1, . . . , n.
(vii) (2 )N = (n )M for all > n .
Proof. The statement vacuously holds for n = 0. Suppose that it holds for n 1, and
the hypothesis holds for n, where n is a positive integer. Let Pn = Fn(n , 2, n). Then by
Lemma 18.25, Pn has the (2<n )+ -cc, i.e., by (iv) it has the +
n -cc. By Lemma 18.26 it is
n -closed. So by Propositions 18.4 and 18.17, Pn preserves all cofinalities and cardinalities.
Let G be Pn -generic over M . By Theorem 18.29, (2n = n )M [G] , (2 )M [G] = (n )M for
all > n , and also conditions (i)-(v) hold for M [G] for i = 1, . . . , n 1. Hence by the
inductive hypothesis, there is a c.t.m. N with M [G] N such that
(1) (2i = i )N for each i = 1, . . . , n 1.
(2) (2 )N = (n1 )M [G] for all > n1 .
In particular,
n
(2n )N = (n1
)M [G] (nn )M [G] = ((2n )n )M [G] = (2n )M [G]
249
The following rather trivial fact will be technically useful in what follows.
Lemma 19.1. Df(A, C, n) for any sets A, C and any natural number n.
Proof. Clearly Df (k, A, C, 0), and for n > 0, = Diag (A, n, 0, 0), so the result
follows.
Let L be the first order language for set theory augmented by a unary relation symbol R.
Given two sets A, C, a formula (v0 , . . . , vn1 ) and elements a0 , . . . , an1 of A, we denote
by A,C (a0 , . . . , an1 ) the statment that holds with quantifiers relativized to A and a
subformula Rvi interpreted as saying that ai C. It is easy to give a recursive definition
250
of A,C (a0 , . . . , an1 ); we can conceive it as associating with a formula another formula
A,C (a0 , . . . , an1 ).
Lemma 19.2. Let (v0 , . . . , vn1 ) be a formula of L , and let A, C be any sets. Then
{s n A : A,C (s(0), . . . , s(n 1))} Df(A, C, n).
Proof. Induction on :
If is Rvi , then
{s n A : A,C (s(0), . . . , s(n 1))} = {x n A : s(i) C}
= Rel(A, C, n, i)
Df (0, A, C, n)
Df(A, C, n).
If is vi vj , then
{s n A : A,C (s(0), . . . , s(n 1))} = {x n A : s(i) s(j)}
= Diag (A, n, i, j)
Df (0, A, C, n)
Df(A, C, n).
If is vi = vj , then
{s n A : A,C (s(0), . . . , s(n 1))} = {x n A : s(i) = s(j)}
= Diag= (A, n, i, j)
Df (0, A, C, n)
Df(A, C, n).
Suppose that is , where
{s n A : A,C (s(0), . . . , s(n 1))} Df(A, C, n).
Then
{s n A : A,C (s(0), . . . , s(n 1))} = n A\{s n A : A,C (s(0), . . . , s(n 1))}
Df(A, C, n).
Suppose that is , where
{s n A : A,C (s(0), . . . , s(n 1))} Df(A, C, n)
and {s n A : A,C (s(0), . . . , s(n 1))} Df(A, C, n).
251
Then
{s n A : A,C (s(0), . . . , s(n 1))}
= {s n A : A,C (s(0), . . . , s(n 1))} {s n A : A,C (s(0), . . . , s(n 1))}
Df(A, C, n).
Finally, suppose that is vk . Then there is a formula with free variables among
v0 , . . . , vn such that is logically equivalent to vn . We assume inductively that
{s n+1 A : A,C (s(0), . . . , s(n))} Df(A, C, n + 1).
Then
{s n A : A,C (s(0), . . . , s(n 1))}
= {s n A : (vn )A,C (s(0), . . . , s(n 1))} (as is easily seen)
= Proj(A, {s n+1 A : A,C (s(0), . . . , s(n))}, n)
Df(A, C, n)
Now we prove a sequence of lemmas leading up to the fact that Df is absolute for transitive
models of ZF. To do this, we have to extend the definitions of our functions above so that
they are defined for all sets, since absoluteness was developed only for such functions. We
do this by just letting the values be 0 for arguments not in the domain of the original
functions.
Lemma 19.3. The function Rel is absolute for transitive models of Zf.
Proof. x = Rel(A, C, n, i) iff (n
/ and x = 0), or (n and i
/ n) or i < n
and the following condition holds:
s x[s n A and s(i) C] and s n A[s(i) C s x].
Lemma 19.4. The function Proj is absolute for transitive models of ZF.
Proof. x = Proj(A, R, n) iff n
/ and x = 0, or n and the following condition
holds:
s x[s n A t R(t n = s)]
s n A[t R(t n = s) s x];
Lemma 19.5. The function Diag is absolute for transitive models of ZF.
Proof. x = Diag (A, n, i, j) iff not(n and i, j < n) and x = 0, or n , i, j < n,
and the following condition holds:
s x[s n A s(i) s(j)] s n A[s(i) s(j) s x]
252
Lemma 19.6. The function Diag= is absolute for transitive models of ZF.
Proof. Similar to that of Lemma 19.19.
Lemma 19.7. For any sets A and C and any natural number n let
T1 (A, C, n) ={Rel(A, C, n, i) : i < n} {Diag (A, n, i, j) : i, j < n}
{Diag= (A, n, i, j) : i, j < n}.
Then T1 is absolute for transitive models of ZF.
Proof. x = T1 (A, C, n) iff not(n ) and x = 0, or n and the following condition
holds:
y xi, j < n[y = Rel(A, C, n, i) y = Diag (A, n, i, j) y = Diag= (A, n, i, j)]
i < ny {Rel(A, C, n, i)}[y x]
i, j < ny {Diag (A, n, i, j)}[y x]
i, j < ny {Diag= (A, n, i, j)}[y x].
Lemma 19.8. For any sets A, L and any natural number n let
T2 (A, n, L) = {n A\R : R L}.
Then T2 is absolute for transitive models of ZF.
Proof. x = T2 (A, n, L) iff not(n ) and x = 0, or n and the following condition
holds:
y xR L[y = n A\R] z[R L(z = n A\R) z x].
Here we need a little argument. Let M be a transitive model of ZF. Suppose that A, n, L
M , n , z is a set, R L, and z = n A\R; we would like to show that z M . There is a
w M such that w = (n A\R)M , since M is a model of ZF. By absoluteness, z = w M ,
as desired.
Lemma 19.9. For any set X, let T3 (X) = {R S : R, S X}. Then T3 is absolute
for transitive models of ZF.
Proof.
y = T3 (X) z yR, S X[z = R S] R, S X[R S y].
Lemma 19.10. For any sets A, X and any natural number n, let T4 (A, n, X) be the
set {Proj(A, R, n) : R X}. Then T4 is absolute for transitive models of ZF.
Proof. x = T4 (A, n, X) iff not(n ) and x = 0, or n and the following condition
holds:
R X[Proj(A, R, n) x] z[R X(z = Proj(A, R, n)) z x].
253
iff
T1 (A, C, n)
if k = 0,
f (k , A, C, n) T2 (A, n, f (k , A, C, n))
F ((k, A, C, n), f ) =
T3 (A, n, f (k , A, C, n))
T4 (A, n, f (k , A, C, n + 1))
if k = k + 1.
Then F is absolute for transitive models of ZF.
Lemma 19.13. Df is absolute for transitive models of ZF.
Proof. It suffices to check that Df is obtained from the function F of Lemma 19.12
by the recursion theorem applied to B and S. Let G be the function so obtained. Then
for any set A and any natural number n,
G(0, A, C, n) = F ((0, A, C, n), G pred(B, (0, A, C, n), S))
= F ((0, A, C, n), 0) = T1 (A, C, n)
= {Rel(A, C, n, i) : i < n} {Diag (A, n, i, j) : i, j < n}
{Diag= (A, n, i, j) : i, j < n},
as desired. Now take any k . Then, with f = G pred(B, (k + 1, A, C, n), S)),
G(k + 1, A, C, n) = F ((k + 1, A, C, n), G pred(B, (k + 1, A, C, n), S))
= F ((k + 1, A, C, n), f )
= f (k, A, C, , n) T2 (A, n, f (k, A, C, n))
T3 (A, n, f (k, A, C, n)) T4 (A, n, f (k, A, C, n + 1))
= G(k, A, C, n) {n A\R : R G(k, A, C, n)}
{R S : R, S G(k, A, C, n)}
{Proj(A, R, n) : R G(k, A, C, n + 1)}
Lemma 19.14. Df is absolute for all transitive models of ZF.
Proof.
x = Df(A, C, n) iff
y xk [y Df (k, A, C, n)]
k y Df (k, A, C, n)[y x].
254
The following is the definable power set operation: For any sets A, C,
D(A, C) = {X A : n s n AR Df(A, C, n + 1)[X = {x A : s hxi R}]}.
Here s hxi is the member t of
n+1
X Dn s n AR Df(A, C, n + 1)
[x X(x A s hxi R)
x A(s hxi R x X)]
X[n s n AR Df(A, C, n + 1)
[x X(x A s hxi R)
x A s n A(s hxi R x X)] X D]
Here there is an unbounded quantifier X, and one must check that if the statement holds
in a transitive class model of ZF, then it really holds. This is clear.
Lemma 19.16. Let (v0 , . . . , vn1 , x) be a formula of L with the indicated free
variables. Then
ACv0 , . . . , vn1 A[{x A : A,C (v0 , . . . , vn1 , x)} D(A, C)].
Proof. Let v0 , . . . , vn1 A and R = {s
Lemma 2, R Df(A, n + 1). Clearly
n+1
Hence
def
i<n
Hence
X = {x A : s hxi R} D(A, C),
as desired.
For (iv), note that
D(A, C) =
n
sn A
Hence
|D(A, C)|
On the other hand, {a} D(A, C) for each a A by (iii), so |A| |D(A, C)|. So (iv)
holds.
Finally, if x CA, then AC = {x A : x C} D(A, C) by Lemma 19.119.
Now we define the constructible hierarchy. Recall that for any set B, trcl(B) is the transitive closure of B, i.e., it is the smallest transitive
S
Sset
S containing B. It can be defined
recursively using the intuition trcl(B) = B B
B . . ..
LBC
= trcl(B);
0
BC
LBC
+1 = D(L , C);
[
LBC
for limit;
LBC
=
<
{B}
L (B) = L
;
L [C] = LC
;
[
LBC =
LBC
;
On
L=
L
;
On
L(B) =
L (B);
On
L[C] =
L [C].
On
(ii) LBC
LBC
for all < .
256
+1
BC
also, so (y) < . This proves .
BC
def
BC
For , suppose that = BC (y) < . Then y LBC
+1 L , as desired.
BC
BC
(ii): Assume that y x. Let BC (x) = . Then x LBC
+1 = D(L , C) P(L ),
BC
(y) < BC (x).
and so y LBC
. Hence
(iv): We prove this by induction on . It is obvious for = 0, and the inductive step
when is limit is clear. So, suppose that we know that LC
On = , and = + 1. If
C
C
C
C
L On, then D(L , C) P(L ), so L On = ; hence . This
C
C
shows that LC
On . If < , then L On L On. Thus it remains only
to show that LC
. Now there is a natural 0 formula (x) which expresses that x is
an ordinal:
y xz y(z x) y xz yw z(w y);
this just says that x is transitive and every member of x is transitive. Since LC
is
C
Hence D(LC
, C) = L , as desired.
257
C
(iii): By (iv) we have LC
+1 On = + 1, and hence + 1 L+1 , so that
C
LC and C () . By (iv) again, we cannot have LC
() = .
, so
BC
BC
BC
L
BC
BC
(v): L = {x L : (x = x)
} D(L , C) = L+1 .
(vi): An easy induction on .
<
BC
(vii): Clearly [LBC
D(LBC
]
, C) = L+1 .
(viii): By induction on n. It is clear for n = 0. Assume that Ln = Vn . Thus Ln is
finite. Hence by (vii) and (vi),
, C) = L+1 L
In order to show that LBC is a model of ZF, we need another fact about absoluteness
which was covered in the first semester.
Lemma 19.21. Suppose that M and N are classes with M N . Let 0 , . . . , n be a
list of formulas such that if i n and is a subformula of i , then there is a j n such
that j is . Then the following conditions are equivalent:
(i) Each i is absolute for M, N .
(ii) If i n and i has the form xj (x, y1 , . . . , yt ) with x, y1 , . . . , yt exactly all the
free variables of j , then
N
y1 , . . . , yt M [x N N
j (x, y1 , . . . , yt ) x M j (x, y1 , . . . , yt )].
Note the seemingly minor respects in which (ii) differs from the definition of absoluteness:
the implication goes only one direction, and on the right side of the implication we relativize
j to N rather than M .
Proof. (i)(ii): Assume (i) and the hypothesis of (ii). Suppose that y1 , . . . , yt M
M
and x N N
j (x, y1 , . . . , yt ). Thus by absoluteness x M j (x, y1 , . . . , yt ); choose
N
x M such that M
j (x, y1 , . . . , yt ). Hence by absoluteness again, j (x, y1 , . . . , yt )). Hence
x M N
j (x, y1 , . . . , yt ), as desired.
(ii)(i): Assume (ii). We prove that i is absolute for M, N by induction on the
length of i , where we consider formulas to be built up using , , . This is clear if i is
atomic, and it easily follows inductively if i has the form j or j k . Now suppose
that i is xj (x, y1 , . . . , yt ), and y1 , . . . , yt M . then
M
M
i (y1 , . . . , yt ) x M j (x, y1 , . . . , yt ) (definition of relativization)
x M N
j (x, y1 , . . . , yt ) (induction hypothesis)
x N N
j (x, y1 , . . . , yt ) (by (ii)
N
i (y1 , . . . , yt )
(definition of relativization)
258
Theorem 19.22. Suppose that Z() is a set for every ordinal , and the following
conditions hold:
(i) If < , then Z() Z().
S
(ii) If is a limit ordinal, then Z() = < Z().
S
Let Z = On Z(). Then for any formulas 0 , . . . , n1 ,
> [0 , . . . , n1 are absolute for Z(), Z].
Proof. Assume the hypothesis, and let an ordinal be given. We are going to apply
Lemma 19.21 with N = Z, and we need to find an appropriate > so that we can take
M = Z() in 19.21.
We may assume that 0 , . . . , n1 is subformula-closed; i.e., if i < n, then every
subformula of i is in the list. Let A be the set of all i < n such that i begins with an
existential quantifier. Suppose that i A and i is the formula xj (x, y1 , . . . , yt ), where
x, y1 , . . . , yt are exactly all the free variables of j . We now define a class function Gi as
follows. For any sets y1 , . . . , yt ,
Z
Gi (y1 , . . . , yt ) = the least such that x Z()j (x, y1 , . . . , yt ) if there is such,
0
otherwise.
Then for each ordinal we define
Fi () = sup{Gi (y1 , . . . , yt ) : y1 , . . . , yt Z()};
note that this supremum exists by the replacement axiom.
Now we define a sequence 0 , . . . , p , . . . of ordinals by induction on n . Let
0 = + 1. Having defined p , let
p+1 = max(p+1 , sup{Fi () : i A, p } + 1).
Finally, let = supp p . Clearly < and is a limit ordinal.
(1) If i A, y1 , . . . , yt Z(), and x ZZ
i (x, y1 , . . . , yt ), then there is an x Z() such
Z
that i (x, y1 , . . . , yt ).
In fact, choose p such that y1 , . . . , yt Z(p ). Then Gi (y1 , . . . , yt ) F (p ) < p+1 . Hence
an x as in (1) exists, with x Z(p+1 ).
Now the theorem follows from Lemma 19.21.
Theorem 19.23. LBC is a model of ZF.
Proof. We take the axioms one by one, using the results of Chapter 1. Extensionality
holds since LBC is transitive (by Lemma 19.19); see Theorem 1.6.
According to Theorem 1.7 to verify that the comprehension axioms hold in LBC it
suffices to take any formula with free variables among x, z, w1 , . . . , wn , assume that
z, w1 , . . . , wn LBC , and prove that
(1)
{x z : L
BC
(x, z, w1 , . . . , wn )} LBC
259
BC
(x, z, w1 , . . . , wn )}
LBC
: (x z L
BC
(x, z, w1 , . . . , wn ))}
LBC
= {x LBC
: (x z
D(LBC
, C) (by Lemma 19.16)
= LBC
+1 ,
and (1) holds.
Pairing: See Theorem 1.8. Suppose that x, y LBC . Choose so that x, y LBC
.
Then by Lemma 19.16,
BC
BC
{x, y} = {z LBC
: (z = x z = y)L } D(LBC
, C) = L+1 ,
as desired.
Union: See Theorem 1.9. Suppose that x LBC . Choose so that x LBC
. Then
[
x = {z : u(z u u x)}
BC
= {z LBC
: (u(z u u x))L }
BC
D(LBC
, C) = L+1 ,
(since LBC
is transitive)
as desired.
Power set: See Theorem 1.10. Suppose that x LBC . For each z L such that z x
BC
choose z such that z LBC
z . Let = supzx Lz . Then, using Theorem 19(v),
BC
,
LBC
P(x) LBC = {z LBC : z x} LBC
+1 L
as desired.
Replacement: See Theorem 1.11. Suppose that is a formula with free variables
among x, y, A, w1, . . . , wn , we are given A, w1 , . . . , wn L, and
(1)
For each x A let zx be such that zx LBC and L (x, zx , A, w1 , . . . , wn ); we are using
the replacement axiom here. Then for each x A choose x so that zx LBC
x . Let
BC
= supxA x . Suppose now that y LBC and L (x, y, A, w1, . . . , wn ) for some x A.
BC
Then by (1), y = zx , and hence y LBC
x L . This proves that
{y LBC : x AL
BC
BC
Since LBC
LBC
, this is as desired.
+1 L
260
.
Now
rank
(x) = (rankBC )M (x) by absoluteness, so
On
On
( < )
or ( = and < )
or ( = and = and < ).
261
(, + 1)
(0, + 1)
( + 1) =
( + 1, )
(, 0)
() = -least(, )
Prove:
(1)
(2)
(3)
(4)
if () = (, ) and < ,
if () = (, ) and = ,
if () = (, ) and + 1 < ,
if () = (, ) and + 1 = ;
such that < [() (, )] if is limit.
E19.3. Suppose that M is a transitive class model of ZFC, and every set of ordinals is in M .
Show that M = V . Hint: take any set X. Let = |trcl({X})|, and let f : trcl({X})
be a bijection. Define E iff , < and f () f (). Use exercise E19.2 to show that
E M . Take the Mostowski collapse of (, E) in M , and infer that X M .
E19.4. Show that if X 1 then CH holds in L(X). Hint: show that if A P() in
L(X), then there are , < 1 such that X L (X ).
E19.5. Show that if X 1 then GCH holds in L(X).
262
Y
X
e(p) +
f ([[ = ]]) = f
(e(q) [[ = ]])
(,p)
(,q)
!
Y
X
e(q) +
(e(p) [[ = ]])
(,q)
Y
(,p)
(,q)
f (e(q)) +
Y
(,p)
(,q)
e(f (p)) +
f (e(p)) +
(,p)
e(f (q)) +
Y
(,p)f ( )
Y
(,q)f ()
e(q) +
X
(,q)
X
(,p)
(,q)
(,p)
X
X
e(p) +
(e(q) [[ = ]])
(,q)f ()
(e(p) [[ = ]])
(,p)f ( )
= [[f () = f ( )]].
264
Now we prove the lemma itself by induction on , thus officially outside our usual mathematical language. The atomic equality case has already been treated. Here is the remaining
argument, with obvious inductive assumptions:
X
f ([[ ]]) = f
(e(p) [[ = ]])
=
(,p)
(,p)
(,p)
(,q)f ( )
= [[f () = f ( )]];
f ([[(0 , . . . , n1 )]]) = f ([[(0 , . . . , n1 )]])
= f ([[(0 , . . . , n1 )]])
= [[(f (0 ), . . . , f (n1 ))]]
= [[(f (0 ), . . . , f (n1 ))]];
f ([[(0 , . . . , n1 )(0 , . . . , n1 )]])
V P
f ([[(0 , . . . , n1 , )]])
V Q
V Q
Proof.
p
(0 , . . . , n1 ) iff
iff
iff
iff
f (p)
(
x0 , . . . , x
n1 ).
q
(
x0 , . . . , x
n1 ). Let f be an automorphism such that f (p) and q are compatible. By
Lemma 20.7, f (p)
(
x0 , . . . , x
n1 ). If r f (p), q we then have e(r) [[(
x0 , . . . , x
n1 )]]
[[(
x0 , . . . , x
n1 )]], contradiction.
(ii): Suppose that 1 6
(
x0 , . . . , x
n1 ). By (i), p 6
(
x0 , . . . , x
n1 ) for all p. Hence
by Lemma 3.13(ix), 1
(
x0 , . . . , x
n1 ).
Lemma 20.11. Suppose that I and J are sets, is an infinite cardinal, and =
hi : i Ii is a system of permutations of J. For each f Fn(I, J, ) define (f ) to be
the function given as follows:
dmn( (f )) = dmn(f );
( (f ))(i) = i (f (i)) for each i dmn(f ).
Then is an automorphism of Fn(I, J, ).
Proof. Clearly (f ) Fn(I, J, ) for any f Fn(I, J, ). Now is one-one:
suppose that (f ) = (g). Then
dmn(f ) = dmn( (f )) = dmn( (g)) = dmn(g),
and for any i dmn(f ),
f (i) = i1 (i (f (i))) = i1 (( (f ))(i) = i1 (( (g))(i) = i1 (i (g(i))) = g(i).
So f = g.
Next, maps onto Fn(I, J, ). For, let g Fn(I, J, ). Let f (i) = i1 (g(i)) for all
i dmn(g), with dmn(f ) = dmn(g). Clearly (f ) = g.
Now suppose that f, g Fn(I, J, ) and f g. Then for i dmn(f ) we have
( (f ))(i) = i (f (i)) = i (g(i)) = ( (g))(i). So (f ) (g). The other implication is
proved similarly.
Lemma 20.12. Fn(I, J, ) is almost homogeneous.
Proof. Suppose that f, g Fn(I, J, ). For each i I let i be the following
permutation of J: if i
/ dmn(f ) dmn(g), then i is the identity on J. If i dmn(f )
dmn(g), then i is the transposition (f (i), g(i)) (which may be the identity). Then we
claim that (f ) and g are compatible. For, suppose that i dmn( (f )) dmn(g) =
dmn(f ) dmn(g). Then ( (f ))(i) = i (f (i)) = g(i), as desired.
We now need a basic result about product forcing; this result will be useful also when
discussing iterated forcing later.
Let P and Q be quasi-orders. Their product P Q is defined to be (P Q, , 1 ),
where
(p1 , q1 ) (p2 , q2 ) iff p1 p2 and q1 q2 ;
1 = (1P , 1Q ).
267
p is dense in Q.
Define
E = {(p , q) : p p and p
q }.
Thus E is a subset of P Q; we claim that it is dense below (p, 1). To prove this, take
any (p , q ) (p, 1). Now p p, so by (1),
Q(y
and y x),
p
x Qy
and hence
and y q ).
p
y Q(y
Hence by Proposition 3.17(i), there is a p p and a q Q such that
p
q and q q .
Then by Lemma 3.23, q q . Hence p p p and p
q , so (p , q ) E. Also
(p , q ) (p , q ). So this proves our claim that E is dense below (p, 1).
Since p G0 , we have (p, 1) G0 G1 , so by the genericity of G0 G1 we get
(G0 G1 ) E 6= 0; say that (r, s) (G0 G1 ) E. Thus r G0 , s G1 , r p, and
r
s . Hence s G = D, so D G1 6= . So we have proved (ii).
(ii)(i): Assume (ii). First we check that G0 G1 is a filter. Suppose that (p, q)
G0 G1 and (p, q) (p , q ). Then p p , hence p G0 ; similarly, q G1 . So
(p , q ) G0 G1 . Now suppose that (p, q), (p , q ) G0 G1 . Thus p, p G0 , so there
is an r G0 such that r p, p . Similarly we get an s G1 such that s q, q . So
(r, s) G0 G1 and (r, s) (p, q), (p , q ). So G0 G1 is a filter.
268
iff
1Q
Q ((
Q )L(P()) .
In particular we have
R[
() = s],
q
< s
and so by Proposition 3.17, for each < we can choose r G and s R such that
r q and r
()
= s . Then
(4) If < < , then s s .
In fact, choose t r , r ; this is possible since r , r G. Then
t
(
) ()
()
= s (
) = s ,
hence t
s s , hence by Lemma 3.23, s s , so that (4) holds.
Thus in M we have a decreasing sequence s = hs : < i, and so there is a t R
such that t s for all < . It follows that r
t ()
for all < , and hence
t p for all < , as desired for (3).
(5) (P())M [G] = (P())M [G][H] .
For, if f 2 and f M [G][H], then by (3) and Theorem 4.16 we have f M [G]. So (5)
holds.
(6)
1P
P,M ((
P )L(P())
iff
1P
P,M [H] ((
P )L(P()) .
1Q
Q,M ((
Q )L(P())
iff
1Q
Q,M [H] ((
Q )L(P()) .
Now in M [H] we have |I| = |J|, as noted above. Hence in M [H], the partial orders P
and Q are isomomorphic. Hence the conclusion of the lemma follows from (6), (7), and
Corollary 20.7.
Theorem 20.15. If ZF is consistent, then so is ZF + AC.
Proof. Assume that ZF is consistent. By the theory of constructibility we know that
also ZFC is consistent, so we take a c.t.m. M of ZFC. Let P = Fin(1 , 2), and let G be
P-generic over M , and let N = L(P())M [G] . By Theorem 5.23, N is a model of ZF. We
claim that AC fails in N , as desired.
270
The remaining exercises are concerned with generalizations of the main theorem, Theorem
20.15, of this chapter. Proofs of the consistency of CH date from the 1930slong before
forcing, and shortly before constructibility. The proofs were not relative to ZFC; one had
to admit many Urelementeelements with no members. But the basic idea of those
proofs can be adapted to ZFC, and Theorem 20.15 is of this sort. We give some exercises
which form an introduction to the rather extensive work that has been done in this area.
271
Most of this work is to show that such-and-such a statement, while a consequence of ZFC,
cannot be proved in ZF, but does not imply AC either.
E20.9. We expand the language of set theory by adding an individual constant . An
Urelement is an object a such that a 6= but a does not have any elements. (Plural is
Urelemente.) A set is an object x which is either or has an element. Both of these are
just definitions, formally like this:
U r(a) a 6= x(x
/ a);
Set(x) x = y(y x).
Now we let ZFU be the following set of axioms in this language:
All the axioms of ZF except extensionality and foundation.
x[(x )].
x, y[Set(x) Set(y) z(z x z y) x = y].
x[Set(x) x 6= y xz(z x z
/ y)].
We also reformulate the axiom of choice for ZFU; it is the following statement:
A {Set(A ) x A [Set(x) x 6= ]
x A y A [x 6= y z[(z x z y)]]
Bx A !y(y x y B)}.
We let ZF CU be all of these axioms.
One can adapt most of elementary set theory to use these axioms; browsing through a
rigorous introduction to set theory (like the first few chapter of my m6730 notes) should
convince one of this. In this exercise, give a new definition of ordinal.
Also, show that if we add the axiom a[U r(a)] we get a theory equivalent to ZF.
E20.10. Let be an infinite cardinal. Let be an infinite ordinal such that V Since
|V+1 | = 2|V | , we also have |V+1 \V |. Let U be a subset of V+1 \V of size . Let
Z be any element of U , fixed for what follows. We define hW : Ordi by recursion:
W0 = U ;
W+1 = W (P(W )\{});
[
W =
W for limit;
<
W =
W .
Ord
^
^
v0 . . . vm
U r(vi )
[(vi = vj )] .
im
0i<jm
274
p P q and p
P .
275
Proposition 21.5. Again assume the above notation. Suppose that G is P-generic
0
over M , is a P-name, and G G
. then there is a (r, ) P such that r G and
r
= 0.
Proof. Choose q G such that q
0 . Then by Lemma 3.13(v), choose
r q with r G such that for some (, s) 0 we have r s and r
= . Since
r q, we have r
0 , so by the external definition of forcing, r
0 . Clearly
(r, ) P .
0
Next, suppose that G is P-generic over M and H G
. Then we define
that (s, ) (p, 1), (q, ), (r, 1). Since (s, ) (q, ), we have s
. Also, s r, so
s
= . Hence s
. Hence (s, ) (p, ), and so (p, ) K, as desired.
The last part of the theorem follows from Theorem 21.6.
Lemma 21.8. Suppose that M is a c.t.m. of ZFC, is a regular cardinal of M , P is
a quasi-order in M , and P satisifies -cc in M . Suppose that is a P-name, and
1
|| <
.
(1) {p : E} is an antichain in P.
iff
iff
iff
iff
r() if < ,
q() if supp(q)\supp(p),
1
otherwise.
Now it suffices to prove the following statement:
(*) For all we have s P , s p , and s q .
We prove (*) by induction on . If , then s = r P by (K4), and
s p , q since r p , q , by (vi).
Now assume inductively that < . First suppose that is a successor ordinal
+ 1. Then s P by the inductive hypothesis. Now we consider several cases.
282
iff
Fix q A and fix a generic G for P over M such that q G; we want to show that
G = (q )G .
First suppose that x G . Choose (, r) such that r G and x = G . By the
definition of , there is a q A such that r q , r
q , and dmn(q ). Since
284
is dense below p, and hence there is an r G which is also in (2). By Proposition 3.3(iii),
if G A = , then there is an element q G incompatible with each member of A; in this
case, choose s G with s r, q. Then s is in (2) and s is incompatible with each element
of A, contradicting the maximality of A. So G A 6= .
Say q G A. Choose r G such that r p, q. Since q
(, 1, . . . , n ), also
r
(, 1 , . . . , n ), and hence (G , (1 )G , . . . , (n )G ) holds in M [G], as desired.
Now we give a fact about Martins axiom which is used below; it is an exercise in Chapter
9.
Lemma 21.16. MA() is equivalent to MA() restricted to ccc quasi-orders of cardinality .
Proof. We assume the indicated special form of MA(), and assume given a ccc
quasi-order P and a family D of at most dense sets in P; we want to find a filter on P
intersecting each member of D. We introduce some operations on P . For each D D
define fD : P P by setting, for each p P , fD (p) to be some element of D which is
p. Also we define g : P P P by setting, for all p, q P ,
p if p and q are incompatible,
g(p, q) =
r with r p, q if there is such an r.
Here, as in the definition of fD , we are implicitly using the axiom of choice; for g, we
choose any r of the indicated form.
285
We begin by defining the coding which will be used. Some elementary notation: For
any ordered pair (a, b), 1st (a, b) = a and 2nd (a, b) = b.
Claim. There is a function f in M with the following properties:
(1) f : .
(2) For all , , < there is an > such that f () = (, ).
(3) 1st (f ()) for all < .
Proof of Claim. Let g : be a bijection. For each < let g() = (, , ),
and set
(, ) if ,
f () =
(0, 0) otherwise.
So (1) and (3) obviously hold. For (2), suppose that , , < . Now g 1 [{(, , ) :
< }] has size , so there is an g 1 [{(, , ) : < }] such that , < . Say
g() = (, , ). Then < , so f () = (, ) and < , as desired.
Another preliminary
is a cardinality bound. Note that if < , then = , since, using
P
regularity and < 2 = , we have
[
X
2 = .
= | | =
<
<
(4) If Q is a ccc quasi-order in M of size less than , then there are at most pairs (, )
such that < and is a nice Q-name for a subset of ( ).
To prove (4), recall that a nice Q-name for a subset of ( )is a set of the form
[
{{
a} Aa : a }
where for each a , Aa is an antichain in Q. Now by ccc the number of antichains
in Q is at most |Q| . So for a fixed < the number sets of the indicated form is at
most = . Hence (4) holds.
For brevity, we let pord(, W ) abbreviate the statement that W is the order relation of a
ccc quasi-order on the set , with largest element 0.
Now we are going to define by recursion functions P, , , and with domain . Let I
be the collection of all finite subsets of .
Let P0 be the trivial partial order ({0}, 0, 0).
Now suppose that P has been defined, so that it is a ccc quasi-order in M . We now
define , , , and P+1 . By (4), the set of all pairs (, ) such that < and
EXERCISES
E21.1. Let f be a complete embedding of P into Q. Show that there is an isomorphism
PRO(P)
PRO(Q)
g of RO(P) into RO(Q) such that for any X RO(P), g(
X) = xX g(x).
Furthermore, show that the following diagram commutes:
f
P
eP
eQ
RO(P )
RO(Q)
and [ :
implies that = ].
1 = {(op(op(
p,
p ,
H ), op(
H )), q) : p, p fin(, ),
E21.16. (Continuing E21.15) Next, we let 2 = {(op(0, 0), 1P )}. Then for any generic
G, (2 )G = (0, 0). Finally, let = op(op(0 , 1 ), 2 ). This finishes the definition of .
Prove:
(vi) 1P
P is
1 cc.
E21.17. (Continuing E21.16) Now P+1 is determined. The limit stages are clear. So the
construction is finished, and P is ccc.
Let G be P -generic over M . Prove
(vii) In M [G], if F and |F | < 2 , then there is a g such that f < g for all
f F.
E21.18. (Continuing E21.17) Show that if ZFC is consistent, then there is a c.t.m. of ZFC
with the following properties:
(i) 2 = 2 .
(ii) 21 = 3 .
(iii) Every almost unbounded set of functions from to has size 2 .
(iv) MA(1 ) fails.
291
292
Proof. Remember that subsets of and their characteristic functions are both considered as reals. Implicitly, one is a Cohen real iff the other is, by denition. So we will
def
show that the function l = hg(h(, m)) : m i is a Cohen real.
Fix < , and let J = { < : h1 () has the form (, m) for some m }. Let
k(m) = h(, m) for all m . Then k is a bijection from onto J. By 22.1, G Fin(J, 2)
is Fin(J, 2)-generic over M . Dene k : Fin(J, 2) Fin(, 2) by setting k (p) = p k for
any p Fin(J, 2). So k is an isomorphism from Fin(J, 2) onto Fin(, 2). Clearly then
k [G Fin(J, 2)] is Fin(, 2)-generic over M . So the proof is completed by checking that
S
k [G Fin(J, 2)] = l. Take any m . Then
[
(m, )
k [G Fin(J, 2)] iff there is a p k [G Fin(J, 2)]
such that (m, ) p
iff
there is a q G Fin(J, 2)
such that (m, ) k (q)
iff
iff
there is a q G Fin(J, 2)
such that (m, ) q k
g(k(m)) =
iff
iff
g(h(, m)) =
(m, ) l
Theorem
S 22.3. Suppose that M is a c.t.m. of ZFC and G is Fin(, 2)-generic over
M . Let g = G (so that g is a Cohen real). Then for any f which is in M , the set
{m : f (m) < g(m)} is infinite.
Proof. For each n let in M
Dn = {h Fin(, 2) : there is an m > n such that m dmn(h) and f (m) < h(m)}.
Clearly Dn is dense. Hence the desired result follows.
Cohen reals are widely used in set theory. A couple of interesting facts are: adding a
Cohen real adds a Suslin tree. If one starts with a model of CH and adds Cohen reals,
then in the extension there is a maximal chain of elements dierent from 1 in the Boolean
algebra P()/n.
Roitman, J. [79] Adding a random or a Cohen real. . . Fund. Math. 103 (1979), 4760.
Random forcing
The general idea of random forcing is to take a -algebra of measurable sets with respect
to some measure, divide by the ideal of sets of measure zero, obtaining a complete Boolean
algebra, and use it as the forcing algebra; the partially ordered set of nonzero elements is
the forcing partial order.
We give more details in the simplest case: we are going to dene a measure on certain
subsets of 2 and form a partial order in a natural way, without giving complete details
293
on how this ts into the above framework. We refer to Halmos, Measure Theory, for
standard facts about measures.
Let nseq be the set of all nite sequences of 0s and 1s; thus
nseq =
2.
(iv): assume the hypothesis, and suppose that f F . Take any h Uf . Then there
is a g G such that h Ug . Then g = h m = f , so f G . Thus F G .
(v): immediate from (iv).
(vi): Obviously F is closed under unions. For complementation, write a F as in (iii).
Then the element
[
Uf
f m 2\F
(1)
(2)
(3)
(4)
(5)
0
/ G.
1 G.
If a G and a b, then b G.
If a, b G, then a b G.
For any a A, either a A or a A.
A subset X of a Boolean algebra A satises the finite intersection property (p) iff
for every nite F X.
F 6= 0
/
G
for
all
a
F
.
Thus
a
G
for
all
a
F
,
so
aF a
P
Q
G. Then 0 = ( F ) aF a G, contradiction.
The following theorem actually follows from th topological duality theory for Boolean
algebras. To avoid going into this theory, we give a direct proof.
def S
Lemma 22.7. If hai : i Ii is a system of S
elements of F and b = iI ai is in F,
then there is a finite subset F of I such that b = iF ai .
Proof. Suppose this is not true. Then the set
(1)
{b} {ai : i I}
T
has S
p. In fact,
S otherwise there is a nite
S F I such that b iI ai = 0, and hence
b iF ai iI ai = b, so that b = iF ai , contrary to our supposition.
Hence by 22.5
the set (1). Now for each
S let D be an ultralter on F which contains
n
n we have gn 2 Ug = 1 D, so by 22.6, there is a g 2 such that Ug D. Since
Uk Uh for distinct k, h n 2, this g is unique; call it g (n) . If m n, then Ug (m) Ug (n) D,
S
hence Ug (m) Ug (n) 6= , hence g (m) g (n) . Let f = m g (m) . So f 2.
295
S
Now write b = hH Uh with H p 2 for some p , by 22.4(iii). Since b D,
by 22.6, Uh SD for some h H . Since also g (p) D, clearly h = g (p) . It follows that
f Uh b = iI ai , so
S there is an i I such that f ai .
Now write ai = kK Uk with K q 2 for some q , by 22.4(iii). As above, we
then get g (q) K , and so f ai . This is a contradiction.
Now we turn to the actual measure.
Lemma 22.8. There is a measure on F such that if m and F m 2, then
[
|F |
Uf = m .
2
f F
Proof. Note rst that the indicated condition on is not quite a denition, since a
given element can be written as a union in many ways. So we rst show
(1)
If m, n , F m 2, G n 2 and
Uf =
f F
Ug , then
gG
|G |
|F |
= n.
m
2
2
f F hHf
Uf
and
f F
|K | =
|Hf |
f F
2nm
f F
[
kK
Uk =
Uf =
f F
[
gG
Ug ,
as desired in (1).
Thus the condition on in the lemma can be taken as a denition, and 22.4(iii) implies
that is dened on all of F. Obviously takes on only nonnegative values, and (0) = 0.
Note that (a) 6= 0 if a 6= 0, although we will not use this. Clearly 0 (a) 1 for all
a F. Also, ( 2) = 1, since
[
( 2) =
Uf = 1.
f {0} 2
Now suppose that a and b are disjoint elements of F; we want to show that
S (a b) =
m
(a)S+ (b). By 22.4(ii) there exist m and F , G 2 such that a = f F Uf and
b = gG Ug . Since a b = , we also have F G = . Hence
[
[
[
Uf
Uf
Ug =
(a b) =
f F
f F G
gG
|F G |
|F | + |G |
|F | |G |
=
= m + m = (a) + (b).
m
m
2
2
2
2
Finally, countable additivity follows trivially from 22.7, since no innite union of nonempty
pairwise disjoint members of F is in F.
=
We can extend the measure given in Lemma 22.7 to the -eld F generated by F, and
the extended measure is still denoted by . (See, e.g., Halmos, Theorem A in section 22.)
Lemma 22.9. If a F , (a) > 0, and > 0, then there is a b F such that b a
and 0 < (b) < .
Proof. We have (a) < 1, so by standard
there is a system
S measure theory
S results,
hbi : i i of elements of F such that a i bi and
S i bi <S1. (See, e.g.,
S the
top of page 57 of Halmos.) ForSeach i let ci = bi \ j<i bj . Then i ci = i bi .
In fact, is clear, and if f i bi , choose i minimum such that f S
bi ; then f ci .
1
m
Now by 22.4(iii) we can choose an m such that 2 > and c0 = gG Ug for some
G m 2.
!
[
1
(1)
There is a g m 2\G such that Ug
ci < m .
2
0<i
In fact, otherwise we have
[
1=
Ug
gm 2
gG
Ug +
ci
gm 2\G
297
Ug
[
0<i
ci
contradiction.
Now we choose g as in (1), and let
[
d=
hm 2\{g}
Then clearly a
i ci
Uh
ci .
0<i
d, hence d a, and
0 6= (d) = Ug \
ci
0<i
1
< .
2m
Lemma 22.10. Suppose that a F with (a) > 0 and n . Then there is a
b F and an h nseq such that dmn(h) n, b a, 0 < (b), and b Uh .
Proof. By 22.8 choose c F such that c a and 0 < (c) < 21nS
. Then by
measure
theory,
there
is
a
sequence
hd
:
i
i
of
elements
of
F
such
that
c
i
i di and
P
1
i (di ) < 2n . By 22.4(ii) we may assume that each di has the form Uhi . Now
[
(c Uhi ), hence
c=
i
(c)
(c Uhi ),
so we can choose i such that 0 < (c Uhi , giving the desired result.
Now we dene Pr to be the collection of all a F of positive measure, ordered by inclusion.
Theorem 22.11. Let M is a c.t.m. of ZFC, and suppose that F, F , Pr are taken
in
T the sense of M . Let G be Pr -generic over M . Then there is an f 2 such that
G = {f }.
Proof. By Lemma 22.10, the set
Dn = {b Pr : there is an h nseq such that b Uh and dmn(h) n}
is dense in Pr for each m . Choose h(n) Dn G for each n , and choose
h(n) nseq such that b(n) Uh(n) . So Uh(n) G. Clearly h(n) and h(m) are compatible
T
def S
(n)
for all
m,
n
,
so
f
=
h
is
a
function
mapping
into
2.
Clearly
f
G. If
n
T
g G, then g Uh(n) for each n, and hence g = f .
In the situation of Theorem 22.11, f , or {i : f (i) = 1}, is called a random real.
Lemma 22.12. Pr has ccc.
Proof. Suppose to the contrary that X Pr is an uncountable family of pairwise
incompatible elements. Then a b = for all distinct a, b X. Now
[
1
X=
x X : (x) >
,
n+1
n
298
def
1
so there is an n such that Y = {x
S X :P(x) > n+1 } is uncountable. If Z
is S
a denumerable subset of Y , we get ( Z) =
xZ (x), which is impossible since
1
( Z) 1 and (x) > n+1 for all x Z.
We now give a little more of the general theory of Boolean algebras. If A is a BA, a subset
I of A is an ideal iff the following conditions hold:
(1) 0 I.
(2) if x I and y x, then y I.
(3) if x, y I, then x + y I.
Given an ideal I, we dene a relation I A A by:
a I b iff
ab I.
hence a b I a b .
(iv): Clearly a a = aa , so (iv) holds.
This lemma enables us to dene the quotient A/I. Its elements are the equivalence classes
under I , and the operations satisfy the following conditions; the lemma says that this is
possible:
[a] + [b] = [a + b];
[a] [b] = [a b];
[a] = [a].
0 = [0].
1 = [1].
The algebraic structure A/I is clearly a BA, since the axioms are easily checked.
We apply this in our case as follows. Let I be the set of all a F of measure 0. It is
easily checked that I is an ideal in F . We dene M = F /I.
A BA A is -complete iff any countable subset of A has a sum.
Lemma 22.14. If A is a -complete BA satisfying ccc, then A is complete.
Proof. Let X be any subset of A; we want to show that it has a sum. By Zorns
lemma, let Y be a maximal set subject to the following conditions: Y consists of pairwise
disjoint elements, P
and for any y Y there isP
an x X such that y x. By ccc, Y is
countable, and so
Y exists. WeP
claim that
YP
is the least upper boundPof X.
Suppose that x X and x 6
Y . Then x Y 6= 0, and Y {x Y } properly
P
contains
Y.
P Y and satises both of the conditions dening Y , contradiction. Hence x
So
Y is an upper bound for X.
P
P
Suppose that z is any upper bound for X, but suppose that Y 6 z. Thus Y z 6=
0, so by 2.2 there is a y Y such that y z 6= 0. Choose x X such that y x. Now
x z, so y z z, hence y z = 0, contradiction.
Lemma 22.15. M satisfies ccc and is complete.
Proof. Suppose that X is an uncountable set of nonzero pairwise disjoint elements
of M. We can write X = {[y] : y Y } for some Y F such that [y] [z] = 0 for
y 6= z. Let
S hy : < 1 i be a sequence of distinct elements of Y . For each < 1 let
zS= y \ < y . For < < 1 we have [y ] [y ] = 0, hence (y y ) = 0, so
( < (y y ) = 0. Hence (z ) = (y ) 6= 0. So hz : < 1 i is a system of pairwise
disjoint elements of Pr , contradicting 22.12. Hence M satises ccc.
To show that M is complete, by Lemma 22.14 it suces to show that it is -complete.
So, suppose that X is a countable subset of S
M . We can write X = {[y] : y Y } for some
Y \z
(y\z) = 0;
so [
yY
Y ] [z], as desired.
300
def
[[f(
n) < m]],
[r]
k([[f(
n) < m]]).
m
f (
n) < p, so [am ] [ap ]. Let bm = pm ap for each m . Then [am ] = [bm ] for
each m.
S
P
(2) m [bm ] =
m bm .
S
In fact,
for {[bm ] : m }. If [c] is any upper bound,
m bm is clearly an upper bound
S
S
then (bm \c) = 0 for each m, and hence m bm \c = 0, so that
m bm [c]. So
(2) holds.
S
S
Note that [r] =
m bm ; so (r) =
m bm . By Theorem D on page 38
of Halmos we get (r) = sup{(bm ) : m }. So we can choose m such that
1
(r). Let h(n) be the least such m. Thus
(bm ) (r) 2n+2
(3)
1
2n+2
(r).
Now
r\
bh(n)
=
It follows that
(r\bh(n) )
1
2n+2
(r)
1
(r).
2
bh(n)
> 0,
Theorem 22.19. Suppose that M is a c.t.m. of ZFC. Consider Q within M , and let
G be Q-generic over M . Then the set
{s Seq : s p for all p G}
is a function from into 2.
Proof. For each n let
Dn = {p Q : there is an s Seq such that dmn(s) = n and s t or t s for all t p}.
Then Dn is dense: if q Q, choose any s q such that dmn(s) = n, and let p = {t q :
s t or t s}. Clearly p Dn and p q.
Now for each n let p(n) be a member of G Dn , and choose s(n) accordingly.
(1) If m < n, then s(m) s(n) .
In fact, choose r G such that r p(m) p(n) . Then s(m) t and s(n) t for all t r
with dmn(t) n, so s(m) s(n) .
(2) s(m) q for all q G.
In fact, let q G, and choose r G such that r q and r p(m) . Take t r with
dmn(t) = m. then t = s(m) since r p(m) . Thus s(m) q since r q.
(3) If t q for all q G, then t = s(m) for some m.
For, let dmn(t) = m. Since t p(m) , we have t = s(m) .
From (1)(3) the conclusion of the theorem follows.
The function described in Theorem 22.19 is called a Sacks real.
If p Q, a member f of p is a branching point iff f h0i, f h1i p.
Sacks forcing does not satisfy ccc:
Proposition 22.20. There is a family of 2 pairwise incompatible members of Q.
Proof. Let A be a family of 2 innite pairwise almost disjoint subsets of . With
each A A we dene a sequence hPA,n : n i of subsets of Seq, by recursion:
PA,0 = {};
{f h0i : f PA,n }
PA,n+1 =
{f h0i : f PA,n } {f h1i : f PA,n }
if n
/ A,
if n A.
S
Note that all members of PA,n have domain n. We set pA = n PA,n . We claim that
pA is a perfect tree. Condition (1) is clear. For (2), suppose that f pA ; say f PA,n .
Let m be the least member of A greater than n. If g extends f by adjoining 0s from n to
m 1, then g h0i, g h1i pA , as desired in (2).
We claim that if A, B A and A 6= B, then pA and pB are incompatible. For,
suppose that q is a perfect tree and q pA , pB . Now A B is nite. Let m be an integer
greater than each member of A B. Let f be a branching point of q with dmn(f ) m;
303
it exists by (2) in the denition of perfect tree. Let dmn(f ) = n. Then f PA,n and
f h0i, f h1i PA,n+1 , so n A by construction. Similarly, n B, contradiction.
Proposition 22.21. Q is not 1 -closed.
Proof. For each n let
pn = {f Seq : f (i) = 0 for all i < n}.
T
Clearly pn is perfect, pn pm if n > m, and n Pn is {f } with f (i) = 0 for all i, so
that the descending sequence hpn : n i does not have any member of Q below it.
By 22.20 and 22.21, the methods of section 11 cannot be used to show that forcing with Q
preserves cardinals, even if we assume CH in the ground model. Nevertheless, we will show
that it does preserve cardinals. To do this we will prove a modied version of 1 -closure.
If p is a perfect tree, an n-th branching point of p is a branching point f of p such that
there are exactly n branching points g such that g f . Thus n > 0. For perfect trees p, q
and n a positive integer, we write p n q iff p q and every n-th branching point of q is
a branching point of p. Also we write p 0 q iff p q.
Lemma 22.22. Suppose that p q are perfect trees, and n . Then:
(i) If p n q, then p i q for every i < n.
(ii) If p n q and f is an n-th branching point of q, then f is an n-th branching point
of p.
(iii) For each positive integer n there is an f p such that f is an n-th branching
point of q.
(iv) The following conditions are equivalent:
(a) p n q.
(b) For every f Seq, if f is an n-th branching point of q, then f h0i, f h1i p.
(v) For each positive integer n there are exactly 2n1 n-th branching points of a perfect
tree p.
(vi) If p and q are perfect trees, then so is p q.
(vii) If p and q are perfect trees, then {r : r is a perfect tree and r p or r q} is
dense below p q.
Proof. (i): Assume that p n q, i < n, and f is an i-th branching point of q.
Then since q is perfect there are n-th branching points g, h of q such that f h0i g and
f h1i h. So g, h p, hence f p. This shows that p i q.
(ii): Suppose that p n q and f is an n-th branching point of q. Let r0 , . . . , rn1 be all
of the branching points g of q such that g f . Then by (i), r0 , . . . , rn1 are all branching
points of p. Hence f is an n-th branching point of p.
(iii): Let f be an n-th branching point of p. Then it is an m-th branching point of q
for some m n. Let r be an n-th branching point of q below f . Then r p, as desired.
[But r might not be a branching point of p.]
(iv), (v), (vi): Immediate from the denitions.
(vii): Suppose that p, q, t are perfect trees and t p q; we want to nd a perfect
tree r t such that r p or r q. If t p q, then r = t works. Otherwise, there is
304
def
qn
x B (
n) = x). Let t0 , . . . , t2n 1 list all of the functions f h0i and f h1i such
that f is an n-th branching point of qn . (Recall 3.21(v).) Then for each i < 2n we have
(
qn ti qn , and so qn ti
x B
n) = x). Hence there exist an ri qn ti and a
305
bi B such that ri
(
n) = bi . Let qn+1 be the amalgamation of {ri : i < 2n } into qn ,
n
and let Fn = {bi : i < 2 }. Thus qn+1 n qn by 22.24. Moreover:
(1)
qn+1
(
n) Fn .
In fact, let G be Q-generic over M with qn+1 G. By 22.22(vii), there is an i such that
ri G. Since ri
(
n) = bi , it follows that G (n) Fn , as desired in (1).
Now with (1) the construction is complete.
By the fusion lemma 22.23 we get s n qn for each n. Hence the conclusion of the
lemma follows.
Theorem 22.26. If M is a c.t.m. of ZF C + CH and Q M is the Sacks forcing
partial order, and if G is Q-generic over M , then cofinalities and cardinals are preserved
in M [G].
Proof. Since |Q| 2 = 1 by CH, the poset Q satises the 2 -chain condition,
and so preserves conalities and cardinals 2 by 9.4. Hence by 9.2 it suces to show
that 1M remains regular in M [G]. Suppose not: then there is a function f : 1M in
M [G] such that rng(f ) is conal in 1M . Hence there is a name such that f = G , and
hence there is a p G such that p
:
1M . By Lemma 22.25, choose q p and
F : [1M ]< in M such that q
(
n) Fn for every n . Take < 1M such that
S
r
< (
n). So we have:
(2) r
(
n) Fn ;
S
(3) n Fn < ;
(4) r
< (
n).
These three conditions give the contradiction r
(
n) < (
n).
Baumgartner, J.; Laver, R. [79] Iterated perfect-set forcing. Ann. Math. Logic 17 (1979),
271288.
Hechler MAD forcing
Innite subsets a, b of are almost disjoint iff a b is nite.
Theorem 22.27. There is a collection A of infinite pairwise almost disjoint subsets
of such that |A | = 2 .
Proof. For each f let af = {f n : n }. If f, g and f 6= g, then af ag
is nite. In fact, choose i such that f (i) 6= g(i). Then af ag {f j : j i}. Now
note that the collection A of all nite sequences of members of has size . So there is a
bijection F from A to . Let
A = {F [af ] : f }.
Clearly A is as desired.
306
A family A of innite subsets of is maximal almost disjoint (MAD) iff any two members
of A are almost disjoint, and A is maximal with this property. By Theorem 22.27, there
is a MAD family of size 2 . (Apply Zorns lemma.)
Theorem 22.28. Every infinite MAD family of infinite subsets of is uncountable.
Proof. Suppose that A is a denumerable pairwise almost disjoint family of innite
subsets of ; we want to extend it. Write A = {An : n }, the An s distinct. We
dene
han : n i by recursion. Suppose that am has been dened for all m < n. Now
S
m<n (Am An ) is nite, so we can choose
an An \ {am : m < n}
(Am An ) .
m<n
say {, } dmn(q1 ). Let q1 ({, }) = m, and suppose that m i < np3 . Then
q3 ({, }) = q1 ({, }) = m, so (p3 ())(i) = 0 or (p3 ())(i) = 0. So (3) holds. The nal
inequalities are clear.
Lemma 22.30. H satisfies ccc.
Proof. Suppose that N is an uncountable subset of H; we want to nd two compatible
members of N . Now hdmn(p) : (p, q) N i is an uncountable system of nite sets, so there
exist an uncountable N N and a nite subset H of 1 such that hdmn(p) : (p, q) N i
is a -system with root H. Next,
N =
where
(f,g)J
J = {(f, g) : f : H , g is a function,
dmn(g) [H]2 , and rng(g) }.
def
To check that (p3 , q1 q2 ) H, rst note that (1) is clear. To show that q1 q2 is a function,
suppose that {, } dmn(q1 ) dmn(q2 ). Then dmn(q1 ) dmn(q2 ) [dmn(p1 )]2
[dmn(p2 )]2 = [H]2 , and it follows that q1 ({, }) = q2 ({, }). Thus q1 q2 is a function.
Furthermore,
dmn(q1 q2 ) = dmn(q1 ) dmn(q2 )
[dmn(p1 )]2 [dmn(p2 )]2
[dmn(p1 ) dmn(p2 )]2
= [dmn(p3 )]2 .
The range of q1 q2 is clearly contained in np2 . So we have checked (2). For (3), suppose
that {, } dmn(q1 q2 ), (q1 q2 )({, }) = m, and m i < np2 . We consider some
cases:
Case 1. {, } dmn(q2 ). Then , dmn(p2 ), so p3 () = p2 () and p3 () = p2 ().
Hence (p3 ())(i) = 0 or (p3 ())(i) = 0, as desired.
Case 2. {, } dmn(q1 )\dmn(q2 ) and i < np1 . Thus , dmn(p1 ). If
dmn(p2 ), then p1 () = p2 (), and so (p3 ())(i) = (p1 ())(i). If
/ dmn(p2 ), still
(p3 ())(i) = (p1 ())(i). Similarly for , so the desired conclusion follows.
308
Proof. Conalities and cardinals are preserved by 22.30 and 9.4. For each < 1 ,
[
x = {p() : (p, q) G for some q, and dmn(p)}.
We claim that x is a function. For, suppose that (a, b), (a, c) x . By the denition,
choose (p1 , q1 ), (p2 , q2 ) G such that dmn(p1 ), dmn(p2 ), (a, b) p1 (), and
(a, c) p2 (). Then choose (p3 , q3 ) G such that (p3 , q3 ) (p1 , q1 ), (p2 , q2 ). By (4) in the
denition of H we have dmn(p3 ), and by (5) we have (a, b), (a, c) p3 (), so a = c.
Next we claim that in fact x has domain . (Its domain is clearly a subset of .)
For, take any m . It suces to show that the set
def
(t())(i) if i < nt ,
(p())(i) = 0
if nt i and 6= ,
1
if nt i and = .
309
It is easy to check that (p, u) H, in fact (p, u) Em , and (p, u) (r, s), as desired. So
each a is innite.
Next we show that distinct a , a are almost disjoint. Suppose that , < 1 with
6= . Since D0 and D0 are dense, there are (p1 , q1 ), (p2 , q2 ) G with dmn(p1 )
and dmn(p2 ). Choose (p3 , q3 ) G such that (p3 , q3 ) (p1 , q1 ), (p2 , q2 ). Thus ,
dmn(p3 ). Next we claim:
def
F = {(r, s) : {, } dmn(s)}
is dense below (p3 , q3 ). In fact, suppose that (t, u) (p3 , q3 ). We may assume that {, }
/
dmn(u). Let dmn(r) = dmn(t), and for any dmn(r) let r() be the function with
domain nt + 1 such that t() r() and (r())(nt) = 0. Let dmn(s) = dmn(u) {{, }},
with u s and s({, }) = nt . It is easily checked that (r, s) H, in fact (r, s) F , and
(r, s) (t, u). So, as claimed, F is dense below (p3 , q3 ). Choose (p4 , q4 ) F G.
We claim that a a q4 ({, }). To prove this, assume that m a a , but
suppose that q4 ({, }) m. Thus x (m) = 1 = x (m), so there are (e, b), (c, d) G such
that dmn(e), m dmn(e()), (e())(m) = 1, and dmn(c), m dmn(c()),
and (c())(m) = 1. Choose (p5 , q5 ) G with (p5 , q5 ) (p4 , q4 ), (a, b), (c, d). Then
{, } dmn(q5 ), q5 ({, }) = q4 ({, }) m < np5 , (p5 ())(m) = (e())(m) = 1,
and (p5 ())(m) = (c())(m) = 1, contradiction. So we have shown that ha : < 1 i is
an almost disjoint family.
To show that ha : < 1 i is MAD, suppose to the contrary that b is an innite
subset of such that b a is nite for all < 1 . Let be a name such that G = b.
For each < 1 let
= {(i, (p, q)) : i , (p, q) H, dmn(p),
i dmn(p()), and (p())(i) = 1}.
Cleary G = a . For each n let An be maximal subject to the following conditions:
(1) An is a collection of pairwise incompatible members of H.
(2) For each (p, q) An , (p, q)
n
or (p, q)
n
/ .
Then
(3) An is maximal pairwise incompatible.
In fact, suppose that (r, s) (p, q) for all (p, q) An . Now (r, s)
n
n
/ , so
there is a (t, u) (r, s) such that (t, u)
n
or (t, u)
n
Using D0 , we may assume that dmn(p1 ). Choose n b with n > m and n np1 .
310
(p ())(i)
3
0
(p4 ())(i) =
0
if
if
if
if
i < np3 ,
np3 i and 6= ,
np3 i, = , and i 6= n,
= and i = n.
We check that (p4 , q1 q2 ) H. Conditions (1) and (2) are clear. For (3), suppose that
{, } dmn(q1 q2 ), and (q1 q2 )({, }) i < np4 . Remember that np4 is k. If
i < np3 , then the desired conclusion follows since (p3 , q1 q2 ) H. If np3 i, then the
desired conclusion follows since at least one of , is dierent from . Hence, indeed,
(p4 , q1 q2 ) H.
Clearly (p4 , q1 q2 ) (p2 , q2 ), and so (p4 , q1 q2 )
n
. It is also clear that
(p4 , q1 q2 ) (p1 , q1 ), so (p4 , q1 q2 )
m.
Since m < n, it follows that
(p4 , q1 q2 )
n
/ . But (
n, (p4 , q1 q2 )) is clearly a member of , and hence (p4 , q1 q2 )
n
, contradiction.
Miller, A. [03] A MAD Q-set. Fund. Math. 178 (2003), 271281.
Hechler real forcing
Let (n, f ) P iff n and f , and order these pairs by (n, f ) (m,
S g) iff n m,
f m = g m, and f (k) g(k) for all k. If G is P -generic over M , then (m,f )P f m
is a Hechler real.
Labedzki, G.; Repick
y, M. Hechler reals. J. Symb. Logic 60 (1995), 444-458.
Prikry-Silver forcing
Let P be the collection of all functions f with domain a subset D of such that \D is
innite, and with range contained
Sin 2, ordered by . This is called Prikry-Silver forcing.
If G is P -generic over M , then pG p is a function from to 2, called a Prikry-Silver
real. The treatment of this forcing is similar to that of Sacks forcing.
Jech, T. Multiple forcing. Cambridge Univ. Press 1986, 136pp.
311
Mathias forcing
Let P consist of all pairs (s, A) such that s is a nite subset of , A is an innite subset
of , and each element of s is less than min(A). Dene (s, A) (s , A ) iff sS s , A A ,
and s\s A . This is the Mathias forcing poset. If G is generic, then (s,A)G s is a
Mathias real.
Bartoszy
nski, T.; Judah, H. Set theory. On the structure of the real line. A. K.
Peters 1995. 546pp.
Laver forcing
A Laver tree is a subset T of
properties:
n
n
We now want to do the same thing for regular limit cardinals . We introduce the Levy
collapsing order:
Lv = {p : p is a nite function, dmn(p) , and
for all (, n) dmn(p), p(, n) }.
Again this set is ordered by .
Lemma 22.33. For regular uncountable, Lv has the -cc.
Proof. Very similar to part of the proof of 22.32.
Theorem 22.34. Let M be a c.t.m. of ZFC, and suppose that in M is regular and
uncountable. Let G be Lv -generic over M . Then cardinals are preserved in M [G],
M [G]
and 1
= .
Proof. Cardinals are preserved by 9.4 and 22.33.SSuppose that 0 < < ; we
will nd a function mapping onto in M [G]. Let g = G. Clearly G is a function.
Now for each < , and m let
Dm = {p Lv : (, m) dmn(p)}.
Clearly Dm is dense, so (, m) dmn(g). Thus dmn(g) = . Now suppose that
< and < . Let
E = {p Lv : there is an m such that (, m) dmn(p) and p(, m) = }.
We claim that E is dense. For, suppose that < and < . Take any q Lv .
Choose m such that (, m)
/ dmn(q), and let p = q {((, m), )}. Clearly p E ,
as desired.
It follows that hg(, m) : m i maps onto .
313
is predense below q. Finally, we say that P is proper iff II has a winning strategy in this
game.
We give a rigorous formulation of these ideas, not relying on informal notions of games.
A play of the game (P ) is an infinite sequence
hp0 , A0 , C0 , A1 , C1 , . . . , An , Cn . . .i
satisfying the following conditions for each n :
(1) p0 P .
(2) An is a maximal antichain of P .
(3) Cn = hBin : i ni, where each Bin is a countable subset of Ai .
Given such a play, we say that II wins iff there is a q p0 such that for every i , the
set
[
Bin
in
is predense below q.
A partial play of length m of (P ) is a sequence
hp0 , A0 , C0 , A1 , C1 , . . . , Am1 , Cm1 , Am i
satisfying the above conditions. Note that the partial play ends with one of the maximal antichains Am . A strategy for II is a function S whose domain is the set of all
partial plays of (P ), such that if P is a partial play as above, then S(P) is a set
Cm satisfying the condition (3). A play is said to be according to S iff for every m,
Cm = S(hp0 , A0 , C0 , A1 , C1 , . . . , Am1 , Cm1 , Am i). The strategy S is winning iff II wins
every play which is played according to S.
Thus this explains the notion of properness rigorously.
314
Am and an element
def
Under PFA, any uncountable Boolean algebra has an uncountable set of pairwise incomparable elements.
Under PFA, any uncountable family of subsets of contains an uncountable simply
ordered subset or an uncountable family of pairwise incomparable elements.
PFA implies that every tree of height 2 with all levels of size less than 2 has a linearly
ordered subset of size 2 .
Now we begin to develop a detailed application of PFA. The application concerns closed
unbounded subsets of 1 , and the proof involves the notion of an indecomposable ordinal.
Recall that an ordinal is called indecomposable iff it is 0 or has the form for some
. First we state two elementary facts. A normal function on an infinite cardinal is a
strictly increasing continuous
function from into . Recall that a function f : is
S
continuous iff f () = < f () for every limit ordinal < .
Proposition 23.5. Let be an uncountable regular cardinal, and C . Then C is
club in iff there is a normal function f on such that rng(f ) = C.
Proof. : Let f be the strictly increasing enumeration of C. Thus f : and f
is strictly increasing. If is a limit ordinal
less than , then f ()
S
S < f () for every <
since f is strictly increasing. Hence < f () S
f (). But < f () C since C is
closed, so there is some < such that f () = < f (). Then f () < f () for all
< . Hence . If < , then f () < f (), hence f () < f () for some < ,
contradiction. So = , as desired, showing that f is continuous.
: Suppose that f is a normal function on and rng(f ) = C. Then obviously C
is unbounded. To show that it is closed, suppose that is a limit ordinal and C is
unbounded
S in . Thus for every < there is a < such that f ( ) < .
Let = < . If < , then f ( ) f (). Thus f (). Now is a limit
ordinal. For, if = + 1, then < , and hence there is a < such that < ,
so S
and hence f () f ( ) < , contradiction. Now the continuity of f gives
f () = < f ( ) . Hence f () = , as desired.
Proposition 23.6. If < 1 , then < 1 .
Proof. By induction on .
Corollary 23.7. The set of all indecomposable ordinals less than 1 is club in 1 .
In particular, the union of a sequence of indecomposable ordinals is indecomposable.
Now we define a partial order which we discuss for the rest of this section. Let PC be
the set of all f fin(1 , 1 ) such that f g for some normal function g on 1 . The order
is .
Lemma 23.8. If p PC , < < 1 for all rng(p), and is indecomposable,
then p {(, )} PC .
Proof. We may assume that p is nonempty. Let f be a normal function on 1 such
that p f . Let be the largest member of rng(p). Then ( rng(f )) (1 \) is club
318
nal less than 1 . Since p0 0 0 < , it follows from 23.8 that q = p0 {(, )} PC .
We claim that q is what
to show that II has won. To prove this, suppose that
S is needed
n
i . To show that in Bi is predense below q, take any r q. Let r = r ( ).
Obviously r PC . Choose n with n i such that r n n . Since n+1 C(Ai )
and n < n+1 , by definition of C(Ai ) we have g(Ai , n ) < n+1 , and it follows that there
is an s Ai such that s and r are compatible and s g(Ai , n ) g(Ai, n ). We claim
that s and r are compatible (as desired). For, choose normal functions h and k on 1 such
that s r h and r k. Now the set (rng(h) n+1 ) (\n+1 ) (rng(k)\) is club;
let l be its strictly increasing enumeration. Choose < 1 such that h[] = rng(h) n+1 .
Now since r q we have dmn(r) and r() = ; hence k() = . It follows that
l() = h() for all < ,
l( + ) = n+1 + for all < , and l[[, )] = \n+1 ,
l( + ) = k( + ) for all < 1 .
Now suppose that dmn(s). Then (, s()) g(Ai , n ) g(Ai , n ) n+1 n+1 , and
so s() = h() = l(). Next, suppose that dmn(r) and < . Since r() = , it
follows that r() < . So (, r()) and hence r() = r () and (, r()) n n .
So r() = r () = h() = l(). Finally, suppose that dmn(r) and . Then
r() = k() = l().
Proposition 23.10. PC does not satisfy Axiom A.
Proof. Suppose it does. We define three sequences hAn : n i, hpn : n i and
hCn : n i by recursion. Let p0 be any nonempty member of PC , and let A0 be a maximal
antichain such that each member q of A0 is nonempty with max(rng(q)) > max(rng(p0 )).
Having defined An , pn , and all Cm with m < n we now define An+1 , pn+1 and Cn . By
condition (A3) in the definition of Axiom A, let pn+1 n pn and Cn An be such that Cn
is a countable subset of An and is predense below pn+1 . Let An+1 be a maximal antichain
such that each member q of An+1 is nonempty with max(rng(q)) > max(rng(pn+1 )).
So, our sequences have been defined. By (A4), choose q PC such that q pn for
all n . Since q is finite, it follows that there is an m such that q = pm = pn for
all n m. Now let be the largestSmember of rng(pm ), say with (, ) pm . Choose
greater than and each member of qCm+1 rng(q), and let r = pm {( + 1, )}. Clearly
r PC . Now r pm = pm+1 , so r is compatible with some member q of Cm+1 Am+1 .
Say r q f with f a normal function. Since r() = and r( + 1) = , we have
f () = and f ( + 1) = . Let = max(rng(q)); say (, ) q. Since q Am+1 , we
have > max(rng(pm+1 )) = max(rng(pm )) = . Thus f () = q() = > = f (), so
+ 1. Hence = f ( + 1) f () = , contradicting the choice of .
Now we are ready for our one application of PFA.
Theorem 23.11. Assume PFA, and suppose that hA : < 1 i is a collection of
infinite subsets of 1 . Then there is a club C in 1 such that for all < 1 , A 6 C.
320
Proof. We are going to apply PFA to the partial order PC which we have been
discussing. By 23.9, PC is proper. We now define three families, each of size at most 1 ,
of dense subsets of PC . For each < 1 , let
D = {p PC : there is a A such that one of the following holds:
(1) 0 dmn(p) and < p(0);
(2) there is a such that , + 1 dmn(p) and p() < < p( + 1)}.
To see that D is dense in PC , let q PC . Say q f with f a normal function on 1 . Let
r = q {(0, f (0))}. Clearly r PC and r q. Let be the largest member of rng(r), and
choose so that r() = .
If < f (0) for some A , then r D , as desired. Suppose that f (0) for
every A .
Next, suppose that A 6 rng(f ). Choose A \rng(f ), and let be minimum such
that < f (). By the continuity of f , is a successor ordinal + 1. Then f () < <
f ( + 1), and so r {(, f ()), ( + 1, f ( + 1))} is the desired member of D extending
q. So suppose that A rng(f ).
Suppose that < for some A . Define g : 1 1 by setting
f ()
if ,
g() =
+ if = + with 6= 0.
def
iff
This is clearly a partial order on . We say that F is almost unbounded iff there is
no g such that f < g for all f F . Clearly itself is almost unbounded; it has
size 2 .
Theorem 24.1. Let be a regular cardinal. Then any almost unbounded subset of
has size at least + .
Proof. Let F have size ; we want to find an almost bound for it. We may
assume that F 6= . Write F = {f : < }, possibly with repetitions. (Since maybe
|F | < .) Define g by setting, for each < ,
!
g() =
sup f ()
+ 1.
Now we apply MA() to get a filter G on P intersecting all of these dense sets. Since G
def S
is a filter, the relation g = (p,F )G p is a function. Since GEn 6= for each n , g has
domain . Let f F . Choose (p, F ) G Df . Let m be greater than each member
of dmn(p). We claim that f (n) < g(n) for all n m. For, suppose that n m. Choose
(q, H) G such that n dmn(q), and choose (r, K) G such that (r, K) (p, F ), (q, H).
Then f K since F K. Also, n dmn(r) since q r. So n dmn(r)\dmn(p). Hence
from (r, K) (p, F ) we get g(n) = r(n) > f (n).
As another illustration of iterated forcing, we now show that it is relatively consistent that
every almost unbounded subset of has size 2 , while MA holds. This follows from
the following theorem, using the fact, which we assume, that MA implies that 2 = 2 for
every infinite cardinal < 2 .
Theorem 24.3. There is a c.t.m. of ZFC with the following properties:
(i) 2 = 2 .
(ii) 21 = 3 .
(iii) Every almost unbounded set of functions from to has size 2 .
Proof. Applying 11.9 to a model N of GCH, with = 1 and = 3 , we get a c.t.m.
M of ZFC such that in M , 2 = 1 and 21 = 3 . We are going to iterate within M , and
iterate 2 times. At each successor step we will introduce a function almost greater than
each member of at that stage. In the end, any subset of of size less than 2 appears
at an earlier stage, and is almost bounded.
(1) If Q is a ccc quasi-order in M of size less 1 , then there are at most 1 nice Q-names
for subsets of ( ).
To prove (1), recall that a nice Q-name for a subset of ( )is a set of the form
[
{{
a} A : a }
and [ :
implies that = ].
In fact, clearly
1P
P W [W :
and [ :
In fact, first suppose that x (0 )G . Then there exist p fin(, ) and H [1 ]< such
that x = (p, (
H )G ). Now (H )G = {( )G : H}, and ( )G for each by (2).
Thus x is in the right side of (3).
Second, suppose that p fin(, ) and K [ ]< . For each f K there is a
1 = {(op(op(
p,
p ,
H ), op(
H )), q) : p, p fin(, ),
H, H [1 ]< , p p, H H, q P , and for all H
To prove this, first suppose that x (1 )G . Then there are q G, p, p fin(, ) and
1
that ((p, K), (p , K )) (p )G , as desired.
Next, we let 2 = {(op(0, 0), 1P )}. Then for any generic G, (2 )G = (0, 0). Finally,
let = op(op(0 , 1 ), 2 ). This finishes the definition of .
By the argument in the proof of 24.2 we have
(5) 1P
P is
1 cc.
325
Then byS
15.18, G is P-generic over N , H N [G+1 ], and H is Q -generic over N [G ].
Let g = (p,F )H p. Clearly g is a function. For each m , let
Em = {(p, K) : Q : m dmn(p)}.
Then Em is dense. (See the proof of 24.2.) It follows that g .
def
<
<
P = {(p, F ) : p Fn(, , ), F
} (see before 11.5)
(p, F ) (q, G) iff q p, G F, and f G dmn(p)\dmn(g)(p() > f ());
1P = (0, 0).
Then the following conditions hold.
(i) |P | + .
(ii) P is -closed.
(iii) P has the + -cc.
(iv) P preserves cofinalities and cardinals.
(v) If G is P-generic over M , then there is a function g in M [G] such that f < g
for all f ( )M .
Proof. Clearly (i) holds.
P satisfies the + -c.c.: Suppose that B P with |B| + . Then since |Fn(, , )| =
, wlog there is a q such that p = q for all (p, F ) B, and so + -c.c. is clear.
S
P is -closed:
Suppose
that
h(p
,
F
)
:
<
i
is
decreasing,
with
<
.
Let
q
=
< p
S
and G = < F . Suppose that < ; we claim that (q, G) (p , F ). Suppose that
f F and dmn(q)\dmn(p ). Then there is a < such that dmn(p ). We may
assume that < . Hence (p , F ) (p , F ), so q() = p () > f (), as desired.
Now it follows from sections 9, 11 that (iv) holds.
Now suppose that G is P-generic over M . Define
g=
p.
(p,F )G
Clearly g is a function with domain and range included in . To show that g has domain
, take any < . Let D = {(p, F ) : dmn(p)}. Then D is dense. In fact, suppose
that (q, H) P. Wlog
/ dmn(q). Let p be the extension of g by adding to its domain
and defining p() to be any ordinal less than which is greater than each f () for f H.
Clearly (p, H) (q, H) and (p, H) D. So g has domain .
def
If is a P-name for a p.o., then we say that is full for -sequences iff the following
conditions (a)(d) imply condition (e):
(a) p P.
(b) < .
(c) dmn( 0 ) for each < .
(d) for all , < , if < , then p
( 0 ) ( 0 ) ( ).
(e) There is a dmn( 0 ) such that p
and p
for each < .
Lemma 24.5. Let M be a c.t.m. of ZFC, and an infinite cardinal in M . Let I be
the ideal in P() consisting of all sets of size less than . In M , let (P, ) be an -stage
iterated forcing construction with supports in I (Kunens sense). Suppose that for each
< , the P -name is full for -sequences. Then P is -closed.
Proof. Let hp : < i be a sequence of elements of P such that p p if
< < , and < . We will define p = hp : < i by recursion so that the following
condition holds:
For all < , p = hp : < i P and < (p p ) and
[
supp(p ) =
supp(p ).
<
The induction step to a limit ordinal is clear, as is the case = 0. Now we define p ,
given p . By fullness we get dmn( 0 ) such that
p
and p
for each < .
Clearly p is as desired.
Here is our generalization of 24.3.
Theorem 24.6. Let M be a c.t.m. of GCH, and let be an uncountable regular
cardinal in M . Then there is a generic extension N of M preserving cofinalities and
cardinals such that in N the following hold:
(i) 2 = ++ .
+
(ii) 2( ) = +++ .
(iii) Every subset of of size less than 2 is almost unbounded.
Proof. First we apply 11.9 with = + and = +++ to get a generic extension M
+
of M preserving cofinalities and cardinals in which 2< = , 2 = + , and 2 = +++ .
We are going to iterate within M , and iterate ++ times. At each successor step we
will introduce a function almost greater than each member of at that stage. In the end,
any subset of of size less than ++ appears at an earlier stage, and is almost bounded.
(1) If Q is a + -cc quasi-order in M of size less + , then there are at most + nice
Q-names for subsets of ( ).
328
To prove (1), recall that a nice Q-name for a subset of ( )is a set of the form
[
{{
a} A : a }
where for each aP , Aa is an antichain in Q. Now by + -cc, the number of antichains
in Q is at most <+ |Q| + by 2 = + . So the number of sets of the indicated form
is at most ( + ) = + . Hence (1) holds.
Now we are going to define by recursion functions P, , and with domain ++ .
Let P0 be the trivial partial order ({0}, 0, 0).
Now suppose that P has been defined, so that it is a + -cc quasi-order in M of size
at most + , it is -closed, and every element has support of size less than . Also we
assume that has been defined for every < so that is a P -name for a quasi-order,
and it is full for -sequences. We now define , , and P+1 . By (1), the set of all nice
P -names for subsets of ( ) has size at most + . We let { : < + } enumerate all
of them.
(2) For every < 1 there is a P -name such that
1P
P : and [ : implies that = ].
In fact, clearly
1P
P W [W : and [ : implies that W = ]],
and so (2) follows from the maximal principle.
This defines .
([ + ]< )M = ([ + ]< )M
[G]
In fact, is clear. Now suppose that L ([ + ]< )M [G] . Then there exist an ordinal <
and a bijection f from onto L. Since P is -closed, by 11.1 we have f M , and hence
L M , as desired in (3). Similarly,
(4)
(5)
[G]
In fact, first suppose that x (0 )G . Then there exist p Fn(, , ) and H [ + ]< such
that x = (p, (
H )G ). Now (H )G = {( )G : H}, and ( )G for each by (2).
Thus x is in the right side of (3).
329
+
(f ) < + such that f = ((f
of
) )G . Let H = {(f ) : f K}. So H is a subset of
size less than . By (2) we have f = ((f ) )G for each f K. Now (H )G = K, and so
(p, K) (0 )G , as desired. So (5) holds.
Next, we define
1 = {(op(op(
p,
p ,
H ), op(
H )), q) : p, p Fn(, , ),
H, H [ + ]< , p p, H H, q P , and for all H
< (p())}.
and all dmn(p)\dmn(p ), q
P ()
H and all n dmn(p)\dmn(p ), q
() < (p()). Then with K = (
H )G and
op(
q , H ). Now if < < , then p
; hence q q . Let r = <
S
and K = < H . Thus r Fn(, , ) and K [ + ]< . Let = op(
r ,
K ). Clearly
0
dmn( ). Suppose that < . To show that p
q , suppose that p G with
330
G P -generic over M . Then G = (r, (
K )G ), and clearly (K )G ) = K. Suppose that
H and dmn(r)\dmn(q ). Say dmn(q ) with < . Clearly < . Since
( )G ( )G by (8), we have r() = q () > ( )G (). This proves that G ( )G ,
and so (9) holds.
Now P+1 is defined by (I7) and (I8) in the definition of iteration. We now want to
show that P+1 is + -cc, and for this we will apply 15.10. we are assuming that P is
+ -cc, so it suffices to prove that 1
P cc. So, let G be P -generic over M . As
above, 2< = in M [G]. Now |P | + by assumption. Hence 2 = + in M [G] by 9.6
(with , , replaced by + , + , respectively). Hence G is + -cc by (5) and (6). So
P+1 is + -cc by 15.10.
P+1 is -closed by 24.5, since we have proved that is full for -sequences. This
finishes the recursion step from to + 1.
Now suppose that is a limit ordinal ++ . We let
332
iff
{i A : f (i) R g(i)} F.
The most important cases of this notion that we will deal with are f <F g, f F g, and
and f =F g. Thus
f <F g iff {i A : f (i) < g(i)} F ;
f F g
f =F g
iff
iff
{i A : f (i) g(i)} F ;
{i A : f (i) = g(i)} F.
Sometimes we use this notation for ideals rather than filters, using the duality between
ideals and filters, which we now describe. An ideal on a set A is a collection I of subsets
of A such that the following conditions hold:
(4) I
(5) If X Y I then X I.
(6) If X, Y I then X Y I.
An ideal I is proper iff I 6= P(A).
If F is a filter on A, let F = {X A : A\X F }. Then F is an ideal on A. If I is
an ideal on A, let I = {X A : A\X I}. Then I is a filter on A. If F is a filter on
A, then F = F . If I is an ideal on A, then I = I.
Now if I is an ideal on A, then
f RI g
iff
{i A : (f (i) RI g(i))} I;
f <I g
f I g
f =I g
iff
iff
iff
{i A : f (i) g(i)} I;
{i A : f (i) > g(i)} I;
{i A : f (i) 6= g(i)} I.
n
n+1
if n is even,
if n is odd.
(2) An example where f =F g but neither f <F g nor f = g: Let A = and let F consist
of all subsets of that contain all even natural numbers. Define f and g by
n
n if n is even,
n if n is even,
f (n) =
g(n) =
1 if n is odd;
0 if n is odd.
Products and reduced products
In the preceding section we were considering ordering-type relations on the proper classes
A
Ord. Now we restrict ourselves
that h A Ord. We specialize the
Q to sets. Suppose
A
general notion by considering aA h(a) Ord. To eliminate trivialities, we usually
assume that h(a) is a limit ordinal for every a A; then we call h non-trivial.
Proposition 25.3. If Q
F is a proper filter on A, g, h
g <F h, then there is a k aA h(a) such that g =F k.
aA
h(a). Moreover,
{a A : g(a) = k(a) {a A : g(a) < h(a)} F,
so g =F k.
Q
We will frequently consider the structure ( aA h(a), <F , F ) in what follows. For most
considerations it is equivalent to consider the associated reducedQproduct, which we define
as follows. Note that =F is an equivalence relation on the set aA h(a). We define the
334
iff
f, g
x F y
iff
f, g
Y
Y
A under =F .
aA
h(a). Then
Proof.
Now suppose that [f ] <F [g]. Then there are
Q (i): The direction is obvious.
also convenient to change the notion of partial order slightly. A (strict) partial ordering is
a pair (P, <P ) consisting of a nonempty class P and an irreflexive transitive relation <P
which is a subset of P P . Many times we omit the subscript P if it is clear from the
context which class is meant. And we may speak of a partial ordering P if the relation
<P is clear from the context. The essential equivalence of this notion of a partial ordering
with the version is expressed in the easy exercise 2.15.
A double ordering is a system (P, P , <P ) such that (P, P ) is a quasi-ordering,
(P, <P ) is a partial ordering, and the following connections between the two hold:
(1) If p <P q or p = q, then p P q.
(2) If p <P q P r or p P q <P r, then p <P r.
(3) For every p P there is a q P such that p <P q.
Proposition 25.6. For any set A and proper filter F on A, the system (A Ord, F
, <F ) is a double ordering.
Proposition 25.7. Let h A Ord, with h taking only limit ordinal values, and let F
be a proper
Q filter on A. Then
(i) ( QaA h(a), F , <F ) is a double ordering.
(ii) ( aA h(a)/F, F , <F ) is a double ordering.
We now give some general definitions, applying to any double ordering (P, P , <P ) unless
otherwise indicated.
A subclass X P is cofinal in P iff p P q X(p P q). By the condition (3) above,
this is equivalent to saying that X is cofinal in P iff p P q X(p <P q).
Since clearly P itself is cofinal in P , we can make the basic definition of the cofinality
cf(P ) of P , for a set P :
cf(P ) = min{|X| : X is cofinal in P }.
We note that cf(P ) can be singular. For example, if we take P to be the disjoint union of
copies of n for n , we get cofinality .
A sequence hp : < i of elements of P is <P -increasing iff , < ( < p <P
p ). Similarly for P -increasing.
Suppose that P is a double order and is a set. We say that P has true cofinality iff P
has a linearly ordered subset which is cofinal.
Proposition 25.8. Suppose that a set P is a double order, and hp : < i is strictly
increasing in the sense of P , is cofinal in P , and is regular. Then P has true cofinality,
and its cofinality is .
Proof. Obviously P has true cofinality. If X is a subset of P of size less than , for
each q X choose q < such that q < pq . Let = supqX q . Then < since is
regular. For any q X we have q < p . This argument shows that cf(P ) = .
336
Proposition 25.9. Suppose that P is a double ordering, P a set, and P has true
cofnality. Then:
(i) cf(P ) is regular.
(ii) cf(P ) is the least size of a linearly ordered subset which is cofinal in P .
(iii) There is a <P -increasing, cofinal sequence in P of length cf(P ).
Proof. Let X be a linearly ordered subset of P which is cofinal in P , and let {y :
< cf(P )} be a subset of P which is cofinal in P ; we do not assume that hy : < cf(P )i
is <P - or P -increasing.
(iii): We define a sequence hx : < cf(P )i by recursion. Let x0 be any element of
X. If x has been defined, let x+1 X be such that x , y < x+1 ; it exists since X is
cofinal, using condition (3). Now suppose that < cf(P ) is limit and x has been defined
for all < . Then {x : < } is not cofinal in P , so there is a z P such that z 6 x
for all < . Choose x X so that z < x . Since X is linearly ordered, we must
then have x < x for all < . This finishes the construction. Since y < x+1 for all
< cf(P ), it follows that {x : < cf(P )} is cofinal in P . So (iii) holds.
(i): Suppose that cf(P ) is singular, and let h : < cf(cf(P ))i be a strictly increasing
sequence cofinal in cf(P ). With hx : < cf(P )i as in (iii), it is then clear that {x :
< cf(cf(P ))} is cofinal in P , contradiction (since cf(cf(P )) < cf(P ) because cf(P ) is
singular).
(ii): By (iii), there is a linearly ordered subset of P of size cf(P ) which is cofinal in
P ; by the definition of cofinality, there cannot be one of smaller size.
For P with true cofinality, the cardinal cf(P ) is called the true cofinality of P , and is
denoted by tcf(P ). It does not always exist; the example given before Proposition 25.8
shows this. When it does exist, it coincides with the cofinality. We write
tcf(P ) =
to mean that P has true cofinality, and it is equal to .
P is -directed iff for any subset Q of P such that |Q| < there is a p P such that
q P p for all q Q; equivalently, there is a p P such that q <P p for all q Q.
Proposition 25.10. (Pouzet) Assume that P is a double poset. For any infinite
cardinal , we have tcf(P ) = iff the following two conditions hold:
(i) P has a cofinal subset of size .
(ii) P is -directed.
Proof. is clear, remembering that is regular. Now assume that (i) and (ii) hold,
and let X be a cofinal subset of P of size .
S
First we show that is regular. Suppose that it is singular. Write X = <cf() Y
with |Y | < for each < cf. Let p be an upper bound for Y for each < cf, and
let q be an upper bound for {p : < cf}. Choose r > q. Then choose s X with r s.
Say s Y . Then s p q < r s, contradiction.
So, is regular. Let X = {r : < }. Now we define a sequence hp : < i by
recursion. Having defined p for all < , by (ii) let p be such that p < p for all
< , and r < p for all < . Clearly this sequence shows that tcf(P, <P ) = .
337
dominance. (That is, f < g iff f (a) < g(a) for all a A.) Then sup f is an exact upper
bound for f , and cf((sup f )(a)) = for every a A.
Proof. For brevity let h = sup f . Then clearly h is an upper bound for f . Now suppose that f g A Ord for all < . Then for any a A we have h(a) = sup< f (a)
g(a), so h g. Thus h is a least upper bound for f . Now suppose that k A Ord and
k < h. Then for every a A we have k(a) < h(a), and hence there is a a < such
that k(a) < fa (a). Let = supaA a . So < since is regular and greater than |A|.
Clearly k < f , as desired.
The next proposition gives equivalent definitions of least upper bounds for our special
partial order.
Proposition 25.17. Suppose that I is a proper ideal on A, F A Ord, and f A Ord.
Then the following conditions are equivalent.
(i) f is a least upper bound of F under I .
(ii) f is an upper bound of F under I , and for any f A Ord, if f is an upper
bound of F under I and f I f , then f =I f .
(iii) f is a minimal upper bound of F under I .
Proof. (i)(ii): Assume (i) and the hypotheses of (ii). Hence f I f , so f =I f by
Proposition 25.2(vii).
(ii)(iii): Assume (ii), and suppose that g A Ord is an upper bound for F and
g I f . Then g =I f by (ii), so f I g.
(iii)(i): Assume (iii). Let g A Ord be any upper bound for F . Define h(a) =
min(f (a), g(a)) for all a A. Then h is an upper bound for F , since if k F , then
{a A : k(a) > f (a)} I and also {a A : k(a) > g(a)} I, and
{a A : k(a) > min(f (a), g(a))} {a A : k(a) > f (a)} {a A : k(a) > g(a)} I,
so k I h. Also, clearly h I f . So by (iii), f I h, and hence f I g, as desired.
In the next proposition we see that in the definition of exact upper bound we can weaken
the condition (1), under a mild restriction on the set in question.
Proposition 25.18. Suppose that F is a nonempty set of functions in A Ord and
f F f F [f <I f ]. Suppose that h is an upper bound of F , and g A Ord, if
g <I h then there is an f F such that g <I f . Then h is an exact upper bound for F .
Proof. First note that {a A : h(a) = 0} I. In fact, choose f F . Then f <I h,
and so {a A : h(a) = 0} {a A : f (a) h(a)} I, as desired.
Now we show that h is a least upper bound for F . Let k be any upper bound. Let
n
l(a) = k(a) if k(a) < h(a),
0
otherwise.
Since {a A : l(a) h(a)} {a A : h(a) = 0}, it follows by the above that {a A :
l(a) h(a)} I, and so l <I h. So by assumption, choose f F such that l <I f . Now
f I k, so l <I k and hence
{a A : k(a) < h(a)} {a A : l(a) k(a)} I,
340
so h I k, as desired.
For the other property in the definition of exact upper bound, suppose that g <I h.
Then by assumption there is an f F such that g <I f , as desired.
Q
Corollary 25.19. If h A Ord is non trivial and F aA h(a),
Q then h is an exact
upper bound of F with respect to an ideal I on A iff F is cofinal in aA h(a).
In the next proposition we use the standard notation I + for A\I. The proposition shows
that exact upper bounds restrict to smaller sets A.
Proposition 25.20. Suppose that F is a nonempty subset of A Ord, I is a proper ideal
on A, h is an exact upper bound for F with respect to I, and f F f F (f <I f ).
Also, suppose that A0 I + . Then:
def
(i) J = I P(A0 ) is a proper ideal on A0 .
(ii) For any f, f A Ord, if f <I f then f A0 <J f A0 .
(iii) h A0 is an exact upper bound for {f A0 : f F }.
(i) is clear. Assume the hypotheses of (ii). Then
{a A0 : f (a) f (a)} {a A : f (a) f (a)} I,
and so f A0 <J f A0 .
For (iii), by (ii) we see that h A0 is an upper bound for {f A0 : f F }. To
see that it is an exact upper bound, we will apply Proposition 25.125. So, suppose that
k <J h A0 . Fix f F . Now define g A Ord by setting
f (a) if a A\A0 ,
g(a) =
k(a) if a A0 .
Then
{a A : g(a) h(a)} {a A : f (a) h(a)} {a A0 : k(a) h(a)} I,
so g <I h. Hence there is an l F such that g <I l. Hence
{a A0 : k(a) l(a)} {a A : g(a) l(a)} I,
so k <J l, as desired.
Next, increasing the ideal maintains exact upper bounds:
Proposition 25.21. Suppose that F is a nonempty subset of A Ord, I is a proper ideal
on A, h is an exact upper bound for F with respect to I, and f F f F (f <I f ).
Let J be a proper ideal on A such that I J. Then h is an exact upper bound for F
with respect to J.
Proof. We will apply Proposition 25.125. Note that h is clearly an upper bound for
F with respect to J. Now suppose that g <J h. Let f F . Define g by
g(a) if g(a) < h(a),
g (a) =
f (a) otherwise.
341
I
aA
aA S(a). Thus
Q
k h, so there is an f X such that
Qk <I f . Since f gf , it follows
Q that k <I gf , as
desired. Conversely, suppose that
Q Y aA S(a) and Y is cofinal in ( aA S(a),
Q <I ); we
show that also Y is cofinal in h, and this will prove of the claim. Let f h. Then
f gf , and there is a k Y such that gf <I k; so f <I k, as desired.
This finishes the proof of (i). Q
For (ii), first suppose that tcf( h, <I ) exists; call it . Thus
Q is an infinite regular
cardinal. Let hfi : i < i be a <I -increasing cofinal sequence in h. We claim that gfi
gfj if i < j < . In fact, if a A and fi (a) < fj (a), then fi (a) < fj (a) gfj (a) S(a),
andQso by the definition of gfi we get gfi (a) gfj (a). This implies that gfi I gfj . Now
cf( h, <I ) = , so for any B []< there is a j < such that gfi <I fj gfj . It follows
that we can take a subsequence of hgfiQ
: i < i which is strictly increasing modulo I; it is
also clearly cofinal, and hence = Q
tcf( aA S(a), <I ) by Proposition 25.25.
Conversely, suppose that tcf( Q
aA S(a), <I ) exists; call it . Let hfi : i < i be
a <Q
I -increasing cofinal sequence in
Q aA S(a). Then it is also a sequence showing that
tcf( h, <I ) exists and equals tcf( aA S(a), <I ).
Proposition 25.23. Suppose that hLa : a Ai and hMa : a Ai are systems of
linearly ordered sets such that each La and Ma has no last element. Suppose that La is
isomorphic to Ma for all a A. Let I be any ideal on A. Then
!
!
Y
Y
La , <I , I
Ma , <I , I .
=
aA
aA
342
iff
X B and c1 [X] I.
Q
Q
Then J is an ideal
on
B,
and
there
is
an
isomorphism
h
of
B/J
into
aA c(a)/I such
Q
that for any e B we have h(e/J) = he(c(a)) : a Ai/I.
Q
If |A| <
Q min(B), then
Q the range of h is cofinal in aA c(a)/I, and we have
(i) cf( Q
B/J) = cf( aA
Q c(a)/I and
(ii) tcf( B/J) = tcf( aA c(a)/I).
Q
Proof. Clearly
Q J is an ideal. Next, for any e B let e = he(c(a)) : a Ai. Then
for any e1 , e2 B we have
e1 =J e2
iff
{b B : e1 (b) 6= e2 (b)} J
iff
iff
iff
{a A : e1 (c(a)) 6= e2 (c(a))} I
e1 =I e2 .
This shows that h exists as indicated and is one-one. Similarly, h preserves <I in each
direction. So the first part of the lemma holds.
Now suppose that |A| < min(B). Let
Q G be the range of h.
Q By Proposition 25.11,
Q(i)
and (ii) follow from G being cofinal in aA c(a)/I. Let g aA c(a). Define e B
by setting, for any b B,
e(b) = sup{g(a) : a A and c(a) = b}.
Q
The additional supposition implies that e
B. Now note that {a A : g(a) >
e(c(a))} = I, so that g/I h(e/J), as desired.
According toQthese last propositions, the calculation of true cofinalities for partial orders
of the form ( aA h(a), <I ), with h A Ord and h(a) a limit ordinal for every a A, and
with |A| <Qmin(cf(h(a)), reduces to the calculation of true cofinalities of partial orders of
the form ( B, <J ) with B a set of regular cardinals with |B| < min(B).
For the next proposition,
Q note that an ultraproduct of linear orders is a linear order.
In fact, given [f ] 6= [g] in iI Li /F with each Li a linear order, we have
{i I : f (i) 6= g(i)} = {i I : f (i) < g(i)} {i I : g(i) < f (i)},
343
and since the set on the left is in F , it follows that one of the sets on the right is in F ;
hence [f ] < [g] or [g] < [f ].
Lemma 25.25. If (Pi , <i ) isQa partial order with
Q true cofinality i for each i I and
D is an ultrafilter on I, then tcf( iI i /D) = tcf( iI Pi /D).
Q
Proof. Note that iI i /D is a linear order, and so its true cofinality
exists and
Q
equals its cofinality. So the lemma is asserting that the ultraproduct iI Pi /D has as
true cofinality.
Q
Let hg : < i be a sequence
Q of members of iI i such that hg /D : < i is
strictly increasing and cofinal in iI i /D. For each i I letQhf,i : < i i be strictly
increasing and cofinal in (Pi , <i ). For each < define h iI Pi by setting
h (i) =
Q
fg (i),i . We claim that hh /D : < i is strictly increasing and cofinal in iI Pi /D (as
desired).
To prove this, first suppose that < < . Then
{i I : h (i) < h (i)} = {i I : fg (i),i <i fg (i),i } = {i I : g (i) < g (i)} D;
so h /D < h /D.
Q
Now suppose
that k iI Pi ; we want to find < such that k/D < h /D.
Q
Define l iI i by letting l(i) be the least < such that k(i) < f,i . Choose <
such that l/D < g /D. Now if l(i) < g (i), then k(i) < fl(i),i <i fg (i),i = h (i). So
k/D < h /D.
Existence of exact upper bounds
We introduce several notions leading up to an existence theorem for exact upper bounds:
projections, strongly increasing sequences, a partition property, and the bounding projection property.
We start with the important notion of projections. By a projection framework we
mean a triple (A, I, S) consisting of a nonempty set A, an ideal I on A, and a sequence
hSa : a Ai of nonempty sets of ordinals. Suppose that we are given such a framework. We
define sup S in the natural way: it is a function with domain A, and (sup S)(a) = sup(Sa )
A
for every a A. Thus sup S A Ord. Now
Q suppose also that we +have a function f Ord.
Then we define the projection of f onto aA Sa , denoted by f = proj(f, S), by setting,
for any a A,
min(Sa \f (a)) if f (a) < sup (Sa ),
+
f (a) =
min(Sa )
otherwise.
Thus
f + (a) =
f (a)
min(Sa )
344
if f (a)
/ Sa and f (a) < sup(Sa ),
if sup(Sa ) f (a).
sup{f + 1 : L } I f .
the ordinal = supaA\M a is less than . We claim that d <I f() . In fact, suppose
that a A\(M Z ). Then a A\(Za Z ), and so d(a) < f(a ) (a) f() (a). Thus
d <I f() , as claimed. Now it follows easily from Proposition 25.18 that h is an exact
upper bound for f .
For the final portion of (ii), it suffices to show
(1) There is a W I such that cf(h(a)) = for all a A\W .
In fact, let
W = {a A : a < [a , )[a Z ]}.
def
h (a) =
h(a0 ) if a W .
Then h =I h , and it follows that h satisfies the properties needed.
(iii)(iv): For each a A, let ha : < i be a strictly increasing sequence of ordinals
with supremum h(a). Define g (a) = a for all a A and < . Clearly g < g if < .
Now suppose that < . Then f <I h. For each a A such that f (a) < h(a) choose
a < such that f (a) < aa . Since |A| < , choose < such that a < for all a A.
Then for any a A such that f (a) < h(a) we have f (a) < a = g (a). Hence f <I g ,
which is half of what is desired in (iv).
Now suppose that < . Then g < h, so by the exactness of h, there is an <
such that g <I f , as desired.
(iv)(i): Assume (iv). Define strictly increasing continuous sequences h() : < i
and h() : < i of ordinals less than as follows. Let (0) = 0, and choose (0) so
that g0 <I f(0) . If () and () have been defined, choose ( + 1) > () such that
f() I g(+1) , and choose ( + 1) > () such that g(+1) <I f(+1) . Thus for every
< we have
g() <I f() I g(+1) .
since obviously hg() : < i is strongly increasing under I, Lemma 25.28 gives (i).
The notion of a strongly increasing sequence is clarified by giving an example of a sequence
such that no subsequence is strongly increasing. This example depends on the following
well-known lemma.
Lemma 25.30. If is a regular cardinal and I is the ideal []< on , then there is
def
a sequence f = hf : < + i of members of such that f <I f whenever < < .
Proof. We construct the sequence by recursion. Let f0 () = 0 for all < . If f
has been defined, let f+1 () = f () + 1 for all < . Now suppose that < is a
limit ordinal, and f has been defined for every < . Let h() : < i be a strictly
increasing sequence of ordinals with supremum , where = cf(). Thus . Define
f () = (sup f() ()) + 1.
The sequence constructed this way is as desired. For example, if is a limit ordinal as
above, then for each < we have { < : f() () f ()} , and so f() <I f .
Now let A = and let I and f be as in the lemma. Suppose that f has a strongly
increasing subsequence of length + under I. Then by proposition 25.29, f has an exact
347
upper bound h such that cf(h()) = + for all < . Now the function k with domain
taking the constant value is clearly an upper bound for f . Hence h I k. Hence there
is an < such that h() k() = , contradiction.
A further fact along these lines is as follows.
def
and yet it is easy to see that there is a sequence f = hf : < 1 i which is <I -increasing.
Hence by Lemma 25.31, this sequence does not have an exact upper bound.
Another fact which helps the intuition on exact upper bounds is as follows.
Lemma 25.32. Let be a regular cardinal, and let I = []< . For each < let
def
f be defined by f () = for all < . Thus f = hf : < i is increasing
everywhere. Claim: f does not have a least upper bound under <I . (Hence it does not
have an exact upper bound.)
Proof. Suppose that h is an upper bound for f under <I . We find another upper
bound k for f under <I such that h is not I k. First we claim
(1) < < ( h()).
In fact, otherwise we get a < such that for all < there is a > such that
> h(). But then |{ < : f () > h()}| = , contradiction.
348
By (1) there is a strictly increasing sequence h : < i of ordinals less than such
that for all < and all we have < h(). Now we define k by setting, for
each < ,
if +1 < +2 ,
k() =
h() otherwise.
To see that k is an upper bound for f under <I , take any < . If +1 , then
h() + 1, and hence k() = f (), as desired. For each < we have k(+1 ) =
< h(+1 ), so h is not I k.
Now we define a partition property. Suppose that I is an ideal over a set A, is an
uncountable regular cardinal > |A|, f = hf : < i is a <I -increasing sequence of
members of A Ord, and is a regular cardinal such that |A| < . The following
property of these things is denoted by () :
()
set
{ < :cf() = and hf : X0 i is strongly increasing under I
for some unbounded X0 }
is stationary in .
Proof. Let S be the indicated set of ordinals .
: Assume () and suppose that C is a club. Choose C0 C of order type
such that hf : C0 i is strongly increasing under I. Let = sup(C0 ). Clearly C S.
: Assume that S is stationary in , and suppose that X is unbounded. Define
C = { : is a limit ordinal and X is unbounded in }.
We check that C is club in . For closure, suppose that < is a limit ordinal and C
is unbounded in ; we want to show that C. So, we need to show that X is
unbounded in . To this end, take any < ; we want to find X such that < .
Since C is unbounded in , choose C such that < . By the definition of C
we have that X is unbounded in . So we can choose X such that < . Since
< < , is as desired. So, indeed, C is closed.
To show that C is unbounded in , take any < ; we want to find an C such
that < . Since X is unbounded in , we can choose a sequence 0 < 1 < of
elements of X with < 0 . Now is uncountable and regular, so supn n < , and it
is the member of C we need.
Now choose C S. This gives us an unbounded set X0 in such that hf : X0 i
is strongly increasing under I. Now also X is unbounded, since C. Hence we can
define by induction two increasing sequences h() : < i and h() : < i such that
349
Note that f < sup(S ) everywhere. Hence by (ii), there is a < such that the function
proj(f , hS (a) : a Ai) <I -bounds f . Now let < . If f (a) < sup(S(a)) and f (a) <
(proj(f , hS (a) : a Ai))(a), then
(proj(f , hS (a) : a Ai))(a) = min(S (a)\f (a))
= min(S(a)\f (a))
= (proj(f , hS(a) : a Ai))(a).
Hence f <I proj(f , hS(a) : a Ai), as desired.
Lemma 25.36. (Bounding projection lemma) Suppose that I is an ideal over A,
> |A| is a regular cardinal, f = hf : < i is a <I -increasing sequence satisfying
() for a regular cardinal such that |A| < . Then f has the bounding projection
property for .
Proof. Assume the hypothesis of the lemma and of the bounding projection property
for . For every < let
f+ = proj(f , S).
Suppose that the conclusion of the bounding projection property fails. Then for every
< , the function f+ is not a bound for f , and so there is a < such that f 6I f+ .
Since f I f+ , we must have < . Clearly for any we have f 6I f+ . Thus
for every we have < (f+ , f ) I + . Now we define a sequence h() : < i of
elements of by recursion. Let (0) = 0. Suppose that () has been defined. Choose
+
, f ) I + for every ( + 1). If is limit and ()
( + 1) > () so that < (f()
has been defined for all < , let () = sup< (). Then let X be the range of this
sequence. Thus
if , X and < , then < (f+ , f ) I + .
Since () holds, there is a subset X0 X of order type such that hf : X0 i is
strongly increasing under I. Let hZ : X0 i be as in the definition of strongly increasing
under I.
For every X0 , let be the successor of in X0 . Note that
< (f+ , f )\(Z Z {a A : f (a) sup(S(a))}) I + ,
and hence it is nonempty. So, choose
a < (f+ , f )\(Z Z {a A : f (a) sup(S(a))}).
Note that this implies that f+ (a ) S(a ). Since > |A|, we can find a single a A such
that a = a for all in a subset X1 of X0 of size . Now for 1 < 2 with both in X1 , we
have
f+1 (a) < f1 (a) f2 (a) f+2 (a).
351
[The first inequality is a consequence of a = a1 < (f+1 , f1 ), the second follows from
1 2 and the fact that
a = a1 = a2 A\(Z1 Z2 ),
and the third is true by the definition of f+2 .]
Thus hf+ (a) : X1 i is a strictly increasing sequence of members of S(a). This
contradicts our assumption that |S(a)| < .
The next lemma reduces the problem of finding an exact upper bound to that of finding a
least upper bound.
Lemma
cardinal, and
the bounding
Then h is an
Proof. Assume the hypotheses, and suppose that g <I h; we want to find <
such that g <I f . By increasing h on a subset of A in the ideal, we may assume that
g < h everywhere. Define Sa = {g(a), h(a)} for every a A. By the bounding projection
def
property we get a < such that f+ = proj(f , hSa : a Ai) is an upper bound for f .
We shall prove that g <I f , as required.
def
h(a) > f+ (a)} I. Also, the set N = {a A : f (a) sup Sa } is in I. Suppose that
a A\(M N ). Then g(a) < h(a) f+ (a) = min(Sa \f (a)), and since g(a) Sa , this
implies that g(a) < f (a). So g <I f , as desired.
Here is our first existence theorem for exact upper bounds.
Theorem 25.38. (Existence of exact upper bounds) Suppose that I is a proper ideal
over A, > |A|+ is a regular cardinal, and f = hf : i is a <I -increasing sequence of
functions in A Ord that satisfies the bounding projection property for |A|+ . Then f has an
exact upper bound.
Proof. Assume the hypotheses. By Lemma 25.37 it suffices to show that f has a
least upper bound, and to do this we will apply Proposition 25.17. Suppose that f does
not have a least upper bound. Since it obviously has an upper bound, this means, by
Proposition 25.17:
(1) For every upper bound h A Ord for f there is another upper bound h for f such that
h I h and {a A : h (a) < h(a)} I + .
In fact, Proposition 25.17(ii) says that there is another upper bound h for f such that
h I h and it is not true that h =I h . Hence {a A : h(a) < h (a)} I and
{a A : h(a) 6= h (a)} I + . So
{a A : h(a) 6= h (a)}\{a A : h(a) < h (a)} I + and
{a A : h(a) 6= h (a)}\{a A : h(a) < h (a)} = {a A : h (a) < h(a)},
352
so (1) follows.
Now we shall define by induction on < |A|+ a sequence S = hS (a) : a Ai of
sets of ordinals satisfying the following conditions:
(2) 0 < |S (a)| |A| for each a A;
(3) f (a) < sup S (a) for all and a A;
(4) If < , then S (a) S (a), and if is a limit ordinal, then S (a) =
<
S (a).
We also define sequences hh : < |A|+ i and hh : < |A|+ i of functions and h() : <
|A|+ i of ordinals.
The definition of S for limit is fixed by (4), and the conditions (2)(4) continue to
hold. To define S 0 , pick any function k that bounds f (everywhere) and define S 0 (a) =
{k(a)} for all a A; so (2)(4) hold.
Suppose that S = hS (a) : a Ai has been defined, satisfying (2)(4); we define
+1
S
. By the bounding projection property for |A|+ , there is a () < such that
def
h = proj(f() , S ) is an upper bound for f under <I . Then
(5) if () < , then h =I proj(f , S ).
In fact, recall that h (a) = min(S (a)\f() (a)) for every a A, using (3). Now suppose
that () < < . Let M = {a A : f() (a) f (a)}. So M I. For any a A\M
we have f() (a) < f (a), and hence
min(S (a)\f() (a)) min(S (a)\f (a));
it follows that h I proj(f , S ). For the other direction, recall that h is an upper
bound for f under <I . So f I h . If a is any element of A such that f (a) h (a)
then, since h (a) S (a), we get min(S (a)\f (a)) h (a). Thus proj(f , S ) I h .
This checks (5).
Now we apply (1) to get an upper bound h for f such that h I h and < (h , h )
I + . We now define S +1 (a) = S (a) {h (a)} for any a A.
(6) If () < , then proj(f , S +1 ) =I h .
For, we have f I h and, by (5), h =I proj(f , S ). If a A is such that f (a) h (a),
h (a) h (a), and h (a) = proj(f , S )(a), then min(S (a)\f (a)) = h (a) h (a)
f (a), and hence
proj(f , S +1 )(a) = min(S +1 (a)\f (a)) = h (a).
It follows that proj(f , S +1 ) =I h , as desired in (6).
Now since |A|+ < , let < be greater than each () for < |A|+ . Define
H = proj(f , S ) for each < |A|+ . Since > (), we have H =I h by (5). Note that
H+1 = proj(f , S +1 ) =I h ; so < (H+1 , H ) I + . Now clearly by the construction we
have S 1 (a) S 2 (a) for all a A when 1 < 2 < |A|+ . Hence we get
(7) if 1 < 2 < |A|+ , then H2 H1 , and < (H2 , H1 ) I + .
353
Now for every < |A|+ pick a A such that H+1 (a ) < H (a ). We have a = a for
all , in some subset of |A|+ of size |A|+ , and this gives an infinite decreasing sequence
of ordinals, contradiction.
The following lemma gives a slight extension of Theorem 25.325.
Lemma 25.39. Suppose that I is a proper ideal over A, |A|+ is a regular
cardinal, f = hf : < i is a <I -increasing sequence of functions in A Ord, |A|+ ,
f satisfies the bounding projection property for , and g is an exact upper bound for f .
Then
{a A : g(a) is non-limit, or cf(g(a)) < } I.
Proof. Let P = {a A : g(a) is non-limit, or cf(g(a)) < }. If a P and g(a) is
a limit ordinal, choose S(a) g(a) cofinal in g(a) and of order type < . If g(a) = 0 let
S(a) = {0}, and if g(a) = + 1 for some let S(a) = {}. Finally, if g(a) is limit but is
not in P , let S(a) = {g(a)}.
Now for any < let
N = {a A : f (a) f+1 (a)} and
Q = {a A : f+1 (a) g(a)}.
Then clearly
() If a A\(N Q ), then f (a) < sup(S(a)).
It follows that {a A : f (a) sup S(a)} N Q I. Hence the hypothesis of
def
the bounding projection property holds. Applying it, we get < such that f+ =
proj(f , hS(a) : a Ai) <I -bounds f . Since g is a least upper bound for f , we get
def
(iii)(i): Assume (iii). By modifying g on a set in the ideal we may assume that g(a)
is a limit ordinal and cf(g(a)) for all a A. Choose a club S(a) g(a) of order type
cf(g(a)). Thus the order type of S(a) is . We prove that () holds. So, assume that
X is unbounded; we want to find X0 X of order type over which f is strongly
increasing
Q under I. To do this, we intend to define by induction on < a function
h S and an index () X such that
(1) h <I f() I h+1 .
(2) The sequence hh : < i is <-increasing (increasing everywhere; and hence it certainly
is strongly increasing under I).
(3) h() : < i is strictly increasing.
After we have done this, the sandwich argument (Lemma 25.28) shows that hf() : < i
is strongly increasing under I and of order type , giving the desired result.
The functions h are defined as follows.
Q
h0 S is arbitrary.
For a limit ordinal < let h = sup< h .
Having defined h , we define h+1 as follows. Since g is an exact upper bound and h < g,
choose () greater than all () for < such that h <I f() . Also, since f <I g for
all < , the projections f+ = proj(f, S) are defined. We define
h+1 (a) =
+
(a)) + 1
max(h (a), f()
h (a) + 1
Thus we have
h <I f() I h+1 , for every .
So conditions (1)(3) hold.
Now we apply some infinite combinatorics to get information about () .
Lemma 25.41. Suppose that:
(i) I is an ideal over A.
(ii) and are regular cardinals such that |A| < and ++ < .
(iii) f = hf : < i is a sequence of length of functions in A Ord that is <I increasing and satisfies the following condition:
For every < with cf() = ++ there is a club E such that for some
with < ,
()
sup{f : E } I f .
if there is such a ,
otherwise.
Now we let (i + 1) be the least member of U which is greater than sup{ : S}. It
follows that
(3) If S and the first case in the definition of holds, then h <I f(i+1) .
def
Now the set F = {(k) : k ++ } is closed, and has order type ++ . Let = sup F .
Then F is a club of , and cf() = ++ . Hence by the hypothesis (iii) of the lemma, there
is a club E and a [, ) such that () in the lemma holds. Note that F E is
club in .
Let D = 1 [F E ]. Since is strictly increasing and continuous, it follows that D
is club in ++ . Hence by (2) there is an D S such that C D. Hence
def
C = [C ] F E
is club in () of order type . Then by () we have
sup{f : C } I f .
Now
(5) For every < both in C , we have sup{f : C ( + 1)} <I f .
To prove this, note that there is an i < ++ such that = (i). Now follow the definition of
(i + 1). There C was considered (among all other closed unbounded sets in the guessing
sequence), and h was formed at that stage. Now
h = sup{f : [C (i + 1)]} sup{f : [C ]} = sup{f : C } I f ,
so the first case in the definition of holds. Thus by (3), h <I f(i+1) . Clearly
(i + 1) , so (5) follows.
Now let h() : < i be the strictly increasing enumeration of C , and set
X0 = {( + 2m) : < , 0 < m }.
356
Y
C (+) , <J bd ,
(+)
{a C0
define f aC0 cf(h(a)) as follows. Since f <J bd h, the set {a C0 : f (a) h(a)} is
bounded, so choose a0 C0 such that for all b C0 with a0 b we have f (b) < h(b). For
EXERCISES
E25.1. Let and be regular cardinals, with < . We define two statements CG(, )
and M AJ(, ):
CG(, ) iff
M AJ(, ) iff
Thus Theorem 25.1 implies that CG(, ++ ) holds for any regular cardinal .
Prove that CG(, ) implies M AJ(, ).
E25.2. Recall the condition :
There are sets A for each < 1 such that for every A 1 the set { < 1 :
A = A } is stationary.
We do not need any properties of , but it is of interest that it follows from V = L and
implies CH.
Prove that implies CG(, 1 ). (It is consistent that CG(, 1 ) fails, but this is not
part of this exercise.)
E25.3. Suppose that A is infinite, h A Ord is a function having only limit ordinals as
values, and F is a nonprincipal ultrafilter on A. [This means
that each cofinite subset of
Q
A is in F ; cofinite = complement of finite.] Prove that aA h(a)/F is infinite.
E25.4. An ultrafilter F on a set A is countably incomplete iff there is a countably infinite
partition P of A Q
such that A\a F for every a A. Show that if F is countably
incomplete, then aA h(a)/F
has size at least 2 . Hint: Let hpi : i i enumerate P ,
S
and for each i let bi = ij< pj ; thus bi F . For each a A, let ca = {i : a bi };
hence ca is a finite set. Use these sets to solve the problem.
E25.5. (Continuing
E25.3) Suppose that F is a countably incomplete ultrafilter on A.
Q
Show that aA h(a)/F is not well-ordered.
E25.6. Let F = {, \1}, a filter on . Define a subset X of which has two distinct
least upper bounds under F .
E25.7. Give an example of a set A, a collection F A Ord, and an ideal I on A, such that
there is a subset X of F which has a unique least upper bound under <I , but no exact
upper bound. (See the example on page 121.)
E25.25. Suppose that P and Q are partially ordered sets. Show that the following two
conditions are equivalent:
(i) There is a function f : P Q such that q Qp P r P [p r implies that
q f (r)].
361
p P p[f (p ) Q q];
q Q q[g (q ) P p].
Then R is a quasiorder on R , and one can let R be the associated partial order.
E25.11. Suppose that , , are cardinals such that (1) = 1 or is infinite; (2) ;
(3) and are infinite. For f, g we define f g iff |{ < : f () > g()}| < .
Let
b.. = min{|B| : B is a -unbounded subset of };
d,, = min{|B| : B is a -cofinal subset of }.
Prove that b,, is regular, and b,, cf(d,, ).
E25.12. Suppose that > |A|+ is a regular cardinal and f = hf : < i is a <I increasing sequence. Consider the following property of f and a regular cardinal such
that |A| < :
Bad : There exist:
(a) nonempty sets Sa of ordinals for a A, each of size less than , such that f <I
hsup(Sa ) : a Ai for all < , and
(b) an ultrafilter D over A extending the dual of I
such that for every < there is a < such that proj(f , S) <D f .
Show that the bounding projection property for implies Bad .
362
E25.13. (Continuing E25.12) Suppose that > |A|+ is a regular cardinal and f = hf : <
i is a <I -increasing sequence. Also suppose that is a regular cardinal and |A| < .
Let Ugly be the following statement:
There exists a function g A Ord such that, defining t = {a A : g(a) < f (a)}, the
sequence ht : < i does not stabilize modulo I. That is, for every there is a > in
such that t \t I + .
Show that if Ugly holds, then ht : < i is I -increasing, i.e., if < < then
t \t I.
Also show that if the bounding projection property holds for , then Ugly.
E25.14. (Continuing E25.13) Suppose that > |A|+ is a regular cardinal and f = hf : <
i is a <I -increasing sequence. Also suppose that is a regular cardinal, with |A| < .
Suppose that Bad and Ugly. Show that the bounding projection property holds for
.
Hint: Suppose that it does not hold. For brevity write f+ for proj(f , S). For all
, < let t = {a A : f+ (a) < f (a)}. Prove:
(1) For every < there is a > such that t I + and for all > we have
t \t I.
Now by (1) define strictly increasing sequences h() : < i and h() : < i such that
()
() ()
for all < , t() I + , () < (), () < () if < < , and t \t() I for all
> (). Prove:
(2) If < < , then
o
n
() ()
()
()
()
+
+
t() t() t() t() \t() a A : f() (a) < f() (a) .
Next, prove
( )
( )
I +.
(3) If 1 < < m < , then t(11 ) . . . t(m
m)
()
By (3), the set I {t() : < } has fip, and hence is contained in an ultrafilter D. This
easily leads to a contradiction.
E25.15 (Continuing E25.14; The Trichotomy theorem) Suppose that > |A|+ is a regular
cardinal and f = hf : < i is a <I -increasing sequence. Also let be a regular cardinal
such that |A| < . Let Good be the statement that there exists an exact upper
bound g for f such that cf(g(a)) for every a A.
Prove that Bad , Ugly, or Good .
References
Abraham, U. and Magidor, M. Cardinal arithmetic. To appear in the Handbook of Set
Theory. 86pp.
Holz, M., Steffens, K., Weitz, E. Introduction to cardinal arithmetic. Birkh
auser
1999, 304pp.
Shelah, S. Cardinal arithmetic. Oxford Univ. Press 1994. 481pp.
363
by (v)
(vii): Let a = min(A). Thus a pcf(A) by (i). Suppose that pcf(A) with < a;
we want to get a contradiction.
Say h[g ] : < i is strictly increasing and cofinal in
Q
Q
A/D. Now define h A as follows: for any b A, h(b) = sup{g (b) + 1 : < }.
Thus [g ] < [h] for all < , contradiction.
(viii): Suppose that pcf(A)\A. Suppose that A is finite, and let a = min(A\).
So a, and if b A a then b < . Thus A = A a. Hence pcf(A) =
pcf(A\a) (A ) by (vi), and so a by (vii). So = a, contradiction. Thus A is
infinite, and this proves (viii).
The following result gives a connection with earlier material; of course there will be more
connections shortly.
364
Proposition
26.2. If A is a collection of regular cardinals, F is a proper filter on A,
Q
and = tcf( A/F ), then pcf(A).
Q
Proof. Let hf : < i be a <F -increasing cofinal sequence in A/F . Let D be any
ultrafilter
containing F . Then clearly hf : < i is a <D -increasing cofinal sequence in
Q
A/D.
Definitions. A set A is progressive iff A is an infinite set of regular cardinals and |A| <
min(A).
If < are ordinals, then (, )reg is the set of all regular cardinals such that
< < . Similarly for [, )reg , etc. All such sets are called intervals of regular
cardinals.
Proposition 26.3. Assume that A is a progressive set, then
(i) Every infinite subset of A is progressive.
(ii) If is an ordinal and A is unbounded in , then is a singular cardinal.
(iii) If A is an interval of regular cardinals, then A does not have any weak inaccessible
as a member, except possibly its first element. Moreover, there is a singular cardinal such
that A is unbounded in and A\ is finite.
Proof. (i): Obvious.
(ii): Obviously is a cardinal. Now A is cofinal in and |A| |A| < min(A) <
. Hence is singular.
(iii): If A, then by (ii), A cannot be unbounded in ; hence is a successor
cardinal, or is the first element of A. For the second assertion of (iii), let sup(A) = +n
with a limit ordinal. Since A is an infinite interval of regular cardinals, it follows that
A is unbounded in , and hence by (ii), is singular. Hence the desired conclusion
follows.
Theorem 26.4. (Directed set theorem) Suppose that A is a progressive set, and is
a regular cardinal such that sup(A) < . SupposeQthat I is a proper ideal over A containing
all proper initial segments of A and such that ( A, <I ) is -directed. Then there exist a
set A of regular cardinals and a proper ideal J over A such that the following conditions
hold:
(i) A [min(A), sup(A)) and A is cofinal in sup(A).
(ii) |A | |A|.
(iii) J contains
bounded subsets of A .
Q all
(iv) = tcf( A , <J ).
Proof. First we note:
() A does not have a largest element.
For, suppose that Q
a is the largest element of A. Note that then I = P(A\{a}). For each
< a define f A by setting
f (b) =
365
if b 6= a,
if b = a.
Q
Since a < , choose g A such that f <I g for all a. Thus {b A : f (b) g(b)}
I, so f (a) < g(a) for all < a. This is clearly impossible. So () holds.
Q
Now by Lemma 8.43 there is a <I -increasing sequence f = hf : < i in A which
satisfies () for every A. Hence by 8.388.40, f has an exact upper bound h A Ord
such that
(1)
for every A. Now the identity function k on A is clearly an upper bound for f , so
h I k; and by (1), {a A : h(a) is non-limit or cf(h(a)) < min(A)} I. Hence by
changing h on a set in the ideal we may assume that
(2)
Q
Now f shows that ( h, <I ) has true cofinality .QLet A = {cf(h(a)) : a A}. By Lemma
8.24, there is a proper ideal J on A such that ( A , <J ) has true cofinality ; namely,
XJ
iff
Clearly (ii) and (iv) hold. By (2) we have A [min(A), sup(A)). Now to show that
A is cofinal in sup(A), suppose that A; we find A such that . In fact,
{a A : cf(h(a)) < } I by (1). Let X = {b A : b < }. Then
h1 [cf 1 [X]] = {a A : cf(h(a)) < } I,
and so X J. Taking any A \X we get . Thus (i) holds. Finally, for (iii),
def
is a regular cardinal > sup(A). Assume that I is a proper ideal over A such that
Q
( A, <I ) is -directed. Then pcf(A).
Proof. We may assume that I contains all proper initial segments of A. For, suppose
that this is not true. Then there is a proper initial segment B of A such that B
/ I. With
a A\B we then have B A a, and so A a
/ I. Let a be the smallest element of A
def
such that A a
/ I. Then J = I P(A Q
a) is a proper ideal that contains all proper
initialQsegments of A a. we claim that ( (A a), J) isQ-directed. For, suppose that
X (A a) with |X| < . For eachQg X let g +
A be such that g+ g and
+
g + (b) = 0 for all b A\a. Choose f A such that g I f for all g X. So if g X
we have
{b A a : g(b) > f (b)} = {b A : g + (b) > f (b)} I P(A a),
and so g J (f (A a) for all g X, as desired.
366
J<
[A] = {D : D is an ultrafilter and cf( A/D) }.
The intersection
is to be understood as being equal to P(A) if there is no ultrafilter D such
Q
that cf( A/D) .
Now suppose that X J< [A] and D is an ultrafilter such that cf( A/D) . If
X
/ D, then A\X D and hence pcf(A\X) 6 , contradiction. Thus X is in the
indicated intersection.
If X is in the indicated intersection, we want to show that A\X . To this end,
supposeQ
that D is an ultrafilter such that A\X D, and to get a contradiction suppose
that cf( A/D) . Then X D by assumption, contradiction.
Note that the argument gives the desired result in case there are no ultrafilters D
as indicated in the intersection; in this case, pcf(A\X) for every X A, and so
J<
[A] = P(A).
367
We claim that f g mod J< [A] for all f F . For, if f (a) > g(a), then > |F | a; thus
{a : f (a) > g(a)} A. Now pcf( A) = A , so {a : f (a) > g(a)} J< [A].
def
hf : < 0 i which satisfies () , with = |A|+ , and such that fi fi for all i < 0 . To
0
do this choose, for every S++
a club E of order type ++ . Now for such a we
define
f = sup({fj : j E } {f }).
For ordinals < 0 of cofinality 6= ++ we apply the inductive hypothesis to get f such
that f <J< f for every < and also f <J< f .
This finishes the construction. By Lemma 8.41, ()|A|+ holds for f , and hence by
Theorem 8.40, f has an exact upper bound g A Ord with respect to <J< . The identity
function on A is an upper bound for f , so we may assume that g(a) a for all a A.
def
Now we shall prove that B = {a A : g(a) = a} J< [A], so a further modification of g
yields the desired upper bound for f .
To get a contradiction, suppose that B
/ J<
Q [A]. Hence pcf(B) 6 , and so there is
an ultrafilter Q
D over A such that B D and cf( A/D) . Clearly D J<
Q[A] = , as
otherwise cf( A/D)
<
.
Now
f
has
length
<
,
and
so
it
is
bounded
in
A/D; say
0
Q
that Q
fi <D h A for all i < 0 . Thus h(a) < a = g(a) for all a B. Now we define
h A by
n
h (a) = h(a) if a B,
0
otherwise.
368
<+
J< [A].
pcf(A)
Proof. The inclusion is clear. Now suppose that X J< [A]. Thus pcf(X) .
Let be the largest element of pcf(X). Then , and pcf(X) + , so X J<+ , and
the latter is a subset of the right side.
Theorem 26.13. (The interval theorem) If A is a progressive interval of regular
cardinals, then pcf(A) is an interval of regular cardinals.
369
Proof. Let = sup(A). By 26.3(iii) and 26.1(vi) we may assume that is singular.
By Theorem 26.11 let 0 = max(pcf(A)). Thus we want to show that
Q every regular cardinal
in (, 0 ) is in pcf(A). By Theorem 26.8, the partial order ( A, <J< ) is -directed.
Clearly J< is a proper ideal, so pcf(A) by Corollary 26.5.
Definition. If is a cardinal |A|, then we define
pcf (A) =
[
{pcf(X) : X A and |X| = }.
Y Y
bB
370
A/Db /E
Q
Hence it suffices by Proposition 8.11 to show that A/F is isomorphic to a cofinal subset
of this iterated ultraproduct. To do this, we consider the Cartesian product B A and
define
HP
H B A and {b B : {a A : (b, a) H} Db } E.
iff
(b,a)BA
hb,aiBA
a we define f
bB (
A/Db ) by setting
hb,aiBA
f =P g
a we have
iff
iff
iff
iff
f =E g .
(
A/Db ). For each b B write h(b) = hb /Db with
bB
Q
hb A. Then define f (b, a) = hb (a). Then
f (b) = hf (b, a) : a Ai/Db = hhb (a) : a Ai/Db = hb /Db = h(b),
as desired. Finally, k preserves order, since
f /P < g/P
iff
iff
iff
iff
So () holds.
Now we apply 7.9, with r, B A, A, P in place of c, A, B, I respectively. Then F is
the Rudin-Keisler projection on A, since for any X A,
XF
iff
iff
{b B : X Db } E
{b B : {a A : r(b, a) X} Db } E
iff
{b B : {a A : (b, a) r 1 [X]} Db } E
iff
r 1 [X] P.
371
Q
Q
Thus by Lemma 8.24 we get an isomorphism h of A/F into (b,a)BA a/P such that
Q
h(e/F ) = he(r(b, a)) : (b, a) B
So now it suffices now to show
Q Ai/P for any e A. Q
that the range of h is cofinal in (b,a)BA a/P . Let g (b,a)BA a. For every b B
Q
define gb
A by gb (a) = g(b, a). Let
Q = min(B). Since B is progressive, we have
|B| <
A/J< [A] (Theorem 26.8), there is a function
Q . Hence by the -directness of
k A such that gb <J< k for each b B. Now b for all b B, so J< Db = , and
so gb <Db k. It follows that g/P <P h(k/D). In fact, let H = {(b, a) : g(b, a) < k(r(b, a))}.
Then
{b B : {a A : (b, a) H} Db } = {b B : {a A : gb (a) < k(a)} Db } = B E,
as desired.
Generators for J<
If I is an ideal on a set A and B A, then I + B is the ideal generated by I {B}; that
is, it is the intersection of all ideals J on A such that I {B} J.
Proposition 26.16. Suppose that I is an ideal on A and B, X A. Then the
following conditions are equivalent:
(i) X I + B.
(ii) There is a Y I such that X Y B.
(iii) X\B I.
Proof. Clearly (ii)(i). The set
{Z A : Y I[Z Y B]}
is clearly an ideal containing I {B}, so (i)(ii). If Y is as in (ii), then X\B Y , and
hence X\B I; so (ii)(iii). If X\B I, then X (X\B) B, so X satisfies the
condition of (ii). So (iii)(ii).
The following easy lemma will be useful later.
Lemma 26.17. Suppose that A is progressive and B A.
(i) P(B) JQ
< [A] = J< [B].
(ii) If f, g A and f <J< [A] g, then (f B) <J< [B] (g B).
Proof. (i): Suppose that X QP(B)JQ
< [A] and X D,
Q an ultrafilter on B. Extend
D to an ultrafilter E on A. Then B/D = A/E, and cf( A/E) < . So X J< [B].
The converse is proved similarly.
Q
(ii): Assume that f, g A and f <J< [A] g. Then
{a B : g(b) f (b)} P(B) J< [A] = J< [B]
by (i), as desired.
Definitions. If there is a set X such that J [A] = J< + X, then we say that is
normal.
372
ultrafilter
Q D over A such that = cf( A /D), f is cofinal in A /D. Extend each f
to g A by setting g (a0 ) = 0. If < < , then
{a A : g (a) g (a)} {a A : f (a) f (a)} {a0 },
and {a A : f (a) f (a)} J< [A ] J< [A] and also {a0 } JQ
< [A] since a0 < ,
so g <J< g . Now let D be an ultrafilterQover A such that = cf( A/D). AsQabove,
A D; let DQ
= D P(A ). Then = cf( A /D ). To show that g is cofinal in A/D,
take any h A. Choose < such that (h A )/D < f /D . Then
{a A : h(a) g (a)} {a A : h(a) f (a)},
so h/D < g /D, as desired.
Thus we can make the assumption as in (1). Suppose that there is no universal
sequence for . Thus
(2) For everyQ<J< -increasing sequence f = hfQ
: < i there is an ultrafilter D over A
such that cf( A/D) = but f is bounded in A/D.
We are now going to construct a <J< -increasing sequence f = hf : < i for each
Q
< |A|+ . We use the fact that A/J< is -directed (Theorem 26.8).
Using this directedness, we start with any <J< -increasing sequence f 0 = hf0 : < i.
373
For limit < |A|+ we are going to define f by induction on so that the following
conditions hold:
(3) fi <J< f for i < ,
(4) sup{f : < } f .
Suppose that fi has been defined for all i < . By -directedness, choose g such that
fi <J< g for all i < . Now for any a A we have sup{f (a) : < } < a, since
< |A|+ < min A a. Hence we can define
f (a) = max{g(a), sup{f (a) : < }}.
Clearly the conditions (3), (4) hold.
Now suppose that f has been defined and is <J< -increasing; we define f +1 . By
(2), choose an ultrafilter D over A such that
Q
(5) cf( A/D ) = ;
(6) The sequence f is bounded in <D .
A/D ;
i < such that i < i for all < |A|+ . Now define
A = {a A : h(a) f (a)}.
By (9) we have A A for < < |A|+ . We are going to get a contradiction by showing
that A A+1 for every < |A|+ .
In fact, this follows from the following two statements.
(12) A
/ D .
374
J<
{{a A : g(a) f (a)} : < }
has fip. For, suppose that it does not have fip. Then there is a finite nonempty subset F
of such that
[
(2)
{a A : g(a) < f (a)} : < } J<
.
F
Proof. Q
First we prove . Let = max(pcf(A)),
and let D be an ultrafilter on A such
Q
that =
cf(
A/D).
Now
for
any
f,
g
A,
if
f
<
g then
Q
Q
Q f <D g. Hence
Q any cofinal
set in ( A, <) is also cofinal in ( A, <D ), and Q
so = cf( A, <D ) cf( A, <).
To prove , we exhibit a cofinal subset of ( A, <) of size . For every pcf(A)
fix a universal sequence f = hfi : i < i for , by Theorem 26.18. Let F be the set of all
functions of the form
sup{fi11 , fi22 , . . . , finn },
where 1 , 2 , . . . , n is a finite sequence of members of pcf(A), possiblyQwith repetitions,
and ik < k for each k = 1, . . . , n. We claim that F is cofinal in ( A, <); this will
complete the proof.
Q
To prove this claim, let g A. Let
I = {> (f, g) : f F }.
(Recall that > (f, g) = {a A : f (a) > g(a)}.) Now I is closed under unions, since
> (f1 , g) > (f2 , g) = > (sup(f1 , f2 ), g).
375
Q
For, let D be an ultrafilter such thatQ = cf( A/D). Then A ( + 1) D, as otherwise
{a A : < a} D, and so cf( A/D) > by 26.1(vii), contradiction. Thus
pcf(A ( + 1)), and hence pcf((A ( + 1)) 6 , proving (1).
(2) 6= .
For, |A| < min(A) , so A is bounded in because is regular. Hence 6= by an
initial remark of this proof.
Now we can complete the proof for the case in which is + 1 for some A. In
this case, actually = . For, we have A J< [A] while
/ J< [A]. Let D
Q A ( + 1)
be an ultrafilter on A such that A ( + 1) D and cf( A/D) . Then A
/ D,
since A J< [A], so {} D, and so . By (1) we then have = .
Now define, for < and a A,
f (a) =
376
if a < ,
if a.
Q
Thus f A. The sequence hf : < i is <J< [A] -increasing, since if < < then
{a A : f (a) f (a)} A Q
J< [A]. It is also universal forQ. For, suppose that D is
an ultrafilter on A such that cf( A/D) = . Suppose that g A. Now |A| < min(A)
def
, so = (supaA g(a)) + 1 is less than . Now {a A : g(a) < f (a)} = A\ D, so
g <D f , as desired. Finally, hf : < i satisfies () , since it is itself strongly increasing
under J< [A]. In fact, if < < and a A\, then f (a) = < = f (a), and
A J< [A].
Hence the case remains in which < and A is unbounded in . Let hf : < i be
any universal
sequence for . We now apply Lemma 8.43 with I replaced by J< [A]. (Recall
Q
that ( A, I< [A] is -directed by Theorem 26.8.) This gives us a <J< [A] -increasing
sequence f = hf : < i such that f < f+1 for every < and () holds for f , for
every regular cardinal such that ++ < and {a A : a ++ } J< [A]. Clearly
then f is universal for . If is a regular cardinal less than , then ++ < < , and
{a A : a ++ } J< [A] by the minimality of , so the conclusion of the lemma
holds.
Lemma 26.23. Suppose that A is a progressive set of regular cardinals, B A, and
is a regular cardinal. Then the following conditions are equivalent:
(i) J [A] = J< [A] + B.
Q
(ii) B J [A], and for every ultrafilter D on A, if cf( A/D) = , then B D.
Proof. (i)(ii): Assume
Q (i). Obviously, then, B J [A]. Now suppose that D is
an ultrafilter on A and cf( A/D) = . By Corollary 26.9(ii) we have J [A] D 6= =
J< [A] D. Choose X J [A] D. Then by Proposition 26.16, X\B J< [A], so since
J< [A] D = , we get B D.
(ii)(i): is clear. Now suppose that X J [A]. If X B, then obviously
X J<Q
[A] + B. Suppose that X 6 B, and let D Q
be any ultrafilter such that X\B D.
Then cf( A/D) since pcf(X) + , and so cf( A/D) < by the second assumption
in (ii). This shows that pcf(X\B) , so X\B J< [A], and hence X J< [A] + B by
Proposition 26.16.
Theorem 26.24. If A is progressive, then every member of pcf(A) has a generator.
Proof. First suppose that we have shown the theorem if |A|+ < min(A). We show
how it follows when |A|+ = min(A). The least member of pcf(A) is |A|+ by 26.1(vii).
We have J<|A|+ [A] = {} and J|A|+ [A] = {, {|A|+}} = J<|A|+ [A] + |A|+ , so |A|+ is a
|A|+ -generator. Now suppose that pcf(A) with > |A|+ . Let A = A\{|A|+}. By
26.1(vi) we also have pcf(A ). By the supposed result there is a b A such that
J [A ] = J< [A ] + b. Hence, applying Lemma 26.6 to + and {|A|+ },
J [A] = J [A ] + {|A|+ }
= J< [A ] + b + {|A|+ }
= J< [A] + b,
as desired.
377
Thus we assume henceforth that |A|+ < min(A). Suppose that pcf(A). First we
take the case = |A|++ . Hence by Lemma 26.1(vii) we have A. Clearly
J [A] = {, {}} = {} + {} = J< [A] + {},
so has a generator in this case. So henceforth we assume that |A|++ < .
By Lemma 26.22, there is a universal sequence f = hf : < i for such that ()|A|+
holds. Hence by Lemma 8.40, f has an exact upper bound h with respect to <J< . Since
h is a least upper bound for f and the identity function on A is an upper bound for f , we
may assume that h(a) a for all a A. We now define
B = {a A : h(a) = a}.
Thus we can finish the proof by showing that
()
that
Q g <J< f . Hence g <D f , and clearly g =D g , so g <D f . This proves that
cf( A/D) = . So we have proved in ().
For , we argue by contradiction and suppose that there is an X J such that
X
/ J< [A] + B. Hence (by Proposition 26.16), X\B
/ J< . Hence J
Q< {X\B} has
fip, so we extend it to an ultrafilter D. Since D QJ< = , we have cf( A/D) . But
also X D since X\B D, and
Q X J , so cf( A/D) = . By the universality of f it
follows that f is cofinal in cf( A/D). But A\B D, so {a A : h(a) < a} D, and so
there is a < such that h <D f . This contradicts the fact that h is an upper bound of
f under <J< .
Now we state some important properties of generators.
Lemma 26.25. Suppose that A is progressive, pcf(A),Qand B A.
(i) If B is a -generator, D is an ultrafilter on A, and cf( A/D) = , then B D.
(ii) If B is a -generator, then
/ pcf(A\B).
(iii) If B J and
/ pcf(A\B), then B is a -generator.
(iv) If = max(pcf(A)), then A is a -generator on A.
(v) If B isQa -generator, then the restrictions to B of any universal sequence for
are cofinal in ( B, <J< [B] ).
Q
(vi) If B is a -generator, then tcf( B, <J< [B] ) = .
378
( (f B, h))\
def
(f B, h) (B\C) = ;
(ii) Q
pcf(A), F has a -generator on A as an element, and J<
F.
(iii) cf( A/D) = for every ultrafilter D extending F .
Proof. (i)(iii): obvious.
(iii)(ii): Obviously pcf(A). Let B be a -generator on A. Suppose that
B
/
QF . Then there is an ultrafilter D on A such that A\B D and D extends F . Then
cf( A/D) = by (iii), and this contradicts Lemma 26.25(i).
Q
(ii)(i): LetQ
B F be a -generator. By Lemma 26.25(vi) we have tcf( B/J< ) =
X B
B ,
and {} N pcf(X).
Corollary 26.30. Suppose that A is progressive, hB : pcf(A)i is a generating
sequence
S for A, and X A. Suppose that is any infinite cardinal. Then X J< [A] iff
X N B for some finite subset N of pcf(A).
Proof. : Assume that X J< [A]. Thus pcf(X) , and Theorem 26.29 gives
the desired conclusion.
: QAssume that a set N is given as indicated. Suppose that pcf(X). Say
= cf( A/D) with X D. Then B D for some N . By the definition of
generator, B J [A], and hence < . Thus we have shown that pcf(X) , so
X J< [A].
Lemma 26.31. Suppose that A is progressive and hB : pcf(A)i is a generating
sequence for A. Suppose that D is an ultrafilter on A. Then thereQis a pcf(A) such
that B D, and if is minimum with this property, then = cf( A/D).
Q
Proof. Let = cf( A/D). Then pcf(A) and B D by Lemma 26.25(i).
Suppose that B D with < . Now B J J< , contradicting Lemma 26.25(viii),
applied to .
Lemma 26.32. If A is progressive and also pcf(A) is progressive, and if pcf(A)
and B is a -generator for A, then pcf(B) is a -generator for pcf(A).
Proof. Note by Theorem 26.15 that pcf(pcf(B)) = pcf(B) and pcf(pcf(A\B)) =
pcf(A\B). Since B J [A], we have pcf(B) + , and hence pcf(pcf(B)) + and
381
Then:
(i) max(pcf
Q F (A)) exists.
(ii) cf( A/F ) = max(pcf F (A)).
(iii) If B pcf F (A) is progressive, then pcf(B) pcf F (A).
(iv) If A is a progressive interval of regular cardinals with no largest element, and
F = {X A : A\X is bounded}
is the filter of co-bounded subsets of A, then pcf F (A) is an interval of regular cardinals.
Proof. (i): Clearly pcf F (A) pcf(A), and so if = max(pcf(A)), then A F
J<+ [A]. Hence we can choose minimum such that F J< [A] 6= . By Corollary 26.12,
S
has fip. For, suppose not. Then there is a finite subset G of G such that gG {a A :
g(a) < f (a)} F . Let h = supf G f . Then g < h mod F and h G,Qcontradiction.
Thus () has fip, and we let D be an ultrafilter containing it. Let = cf( A/D). Then
pcf F (A), and f g mod D for all f G. Since the members of a universal sequence
for are in G, this is a contradiction. This completes the proof of (ii).
For (iii), we look at the proof of Theorem 26.15. Let F be the ultrafilter named F at
the beginning of that proof. Since B pcf F (A), each b B is in pcf F (A), and hence the
ultrafilters Db can be taken to extend F . Hence F F , and so pcf F (A), as desired
in (iii).
Finally, we prove (iv). Let 0 = min(pcf F (A)) and 1 = max(pcf F (A)), and suppose
that is a regular
Q cardinal such that 0 < < 1 . Let D be an ultrafilter such that
F D and cf( A/D) = 1 . Then by Corollary 26.9(ii), D J<1 = , so J1 D. Thus
Q
+
F J<
F J<
generates a proper filter G. Since ( A, <J< ) is
D, so F J<
1
383
Q
-directed by Theorem 26.8, so is ( A, <G ). Note that if a A, then {b A : a < b} F .
It follows that sup(A) 0 < . Hence we can apply Theorem 26.4 and get a subset A
of A (since A is an interval of regular cardinals) and a proper ideal K over
A such that
Q
A is cofinal in A, K contains all proper initial segments of A , and tcf( A, <K ) = .
Let hf : < i be strictly increasing and cofinal mod K. Extend K to a filter L on A,
and extend each function f to a function f+ on A. Then clearly hf+ : < i is strictly
increasing and cofinal mod L, and L contains F . This shows that pcf F (A).
EXERCISES
E26.1. A set A of regular cardinals is almost progressive iff A is infinite, and A |A| is
finite. Prove the following:
(i) Every progressive set is almost progressive.
(ii) If A is an infinite set of regular cardinals and |A| < , then A is almost progressive.
(iii) Every infinite subset of an almost progressive set is almost progressive.
(iv) If A is almost progressive, then A\|A|+ is progressive, A |A|+ is finite, and
A = (A\|A|+) (A |A|+ ).
(v) If is an ordinal, A is almost progressive, and A is unbounded in , then
is a singular cardinal.
(vi) If A is an almost progressive interval of regular cardinals, then A does not have
any weak inaccessible as a member, except possibly its first element. If in addition A is
infinite, then there is a singular cardinal such that A is unbounded in and A\ is
finite.
E26.2. Show that Theorem 26.4 and Corollary 26.5 also hold if A is almost progressive.
E26.3. Suppose that A is progressive and |A| is regular. Show that sup(pcf (A))
sup(A) .
E26.4. Suppose that A is progressive, and is a cardinal such that A is unbounded in
. Show that + pcf(A).
E26.5. Assume GCH and suppose that A is progressive. Show that
pcf(A) = A { + : a cardinal, A unbounded in }.
E26.6. Suppose that is a limit ordinal and A is an infinite set such that |A| < cf().
Determine all regular cardinals such that l = cf(A /D) for some ultrafilter D on A.
E26.7. Suppose that is a regular cardinal and D is an ultrafilter on such that \ D
for every < . Show that cf( /D) > .
E26.8. Let A be a progressive set of regular cardinals and an infinite cardinal. Suppose
that J is an ideal on A. Show that the following conditions are equivalent:
(i) J = I< [A].
(ii) J is the intersection of all ideals K on A which satisfy the following condition:
ForQeach X A with X
/ K there is an ultrafilter D on X such that D K = and
cf( X/D) .
384
385
Y
Q
A
B | 2max(|tr cl(A),|tr cl( B))
2max(|tr cl(A),2
22
|tr cl(B)
by (viii)
S
Finally, for (xi), note that tr cl(pcf(A)) = pcf(A) pcf(A).
Now |pcf(A)| 2|A|
Q
2|tr cl(A)| by Theorem 9.27. If pcf(A), then clearly | A|, so the desired conclusion
now follows from (ix).
We also need the fact that some rather complicated formulas and functions are absolute
for sets H . Note that H is transitive. Many of the indicated formulas are not absolute
for H in general, but only under the assumptions given that is much larger than the
sets in question.
Lemma 27.4. Suppose that is an uncountable regular cardinal. Then the following
formulas (as detailed in the proof ) are absolute for H .
(i) B = P(A).
387
(ii) D is an ultrafilter on A.
(iii) is a cardinal.
(iv) is a regular cardinal.
(v) and are cardinals, and = + .
(vi) = |A|.
Q
(vii) B = A.
(viii) A = B C.
(ix) A is infinite, if is uncountable.
(x) A is an infinite Q
set of regular cardinals and D is an ultrafilter on A and is a
regular cardinal and f A and f is strictly increasing and cofinal modulo D, provided
that 2|tr cl(A)| < .
(xi) A is an infinite set of regular cardinals, and B = pcf(A), if 2|tr cl(A)| < .
(xii) A is an infinite set of regular cardinals and f = hJ< [A] : pcf(A)i, provided
that 2|tr cl(A)| < .
(xiii) A is an infinite set of regular cardinals and B = hB : pcf(A)i and
|tr cl(A)|
pcf(A)(B is a -generator), if 22
< .
Proof. Absoluteness follows by easy arguments upon producing suitable formulas, as
follows.
(i): Suppose that A, B H . We may take the formula B = P(A) to be
x B[y x(y A)] x[y x(y A) x B].
The first part is obviously absolute for H . If the second part holds in V it clearly holds in
H . Now suppose that the second part holds in H . Suppose that x A. Hence x H
and it follows that x B.
(ii): Assume that A, D H . We can take the statement D is an ultrafilter on A
to be the following statement:
X D(X A) A D X, Y D(X Y D)
/D
Y X D[X Y Y A Y D] Y [Y A Y D (A\Y ) D].
Again this is absolute because Y A implies that Y H .
(iii): Suppose that H . Then
is a cardinal
iff
Note here that if f is a function with dmn(f ) = and rng(f ) , then f , and
hence f H .
(iv): Assume that H . Then
is a regular cardinal iff
iff
A,
(xi): Assume that 2|tr cl(A)|) < , and A, B H . Let (A, D, , f ) be the statement
of (x). Note:
(1) If (A, D, , f ), then D, , f H , and max(, |tr cl(A)|) 2|tr cl(A)| .
In fact, D P(A), so tr cl(D) tr cl(P(A)) = P(A) tr cl(A), and so |tr cl(D)|
Q <
by Lemma
27.3(iii); so D H . Now f is a one-one function from into
A,
Q
Q so
|tr cl(A)|
| A| < , and hence H and max(, |tr cl(A)|) 2
. Finally, f A,
so it follows that f H .
Thus (1) holds. Hence the following equivalence shows the absoluteness of the statement in (xi):
A is an infinite set of regular cardinals and B = pcf(A)
iff
A is infinite, and A( is a regular cardinal) BDf (A, D, , f )
Df [(A, D, , f ) B].
(xii): Assume that 2|tr cl(A)|) < . By Lemma 27.3(xi) we have pcf(A) H . Hence
A is an infinite set of regular cardinals f = hJ< [A] : pcf(A)i
iff
A is infinite and A( is a regular cardinal and
f is a function and B[B = pcf(A) B = dmn(f )]
dmn(f )X A[A f () iff C[C = pcf(X) C ]]
|tr cl(A)|
Proof.
ChN (a) = sup(N a)
[
= (N a)
=
[ [
<
(N a)
<
ChN (a).
<
Hence for every a A for which f (a) < ChN (a), there is an a < such that f (a) <
ChNa (a). Hence the existence of as indicated follows.
395
Note that (2) says that B\{a A : ChN (a) f (a)} J< [A].
Now
/ pcf(A\B) by
Q Lemma 9.25(ii), and hence J< [A\B] = J [A\B]. So by
Theorem 9.8 we see Q
that (A\B)/J< [A\B] is + -directed, so hf (A\B) : < i has
an upper bound h (A\B). We may assume that h N , by the usual argument. Hence
f (A\B) <J< [A\B] h < ChN (A\B);
hence {a A\B : ChN (a) f (a)} J< [A], and together with (2) and using Lemma
9.25(ix) this finishes the proof.
Now supposeQthat A is progressive, is a limit ordinal, f = hf : < i is a sequence of
members of A, |A|+ cf() < min(A), and E is a club of of order type cf(). Then
we define
hE = sup{f : E}.
396
aA
def
Hence there is an a A such that M = { < |A|+ : hE (a) > hE+1 (a)} has size |A|+ .
Now hE (a) hE (a) whenever < < |A|+ , so this gives an infinite decreasing sequence
of ordinals, contradiction.
The function g of this lemma is called the minimal club-obedient bound of f .
Corollary 27.15. Suppose
Q that A+is progressive, is a limit ordinal, f = hf : < i
is a sequence of members of
A, |A| cf() < min(A), J is an ideal on A, and f is
<J -increasing. Let g be the minimal club-obedient bound of f . Then g is a J -bound for
f.
Now suppose that A is progressive, pcf(A), and is a regular cardinal such that
|A| < < min(A). We say that f = hf : < i is -minimally obedient for iff f is a
universal sequence for and for every < of cofinality , f is the minimal club-obedient
bound of f .
A sequence f is minimally obedient for iff |A|+ < min(A) and f is minimally
obedient for every regular such that |A| < < min(A).
Lemma 27.16. Suppose that |A|+ < min(A) and pcf(A). Then there is a
minimally obedient sequence for .
Proof. By Theorem 9.18, let hf0 : < i be a universal sequence for . Now
by induction we define functions f for < . Let f0 = f00 , and choose f+1 so that
max(f , f0 ) < f+1 .
For limit < such that |A| < cf() < min(A), let f be the minimally club-obedient
bound of hf : < i.
For other limit < , use the -directedness (Theorem 9.8) to get f as a <J< -bound
of hf : < i.
Thus we have assured the minimally obedient property, and it is clear that hf : < i
is universal.
397
Lemma 27.17. Suppose that A is progressive, and is a regular cardinal such that
|A| < < min(A). Also assume the following:
(i) pcf(A).
(ii) f = hf : < i is a -minimally obedient sequence for .
(iii) N is a -presentable elementary substructure of H , with large, such that
, f, A N .
Then the following conditions hold:
(iv) For every N \N we have:
(a) cf() = .
(b) There is a club C of of order type such that f = sup{f : C} and
C N.
(c) f (a) N a for every a A.
(v) If = ChN (), then:
(a) N \N ; hence we let C be as in (iv)(b), with f = sup{f : C}.
(b) f N for each C.
(c) f (ChN A).
(vi) = ChN () and C is asQin (iv)(b), with f = sup{f : C}, and B is a
generator, then for every h N A there is a C such that (h B) <J< (f B).
Proof. Assume (i)(iii). Note that A N , by Lemma 27.5(ix).
For (iv), suppose also that N \N . Then by Lemma 27.7 we have cf() = ,
and there is a club E in of order type such that E N . By (ii), we have f = fC for
some club C of of order type . By the minimally obedient property we have fC = fCE ,
and thus we may assume that C E. For any C and a A we have f (a) N by
Lemma 27.5(viii). So (iv) holds.
For (v), suppose that = ChN (). Then N \N because |N | = < min(A) .
For each C we have f N by Lemma 27.5(viii). For (c), if a A, then f (a) =
supC f (a) ChN (a), since f (a) N a for all C.
Next, assume the hypotheses of (vi). By Lemma 27.11, f is persistently cofinal in ,
so by Lemma 27.13, B is a -generator. By Lemma 9.25(v) there is a C such that
h B <J< f B . Now B =J< [A] B by Lemma 9.25(xi), so
{a B : h(a) f (b)} (B\B ) {a B : h(a) f (b)} J< [A].
We now define some abbreviations.
H1 (A, , N, ) abbreviates
A is a progressive set of regular cardinals, is a regular cardinal such that |A| < <
min(A), and N is a -presentable elementary substructure of H , with big and A N .
H2 (A, , N, , , f, ) abbreviates
H1 (A, , N, ), pcf(A), f = hf : < i is a sequence of members of
and = ChN ().
P1 (A, , N, , , f, ) abbreviates
H2 (A, , N, , , f, ) and {a A : ChN (a) f (a)} is a -generator.
398
A, f N ,
P2 (A, , N, , , f, ) abbreviates
H2 (A, , N, , , f, ) and the following hold:
(i) f (ChN A). Q
Q
(ii) For every h N A there is a d N A such that for any -generator B,
(h B) <J< (d B) and d f .
Thus H1 (A, , N, ) is part of the hypothesis of Lemma 27.17, and H2 (A, , N, , , f, )
is a part of the hypotheses of Lemma 27.17(v).
Lemma 27.18. If H2 (A, , N, , , f, ) holds and f is persistently cofinal for , then
P1 (A, , N, , , f, ) holds.
Proof. This follows immediately from Lemma 27.13.
Lemma 27.19. If H2 (A, , N, , , f, ) holds and f is -minimally obedient for ,
then both P1 (A, , N, , , f, ) and P2 (A, , N, , , f, ) hold.
Proof. Since f is -minimally obedient for , it is a universal sequence for , by
definition. Hence by Lemma 27.11 f is persistently cofinal for , and so property P1
follows from Lemma 27.18.
For P2 , note that , A N since f N , by Lemma 27.5(vii),(ix). Hence the hypotheses of Lemma 27.17(v)Qhold. So (i) in P2 holds by Lemma 27.17(v)(c). For condition (ii),
suppose that h N A. Take B and C as in Lemma 27.17(vi), and choose C such
that h B <J< f B. Let d = f . Clearly this proves condition (ii).
The following obvious extension of Lemma 27.19 will be useful below.
Lemma 27.20. Assume H1 (A, , N, ), and also assume that = ChN () and
def
Then
(i) b is a -generator.
(ii) There is a set b b such that:
(a) b N ;
(b) b\b J< [A];
(c) b is a -generator.
Proof. (i) holds by Lemma 27.21(i). For (ii), by Lemma 27.12 choose < such
that, for every a A,
f (a) < ChN (a) iff
(1)
Now by (i) of P2 (A, , N, , , f, ) we have f (ChN A). Hence by (1) we see that for
every a A,
ab
(2)
iff
A such
0
i
(ChN (A\Ai+1 )) = sup{(f(
(A\Ai+1 )), . . . , (f(
(A\Ai+1 ))}.
0)
i)
400
Since the cardinals are decreasing, there is a first i such that Ai+1 = , and then the lemma
is proved. To start, A0 = A and 0 = max(pcf(A)). Clearly 0 N . Now suppose that
i and Ai are defined, with Ai 6= 0. By Lemma 27.22(i) and Lemma 9.25(x), the set
i
{a A (i + 1) : ChN (a) = f(
(a)}
i)
j
(5) f(
(ChN A) for all j i.
j)
Now suppose that a A\Ai+1 . If a Ai , then by (4), a b1 , and so by (3), ChN (a) =
i
f(
(a), and (1) holds for a. If a
/ Ai , then A 6= Ai , so i 6= 0. Hence by the inductive
1)
hypothesis for (1),
i1
0
(a)},
(a), . . . , f(
ChN (a) = sup{f(
i1 )
0)
and (1) for a follows by (5).
The cofinality of ([] , )
First we give some simple properties of the sets [] , not involving pcf theory.
Proposition 27.24. If are infinite cardinals, then
()
|[] | = cf([] , ) 2 .
Proof. Let = cf([] , ), and let hYi : i < i be an enumeration of a cofinal subset
of cf([] , ). For each i < let fi be a bijection from Yi to . Now the inequality in ()
is clear. For the other direction, we define an injection g of [] into P(), as follows.
Given E [] , let i < be minimum such that E Yi , and define g(E) = (i, fi [E]).
Clearly g is one-one.
Proposition 27.25. (i) If 1 < 2 , then
cf([]1 , ) cf([]2 , ) cf([2 ]1 , ).
(ii) cf([+ ] , ) = + .
+
(iii) If + , then cf([] , ) cf([] , ) + .
(iv) If 1 < 2 , then cf([1 ] , ) cf([2 ] , ).
401
403
0
if < 0 , otherwise.
f0 ()
Note here that the superscript
Lemma 27.30. Assume the above notation. Then f00 f00 ,e for all 0 pcf(A)
and all 0 0 .
Proof. Take any 0 0 and any A. If 0 , then f00 ,e () = f00 (). Suppose
that < 0 . If the above maximum does not exist, then again f00 ,e () = f00 (). Suppose
the maximum exists. Now (0 , ) F 0 , (A), so
f00 () = El0 , (0 ) max({El0 ,...,n (0 ) : (0 , . . . , n ) F 0 , }) = f00 ,e ().
Lemma 27.31. Suppose that A is progressive, is a regular cardinal such that
def
|A| < < min(A), and f = hf : pcf(A)i is a sequence of sequences f such that f
is -minimally obedient for . Assume also H1 (A, , N, ) and f N .
Then also f e N .
Proof. The proof is a more complicated instance of our standard procedure for going
from V to H to N and then back. We sketch the details.
404
Assume the hypotheses. In particular, A N . Hence also pcf(A) N . Also, |A| < ,
so A N . Now clearly F N . Also, El N . (Note that El depends upon A.) Then by
absoluteness,
H |= g g is a function, dmn(g) = pcf(A) 0
f00 ()
max({El0 ,...,n (0 ) : (0 , . . . , n ) F 0 , })
g() =
0
f0 ()
pcf(A)0 0 A
if 0 ,
if < 0 ,
and this maximum exists,
if < 0 , otherwise.
and
It follows that
El0 ,...,n ,1 ,...,a (0 ) = ChN (a),
and (iii) then gives f00 ,e (a) = ChN (a).
Definition. Suppose that A is progressive and A P pcf(A). A system hb : P i
of subsets of A is transitive iff for all P and all b we have b b .
Theorem 27.34. Suppose that H1 (A, , N, ), and f = hf : pcf(A)i is a system
of functions, and each f is -minimally obedient for . Let f e be the derived elevated
array. For every 0 pcf(A) N put 0 = ChN (0 ) and define
b0 = {a A : ChN (a) = f00 ,e (a)}.
Then the following hold for each 0 pcf(A) N :
(i) b0 is a 0 -generator.
(ii) There is a b0 b0 such that
(a) b0 \b0 J<0 [A].
(b) b0 N (each one individually, not the sequence).
(c) b0 is a 0 -generator.
(iii) The system hb : pcf(A) N i is transitive.
Proof. Note that H2 (A, , N, , 0 , f 0 ,e , 0 ) holds by Lemma 27.31. By definition,
minimally obedient implies universal, so f 0 is persistently cofinal by Lemma 27.11. Hence
by Lemma 27.24, f 0 ,e is persistently cofinal, and so P1 (A, , N, , 0 , f 0 ,e , 0 ) holds by
Lemma 27.18. Also, by Lemma 27.19 P2 (A, , N, , 0, f 0 , 0 ) holds, so the condition
P2 (A, , N, , 0, f 0 ,e , 0 ) holds by Lemmas 27.30 and 27.33(ii). Now (i) and (ii) hold by
Lemma 27.22.
Now suppose that 0 pcf(A) N and b0 . Thus
ChN () = f00 ,e (),
where 0 = ChN (0 ). Write = ChN (). We want to show that b b0 . Take any
a b . So ChN (a) = f,e (a). By Lemma 27.33(iv) we get f00 ,e (a) = ChN (a), so a b0 ,
as desired.
Localization
Theorem 27.35. Suppose that A is a progressive set. Then there is no subset B
pcf(A) such that |B| = |A|+ and, for every b B, b > max(pcf(B b)).
Proof. Assume the contrary. We may assume that |A|+ < min(A). In fact, if we
know the result under this assumption, and now |A|+ = min(A), suppose that B pcf(A)
with |B| = |A|+ and b B[b > max(pcf(B b))]. Let A = A\{|A|+}. Then let
B = B\{|A|+ }. So by Proposition 9.1(vi) we have B pcf(A ). Clearly |B | = |A |+
and b B [b > max(pcf(B b))], contradiction.
Also, clearly we may assume that B has order type |A|+ .
406
Let E = A B. Then |E| < min(E). Let = |E|. By Lemma 27.16, we get an
array hf : pcf(E)i, with each f -minimally obedient for . Choose N and so
that H1 (A, , N, ), with N containing A, B, E, hf : pcf(E)i. Now let hb :
pcf(E) N i be the set of transitive generators as guaranteed by Theorem 27.34. Let
b N be such that b b and b \b J< .
Now let F be the function with domain {a A : B(a b )} such that for each
such a, F (a) is the least B such that a b . Define B0 = { B : a dmn(F )(
F (a)}. Thus B0 is an initial segment of B of size at most |A|. Clearly B0 N . We let
0 = min(B\B0 ); so B0 = B 0 .
Now we claim
(1) There exists a finite descending sequence 0 > > n of cardinals in N pcf(B0 )
such that B0 b0 . . . bn .
We prove more: we find a finite descending sequence 0 > > n of cardinals in
N pcf(B0 ) such that B0 b0 . . . bn . Let 0 = max(pcf(B0 )). Since B0 N ,
def
Let be the ordinal such that = sup(A). Now pcf(A) is an interval of regular cardinals
by Theorem 9.13. So pcf(A) contains all regular cardinals in the set {+ : < +4 }.
Now we are going to define a strictly increasing continuous sequence hi : i < +3 i of
ordinals less than +4 .
1. Let 0 = +3 .
S
2. For i limit let i = j<i j .
3. Now suppose that j has been defined for all j i; we define i+1 . For each k S
(+)
(+)
let ek = {+j : j Ck (i + 1)}. Thus ek is a subset of pcf(A). If max(pcf(ek )) <
(+)
++4 , let k be an ordinal such that max(pcf(ek )) < +k and k < +4 ; otherwise
let k = 0. Let i+1 be greater than i and all k for k S, with i+1 < +4 . This is
possible because |S| = +3 . Thus
(+)
(+)
(3) For every k S, if max(pcf(ek )) < ++4 , then max(pcf(ek )) < +i+1 .
This finishes the definition of the sequence hi : i < +3 i. Let D = {i : i < +3 }, and
let = sup(D). Then D is club in . Let = + . Thus has cofinality +3 , and
it is singular since > 0 = +3 . Now we apply Corollary 9.35: there is a club C0 in
(+)
such that + = max(pcf(C0 )). We may assume that C0 [ , ). so we can write
C0 = {+i : i D0 } for some club D0 in . Let D1 = D0 D. So D1 is a club of . Let
E = {i +3 : i D1 }. It is clear that E is a club in +3 . So by (2) choose k E S
such that Ck E. Let Ck = { Ck : there is a largest Ck such that < }. Set
+
B = {+
+i : i Ck }. We claim that B is as desired. Clearly |B| = .
()
Let i Ck be largest such that i < j. So i + 1 j. We consider the definition given above
of i+1 . We defined ek = {+l : l Ck (i + 1)}. Now
(+)
(4) B +
+j ek .
+
For, if b B +
+j , we can write b = +l with l Ck and l < j. Hence l i and so
(+)
b = +
+l ek . So (4) holds.
Now if l Ck (i + 1), then l E, and so l D1 D0 . Hence +l C0 . This
(+)
(+)
(+)
(+)
shows that ek C0 . So max(pcf(ek )) max(pcf(C0 )) = + < ++4 . Hence by
(+)
(3) we get max(pcf(ek )) < +i+1 . So
(+)
max(pcf(B +
+j )) max(pcf(ek )) by (4)
< +
+i+1
+
+j ,
which proves ().
Theorem 27.38. If is a singular cardinal such that < , then
cf([ ]|| , ) < ||+4 .
409
Proof. Let = ||+ and A = (, )reg . By Lemma 27.25(iii) and Lemma 27.28,
+
< max
||
cf()
+
, ||+4 .
(1)
In fact, let B [ ]|| be cofinal and of size cf([ ]|| , ). Now cf() ||, so
[ ]cf() =
[Y ]cf() ,
Y B
410