Upload
duongquynh
View
214
Download
1
Embed Size (px)
Citation preview
Interpreting Frege's Grundgesetze in an Adaptation of Quine's New Foundations
Ryan Beaton
Department of Mathematics and Statistics,
McGill University, Montréal
Québec, Canada
August, 2004
A thesis submitted to the Faculty of Graduate Studies and Research
in partial fulfillment of the requirements of the degree of
Master of Science
Copyright © Ryan Beaton, 2004
1+1 Library and Archives Canada
Bibliothèque et Archives Canada
Published Heritage Branch
Direction du Patrimoine de l'édition
395 Wellington Street Ottawa ON K1A ON4 Canada
395, rue Wellington Ottawa ON K1A ON4 Canada
NOTICE: The author has granted a nonexclusive license allowing Library and Archives Canada to reproduce, publish, archive, preserve, conserve, communicate to the public by telecommunication or on the Internet, loan, distribute and sell th es es worldwide, for commercial or noncommercial purposes, in microform, paper, electronic and/or any other formats.
The author retains copyright ownership and moral rights in this thesis. Neither the thesis nor substantial extracts from it may be printed or otherwise reproduced without the author's permission.
ln compliance with the Canadian Privacy Act some supporting forms may have been removed from this thesis.
While these forms may be included in the document page count, their removal does not represent any loss of content from the thesis.
• •• Canada
AVIS:
Your file Votre référence ISBN: 0-494-12395-8 Our file Notre référence ISBN: 0-494-12395-8
L'auteur a accordé une licence non exclusive permettant à la Bibliothèque et Archives Canada de reproduire, publier, archiver, sauvegarder, conserver, transmettre au public par télécommunication ou par l'Internet, prêter, distribuer et vendre des thèses partout dans le monde, à des fins commerciales ou autres, sur support microforme, papier, électronique et/ou autres formats.
L'auteur conserve la propriété du droit d'auteur et des droits moraux qui protège cette thèse. Ni la thèse ni des extraits substantiels de celle-ci ne doivent être imprimés ou autrement reproduits sans son autorisation.
Conformément à la loi canadienne sur la protection de la vie privée, quelques formulaires secondaires ont été enlevés de cette thèse.
Bien que ces formulaires aient inclus dans la pagination, il n'y aura aucun contenu manquant.
Abstract
We first give a modern presentation of the formai language of Frege's Grundgesetze.
There follows a comparison of the motivations for Frege's "Cumulative Type The
ory" and for Russell's Type Theory and of the basic arithmetical definitions in each.
Quine's New Foundations and, in particular, extensions of Jensen's modification,
NFU, are introduced and consistency results are discussed. Finally, an interpretation
is given in an NFU framework of a modified form of the Grundgesetze theory. It
is shown that an "Axiom of Counting" necessary for arithmetic in NFU is needed
in an analogous way for arithmetic in our interpretation; it is further demonstrated
that from the statement of this axiom in NFU, the appropriate analogue is provable
for our interpretation. The development of arithmetic in an NFU framework is seen
essentially to be that intended by Frege in the Grundgesetze.
Résumé
En premier lieu, nous donnons une présentation moderne du langage du texte Grundge
setze de Frege. Nous procédons à une comparaison entre la "théorie cumulative des
types" de Frege et la "théorie des types" de Russell et des définitions arithmétiques
de chacune. Nous discutons, par la suite, des "Nouvelles Fondations" de Quine
et de certaines extensions de la modification, NFU, de Jensen. Finalement, nous
interprétons, dans le cadre de NFU, une forme modifiée de la théorie des Grundge
setze. Nous démontrons qu'un "axiome de dénombrement," essentiel à l'arithmétique
dans NFU, est d'une manière analogue nécessaire à l'arithmétique dans notre in
terprétation. Nous prouvons qu'à partir de l'énoncé de cet axiome dans NFU, l'énoncé
analogue est démontrable pour notre interprétation. Il est clair que le développement
de l'arithmétique dans le cadre de NFU est essentiellement celui visé par Frege dans
les Grundgesetze.
III
Acknowledgments
l thank Professor Michael Makkai for his many hours of help and am deeply appre
ciative of his enthusiastic support for this project. l am grateful for the financial
support l have received from the National Sciences and Engineering Research Coun
cil (NSERC) of Canada and from Dr. Richard H. Tomlinson, benefactor of McGill
University. The generous contributions of NSERC and Dr. Tomlinson have funded
my two years as a Master's student at McGill. Thank you finally to aU the professors
of the department from whom l've had the opportunity to learn and to the friendly
staff of the McGill Mathematics Department.
v
Table of Contents
Abstract
Résumé
Acknowiedgments
Introduction
1 The FormaI and Meta-Language of the Grundgesetze
1.1 The "Cumulative Type Theory" Behind the Grundgeseize
1.2 The Theory GG .
1.2.1 Types ..
1.2.2
1.2.3
Basic definitions: primitives, terms and sentences
Axioms and Rules of Inference .
1.3 Interpretation of Primitives ....
2 Type Theory and the Grundgesetze: Basic Arithmetic and the Para-
i
iii
v
1
5
6
11
11
12
17
20
doxes 25
2.1 Frege's Levels and Russell's Types. 26
2.1.1 The Theory of Types . . . . 26
2.1.2 Frege's Theory of Levels and The Definition of Number
2.2 Formally Defining Cardinality .............. .
VIl
27
30
2.2.1 The Application Operator and "Reduction"
2.2.2 Bijection and Equinumerosity . . . . . . . .
2.2.3 Successor, Ancestral and the Natural Numbers .
2.3 Russell's Paradox . . . . . . . . . . . . ........ .
3 New Foundations, Modifications and Consistency Results
3.1 Quine's Original Theory . . . . . . . . . . . . . . . . . .
3.1.1 A Concise Statement of Quine's New Foundations
3.2 The Consistency of NF and Variants ...
3.2.1 Specker Models and Models of NF
3.2.2 Jensen's New Foundations with Urelements .
3.3 Constructing Models of NFU .......... .
3.3.1 Weak Zermelo Set Theory and Extensions
3.4 Arithmetic in NFU . . . . . . .
3.4.1 The Axiom of Counting
4 Interpreting GGree in NFUT
4.1 The Language of GGree . ..
4.2 NF UT, Interpreting GGree Primitives, and F-stratijication
4.2.1 Extending the Language of NFU: NFUT .
4.2.2 Translating GGree into NFUT
4.2.3 F-Stratification ....
4.3 Recovering Frege's Arithmetic
4.3.1 Basic Laws of GGree
4.3.2 Arithmetic in GGree
4.3.3 The Axiom of Counting
Conclusion
Vlll
30
35
38
41
43
43
46
47
47
49
51
52
55
56
59
59
61
62
67
71
77
77
80
84
93
Introduction
Largely ignored in his own lifetime, recognition came posthumously to Gottlob Frege.
It is now generally acknowledged that if anyone can rightfully be called the father
of modern logic, then surely it is Frege. In 1879, his first publication, Begriffss
chrift ("Concept Writing"), introduced, among other things, the modern notions of
quantification, formaI language and formaI derivation, and provided precise logical
definitions of "sequence," "following in a sequence," etc., paving the way for his log
ical reconstruction of arithmetic.
Begriffsschrift was followed by several publications of a more philosophical tone
supporting Frege's "logicist program" of reducing basic mathematics to a branch
of pure logic. This line of thinking culminated in the Grundgesetze der Arithmetik
("The Basic Laws of Arithmetic"), the first volume of which appeared in 1893 and was
written by Frege at the height of his intellectual powers. Satisfied with his lengthy
philosophical reflection, he wrote the Grundgesetze as an ultimate vindication for
logicism, a mathematical book which develops arithmetic from purely logical founda
tions. As the second volume, treating rational and real numbers, was being published
in 1902, Frege received a letter from the young Bertrand Russell, pointing out the
inconsistency in the formaI system of the Grundgesetze.
The Grundgesetze came to be seen as a brilliant failure, including by Frege himself,
who published litt le after 1902, certainly nothing of the same scope or ambition. The
extent to which his work does indeed accomplish the founding of mathematics in logic
1
and sets the stage for modern mathematicallogic has slowly come to be appreciated.
There are now countless books discussing Frege's influence on a variety of fields,
including, of course, logic.
However, it remains an interesting question to what extent Frege's actual devel
opment of arithmetic in the Grundgesetze is consistent and viable. SpecificaUy, is it
possible to interpret a sufficient part of the Grundgesetze theory, hereafter denoted
by "GG," in a consistent set theoryl in such a way that Frege's basic definitions and
proofs regarding cardinals, natural numbers, arithmetic, etc., can be reproduced?2
Frege can hardly be faulted for having implicitly assumed a naive set theory in 1893;
(un)fortunately, he had the brilliance to capture in a precise, formaI system the in
consistency that had been lurking. There was nowhere to hide the inconsistency once
spotted and Frege's system coUapsed; however, this alone does not mean the model
of arithmetic he describes and attempts unsuccessfuUy to axiomatize is incoherent.
After aU, Frege's definitions of cardinal, successor, natural number, etc., are very
natural, more so perhaps than the usual ZF definitions.
The main purpose of this essay is twofold. For one, we aim to give a set-theoretic
interpretation of GGree , a modified form of the theory GG, that will aUow, as much
as possible, the development of arithmetic as it is carried out in the Grundgesetze.
A ZF -style set theory is completely unsuited to this. For instance, Frege takes the
natural number n to be the class of aU classes having n elements.3 In particular,
IThat is, one we are reasonably confident is consistent.
2Boolos [4], Wright [33] and others have written extensively on set theories and theories of second
order logic equivalent to fragments of GG; however we are interested here especially in interpreting
Frege's own language and giving definitions of natural number, etc., in this language, so that we may,
for one thing, bring out conceptual similarities between the Grundgesetze, Russell's Type Theory and
Quine's New Foundations.
3Roughly speaking. This is indeed the conception of natural number Frege arrived at in Grundla-
gen der Arithmetik ("Foundations of Arithmetic") (1884) and still held at the time of the Grundge
setze. However, technical features of the for mal language he presents in the Grundgesetze force Frege
2
the cardinal "One" is "the class of all unit classes." No such thing exists in ZFC,
and while there is such a class in, e.g., Morse-Kelley set theory, we still cannot form
the class of all natural numbers, so defined, in MK. The iterative conception of set
motivating these set theories is completely incompatible with GG.
On the other hand, there are deep conceptual similarities between Frege's system
and Type Theory, as first introduced by Whitehead and Russell. This is especially
clear from Frege's discussion of "concept levels," from which 1 extract a "Cumulative
Type Theory," denoted "CTT" (see Appendix). Russell's theory of types, even as
simplified by Ramsey, can be rather awkward in mathematical practice, as it main
tains a strict separation of types and strong restrictions on expressions considered
grammatical or meaningful. There is also a proliferation of "copies of sets," e.g.,
there exists an empty set and a universal set for every type 2': 1. A further sim
plification was proposed by Quine in his New Foundations article. The system of
New Foundations, NF, rejects the separation of types to the extent that it formally
contains a single type; however, the key notion of "stratification" is drawn from Type
Theory. Roughly speaking, a formula is stratified if it contains no free variable in
stantiating different types, i.e., within this single formula. For instance, x E x is an
unstratified formula. The details are given in Chapter 3. The main feature of NF
is that it restricts comprehension to stratified formulas. Unlike Type Theory, NF
recognizes x E x as a meaningful formula; however, no comprehension principle is
available from which to form a set of objects satisfying it (even if these objects are
restricted to the elements of sorne set).
This stratified comprehension principle creates a set theory very much different
from ZFC, and ideal for interpreting GGree . The consistency of NF is still very much
in question, but modifications have yielded interesting consistency results. It is in
an extension, our NF UT, of the best known such modification, Jensen's NFU (New
to modify the definition slightly; this is discussed fully in Chapter 2.
3
Foundations with Urelements) , that we give our interpretation of GGree . It should
be mentioned that, as the modern day doyen of NF studies, Randall Holmes, puts it,
NFU is not being promoted as a replacement for ZFC, but "those who are interested
in studying the foundations [of mathematics] should be familiar with alternatives." [16]
ZFC itself was developed in the aftermath of Frege's great "failure," which led to the
first deep studies of the role of the paradoxes in mathematicallogic. An understanding
of alternative set theories and logical foundations of arithmetic, su ch as Frege's, can
only deepen our understanding of set theory in general. The second of the principle
aims of this essay is therefore to draw the conceptual similarities between GG, TT,
NF and NFU4, in particular, to show that arithmetic in NFU is arithmetic as Frege
intended.
The main text of this essay is divided roughly as follows. Chapter 1 presents the
formaI language GG. The presentation is preceded by an informaI discussion of CTT,
which is itself given a formaI presentation in the Appendix. Chapter 2 discusses the
relation between CTT and simple Type Theory, TT, and basic arithmetical definitions
therein. Russell's paradox is discussed and derived for GG. Chapter 3 is a discussion
of NF and its modifications and extensions. Chapter 4 presents our main results. The
primitives of GGree are defined in NFUT. The fragment of the GG axioms recovered in
GGree is discussed. The development of arithmetic in the reconstruction is presented,
with basic results relating this development to the usual development in NFU proper.
Finally, we give some concluding remarks.
4Cocchiarella [5] is a fairly in-depth philosophical and logical analysis of these similarities and
a nice complement to some parts of the present essay. In some instances, we develop and make
mathematically precise certain statements in [5]; see 2.2.1 and the Appendix. Indeed, this present
essay may be seen as exploring a middle ground between the two paths of research in logicjFrege's
studies as found in Boolos [4] and as found in Cocchiarella [5].
4
Chapter 1
The FormaI and Meta-Language of
the Grundgesetze
The main purpose of this chapter is to introduce the formaI language of the Grundge
setze and to explain some of its more prominent features. First, however, some
discussion is needed of the background logical theory to which Frege's study has
brought him at the time of the Grundgesetze. This theory, which Frege outlines in
the opening sections of the Grundgesetze is extremely unwieldy for the purposes of
even the most basic arithmetic and Frege greatly simplifies things before presenting
the formaI language of his system. However, an understanding of Frege's "Cumula
tive Type Theory" will be important for discussions in Chapter 2 and subsequent.
To this, the first section of this chapter is therefore devoted.
5
1.1 The "Cumulative Type Theory" Behind the
Grundgesetze
Much of the brilliance and originality in his work, Frege derives ultimately from a
rigourous analysis of language. In logic, Frege takes an analysis of statements as his
starting point. A major advance in logic came with his separation of statement into
argument and function rather than subject and predicate. While it may be hard
to appreciate the depth of such a change now, this break with thousands of years
of tradition paved the way for the first satisfactory presentation of quantification in
logic, in Frege's 1879 publication, Begriffsschrift. By the time of the Grundgesetze,
this linguistic analysis led Frege to a theory of "concept levels," which he discusses at
the beginning of the Grundgesetze (cf. §§1-5, 21-33), and which l will call "Cumulative
Type Theory," or "CTT."! For instance, let us consider the statement, "Hydrogen is
lighter than oxygen." If we remove the object name "Hydrogen" from this statement,
replacing it with the variable symbol "x" , we obtain the name of a function, Frege says,
namely "x is lighter than oxygen" , whose value is true or false for every appropriate
argument. As only an object name in the place of "x" is appropriate (to obtain
a meaningful statement), Frege calls the function named above a unary first-order
function. And clearly then "x is lighter than y" is the name of a binary first-order
function. The types of these two functions are to be sharply distinguished from each
other as well as from the types of second-order functions. One example of the latter is
"There exists an x such that f(x)", where x is an object variable and f is a (variable)
function of x. Now, only the name of a unary first-order function is appropriate
1 l have not come across the term "Cumulative Type Theory" anywhere, but it seems very natural
and may weIl have been coined previously. In the Appendix, l give a presentation of the formaI
language CTT, based on my interpretation of Frege's informaI discussion. Terminology and notation
in the Appendix rest to some extent on definitions and explanations in the main text and it is assumed
the reader has read the main chapters before the Appendix.
6
to take the place of "f" in the above. Clearly we can continue on with third-Ievel
functions, etc., (though examples may become less natural). Note that Frege also
allows "mixed level" functions: e.g., "x is such that f(x)" is a name for a function
requiring two arguments, the first of object type and the second of unary first-order
function type. Again, the type of such a function is to be sharply distinguished from
the types of the functions discussed above.
Three points should be mentioned. First, in relation to statements, Frege uses
the word "function" in a literaI sense. That is, his analysis of statements led him
to the conclusion that concepts, such as "is lighter than oxygen", really are special
cases of functions, of the same nature as those familiar in mathematics. Just as "x
+ 2" is a name for a function taking arguments of object type to values of object
type2 , so Frege views the function named by "x is lighter than oxygen". The latter
is a special case of first-order unary function. For such a first-order function, that
is, for one whose value for every argument is a truth value, Frege reserves the name
concept. A first-order binary function whose value for every pair of arguments is a
truth value Frege calls a relation. In particular, then, Frege is led to the conclusion
that truth values, the Ttue and the False, or T and -.l, are objects of the same nature
as, e.g., numbers. This has, for one thing, the effect that logical connectives (officially:
material implication, negation and univers al quantification in the Grundgesetze) are
functions, and each statement is a name for T or -.l. In fact, Frege includes a further
truth-functional connective, the truth function --x, a first-order concept such that,
for each object a, --a = T if a = T and --a = -.l if a i= T 3. Since the logical
2Frege insists that every such function must be defined for all objects "that there really are,"
including, e.g., Julius Caesar. Though this causes Frege philosophical spasms, it is never an issue in
mathematical practice, and we may keep in mind a universe of purely mathematical entities when
working with GG, as Frege himself inevitably does. 3 Aczel [1] gives an interesting analysis of --x as an "internaI truth definition" that is the
ultimate source of inconsistency in GG.
7
connectives are functions, and GG will contain only functions as primitives4, there
are no formulas in the language, only terms. As we will see, any term, cp, can occur
in the argument position of --x. Therefore, for any term cp, --cp equals T or
1- (for aIl values of the free variables in CP); we say that --cp is a GG statement.
An axiom is interpreted as saying that a certain statement equals T and rules of
inference as saying that, given a (finite) set of statements, one of which is identified
as the conclusion and the remaining as premises, if, for a given set of values of aIl free
variables in the set of statements, we have that each of the premises equals T, then
so does the conclusion. Derivations are defined as usual; see 1.2.2.
The second point is that the number of fun ct ion types will multiply very rapidly,
as any two functions are of the same type iff they have the same arity, n, and, for
each i ~ n, the ith variable of each is of the same type. By the definition of function
in GG, the value of any function for each appropriate set of arguments is an object.
So, for example, the differential operator, d~~) Iy is of mixed level, needing a unary
first-order function, F(x) and an object, a, as arguments (for the variables f(x) and
y, respectively) in order to return an object. This framework is very cluttered and
completely impractical for "actually doing" mathematics, even simple arithmetic.
The situation is resolved in large part through the introduction of an abstraction
operator. Frege takes it as a law of logic that to every first-order concept there
corresponds a collection, or class, itself of object type, who se members are exactly
those objects which fall under (satisfy) the given concept. Since his analysis has
brought Frege to the conclusion that concepts are a special case of unary first-order
functions, he extends this basic law of logic to aIl unary first-order functions, with
4Including equality. We will denote Frege's "functional" equality symbol by "~,, and keep "=" for
the usual equality predicate. Note that the above discussion of --x concerns the meta-theoretical
interpretation of --x and the symbol "=" has its usual meaning. Note also that there are no
constant names for T and ..L in GG; they are used here, as by Frege, to discuss the meta-theoretical
interpretation of the GG primitive symbols.
8
the slight modification that to each of these functions there corresponds a graph,
or "course-of-values" ("Werthverlauf"), as Frege puts it. In the special case of a
concept, we therefore have the graph of its characteristic function rather than sim ply
the collection of objects falling under it. The abstraction operator is a second-order
unary function, taking first-order unary functions to their corresponding graphs, the
graphs themselves being objects. But in this way, any second-order unary function has
a corresponding first-order function, in which the variable ranges over the graphs of
first-order functions, and this first-order function has a corresponding graph (object).
Thus we can "reduce" all higher order functions to their corresponding graphs5 . This
is explained in detail in Chapter 2, where an explicit example of such a reduction
is also given in discussing Frege's definition of "natural number". Iteration of the
abstraction operator, as explained in 1.3, is used for functions of higher arity. (Note
the difference between higher arity and higher order). Frege also adopts a strict law
of extensionality: two courses-of-values are identical iff their corresponding functions
have the same value for every argument. The picture of the logical universe as given
by CTT is thus nicely tidied up (or at least contained), and Frege formally retains
only three types: object, first-order unary function and first-order binary function.
However, with an unlimited comprehension principle and a strict law of extensionality
in hand, Frege is clearly on the edge of inconsistency. The final push cornes in the
form of a unit class operator, from which Frege can define an application operator;
the resulting inconsistency is demonstrated formally at the end of Chapter 26.
5Cocchiarella [5] also mentions this "collapse" of aU function types to object type, or "level 0";
in Chapter 2 we prove a theorem making precise this idea of collapse, or reduction.
6In the appendix to the Grundgesetze, Vol. II [10] that deals with Russell's paradox, Frege
returns to the initial framework of CTT and asks whether it ultimately is necessary to keep ail
levels separate, with a different "type" of equality for every level. Of course, he quickly despairs
at placing mathematics on such a hopelessly convoluted foundation. To his credit, even before the
discovery of any paradox, he called attention the his Basic Law V, dealing with abstraction and
9
The third and final point l wish to mention is Frege's concern with "securing
a reference" for aIl the terms of his language. There are two reasons a reader of
the Grundgesetze der Arithmetik might come to believe that Frege do es not want to
commit fully to a rigourously formaI language. One reason is the scathing attack Frege
launches on "formalists."7 Another is Frege's insistence that each term of his language
pick out a single, specifie mathematical entity, its reference, existing in a logical realm
independent of human thought. Frege clearly intends that these references are fixed
and provide the only correct interpretation of his language. In point of fact, however,
Frege is entirely committed to formaI rigour and seems to have been the first to
present a formaI language, with a fully explained grammar and rules of inference
based solely on syntactical features. Yet, as a diehard realist, he is not satisfied that
some interpretation of his language can be given; rather, he requires that the correct
interpretation of his language can be given, e.g. of the term in GG for the cardinal one
as the cardinal one, of the functional equality symbol as the equality function, and so
on. Frege is opposed to trends he sees in certain formalists of ignoring the question of
mathematical meaning and, ultimately, truth, and instead simply looking for clever
rules for shifting symbols around. For Frege, the focus has to be on logical and
mathematical truth, from which consistency would follow8 . While he made the point
against "pure formalists", Frege's own mathematical realism seems rather far-fetched
from the modern point of view and l have no intention of discussing it further. l simply
wish to point out that Frege's insistence on a specifie semantical interpretation in no
way compromises his formaI rigour, and while l have "modernised" the presentation of
the language GG below, this is basically a notational modification. The essentials of
extensionality, saying that if there \vere any weakness in or point of attack against his project of
reducing mathematics to logic, it must surely be this law. ([11] Introduction, p.VII) 7See especially [10] §§86-l37.
8See [8], pp.4-24, for an exchange between Frege and Hilbert, in which Frege questions Hilbert's
emphasis on consistency over truth.
10
the following definitions are all found in Frege's own presentation in the Grundgesetze,
and his notions of formaI language, formaI deduction, etc. are the modern ones. (Or,
rather, the modern notions are Frege's.)
1.2 The Theory GG
1.2.1 Types
The language of the Grundgesetze is a multi-sorted language and we introduce the
following variable types:
• object variables: xo, Xl, '" , X n , .... We say these variables are of object
type or level O.
• unary function variables: fl, il, ... , f~,
• binary function variables: f5, if , '" , f~, Both unary and binary
function variables are said to be of (first-order) function type or level 1.
We will follow Frege in distinguishing bound variables from free variables.9 This
distinction isn't necessary, but it allows a clearer statement of substitution rules
below. The variable symbolslo listed above will therefore stand for free variables and
we introduce the following bound variable types:
9Frege in fact further distinguishes between variables bound by quantifiers and those bound by
the abstraction operator (see 1.2.2 for definition of the abstraction operator). Our free variable
symbols correspond to Frege's Latin letters, our bound variable symbols correspond to both his
lower-case German letters (bound-by-quantifier variable symbols) and his lower-case Greek letters
(bound-by-abstraction variable symbols).
lOWe are going to adopt the simplifying convention that all symbols of the formaI language,
and expressions built up from these symbols, name themselves. E.g., --(x ~ x) both equals
T (since indeed --(x ~ x) = T) and names the expression --(x ~ x). This allows us to
avoid complications such as Quine's "quasi-quotes" for expressions like 1/J ---> cjJ or VxcjJ, which mix
11
• bound object variables:
object type or level O.
... ,
• bound unary function variables: fo 1 , il,
• bound binary function variables: R, li,
. . .. We say these variables are of
o •• ,
o •• , j;, .... Both unary and binary
function variables are said to be of (first-order) function type or level 1.
In practice, we will generally drop the arity superscript on function symbols when
writing terms, as the arity of any function will be clear from the context.
1.2.2 Basic definitions: primitives, terms and sentences
We now introduce nine primitive operation symbols, which Frege calls the "pri
mary function names" ("ursprüngliche N amen von Functionen") of the Grundgesetze.
These are the primitives of GG, and include the functional analogues of the usuallog
ical connectives. The interpretation which Frege intends for these function symbols
is discussed in 1. 3 below.
primitive unary operations:
• truth function: --x,
• negation: -.:., x, 11
• unit class function: \x,
• universalization over objects: \jif(i) ,
meta-theoretical names of expressions ('Ij;, qy) and parts of expressions themselves (-->, V, x). 1 thank
Professor Makkai for pointing this simplification out to me.
n A dot over a logical connective indicates that Frege's "functional analogue" to the usual con-
nective is meant, and serves to distinguish the two.
12
• universalization over unary functions: VP'iJ!(p) ,
• universalization over binary functions: V j2'iJ! (j2),
• abstraction: 5:f(x).
primitive binary operations:
• equality: x ~ y,
• implication: x -----+ y.
In the Grundgesetze formalism, a second-order function, such as 'iJ!(fl) or 'iJ!(j2)
ab ove , is given by a term <p and specified free function variable( s) (e.g. JI or j2),
as explained below. Frege does not state things quite like this, as he does not define
"term" in the way we will. His discussion is somewhat informaI, but it is clear that, for
him, the type (or linguistic role) of a meaningful expression (which we caU a term) <p
in his language is determined by specified Latin letters (i.e., free variables) occurring
in <p (these correspond to the variables in our Xq,; see Definition 1.1 below)12. In -1
connection with this, we need to explain notations such as <p[j1i~~l which occur below.
In what foUows, given expressions "(, Œi, f3i built from the primitive symbols of
GG (i.e., GG expressions), "(['K"'" tl is the expression obtained from "( as foUows:
For each i ::; n:
(i) If f3i is an ob ject variable (free or bound), then for each occurrence of f3i in "(
we substitute Œi.
12We take the slightly more liberal view that a function may also be determined by a term, cP,
and specified variable(s) not necessarily occurring in cP.
13
(Note for use in the following two clauses that, for expressions a, 'ljJ, X and object
variable z, w, the expressions a[~l and a[~, ~l are defined by (i). E.g., for a unary
(resp., binary) function variable f, we have that f(z)[~l (resp., f(z, w)[~,~]) is the
expression f('ljJ) (resp., f('ljJ,x))·)
(ii) If (Ji is f(w), with f a unary function variable (free or bound) and w a free
object variable, and 'ljJ is any GG expression13, then for each occurrence of the form
f('ljJ) in '1 we substitute ad~]' this last expression being defined by (i). For example,
(f(a) ~ f(a))[(;(~~)l is the expression (a ~ b) ~ (a ~ b);
(iii) If (Ji is f(u, v) with f a binary function variable (free or bound) and u, v
free object variables, and 'ljJ, X are GG expressions14 , then for each occurrence of the
form f('ljJ, X) in '1 we substitute ai[~, ~], this last expression being defined by (i). For
example, (f(a, b) ~ f(a, b))[j(~~~l stands for (a ~ b) ~ (a ~ b).
In every use we make of expressions of the form '1 [~: ' ... , ~ l, the (Ji fall under one
of the above three cases, such that 'I[~, ... ,~l is al ways well-defined by the above
three clauses. Note that if '1 and each ai are assumed to be terms in the ab ove , then
one can prove by an entirely straightforward and tedious induction, that 'I[~, ... , ~l
is also a term, according to the following definition of "term."
We give an inductive definition of (well-formed) term, with a simultaneous induc
tive definition of the set of free variables (sfv)) X qn of a term cP·
Definition 1.1. A GG term and the set of free variables of a GG term are defined
as follows.
13 As we have yet to define "term," 'l/J, like "(, here stands for any GG expression; however, in
practice, 'l/J and "( will always be terms, except in cases of binding variables, such as \ixr,b[~], where
we have the bound variable x in the role of 'l/J above.
14Same note for 'l/J and X as for 'l/J above.
14
1. Each free object variable, Xi, is a term with sfv {Xi}'
2. if cP is a term with sfv XrjJ, then each of -- cP, ~ cP, and \cP is a term with sfv
XrjJ;
3. if cP and 'ljJ are terms with sfv XrjJ and X'Ij;, respectively, then each of cP ~ 'ljJ
and cP ~ 'ljJ is term with sfv XrjJ U X'Ij;;
4. if cP and 'ljJ are terms with sfv XrjJ and X'Ij;, respectively, then each of P (cP) and
r (cP, 'ljJ) is a term, with sfv XrjJ U {fI} and sfv XrjJ U X'Ij; U {f2}, respectively,
where p is a free unary function symbol and r a free binary function symbol;
5. if cP is a term with sfv XrjJ, X is a free object variable and i; is a bound object
variable not occurring in cP, then each of Vi;cP[~l and icP[~l is a term with sfv
X4>-{x};
6. if cP is a term with sfv XrjJ, f is a free unary (resp., binary) function variable, f is a bound unary (resp., binary) function variable not occurring in cP, and x, y
are free object variables, then vjlcP[{~~~}l (resp., vj2cP[{~~~:~m is a term with
sfv XrjJ - {f};
7. nothing else is a term.
Note that these rules do not allow a term to have a quantifier that falls within the
scope of another quantifier binding the same variable. Frege himself does allow such
"nesting" of quantifiers; we have simply found it easier to state substitution rules
below with such nesting of quantifiers ruled out. Frege adopts a standard set of rules
for scope and binding, as do we, without repeating them here. We will call a term cP
closed if it has no free variables, i.e., if XrjJ is empty.
Every term cP is an object name in the sense that cP may be meaningfully substi
tuted for any free object variable in a meaningful GG expression. Specifically, cP may
15
be meaningfully substituted for any free object variable occurring in a GG term. Sub
stitution rules are given formally below. A unary first-order function is given in GG
by a term cp together with a specified free object variable x (which may or may not be
in X</». A binary first-order function is given in GG by a term cp together with two
specified free object variables x, y (which may or may not be in X</». A unary second
order function is given in GG by a term cp together with a specified free function
variable f (which may or may not be in X</». EtC. 15 To this extent, we find a trace of
Cumulative Type Theory in GG and this way in which terms represent objects and
functions in GG determines the exact statement of the rules of substitution; see 1.2.3.
In meta-theoretical discussions, we will generally use the lower case letters a, b, c, ...
to indicate objects, capitals F(x), F(x, y), G(x), G(x, y), H(x), H(x, y), ... to indicate
first-order functions, and capital Greek letters 1>(f), 1>(f, g), 'IJ(f), 'IJ(f, g), ... to in
dicate second-order functions.
Now, on the intended interpretation of --x, for any term, cp, -- cp is equal
(for aIl values of the free variables in cp) to T or -L and, as mentioned, we call -- cp a
G G statement. Frege reserves the term sentence ("Begriffsschriftsatz") for a sequence
of symbols of the form f- cp, with cp a term. The symbol f- consists of two parts for
Frege: the "judgment stroke" ("Urtheilstrich"), l, and the "horizontal stroke," --.
The latter is a function symbol as already described, while the former indicates the
assertion that the term to the right is (a name for) T. l believe this is how the
now ubiquitous turnstile was born. Frege calls an expression of the form -- cp a
"thought" ("Gedanke"), rather than a sentence, in the sense that we may contemplate
a "thought" without asserting it. In Frege's notation, an expression of the form f- cp
15Frege does not state or see things quite this way. Using our own terminology, we may say that
he cons id ers closed terms to be object names, terms <p such that Xq, contains a single object variable
to be unary first-order function names, terms <p such that Xq, contains exactly two object variables
to be binary first-order function names, terms <p such that Xq, contains a single function variable to
be unary second-order function names, etc ..
16
appears only if cp can be formally deduced in GG. This judgment stroke is a convenient
notation for marking theorems of GG, and we use it as such.
Officially, we adopt the following definitions. Any expression of the form --cp, where cp is a GG-term, is called a GG-statement or simply a statement. A formal
deduction, L:, is a finite sequence, CPl, CP2, .. , CPn, of statements, such that each CPi is
an axiom of GG, or is the conclusion of an instance of a rule of inference of GG all of
whose premises precede CPi in L:. (The axioms and rules are given in 1.2.3 immediately
below.) L: is a formal deduction of length n if L: is a formaI deduction and L: is a
sequence of length n. L: is a formal deduction of cp if L: = (CP1, CP2, ... ,CPn) is a formaI
deduction of length n and cp is CPn. cp is a GG-theorem, or simply theorem, if there
exists a formaI deduction of cp. From the following axioms and rules, it is easy to
see that if a theorem, cp, contains free variables, it is equivalent to its (functional)
universal closure, in the usual sense that the two statements are inter-derivable16 . We
will sometimes write f- cp to indicate that cp is a theorem.
1.2.3 Axioms and Rules of Inference
We come now to the axioms and rules of inference.
Axioms
Frege states six "Basic Laws" ("Grundgesetze") for his system. They are:
16This rais es a rather inelegant point. Consider the statement --(x ~ x). Technically, its
universal closure is Vx(--(x ~ x)) and the statement of its universal closure is
--Vx(--(x ~ x)). But the latter is equal to Vx(x ~ x), provably in GG (of course, technically
everything is provable in GG, but we mean "provably" in a more straightforward sense here), and . .
by definition in the meta-theoretical interpretation ofV. There is thus no harm in calling Vx(x ~ x)
the universal clos ure of --(x ~ x), and referring to it as a statement. Exactly the same point
that is made here for V can be made for ~, similarly for ~.
17
• Basic Law 1.
a) f- x ~ (y ~ x)
b)f-x~x.
Frege shows in Begriffsschrift that b) is in fact derivable from a) in his logic;
however he states b) here for economy of derivations.
• Basic Law II.
a) f- vyf(y) ~ f(x)
b) f- VgM{3(g((3)) ~ M{3(J((3)).
b) is actually an axiom schema: f and gare (first-order) unary function variables
and M{3 is a second-order unary function (of a first-order unary function), which
is equivalent formally to saying it is a GG term, </>, together with a specified
unary function variable, f. Given such </> and f, an instance of Basic Law II
b) is obtained by substituting Vg</>[~~~~l for VgM{3(g((3)) and </> for M(3(J((3)).
Since Frege doesn't allow quantification over second-order functions, b) must
be stated as a schema 17.
• Basic Law III. Substitutability of equal terms.
f- g(x ~ y) ~ g(vj(](y) ~ ](x))).
Note that f- (x ~ y) ~ vj(](y) ~ ](x)) is a special case.
• Basic Law IV.
f- (~ (-x ~~ y)) ~ (-x ~ -y).
This law is intended to formalize the notion that there are two truth values,
i.e., two possible values of the function --x. It certainly captures the notion
that there are ai mosi two; indeed a revision of GG proposed by Frege in light
17The f3 subscript in expressions of the form M(3(g(f3)) indicates simply that M(3(g(f3)) is a function
of the (first-order) unary function 9 only, and that if the argument in any occurrence of g(x) in
M(3(g(f3)) is changed, then the resulting second-order function M(3*(g(f3*)) is different.
18
of the paradoxes fails because it cannot prove that f--':" (--x ==-.:., x), i.e., that
T -=/: ~18.
• Basic Law V. Extensionality.
f- [if (x) == ig(x)] == [vx(J(x) == g(x))].
The interpretation is that the LHS (of the main equality symbol) is equal to T
iff the RHS is also, which is to say the two courses-of-values are equal iff the
corresponding functions are equal for every possible argument.
• Basic Law VI.
f- x == \ Y (x == y).
This law governs the unit class operator, \x, explained below in 1.3.
Rules of Inference
As Frege showed in Begriffsschrift, he requires as rules only modus ponens, univers al
generalization and "substitution rules," together with obvious allowance for renam
ing of bound variables in a term to obtain an alphabetic variant. However, for the
sake of economy of derivations, he officially sanctions several derivable rules in the
Grundgesetze. We won't present the latter here as we don 't intend on carrying out
many derivations in GG, and there is nothing exceptional about any of the derived
rules Frege uses. Here are the non-derived rules:
• Modus Ponens. If <p and 'ljJ are any two terms of GG such that <p and <p ~ 'ljJ
are theorems of GG, then 'ljJ is also a theorem of GG.
• Universal Generalization. If <p ~ 'ljJ is a theorem and the free object variable
x do es not occur in <p, then <p ~ (vx'ljJ[~]) is a theorem. Similarly, if <p ~ 'ljJ
18See Boolos [4] for a detailed study of the consistency of fragments of GG.
19
is a theorem and the free (unary or binary) function variable f does not occur
in <p, then <p ~ (',fj'lj;[;~~jD is a theorem19 .
• Universal Specification (s). a) If <p is theorem of the form vx'lj;[~J2° for sorne term
'lj;, then, for any term X, 'lj;[~l is a theorem.
b) If <p is theorem of the form V jl'lj; [~~ ~~~ 1 for sorne term 'lj;, then for any term
X21 and object variable y, 'lj;[~l is a theorem.
c) If <p is theorem of the form V j2'lj; [~~~~:~~ 1 for sorne term 'lj; , then for any term
X and object variables z, w, 'lj;[j2(;,w)l is a theorem.
1.3 Interpretation of Primitives
The intended interpretation of sorne of Frege's primitives has already been touched
upon. As mentioned, the truth function, --x, takes every object other than T to..i,
while taking T to T. By definition, each functional analogue (negation, implication,
universalization and equality here) of a usuallogical connective returns a truth value
for each (pair of) argument(s); keeping this in mind, we may complete their definitions
as follows:
• negation: For any object, a, -.:, a = T {::}def --a = ..i.
19We must add the clause that each ofvx7,b[~l and vj7,b[~l is a term, which, given our definition
of term (and in particular our disallowance of nesting quantifiers binding the same variable), is
equivalent to saying that x do es not occur in 7,b, and that j does not occur in 7,b, respectively.
20Technically --vx7,b[~]' see the final foot note in 1.2.2.
21X may or may not contain y. Note the different substitution rules for 7,b[~1 and 7,b[~]; see
1.2.2. Similar remarks apply for clause c).
20
• implication: For objects a, b, (a ~ b) = ~ <;::}def (--a = T and --b = ~).
• quantification: (i) For any first-order unary function F(x),
\jxF(x) = T <;::}def for aIl objects a, --F(a) = T (equivalently, F(a) = T);
(ii) for any second-order unary function WU) of a unary (resp. binary) first
order function variable j, vjW(]) = T <;::}def for aIl unary (resp. binary) first
order functions F, --W(F) = T (equivalently, W(F) = T).
• equality: For any objects a, b, (a ~ b) = T <;::}def a = b.
The remaining primitives are the abstraction and unit class operators. We describe
abstraction first. Frege's notion of abstraction applies to junctions, specificaIly, unary
first-order functions. In the formaI system GG such a function is given by a term <fJ
and a specified object variable x. Abstraction is the Grundgesetze analogue of set
formation; however because <fJ is a term and not a formula, we cannot take i<fJ[~l to
be the set of x such that <fJ(x). Rather, i<fJ[~l is to be thought of as the graph or
"course-of-values" of y = <fJ(x) (where y is a variable not free in <fJ), as mentioned
earlier. Frege takes i<fJ[~l to be primitive in the sense that he does not specify any
structure for courses-of-values (e.g., as sets of ordered pairs), saying only that we find
a visual representation ([11], §1) of the course-of-values of a (unary) function as a plot
in the xy-plane. In any case,. Frege takes it as a logical truth that for every unary first
order function, F(x), there exists a corresponding course-of-values, iF(x), satisfying
Basic Law V. The notion of a "double course-of-values" ("Doppelwerthverlauf') is
already contained in the above notion of course-of-values. For if we consider a binary
first-order function, F(x, y), then for any fixed object a, iF(x, a) is a (simple) course
of-values corresponding to the unary function F(x, a). Therefore, iF(x, y) is itself a
unary function of the object variable y, who se value for any argument is a course
of-values. The meaning of the term y(iF(x, if)) is thus completely determined from
the original definition of course-of-values. Frege himself discusses and makes use of
21
only simple and double courses-of-values (and thus technically only simple courses
of-values - the point here being that he never iterates abstraction more than once);
however the further generalization to courses-of-values corresponding to functions of
n object variables is straightforward from the above22.
Now, just as modern set the ory reduces all mathematical entities to sets, Frege
would like to construe all mathematical objects as courses-of-values. (Note, however,
that functions for Frege remain fundamentally different from objects and cannot be
construed as courses-of-values.) In his interpretation of GG, the only objects to which
Frege is committed are T, ..i and those which can be named by terms of GG. Since
it is clear by inspection of the primitives used in term-formation that all objects
which can be named by GG-terms must be courses-of-values or names of Tor ..i, this
amounts to finding a way to construe T and ..i as courses-of-values. It is not clear to
me why this is an appealing path for Frege to take, and his "permutation argument,"
given in §10 of [11], supporting such a move is perhaps less than convincing. We will
only mention here for the sake of completeness that Frege cornes to the conclusion
that we should take T to be the course-of-values corresponding to the unary function
taking T to T and every other object to ..i, while we should take ..i to be the course
of-values corresponding to the unary function taking ..i to T and every other object
to ..i23 . Strange objects to say the least, and it is all the more strange that Frege
says the reason these identifications are permissible is that he has shown they do
22 Interestingly, when we later reconstruct our modified form of the Grundgesetze in NF UT, this
restriction to only simple and double courses-of-values plays a more significant role.
23Formally, we could give the definitions as T =def i(--x) and ~ =def i(x ~ (~\;y(ij ~ fj))). There is nothing especially peculiar about these definitions as they stand. What is peculiar is rather
Frege's interpretation of T and ~ bath as the objects used to define (meta-theoretically) functions
such as --x and x ~ y and as the two (formally definable) courses-of-values given above. If
we were to interpret courses-of-values as sets of ordered pairs in the usual way, we would have, in
particular, that (T, T) E T and (~, T) E ~.
22
not contradict (which can only mean "are consistent with") Basic Law V, governing
courses-of-values. This is certainly peculiar for someone who bitterly and sarcastically
argues against mathematicians who think they can "magically bestow" properties
on objects through definitions, without contemplating the (independently existing)
objects themselves to see if they possess these properties (cf. [10], §139 and §143).
It is interesting that the identification within GG of T and --L with the courses-of
values described ab ove , is entirely analogous to the suggestion made by Quine in
Mathematical Logic that, in his set theory, we should identify concrete objects, i.e.
non-sets, with their unit sets. E.g., if we admit the existence of T as a non-set in
ML, then we have T = {T}. This seemingly trivial point we will return to in our
discussion of NF, NFU and variants, as it is at the centre of work on the consistency
of these systems.
Given an abstraction operator on functions as ab ove , it is natural to expect an
application operator. Such an application operator is more naturally the complement
of abstraction than the unit class operator and the main use Frege makes of the unit
class operator is indeed to define an application operator. The function \x, formally
a primitive of GG, is to be interpreted as the function which Frege defines as follows.
Definition 1.2. \x = y {:}def (there is a unary first-order function G(z) such that
x = iG(z) and G(y) = T and, for all u, G(u) = T =} u = y) OR (there is no such
function and y = x).
The idea is that if there is some concept24 , G(z), such that x = iG(z) and such
that exactly one object falls under the concept G(z), then the unit class operator
returns this object; otherwise it simply returns the argument. (As can be seen by
24The function G(z) need not, strictly speaking, be a concept; it need only be a function whose
value is T for exactly one argument; in practice, however, we will generally not make use of any
function whose range includes both truth values and other objects (e.g. numbers).
23
the fact that Basic Law VI governs only the case where there is such a function, the
actual value returned by the "garbage clause" is irrelevant.)
The application operator, x ~ y, is then formaUy defined as:
Definition 1.3. x ~ y =def \t(-.:, \;g(y == ig(i) ---.:.....-':' (il, == g(x)))).
By "formally defined," we mean that the left-hand sicle of the definition is to be
understood simply as a meta-theoretical short-hand notation for the right-hand side.
Given the obvious meta-theoretical definitions of 3 and Â25, this definition can be
expressed more readably as:
x ~ y =def \t(3g(y == ig(i) Â il, == g(x))).
80 if Y is a course-of-values, iG(i), for sorne G(z), then x ~ y = G(x); otherwise,
for every u, 3g(y == ig(i) Â u == g(x)) = .1, and
x ~ y = t(3g(y == ig(i) Â il, == g(x))), that IS, x ~ y is the course-of-values
corresponding to a (every) function whose value is uniformly .1:
x ~ y = i(-':' (x == x)). However, since Frege intends for aU objects to be courses-of
values, the latter case is not supposed to arise.
As a final note in this section, we point out that ((.), (.) ~ (.)) constitutes a
À-system for any model of GG. (Let us ignore for a moment the fact that there are
no such models and the last statement is vacuously true.) Peter Aczel develops an
analysis of the language of the Grundgesetze as a À-structure in his article [1].
25For definiteness, ~x4> abbreviates ~ ('-ix(~ 4») and 4> A 'lj! abbreviates ~ (4) .-:..... (~ 'lj!)), while,
for future reference, 4> V 'lj! abbreviates (~ 4» .-:..... 'lj!.
24
Chapter 2
Type Theory and the
Grundgesetze: Basic Arithmetic
and the Paradoxes
In the decade after uncovering the paradoxes plaguing Frege's Grundgesetze, Russell,
together with Alfred North Whitehead, developed his own logical analysis of mathe
matics. The monumental Principia Mathematica presented the Theory of Types as a
logical framework for mathematics which steered clear of the paradoxes. This chap
ter focuses on the relation of simple Type Theoryl, denoted "TT" below, to Frege's
Grundgesetze, especially the similarities of TT to CTT and the careful separation of
levels in both. This will help to clarify the basic GG arithmetical definitions which
we present and to draw a conceptual link between the Grundgesetze and Quine's
New Foundations, which is in a sense derived from TT. Of course, the system of the
IThis is Russell's type theory as simplified by Ramsey: simple or "unramified" type theory. "In
spirit," simple type theory is still Russell's type theory, to state things vaguely. More specifically,
the handling of the paradoxes in simple type theory remains essentially that which motivated Russell
to develop his original type theory; see 2.1.1.
25
Grundgesetze is inconsistent. We discuss how the addition of the abstraction and ap
plication operators creates the inconsistency and, finally, we derive this inconsistency
in GG.
2.1 Frege's Levels and Russell's Types
2.1.1 The Theory of Types
As Russell notes in reflecting on the Principia, "It will be found that in aIl the logical
paradoxes there is a kind of reflexive self-reference [ ... ]" ([24], p.83). In the most
common statement of Russell's paradox, we ask whether the (set-theoretic) property
of not being a member of oneself is applicable to the collection of sets/objects which
are not members of themselves. In deriving this paradox, we assume that, given the
property "not being a member of oneself," we may inquire whether it is applicable to
individual elements of sorne domain, which may be left indefinite in every way, except
that it must "already" contain the collection ofthose objects/sets/things which satisfy
"not being a member of oneself." Put this way, perhaps the intuitive, unrestricted
comprehension of naive set theory is not so obvious, in the sense that we must at
once, or "on the same level," be able to state a property and ask whether it applies
to the collection of aIl things satisfying it.
In any case, Russell says that it was by reflecting on this self-referential nature of
the paradoxes that he was led to formulate his Theory of Types. Here we will formally
present and discuss TT, the version of type theory that is now generally considered,
that is, Russell's theory as simplified by Ramsey, a.k.a. "simple Type Theory."
TT is a theory in first-order predicate logic with distinct variable types (or sorts)
for each natural number i,2 that is, we have a countable list of variables, xb, xl, ... ,
2We could of course interpret TT in a "pure" (unsorted) first-order set theory by adding, for
each i, a unary predicate symbol, Si, to the signature of TT and using these to "sort" the variables,
26
x~, ... ,for each i E N. We will not introduce separate symbols for free and bound
variables in TT. The signature of TT contains the single primitive symbol E, a binary
predicate symbol.
The atomic formulas of TT are all the expressions of the form Xi = yi and Xi E yi+1,
for i E N. Finally, the axioms of TT consist of two schemas:
Axiom Schema of Extensionality:
VXi+1V y i+1(VZi(Zi E XH1 f--+ Zi E yHl) -+ XH1 = yHl)
and
Axiom Schema of Comprehension: 3xH 1Vyi(yi E XH1 f--+ CP),
where the variable i ranges over the natural numbers in both schemas, and cp is a TT
formula not containing XH1 free.
This is the complete statement of TT as it is usually given. Note that for the
purposes of arithmetic, one must add a further axiom of infinity, ensuring an infinite
number of "urelements," i.e., entities of type O.
For any variable, X, of any type, the expression X E X is ungrammatical in TT;
we cannot even make statements of self-membership, let alone form the collection of
all non-self-membered entities, and Russell's paradox never gets off the ground.
2.1.2 Frege's Theory of Levels and The Definition of Number
Frege's concept levels were discussed in some detail in Chapter 1. The parallels
with TT should be clear. In particular, CTT, which can fairly be called the logical
system of the Grundgesetze prior to the introduction of the abstraction and unit
class operators, avoids the "self-referencing" that Russell points to as the source of
the paradoxes.
Certainly, it is Frege's definition of natural number that Russell reproduces in
while also appropriately restricting quantification. E.g., \lxi would be interpreted as \Ix E Si (or,
more precisely, \lX(SiX ---+ •.. )).
27
TT. Frege arrived at his notion of cardinal number after much reflection and critical
analysis of previously suggested definitions of number. It is a testament to Frege's
insight and clarity of thought that his definition of cardinal number strikes us now
as very natural and, to a certain extent, obvious3 . We sketch very roughly the line of
thinking leading to his definition.
Frege points out that we run into difficulty when we try to assign a number
directly to objects. For instance, if we spot a soccer match, sayon field A, we may
say that there are two teams playing or that there are 22 players playing, and if these
numbers are properties of objects, i.e. of the people on the field, then we have that
they are both two and 22. Frege concludes that the property we assign in assigning a
cardinality must be a property assigned to a concept. That is, we assign the property
of "being two" to the concept "team on field A," and of "being 22" to "player on
field A," because there are two and 22 objects, respectively, faUing under these two
concepts. Thus, we have the conclusion that, for n a natural number, "being n" is a
second-order concept, i.e., a property that can be meaningfuUy predicated only of a
first-order concept.
Up to this point, RusseU's analysis of "natural number" is in essential agreement
with Frege's and the natural numbers (though not the set of natural numbers!) "exist
at level two" in a model of TT: the natural number n is the set of aU sets of type 1
that contain n elements (of type 0). (Note, however, that we will also have a "copy"
of the natural numbers for each type k greater than two: the natural number n "of
type k" will be the set of aU sets of type k - 1 that contain n elements of type k - 2.)
However, Frege does not hold the view that the natural number n is a second-order
3 Although it need not concern us here, we may note that the von Neumann definition of cardinal
- i.e. a special case of the von Neumann ordinal - would also have been rejected by Frege. For,
though these might serve nicely for the purposes of arithmetic, the idea that the number one is an
element of the number two would certainly have seemed preposterous to Frege.
28
concept; it is "being n," or, more clearly put, "being true of n objects" which is a
second-order concept. The natural number n is, for Frege, an object, as indicated by
the definite article. This same line of reasoning Frege follows in paradoxically and
(somewhat) famously stating "the concept 'horse' is not a concept" ([12], pp.67-69).
The point Frege wishes to make is naturally that the concept "horse" is the extension
of "is a horse," and, while "is a horse" can be (either truly or falsely) predicated
of any object, it is not grammatical to ask whether "the concept 'horse'" is true
of an object. We may ask if "fans under the concept 'horse'" is true of an object,
but now we have again named a concept, to be sharply distinguished, Frege says,
from the concept "horse" (and from the concept "falls under the concept 'horse'"
!). In 2.2.2, we show how Frege defines the natural number n as the course-of-values
corresponding (via a first-order concept) to the second-order concept "being n". Now,
we might at first think that "being n" could also be a third-order concept for Frege,
so that, in analogy with the situation in TT, "being n" would apply to second-order
concepts that are satisfied by n first-order concepts. Frege disagrees on the grounds
that counting is always do ne of abjects falling under some specified first-order concept.
So that even if we "count concepts," e.g. we count the number of "distinct species of
four-Iegged animaIs," we are really counting classes (which are, for Frege, objects),
i.e. the extensions of the concepts, "tiger," "zebra," "German shepherd," etc., and
not the concepts themselves, which are of a fundamentally different nature than
their extensions. For Frege, these extensions, or courses-of-values, are of fundamental
importance in logic. He believes that (first-order) concepts have extensions, i.e.,
collections or classes consisting of those objects falling under the given concept, and
that these classes genuinely exist independently of human thought. Russell on the
other hand, believes that "classes are merely a convenience in discourse," that a class
."is only an expression" ([24], p.81-82)4. Accordingly, in a model of TT, there is no
4And numbers are thus also "merely linguistic conveniences" ([24], p.71). It's somewhat amusing
29
object (entity of type 0) corresponding to a property - or formula - qy(x) of one free
object variable; the set of objects x such that qy(x) is an entity of type 1 ("merely a
linguistic convenience" for Russell) and its existence resides essentially in the formula
qy(x).
We have here a fundamental difference between Frege's and Russell's philosophical
outlooks on logic. Ultimately, his more liberal view on the existence of classes is what
brings inconsistency into Frege's system, for it translates formally into the abstraction
operator discussed in Chapter 1. Before demonstrating this inconsistency, however,
let us give the basic arithmetical definitions of the Grundgesetze.
2.2 Formally Defining Cardinality
We have earlier alluded to the fact that the abstraction operator, though strictly
speaking it is applicable only to first-order functions, can be used to "reduce the
level" of higher-order functions as weIl. We will now see this for the second-order
concept "being true of n objects" discussed above, as the natural number n will be
defined as the course-of-values corresponding to this second-order concept. We first
need sorne preliminary definitions, however.
2.2.1 The Application Operator and "Reduction"
In §34 of the Grundgesetze, Vol.! ([11]), Frege introduces the application operator,
as in Definition 1.3:
x ,-., y =def \fi(~ Vg(y = ig(i) ~~ (fi = g(x)))).
to note Russel! claims that with the definition of the natural number n as the class of al! classes with
n members (which he credits to Frege and, independently, to himself ([24], p.70)), we "get rid of
numbers as metaphysical entities." ([24], p.71) It's hard to imagine that Frege would have accepted
this conclusion.
30
As mentioned in 1.3, the meaning of the unit class operator is such that we then
have, for any unary fun ct ion G(x) and any object a, that a ~ iG(i) = G(a). For
any binary function F(x, y) and any objects a, b, we similarly have that
a ~ (b ~ y(iF(i, fj))) = a ~ iF(i, b) = F(a, b). (Recall that iF(i, y) is a unary
function of y, and y(iF(i, fj)) is the corresponding course-of-values.)
In §35, Frege gives sorne examples of second-order functions and their correspond
ing ("entsprechende") first-order functions. For instance, con si der the second-order
concept of "existence":
Definition 2.1. "Existence operator":
E2(f) =def':' Vi(':' f(i)), abbreviated E2(f) =def 3if(i).
80 for any unary function, F, E2 (F) = T iff there is an object a such that
F(a) = T, otherwise, E2(F) =~. Corresponding to this second-order function, E2'
we have the first-order function El:
Definition 2.2. "Existence operator, reduced":
El(Y) =def':' Vi(':' (i ~ y)), or El(Y) =def 3i (i ~ y).
Now we have that for any unary function, F, E1(iF(i)) = T iff there is an object
a such that a ~ iF(i) = F(a) = T, otherwise E1(iF(i)) =~. And so, for any
unary first-order function F(x), E2(F) = E1(iF(i)).
Clearly, in this correspondence between second- and first-order functions, there is
nothing particular about the Existence operator; in any unary second-order function,
\If (f), of a unary first-order function, we rnay replace each occurrencé of f (x) with
x ~ z to obtain a first-order function G1lt (z) such that, for any function F,
\If(F) = G1lt (iF(i)). Frege says this much in the Grundgesetze ([11], §§35-37) without
giving a precise general staternent of this "reduction of second-order functions". It
will prove quite useful for us to do so and we state the following (meta-)lernrna.
5Recall that, in GG, functions are given by a term and specified variable(s).
31
Lemma 2.1. Let cp be any GG-term and 9 any unary function variable. Then there
is a well-defined GG-term CPg-red and an abject variable Yg su ch that: (i) Xq, - {g} =
Xq,g_red - {Yg} and (ii) for the functions cp(g) and CPg-red(Yg) , we have, provably in
GG,6 that cp(G) = CPg-red(iG(x)) for every unary first-order function G(x) (and set
of values for variables in Xq, - {g} ).
Pro of. Let f2 be a free binary function variable and let f be a free unary function
variable different frorn g. Given cp, we define CPg-red inductively as follows.
1. If cp is x for sorne object variable x, then CPg-red is x;
2. if, for sorne terrns 'l/J and X, cp is (-'l/J) or (.:. 'l/J) or (\'l/J) or ('l/J ----t X)
or ('l/J ~ X) or f ('l/J) or P ('l/J, X) then CPg-red is, respectively, (--'l/Jg-red) or
(.:. 'l/Jg-red) or (\ 'l/Jg-red) or ('l/Jg-red ~ Xg-red) or ('l/Jg-red ~ Xg-red) or f( 'l/Jg-red)
or p ('l/Jg-red, Xg-red);
3. if, for sorne terrn 'l/J and free object variables x, y, cp is i'l/J[~] or vx'l/J[~] or
Vf'l/J[;~~j] or Vf2'l/J[t~~~:~j], then CPg-red is, respectively, i'l/Jg-red[~] or VX~g-red[~l ~f-nl, [Ï(x)] ~f-2nl, [i2 (x,y)]. or v If'g-red j(x) or v If'g-red j2(x,y) ,
4. if, for sorne terrn 'l/J, cp is g('l/J) , then CPg-red is 'l/Jg-red r-., Yg, where Yg is a free
object variable not occurring in 'l/J;
5. finally, if, for sorne terrn 'l/J, cp is Vg'l/J[;i~i], then CPg-red is Vyg'l/J~-red[~:], where ni/ . ni, [i(z~yg)] If' g-red lS If' g-red yg .
6Note that the proof does not contain any "trick" relating to the inconsistency of GG. Specifically,
if we placed restrictions on the existence of courses-of-values, for instance, to obtain a consistent
fragment of GG, then the following proof would still be valid, with clause (ii) modified to read, " ...
for every unary first-order function G (x) such that i:G (x) exists ... ."
32
It's obvious from this definition that X'" - {g} = X"'g-red - {Yg}, and we assume
this without proof. The proof of (ii) is by induction on the structure of cp, specifically,
on the number of GG primitives in cp. Note that the variables 9 and Yg, as in the
statement of the lemma, are fixed in the proof of the inductive step. We need to
consider each of the above cases:
1. In case 1, we have that both cp and CPg-red are x and, so we have triviaUy that,
for every value a of x and every unary first-order function G (z), cp (G) = a =
CPg-red(iG(z)).
2. For each instance of case 2, it foHows directly from the induction hypothesis that,
for any G(z), cp(G) = cp(iG(z)) (for all values of the variables in X", - {g}).
3. Suppose cp is i?jJ[~l. By the induction hypothesis, the fun ct ions ?jJ(g) and
?jJg-red(Yg) are such that ?jJ(G) = ?jJg-red(iG(z)) for every unary first-order G(z)
and aU values of the variables in X'Ij; - {g}. In particular, then, for all values of
the variables in X'Ij;-{g, x} = X",-{g}, and every G(z), we have that, for all val
ues a of the variable x, ?jJ(G) = ?jJg-red(iG(z)). That is, for the functions ?jJ(x, g)
and ?jJg-red(X, Yg), we have \;x(?jJ(x, G) = ?jJg-red(X, iG(z))) for aH G(z) and aH
values of variables in X", - {g}, which is to say \;x(?jJ[~](G) = ?jJg-red[~](iG(z))).
FinaUy, by Extensionality (Basic Law V), this is equivalent to saying that
i?jJ[~](G) = i?jJg-red[~](iG(z)) for aU G(z) and aU values of variables in X",-{g}.
I.e., cp(G) = cpg-red(iG(z)) for aU G(z) and all values of variables in X", - {g},
as required. The proofs of the inductive steps of the remaining possibilities of
case 3 are similar. (Note, in particular, that the proof of the inductive step for
the case where cp is \;x?jJ[~l is essentially contained in the pro of just given.)
4. Suppose cp is g(?jJ) , so cjJg-red is ?jJg-red '" Yg' Note that (*) CPg-red(Yg)
?jJg-red(Yg) r--- Yg and cp(g) = g(?jJ(g)),7, and by the IH, the functions ?jJ(g) and
70f course, it is certainly possible that g not appear in 'ljJ (and therefore also that Yg not appear
33
'ljJg-red(Yg) are such that 'ljJ(G) = 'ljJg-red(iG(i)) for every unary first-order G(z).
Now, for any G(z) (and values of the variables in Xrj; - {g}), we have <p(G) =
G('ljJ), and <pg-red(iG(i)) = 'ljJg-red(iG(i)) ~ iG(i) = G('ljJg-red(iG(i))), by
definition of the application operator. Now, G( 'ljJg-red(iG(i))) = G( 'ljJ( G)) by
the IR, and G('ljJ(G)) = <p(G) from (*). So agàin, <p(G) = <Pg-red(iG(i)) for aU
G(z) and aU values of variables in Xrj; - {g}, as required.
5. FinaUy, suppose <p is Vg'ljJ[~], so <Pg-red is Vyg'ljJ~-red[~l. Now, for aU values
of the variables in Xrj; (note that 9 is not an element of Xrj;), we have that
each of <p and <Pg-red is equal to a truth value. Therefore, it suffices to show
that <p(G) = --L iff <pg-red(iG(i)) = --L8, for any G(z). Since 9 is not free in cp
and Yg is not free in <Pg-red, <p(G) has the same value for every G(z), as does
<Pg-red(iG(i)). As the values of these functions do not vary with G(z), we will
suppress the argument of <p(g) and that of <Pg-red(Yg) in what foUows. Suppose
<p = --L. Then for sorne H(z), 'ljJ(H) =1= T, so by the IR, 'ljJg-red(iH(i)) =1= T.
Now note that by the definition of 'ljJg-red, every occurrence of Yg in 'ljJg-red is
within an expression of the form X ~ Yg, where X is a term (or a term with
bound variables substituted for free variables). Since a ~ b = a ~ i(i ~ b) for
aIl a, b, we have that 'ljJ~-red(b) = 'ljJg-red(i(i ~ b)) = 'ljJg-red(b) for aIl values b
of the variable Yg in 'ljJ'(Yg). So from the above, 'ljJ~_reAiH(i)) =1= T also, which
is to say <Pg-red = --L.
Conversely, suppose <Pg-red = --L, i.e. there is an object b such that 'ljJ~-red(b) =1=
T. Therefore, 'ljJg-red(i(i ~ b)) =1= T, and by the IR, 'ljJg-red(i(i ~ b)) = 'ljJ(z ~
b).9 So 'ljJ(z ~ b) =1= T, and therefore <p = --L, as required.
in 1jJg-red). 8Note that since T and -1 are not symbols of GG, an expression such as cP = -1 should be taken,
within GG, as an abbreviation for, e.g., cP ~..:., (x ~ x):
9It's to obtain this equality that we replace 1jJg-red by 1jJg_red[i(z;:yg)] in the definition of cPg-red
34
So, once again expressing the arguments, we have cjJ( G) = cjJg-red(iG(i)) for
every G(z) and aU values of the variables in X</>( -{g}), completing the proof.
o
ActuaUy, our main interest in this result is something that Frege does not mention
at aU, namely that quantification over function variables can, in the sense given by
above the above lemma, be replaced by quantification over object variables. This
is important for our interpretation of GG in NF UT, because the latter, as a theory
of sets recognizes only one type of first-class existent, that is, there exists a single
variable type (corresponding to Frege's object type) over which we can quantify.
For now, however, we turn to Frege's definition of cardinal number.
2.2.2 Bijection and Equinumerosity
The basic notion at the heart of Frege's definition of cardinal number is that of
"equinumerosity" ("gleichzahligkeit"). Two sets are said to be equinumerous if there
is a bijection between them. We therefore need a definition of bijection; to this end,
Frege gives first a definition of injection. Note that in the context of GG this apphes
to a binary function.
Definition 2.3. A binary function F(x, y) is said to be injective ("eindeutig") if
\;x\;i)(F(x, i)) -.:..... (\;i(F(x, i) -.:..... i) ~ i))) = T, i.e. if, for each x, there is at
most one y such that F(x, y) = T. "Being injective" is most naturaUy a second
order concept, as an injection is a first-order function. However, in practice Frege
always makes use of the corresponding first-order concept, for which he introduces
the foUowing short-hand notation:
I(p) =def VxVi)[(x ,---., (i)'---" p)) -.:..... (Vi(x ,---., (i ,---., p) -.:..... i) ~ i))].
above.
35
Thus, if p is the extension y(&:F(i, fj)) of sorne binary function, F(x, y), then
I(p) = T iff F is injective as defined above. Frege then gives the following two
definitionslO .
Definition 2.4. )p =def y(&:(I(p) À Vû(û ~ i ~ :3v(v ~ fj À û ~ (v ~ p))))).
Thus, for each object p, )p is a relation such that, given objects a, b, a ~ (b ~
)p) = T iff [I(p) = T and for each u such that u ~ a = T, there is a v such that
v ~ b = T and u ~ (v ~ p) = T]. In such a case, Frege says "the p-relation maps
the a-concept into the b-concept." 11 This is the GG equivalent of an injection from
the set {ulu ~ a = T} into the set {vlv ~ b = T}.
Definition 2.5. Inverse relation. U(p) =def y(&:(fj ~ (i ,--.., p))).
This gives, for any objects a, b,p, that (a ~ (b ~ U(p))) = (b ~ (a ~ p)), so
that, in particular, --(a ~ (b ~ U(p))) = --(b ~ (a ~ p)). It follows, for any
a, b,p, that a ~ (b ~)U(p)) = T iff b ~ (a '--"')p) = T. We can now capture the
notion of a bijective correspondence, or equinumerosity, as follows.
Definition 2.6. Given objects a, b, the concepts --(x ~ a) and --(x ,--.., b) are
said to be equinumerous if there exists an object p such that
[a,--.., (b '--"')p) 1\ b ~ (a ~)U(p))] = T, i.e., p injects the a-concept into the b-concept
and U(p) injects the b-concept into the a-concept.
With the above definitions in hand, Frege can now define, for any object a, the
cardinal of the a-concept ("Anzahl des a-Begriffes"). Recall that "being n in number"
or "having cardinality n"12 is for Frege a second-order concept. However, to this
lOFrege himself never uses ~, Â or V in his definitions, but for the sake of readability we will use
them freely.
Il "Die p-Beziehung bildet den a-Begriff in den b-Begriff ab." ([11], §38) Note that Frege is implicitly
defining, for an object a, the a-concept as --(x r-- a).
12n is not intended to indicate a restriction to finite numbers here.
36
there corresponds a first-order concept by the association of first-order functions with
their courses-of-value and the use of application, as discussed above. This first-order
function may be stated as "The x-concept has cardinality n," where x is an object
variable. Now if we apply the abstraction operator to this function, we obtain the
cardinal n. That is, n is the course-of-values corresponding to the function that takes
an object a to T iff the a-concept has cardinality n, and takes a to 1.. otherwise. If we
were to replace concepts with the sets for which they are characteristic functions, then
we would have that n is the set of all sets with n elements. The number associated
with an object a is the extension (course-of-values) of the concept "equinumerous
with the a-concept." Formally:
Definition 2.7. The cardinal, CardF(u), of the u-concept.
CardF(u) =def i[:3Y(u r--- (x r---)U(Y)) A x r--- (u r---)Y))].
We can then define the cardinals zero and one as follows.
Definition 2.8. Zero. OF =def CardF(i(':' (x ~ x))). (OF for "Frege 0".)
That is, OF is the cardinality of the concept "not equal to itself." Note that to
every function F(x) is associated a cardinality CardF(iF(x)) and many extensionally
different 13 functions have cardinality OF, e.g. any function whose range is the natural
numbers, assuming T is not equal to any natural number. But the cardinality operator
defined in 2.7 is intended primarily as an operator on extensions of concepts and any
function F(x) can be converted into a concept --F(x) by an application of the
truth function. (For F(x) a concept, --F(x) = F(x) for all x.) And modulo such
an application, all functions with cardinality OF are extensionally equivalent, i.e. take
the same value (uniformly 1.. in this case) for every argument.
13Two functions being "extensionally different" meaning that there is at least one (set of) argu
ment(s) for which the functions return different values, which, given Basic Law V, is equivalent to
the courses-of-values corresponding to the two functions being unequal.
37
Definition 2.9. One. IF =def CardF(i(i ~ OF )).
Any concept, e.g. "equal to OF," , under which a unique object falls has cardinality
IF. Note that, having defined OF and IF as such, a proof (though an easy one) is
needed in GG to show that IF is the successor of OF, with successor as defined below.
2.2.3 Successor, Ancestral and the Natural Numbers
We come now to Frege's definitions of successor, ancestral and, finaIly, natural number.
The first two definitions are of relations, and the third of a concept.
Definition 2.10. Successor.
SF =def y(i(~il~v(i ~ CardF(i(z ~ il  -:., (z ~ v)))  v ~ il  fJ ~ CardF(ù)))).
We'Il write XSFY for x ~ (y ~ SF).
80 bis a successor of a, aSFb = T, iff there are objects c,d suchthat (i) a is the
cardinality of the concept "falling under the c-concept but not equal to d," (ii) d falls
under the c-concept and (iii) b is the cardinality of the c-concept. Roughly, b is a
successor of a if b is the cardinal of a concept under which exactly one more object
falls than under a concept for which a is the cardinal.
The following definition of the "ancestral of a relation" is intended mainly to be
applied to the extension (double course-of-values) q of sorne given relation, but can of
course be rneaningfully applied to any object. Given a relation F(x, y), the ancestral
of F, denoted <~(iF(i,:iï))14 is intended to capture the relation of "precedes by a finite
(and non-zero) nurnber of places in the F-relation."
Definition 2.11. Ancestral. Given an object q, we define the ancestral of the q-
14Essentially, Frege's own notation; see [11] §45.
38
relation as
<q=dej y(i(Vg[(\ffi\fîi(g(fi) A fi ~ (îi ~ q)) ~ g(îi)) ~
\fiiJ(i ~ (iiJ ~ q) ~ g(iiJ)) ~ g(Y)])).
Let us calI a concept, G(x), hereditary in the q-relation if . . .
[\ffi\fîi(G(fi) 1\ fi ~ (îi ~ q)) ~ G(îi)] = T. Then the above definition states that
a <q b = T ifIfor every concept, G(x), which is hereditary in the q-relation, it follows
from the fact that G(c) = T for every "q-successor" c of a (i.e. \fiiJ(a ~ (iiJ ~ q) ~
G(iiJ)) = T), that G(b) = T.
The following definition is straightforward.
Definition 2.12. Equal ta or finitely preceding in the q-relation.
~q=dej y(i(i <q fJ V i == fJ))·
We come finally to Frege's definition of natural number as equal to zero or finitely
preceded by zero in the SF-relation.
Definition 2.13. An object nF is a (Frege) natural number if OF ~SF nF = T.
lmmediately after laying down this definition in §46 of the Grundgesetze ([11]),
Frege says that he has indicated in §82 of his Grundlagen ([18]) that, for any natural
number nF, the cardinality of the concept "belonging to the series of natural numbers
ending with nF" ("der mit n endenden Anzahlenreihe angehorend" 15) is the successor
of nF. As a theorem of GG, this may be stated as:
f- (OF ~SF nF) ~ (nFSFCardF(nF ~~SF)).
Note that a ~ (nF ~~SF) = T ifI a = nF or a precedes nF in the SF-sequence, i.e.
15Frege, of course, did not subscript his natural numbers with "F."
39
nF ~::;SF is the extension of the concept "belonging to the series of natural numbers
ending with nF." (In other words, nF ~::;SF= i(i ::;SF nF).)
It's interesting that Frege calls attention to this theorem as central to his devel
opment of arithmetic, as we must discuss this exact theorem in sorne detail when
reconstructing a modified form of GG in NFUT. What is clear is that an inductive
proof will be required. That the principle of induction holds for natural numbers
in GG is clear from the definition of Ancestral, 2.11 above. For suppose G(x) is a
function such that G(OF) = T and G(x) is hereditary in the SF-relation, i.e., that
G(a) = T implies G(b) for ban SF-successor of a, formally:
(*) f- \lu\lv[(G(u) À USFV) ~ G(v)].
If nF is a natural number, then by definition nF = OF or OF <SF nF, i.e. nF = OF or
f- yg(\lu\lv((g(u) À USFV) ~ g(v)) ~ [YW(OFSFW ~ g(w)) ~ g(nF)])
from which it follows by an instance of Universal Specification, that . .
f- [\lu\lv((G(u) 1\ USFV) ~ G(v))] ~ [\lW(OFSFW ~ G(w)) ~ G(nF)].
Then by (*) above and an instance of Modus Panens, we have
(t) f- \lW(OFSFW ~ G(w)) ~ G(nF).
It's quite easy to demonstrate that cn f- OFSFa ~ a ::::: IF. (In §87, Frege proves
sentence 71: f- I(SF), i.e. that there is at most one SF-successor to any object16,
and it's easy to show that f- OFSFIF, so (t) follows.) From the assumption f- G(OF)
together with (*) and (t), we may deduce f- G(IF). From f- G(IF) and (t), it now
follows that
f- \lW(OFSFW ~ G(w)).
Finally, then (t) and Modus Panens give us that
f- G(nF).
16This does not require induction, and it is not limited to objects, a, whose corresponding concepts,
-(x ".--, a), are finite.
40
2.3 Russell's Paradox
The inconsistency in Frege's painstakingly constructed logical theory, the project
to which he had dedicated his career, was brought to light in his 1902 exchange
with Russell ([8], pp.59-85). Frege did not immediately recognize the extent of the
disaster, quickly appending an analysis of Russell's paradox to the second volume
of the Grundgesetze, an analysis that, although quite detailed and insightful, ends
optimistically with the statement that should the problem17 not be so fully resolved
as he had believed, he "does not doubt that the way to a solution has been found"
([10] Nachwort, p.265). He soon came to realize that no simple solution presented
itself and, in print at least, abandoned the project of reducing of mathematics to pure
logic.
Once it is pointed out, the paradox is easy to derive. Frege gives the following
derivation in his Appendix. In Volume 1 ([11]), Frege has proved
f- f(x) ~ x ~ yf(fj) (special case of sentence 77, §91). We may substitute
-:. (x ~ x) for f(x) and i( -:. (x ~ x)) for x to obtain
f- [-:' (i(-:' (x ~ x)) ~ i(-:' (x ~ x)))] ~ i(-:' (x ~ x)) ~ i(-:' (x ~ x)).
It then follows immediately (reductio ad absurdum: sentence la, §49) that
(t) f- i(-:' (x ~ x)) ~ i(-:' (x ~ x)).
Similarly, starting from f--:' f (x) ~-:. (x ~ y f (fj)) (also a special case of sentence
77, §91), we arrive at
(:n f--:' (i(-:' (x ~ x)) ~ i(-:' (x ~ x))).
(t) and (:n give the (un)desired contradiction. Sentence la, mentioned ab ove , is
f- x ~ (-:' x ~ y), from which, using (t) and (:j:), we can now deduce any sentence
we wish.
Different interpretations have been given of the exact cause of the contradiction in
17In particular, Frege says, the problem of how we can grasp and recognize numbers as logical
objects.
41
Frege's Grundgesetze. The most conspicuous source of inconsistency is the unlimited
abstraction principle, which correlat es one-to-one unary first-order functions (and, ul
timately m-ary nth-order functions for any fixed m, n) with courses-of-values18 , which
are themselves arguments of these first-order functions. This correlation, together
with the application operator which gives us the means to apply a course-of-values
to itself, would seem, at least at first thought, to be the source of problems, and this
is indeed the "majority opinion"; see, for instance, Boolos [4]. However, there have
been other interpretations, as weIl. For instance, Dummett [7] seems to primarily
blame the second-order quantification in Frege's logic; Boolos offers a rebuttal of this
opinion in [4]. Aczel [1] offers yet another perspective in his analysis of the underly
ing structure of the formaI language of GG as a À-structure, and in particular as a
construction Aczel caUs a "Frege structure." Aczel shows that, in a Frege structure,
the introduction of any function playing the role of the truth function --x results
in contradiction, and so fingers --x as the real culprit. Unfortunately, any serious
discussion of these various conclusions would take us too far astray from the stated
goal of our essay. We turn our attention now to NF and NFU, the set-theoretic
framework we will use in Chapter 4 to interpret GGree .
18The correlation is one-to-one if we take an extensional view of functions, Le., that two given
functions are identical exactly if they take the same value for each (set of) argument(s).
42
Chapter 3
New Foundations, Modifications
and Consistency Results
This chapter introduces Quine's original theory, as well as Jensen's modification,
New Foundations with Urelements. A celebrated consistency result of Specker's is
presented for NF, and its application to NFU is shown. The consistency of NFU
relative to a weakened version of Zermelo set theory then follows by a theorem of
Jensen's. Finally, extensions of NFU, including the addition of an "Axiom of Count
ing" important for Chapter 4, are discussed and recent results are mentioned. As the
results in this chapter are not original and are presented mainly for use in Chapter 4,
we state lesser results without proof and sketch proofs for more important theorems,
giving references to the original sources.
3.1 Quine's Original Theory
W. v. Quine's article [30], "New Foundations for Mathematical Logic," appeared in
the February 1937 edit ion of The American Mathematical Monthly. In the article,
Quine st arts by stating his view that in Whitehead and Russell's Principia Math-
43
ematica we have "good evidence that aIl mathematics is translatable into logic."
Quine then notes that there is possible a further translation of the logic of the Prin
cipia into a logic whose language contains expressions built only from an infinite list,
Xo, Xl,· .. ,xn ,· .. , of variable symbols and the three primitives 1 (alternative denial),
V and E, together with parentheses. This is the language Quine adopts for his New
Foundations l . NaturaIly, the atomic formulas of the language are expressions of the
form X E y, for variable symbols X and y, and formulas are defined as usual. It is the
stated purpose of the remainder of Quine's article to outline this further translation
of the Principia. To this end, Quine provides a standard list of definitions for truth
functional connectives, the existential quantifier, "x being a subset of y," etc.; ordered
pairs are defined in the usual (Kuratowski) set-theoretic fashion2 , relations are defined
as sets of ordered pairs, and so on. Quine does not take equality to be a primitive,
giving the following "Leibnizian" definition: x = Y <r?def Vz(x E z +--+ Y E z).
Quine also adopts as an axiom the following "princip le of extensionality."
Extensionality. VxVy(Vz(z E x +--+ Z E y) ---+ x = y).
To this "principle," Quine adds five "rules" (actuaIly, two axioms and three rules of
inference). These include one theorem of propositional logic, the rule of inference
modus ponens and standard rules of universal generalization and universal specifi
cation, together sufficient for deriving aIl theorems of standard first-order predicate
logic. Thus far, there is nothing particular about Quine's logic. Rounding out his
rules, however, is an axiom of set formation. Quine points out that unrestricted set
formation, in the form of an axiom schema such as
Unrestricted Comprehension. ~xVy(y E x +--+ cp),
where cp is a formula not containing x free, leads to RusseIl's paradox, and that Rus-
lQuine does not actually use the symbol 't:/; he symbolises the universal quantification of an
expression such as x = x by (x)(x = x) rather than by 't:/x(x = x). We shall make use of the latter
notation, with the same meaning, of course.
2I.e. (x,y) =def {{x}, {x, y}}.
44
sell's Theory of Types overcomes this difficulty by assigning a "type" (i.e. a natural
number) to each variable symbol and barring all expressions of the form x E y, except
in the case where x is of type one less than is y. However, as Quine points out, "in
all contexts [i.e., formulas] the types appropriate to the several variables are actually
left unspecified; the context remains systematically ambiguous, in the sense that the
types of its variables may be construed in any fashion conformable to the require
ment that E connect variables only of consecutively ascending types." Any formula,
cp, of untyped set theory, that has a meaningful interpretation in type theory, i.e., for
which it is possible to assign types (natural numbers) to its variables such that the
resulting expression is a formula of TT, will in fact have infinitely many meaningful
interpretations in TT, obtained by repeatedly "shifting" the type of each variable
up by one. In type theory, there are endless "cleavages and reduplications" of sets,
relations and even arithmetic, which Quine says are not only "intuitively repugnant,
but they continually call for more or less elaborate technical manoeuvres [ ... ]."3
To avoid this, and since there is no apparent difficulty with considering expressions
of the form x tJ. x as meaningful, only with considering the existence of a set composed
of exactly the members satisfying it, Quine proposes that we reject the strict Principia
separation of types, and sim ply restrict the axiom of set formation in an appropriate
way. To this end, we may define Quine's notion of stratification as follows.
Definition 3.1. A formula, cp, (of NF) is said to be stratified if there exists an in
dexing, (J', (by natural numbers) of the variable symbols in cp such that the resulting
formula, cp*, is a formula of TT. The indexing, (J', will be called a stratification as
signment for cp and cp* will be called the TT-formula obtained from cp by (J', or simply
a TT-formula obtained from cp.
3Interestingly, Russell rejects Quine's solution (if indeed it is one) of the paradoxes as "created ad
hoc and not to be such that even the cleverest logician would have thought of if he had not known
of the contradictions." ([24], p.SO)
45
As a final axiom, then, Quine adopts the following revised axiom schema of com
prehension.
Stratified Comprehension. :JxVy(y Ex+-> cP), where cP is a stratified formula not
containing x freé.
Note that the definition of stratified formula applies equally well if we adopt the
equality symbol, = , as a primitive. That is, it follows from Quine's "definition" of
equality that if cP is a formula such that the expression x = y occurs in cP, then any
stratification assignment for (the formula abbreviated by) cP assigns the same index
to x and to y, as required for the result to be a legitimate formula of TT. Below we
adopt equality as a primitive.
3.1.1 A Concise Statement of Quine's New Foundations
To recap the above introduction to New Foundations we give a more concise presen
tation of Quine's theory. We will denote the set theory described below by "NF."
NF is a theory in first-order predicate logic (with identity). Its signature contains
the single binary predicate symbol E, and its only axioms are Extensionality and the
Schema of Stratified Comprehension, stated ab ove , and repeated here for convenience:
Axiom of Extensionality. VxVy(Vz(z Ex+-> Z E y) ---+ x = y).
(Note that the right-Ieft implication is here assumed an axiom of predicate logic.)
Axiom Schema of Stratified Comprehension. :JxVy(y Ex+-> cP), where cP is a stratified
formula not containing x free.
4The Axiom Schema of Stratified Comprehension can be replaced by an equivalent finite list of
comprehension axioms. See [14] and [15].
46
3.2 The Consistency of NF and Variants
The consistency of NF relative ta any more thoroughly studied or accepted set theory
remains essentially an open question. We will present and sketch a pro of of a famous
consistency result of Specker's ([28]) ta the effect that a model of NF exists iff there
exists a model of TT with a "shifting automorphism," a.k.a. a "Specker model." The
importance of this result for our purposes is that it can be easily modified ta prove
that the set theory NFU (defined below) is consistent if there is a model of TTU
(also defined below), which in turn can be constructed from a model of weak Zermelo
set theory. NFU and weak extensions thereof will then provide the framework for an
interpretation of GG.
3.2.1 Specker Models and Models of NF
In this section we will indicate the free variables Xl, X2,' .. ,Xn , of a formula cp as in
CP(Xl' X2, . .. ,xn ). If CP(Xl, X2, . .. ,xn ) is a stratified NF-formula, we will denote by
cp*(X~l, X~2, . .. ,x~n) the TT-formula obtained from cp by a stratification assignment,
(J, such that O'(Xj) = ij for 1 ~ j ~ n.
We now show that the existence of a model of NF is equivalent to the existence
of a model of TT for which there exists a "shifting automorphism." In this, we follow
essentially Jensen's presentation in [20].
If M = (Ui , Ei)iEN is a model of TT, i.e., M 1= TT, then we will denote by
M+ the model (Ui , Ei)iEN\{O} 5 of TT. That is, M+ 1= TT is the model obtained
by "dropping" the bottom level of M. Now, a shijting automorphism6 of M is an
isomorphism 0' : M --+ M+ such that, for each i, 0' restricted ta Ui is a bijection ta
Ui+l . In other words, 0' is a bijection on the underlying sets, with O'(Ui ) = Ui+1 and
5 Really, M+ should be defined as (Vi, EniEN with Vi = Ui+l and Ei=Ei+l for i E N, but l take
it the meaning is clear.
6Technically not an automorphism, but the terminology is standard.
47
Definition 3.2. A Specker model, M, is a model of TT such that there exists a
shifting automorphism of M.
With these definitions in place, we can prove the foIlowing.
Theorem 3.3. (Specker) NF is consistent iff there exists a Specker model.
Proof. Let M = (Ui , Ei)iEN be a Specker model, and let (J be a shifting automorphism
for M. Define N = (U, EN) by taking U = Ua and EN such that, for u, v E Ua,
U EN V {::}def U Ea dv). We show that N 1= NF.
We first note that for any stratified NF-formula cfy(XI, X2, . .. ,xn), any TT-formula
cfy* (X~l , X~2, ... , x~n) obtained from cfy and any UI, U2, ... , Un in Ua, the foIlowing equiv
alence holds: (*) N 1= cfy[UI, U2, ... ,un] {::} M 1= cfy* [(Jil (ud, (Ji2 (U2), ... , (Jin (Un)].
(*) is shown by induction: If cfy is x E y, then cfy* is necessarily Xi E yi+l
for some natural number i. Now if u, v E Ua, then by definition of EN, we have
U EN v {::} U Ea (J( v). It foIlows from the fact that (J is a shifting automorphism that
U Ea (J(v) {::} (Ji(U) Ei (Ji+1(v) for every i. So we have
NI= cfy[u,v] {::} MI= cfy*[(Ji(U),(Ji+I(V)],
as required. If cfy is x = y, there is nothing to prove. Induction on the struc
ture of cfy is straightforward, although the case of quantifiers requires some care.
Let us consider the case where cfy(XI' X2, . .. ,xn) is Vy1jJ(y, Xl, X2,' .. ,xn) for some
1jJ. Then cfy* is Vyj 1jJ* (yj , X~l , X~2 , ... , x~) for some appropriate indexing of the vari
ables. Let UI, U2,· .. ,Un E Ua· Now, if N 1= cfy[UI, U2, . .. ,un] then, for aIl v E Ua,
N 1= 1jJ[v, UI, U2, ... ,un]' By the induction hypothesis,
N 1= 1jJ [v, UI, U2, ... ,un] {::} M 1= 1jJ* [(Jj ( V ), (Jil ( UI), (Ji2 ( U2), ... , (Jin ( Un)],
SO we have that (t) (for aIl v E Ua, M 1= 1jJ* [(Jj (v), (Jil (UI), (Ji2 (U2), ... , (Jin (un)]).
But (Jj is a bijection from Ua to Uj , so every w E Uj is w = (Jj (v) for some v E Ua
and therefore (t) implies that
48
for aH w E Uj , MF 'l,Ii* [w, (ji! (Ul), (ji2 (U2), . .. ,(jin (un)],
i.e. that MF q'>*[(ji!(ud,(ji2(U2)"" ,(jin(Un)].
The reverse implication follows similarly.
So (*) holds. But now we need only note that the axiom of Extensionality and aH
instances of the Axiom Schema of Stratified Comprehension are stratified formulas
of NF and any TT-formula obtained from one of these axioms is an axiom of TT.
Since M F TT, M of course models all such axioms, and by (*), N therefore models
Extensionality and Stratified Comprehension. I.e. NF NF.
The converse is much simpler. For suppose N = (U, EN) F NF. Define M =
(Ui , Ei)iEf\! by letting Ui = U and Ei=EN for all i. It's easy to verify that MF TT
and that the identity function on U de fines a shifting automorphism of M. 0
3.2.2 Jensen's New Foundations with Urelements
Just over thirty years after Quine introduced NF, Ronald Bjorn Jensen wrote an
article titled "On the Consistency of a Slight(?) Modification of Quine's New Foun
dations" ([20]). The "slight" modification Jensen considers is a weakening of the
axiom of extensionality to allow "Urelements," i.e. non-sets. Thus, in addition to an
empty set, the resulting theory, denoted by "NFU," would allow for many different
objects that have no members. Formally, NFU is the theory obtained from NF by
replacing the Axiom of Extensionality with the following:
Axiom of Weak Extensionality.
\fx\fy[3z(z E x) ~ ((\fw(w E x <-t W E y) ~ x = y))].
Note that any object which is a non-empty set, i.e., an object which has at least one
member, is subject to the fulllaw of extensionality. On the face of it, this modification
indeed seems slight; this was certainly Quine's own opinion. For in his 1937 article,
49
he suggests that for non-sets, a, we may interpret expressions of the form x E a as
stating the equality of x and a. In a footnote, he points out that "this interpretation,
along with the subsequent postulate Pl [(Full) Extensionality], results in the fusion
of every individual with its unit class; but this is harmless." ([30], p. 71) He repeats
this opinion in his 1940 book, Mathematical Logic, where he also mentions that this
"fusion" of an individual with its unit class is warned against by Frege, but in the
case of non-sets or "individuals," the "retention of the pre-Fregean attitude leads to
no trouble." ([31], p.136). But, as we have seen, Frege himself was taken by the same
impulse for uniformity among the objects in his theory. Quine wanted all objects
in his theory to be interpretable as sets and Frege wanted all objects in his to be
interpretable as courses-of-values. But both felt that their theories should, at least in
principle, be applicable to aU objects. That being the case, it seems hard to provide
a reasonable justification for their interpretations of non-sets as sets and non-courses
of-values as courses-of-values, respectively, other than the desire for uniformity, to
"tidy up the picture.,,7 In any case, the modification proposed by Jensen is quite
natural.
Here we are primarily concerned with the effect this modification has on the
consistency of the theory. We will now show that NFU is in fact consistent relative
to a weak form of Zermelo set theory. This result indicates that the fusion of a
non-class with its unit class is, in terms of consistency at least, far from "harmless."
Before defining weak Zermelo set theory, denoted "WZ," the first step in our pro of
is to record an immediate corollary of Theorem 3.3. Let us denote by "TTU" the
modified theory of types obtained from TT by replacing each instance of Extension
ality with the corresponding instance of Weak Extensionality. We define a shifting
automorphism of a model of TTU exactly as for models of TT, and a Specker model
7 If, on the other hand, we wish our Iogic to apply exclusively to mathematicai objects, perhaps
one couid make the case for these interpretations.
50
of TTU as a model of TTU for which there is a shifting automorphism. Then, simply
noting that the Axiom of Weak Extensionality is also a stratified NFU -formula, the
pro of of Theorem 3.3 can be carried out exactly as above to give
Corollary 3.1. NFU is consistent if there exists a Specker model of TTU.
In fact, since formulations of the axioms of Infinity and Choice are also stratified,
we may give a refined version of this corollary:
Corollary 3.2. NFU (+ Infinity, + Infinity + Choice) is consistent if there exists a
Specker model of TTU (+ Infinity, + Infinity + ChoicejB.
In the next section, it is actually a Specker model of TTU that we construct from
a model of WZ. We can then define a model of NFU in the same way that we defined
a model of NF from a Specker model (of TT).
3.3 Constructing Models of NFU
Below, we define the theory WZ, some equiconsistent extensions, and construct a
Specker model of TTU from a model of one such extension. The construction also
shows the consistency of NFU + Infinity (+ Choice) relative to WZ + Infinity (+
Choice). Finally, we will consider the addition to NFU of an "Axiom of Counting,"
introduced by Rosser and discussed by Randall Holmes in his work on mathematics in
NFU-style set theories. Holmes is primarily interested in much st ronger extensions of
NFU and gives a detailed account of related consistency results in [17]. As our ai ms
8The same statement could technically have been made with NF and TT replacing NFU and
TTU, respectively; however Specker [27] has shown the NF disproves Choice, and therefore proves
Infinity. So every Specker model of TT models Infinity + ...., Choice.
51
are more modest here, we will discuss specifically only the addition of "Counting"
to NFU, or rather to NFUT (see 4.2.1), which will be the theory we finally use to
interpret a modification of GG in the next chapter. NFUT with Counting appears
to be sufficient for Fregean arithmetic.
3.3.1 Weak Zermelo Set Theory and Extensions
Below, a formula cp is said to be bounded if aIl occurrences of V in cp are within expres
sions of the form Vx(x E y ---+ ..• ) and aIl occurrences of :3 are within expressions of
the form :3x(x E y 1\ ... ). WZ is the first-order set theory axiomatized by (universal
closures of) the following.
1. Extensionality: V Z (z E x <--t Z E y) ---+ x = y.
2. Empty set, 0: :3xVy(y ~ x).
3. Pair set, {x, y}: :3zVw(w E z <--t (w = X V W = y)).
4. Union, Ux: :3yVz(z E y <--t :3w(w E x 1\ Z E w)).
5. Power set, Px: :3yVz(z E y <--t Vw(w E Z ---+ W EX)).
6. Schema of Bounded Comprehension: :3xVy(y E x <--t cp), where cp is a bounded
formula not containing x free.
In his article, Jensen also outlines a proof of
Lemma 3.1. TT( + Infinity, + Infinity + Choice) is consistent if! WZ( + Infinity, +
Infinity + Chaice) is consistent.
We give here a pro of of the following theorem of Jensen's.
Theorem 3.4. (Jensen) There is a Specker model of TTU (+ Infinity, + Infinity +
Choice) if WZ (+ Infinity, + Infinity + Choice) is consistent.
52
It follows then, by Corollary 3.2, that
Corollary 3.3. NFU (+ Infinity, + Infinity + Choice) is consistent if WZ (+ Infin
ity, + Infinity + Choice) is consistent, equivalently, by Lemma 3.1, if TT( + Infinity,
+ Infinity + Choice) is consistent.
Pmof. We sketch a proof of Theorem 3.4 to give the reader a picture of an NFU
model. For further details ofthe proof, we refer the reader to Jensen's original article
[20].
Assume WZ is consistent. We add Skolem functions to WZ to obtain the theory
WZ*. That is, WZ* is the theory WZ plus the axioms
Skolem axioms
\Ix 1 , X2,···, xn[:Jy(<p(y, Xl, X2,···, Xn) ~ <P(J</J(X1, X2,···, Xn), Xl, X2,···, Xn))],
where f<P is a new function symbol for each WZ* formula <p. WZ* is consistent if WZ
lS.
N ext, consider the following three axioms for a set of constant names {Ci hEM.
2. <p(CilJ Ci2'.··' Cin) ~ <p(CjlJ ch,···, Cjn) for il < i 2 < ... < in, j1 < j2 < ... < jn,
and any WZ* formula <p.
3. Ci is infinite.
Let W Zl denote the theory obtained from WZ* by adding the constant symbols
{Ci hEM and the axioms 1 and 2 above. Let W Z2 denote the theory obtained from
W Zl by further adding axiom 3.
Let M = (U, EM, L;) 1= WZ* (where L; interprets the Skolem functions). We show
that any finite subset of the W Zl axioms is interpretable in M. Let Ua E U. Define
Ui, i E M, by M 1= Ui+1 = Ui U PUi. Clearly, M 1= Ui U PUi ç Uj for i < j. Let
53
(h, tP2' .. . ,tPn be any finite sequence of WZ* formulas. Byan application of Ramsey's
theorem, we obtain an infinite subset U* = {Uij hEN of {UdiEN such that, for each
<pi,i S; n,
M ~ <Pi ( Ujl' Uh, ... ,Ujmi) +-+ <Pi ( Uh 1 , Uh2' ... ,Uhmi ) for Ujk' Uhk E U*, k S; mi, and
j1 < j2 < ... < jmil hl < h2 < ... < hmi · Taking Cj to be Uij for j E M, we therefore
have that M interprets any given finite subset of the W Zl axioms. By Compactness,
WZ1 is therefore consistent if WZ* is. Furthermore, if M ~ lnfinity, then we may
take Uo to be an infinite set and axiom 3 will be satisfied by each Ci. If WZ + Choice
is consistent, we may also choose M such that M ~ Choice. So if WZ (+ lnfinity,
+ lnfinity + Choice) is consistent, then so is WZ1(WZ2, WZ2 + Choice).
Now, if there exists a model of WZ1 , then there exists a term model. Suppose
M = (U, EM, l:) ~ WZ1 is a term model. Then an automorphism, CY, of (U, EM) is
uniquely determined by CY(Ci) = CH1, i E M. Let us define, for i E M:
Ui =def {x E UI M ~ x E Ci} and x Ei y {:}def M ~ (y ç Ci 1\ x E y). Setting
N = (Ui , Ei)iEN, we wish to show that N ~ TTU; it's clear that CY restricted to
UiEN Ui is then a shifting automorphism for N.
N ~ Weak Extensionality by definition of Ei. To show Comprehension, one
proves by induction that for any TTU formula tP( X~l , X~2 , ... , x~n), there is a bounded
WZ1 formula <P*(X1, X2, ... ,xn) such that
N ~ tP[U1, U2,···, un] {:} M ~ tP*[U1, U2, ... , Un]. Comprehension for N then follows
from Bounded Comprehension for M. So N is a Specker model of TTU. See [20] for
details. If M models Infinity (Choice), then N models Infinity (Choice). 0
We then obtain the desired model, (P, EP ), of NFU from a Specker model of TTU
as described previously, namely P =def Uo and for any u, v E Uo,
U EP V {:}def U Eo cy(v).
54
3.4 Arithmetic in NFU
We take a brief look now at arithmetical concepts in NFU. The basic definitions are
essentially those originally given by Frege, transferred to a set-theoretic context.
Let us assume that ordered pairs are defined and satisfy (*) (x, y) = (z, w) +-t
(x = z 1\ y = w). (x, y) may be defined as usual as {{x}, {x, y}}, but in the next
chapter we discuss and adopt an alternative definition; for now we assume only that
ordered pairs exist and satisfy (*). Then relation, function, bijection, etc., are defined
as usual and we introduce also the following definitions.
Definition 3.5. Two objects a, b are said to be equinumerous if there exists a bijection
between a and b (i.e., a bijective correspondence between the elements of a and the
elements of b; this is well-defined for urelements also). We write a rv b for "a and b
are equinumerous."
Note, in particular, that all urelements are equinumerous to the empty set.
Definition 3.6. For any object a, the cardinality of a is defined as
Card(a) =def {xix rv a}.
It's easy to verify that an NFU formula defining x rv y is stratified (assuming, for
instance, that (x, y) is defined as {{x}, {x, y} }, or as we define it in Chapter 4).
Definition 3.7. Zero is defined as 0 =def Card({xlx =1= x}}.
Definition 3.8. One is defined as 1 =def Card( {O}).
We define now the three important notions of successor, ancestral and natural
number.
Definition 3.9. Y is the successor of x, written xSy, iff
:Jz:Jw(w E z 1\ y == Card(z) 1\ x = Card(z - {w})).
55
Note that this formula is stratified and therefore the successor relation, S, exists
as a set in NFU.
Definition 3.10. Given any binary relation (set of ordered pairs) X, we define the
ancestral, ~ x, of X as follows:
x ~x y {=}def Vz[(VwVu((w E z 1\ wXu) -t u E z)) -t (x E z -t y E z)J9.
Definition 3.11. A natural number is any object x such that 0 ~s x, i.e., such that
"0 is an ancestor of x in the S-relation."
o ~s x is (an abbreviation for) a stratified formula of NFU and so the set of
natural numbers exists in NFU.
3.4.1 The Axiom of Counting
The problem with the above NFU definition of natural number is that we do not know,
for a general formula cjJ( x), that Mathematical Induction holds with respect to cjJ( x).
If cjJ(x) is a stratified formula, then the set {xlcjJ(x)} exists and it is straightforward to
deduce from the definition of natural number that cjJ(O) and cjJ(n) -t cjJ(n + 1) jointly
imply Vn(n EN -t cjJ(n)). (See 2.2.3; the proof that induction holds in NF/NFU for
stratified formulas parallels the proof given there for GG.) However, if cjJ(x) is unstrat
ified, we cannot (in general) know that {xlcjJ(x)} exists, and so we cannot conclude
from cjJ(O) and cjJ(n) -t cjJ(n + 1), that Vn(n E N -t cjJ(n)). For instance, NFU cannot
prove the following: 1o
9Note that this definition doesn't quite mirror Frege's definition of ancestral: for that we would
have to replace x E z --> y E z in the above formula with (Vv(xXv --> VEz)) --> y E z. The
difference is that, using Definition 3.10 ab ove , which is equivalent to Quine's ([31], p.216), we have
y :s;X y for any binary relation X and any y, whereas on Frege's definition this will generally not be
the case. Of course, we could define <x here to properly mirror Frege's definition, but the above
definition is slightly more efficient for our purposes.
lOIn fact, Orey [21J has shown that the Axiom of Counting is independent of NF.
56
Definition 3.12. Axiom of Counting. For aIl n E N, {O, 1, ... , n} En + 1.
Extending NFU by the addition of this axiom was suggested by Rosser ([22])11
This is in fact a very weak extension. Jensen [20] has shown that in ZF we can
construct w-standard models of NFU, in which Mathematical Induction holds for the
natural numbers with respect to any formula, stratified or unstratified. In particular,
Counting is true in such models. We consider this axiom here because its analogue in
GG is an important theorem in Frege's Grundgesetze, and our interpretation in the
next chapter of a modified form of GG will be unable to prove this theorem of Frege's
for basically the same reason that NFU cannot prove Counting. This will provide us a
chance to bring out the parallels between arithmetic in GG and in NFU, culminating
in a proof that adding Counting to NFU makes the analogous GG theorem true in
our interpretation of GGree , the modified form of GG that we present in the next
chapter.
llIn the form n E N -+ {1, ... ,n} E n. Holmes suggests stating this axiom as "AU finite sets
are strongly Cantorian." For a definition of "Cantorian" and "Strongly Cantorian," see [17], p.7.
Holmes also discusses several considerably st ronger extensions, ultimately settling on a theory he
caUs "NFUM," which he proves to be equiconsistent with Morse-KeUey set theory plus the axiom
"there is a K-complete nonprincipal ultrafilter on the proper class ordinal K"; see [17], p.30 for
definitions of these terms.
57
Chapter 4
Interpreting GGrec in NFUT
We give in this chapter an interpretation in NFUT of a modified form, denoted by
"GGree" ("GG reeonsirueied"), of GG; NFUT is defined below in 4.2.1. The most
noticeable modification to GG will be the elimination of function variables. We first
give a presentation of the language of GGree and define (contextually) the GGree
primitives in NF UT. We then define a notion of F-siraiijicaiion for terms of GGree
and prove that courses-of-values corresponding to F-stratified terms exist. We can
then examine to what extent Frege's original axioms are satisfied by GGree and how
far his developinent of arithmetic can be pushed. We will see that basic arithmetic in
GGree (and GG) is interpreted in essentially the same way as in NFU frameworks.
4.1 The Language of GGree
Variables
Variables in GGree are untyped and correspond to those of object type in GG. We
will again distinguish between free and bound variables, so we have the following lists
of variables.
59
• Free variables: xo, Xl, X2,· .. ,xn ,· ...
• Bound variables: xo, Xl, X2,· .. ,Xn ,· ...
GGree has nine primitive functions. Application, X ~ y, must be taken as primi
tive because its definition in GG makes use of a quantification over a function variable.
In Chapter 2, we showed that given an application operator as defined in GG, we can
replace quantification over function variables with quantification over object variables,
but this result clearly cannot be used to eliminate the quantifier in the definition of
X ~ y. The second major difference between the primitives of GG and those of GGree
is the presence, among the latter, of two abstraction operators (unary and binary)
and two application operators (unary and binary). The reason for this is discussed
in 4.2.2.
The primitives of GGree
The nine primitive functions of GGree are
l. truth function: --x,
2. negation: -, x,
3. implication: x ~ y,
4. equality: x ~ y,
5. universalization over objects: VXf(X)I,
6. application: x ~ y,
7. binary application: xy~z,
1 A unary (first-order) function, such as indicated here by f(x), is represented in GGrec by a term
and a specified free variable, binary functions by a term and two specified free variables.
60
8. abstraction: if(x),
9. binaryabstraction: fYf(x, fj).
Terms
GGree term and the set of free variables, sfv, of a GGree term are defined inductively
as follows.
1. Each free object variable, Xi, is a term with sfv {Xi};
2. if rp is a term with sfv X,p, then each of -- rp and ~ rp, is a term with sfv X,p;
3. if rp and 1j; are terms with sfv X,p and Xlp, respectively, then each of rp ~ 1j;,
rp ~ 1j; and rp ~ 1j; is term with sfv X,p U XI/;;
4. if rp, 1j; and X are terms with sfv X,p, X'Ij; and Xx, respectively, then rp1j;;:::"'X is a
term with sfv X,p U X'Ij; U Xx;
5. if rp is a term with sfv X,p, X, y are free object variables and X, fj are bound
object variables not occurring in rp, then each of \';"xrp[~l and irp[~l is a term
with sfv X,p - {x}, and fYrp[~, ~l is a term with sfv X,p - {x, y};
6. nothing else is a term.
Statement, formal deduction, theorem, etc. are defined for GGree exactly as for
GG.
4.2 NF UT, Interpreting GGrec Primitives, and F-
stratification
Before we can examine the basic arithmetical definitions of GG as given in GGree ,
we need to interpret the primitives above in NF UT. For, as might be expected, this
61
translation will place a restriction on which courses-of-values we can assume to exist.
This restricted form of abstraction will be the biggest difference between the GG
axioms as recovered in our interpretation of GGree and the original axioms (see 1.2.3).
Note that it will not be possible, in general, to translate a term of GGree into the
primitive set-theoretic notation of NFUT, for the GGree functions of universalization
and abstraction are interpreted as second-order functions. That is, since these second
order functions take first-order functions as arguments, and the latter are not first
class existents in NF UT, we cannot give explicit definitions of universalization and
abstraction. \Vhat is possible, for any GGree-term 1, is to translate the expression
y = 1 into the language of NF UT, where y is a variable not occurring in 1.2 As we
shall see, this will be sufficient to translate any expression of the form 1 = Tinto the
language of NFUT and therefore to determine which GGree statements are true on
our interpretation. First, we need to fix the objects T and L
4.2.1 Extending the Language of NFU: NFUT
If we are to define the functions --x, -.:, x, x ~ y and x ~ y (and interpret
y = Vx1[~]), then clearly we need to fix two distinct objects as T and _L Intuitively,
these truth values are "concrete" objects, in that they seem to be neither sets nor
courses-of-values. For reasons of philosophical purity, Quine would have us construe
these objects so that T = {T} and .-L = {.-L}, while Frege takes them to be courses
of-values as described in Chapter 2. However, these interpretations seem unnatural,
especially in the present NFU context in which we are in any case allowing urele
ments. Therefore, the obvious interpretation to give of T and .-L is that they are
2Note that y will be a variable of the language of NF UT, while all variables in cp are of the
language of GGree, so y is a priori different from all variables in cp. However, in practice, we will
use the same symbols for variables of NFUTand free variables of GGree , so the statement above
indicates simply that we choose a variable symbol y not appearing in cp.
62
two distinct urelements. Do we know that such elements exist? In fact, Holmes has
shawn (NFU + Infinity + Choice) that "there are many cardinals between IVI [the
cardinality of the universel, and IP(V)I [the cardinality of the set of all sets]" ([17]),
i.e., that the number of urelements in any model of the theory is quite enormous. In
arder ta select and fix two of them, we will add the names T and ..l to the language
of NFU, together with the axiom
Axiom A. T =1= ..l.
Instead of an axiom stating that nothing is a member of either T or ..l, we will
also add the unary predicate symbol 'Set' to the language, with the axioms
Axiom B. ::Jx(x E y) -+ Set(y),
Axiom C. -,Set(T) 1\ -,Set(..l).
Stating the axioms of extensionality and stratified comprehension in the following
forms then allows us to distinguish the empty set from other urelements, including
T and 1...
Extensionality. (Set(x) 1\ Set(y)) -+ ('v'z(z E x ~ Z E y) -+ x = y).
Schema of Stratified Comprehension. ::Jx(Set(x) 1\ 'v'y(y E x ~ CP)), where cP is a
stratified formula not containing x free.
Note, however, that the notion of stratification now requires sorne clarification in
the case of formulas containing T and/or ..l. In this regard, we take T and 1.. quite
63
naturally to be constant terms, or O-ary functions, as we would a name for the empty
set. For suppose we wish to introduce the name 0 for "the set x such that nothing is
an element of x", i.e., we add 0 to the language and the axiom Set(0) /\ Vy(y ~ 0).
What stratification restrictions should we place on formulas containing 07 Note
that any formula containing 0 is of course equivalent to one not containing 0. For
instance, the atomic formula 0 E x is equivalent to (*) 3y(Set(y)/\ Vz(z ~ y)/\y EX).
Similarly for x E 0, x = 0 and 0 = x, and from the translations of these we can
easily define, for any formula cp containing 0, an equivalent formula cp* not containing
0. Note that if cp is a formula containing the atomic formula 0 E x (and no other
atomic formulas containing 0) and cp* is a formula obtained from cp by replacing each
occurrence of 0 E x by (alphabetic variants of) (*), then we may of course choose the
bounded variables in (*) so that these do not occur in cp. Suppose now that there is
an indexing, a, of the variables of cp such that a is a stratification assignment for an
atomic formulas other than 0 E x. Then it should be clear that a can be extended
to a stratification assignment, a* for CP*. 3 So we see that if we add 0 as a name for
the empty set, the atomic formulas 0 E x, x E 0, 0 = x and x = 0 are free from
stratification restrictions. More precisely, we define a stratification assignment for a
formula, cp, which may contain occurrences of 0, as an indexing, a, of the variables
of cp such that the restriction of a to the variables of any atomic formula, 'Ij;, of
cp, not containing an occurrence of 0, is a stratification assignment, in the original
sense, for 'Ij;. This definition of an extended notion of stratification is motivated by
our discussion ab ove , which shows that if cp, possibly containing occurrences of 0, is
stratified in this extended sense, then there is a formula cp*, not containing occurrences
3Specifically, if, in the formula fjJ, x also occurs outside of occurrences of the atomic formula
o Ex, then we take a* = aU {(y,a(x) - 1), (z,a(x) - 2)} (where we might need to replace each
a( u) by a( u) + 2 to insure that a* (u) ::::: 0 for ail variables u in fjJ). If, in fjJ, x occurs only in the
occurrences of the atomic formula 0 Ex, then we take a* = aU {(x,n +2), (y,n+ 1), (z,n)}, for
some arbitrary n E N.
64
of 0, such that </J* is equivalent to </J and </J* is stratified in the original sense.
This then also motivates our definition of stratification assignment for a formula
of NFUT. Namely, to repeat, for T and -.l, the conditions stated above for 0: a
stratification assignment for a formula, </J, of NFUT is an indexing, cr, of the variables
of </J such that the restriction of cr to the variables of any atomic formula, 'ljJ, of </J,
not containing an occurrence of T or -.l, is a stratification assignment, in the original
sense, for 'ljJ.
The discussion above concerning the addition of names to the language of NFU
is closely related to the use of definite description terms, which we will now briefly
describe. As with 0 for the empty set, we may wish to use the name Au B for the
x such that Vy(y E x ~ (y E A V Y E B)). We can handle all such cases in which a
formula (possibly with parameters) defines a unique object by introducing an analogue
to Frege's unit class operator \x, such as Russell's unit class operator iX</J for definite
descriptions </J. Suppose, for a formula </J, we know that :Jx(</J /\ Vy(</J[~l --+ y = x)).
We can introduce the term iX</J and define its use in any context from the following
definitions.
• U = iX</J {::}dej :Jx( </J /\ Vy( </J[~l --+ y = x) /\ U = x);
• same definition for iX</J = u;
• u E iX</J {::}dej :Jy(y = iX</J /\ u E y);
• iX</J E U {::}dej :Jy(y = iX</J /\ yEU).
If we allow a formula, </J, to contain such definite description terms, we could
adopt the following stratification conventions: We assign types not only to variables
of </J, but also to the terms, iX'ljJ, and require, in addition to the usual stratification
requirements, that type(ix'ljJ) = type(x). To see that, e.g. x E (iy(Set(y)/\ Vz(z ~ y)))
is stratified in this extended sense, we may consider the stratification assignment:
65
type(x) = 0, type(ty(Set(y) 1\ 'iz(z ~ y))) = 1 = type(y) , type(z) = O. It's not hard
to verify that if a formula containing definite descriptions is stratified in this new
sense, then the NFU formula which it abbreviates is stratified in the original sense
(although we may have to take alphabetic variants of the definite description terms).
We don't bother proving this here as we will make no technical use of the notion of
stratification extended to formulas possibly containing definite descriptions.
Ordered Pairs and Infinity
The axioms we officially adopt for this chapter are the five above (A, B, C, Weak
Extensionality and the Schema of Stratified Comprehension), together with an ax
iom of infinity. Choosing a form for the statement of in finit y actually raises a point
concerning ordered pairs, and we digress briefly to discuss this point. If we adopt
the usual definitions of the ordered pair, (x, y), as { {x}, {x, y}} and functions as sets
of ordered pairs, then we will have a type difference of 3 between a function and its
arguments and values. This is not unmanageable, but it is awkward, especially wh en
we consider that the ordered triple, (x, y, z), cannot be defined as a special case of
ordered pair if we want x, y and z to be assigned the same type. Quine suggests
(x,y,z) =def ({{x}}, (y,z)) = {{{{x}}},{{{x}}, {{y},{y,z}}}} (!),
so as to recover uniformity between x, y and z. We would then have a level difference
of 5 between a binary function and its arguments and values. Holmes suggests ([17],
p.4) rather that we introduce as primitives the projections 1fl and 1f2 with the same
stratification requirements as equality, e.g. X1flY is to be read "y is the first projection
of x", and any stratification assignment for a formula containing X1fiY must assign
the same level to x and y. Definitions of n-tuple, relation, function, etc. are given
as usual in terms of ordered pairs and we have the more natural level difference of 1
between a function and its arguments and values, for functions of all arities. So we
add the binary predicate symbols 1fl and 1f2 to the language of NFUT and state the
66
axiom of infinity as follows.
In relation to the above we quote the following (NFU) results of Holmes: "If
the Axiom of Infinity were given in the more usual form asserting the existence of
an infinite set, and the Axiom of Choice were assumed as well, it would be possible
to prove the existence of a type-Ievel ordered pair operation [1['1 and 7f2 ab ove] as a
theorem [ ... ]; proving that there is an infinite set, given a type-Ievel pair, is quite easy."
And: "In the absence of Choice, the existence of a type-Ievel ordered pair implies but
is not equivalent to the existence of an infinite set, but NFU with the axiom 'there is
an infinite set' interprets NFU with a type-Ievel pair in a straightforward manner."
([17], pp.4-5)
As it is not necessary for the purposes of this essay, we do not explicitly adopt
the Axiom of Choice, but we note that if the reader assumes it as an acceptable
(or necessary) axiom to add to Zermelo set theory, then from Chapter 3, it is also
acceptable in the context of NFU. We adopt the Axioms A, B, C, Extensionality,
Infinity and the Schema of Stratified Comprehension, as listed ab ove , as the axioms
of our set theory, which we label "NFUT" ("New Foundaiions with Urelements and
Truth Values"). Later, we will consider the addition of Counting.
4.2.2 Translating GGree into NFUT
Given any GGree term rp, we now define inductively the translation of y = rp into a
formula rp~FUT of NFUT. In what follows, rp, 'ljJ, X and pare GGree-terms, x, Z, u are
free variables of GGree or variables of NF UT, and i, z are bound variables of GGree .
1. rp is x: y = X Bde! Y = x;
67
2. cP is -'ljJ:
(y = -'ljJ) Bdef [('ljJ - T 1\ Y = T) V ('ljJ -::1 T 1\ Y = ..l)];
3. cP is -.:, 'ljJ:
(y = -.:, 'ljJ) Bdef [('ljJ -::1 T 1\ Y = T) V ('ljJ = T 1\ Y - ..l)];
4. cP is 'ljJ ~ x:
(y = ('ljJ ~ X)) Bdef
[('ljJ = T 1\ X -::1 T 1\ Y = ..l) V (('ljJ -::1 T V X = T) 1\ Y = T)];
5. cP is 'ljJ ~ x:
(y = (1/J ~ X)) Bdef
:3X:3Z[(X = 'ljJ 1\ z = X) 1\ ((x = z 1\ Y = T) V (x -::1 z 1\ y = ..l))];
(y = (1/J ~ X)) Bdef
:3x:3Z[(X = 'ljJ 1\ z = X) 1\ (( "z is a unary F - function" 1\ (x, y) E z)
V (--,("zisaunaryF - function") 1\ y = 0))];
7. cP is 'ljJx~p:
(y = (1/Jx ~ p)) Bdef
:3x:3z:3w[(x = 'ljJl\z = Xl\w = p) 1\(("wisabinaryF- function" 1\ (x,z,y) E w)
V (--,("wisabinaryF - function") 1\ y = 0))];
4Expressions in quotation marks are defined below, and y = 0 abbreviates
:3u(Set(u) Il 'tfv(v ~ u) Il y = u).
68
8. </J is Vx1P[~]:
. x (y = VX1P[-]) <=?def [(Vx(1P
X T) !\ Y T) V (3x( 1P =1= T) !\ Y .l)];
~ x (y = x1P[ -]) <=?def
X
[(3w("wisaunaryF - function"!\ VxVz((x,z) E w <--t Z = 1P)!\y = w))
V (-,3w( "w is a unary F - function" !\ VxVz((x, z) E w <--t Z = 1P))!\ y = 0)];
10. </J is fi1P[~, ~]:
-- x z (y = xz1P[-, -]) <=?def
X Z
[(3w( "w is a binary F - function" !\ VxVzVu((x, z, u) E w <--t Z = 1P) !\ y = w))
V(-'3w("wisabinaryF-function"!\VxVzVu((x,z,u) E w <--t Z = 1P))!\y = 0)].
AIl occurrences of GGree terms 1P (or X or p) in the RHS of the definitions above
are within expressions of the form x = 1P,5 for sorne variable x, such that the number
of occurrences of GGree primitives in 1P is strictly less than in the term </J for which
y = </J is being defined. So it is clear that iterative use of the definitions above will
translate y = </J into a formula, </J:FUT, of NFUT for any GGree term </J. It should also
be clear that the free variables in the final NFUT formula </J:FUT are exactly those
in the original expression y = </J, i.e., x is a free variable of </J:FUT iff x E Xq, U {y}.
We explain now the expressions in quotation marks above. As mentioned, once
ordered pairs, and n-tuples generally, are defined (see 4.2.1), we define relation and
function as usual. That is,
50r T = 1/; / ..l = 1/;, which could of course be replaced by =:ix(x = T Ax = 1/;) / =:ix(x = ..lAx = 1/;).
69
Definition 4.1.
"x is an n - ary relation" {:}dej
Vy(y E x ---+ :3zo, Zl,···, Zn-l(y = (zo, Zl,···, Zn-l))).
"x is an n - ary function" {:}dej
"x is an n + 1 - ary relation" A VyVz[Vwo, Wl, ... ,Wn-l(((WO, Wl,· .. ,Wn-l, y) Ex
A (wo, Wl,···, Wn-l, z) EX) ---+ y = z)].
An n-ary F-function is simply a n-ary function whose domain is vn, where V =dej
{xix = x}, i.e., V is the universe (which exists as a set in NFU(T)).
Definition 4.2.
"x is an n - ary F - function" {:}dej
"x is an n - ary function" A Vwo, Wl, ... ,Wn-l (:3y(( wo, Wl, ... ,Wn-l, y) Ex)).
Actually, the notion of F-function is hardly more restrictive than that of function,
in an NFU context. For, taking the case of unary functions for simplicity, given any
function f in NFU (i.e., a set of ordered pairs satisfying the conditions of being a
function, as defined above) , the domain, Dj, of f, is a set, as is its complement,
DCj. The identi ty function restricted to Dy, Id Dy' is then also a set and, finally, so
is the union fUI d DG, which is an F -function that agrees with f on D j, and is the f
identity function for aIl other arguments. Thus, any function is easily extended to an
F -function in this way.
In the arithmetical definitions we use below, it will be necessary that certain
formulas be stratified and, in sorne cases, this will require that variables in the different
argument places of binary functions be assigned the same type. For this reason,
70
we cannot do with only simple (unary) abstraction and application operators. For,
given an F -stratified (defined below) GGree term cp and free variables x, y, the term
y(x1>[~,~]) indeed defines a function in NFUT6. However, it does not define a binary
function, in the sense of the above definitions. Rather, y(xcp[~, Q]) de fines a unary x y .
function F(y) such for any a, F(a) = Fa(x), where Fa(x) is itself a unary function,
namely the function defined by (xcp[~, ;:])1. This means that the arguments of the
function F(y) are one type higher than the arguments of the functions Fa(x), so that
a stratification assignment for a term of the form z ,,-... (w ,,-... y( xcp[~, ;]) will need to
assign a type one higher to w than to z. Of course to state this clearly, we need to
formulate a precise notion of stratification for GGree terms, and this should be done
without resorting in each case to a full translation into NFUT primitives. Therefore
we introduce now the notion of F-stratification.
4.2.3 F -Stratification
Note that defining the interpretations of GGree terms in NFUT as above amounts
to defining a definite description term corresponding to each GGree term, e.g., the
definition for y = (x == z) is equivalent to defining
(x == z) =def ~y((x = z 1\ y = T) V (x =1 z 1\ y = ~)).
Similarly for the other definitions; to complex terms will generally correspond nested
definite descriptions. E.g., we may translate the GGree term (x == y) ~ z as, in the
first instance, ~w[((x == y) = T 1\ z =1 T 1\ w = ~) V (((x == y) =1 TV z = T) 1\ w = T)],
60f course, in NFUT we are adopting the usual set-theoretic identification of a function with its
graph.
7Note the different meanings for [~l and [~l: the former indicates a substitution of symbols and
corresponds to the meaning we have given to expressions such as &:<p[~l throughout this essay; the
latter indicates that a is a parameter replacing the variable y. x<p[~, ~l does not in fact stand for
any term of GGrec , and perhaps the notation &:<p[~l(a) is more appropriate, as long as we keep in
mind that a is here an argument of &:<p[~l(y), a function of the variable y.
71
and finally as
LW[(LU[(X = y 1\ U = T) V (x =1- y 1\ U = l..)] = T 1\ z =1- T 1\ W = l..)
V ((LU[(X = y 1\ U = T) V (x =1- y 1\ U = l..)] =1- T V z = T) 1\ W = T)].
From what was said above concerning stratification and definite descriptions, it's
essentially automatic to read off the requirements that the notion of F-straiijication
must place on a GGree term 1> to ensure that the translation 1>:FUT of y = 1> is
stratified. Below, we state these requirements precisely and verify that, for any GGree
term 1> satisfying our definition of F-stratification, 1>:FUT, the translation into NFUT
of y = 1>, is indeed stratified. (Stratified in the original sense (as modified for NFUT) ,
so that, as mentioned in 4.2.1, we do not ultimately rely on any notion of stratification
extended to formulas with definite descriptions.)
So we place the following conditions on a stratification assignment for a GGree
term. We may assign an arbitrary type to the term as a whole. Now, assume that 1>
is a subterm of the term in question and that 1> has been assigned type n. Then
1. if 1> is --1/;: We may assign an arbitrary type to 1/;;
2. if 1> is ~ 1/;: We may assign an arbitrary type to 1/;;
3. if 1> is 1/; ~ x: We may assign arbitrary types to 1/; and X;8
4. if 1> is 1/; ~ x: We may assign an arbitrary type to 1/;, but must assign this same
type to x;
5. if 1> is 1/; ,-.., x: We must assign type n to 1/; and type n + 1 to x;
6. if 1> is 1/;X;::::'P: We must assign type n to 1/; and to X, and type n + 1 to p;
8Note that this freedom in assigning types to the immediate subterm(s) of a term whose main
connective is --, ..:" -:.... or \; is analogous to the fact that an NFU formula ,cp or cp ---> 7jJ or \:/xcp is stratified iff cp (and 7jJ) is (are).
72
7. if qy is vx7jJ[~l: We may assign an arbitrary type to 7jJ[~1;9
8. if qy is i7jJ[~l: We must assign type n - 1 to x and to 7jJ[~1;
9. if qy is ii7jJ[~, ~l: We must assign type n - 1 to x, to fi and to 7jJ[~, ~l.
Let us call a GGrec term qy F-stratijied if there is an assignment of types (natural
numbers) to all subterms lO of qy such that the above conditions are satisfied. Such
an assignment will be called an F-stratijication assignment for <p. We now prove a
lemma to the effect that "F -stratification implies stratification." The proof will make
use of the following obvious fact: If 7jJ is any subterm of a GGrec term <p, and T is
an F-stratification assignment for <p, then T restricted to the subterms of 7jJ is an
F-stratification assignment for 7jJ. In the statement and pro of of the lemma, "NFUT
formula" is a formula written purely in the primitives of NFUT (in particular, not
containing definite description terms), so that "stratification assignment" for such a
formula me ans an assignment of types to variables only.
Lemma 4.1. Let <p be an F-stratijied GGrec term and T be any F-stratijication as
signment for qy. Let <p:FUT be the translation dejined above (see 4.2.2) of y = qy into
the primitives of NF UT. Then there exists a stratijication assignment, Œ, for qy:FUT
such that (i) Œ(Y) = T(<p), and (ii) Œ(U) = T(U) for each free variable u of <p,u
Proof. The proof is by induction on the structure of GGrec terms, specifically, on
the number of occurrences of GGrec primitives in a term <p. If the number is zero,
then qy is x for some free variable x and T(X) = n trivially defines an F-stratification
9Note that, technically, 1j;[~l is not a GGree term; however, for stratification assignments, expres
sions such as 1j;[~], which would form GGree terms on substitution of free variables for the bound
variables that are "free" in them (e.g. i; in 1j;[~]), will be considered terms and assigned types in the
same way as other subterms. lOSee previous footnote. llRecall that the free variables of cp:FUT are exactly those of cp together with y.
73
assignment for </J for any natural number n. </J:FUT is then y = x and a-(y) = a(x) = n
is clearly a stratification for </J:FUT such that a(y) = T( </J).
Induction Hypothesis: For all GGree terms 'ljJ containing n or fewer occurrences of
GGree primitives, if T is any F-stratification assignment for 'ljJ, then there is a strati
fication assignment, a, for the NFUT formula 'ljJf:FUT such that (i) a(x) = T('ljJ) and
(ii) for every free variable, u, of 'ljJ, a(u) = T(U).
Assume the GGree term </J contains n + 1 occurrences of primitives. From the
definition of "GGree term" (see 4.1), we have the following cases:
1. </J is --'ljJ for sorne term 'ljJ. Suppose T is an F-stratification assignment for
</J, with T(</J) = m. Now, </J:FUT is the translation of y = </J, equivalently, of
3x( 'ljJf:FUT 1\ y = --x), which is to say </J:FUT is
(*) 3X('ljJf:FUT 1\ ((x = T 1\ Y = T) V (x # T 1\ Y = .-L))),
where 'ljJf:FUT is the translation of x = 'ljJ. 'ljJ contains n occurrences of GGree
primitives and T restricted to the subterms of'ljJ is an F -stratification assignment
for 'ljJ, so by the IH, there is a stratification assignment, a*, for 'ljJf:FUT such that
a*(x) = T('ljJ) and a*(u) = T(U) for every free variable U in 'ljJ. It's easy to verify,
from (*), that a =def a* U {(y, m)} is a stratification assignment for </J:FUT.
By definition, a(y) = m = T(</J) and from the IH, a(u) = T(U) for every free
variable U in </J.
2. The proof of the inductive step is similar for the cases where </J is ~ 'ljJ, 'ljJ ~ X
or \;fx'ljJ[~l.
3. </J is 'ljJ ~ X for sorne terms 'ljJ and X. Suppose T is an F-stratification assignment
for </J and T( </J) = m. By the definition of F -stratification, there is sorne k such
that T('ljJ) = T(X) = k. Now, </J:FUT is
(*) 3x3z('ljJf:FUT 1\ XljFUT 1\ ((x = z 1\ Y = T) V (x # z 1\ Y = .-L))).
The restrictions of T to subterms of'ljJ and to subterms of X are F-stratification
74
assignments for 1jJ and X, respectively. Thus, by the IH, there are stratifi
cation assignments, 0"1 and 0"2, for 1jJ!:FUT and X1jFUT, respectively, such that
O"l(X) = r(1jJ) = k = r(x) = 0"2(Z) and O"l(U) = r(u) for every free variable u in
1jJ!:FUT and 0"2(U) = r(u) for every free variable u in X1jFUT. Note in particular,
that 0"1 (u) = 0"2 (u) for every free variable common to both 1jJ!:FUT and X1jFUT.
Alphabetic variants of 1jJ!:FUT and X1jFUT may be chosen so that no bound vari
ables are common to both these formulas, and therefore the intersection of the
domains of 0"1 and 0"2 contains only free variables common to both formulas.
Then it is easy to verify from (*) that 0" =def 0"1 U 0"2 U {(y, m)} is well-defined
and is a stratification assignment for </J:FUT such that the two conditions of the
lemma are satisfied.
4. </J is 1jJ ~ X for sorne terms 1jJ and X. Suppose r is an F-stratification assignment
such that r(</J) = m. Then, by definition of F-stratification, r(x) = m + 1 and
r(1jJ) = m. Now, </J:FUT is
(* ):3x:3z(1jJ:FUT /\ X:FUT /\ (( "z is a unary F - function" /\ (x, y) E z)
V ("zis NOT a unary F - function" /\ y = 0))).
The restrictions of r to subterms of 1jJ and to subterms of X are F-stratification
assignments for 1jJ and X, respectively. By the IH, there are stratification as
signments, 0"1 and 0"2, for 1jJ!:FUT and X1jFUT, respectively, such that O"l(X) =
r(1/J) = m, 0"2(Z) = r(x) = m + 1 and O"l(U) = r(u) for every free variable u in
1/J!:FUT and 0"2(U) = r(u) for every free variable u in X1jFUT. Note, in particular,
that O"l(U) = 0"2(U) for every free variable common to both 1jJ!:FUT and X1jFUT.
Again, we may choose alphabetic variants of 1/J!:FUT and X1jFUT such that they
have no baund variables in common. So 0" =def 0"1 U 0"2 U {(y, m)} is well-defined
and, from (*), we can verify that 0" is a stratification assignment for </J:FUT such
that the two conditions of the lemma are satisfied. (Remember that, on our
75
definition of ordered pair, a stratification assignment for y = (x, z) will assign
the same type to each of x, y and z.)
5. Similarly for the case where </> is 1/Jx~p.
6. </> is i1/J[~l for some term 1/J, free variable x and bound variable x. Suppose r
is an F -stratification assignment such that r( </» = m. Then, by definition of
F-stratification, r(x) = r(1/J[~]) = m - 1. Now, </>~FUT is
(* K:Jw( "w is a unary F - function" 1\ VxVz( (x, z) E w f--+ 1/J:FUT) 1\ y = w))
V (-,:::Jw( "w is a unary F - function" 1\ VxV z( (x, z) E w f--+ 1/J:FUT)) 1\ Y = 0).
The restriction of r to subterms of 1/J[~l is an F-stratification assignment for 1/J[~l.
(Recall that, by convention for the purposes of F-stratification assignments, x in 1/J[~l is considered a free variable symbol and therefore 1/J[~l is considered
to be a GGree term.) By the IH, there is a stratification assignment, cr*, for
1/J1jFUT, such that cr*(z) = r(1/J[~]) = r(5:) = m - 1, and cr*(u) = r(u), for u a
free variable of 1/J[~l. It's easy to verify, from (*), that cr =def cr* U {(y, m)} is a
stratification assignment for </>~FUT. By definition, cr(y) = m = r( </» and from
the IH, cr( u) = r( u) for every free variable u in </>.
7. Similarly for the case where </> is :§1/J[~, ~l.
D
The importance of this lemma is that it establishes a precise sufficient condition on
the structure of a GGree term </> which guarantees the existence of i</>[~l and iY</>[~,;l
as courses-of-values. We state this as a theorem.
Theorem 4.3. Let </> be an F-stmtified GGree term. Then for any free variables
x, y, the courses-of-values i</>[~l and iY</>[~,;l exist as unary and binary F-funciions,
respectively.
76
Pro of. Note that, by definition, our interpretation of GGree terms guarantees the
existence of sets corresponding to the terms i4>[~l and iy4>[~,;l regardless of strat
ification restrictions. However, if the NFUT formula translating z = 4>, namely
4>~FUT, is unstratified, then both i4>[~l and iy4>[~,;l may simply equal 0. On the
other hand, if 4>~FUT is stratified, then i4>[~l is the unary F-function F such that
VxVz((x, z) E F +--> 4>~FUT), and iy4>[~,;l is the binary F-function G su ch that
VxVyVz((x, y, z) E G +--> 4>~FUT). But sin ce 4>~FUT is stratified if 4> is F-stratified,
F-stratification of 4> is a sufficient condition for the existence of i4>[~l and iy4>[~,;l
as the desired functions, or courses-of-values. D
We may now examine the extent to which we recover Frege's arithmetical devel
opment in GGree-
4.3 Recovering Frege's Arithmetic
Theorem 4.3 above relating F-stratification and the existence of courses-of-values is
needed to state in a reasonable and self-contained way the axioms of GG as recovered
in our interpretation of GGree . In this section, we first state these modified basic laws
and proceed to develop Fregean arithmetic in GGree , drawing parallels to arithmetic
in NFU(T).
4.3.1 Basic Laws of GGree
We state six "Basic Laws" of GGree , statements that equal T on the interpretation
given and that refiect Frege's six laws, stated in 1.2.3. OfficiaIly, we define the theory
GGree as the set of aIl GGree statements 4> such that 4> = T on our interpretation, so
the following is not meant as an axiomatization of GGree .
77
Axioms
• Basic Law 1.
a) f- x ~ (y ~ x)
b)f-x~x.
These are clearly true on our interpretation.
• Basic Law II.
f- Vi:(i: r--- y) ~ X r--- y.
This is again clearly true on our interpretation for any term cp. This law corre
sponds to BL II a) in 1.2.3. As we no longer have quantification over function
variables, we have no axiom here corresponding to BL II b). Recall that quan
tification over function variables is "replaceable" by quantification over object
variables, in a sense made precise in 2.2.1.
Note, however, we may also state a schema, true on our interpretation of GGree ,
corresponding to BL II a) in 1.2.3. This schema would follow from the above
in the presence of Unrestricted Comprehension (Abstraction); however, given
the restriction to F-stratified Comprehension (see Basic Law VI below), this
schema is more general:
f- Vi: ( cp[~]) ~ cp, for any GGree term cp.
• Basic Law III. Substitutability of equal terms.
L_ rI-.[(y~z)l . rI-.[('iw(z~W~y~w»l J: GG t ri-. ,- 'f' x ----+ 'f' x ' lor any ree erm 'f"
Note that f- (y ~ z) ~ Vw(z r--- W ~ y r--- w) is the special case of the above
taking cp to be x. As for Basic Law II ab ove , we could have stated a single
axiom 12, from which the above would follow in the presence of U nrestricted
Comprehension, but which is weaker than the above given only F-stratified
Comprehension. The axiom would be stated as:
12We stated the schema first for Basic Law III because we think the meaning is (slightly) clearer.
78
\- (y ~ z) ~ x ~ [(\>'w(z ~ W ~ y ~ w)) ~ x], from which the special
case above could still be recovered by substituting for x the term i(x), i.e. the
course-of-values for the identity function (which exists as a unary F-function by
Basic Law VI below).
• Basic Law IV
\- (~ (-x ~~ y)) ~ (-x ~ -y).
This law is clearly true on our interpretation.
• Basic Law V Extensionality.
a) \- (icp[~l ~ i~[~]) ~ vx(cp[~l ~ ~[~]), for any F-stratified GGree terms cp and
~ and variable x.
b) \- (fYcp[~, ~l ~ fY~[~,~]) ~ VxVy(cp[~, ~l - 'l/J[~, ~]), for any F-stratified
GGree terms cp and ~ and variables x, y.
Extensionality applies to courses-of-values corresponding to F-stratified GGree
terms. This law is true (in this form) on our interpretation as we showed that
courses-of-values corresponding to F-stratified GGree terms exist as graphs of
(unary or binary) F-functions and two such graphs are clearly equal iff the
functions to which they correspond are equal for all arguments.
• Basic Law VI. F-stratified Comprehension.
\- :Jy\>'x(x ~ y ~ cp[~]), for each F-stratified GGree term cp and variable x.
If cp is F-stratified, then i(cp[~]) exists as the intended unary F-function and
and satisfies Vx(x ~ i( cp[~]) ~ cp[~]) = T, and Basic Law VI, as stated here,
therefore holds in GGree .
The most significant difference between the Basic Laws as stated here and as
stated in 1.2.3 is the restriction to F-stratified terms in the axioms of Extensionality
and Comprehension. Clearly, this is (at least roughly) equivalent to the restrictions, in
79
NFU(T), of Extensionality to sets (as opposed to urelements) and of Comprehension
to stratified formulas. We must examine to what extent this affects the development
of Arithmetic in GGree .
4.3.2 Arithmetic in GGree
In this section, we give basic arithmetical definitions in GGree , an appropriately mod
ified version of the presentation given in 2.2 for GG.
Bijection and Equinumerosity
Definition 2.3, of "p is (the graph of) an injective function", is changed slightly as
the "iterated application" of GG is replaced by the "binary application" of GGree .
Definition 4.4. I(p) =def \ix\iY[(xy~p) ~ (\iz(xz~p) ~ (Y ~ z))].
Note that the RHS is F-stratified. Note also that any F-stratification assignment
for a term containing l (p) will assign the same type to p and to l (p), while the bound
variables in I(p) are unimportant in F-stratification assignments, as they can always
be changed to obtain an alphabetic variant.
The following three definitions involve "binary courses-of-values" and are accord
ingly slightly different from their GG counterparts.
Definition 4.5. For any p, we define
)p =def f[;(I(p)  Vu(u ~ x ~ :3v(v ~ y  uv~p))).
Note that I(p) 1\ \iu(u ~ x ~ :3v(v ~ y  uv~p)) is F-stratified, and therefore
)p exists as a binary course-of-values, i.e., a binary F-function, for any p. Any F
stratification assignment, Œ, for a term containing )p will assign types to subterms
su ch that ŒOp) = Œ(p) + 1.
80
For objects a, b, we read ab::::-')p = T as "the p-relation maps the a-concept into
the b-concept", i.e., as the GGree equivalent of a ~ (b ~)p) = T in GC. (See 2.2 for
the CG originals of an definitions given here and a more complete discussion of their
meanings.)
Definition 4.6. Inverse relation. U(p) =def fY(f;x::::-,p))).
Note that yx::::-'p is clearly F-stratified.
Definition 4.7. Given objects a, b, the concepts --(x ~ a) and --(x ~ b) are
said to be equinumerous if there exists a p such that (ab::::-')p 1\ ba::::-')U(p)) = T,
i.e., p maps the a-concept into the b-concept and U(p) maps the b-concept into the
a-concept.
We now arrive at the definition of the cardinal of the a-concept, without so far
having to make any major changes to the definitions as given in 2.2.
Definition 4.8. The cardinal of the u-concept.
Given any u, we define the cardinal of the u-concept as
CardF(u) =def i[~fj(ux::::-')U(f;) À (xu::::-')fj))].
Note that ~fj(ux::::-')U(fj) À (xu::::-')fj)) , which we will abbreviate x rvF U, is F
stratified, and that any F-stratification assignment, Œ, for a term containing CardF(u)
will be su ch that Œ(CardF(u)) = Œ(U) + l. We define the cardinals zero and one as before.
Definition 4.9. Zero. OF =def CardF(i(~ (x == x))).
Clearly, ~ (x == x) is F -stratified.
Definition 4.10. One. IF =def CardF(i(x == OF )).
Again, x == OF is clearly F-stratified (for an alphabetic variant of the definition of
OF given ab ove ).
81
Successor, Ancestral and the N atural N umbers
We come now, as before, to the definitions of successor, ancestral and natural number.
We have thus far had to make only minor changes to Frege's core definitions, replacing
iterated abstraction and application in GG with binary abstraction and application
in GGrec . But the definitions are otherwise the same, indicating, for one thing, the
extent to which Frege himself did without quantification over function variables. We
can also define successor essentially as in GG, but then we come to the definition of
ancestral, which will require more attention.
Definition 4.11. Successor.
Sp =dej iY(~û~v(x ~ Cardp(i(z ~ Û A ~ (z ~ v))) A v ~ Û A f) ~ Cardp(û))).
We write zSpw for zw;::::"Sp.
As before, m is the successor of n if m is the cardinal of a concept under which
exactly one more object falls than under a concept for which n is the cardinal.
We come now to the definition of ancestral, the first definition from 2.2 containing
a function variable. In the context of GG, Frege could have given an equivalent defi
nition in which the quantification over a unary variable is replaced by a quantification
over an object variable, as we have discussed before. This is, in fact, the definition we
now give; of course, given the restricted Comprehension axiom of GGrec , this modified
definition no longer carries the same weight as that of GG. This last statement will
be made precise below in discussing Induction in GGree .
Definition 4.12. Ancestral, modified.
<q=dej iY('ifz([(\iÛ\iv(û ~ z) A (ûv;::::"q)) ---.:..... (v ~ z)]
---.:..... [(\iw(xw;::::"q) ---.:..... (w ~ z)) ---.:..... (f) ~ z)])).
82
Given a GGree term 4> and variable x, let us caU the concept, --4>(x), hereditary
in the q-relation if
(*) ('iuVv((4>[~l A uv:::-'q) ~ 4>[~])) = T.
If 4> is F-stratified, then x(4)[~]) is a unary F-function (course-of-values) and, setting
a =def x( 4>[~]), (*) above is equivalent to . . .
(t) (VuVv(u ~ a !\ uv:::-'q) ~ v ~ a) = T.
If 4> is not F-stratified, then we cannot know that x(4)[~]) is the desired course-of
values, in particular, we cannot know that there is any object a such that (*) is
equivalent to (t). Clearly, this will have an effect on Induction, as we discuss after
giving the two final definitions of this section.
The foUowing definition is still straightforward.
Definition 4.13. Equal to or finitely preceding in the q-relation.
~q=def fY(x <q fJ V x ~ fJ)·
We write x ~q y for xy:::-' ~q.
Note that any F-stratification assignment, T, for a term containing x ~q y is such
that T(X) = T(Y).
The definition of natural number as equal to zero or finitely preceded by zero in
the SF-relation is essentially unchanged.
Definition 4.14. An object nF is a natural number if OF ~SF nF = T.
Given a GGree term 4> and variable x as before, suppose now that 4>(OF) = T and
that --4>(x) is hereditary in the SF-relation. If 4> is F-stratified, then (t) above
holds for q = SF and x ~ a = 4>(x) for all x (again setting a =def x(4)[~])). Using the
modified definition of Ancestral above and an argument entirely analogous to that
given at the end of 2.2.3, we see that Induction holds for 4>(x), i.e., that nF ~ a = T,
and therefore 4>( n F) = T, for all natural numbers n F.
83
However, if </J is not stratified, we cannot, in general, prove that Induction holds for
</J( x). This is the sense in which the modified definition of ancestral "does not carry
the same weight" as the original. To determine whether Frege's theorems and proofs
regarding the natural numbers are still possible requires a case-by-case inspection.
4.3.3 The Axiom of Counting
As it turns out, there is no difficulty in proving most of Frege's main results. In par
ticular, Frege's proofs of I(SF) and I(U(SF)) (i.e., there is at most one SF-successor
and at most one SF-predecessor for any object, respectively) do not make use of
courses-of-values corresponding to un-F-stratified terms and can be reproduced in
GGree . We seem to run into no difficulty until we reach the following important the
orem, mentioned earlier in 2.2.3, and here slightly modified for the GGree context:
(*) f- (OF ::;SF nF) ~ (nFSFCardF(i(OF ::;SF i ::;SF nF))),
where OF ::;SF i ::;SF nF abbreviates OF ::;SF i A i ::;SF nF.
The above term is not F-stratified. For suppose T is an F-stratification assignment
for (*). Then r(i(OF ::;SF i ::;SF nF)) = r(i)+l = r(nF)+l and T(CardF(i(OF ::;SF
i ::;SF nF))) = r(i(OF ::;SF i ::;SF nF)) + 1 = r(nF) + 2. But the above formula (*)
also requires r(CardF(i(OF ::;SF i ::;SF nF))) = T(nF), so we have the contradiction
r(nF) = r(nF) + 2.
That the development of arithmetic runs into the same obstacle in GGree as in
NFU is perhaps not surprising, given that we are interpreting GGree in NFUT and
that the notion of F-stratification is built on that of stratification. However, given
that our GGree definitions are basically those of the Grundgesetze, this do es bring out
quite clearly the extent to which arithmetic in NFU is essentially arithmetic as Frege
envisioned it, though transferred to a set-theoretic context. Our aim in the remainder
of this chapter is to prove that the truth of the above GGree statement (*) follows if
84
we add the Axiom of Counting to NFUT. The steps we take to demonstrate this will
also make more explicit the parallels between GGrec and NFU.
Unravelling the GGrec definition of natural number
For the following discussion, it will be useful to define two unary functions. As it
happens, these functions are defined for all x E V, i.e. they are unary F-functions, as
is clear from their stratified defining formulas. However, we introduce them mainly as
notational abbreviations and the fact that they are F-functions will not be essential.
Definition 4.15. Y = x* {::}def Vw(w E y <--t W ~ X = T).
(I.e., x* = {wlw ~ x = T}.)
Definition 4.16. 13
y = Char(x) {::}def ["y is a unary F - function" /\
VzVw((z, w) E Y <--t ((z E x /\ w = T) V (z ~ x /\ w = ..1)))J.
Note that Char(x) is itself a unary function, and that any stratification assignment
for a term containing x* will assign the same type to x and x*. The same is true of
any stratification assignment for a term containing Char(x). It's immediate from the
definitions of these two functions that y E x* <--t Y ~ x = T and y E x <--t
Y ~ Char(x) = T for any x and y. It's also easy to verify that for any x,
Set(x) -+ (Char(x))* = x. (Note that it's not true that Set(x) -+ (Char(x*)) = x.
We have rather "x is an F-concept" -+ (Char(x*)) = x, where an F-concept is a
unary F-function equal to T or ..1 for every argument.)
13It's important here that ((z E x 1\ w = T) V (z ~ x 1\ w = -.1)) is stratified, for otherwise we
cannot know that y exists as described (in particular, as a unary F-function) in the above formula.
That Char(x) exists for each x and that CHAR =def {(x,y)ly = Char(x)} exists are two different
(though related) facts.
85
In discussing the natural numbers, we will have to be careful to distinguish between
the objects which are natural numbers on the GGree definition and those which satisfy
theNFU(T) definition. The former we will call GGree , or Frege, natural numbers, and
label the individuals nF and the set NF, and the latter we call NFU natural numbers,
n for individuals and N for the set. To every object a is associated a Frege cardinal,
as defined above: CardF(a) =def i(i l'VF a).
CardF(a) is a unary F-function such that, for each object x, (x, T) E CardF(a) iff
there is a y such that ax;:::~·-)U(y) = T and xa;:::::")y = T, which is to say that y* is a
function such that the restriction of y* to x* is a bijection between x* and a*. And
if there exists any bijection z between x* and a*, then setting y = Char(z), we see
that ax;:::::")U(y) = T and xa;:::::")y = T, so that x l'VF a. So we have the following
"NFUT definition" of the Frege cardinal CardF(a), equivalent (in NFUT) to the
original definition given in GGree notation.
Definition 4.17. For any object a, CardF(a) =def {xlx* l'V a*}.
Note, e.g., that OF = {xlx* = 0} and IF = {xlx* l'V {OF} n. Next, we define a function f on the NFU(T) natural numbers, in the aim of
eventually showing that f is a bijection between N and NF.
Definition 4.18.
(x,y) E f {=}def [x E NI\\t'z\t'u((z,u) E y <-+ ((z* E xl\u = T)V(z* ~ xl\u = ..l)))].
So, for n E N, (J(n))* = {zlz* En}.
We must be careful that the formula on the RHS is in fact stratified, to ensure the
existence of f. Inspection of the definitions involved shows that it is indeed stratified.
Now, it is clear from the above that, for any n E N, f(n) is indeed sorne Frege
cardinal (which is not to say that it is clearly a Frege natural nurnber!). We wish to
86
show that, in fact, f(N) = NF' We state first the following lemma and give a proof,
making explicit use of some of the simple properties of Char(x) and x*. We will not
always go into such detail in subsequent proofs.
Lemma 4.2. \ix\in[n E N --> (x E n <-+ (Char(x) ~ f(n) = T))].
Pro of. Note that Char(x) .---.. f(n) = T is equivalent to Char(x) E (f(n))* and that
(*) x E n <-+ (Char(x) ~ f(n) = T) is a stratified formula, so Induction holds in
NFUT relative to (*).
Suppose n = O. Now, \iy(y E 0 <-+ \iz(z ~ y)), so x E 0 implies x is in bijection
with the empty set. By definition, then, Char(x) is such that
\iy(y.---.. Char(x) =.1) and therefore \iy(y ~ (Char(x))*), which implies
(Char(x))* E 0 and therefore Char(x) ~ f(O) = T.
So x E 0 --> (Char(x) ~ f(O) = T). The reverse implication is similar.
Assume that (*) holds for k ~ n and suppose x E n + 1. By definition of successor
(in NFUT), :3y:3z(y E zl\n+l = Card(z)l\n = Card(z-{y}))14. Let y and
z be such that y E z 1\ n + 1 = Card(z) 1\ n = Card(z - {y}). Then x rv z.
Let G : x"'::""z be a bijection. Then G(y) E x. But :3w(w E x) --> Set(x) and
Set(x) --> (Char(x))* = x. Therefore (Char(x))* E n + 1, so, by the definition
of f, Char(x) ~ f(n + 1) = T. Conversely, if Char(x) .---.. f(n + 1) = T, then
(Char(x))* E n+l and, just as for x above, we may deduce that :3w(w E (Char(x))*),
which means :3w(w .---.. Char(x) = T), which in turn means :3w(w E x) and therefore
x = (Char(x))* En + 1.
So x E n + 1 <-+ (Char(x) .---.. f(n + 1) = T).
By Induction, (*) holds for aU n; clearly no property of x was used in the proof
and the proof is complete. D
14Note that Card(x) = '" +--> xE", is (almost) immediate from the definition of cardinal, as is
(CardF(x) ~ a ~ x '" a) = T from the definition of Frege cardinal.
87
We next prove
Lemma 4.3. Vn(n E N --+ (J(n) SF f(n + 1) = T)).
Praof. Let n E N. Now, (J(n))* = {ulu* E n} and (J(n + 1))* = {ulu* E n + 1}.
Let y and z be such that y E z /\ (n + 1 = Card(z)) /\ (n = Card(z - {y})).
Then Char(z) E (J(n + 1))* and Char(z - {y}) E (J(n))*. We have that, for any
w, w ~ Char(z - {y}) = T f-+ (w ~ Char(z) A':"" (w ~ y)) = T. Therefore,
Char(z - {y}) = 1ÎJ(w ~ Char(z) A':"" (w ~ y))15. So Char(z) and y are such that
[y ~ Char(z) A f(n) ~ CardF(1ÎJ(w ~ Char(z) A':"" (w ~ y)))
A f(n + 1) ~ CardF(Char(z))] = T,
so :3u:3v(u ~ v A f(n) ~ CardF(1ÎJ(w ~ v A':"" (w ~ u))) A f(n + 1) ~ CardF(v)),
i.e., (J(n) SF f(n + 1) = T). 0
We consider now the characteristic function, Char(J(N)), of f(N). Char(J(N))
is itself a unary F-function, and we may write Charf(N)(x) for x ~ Char(J(N)), its
value for any object x. Thus, we have Charf(N)(x) = T iff xE f(N).
Lemma 4.4. VX(OF ~SF x = T --+ x ~ Char(J(N)) = T). I.e.) NF ç f(N).
Proof. RecaU from the GGree definitions of ancestral and natural number that, for
any object w, OF ~ W = T and --(x ~ w) being hereditary (taking --(x ~ w)
as a function of x) in SF jointly imply that nF ~ w = T for aU nF (i.e. for aU x such
15We use here that (i) w '"""' Char(z-{y}) equals T or ..1 for all w and (ii) w '"""' Char(z) A:' (w ~ y)
is F-stratified. (ii) is actually cheating a little, since we haven't defined Char(z) in GGree primitives
and the lemma on F-stratification does not technically apply. But it is clear from that lemma and the
definition of Char(z) that the translation of u = (w '"""' Char(z) A:' (w ~ y)) into NFUT primitives
is stratified, which is what we really need for the existence of 1Î!(w '"""' Char(z) A:' (w ~ y)) as a
course-of-values.
88
that OF ::;SF X = T). We show that Char(f(N)) satisfies the two conditions given for
wabove. First, since OF = f(O), OF ,----.. Char(f(N)) = T is clear. Now, suppose that
X ,----.. Char(f(N)) = T, i.e. that X E f(N). Then x = f(n) for some nE N. Now, from
Lemma 4.3, the SF-successor of f(n) is f(n+1)16. So for any y, x SF Y --) Y = f(n+1),
and, of course, f(n + 1) ,----.. Char(f(N)) = T. So x ,----.. Char(f(N)) is hereditary in
the SF-relation. So, finally,
\fX[(OF ::;SF x = T) --) (x ,----.. Char(f(N))) = T] and NF ç f(N), as desired. D
Lemma 4.5. \fx(x E N --) (OF ::;SF f(x) = T)). I.e., f(N) ç NF.
Proof. This proof, like the last one, is by Induction, only this time the induction is on
the NFUT natural numbers rather than the GGree natural numbers. Note that (the
translation into NFUT primitives of) (*) OF ::;SF f(x) = T is stratified. So Induction
holds, in NF UT, relative to (*). Clearly, OF ::;SF f(O) = T, since f(O) = OF. If
n E N and OF ::;SF f(n) = T, then, since f(n)SFf(n + 1) = T, it follows that
OF ::;SF f(n + 1) = T. So by Induction, \fx(x E N --) (OF ::;SF f(x) = T)) and we
have f(N) ç NF. D
The previous two Lemmas show that f(N) = NF. In fact, recalling Lemma 4.3,
we may state the following.
Theorem 4.19. f is an isomorphism (N, S)""::'(NF, SF).
The following lemma is straightforward.
Lemma 4.6.
\fx\fn[n E N --) (Char(x) E (f(n + 1))* --) (f(n) SF CardF(Char(x)) = T))].
16We assume here that I(SF) = T, i.e., that any object has at most one SF-successor. As
mentioned ab ove , there seems to be no difficulty in reproducing Frege's pro of of this in GGree .
89
Pro of. Assume Char(x) E (J(n + 1))*. Now, for any GGree cardinal "'F and any
x, we have that x E ("'F)* <-+ CardF(x) = "'F. Since f(n + 1) is a GGree natural
number (and therefore a GGree cardinal) for any n E N, Char(x) E (J(n + 1))*
implies CardF(Char(x)) = f(n + 1). From Lemma 4.3, f(n) SF f(n + 1) = T, so
f(n) SF CardF(Char(x)) = T. 0
We can now state the main result of this section.
Theorem 4.20. NFUT + Counting proves:
f- (OF ~SF nF) ~ (nF SF CardF(i(OF ~SF X ~SF nF ))).
Proof. We need to show that in NF UT, given the axiom of Counting, we can deduce
that OF ~SF nF = T -+ (nF SF CardF(i(OF ~SF X ~SF nF ))) = T. From results
above, we have that OF ~SF nF = T is equivalent to nF E f(N), which in turn
is equivalent to n E N (with n = f-l(nF )). Also, since i(OF ~SF x ~SF nF) =
Char( {OF, IF, . .. , nF}), we have that (nF SF CardF(i(OF ~SF X ~SF nF)) = T) is
equivalent to (nF SF CardF(Char( {OF, IF, ... , nF})) = T).
So we need to show that n E N -+ (nFSFCardF(Char({OF,tF, ... ,nF}))
T). Assume that n E N. Then nSF CardF(Char({OF, IF, . .. ,nF})) = T follows
from Lemma 4.6 if Char( {OF, IF,· .. , nF}) E (J(n + 1))*. But, from Lemma 4.2,
{O,l, ... ,n} E n+ 1 <-+ (Char({O,l, ... ,n}) ~ f(n+ 1) = T) for all nE N. So,
given the axiom of Counting, Char( {O, 1, ... , n}) ~ f(n + 1) = T for n E N, and by
definition of X*,
(Char({O,l, ... ,n}) ~ f(n+ 1) = T) <-+ Char({O,l, ... ,n}) E (J(n+ 1))*. Fi-
nally, sin ce f restricted to {O, 1, ... , n} is an obvious bijection between this set and
{OF, IF, ... , nF} (and therefore Char(J) is a bijection in the sense of GGree between
Char( {O, 1, ... , n}) and Char( {OF, IF, ... , nF} )), we have that
Char({O, 1, ... ,n}) E (J(n+ 1))* <-+ Char({OF, IF, ... ,nF}) E (J(n+ 1))*. As men-
tioned, it now follows from Lemma 4.6 that n SF CardF(Char( {OF, IF, . .. , nF})) =
90
Conclusion
In this essay, we have attempted to shed light on sorne of the connections between
various logical foundations of arithmetic. Our princip le goal in this vein was to
interpret a modified form of Frege's Grundgesetze theory in an NFU framework and
to sketch the parallel development of arithmetic in this interpretation and in NFU
proper. This we have done in the last chapter, with Theorem 4.20 as our main
result in this direction, establishing the derivation of a central arithmetical theorem
of the Grundgesetze 17 from its corresponding statement, as the Axiom of Counting,
in NF UT. In proving 4.20, we make use of connections between the GGree natural
numbers and the NFU natural numbers, and between mathematical induction as
applied to each of these two. The connections clearly run deep and it may prove
interesting to pursue them for more general results. A study of the GG definitions of
the rationals and reals (see [10]) in the context of our interpretation may also prove
fruitful.
We have also sought in this essay to trace historical reasons for similarities between
GG and NFU. Clearly, the conceptual ties between GG, TT, NF and NFU have been
our main focus, but these cannot be entirely separated from the historical progress
of research in mathematical logic. Of course, we had to restrict our focus to a very
limited number of historical developments, indicating in certain places important
points that have been left aside or only briefly touched upon in this essay. As with
17To the importance of which Frege himself called attention [11], §46.
93
historical analysis in any field, there remains much more to be said and many more
connections to be traced; perhaps this is especially true in the fields of mathematics
and logic, where historical understanding is not generally considered a priority.
Ultimately, one can say, in the case of the Grundgesetze, that while Frege's formaI
system was inconsistent, his construction of arithmetic was not. In fact, it is this con
struction of arithmetic that is developed in NFU-style set theories. The paradoxes
don't block Frege's arithmetical definitions, but they clearly complicate the develop
ment of an arithmetic that uses these definitions. As compared to ZFC and variants,
technical complications may indeed make NFU-style set theory a less natural setting
for interpreting mathematics. However, it is certainly not an uninteresting setting,
nor one without a history and motivation, for one can trace its roots, as with so much
of modern logic, to Frege's own groundbreaking work.
94
Appendix A
Frege's Cumulative Type Theory
In this appendix, we present a theory based on Frege's informaI discussion of concept,
or function, levels at the beginning of the Grundgesetze (esp. §§1-4 and §§21-25). We
believe this theory is an extremely plausible interpolation from what Frege actually
says. Frege himself does not give any formaI presentation of the "Cumulative Type
Theory" (CTT) below and we don't claim the details we furnish are specifically
"what Frege had in mind." For instance, the use in presentation of both "levels"
and "types," which seems to us necessary to describe CTT, Frege does not explicitly
mention. However, it seems likely that had Frege described a formaI theory for his
concept levels, it would be equivalent to CTT as we present it below.
As mentioned in Chapter 1, CTT is rather messy and is used neither in the
Grundgesetze nor here for the purposes of actually developing arithmetic. Frege's
discussion of CTT serves to motivate the formaI language GG that he ultimately
adopts. Our main purpose in giving a formaI presentation of CTT is to show that,
given two simplifying assumptions, CTT reduces to TT.
Types of variables in CTT will be given by a pair, (n, m), of natural numbers and
to each pair (n, m) E [(N - {O}) x N] U{(O, On corresponds a distinct CTT type. Each
type, other than type (0,0), is a function type with range (a subset of) the entities
95
of type (0,0) (Le. "objects"); the type of a function is determined by the types of its
variables. Type (0,0) is Frege's object type and may be thought of as a 0 - ary or
constant function type. n is the ZeveZ of the type (n, m) and for each n > 0 there are
infinitely (though countably!) many distinct types of level n. For each n > 0, we will
have to fix an ordering, (Jn, of the types of level n. Let us first describe the levels
informally.
• LeveZ O. These are the objects, or constant functions. There is only one type of
level 0, that is, type (0,0).
• LeveZ 1. These are all the functions taking objects as arguments. These func
tions may be of any finite arity, so there is one type for each arity n EN, and
we may sim ply take the ordering of level 1 functions to be given by their arities.
That is, "f is a function of type (1, n)" is then equivalent to "f is an n + 1-ary
level 1 function." (Note that no level 1 type corresponds to O-ary functions, so
type (1,0) corresponds to unary functions, type (1,1) to binary functions, etc.)
• LeveZ 2. These are all the functions taking as arguments functionsjobjects of
level ::; 1, such that at Zeast one argument is of ZeveZ 1. Since again these
functions may be of any finite arity, we have one type of level 2 for every finite
sequence (al, a2,· .. ,am) of types ai = (ail, ai2) of level ::; 1 (i.e. such that
ail::; 1), such that for some i ::; m, ai is of ZeveZ1 (i.e. ail = 1). Let 52 be the
set of all such sequences and let (J2 : 52 --> N be a bijection ordering 52. Given
any kEN, we write (aik, a~k, ... ,a;!:J for (J:;l(k). Then "f is a function of type
(2, k)" is equivalent to "f is a level 2 function of the variables (h, 12,···, fmk)'"
where fi is of type ark for i ::; mk.
96
Level n. These are all the functions taking as arguments functionsjobjects of
level ::::; n - 1, su ch that at least one argument is of level n - 1. These
functions may be of any finite arity and we have one type of level n for every
finite sequence (al, a2, ... ,am) of types ai = (ail, ai2) of level ::::; n - 1 (i.e. such
that ail ::::; n - 1), su ch that for some i ::::; m, ai is of level n-l (i.e.
ail = n - 1). Let Sn be the set of all such sequences and let crn : Sn ----t N be a
bijection ordering Sn- Given any kEN, we write (a'lk, a'2k, ... , a~k) for
cr;;l(k). Then "f is a function of type (n, k)" is equivalent to "f is a level n
function of the variables (h, h, ... ,fmk)'" where fi is of type ark for i ::::; mk.
We now give a more formaI presentation of CTT. We will not write the full details
of term formation and substitution rules, as we don't intend on actually working
within CTT; it should be clear by analogy with GG how such details may be fiUed
lll.
We use the foUowing notations and definitions. Types of level 1 are ordered by
their arity as indicated above: (1, n) is the type of an n + l-ary level 1 function. We
define inductively, for n E N - {O}, (i) the set, Sn, as the set of aU finite sequences,
for sorne (at least one) i ::::; m, as weU as (ii) the bijection crn : Sn ----t N, ordering Sn
(In fact, we don't actuaUy define this bijection here, but it is clear that sorne such
ordering of Sn exists, for each n E N, and we assume that such a crn is fixed for each
n EN.) Given any n > 0, kEN, we have cr;;l(k) E Sn and we denote cr;;l(k) by
( nk nk nk ) . d· t d b al ,a2 , ... , amk ' as ln 1ca e a ove.
N ow, the presentation of CTT. CTT has the foUowing variable symbols: An
. fi ·t 1· t (0,0) (0,0) (0,0) f· bl f t (0 0) d f h ( k) III m e 1S X o , Xl , ... , X n , . .. 0 vana es 0 ype , ,an, or eac n, E
97
(N - {O}) x N, an infinite list f~n,k), fin,k) , ... , f;::,k) , . .. of variables of type (n, k).
As for G G, we assume corresponding lists of bound variable symbols.
CTT has the following primitive functions:
• truth function: --x,
• negation: -, x,
• implication: x ~ y.
• for each CTT type (n, k), equality of type (n, k) entities: f(n,k) ~(n,k) g(n,k) ,
• for each CTT type (n, k), universalization over f(n,k): v}<n,k)G(j(n,k») 1.
A CTT term and its set of free variables, sfv are defined inductively as follows.
1. Each free object variable, x~o,ü), is a term with sfv {x~O,O)};
2. if f(n,k) and g(n,k) are variables of type (n, k), then f(n,k) ~(n,k) g(n,k) is a term
with sfv {f(n,k), g(n,k)};
3. if mEN, 4>i is a term with sfv Xq,i for i :::; m + 1, and f is a variable of type
(1, m), then f(4)l, 4>2,' .. ,4>m+l) is a term with sfv Ui::om+l Xq,i U {f};
4. if n, mEN, ai is a type of level ail:::; n - 1 for each i :::; m, ail = n - 1 for
sorne i :::; m, f is a variable of type (n, k) with k = O'n(al, a2, . .. ,am), and fi is
a variable of type ai for each i :::; m, then f(h, f2,' .. ,fm) is a term with sfv
{f, h, 12,· .. ,fn};
5. if 4> is a term with sfv Xq" then each of -- 4> and":" 4>, is a term with sfv Xq,;
1 A function of a variable of type (n, k) is given in CTT by specifying a term, cp, and a variable,
j(n,kl, of type (n,k).
98
6. if <p and 'ljJ are terms with sfv Xr/> and X1j;, respectively, then each of <p -.:...... 'ljJ
and <p ~(ü,ü) 'ljJ is term with sfv Xr/> U X1j;;
7. if <p is a term with sfv Xr/>, j(n,k) is a free variable of type (n, k) and j(n,k) is a
bound variable of type (n,k) not occurring in <p, then vj(n,k)<p[~~:::~l is a term
with sfv Xr/> - {j(n,k)};
8. nothing else is a term.
With the exception of Basic Law VI, we may adopt a set ofaxioms which are very
close to those of GG. We state these as follows.
• Basic Law 1.
a)f-x-':"""(y-':"""x)
b)f-x-':"""x,
where x, y are variables of type (0,0).
• Basic Law II. For each type (n, k),
f- 'ij(n,k)g(j(n,k») -.:...... gu(n,k»),
where j(n,k) is a variable of type (n, k) and 9 is a variable of level n + 1 and of
type corresponding to the sequence ((n, k)) E Sn+1, i.e. 9 is a (variable for a)
function of the single variable j(n,k).2
• Basic Law III. Substitutability of equal terms. For each type (n, k),
f- h(l,ü)Uin,k) ~(n,k) jJn,k») -.:...... h(l'Ü)(Vg(guJn,k») -.:...... gUin,k»))),
where 9 is a function of the same type as in the statement of Basic Law II
above. Note that f- Uin,k) ~(n,k) jJn,k») -.:...... Vg(guJn,k») -.:...... gUin,k»)) is a
special casé.
2Note that the application of this law to functions given in CTT by a specified term and variable
j(n,k) is determined by the substitution rules.
3Note also that, e.g. h(l,O)Uin,k) ~(n,k) jJn,k)) is in fact a term by clause 3 in the definition of
"term," since jin,k) ~(n,k) jJn,k) is a term by clause 2.
99
• Basic Law IV
f- (~ (-x ~(O,O)~ y)) ~ (--x ~(O,O) -y) .
• Basic Law V Extensionality. For each type (n, k),
L (f(n,k) --.:.. j(n,k») --.:.. ,- 1 -(n,k) 2 -(0,0)
. _ark . _a~k (n,k) _ark _a~k --.:.. (n,k) _ark _a~k [\191 ... \l9mk (f1 (91' ... ,9mk ) -(0,0) f2 (91 , ... , 9mk ))],
h k - (nk nk) w ere - (Jn al , ... , amk .
At first glance, it seems that Frege might object to the statement of Extensionality
given here as Basic Law V For Frege thinks that we cannot talk about equality of
functions - unsaturated entities - in the same sense that we talk about equality of
objects. Frege indeed never talks about the equality of functions, but rather of the
courses-of-values corresponding to functions. But clearly the identity of courses-of
values corresponding to two functions is sorne kind of statement of equality relating
to the two functions, and it is exactly this notion of equality, namely extensional
equality, that we capture in our statement of Basic Law V ab ove , and that we denote
by the typed equality symbol, ~(n,k), for each CTT type (n, k). If someone were to
really insist that this notion of "extensional equality" cannot properly be called an
equality, then we would gladly calI it an extensional equivalence instead.
Nonetheless, whether Frege fully realized it or not, he may have had a very good
technical reasons for keeping expressions of the form fin,k) ~(n,k) fJn,k) out of his
formaI language, or at least not giving them the status of terrn. The reason is that
Frege takes a function of any specifie type to be formally given by a term and speci
fied variable(s) of the appropriate type(s), and he therefore allows as an instance of
universal specification any substitution of a function, given in this way as term and
specified variable(s), for a universally quantified function variable (of the correspond
ing type). But it is essential for Frege's notion of substitution (both as he spelled it
100
out and as we reconstructed it in Chapter 1) that with any occurrence of a function
variable, f, the variables of f be expressed, e.g. as f(x, y) if f is a first-order binary
function variable. This is not the case in an expression such as fin,k) ~(n,k) fJn,k)
and substitution, as Frege and we have defined it, of a function given by a term and
specified variable(s) for a function variable in this expression do es not make sense
(unless n = k = 0). Because we don't have courses-of-values in CTT, we no longer
have any means of stating the (typed) equality of two "extensionally equivalent" func
tions, as given by terms and specified variables. If, for instance, c/J(!I, ... ,fm) is the
function of type (n, k) given in CTT by the term c/J and the variables !I, . .. ,fm and
'ljJ(ft, ... , fm) is the function (of type (n, k)) given by the term 'ljJ and the variables
!I, ... , fm, and we have that vll ... Vlm(c/J[~~, ... , ~:] ~ 'ljJ[~~, ... , ~:]) = T, then we
have no way of expressing the equality of c/J(ft,· .. ,fm) and 'ljJ(!I, . .. ,fm) in CTT,
since, e.g., c/J(fl, ... ,fm) ~(n,k) 'ljJ(!I, . .. ,fm) is not a legitimate term. This situation
we remedy by the addition of the following axiom schema:
Axiom Schema of CTT-Comprehension. For every CTT term c/J and any variables,
!I, ... ,fm, of types al, . .. ,am, respectively,
f- ~-(n,k)~f- ~f- (-(n,k)(f- f- ) ~ -+.[iI lm]) ::J9 VI· .. V m 9 l, ... , m -(0,0) 'f' fI"'" fm '
where k = O"n(al,' .. ,am)'
Now, if c/J(ft, . .. ,fm) and 'ljJ(!I, . .. ,fm) are two functions given in CTT as men
tioned above, their equality (of type (n, k)) may be formally stated as follows:
( ) ~-(n,k)~-(n,k)['f- ~f- (-(n,k)(f- f- ) ~ -+.[11 lm] * ::J9l ::J92 VI· .. V m 91 l,· .. , m -(0,0) 'f' !l''''' fm - -
. -(n,k)(f- f- ) ~ OI,[!I fm]/\' -(n,k) ~ -(n,k)) 1\ 92 l, ... , m -(0,0) 'f' fI"'" fm 91 -(n,k) 92 ,
(assuming again k = O"n(al,"" am))'
By two instances of the Axiom Schema of CTT-Comprehension, we can be sure
that the (*) is in fact equivalent to the statement of "(n, k )-equality" between
101
The need for a comprehension principle in the absence of courses-of-values is in
teresting. As we have mentioned often, a function in GG and in CTT is represented
by a term and specified variable(s). (Note that this of course includes the case where
the term is simply the specified variable.) In GG, we can meaningfully substitute
a function so given for a function variable (of the appropriate type) in any context
in which the latter meaningfully occurs. Given Basic Law V of GG, we state the
extensional equivalence of two functions by stating the equality of the correspond
ing courses-of-values. The existence of a function represented by a term and specified
variable(s) is essentially given by the fact that the term can be formed in GG; however
the existence of courses-of-values for functions is required in order to make statements
of extensional equivalence between these functions, or, rather, of the uniqueness (in
terms of courses-of-values) of extensionally equivalent functions. The positing of a
corresponding course-of-values for every function is built into the grammar of GG. In
CTT, given the absence of courses-of-values, we must state the uniqueness of exten
sionally equivalent functions directly as in the Basic Law V of CTT ab ove. But this
means that there are contexts in which we cannot meaningfully substitute a function
as given by a term and specified variable(s) for a function variablé. In particular, if
we wish generally to make statements of extension al equivalence between functions
represented by terms and specified variables, this forces us to directly posit the ex
istence, for any term and specified variable(s), of the corresponding function, as we
do in the Axiom Schema of CTT-Comprehension above. In both GG and CTT,
statements of equality, or equivalence, between functions requires an assumption of
existence, an explicit postulate on the existence of functions in CTT and an implicit
assumption of the existence of courses-of-values in GG. Of course, Russell's paradox
4That is, even when the function variable is of the appropriate type, i.e., the type of the function
represented by the term and specified variable( s).
102
shows us that if we assume the existence of courses-of-values as in GG, that is, en
tities all of the same type (object type in GG), corresponding to functions that take
entities of this type as arguments, then we must place some restrictions(s) on our
existence assumption. E.g., we may adopt a "limitation of size" doctrine as in ZF,
or we may adopt a stratification requirement as in NF. On the other hand, if we
postulate a comprehension principle on the existence of functions as in CTT, while
maintaining a strict separation of types, then no paradox seems to arise from unlim
ited comprehension. This is analogous to the unlimited comprehension principle for
sets in TT.
We come finally to the rules of CTT. As for GG, we may adopt modus ponens, uni
versal generalization and universal specifications (see Chapter 1). For CTT, however,
we need a more elaborate notion of substitution. For the statement of universal speci
fication in GG, we defined substitution of terms for object variables, and of first-order
unary and binary functions, represented in GG by a specified term and one or two
specified object variables, for unary and binary function variables, respectively. The
statement of universal specification in CTT requires a general notion of substitution
of a function of type (n, k), as given by a specified CTT term and mk variables of
types a~k, a'2k, ... , a~k' respectively, for a variable of type (n, k). For cases in which
the variable f(n,k) occurs with its variables expressed, this generalization is straight
forward from the notion of substitution in GG, if rather tedious to spell out. For
both these reasons, we skip the details here. For occurrences of function variables
with their own variables unexpressed, we do not allow substitutions. CTT as we
have presented it to this point is our formaI interpretation of Frege's discussion in the
Grundgesetze. We turn our attention now to simplifying assumptions for CTT.
We consider first simplifying CTT by restricting the range of each function to
a set of exactly two distinct objects. Specifically, we would like these objects to be
T and 1-, which we may represent in CTT as Vx(x ~(O,O) x) and -.:, Vx(x ~(O,O) x),
103
respectively, where x is a variable of type (0,0). This restriction can be achieved with
the following axiom schema.
Axiom Schema of Restriction to Concepts. For every CTT type (n, k) and func
tion f of type (n, k),
. . . f- '::/91 ... '::/9m(J(91 , ... ,9m) ~(O,O) '::/x(x ~(O,O) x)
V f(91,"" 9m) ~(O,O)":" vx(x ~(O,O) x)),
where k = (Jn(a~k, ... , a;:k) and gi is of type ark for i ~ mk·
Let us abbreviate f(gl,' .. ,gm) ~(O,O) '::/x(x ~(O,O) x) by (gl,"" gm) E f, and
f(gl,'" ,gm) ~(O,O)":" '::/x(x ~(O,O) x) by (gl,'" ,gm) tI. j. Then we may restate the
above axiom as: ., .
f- '::/91 ... '::/9m((91 , ... ,9m) E f V (91, ... ,9m) tI. 1).
In this way, the functions of CTT are restricted essentially to predicates. (Note
that each of the primitive functions of CTT is assumed to satisfy this axiom.) Clearly,
if we adopt the above axiom, then every function may be interpreted as a set or n
ary relation, for sorne n E N. Let us denote CTTR the theory obtained from CTT
by the addition of this axiom. Roughly speaking, CTTR is TT extended to allow
not only sets at each level n, but k-ary relations for kEN, and in such a way that
these relations may be of mixed levels, so long as they are restricted to sets and
relations of levels ~ n - 1 and any tuple (gl, ... ,gm) contained in such a relation, f,
((gl, ... ,gm) E 1) contains sorne gi of level n - 15 .
Therefore, if we further restrict CTTR by allowing only unary functions, we re
cover essentially the theory TT. That is, we would have a single type for each level n,
5To state this last condition more precisely, we should say that any tuple (91, ... ,9m) for which
the expression (91, ... ,9m) E f is grammatical (i.e. a CTT term) is such that 9i is of level n - 1 for
sorne i :::; m.
104
corresponding, for n > 0, to functions of a single variable of level (or type, since we
may then identify types with their levels) n - 1 and to objects for n = O. Any such
function of level n > 0 is in fact the characteristic function for a set of entities of type
n - 1. By Extensionality (Basic Law V) ab ove , any two functions of the same level
are equal iff they take the same value for every argument of level n - 1, which means
they are equal iff they are characteristic functions for the same set (if we assume also
the usual set-theoretic axiom of extensionality). Basic Law V reduces to:
fn+ 1 ~(n,k) gn+l ~ ';/hn(jn+l(hn) ~(O,O) gn+l(hn)), for n E N.
In CTTR, this may also be written as:
r+l ~(n,k) gn+l ~ Vhn(hn E r+1 ~ hn E gn+l).
We also have that the following holds in CTTR: for any CTT term 4> and variable
f of type n EN,
3gn+lvj(gn+l(j) ~(O,O) Vx(x ~(O,O) x) ~ 4>[tl ~(O,O) vx(x ~(O,O) X)).6
This might also be written as:
3gn+lVj(j E gn+l ~ 4>[tl ~(O,O) T),
where we have abbreviated 'ï/x(x ~(O,O) x) by T.
But now it should be quite dear that we can interpret TT in CTTR restricted
to unary functions and vice versa. This brings precision to our daim that, in the
Grundgesetze, Frege starts from a kind of Cumulative Type Theory, to which he adds
abstraction and application (via the unit dass operator). This result also demon
strates our further daim that CTT reduces to TT with the above simplifying as-
sumptions.
6With the substitution c;b[tl appropriately defined.
105
Bibliography
[1] Peter Aczel. 'Frege structures and the notions of proposition, truth and set'. In
Jon Barwise et al., editor, The Kleene Symposium, volume 101 of Studies in Logic
and the Foundations of Mathematics, pages 31-59. North-Holland Publishing
Company, 1980.
[2] Jon Barwise, edit or. Handbook of Mathematical Logic. Elsevier Science B.V.,
1977.
[3] George Boole. Collected Logical Works, volume II: Laws of Thought. The Open
Court Publishing Company, 1940.
[4] George Boolos. Logic, Logic, and Logic. Harvard University Press, 1998.
[5] Nino B. Cocchiarella. 'Frege's double correlation thesis and Quine's set theories
NF and ML'. Journal of Philosophical Logic, 14:1-39, 1985.
[6] Marcel Crabbe. 'On NFU'. Notre Dame Journal of Formal Logic, 33(1):112-119,
Winter 1992.
[7] Michael Dummett. Frege: Philosophy of Mathematics. Harvard University Press,
1991.
107
[8] Gottfried Gabriel et al., edit or. Gottlob Freges Briefwechsel, mit D.Hilbert,
E.Husserl, B.Russell, sowie ausgewaehlte Einzelbriefe Freges. Felix Meiner Ver
lag, 1980.
[9] Mathieu Marion et Alain Voizard, editor. Frege Logique et Philosophie. Harmat
tan Ine., 1998.
[10] Gottlob Frege. Grundgesetze der A rithmetik, volume II. Verlag von Hermann
Pohle, 1893.
[11] Gottlob Frege. Grundgesetze der Arithmetik, volume 1. Verlag von Hermann
Pohle, 1893.
[12] Gottlob Frege. Funktion, BegrijJ, Bedeutung. Fuenf Logischen Studien. Vanden
hoeek und Rupreeht, 1962.
[13] Gottlob Frege. BegrijJsschrift und andere aufsaetze. Wissensehaftliehe Buehge
sellsehaft Darmstadt, 1964.
[14] T. Hailperin. 'A set ofaxioms for logie'. Journal of Symbolic Logic, 9:1-19,1944.
[15] M. Randall Holmes. Elementary Set Theory with a Universal Set, volume 10 of
Cahiers du Centre de logique. Aeademia, 1998.
[16] M. Randall Holmes. 'Foundations in NFU'. FOM mailing list posting, Mareh
29 1999.
[17] M. Randall Holmes. 'Strong axioms of infinity in NFU'. The Journal of Symbolic
Logic, 66(1):87-116, 2001.
[18] Gottlob Frege; Translated into English by J.L.Austin. Foundations of Arithmetic.
Northwestern University Press, 2nd edition, 1980.
108
[19] Gottlob Frege; Translated into English by P.T Geach and R.H.Stoothoff. Logical
Investigations. Basil Blackwell, 1977.
[20] Ronald Bjorn Jensen. 'On the Consistency of a Slight(?) Modification of Quine's
New Foundations'. Synthese, 19:250-263, 1968-1969.
[21] Steven Orey. 'New Foundations and the Axiom of Counting'. Duke Mathematical
Journal, 31:655-660, 1964.
[22] J. Barkley Rosser. Logic for Mathematicians. Chelsea, 2nd edition, 1973.
[23] Bertrand Russell. The Principles of Mathematics. George Allen and Unwin Ltd.,
2nd edition, 1937.
[24] Bertrand Russell. My Philosophical Development. George Allen and Unwin Ltd.,
1959.
[25] Ernst Schroeder. Vorlesungen ueber die Algerbra der Logik, volume 1.
B.G.Teubner, 1890.
[26] Hans Sluga. Gottlob Frege. Routledge and Kegan Paul, 1980.
[27] Ernst Specker. 'The Axiom of Choice in Quine's New Foundations for Mathemat
ical Logic'. Proceedings of the National Academy of Sciences, USA, 39:972-975,
1953.
[28] Ernst Specker. 'Typical Ambiguity'. Logic, M ethodology and Philosophy of Sci
ence {Proc. 1960 Internat. Congr.}, pages 116-124, 1962.
[29] Willard van Orman Quine. A System of Logistic. Harvard University Press,
1934.
[30] Willard van Orman Quine. 'New Foundations for Mathematical Logic'. The
American Mathematical Monthly, 44(2):70-80, February 1937.
109
[31] Willard van Orman Quine. Mathematical Logic. Harvard University Press, 1951.
[32] Alfred North Whitehead and Bertrand Russell. Principia Mathematica, volume
I-Ill. Cambridge University Press, 1910-1913.
[33] Crispin Wright. Frege 's Conception of Numbers as Objects. Number Two in
Scots Philosophical Monographs. Aberdeen University Press, 1983.
110