Documente Academic
Documente Profesional
Documente Cultură
Ana ysis
lb f 1
(t )dt =f b) - f (a)
n
A Course in
Analysis
-----VoLI----
lntroductory calculus
Analysls of Functions of One Real Variable
Vol. II
Part 3 Differentiation of Functions of Several Variables
Part 4 Integration of Functions of Several Variables
Part 5 Vector Calculus
Vol. III
Part 6 Measure and Integration Theory
Part 7 Complex-valued Functions of a Complex Variable
Part 8 Fourier Analysis
Vol. IV
Part 9 Ordinary Differential Equations
Part 10 Partial Differential Equations
Part 11 Calculus of Variations
Vol. V
Part 12 Functional Analysis
Part 13 Operator Theory
Part 14 Theory of Distributions
Vol. VI
Part 15 Differential Geometry of Curves and Surfaces
Part 16 Differentiable Manifolds and Riemannian Geometry
Part 17 Lie Groups
Vol. VII
Part 18 History of Analysis
Introductory Calculus
Analysis of Functions of One Real Variable
Niels Jacob
Kristian P Evans
Swansea University, UK
A COURSE IN ANALYSIS
Volume I: Introductory Calculus, Analysis of Functions of One Real Variable
Copyright © 2016 by World Scientific Publishing Co. Pte. Ltd.
All rights reserved. This book, or parts thereof, may not be reproduced in any form or by any means, electronic or
mechanical, including photocopying, recording or any information storage and retrieval system now known or to
be invented, without written permission from the publisher.
For photocopying of material in this volume, please pay a copying fee through the Copyright Clearance Center,
Inc., 222 Rosewood Drive, Danvers, MA 01923, USA. In this case permission to photocopy is not required from
the publisher.
ISBN 978-981-4689-08-3
ISBN 978-981-4689-09-0 (pbk)
Printed in Singapore
Preface
We are currently living in times where many undergraduates consider the
internet as their main, if not their only source for supporting their academic
studies. Furthermore, many publishers prefer short textbooks directly related
to modules as the best solution for mathematics textbooks. This project,
namely to write and publish a whole course on analysis consisting of up to 6
volumes, therefore, may appear to be going against the grain, perhaps even
a Don Quixote’s style fight against modernity. However the motivation for
developing these volumes has slowly emerged over the last few years by our
observations while teaching analysis to undergraduates.
Five years ago, after long discussions and preparations we changed the un-
dergraduate mathematics provision at Swansea University. We now think
more in terms of courses than modules. Our analysis course runs over five
terms as does our algebra course, and both are compulsory for all students.
Clearly there are still choices and in particular in the final year students can
v
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 2
A COURSE IN ANALYSIS
choose out of quite a few advanced modules. A further, rather important new
feature of the new provision is that we leave (whenever possible) each course
for each cohort in the hands of one lecturer. The students seem to favour
this type of continuity in terms of both the presentation of material and the
lecturer, and more importantly they are performing much better than they
have done in previous years.
Our aim is to provide students and lecturers with a coherent text which can
and should serve entire undergraduate studies in Analysis. The Course can
also be used as a standard reference work. It might be worth mentioning
that for graduate students in analysis such a lack of a modern course was
also felt at no other place but Princeton University. E.M. Stein’s four-volume
course “Princeton Lectures in Analysis” published jointly with R. Shakarchi
between 2003 - 2011 is a response to such a real need, i.e. multiple-volume
courses are by no means out of date, maybe they are needed more than ever
to give students a foundation and a lasting reference for their mathematical
education and beyond.
The first named author has taught mathematics, mainly analysis related
topics, but also geometry and probability theory, for over 38 years at 7 uni-
versities in 2 countries. The material in this course is based on ca. 40
different modules he has taught over the years. For these volumes the mate-
vi
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 3
PREFACE
rial was of course rearranged and amended, but nonetheless to a large extent
they reflect still the provision. This first volume covers first year analysis
as taught by the first named author with the support of the second named
author in Swansea in the academic year 2010/11, an introduction to calculus
and analysis of functions of one variable.
Finally we want to thank all who have supported us in writing this volume,
in particular the World Scientific Press team.
Niels Jacob
Kristian P. Evans
Swansea, January 2015
vii
This page intentionally left blank
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 5
Contents
Preface v
3 Mathematical Induction 39
6 Derivatives 91
ix
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 6
A COURSE IN ANALYSIS
21 Differentiation 293
x
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 7
CONTENTS
Appendices 471
References 733
xi
This page intentionally left blank
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 9
Endl, K., und Luh, W., Analysis I, 3. Aufl. Analysis II, 2. Aufl. Akade-
mische Verlagsgesellschaft, Wiesbaden 1975 und 1974.
xiii
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 10
A COURSE IN ANALYSIS
Heuser, H., Lehrbuch der Analysis. Teil 1 und 2. B.G. Teubner Verlag,
Stuttgart 1980 und 1981.
For compiling the lists of formulae in some of the appendices we used often
Solved problems are important for students and we used some existing col-
lections of solved problems to supplement our selection. Sometimes these
collections served only to get some ideas, on some occasions we picked prob-
lems but provided different or modified solutions, but here and there we used
complete solutions. The main sources which are very valuable for students
are
Spiegel, M.R., Real Variables. Schaum’s Outline Series Theory and Prob-
xiv
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 11
Niels Jacob
xv
This page intentionally left blank
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 13
List of Symbols
N natural numbers
kN := {n ∈ N | n = km for m ∈ N}
N0 := N {0}
Z integers
Q rational numbers
R real numbers
R+ non-negative real numbers
Rn = R × · · · × R set of ordered n-tuples of real numbers
x−1 := x1
xn := x · x · . . . · x (n factors)
1 √
a n or n a nth root of a
n √
x m = m xn
x > 0 x is strictly greater than 0
x < 0 x is strictly less than 0
x ≥ 0 x is non-negative
x ≤ 0 x is non-positive
|x| absolute value of x
∞ infinity
−∞ negative infinity
n! n factorial
n
k
binomial coefficient
max{a1 , . . . , an } maximum of a1 , . . . , an
min{a1 , . . . , an } minimum of a1 , . . . , an
n
aj finite sum of aj
j=1
∞
ak infinite series
k=1
k
aj = am + am+1 + · · · + ak
j=m
xvii
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 14
A COURSE IN ANALYSIS
n
aj finite product of aj
j=1
n
= al · al+1 · . . . · an
j=l
∞
j=1 aj infinite product of aj
X ×Y Cartesian product
∅ empty set
P(X) power set of the set X
∈ belongs to
∈
/ does not belong to
x • y binary operation
⊂ set subset
M1 \ M2 set subtraction
set intersection
set union
A complement of A
=⇒ implies
xRy relation
∼ equivalence relation
[a] equivalence class
∨ or
∧ and
⇐⇒ equivalence (statements)
∀ for all
∃ there exists
¬p negation of p
N
Aj finite union of sets Aj
j=1
N
xviii
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 15
LIST OF SYMBOLS
xix
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 16
A COURSE IN ANALYSIS
xx
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 17
LIST OF SYMBOLS
xxi
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 18
A COURSE IN ANALYSIS
xxii
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 19
xxiii
Part 1
Introductory
Calculus
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 23
1 Numbers - Revision
Before we start with calculus we need to know how to manipulate complicated
expressions of real numbers and above all we must become familiar in doing
this. We urge students to avoid using calculators in this course. The intention
here is to ensure that we understand the basics; much of what is introduced
may seem obvious but the concepts will become very useful later in the book.
In particular, we will need a lot of familiarity in manipulating expressions
where numbers are replaced by functions or later on even by operators. We
start to systematically introduce set theory as a common language in modern
mathematics. Basic notions from logic on which we rely are taught in other
courses, however these are collected in Appendix I.
The natural numbers or positive integers are the numbers
1, 2, 3, 4, 5, . . . (1.1)
n ∈ N, (1.2)
for example
12 ∈ N. (1.3)
The set of all integers is denoted by Z and consists of the numbers
/ N.
−15 ∈ (1.7)
n ∈ N implies n ∈ Z. (1.8)
3
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 24
A COURSE IN ANALYSIS
N ⊂ Z. (1.9)
Clearly, there are other subsets of Z, for example the set of all negative
integers, or the set of all even integers, etc. The rational numbers are
denoted by Q and this is the set of all fractions
k
q= , where k ∈ Z and n ∈ N. (1.10)
n
Examples of fractions are
3 7 −1 −12
, , , , etc. (1.11)
7 7 8 3
We also write − 18 for −1
8
, etc. Note that we face a problem: − 12 3
and − 41
are different formal expressions which represent the same number, and in
addition we want to consider − 41 and −4 to be equal. For now we use a naı̈ve
k
approach where we consider two rational numbers q = m and r = nl as equal
if kn = lm, and further, for 1 we write k. The last identification of k1 with
k
Z ⊂ Q. (1.12)
N ⊂ Q. (1.13)
It is helpful to introduce at this stage the few notions and notations from set
theory that we have used so far in a more systematic way. Unfortunately,
there is no simple and unproblematic way to introduce the general notion of
a set. For our purposes the original definition of G. Cantor is sufficient:
4
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 25
1 NUMBERS - REVISION
m ∈ M, (1.14)
k∈
/ M. (1.15)
Before we can do anything with sets we need to define when two sets M1 and
M2 are equal:
Two sets are equal if and only if they have the same elements.
So far we have introduced the natural numbers, the integers and the rational
numbers. We already know that there are numbers which are not √ rational,
i.e. have no representation as a fraction. Take for example π or 2. We call
these numbers irrational numbers. The real numbers, denoted by R, is
the set consisting of all rational and irrational numbers. Of course, a second
thought shows that this is not a proper definition. However, up until now we
have had a naı̈ve idea of what the real numbers are, for example the points
on a straight line. We will operate with this naiv̈e approach for some time
until we can eventually give a proper definition and characterisation of R.
5
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 26
A COURSE IN ANALYSIS
R \ Q = {x ∈ R | x ∈
/ Q}, (1.21)
for which we do not introduce an extra symbol. Note that (1.20) suggests a
way to characterise sets, for example
is the set of all even natural numbers. Again, it is easier to slowly get used
to this notation than to give a formal definition. The idea is to consider all
those elements of a given set M which share a certain property A, i.e.
Another way to characterise a set is by listing all of its elements, for example
(x + y) + z = x + (y + z), (1.26)
and
x + y = y + x. (1.27)
6
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 27
1 NUMBERS - REVISION
From equality (1.26) we deduce that it does not make any difference whether
we first add x and y together and then add z, or whether we first add y
and z together and then add x. We say that addition of real numbers is
associative and (1.26) is called the associative law of addition. Equality
(1.27) tells us that the order does not matter when adding real numbers, i.e.
addition in R is commutative.
There is one (and only one) real number which is very special with respect to
addition: we may add this number to any other number x ∈ R and the result
is again x. This number is 0 and we consider 0 as the neutral element
with respect to addition, i.e.
Given a real number x, there is always exactly one real number −x such that
x + (−x) = 0. (1.29)
x − y := x + (−y). (1.31)
Note that we have used the symbol “:=” here for the first time. In general
A := B means that A is defined by B, for example we write
2N := {n ∈ N | n = 2m for m ∈ N} (1.32)
(x · y) · z = x · (y · z) (1.33)
and
x·y = y·x (1.34)
hold for all x, y, z, ∈ R. Hence multiplication is associative and com-
mutative. For the time being we write x · y for the product of x and y
but later on we will adopt the usual notation and will just write xy. As in
7
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 28
A COURSE IN ANALYSIS
1 · x = x. (1.35)
x · x−1 = 1. (1.36)
Shortly we will investigate why 0 does not have an inverse element with
respect to multiplication. A further notation for x−1 is
1
:= x−1 , x = 0, (1.37)
x
and for x · y −1 = y −1 · x we write
x
:= x · y −1, y = 0. (1.38)
y
Finally we want to link addition and multiplication. This is done by the law
of distributivity which states that for all x, y, z, ∈ R
x · (y + z) = (x · y) + (x · z). (1.39)
x · (y + z) = x · y + x · z (1.40)
or
x(y + z) = xy + xz. (1.41)
Now we can address the problem why 0 cannot have an inverse with respect
to multiplication. Since 1 − 1 = 0 for any x ∈ R it follows that
0 · x = (1 − 1)x = x − x = 0.
8
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 29
1 NUMBERS - REVISION
z = x • y. (1.42)
The condition that a set A is non-empty will occur quite often. We formally
introduce the empty set ∅ as the set which has no elements and A being
non-empty means
A = ∅. (1.43)
The set R \ {0} is of course non-empty and multiplication on R \ {0} gives
a further binary operation. We write (R, +) and (R \ {0}, ·) to indicate that
we want to consider R with the binary operation “+” and R \ {0} with the
binary operation “·”.
Let us return to R with the algebraic operation addition satisfying (1.26) -
(1.29), the algebraic operation multiplication satisfying (1.33) - (1.36) and
the law of distributivity (1.39). We want to derive some simple rules for
doing calculations. For a, b, c, d ∈ R, b = 0 and d = 0 we have
a c ad + cb
+ = . (1.44)
b d bd
Indeed:
a c
+ = ab−1 + cd−1
b d
bd
= (ab−1 + cd−1 )
bd
1
= ((ab−1 + cd−1 ) · (bd) ·
bd
1
= (ab−1 bd + cd−1 bd) ·
bd
ad + cb
= ,
bd
9
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 30
A COURSE IN ANALYSIS
where we used that for every real number x, x = 0, that xx = 1. (Recall that
1
1
= 1 and xx = 11 if and only if 1 · x = x · 1 which is of course true.) Let us
do an example. For a = 34 , b = 15
8
, c = 92 and d = 23 we find
3 9
a c 4
+ = 8 + 22 ,
b d 15 3
3 15 9 3
· = + ·
4 8 2 2
45 27
= +
32 4
45 · 4 + 27 · 32 1044 261
= = = .
32 · 4 128 32
Here we have already used the general rule that for a = 0 and b = 0 we have
that a −1 b
= , (1.45)
b a
as we know the rule
1 a
a· = for a ∈ R and b ∈ R \ {0}. (1.46)
b b
In addition we know that
a c ac
· = , b = 0, d = 0. (1.47)
b d bd
The rule (1.45) is of some more interest, so let us spend some time on it.
Recall that ab = ab−1 , hence (1.45) claims that (ab−1 )−1 = ba−1 . We can prove
this easily by assuming that the inverse element is uniquely determined:
10
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 31
1 NUMBERS - REVISION
xn := x · x · x · . . . · x (n factors). (1.48)
xn
xn · (xn )−1 = = 1,
xn
but in addition we have
x ·...· x 1
xn · (xn )−1 = = (x · . . . · x) ,
x ·...· x x · ...·x
thus we find
1
(xn )−1 = (1.52)
xn
and we write
x−n := (xn )−1 . (1.53)
The rules (1.49) and (1.50) now extend to all n, m ∈ Z provided that x = 0
and y = 0. If we agree to define
x0 = 1, (1.54)
xk · xl = xk+l (1.55)
and
(x · y)k = xk · y k (1.56)
11
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 32
A COURSE IN ANALYSIS
Indeed, we can extend (1.55) and (1.56) to fractional powers. For x > 0 and
y > 0 and p, q ∈ Q it follows that
xp · z q = xp+q (1.59)
and
(x · y)p = xp · y p . (1.60)
Further, for p = n
m
, n, m ∈ N, and x ≥ 0 we write
n √
x m = m xn . (1.61)
12
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 33
1 NUMBERS - REVISION
− 92 − 72 − 52 − 32 − 12 1
2
3
2
5
2
7
2
9
2
−5 −4 −3 −2 −1 0 1 2 3 4 5
Figure 1.1
At the moment we pretend that there is a one-to-one correspondence between
the points on the real line and the real numbers. If x > 0 we say that x is
positive, we call x negative if x < 0. We write x ≥ 0 if x > 0 or x = 0 and
we write x ≤ 0 if x < 0 or x = 0. It is convenient to add the notation R+ for
all non-negative real numbers, i.e. R := {x ∈ R | x ≥ 0}. If x ≥ 0 we call
x non-negative, if x ≤ 0 we call x non-positive. The following rules hold
for x, y ∈ R:
13
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 34
A COURSE IN ANALYSIS
Here are some simple rules for handling inequalities. For a, b ∈ R and x, y ∈ R
we have:
x > y implies x + a > y + a; (1.76)
x ≥ y implies x + a ≥ y + a; (1.77)
x < y implies x + a < y + a; (1.78)
x ≤ y implies x + a ≤ y + a; (1.79)
x > y and a > b implies x + a > y + b. (1.80)
If x, y ∈ R and a ∈ R, a > 0, then we have:
x ≥ y implies a · x ≥ a · y; (1.82)
x < y implies a · x < a · y; (1.83)
x ≤ y implies a · x ≤ a · y. (1.84)
We also know that
x ≥ y implies a · x ≤ a · y; (1.87)
x < y implies a · x > a · y; (1.88)
x ≤ y implies a · x ≥ a · y. (1.89)
In the next section we will often make use of these rules. Here are some
simple examples:
i)
3 7 3 7 7
≤ , hence 4 · = 3 ≤ = 4 · ,
4 8 4 2 8
however
3 7 7
(−4) · = −3 ≥ − = (−4) · .
4 2 8
ii)
3 + x > 2 + y implies 1 + x > y or y − x < 1.
14
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 35
1 NUMBERS - REVISION
iii) Consider 7x−5 > 21x+30. This inequality is equivalent to 7x > 21x+35,
which is again equivalent to x > 3x + 5, or −5 > 2x, implying x < − 52 . In
fact all these manipulations are reversible. Thus the problem: find all x ∈ R
such that
7x − 5 > 2x + 30
has the solution x ∈ R such that x < − 52 . More formally, the set of solutions
of the inequality
7x − 5 > 2x + 30
is given by
5
x∈R|x<− .
2
In this chapter we have summarised what we may have already learned else-
where about real numbers. We might have even slightly extended these ideas.
Some ideas from set theory have been introduced, further, we occasionally
pointed out that some of the statements and rules we take for granted need
a proper justification, and we indicated some of the more formal aspects,
such as relations to binary operations. In Part 1 of our course we will conse-
quently use the following approach: starting from a basic knowledge we will
gradually move to more and more precision, indicating any gaps in our work
along the way. Eventually we will be prepared for a more mature approach
to mathematics in particular in analysis when entering Part 2.
Problems
1. Is the set {φ} empty?
2. Decide which of the following sets is empty
a) {x ∈ R | x2 = 16 and 2x+3 = 12}, b) {x ∈ Q | x2 = 9 and 3x−6 = 3},
c) {x ∈ R | x = x}, d) {x ∈ Z | x2 = 14 }, e) {x ∈ Q | x2 = 14 }.
3. Given the 3 sets
A = {3, 5, 7, 9, 11}, B = {z ∈ Z | z is odd} and C = {z ∈ Z \ {2} | z is prime}.
15
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 36
A COURSE IN ANALYSIS
8. a) Simplify:
3a + 4(a + b)2 − 6a( 12 + b) − 2b(a + 2b)
1 , a + b = 0.
2
(a + b)
c) Simplify:
a−b 4ab a+b
+ 2
−
a + b (a + b) a−b
(a = b and a = −b).
d) Simplify:
x3 − y 3 1 x y
− y 4x2 3
− +
y−x y x y x
(x = y, x = 0, y = 0).
9. Simplify: 8 12 6
1
9 11
− 29 5
−7
8 3 7
.
3 4
−2
10. Simplify:
2 3 1 4 8 3
( 25 ) −( 38 )
2
a) 3
− 2
+5 9
; b) 19 .
40
16
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 37
1 NUMBERS - REVISION
11. Simplify:
a)
(a + b)3 − (b − a)2 (b + a)
, ab = 0;
4ab
b)
a 3 4
b
− ab
, ab = 0.
a2 b3
12. Find:
√
225 a4 b6
a) 625; b) 49
; c) (a+b)2
, a ≥ 0, b ≥ 0 and a + b = 0.
15. Prove by using the stated rules for addition and multiplication that
1 1 d+b
(a) b
+ d
= d·b
; b = 0, d = 0.
a
a
(b) b
c = b
· dc , b = 0, c = 0, d = 0.
d
(b) Use the fact that for y ≥ 0 there exists exactly one real number
√ √
y ≥ 0 such that ( y)2 = y to find all solutions to the quadratic
equation
ax2 + bx + c = 0.
17
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 39
19
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 40
A COURSE IN ANALYSIS
a2 + b2
|ab| ≤ for all a, b ∈ R. (2.6)
2
We already know
a2 + b2
ab ≤ , (2.7)
2
therefore all we need to show is that
a2 + b2
−ab ≤ .
2
To do this consider (a + b)2 . As before we find
0 ≤ (a + b)2 = a2 + 2ab + b2 ,
a2 + b2
−ab ≤ . (2.8)
2
Thus, (2.7) and (2.8) imply
a2 + b2
|ab| ≤ , (2.9)
2
since |ab| can only take the value ab or −ab.
Here are some rules for handling the absolute value: For x, y ∈ R we find
20
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 41
21
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 42
A COURSE IN ANALYSIS
−|x| 0 |x|
Figure 2.1
Let us change our point of view. Consider on R the set
This set only consists of two points: x and −x. Thus we may use the
absolute value to define subsets of R. We may extend this procedure by
allowing inequalities:
Let ε > 0 and a ∈ R be fixed. Define on R the subset
We want to find all points in R belonging to the set Bε (a). Using the definition
of the absolute value we find
or
|x − a| < ε if and only if − ε + a < x < a + ε.
As the simplest case take a = 0. This means that in Bε (0) we find all points
with absolute value less than ε, or equivalently those whose distance to 0 is
less than ε :
B (0)
0
−
Figure 2.2
22
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 43
But now we see the general interpretation: in Bε (a) we find all points which
have a distance less than ε to a
B (a)
a 0
− + a a+
Figure 2.3
1
B 1 (5) = {x ∈ R| |x − 5| < }
2 2
1 1
= {x ∈ R| − + 5 < x < 5 + }
2 2
9 11
= {x ∈ R| < x < }
2 2
B 1 (5)
2
0 9
5 11
2 2
Figure 2.4
B. Next we look at
2 2
B2 − = x ∈ R| x − − < 2
3 3
2
= {x ∈ R| |x + | < 2}
3
2 2
= {x ∈ R| − 2 − < x < 2 − }
3 3
8 4
= {x ∈ R| − < x < }
3 3
23
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 44
A COURSE IN ANALYSIS
B2 (− 23 )
− 23 0
− 83 4
3
Figure 2.5
0 a b
Figure 2.6
Thus in (a, b) we find all real numbers x which are larger than a and less
than b. For example
0
−3 8
Figure 2.7
With this notation we have
(−ε, ε) = Bε (0)
or more generally
(−ε + a, a + ε) = Bε (a)
for ε > 0 and a ∈ R. Note that the numbers a and b do not belong to (a, b).
Again we can extend our procedure of defining sets. For a, b ∈ R, a < b we
set
[a, b) := {x ∈ R| a ≤ x < b}, (2.16)
which corresponds to
24
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 45
a b
Figure 2.8
Also, we may consider
which corresponds to
a b
Figure 2.9
Finally we introduce
which corresponds to
a b
Figure 2.10
A COURSE IN ANALYSIS
0
−3 2
3
Figure 2.11
or
1 3 1 3
( , ] = {x ∈ R| < x ≤ }
5 4 5 4
0 1 3
1
5 4
Figure 2.12
Often we will encounter the following type of problem: given ε1 > 0 and
ε2 > 0 as well as a1 , a2 ∈ R, find all points x ∈ R such that x ∈ Bε1 (a1 )
and x ∈ Bε2 (a2 ). We have an easy geometric solution to the problem: it may
happen that
a1 a2
a1 − 1 a1 + 1 a2 − 2 a2 + 2
Figure 2.13
or
a2 − 2 a2 + 2
a1 a2
a1 − 1 a1 + 1
Figure 2.14
In the first case Bε1 (a1 ) and Bε2 (a2 ) have no points in common i.e. they are
disjoint. In the second case there are points in the intersection of B2 (a1 )
and B2 (a2 ), i.e. these points belong to both sets. In order to find the points
in the intersection, we must solve simultaneously the inequalities
26
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 47
and
x < a1 + ε1 and x < a2 + ε2
therefore
max{−ε1 + a1 , −ε2 + a2 } < x
and
x < min{a1 + ε1 , a2 + ε2 }.
Thus the solution to (2.20) is
B2 (4)
2 4 6
0 3
1 5
B2 (3) Figure 2.15
but (6,5) is not an interval since 6 > 5, i.e. there are no points belonging to
both sets.
B2 (8)
8
0 1 3 5 6 10
B2 (3) Figure 2.16
27
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 48
A COURSE IN ANALYSIS
The set of all points belonging to Bε1 (a1 ) and to Bε2 (a2 ) is denoted by
and this set is called the intersection of Bε1 (a1 ) and Bε2 (a2 ). In the case where
there are no points in the intersection, i.e. in the case where the intersection
is empty, we write
Bε1 (a1 ) ∩ Bε2 (a2 ) = ∅. (2.22)
We define the intersection of two general sets A and B by
A ∩ B = {x ∈ A | x ∈ B} = {x ∈ B | x ∈ A} = {x | x ∈ A and x ∈ B},
A ∪ B = {x | x ∈ A or x ∈ B}. (2.23)
A ∪ B = {x ∈ X | x ∈ A or x ∈ B}. (2.25)
For example with X = N, A = {1, 2, 3, 5, 7} and B = {3, 4, 5, 8, 9} we find
and
A ∪ B = {1, 2, 3, 5, 7} ∪ {3, 4, 5, 8, 9} = {1, 2, 3, 4, 5, 7, 8, 9}.
Given a set X and a subset A ⊂ X we may form a new set, the complement
of A in X for which we write A and is defined by
A := X \ A = {x ∈ X | x ∈
/ A}. (2.26)
28
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 49
Z N = {z ∈ Z | z ∈
/ N} = {z ∈ Z | z ≤ 0} (2.27)
R N = {x ∈ R | x ∈
/ N} = R \ N (2.28)
and clearly
Z N = R N.
We will use the notation A when it is clear from the context which set X
is meant, i.e. for which X we consider A to be a subset, otherwise we write
X \ A instead of A .
In Appendix II we have collected many results about operations on sets. Here
we summarise some rules and give an outline of some of the proofs. Further
proofs are given in Appendix II. The empty set is a special set, basic rules
for the empty set which are all discussed in Appendix II are: For any set X
the following hold:
X ∪ ∅ = X and X ∩ ∅ = ∅. (2.29)
Further, ∅ ⊂ X for every set X and when considering ∅ as a subset of X we
have ∅ = X. For every set X we have the obvious relations
X ∪ X = X and X ∩ X = X, (2.30)
X ∪ Y = Y ∪ X and X ∩ Y = Y ∩ X. (2.31)
X ∪ Y ⊂ Y ∪ X and Y ∪ X ⊂ X ∪ Y. (2.32)
The next rule for proving such statements is to transform these statements
into a formal logical statement: for example X ∪ Y ⊂ Y ∪ X corresponds to
(x ∈ X ∪ Y ) implies (x ∈ Y ∪ X) (2.33)
29
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 50
A COURSE IN ANALYSIS
or equivalently
(X ∈ X ∪ Y ) =⇒ (X ∈ Y ∪ X). (2.34)
Now let us have a closer look at the statement x ∈ X ∪ Y :
or equivalently
(x ∈ X ∪ Y ) ⇐⇒ (x ∈ X) ∨ (x ∈ Y ). (2.36)
(x ∈ X) ∨ (x ∈ Y ) ⇐⇒ (x ∈ Y ) ∨ (x ∈ X). (2.37)
X ∪ (Y ∪ Z) = (X ∪ Y ) ∪ Z (2.38)
X ∩ (Y ∩ Z) = (X ∩ Y ) ∩ Z (2.39)
X ∪ (Y ∩ Z) = (X ∪ Y ) ∩ (X ∩ Z) (2.40)
and
X ∩ (Y ∪ Z) = (X ∩ Y ) ∪ (X ∩ Z). (2.41)
Let us prove (2.41): we need to prove
X ∩ (Y ∪ Z) ⊂ (X ∩ Y ) ∪ (X ∩ Z) (2.42)
and
(X ∩ Y ) ∪ (X ∩ Z) ⊂ X ∩ (Y ∪ Z). (2.43)
Note that
x ∈ X ∩ (Y ∪ Z) ⇐⇒ (x ∈ X) ∧ (x ∈ Y ∪ Z)
30
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 51
⇐⇒ (x ∈ X) ∧ ((x ∈ Y ) ∨ (x ∈ Z))
⇐⇒ ((x ∈ X) ∨ (x ∈ Y )) ∧ ((x ∈ X) ∨ (x ∈ Z)),
where we used (A.I.10) from Appendix I. However,
(A ) = A, (2.45)
x ∈ (A ) ⇐⇒ x ∈
/ A ⇐⇒ x ∈ A.
(A ∩ B) = A ∪ B (2.46)
and
(A ∪ B) = A ∩ B . (2.47)
We prove (2.46). The fact that x ∈ (A ∩ B) means
x∈
/ A∩B ⇐⇒ (x ∈
/ A) ∨ (x ∈
/ B)
⇐⇒ (x ∈ A ) ∨ (x ∈ B )
⇐⇒ x ∈ (A ∪ B ),
Aj = A1 ∩ · · · ∩ AN . (2.49)
j=1
31
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 52
A COURSE IN ANALYSIS
N
Thus, x ∈ Aj if for at least one j0 ∈ {1, . . . , N} we have x ∈ Aj0 , whereas
j=1
N
x∈ j=1 Aj means that x ∈ Aj for all j ∈ {1, . . . , N}.
i.e.
max{a, c} < x < min{b, d},
or
x ∈ [a, b) ∩ [c, d] if and only if a ≤ x < b and c ≤ x ≤ d,
i.e.
max{a, c} ≤ x < b if b ≤ d
or
max{a, c} ≤ x ≤ d if d < b.
Here max{a, c} stands for the larger number, i.e. the maximum of a and c,
whereas min{b, d} stands for the smaller number, i.e. the minimum of b and
d.
0
-2 3 5 6
Figure 2.17
Note that (2.50) is an equality of sets, namely
32
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 53
a b c d
Figure 2.18
If (a, b) ∩ (c, d) = ∅, then (a, b) ∪ (c, d) is either one of these intervals, namely
(a, b) if (c, d) ⊂ (a, b) or (c, d) if (a, b) ⊂ (c, d)
c a b d
Figure 2.19
or (a, b) ∪ (c, d) = (min(a, c), max(b, d))
(c, d)
(a, b)
a c b d
Figure 2.20
Note that in the case of closed or half-open intervals we may meet some
new possibilities(compared with open intervals). The two intervals (a, b] and
(b, c), for example, do not intersect
however
(a, b] ∪ (b, c) = {x ∈ R | a < x ≤ b or b < x < c}
33
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 54
A COURSE IN ANALYSIS
and
(−∞, ∞) := R. (2.56)
We call “∞” infinity and “−∞” minus infinity and at the moment it is just
a useful name and notation.
We have already used max and min without stating the formal definitions:
a, a ≥ b;
max{a, b} := (2.57)
b, b ≥ a,
and
a, a ≤ b;
min{a, b} := (2.58)
b, b ≤ a.
It is interesting to note that we can express max and min using the absolute
value.
Lemma 2.7. For a, b ∈ R we have
1
max{a, b} = (a + b + |a − b|) (2.59)
2
and
1
min{a, b} = (a + b − |a − b|). (2.60)
2
Proof. We prove (2.59) and leave (2.60) as an exercise. If a ≥ b then
max{a, b} = a. In this case a − b ≥ 0, hence |a − b| = a − b and
1 1
(a + b + a − b) = 2a = a.
2 2
If however b ≥ a then max{a, b} = b. In this case a−b ≤ 0 hence |a−b| = b−a
and we find
1 1
(a + b + b − a) = 2b = b.
2 2
34
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 55
The notations of maximum and minimum easily extend to finite sets of real
numbers. If a1 , · · · , an ∈ R then
and
Definition (2.61) tells us that ak is larger or equal than all other elements
a1 , · · · , an in the set {a1 , · · · , an } and (2.62) says that al is less or equal to
all other elements of the set {a1 , · · · , an }.
|a + b| ≤ |a| + |b|.
−|a| − |b| ≤ |a + b|
and
||a| − |b|| ≤ |a + b|. (2.64)
35
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 56
A COURSE IN ANALYSIS
Proof. First note that (2.64) follows from (2.63) and vice versa. In fact we
may take the real number −b instead of b in (2.63) to find
The proof that (2.64) implies (2.63) follows the same idea.
Now we prove (2.63). By the triangle inequality we know that
|a| = |a − b + b| ≤ |a − b| + |b|
implying
|a| − |b| ≤ |a − b|. (2.65)
On the other hand we have
implying
−(|a| − |b|) ≤ |a − b|, (2.66)
thus together with (2.65) we have
Problems
1. Let X = {a, b, c, d, e, f, g, h, i} and consider the subsets
A = {a, b, c, d}, B = {b, d, f, h} and C = {c, d, e, f }. Find A ,
(A ∩ C) , B \ C, and (A ∪ B) .
36
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 57
6. Calculate
the following
values:
√
a) − 58 ; b) 11
3
− 3 ; c) 7 − 12 ; d) | | − 3| − | − 5| |; e) a2 ,
9 5
a ∈ R.
7. Prove that for every ε > 0 and all a, b ∈ R the following hold
1 2
|ab| ≤ εa2 + b,
4ε
and
1
min{a, b} = (a + b − |a − b|).
2
Furthermore, for a > 0 prove that
1
a+ ≥ 2.
a
|a − c| ≤ |a − b| + |b − c|
and
| |a − b| − |c| | ≤ | |a − b| − c| ≤ |a| + |b| + |c|.
−3 ≤ 7x − 2 < 6x + 5.
|x − 3| ≤ |x + 3|.
37
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 58
A COURSE IN ANALYSIS
2x + 6(2 − x) ≥ 8 − 2x
hold?
b) Find all values of x ∈ R such that
x2 + 2x − 10 < 3x + 2.
38
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 59
3 Mathematical Induction
Mathematics derives new statements from given ones. The underlying pro-
cedure is of course called a proof. It is by no means easy to define what a
(correct) proof is, and there is no need to do this here. For a working math-
ematician a proof reduces to the following: you start with some statements
either being taken for granted to be true (axioms) or already proven (theo-
rems, propositions, lemmata), and then you apply the usual rules of (math-
ematical) logic which we have collected in Appendix I in order to arrive at
new statements. Very often we have to handle statements A(n) depending
on n ∈ N or n ∈ {k ∈ Z | k ≥ m for some m ∈ Z}. For example the
statement A(n) could be
n(n + 1)
A(n) : 1 + 2 + · · · + n = , n ∈ N. (3.1)
2
To prove that such a statement is true for N we cannot just check one-by-
one that it is true for every natural number however we may use a method
called mathematical induction. It is possible to show that this method
is sufficient for proving statements like A(n), n ∈ N, however this involves
looking at the actual construction of N and Peano’s axioms which goes be-
yond the scope of this introductory section. For more about Peano’s axioms
and mathematical induction, see Appendix III. The method of mathemati-
cal induction follows from the axiom of mathematical induction (one of
Peano’s axioms): Suppose that for each n ≥ m, m, n ∈ Z, a mathematical
statement A(n) is given. If A(m) is true and if for all n ≥ m the statement
A(n) implies that the statement A(n + 1) is true, then A(n) is true for all
n ≥ m.
At this stage we will just assume this axiom. An alternative version of the
axiom of mathematical induction is:
Suppose for each n ≥ m, m, n ∈ Z, a statement A(n) is given. If A(m) is
true and if for all n ≥ m the statements A(m), . . . , A(n) imply the statement
A(n + 1), then A(n) holds for all n ≥ m.
In simple terms this means that the method of mathematical induction is
as follows: we begin by showing that A(m) is true for some m ∈ N, usually
m = 0 or m = 1 (base case). Next we assume that A(n) is true for arbitrary
n ≥ m (induction hypothesis) and then prove that A(n+1) is true (induction
step).
39
July 28, 2015 9:26 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 60
A COURSE IN ANALYSIS
Let us start with a simple example to see how we can apply the axiom of
mathematical induction.
40
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 61
3 MATHEMATICAL INDUCTION
= 133(11k + 122n+1 ).
Since 11k + 122n+1 is a natural number, say m, it follows that 11(n+1)+2 +
122(n+1)+1 = 133m, i.e. A(n+1) is correct. Now the principle of mathematical
induction yields that A(n) holds for all n ≥ 0.
This example already gives an insight that mathematical induction as a
method of proving a statement A(n) for all n ≥ m, n, m ∈ Z, is a way
forward. However, depending on the statement A(n) we may need addi-
tional knowledge for the proof that A(n) implies A(n + 1). Indeed this is
already of course the case when proving A(m).
There is a reason why we have not started with proving (3.1). Although the
notation 1 + 2 + · · · + n is intuitively clear, we will introduce a better one.
Suppose that a1 , . . . ak ∈ R, which is shorthand for: suppose that for every
j ∈ {k ∈ N | k ≤ n} we have aj ∈ R. The sum A of these n real numbers is
denoted by
n
A := aj , (3.6)
j=1
41
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 62
A COURSE IN ANALYSIS
42
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 63
3 MATHEMATICAL INDUCTION
i.e. A(1) holds. Now suppose that A(n) holds for arbitrary but fixed n ∈ N.
We want to show that then A(n + 1) holds too. Indeed we have
n+1
n
j= j + (n + 1),
j=1 j=1
and this is already the crucial step since it allows us to use statement A(n),
namely
n+1
n
j= j + (n + 1)
j=1 j=1
43
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 64
A COURSE IN ANALYSIS
n
xn+1 − 1
Now if xj = then
j=0
x−1
n+1
n
xn+1 − 1
xj = xj + xn+1 = + xn+1
j=0 j=0
x−1
n+1
x − 1 xn+1 (x − 1)
= +
x−1 x−1
xn+1 + xn+2 − xn−1 − 1 xn+2 − 1
= = ,
x−1 x−1
Now suppose that (3.18) holds for arbitrary but fixed n ∈ N. We find using
the triangle inequality (2.12) that
n+1 n n
al = al + an+1 ≤ al + |an+1 |
l=1 l=1 l=1
n
≤ |al | + |an+1 |,
l=1
44
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 65
3 MATHEMATICAL INDUCTION
where in the last step we used (3.18) for n. Now the rest is straightforward
since
n n+1
|al | + |an+1 | = |al |,
l=1 l=1
and the first estimate is proved for n + 1 provided it holds for n, hence by
mathematical induction the first estimate holds for all n ∈ N. The second
estimate in (3.18) is proved without induction. Let max {|a
1 |, . . . , |an |} = |ak |
for some 1 ≤ k ≤ n. Replacing each number |al | in nl=1 |al | by |ak | will
increase the sum, i.e.
n
|al | ≤ |ak | + · · · + |ak | ≤ n · max {|a1 |, . . . , |ak |}.
l=1
As in the case for finite sums we can introduce a notation for finite products.
Let a1 , . . . , an ∈ R be given. We denote their product by
n
aj = a1 · a2 · . . . · an . (3.19)
j=1
Clearly, using the associative law for multiplication we have for m < n that
n
m n
aj = aj · aj . (3.20)
j=1 j=1 j=m+1
Note that the second term on the right hand side of (3.20) is an obvious
generalisation of (3.19), compare with the analogous notation for sums, see
(3.15).
Hence, for l < n, l, n ∈ Z, and real numbers al , al+1 , . . . , an we write for their
product
n
aj = al · al+1 · . . . · an . (3.21)
j=l
45
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 66
A COURSE IN ANALYSIS
Further we define
0! = 1. (3.24)
46
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 67
3 MATHEMATICAL INDUCTION
or
1 = 1 + 0.
We can now prove our first non-trival result. The following formulae should
be familiar:
n
n n
(x + y) = xn−k y k . (3.28)
k=0
k
0
0 0 0−k k 0 0 0
(x + y) = 1 and since x y = x y =1
k=0
k 0
the statement A(0) holds. Now we prove that A(n) implies A(n + 1):
47
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 68
A COURSE IN ANALYSIS
n
n n−k k
n+1 n
(x + y) = (x + y) (x + y) = x y (x + y)
k
k=0
n n
n n+1−k k n n−k k+1
= x y + x y
k k
k=0 k=0
n n−1
n n+1−k k n n−k k+1
= xn+1 + x y + x y + y n+1
k k
k=1 k=0
n n
n n+1−k k n
= xn+1 + x y + xn−(k−1) y k + y n+1 (3.29)
k k−1
k=1 k=1
n
n n
= xn+1 + + xn+1−k y k + y n+1
k k−1
k=1
n
n + 1 n+1 0 n n n+1−k k n + 1 0 n+1
= x y + + x y + x y
0 k k−1 n+1
k=1
n + 1
n+1
= xn+1−k y k ,
k
k=0
(x + y)0 = 1,
(x + y)1 = x + y,
(x + y)2 = x2 + 2xy + y 2 ,
48
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 69
3 MATHEMATICAL INDUCTION
(x − y)2 = x2 − 2xy + y 2,
(x + y)3 = x3 + 3x2 y + 3xy 2 + y 3 ,
(x + y)4 = x4 + 4x3 y + 6x2 y 2 + 4xy 3 + y 4 .
Remark 3.12. The binomial coefficients will play an important part in prob-
ability theory and combinatorics.
To obtain this result we argue as follows: in the first sum put the running
index k equal to l = k + 1. Thus, whenever we see k we replace it by l − 1
to get
n−1
n−1
n n
xn−(l−1) y l−1+1 = xn−(l−1) y l
l−1=0
l − 1 l−1=0
l − 1
n
n
= xn−(l−1) y l ,
l=1
l − 1
and now put l = k.
There is still a need to improve our formal definition of the sum of n real
numbers as given in (3.6), the same applies to the definition of their product,
see (3.21). We have to introduce the concept of a recursive definition.
Suppose for m ≤ j ≤ n, m, n ∈ Z, mathematical objects C(j) are defined.
For example C(j) = jl=1 al for 1 ≤ j ≤ n and al ∈ R. It might happen that
we can extend the definition to get a new object C(n + 1). In our example
we may define
n+1
n
C(n + 1) := al := an+1 + al = an+1 + C(n). (3.33)
l=1 l=1
Thus we use the already defined objects C(m), . . . , C(n) to define the new
object C(n+1). If we can extend this to all n ≥ m, i.e. for all n ≥ m, m, n ∈
Z, we can define C(n+1) given C(m), . . . C(n), then we say that the sequence
49
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 70
A COURSE IN ANALYSIS
an+1 := a · an , n ≥ m, a = 0. (3.35)
We can put this into a more formal scheme which indicates that we may prove
by mathematical induction that when defining objects C(j) by recursion we
indeed have defined all the elements of the sequence C(j), j ≥ m. The formal
proof however we omit. If C(j), j ≥ m, j, m ∈ Z, are the objects we want to
define we start with
for example
1
A(1) : al := a1 .
l=1
for example
n+1
n
A(n + 1) : C(n + 1) := al := an+1 + al = an+1 + C(n).
l=1 l=1
Note that we will not always need C(m), . . . , C(n) to define C(n + 1); in our
example C(n) is sufficient. We can interpret A(n) as the statement: given
A(m), . . . , A(n−1), then it is formally possible to define A(n). The proof that
a definition by recursion gives all objects C(n), n ≥ m, must now show that
for all n ≥ m the following holds: if we can formally define A(m), . . . A(n),
then we can also formally define A(n + 1).
Next comes an observation which will force us to be a bit cautious. So far
mathematical statements are objects which we have not really defined, how-
ever we have a naı̈ve but often correct idea of what statements are. Math-
ematical induction was introduced to prove such (naı̈ve) statements. The
50
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 71
3 MATHEMATICAL INDUCTION
Problems
1. a) Use mathematical induction to prove that for k ∈ N ∪ {0}
k 3 + (k + 1)3 + (k + 2)3
is divisible by 9.
b) Prove by mathematical induction that for every integer n ≥ 0
the number
n5 n4 n3 n
+ + −
5 2 3 30
is an integer.
xn − y n = (x − y)Qn (x, y)
(n − 1)xn + y n ≥ nxn−1 y.
2 5
6
1 k k−2 l+1
a) j
; b) (a − a ), a = 1; c) (−1)l .
j=−2
2 k=2 l=1
l
51
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 72
A COURSE IN ANALYSIS
and
N
N
N
aj + bj = (aj + bj ).
j=1 j=1 j=1
b) For x, y ∈ R simplify
5
(x − y) xk y 5−k .
k=0
m
1
c) (a + (j − 1)d) = m(2a + (m − 1)d).
j=1
2
52
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 73
3 MATHEMATICAL INDUCTION
A COURSE IN ANALYSIS
or k1
k
k
1
aj ≤ aj .
j=1
k j=1
k −k
consider a1 · . . . · ak · a2 .
15. Define
1 c
xn := xn−1 + , n ∈ N,
2 xn−1
with c > 0 and x0 := 1. Further set
c
an := , n ∈ N ∪ {0}.
xn
Prove an ≤ an+1 ≤ xn+1 ≤ xn for n ≥ 1.
54
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 75
Example 4.1. A. Suppose that a shop offers n ∈ N items for sale and we
enumerate these items by 1, · · · , n, we may then assign a price to each. Thus
D = {1, · · · , n} and for x ∈ D the new number f (x) denotes the price.
B. For x ∈ D = R we can consider its absolute value |x|, i.e. f (x) = |x|.
C. With D = {x ∈ R|x ≥ 0} we may consider
√
x −→ f (x) = x
Note that when thinking more carefully about the foundations of mathemat-
ics this definition causes some problems. However for now it is absolutely
sufficient for our purposes. We call D the domain of the function f , some-
times we write D(f ) instead of D. Often, if no confusion arises (just as
above) we call f a function and omit the domain and the target set or
co-domain R. Two functions fj : Dj → R, j = 1, 2, are equal if and only if
D1 = D2 and if for all x ∈ D1 = D2 we have f1 (x) = f2 (x). Sometimes it is
useful to write f (·) instead of f .
55
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 76
A COURSE IN ANALYSIS
|.| :R−→R
x −→|x|.
Both are functions but they are not equal since D(f1 ) = D(f2 ).
D. Let k ∈ N0 := N ∪ {0} and a0 , a1 , . . . , ak ∈ R. Then for every x ∈ R we
can construct a new real number by
k
p(x) := aj xj = a0 + a1 x + . . . + ak xk . (4.1)
j=0
{ (x, f (x))| x ∈ D} .
Let us first formalise this idea and then we will use it to give a geometric
interpretation of a function. For x ∈ R and y ∈ R we can form the pairs
(x, y) and (y, x) where it matters whether x or y is in the first position. The
set of all ordered pairs of real numbers is called the Cartesian product
of R with itself and is denoted by R × R or simply by R2 . Thus a ∈ R2 if a
is a pair (x, y) of real numbers, x, y ∈ R. Two pairs (x1 , y1 ) and (x2 , y2) are
56
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 77
√ √
equal if and only if x1 = x2 and y1 = y2 . For example 4, 1 = 2, 1 but
(2, 1) = (1, 2). If D ⊂ R we can define a subset of R by
2
D × R := (x, y) ∈ R2 x ∈ D and y ∈ R , (4.2)
{ (x, f (x))| x ∈ D} ⊂ D × R ⊂ R2 .
For a function f : D −→ R the value at x is the real number f (x) and the
graph Γ(f ) is a subset of the Cartesian product D × R.
Consider the function |.| : R −→ R, x −→ |x|. It is defined for all x ∈ R but
only non-negative real numbers may occur as a value of the function, since
|x| ≥ 0 for x ∈ R. We introduce the range of a function f : D −→ R as the
set
57
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 78
A COURSE IN ANALYSIS
y
3
(−2, 32 ) 2 + (3, 2)
+
1
x
−5 −4 −3 −2 −1 1 2 3 4
−1
−2
−3
+ +
(−4, − 72 ) −4 (1, − 72 )
Figure 4.1
−5
Here are some examples with y = f (x) (or y = g(x), y = h(x)). In the
following figure the function f is the identity on R, g is a parabola, again
defined on R, and h is the square root function which is of course only defined
on R+ = {x ∈ R | x ≥ 0}.
y
g(x) = x2
5 f (x) = x
4 √
h(x) = x
3
2
1
x
−5 −4 −3 −2 −1 1 2 3 4 5 6 7 8
−1
−2
−3 Figure 4.2
58
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 79
In Figure 4.3 the function f is the absolute value with domain R, the function
g is a hyperbola defined on R \ {0}, and h is a cubic polynomial with domain
R. y
1
g(x) = x
6
5 h(x) = x3
4 f (x) = |x|
x
−7 −6 −5 −4 −3 −2 −1 1 2 3 4 5 6
g(x) = x1 −1
−2
−3
−4
−5 Figure 4.3
It is likely you have already seen these graphs before, but there is a non-
trivial question: how do we know that they are correct? A typical domain
D(f ) contains infinitely many points. We cannot calculate all values f (x).
Thus before we can draw the graph we need to understand and discuss the
function f : D −→ R and its behaviour. The following are natural questions:
• are there lower and upper bounds?
• are there local or global extreme values, i.e. maxima or minima?
• is the function monotone?
• is the graph connected?
..
.
59
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 80
A COURSE IN ANALYSIS
The last question arises when looking at f : R\ {0}−→ R,x −→ x1 . The
graph
Γ(f ) has the two components Γ+ (f ) = x, x1 x > 0 and Γ− (f ) =
x, x1 x < 0 , i.e.
Γ (f ) = Γ+ (f ) ∪ Γ− (f ) (4.6)
and in addition
Γ+ (f ) ∩ Γ− (f ) = ∅. (4.7)
y = χA (x)
−1− 1 1 1 2 x
2 2
Figure 4.4
60
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 81
with [x] being the largest integer less or equal than x. Thus [1] = 1, in
general [k] = k for k ∈ Z, but [ 12 ] = 0, [− 12 ] = −1 etc. Note that we always
have x − [x] ∈ [0, 1). Here is the graph of [x] and x − [x]:
[x]
4
2
x − [x]
1
−7 −6 −5 −4 −3 −2 −1 1 2 3 4 5 6
−1
−2
−3
−4
−5 Figure 4.5
where [ indicates that the left end point is included and ) indicates that the
right point is not included. In addition let us consider the new function
f : R −→ R, x −→ x − [x]. Its graph looks periodic with period 1. This
means that f (x + 1) = f (x) for all x ∈ R. Thus for a general function we
may ask whether it is periodic.
There are some simple procedures to construct new functions from given
ones. Let f1 , f2 : D −→ R be given functions. Note that they have the same
domain. We define
i) their sum by
f1 + f2 :D−→R
x −→(f1 + f2 )(x) := f1 (x) + f2 (x) (4.10)
61
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 82
A COURSE IN ANALYSIS
f1 − f2 :D−→R
x −→(f1 − f2 )(x) := f1 (x) − f2 (x) (4.11)
f1 · f2 :D−→R
x −→(f1 · f2 )(x) := f1 (x) · f2 (x) . (4.12)
A problem, in fact a more serious one than one may think at the beginning,
is to define the quotient of two functions f1 , f2 : D −→ R. The idea is to
define
f1 f1 (x)
(x) = for x ∈ D. (4.14)
f2 f2 (x)
However this does not make sense for f2 (x) = 0. We either have to assume
f2 (x) = 0 for all x ∈ D or we can only define ff12 on Dq = {x ∈ D|f2 (x) = 0} .
In fact the situation is more delicate if we look at the simple case where
f1 , f2 : R −→ R, f1 (x) = x and f2 (x) = x for all x ∈ R. Of course ff12 (x)
(x)
=1
for all x = 0 but we would like to extend this so that it also holds for
x = 0. Thus a further problem to study is: when does a given function
f : D −→ R have an extension to a larger domain D1 , D ⊂ D1 ⊂ R? We
62
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 83
is always defined. The same type of argument holds for the difference and
the product of two rational functions, and with the obvious extension in each
case for finitely many ones.
Now look at p(x) = (x − 1)2 and q(x) = (x − 1). Both are polynomials,
63
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 84
A COURSE IN ANALYSIS
f : X → Y
x → f (x)
For example we may take X as the set of all bounded open intervals, i.e.
X := {(a, b) | a < b and a, b ∈ R}, and for Y we may take the non-negative
real numbers, i.e. Y = R+ . Now we may define λ : X → R+ , λ((a, b)) = b−a.
Thus the mapping λ maps every bounded open interval (a, b) ⊂ R onto its
length b − a ∈ R. Another example is the following: Take X = R+ and Y to
be the set of all closed intervals [0, a], a > 0, i.e. Y := {[0, a] | a > 0}. Then
f : X → Y, a → [0, a] is a mapping.
However, as we have already pointed out in the case of functions, the term
“rule” is not well defined. Thus we try something else taking into account
our experience with functions.
Consider the Cartesian product X × Y , i.e.
64
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 85
−4 −3 −2 −1 1 2 3
−1 Figure 4.6
−4 −3 −2 −1 1 2 3
−1
−2
−3
−4 Figure 4.7
65
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 86
A COURSE IN ANALYSIS
R = {(x, x) | x ∈ X} ⊂ X × X
is an equivalence relation.
In other words
Rf = {(x, f (x)) | x ∈ X} ⊂ X × Y
is a generalisation of the graph of a function. Once again X is called the
domain of f , Y is sometimes called the co-domain of f or the target set.
In this sense functions are mappings f : D → R, D ⊂ R. Making this dis-
tinction between mappings and functions may seem artificial, but it might
be helpful in the beginning. As a rough guide, when speaking about func-
tions we mean mappings from some set to the real numbers (or the complex
numbers in later parts).
The range of f or the image of X under f is
66
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 87
f ([−2, −1] ∪ [1, 2]) = f ([−2, −1]) = f ([1, 2]) = [1, 4].
P(X) := {Y | Y ⊂ X}
67
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 88
A COURSE IN ANALYSIS
Problems
1. a) Find the product sets A × B and B × A for A = {3, 4, 5, 6} and
B = {1, 2, 3} and sketch the set in the plane.
b) Prove that N × Z ⊂ R × Q.
c) Let X = {1, 2, 3}, Y = {3, 4, 5}, and Z = {6, 7}.
Find (X ∪ Y ) × Z, X × (Y ∪ Z) and (X × Z) ∩ (Y × Z).
7. Find the power set of: a) the empty set φ; b) the set {1, 2, 3}.
68
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 89
8. Let X be a set with N elements, Prove that the power set P(X) of X
has 2N elements. Use the fact
that
the number of subsets of k elements
of a set with N elements is Nk .
12. For each of the following rational expressions q(x) find the largest set
D ⊂ R such that q : D −→ R is a well defined function. Where appro-
priate try to extend q : D −→ R to a larger domain in a meaningful
way by modifying q.
69
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 90
A COURSE IN ANALYSIS
(ii) In each of the following cases find the image of the indicated set:
√ !
a) f : [0, ∞) −→ R, x → x, find f 14 , 9 ;
x2 −1
b) g : R −→ R, x → x2 +2
, find g({1, 2, 3, 4});
c) h : R −→ R, x → 2 , find h(N).
x
70
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 91
71
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 92
A COURSE IN ANALYSIS
i.e. bijective. This example shows the importance of the domain and the
range of a function when deciding about its injectivity and its surjectivity,
respectively.
B. For a ∈ R consider fa : R −→ R, x −→ x + a. We claim that fa is always
bijective. First, for x = y it follows that fa (x) = x + a = y + a = fa (y), and
secondly, given z ∈ R the equation fa (x) = z, i.e. x + a = z, has the (unique)
solution x = z − a and it follows that fa (z − a) = (z − a) + a = z, i.e. fa is
surjective. The last calculation shows what “determine whether f : D −→ F
is surjective” or “find the range R(f )” really means: we have to solve the
equation f (x) = y for all y ∈ F such that the solution belongs to D.
C. For a = 0 the function ga : R −→ R, x −→ ax is bijective. First note
that for x, z ∈ R, x = z, it follows that ax = az, i.e. ga is injective. To show
that ga is surjective we have to solve for all y ∈ R the equation ga (x) = y,i.e.
ax = y. Clearly the solution is x = ya provided a = 0. Thus ga , a = 0, is
bijective.
Note that in the case where a = 0 the function g0 is the constant function
g0 : R −→ R, x −→ 0. This function is neither injective nor surjective.
In fact for every c ∈ R the constant function hc : R −→ R, x −→ c, i.e.
hc (x) = c for all x ∈ R, is neither injective nor surjective.
D. Let A ⊂ R be a set and consider χA : R −→ [0, 1], the characteristic
function of the set A. For all x ∈ A this function is equal to 1, and for all
x ∈ A it has the value 0. Thus it is neither injective nor surjective: it is not
injective since either A or A has at least two elements and they are mapped
by χA onto the same value. In addition for 12 ∈ [0, 1] there is no x ∈ R such
that χa (x) = 12 , therefore it is not surjective.
E. The absolute value |.| : R −→ R+ is surjective but not injective. Indeed,
for x = −x, i.e. x = 0, we know that |x| = | − x|, i.e. | · | is not injective.
On the other hand, for y ≥ 0 we may take x = y to find |x| = y, showing
surjectivity.
Next we meet some examples considering general mappings.
Example 5.4. A. Let X = N and Y = R. We consider mappings f : N → R,
n → f (n). Such mappings are called sequences of real numbers and it
is convenient to write (f (n))n∈N for such a sequence. Later on we will just
start with a sequence (an )n∈N , an ∈ N, suppressing often that we are working
with a mapping, i.e. that an = f (n) for some f : N → R. Now the question
arises whether a mapping f : N → R can be surjective. The answer is no, a
proof will be given later, see Theorem 18.35.
72
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 93
f2 ◦ f1
f1
f2
f2 (f1 (x))
x
f1 (x)
R(f1 ) = D2
D1 F1 F2
Figure 5.1
73
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 94
A COURSE IN ANALYSIS
Exercise 5.11. Prove that the composition of two injective mappings is in-
jective and that of two surjective mappings is surjective. Deduce that the
composition of two bijective mappings is bijective.
74
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 95
f3 ◦ (f2 ◦ f1 ) : D1 −→ F3 (5.1)
(f3 ◦ f2 ) ◦ f1 : D1 −→ F3 . (5.2)
From (5.1) we find for all x ∈ D1
By Lemma 5.12 we may just write f3 ◦ f2 ◦ f1 for both expressions (5.1) and
(5.2). This clearly extends to finitely many functions.
and then we may form f˜2 |{x|x≥1} ◦ f1 = f2 ◦ f1 . Everyone will agree that
the latter approach is simpler and no confusion will arise when we just write
f˜2 ◦ f1 , which is however an abuse of notation.
75
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 96
A COURSE IN ANALYSIS
f −1 ◦ f = idD (5.7)
and
f ◦ f −1 = idF . (5.8)
76
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 97
77
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 98
A COURSE IN ANALYSIS
and
f1−1 ◦ f2−1 ◦ (f2 ◦ f1 ) = f1−1 ◦ f2−1 ◦ f2 ◦ f1
= f1−1 ◦ idD2 ◦ f1 = f1−1 ◦ idF1 ◦ f1
= f1−1 ◦ f1 = idD1
y-axis
F
f (xj ) = y
y
x1 x2 x3 x4 x-axis
D
Figure 5.2
78
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 99
y-axis
y0
F
f
y1
x1 x-axis
D
Figure 5.3
y-axis
F f
y
x D x-axis
Figure 5.4
79
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 100
A COURSE IN ANALYSIS
y
y=x
b a x
Figure 5.5
y=x
Γ(f )
Γ(f −1 )
F x
D
Figure 5.6
√
For example for . : R+ −→ R+ we find:
80
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 101
f −1 (x) = x2
y=x
5
3 √
f (x) = x
2
x
−1 1 2 3 4 5 6
−1 Figure 5.7
(f ± g) ◦ h := f ◦ h ± g ◦ h, (5.12)
(f · g) ◦ h := (f ◦ h) · (g ◦ h) , (5.13)
and if g(y) = 0 for all y ∈ D2
f f ◦h
◦ h := . (5.14)
g g◦h
For example
√ we may consider h : √ R −→ R+ , x −→ |x|, f : R+ −→ R,
x −→ x and g : R+ −→ R, x −→ 1 + x where we get
((f ± g) ◦ h) (x) = |x| ± 1 + |x|
((f · g) ◦ h) (x) = |x| 1 + |x|
f |x|
◦ h (x) = .
g 1 + |x|
81
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 102
A COURSE IN ANALYSIS
R is symmetric:
ARB and BRA : if ARB then there exists a bijective
f : A → B, but f −1 : B → A is bijective too.
R is transitive:
ARB and BRC means that there exists f : A → B and
g : B → C both bijective, then g ◦ f : A → C is bijective
too.
This is one of the most important equivalence relations which had an enor-
mous influence on the historical development of set theory. We will return
to it later.
To proceed further we need the following considerations. Let X = ∅ be a set
and “∼” an equivalence relation on X. Let a ∈ X. We denote by [a] the set
of all x ∈ X with x ∼ a, i.e.
[a] := {x ∈ X | x ∼ a} (5.15)
and we call [a] the equivalence class of a or generated by a. A partition of
a set X is a set of subsets of X such that every element of X belongs to only
one of these subsets, for example {1, 2}, {3, 4}, {5} would be a partition of
{1, 2, 3, 4, 5}, however neither {1}, {3, 4} nor {1, 2}, {2, 3, 4}, {5} would be.
Formally, we call a family of sets Z ⊂ P(X) a partition of X if
1. every x ∈ X belongs to some Z ∈ Z, i.e. for x ∈ X there exists Z ∈ Z
such that x ∈ Z or
X= Z,
Z∈Z
with
Z := {x ∈ X | x ∈ Z for some Z ∈ Z}
Z∈Z
82
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 103
Z1
X
Z2
Z4
Z3
Figure 5.8
83
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 104
A COURSE IN ANALYSIS
and
2. X = [r] = {x ∈ X | x ∼ r}.
r∈R r∈R
84
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 105
Clearly fZN is injective, k = k implies fZN (k) = fZN (k ). However fZN is also
surjective: given n ∈ N, if n is even, i.e. n = 2k, k ∈ N, then fZN (k) = n. If
n is odd, i.e. n = 2k + 1, k ∈ N, then fZN (−k) = n.
The case of Q is more involved and we only indicate the idea of showing how
to prove that all non-negative fractions can be mapped bijectively onto N.
Note that there is a lot of multiple counting in the following scheme, i.e. we
need to refine the counting process. This enumeration scheme is due to G.
Cantor who is together with R. Dedekind the founder of set theory; one of
the greatest intellectual achievements of mankind.
0 1 2 3 4 5
1 1 1 1 1 1
0 1 2 3 4 5
2 2 2 2 2 2
0 1 2 3 4 5
3 3 3 3 3 3
0 1 2 3 4 5
4 4 4 4 4 4
0 1 2 3 4 5
5 5 5 5 5 5
0 1 2 3 4 5
6 6 6 6 6 6
Figure 5.9
85
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 106
A COURSE IN ANALYSIS
famous continuum hypothesis (CH) states that such a set does not exist.
So far a proof does not exist, however K. Gödel proved that in the standard
model of set theory which is denoted by ZFC, where Z stands for E. Zermelo,
F for A. Fraenkel and C for the Axiom of Choice, CH cannot be disproved.
Some thirty years later P. Cohen proved that CH is independent of ZFC.
Problems
1. Decide whether or not the following functions are injective, surjective
or bijective. Sketch the graph in each case.
a)
f1 : R −→ R+
x −→ |x − 3| + 2;
b)
f2 : [1, ∞) −→ (0, 2]
2
x −→
x
where for a ∈ R we write [a, ∞) = {x ∈ R | x ≥ a};
c)
f3 : [−2, 7] −→ [0, 3]
√
x −→ x + 2
p p
2. a) Consider the mapping g : Q −→ Z, q
−→ g q
= p + q. Is g
injective, surjective or bijective?
b) Let r : R × R −→ R × R, (x, y) −→ r(x, y) := (y, x). Test r for
injectivity, surjectivity and bijectivity.
3. Given f : R −→ R, x −→ 5x2 − 2x + 1, and g : [−5, ∞) −→ R,
a) √
x −→ 5 + x. Find f ◦ g : [−5, ∞) −→ R.
Consider f : R −→ R, x −→ |x + 3| − 2 and h : R −→ R,
b) √
x −→ x4 + 2. Find the largest sets D1 ⊂ R and D2 ⊂ R such that
we can form f ◦ h : D1 −→ R and h ◦ f : D2 −→ R. In each case give
a formula for the function, i.e. for f ◦ h and h ◦ f .
set D ⊂ R where we can define f ◦ h where
c) Find the largest √
f : [0, ∞) −→ R, x −→ x and h : R −→ R, x −→ |x + 2| − 1.
86
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 107
6. Let f : X −→ Y be a mapping.
a) Prove that f is injective if and only if there exists a mapping
g : Y −→ X such that g ◦ f = idX .
b) Prove that f is surjective if and only if there exists a mapping
h : Y −→ X such that f ◦ h = idY .
|f (x)| + f (x)
f + : X −→ R, x −→ f + (x) :=
2
and
|f (x)| − f (x)
f − : X −→ R, x −→ f − (x) := ,
2
87
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 108
A COURSE IN ANALYSIS
which are called the positive part of f and the negative part of f
respectively. Prove that f + (x) ≥ 0 and f − (x) ≥ 0 for all x ∈ X and
f = f + − f − , |f | = f + + f − .
88
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 109
X × Y × Z = {(x, y, z) | x ∈ X ∧ y ∈ Y ∧ z ∈ Z}.
Prove that
J : (X × Y ) × Z −→ X × Y × Z
((x, y), z) −→ (x, y, z)
is a bijective mapping. (Note that by definition (x, y, z) = (x , y , z ) if
and only if x = x , y = y and z = z .)
Remark: for finitely many sets A1 , . . . , AN we can define their carte-
sian product by
A1 × . . . × AN := {(a1 , . . . , aN ) | a1 ∈ A1 , . . . , aN ∈ AN },
or more formally
Rn := R × . . . × R (n terms);
Zm := Z × . . . × Z (m terms)
and more generally
Ak := A × . . . × A (k terms).
89
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 111
6 Derivatives
We want to study real-valued functions f : D → R, D ⊂ R, more closely.
For example we would like to know whether f is monotone increasing or
decreasing, attains local extreme values, has zeroes, etc. For all this and for
many more problems the concept of the derivative is very helpful. We will
spend some time on the construction of the derivative which we will formally
define in Definition 6.2. The central idea is to substitute locally, i.e. in a
neighbourhood of a point x0 ∈ D, the graph Γ(f ) of a function f by a straight
line, more precisely by the graph Γ(g) of a function g : R → R, x → ax + b.
y
Γ(g1 )
Γ(g2 )
Γ(g3 )
Γ(f )
y0 = f (x0 )
D = [a, b]
Γ(g4 )
a x0 b x
Figure 6.1
As Figure 6.1 shows, many straight lines are possible. We have already
indicated one condition that we want to impose: if x0 ∈ D is the point of
interest i.e. if we want to replace Γ(f ) in a neighbourhood of x0 by Γ(g),
then (x0 , f (x0 )) should lie on the straight line being selected.
The equation of a straight line passing through (x0 , f (x0 )) can be obtained as
follows. A straight line should be interpreted as the graph Γ(g) of a function
g : R → R, x → g(x) = ax + b. The condition that (x0 , f (x0 )) ∈ Γ(g)
means
g(x0 ) = ax0 + b = f (x0 ) (6.1)
91
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 112
A COURSE IN ANALYSIS
which is one equation for the two unknown a and b. Thus we need a further
condition to determine g, i.e. Γ(g). Since our aim is to substitute locally, i.e.
in a neighbourhood of x0 ∈ D, Γ(f ) by Γ(g), we may argue as follows. For
|x − x0 | small we should have
f (x) ≈ g(x) = ax + b, (6.2)
where “ ≈ ” stands at the moment for “f (x) being close to g(x)”. Of course,
in addition to (6.2) we assume (6.1).
Thus for |x − x0 | small we should have
f (x) − f (x0 ) ≈ a(x − x0 ), (6.3)
which we obtain by subtracting (6.1) from (6.2). For x = x0 this yields
f (x) − f (x0 )
= a + error. (6.4)
x − x0
Now if |x − x0 | tends to 0 then the error should also go to 0. This would
determine a and from (6.1) we can now calculate b to be
b = f (x0 ) − ax0 . (6.5)
We need to be precise by what “the error goes to 0 as |x − x0 | goes to 0”
means. Before this we give a geometric interpretation for our considerations.
We have the intuitive idea of a tangent to a given curve, in the case of a
circle we can even give a precise definition:
(x0 , y0 )
(0, 0)
Figure 6.2
92
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 113
6 DERIVATIVES
A straight line is a tangent to a circle at the point (x0 , y0) if the point
(x0 , y0 ) belongs to (the graph of) this straight line and this straight line is
perpendicular (later we will also say orthogonal) to the straight line through
the centre of the circle and the point (x0 , y0 ).
For a general curve we cannot use this definition, but we may do the following:
consider the graph Γ(f ) of f : D → R, x0 ∈ D.
y − axis
Γ(f )
Γ(g)
y0 = f (x0 )
a x0 b x − axis
Figure 6.3
93
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 114
A COURSE IN ANALYSIS
y − axis
Γ(f )
Γ(g̃)
y1 = f (x1 )
y0 = f (x0 )
a x0 b x − axis
x1
Figure 6.4
This straight line is completely determined by the two conditions:
g̃(x0 ) = f (x0 ) = ãx0 + b̃; (6.6)
and
g̃(x1 ) = f (x1 ) = ãx1 + b̃. (6.7)
This leads to
f (x0 ) − f (x1 )
ã = (6.8)
x0 − x1
and
f (x1 )x0 − f (x0 )x1
b̃ = . (6.9)
x0 − x1
Thus the error term in (6.4) should be given by |a − ã|. Intuitively we now
take a sequence of points (xν , f (xν )), ν ∈ N, on Γ(f ), xν = x0 for all ν ∈ N,
tending to (x0 , f (x0 )) and consider the corresponding straight lines gν (x) =
aν x + b with
f (x0 ) − f (xν )
aν = (6.10)
x0 − xν
and
f (xν )x0 − f (x0 )xν
bν = . (6.11)
x0 − xν
94
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 115
6 DERIVATIVES
y − axis
Γ(f )
Γ(g3 )
Γ(g2 )
Γ(g1 )
y = f (x0 )
a x1 x2 x3 x0 b x − axis
Figure 6.5
We may think that the tangent is just the “limit line”. However, here we
encounter one of the main problems in analysis. Do we know that the “limit”
exists?
Having these preliminaries in mind we now do the preparations needed for
correct and precise statements. We need to understand the concept of a limit
of a function:
lim f (y) = a. (6.12)
y→x
or
lim |f (y) − a| = 0. (6.14)
y→x
The latter means: given a small error bound ε > 0, if y is close to x then
|f (y) − a| < ε.
95
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 116
A COURSE IN ANALYSIS
if for every ε > 0 there exists δ > 0 such that 0 < |x − y| < δ implies
|f (y) − a| < ε.
We will see later in Part 2 that this definition yields the following simple
rules for limits:
Let f, g : D → R be two functions and assume that
and
lim g(y) = b (6.17)
y→x
then we have
as well as
lim (f · g)(y) = lim f (y) · lim g(y) = a · b. (6.19)
y→x y→x y→x
lim f (y)
f (y) y→x a
lim = = . (6.20)
y→x g(y) lim g(y) b
y→x
(Note that we will improve (6.20), we will need only the assumption that
b = 0.)
Example 6.1. A. Consider the constant function hc : R → R, x → c, i.e.
hc (x) = c for all x ∈ R. Then |hc (y) − hc (x)| = |c − c| = 0 and therefore
whatever the value of |x − y| is, |hc (y) − hc (x)| < ε for every ε > 0. Hence
Indeed, consider
96
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 117
6 DERIVATIVES
i.e.
|fa (y) − fa (x)| < ε.
Thus p is the finite sum of finite products of functions for which we know the
limits, hence
lim p(y) = p(x). (6.23)
y→x
x
Figure 6.6
Suppose that limy→0 χ(0,∞) (y) = a for some a ∈ R. Then for every > 0 there
exists δ > 0 such that |y| < δ implies |χ(0,∞) (y) − a| < , i.e. −δ < y < δ
implies |χ(0,∞) (y) − a| < . Now for −δ < y < 0 we have χ(0,∞) (y) = 0,
implying that |a| < for all > 0, i.e. a must be equal to 0. However,
for 0 < y < δ, if we have χ(0,∞) (y) = 1 and with a = 0 we would have
|1 − 0| = 1 < for every > 0. This is of course a contradiction. Therefore
lim χ(0,∞) (y) does not exist.
y→0
97
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 118
A COURSE IN ANALYSIS
We now return to our original problem and study for a given function f :
D → R the limit
f (y) − f (x) f (x) − f (y)
lim = lim . (6.24)
y→x y−x y→x x−y
f (y) − f (x0 )
f (x0 ) = y→x
lim . (6.26)
0
y=x0
y − x0
df (x0 ) df
= f (x0 ) or (x0 ) = f (x0 ). (6.27)
dx dx
i.e.
hc (x0 ) = 0 for all x0 ∈ R. (6.28)
98
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 119
6 DERIVATIVES
i.e.
f (x0 ) = a for all x0 ∈ R. (6.29)
C. Let g : R → R, x → ax , a ∈ R. Using the formula
2
y 2 − x20 = (y + x0 )(y − x0 )
we get for x0 ∈ R
g(y) − g(x0 ) ay 2 − ax20 a(y 2 − x20 )
lim = y→x
lim = y→x
lim
y→x 0
y=x0
y − x0 0
y=x0
y − x0 0
y=x0
y − x0
a(y + x0 )(y − x0 )
= y→x
lim = y→x
lim a(y + x0 ) = 2ax0
0
y=x0
y − x0 0
y=x0
i.e.
g (x0 ) = 2ax0 . (6.30)
D. We want to differentiate the function h : R \ {0} → R, x → 1
x
. For x0 = 0
we find
1 1 x0 −y
y
− x0 y·x0
lim = y→x
lim
y→x0
y=x0
y − x0 0
y=x0
y − x0
−1 1
= y→x
lim = − 2,
0
y=x0
y · x0 x0
i.e.
1
h (x0 ) = − , x0 = 0. (6.31)
x20
Recall that by assumption y ∈ D(h) = R \ {0}, i.e. y = 0.
Note that in all these examples we can find the derivative for all points in
the domain. Thus in each case we can define a new function. Therefore we
give
99
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 120
A COURSE IN ANALYSIS
where we write lim a for lim ha (y) and ha (y) = a for all y ∈ R. Now we
y→x0 y→x0
prove (6.37)
(f ± g)(y) − (f ± g)(x0 ) f (y) − f (x0 ) g(y) − g(x0 )
lim = y→x
lim ±
y→x0
y=x0
y − x0 y=x0
0 y − x0 y − x0
f (y) − f (x0 ) g(y) − g(x0 )
= y→xlim ± y→x
lim
y=x
0 y − x0 y=x
0 y − x0
0 0
= f (x0 ) ± g (x0 ).
100
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 121
6 DERIVATIVES
To proceed further, we need the following simple but far reaching observation.
g(y) − g(x0 )
g(y) − g(x0 ) = (y − x0 ).
y − x0
g(y) − g(x0 )
Now y→x
lim = g (x0 ) and y→x
lim (y − x0 ) = 0. Consequently we have
0
y=x0
y − x0 0
y=x0
g(y) − g(x0 )
lim (g(y) − g(x0 )) = y→x
lim lim (y − x0 ) = 0,
y→x0
y=x0
0
y=x0
y − x0 y→x0
y=x0
or
lim g(y) = g(x0 ).
y→x0
y=x0
Now we can prove Leibniz’s rule, which is also known as the product rule.
101
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 122
A COURSE IN ANALYSIS
(f g)(y) − (f g)(x0 )
lim
y→x0
y=x0
y − x0
f (y) − f (x0 ) g(y) − g(x0 )
= y→x
lim lim g(y) + y→x
lim f (x0 ) y→x
lim
0
y=x0
y − x0 y→x0
y=x0
0
y=x0
0
y=x0
y − x0
= f (x0 )g(x0 ) + f (x0 )g (x0 ).
With Lemma 6.7 in mind we add a new, central concept to our considerations.
The class of continuous functions is much larger than the class of differen-
tiable functions and we will discuss these functions in greater detail later on.
We will also give an example of a continuous function which is not differen-
tiable.
102
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 123
6 DERIVATIVES
N
N
j−1
p (x) = aj jx = aj jxj−1 . (6.41)
j=0 j=1
(aj xj ) = aj jxj−1 ,
103
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 124
A COURSE IN ANALYSIS
D. For n ∈ N we claim
1
(x−m−1 ) = ( (x−m ))
x
1 1
= − 2 (x−m ) + (x−m )
x x
−m−2 1
= −x − mx−m−1
x
−m−2
= −(m + 1)x ,
proving (6.42).
In the next chapter we will discuss more examples after having investigated
the derivatives of composed functions.
104
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 125
6 DERIVATIVES
Problems
1. Using the rules (6.18) − (6.20) for limits prove:
5 1 1 − x2
a) lim3 3
x2 − 7
12
x = ; b) lim = 2;
x→ 4 2 x→1 1 − x
x3 − 4x2 + 7x − 13 2
c) lim = .
x→3 − 75 x2 + 1+x
1
2 25
x2 − 2x + 5 x2 − 9
a) lim ; b) lim .
x→4 x−2 x→−3 (x + 5)(x + 3)
f : R −→ R,
x sin x1 , x = 0
x →
0, x=0
is continuous at x = 0.
105
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 126
A COURSE IN ANALYSIS
8. Using rules (6.36) − (6.38) as well as Example 6.12 find the derivatives
of the following functions:
a) f : (1, 5) −→ R, f (x) = 75 x2 − 2
x3
;
t7 +12t3 −2
b) g : (1, 2) −→ R, g(t) = t5
;
M
c) h : (2, 7) −→ R, h(s) = js−j , M ≥ 2.
j=1
106
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 127
7 Derivatives Continued
In this chapter we want to extend the number of rules for calculating deriva-
tives. Before doing this, let us agree to a slight simplification in our notation.
In the following we will often write
f (y) − f (x0 ) f (y) − f (x0 )
lim instead of lim ,
y→x0 y − x0 y→x0
y=x0
y − x0
hence
f (x) = 4x3 + 4x.
If we instead consider the function f˜k : R −→ R, x −→ (x2 + 1)k , k ∈ N,
the calculation becomes more involved, we first calculate (x2 + 1)k = . . . and
then take the derivative. Note that f and f˜k are composed functions. With
g̃ : R −→ R, x −→ x2 + 1, we find f (x) = (g̃(x))2 and f˜k (x) = (g̃(x))k . Thus
with hk (y) = y k we have f = h2 ◦ g̃ and f˜k = hk ◦ g̃. We aim to express for
an arbitrary composed function f = h ◦ g its derivative by using those of h
and g. Note that hk (x) = kxk−1 is simple to calculate as is g̃ (x) = 2x.
√ √
Example 7.2. Consider . : R+ −→ R, x −→ x. We want to calculate
√
the derivative of . at x0 ∈ R+ . Thus we have to look at
√ √ √ √ √ √
x − x0 ( x − x0 )( x + x0 )
= √ √
x − x0 (x − x0 )( x + x0 )
x − x0
= √ √
(x − x0 )( x + x0 )
1
= √ √ .
x + x0
√ √
Assuming lim x = x0 we find for x0 = 0
x→x0
√ 1
( x) x=x0 = √ ;
2 x0
107
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 128
A COURSE IN ANALYSIS
or
1 1
(x1/2 ) = 1/2
= x−1/2 , x > 0. (7.1)
2x 2
Now we want to calculate dxd
p(x), where p : R −→ R is a differentiable
function with R(p) ⊂ {x ∈ R|x > 0}.
Consider
p(x) − p(x0 ) ( p(x) − p(x0 ))( p(x) + p(x0 ))
=
x − x0 (x − x0 )( p(x) + p(x0 ))
p(x) − p(x0 ) 1
= · ,
x − x0 p(x) + p(x0 )
and for x −→ x0 we find assuming lim p(x) = p(x0 ), that
x−→x0
p(x) − p(x0 ) 1
lim = p (x0 ).
x−→x0 x − x0 2 p(x0 )
√
If we write for a moment g(x) = x the above result reads as
g(p(x)) − g(p(x0 ))
(g ◦ p) (x0 ) = lim
x−→x0 x − x0
p(x) − p(x0 )
= lim
x−→x0 x − x0
1
= p (x0 ) = g (p(x0 )) · p (x0 ),
2 p(x0 )
√
where we used that g (x) = ( x) = 2√1 x , which we still need to prove.
The previous example suggests the following general result:
108
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 129
7 DERIVATIVES CONTINUED
Proof. First recall that by Corollary 6.10 both functions h and f are contin-
uous. In particular we have
However, there is a problem: h(x) − h(x0 ) = 0 need not be true. Indeed the
term h(x) − h(x0 ) could be zero for infinitely many values. Thus we have to
modify the proof. Define the function
f (y)−f (y0 )
∗ y−y0
for y = y0
f (y) := . (7.3)
f (y0) for y = y0
Then we have
lim f ∗ (y) = f ∗ (y0 ) = f (y0 ) (7.4)
y−→y0
109
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 130
A COURSE IN ANALYSIS
and further
f (y) − f (y0 ) = f ∗ (y)(y − y0 ). (7.5)
Now it follows that
f (h(x)) − f (h(x0 ))
(f ◦ h) (x0 ) = lim
x−→x0 x − x0
∗
f (h(x))(h(x) − h(x0 ))
= lim
x−→x0 x − x0
h(x) − h(x0 )
= lim f ∗ (h(x)) lim
x−→x0 x−→x0 x − x0
= f (h(x0 ))h (x0 ),
where we used that lim h(x) = h(x0 ) and therefore lim f ∗ (y) = f (y0 )
x−→x0 y−→y0
implies lim f ∗ (h(x)) = f (h(x0 )).
x−→x0
B. Let g : R
−→ R, g(x) = 0 for all x ∈ R, be differentiable. We want
to find g(·) (x). With f (x) = x1 for x = 0 the function x −→ g(x)
1 1
is the
composed function x −→ (f ◦ g)(x).
Therefore we find
1
(x) = (f ◦ g)(x) = f (g(x)) · g (x)
g(·)
1 g (x)
= − 2 · g (x) = − ,
g (x) g(x)2
i.e.
1 g
= − 2, g = 0. (7.6)
g g
110
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 131
7 DERIVATIVES CONTINUED
111
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 132
A COURSE IN ANALYSIS
We will provide a complete proof of this theorem later in our course but for
the moment we take this result for granted.
Example 7.6. Let f : R+ −→ R+ , x −→ x2 . For x = 0 we have f (x) √=
2x = 0. The inverse function f −1 is of course f −1 : R+ −→ R+ , x −→ x.
√
From (7.7) we derive with y0 = x0 , i.e. y0 = x20 that
√ 1 1
( y 0 ) = = √
2x0 2 y0
|x|
Suppose that lim = a for some a ∈ R. Then for = 12 there exists
x→0
x=0
x
δ > 0 such that for all x ∈ R with |x| < δ, i.e. −δ < x < δ, it follows that
112
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 133
7 DERIVATIVES CONTINUED
|x|
x − a < 12 . In particular, for −δ < x < 0 we have | − 1 − a| < 1
2
and for
1
0 < x < δ we have |1 − a| < 2
, i.e.
1 1 1 1
− < −1 − a < and − < 1 − a <
2 2 2 2
implying that − 32 < a < 12 and 12 < a < 32 which is a contradiction. Therefore
x → |x| is not differentiable at x0 = 0.
The continuity of x → |x| at x0 = 0 is trivial. We just need to prove that
lim |x| = 0, i.e. given > 0 we need to find δ > 0 such that |x| < δ implies
x→0
||x| − |0|| = |x| < . Thus δ = will do.
Problems
To solve these problems knowledge√ of derivatives of rational functions and
the square root function x → x, x > 0 may be used. Moreover, while
solving these problems results from previous questions may be used without
proof or justification.
√
1. Consider the function hk : (0, ∞) −→ R, k ∈ N, hk (x) = xk . Find
d
h (x).
dx k
113
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 134
A COURSE IN ANALYSIS
1
5. In
√ the following denote the inverse function of x → xk by x → x k =
k
x, x > 0 for k ∈ N. Find the derivatives of:
1
i) f : R −→ R, f (s) = (1 + s2 ) k ;
√
1+t4
ii) g : R −→ R, g(t) = √
5
1+t6 +t8
;
1
iii) h : (0, ∞) −→ R, h(u) = u 7 .
1+u2
1+u4
l 1 √
6. For x > 0, k ∈ N and l ∈ N0 we set x k = xl · x k = xl k x. Find the
derivatives of: 3
l 2 )− 2
i) f : (0, ∞) −→ R, f (x) = x k ; ii) g : R −→ R, g(s) = (1+s
(1+s )4 5 .
1
8. Find the derivative of g : (−1, 1) −→ R, where g(t) = (t2 − 1)(2t + 3) 2 .
114
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 135
x0 x1 x-axis
Figure 8.1
It looks like the function is unbounded but at x0 the function has a local
maximum and at x1 it has a local minimum.
We want to find criteria for these properties to hold. For this we first need
some definitions.
Definition 8.2. A function f : D → R is said to be bounded if there exists
M ≥ 0 such that |f (x)| ≤ M for all x ∈ D.
Example 8.3. A. The function χA : R → R is for every set A ⊂ R bounded
since |χA (x)| ≤ 1 for all x ∈ R.
B. The function | · | : R → R, x → |x| is unbounded. Indeed suppose there
exists M ≥ 0 such that |x| ≤ M for all x ∈ R. Then for x = M + 1 we would
find
|M + 1| = M + 1 ≤ M
115
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 136
A COURSE IN ANALYSIS
which is a contradiction.
Definition 8.4. Let f : (a, b) → R be a function, a < b. We say that f has
a local maximum at x0 ∈ (a, b) (a local minimum at x1 ∈ (a, b)) if there
exists > 0 such that f (x0 ) ≥ f (y) for all y ∈ (a, b) satisfying |x0 − y| <
(f (x1 ) ≤ f (y) for all y ∈ (a, b) such that |x1 − y| < ).
In the case that f has either a local maximum or a local minimum at x2 ∈
(a, b) we just speak of a local extreme value or a local extremum at x2 .
Of central importance is:
Theorem 8.5. Suppose that f : (a, b) → R has a local extremum at x0 ∈
(a, b). If f is differentiable at x0 then f (x0 ) = 0.
This result fits well to our imagination, look at the graph
y-axis
x0 x1 x-axis
Figure 8.2
116
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 137
is a (local) minimum at x0 = α.
y-axis
f (x) = x3
x-axis
Figure 8.3
This example shows that Theorem 8.5 is a necessary but not a sufficient
condition for a local extreme value.
C. The function | · | : R → R, x → |x| is for all x > 0 or x < 0 strictly
positive whereas |0| = 0. Thus at x0 = 0 it has a local minimum. However
the absolute value is not differentiable at x0 = 0, compare Example 7.7. Thus
we cannot apply Theorem 8.5.
Theorem 8.5 only gives a necessary condition for local extreme values to
exist. We want to find now sufficient criteria for local maxima and minima.
It turns out that for this we need higher order derivatives.
Let f : D → R be a function such that f (x) exist for all x ∈ D. Then we
can consider f as a new function f : D → R, x → f (x). Next we may
ask whether f has at x0 ∈ D a derivative, i.e. whether
f (x) − f (x0 )
lim = f (x0 ) (8.1)
x→x0
x=x0
x − x0
117
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 138
A COURSE IN ANALYSIS
dk f
k
(x0 ) = f (k) (x0 ) = (f k−1 ) (x0 )
dx
f k−1 (x) − f k−1(x0 )
= x→x
lim
0
x=x0
x − x0
0
as the k th derivative of f at x0 . By definition: f (0) = ddxf0 = f.
Note that the definition of higher order derivatives is a definition by recursion:
k−1
dk d d
k
f (x) := f (x).
dx dx dxk−1
Clearly we may extend our rules for taking derivatives to higher order deriva-
tives. Here are some of the simple ones:
dk dk f dk g
(f ± g) = ± (8.2)
dxk dxk dxk
and
dk dk f
(cf ) = c . (8.3)
dxk dxk
However the following rule is not so simple:
k k−l
dk k d f dl g
(f · g) = , (8.4)
dxk l=0
l dxk−l dxl
where kl denote the binomial coefficients. We will return to this formula in
Part 2, see Problem 2 in Chapter 21. Here is the above rule in its simplest
form, i.e. when k = 2:
118
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 139
$ %
d2 d d
(f · g) = (f · g)
dx2 dx dx
d
= (f g + f g )
dx
= f g + f g + f g + f g
= f g + 2f g + f g
2 2
2 d 2 df dg 2 dg
= 2
f· g+ + f .
0 dx 1 dx dx 2 dx2
The following result, called the mean value theorem is useful to study
functions in more detail.
119
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 140
A COURSE IN ANALYSIS
y-axis
f (ξ)
f
f (b) − f (a)
b−a
a ξ b x-axis
Figure 8.4
Remark 8.11. When proving the mean value theorem later in Part 2 of our
course, compare with Corollary 22.6, we will carefully discuss the importance
of each of the assumptions in the above theorem.
The mean value theorem has important consequences:
Corollary 8.12. Suppose that f : [a, b] → R fulfils the assumptions of the
mean value theorem. Further suppose that m ≤ f (η) ≤ M for all η ∈ (a, b).
Then we have the estimates
m(x − y) ≤ f (x) − f (y) ≤ M(x − y) (8.6)
for all x, y ∈ (a, b), y ≤ x.
Proof. We may apply the mean value theorem to f |[y,x] to find first
f (x) − f (y) = f (ξ)(x − y)
for some ξ ∈ (y, x), ξ = ξ(x, y). Since f (ξ) ≥ m and x − y ≥ 0 this implies
m(x − y) ≤ f (x) − f (y).
Further, since f (ξ) ≤ M and x − y ≥ 0 we find in addition
f (x) − f (y) ≤ M(x − y).
120
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 141
Proof. Using (8.6) with m = M = 0 we find f (x) = f (y) for all x, y ∈ (a, b),
i.e. f (x) = c := f (x0 ) for x ∈ [a, b] and some fixed x0 ∈ [a, b].
x
Figure 8.5
121
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 142
A COURSE IN ANALYSIS
Problems
1. a) Let f : D1 −→ R and g : D2 −→ R be two functions such
that f (D1 ) ⊂ D2 . Suppose that g is bounded with bound M ≥ 0, i.e.
|g(x)| ≤ M for all x ∈ D2 . Prove that g ◦ f : D1 −→ R is bounded and
find a bound for g ◦ f .
b) Consider f : (1, 2) −→ R, x → (x − 1)2 and g : (0, ∞) −→ R,
y → y1 . Show that f is bounded and that f ((1, 2)) ⊂ (0, ∞). Is the
function g ◦ f : (1, 2) −→ R bounded?
c) Give an example of a continuous function f : (a, b) −→ R, a < b,
with the property that for all a1 and b1 such that a < a1 < b1 < b the
function f |[a1 ,b1 ] is bounded but f is unbounded.
122
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 143
s = 0?
d2 2 2 −1
(u (x) + 1)(v (x) + 1) .
dx2
6. Find
d2 1 x2
where u : R −→ R, u(x) = √ .
dx2 (u (x) + 2)2
2
1 + x2
123
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 144
A COURSE IN ANALYSIS
124
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 145
f (x)f (−x) = 1,
1
i.e. f (x)
= f (−x) or with f = exp
1
exp(−x) = . (9.3)
exp(x)
125
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 146
A COURSE IN ANALYSIS
Before we can proceed further we state without proof (which we will provide
later, in Part 2, Theorem 20.17) the intermediate value theorem:
The intermediate value theorem applied to exp implies that exp(x) > 0 for
all x ∈ R. Indeed, suppose that there is x1 ∈ R such that exp(x1 ) < 0. Since
x0 = 0, we conclude that there must be x0 ∈ (x1 , 0) if x1 < 0 or x0 ∈ (0, x1 )
if x1 > 0, such that exp(x0 ) = 0 which is impossible by Lemma 9.3 Hence
exp(x) > 0 for all x ∈ R.
Lemma 9.6. The exponential function is strictly positive and strictly in-
creasing.
Proof. It remains to prove that exp is strictly increasing. But exp (x) =
exp(x) > 0, implying the result.
Proof. For y ∈ R fixed we consider the function x −→ g(x) := exp(y + x). It
follows that
g (x) = (exp(y + x)) = exp(y + x) = g(x),
126
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 147
hence g(x) = K exp(x) for some K ∈ R, compare with the proof of Lemma
9.4. Now, with x = 0 we find
or
exp(x + y) = g(x) = exp(x) exp(y)
proving the lemma.
e := exp(1). (9.5)
exp(n) = en . (9.6)
exp(x) = ex . (9.8)
In particular we have
ex+y = ex ey and e0 = 1. (9.9)
127
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 148
A COURSE IN ANALYSIS
We know that exp is strictly increasing and exp(x) > 0 for all x ∈ R. Assume
for a moment that R(exp) = {x ∈ R|x > 0}. Then we know that exp : R −→
{x ∈ R|x > 0} is bijective and has a differentiable inverse, i.e. there exists a
function ln :{x ∈ R|x > 0}−→R with the properties
x−→ln x
and
exp(ln y) = y for y > 0. (9.11)
We call ln the (natural) logarithm. For its derivative we find using (7.7)
that
d 1 1 1
ln y =
= = ,
dy exp (ln y) exp(ln y) y
i.e.
1
(ln y) = , y > 0, (9.12)
y
which also implies that ln is strictly increasing on {y ∈ R|y > 0}. Further-
more we have
ln(1) = 0 (9.13)
since 1 = exp(0), and we claim for x, y > 0 that
ln(x · y) = ln x + ln y. (9.14)
ln y = g(1) = ln(yx) − ln x
or
ln yx = ln y + ln x,
proving (9.14). Finally we note that for x > 0
x 1
0 = ln 1 = ln = ln x + ln = ln x + ln x−1 ,
x x
128
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 149
or
1
ln x−1 = ln
= − ln x. (9.15)
x
Now let a > 0 be given. We define on R the function x −→ ax by
ax := ex ln a . (9.16)
as well as
(ax )y = axy . (9.18)
Further, x −→ ax is bijective with range {y ∈ R|y > 0} and has an inverse
function which is denoted by x −→ loga x. The value of loga x is called the
logarithm of x with respect to the basis a.
(Note that since a−1 = a1 , it is often convenient to define x −→ ax and
y −→ loga y only for a > 1.)
For the derivative of x −→ ax we find
d x d x ln a
a = e = (ln a)ex ln a = (ln a)ax , (9.19)
dx dx
and this implies for x −→ loga x
d 1 1 1
loga x = x = log x
= . (9.20)
dx (a ) (loga x) (ln a)a a (ln a)x
Here are some examples of derivatives
d 1
(x ln x) = ln x + x · = 1 + ln x, (9.21)
dx x
d x d x ln x d
(x ) = e = (x ln x) ex ln x = (1 + ln x)xx . (9.22)
dx dx dx
For differentiable functions u : R −→ R, v : R −→ R+ \ {0} we find
d u(x)
e = u (x)eu(x) (9.23)
dx
and
d 1 v (x)
ln v(x) = v (x) = . (9.24)
dx v(x) v(x)
129
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 150
A COURSE IN ANALYSIS
The term vv is often called the logarithmic derivative of v.
Before we can draw the graph of exp and ln, we need to study the asymptotic
behaviour of functions.
Let f : R −→ R be a function. We want to study the behaviour of f (x) for
x becoming larger and larger, i.e. for x tending to infinity. It may happen
that for x tending to infinity f (x) tends to some number a or to infinity, but
other cases are possible.
We write
lim f (x) = a (9.25)
x−→∞
130
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 151
lim xn = ∞. (9.27)
x−→∞
(1 + a)n+1 = (1 + a)n (1 + a)
n(n − 1) 2
≥ 1 + na + a (1 + a)
2
n(n − 1) 2 n(n − 1) 3
= 1 + na + a + a + na2 + a
2 2
n(n − 1) + 2n 2
≥ 1 + (n + 1)a + a
2
(n + 1)n 2
= 1 + (n + 1)a + a
2
and the result follows by the principle of mathematical induction.
Lemma 9.12. We have
lim ex = ∞. (9.30)
x−→∞
131
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 152
A COURSE IN ANALYSIS
Remark 9.13. Note that we have assumed that for M > 0 we find N ∈ N
such that 1 + bN > M. If M ≤ 1 then every N ∈ N will do, but this case is
of course not interesting. If M > 1 then
M −1
MM
−1
− 1 > 0 and we may take N such
that N > b , for example N = 1 + b .
Lemma 9.14. We have
x
lim = lim xe−x = 0. (9.31)
x−→∞ exp(x) x−→∞
Proof. We claim that φ(x) := xe−x is for x > 1 strictly decreasing. This
follows from
φ (x) = e−x − xe−x = e−x (1 − x) < 0,
provided x > 1. Hence for x > N > 1 it follows that
132
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 153
if for every > 0 there exists N = N() ∈ N such that x < −N implies
|f (x) − a| < .
Proof. We have to prove that for every > 0 there exists N ∈ N such that
x < −N implies |ex − 0| = ex < . With y := −x > 0 this is equivalent to
N < y implies e−y < or N < y implies e1y < , i.e.
We now prove (9.34). The function y −→ g(y) = e−y is strictly decreasing
since g (y) = −e−y < 0. By Bernoulli’s inequality we find therefore for N < y
and using e = (1 + b) that
1 1
e−y ≤ e−N ≤ < .
1 + Nb Nb
1 1
N < y implies e−y < e−N ≤ < < .
1 + Nb Nb
Note that Lemma 9.15 together with Lemma 9.12 finally proves that the
range of exp is equal to {x ∈ R|x > 0}.
Now we can sketch the graph of x −→ exp(x). It must be strictly positive,
strictly increasing, for x −→ −∞ it tends to 0, at x = 0 it has the value 1,
and for x −→ ∞ it tends to ∞:
133
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 154
A COURSE IN ANALYSIS
4
y = exp(x)
3
−3 −2 −1 1 2 3 x
−1
−2 Figure 9.1
By our general considerations we can now also sketch the graph of x −→ ln x.
We only have to reflect the graph of exp at the principal diagonal:
y
5 y = exp(x)
3
y=x
2
1 y = ln(x)
−3 −2 −1 1 2 3 x
−1
−2 Figure 9.2
134
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 155
lim ln x = ∞. (9.35)
x−→∞
Given x > 0 set x = exp y. Now x tending to infinity implies that exp y
tends to infinity which is only possible when y tends to infinity, but y = ln x.
Next consider ln x for x tending to 0. This is equivalent to considering ln x1
for x tending to ∞. But ln x1 = − ln x and lim ln x = ∞. Thus we find
x−→∞
ln x −→ −∞ for x −→ 0. For this we write
Problems
1. a) Using the definition of lim f (x) = ∞ prove that lim (x2 − 5) =
x→∞ x→∞
∞.
k
b) Let p : R −→ R, p(x) = al xl , be a polynomial of degree k
l=0
with ak > 0. Prove that lim p(x) = ∞.
x→∞
c) For a ∈ R prove that
1 + a + ax2
lim = a.
x→∞ 1 + x2
135
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 156
A COURSE IN ANALYSIS
n
e = pn (x)e−x .
dt
6. Find the following derivatives:
√ d2
d
a) ds d
ln( s4 + 1−s2 ); b) dx (ln(ax )), a > 1; c) dy 2
ln((y 2 +1)−k ), k ∈ N.
lim (exp(p(x))).
x→−∞
136
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 157
n
9. Let p(x) = ak xk be a polynomial and an > 0. Prove that there
k=0
exists R > 0 such that p(x) > 0 for x ≥ R. Hence for x ≥ R the
ln(p(x))
function x → ln(p(x)) is defined. Now show that lim = 0.
x→∞ x
10. a) For x, y > 0 prove under the assumption that for a > 0 it follows
1
that lna 2 = 12 lna the estimate
137
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 159
f = g, g = −f (10.1)
φ
A = (0, 0) x0 = cos φ 1
B = (1, 0)
Figure 10.1
139
July 28, 2015 9:26 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 160
A COURSE IN ANALYSIS
Figures 10.2 and 10.3 below give a further insight into the values of sin and
cos for 0 ≤ φ < 2π. First we look at Figure 10.2:
y-axis
π
2
+φ
φ
− cos φ cos φ 1 x-axis
π+φ
3π
2
+φ
− sin φ
(− cos φ, − sin φ) (cos φ, − sin φ)
Figure 10.2
We find for example that cos(π − φ) = − cos φ and sin(π + φ) = − sin φ, etc.
Next we consider Figure 10.3:
140
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 161
y-axis
(− sin φ, cos φ)
cos φ
− sin φ φ
− cos φ π+φ sin φ cos φ 1 x-axis
− sin φ
(− cos φ, − sin φ)
3π
2
+φ
− cos φ
(sin φ, − cos φ)
Figure 10.3
141
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 162
A COURSE IN ANALYSIS
sin 0 = 0 cos 0 = 1
sin π2 = 1 cos π2 = 0
sin π = 0 cos π = −1
sin 3π
2
= −1 cos 3π2
=0
sin 2π = 0 cos 2π = 1.
Moreover by Pythagoras’ theorem, see Appendix IV, we know
x20 + y02 = 1
or
cos2 φ + sin2 φ = 1. (10.6)
We also note the following results:
142
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 163
Indeed, given > 0 choose δ = to find for |φ−0| = |φ| < δ that | sin φ−0| =
| sin φ| ≤|φ| < δ = . Thus we have proved that sin is continuous at 0. Since
cos φ = 1 − sin2 φ we find that
lim cos φ = lim 1 − sin2 φ = 1, (10.16)
φ−→0 φ−→0
143
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 164
A COURSE IN ANALYSIS
From elementary geometry we know that a sector OAB with an angle φ,
0 ≤ φ < 2π, of a circle with radius r has area 12 r 2 φ, see the following figure
for an explanation.
Length of AB: rφ
0 r A
Figure 10.4
D = (cos φ, sin φ)
B
Length of AB: rφ
φ
·
0=(0,0) A = (cos φ, 0) C = (1, 0)
Figure 10.5
It is obvious that the area of the sector OAB is less or equal to that of the
triangle OCD. Since r = cos φ and the area of the triangle OCD is given by
1
2
sin φ we find for 0 ≤ φ < π2
1 1
φ cos2 φ ≤ sin φ,
2 2
144
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 165
π
and for 0 < φ < 2
sin φ
cos2 φ ≤ . (10.17)
φ
sin φ
Since cos2 φ as well as φ
are even functions we can extend (10.17) to all φ,
0 < |φ| < π2 .
We now claim:
π
Proof. From (10.17) and (10.14) we deduce for 0 < φ < 2
that
sin φ
1 − sin2 φ = cos2 φ ≤ ≤ 1.
φ
145
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 166
A COURSE IN ANALYSIS
Proof. Note that sin 0 = 0, and therefore Theorem 10.4 states that sin is
differentiable at 0 with derivative 1 which is equal to cos 0. Now, using
(10.9) we find
sin φ − sin φ0 2 sin φ−φ
2
0
cos φ+φ
2
0
sin(φ − φ0 )/2 φ + φ0
= = · cos ,
φ − φ0 φ − φ0 (φ − φ0 )/2 2
which implies
sin φ − sin φ0 sin φ−φ
2
0
φ − φ0
lim = lim φ−φ0
lim cos
φ−→φ0 φ − φ0 φ−→φ0
2
φ−→φ0 2
sin h φ + φ0
= lim lim cos = cos φ0 ,
h−→0 h φ−→φ0 2
where we used the continuity of cos, compare with Corollary 10.3.
Knowing that sin is differentiable and sin = cos allows us to calculate the
derivative of cos by using the chain rule:
d d 1
cos x = (1 − sin2 x) 2
dx dx
1 1
= (−2 sin x cos x) · (1 − sin2 )x)− 2
2
sin x cos x
= − = − sin x.
cos x
Corollary 10.6. The function sin has for φ = (2k + 12 )π, k ∈ Z, a (local)
maximum and for φ = (2k − 12 )π, k ∈ Z, a (local) minimum. The function
cos has for 2kπ, k ∈ Z, a (local) maximum and for (2k + 1)π, k ∈ Z, a (local)
minimum.
Proof. We know (sin φ) = cos φ = 0 for φ = (k + 12 )π, k ∈ Z. Now (sin φ) =
− sin φ. Hence for φ = (2k + 12 )π, we find
π 1 1
− sin = − sin(2k + )π = (sin) (2k + )π = −1 < 0,
2 2 2
thus sin has a local maximum for φ = (2k + 2 )π. For φ = (2k − 12 )π we find
1
π π 1 1
sin = − sin − = − sin(2k − )π = (sin )(2k − )π = 1 > 0
2 2 2 2
1
implying that sin has a local minimum for φ = (2k − 2 )π. The result for cos
is proved in an analogous way.
146
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 167
From our definition of sin and cos it is clear that φ = π is the smallest zero
of sin larger than 0, as is π2 the smallest zero of cos larger than 0. We also
note the formula:
π
cos φ = sin(φ + ). (10.21)
2
The graphs Γ(sin) and Γ(cos) look like:
1 Γ(sin)
−π − π2 π
2
π 3π
2 2π
−1 Γ(cos)
Figure 10.6
d 1
arcsin(x) = √ . (10.22)
dx 1 − x2
147
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 168
A COURSE IN ANALYSIS
Using sin and cos we may introduce some further functions of importance.
Consider first the tangent function
sin x
tan x := . (10.24)
cos x
Of course we must assure that cos x = 0, thus we define the function tan on
the set R \ {(k + 12 )π|k ∈ Z}. It is obvious that tan is an odd function since
sin(−x) sin x
tan(−x) = =− = − tan x,
cos(−x) cos x
148
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 169
d d sin x
tan (x) = tan(x) =
dx dx cos x
cos x cos x + sin x sin x 1
= 2
= ,
cos x cos2 x
i.e.
1
tan x =
. (10.25)
cos2 x
Further we may introduce the cotangent function
cos x
cot x := , (10.26)
sin x
which is defined on R \ {kπ|k ∈ Z}. Once again we find that cot is an odd
function and we have
d cos x − sin x sin x − cos x cos x
cot (x) = =
dx sin x sin2 x
1
= − 2 ,
sin x
i.e.
1
cot (x) = − . (10.27)
sin2 x
From (10.25) it follows that on (− π2 , π2 ) the function tan is strictly increasing,
hence it has an inverse, the arcus-tangent function arctan : R −→ (− π2 , π2 ).
Note however that we have not yet proved that R(tan |(− π2 , π2 ) ) = R.
For arctan we find by Theorem 7.5 that
1
arctan (x) = = cos2 (arctan x).
tan (arctan x)
1
Now, cos2 y = 1+tan2 y
as follows from
1 1 cos2 y
= = = cos2 y,
1+ sin2 y cos2 y
+ sin2 y sin2 y + cos2 y
cos2 y cos2 y cos2 y
which yields
1 1
arctan (x) = 2 = ,
1 + tan (arctan x) 1 + x2
149
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 170
A COURSE IN ANALYSIS
i.e.
1
arctan (x) = . (10.28)
1 + x2
From (10.27) we find that for x ∈ (0, π) the function cot is strictly decreasing
and hence it has an inverse function arccot, arcus-cotangent. For arccot
we find
1
arccot (x) =
= − sin2 (arccot x).
cot (arccot x)
Since sin2 y = 1+cot
1
2 y we find
1 1
arccot x = − 2
=− ,
1 + cot (arccot x) 1 + x2
i.e.
1
arccot x = − . (10.29)
1 + x2
We postpone the proof of R(tan |(− π2 , π2 ) ) = R(cot |(0,π) ) = R, until Remark
20.18.B. and we refer to Appendix V where one can find a lot of formulae
connecting sin, cos, tan, cot, arcsin, arccos, arctan, arccot. We mention that
often a new name is introduced for x −→ sin1 x and x −→ cos1 x , namely
1 1
csc x = and sec x = (10.30)
sin x cos x
called co-secant and secant function. We finally consider the following
graphs:
arcsin(x) arccos(x)
π
2
−1 1
−1 0 1
150
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 171
tan(x) cot(x)
−π − π2 π
2
π − π2 π π − 3π
2 2
arctan(x) arccot(x)
π
π
2
− π2
151
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 172
A COURSE IN ANALYSIS
Problems
1. a) Let f : R+ −→ R be any function and let g : R −→ R+ be an
even function. Prove that f ◦ g : R −→ R is an even function.
b) Let f : R −→ R and g : R −→ R be odd functions. Is f ◦ g an
odd function too?
c) Given an even function f : R −→ R and (a, b) ⊂ R, a < 0 < b.
Prove that f |(a,b) cannot have an inverse function.
2. a) Let f : R −→ R be a differentiable function. Prove that if f is
even then f is odd and if f is odd then f is even. Deduce that if f
is a k times continuously differentiable function and l ≤ k is an even
number then f (l) is even.
b) Let f : R+ −→ R be a function. Show that f has an even
extension g : R −→ R and f |(0,∞) an odd extension h : R −→ R.
3. a) Does the limit lim (sin x) exist?
x→∞
b) Prove for k ∈ N that
(sin x)k
lim = 0.
x→∞ x
4. Using the definitions of sin, cos, tan and cot, and the addition theorems
find the values of
a) sin π8 , b) cos π6 , c) tan π3 , d) cot 12
π
.
5. Find the values of
√ √ √
a) arcsin 23 , b) arccos − 12 2 , c) arctan √13 , d) (− 3).
152
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 173
8. Find the derivatives (on the natural domains) of the given functions:
√
d
a) dx cos(ln(1 + x2 )); b) dtd √sin(tan t)
4
1−cos t
; c) d
ds
arcsin( 1 + cos s);
d 2
d) du
arctan(e−u cot u).
Prove that
1
Dn (t) = Cn (2t),
2
where n
1
Cn (t) =
+ cos jt,
2 j=1
and deduce that Dn is on − π2 , π2 arbitrarily often differentiable.
n
Hint: first find cos jt and consider cos jt · sin 2t .
j=1
153
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 175
11 Investigating Functions
In this chapter we want to develop a scheme for investigating a given func-
tion in a systematic way. The first problem we have to address is that of the
domain. Clearly, if a function is given as f : D −→ R we know D. However,
often we have to handle functions which are obtained from given ones or con-
structed “indirectly”: the exponential function was introduced as a solution
of a certain differential equation; the tangent function is the quotient of two
x2
functions both having many zeroes; the function x −→ |x| is not defined for
x = 0 but easily extended to the function x −→ |x| which is defined for all
x ∈ R.
Thus our starting point should be an expression f (x) defined originally for
some subset D̃ ⊂ R such that x −→ f (x) is a function on D̃. The first step
is to determine the maximal domain D of the expression f (x), i.e. the
largest set D ⊂ R such that D̃ ⊂ D and f (x) is defined on D. We distinguish
the maximal domain D of the expression f (x) as the domain of the maximal
extension of f : D̃ −→ R.
155
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 176
A COURSE IN ANALYSIS
x0
a x0 − x0 + b
Figure 11.1
1
Consider := min(x0 −a, b−x0 ) > 0. Then we claim (−+x0 , x0 +) ⊂ (a, b).
2
The proof is simple x ∈ (− + x0 , x0 + ) means − + x0 < x < x0 + and
with = 12 min(x0 − a, b − x0 ) we find in the case where = 12 (x0 − a) that
1 1
− (x0 − a) + x0 < x < x0 + (x0 − a)
2 2
which yields
1 1 1 1 1
a < a + x0 < x < (x0 − a) + x0 < b + x0 < b,
2 2 2 2 2
hence x ∈ (− + x0 , x0 + ), = 12 (x0 − a), implies x ∈ (a, b). The case where
= 12 (b − x0 ) is proved in the same way and is left as an exercise.
We call x0 ∈ R a boundary point of D ⊂ R if for every > 0 the interval
(−+x0 , x0 +) contains at least a point belonging to D and a point belonging
to D , recall D = {x ∈ R|x ∈ / D}. It may happen that a boundary point
belongs to D but it need not belong to D. Consider the set
By definition a ∈
/ D but b ∈ D. We claim that both a and b are boundary
points.
a− a a+ b
Figure 11.2
We start with a and choose any > 0. The set (− + a, a + ) consists of all
points x ∈ R such that − + a < x < a + , hence all points − + a < x ≤ a
belong to (a, b] and all points a < x < a+ belong to (a, b] provided ≤ b−a.
Thus a is a boundary point not belonging to (a, b]. Now, to see that b is a
156
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 177
11 INVESTIGATING FUNCTIONS
boundary point take > 0 and consider (− + b, b + ). These are all points
x satisfying − + b < x < b + . Those x satisfying − + b < x ≤ b belong to
(a, b], provided < b − a and those satisfying b < x < b + belong to (a, b] .
Hence b is also a boundary point and it belongs to D = (a, b].
Note, in both cases we have to modify our argument if becomes too large,
< 12 (b − a) will always be sufficient.
By definition we call −∞ and +∞ the boundary points (at infinity) of the
intervals (−∞, a) or (−∞, a] and (b, +∞) or [b, +∞), respectively, as well as
of R = (−∞, ∞). This is a slight abuse of the definition but helpful.
Typically the domains D we will have to work with will consist of a finite
union of finite or infinite intervals which could be open, closed or half-open.
However, countable unions of finite intervals may also occur, think of the
tangent function. The set ∂D of all boundary points of D (excluding −∞
and +∞) is called the boundary of D. The first task is to find all boundary
points of D.
In the following we will only investigate functions which are continuous on
D, in fact we will assume the functions to be a few times differentiable. Here
is a fact which we will prove in Part 2: if f : D −→ R is continuous and D
a finite union of bounded and closed intervals then f is bounded, i.e. there
exists M ≥ 0 such that |f (x)| ≤ M for all x ∈ D.
As the example f : (0, 1] −→ R, x −→ x1 , shows this does not hold for non-
closed intervals, and g : R −→ R, x −→ x, shows that this does not hold for
unbounded intervals.
We want to study the continuous function f : D −→ R at boundary points of
D. First consider the case where D is a bounded interval. In the case where
D = [a, b] is closed (and bounded) we know that f is bounded and f (a) as
well as f (b) are finite values. Suppose that D is not closed, i.e. D = (a, b]
or D = [a, b) or D = (a, b). Of course f could still be bounded, but it need
not be. If a boundary point does not belong to D everything may happen.
However if a boundary point belongs to D, f remains “locally” bounded,
i.e. bounded at this boundary point (and in a small neighbourhood of it
belonging to D), but no information is known a priori for all of D. Indeed, if
a ∈ D (the case b ∈ D goes analogously) we find that f |[a, b−a ] is continuous,
2
hence bounded. The simple proof that f : D −→ R being continuous implies
the continuity of f |D̃ , D̃ ⊂ D, is left to the reader.
Let f : (a, b) −→ R be a continuous function. Here are some examples of
what may happen at the boundary:
157
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 178
A COURSE IN ANALYSIS
f (x)
lim = 1. (11.1)
x−→∞ g(x)
f (x)
Now, lim = 1 means that given > 0 there exists N ∈ N such that
g(x)
x−→∞
for x > N it follows that
f (x)
g(x) − 1 < or |f (x) − g(x)| < g(x),
i.e.
−g(x) < f (x) − g(x) < g(x)
or
(1 − )g(x) < f (x) < (1 + )g(x) for x > N, (11.2)
recall g(x) = x which is positive for x > N. This means that for > 0 given
and x sufficiently large, the behaviour of f is controlled by g.
C. Consider g : (0, 1) −→ R, x −→ sin x1 . This function is bounded but it
does not have a limit or specific asymptotic behaviour as x −→ 0. Indeed,
1
for the sequence xn = nπ we have sin x1n = 0, for the sequence yn = 2n+1 1 π it
2
follows that sin y1n = 1, and in fact for every value z ∈ [−1, 1] we can find a
sequence zn , zn −→ 0, such that sin z1n −→ z.
158
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 179
11 INVESTIGATING FUNCTIONS
f (x)
lim =1 .
x−→b g(x)
Since
N N
j=0 aj x
j aj j−N
= x
aN xN a
j=0 N
N
−1
aj j−N
= 1+ x
j=0
aN
it remains to prove
N
−1
aj j−N
lim x = 0.
x−→∞
j=0
aN
But we know that lim xj−N = 0 for j < N. Note that the same argument
x−→∞
yields that g(x) = aN xN is also an asymptote of p(x) as x −→ −∞. Further,
this example shows that an asymptote is not uniquely determined. Take for
simplicity p(x) = x2 + 1, then x −→ x2 is an asymptote, but by a trivial
calculation it is easy to see that x −→ x2 + c, c ∈ R, is a further one.
159
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 180
A COURSE IN ANALYSIS
lim
x→a
f (x) = lim
x→a
g(x) = 0 (11.4)
x=a x=a
or
lim
x→a
g(x) = +∞ or − ∞. (11.5)
x=a
Then
f (x) f (x)
lim = lim (11.6)
x→a
x=a
g(x) x→a
x=a
g (x)
provided the limit on the right hand side exists. An analogous statement holds
for the boundary point b.
Example 11.6. A. For α > 0 we have
eαx αeαx
lim = lim = +∞. (11.7)
x−→∞ x x−→∞ 1
N j
B. For every polynomial p(x) = j=0 aj x , aN = 0, and α > 0 we have
N
lim aj xj e−αx = 0. (11.8)
x−→∞
j=0
160
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 181
11 INVESTIGATING FUNCTIONS
Indeed
N N j
j=0 aj x
j
j=0 aj x
lim = lim
x−→+∞ eαx x−→+∞ αeαx
dN
N j
dxN j=0 aj x
= . . . = lim
x−→+∞ αN eαx
N! aN
= lim = 0.
x−→+∞ αN eαx
(We are allowed of course to iterate applications of de l’Hospital’s rule.)
C. We claim
lim
x→0
xx = 1. (11.9)
x>0
lim
x→0
xx = lim
x→0
exp(x ln x)
x>0 x>0
= exp lim
x→0
x ln x .
x>0
Now
ln x
lim
x→0
(x ln x) = lim
x→0 1
x>0 x>0 x
1
x
= lim = lim (−x) = 0,
x→0
x>0
− x12 x→0
x>0
hence
x
lim x = exp lim (x ln x) = exp(0) = 1.
x→0 x−→0
x>0
161
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 182
A COURSE IN ANALYSIS
ex − e−x
sinh x := ; (11.10)
2
ex + e−x
cosh x := ; (11.11)
2
sinh x ex − e−x
tanh x := = x ; (11.12)
cosh x e + e−x
and
cosh x ex + e−x
coth x := = x . (11.13)
sinh x e − e−x
Other hyperbolic functions are:
1
cosech x := ; (11.14)
sinh x
and
1
sech x := . (11.15)
cosh x
162
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 183
11 INVESTIGATING FUNCTIONS
and
sinh x ex − e−x
lim = lim − −x
= lim 1 − e2x = 1.
x−→−∞ g2 (x) x−→−∞ e x−→−∞
Further we find
ex + e−x
sinh (x) = = cosh x > 0, (11.17)
2
implying that sinh is strictly monotone increasing. The graph of sinh looks
like:
−2 −1 1 2
−1
−2
−3
−4 Figure 11.3
163
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 184
A COURSE IN ANALYSIS
and
cosh x ex + e−x
lim = lim −x
= lim 1 + e2x = 1.
x−→∞ g3 (x) x−→−∞ e x−→−∞
Since
ex − e−x
cosh (x) = = sinh(x) (11.18)
2
we find that x0 = 0 is the only zero of cosh . Further
cosh (x) = sinh (x) = cosh(x) > 0
e0 +e0
for all x. Hence cosh has a minimum at x0 = 0 with value cosh 0 = 2
= 1.
The graph of cosh is given by
−3 −2 −1 0 1 2 3
Figure 11.4
Next we discuss tanh. Since cosh x = 0 for all x ∈ R we find that the domain
of tanh is again R. Further, tanh is the product of an even and an odd
164
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 185
11 INVESTIGATING FUNCTIONS
−4 −3 −2 −1 1 2 3
−1
Figure 11.5
165
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 186
A COURSE IN ANALYSIS
which is for x = 0 always strictly negative, hence coth |{x∈R|x>0} and coth |{x∈R|x<0}
are strictly decreasing functions.
For x −→ ∞ we find
ex + e−x
lim coth x = lim =1
x−→∞ x−→∞ ex − e−x
−3 −2 −1 1 2
−1
−2
−3
−4 Figure 11.6
Note that Figure 11.6 suggests that R(coth) has a gap, namely the interval
[−1, 1]. It is the discontinuity of coth at x = 0 which tolerates such a
behaviour.
166
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 187
11 INVESTIGATING FUNCTIONS
1
= .
cosh sinh−1 (y)
Now, cosh x = 1 + sinh2 x, recall (11.20), implying that
1 1
sinh−1 (y) = = . (11.23)
1 + sinh 2
sinh−1 (y) 1 + y2
We claim √
arsinh x = ln x + x2 + 1 . (11.24)
Note that
√
√ d
dx
x + x2 + 1 1
ln x + x2 + 1 = √ =√ ,
x + x2 + 1 x2 + 1
i.e. √
ln x + x2 + 1 = arsinh (x),
and therefore they differ only by a constant:
√
arsinh x = c + ln x + x2 + 1 .
167
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 188
A COURSE IN ANALYSIS
1 cosh2 x − sinh2 x
tanh (x) = 2 = 2 = 1 − tanh2 (x).
cosh x cosh x
Now we get by (7.7)
1
artanh (y) = tanh−1 (y) =
(tanh )(tanh−1 (y))
1 1
= 2 −1 = ,
(1 − tanh )(tanh (y)) 1 − y2
i.e. we have
1
artanh (x) = for − 1 < x < 1. (11.26)
1 − x2
As in the case of arsinh we can prove
1 1+x
artanh(x) = ln . (11.27)
2 1−x
In the exercises there will be questions related to the inverse function of
cosh |[0,∞) . This function is denoted by arcosh and is defined on [1, ∞). Its
derivative is given by
1
arcosh (x) = √ , x>1 (11.28)
x2 −1
and we have √
arcosh x = ln(x + x2 − 1), x > 1. (11.29)
For coth we restrict our attention first to values x > 1. Thus coth |(0,∞) is
considered as a strictly decreasing function with range (1, ∞). This function
has an inverse function arcoth and we have
1
arcoth (x) = , x>1 (11.30)
1 − x2
and
1 x+1
arcoth(x) = ln , x > 1. (11.31)
2 x−1
Using the symmetry of coth we can extend (11.29) and (11.30) to x < −1.
168
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 189
11 INVESTIGATING FUNCTIONS
Problems
1. Consider the set D := [−1, 2) ∪ {3, 4} ∪ [5, 6]. Find every interior point
of D and the boundary ∂D of D.
2. For each of the following expressions find the maximal set D ⊂ R such
that on D the expressions define functions.
a) (x2 − 1) (x2 + 4x).
cos(ln(arctan x))
b) x3 +4x2 −5x
.
1
c) ((sinh x) (1 − x4 )) 2 .
d) cot(arcsin x).
ln(cos 3t)
b) lim ;
t→0 ln(cos 2t)
t>0
3y 2 − y + 5
c) lim ;
y→∞ 5y 2 − 6y − 3
1 1
d) lim − 2 .
u→0 sin2 u u
1 1 u2 −sin 2u u2 sin x
Hint: rewrite sin2 u
− u2
as u4
· sin2 u
, note that lim = 1,
x→0 x
u2 −sin 2u
and when applying l’Hospital rules to u4
make use of the addition
theorems for trigonometric functions.
169
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 190
A COURSE IN ANALYSIS
170
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 191
12 Integrating Functions
Let us start to analyse a natural problem in mathematics. Given a continuous
function g : [a, b] → R, a, b ∈ R, a < b, can we find a function f : [a, b] → R
such that on (a, b)
f (t) = g(t)? (12.1)
Let us assume that we know the value of f (a). A very rough approximation
of f (t), a < t < b, is
f (t) − f (a)
.
t−a
Hence (12.1) would give
or
f (t) ≈ f (a) + g(t)(t − a), (12.3)
where g(t) ≈ h(t) means that g is close to h. There is a simple geometric
interpretation of the right hand side of (12.2)
y = g(x)
g(t)
a t b x
Figure 12.1
The area of the rectangle with vertices (a, 0), (t, 0), (t, g(t)) and (a, g(t)) is
given by g(t)(t − a). Of course, when t varies in [a, b] we obtain a function
But only for very small values of t − a, t > a, do we expect the function
(12.4) to be a reasonable approximation of a function f satisfying (12.1).
171
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 192
A COURSE IN ANALYSIS
g(t)
g(t1 )
a t b x
Figure 12.2
Iterating this process n-times we find with a < t1 < t2 < · · · < t < b,
f (t) − f (a) = f (t) − f (tn ) + f (tn ) − f (tn−1 ) + · · · + f (t1 ) − f (a)
≈ g(t)(t − tn ) + g(tn )(tn − tn−1 ) + · · · + g(t1 )(t1 − a)
n+1
= g(tj )(tj − tj−1 ), (12.5)
j=1
g(t3 )
g(t2 )
g(t)
g(t1 )
g(a)
···
a t1 t2 t3 · · · tn t = tn+1 b x
Figure 12.3
172
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 193
12 INTEGRATING FUNCTIONS
t1 t2 t3 t4 t5 t6
a = t0 t7 = b
Figure 12.4
We call such a finite sequence a partition of [a, b] into sub-intervals [tj , tj−1 ],
j = 1, ...n + 1 and we sometimes write Z(t1 , · · · tn ) or just Zn for such a
partition. The number
n+1
Sr (g, Zn) := g(tj )(tj − tj−1 ). (12.7)
j=1
173
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 194
A COURSE IN ANALYSIS
y = g(x)
a = t0 t1 t2 t3 t4 t5 t6 t7 = b x
Figure 12.5
In fact we may generalise Sr (g, Zn) slightly to the general Riemann sum of
g with respect to Zn and points ξj ∈ [tj−1 , tj ], which is defined by:
n+1
S(g, Zn, ξ) := g(ξj )(tj − tj−1 ) (12.8)
j=1
ξ1 ξ2 ξ3 ξ4 ξ5 ξ6 ξ7
a = t0 t1 t2 t3 t4 t5 t6 t7 = b x
Figure 12.6
Definition 12.1. Let g : [a, b] → R, a, b ∈ R, a < b, be a continuous
function. Suppose there exists a number Ia,b (g) ∈ R such that for every ε > 0
there exists δ > 0 with the property that if Zn is any finite partition of [a,b]
with mesh size m(Zn ) < δ then
|Ia,b (g) − Sr (g, Zn )| < ε.
174
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 195
12 INTEGRATING FUNCTIONS
In this case we call Ia,b (g) the (Riemann) integral of g over the interval
[a,b] and denote it by
b
g(t)dt := Ia,b (g). (12.9)
a
This definition is justified by the idea that the interval [c, c] has length zero,
hence the rectangle with one side of length g(t) and the other of length 0
should have area 0.
Let g : [a, b] → R be a continuous function. We define a new function
f : [a, b] → R by
x
x → f (x) := g(t)dt. (12.12)
a
Since [a, x] is a closed and bounded interval and g|[a,x] is continuous f (x) is
well defined.
The following theorem is important:
Theorem 12.5. Let g : [a, b] → R be a continuous function. Then f :
[a, b] → R defined by (12.12) is differentiable and we have
f (x) = g(x), (12.13)
175
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 196
A COURSE IN ANALYSIS
i.e. we have
x
d d
f (x) = g(t)dt = g(x). (12.14)
dx dx a
We will prove this result in Part 2 of our course in a similar way as to how we
motivated the introduction of the integral. Note that Theorem 12.5 allows
us to calculate integrals. First we give
(f − h) = g − g = 0,
which yields f − h = c.
176
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 197
12 INTEGRATING FUNCTIONS
177
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 198
A COURSE IN ANALYSIS
xα := eα ln x . (12.22)
For α = n ∈ N we find
en ln x = eln x · · · eln x = x · · · x = xn ,
and
b
cos xdx = sin x|ba . (12.27)
a
178
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 199
12 INTEGRATING FUNCTIONS
Hence there are functions not identical to zero whose integral over a certain
interval might be zero.
Example 12.14. For exp we find
b
ex dx = ex |ba = eb − ea . (12.28)
a
179
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 200
A COURSE IN ANALYSIS
Problems
1. a) Find the Riemann sum of the function f : [1, 2] −→ R with
f (t) = 2t2 − t with respect to the partition tk = 1 + nk , k = 0, 1, . . . , n,
and ξk being the midpoint of the interval [tk−1 , tk ], k = 1, . . . , n.
b) Let a < b and h : [a, b] −→ R be the function h(t) = 1
1+t2
.
a(m2 −l2 )+l2 b
For the partition tl = m2
, l = 0, 1, . . . , m, and ξl ∈ [tl , tl+1 ],
l = 0, . . . , m−1, such that ξl −tl = 13 (tl+1 −tl ) and tl+1 −ξl = 23 (tl+1 −tl )
find the corresponding Riemann sum.
(After calculating tk − tk−1 , k = 1, . . . , m, and ξk , k = 1, . . . , m, and
m
forming the sum g(ξk )(tk − tk−1 ), it will not be possible to simplify
k=1
much in this expression.)
n
S(g, Zn, ξ) = g(ξj )(tj − tj−1 ).
j=1
Here Z|[a,tk ] is the partition a = t0 < t1 < · · · < tk , Z|[tk ,b] is the
partition tk < tk+1 < · · · < tn = b, and ξ|[a,tk ] as well as ξ|[tk ,b] denote
the points ξj belonging to [a, tk ] and [tk , b] respectively.
by calculating the area of the triangles ABC and BDE in Figure 12.7
where A = (−2, 0), B = (0, 0), C = (−2, 2), D = (1, 0), E = (1, 1).
180
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 201
12 INTEGRATING FUNCTIONS
C
−2
E
−1
|
A B D
Figure 12.7
b) The upper semicircle with radius R√in Figure 12.8 is the graph
of the function g : [−R, R] −→ R, g(r) = R2 − r 2 .
−R
g(r)−
R
−R r R
Figure 12.8
Again using the interpretation
R that
R integration represents area, see Def-
√
inition 12.3, find g(r)dr = R2 − r 2 dr.
−R −R
181
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 203
provided the mesh size m(Zn ) is small enough. Thus we have proved
b
b
b
(λg1 (t) + μg2 (t)) dt = λ g1 (t) dt + μ g2 (t) dt, (13.1)
a a a
183
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 204
A COURSE IN ANALYSIS
In all of our considerations we have so far assumed that a < b and in the
case where a = b we defined that
b
g(t) dt = 0.
a
184
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 205
For this we take f (x) = ln x and g(x) = x in (13.4). Since g (x) = 1 and
(ln x) = x1 we find
b b
b 1
(ln x) 1 dx = (ln x)x − x dx
a a a x
b
b b
= (ln x)x − 1 dx = ((x ln x) − x) .
a a a
Example 13.4. For a < b and with f (x) = x and g (x) = sin x, i.e. we may
take g(x) = − cos x, to find
b b
b
x sin x dx = −x cos x − 1 (− cos x) dx
a a a
b
b
= −x cos x + cos x dx
a a
b
= (−x cos x + sin x) .
a
185
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 206
A COURSE IN ANALYSIS
or
b
b
x x
2 (cos x) e dx = (cos x + sin x) e
a a
implying b
b
x (cos x + sin x) ex
(cos x) e dx = .
a 2 a
Our next rule for integration is derived from the chain rule.
Theorem 13.7 (Change of variables, Part 1). Let g : [a, b] −→ R be a
continuous function and let φ : [α, β] −→ [a, b] be a differentiable function
with continuous derivative φ . Then
β
φ(β)
g(φ(t))φ(t) dt = g(x) dx. (13.6)
α φ(α)
186
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 207
Remark 13.10. In Examples 13.8 and 13.9 the function g does not have to
be defined on all of R. It would be sufficient to consider functions defined on
[α + c, β + c] and [αc, βc], respectively.
187
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 208
A COURSE IN ANALYSIS
d
For this note first that dt ln φ(t) = φφ(t)
(t)
provided ln φ(t) is defined, i.e. φ(t) >
0. Now we use in the change of variable formula g(x) = x1 and it follows that
b
φ(b) φ(b) b
φ (t) 1
dt = dx = ln x = ln φ(t) .
a φ(t) φ(a) x φ(a) a
The case where φ(t) < 0 is treated by switching from φ(t) to −φ(t). As an
immediate consequence of (13.9) we find
b
b
x 1 b
2x 1 2
dx = dx = ln(1 + x ) (13.10)
a 1 + x2 2 a 1 + x2 2 a
or
b b
cos t b
cot t dt = dt = ln | sin t| , (13.11)
a a sin t a
Proof. Of course (13.13) follows from (13.6) using that φ−1 exists.
Let us compare (13.6) with (13.13). In (13.6) we have to identify the function
we want to integrate as a term g(φ(t))φ(t), whereas in (13.13) we start with
b
the integral a g(x) dx and modify it. But we have to pay a price: we have to
find an invertible (bijective) smooth change of variable, i.e. we need to find
188
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 209
b
t = t(x) = φ−1 (x) to transform a g(x) dx to the right hand side in (13.13).
Note that
dt d φ−1 1 1
= (x) = −1 = .
dx dx φ (φ (x)) φ (t)
b
The transformation of a g(x) dx could be done in a formal way
g(x) g(φ(t))
dx φ (t) dt
a φ−1 (a)
b φ−1 (b).
The second step looks a bit more demanding. In principle we can easily
introduce t = φ−1 (x). But now we need φ (t), i.e. we have to invert φ−1 . In
certain examples this is often not needed.
b
Example 13.13. Consider the integral a (x + 2) sin(x2 + 4x − 6) dx. We
dt
choose t = φ−1 (x) = x2 + 4x − 6, i.e. dx = 2x + 4 = 2(x + 2).
Now we use
sin(x2 + 4x − 6) sin t
but instead of
dx φ (t) dt
we observe that
1
(x + 2) dx = dt
2
which yields
b
−1
2 1 φ (b)
(x + 2) sin(x + 4x − 6) dx = sin t dt
a 2 φ−1 (a)
φ−1 (b)
1 1 1
= − cos t = cos(φ−1 (a)) − cos(φ−1 (b)).
2 φ−1 (a) 2 2
t = φ−1 (x) = x2 + 4x − 6 = x2 + 4x + 4 − 10
= (x + 2)2 − 10
189
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 210
A COURSE IN ANALYSIS
or √
x = φ(t) = −2 + t + 10, t ≥ −10 and x ≥ −2
and √
x = φ(t) = −2 − t + 10, t ≥ −10 and x ≤ −2.
Hence for b > a ≥ −2 or a < b ≤ −2 we may use our calculation. In each
case we eventually get
b
1 1
(x + 2) sin(x2 + 4x − 6) dx = cos(a2 + 4a − 6) − cos(b2 + 4b − 6).
a 2 2
for the primitives of g, i.e. with this notation we can write for a primitive f
of g
f (x) = g(t) dt + c (13.15)
where c is a constant. (This is not a very well defined notation, but very
useful.)
Using in (13.14) a change of variables t = φ−1 (x) we find that
g(φ(t))φ(t) dt + c̃ (13.16)
b
To eventually find a
g(x) dx we observe that
b φ−1 (b)
g(x) dx = h(t)
a φ−1 (a)
190
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 211
B. Consider
1
dx
.
−1 (x + 2)(3 − x)
Observe first that
dx dx dx
= = .
(x + 2)(3 − x) 2
6 − (x − x) 25 1 2
4
− (x − 2
)
2t
By a further change of variables s = 5
, i.e. ds = 25 dt we find
2 dt ds
2 = √ = arcsin s + c
5 1 − s2
1 − 2t5
2t
= arcsin + c.
5
Therefore we have
dx 2t 2x − 1
= arcsin + c = arcsin +c
(x + 2)(3 − x) 5 5
191
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 212
A COURSE IN ANALYSIS
and finally
1
1
dx 2x − 1
= arcsin
−1 (x + 2)(3 − x) 5 −1
1 3
= arcsin − arcsin − .
5 5
1 3
= arcsin( ) + arcsin( ).
5 5
C. Consider
2−x tanh 21−x dx.
Take t = 21−x which yields dt = −(ln 2)21−x dx, i.e. 2−x dx = − 2 ln1 2 dt and
therefore
−x 1−x 1
2 tanh 2 dx = (tanh t) − dt
2 ln 2
1 1
=− ln cosh t + c = − ln cosh 21−x + c
2 ln 2 2 ln 2
where we used that (ln cosh t) = tanh t.
D. Consider
1/√2
x arcsin x2
√ dx.
0 1 − x4
1 √2x dx
Take t = arcsin x2 to find dt = √ 2x dx, i.e. dt = 1−x4
and hence
1−(x2 )2
x arcsin x2 1 1
√ dx = t dt = t2 + c.
1 − x4 2 4
It follows that
√ 1/√2
1/ 2
x arcsin x2 1
√ dx = (arcsin x2 )2
0 1 − x4 4 0
2 2
1 1 1 π
arcsin − (arcsin 0)2 =
4 2 4 144
192
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 213
holds with suitable real numbers aij , bij and cij . Hence, whenever the integral
b P (x)
a Q(x)
dx exists we have
b k
pi
b l qi
b
P (x) aij bij x + cij
dx = dx + dx. (13.19)
a Q(x) a (x − zi )j a (x2 + αi x + βi )j
i=1 j=1 i=1 j=1
193
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 214
A COURSE IN ANALYSIS
Expanding the right hand side and comparing coefficients we end up with
four linear equations for the four unknowns A, B, C, D. Note that Q̃ =
(4x − 3)(2x + 5)3 does not have leading coefficient 1 and it is not of type
(13.17). However
3 3
3 3 5 3 5
Q̃(x) = 4 x − 2 x+ = 32 x − x+
4 2 4 2
= 32Q(x)
and Q(x) has leading coefficient 1 and is of type (13.17). In general we can
find γ0 ∈ R such that for a polynomial Q̃(x) we get Q̃(x) = γ0 Q(x) where
Q(x) has leading coefficient 1 and is of type (13.17).
For practical purposes switching from Q̃ to γ0 Q is often not needed, but in
order to get in (13.17) uniqueness up to the order of factors it is needed. An
alternative way is to use in (13.17) for a general polynomial Q̃ the represen-
tation γ0 (x − z1 )p1 · · · (x − zk )pk (x2 + α1 x + β1 )q1 · · · (x2 + αl + βl )ql where γ0
is the leading coefficient of Q̃.
Here is a more simple example:
Write
implying
6 − x = 5A − 3B + x(2A + B)
or
5A − 3B = 6 and 2A + B = −1
3
which yields A = 11
and B = − 17
11
.
Hence
3 17
6−x
= 11 − 11 ,
(x − 3)(2x + 5) x − 3 2x + 5
194
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 215
and therefore
6−x 3 1 17 2
dx = dx − dx
(x − 3)(2x + 5) 11 x−3 2 · 11 2x + 5
3 17
= ln |x − 3| − ln |2x + 5| + c.
11 22
b dx
B. Let −1, 1 ∈
/ [a, b] and consider a 1−x 2 . We try
1 1 A B (A + B) + (A − B)x
2
= = + =
1−x (1 − x)(1 + x) 1−x 1+x 1 − x2
which leads to A + B = 1 and A − B = 0, i.e. A = B = 12 .
This implies
b
1 1 b 1 1 b 1
2
dx = dx + dx
a 1−x 2 a 1−x 2 a 1+x
b
b
1 1 1
= dx − dx
2 a 1+x a x−1
b
1
= (ln |x + 1| − ln |x − 1|)
2 a
b
1 x + 1
= ln .
2 x − 1 a
Problems
1. Find
n
1 1
(1 + k 2 )x k2 dx.
0 k=1
195
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 216
A COURSE IN ANALYSIS
196
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 217
7. For m, n ∈ N prove
⎧
π ⎨ 0, n = m
1
(cos nx)(cos mx)dx = 1, n=m>0
π −π ⎩
2, n=m=0
and
π
1
(sin nx)(cos mx)dx = 0.
π −π
197
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 218
A COURSE IN ANALYSIS
11. Evaluate the following integrals. Note that a change of variables and
integration by parts may need to be used.
4 √
a) 3 2t+1 dt;
0
π
x sin x
b) I := 2
dx.
0 1 + cos x
π2
Hint: derive the equality I = 2
− I.
12. Use partial fractions to find
x+1
dx.
x4 − x
(The result of Problem 8 b) may eventually become useful.)
13. For f, g : [a, b] −→ R being three times continuously differentiable
prove
b
b
(3) b b b
f (t)g (t)dt = f g |a − f g |a + f g|a − f (3) (t)g(t)dt.
a a
14. Prove that for g continuously differentiable and g(s) > 0 we have
g (s)
ds = 2 g(s).
g(s)
Now find
π
2 cos r
√ dr.
π
6
sin r
198
Part 2
Analysis in One
Dimension
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 221
+ :R × R−→R · :R × R−→R
(x, y) −→x + y (x, y) −→x · y.
(x + y) + z = x + (y + z); (14.1)
x + 0 = x; (14.2)
x + (−x) = 0; (14.3)
x + y = y + x; (14.4)
where (14.2) means that in R there exists an element 0 such that x+0 = x for
all x ∈ R, i.e. 0 is a neutral element with respect to addition. Further we
interpret (14.3) as follows: for every x ∈ R there exists an inverse element
−x with respect to addition.
For multiplication we have with x, y, z ∈ R the rules
(x · y) · z = x · (y · z); (14.5)
1 · x = x; (14.6)
x · x−1 = 1 for x = 0; (14.7)
x · y = y · x; (14.8)
201
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 222
A COURSE IN ANALYSIS
x · (y + z) = x · y + x · z. (14.9)
It turns out that there are many sets K with operations + and · satisfying
(14.4)-(14.9), we call each such algebraic object (K, +, ·) a (commutative)
field. It can be easily checked that all rational numbers form a field, as do
the complex numbers C. In algebra a lot of consequences of these axioms
can be learned. These consequences justify our usual calculations in Q, R,
or C. Here we take these consequences for granted.
For R and Q we also have axioms of order:
(x − y) + (y − z) = x − z > 0 or x > z.
Next we show x > y and a > 0 implies ax > ay. Since x − y > 0 and a > 0
it follows that a(x − y) > 0 or ax − ay > 0, i.e. ax > ay.
Of course we will continue to use the notation x ≤ y and x ≥ y as defined in
Part 1, Chapter 1.
A (commutative) field (K, +, ·) on which (14.10)-(14.12) hold is called an
ordered field. Both R and Q are ordered fields, but C is not. This follows
for example from the fact that i2 = −1. Indeed, (14.12) implies for x > 0
that x2 > 0, for x = 0 we have x2 = 0, and for x < 0 it follows that −x > 0,
202
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 223
D = (0, 1)
C = (1, 1)
(l, 0)
B = (1, 0)
A = (0, 0)
Figure 14.1
203
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 224
A COURSE IN ANALYSIS
take r1 = q+p
2
and then take instead of p the number r1 , now continue this
procedure. Doing this N-times we find rN = q + p−q 2N
. Thus given > 0 we
can find a rational number rN such that |q − rN | < . Just take p = q + 1
and N such that 2−N < .
So we face the following strange situation: not every point on the “line”
corresponds to a rational number but we can put into the gap between two
rational numbers as many rational numbers as we like.
The number l, l2 = 2, lies between two rational numbers. We can argue as
follows: the square of the length d of the side AF of the triangle AEF with
A = (0, 0), E = (4, 0) and F = (4, 3) is equal 42 + 32 = 52 , i.e. d = 5, but
l < d:
F
3
D C
1
B (l, 0) E
0
A 1 2 3 4 5
Figure 14.2
−1
Thus we have 0 < l < 5 implying that we can get as close as we wish to l
in terms of rational numbers. A word of caution: once again we have mixed
geometric arguments with algebraic ones.
We need to resolve these problems, but before we can do this we need more
knowledge about properties of the real numbers (if they exist). We continue
for a while to pretend as if we already have the real numbers at our disposal
and try to deduce new tools so that we are eventually in a position to establish
the existence of the real numbers.
First let us add a further axiom
204
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 225
205
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 226
A COURSE IN ANALYSIS
or Gn ≤ An .
An ≥ an An−1
n−1 ≥ an an−1 An−2
n−2
≥ an an−1 · . . . · a1 = Gnn ,
or Gn ≤ An .
n
n
12 n
12 n
12
|aj bj | a2k b2k = b2j a2k a2j b2k
k=1 k=1 k=1 k=1
n
n
b2j a2k + a2j b2k
k=1 k=1
≤ ,
2
which implies
12 n 12
n n
aj bj ≤ a2j b2j .
j=1 j=1 j=1
n
Proof. If k=1 (ak + bk )2 = 0 the statement is trivial.
207
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 228
A COURSE IN ANALYSIS
n
In the case k=1 (ak + bk )2 > 0 we find
n
n
n
(ak + bk )2 = (ak + bk )ak + (ak + bk )bk
k=1 k=1 k=1
n n
≤ |ak + bk ||ak | + |ak + bk ||bk |
k=1 k=1
n
12 n
12 n
12 n
12
2
≤ (ak + bk ) a2k + 2
(ak + bk ) b2k
k=1 k=1 k=1 k=1
n
12 ⎛ n 12 n 12 ⎞
= (ak + bk )2 ⎝ 2
ak + bk ⎠
2
implying (14.19).
Corollary 14.6. For real numbers a1 , . . . , ak , b1 , . . . , bk , c1 . . . , ck we have
n
12 n
12 n
12
(ak − bk )2 ≤ (ak − ck )2 + (ck − bk )2 . (14.20)
k=1 k=1 k=1
an = (1 + x)n ≥ 1 + nx.
208
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 229
Problems
1. Given x, y ∈ R, x < y. Prove the existence of z ∈ R such that x < z <
y.
2. Using the axioms of an ordered field prove that for x, y, z ∈ R:
a) x < 0 implies −x > 0;
b) x2 > 0 for all x ∈ R;
c) a < 0 and x < y implies ax > ay.
3. Prove that Archimedes’ axiom holds in Q.
4. Show that there is no element a ∈ Q such that a2 = 3.
5. Using Bernoulli’s inequality prove
2nn ≤ (n + 1)n for n ≥ 1,
then, by induction show that
n n
n! ≤ 2 .
2
6. Use mathematical induction to prove for xk ≥ 0, k ∈ N, and n ∈ N
that n n
(1 + xk ) ≥ 1 + xk .
k=1 k=1
209
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 230
A COURSE IN ANALYSIS
Now prove
n
n
12
1 √
√ |ak | ≤ a2k ≤ n max (|a1 |, . . . , |an |).
n k=1 k=1
210
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 231
211
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 232
A COURSE IN ANALYSIS
q k+2 = q k+1 + q k , k ≥ 2,
satisfies for k ≥ 2
i.e.
ak+n = A1 ak+n−1 + · · · + An an
aj = xj , j = 0, . . . , n − 1,
by looking at solutions of
q n − A1 q n−1 − A2 q n−2 − · · · − An = 0.
212
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 233
Definition 15.3. Let (an )n≥k be a sequence of real numbers. The sequence
is called convergent to a ∈ R if for every ε > 0 there exists N = N(ε) ∈ N
such that n ≥ N() implies
|an − a| < ε. (15.6)
If (an )n∈N converges to a we call a the limit of (an )n∈N and we write
lim an = a. (15.7)
n→∞
a
a− a+
Figure 15.1
If (an )n∈N converges to a, then given ε > 0, all elements an , n ≥ N(ε), will
lie in the interval (a − ε, a + ε). This is equivalent to the statement that for
every ε > 0 all but a finite number of the an ’s will lie in (a − ε, a + ε).
We call the interval (a − ε, a + ε), ε > 0, an ε-neighbourhood of a. Thus
the convergence of (an )n∈N to a means that for every ε > 0 all but finitely
many elements of the sequence lie in the corresponding ε-neighbourhood of
a.
213
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 234
A COURSE IN ANALYSIS
1 1
B. Consider the sequence n n∈N
. We claim lim = 0.
n→∞ n
Given ε > 0, let N(ε) ∈ N be such that N(ε) > 1ε . It follows that
1
− 0 = 1 < ε for all n ≥ N(ε).
n n
which is a contradiction.
n Hence no a ∈ R can be the limit of ((−1)n )n∈N .
n
D. The limit of n+1 n∈N is 1, i.e. lim n+1 = 1.
n→∞
Given ε > 0 we find
n n − (n + 1) 1
n + 1 − 1 = n + 1 = n + 1 < ε.
Hence, if we choose N(ε) = 1ε + 1, then for each n ≥ N(ε) we have
n
n + 1 − 1 < ε.
n
E. We have lim n = 0.
n→∞ 2
For n > 3 we know that n2 ≤ 2n . It follows that
n2 n 1
n
≤ 1 or n < for n > 3.
2 2 n
Let ε > 0 be given and take N(ε) > max{3, 1ε }. Now n ≥ N() implies
n n 1
n − 0 = n ≤ < ε.
2 2 n
A helpful observation is
214
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 235
Lemma 15.6. For a convergent sequence (an )n≥k of real numbers and any
m∈N
lim an = lim an+m (15.8)
n→∞ n→∞
holds.
Proof. First define bn := an+m . Since limn→∞ an = a exists, for every > 0
there exists N such that n ≥ N implies |a − an | < . However for these n we
have n + m ≥ N and therefore
|bn − a| = |an+m − a| < .
215
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 236
A COURSE IN ANALYSIS
Example 15.10. The sequence (an )n≥0 of all Fibonacci numbers is divergent
since we always have that an ≥ n for n ∈ N0 . For n = 0, 1 this is trivial. Now
suppose an ≥ n for all n ≤ N, we find aN +1 = aN + aN −1 ≥ N + N − 1 =
2N − 1 ≥ N + 1
Example 15.11. We want to study the geometric sequence (q n )n∈N .
A. If |q| < 1, then lim q n = 0.
n→∞
We know by Corollary 14.8 that for ε > 0 there exists N ∈ N such that
|q|N < ε. Now we find
|q n − 0| = |q n | = |q|n ≤ |q|N < ε
for all n ≥ N.
B. For q = 1 we have q n = 1 and we already know that
lim q n = lim 1 = 1.
n→∞ n→∞
C. For q = −1, i.e. q = (−1) , we have just shown that ((−1)n )n∈N is
n n
divergent.
D. For |q| > 1 it follows that (|q|n )n∈N , hence (q n )n∈N is unbounded, compare
Lemma 14.7. Therefore (q n )n∈N is divergent.
√ 1
Example 15.12. We claim that lim n n = lim n n = 1. For this we set
√ n→∞ n→∞
an := n n − 1. Given > 0 we need to find N() ∈ N such that n ≥ N()
implies |an | = an < The binomial theorem yields
n
n n j n n(n − 1) 2
n = (1 + an ) = an ≥ 1 + a2n = 1 + an .
j 2 2
j=0
216
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 237
Proof. Suppose that (an )n≥k has two limits a and a . Then given ε > 0, since
lim an = a, there exists N1 ∈ N such that |an − a| < 12 ε for n ≥ N1 . On
n→∞
the other hand, since we also have lim an = a there exists N2 such that
n→∞
|an − a | < 12 ε for n ≥ N2 . Thus it follows that, if N ≥ max{N1 , N2 }, then
Theorem 15.14 (Sum of convergent sequences). Let (an )n≥k and (bn )n≥k
be two convergent sequences with limits a and b, respectively, i.e. lim an = a
n→∞
and lim bn = b. Then the sequence (cn )n≥k , cn := an + bn , converges to a + b,
n→∞
i.e.
lim cn = a + b.
n→∞
ε
Proof. Given ε > 0. For 2
> 0 there exist N1 and N2 such that
ε
n ≥ N1 implies |a − an | < ,
2
and
ε
n ≥ N2 implies |b − bn | <
.
2
For N = max{N1 , N2 } we find that n ≥ N implies
217
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 238
A COURSE IN ANALYSIS
Proof. Put lim an = a and lim bn = b. We know that (an )n∈N is bounded,
n→∞ n→∞
hence with some K > 0 we have |an | ≤ K for all n ∈ N. But (bn )n∈N is
also a bounded sequence, and without loss of generality we may also assume
that |bn | ≤ K for all n ∈ N. In addition by Theorem 15.8 we also know that
|a| ≤ K and |b| ≤ K. The convergence of (an )n∈N and (bn )n∈N implies that
for ε > 0 there exists N1 , N2 ∈ N such that
ε ε
|an − a| < for n ≥ N1 , and |bn − b| < for n ≥ N2 .
2K 2K
Now, for all n ≥ N := max (N1 , N2 ) we find
|an · bn − a · b| = |an (bn − b) + (an − a) · b|
≤ |an ||bn − b| + |an − a||b|
ε ε
≤K· +K · = ε.
2K 2K
218
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 239
an lim an
lim = n→∞ .
n→∞ bn lim bn
n→∞
|b|
Proof. Since b := lim bn we find for ε := 2
> 0 a number N0 ∈ N such that
n→∞
|b|
|bn − b| < for n ≥ N0 .
2
Since |b| − |bn | ≤ |bn − b| for n ≥ N0 we have
|b| |b|
|b| − |bn | < or < |bn |,
2 2
the last statement being equivalent to |b1n | < |b|
2
. In particular for all n ≥ N0
we have bn = 0 and hence for these n the expression abnn always makes sense.
Next suppose that an = 1 for all n. Given ε > 0 there exists N1 ∈ N such
that
ε|b|2
|bn − b| < for n ≥ N1 .
2
Therefore, for n ≥ N := max{N0 , N1 } we find
1
− 1 = b − bn = 1 |b − bn | < 2 |bn − b|
bn b bn b |bn ||b| |b|2
2 ε|b|2
< 2· = ε.
|b| 2
Hence we have proved that
1 1
lim = .
n→∞ bn b
an 1
Since = an · , n ≥ N0 , the general case follows now from Theorem
bn bn
15.16.
219
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 240
A COURSE IN ANALYSIS
7n2 + 3n
Example 15.20. Consider the sequence an = , n ∈ N. We may
n2 − 2
7 + n3
write an = . Now, lim n1 = 0, implying that lim n12 = lim n1 · n1 = 0
1 − n22 n→∞ n→∞ n→∞
Theorem 15.21. Let (an )n≥k and (bn )n≥k be two convergent sequences and
suppose that an ≤ bn for all n ≥ k. Then we have lim an ≤ lim bn .
n→∞ n→∞
a−b
Proof. Suppose b := lim bn < a := lim an . For ε := 2
> 0 there exists
n→∞ n→∞
N1 , N2 ∈ N such that |an − a| < ε for n ≥ N1 and |bn − b| < ε for n ≥ N2 .
For n ≥ max{N1 , N2 } we find that
an > a − ε and bn < b + ε.
By the definition of ε we have
a−b a+b a−b
a−ε=a− = = b+ = b+ε
2 2 2
implying that bn < b + ε = a − ε < an which contradicts the assumption
an ≤ bn and so the theorem is proved.
Remark 15.22. A. In particular an ≥ 0 implies that limn→∞ an ≥ 0.
B. Note that in Theorem 15.21 we need not assume that an ≤ bn for all
n ≥ k. It is sufficient to assume that an ≤ bn for all N0 ≥ k, N0 ∈ N.
C. Note further that an < bn for all n ≥ k (or all n ≥ N0 , N0 ≥ k) does not
imply that
lim an < lim bn .
n→∞ n→∞
To see this take the sequence an = 0 for all n ∈ N and bn = n1 . For all n ∈ N
we know that an = 0 < n1 = bn , but lim an = 0 = lim bn .
n→∞ n→∞
Corollary 15.23. Suppose that (an )n∈N is a convergent sequence and that
with two numbers A and B we have A ≤ an ≤ B for all n ∈ N, n ≥ N0 .
Then
A ≤ lim an ≤ B.
n→∞
220
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 241
Problems
1. Let M be a countable set and f : M → R. Prove that we can arrange
F (M) as a sequence, i.e. f (M) = {ak ∈ R|k ∈ N} with suitable real
numbers ak .
Hint: recall that M is countable if and only if there exists a bijective
mapping g : N → M.
2. a) Let (an )n≥k be a sequence of real numbers such that an = a for
all n ≥ M ≥ k. Prove that (an )n≥k converges and find its limit.
b) Let (an )n≥k be a sequence with limit a.
Consider the sequence
cn , k ≤ n ≤ M − 1
bn :=
an , n≥M
for any choice of numbers cn , k ≤ n ≤ M −1, and any choice of M ≥ k.
Prove that lim bn = a.
n→∞
3. Let (an )n≥k be a sequence converging to 0 and let (bn )n≥k be a bounded
sequence. Show that lim (bn an ) = 0.
n→∞
221
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 242
A COURSE IN ANALYSIS
8. For k ∈ N prove:
1
a) lim k = 0;
n→∞ n
1
b) lim 1 = 0.
n→∞ n k
9. Use the theorems about limits and already proved results about limits
of sequences to find:
(n + 1)2 − n2
a) lim ;
n→∞ n
√ √
b) lim ( n + 1 − n);
n→∞
n
j=1 j
c) lim ;
n→∞ n2
n 2
j=1 j
d) lim ;
n→∞ n3
1 + 2 · 3n
e) lim ;
n→∞ 5 + 4 · 3n
n + 4n
f) lim .
n→∞ 5n
√
10. Prove that lim n a = 1 for a ≥ 1.
n→∞
222
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 243
12. Find n
ak ν k
lim k=0
m l
, ν ∈ N, ak , bl ∈ R.
l=0 bl ν
ν→∞
i.e.
1
lim n f x0 + − f (x0 ) = A.
n→∞ n
223
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 245
n
more generally, if (an )n≥l , then sn := ak . We call sn the nth partial sum
k=l
∞
of the (infinite) series ak . Thus we have a new sequence (sn )n∈N and
k=1
∞
note that at the moment ak is just a formal expression for this sequence.
k=1
However, it may happen that the sequence of the partial sums (sn )n∈N con-
∞
verges to some limit s. In this case we denote the limit also by ak . Thus
k=1
∞
the symbol ak will have two meanings: a formal expression for the se-
k=1 n
quence of partial sums ak and, if it exists, the limit of the sequence
k=1 n∈N
of partial sums.
Remark 16.1. Note that every sequence (an )n∈N has a representation as the
partial sums of a series, i.e. in a certain sense sequences and series are in a
one-to-one correspondence. Indeed, given (sn )n∈N define
an = sn − sn−1 .
n
Then sn = ak .
k=1
225
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 246
A COURSE IN ANALYSIS
∞
1
Example 16.3. Consider the series k(k+1)
. We then see that
k=1
n
n
1 1 1
sn = = −
k=1
k(k + 1) k=1 k k+1
n
n−1
1 1 1
=1+ − −
k=2
k k=1 k + 1 n + 1
1 n−1 1 n−1
1
=1− + −
n + 1 k=1 k + 1 k=1 k + 1
1 n
=1− = ,
n+1 n+1
n
∞
1
i.e. (sn )n∈N = n+1 n∈N
and therefore the series k(k+1)
converges and its
k=1
limit is given by
∞
1 n
= lim sn = lim = 1.
k=1
k(k + 1) n→∞ n→∞ n + 1
226
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 247
∞
The series xk is called the geometric series (with parameter or variable
k=0
x ∈ (−1, 1)). Now we prove: let x ∈ R, x = 1, then for all n ∈ N0 we have
n
1 − xn+1
xk = .
k=0
1−x
and further
n+1
n
1 − xn+1
xk = xk + xn+1 = + xn+1
k=0 k=0
1−x
1 − xn+1 (1 − x)xn+1
= +
1−x 1−x
n+1 n+1
1−x +x − xn+2
=
1−x
1 − xn+2
= ,
1−x
and the result follows by mathematical induction.
Remark 16.5. Let us change our point of view and consider the function
f : R \ {1} → R, f (x) = 1−x 1
. If |x| < 1 then Theorem 16.4 says that f
∞ k
has a representation by x → ∞ k
k=0 x in the
∞ ksense that f (x) = k=0 x for
|x| < 1. We say that for |x| < 1 the series k=0 x converges to the function
f . Note that f is defined on a much larger set than the series converges, i.e.
the series represents f only on a subset of the domain of f .
Example 16.6. A. The following holds
∞ ∞ k
−k 1 1 1 1 1
2 = = 1+ + + +··· = 1 = 2.
k=0 k=0
2 2 4 8 1− 2
B. We have
∞
∞ k
−k 1 1 1 1 1 2
(−2) = − = 1− + − ±··· = 1 = .
k=0 k=0
2 2 4 8 1 − −2 3
227
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 248
A COURSE IN ANALYSIS
n
n
Proof. With cn := al and dn := bl we have
l=k l=k
n
n
n
(al ± bl ) = al ± bl = cn ± dn ,
l=k l=k l=k
implying that
∞ n
n n
(al ± bl ) = lim (al ± bl ) = lim al ± bl
n→∞ n→∞
l=k l=k l=k l=k
n n
∞
∞
= lim al ± lim bl = al ± bl .
n→∞ n→∞
l=k l=k l=k l=k
228
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 249
∞
1
∞
1
But we will see later that the series k
is not convergent, hence k+1
does
k=1 k=1
not converge. Hence
∞
∞
1 1
−
k k+1
k=1 k=1
Proof. Suppose that lim an = +∞. There exists n0 ≥ k such that an > 0
n→∞
for all n ≥ n0 . In particular we have an = 0 for n ≥ n0 . Now, given
ε > 0, there exists n1 such that an > ε for n ≥ n1 which implies a1n < ε
1
229
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 250
A COURSE IN ANALYSIS
Proof. We only handle the case an > 0 for all n ≥ k. Let K > 0 be given.
Since lim an = 0 there exists N ∈ N such that |an | < ε := K1 for n ≥ N.
n→∞
Hence
1 1 1
= > = K for n ≥ N,
an |an | ε
1
i.e. lim = +∞. The case an < 0 follows in a similar way.
n→∞ an
Suppose that there exists κ > 0 such that infinitely many al satisfy al ≥ κ.
We claim that in this case (sn )n≥k diverges to +∞. Indeed, given K > 0 we
can find N0 ∈ N such that κN0 > K. Since al ≥ κ for infinitely many l ≥ k
there exists N1 ∈ N such that in the set {ak , . . . , aN1 } at least N0 elements
satisfy al ≥ κ. We introduce the set
Here we used the fact that M(N0 , N1 ) has at least N0 elements. The notation
l∈M (N0 ,N1 ) al is almost self-explaining: the summation is over all elements
of M(N0 , N1 ), i.e. we sum up all al with l ∈ M(N0 , N1 ).
Therefore for a series ∞ l=k al of non-negative numbers to converge the fol-
lowing must hold: for every > 0 there exists N() ∈ N such that n ≥ N()
implies an = |an | < , i.e. lim an = 0.
n→∞
230
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 251
Observe that the following two new concepts arose in the considerations
above:
- monotonicity of a sequence: m > n implies am ≥ an ;
- selecting a subsequence: for infinitely many l we have al ≥ κ, in other
words we can find a sequence of integers lj , j ∈ N, lj ≥ k, such that alj ≥ κ,
i.e. (alj )j∈N is a new sequence whose elements are elements of the sequence
(al )l∈N and alj ≥ κ for all j ∈ N.
Problems
1. Let Sn := n(n+1)(2n+1)
6
, n ∈ N, be the nth partial sum of a sequence
(an )n∈N . Find an .
∞ ∞
2. Let an ≤ bn , n ∈ N, and∞suppose that n=1 an and n=1 bn converge.
∞
Prove that n=1 an ≤ n=1 bn holds.
2 1 1
3. Use the fact that 4k 2 −1
= 2k−1
− 2k+1
to prove that
∞
1 1
= .
k=1
4k 2 −1 2
231
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 252
A COURSE IN ANALYSIS
∞ 1 (−1)k
7. a) Find k=0 2k
+ 3k
;
1
b) Under the assumption that lim ln(1 + ) = 0 show that
n→∞ n
∞
1 1
ln 1 − 2 = ln ;
k=1
k 2
∞ 1
c) Suppose k=1 k 2 = A. Prove that
∞
1 3A
= .
(2k − 1)2 4
k=1
3 2
2 +n , n ∈ N, diverges to +∞.
+2n −2
8. a) Prove that the sequence an = n15n
b) Prove that the sequence sin1 1 diverges to +∞. Hint: for
n n∈N
all x ∈ R, we have | sin x| ≤ |x|.
9. Construct sequences (an )n∈N and (bn )n∈N of real numbers such that
lim an = ∞, lim bn = 0 and
n→∞ n→∞
232
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 253
a1 a2 a3 a4 . . .
M
Figure 17.1
The distance between an and an+1 , i.e. an+1 − an > 0 must become smaller
and smaller. Indeed, suppose that for infinitely many nk ∈ N, k ∈ N, we have
ank +1 − ank ≥ η > 0. We claim that there must exist an N ∈ N such that
n ≥ N implies an+1 ≥ M which is a contradiction.
By Archimedes’ axiom, given η > 0 there exists N ∈ N such that Nη ≥
M + |a1 |. Since ank +1 − ank ≥ η for infinitely many nk there exists N1 ∈ N
such that for at least N elements l ∈ {1, . . . , N1 }, we have anl +1 − anl ≥ η.
Now for n ≥ N1 we find
an+1 − a1 = an+1 − an + an − an−1 + · · · + a2 − a1
n
= (aj+1 − aj )
j=1
N
≥ (anl +1 − anl ) ≥ Nη ≥ M + |a1 |
l=1
233
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 254
A COURSE IN ANALYSIS
limit exist in R?
The following definition is a more formal approach to the statement that the
“distance between elements of a sequence becomes smaller and smaller”.
Proof. Suppose that (an )n≥k converges to a. Given > 0 we can find N ∈ N
such that |an − a| < 2 for all n ≥ N. Thus for n, m ≥ N we get
|an − am | + |(an − a) − (am − a)| ≤ |an − a| + |am − a| < + = .
2 2
Now let (an )n≥k be a Cauchy sequence. For = 1 there exists N ∈ N such
that n ≥ N implies
|an | − |aN | ≤ |an − aN | < 1
which yields for all l ≥ k
234
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 255
ordered field satisfying Archimedes’ axiom, the completeness axiom and into
which we can embed Q as a dense subset.
Next we continue by proving some of the important consequences of the
completeness axiom. We start with the famous Theorem of Bolzano and
Weierstrass. We first need to consider the following definition
Definition 17.4. Given a sequence (an )n≥k let k ≤ n1 < n2 < n3 < . . .
be a strictly increasing sequence of integers (nl )l∈N . We call the sequence
(anl )l∈N = (an1 , an2 , . . . ) a subsequence of (an )n≥k .
Example 17.5. Consider the sequence ((−1)n )n∈N . The sequence ((−1)2k )k∈N
and the sequence ((−1)2k+1 )k∈N are subsequences of ((−1)n )n∈N . Indeed, in
the first case we take nk = 2k, in the second case we have nk = 2k + 1.
Note that (−1)2k = 1 and (−1)2k+1 = −1, i.e. ((−1)2k )k∈N = (1, 1, . . . ), and
((−1)2k+1 )k∈N = (−1, −1, . . . ). Thus while ((−1)n )n∈N is divergent, each of
the two subsequences is convergent, but they have different limits.
Theorem 17.6 (Bolzano-Weierstrass). Every bounded sequence
(an )n∈N0 in R has a convergent subsequence.
Proof. We proceed in three steps.
1. Since (an )n∈N0 is bounded there exist numbers A, B ∈ R such that A ≤
an ≤ B for all n ∈ N0 . Let us consider the interval [A, B] := {x ∈ R; A ≤
x ≤ B}. We will use the principle of mathematical induction to construct a
sequence [Ak , Bk ], k ∈ N0 , of intervals having the following properties:
i) In each interval [Ak , Bk ] there are infinitely many elements of (an )n∈N0 ;
ii) [Ak , Bk ] ⊂ [Ak−1 , Bk−1], k ≥ 1;
iii) Bk − Ak = 2−k (B − A).
We start by setting A0 = A and B0 = B. Now suppose that [Ak , Bk ] is
already constructed and has the properties i)-iii). Let M := Ak +B 2
k
be the
centre of the interval. Since in [Ak , Bk ] there are infinitely many elements
of (an )n∈N0 , either [Ak , M] or [M, Bk ] (or both) must contain infinitely many
elements of (an )n∈N0 . Now we define the interval
⎧
⎪
⎨[Ak , M], if [Ak , M] contains infinitely many
[Ak+1 , Bk+1 ] := elements of (an )n∈N0
⎪
⎩
[M, Bk ], otherwise.
Obviously [Ak+1 , Bk+1] satisfies i)-iii).
2. Now we define inductively a subsequence (ank )k∈N0 of (an )n∈N0 such that
235
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 256
A COURSE IN ANALYSIS
Remark 17.7. Clearly the Bolzano-Weierstrass theorem also holds for se-
quences (an )n≥k .
Example 17.9. A. The sequence ((−1)n )n∈N has two accumulation points,
namely +1 and −1, compare with Example 17.5. Thus while the limit of
a sequence is always unique, a sequence may have a lot of (even infinitely
many) accumulation points.
B. The sequence an = (−1)n + n1 , n ∈ N, also has the two accumulation
points +1 and −1. Indeed we have
1 1
lim a2n = lim (−1)2n + = lim 1 + = 1,
n→∞ n→∞ 2n n→∞ 2n
and
2n+1 1 1
lim a2n+1 = lim (−1) + = lim −1 + = −1.
n→∞ n→∞ 2n + 1 n→∞ 2n + 1
236
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 257
It is unbounded but has one accumulation point, namely 0 since lim a2n−1 =
n→∞
1
lim = 0.
n→∞ 2n−1
Lemma 17.10. A. If (an )n≥k converges, then the limit is the only accu-
mulation point of (an )n≥k , i.e. every subsequence of a converging sequence
converges to the same limit.
B. If a subsequence of a Cauchy sequence converges, the whole sequence con-
verges (to the same limit).
Proof. Part A is obvious. B. Let (ak )k∈N be a Cauchy sequence and suppose
that (akl )l∈N converges to a. For > 0 there exists N1 ∈ N such that l ≥ N1
implies |akl − a| < 2 . Since (akl ) is a Cauchy sequence there exists N2 ∈ N
such that n, m ≥ N2 implies |an − am | < 2 . Thus for l ≥ N1 and nl ≥ N2 we
find for all n ≥ N2
|an − a| ≤ |an − anl | + |anl − a| < .
237
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 258
A COURSE IN ANALYSIS
The next result resolves one of the problems discussed at the beginning of
this chapter.
Theorem 17.14. Every monotone and bounded sequence (an )n≥k is conver-
gent.
Set N := nl0 then for every n ≥ N there exists l ≥ l0 such that nl ≤ n < nl+1 .
If (an )n≥k is monotone increasing (decreasing) it follows that
238
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 259
Proof. Since An ≤ An+1 < Bn+1 ≤ Bn it follows that the sequence of left
end points (An )n∈N0 as well as the sequence of right end points (Bn )n∈N0 are
bounded. Since each of these sequences is monotone it is convergent. Denote
their limits by A and B, respectively. Clearly we have A ≤ B. If A = B
we are done. Suppose that A < B. Then [A, B] ⊂ In and there exists
n∈N0
x0 ∈ [A, B] such that A < x0 < B. But in this case An < x0 < Bn implying
0 < x0 − An < Bn − An
and
An − Bn < −Bn + x0 < 0
leading to
0 ≤ x0 − lim An ≤ lim (Bn − An ) = 0
n→∞ n→∞
and
0 = lim (An − Bn ) ≤ − lim Bn + x0 ≤ 0,
n→∞ n→∞
239
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 260
A COURSE IN ANALYSIS
240
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 261
Problems
1. a) Consider the sequence (sn )n∈N , sn := nj=1 1j . Prove that s2n −
sn > 12 and deduce that (sn )n∈N diverges to +∞.
j+1
b) Prove that the sequence (sn )n∈N , sn := nj=1 (−1)j , is a Cauchy
sequence.
2. Let (an )n∈N be a sequence of real numbers such that for all n ≥ N we
have |an − an+1 | < 2−n . Prove that (an )n∈N is a Cauchy sequence.
3. Let (an )n≥k , (bn )n≥k , and (cn )n≥k be sequences of real numbers such
that lim an = a, lim bn = b and for all n ≥ k we have an ≤ cn ≤ bn .
n→∞ n→∞
Prove that (cn )n≥k has a convergent subsequence.
√
n
4. Given the sequence n+1 . Show that this is a bounded decreasing
n∈N
sequence and deduce that its limit exists.
n 1
5. Consider the sequence k=0 k! n≥0 . Prove that this sequence is bounded
and deduce that it must have the limit ∞ 1
k=0 k! .
Hint: first show that k! ≥ 2 k−1
for k ∈ N.
6. Let (an )n∈N , an ≥ 0, be a sequence and assume that (an )n∈N has no
accumulation points. Prove that lim an = ∞.
n→∞
7. Give an example of a sequence (an )n∈N such that −2, 13 , 17 are accumu-
lation points of (an )n∈N and −3 ≤ an ≤ 19 for all n ∈ N.
xkn − a (k − 1)xkn + a
xn+1 := xn − = , n ∈ N0 .
kxk−1
n kxk−1
n
1 √
k
Prove that lim xn = a k = a. Hint: use the following steps:
n→∞
241
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 262
A COURSE IN ANALYSIS
iv) xkn ≥ a;
v) xn+1 ≤ xn .
n
9.* Prove that the sequence (an )n∈N , an = 1 + n1 , has the limit ∞ 1
j=1 j! .
We denote the limit by e where e is called the Euler number.
n n+1
10. Let an = 1 + n1 and bn = 1 + n1 . Prove that ([an , bn ])n∈N are
nested intervals with {e} = n∈N [an , bn ], and e is the Euler number.
242
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 263
p
Proof. Let sp := al be the pth partial sum. It follows that
l=k
n
sn − sm−1 = al ,
l=m
and the criterion is nothing but the statement that the sequence of partial
sums forms a Cauchy sequence.
∞
Theorem 18.2. If the series al converges then lim al = 0.
l=k l→∞
∞
Proof. If al converges, then by Theorem 18.1, for every ε > 0 it follows
n l=k
that al < ε provided n ≥ m ≥ N for some suitable N ∈ N. Putting
l=m
n = m we find that |al | < ε for all n ≥ N, i.e. lim al = 0.
l→∞
1
∞
Example 18.3. A. For |q| < 1 we know that k=0 q k = 1−q , i.e. the series
converges. Moreover, by Example 15.11.A we know that lim q k = 0.
k→∞
∞
k
B. The series (−1) diverges since the sequence
k=1
((−1)k )k∈N does not converge to 0.
243
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 264
A COURSE IN ANALYSIS
∞
Theorem 18.4. Let al be a series of non-negative numbers al ≥ 0. This
l=k
series converges if and only if it is bounded, i.e. the sequence of its partial
sums is bounded.
p
Proof. Since al ≥ 0 for all l ≥ k the sequence of partial sums sp = al is
l=1
monotone increasing and bounded, hence by Theorem 17.14 it is convergent.
∞
Conversely, if al is convergent the corresponding sequence of partial sums
l=1
must be bounded.
∞
1
Example 18.5. The harmonic series n
diverges.
n=1
Referring to Problem 1 a) in Chapter 17, we may argue that (sn )n≥1 is not a
Cauchy sequence, hence it cannot converge. We give here a further proof by
showing that the partial sums are unbounded. Consider the special partial
sums
2k+1 k
2p+1
1 1 1
s2k+1 := =1+ +
n=1
n 2 p=1 n=2p +1 n
1 1 1 1 1 1 1
=1+ + + + + + + +
2 3 4 5 6 7 8
⎛ ⎞
k+1
2
1⎠
···+⎝ .
k
n
n=2 +1
Therefore
∞ we find s2k+1 ≥ 1+ k2 implying that the partial sums are unbounded
and so n=1 n1 is divergent.
∞
1
∞
Remark 18.6. Note that n
is an example of a divergent series al with
n=1 l=k
lim al = 0. Hence the converse of Theorem 18.2 does not hold.
l→∞
244
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 265
∞
Example 18.7. For all k ∈ N, k > 1, the series 1
nk
converges. To see this
n=1
∞
1
we apply Theorem 18.4 and prove the boundedness of nk
for k > 1. For
n=1
p ∈ N such that N ≤ 2p+1 − 1 we find
N 2 p+1
−1
1 1
sN := k
≤
n=1
n n=1
nk
2p+1 −1
1 1 1
=1+ k
+ k +···+
2 3 n=2p
nk
p p q
1 1
≤ 2q q k =
q=1
(2 ) q=1
2k−1
∞
1 2k−1
≤ (2−k+1 )q = = .
q=1
1 − 2−k+1 2k−1 − 1
The next result is useful when dealing with alternating series, i.e. series
in which consecutive terms change sign.
Theorem 18.8 (Leibniz’s criterion for alternating series). Let (an )n∈N
be a monotone decreasing sequence of non-negative real numbers with lim an =
n→∞
∞
n
0. Then the series (−1) an converges.
n=1
k
Proof. Set sk := (−1)n an . Since
n=1
it follows that
s0 ≥ s2 ≥ s4 ≥ · · · ≥ s2k+2 ≥ . . .
and analogously, since
we find
s1 ≤ s2 ≤ s3 ≤ · · · ≤ s2k+3 ≤ . . .
245
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 266
A COURSE IN ANALYSIS
In addition we have
s2k+1 ≤ s2k
since s2k+1 − s2k = −a2k+1 ≤ 0.
The sequence (s2k )k∈N is monotone decreasing and bounded since s2k ≥ s1 .
By Theorem 17.14 it is convergent, hence
lim s2k = S
k→∞
lim s2k = S .
k→∞
Further we find
∞
i.e. S = S . Now we prove (−1)k ak = S. For this let ε > 0 be given.
k=1
Then there exists N1 (ε), N2 (ε) ∈ N such that |s2k − S| < ε for k ≥ N1 ,
|s2k+1 − S| < ε for k ≥ N2 . Thus for k ≥ max (2N1 , 2N2 + 1) we find
|sk − S| < ε.
∞
(−1)k
Example 18.9. A. The alternating harmonic series k
converges.
k=1
∞
(−1)k
(Also compare with Problem 1 b) in Chapter 17.) B. The series 2k+1
k=0
converges.
∞
Definition 18.10. A series ak of real numbers is called absolutely con-
k=1
∞
vergent if the series |ak | converges.
k=1
246
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 267
∞
Proof. Suppose ak is absolutely convergent. According to the Cauchy
k=1
∞
criterion applied to the series |ak |, for ε > 0 there exists a number N(ε)
k=1
such that n ≥ m ≥ N implies
n
|ak | < ε.
k=m
∞
i.e. the Cauchy criterion holds for ak which implies the convergence of
k=1
∞
ak by Theorem 18.1.
k=1
Remark 18.12. The alternating harmonic series shows that the converse of
Theorem 18.11 is not true: a convergent series need not be absolutely conver-
gent. Convergent series which are not absolutely convergent are sometimes
called conditionally convergent.
∞
Theorem 18.13 (Comparison test). Let ck be a convergent series of
k=1
non-negative real numbers ck ≥ 0. Further let (ak )k∈N be a sequence such
∞
that |ak | ≤ ck for all k ∈ N. Then the series ak converges absolutely.
k=1
Therefore we find
n
n
|ak | ≤ ck < ε for n ≥ m ≥ N,
k=m k=m
247
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 268
A COURSE IN ANALYSIS
The next two tests, the ratio test and the root test are very powerful tools.
We will use these tests in this part and later on when dealing with power
series, and also in the chapter on complex analysis.
∞
Theorem 18.14 (Ratio test). Let an be a series such that an = 0 for
n=0
all n ≥ N0 . Suppose that there exists ν, 0 < ν < 1, such that
an+1
an ≤ ν for all n ≥ N0 .
∞
Then the series an converges absolutely.
n=0
∞
Proof. The convergence of the series an does not depend on the first N0
n=0
terms. Now
an+1
an ≤ ν for all n > N0 ,
∞
implies that |aN0 +k | ≤ |aN0 |ν k . Since 0 < ν < 1 the series ν n converges,
n=N0
so by Theorem 18.13 the theorem is proved.
an+1
Corollary 18.15. Let (an )n∈N0 be a sequence and suppose that lim =
n→∞ a
∞ n
a < 1. Then the series n=0 an converges absolutely.
Proof. Since a < 1 there exists > 0 such that 0 < a + < 1. For this > 0
there exists N ∈ N such that n ≥ N implies
an+1
− a <
an
or
an+1
an < a + < 1
and the ratio test then gives the result.
Remark 18.16. A. Note that changing finitely many elements in a sequence
or series does not effect its convergence behaviour.
∞
B. The series ck in Theorem 18.13 is called a majorant of the series
k=1
248
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 269
∞
ak .
k=1
an+1 an+1
C. Note that the condition an < 1 or lim = 1 are not sufficient
n→∞ a
∞ n
for the (absolute) convergence of k=1 ak as the harmonic series
shows. Here
a n
an+1 n+1
an = n1 , hence an+1
an
= an = n+1n
< 1 as well as lim = lim =
∞ 1
n→∞ an n→∞ n + 1
1 and n=1 n diverges.
∞
n2
Example 18.17. The series converges.
n=0
2n
n2
If an = 2n
, then for n ≥ 3 we have
an+1 (n + 1)2 2n 1 1 2 1 1 2 8
an = 2n+1 n2 = 2 (1 + n ) ≤ 2 (1 + 3 ) = 9 < 1,
It follows that
√ r, n even
n
an = √
n
ar, n odd
n
therefore using Problem 10 in Chapter 15 we find lim |an | = |r| < 1 and
n→∞
taking Remark 18.19, i.e. Problem 12 b), into account we find that ∞ n=1 an
converges absolutely.
249
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 270
A COURSE IN ANALYSIS
Proof. For the interval [1, N] we choose the partition t1 = 1 < 2 < 3 < · · · <
N
N. Then the sum f (1) + · · · + f (N − 1) is the Riemann sum for 1 f (x)dx
with respect to this partition and the points ξj = tj = j, whereas the sum
N
f (2)+· · ·+f (N) is the Riemann sum for 1 f (x)dx with respect to the same
partition and the points ξj = tj+1 = j + 1. Since f is decreasing it follows
that
N
f (2) + · · · + f (N) < f (x)dx < f (1) + · · · + f (N − 1).
1
y = f (x)
f (1)
f (2)
.
.
.
f (6)
1 2 3 4 5 6
Figure 18.1
N
Since f ≥ 0 it follows that 1 f (x)dx is an increasing sequence. Now if
∞ N ∈N
N
n=1 f (n) converges then 1
f (x)dx is a bounded increasing sequence
N N ∈N
N
and therefore lim f (x)dx exists. Conversely if lim f (x)dx exists,
N →∞ 1 N →∞ 1
250
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 271
i.e. is finite, then the increasing sequence SN = N n=1 f (n) is bounded by
N
∞
lim f (x)dx + f (1) and hence n=1 f (n) converges.
N →∞ 1
∞
1 1
Example 18.22. A. The series diverges since x → is a
n=2
n ln(n) x ln(x)
N
1 N
decreasing function and dx = ln(ln(x)) 1 = ln (ln N) − ln (ln 2)
2 x ln(x)
∞
1
does not have a finite limit. However the series converges since
n=1
n(ln(n))2
N
1 1 1 N
x → is a decreasing function and dx = − =
x(ln(x)) 2
2 x(ln(x)) 2 ln(x) 2
1 1
− + converges.
ln N ln 2
B. For α > 1 the function x → x1α is on [1, ∞) positive and decreasing.
Further
N N
1 1
1−α 1
α
dx = x = (N 1−α − 1).
1 x 1 − α 1 1 − α
Since α > 1 it follows that
N
1 1 1
lim α
dx = lim (N 1−α − 1) =
N →∞ 1 x N →∞ 1−α α−1
implying the convergence of the series ∞ 1
n=1 nα for α > 1.
ln(n)
Lemma 18.23. For α > 0, we have lim = 0.
n→∞ nα
k
Proof. Putting n = ek , this is equivalent to αk → 0 as k → ∞.
∞
e
k
But is a convergent series by the ratio test:
k=1
eαk
ak+1 (k + 1)eαk k+1 1 1
= α(k+1) = α
→ α < 1.
ak e k k e e
k
Therefore by Lemma 9.14 → 0 as k → ∞.
eαk
251
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 272
A COURSE IN ANALYSIS
∞
Theorem 18.27. Let an be an absolutely convergent series with limit A,
n=0
∞
i.e. an = A. Then every rearrangement of this series also converges to A.
n=0
252
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 273
253
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 274
A COURSE IN ANALYSIS
The next result relates rational numbers to real ones from the point of view
of approximation.
Proof. We will show that every positive real number can be approximated
m
by dyadic rationals. Let r ∈ R, r > 0, be the form n with n ∈ Z and m ∈ N
2
and let
n
xn := al 2−l
l=−k
x < 2k+1 .
Then we put a−k = 1 and define al by
-
1 if r − xl−1 > 2−l
al = (18.2)
0 if r − xl−1 ≤ 2−l
From the construction it follows that
i.e.
|x − xn | ≤ 2−n for n ≥ −k,
∞
hence x = lim xn = an 2−n .
n→∞ n=−k
254
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 275
−m−1 1
≤ (b − 1)b = b−m .
1 − b−1
For > 0 we find N ∈ N such that m ≥ m ≥ −k implies |sm − sm | < ,
namely if b−m < for m ≥ N, and the result then follows.
Of central importance is
Theorem 18.33. Let b ∈ N where b ≥ 2 then every real number x ∈ R has
a representation as a b-adic fraction, i.e.
∞
x = sgn(x) an b−n
n=−k
255
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 276
A COURSE IN ANALYSIS
Proof. Again we may assume that x > 0. By Lemma 14.7 there exists l ∈ N0
such that x < bl+1 . Let k be the smallest non-negative integer such that
0 ≤ x < bk+1 .
we have
xm ≤ x < xm + b−m .
Since
0 = 0 · bk < 1 · bk < · · · < (b − 1)bk < b · bk = bk+1
is a partition of [0, bk+1 ] and since 0 ≤ x < bk+1 , there exists exactly one
non-negative integer 0 ≤ a−k ≤ b such that
x
Figure 18.2
Thus we have a starting point for induction. Next we suppose that all an for
n ≤ m are already constructed such that
xm ≤ x < xm + b−m .
256
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 277
∞
x= an b−n .
n=−k
(0, 1) = {xn |n ∈ N} .
257
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 278
A COURSE IN ANALYSIS
c = 0.c1 c2 c3 c4 c5 . . .
with -
1
1 if akk =
ck :=
2 if akk = 1.
In particular we have ck = akk for all k ≥ 1. By assumption there must
be some n ∈ N such that xn = c which would imply ann = cn . This is a
contradiction and the theorem is proved.
Remark 18.36. The procedure used in the proof of Theorem 18.35 is called
Cantor’s diagonalisation argument (or procedure). In fact it was used 15
years earlier by Paul du Bois-Reymond.
258
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 279
Figure 18.3
Problems
1 n+k
1. For > 0, find N ∈ N such that n ≥ N implies m k=1 2 < . Why
does this imply the Cauchy criterion holds for the series ∞ −k
k=0 2 ?
259
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 280
A COURSE IN ANALYSIS
∞
n=1 nα , α ∈ R;
1
a)
∞
b) 1
n=2 n(ln n)α , α ∈ R.
5. Test the following alternating series for convergence:
(−1)n−1
a) ∞ n=1 nα
, α ∈ R;
∞ (−1)n+1
b) n=1 2n−1 ;
(−1)n
c) ∞ n=2 n ln n .
∞ 2
8. Suppose
that (a n ) n∈N and (bn ) n∈N are two sequences such that n=1 an
and ∞ b
n=1 n
2
converge. Prove the (extended) Cauchy-Schwarz in-
equality
∞ 12 ∞ 12
∞ ∞
ak bk ≤ |ak bk | ≤ a2k b2k
k=1 k=1 k=1 k=1
260
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 281
9. Let (ak )k∈N be a sequence of real numbers. Prove that the series
∞ 1 |ak |
k=1 2k 1+|ak | converges. Furthermore, for two sequences (ak )k∈N and
(bk )k∈N of real numbers prove
∞
∞
∞
1 |ak + bk | 1 |ak | 1 |bk |
k
≤ k
+ .
2 1 + |ak bk | 2 1 + |ak | 2k 1 + |bk |
k=1 k=1 k=1
10. Use the ratio test or otherwise to investigate the convergence of the
following series:
a) ∞ 6 −n2
n=1 n e ;
∞ 4n2 +15n−3
b) n=1 2 3 ;
n (n+1) 2
∞ xk
c) k=0 k! , x ∈ R;
∞ k x2k
d) k=0 (−1) (2k)! , x ∈ R.
11. Prove the following: if for a sequence (an )n∈N of real numbers an+1
an
≥
∞
λ > 1 then the series n=1 an diverges. Use this result to show the
divergence of:
(−1)n 3n
a) ∞ n=1 n4
;
3
b) ∞ n√2
n=1 (n+3) 4n+15 .
261
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 282
A COURSE IN ANALYSIS
18. Prove that if D ⊂ R is a set which contains an open interval (a, b), i.e.
(a, b) ⊂ D, then D is not countable.
Hint: use the fact that the interval (0, 1) is not countable and construct
a bijective mapping f : (a, b) → (0, 1).
262
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 283
19 Point Sets in R
Functions or sequences map subsets of the real line onto subsets of the real
line. In order to understand this process better we need to acquire more
knowledge of subsets of the real line. This is a task which will accompany
us for some time and it is partly more abstract and formal than students are
used to at the beginning of their studies. However it is unavoidable in order
to gain a deeper understanding of mathematics.
We already know a certain class of subsets of R and we have seen its impor-
tance: intervals.
For a ≤ b we define the closed interval by
noting that [a, a] = {a} is a closed interval. For a < b we have the open
interval
(a, b) := {x ∈ R|a < x < b}, (19.2)
and for a < b we have two kinds of half-open intervals, namely
and
(a, b] := {x ∈ R|a < x ≤ b}. (19.4)
We extend these notions to infinite or unbounded intervals. For a ∈ R
we set
Moreover we define
R+ := [0, ∞), (19.9)
so that (0, ∞) = R+ \ {0} and we occasionally use
(−∞, ∞) := R, (19.10)
263
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 284
A COURSE IN ANALYSIS
Proof. First note that there is a need for a proof. At a first glance the
notion of an open interval is unrelated to the notion of an open set. But
of course we should expect some consistency in our notions. Therefore let
(a, b) ⊂ R be an open interval. We want to prove that for x ∈ (a, b) there
exists > 0 such that the open interval (x − , x + ) is a subset of (a, b), i.e.
(x − , x + ) ⊂ (a, b). For this choose := 12 min(x − a, b − x) > 0 and it
follows that (x − , x + ) ⊂ (a, b).
a x
x− x+ b
Figure 19.1
Note that the proof is also valid for (−∞, b) or (a, ∞), i.e. both are open
sets.
We next want to study some properties of open sets.
Proof. A. Assume that ∩Nν=1 Aν =∅, otherwise there is nothing to prove since
by definition ∅ is open. Let x ∈ N ν=1 Aν , thus x ∈ Aν for all ν = 1, . . . , N.
Since Aν is open there exists ν > 0 such that (x − ν , x + ν ) ⊂ Aν . For
:= min1≤ν≤N ν > 0 we find
N
N
x ∈ (x − , x + ) ⊂ (x − ν , x + ν ) ⊂ Aν ,
ν=1 ν=1
264
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 285
19 POINT SETS IN R
implying the openess of N ν=1 Aν .
B. Now let I = ∅ be any index set and for j ∈ I let Aj ⊂ R be open.
Consider
A := Aj := {x ∈ R|x ∈ Aj0 for some j0 ∈ I}, (19.11)
j∈I
and assume that at least one set Aj1 is non-empty, otherwise A = ∅ and
nothing remains to prove. Take x ∈ A, then for some j0 ∈ I we have x ∈ Aj0
exists an open interval (x − , x + ) ⊂ Aj0 which
and since Aj0 is open there
yields (x − , x + ) ⊂ j∈I Aj = A and the lemma is proved.
Example 19.4. A. If a1 < b1 < a2 < b2 then the two intervals (a1 , b1 ) and
(a2 , b2 ) are open and disjoint. Their union (a1 , b1 ) ∪ (a2 , b2 ) is open too but
it is not an interval anymore.
a1 b1 a2 b2
Figure 19.2
∞ 1
Moreover, the set n=1 (n − n
,n + n1 ) is open.
1 1
B. Consider the open intervals (1 − n+1 , 1 + n+1 ). Their intersection is given
by
∞
1 1
{1} = (1 − ,1 + )
n=1
n+1 n+1
(compare also with Problem 4). The set {1} does not contain an open inter-
val, hence we cannot expect that an infinite intersection of open sets is open.
C. The following type of construction will be used (in a modified form) quite
often. Let a < b and f : [a, b] → R be a function. Let > 0 and for
∈ [a, b] consider the open interval (f (x) − , f (x) + ) ⊂ R. It follows that
x
x∈[a,b] (f (x) − , f (x) + ) ⊂ R is an open
set. The image of f , i.e. f ([a, b]) is
a subset in R and clearly f ([a, b]) ⊂ x∈[a,b] (f (x) − , f (x) + ). Thus we can
consider f ([a, b]) as a subset of an open set and every y = f (x) ∈ f ([a, b])
is the centre of an open interval of length 2 entirely belonging to this open
set. Clearly f ([a, b]) does not have to be open, just consider f : [a, b] → R,
f (x) = c ∈ R for all x ∈ [a, b]. Then f ([a, b]) = {c} which is not open.
265
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 286
A COURSE IN ANALYSIS
Bν = Bν ,
ν=1 ν=1
N
N
and since Bν is open, Bν is open, and hence Bν is closed. For
ν=1 ν=1
an arbitrary collection Bj ⊂ R, j ∈ I, of closed sets we have
and therefore
Bj = Bj ,
j∈I j∈I
266
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 287
19 POINT SETS IN R
and since each Bjc is open it follows from Lemma 19.3 that Bj is open,
j∈I
hence Bj is closed.
j∈I
Remark 19.8. In Problem 1 we will prove that [a, b) and (a, b] are neither
open nor closed.
Example 19.9. A. A single point a ∈ R forms a closed set {a} since {a} =
((−∞, a) ∪ (a, ∞)) . This implies that any finite union of points a1 , . . . , aN
is closed:
N
{aν |ν = 1, . . . , N} = {aν }.
ν=1
∞Let aν ∈ R, ν ∈ N and assume for some δ > 0 that |aν − aν+1 | ≥ δ. Then
B.
v=1 {aν } is a closed set. (Compare with Problem 3).
267
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 288
A COURSE IN ANALYSIS
268
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 289
19 POINT SETS IN R
i.e. we do not change the upper bound. We repeat this process to obtain a
decreasing sequence of upper bounds and an increasing sequence of elements
belonging to D, and we will prove that they converge to the same limit. Our
demonstration uses mathematical induction:
We construct
i) a sequence x0 ≤ x1 ≤ x2 ≤ · · · of elements in D, and
ii) a sequence K0 ≥ K1 ≥ K2 ≥ · · · of upper bounds of D such that
Kn − xn ≤ 2−n r for all n ∈ N, r = K0 − x0 . (19.14)
Starting with x0 and K0 let us assume that x0 , . . . xn , ∈ D and K0 , . . . , Kn ,
upper bounds of D, are already constructed such that (19.14) holds. Define
Kn + xn
M := .
2
There are two possibilities: if M is an upper bound of D, we put xn+1 := xn
and Kn+1 := M; if M is not an upper bound of D, we put Kn+1 := Kn and
choose xn+1 ∈ D with xn+1 > M. In each case we have
xn ≤ xn+1 , Kn ≥ Kn+1 and Kn+1 − xn+1 ≤ 2−n−1 r.
The sequence (Kn )n∈N is monotone decreasing and bounded since x0 ≤ Kn ≤
K0 . Hence (Kn )n∈N0 converges to some K ∈ R. Since for x ∈ D we always
have x ≤ Kn , it follows that x ≤ lim Kn = K, i.e. K is an upper bound for
n→∞
D. To show that it is the least upper bound, suppose K < K. Then there
exists n0 ∈ N such that 2−n0 r < K − K , which yields
xn ≥ Kn − 2−n r ≥ K − 2−n r > K ,
so that K is not an upper bound. Hence K = sup D. Note that (19.14)
implies limn→∞ Kn = limn→∞ xn .
Example 19.16. A. For a closed interval [a, b], a ≤ b, we have sup[a, b] = b
and inf[a, b] = a.
B. For an open interval (a, b), a < b, we find sup(a, b) = b and inf(a, b) = a.
We show that b = sup(a, b). Clearly, b is an upper bound for (a, b). Suppose
that b < b. It follows that
a + b b + b
x := max , ∈ (a, b)
2 2
and b < x, hence b could not be an upper bound.
269
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 290
A COURSE IN ANALYSIS
Definition 19.19. Let (an )n∈N be a sequence of real numbers. We define its
limit superior by
270
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 291
19 POINT SETS IN R
and
lim inf an = inf A. (19.18)
n→∞
271
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 292
A COURSE IN ANALYSIS
In order to see (19.20) note that passing from (an )n∈N to (−an )n∈N is a re-
flection about 0 which reverses all order relations.
272
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 293
19 POINT SETS IN R
Finally we want to provide some results which are useful to know, but we
provide the proofs only in Appendix VIII. We start with
Proof. Both ∅ and R are open and closed. Indeed ∅ is open by definition,
hence ∅ = R is closed. However R = n∈N (−n, n) is the union of open
sets, hence open, implying that R is closed. Suppose that A is open and
closed, hence A is open and closed and the connected set R has the splitting
R = A ∪ A . Hence either A or A is empty, hence A is either R or ∅.
We finally have
Problems
1. Prove that for a < b the half-open interval [a, b) is neither open nor
closed.
273
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 294
A COURSE IN ANALYSIS
274
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 295
19 POINT SETS IN R
12. Let (an )n∈N and (bn )n∈N be two sequences and let λ > 0. Prove
a) lim supn→∞ (λan ) = λ lim supn→∞ an ;
b) lim supn→∞ (an + bn ) ≤ lim supn→∞ an + lim supn→∞ bn ;
c) lim supn→∞ (an + bn ) ≥ lim supn→∞ an + lim inf n→∞ bn ;
d) if limn→∞ bn = b, i.e. the limit exists, then
13. The set A := [0, 1] ∪ {2} ∪ (3, 4) ⊂ R is not an interval, hence not
connected. Give a splitting {O1 , O2} of A.
275
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 297
20 Continuous Functions
In Chapter 6 we encountered the concept of a continuous function, see Defi-
nition 6.9. This notion depends on the idea of a limit of a function (at some
point of its domain) which was introduced in Chapter 6. Recall: a function
f : D → R, D ⊂ R, has the limit a as y ∈ D approaches x if for every > 0
there exists δ > 0 such that 0 < |x − y| < δ implies |f (y) − a| < . First we
want to relate this definition to limits of sequences.
Proof. Suppose that for ε > 0 there exists δ > 0 such that 0 < |y − x| < δ,
y ∈ D, implies |f (y) − a| < ε. Let limn→∞ xn = x, xn ∈ D. Then there
exists N = N(δ) such that for n ≥ N(δ) it follows that |xn − x| < δ. By
assumption it follows that |f (xn ) − a| < ε for n ≥ N(δ) = N(δ(ε)), i.e.
limn→∞ f (xn ) = a.
Suppose now that for every sequence (xn )n∈N , xn ∈ D, with limn→∞ xn = x
it follows that limn→∞ f (xn ) = a. We have to prove that for every ε > 0
there exists δ > 0 such that 0 < |y − x| < δ implies |f (x) − a| < ε. Suppose
this does not hold. Then there exists ε > 0 such that for no value of δ > 0
do we have |f (y) − a| < ε for all y ∈ D with 0 < |y − x| < δ. Thus for every
n ∈ N there exists xn ∈ D such that
1
|xn − x| < and |f (xn ) − a| ≥ ε .
n
This implies that limn→∞ xn = x and therefore limn→∞ f (xn ) = a, but
|f (xn ) − a| ≥ ε for some ε > 0 which is a contradiction.
i) for every ε > 0 there exists δ = δ(ε) > 0 such that for y ∈ D the
condition 0 < |y − x| < δ implies |f (y) − f (x)| < ε;
277
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 298
A COURSE IN ANALYSIS
Proof. For ε := |f (x)| > 0 there exists δ > 0 such that y ∈ D and 0 <
|y − x| < δ implies |f (y) − f (x)| < ε. It follows that
278
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 299
20 CONTINUOUS FUNCTIONS
c1 D1 x D2 c2
Figure 20.1
If limy→x f (y) = a then in the case limy→x f |D1 (y) we are approaching x from
the left, i.e. y < x, and in the case of limy→x f |D2 (y) we are approaching x
from the right, i.e. x < y. This leads to
Definition 20.5. A. We say that f : D → R has a limit from the right
if for every sequence (xn )n∈N , xn ∈ D and xn > x, with limn→∞ xn = x it
follows that limn→∞ f (xn ) = a. We write
279
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 300
A COURSE IN ANALYSIS
implies |xk − x| < δ, thus given > 0 we find N ∈ N such that k ≥ N implies
|f (xk ) − a| < proving the lemma.
Example 20.8. A. Consider x → [x]. Then limx1 [x] = 1 and limx1 [x] =
0.
Indeed, for any sequence (xn )n≥0 , xn > 1 and limn→∞ xn = 1 it follows for n
sufficiently large that [xn ] = 1, if however xn → 1 and xn < 1 then [xn ] = 0
for n large.
B. Let P (x) = xk + a1 xk−1 + . . . + ak−1 x + a, k ≥ 1, be a polynomial. It
follows that
lim P (x) = ∞
x→∞
and
+∞ for k even
lim P (x) =
x→−∞ −∞ for k odd.
Proof. For x = 0 we write
a1 a2 ak
P (x) = xk g(x) = xk 1 + + 2 + ...+ k .
x x x
If x ≥ c := max (1, 2k|a1 |, . . . , 2k|ak |) it follows that
1
g(x) ≥ ,
2
hence for these x we have
1 k x
P (x) ≥ x ≥ .
2 2
Thus, if xn → ∞ then P (xn ) ≥ x2n → ∞, or limn→∞ P (xn ) = ∞. Since
P (−x) = (−1)k Q(x) = (−1)k (xk − a1 xk−1 + . . . + (−1)k−1 ak−1 + (−1)k ak the
second statement follows from the first.
280
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 301
20 CONTINUOUS FUNCTIONS
f +g, λf , f · g.
f
In addition, if g(x) = 0, then g
is also continuous at x.
281
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 302
A COURSE IN ANALYSIS
ξ := lim an = lim bn .
n→∞ n→∞
In addition
f (ξ) = lim f (an ) ≤ 0 ≤ lim f (bn ) = f (ξ),
n→∞ n→∞
so that f (ξ) = 0.
Remark 20.15. This result allows us to decide whether the equation f (x) =
0 has a solution in the domain [a, b] of f . Suppose that for some c1 ∈ [a, b]
we have f (c1 ) > 0 (f (c1 ) < 0) and for some c2 ∈ [a, b], c2 > c1 we have
f (c2 ) < 0 (f (c2 ) > 0), then f |[c1,c2 ] satisfies the conditions of Theorem 20.14
and hence f (x) = 0 must have a solution ξ ∈ [c1 , c2 ] ⊂ [a, b].
282
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 303
20 CONTINUOUS FUNCTIONS
283
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 304
A COURSE IN ANALYSIS
and
f (q) = inf{f (x) | x ∈ [a, b]} = min{f (x) | x ∈ [a, b]}.
Proof. We prove the result for the maximum. For the minimum we only have
to consider −f instead of f . Set
lim f (xn ) = A.
n→∞
284
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 305
20 CONTINUOUS FUNCTIONS
x y
Figure 20.2
Clearly y − ε2y , y + ε2y y∈K is an open covering of K. By the compactness of
ε ε ε ε
K we may take a finite subcovering y1 − y21 , y1 + y21 , . . . , yN − y2N , yN + y2N
of K. It follows that
N
εy εy
Bx := x − j,x+ j
j=1
2 2
(i) [a, b] ⊃ I1 ⊃ I2 ⊃ . . .
285
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 306
A COURSE IN ANALYSIS
of [a, b] and by Proposition 20.25 it contains a finite subcovering (Aνj )νj ∈IN
where IN is a finite subset of I ∪ {p}. If p ∈ IN , then, since K ∩ Ap = ∅, we
can remove Ap and we still have a finite covering of K.
Our first application of compactness is related to uniform continuity.
Definition 20.27. A function f : D → R is called uniformly continuous
on D if for every ε > 0 there exists δ > 0 such that for x, y ∈ D the inequality
|x − y| < δ implies |f (x) − f (y)| < ε.
Remark 20.28. A. The important difference of continuity on D and uniform
continuity lies in the fact that in the latter case δ is independent of x ∈ D.
B. If f : D → R is uniformly continuous on D, then it is obviously continuous
on D. However the converse is false.
Example 20.29. The function f : (0, 1] → R, x → x1 , is
continuous
on (0, 1].
p p2
Indeed, for p ∈ (0, 1] and ε > 0 it follows with δ := min , ε
2 2
that
1 1 x − p 2 |x − p| 2δ
|f (x) − f (p)| = − = ≤ < 2 ≤ ε,
x p xp p 2 p
286
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 307
20 CONTINUOUS FUNCTIONS
Take δ := 12 min (δx1 ,ε , . . . , δxN ,ε ). For |x − y| < δ it follows that for some
1 ≤ j ≤ N we have
δxj ,ε δxj ,ε
x ∈ xj − , xj +
2 2
and further
δxj ,ε
|xj − y| ≤ |x − y| + |x − xj | < δ + < δxj ,ε ,
2
and therefore
ε ε
|f (y) − f (x)| ≤ |f (y) − f (xj )| + |f (x) − f (xj )| < + =ε
2 2
287
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 308
A COURSE IN ANALYSIS
Since f −1 (yn ) ∈ [a, b], a subsequence (f −1 (ynk ))k∈N converges by the Bolzano-
Weierstrass theorem:
lim f −1 (ynk ) = c,
k→∞
−1
and |c − f (ynk )| ≥ ε. Further f (f −1 (ynk )) = ynk and the continuity of f
implies
y = lim ynk = lim f (f −1(ynk )) = f (c),
k→∞ k→∞
−1 −1
i.e. f (y) = f (f (c)) = c contradicting |c − f −1 (y)| ≥ ε and the theorem
is proved.
Problems
1. Let f : [a, b] → R, a < b, be a function. Prove that f is continuous
at x ∈ [a, b] if and only if for every sequence (xn )n∈N , xn ∈ (a, b),
converging to x the following holds
288
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 309
20 CONTINUOUS FUNCTIONS
fr := y→x
lim f (y) = f (x) and fl := y→x
lim f (y) exists.
y>x y<x
289
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 310
A COURSE IN ANALYSIS
and -
−f (x), if f (x) ≤ 0
f− (x) :=
0, if f (x) > 0.
11. Let D ⊂ R be a non-empty set and C(D) the set of all continuous
functions f : D → R.
a) Prove that C(D) with its natural operations forms an R-algebra.
b) Let a : D → R be a fixed continuous function and define Aop :
C(D) → C(D) by Aop u = au, i.e. Aop u(x) = a(x)u(x). Prove that Aop
is a linear operator on C(D).
290
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 311
20 CONTINUOUS FUNCTIONS
291
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 312
A COURSE IN ANALYSIS
292
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 313
21 Differentiation
Let D ⊂ R and f : D → R be a function. We know by examples that even
continuous functions may look rather complicated. Thus we may ask the
question whether it is possible to approximate locally a given function by a
simpler function. Obviously straight lines (considered as graphs of functions)
are the simplest functions on R . They are given by
ga,b : R → R
x → ax + b
with a, b ∈ R. We want to make our considerations for a moment more
complicated and relate our point of view to linear algebra. Given an n-
dimensional vector space (V, R) over the reals. A mapping A : V → V is
called linear if A(λx + μy) = λAx + μAy holds for all λ, μ ∈ R and x, y ∈ V .
Choosing a fixed basis in V we know that with respect to this basis A has a
representation as an n×n-matrix. Now, R is a real vector space of dimension
1 and taking 1 ∈ R as basis any matrix is just a real number.
Thus all linear mappings Aa : R → R have the matrix representation x →
Aa x = ax where a ∈ R represents Aa .
Therefore we may interpret a straight line as the graph of the composition
of two mappings: A linear mapping x → ax and a translation Tb : R → R,
x → x + b, i.e. we consider
Tb ◦ Aa : R → R
x → Tb (Aa x) = Tb (ax) = ax + b.
We call these mappings the affine mappings ha,b := Tb ◦ Aa , a, b ∈ R, on R.
Thus straight lines are the graphs of affine maps. More generally:
Definition 21.1. Let (V, R) be a vector space over R. We call F : V → V
an affine mapping if F x = Ax + b holds for a linear mapping A : V → V
and a vector b ∈ V .
Let us return to our original problem. Given f : D → R and x0 ∈ D. We
are looking for an affine mapping ha,b : R → R such that in a neighbourhood
of x0 the function x → ha,b (x) is a good approximation of x → f (x). In
particular we require f (x0 ) = ha,b (x0 ). Thus in a neighbourhood of x0 we
want to have that
293
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 314
A COURSE IN ANALYSIS
f (x0 ) = ax0 + b.
Thus
|f (x) − f (x0 ) − a(x − x0 )| should be small,
which leads to
f (x) − f (x0 )
− a |x − x0 |
x − x0
should be small. Now suppose that φx0 : D → R is a function such that
φx0 (x)
lim = 0. (21.1)
x→x0 x − x0
294
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 315
21 DIFFERENTIATION
such that
f (x) = f (x0 ) + c(x − x0 ) + φx0 (x). (21.6)
In this case we have c = f (x0 ).
Proof. Suppose that f is differentiable at x0 ∈ D and set c = f (x0 ). Defining
φx0 (x) := f (x) − f (x0 ) − f (x0 )(x − x0 )
we find
φx0 (x) f (x) − f (x0 )
lim = lim − f (x0 ) = 0,
x→x0 x − x0 x→x0 x − x0
and obviously
f (x) = f (x0 ) + f (x0 )(x − x0 ) + φx0 (x).
Now suppose that (21.6) holds with φx0 satisfying (21.5). Then we find
immediately that
f (x) − f (x0 ) φx0 (x)
lim − c = lim =0
x→x0 x − x0 x→x0 x − x0
or
f (x) − f (x0 )
lim = c, i.e. c = f (x0 ).
x→x0 x − x0
295
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 316
A COURSE IN ANALYSIS
y-axis
y0 = f (x0 )
y = f (x)
x x0 x-axis
Figure 21.1
296
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 317
21 DIFFERENTIATION
Note that the tangent is defined after we have introduced the notion of dif-
ferentiability. An important consequence of Theorem 21.3 is
Corollary 21.5. A function differentiable at x0 is continuous at x0 .
Proof. Since f is differentiable at x0 we have
f (x) = f (x0 ) + f (x0 )(x − x0 ) + φx0 (x)
φx0 (x)
where limx→x0 x−x0
= 0. In particular we have limx→x0 φx0 (x) = 0 which
leads to
297
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 318
A COURSE IN ANALYSIS
298
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 319
21 DIFFERENTIATION
g◦f :D →R
holds.
Let f : D → R be a differentiable function. Then f : D → R is again
a function and we may ask whether f is differentiable. If yes, we call the
derivative (f ) of f the second derivative of f and write simply f . By
induction we define higher order derivatives(if they exist)
k−1
dk (k) d d d (k−1)
k
f (x) := f (x) := k−1
f (x) = f (x).
dx dx dx dx
299
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 320
A COURSE IN ANALYSIS
C ∞ (I) := C k (I).
k∈N
Moreover we set
Cbk (I) := {u ∈ C k (I)|u(l) is bounded for l = 0, . . . , k},
recall u(0) := u.
Exercise 21.14. Show that for 0 ≤ k ≤ ∞ the set C k (I) with its natural
operations, i.e. pointwise addition and multiplication is an R algebra.
Note that functions that are differentiable need not have continuous deriva-
tives.
Example 21.15. Let
-
x2 sin( x1 ) , x = 0
f (x) := .
0 ,x = 0
Then, by the rules f is differentiable for all x = 0. In fact
1 1
f (x) = 2x sin − cos
x x
and we have
1
x2 sin x
−0 1
f (0) = lim = lim x sin = 0.
x→0 x x→0 x
But f (x) does not have a limit as x → 0.
300
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 321
21 DIFFERENTIATION
Problems
1. In the situation of Figure 21.1 find the straight line Sx0 ,x : R → R
passing through the points (x0 , f (x0 )) and (x, f (x)). Further prove
that if gx0 is the tangent line of f at x0 then limx→x0 Sx0 ,x (t) = gx0 (t).
k
dk (k) k (k−l)
(f · g)(x) = (f · g) (x) = f (x)g (l) (x).
dxk l
l=0
is differentiable.
301
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 322
A COURSE IN ANALYSIS
k=1
302
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 323
21 DIFFERENTIATION
303
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 325
Proof. If f is constant on (a, b), then any c will do. If not, suppose first that f
has a local maximum at x ∈ (a, b). Take ε > 0 such that (x−ε, x+ε) ⊂ (a, b)
and
f (y) ≤ f (x) for all y ∈ (x − ε, x + ε).
It follows that
f (y) − f (x)
f+ (x) = y→x
lim ≤0 (22.2)
y>0
y−x
and
f (y) − f (x)
f− (x) = y→x
lim ≥ 0. (22.3)
y<0
y−x
The differentiability of f implies now
Remark 22.3. A. Note that f (x) = 0 does not imply that f has a local
extreme value at x: take f (x) = x3 and x = 0.
B. The geometric interpretation of Theorem 22.2 is that at a local extreme
305
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 326
A COURSE IN ANALYSIS
we can apply Theorem 22.4, and hence there is x ∈ (a, b) such that h (x) = 0
i.e.
0 = h (x) = (f (b) − f (a))g (x) − (g(b) − g(a))f (x),
implying the theorem.
Theorem 22.5 is sometimes called the second or the generalised mean
value theorem. Setting g(x) = x in this theorem gives
306
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 327
f (x) f (x)
lim = lim (22.7)
x→x0 g(x) x→x0 g (x)
307
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 328
A COURSE IN ANALYSIS
Proof. Suppose first that x > x0 . By Theorem 22.5 applied to the interval
[x0 , x], there exists y, x0 < y < x, such that
f (y)
As x → x0 , x > x0 , it follows that y → x0 , thus if limy→x0 g (y)
exists, it is
equal to limx→x0 fg(x)
(x)
. A similar argument works when x < x0 .
Already in Part 1 we made use of these rules, see Theorem 11.5 and Example
11.6.
Sometimes we may have to use L’Hospital’s rule more than once.
and g(x) = x. Then f (0) = g(0) = 0 and both functions are differentiable at
f (x)
0. Now = x sin(x) → 0 as x → 0, but
g(x)
f (x) 1 1
= f (x) = 2x sin − cos
g (x) x x
308
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 329
f (x) f (x)
lim = lim (22.8)
x→∞ g(x) x→∞ g (x)
309
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 330
A COURSE IN ANALYSIS
Proof. We consider the case f (x) > 0, the second case goes analogously. By
assumption we have
f (y) − f (x)
f (x) = lim > 0.
y→x y−x
and
f (y) > 0 for x < y < x + ε.
Therefore f is strictly monotone decreasing in [x−ε, x] and strictly monotone
increasing in [x, x + ε] :
y-axis
x x-axis
Figure 22.1
310
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 331
h̃x0
g̃x0
x0 x
Figure 22.2
g̃x0 = {(t, gx0 (t))|gx0 (t) = f (x0 )t+f (x0 )−x0 f (x0 ), t ∈ R} = Γ(gx0 ) (22.10)
which we also interpret as the graph Γ(gx0 ) of the function t → gx0 (t),
gx0 (t) = f (x0 )t + f (x0 ) − x0 f (x0 ). In the case where f (x0 ) = 0 then
g̃x0 is a line parallel to the x-axis and the line ñx0 := {(x0 , t)|t ∈ R} is par-
allel to the y-axis and passes through (x0 , f (x0 )) and they are perpendicular
to each other. However, ñx0 is not the graph of a function. For f (x0 ) = 0
we can consider the straight line
1 x0
ñx0 = (t, nx0 (t)|nx0 (t) = − t + f (x0 ) + = Γ(nx0 ),
f (x0 ) f (x0 )
311
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 332
A COURSE IN ANALYSIS
312
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 333
and then
x0 − c1 + f (x0 )(f (x0 ) − c2 ) = 0,
or again if f (x0 ) = 0
1 + f 2 (x0 )
c1 = x0 − f (x0 ) , (22.14)
f (x0 )
and finally, if f (x0 ) = 0,
3
(1 + f 2 (x0 )) 2
r= . (22.15)
|f (x0 )|
The condition f (x0 ) = 0 is of course natural when assuming that locally we
can improve the approximation by a straight line. The circle
κx0 := {(x, y) ∈ R2 |(x − c1 )2 + (y − c2 )2 = r2 }
⎧ 2
⎨ 2
2
1 + f (x0 )
= (x, y) ∈ R x − x0 + f (x0 )
⎩ f (x0 )
2 ⎫
(1 + f (x0 ))3 ⎬
2 2
1 + f (x0 )
+ y − f (x0 ) − =
f (x0 ) |f (x0 )|2 ⎭
Problems
1. Use the generalised mean value theorem to prove that for f ∈ C 2 ([a, b])
satisfying f (a) = f (b) and f (a) = f (b) = 0 there exists x1 , x2 ∈ (a, b),
x1 = x2 , such that f (x1 ) = f (x2 ).
313
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 334
A COURSE IN ANALYSIS
4. For 0 < p < q and x > 0 use the mean value theorem to show
p q
x x
1+ < 1+ .
p q
Hint: apply the mean value theorem to y → ln(1 + y) on [0, xq ] and on
[ xq , xp ], x > 0.
b)
(ln x)β
lim = 0;
x→∞ xα
c)
lim xx = 1.
x→0
x>0
7. Let f ∈ C 2 (R) such that f (0) = 1, f (0) = 0, and f (0) = −1. Prove
that for any a ∈ R
x
a 2
− a2
lim f √ = e .
x→∞
x>0 x
314
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 335
dk f (t)
(−1)k ≥ 0. (22.16)
dtk
A function f ∈ C ∞ ((0, ∞)) is called a Bernstein function if f ≥ 0
and for all k ∈ N
dk f (t)
(−1)k ≤ 0. (22.17)
dtk
Prove that for a > 0 the function t → e−at is completely monotone and
the function t → 1 − e−at is a Bernstein function. Furthermore show
that t → tα , 0 < α ≤ 1, is another Bernstein function.
1 2
10. a) Determine all local extreme values of f (x) = x 3 (1 − x) 3 .
b) Find the maximum of f : R → R given by
1 1
f (x) = + .
1 + |x| 1 + |x − 1|
315
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 337
y-axis
f (x2 )
λf (x1 ) + (1 − λ)f (x2 )
f (x1 )
f (λx1 + (1 − λ)x2 )
317
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 338
A COURSE IN ANALYSIS
318
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 339
The basic definition of convexity does not require differentiability and not
even continuity, it is a geometric statement expressed by an inequality. If we
consider Figure 23.1 then inequality (23.1) says that for all x ∈ [x1 , x2 ] the
graph of f lies below the line segment connecting (x1 , f (x1 )) to (x2 , f (x2 )).
This line segment is the graph of the function
f (x1 ) − f (x2 )
g(t) = f (x1 ) + (t − x1 ), t ∈ [x1 , x2 ]. (23.2)
x1 − x2
Hence convexity means
f (x1 ) − f (x2 )
f (t) ≤ f (x1 ) + (t − x1 ) (23.3)
x1 − x2
for all t ∈ [x1 , x2 ].
Lemma 23.4. A function f : I → R is convex if and only if for any three
points x < z < y, x, y, z ∈ I, the inequalities
f (x) − f (z) f (x) − f (y) f (z) − f (y)
≤ ≤ (23.4)
x−z x−y z−y
hold.
f (x) − f (y)
f (z) ≤ f (y) + (z − y),
x−y
319
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 340
A COURSE IN ANALYSIS
or
f (x) − f (y)
f (z) − f (y) ≤ (z − y).
x−y
Taking into account that z − y < 0, we eventually arrive at
proving the second inequality in (23.4). Now suppose that (23.4) holds and
take z = αx + (1 − α)y to find
which yields
or
f (αx + (1 − α)y) ≤ αf (x) + (1 − α)f (y),
proving the convexity of f .
Theorem 23.5. Let I be an interval with end points a < b and let f : I → R
be a convex function. For every x ∈ (a, b) the function is differentiable from
the right and from the left.
Proof. Take x ∈ (a, b) and t1 , t2 ∈ I such that x < t1 < t2 . From (23.4) we
deduce
f (x) − f (t1 ) f (x) − f (t2 )
≤ ,
x − t1 x − t2
in other words, the function F : [x, b] ∩ I → R, F (t) = f (x)−f
x−t
(t)
, is monotone
increasing. Further, for x1 ∈ I with x1 < x it follows again by (23.4) that
320
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 341
f (x) − f (t)
lim F (t) = lim = f+ (x)
t→x
t>0
t→x
t>0
x−t
exists. Analogously we see that f is differentiable from the left, i.e. f− (x)
exists for x ∈ (a, b).
Proof. With the same argument as in the proof of Corollary 21.5 we deduce
that if f is differentiable from the right (left) at x ∈ (a, b) then f is continuous
from the right (left) at x. Hence being continuous from the right and from the
left, f must be continuous at x. (A more detailed proof is given in Problem
3.)
and
g(λx1 + (1 − λ)x2 ) ≤ λg(x1 ) + (1 − λ)g(x2 )
by adding and multiplying by α ≥ 0, respectively. Moreover, if limn→∞ fn (x) =
F (x) < ∞ exists for all x ∈ I we can pass to the limit in
321
July 28, 2015 9:26 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 342
A COURSE IN ANALYSIS
Remark 23.9. Note that by Corollary 23.6 convex functions provide us with
a class of functions for which the pointwise limit of sequences belonging to
this class is always continuous.
Proposition 23.10. Let I ⊂ R be an interval and J = ∅ an index set.
Suppose that for each j ∈ J a convex function fj : I → R is given. Then if
g(x) := sup{fj (x)|j ∈ J} < ∞ (23.5)
is finite for each x ∈ I, then g : I → R is convex.
Proof. Let > 0. There exists fj , j ∈ J, such that for all x1 , x2 ∈ I and
λ ∈ (0, 1) the following holds
fj (λx1 + (1 − λ)x2 ) ≥ g(λx1 + (1 − λ)x2 ) − ,
which implies by the convexity of f
g(λx1 + (1 − λ)x2 ) − ≤ fj (λx1 + (1 − λ)x2 )
≤ λfj (x1 ) + (1 − λ)fj (x2 )
≤ λg(x1 ) + (1 − λ)g(x2 ).
Since > 0 is arbitrary we eventually get
g(λx1 + (1 − λ)x2 ) ≤ λg(x1 ) + (1 − λ)g(x2 ).
322
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 343
1
1
1 1
{x ∈ R2 | ||x||2 = 1} {x ∈ R2 | ||x||1 = 1}
Figure 23.2 Figure 23.3
323
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 344
A COURSE IN ANALYSIS
{x ∈ R2 | ||x||∞ = 1}
Figure 23.4
and
||x||p ≥ 0 for all x ∈ Rn and xp = 0 if and only if x = 0 ∈ Rn . (23.9)
1 1
Theorem 23.15 (Hölder’s inequality). Let p, q ∈ (1, ∞), p
+ q
= 1. For
x, y ∈ Rn it follows that the inequality
n
|xν yν | ≤ xp yq (23.10)
ν=1
holds.
324
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 345
|xν · yν | 1 1
ξν ην
= ξνp ηνq ≤ +
xp yq p q
and summing over all ν we have
n
1 1 1
|xν · yν | ≤ + = 1
xp yq ν=1 p q
p
Now, for p > 1 and q = p−1 , i.e. p1 + 1q = 1, we consider z ∈ Rn , zν =
p−1
|xν + yν | , ν = 1, . . . , n. It follows that
325
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 346
A COURSE IN ANALYSIS
or p
zq = x + ypq .
Next we first apply the triangle inequality and then Hölder’s inequality to
obtain
n
n
n
|xν + yν ||zν | ≤ |xν zν | + |yν zν |
ν=1 ν=1 ν=1
≤ (xp + yp)zq .
xk − x < ε.
326
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 347
Problems
1. Prove that the convexity of f : I → R implies Jensen’s inequality:
for every m ∈ N, m ≥ 2, and any choice of points x1 , . . . , xm ∈ I and
all 0 ≤ λj ≤ 1, j = 1, . . . , m, such that λ1 + · · · + λm = 1 it follows that
f (λ1 x1 + · · · + λm xm ) ≤ λ1 f (x1 ) + · · · + λm f (xm ). (23.12)
Hint: use mathematical induction with respect to m.
327
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 348
A COURSE IN ANALYSIS
328
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 349
1 + ||x||2
≤ 2(1 + ||x − y||2). (23.14)
1 + ||y||2
and
|||x||| := max(||x||(1) , ||x||(2) )
two further norms are given on Rn .
b) Prove the converse triangle inequality
(1) (n)
10. Let (xk )k∈N , xk ∈ Rn , xk = (xk , . . . , xk ) be a sequence in Rn . Prove
that (xk )k∈N converges to x ∈ Rn , x = (x(1) , . . . , x(n) ), in the norm
|| · ||p , 1 ≤ p < ∞, if and only if
(j)
lim |xk − x(j) | = 0
k→∞
for 1 ≤ j ≤ n.
(1) (n)
11. Let (xk )k∈N , xk = (xk , . . . , xk ) ∈ Rn , be a sequence converging in the
norm || · ||p, p ∈ [1, ∞), to some x = (x(1) , . . . , x(n) ) ∈ Rn . Suppose that
|| · || is a further norm on Rn satisfying the inequality ||y|| ≤ c||y||p for
all y ∈ Rn with some c > 0. Prove that (xk )k∈N converges to x with
respect to || · ||.
329
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 351
331
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 352
A COURSE IN ANALYSIS
y-axis
1 2 x-axis
n n 1
Figure 24.1
The sequence (fn )n∈N\{1} converges pointwise on [0, 1] to f = 0. Indeed, for
x = 0 we have fn (x) = 0 for all n. Further, for every x ∈ (0, 1] there exists
N = N(x) ≥ 2 such that
2
≤ x for n ≥ N(x)
n
332
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 353
Proof. Suppose that (fn )n∈N converges uniformly to f. Then for ε > ε > 0
there exists N = N(ε ) such that
hence
sup |fn (x) − f (x)| ≤ ε < ε for all n ≥ N.
x∈K
Conversely, since
for all x ∈ K it follows that if sup |fn (y) − f (y)| < ε, then
y∈K
|fn (x) − f (x)| < ε for all x ∈ K.
If the set K is fixed we just write f ∞ instead of f K,∞. We call ||f ||∞ the
supremum norm or just the sup norm of f .
333
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 354
A COURSE IN ANALYSIS
||f ||K,∞ ≥ 0 and ||f ||K,∞ = 0 if and only if f (x) = 0 for all x ∈ K;
i.e. f is the 0-element in Mb (K; R); (24.5)
||λf ||K,∞ = |λ|||f ||K,∞ for λ ∈ R and f ∈ Mb (K; R); (24.6)
||f + g||K,∞ ≤ ||f ||K,∞ + ||g||K,∞ for all f, g ∈ Mb (K; R). (24.7)
Note that the triangle inequality implies the converse triangle inequality,
i.e.
||f ||∞,K − ||g||∞,K ≤ | ||f ||∞,K − ||g||∞,K | ≤ ||f − g||∞,K ,
compare with Lemma 2.9 or Problem 9 b) in Chapter 23. The next theorem
shows the importance of uniform convergence.
335
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 356
A COURSE IN ANALYSIS
and Lemma 24.9. To prove ||f ||∞ ≤ C note that for > 0 there exists N()
such that n ≥ N() implies by the converse triangle inequality that
or
||f ||∞ ≤ + ||fn ||∞ ≤ + C,
however > 0 was arbitrary which implies ||f ||∞ ≤ C.
In order to simplify matters, in the following we will only investigate uniform
convergence in Mb (K; R). As a first result we prove that the Cauchy criterion
holds for uniform convergence in Mb (K; R).
Theorem 24.11. A sequence (fn )n∈N , fn ∈ Mb (K; R), converges uniformly
with limit f ∈ Mb (K; R) if and only if for every > 0 there exists N() such
that n, m ≥ N() implies ||fn − fm ||∞ < .
Proof. Suppose that (fn )n∈N converges uniformly to f . For > 0 there exists
N() such that n ≥ N() implies ||f −fn ||∞ < 2 which yields for n, m ≥ N()
that
||fn − fm ||∞ = ||fn − f + f − fm ||∞ ≤ ||fn − f ||∞ + ||fm − f ||∞ < + = .
2 2
Conversely suppose that for > 0 there exists N() such that n, m ≥ N()
implies ||fn − fm ||∞ < . This gives for every x ∈ K and n, m ≥ N(),
336
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 357
which yields
||fn − f ||∞ = sup |fn (x) − f (x)| ≤ , (24.11)
x∈K
337
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 358
A COURSE IN ANALYSIS
it follows
We have seen in Example 24.7 that there are pointwise convergent sequences
of continuous functions which are not uniformly convergent and whose limit
is not continuous. If we combine Proposition 23.8 and Corollary 23.6 we
see that the pointwise limit of convex functions is continuous, i.e. uniform
convergence is not needed to get continuity. The argument is that con-
vex functions are continuous and that pointwise limits of convex functions
are convex. The next result gives a further example that using additional
information we sometimes get that pointwise convergence implies uniform
convergence.
Remark 24.15. A. This result also holds for sequences of decreasing func-
tions.
B. Note that we do not require fn to be continuous, i.e. we may have a
sequence of non-continuous functions converging uniformly to a continuous
function.
338
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 359
It turns out that uniform convergence of (fn )n∈N or of (fn )n∈N is not sufficient:
339
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 360
A COURSE IN ANALYSIS
it follows that (gn )n∈N converges uniformly to 0, i.e. the function x → 0 for
all x ∈ R.
It turns out that the pointwise convergence of (fn )n∈N and the uniform con-
vergence of (fn )n∈N will be sufficient to imply (24.14), but to prove this we
will need more tools.
Problems
1. Show that the sequence (gn )n∈N , gn : R → R, where
-
x
, if n is even
gn (x) = n1
n
, if n is odd
2. Prove that the pointwise limit of-the sequence (fn )n∈N , fn : [0, 1] → R,
1
1 2
, x=0
fn (x) = 1+(nx−1)2 , is f (x) = and deduce that the
0, x ∈ (0, 1],
convergence cannot be uniform.
340
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 361
341
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 363
f (b) Γ(f )
f (a)
a b
Figure 25.1
We take for granted that the area of a rectangle with vertices (a, 0), (b, 0),
(b, c), (a, c), a < b, b < c, is given by
A = (b − a)(c − b).
a b
Figure 25.2
Interpreting the line segment connecting (a, c) with (b, c) as the graph of the
function fc : [a, b] → R, fc (x) = c, we find for the area of this rectangle
A = fc (a)(b − a), in fact A = fc (ξ)(b − a) for every ξ ∈ [a, b]. Furthermore,
when looking at Figure 25.3
343
July 28, 2015 9:26 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 364
A COURSE IN ANALYSIS
c2
c3
c1
···
cn
a = x0 x1 x2 x3 · · · xn−1 xn = b
Figure 25.3
we of course agree that the area A is given by
n
A= ck (xk − xk−1 ) (25.1)
k=1
Now we have the obvious idea: in order to find the area bounded by Γ(f ), f :
[a, b] → R, f (x) ≥ 0, and the interval [a, b] as well as the line segment
connecting (a, 0) and (a, f (a)) and the line segment connecting (b, 0) and
(b, f (b)), see Figure 25.1, we approximate Γ(f ) by the graphs of piecewise
constant functions, see Figure 25.4, and try to pass to the limit.
···
t1 t2 t3 t4 t5 t6
a = x0 = t0 x1 x2 x3 · · b· = xn = tm
Figure 25.4
344
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 365
This idea eventually leads to a solution of the original problem, but we must
first overcome a few difficulties. One problem is that the area A(f ) we
are looking for is not yet defined. In fact only after we define a proper
approximation process for a (large) class of functions can we define the “area
under the graph” of a function. We therefore need to find for a class of
functions a way of approximating them with piecewise constant functions ϕ
such that the area A(ϕ) associated with ϕ by (25.2) converges to a quantity
A(f ) which we can interpret as the “area under Γ(f )”.
First let us consider piecewise constant functions.
Definition 25.1. Let [a, b], a < b, be a closed and bounded, hence compact
interval. We call a finite set of numbers or points a = x0 < x1 < · · · <
xn−1 < xn = b a partition Z of [a, b].
The set of all step functions on [a, b] is denoted by T [a, b]. Note that in
Definition 25.2 no statement about the values f (xk ), xk ∈ Z, is made, except
that they are real numbers.
345
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 366
A COURSE IN ANALYSIS
346
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 367
and
ψ(x) − ϕ(x) − |ψ(x) − ϕ(x)| ≤ for all x ∈ [a, b]. (25.6)
For x = a = t0 we have ϕ(x) = f (x) = ψ(x), hence ϕ(x) ≤ f (x) ≤ ψ(x). For
x ∈ (tk−1 , tk ] it follows that |x − tk | < δ and therefore
− < f (x) − f (tk ) < ,
2 2
or
ϕ(x) = ck = f (tk ) − < f (x) < f (tk ) + = ck = ψ(x),
2 2
i.e. ϕ(x) ≤ f (x) ≤ ψ(x) for all x ∈ [a, b].
347
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 368
A COURSE IN ANALYSIS
Now we define an integral for step functions ϕ ∈ T [a, b] with the aim to
extend it at least to all continuous functions on [a, b].
Note that the integral does not depend on the values f (tk ), k = 0, . . . , n.
However, the integral as defined by (25.9) seems to depend on Z, but ϕ can
be represented with respect to other partitions. So we need to prove that the
integral only depends on ϕ and not on the chosen partition to represent ϕ.
b
Lemma 25.8. The definition of a f (x)dx is independent of the choice of
partition representing ϕ.
and
cj = ci for ki−1 < j ≤ ki ,
implying
m
n
ki
n
cj (tj − tj−1 ) = cj (tj − tj−1 ) = ci (xi − xi−1 ).
j=1 i=1 j=ki−1 +1 i=1
348
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 369
and
b
b
(λϕ)(x)dx = λ ϕ(x)dx. (25.11)
a a
Proof. For (25.10) we need to represent ϕ and ψ with respect to the same
partition and then we can use as for the proofs of (25.11) and (25.12) the
fact that the summation process is additive, homogeneous and positivity
preserving, i.e. the sum of non-negative numbers is non-negative.
Corollary 25.10. Let ϕ, ψ ∈ T [a, b] and ϕ ≤ ψ, i.e. ϕ(x) ≤ ψ(x) for all
x ∈ [a, b] then we have
b
b
ϕ(x)dx ≤ ψ(x)dx. (25.13)
a a
349
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 370
A COURSE IN ANALYSIS
a b a b
Figure 25.5 Figure 25.6
This leads us to introduce the upper and lower integral as well as the Rie-
mann integral.
b ∗
b
f (x)dx := inf ϕ(x)dx|ϕ ∈ T [a, b] and ϕ ≥ f (25.14)
a
a
b
b
f (x)dx := sup ϕ(x)dx|ϕ ∈ T [a, b] and ϕ ≤ f . (25.15)
∗ a
a
b ∗
b
f (x)dx = f (x)dx. (25.16)
∗
a a
350
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 371
and call the left hand side of (25.17) the Riemann integral of f (over
[a, b]).
C. Let f : [a, b] → R be a non-negative Riemann integrable function. The
area A(f ) bounded by Γ(f ), the interval [a, b], the line segment connecting
(a, 0) and (a, f (a)) and the line segment connecting (b, 0) and (b, f (b)) is
defined by
b
A(f ) = f (x)dx.
a
(Compare with Definition 12.3.)
Note that from Definition 25.11.A we always have
b
b ∗
f (x)dx ≤ f (x)dx. (25.18)
∗
a a
Thus the aim is to determine the class of bounded functions where equality
holds in (25.18) and to discuss properties of the Riemann integral. First
however we give two examples.
Example 25.12. A. For f ∈ T [a, b] clearly we have
b
b ∗
b
f (x)dx = f (x)dx = f (x)dx,
∗ a
a a
351
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 372
A COURSE IN ANALYSIS
b ∗
b ∗
b ∗
(f + g)dx ≤ f dx + gdx, (subadditivity) (25.19)
a a a
and for λ ≥ 0
b ∗
b ∗
(λf )dx = λ f dx, (positive homogeneity). (25.20)
a a
∗
b ∗
Proof. Let us write f for f dx if it is clear what is meant, analogously
a
352
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 373
Using the definition of inf and sup, given > 0, we can find ψ, ϕ ∈ T [a, b],
ϕ ≤ f ≤ ψ, such that
b
b
b
b
f (x) ≤ ϕ(x)dx + and ψ(x)dx − ≤ f (x)dx,
a a 2 a 2 a
hence we have:
353
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 374
A COURSE IN ANALYSIS
Furthermore we have
Theorem 25.17. Every monotone function f : [a, b] → R is Riemann inte-
grable.
Proof. Suppose that f is increasing (for decreasing functions the proof goes
analogously). By xk := a + (b−a)
n
, k = 0, 1, . . . , n, an equidistant partition of
[a, b] is given. We now define the two step functions ϕ, ψ ∈ T [a, b] by
ϕ(x) :=f (xk−1 ), xk−1 ≤ x < xk ,
ψ(x) :=f (xk ), xk−1 ≤ x < xk ,
as well as ϕ(b) = ψ(b) = f (b). Since f is monotone increasing we find
ϕ ≤ f ≤ ψ. Furthermore we have
b
b n
n
ψ(x)dx − ϕ(x)dx = f (xk )(xk − xk−1 ) − f (xk−1 )(xk − xk−1 )
a a k=1 k=1
n n
b−a
= f (xk ) − f (xk−1 )
n k=1 k=1
b−a
= (f (b) − f (a)).
n
354
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 375
Now given > 0 we can find N ∈ N such that for n ≥ N it follows that
b−a b−a
(f (xn ) − f (x0 )) = (f (b) − f (a)) < ,
n n
i.e. we have
b b
ψ(x)dx − ϕ(x)dx <
a a
and by Theorem 25.15 the result follows.
Note that now we have two classes of integrable functions which are not
necessarily continuous: step functions and monotone functions.
Theorem 25.18. The set of all Riemann integrable functions f : [a, b] → R
form a real vector space. In addition we have for two Riemann integrable
functions f, g : [a, b] → R
b
b
f ≤ g implies f (x)dx ≤ g(x)dx. (25.25)
a a
and
∗
∗
∗
f= f, g= g implies that (f + g) = (f + g)
∗ ∗ ∗
as well as
b b b
(f (x) + g(x))dx = f (x)dx + g(x)dx.
a a a
By Theorem 25.13 and Corollary 25.14 we find for λ > 0
∗
∗
∗
λ f dx = (λf )dx ≤ (λf )dx = λ f dx,
∗
∗
so λf is integrable and λf dx = λ f dx = λ f dx. For λ < 0 we
use Corollary 25.14, in particular (25.23). Finally, (25.25) follows once we
355
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 376
A COURSE IN ANALYSIS
b
know that f ≥ 0 implies a f (x)dx ≥ 0. But for f ≥ 0 there always exists
ϕ ∈ T [a, b] such that 0 ≤ ϕ ≤ f , hence
b
b
0≤ f (x)dx = f (x)dx.
∗ a
a
Proof. By our assumptions, given ε > 0 there are step functions ϕ, ψ ∈ T [a, b]
such that ϕ ≤ f ≤ ψ and
b
(ψ − ϕ)(x)dx ≤ ε.
a
356
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 377
d p
x = pxp−1
dx
the mean value theorem yields
ψ p − ϕp ≤ p(ψ − ϕ),
note that xp−1 ≤ 1 for 0 ≤ x ≤ 1. Hence we find
b
b
p p
(ψ − ϕ )(x)dx ≤ p (ψ − ϕ)(x)dx < ε,
a a
b
1. It follows that |g|p(x)dx exists, but
a
b
b
b
p p 1 1
|g| (x)dx = |f (x)| · dx = |f (x)|p dx
sup f (t) sup f (t)
a a t t a
357
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 378
A COURSE IN ANALYSIS
b
b
f (x)φ(x)dx = f (ξ) φ(x)dx. (25.29)
a a
Proof. Define
m := inf{f (x); x ∈ [a, b]},
and
M := sup{f (x); x ∈ [a, b]}.
It follows that
mφ ≤ f φ ≤ Mφ,
hence
b
b
b
m φ(x)dx ≤ f (x)φ(x)dx ≤ M φ(x)dx.
a a a
b
b
f (x)φ(x)dx = μ φ(x)dx.
a a
358
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 379
When combining the mean value theorem and the fundamental theorem of
calculus we get a powerful tool to derive estimates. For this reason we post-
pone applications of the mean value theorem until the next chapter. However
we state a very useful and often applied consequence of (25.30).
Corollary 25.22. Let f ∈ C([a, b]) and h > 0 such that x, x + h ∈ [a, b] then
x+h
1
lim f (t)dt = f (x). (25.31)
h→0 h x
is called the Riemann sum of f with respect to the partition Z and points
ξk , k = 1, . . . , n.
As before we denote the mesh size of the partition Z by
359
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 380
A COURSE IN ANALYSIS
with mesh size less than or equal to δ, i.e. η(Z) ≤ δ, and any choice of
general points ξk ∈ [xk−1 , xk ]
b
n
f (x)dx − f (ξk )(xk − xk−1 ) ≤ ε
k=1
a
holds.
Proof. Given ε > 0 there are step functions φ, ψ ∈ T [a, b] such that
b
ε
φ ≤ f ≤ ψ and (ψ − φ)(x)dx ≤ .
2
a
Without loss of generality we may assume that φ and ψ are given with respect
to the same partition
Since f is bounded
360
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 381
which implies
b
b
b
ε ε
ϕ(x)dx − ≤ F (x)dx ≤ ψ(x)dx + .
2 2
a a a
or
b
b
| f (x)dx − F (x)dx| ≤ ε
a a
361
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 382
A COURSE IN ANALYSIS
b
b
| f (x)g(x)dx| ≤ |f (x)g(x)|dx ≤ f p gq . (25.36)
a a
n
p1 n
p1
≤ |f (ξk )|p (xk − xk−1 ) + |g(ξk )|p (xk − xk−1 ) +ε
k=1 k=1
⎛ ⎞ p1 ⎛ ⎞ 1p
b
b
≤⎝ |f (x)|p dx⎠ + ⎝ |g(x)|pdx⎠ + 2ε
a a
362
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 383
363
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 384
A COURSE IN ANALYSIS
bj
where with Ij = [aj , bj ] we write Ij
f (x)dx for aj
f (x)dx.
Rewriting (25.39) as
N
N
f (x)dx = f (x)dx (25.40)
j=1 Ij j=1 Ij
and
a
f (x)dx := 0. (25.42)
a
Problems
1. Prove that the product of two step functions is a step function, i.e.
ϕ, ψ ∈ T [a, b] implies ϕ · ψ ∈ T [a, b], and deduce that T [a, b] is an
algebra.
364
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 385
365
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 386
A COURSE IN ANALYSIS
b 1
b
b) Denote by −a f (x)dx := b−a a
f (x)dx the mean value of f :
(n)
[a, b] → R which we assume to be Riemann integrable. With xj :=
a + nj (b − a), j = 0, . . . , n, prove that
n
b 1 (n)
− f (x)dx = lim f (xj ).
a n→∞ n
j=1
1 1
with p
+ q
= 1, 1 < p.
integrable.
b) Prove that for two Riemann integrable functions f, g : [a, b] → R
and every > 0 we have
b
b
2 1 b
|f (x)g(x)|dx ≤ |f (x)| dx + |g(x)|2dx.
a a 4 a
366
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 387
367
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 389
f (t)dt = hf (ξh ).
x
x+h
1 1
F (x) = lim f (t)dt = lim hf (ξh ) = f (x).
h→0 h h→0 h
x
369
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 390
A COURSE IN ANALYSIS
b
f (x)dx = F (b) − F (a).
a
x
F0 (x) = f (t)dt, (a ∈ I fixed).
a
b
F0 (a) = 0 and F0 (b) = f (t)dt.
a
A useful notation is
b
f (x)dx = F |ba , (26.1)
a
Let us restate (with full proofs) some rules for integration that have already
been proved in Chapter 13.
370
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 391
b
b
f (x)g (x)dx = f · g|ba − f (x)g(x)dx
a a
(26.3)
b
= f (b)g(b) − f (a)g(a) − f (x)g(x)dx.
a
b
b
f (x)g(x)dx + f (x)g (x)dx = F |ba = f · g|ba .
a a
b
φ(b)
f (φ(t))φ(t)dt = f (x)dx. (26.4)
a φ(a)
b
φ(b)
f (φ(t))φ (t)dt = F ◦ φ|ba = F (φ(b)) − F (φ(a)) = f (x)dx.
a φ(a)
371
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 392
A COURSE IN ANALYSIS
i.e.
a a a a
h(t)dt = h(t)dt + h(s)ds = 2 h(t)dt.
−a 0 0 0
B. Since
a 0 a
g(t)dt = g(t)dt + g(t)dt
−a −a 0
0 a
we are done if we can show that −a
g(t)dt = − 0
g(t)dt. The change of
variable t → −s yields however
0
0
0
a
g(t)dt = − g(−s)ds = g(s)ds = − g(t)dt.
−a a a 0
372
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 393
or
a
F (x + c) − F (x) = F (a) − F (a − c) = f (t)dt = 0. (26.8)
a−c
Moreover we have
2π
a+2π
(1 + cos x)dx = (1 + cos x)dx = 2π = 0
0 a
for all a ∈ R. Hence we cannot expect Proposition 26.8 to hold for all periodic
functions. b+c a
B. From (26.8) it follows that b f (t)dt = 0 for all b ∈ R if a−c f (t)dt = 0
for an a ∈ R.
Let f : R → R be a a continuous function with period c > 0 and for some
a ∈ R set A := a−c f (t)dt, A need not be zero. The function fA : R →
R, x → fA (x) = f (x) − Ac is once again periodic with period c:
A A
fA (x + c) = f (x + c) − = f (x) − = fA (x).
c c
373
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 394
A COURSE IN ANALYSIS
Moreover it holds
a
a
A
fA (x)dx = (f (x) − )dx = A − A = 0.
a−c a−c c
Hence we may apply Proposition 26.8 to fA to find that
b+c
b+c
b+c
A
0= fA (t)dt = (f (x) − )dx = f (x)dx − A, (26.9)
b b c b
374
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 395
implying (26.16).
375
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 396
A COURSE IN ANALYSIS
we get the right inequality. To get the left inequality note that t → eat −1−at
is on [0, ∞) for all a ≥ 0 monotone increasing since dtd (eat − 1 − at) =
a(eat − 1) ≥ 0. For t = 0 we have eat − 1 − at|t=0 = 0, hence we find
0 ≤ eat − 1 − at
or
(1 + at) ≤ eat ,
i.e. (1 + at)e−at ≤ 1, which is equivalent to
0 ≤ 1 − (1 + at)e−at
or
at ≤ 1 + at − (1 + at)e−at ,
leading eventually to
at
≤ 1 − e−at .
1 + at
Example 26.15. For a ≥ 0 and t ≥ 0 it holds
at
e − 1 + at 1 2
≤ a t. (26.18)
t 2
376
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 397
Note that all these examples work along the same idea: we want to estimate
the difference f (x) − f (y) for a given function f . If we can identify f as a
primitive, i.e. for some function g we have
y
f (x) − f (y) = g(t)dt,
x
377
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 398
A COURSE IN ANALYSIS
proved.
We refer to Problem 9 in Chapter 24 for an example showing that uniform
convergence of (fn )n∈N and pointwise convergence of (fn )n∈N is not sufficient
for (26.23) to hold. Theorem 26.19 will become particularly powerful when
applied to series of functions.
378
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 399
Problems
1. Prove
a) If f ∈ C k ([a, b]) then all primitives of f belong to C k+1 ([a, b]), k ≥
0.
b) Every f ∈ C([a, b]) determines by its primitive a one-dimensional
affine subspace of C 1 ([a, b]).
2. Define on x C([a, b]) the mapping T : C([a, b]) → C([a, b]) by (T f )(x) :=
f (a) + a e−t f (t)dt, x ∈ [a, b]. We call g ∈ C([a, b]) a fixed point of
T if T g = g, i.e. (T g)(x) = g(x) for all x ∈ [a, b]. Prove that if
g ∈ C k ([a, b]) is a fixed point it must belong to C k+1 ([a, b]), hence
g ∈ C ∞ ([a, b]) = ∞ k
k=1 C ([a, b]).
3. For f ∈ C(R), f ≥ 0, define for the right half-open interval I = [a, b),
a, b ∈ R
b
μ(I) := f (t)dt and μ(∅) := 0.
a
c) For a0 ∈ R fixed define the function μa0 (x) := μ([a0 , x)). Show
that μa0 ∈ C 1 (R) and μa0 (x) = f (x).
4. Let f : [−a, a] → R
xbe a continuous function and suppose that for all
x ∈ (0, a] we have −x f (t)dt = 0. Prove that f is an odd function.
Hint: first prove that if g : [a, b] → R is continuous and if for all
379
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 400
A COURSE IN ANALYSIS
β
α < β, α, β ∈ [a, b], it follows that α
g(t)dt = 0 then g(t) = 0 for all
t ∈ [a, b].
y ρ − xρ ≤ ρ(y − x).
b) For − π4 ≤ x < y ≤ π
4
show that
2
y − x ≤ √ (sin y − sin x).
2
8. Let f ∈ C 1 ([a, b]) and suppose that f ⊥ f . Prove that this implies
|f (a)| = |f (b)|.
380
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 401
9. Prove that on C01 ([a, b]) := {f ∈ C 1 ([a, b])|f (a) = f (b) = 0} a norm is
given by ||f ||L2 . Hint: use Corollary 26.10 and Problem 6 in Chapter
25.
is increasing.
Hint: prove that (fn )n∈N forms a Cauchy sequence with respect to the
norm || · ||∞ and apply Theorem 24.11.
N k
12. Consider the sequence SN (x) := k=0 x defined on (−1, 1). Let
[a, b] ⊂ (−1, 1) be a compact interval. Prove that SN (x) as well as
SN
(x) converge uniformly on [a, b]. Deduce that for m ∈ N, m ≥ 2 it
follows that ∞
k m
k
= .
k=1
m (m − 1)2
381
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 403
as well as
a
u(a) = u0 + h(t)dt = u0 .
a
Note that we can “derive” the solution (27.3) to (27.2) by “integrating”
(27.2):
x
x
u (t)dt = h(t)dt
a a
383
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 404
A COURSE IN ANALYSIS
implying
x
u(x) − u(a) = h(t)dt,
a
or
x
x
u(x) = u(a) + h(t)dt = u0 + h(t)dt.
a a
Let us have a closer look at the integral on the left hand side. If we consider
z = u(t) as a new variable then the rule for integration by substitution,
compare with Proposition 26.6 or Theorem 13.12, yields
u(x)
x
g(z)dz = g(u(t))u(t)dt, (27.6)
u(a) a
which implies
u(x)
x
g(z)dz = h(t)dt. (27.7)
u(a) a
i.e.
G(u(x)) = H(x) + G(u0 ) − H(a). (27.9)
If we add the assumption that G has an inverse, we find
384
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 405
Next we suppose that G−1 is differentiable, for example the condition G (y) =
g(y) = 0 for all y ∈ R will be sufficient. It follows from (27.10) that
d d −1
u(x) = G (H(x) + G(u0 ) − H(a))
dx dx
d
= (G−1 ) (H(x) + G(u0 ) − H(a)) (H(x) + G(u0 ) − H(a))
dx
1
= −1 h(x)
G (G (H(x) + G(u0 ) − H(a)))
1
= h(x)
g (G−1 (H(x) + G(u0 ) − H(a)))
1
= h(x),
g(u(x))
or
d
g(u(x)) u(x) = h(x),
dx
i.e. by (27.10) we have indeed a solution to (27.4). We have added two new
conditions: G has an inverse and G−1 is differentiable. However, these two
conditions are not independent: if g(y) = 0 for all y ∈ R, it must be either
strictly positive or strictly negative. Since G is the primitive of g, i.e. G = g,
in the first case G is strictly monotone increasing and in the second case it is
strictly monotone decreasing. In each case however G has an inverse which is
differentiable. Thus we have proved the following existence and uniqueness
result:
Theorem 27.1. Let h : [a, b] → R, a < b, be a continuous function and
u0 ∈ R. Suppose that g : R → R is a continuous function and g(y) = 0 for
all y ∈ R. In this case the initial value problem
385
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 406
A COURSE IN ANALYSIS
Remark 27.2. Note that after we have derived a candidate for u as a solution
to (27.4) we have to verify that this function is indeed a solution. This is
typical for solving differential equations: to derive a formula for u we need
to do some calculations, but since we do not know what u is, we may not
be able to justify these calculations. Thus we pretend as if all steps in the
calculation are allowed, and once we have derived a formula we try (and we
have) to verify that this formula gives a solution.
Remark 27.3. Let us return to formula (27.8) (or (27.9)). In the case
that h and g are continuous, hence have a primitive, this formula makes
sense for any u : [a, b] → R, u(a) = u0 . Hence we can call a function, not
necessarily differentiable, a generalised solution to (27.4) if (27.8) holds. Non-
differentiable solutions of (partial) differential equations are of importance,
however we first need to understand more about differentiable solutions.
For solving differential equations, say the initial value problem (27.4), some-
times a rather formal approach is helpful. With y = u(x) we write (27.4)
as
dy
g(y) = h(x), (27.13)
dx
and
y0 = u(a). (27.14)
We now write (27.13) formally as
(of course we only need to include one constant). Thus we have a formal
algorithm:
dy
To solve g(u(x))u(x) = h (x), look at g(y) dx = h(x), and integrate g(y)dy =
h(x)dx, i.e. for the integration process the variables are separated and con-
sequently this method is called separation of variables. Once we have
separated the variables we try to evaluate the two integrals in (27.16) and
we then return to (27.8).
386
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 407
A COURSE IN ANALYSIS
or
d
3u2 (x) u(x) = cos x.
dx
Note that we cannot apply Theorem 27.1 since g(z) = 3z 2 has a zero at
z0 = 0. The short calculation above is still formal, we still need to specify for
which values of x it holds for. The problem are points where sin x = −1, i.e.
x = 3π
2
+ 2kπ, k ∈ Z. At these points u given by (27.20) is not differentiable.
However for x = 3π
2
+ 2kπ we have
du(x) 2 1 2
3u2 (x) = 3(1 + sin x) 3 · (1 + sin x)− 3 cos x = cos x
dx 3
du 3π
which implies that although dx
does not exist for x = 2
+ 2kπ we still have
2 du(x) 3π
lim 3u (x) = lim cos x = cos (= 0).
x→ 3π
2
+2kπ dx x→ 3π
2
+2kπ 2
1
With this interpretation we can claim that u(x) = (1 + sin x) 3 satisfies
3u2 u = cos on the entire real axis even if it is not differentiable at certain
points.
Example 27.5. Our aim is to find a solution to the initial value problem
du(x)
+ 3u(x) = 8, u(0) = 2. (27.21)
dx
We may write this equation as
dy
1 + (3y − 8)1 = 0, y = u(x), y0 = u(0),
dx
and formally we find
dy
= dx,
8 − 3y
or
dy
= dx + C,
8 − 3y
i.e.
1
− ln(8 − 3y) = x + C.
3
388
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 409
389
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 410
A COURSE IN ANALYSIS
Consequently we find
u(x)
d d
g(z)dz = (G(u(x)) − G(v(x)))
dx v(x) dx
= G (u(x))u (x) − G (v(x))v (x)
= g(u(x))u(x) − g(v(x))v (x).
Hence we have
Even in the case of Theorem 27.1 we will in general have no explicit formula
for the solution of (27.11) and of course this applies to (27.17) too. Neither
should we expect to find G or H explicitly, nor will we have an explicit
formula for G−1 . Nonetheless, for a function satisfying, say (27.17), we can
derive some properties. Here are some first observations. Suppose u : [a, b] →
R is differentiable and solves
390
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 411
u (x) = −f (x)u(x)
We can iterate this process until on the right hand side the k th derivative of
f appears, which is of course the case when forming u(k+1)
dk dk
u(k+1) (x) = u (x) = (−f (x)u(x))
dxk dxk
k
k (l)
=− f (x)u(k−l) (x),
l
l=0
391
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 412
A COURSE IN ANALYSIS
Problems
1. For c < 0 consider the function uc : R → R defined by
⎧ 2
x
⎪
⎨4, x>0
uc (x) := 0, c≤x≤0
⎪
⎩ (x−c)2
− 4 , x < c.
Prove further that every uc satisfies uc (2) = 1. Now deduce that the
initial value problem
v (x) = |v(x)|, x ∈ (2, 3), v(2) = 1,
392
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 413
ϕ(s)
c) ϕ (s) = tan s
, ϕ( π4 ) = π4 ;
393
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 415
exist for α, β ∈ I. b
B. The definition of a f (t)dt is independent of the choice of α since for
α, γ ∈ I it follows that
d
α
d
γ
d
f (t)dt = f (t)dt + f (t)dt = f (t)dt + f (t)dt.
c c α c γ
α
C. In the case that f |[a,α] or f |[α,b] is already integrable, i.e. a f (t)dt or
d
α
b
α
f (t)dt exist, then we need only require that lim f (t)dt or lim f (t)dt
d→b α c→a c
exist and we can define
b
α
d
f (t)dt := f (t)dt + lim f (t)dt (28.4)
a a d→b α
395
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 416
A COURSE IN ANALYSIS
and
b α b
f (t)dt := lim f (t)dt + f (t)dt, (28.5)
a c→a c α
respectively.
D. Of course it is possible to reduce all limits under consideration to limits
of the type
α
b−
lim f (t)dt or lim f (t)dt, (28.6)
→0 a+ →0 α
with > 0 such that a + , b − ∈ I.
This definition allows us already to integrate certain continuous functions
defined on open or half-open intervals, and even some unbounded functions
are included.
Example 28.3. Let R > 0 and 0 < α < 1. Consider the unbounded,
continuous function fα : (α, R] → R, x → x1α . For c ∈ (0, R] it follows that
R
R R
1 1 1 1 1−α
fα (x)dx = α
dx = · α−1 = R − c1−α .
c c x 1−α x c 1−α
Since 1 − α > 0 we find
R
1
lim fα (x)dx = R1−α ,
c→0 c 1−α
hence
R R
1 1
fα (x)dx = α
dx = R1−α . (28.7)
0 0 x 1−α
Example 28.4. The following holds
1
0
1−
dx dx dx
√ = lim √ + lim √
1−x 2 →0 −1+ 1−x 2 →0 0 1 − x2
−1
= − lim arcsin(−1 + ) + lim arcsin(1 − )
→0 →0
π π
=− − + = π,
2 2
i.e.
1
dx
√ = π. (28.8)
−1 1 − x2
396
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 417
or
π
−
2
2I = (ln(sin x) + ln(cos x))dx
π −
π −
2 2 sin 2x
= ln(sin x cos x)dx = ln dx
2
π −
π −
2 2
= ln(sin 2x)dx − ln 2 dx
π −
2 π
= ln(sin 2x)dx − ln 2( − + )
2
π −
2 π
= ln(sin 2x)dx − ln 2.
2
397
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 418
A COURSE IN ANALYSIS
implying
2
π−
π 1
2I = I − ln 2 − ln(sin t)dt + ln(sin t)dt ,
2 2 π−2
or
2 π−
π 1
I = − ln 2 − ln(sin t)dt + ln(sin t)dt .
2 2 π−2
Since lim (y α ln y) = 0 for any α > 0 we first note that lim ((sin 2) ln(sin 2)) =
y→0 →0
0, implying of course that lim ln(sin 2) = 0. Next we note that
→0
and
max | ln(sin t)| = − ln(sin 2),
t∈[π−2,π−]
exists we denote it by
∞
R
b
b
f (x)dx := lim f (x)dx g(x)dx = lim g(x)dx .
a R→∞ a −∞ R→∞ −R
(28.11)
398
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 419
and
b d
f (x)dx := lim f (x)dx (28.13)
α d→b α
exist, then we define
b
α
b
f (x)dx := f (x)dx + f (x)dx (28.14)
a a α
α
d
:= lim f (x)dx + lim f (x)dx.
c→a c d→b α
α b
Remark 28.7. As before, if one of the integrals a f (x)dx or α f (x)dx
exist as the Riemann integral of a continuous function defined on a compact
interval, then in (28.14) we need to consider only one limit.
1
Since lim = 0, note that α − 1 > 0, it follows that
R→∞ Rα−1
∞
R
dx dx 1 1 1
α
= lim α
= lim 1 − α−1 = .
1 x R→∞ 1 x R→∞ α − 1 R α−1
399
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 420
A COURSE IN ANALYSIS
400
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 421
Theorem 28.13. Suppose that f ≥ 0. Then for every α ∈ (a, b) the integral
b
α
f (x)dx converges if there exists a constant M > 0 such that for all β ∈
(α, b)
β
f (x)dx ≤ M (28.18)
α
holds.
β
Proof. Since f ≥ 0 the function β → α
f (x)dx is monotone increasing and
it is bounded, hence the limit
β
lim f (x)dx
β→b α
and consequently
t
y
f (x)dx < and f (x)dx < . (28.20)
s z
401
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 422
A COURSE IN ANALYSIS
1
1 1
dx = ln 1 − ln = ln
x
and the limit → 0 does not exist.
The following criterion is useful:
Theorem 28.17. Let g : I → R, g(x) ≥ 0, and suppose that I g(x)dx
converges. If |f (x)| ≤ g(x) for all x ∈ I then I f (x)dx converges absolutely.
Proof. We use once again the Cauchy criterion. Since the integral I g(x)dx
exists, the Cauchy criterion holds, so we can replace in (28.19) the function
|f | by g. Now we need to observe that
t
t
t
f (x)dx ≤ |f (x)|dx ≤ g(x)dx
s s s
as well as
y y y
f (x)dx ≤ |f (x)|dx ≤ g(x)dx.
z z z
402
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 423
exist. Indeed, if P (sin x, cos x) = N k l
k,l=0 Ak,l sin (ak x) cos (bl x) and α > 1
then
∞
P (sin x, cos x)
dx, r > 0,
r xα
exists. We only need to observe that
N
|P (sin x, cos x)| ≤ |Ak,l |
k,l=0
∞
and that for α >1 the integral r x1α dx converges.
∞
B. The integral 0 sinx x dx converges. We use the Cauchy criterion to show
this. Let 0 < s < t, then integration by parts gives
t t
t
sin x − cos x cos x
x
dx =
x − x2
dx
s s s
which implies
t
t
sin x 1 1 dx
dx ≤ + +
x s t x2
s
s t
1 1 1 2
= + + − = .
s t x s s
Thus given > 0, choose s0 ≥ 2 to find that for t > s > s0 it follows
t
sin x 2
dx ≤ < .
x s
s
Remark 28.20. Of course we can also use the integral test, Theorem 18.21,
to test improper integrals for convergence.
We now want to introduce one of the most important functions in mathemat-
ics and for this we need some preparation. For x ∈ R consider y → cos xy.
It follows that
2 2
1 sin 2x − sin x
cos(xy)dy = sin xy = .
1 x 1 x
Hence we have defined a new function (at least) on R \ {0} by
2
sin 2x − sin x
x → cos(xy)dy = .
1 x
403
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 424
A COURSE IN ANALYSIS
exists.
exists and is finite. Further, using that lim tx+1 e−t = 0 implies that for some
t→∞
N ∈ N the condition t ≥ N yields
1
tx−1 e−t ≤ , (28.23)
t2
we find for R > N that
R
N
R
tx−1 e−t dt = tx−1 e−t dt + tx−1 e−t dt
1 1 N
N
R
1
≤ tx−1 e−t dt + 2
dt
1 N t
N
1 1
= tx−1 e−t dt + − ≤ C(N) < ∞.
1 N R
404
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 425
R
As before we observe that R → 1
tx−1 e−t dt is monotone, bounded and
continuous, hence
R
∞
x−1 −t
lim t e dt = tx−1 e−t dt
R→∞ 1 1
exists. Thus
1
R
x−1 −t
Γ(x) := lim t e dt + lim tx−1 e−t dt
→0 R→∞ 1
is well defined.
405
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 426
A COURSE IN ANALYSIS
Since
R
Γ(1) = lim e−t dt = lim (1 − e−R ) = 1, (28.26)
R→∞ 0 R→∞
or
∞
∞
−x2 1 1 1 1
e dx = t− 2 e−t dt = Γ( ).
0 2 0 2 2
Since
∞
∞
−x2 2
e dx = 2 e−x dx
−∞ 0
406
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 427
or
ln F (λx + (1 − λ)y) ≤ ln(F (x)λ F (y)1−λ),
implying
F (λx + (1 − λ)y) ≤ F (x)λ f (y)1−λ. (28.30)
Theorem 28.29. The Γ-function is logarithmic convex.
∞
Proof. First we note that Γ(x) = 0 tx−1 e−t dt > 0 for x > 0. Next, for
x, y ∈ (0, ∞) and 0 < λ < 1 we set p := λ1 and q := 1−λ
1
, i.e. p1 + 1q = 1.
x−1 t y−1 t
Define f (t) = t p e− p and g(t) = t q e− q . For > 0 and R > Hölder’s
inequality yields
R
R p1
R 1q
p q
f (t)g(t)dt ≤ f (t) dt g(t) dt ,
but
x y
f (t)g(t) = t p + q −1 e−t = tλx+(1−λ)y−1 e−t ,
f (t)p = tx−1 e−t ,
g(t)q = ty−1 e−t ,
i.e. we find
R
R λ
R 1−λ
λx+(1−λ)y−1 −t x−1 −t y−1 −t
t e dt ≤ t e dt t e dt .
Problems
1. a) Let a < b. Prove that the improper integral
b
dx
α
a (x − a)
407
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 428
A COURSE IN ANALYSIS
diverges.
d) Show that
∞
a
e−ax cos(wx)dx = .
0 a2 + w 2
408
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 429
Find now
− 1
g(x) g(x)
lim dx + dx .
→0 −1 x x
409
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 430
A COURSE IN ANALYSIS
410
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 431
N
SfN (x) := fn (x). (29.1)
n=0
Then the series, i.e. the sequence (SfN )N ∈N0 of partial sums, converges abso-
lutely and uniformly on K to a function F : K → R.
∞
Proof. First we prove that fn (x) converges pointwise, i.e. for every x ∈ K,
n=0
∞
to some function F : K → R. Since |fn (x)| ≤ fn ∞ the series fn (x)
n=0
converges absolutely by the comparison test, see Theorem 18.13. Therefore
we can define for x ∈ K
∞
F (x) := fn (x),
n=0
∞
fn ∞ < ε for N ≥ Ñ(ε).
n=N +1
411
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 432
A COURSE IN ANALYSIS
∞
Example 29.2. The series cos nx
n2
converges uniformly on R since with
n=1
cos nx
fn (x) = n2
we have
cos nx 1
fn ∞ = sup 2 = 2
x∈R n n
and ∞
1
< ∞.
n=1
n2
Now we return to power and Taylor series.
Definition 29.3. Let (cn )n∈N be a sequence of real numbers and a ∈ R. We
call ∞
a
T(cn ) (x) := cn (x − a)n , x ∈ R, (29.3)
n=0
the (formal) power series associated with (cn )n∈N and centre a.
Most important of course is the question for which x = a the formal power
series T(ca n ) (x) converges.
Theorem 29.4. Let (cn )n∈N0 be a sequence of real numbers and a ∈ R. If
T(ca n ) (x) converges for some x1 = a, then it converges for all x ∈ R such that
|x−a| ≤ < |x1 −a|, i.e. it converges for all x ∈ [a−, a+], 0 < < |x1 −a|.
Moreover, the convergence is absolute and uniform on [a − , a + ] and the
same holds for the series
∞
a
T(nc n)
(x) = ncn (x − a)n−1 . (29.4)
n=1
412
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 433
413
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 434
A COURSE IN ANALYSIS
∞
and the series ncn (x − a)n−1 converges uniformly in [a − , a + ].
n=1
1 (k)
ck = f (a). (29.7)
k!
Proof. A repeated application of Corollary 29.7 yields first the existence of
all derivatives and then
∞
f (k) (x) = n(n − 1) · . . . · (n − k + 1)cn (x − a)n−k
n=k
414
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 435
Having Corollary 29.8 in mind, we note that the power series allow us to
define arbitrarily often differentiable functions. This opens the road to even-
tually prove the existence of the exponential function exp : R → R.
Proof. We set
∞
xk
exp(x) := . (29.8)
k=0
k!
First we claim that this power series converges for all x ∈ R. Indeed for
x ∈ R fixed we find xk+1
1
(k+1)!
xk = |x| ,
k+1
k!
|x| k
and therefore, if k ≥ 2|x| it follows that k+1 ≤ 2(k+1) < 12 , and the ratio
∞ xk
test implies the convergence of k=0 k! . Now we deduce from Theorem 29.4
that this convergence is uniform on every compact interval. Consequently,
by Corollary 29.7 we find
∞ ∞ ∞
xk−1 xk−1 xk
exp (x) = k = = = exp(x).
k=1
k! k=1
(k − 1)! k=0 k!
and exp(0) = 1.
We will see later how we can use power series to solve certain differential
equations (some examples are given in the Problems).
Our aim is to discuss Taylor’s formula and the Taylor series. The starting
point is the fundamental theorem of calculus.
Let I = [a, b] be an interval and f : (a, b) → R be of the class C 2 , i.e.
f ∈ C 2 ((a, b)), and suppose that f , f and f have continuous extensions on
[a, b]. Since f is a primitive of the continuous function f and since by the
fundamental theorem
x
f (x) = f (c) + f (t)dt, c, x ∈ (a, b)
c
415
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 436
A COURSE IN ANALYSIS
Moreover with
x
(1)
Mf := ||f ||[a,b],∞ and Rf,c (x) = f (t)dt
c
we have
(1)
|Rf,c (x)| ≤ Mf |x − c|. (29.11)
Here is the interpretation of these results: If |x − c| is small we can ap-
proximate f (x) by f (c), and we might get a better approximation by f (c) +
f (c)(x − c) :
|f (x) − f (c)| ≤ Mf |x − c|,
and
(x − c)2
|f (x) − (f (c) + f (c)(x − c))| ≤ Mf .
2
Recall that for |x − c| < and < 1 it follows that |x − c|2 < |x − c|. The
main question is whether we can get an even better approximation when
increasing the order of derivatives and iterating the above process.
416
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 437
417
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 438
A COURSE IN ANALYSIS
Thus we have
(n) (n+1)
f (x) = Tf,c (x) + Rf,c (x) (29.15)
with
(n+1) |x − c|n+1
|Rf,c (x)| ≤ Mf (n+1) (29.16)
(n + 1)!
where Mf (n+1) = ||f (n+1) ||[a,b],∞ .
k
(k) 1 j
Texp,0 (x) = x; (29.17)
j=0
j!
k l−1 1
(k) j=0 ( 2 − j) l √
Tg,0 (x) = x , g(x) = 1 + x; (29.18)
l=0
l!
k
(k) (−1)j−1 xj
Th,0 (x) = , h(x) = ln(1 + x); (29.19)
j=1
j
k
(2k+1) x2j+1
Tsin,0 (x) = (−1)j ; (29.20)
j=0
(2j + 1)!
k
(2k) x2j
Tcos,0 (x) = (−1)j . (29.21)
j=0
(2j)!
418
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 439
Proof. Let us suppose that x0 < x, the other case goes analogously. By the
mean value theorem for integrals we find
(n+1) 1 x
Rf,x0 (x) = (x − t)n f (n+1) (t)dt
n! x0
x
(n+1) (x − t)n (x − x) )n+!
=f (ξ) dt = f (n+1) (ξ) ,
x0 n! (n + 1)!
(2n+3) cos ξ
Rsin,0 (x) = (−1)n+1 x2n+3 for ξ ∈ [0, 2π]
(2n + 3)!
and therefore
(2n+1) |x|2n+3
sin x − Tsin,0 (x) ≤
(2n + 3)!
419
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 440
A COURSE IN ANALYSIS
1
Therefore for n = 1 and |x| ≤ we have already
10
1 5 1
(3)
sin x − Tsin,0 (x) ≤ · .
10 120
Finally we consider the Taylor formula as n goes to infinity.
Definition 29.16. Let f : (a, b) → R be an arbitrarily often differentiable
function and x0 ∈ (a, b). We call
∞
f (k) (x0 )
Tf,x0 (x) := (x − x0 )k (29.23)
k=0
k!
We claim that f ∈ C ∞ (R) and f (n) (0) = 0 for all n ∈ N0 . This implies that
Tf (x) = 0 for all x ∈ R, in particular Tf (x) converges for all x ∈ R but for
x = 0 we have Tf (x) = f (x). To prove our claim we show the existence of
polynomials pn such that
- 1
(n) pn ( x1 )e− x2 , x = 0
f (x) = .
0, x=0
420
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 441
thus
pn+1 (t) := −pn (t)t2 + 2pn (t)t3 .
But for x = 0
f (n) (x) − f (n) (0)
f (n+1) (0) = lim
x→0 x
1 − x12
pn ( x )e
= lim
x→0 x
2
= lim rpn (r)e−r = 0.
r→∞
Note that if Tf,x0 (x) converges to f (x) for some x = x0 , then in the interval
[x0 − ρ, x0 + ρ], 0 < ρ < |x − x0 |, the convergence is uniform.
We will encounter Taylor series (and power series) later on when discussing
functions of several real variables and most of all when treating complex-
valued functions of a complex variable.
We have introduced the exponential function now as a convergent power
series and we may ask whether we can prove the functional equation for exp,
i.e.
exp(x + y) = exp(x) exp(y), (29.24)
without using the fact that exp satisfies the initial value problem u =
u, u(0) = 1, compare with Lemma 9.7. The right hand side of (29.24) is
the product of two power series and we first want to discuss products of
infinite series. ∞
Let (a
n )n∈N0 and (bn )n∈N0 be two sequences of real numbers and A := n=0 an ,
B := ∞ m=0 mb the corresponding series which we assume
to converge.
∞ The
aim is to find conditions under which we can represent ( ∞ a
n=0 n ) ( m=0 bm )
as a series converging to A · B.
For two partial sums we have
N M
an bm = an bm
n=0 m=0 n,m
421
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 442
A COURSE IN ANALYSIS
and give
∞
Definition 29.19. Let ∞ n=0 an and m=0 bm be two series of real numbers
and define cn by (29.25). The Cauchy product of these series is given by
∞ ∞
n
cn := an−k bk . (29.26)
n=0 n=0 k=0
Remark 29.20. Sofar the definition does not include a statement about
convergence. Thus ∞ n=0 cn stands for the sequence of partial sums
N n
n=0 ( k=0 an−k bk ).
Theorem 29.21. Let A := lim An , An := nk=0 ak , and B := lim Bm ,
∞ n→∞ ∞ m→∞
Bm := m l=0 bl . If k=0 ak converges absolutely and k=0 bk converges, then
their Cauchy product converges to A · B, i.e.
∞ N
n
A·B = cn = lim an−k bk . (29.27)
N →∞
n=0 n=0 k=0
∞ ∞
In the case where l=0 bl converges absolutely, then n=0 cn is also absolutely
convergent.
∞
By assumption k=0 ak = A, and hence we are done if we can prove that
n
lim an−k (Bk − B) = 0.
n→∞
k=0
422
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 443
Given > 0 we can find N() such that for k ≥ N() we have |Bk − B| < .
For n > N() it follows that
n
N n
an−k (Bk − B) ≤ |an−k ||Bk − B| + |an−k ||Bk − B|
k=0 k=0 k=N +1
N
n
≤ max |Bk − B| |an−k | + |an−k |
k≤N
k=0 k=N +1
N
∞
≤ max |Bk − B| |an−k | + |ak |.
k≤N
k=0 k=0
∞
For n → ∞ it follows that an → 0 since n=0 an converges. Therefore it
follows that for every fixed N
N
lim |an−k | = 0.
n→∞
k=0
Hence
n ∞
0 ≤ lim sup an−k (Bk − B) ≤ |ak |,
n→∞
k=0 k=0
implying the convergence of ∞ n=0 cn to A · B. Now suppose that both series
converge absolutely. Then we get
M M n M n
|cn | = an−k bk ≤ |an−k ||bk |
n=0 n=0 k=0 n=0 k=0
M M ∞ ∞
= |an | |bk | ≤ |an | |bk |
n=0 k=0 n=0 k=0
∞
implying the absolute convergence of n=0 cn .
We now apply this result to exp in order to prove its functional equation.
423
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 444
A COURSE IN ANALYSIS
xn
∞ yk
Proof. We know that exp(x) = ∞ n=0 n! and exp(y) = k=0 k! and both
series converge absolutely. Therefore, by Theorem 29.21 we find
∞
n
xn−k y k
exp(x) exp(y) = .
n=0
(n − k)! k!
k=0
Problems
4
1. For n ∈ N0 consider the functions gn (x) = (1+x
x
4 )n defined on R. Prove
that -
∞
1 + x4 , x = 0
gn (x) =
n=0
0, x = 0.
Why does this series not converge uniformly?
3. For α ∈ R define n
α α−k+1
:= .
n k=1
k
Prove that for α ∈ N this is a binomial coefficient. Let gα : (−1, 1) → R,
(k)
gα (x) = (1 + x)α . Show that gα (0) = k! αk and find the Taylor
(n)
polynomial Tgα ,0 (x).
424
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 445
∞
4. Suppose that ∞ k
k=0 ak x and ∞
k=0 bk xk converge absolutely
∞ and uni-
formly on [−c, c]. Prove that then k=0 (ak +bk )x and k=0 (λak )xk , λ ∈
k
∞ xk
5. Given that ex = k=0 k! find the Taylor series of sinh and cosh.
6. Find the Taylor series about 0 of
1 1+x
f (x) = ln , |x| < 1.
2 1−x
7. For l ∈ N) we define the Bessel function of order l by
∞ 2n+l
(−1)n x2
Jl (x) := . (29.28)
n=0
n!(n + l)!
8. Justify
∞
1
= (−1)l t2l , |t| < 1,
1 + t2
l=0
and by using the identity
x
1
arctan x = dt
0 1 + t2
find Tarctan,0 (x).
425
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 446
A COURSE IN ANALYSIS
426
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 447
427
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 448
A COURSE IN ANALYSIS
Example 30.4. The infinite product ∞ 1 1
k=2 1 − k 2 converges to 2 . Indeed
we have
N
1 1 1 1
PN = 1− 2 = 1− 2 1 − 2 · ...· 1 − 2
k 2 3 N
k=2
2 − 1 32 − 1
2
N2 − 1
= · . . . ·
22 32 N2
(2 − 1)(2 + 1) (3 − 1)(3 + 1) (N − 1)(N + 1)
= 2 2
· ...·
2 3 N2
1N +1
= .
2 N
The latter follows easily by induction: for N = 2 we have (2−1)(2+1)
22
= 34 .
Furthermore
(2 − 1)(2 + 1) (3 − 1)(3 + 1) (N − 1)(N + 1) N(N + 2)
2 2
·...·
2 3 N2 (N + 1)2
1 N + 1 N(N + 2) 1N +2
= 2
= .
2 N (N + 1) 2N +1
Thus for N → ∞ we find
∞
1 1 N +1 1
1 − 2 = lim PN = lim = .
k N →∞ N →∞ 2 N 2
k=2
Since for the convergence of ∞ k=1 ck it is necessary that lim ck = 1 we may
k→∞
introduce ak := ck − 1, i.e. ck = 1 + ak , and consider ∞ k=1 (1 + ak ). Clearly
we have now the necessary condition lim ak = 0 and ak = −1 is excluded.
k→∞
Suppose that ak > −1, i.e. ck > 0. Then we find
N
N
ln PN = ln (1 + ak ) = ln(1 + ak ),
k=1 k=1
or
N
PN = exp ln(1 + ak ) .
k=1
∞
Since exp is continuous
∞ the
∞ convergence of k=1 ln(1
+ ak ) will imply the
∞
convergence
of k=1 ck = k=1 (1+ak ). Conversely, if k=1(1+ak ) converges
then ∞ k=1 ln(1 + ak ) converges too.
Thus we have proved
428
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 449
Remark 30.6. As in the case of series we can sharpen Lemma 30.5 slightly
by assuming that ak > −1 for all k ≥ N0 . Note that if ak ≤ −1 for some
finite values of k, k ≤ N0 , then for these k the terms ln(1 + ak )are not
∞
defined.
∞ We find however the equivalence of the convergence of k>N0 ck
and k=N0 +1 ln(1
+ ak ) and the convergence of this series also implies the
convergence of ∞ k=1 ck .
∞
Proof. We assume first that k=1 ck
converges
to 0. The Cauchy crite-
c=
N
rion applied to the convergent sequence k=1 ck states: for every > 0
N ∈N
and η > 0 there exists N(η, ) such that n > m > N(η, ) implies
n m
ck − ck < η
k=1 k=1
or
n η
c k − 1 < m .
| k=1 ck |
k=m+1
m
m
Since lim ck = c = 0 it follows that for m ≥ N0 we have | k=1 ck | ≥
m→∞
k=1
|c| 2
2
= 0. Hence for n > m > max(N0 , N()) we have with η = |c|
n η
ck − 1 < m ≤ .
| k=1 ck |
k=m+1
429
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 450
A COURSE IN ANALYSIS
which yields
1 3
n
< ck < , (30.4)
2 k=m 2
and in particular cl = 0 for l > N1 . Now let N > N1 fixed. For every
0 < < 12 there exists by assumption N() > N such that n > m > N ()
implies
n n
k=N ck
m − 1 = c − 1
k
k=N ck k=m
2
= |cm+1 · cm+2 · . . . · cn − 1| < ,
3
or
n m m 2
ck − ck < ck · < ,
3
k=N k=m N =k
N
where we used (30.4). Thus k=1 kc is a Cauchy sequence in R and
N ∈N
therefore convergent.
∞ ∞
Definition 30.8.
∞ The product k=1 c k = k=1 (1 + ak ) is called absolutely
convergent if k=1 (1 + |ak |) converges.
Proposition 30.9. If ∞ k=1 (1 + ak ) converges absolutely then it converges.
Proof. We aim to apply the Cauchy criterion, and for this we note that for
a1 , . . . , an ∈ R the following holds:
430
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 451
Moreover,
2n
(−1)k 3 2 5 4 1 1
1+ = · · · · ...· 1+ =1+ →1
k=2
k 2 3 4 5 2n 2n
and
2n−1
(−1)k
3 2 5 4 2n − 1 2n − 2
1+ · · · · ...·
= = 1.
k=2
k 2 3 4 5 2n − 2 2n − 1
∞ 1
However we already know that 1 + does not converge, hence
∞ k=2 k
(−1)k
k=2 1 + k
does not converge absolutely.
431
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 452
A COURSE IN ANALYSIS
We start by considering
π
2
Am := sinm xdx (30.6)
0
and claim
m−1
Am = Am−2 , for m ≥ 2. (30.7)
m
π
Clearly we have A0 = 2
and A1 = 1. In order to prove (30.7) note that
π
π
2 2
sinm xdx = sinm−1 x sin xdx
0 0
π
π 2 d
= sinm−1 x(− cos x)|02 − (sinm−1 x)(− cos x)dx
0 dx
π
2
= (m − 1) sinm−2 x cos2 xdx
0
π
2
= (m − 1) sinm−2 x(1 − sin2 x)dx
0
π
π
2 2
m−2
= (m − 1) sin xdx − (m − 1) sinm xdx,
0 0
or
Am = (m − 1)Am−2 − (m − 1)Am
which implies (30.7). Using (30.7) we find
(2n − 1) (2n − 3) 3 1 π
A2n = · ...· · · (30.8)
2n (2n − 2) 4 2 2
and
2n (2n − 2) 4 2
A2n+1 = ·...· · . (30.9)
(2n + 1) (2n − 1) 5 3
For x ∈ [0, π2 ], i.e. 0 ≤ sin x ≤ 1 we have
implying that
A2m+2 ≤ A2m+1 ≤ A2m . (30.10)
432
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 453
Since
A2m+2 2m + 1
lim = lim =1
m→∞ A2m m→∞ 2m + 2
A2m+1 2m · 2m · . . . · 4 · 2 · 2 2
= · ,
A2m (2m + 1)(2m − 1) · . . . · 3 · 3 · 1 π
i.e.
A2m+1 2 2m · 2m · . . . · 4 · 2 · 2
1 = lim = lim
m→∞ A2m π m→∞ (2m + 1)(2m − 1) · . . . · 3 · 3 · 1
or
m
∞
π 4n2 4n2
= lim = .
2 m→∞ n=1 4n2 − 1 n=1 4n2 − 1
Thus we have proved
For this we will study the Γ-function a bit more closely. We know that
on (0, ∞) the Γ-function is logarithmic convex, i.e. for 0 < λ < 1 and
x, y ∈ (0, ∞) we have
433
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 454
A COURSE IN ANALYSIS
or with (30.14)
n!(x + n)x−1
an (x) := ≤ Γ(x)
x(x + 1) · . . . · (x + n − 1)
(n − 1)!nx
≤ =: bn (x).
x(x + 1) · . . . · (x + n − 1)
Note that
bn (x) (n − 1)!nx (n − 1)!(n + x)nx (n + x)nx
= = =
an (x) n!(n + x)x−1 n!(n + x)x n(n + x)
which implies
bn (x) an (x)
lim = lim = 1.
n→∞ an (x) n→∞ bn (x)
Thus
an (x) Γ(x)
≤ ≤1
bn (x) bn (x)
Γ(x)
and for n → ∞ we find for 0 < x < 1 that ≤ 1 or
lim bn (x)
n→∞
(n − 1)!nx
Γ(x) = lim . (30.17)
n→∞ x(x + 1) · . . . · (x + n − 1)
Hence we have proved a product representation for Γ(x) provided that 0 <
x < 1. More generally we have
434
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 455
holds.
n
Proof. Since lim = 1 we deduce (30.18) for 0 < x < 1 from (30.17),
n→∞ x + n
and for x = 1 we find
n!n n!n n
= =
1(1 + 1) · . . . · (1 + n) (n + 1)! n+1
which yields
n
Γ(1) = 1 = lim
n→∞ n + 1
i.e. (30.18) holds for 0 < x ≤ 1. Now, if (30.18) holds for x ∈ (0, ∞) then it
holds also for y := x + 1 since
n!nx
Γ(y) = Γ(x + 1) = xΓ(x) = lim
n→∞ (x + 1) · . . . · (x + n)
n!ny−1
= lim
n→∞ y(y + 1) · . . . · (y + n − 1)
n!ny
= lim ,
n→∞ y(y + 1) · . . . · (y + n − 1)(y + n)
n
where we used lim = 1, and the theorem is proved.
n→∞ y + n
or equivalently
∞
−x2 1 1
e dx = Γ .
0 2 2
We note further that by (30.18) we find
√
1 n! n
Γ = lim 1
2 n→∞ (1 + 1 )(2 + 1 ) · . . . · (n + 1 )
2 2 2 2
435
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 456
A COURSE IN ANALYSIS
and √
1 n! n
Γ = lim
2 n→∞ (1 − 1 )(2 − 1 ) · . . . · (n − 1 )(n + 1 )
2 2 2 2
implying by Theorem 30.12 that
2
1 2n (n!)2
Γ = lim
2 n→∞ n + 1 (1 − 1 )(4 − 1 ) · . . . · (n2 − 1 )
2 4 4 4
n 2
n n
= 2 lim lim = π,
n→∞ n + 1 n→∞ k 2− 1
2 k=1 4
1 √
or Γ 2
= π which yields
or
∞
N
u(x) = (1 + vk (x)) = lim (1 + vk (x)). (30.22)
N →∞
k=1 k=1
N N
Thus u is the pointwise limit of k=1 uk = k=1 (1 + vk ) . We
N ∈N N ∈N
say that u has the product representation or product expansion (30.21)
or (30.22). Clearly, in order to check pointwise convergence of ∞
k=1 (1+vk (x))
we can use our previous criteria.
436
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 457
∞ x2
Example 30.15. The product k=1 1 − k2
converges for every x ∈ R
2 2
such that x = m , m ∈ N. Indeed we have ∞ absolute convergence since with
2 ∞
ak = − xk2 we have that k=1 |ak | = x2 1
k=1 k 2 converges. But now we can
∞ x2
extend k=1 1 − k2 to all x ∈ R by defining it to be 0 for x2 = m2 , m ∈ N.
x2
Thus ∞ k=1 1 − k 2 defines a function on R. Now it is even easier to see
∞ 2
that k=1 1 + xk2 converges for all x ∈ R.
Definition 30.16.
We call ∞ k=1 (1 + vk ) uniformly convergent to u if
N
the sequence k=1 (1 + vk ) converges on I uniformly to u.
N ∈N
The Cauchy criterion extends to uniform convergence in the usual way, i.e.
we have
Theorem 30.17. The product ∞ k=1 (1 + vk ) converges on I uniformly if for
every > 0 there exists N() ∈ N such that n ≥ N() and m ∈ N implies
that for all x ∈ I
n+m
(1 + v k (x)) − 1 < . (30.23)
k=n+1
Theorem 30.18. Suppose that the series ∞ k=1 |vk |
converges uniformly on
I and that all functions vk are continuous. Then ∞ k=1 (1 + vk ) converges
uniformly to a continuous function.
∞
∞ We know that the convergence of k=1 |vk (x)| implies the convergence
Proof.
of k=1(1 + vk (x)), see Propositions 30.10 and 30.9. Thus we can define a
function on I by
∞
u(x) := (1 + vk (x)).
k=1
Furthermore, there exists M ∈ N such that for all x ∈ I and all l ∈ N the
following holds
|vm+1 (x)| + |vm+2 (x)| + · · · + |vm+k (x)| < 1, (30.24)
∞
which is again a direct consequence of the uniform convergence of k=1 |vk |.
Consider now the product
∞
wM (x) := (1 + vk (x)) (30.25)
k=M +1
437
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 458
A COURSE IN ANALYSIS
n
with pn,M (x) := k=M +1 (1 + vk (x)), n ≥ M + 1, we find with pM,M (x) = 0
∞
wM (x) = (pk,M (x) − pk−1,M (x))
k=M +1
∞
= pk,M (x)vk+1 (x).
k=M +1
We claim that the series ∞k=M +1 pk,M (x)vk+1 (x) is uniformly convergent on
I. Indeed, for k ≥ M + 1 we have
where we used (30.24) and the fact that 1 + y ≤ ey for y ≥ 1. Thus we find
∞
∞
|pk,M (x)vk+1 (x)| ≤ |vk+1 (x)|
k=M +1 k=M +1
∞
and the uniform convergence of k=1 |vk (x)| togetherwith the Weierstrass
M-test yields the uniform and absolute convergence of ∞ k=M +1 pk,M (x)vk (x),
hence wM is a continuous function and the convergence in (30.25) is uniform.
Multiplying wM (x) by M k=1 (1 + vk (x)) does not change the continuity nor
the uniform convergence and the result follows.
Finally we want to establish a result for the derivative of an infinite product
of differentiable functions.
Theorem
∞ 30.19. Suppose∞ that vk : I → R, k ∈ N, is differentiable and that
∞ k=1 |vk | and k=1 |vk | converge uniformly. Then the product u(x) :=
both
k=1 (1 + vk (x)) is differentiable and we have
∞
u (x) ( ∞ (1 + vk (x))) vk (x)
= k=1
∞ = . (30.26)
u(x) k=1 (1 + vk (x)) k=1
1 + v k (x)
d (g1 · . . . · gM )
ln(g1 · . . . · gM ) =
dx g1 · . . . · gM
438
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 459
and
M M
d d gj
ln(g1 · . . . · gM ) = ln gj = ,
dx dx j=1 g
j=1 j
i.e.
M
(g1 · . . . · gM ) gj
= (30.27)
g1 · . . . · gM g
j=1 j
provided gj = 0 and gj > 0. (The case g < 0 and some gj <0 can also
be treated by looking at |g|.) Now, the uniform convergence of ∞
k=1 |vk (x)|
implies the existence of M ∈ N such that for all x ∈ I we have
∞
1
|vk (x)| < ,
k=M +1
2
1
in particular we have |vk (x)|
∞< 2 for k ≥ M + 1. This implies by Proposition
30.10 the convergence
of k=M +1(1 + vk (x)) and hence by Lemma 30.5 the
convergence of ∞ k=M +1 ln(1 + vk (x)). Since
|vk (x)| < 12 for k ≥ M + 1 it
follows that for these k we have 1+v1k (x) < 2 and consequently, since
∞
∞
d vk (x)
(ln(1 + vk (x))) = ,
dx 1 + vk (x)
k=M +1 k=M +1
the uniform convergence of ∞
k=1 |vk | implies the absolute and uniform con-
∞ v
vergence of k=|M +1 1+vk k . By Theorem 26.19 we now have
∞ ∞
d d vk
ln(1 + vk ) = .
dx dx 1 + vk
k=M +1 k=M +1
From
N
N
(1 + vk ) = exp ln(1 + vk )
k=M +1 k=M +1
we derive N
d N
dx k=M +1 (1 + vk ) vk
N =
k=M +1 (1 + vk ) k=M +1
1 + vk
439
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 460
A COURSE IN ANALYSIS
Example 30.20. For x ∈ (−1, 1) we know that F (x) = x ∞ k=1 1 −
x2
k2
∞ x2
converges uniformly, since k=1 k2 ≤ |x|2 ∞ 1
k=1 k 2 . Further we have
∞ 2
∞
x 1
= 2|x|
k2 k2
k=1 k=1
Problems
1. Prove
∞ k 3 −1
a) k=2 k 3 +1 = 23 ;
∞ 1
b) l=1 1+ l(l+2)
= 2.
2. Let ak ≥ 0, ak = 1 for k ∈ N. Show: the convergence of ∞ k=1 (1 − ak )
is equivalent to the convergence of ∞ a
k=1 k .
∞
3. Suppose that k=1 ak converges.
∞ 2
a) Prove that ∞ k=1 (1 + ak ) converges if and only if k=1 ak con-
verges.
∞
b) Prove that if ∞ 2
k=1 ak diverges then k=1 (1 + ak ) diverges.
440
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 461
4. Prove that if the infinite product ∞ k=1 (1+ak ) converges absolutely then
we can rearrange its factors and the corresponding product converges
to the same limit.
5. a) For |x| < 1 prove that ∞ 2k 1
k=1 (1 + x ) = 1−x .
441
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 463
Proof. From the functional equation and the normalisation we deduce for
n ∈ N the fact that G(n + 1) = n!. Now, for 0 < x ≤ 1 we find
or
i.e.
G(n + x) ≤ n!n(x−1) . (31.1)
Moreover,
n + 1 = x(n + x) + (1 − x)(n + x + 1),
i.e. n + 1 is a convex combination of n + x and n + x + 1, and this gives
443
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 464
A COURSE IN ANALYSIS
thus we have
n! = G(n + 1) ≤ G(n + x)(n + x)(1−x) . (31.2)
Combining (31.1) and (31.2) we find
Observing that
n!nx n!nx
≤ ≤ ,
x(x + 1) · . . . · (x + n − 1)n x(x + 1) · . . . · (x + n)
and by Theorem 30.13 we get for n → ∞
i.e. G(x) = Γ(x) for 0 < x < 1. But now G(x + 1) = xG(x) and Γ(x + 1) =
xΓ(x) imply G(x) = Γ(x) for all x > 0.
In the proof of the above result, Theorem 30.13 was quite important. Using
this result we can also give a representation of the Γ-function as an infinite
product involving the exponential function. Recall that the existence of the
Euler constant N
1
γ := lim − ln N (31.5)
N →∞
k=1
k
was proved in Theorem 18.24. Denote by Γn the function
nx n!
Γn (x) := , (31.6)
x(x + 1) · . . . · (x + n)
444
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 465
and
1 1 x x x
ex(ln n−1− 2 −...− n ) = nx e− 1 e− 2 · . . . · e− n ,
which implies
x x
1 1 e1 en
x(ln n−1− 12 −...− n )
Γn (x) = e · · . . . · . (31.10)
x 1 + x1 1 + nx
Theorem 31.2. For x > 0 the Γ-function has the Weierstrass product
representation
∞
e−γx e k
x
Γ(x) = . (31.11)
x k=1 1 + xk
445
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 466
A COURSE IN ANALYSIS
Proof. Since for x > 0 we have Γ(x) > 0 it follows that Γ is differentiable if
and only if ln Γ is differentiable. From (31.11) we derive
∞
x x
ln Γ(x) = −γx − ln x + − ln 1 + , (31.13)
k=1
k k
∞ x
and we know that the series k=1 k
− ln 1 + xk converges pointwise. Fur-
ther we note that
∞ ∞
x 1
∞
x 1 x
− ln 1 + = − = ,
k=1
k k k=1
k k+x k=1
k(k + x)
and this series converges uniformly on compact intervals in (0, ∞). Conse-
quently we have
∞ ∞
Γ (x) 1 x 1 1 !
(ln Γ(x)) = = −γ − + = −γ − + − ,
Γ(x) x k(k + x) x k k+x
k=1 k=1
(31.14)
which now yields also that Γ is arbitrarily often differentiable on (0, ∞).
Indeed we find for l ≥ 2
∞
dl−1 Γ (x) (−1)l (l − 1)!
= . (31.15)
dxl−1 Γ(x) k=0
(x + k)l
446
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 467
where
∞
1 1 1
ϑ(x) = t
− + e−xt dt. (31.22)
0 e −1 t 2
A proof of Theorem 31.5 is given in R. Beals and R. Wong [1]. Here we give
a proof of the Stirling formula for the factorial, or equivalently for Γ(n + 1).
As preparation we prove
Lemma 31.6. For k ∈ N we find ξk ∈ [k, k + 1] such that
k+1
1 1
ln xdx = (ln k + ln(k + 1)) + . (31.23)
k 2 12ξk2
447
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 468
A COURSE IN ANALYSIS
where
n−1
1 1
ηn = 1 − . (31.25)
12 k=1 ξk2
n
ln k
But e k=1 = n!, and therefore
448
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 469
exists, hence
c = lim cn = eη .
n→∞
We want to find c. We must have
c2n
c = lim ,
n→∞ c2n
which gives
N
12
4k 2 √ 2 · 4 · . . . · 2N
2 = 2 √
4k 2 − 1 3 · 5 · . . . · (2N − 1) · 2N + 1
k=1
1 22 · 42 · . . . · (2N)2
=
N + 12 2 · 3 · 4 · 5 · . . . · (2N − 1)(2N)
1 22N (N!)2
= ,
N + 12 (2N)!
which yields
22N (N!)2 √
c = lim √ = 2π, (31.28)
N →∞ N(2N)!
implying
n!
lim √ 1 = 1.
n→∞ 2πn(n+ 2 ) e−n
Thus we have proved
449
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 470
A COURSE IN ANALYSIS
n!
lim √ = 1. (31.29)
2πn(n+ 2 ) e−n
1
n→∞
450
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 471
1
Proof. For 0 < , η < 2
we find using integration by parts
1−η x
x+y−1 t
(1 − t) dt
1−t
x+y x 1−η 1−η x t x−1 1
= − (1−t)
x+y
t
1−t
+ x+y
(1 − t)x+y 1−t (1−t)2
dt
y x −η y (1−η)x 1−η x−1
= (1−) x+y x
+ x+y
t (1 − t)y−1 dt.
f (x + 1) = B(x + 1, y)Γ(x + 1 + y)
x
= B(x, y)(x + y)Γ(x + y)
x+y
= xB(x, y)Γ(x + y) = xf (x),
thus f satisfies the functional equation of the Γ-function. Further, for y > 0
fixed the function x → Γ(x + y) and x → B(x, y) are logarithmic convex.
For the Γ-function this is trivial, in the case of the beta-function we only
need to note that x → tx−1 (1 − t)y−1 is logarithmic convex and hence the
integral defining B(x, y) is a pointwise limit of logarithmic convex functions.
By Problem 12 c) in Chapter 28 it follows that x → B(x, y) is logarithmic
convex. Finally, Problem 12 a) in Chapter 28 shows that x → B(x, y)Γ(x+y)
is logarithmic convex. Since both results hold also for g(x) := ff (x)(1)
and
451
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 472
A COURSE IN ANALYSIS
f (x)
g(1) = 1 we deduce by Theorem 31.1 that the function f (1)
is the Γ-function,
i.e. we have
f (1)Γ(x) = B(x, y)Γ(x + y).
In order to find f (1) we note that
1
1
B(1, y) = (1 − t)y−1 dt =
0 y
and therefore
1
f (1) = B(1, y)Γ(1 + y) = yΓ(y) = Γ(y)
y
and it follows that
Γ(x)Γ(y)
B(x, y) = .
Γ(x + y)
where we used that t → t(1 − t)x−1 is symmetric with respect to the axis
t0 = 12 . Using the substitution s = 4t(1 − t) we obtain
1
1
B(x, x) = 2 sx−1 (1 − s)− 2 · 2−2x ds
0
1
1 1
=2 1−2x
sx−1 (1 − s)− 2 ds = 21−2x B(x, ).
0 2
Now we apply (31.34) to find
1
Γ(x)2 1−2x Γ(x)Γ 2
= B(x, x) = 2
Γ(2x) Γ x + 12
√
or using Γ 12 = π we arrive at the Legrendre duplication formula for
the Γ-function:
452
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 473
Γ(x + n)
Γ(x) := .
(x + n − 1)(x + n − 2) · . . . · (x + 1)x
453
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 474
A COURSE IN ANALYSIS
and we find
x x + 1 x x πx
x+1
1−x
πx
ϕ ϕ =Γ Γ 1− sin Γ Γ cos
2 2 2 2 2 2 2 2
c20 c20
= Γ(x)Γ(1 − x) sin πx = ϕ(x),
4 4
or x x + 1 c2 π
ϕ ϕ = 0 ϕ(x) = ϕ(x). (31.40)
2 2 4 16
For x ∈ R \ Z the function ϕ is arbitrarily often differentiable since the sine
and the Γ-functions are. The functional equation of the Γ-function yields
Γ(1 + x)
ϕ(x) = Γ(1 − x) sin πx
x
sin πx
= Γ(1 + x)Γ(1 − x)
x
∞
π 2k+1 x2k
= Γ(1 + x)Γ(1 − x) (−1)k ,
k=0
(2k + 1)!
and the series on the right hand side converges for all x ∈ R. Moreover,
as x → 0 the right hand side tends to π and is indeed an arbitrarily often
differentiable function. Thus the function
-
ϕ(x), x ∈ R \ Z
ϕ̃(x) :=
π, x∈Z
454
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 475
which yields
1 x 1 x+1
g + g = g(x). (31.41)
4 2 4 2
On [0, 1] the function g is continuous, hence bounded, say |g(x)| ≤ M on
[0, 1], which implies by (31.41)
1 x 1 x + 1 M
|g(x)| ≤ g + g ≤ 2 (31.42)
4 2 4 2
M
|g(x)| ≤ , (31.43)
2N
which due to the periodicity of g extends to all x ∈ R, thus we must have
d2
g(x) = 0 for all x ∈ R. Hence dx 2 ln ϕ̃(x) must be a linear function and
455
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 476
A COURSE IN ANALYSIS
Remark 31.15. From our derivation we can only conclude that (31.45) holds
∞ x2
for x ∈ R \ Z. But for x ∈ Z, one term in k=1 1 − k2 vanishes as does
sin πx, hence we can extend (31.45) to R.
When turning to complex-valued functions of a complex variable and intro-
ducing meromorphic functions we will return to the Γ-function and related
functions. In fact many of the formulae proved here will show their full power
in the complex setting.
Problems
1. Show that √
1 (2n)! π
Γ n+ = , n ∈ N.
2 4n n!
456
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 477
7. Find
∞
x5
dx.
0 (1 + x)7
8. Prove the following product representation of the Beta-function:
∞
x+y 1 + x+y
n
B(x, y) = .
xy n=1 1 + nx 1 + ny
457
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 479
Definition 32.1. A set A ⊂ R is called a null set if for every > 0 there
exists a denumerable number of bounded intervals In with end points an < bn
459
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 480
A COURSE IN ANALYSIS
such that ∞
A⊂ In and (bn − an ) ≤ . (32.1)
n∈N n=1
460
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 481
461
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 482
A COURSE IN ANALYSIS
1
From C0 we take away the open interval ,2
3 3
to obtain
$ % $ %
1 2 0 1 2 3
C1 := [0, 1] \ , = , ∪ , . (32.9)
3 3 3 3 3 3
In the next step we take away from 03 , 13 and 23 , 33 the open “middle inter-
val” of length 19 , i.e.
$ % $ %
0 1 1 2 2 3 7 8
C2 : = , \ , ∪ , \ , (32.10)
3 3 9 9 3 3 9 9
$ % $ % $ % $ %
0 1 2 3 6 7 8 9
= , ∪ , ∪ , ∪ , .
9 9 9 9 9 9 9 9
C := CN . (32.11)
N =0
So what can we say about C? First, since each set CN is closed by Lemma
19.7 it follows that C is closed too. Moreover, since C ⊂ [0, 1], the Cantor set
is bounded, hence by the Heine-Borel theorem, Theorem 20.26, it is compact.
Further, in the N th step we get from CN to CN +1 by removing 2N open
1
intervals of length 3N+1 . The total length of the removed intervals add up to
∞ ∞ N
2N 1 2 1 1
N +1
= = 2 = 1. (32.12)
N =0
3 3 N =0 3 31− 3
462
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 483
and
y = 0.00 . . . 00222 . . . ( first 2 is in position k + 1). (32.17)
Using this identification, in C1 we only find elements with first digit in the
ternary representation being either 0 or 2. In C2 we only find elements
belonging to C1 and with the second digit being either 0 or 2, and in CN
we only have elements from CN −1 with N th digit either 0 or 2. Thus x ∈ C
implies
∞
x= an 3−n , an ∈ {0, 2}. (32.18)
n=1
This result tells us that sets being large when judged by their cardinality
still can be small with respect to “length” or measure. Having these con-
siderations in mind we return to monotone functions. In the following we
consider monotone functions f defined on a compact interval [a, b] which are
bounded. If f is monotone decreasing then −f is monotone increasing and
hence when investigating the “smoothness” or “regularity” of a monotone
function we can confine ourselves to increasing functions. Let f : [a, b] → R
be a bounded increasing function. Since f is real-valued and increasing we
have of course f (a) ≤ f (x) ≤ f (b) < ∞, i.e. f is bounded, however some-
times we prefer to emphasise in this chapter the boundedness of f . From
Problem 6 in Chapter 20 we know that for x0 ∈ (a, b)
463
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 484
A COURSE IN ANALYSIS
and
f (x0 −) := x→x
lim f (x) = sup{f (x)|a ≤ x < x0 } (32.20)
0
x<x0
We call
[f ](x0 ) := f (x0 +) − f (x0 −) ≥ 0 (32.22)
the jump of f at x0 . In part b) of Problem 6 in Chapter 20 we have proved
that f can only have finitely many jumps larger than a given η > 0. Indeed,
since f is bounded, there exists n0 ∈ N such that
n0 η ≥ f (b) − f (a),
implying that an upper bound for the number of jumps of size larger than η
is the largest n ∈ N such that nη ≤ f (b) − f (a). This implies also, again see
Problem 6 in Chapter 20, that f can have at most countable many jumps,
i.e. outside a countable set f is continuous.
Suppose that f has countable jumps occurring at "xj , j ∈ N, a <#x1 < · · · <
xj−1 < xj , xj < b. For N ∈ N denote by SN := xN N
1 , . . . , xk(N ) the finite
subset of {xj |j ∈ N} corresponding to jumps of size larger than N1 . Clearly we
have SN ⊂ SN +1 and ∪N ∈N SN = {xj |j ∈ N}. For SN inequality (32.23) holds
464
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 485
k(N ) N
and the sequence j=1 [f ](xj ) is increasing. Since this sequence is
N ∈N
also bounded it converges and in the limit we obtain
∞
(f (a+) − f (a)) + [f ](xk ) + (f (b) − f (b−)) ≤ f (b) − f (a). (32.24)
k=1
Of interest is now
Theorem 32.7. Let f : [a, b] → R be a bounded increasing function and sj
its jump function. The function ϕf : [a, b] → R defined by
ϕf := f (x) − sf (x) (32.26)
is increasing and continuous.
Proof. Let a ≤ x < y ≤ b. We apply (32.24) to the interval [x, y] and obtain
sf (y) − sf (x) = f (y) − f (x), (32.27)
which implies ϕf (x) − ϕf (y) ≥ 0, i.e. ϕf is increasing. Further, passing in
(32.27) to the limit y → x we find
sf (x+) − sf (x) ≤ f (x+) − f (x),
but the definition of ϕf implies
f (x+) − f (x) ≤ sf (y) − sf (x),
which gives for y → x
f (x+) − f (x) ≤ sf (x+) − sf (x),
or f (x+) − f (x) = sf (x+) − sf (x), i.e. ϕf (x+) = ϕf (x). Analogously we
may prove ϕf (x−) = ϕ(x).
465
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 486
A COURSE IN ANALYSIS
The jump function s f is the pointwise and monotone limit of the sequence
(SN )N ∈N , SN (x) = xN [f ](xj ), which is an increasing step function on
j <x
[a, b]. Thus every monotone increasing function is the sum of continuous
increasing functions and a monotone limit of step functions.
Let f : [a, b] → R be a bounded function and a = x0 < x1 < · · · <
xn−1 < xn = b be a finite partition Z of [a, b] for which we write as be-
fore Z(x0 , . . . , xn ). We can now form
n−1
VZ (f ) := |f (xk+1 ) − f (xk )| . (32.28)
k=0
we denote the total variation of f , where the supremum is taken over all
(finite) partitions of [a, b].
B. We call f a function of bounded variation if V (f ) < ∞. The set of
all functions of bounded variation on [a, b] is denoted by BV ([a, b]).
Vab (f ) := V (f ), f : [a, b] → R.
which implies
|f (x)| ≤ |f (a)| + V (f ).
466
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 487
which yields
V (f ) ≤ κ(b − a).
467
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 488
A COURSE IN ANALYSIS
Theorem 32.14. The set BV ([a, b]) with the natural pointwise operations
forms an algebra. In particular for f, g ∈ BV ([a, b]) and λ ∈ R we have
f + g, λf, f · g ∈ BV ([a, b]).
Proof. Clearly we need only to prove that f + g and f · g belong to BV ([a, b])
if f, g ∈ BV ([a, b]). For this let Z(x0 , . . . , xn ) be a partition of [a, b]. Since
n
|f (xk ) + g(xk ) − f (xk−1 ) − g(xk−1 )|
k=1
n
n
≤ |f (xk ) − f (xk−1)| + |g(xk ) − g(xk−1 )|
k=1 k=1
VZ (f + g) ≤ VZ (f ) + Vz (g),
and then by taking the supremum over all partitions of [a, b] we get
V (f + g) ≤ V (f ) + V (g).
Furthermore we have
implying
V (f · g) ≤ ||g||∞V (f ) + ||f ||∞V (g).
Theorem 32.15. For f ∈ BV ([a, b]) there exists two monotone increasing
functions g, h : [a, b] → R such that f = g − h.
468
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 489
implying
−h(y) = f (y) − vf (y) ≤ f (x) − vf (x) = −h(x),
i.e. −h is decreasing as claimed.
exist for every x0 ∈ (a, b), so do the limits f (a+) and f (b−).
We have now the following situation: BV ([a, b]) is a vector space, in fact
an algebra, and every element in BV ([a, b]) is Riemann integrable. However
certain results that we have considered for continuous, integrable functions do
not hold, for example the fundamental theorem, or rules such as integration
by parts, since for this result we need differentiability. A natural question is
to which extent can we “rescue” these results, i.e. can we find an extension
of our theory of integration which will allow us to prove these results perhaps
with some generalised interpretation? It turns out that we can achieve this
however we will need the Lebesgue measure and we will take up this problem
in Volume 3.
We know that BV ([a, b])∪C([a, b]) is a subset of all Riemann integrable func-
tions. The following result gives a characterisation of a Riemann integrable
function a proof of which we will give in Volume 3.
is a null set.
469
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 490
A COURSE IN ANALYSIS
Problems
1. Define f : [0, 1] → R by f (0) = 0 and f (x) = x cos πx for 0 < x ≤ 1
and prove that f is continuous but not of bounded variation. Hint:
1 1
consider the partition 0 < 2k < 2k−1 < · · · < 13 < 12 < 1.
3. Suppose that g ∈ BV ([a, b]) and inf |g| > 0. Prove that 1g ∈ BV ([a, b]).
x
4. Let f ∈ C([a, b]) and F (x) := a f (t)dt, x ∈ [a, b]. Show that F ∈
b
BV ([a, b]) and V (F ) = a |f (t)|dt.
470
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 491
Appendices
471
This page intentionally left blank
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 493
p ¬p
T F
F T
Table A.I.1
473
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 494
A COURSE IN ANALYSIS
Our main task is to decide, i.e. to define, when these new statements are
true and when they are false. Here are the truth tables for conjunction,
disjunction and implication. Conjunction:
p q p∧q
T T T
T F F
F T F
F F F
Table A.I.2
Thus the conjunction p ∧ q is true if and only if both p and q are true.
Disjunction:
p q p∨q
T T T
T F T
F T T
F F F
Table A.I.3
474
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 495
Implication:
p q p =⇒ q
T T T
T F F
F T T
F F T
Table A.I.4
p q p ⇐⇒ q
T T T
T F F
F T F
F F T
Table A.I.5
Thus p is equivalent to q if both are true or both are false, but this is not
really what we mean when saying that p is equivalent to q. What we really
mean is the following
(p =⇒ q) ∧ (q =⇒ p), (A.I.1)
475
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 496
A COURSE IN ANALYSIS
i.e. p implies q and q implies p. Taking (A.I.1) as the definition for equiva-
lence, then we may introduce a new notation, namely
Note that the truth tables Tables A.I.1 and Tables A.I.3 imply
(p ∧ q) ⇐⇒ (q ∧ p) (A.I.3)
and
(p ∨ q) ⇐⇒ (q ∨ p). (A.I.4)
We want to study some of these compound statements in more detail. We
start with the negation of negation, i.e. ¬(¬p) with truth table
p ¬p ¬(¬p)
T F T
F T F
Table A.I.6
p ¬p p ∧ (¬p)
T F F
F T F
Table A.I.7
p ¬p p ∨ (¬p)
T F T
F T T
Table A.I.8
476
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 497
Thus the statement p ∨ (¬p) is always true, i.e. given any statement, either
p or ¬p is true, there is no other possibility. This fact is called the law
of the excluded middle or tertium non datur . But note: the law of the
excluded middle depends on the fact that any statement is only allowed to
be true or false. As soon as we allow a third option we cannot prove the law
of the excluded middle. When we take the negation of p ∨ (¬p) we get the
statement ¬(p ∨ (¬p)) which is always false. Therefore having Table A.I.7 in
mind we find
(¬(p ∨ (¬p))) ⇐⇒ (p ∧ (¬p)). (A.I.5)
We clearly apply the fact that two compound statements are equivalent if
they have identical truth tables.
Of interest are the two laws dealing with negation of conjunction and dis-
junctions. They are called de Morgan’s laws and they state
and
(¬(p ∨ q)) ⇐⇒ ((¬p) ∧ (¬q)). (A.I.7)
Note that (A.I.5) follows from (A.I.7) with ¬p instead of q. A further im-
portant conclusion we can make from the negation of the implication is
Thus instead of proving that p does not imply q, we may prove that p and
¬q are true. Since (A.I.8) implies
(p =⇒ q) ⇐⇒ ((¬p) ∨ q) (A.I.9)
i.e. instead of proving p implies q we may prove that ¬q implies ¬p. The
equivalence (A.I.10) is known as contra-position and a proof using ¬q =⇒
¬p instead of p =⇒ q is called a proof by contra-position. Combining (A.I.8)
with the law of the excluded middle we obtain a very powerful method for
proving statements: reductio ad absurdum or proof by contradiction. Here
is the method:
477
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 498
A COURSE IN ANALYSIS
i.e. if p implies q and q implies r, then p must imply r. In fact, most if not
all proofs rely on a finite number of applications of (A.I.11).
The following considerations are more involved and often cause some prob-
lems to begin with. We must learn to work with statements which include
quantifiers. To explain this in more detail we need to consider some set
theory.
Let X be a non-empty set. Often we need to consider for each x ∈ X a
statement p which depends on x. For this we write p(x), for example if
X = N the statement could be:
478
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 499
∀x ∈ X : p(x), (A.I.14)
∀x ∈ R ∃n ∈ N : n ≥ x.
Another example is
∃M ∈ R ∀x ∈ R : | sin x| ≤ M.
and
¬(∃x ∈ X : p(x)) ⇐⇒ (∀x ∈ X : ¬p(x)). (A.I.19)
Thus the negation of (A.I.17) is
⇐⇒ ∀M ∈ R : ¬(∀x ∈ R : | sin x| ≤ M)
⇐⇒ ∀M ∈ R (∃x ∈ R : | sin x| > M),
and since (A.I.17) is true, just take M = 1, the last statement is of course
false.
479
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 500
A COURSE IN ANALYSIS
480
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 501
X \ X = ∅ and X \ ∅ = X; (A.II.1)
X ∪ X = X and X ∩ X = X; (A.II.2)
X ∪ Y = Y ∪ X and X ∩ Y = Y ∩ X; (A.II.3)
The statements X ⊂ Y, X ∪ Y = Y, X ∩ Y = X are equivalent; (A.II.4)
The statements X ⊂ X ∪ Y and X ∩ Y ⊂ X are equivalent; (A.II.5)
X ⊂ Z and Y ⊂ Z if and only if X ∪ Y ⊂ Z; (A.II.6)
Z ⊂ X and Z ⊂ Y if and only if Z ⊂ X ∩ Y ; (A.II.7)
X ∪ (Y ∪ Z) = (X ∪ Y ) ∪ Z, i.e. X ∪ Y ∪ Z makes sense; (A.II.8)
X ∩ (Y ∩ Z) = (X ∩ Y ) ∩ Z, i.e. X ∩ Y ∩ Z makes sense; (A.II.9)
X ∪ (Y ∩ Z) = (X ∪ Y ) ∩ (X ∩ Z) and X ∩ (Y ∪ Z) = (X ∩ Y ) ∪ (X ∩ Z);
(A.II.10)
if X ⊂ E and Y ⊂ E, then
(X ) = X, (X ∪ Y ) = X ∩ Y , (X ∩ Y ) = X ∪ Y ; (A.II.11)
X ⊂ Y ⊂ E is equivalent to Y ⊂ X ; (A.II.12)
if X ⊂ E and Y ⊂ E then X ∩ Y = ∅ if and only if X ⊂ Y ; (A.II.13)
if X ⊂ E and Y ⊂ E then X ∪ Y = E if and only if X ⊂ Y, and Y ⊂ X;
(A.II.14)
X × Y = ∅ if and only if X = ∅ or Y = ∅; (A.II.15)
481
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 502
A COURSE IN ANALYSIS
For Z := X × Y we define
pr1 : Z→X pr2 : Z → Y
and
(x, y) → x (x, y) → y
482
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 503
H:X →Z
x → H(x) = G(F (x)).
F ◦ F −1 = idY
F −1 ◦ F = idX
483
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 504
A COURSE IN ANALYSIS
Families of Sets
In the following I and J are arbitrary index sets and (Ai )i∈I and (Bj )j∈J are
families of sets. We define the union and the intersection of such families by:
Ai := {x|x ∈ Ai for some i ∈ I};
i∈I
Ai = A1 ∪ A2 and Ai = A1 ∩ A2
i∈I i∈I
Ai ∪ Bj = (Aj ∪ Bj ); (A.II.44)
i∈I j∈J (i,j)∈I×J
484
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 505
Denumerable Sets
is countable. (Note that instead of N we may take any countable index set).
Next we want to give some hints on how to prove (in principle) statements
about sets and mappings when starting with the basics. There is a nat-
ural correspondence between certain logical operations and set theoretical
operations. Let us introduce the following statements
p: x∈X
q: x∈Y
then
x∈X ∩Y ⇐⇒ p ∧ q
x∈X ∪Y ⇐⇒ p ∨ q
x∈
/X ⇐⇒ ¬p
485
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 506
A COURSE IN ANALYSIS
x ∈ X ⇐⇒ ¬p.
pj : x ∈ X j
then
x∈ Xj =⇒ ∀j ∈ J : pj
j∈J
and
x∈ Xj ⇐⇒ ∃j ∈ J : pj .
j∈J
Now we may use truth tables to prove compound statements when finitely
many statements are involved. For example in order to prove the second
statement of (A.II.10), i.e.
X ∩ (Y ∪ Z) = (X ∩ Y ) ∪ (X ∩ Z)
we can look at
x ∈ X x ∈ Y x ∈ Z (x ∈ X) ∩ (x ∈ Y ∨ x ∈ Z) (x ∈ X ∧ x ∈ Y ) ∨ (x ∈ X ∧ x ∈ Z)
T T T T T
T T F T T
T F T T T
T F F F F
F T T F F
F T F F F
F F T F F
F F F F F
Table A.II.1
Since the last two columns coincide the two statements are equivalent, how-
ever
(x ∈ X) ∧ (x ∈ Y ∨ x ∈ Z) ⇐⇒ x ∈ X ∩ (Y ∪ Z)
and
(x ∈ X ∧ x ∈ Y ) ∨ (x ∈ X ∧ x ∈ Z) ⇐⇒ x ∈ (X ∩ Y ) ∪ (X ∪ Z).
486
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 507
Note: all statements about relations of sets given in our collection are state-
ments involving quantifiers, for example the above statement (A.II.10) is
equivalent to
In our proof we only considered the equivalence for a single x, but since x
was arbitrary this means that we proved it for all x ∈ X ∪ Y ∪ Z.
Although the method of truth tables will always provide a proof as long as
only finitely many statements are involved, it could be quite a time consuming
process to check all cases. For example to prove
one would have to complete a truth table with 16 rows. However, a short
and transparent proof is obtained by using step by step basic definitions and
simple rules for handling logical statements:
Since the pair (x, y) is arbitrary the statement (A.II.52) (which of course is
(A.II.18)) is proved. Similarly we can prove statements with quantifiers, for
example the first statement in (A.II.42):
Ai = Ai .
i∈I i∈I
We have
x∈ Ai ⇐⇒ x ∈
/ Ai
i∈I i∈I
487
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 508
A COURSE IN ANALYSIS
⇐⇒ ¬(∃i ∈ I : x ∈ Ai )
⇐⇒ ∀i ∈ I : ¬(x ∈ Ai )
⇐⇒ ∀i ∈ I : x ∈ Ai
⇐⇒ x ∈ Ai .
i∈I
The proofs for the statements listed above involving mappings are reduced
to statements for sets. For example the meaning of (A.II.24) is
y ∈ F (A ∩ B) =⇒ y ∈ F (A) ∩ F (B)
488
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 509
which is equivalent to
489
This page intentionally left blank
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 511
1 is a natural number.
P.A.2
P.A.3
491
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 512
A COURSE IN ANALYSIS
P.A.4
If n = m then n = m.
492
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 513
Theorem A.III.2. For every pair of natural numbers (n, m) there exists a
unique natural number denoted by add(n, m) such that
and
add(n, m ) = (add(n, m)) for all (n, m) ∈ N × N. (A.III.2)
add : N × N → N (A.III.3)
n + m := add(n, m) (A.III.4)
and the task is to prove using P.A.1-P.A.5 and Theorem A.III.2 only prop-
erties such as
(k + m) + n = k + (m + n) associativity,
or
n + m = m + n commutativity.
E. Landau in [7] gives a very systematical way of introducing N, addition
and the extension from N to Z as well as from Z to Q.
Finally we want to discuss how mathematical induction relates to the
Peano axioms. Recall that mathematical induction works as follows: suppose
that for n ∈ N a statement A(n) is given. If A(1) is true and if A(n) always
implies A(n + 1) then A(n) is true for all n ∈ N.
493
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 514
A COURSE IN ANALYSIS
Denote by M the set of all natural numbers such that A(n) is true, i.e.
M := {n ∈ N|A(n) is true}.
We have to prove
494
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 515
g2
h
β3 g1
β4 β2
β1
α3
α4 α2
α1
Figure A.IV.1
b γh a
c
α β
A c B
Figure A.IV.2
495
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 516
A COURSE IN ANALYSIS
Note that
α + β + γ = π, (A.IV.5)
and for the area of ABC we have
1
area(ABC) = hc · c (A.IV.6)
2
where hc is the height from C to AB. Clearly we have
1 1 1
hc c = hb b = ha a = area(ABC),
2 2 2
where hb and ha denote the heights from B to the side AC and A to the
side BC respectively. In the case of a right angled triangle ABC, see Figure
A.IV.3 we have Pythagoras’ theorem
a2 + b2 = c2 . (A.IV.7)
(Note that there is a slight abuse of notation here: a, b, c denote the sides in
ABC, whereas in (A.IV.7) we use the same symbols to denote the length of
these sides.)
C
·
b γ
a γ= π
2
α β
A c B
Figure A.IV.3
π
Note that we use the “continental” way to indicate an angle of size 2
i.e:
instead of
·
Symbol for a right angle
Figure A.IV.4
496
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 517
Figure A.IV.5
r=1
Segment AB
O A
Figure A.IV.6
497
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 518
A COURSE IN ANALYSIS
By the segment AB we mean the arc joining A and B, i.e. on ∂Cr (O). Often
we say that it is measured by the arc length. This definition of the size
of an angle implies the following correspondence:
π ∼ ◦ π ∼ ◦ π ∼ ◦ π ∼ ◦
= 30 , = 45 , = 60 , = 90 ,
6 4 3 2
3π ∼ 3π ∼
= 135◦ , π ∼
= 180◦, = 270◦, 2π ∼
= 360◦ .
4 2
For a circle Cr (O), see Figure A.IV.7, the length of the arc AB with angle
α is given by
length(AB) = rα (α measured by the arc length) (A.IV.10)
and the area of the sector OAB is given by
r2 α
area(OAB) = (α measured by the arc length). (A.IV.11)
2
B Segment AB
Sector OAB
A
α r
O
Figure A.IV.7
498
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 519
A. Trigonometric Functions
1. Symmetries
499
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 520
A COURSE IN ANALYSIS
⎧
x ⎨
1
2
(1 + cos x), −π ≤ x ≤ π
cos = (A.V.18)
2 ⎩ − 1 (1 + cos x), π ≤ x ≤ 3π
2
x sin x 1 − cos x
tan = = (A.V.19)
2 1 + cos x sin x
x sin x 1 + cos x
cot = = (A.V.20)
2 1 − cos x sin x
6. Sums
x±y x∓y
sin x ± sin y = 2 sin cos (A.V.21)
2 2
x+y x−y
cos x + cos y = 2 cos cos (A.V.22)
2 2
x+y y−x
cos x − cos y = 2 sin sin (A.V.23)
2 2
√ π
cos x ± sin x = 2 sin( ± x) (A.V.24)
4
sin(x ± y)
tan x ± tan y = (A.V.25)
cos x cos y
sin(z ± y)
cot x ± cot y = ± (A.V.26)
sin x sin y
cos(x − y)
tan x + cot y = (A.V.27)
cos x sin y
cos(x + y)
cot x − tan y = (A.V.28)
sin x cos y
500
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 521
7. Products
1
sin x sin y = (cos(x − y) − cos(x + y)) (A.V.29)
2
1
cos x cos y = (cos(x − y) + cos(x + y)) (A.V.30)
2
1
sin x cos y = (sin(x − y) + sin(x + y)) (A.V.31)
2
tan x + tan y
tan x tan y = (A.V.32)
cot x + cot y
cot x + cot y
cot x cot y = (A.V.33)
tan x + tan y
tan x + cot y
tan x cot y = (A.V.34)
cot x + tan y
8. Squares
sin2 x + cos2 x = 1 (A.V.35)
tan2 x 1
sin2 x = = (A.V.36)
1 + tan2 x 1 + cot2 x
1 cot2 x
cos2 x = = (A.V.37)
1 + tan2 x 1 + cot2 x
x 1
sin2 = (1 − cos x) (A.V.38)
2 2
x 1
cos2 = (1 + cos x) (A.V.39)
2 2
2
sin x 1 − cos2 x
tan2 x = = (A.V.40)
1 − sin2 x cos2 x
cos2 x 1 − sin2 x
cot2 x = = (A.V.41)
1 − cos2 x sin2 x
501
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 522
A COURSE IN ANALYSIS
9. Useful Values
x 0 π6 π
4
π
3
π
2
2π
3
3π
4
5π
6
π
◦ ◦ ◦ ◦ ◦ ◦ ◦ ◦ ◦
0 30 45 √ 60
√ 90 120
√ 135
√ 150 180
1 1
sin x 0 √ 2 2√ 2
2 1 3 1 12 3 12 √2 1
2√
0
1 1 1 1 1 1
cos x 1 2 √3 2 2 √2 0 −√2 − 2 2− 2 √3 -1
tan x 0 13√ 3 1 √3 - − √3 -1 − 13√ 3 0
cot x - 3 1 13 3 0 − 13 3 -1 − 3 -
B. Hyperbolic Functions
1. Symmetries
sinh(−x) = − sinh x (A.V.42)
cosh(−x) = cosh x (A.V.43)
tanh(−x) = − tanh x (A.V.44)
coth(−x) = − coth x (A.V.45)
2. Addition Theorems
4. Half Arguments
⎧
x ⎨
1
2
(cosh x − 1), x ≥ 0
sinh = (A.V.54)
2 ⎩ − 1 (cosh x − 1), x < 0
2
'
x 1
cosh = (cosh x + 1 (A.V.55)
2 2
x cosh x − 1 sinh x
tanh = = (A.V.56)
2 sinh x cosh x + 1
x sinh x cosh x + 1
coth = = (A.V.57)
2 cosh x − 1 sinh x
5. Sums
1 1
sinh x ± sinh y = 2 sinh (x ± y) cosh (x ∓ y) (A.V.58)
2 2
1 1
cosh x + cosh y = 2 cosh (x + y) cosh (x − y) (A.V.59)
2 2
1 1
cosh x − cosh y = 2 sinh (x + y) sinh (x − y) (A.V.60)
2 2
sinh(x ± y)
tanh x ± tanh y = (A.V.61)
cosh x cosh y
6. Squares
cosh2 x − sinh2 x = 1 (A.V.62)
tanh2 x 1
sinh2 x = cosh2 x − 1 = 2 = 2 (A.V.63)
1 − tanh x coth x − 1
1 coth2 x
cosh2 x = sinh2 x + 1 = = (A.V.64)
1 − tanh2 x coth2 x − 1
sinh2 x cosh2 x − 1 1
tanh2 x = 2 = 2 = (A.V.65)
sinh x + 1 cosh x coth2 x
sinh2 x + 1 cosh2 x 1
coth2 x = 2 = 2 = (A.V.66)
sinh x cosh x − 1 tanh2 x
Note that we will see the relationship between hyperbolic and trigonometric
functions when we consider complex arguments later in this course.
503
This page intentionally left blank
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 525
• every set bounded from above (below) has a least (greatest) upper
(lower) bound (Theorem 19.14).
Without these results we cannot prove many others, hence the completeness
of R is key for our theory. Nonetheless there are at least two problems
with the axiom of completeness. Firstly, it looks quite artificial, an ad hoc
requirement which turns out to be useful. Secondly, while we may suppose
the axiom to hold, we have given no proof so far that an Archimedian ordered
field which is complete exists.
First we want to discuss an equivalent way of introducing the completeness
of R by choosing a different axiom as a starting point.
Axiom A
505
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 526
A COURSE IN ANALYSIS
Proof. Let x be the least upper bound of {xn |n ∈ N}. Given > 0 there
exists N ∈ N such that x − 2 < xN < x. Since (xn )n∈N is increasing it
follows for all n ≥ N that x − 2 ≤ xN ≤ xn ≤ x, or for all n ≥ N we have
0 ≤ x − xn ≤ 2 , i.e. |xn − x| < , implying the convergence of (xn )n∈N to
x.
i.e. (xkm )m∈N converges to c. Now Lemma 17.10.B implies the result.
Theorem A.VI.3 implies the equivalence of Axiom A (or Axiom A ) with the
Axiom of Completeness, and arguably Axiom A is more natural to accept.
It is possible to prove the equivalence of other statements to the Axiom of
Completeness, but we do not want to go into further detail.
506
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 527
and
[(n, m)] [(n , m )] := [(nn + mm , nm + mn )].
First we can prove that these definitions are independent of the represen-
tatives chosen. Moreover we can identify n ∈ N with [n + m, m], m ∈ N,
and we may define 0 := [n, n], as we may set −n for [m, n + m]. It takes
some work, but it is not difficult to see that N × N/Z with the operations
⊕ and forms a ring and we will use the standard notations from now on,
i.e. 0, 1, n, −n, n + m, n − m when working in Z. We do not want to go much
further into the details since we will do so when passing from Z to Q for
which we employ a similar construction.
On Z × N we define the relation
507
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 528
A COURSE IN ANALYSIS
Q := Z × N/Q ,
and
[(k, m)] [(l, n)] := [(kl, mn)]. (A.VI.4)
Note that mn = m + · · · + m (n summands), so we need only addition in N
(which we get from the Peano axioms) to define ⊕ and . First we need to
prove that our definitions are independent of the choice of representatives.
So let (k, m) Q (k , m ) and (l, n) Q (l , n ). We find
and
[(k , m )] ⊕ [(l , n )] = [(n k + l m , n m )].
However we have km = k m and ln = l n and therefore
nkn m + lmn m = n k nm + l m nm
= n (k m)n + (l n)mm = n (km )n + (ln )mm ,
implying
(nk + lm)n m = (n k + l m )nm,
or
(nk + lm, nm) Q (n k + l m , n m ),
508
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 529
i.e.
[(nk + lm, nm)] = [(n k + l m , n m )].
Analogously we can prove that (A.VI.4) is independent of the representatives.
The next task is to verify the field axioms for (Q, ⊕, ). For example we find
since
[(k, m)] ⊕ [(l, n)] = [(kn + lm, nm)]
and
[(l, n) ⊕ (k, m)] = [(lm + kn, mn)].
For n ∈ Z we identify [(n, 1)] with n, and since (0, 1) Q (0, m) for all m ∈ N
we can represent 0 by any pair of the type (0, m). Further, for n ∈ N we can
identify [n, n] with 1, indeed we get
For [(m, n)] = [(0, 1)] we can form its inverse of multiplication by [(n, m)]:
Thus, along these lines it is possible to prove that (Q, ⊕, ) is a field and
we can consider this field as a model of the rational numbers. We can also
introduce an order relation ≤ on (Q, ⊕, ) by
[(k, m)] ≤ [(l, n)] and [(l, n)] ≤ [(p, q)] implies [(k, m)] ≤ [(p, q)].
Moreover we find
[(0, 1)] ≤ [(k, m)] (A.VI.5)
509
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 530
A COURSE IN ANALYSIS
i.e. (xn )n∈N R (yn )n∈N and (yn )n∈N R (zn )n∈N implies (xn )n∈N R (zn )n∈N .
Hence we have proved that “R ” is an equivalence relation on C. Now we
consider
R := C/R , (A.VI.7)
the set of all equivalence classes of Cauchy sequence of rational numbers. On
C/R we introduce the following two operators:
and
[(xn )n∈N ] [(yn )n∈N ] := [(xn yn )n∈N ]. (A.VI.9)
510
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 531
First we need to show that these definitions are independent of the choice
of representatives. If (xn )n∈N R (xn )n∈N and (yn )n∈N R (yn )n∈N then it
follows immediately that (xn + yn )n∈N R (xn + yn )n∈N since
lim (xn + yn − (xn + yn )) = lim (xn − xn ) + lim (yn − yn ) = 0.
n→∞ n→∞ n→∞
Furthermore we know that (yn )n∈N and (xn )n∈N are bounded and
implies now that lim (xn yn − xn yn ) = 0, i.e. (xn yn )n∈N R (xn yn )n∈N .
n→∞
Next we claim that (C/R , ⊕, ) is a field. We will check only some of
the axioms and the reader is invited to check the remaining ones. For the
addition ⊕ we find for example
or
[(xn )n∈N ] ⊕ [(−xn )n∈N ] = [(xn − xn )n∈N ] = [0].
For the multiplication we have for example with [e] = [(en )n∈N ], en = 1 for
n ∈ N, that
[(xn )n∈N ] [e] = [(xn en )n∈N ] = [(xn )n∈N ].
More delicate is to prove that if [(xn )n∈N ] = [0], then we can find an inverse
with respect to the multiplication. We observe that if [(xn )n∈N ] = [0] there
exists δ ∈ Q, δ > 0, and N(δ) ∈ N such that |xn | ≥ δ for all n ≥ N(δ). If this
is not the case then (xn )n∈N has a subsequence (xnk )k∈N converging to zero,
and by Lemma 17.10.B we conclude that (xn )n∈N must converge to zero, i.e.
[(xn )n∈N ] = [0], which is a contradiction. (Note that the proof of Lemma
17.10.B works for Cauchy sequences in Q.) For (xn )n∈N ∈ C not equivalent
to (cn )n∈N , cn = 0 for all n ∈ N, we define
-
x−1
n , n ≥ N(δ)
x̃n :=
0, n < N(δ)
511
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 532
A COURSE IN ANALYSIS
where -
1, n ≥ N(δ)
xn x̃n =
0, n < N(δ),
which implies that (xn x̃n )n∈N (en )n∈N , en = 1 for n ∈ N. The remain-
ing axioms, in particular the associative laws and the distributivity law are
proved in a straightforward way along the lines as indicated above.
We want to define an order structure on C/R . We call (xn )n∈N ∈ C positive
if there exists δ ∈ Q, δ > 0, and N(δ) ∈ N such that xn ≥ δ for all n ≥ N(δ).
Again, our first task before looking at C/R is to prove that the definition is
independent of the representative. For this let (xn )n∈N ∈ C be positive and
(xn )n∈N ∈ C be equivalent to (xn )n∈N . Then there exists δ ∈ Q, δ > 0, and
N(δ) ∈ N such that xn ≥ δ for n ≥ Nδ . Further, since lim (xn − xn ) = 0 we
n→∞
can find Ñ (δ) ∈ N such that |xn − xn | < δ2 for all n ≥ Ñ(δ). This however
implies for n ≥ max(N(δ), Ñ(δ)) that xn > xn − δ2 ≥ δ2 and hence (xn )n∈N is
positive too. With [0] = [(cn )n∈N ], cn = 0 for all n ∈ N, we define
holds. Let [(xn )n∈N ] = [0]. We claim [(|xn |)n∈N ] ≥ [0]. If this is not the
case, then there exists a subsequence (xnk )k∈N of (xn )n∈N such that |xnk | < k1
implying by Lemma 17.10.B that (xn )n∈N R (cn )n∈N , cn = 0 for all n ∈ N.
Now, we know |xn | ≥ δ > 0 for n ≥ N(δ) and (xn )n∈N is a Cauchy sequence.
Thus there exists Ñ (δ) ∈ N such that n, m ≥ Ñ(δ) implies |xm − xn | < δ2 .
For m0 ≥ max(N(δ), Ñ(δ)) it follows from |xn | ≥ δ that either xm0 ≥ δ
512
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 533
513
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 534
A COURSE IN ANALYSIS
514
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 535
Corollary A.VI.6. For every x ∈ R and > 0 there exists x ∈ j(Q) such
that |x − x| < .
Proof. Let x ∈ R, i.e. x = [(qn )n∈N ] for a Cauchy sequence (qn )n∈N , qn ∈ Q.
By Theorem A.VI.5 we have lim j(qn ) = x, so given > 0 we choose N() ∈
n→∞
N such that |j(qn ) − x| < for n ≥ N() and it follows that |x − x| < for
x = j(qN ()+1 ).
515
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 536
A COURSE IN ANALYSIS
Proof. Let (xn )n∈R be a Cauchy sequence in R. We have to prove the exis-
tence of x ∈ R such that lim xn = x. Let (n )n∈N , n > 0, be a sequence in
n→∞
R such that lim n = 0. (Any sequence (ηn )n∈N , ηn ∈ Q, such that ηn > 0
n→∞
and lim ηn = 0 will induce on R such a sequence by n := j(ηn ).)
n→∞
For n ∈ N there exists qn ∈ Q such that
|j(qn ) − xn | < n .
Since lim n = 0 and (xn )n∈N is a Cauchy sequence, given > 0 we can find
n→∞
N() ∈ N such that n, m ≥ N() implies
n < , m < , |xn − xm | < ,
3 3 3
Given > 0, since lim n = 0 and lim qn = x, we can find N ∈ N such that
n→∞ n→∞
n ≥ N yields n < 2
and |qn − x| < 2 , or for n ≥ N
|xn − x| < + = ,
2 2
which implies lim xn = x.
n→∞
516
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 537
Our presentation follows that in K. Endl and W. Luh [3]. However we have
left some of the details to the reader (including the fact that R as constructed
is an Archimedian field). The reason why the proof is so long is partly because
we have a lot of structure on Q which needs to be transferred to R: algebraic
structures, order structure and convergence (topological structure).
Finally we want to mention a further possibility of constructing R from Q.
Let A, B ⊂ R be two non-empty sets such that A ∪ B = R. In addition we
require for all a ∈ A and b ∈ B that a < b. We call such a pair of subsets
of R a Dedekind cut and denote it by (A|B). Further we call any s ∈ R a
separating number for (A|B) if for all a ∈ A and b ∈ B we have a ≤ s ≤ b.
Equivalent to our axiom of completeness is:
Axiom D
517
This page intentionally left blank
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 539
lim inf (an + bn ) ≤ lim inf an + lim sup bn ≤ lim sup(an + bn ); (A.VII.7)
n→∞ n→∞ n→∞ n→∞
519
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 540
A COURSE IN ANALYSIS
1
lim sup an = ∞ if and only if lim inf = 0, (A.VII.13)
n→∞ an
1
lim inf an = ∞ if and only if lim sup = 0;
n→∞ n→∞ an
if 0 < lim inf an ≤ lim sup an < ∞ then (A.VII.14)
n→∞ n→∞
1 1
lim sup = ,
n→∞ an lim inf n→∞ an
1 1
lim inf = ;
n→∞ an lim supn→∞ an
520
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 541
a1 + · · · + an a1 + · · · + an
lim inf an ≤ lim inf ≤ lim sup ≤ lim sup an ;
n→∞ n→∞ n n→∞ n n→∞
521
This page intentionally left blank
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 543
and a half-open interval is of the type I ∪{c} where I is an open interval with
c being an end point. Thus if we can prove that the union of intersecting
connecting sets is connected and noting that {a}, a ∈ R, is trivially connected
we are done.
Our
claim is: let Aj ⊂ R, j ∈ J = ∅, be a family of connected sets such that
j∈J Aj =
∅. Then j∈J Aj is connected too.
Suppose
j∈J Aj is not connected
and let {O1 , O2 } be a non-trivial splitting
of j∈J Aj such that O1 ∩ j∈J Aj = ∅. Since Aj is connected it follows that
523
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 544
A COURSE IN ANALYSIS
Aj ∩ O2 = ∅
for all j ∈ A, implying O2 ∩ j∈J Aj = ∅ which is a contradic-
tion. Hence j∈J Aj does not have a non-trivial splitting and therefore it is
connected.
Next we prove Theorem 19.27 which states that every open set in R is a
denumerable union of disjoint open intervals.
Proof of Theorem 19.27. Let A ⊂ R be open and x ∈ A. Then there exists
δ > 0 such that (x, x + δ) ⊂ A and (x − δ, x) ⊂ A. Let b := sup{y|(x, y) ⊂ A}
and a := inf{z|(z, x) ⊂ A}. Clearly a < x < b and Ix := (a, b) is an open
interval containing x. We claim Ix ⊂ A. Take w ∈ Ix , and assume x < w < b,
the case a < w < x goes analogously. The definition of b implies the existence
of y > w such that (x, y) ⊂ A but w ∈ (x, y), so w ∈ A. Next we prove
that b ∈/ A (the fact that a ∈ / A goes analogously). Suppose b ∈ A. In
this case there would exist some > 0 such that (b − , b + ) ⊂ A, hence
(x, b + ) ⊂ A contradicting the definition of b. We consider now (Ix )x∈A .
Each x ∈ A belongs to some of these intervals, for example Ix , and each Ix is
contained in A, thus A = x∈A Ix . We want to prove that either Ix1 ∩Ix2 = ∅
or x1 = x2 . Let Ix1 and Ix2 , x1 , x2 ∈ A, say Ix1 = (α1 , β1 ) and Ix2 = (α2 , β2 ),
and suppose x ∈ (α1 , β1 ) ∩ (α2 , β2 ). In this case it follows that α2 < β1 and
α1 < β2 . But α2 ∈ / A hence α2 ∈ / (α1 , β1 ) and therefore α2 ≤ α1 . Since
α1 ∈/ A and hence α2 ∈ / (α2 , β2 ) we have α1 ≤ α2 , i.e. α1 = α2 . Similarly
we can prove β1 = β2 to get (α1 , β1 ) = (α2 , β2 ). Thus if Ix1 ∩ Ix2 = ∅ then
Ix1 = Ix2 . So we have already proved that A is the union of disjoint open
intervals. By Theorem 19.11 each of these intervals must contain a rational
number. But the rational numbers are countable and no rational number
can belong to two of these intervals. Hence we have at most countably many
open intervals, the union of which is A.
524
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 545
Z \ M = {x ∈ Z | x < 5} = {x ∈ Z | x ≤ 4}.
{x ∈ Z | 5x + 7 = 13} = φ.
525
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 546
A COURSE IN ANALYSIS
6
b) This is the same condition as in a), but now we seek rational solutions and 5 ∈ Q.
Therefore we have
6
{x ∈ Q | 5x + 7 = 13} = .
5
{x ∈ Z | 5x + 7 ≤ 13} = {x ∈ Z | x ≤ 1}.
7. a)
−7 27 18 −7 27 · 5 − 18 · 8
− =
3 8 5 3 8·5
−7 135 − 144
=
3 40
−7 9 7 9 7·3 21
= − = · = = .
3 40 3 40 40 40
b)
3 7 3·3 7 9+7
4 + 12 3·4 + 12 12
2 1 = 2·7 19 = 5
19 − 7 19·7 − 19·7 − 133
16 4
4 133 532
= − 12 3
5 = 5 = 3 · 5 = .
133 133
15
c)
42 − 33 16 − 27 11 1
= =− =− .
52 + 19 25 + 19 44 4
8. a)
526
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 547
Now
1 1
(a + b − c)(2a − 6b + 2c) = (a + b − c)(a − 3b + c)
4 2
1 2
= (a + ab − ac − 3ab − 3b + 3bc + ac + bc − c2 )
2
2
1
= (a2 − 3b2 − c2 − 2ab + 4bc)
2
and therefore the result is proved.
c)
d)
x3 − y 3 1 x y
− y 4 x2 3
− +
y−x y x y x
= −(x2 + xy + y 2 ) − yx + y 3 x3 − y 5 x
= x3 y 3 − x2 − 2xy − xy 5 − y 2 .
9.
1
12 6
8
9 11 − 29
5 − 7
7
8 3
−
3 4 222 84
1 72 30
− −
= 9 99 8 99 1135 35
3 · − 4
1 50 54 1·50·54
·99 · 35
=−9 82 = − 9·99·35
22
12 3
10.6 20 20
9·11·7 3·11·7 11·7
=− 22 = − 22 = − 22
3 3 1
20 10 10
=− =− =− ·
77 · 22 77 · 11 847
527
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 548
A COURSE IN ANALYSIS
10. a) b)
3 4
2 1 8 2 3 3 2 8 9
− +5 − −
3 2 9 5 8
= 125 64
19 19
8 1 40 40 40
= − + 512−1125 613
27 16 9 8000 200
8 · 16 − 27 + 40 · 3 · 16 = 19 =−
= 40
19
27 · 16 613
2021 =− .
= . 3800
432
11. a)
(a + b)3 − (b − a)2 (b + a)
4ab
a3 + 3a2 b + 3ab2 + b3 − (b2 − 2ab + a2 )(b + a)
=
4ab
a3 + 3a2 b + 3ab2 + b3 − b3 + a2 b + ab2 − a3
=
4ab
4a2 b + 4ab2
= = a + b;
4ab
b)
a 3 4
b − ab
a2 b 3
a3 b4
3 − 4 a7 − b 7
= b 2 3a = .
a b a6 b 6
√
225 a4 b6 a2 b3
12. a) 625 = 25; b) 49 = 15
7 ; c) (a+b)2 = a+b .
5
3
528
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 549
b) Note that
7 2 3
+ x≤ x
4 5 8
is equivalent to
2 3 7
x− x≤− ,
5 8 4
i.e.
1 7
x≤−
40 4
or
x ≤ −70.
Thus every x ∈ R satisfying x ≤ −70 solves this inequality.
−70
(x − 3) ≥ 0 and (x + 4) ≥ 0
or
(x − 3) ≤ 0 and (x + 4) ≤ 0
is true.
The first pair of inequalities imply
x≥3 and x ≥ −4
x≤3 and x ≤ −4
0
−4 3
z
14. The term xy is not well defined. For x = 2, y = 3, z = 2 we have
529
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 550
A COURSE IN ANALYSIS
1 1 b·d 1
+ = b−1 + d−1 = (b−1 + d−1 ) = (b−1 (b · d) + d−1 (b · d))
b d b·d b·d
1 1 d+b
= ((b−1 · b) · d + (d−1 · d)b) = (d + b) = .
b·d b·d b·d
b) We first show that (x−1 )−1 = x for x = 0. Since (x−1 )−1 x−1 = 1 and
−1
x · x = 1 it follows that
or
(x−1 )−1 x−1 · x = x · x−1 · x,
i.e.
(x−1 )−1 = x
1
and using fractions we get 1 = x. Now we find
x
−1 −1
a
b a c −1 a 1 a −1 1 a 1 ad
c = · = c· = ·c = · ·d= .
d b d b d b d b c bc
530
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 551
b
Now we wish to take the square root on the left hand side. If x + 2a ≥ 0 we have
no problem to find
b 1 2
x+ = b − 4ac,
2a 2a
or
b 1 2
x=− + b − 4ac.
2a 2a
b
b
b
b 2
If x + 2a ≤ 0, we know that − x + 2a = −x − 2a ≥ 0, however x + 2a =
b 2
−x − 2a . Thus we have
2
b 1
−x − = 2 (b2 − 4ac)
2a 4a
or
b 1 2
−x − = b − 4ac,
2a 2a
implying
b 1 2
x=− − b − 4ac.
2a 2a
Thus so long as b2 −4ac ≥ 0 we find that the solutions of the quadratic equation
ax2 + bx + c = 0 are
b 1 2
x1 = − + b − 4ac
2a 2a
and
b 1 2
x2 = − − b − 4ac.
2a 2a
b
If b2 = 4ac we have only one solution x1 = x2 = − 2a .
Chapter 2
1. Since A = {x ∈ X | x ∈
/ A} we have A = {e, f, g, h, i}. The set A ∩ C is given by
those elements belonging to both the sets A and C, hence,
A ∩ C = {c, d}.
Now we find
(A ∩ C) = {a, b, e, f, g, h, i}.
The set B \ C consists of every x ∈ X which belongs to B but does not belong to
C, so we find
B \ C = {b, h}.
Finally, since A ∪ B = {a, b, c, d, f, h}, we have
531
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 552
A COURSE IN ANALYSIS
532
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 553
( )
]( )[
−6 −5 −4 −3 −2 −1 0 1 2 7 3 4 5 6
3
( )
d) Since $
7 7
−2, = x ∈ R − 2 ≤ x <
3 3
and further $ %
3 15 3 15
, = x ∈ R ≤ x ≤
5 4 5 4
we find $ $ % $
7 3 15 3 7
−2, ∩ , = ,
3 5 4 5 3
which we can also see from the following:
[ ]
[ ) |
−3 −2 −1 0 3 1 2 7 3 15 4
5 3 4
[ )
x ∈ A ∩ B =⇒ x ∈ A
or we can write A ∩ B ⊂ A.
On the other hand x ∈ A implies x ∈ A or x ∈ B, i.e.
x ∈ A =⇒ x ∈ A ∪ B,
or we can write A ⊂ A ∪ B.
b) Let x ∈ (A \ B) ∩ B. Then x ∈ A \ B and x ∈ B, or
(x ∈ A ∧ x ∈
/ B) ∧ x ∈ B
x ∈ (A ∩ B) ∪ C
⇐⇒ (x ∈ A ∩ B) ∨ (x ∈ C)
⇐⇒ ((x ∈ A) ∧ (x ∈ B)) ∨ (x ∈ C)
⇐⇒ ((x ∈ A) ∨ (x ∈ C)) ∧ ((x ∈ B) ∨ (x ∈ C))
⇐⇒ x ∈ (A ∪ C) ∩ (B ∪ C) .
533
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 554
A COURSE IN ANALYSIS
5. We prove
a) =⇒ b) =⇒ c) =⇒ d) =⇒ e) =⇒ f) =⇒ a).
Now A ⊂ B means x ∈ A =⇒ x ∈ B. Therefore x ∈ A implies x ∈ A and x ∈ B,
hence x ∈ A ∩ B, i.e. A ⊂ A ∩ B. The inclusion A ∩ B ⊂ A has already been
proved (see Problem 3 a)). Thus A ⊂ B implies A ∩ B = A, i.e. a) implies b).
Next observe that A ∩ B = A is equivalent to (A ∩ B) = A , or
A ∪ B = A . Thus x ∈ B must imply x ∈ A , B ⊂ A , proving b) implies c).
Suppose now that B ⊂ A . We have already proved that A ⊂ B implies A ∩ B =
A. Thus B ⊂ A implies A ∩ B = B . Taking the complement on both sides
gives (A ∩ B ) = B, but (A ∩ B ) = A ∪ B, so we find A ∪ B = B and
therefore c) =⇒ d).
If A ∪ B = B then A ∪ B = A ∪ A ∪ B = X, since A ∪ A = X. Therefore we
have proved that d) implies e).
In order to prove that e) implies f) we only need to take complements: if B ∪ A = X
then (B ∪ A ) = X or B ∩ A = φ.
Finally, we show that A ∩ B = φ implies A ⊂ B. Note that A ∩ B = φ means
that x ∈ A and x ∈ B cannot hold, i.e. x ∈ A implies x ∈ B which is A ⊂ B of
course, i.e. we have proved that f) implies a).
6. We have:
a)
5 5
− = ;
8 8
b)
11
− 3 = 11 − 9 = 2 = 2 ;
3 3 3 3
c)
7 12 35 − 108 73 73
− = =− = ;
9 5 45 45 45
d)
| | − 3| − | − 5| | = |3 − 5| = | − 2| = 2;
√
e) a2 = |a|;
(note that a can be non-positive since a ∈ R).
534
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 555
1 1 1
(a + b − |a − b|) = (a + b − (b − a)) = 2a = a.
2 2 2
However, if b ≤ a we have b = min{a, b} and a − b ≥ 0 which gives |a − b| = a − b
and therefore
1 1 1
(a + b − |a − b|) = (a + b − (a − b)) = 2b = b.
2 2 2
Since a > 0, we deduce that
a2 + 1 ≥ 2a,
or
a2 − 2a + 1 = (a − 1)2 ≥ 0,
1
a+ ≥0 and (a − 1)2 ≥ 0.
a
8. We may use the triangle inequality (2.11). Therefore we can easily see that
|a − c| = |a − b + b − c| ≤ |a − b| + |b − c|.
For the second estimate we use the converse triangle inequality, i.e. (2.55), which
states for α, β ∈ R, that
||α| − |β|| ≤ |α − β|. Now with α = |a − b| and β = c we find
535
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 556
A COURSE IN ANALYSIS
9. a) For x ∈ R
8x − 11 > −24x + 89
is equivalent to
32x > 100
or
25
.
x>
8
Thus 8x − 11 > −24x + 89 holds for all x > 25 8 .
536
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 557
03 + 13 + 23 = 1 + 8 = 9
which is divisible by 9. If the statement holds for k, then we find for k + 1 that
Now the first term k 3 + (k + 1)3 + (k + 2)3 as well as the second term is divisible
by 9 and the result follows by mathematical induction.
b) For n = 0 we find
05 04 03 0
+ + − = 0 ∈ Z.
5 2 3 30
537
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 558
A COURSE IN ANALYSIS
we have a factorisation as required with Qn+1 (x, y) = xQn (x, y)+y n , and the result
follows.
b) For n = 1, the statement reduces to
(1 − 1)x + y 1 ≥ 1xn−1 y
538
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 559
xn+1 − y n x + y n+1 − xn y ≥ 0.
Note that
xn+1 − y n x + y n+1 − xn · y = xn (x − y) + y n (y − x)
= (xn − y n )(x − y).
3. a)
2
1 1 1 1 1 1
j
= −2 + −1 + 0 + 1 + 2
j=−2
2 2 2 2 2 2
1 1
= 22 + 2 + 1 + + = 7 34 .
2 4
b)
5
(ak − ak−2 ) = a2 − a2−2 + a3 − a3−2 + a4 − a4−2 + a5 − a5−2
k=2
= a2 − 1 + a3 − a + a4 − a2 + a5 − a3
= a5 + a4 − a − 1.
539
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 560
A COURSE IN ANALYSIS
c)
6
l+1 1+1 2+1 3+1
(−1)l = (−1)1 + (−1)2 + (−1)3
l 1 2 3
l=1
4+1 5+1 6+1
+ (−1)4 + (−1)5 + (−1)6
4 5 6
3 4 5 6 7
= −2 + − + − +
2 3 4 5 6
4 6 3 5 7
= −2− − + + +
3 5 2 4 6
68 47 −272 + 235 37
=− + = =− .
15 12 60 60
4. a) In both cases our formal proof will use induction. However before giving
the formal proofs let us rewrite the statement as
λ(a1 + . . . + aN ) = (λa1 + . . . + λaN )
and
(a1 + . . . + aN ) + (b1 + . . . + bN ) = (a1 + b1 ) + . . . + (aN + bN ),
thus we get a feeling for the content of these statements: the first is an extension of
the law of distributivity, the second follows as an extension of the commutativity
of addition. Here are the formal proofs:
for N = 1 we obviously have
1
1
λ aj = λa1 = (λaj ).
j=1 j=1
Now if
N
N
λ aj = (λaj )
j=1 j=1
540
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 561
If
N
N
N
aj + bj = (aj + bj )
j=1 j=1 j=1
Now if
n
1 n
=
(2k − 1)(2k + 1) 2n + 1
k=1
541
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 562
A COURSE IN ANALYSIS
b) For k = 1 we find
1
n · n! = 1 · 1! = 1 = (1 + 1)! − 1.
n=1
holds then
m+1
m
(a + (j − 1)d) = (a + (j − 1)d) + a + ((m + 1) − 1)d
j=1 j=1
1
= m(2a + (m − 1)d) + a + md
2
1 1
= · 2am + a + m(m − 1)d + md
2 2
1 1
= 2a(m + 1) + (m + 1)md
2 2
1
= (m + 1)(2a + md).
2
6. a)
2
2−k = 2−(−2) · 2−(−1) · 2−(0) · 2−1 · 2−2
k=−2
= 22 · 2 · 1 · 2−1 · 2−2 = 1.
542
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 563
b)
6
(j − 4) = (3 − 4)(4 − 4)(5 − 4)(6 − 4) = 0.
j=3
c)
5
j+2 3 4 5 6 7 1
= · · · · = .
j=1
j + 4 5 6 7 8 9 6
7. Again, as in Problem 4 a), we first rewrite the statement to understand its content:
N
N
(μaj ) + (νaj )
j=1 j=1
8. a)
7! = 1 · 2 · 3 · 4 · 5 · 6 · 7 = 5040
and
63! 60! 61 · 62 · 63
= = 61 · 62 · 63 = 238, 266.
60! 60!
b)
(n + 1)! − n! (n + 1)n! − n! ((n + 1) − 1)n!
= = = n!
n n n
543
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 564
A COURSE IN ANALYSIS
c)
(n + 1)! (n − 1)! · n · (n + 1)
= = n(n + 1).
(n − 1)! (n − 1)!
9. a) For n = 2 we find
2
2k − 1 1 3 3 1 4 6
= · = = 4 = .
2k 2 4 8 2 2 16
k=1
Now we want to show that the statement for n implies that for n + 1:
n+1
n
2k − 1 2k − 1 2(n + 1) − 1
=
2k 2k 2(n + 1)
k=1 k=1
1 2n 2n + 1
= 2n
2 n 2n + 2
1 (2n)! 2(2n + 1)
= 2(n+1) .
2 n!n! n + 1
Thus it remains to prove that:
(2n)! 2(2n + 1) 2(n + 1) (2(n + 1))!
= = .
n!n! n + 1 n+1 (n + 1)!(n + 1)!
Note that
(2n)!2(2n + 1) 2(2n + 1)!
=
n!n!(n + 1) (n + 1)!n!
and
(2(n + 1))! (2n + 1)!(2n + 2) 2(2n + 1)!(n + 1) 2(2n + 1)!
= = =
(n + 1)!(n + 1)! (n + 1)!(n + 1)! (n + 1)!(n + 1)! (n + 1)!n!
and the identity is now proved.
b) Since
1−1
1
k 0
1
k
1+ = 1+ =1
k k
k=1 k=1
the statement is true for n = 1. Now under the assumption that the statement
holds for n we get for n + 1 that:
n k n−1 k n
1 1 1
1+ = 1+ 1+
k k n
k=1 k=1
n n
nn 1 nn n + 1 (n + 1)n
= 1+ = =
n! n n! n n!
(n + 1)n (n + 1) (n + 1)n+1
= = ,
n!(n + 1) (n + 1)!
and the assertion is proved.
544
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 565
10. a)
4
4
(5x2 + 3y)4 = (5x2 )4−k (3y)k
k
k=0
4 2 4 4 2 3 4 2 2 2 4 2 3 4
= (5x ) + (5x ) (3y) + (5x ) (3y) + (5x )(3y) + (3y)4
0 1 2 3 4
= 625x8 + 1500x6 y + 1350x4 y 2 + 540x2 y 3 + 81y 4 .
b)
n
n n−k
(x − y)n = x (−y)k
k
k=0
n
k n
= (−1) xn−k y k .
k
k=0
545
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 566
A COURSE IN ANALYSIS
b) For k = 1 it is true that p > 1 for p ≥ 3 and also for k = 2 we have p2 > k 2
since p ≥ 3. (Note that p ≥ 2 is not sufficient to get the strict inequality.) Assume
pk > k 2 and k ≥ 2. Multiplying by p yields:
pk+1 > pk 2 ≥ 3k 2
3k 2 ≥ k 2 + 2k + 1
or
2k 2 ≥ 2k + 1
which is equivalent to
k 2 + (k − 1)2 ≥ 2,
which holds since k ≥ 2. Thus by mathematical induction the statement holds for
all k ≥ 2. The case k = 1 has already been proved.
c) Note that for k = 2, 3 and 4 the statement is false. For k = 5 we have:
25 = 32 > 25 = 52 .
2k+1 > 2k 2
1
1 √
√ = 1 ≤ 2 1 = 2.
j=1
j
Now under the assumption that the statement holds for N we find for N + 1 that:
N
+1 1 N √
1 1 1
√ = √ +√ ≤2 N+√
j=1
j j=1
j N +1 N +1
546
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 567
√ √
Now multiplying this inequality by N + 1+ N gives the equivalent statement:
√ √
N +1+ N √ √ √ √
√ ≤ 2( N + 1 − N )( N + 1 + N ) = 2
N +1
or '
N
1+ ≤ 2,
N +1
i.e. we need to justify the equivalent statement
'
N
≤1
N +1
N
which follows from N +1 < 1.
b) For k = 1 we find:
1
(2m)! = 2! = 2 ≥ ((1 + 1)!)1 = 2.
m=1
Suppose that
k
(2m)! ≥ ((k + 1)!)k .
m=1
which is equivalent to
((k + 2)!)k+1
(2(k + 1))! ≥
((k + 1)!)k
((k + 1)!(k + 2))k (k + 2)!
=
((k + 1)!)k
= (k + 2)k (k + 2)!
However note:
547
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 568
A COURSE IN ANALYSIS
or more formally
k
(2(k + 1))! = (k + 2)! (k + 2 + j).
j=1
Hence we conclude
(2(k + 1))! ≥ (k + 2)k (k + 2)!
14. We first prove (∗ ∗) for n = 2k . For this we use mathematical induction. The case
k = 1, i.e. n = 2 follows from
or
√ a1 + a2
a1 a2 ≤ .
2
Now suppose that (∗ ∗) holds for n = 2k−1 , i.e.
1 a1 + . . . + a2k−1
(a1 · . . . · a2k−1 ) 2k−1 ≤ .
2k−1
However we also have for the “next” 2k−1 terms the following estimate:
1 a2k−1 +1 + · . . . · +a2k
(a2k−1 +1 · . . . · a2k ) 2k−1 ≤ ,
2k−1
or equivalently
a + . . . + a k−1 2k−1
1 2
a1 · . . . · a2k−1 ≤
2k−1
and
a + . . . + a2k 2
k−1
2k−1 +1
a2k−1 +1 · . . . · a2k ≤ ,
2k−1
which gives
a + . . . + a k−1 a k−1 + . . . + a k 2k−1
1 2 2 +1 2
a1 · . . . · a2k ≤
2k−1 2k−1
or
1 (a1 + . . . + a2k−1 )(a2k−1 +1 + . . . + a2k )
(a1 · . . . · a2k ) 2k−1 ≤ .
2k−1 · 2k−1
The case k = 1 also gives:
1
(a1 + . . . + a2k−1 )(a2k−1 +1 + . . . + a2k ) ≤ (a1 + . . . + a2k )2 ,
4
548
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 569
implying
1 (a1 + . . . + a2k )2
(a1 · . . . · a2k ) 2k−1 ≤
2k · 2k
or
a1 + . . . + a2k
1
(a1 · . . . · a2k ) 2k ≤ .
2k
Now let n be any integer. Choose a k ∈ N such that 2k > n and introduce
n
1
aj := a := ak
n
k=1
for n < j < 2 . We may now apply the result for 2k when looking at
k
k
a + . . . + a + a + . . . + a 2k
1 n
a1 · . . . · an · a2 −n ≤ .
2k
aj
Now 2k ≤ a for every 1 ≤ j ≤ n and therefore we have
k k
−n
a1 · . . . · an · a2 ≤ a2
or
a1 · . . . · an ≤ an ,
15. First note that xn and an are defined by recursion.
Note that
1 c
xn := xn−1 + , n ∈ N, x0 = 1,
2 xn−1
and
c
an = , n ∈ N ∪ {0},
xn
implying that
1
(∗) xn = (xn−1 + an−1 ), n ∈ N.
2
All terms are non-negative and
x 2
n−1 + an−1
x2n = ≥ xn−1 an−1 .
2
hence
xn−1 an−1 xn−1 c c
xn ≥ = = = an .
xn xn xn−1 xn
Combining this with (∗) we find for n ∈ N
an ≤ xn+1 ≤ xn .
Therefore we deduce
xn
≥1
xn+1
and consequently
xn an
an+1 = ≥ an .
xn+1
Together we now have
an ≤ an+1 ≤ xn+1 ≤ xn .
549
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 570
A COURSE IN ANALYSIS
Chapter 4
1. a) By definition we have
A × B = {(x, y) | x ∈ A and y ∈ B}
and
B × A = {(x, y) | x ∈ B and y ∈ A}.
Therefore it follows that
A×B = {(3, 1), (4, 1), (5, 1), (6, 1), (3, 2), (4, 2), (5, 2), (6, 2), (3, 3), (4, 3), (5, 3), (6, 3)}
and
B×A = {(1, 3), (2, 3), (3, 3), (1, 4), (2, 4), (3, 4), (1, 5), (2, 5), (3, 5), (1, 6), (2, 6), (3, 6)}.
6 6
5 5
B×A
4 A×B 4
3 3
2 2
1 1
0 0
0 1 2 3 4 5 6 0 1 2 3 4 5
X × (Y ∪ Z) ={(1, 3), (1, 4), (1, 5), (1, 6), (1, 7), (2, 3), (2, 4), (2, 5), (2, 6), (2, 7), (3, 3),
(3, 4), (3, 5), (3, 6), (3, 7)}.
Finally, from
X × Z = {(1, 6), (2, 6), (3, 6), (1, 7), (2, 7), (3, 7)}
and
Y × Z = {(3, 6), (4, 6), (5, 6), (3, 7), (4, 7), (5, 7)}
we deduce
(X × Z) ∩ (Y × Z) = {(3, 6), (3, 7)}.
550
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 571
and
5
(Ij ×{j}) = ([1, 2]×{1})∪([2, 3]×{2})∪([3, 4]×{3})∪([4, 5]×{4})∪([5, 6]×{5}).
j=1
This gives:
6
5 6
5
({j} × Ij ) (Ij × {j})
j=1 j=1
5 5
4 4
3 3
2 2
1 1
0 0
0 1 2 3 4 5 6 0 1 2 3 4 5 6
551
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 572
A COURSE IN ANALYSIS
m − k = (m − n) + (n − k) = r1 p + r2 p = (r1 + r2 )p,
i.e. m ≡ k mod(p) implying the transitivity, and therefore we have proved that
m ≡ n mod(p) is an equivalence relation.
6. Again we have to prove that “∼” on Z × N is a reflexive, symmetric and transitive
relation. Now for (k, m) ∈ Z×N we see that km = mk implying (k, m) ∼ (m, k), i.e.
“∼” is reflexive. Also kn = lm is equivalent to lm = kn, i.e. (k, m) ∼ (l, n) if and
only if (l, n) ∼ (k, m) i.e. symmetry is proved. Further, suppose that (k, m) ∼ (l, n)
and (l, n) ∼ (p, q). It follows that kn = lm and lq = pn, implying lqkn = lmpn.
Now n ∈ N, hence n = 0 and therefore we find lqk = lmp. If l = 0, then it
follows that qk = mp or (k, m) ∼ (p, q). However l = 0 implies p = 0 and k = 0,
and therefore qk = 0 = mp, which proves the transitivity of “∼”, i.e. this is an
equivalence relation on Z × N.
7. a) Since φ is the only subset of φ we find that P(φ) = {φ}. Note that P(φ) = φ,
the set {φ} contains one element, the set φ.
b) We have
P({1, 2, 3}) = {φ, {1}, {2}, {3}, {1, 2}, {1, 3}, {2, 3}, {1, 2, 3}}.
N
N
N N
= 1k 1N −k = (1 + 1)N = 2N .
k k
k=0 k=0
552
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 573
subsets with k elements of a set with N elements is νN,k = N
k . We want to find
νN +1,k . Two cases are trivial:
N +1 N +1
νN +1,0 = 1 = and νN +1,N +1 = 1 = .
0 N +1
but we are confined to real numbers, hence for 1 > x we have two solutions, for
x = 1 we have one solution and for x < 1 we have no solution. Therefore we cannot
define a mapping on R which maps x to the solution of y 2 − 2y + x = 0.
k
m
10. Let p(x) = aj xj and q(x) = l=0 bl x
l
and suppose that k ≤ m. Define for
j=0
m
j = k + 1, . . . , m the coefficients aj := 0 to get p(x) = aj xj . Now we define
j=0
m
m
m
p(x) + q(x) = aj xj + bj xj = (aj + bj )xj
j=0 j=0 j=1
553
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 574
A COURSE IN ANALYSIS
b) Since for all j ∈ N we have |x|2j = (x2 )j = x2j , it follows that f and p have
the same domain, namely R, and on R they coincide:
n
n
p(x) = a2j x2j = a2j |x|2j .
j=0 j=0
554
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 575
555
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 576
A COURSE IN ANALYSIS
4 y = x2 + 1
3
y>1
1 y=1
y<1
• • •
−2 −1 0 1 2
√ √
− y−1 y−1
3 •
−b
• 2
−a
1
h−1 ((a, b))
| |
−6 −5 −4 −3 −2 −1 1 2 3 4
A B −1
−2
It suggests that h−1 ((a, b)) is the interval (A, B) with A given by h(A) = a and B by
h(B) = b which implies A = 2a−6 and B = 2b−6. Thus h−1 ((a, b)) = (2a−6, 2b−6).
(ii)
a) Using previous knowledge we find
$ % $ % $ %
1 √ 1 1
f ,9 = y ∈ R | y = x, x ∈ ,9 = ,3 .
4 4 2
b) We have
1 8 5
g({1, 2, 3, 4}) = {g(1), g(2), g(3), g(4)} = 0, , , .
2 11 6
c) We have
h(N) = {y ∈ R | y = 2n , n ∈ N}.
556
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 577
Chapter 5
|x − 3| + 2 = |y − 3| + 2,
or
|x − 3| = |y − 3|.
|x − 3| = |3 + a − 3| = |a| = |3 − a − 3| = |y − 3|,
−1 0 1 2 3 4 5 6
2 2
= or 2y = 2x.
x y
This implies that x = y and therefore f2 is injective. Now let b ∈ (0, 2] and consider
the equation
2
b = f2 (x) = .
x
This equation has the unique solution x = 2b and for 0 < b ≤ 2 it follows that
1 ≤ x < ∞. Thus f2 is surjective and with the previous result it follows that f2 is
bijective.
The graph of f2 is as follows:
557
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 578
A COURSE IN ANALYSIS
0
0 1 2 3 4 5
implying x = y, i.e. f3 is injective. Now let b ∈ [0, 3] and consider the equation
√
x + 2 = b or x + 2 = b2 .
−2 −1 0 1 2 3 4 5 6 7
2. a) For p1 = 2 and q1 = 3 we have g pq11 = p1 + q1 = 5. Further for p2 = 4 and
q2 = 1 we have g pq22 = p2 + q2 = 5, and therefore g is not injective and therefore
it is not bijective either. Clearly,
g is surjective: given k ∈ Z then we take p = k − 1
p
and q = 1 to find g k−1 1 = g q = p + q = k − 1 + 1 = k.
b) Note that r maps pairs of real numbers into pairs of real numbers. Let
(x1 , y1 ) and (x2 , y2 ) be two pairs of real numbers and suppose that
The equality (y1 , x1 ) = (y2 , x2 ) means that y1 = y2 and x1 = x2 , hence the pairs
(x1 , y1 ) and (x2 , y2 ) are equal implying the injectivity of r. The surjectivity of r is
straightforward: given the pair (a, b) ∈ R × R then for (x, y) := (b, a) it follows that
558
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 579
3. a) Not taking into account domain and range (co-domain) problems, formally
(f ◦ g)(x) is given by
(f ◦ g)(x) = f (g(x))
= 5(g(x))2 − 2(g(x)) + 1
√ 2 √
=5 5+x −2 5+x+1
√
= 5x − 2 5 + x + 26.
Since g is defined for all x ≥ −5 and since f is defined for all real √
numbers it follows
that f ◦ g is defined on [5, ∞) and therefore (f ◦ g)(x) = 5x − 2 5 + x + 26 holds
for x ∈ [−5, ∞).
b) For D1 = R we can define f ◦ h by
(f ◦ h)(x) = f x4 + 2 = | x4 + 2 + 3| − 2.
implies f1 (x) = f1 (y) by the injectivity of f2 , and now the injectivity of f1 implies
x = y, i.e. f2 ◦ f1 is injective.
Now let g1 : D1 −→ F1 and g2 : D2 −→ F2 be two surjective mappings such that
g1 (D1 ) = D2 (= F1 ). Then g2 ◦ g1 : D1 −→ F2 is defined and for b ∈ F2 the
surjectivity of g2 implies the existence of a ∈ D2 such that g2 (a) = b. Since g1 is
surjective we know that g1 (D1 ) = F1 = D2 , thus given a ∈ D2 = F1 we find x ∈ D1
such that g1 (x) = a implying that
i.e. g2 ◦ g1 is surjective.
Now if f1 : D1 −→ F1 and f2 : D2 −→ F2 are injective and surjective, i.e. bijective,
and if f1 (D1 ) = D2 (= F1 ), then f2 ◦ f1 is also injective and surjective, i.e. it is
bijective.
559
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 580
A COURSE IN ANALYSIS
5. Since all mappings belonging to Aut(X) are bijective their composition is always
defined.
(i) Since the composition of mappings is associative the statement (f ◦ g) ◦ h =
f ◦ (g ◦ h) follows immediately.
(ii) Clearly f ◦g maps X to X. We need to prove that it is bijective. By Problem
4 we know however that the composition of bijective mappings is bijective, hence
f ◦ g ∈ Aut(X).
(iii) The map idX : X −→ X, x → idX (x) = x, belongs to Aut(X) and the
following holds for f ∈ Aut(X):
(f ◦ idX )(x) = f (idX (x)) = f (x) = idX (f (x)) = (idX ◦ f )(x).
560
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 581
8. First note that the range of h is a subset of R+ \ {0}, and therefore we can define
f ◦ h and g ◦ h on a suitable domain, namely the domain of h. By definition we
have
(f + g) ◦ h = f ◦ h + g ◦ h,
(f · g) ◦ h = (f ◦ h) · (g ◦ h),
1 1
◦h= ,
g g◦h
1 1 1
◦ h (x) = =
g g(h(x)) h(x) + |h(x) − 2|
1
=√ .
x2 + 2 + x2
|a| + a |a| − a
≥ 0 and ≥ 0.
2 2
Indeed, if a ≥ 0 then |a| = a and
|a| + a 2a
= =a≥0
2 2
561
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 582
A COURSE IN ANALYSIS
and
|a| − a a−a
= = 0.
2 2
But if a < 0 then |a| = −a and we find
|a| + a −a + a
= =0
2 2
as well as
|a| − a −2a
= = −a > 0 since a < 0.
2 2
Since for x ∈ X by definition we have f (x) ∈ R it follows that
|f (x)| + f (x) |f (x)| − f (x)
f + (x) = ≥0 and f − (x) = ≥ 0.
2 2
We call f + the positive part and f − the negative part of f . Note that the negative
part of f is a non-negative function.
Now it follows that
|f (x)| + f (x) |f (x)| − f (x)
f + (x) − f − (x) = − = f (x)
2 2
and
|f (x)| + f (x) |f (x)| − f (x)
f + (x) + f − (x) = + = |f (x)|.
2 2
10. a) We need to solve the equation
1
y= ;
1 + x2
or 1 + x2 = y1 , i.e. x2 = y1 − 1.
Since y ∈ (0, 1] it follows that y1 − 1 ≥ 0. Hence
' '
1 1−y
x= −1= .
y y
Thus we find the inverse function of f1 to be:
f1−1 : (0, 1] −→ [0, ∞)
'
1−y
y → .
y
b) We first sketch f2 :
3
0
0 1 2 3 4 5 6 7
562
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 583
f2 (x) = y.
For 0 ≤ x ≤ 1 we find
−x + 2 = y, i.e. x = 2 − y.
(x, y)
•
(0, 0)
−1 x = pr1 ((x, y)) 1
−1
563
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 584
A COURSE IN ANALYSIS
" (x, y) ∈ B
For
1
(0) we find that pr1 ((x,
# y)) = x. Denote the set
2 2
(x0 , y) | − 1 − x0 ≤ y ≤ 1 − x0 by A(x0 ) for x0 ∈ [−1, 1]. Then we find
pr1 (A(x0 )) = x0 .
Now for the circle S 1 = {(x, y) ∈ R2 | x2 + y 2 = 1} we find again that pr1 ((x, y)) =
x, see the following figure:
• (x, y)
For x0 ∈ [−1, 1] and(x0 , y) ∈ S 1 we find with y = ± 1 − x20 that only the
points x0 , ± 1 − x20 are mapped to x0 by pr1 . In both cases we have however
pr1 (B1 (0)) = pr1 (S 1 ) = [−1, 1].
b) We may rewrite R(g) as
R(g) = {(x, y) | x ∈ [0, 1] and g(x) = x2 + 1}
= {(x, x2 + 1) | x ∈ [0, 1]}.
This implies that
pr2 (R(g)) = {x2 + 1 | x ∈ [0, 1]}
= [1, 2],
i.e. we are dealing with the following situation:
2 pr2
pr2 (R(g))
R(g)
1
0
0 1 2
564
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 585
12. First we look at pr1 : X × Y −→ X, (x, y) → x. Now, by the very definition of the
pre-image we have for A ⊂ X
Clearly j is not injective but j(N) is countable. Indeed we know that {2k | k ∈ N}
is countable and the union of a countable set with a finite set is again countable.
14. We have to prove that ‘∼’ is symmetric, reflexive and transitive.
If f, g ∈ M (D; R) and f ∼ g then there exists a finite set Af,g = {x1 , . . . , xm } ⊂ D
such that f (x) = g(x) for x ∈ D \ Af,g . But for x ∈ D \ Af,g we also have
g(x) = f (x), i.e. f ∼ g implies that g ∼ f and ‘∼’ is symmetric. Since f (x) = f (x)
for all x ∈ D and by definition the empty set is finite it follows with Af,f = φ that
f (x) = f (x) for all x ∈ D \ Af,f , i.e. ‘∼’ is reflexive.
Finally, if f, g, h ∈ M (D; R) and f ∼ g as well as g ∼ h, we find sets Af,g and Ag,h
such that
i.e. f (x) = h(x) for x ∈ D \ Af,h implying the transitivity of ‘∼’. Therefore it
follows that ‘∼’ is an equivalence relation.
15. The mapping J is injective: if ((x, y), z) = ((x , y ), z ) then either z = z or (x, y) =
(x , y ). Hence at least one of the statements z = z , x = x , y = y is true which
implies that (x, y, z) = (x , y , z ). The mapping J is surjective: given (x, y, z) ∈
X × Y × Z, then ((x, y), z) ∈ (X × Y ) × Z and J(((x, y), z)) = (x, y, z). Hence J is
bijective.
565
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 586
A COURSE IN ANALYSIS
Chapter 6
1. Firstly, a general remark: in order to calculate limits using (6.18)−(6.20) we assume
that all the relevant assumptions hold. However, while doing these calculations it
is important that we can justify that all steps are correct.
a)
lim3 53 x2 − 12
7
x = lim3 53 x2 − lim3 127
x
x→ 4 (6.18) x→ 4 x→ 4
5 2 7
= lim 3 lim x − lim 12 lim x
(6.19) x→ 34 x→ 34 x→ 34 x→ 34
2
5 3 7 3 5 9 7 3 5·3 7 1
= · − · = · − · = − = .
3 4 12 4 3 16 12 4 16 16 2
1 − x2 (1 − x)(1 + x)
= =1+x
1−x 1−x
and therefore
1 − x2
lim = lim (1 + x) = lim 1 + lim x = 2.
x→1 1−x x→1 (6.18) x→1 x→1
c)
x3 − 4x2 + 7x − 13
lim
x→3 − 75 x2 + 1+x
1
2
27 − 36 + 21 − 13
=
− 635 + 10
1
566
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 587
2. The remark made at the beginning of the solution of Problem 1 also applies here.
a)
x2 − 2x + 5 lim (x2 − 2x + 5)
x→4
lim =
x→4 x−2 lim (x − 2)
x→4
lim x2 − lim 2x + lim 5
x→4 x→4 x→4
=
lim x − lim 2
x→4 x→4
16 − 8 + 5 13
= = ,
4−2 2
and we need to note that lim (x − 2) = 2 = 0.
x→4
b)
x2 − 9 (x − 3)(x + 3)
lim = lim
x→−3 (x + 5)(x + 3) x→−3 (x + 5)(x + 3)
lim (x − 3)
x−3 x→−3
= lim =
x→−3 x + 5 lim (x + 5)
x→−3
−6
= = −3.
2
x2 −9 x−3
We need to note that for x = −3 we have (x+5)(x+3) = x+5 , and that lim (x+ 5) =
x→−3
2 = 0.
3. For x = 3 we have
lim f (x) = lim (x3 − 22) = 27 − 22 = 5
x→3 x→3
and since 5 = lim f (x) = f (3) = 17, it follows that f is not continuous at x = 3.
x→3
567
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 588
A COURSE IN ANALYSIS
and applying part a) in particular that lim |x| = 0 it follows that lim f (x) = 0.
x→0 x→0
f (x) − f (x0 )
Recall f (x) = lim . Therefore for the limit we now find
x→x0 x − x0
f (x) − f − 21 f (x) − f − 12
lim 1 1 = lim
x→− 2 x − −2 x→− 12 x + 12
3 1 3 1 1 3 3
= lim 1 x− = − − = (−1) = −
x→− 2 4 2 4 2 2 4 4
thus f − 12 = − 43 .
1, x ∈ [0, 1]
χ[0,1] (x) =
0, x∈/ [0, 1].
568
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 589
•
1− •
) |(
1
Now, for x0 < 0 we find for x ∈ R such that |x − x0 | < δ and δ < |x0 |, in particular
x < x0 + δ < 0, that
implying χ[0,1] (x0 ) = 0. In a similar way we find that χ[0,1] is differentiable for
0 < x0 < 1: for x close to x0 and x ∈ (0, 1) we find
which gives χ[0,1] (x0 ) = 0. Moreover, for x0 > 1 and 1 < x it follows once again
that
χ[0,1] (x) − χ[0,1] (x0 )
=0
x − x0
hence χ[0,1] (x0 ) = 0.
Before we investigate the case x0 = 0 or x0 = 1, we make the following observation:
in order for
χ[0,1] (x) − χ[0,1] (x0 )
lim
x→x0 x − x0
to exist it is necessary that for all 0 < δ ≤ δ0 the function
569
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 590
A COURSE IN ANALYSIS
for some a ∈ R. Then for = 1 there exists δ̃ > 0 such that 0 < |x − x0 | < δ̃ implies
χ[0,1] (x) − χ[0,1] (x0 )
− a < 1.
x − x0
or
χ[0,1] (x) − χ[0,1] (x0 )
< 1 + |a|.
x − x0
570
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 591
0
0 1 2 3
|x + 2 − a| ≥ |x + 2| − |a|.
571
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 592
A COURSE IN ANALYSIS
or
3
< |a|
x+
2
but x > 1, implies |a| > 32 which is a contradiction. Thus g is not differentiable at
x0 = 2.
However g is continuous at x0 = 2. For this we need to prove that for every > 0
there exists δ > 0 such that 0 < |x − 2| < δ implies |g(x) − 1| < . Now for
−δ + 2 < x < 2 we have |g(x) − 1| = 0, hence every δ > 0 will work. Whereas for
2 < x < δ + 2 we find
|g(x) − 1| = |x2 − 3 − 1| = |x2 − 4| = |x + 2||x − 2|
and since we may assume without loss of generality that δ < 1 we find |x−2| < δ < 1
implies |x| ≤ 3 and therefore
|g(x) − 1| = |x + 2||x − 2| ≤ 4|x − 2|.
Thus for δ = 4 we find 0 < |x − 2| < δ implies
|g(x) − 1| ≤ 4|x − 2| < 4 · δ = 4 · =
4
proving the continuity of g at x0 = 2.
8. In the following we make use of (6.36), (6.37), (6.38), (6.40) and (6.42).
a)
d d 7 2 2
f (x) = x − 3
dx dx 5 x
7 1 14 6
= 2 · x − 2(−3) 4 = x+ 4.
5 x 5 x
b)
d t7 + 12t3 − 2 d 7 3 1
= (t + 12t − 2) ·
dt t5 dt t5
(6.38) d 1 d 1
= (t7 + 12t3 − 2) 5 + (t7 + 12t3 − 2)
dt t dt t5
1 1
= (7t6 + 12 · 3t2 ) 5 + (t7 + 12t3 − 2) −5 · 6
t t
24 10
= 2t − 3 + 6 .
t t
c)
⎛ ⎞
M
d d ⎝ −j ⎠
h(s) = js
ds ds j=1
M
M
d −j
= j (s ) = (−j 2 )s−j−1 .
j=1
ds j=1
572
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 593
9. The proof that χR+ is not differentiable at x0 = 0 follows in the same way as the
proof that χ[0,1] is not differentiable at x0 = 0, see Problem 6.
In order to investigate the differentiability of h : R −→ R, x → x2 f (x) = x2 χR+ (x)
we must consider the limit
h(x) − h(0)
lim .
x→0 x−0
Note that
h(x) − h(0) x2 χR+ (x) − 0
=
x−0 x−0
x2 χR+ (x)
= ≤ |x|.
|x|
Therefore, given > 0 we find for δ = that
0 < |x − 0| = |x| < δ implies
h(x) − h(0) h(x) − h(0)
− =
x−0 0 x − 0 ≤ |x| < δ = ,
which gives
d d n√ √ 1
hk (x) = (x x) = nxn−1 x + xn √
dx dx 2 x
1 n− 12 k k −1 k √ k−2
= n+ x = x2 = x .
2 2 2
Thus we have for all k ∈ N
d√ k k √ k−2
x = x .
dx 2
2. i)
d k k
f (x) = − (1 + x2 )− 2 −1 (2x)
dx 2
−k−2 −kx
= −kx(1 + x2 ) 2 = k+2 ;
(1 + x2 ) 2
573
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 594
A COURSE IN ANALYSIS
ii)
d 1 d 1
g(y) = ·
dy 2 1+ 1 dy y 4
y4
1 1
= −4 5
2 1+ 1 y
y4
2 −2
=− = ;
y5 1 + 1 y3 y4 + 1
y4
iii)
'
d z4 d z2
2
= √
dz 1+z dz 1 + z 2
2z d 1
= √ + z 2 (1 + z 2 )− 2
1 + z2 dz
2z 2 1 2 − 32
= √ + z − 2z(1 + z )
1 + z2 2
2 3
2z(1 + z ) − z z 3 + 2z
= 3 = 3 .
(1 + z 2 ) 2 (1 + z 2 ) 2
574
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 595
4. For f −1 we have
d −1 1
(f )(y) = −1
dy f (f (y)).
1
Since f (x) = kxk−1 and f −1 (y) = y k , we find
d −1 1 1
(f )(y) = −1 k−1
= k−1
dy k(f (y)) ky k
1 1
= y k −1 .
k
2 √ 1
Note that k = 21 we know that the inverse of x → x , x > 0 is y →
1for y = y2
d
and dy y 2 = 12 y − 2 = 2√
1
y , as we already know.
d 1 d √ 1 1 1 1−k 1 1
xk = k
x = x k −1 = x k = √ .
dx dx k k k k xk−1
i)
d d 1
f (s) = (1 + s2 ) k
ds ds
1 1 2s 1
= 2s (1 + s2 ) k −1 = (1 + s2 ) k −1
k k
2s
= 1 .
k(1 + s2 )1− k
575
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 596
A COURSE IN ANALYSIS
ii)
√
d d 1 + t4
g(t) = √
dt dt 5 1 + t6 + t8
d √ √ √ d 5
√
4 ( 5 1 + t6 + t 8 ) − 1 + t4 dt ( 1 + t6 + t 8 )
dt 1 + t
= √
( 5 1 + t6 + t8 )2
1 1
2t3 (1+t6 +t8 ) 5 (1+t4 ) 2 (6t5 +8t7 )
1 − 4
(1+t4 ) 2 5(1+t6 +t8 ) 5
= 2
(1 + t6 + t8 ) 5
iii)
⎛ ⎞ 1
1 1
d ⎝ u7 ⎠ d u 7 (1 + u4 ) 2
= 1
du 1+u2 du (1 + u2 ) 2
1+u4
d 1 1 1 1
d 1 1
du (u (1
7 + u4 ) 2 )(1 + u2 ) 2 − u 7 (1 + u4 ) 2 du (1 + u2 ) 2
=
1 + u2
1 − 67 4 12 1 1 1 1 1 1
7u (1 + u ) + u 7 2u3 (1 + u4 )− 2 (1 + u2 ) 2 − u 7 (1 + u4 ) 2 u(1 + u2 )− 2
=
1 + u2
(1 + u4 + 14u4 )(1 + u2 ) − 7u2 (1 + u4 )
= 6 1 3
7u 7 (1 + u4 ) 2 (1 + u2 ) 2
8u + 15u4 − 6u2 + 1
6
= 6 1 3 .
7u 7 (1 + u4 ) 2 (1 + u2 ) 2
576
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 597
ii)
3
d d (1 + s2 )− 2
g(s) =
ds ds (1 + s4 )5
d 3
= ((1 + s2 )− 2 (1 + s4 )−5 )
ds
3 5 3
= − · 2s(1 + s2 )− 2 (1 + s4 )−5 + (1 + s2 )− 2 (−5 · 4s3 (1 + s4 )−6 )
2
5
= (1 + s2 )− 2 (1 + s4 )−6 (−3s(1 + s4 ) + (1 + s2 )(−20s3 ))
−(20s5 + 3s4 + 20s3 + 3s)
= 5 .
(1 + s2 ) 2 (1 + s4 )6
7. A straightforward calculation using the chain rule and then the quotient rule gives
&
d p(x) 1 d p(x)
−2= −2
dx q(x) 2 p(x) − 2 dx q(x) q(x)
2t(2t + 3) + t2 − 1 5t2 + 6t − 1
= = .
3 3
2 (t2 − 1)(2t + 3) 2 2 (t2 − 1)(2t + 3) 2
9. Since (h ◦ f )−1 = f −1 ◦ h−1 we have to apply the chain rule to f −1 ◦ h−1 , thus
−1
d −1 d −1 dh
(f ◦ h−1 )(z) = f −1
(h (z)) (z)
dz dy dz
577
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 598
A COURSE IN ANALYSIS
with f (x) = y, h(y) = z, i.e. z = h(f (x)). Now using (7.7) we find
−1
dh 1 1
(z) = = −1
dz h (y) h (h (z))
and further
d −1 1 1
f (h−1 (z)) = −1 −1 = ,
dy f (f (h (z)) f ((h ◦ f )−1 (z))
which gives
d 1 1
((h ◦ f )−1 )(z) = .
dz f ((h ◦ f )−1 (z)) h (h−1 (z))
m
10. First note that d
dx p(x) = kak xk−1 . Now using the chain rule we find
k=1
i)
d
p(u(x)) = p (u(x))u (x)
dx
m
= u (x) kak (u(x))k−1 ;
k=1
ii)
d
u(p(x)) = u (p(x))p (x)
dx m m
k
=u ak x kak xk−1 ;
k=0 k=1
iii)
d 1 −1 d
= u(p(x))
dx u(p(x)) u(p(x))2 dx
m
k
u ak x m
−u (p(x)) k=0
= p (x) = − 2 · kak xk−1 .
u(p(x))2 m
k=1
u ak xk
k=0
Chapter 8
1. a) This follows straightforward from the definition of the composition of map-
pings and the boundedness of g. For x ∈ D1 set y := f (x) and observe
578
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 599
579
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 600
A COURSE IN ANALYSIS
d2 4 d 1
t + 1 = (2t3 (t4 + 1)− 2 )
dt2 dt
1 1 3
= 6t2 (t4 + 1)− 2 + 2t3 − · 4t3 (t4 + 1)− 2
2
6t2 (t4 + 1) − 4t6 3
= 3 = (2t6 + 6t2 )(t4 + 1)− 2
4
(t + 1) 2
580
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 601
|s|5
In order to find the second derivative of h(s) := s2 +4 we need to find the limit
h (s) − h (0)
lim .
s→0 s−0
Now for s ≥ 0 we have
h (s) − h (0) 3s5 + 20s3
=
s−0 (s2 + 4)2
581
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 602
A COURSE IN ANALYSIS
implying that
h (s) − h (0)
lim = 0,
s→0 s−0
|s|5
thus s → s2 +4 has a second derivative at s = 0 and this second derivative at 0 is 0.
4. With f (x) = u2 (x) + 1 and g(x) = (v 2 (x) + 1)−1 we find
d2 2 2 −1
d2
(u (x) + 1)(v (x) + 1) = (f (x)g(x))
dx2 dx2
= f (x)g(x) + 2f (x)g(x) + f (x)g (x).
Next we note
f (x) = 2u (x)u(x) a f (x) = 2u (x)u(x) + 2(u (x))2
as well as
2v (x)v(x)
g (x) = − = −2v (x)v(x)(v 2 (x) + 1)−2
(v 2 (x) + 1)2
and
582
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 603
where
Q(u, v)(x) =(2u (x)u(x) + 2(u (x))2 )(v 2 (x) + 1)2
− 8u (x)v (x)u(x)v(x)(v 2 (x) + 1)
+ (u2 (x) + 1)(−2v (x)v 3 (x) − 2v (x)v(x) − 10v (x)2 v(x)2 − 2v (x)2 .
1
= 2
x4 2(1+x2 )
1+x2 + 1+x2
(1 + x2 )2 x4 + 2x2 + 1
= =
x4 + 2x2 + 2 x4 + 2x2 + 2
1
=1− 4 .
x + 2x2 + 2
583
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 604
A COURSE IN ANALYSIS
7. We prove
dn 1 pn (x)
=
dxn 1 + x2 (1 + x2 )n+1
by induction. For n = 0 we have p0 (x) = 1. Now we calculate
dn+1 1 d pn (x)
=
dxn+1 1 + x2 dx (1 + x2 )n+1
with
pn+1 (x) = pn (x)(1 + x2 ) − 2(n + 1)xpn (x).
The degree of pn (x) is at most n and that of pn (x) is at most n − 1, therefore the
degree of pn+1 (x) is at most n + 1.
Now the estimate follows using Problem 2
n
d 1 pn (x) cn (1 + |x2 |) n2
=
dxn 1 + x2 (1 + x2 )n+1 ≤ (1 + x2 )n+1
cn
= n+2 .
(1 + x2 ) 2
584
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 605
8. a) By the definition of the absolute value, we know that |x3 | ≥ 0 for all x ∈ R
and |x|3 = 0 if and only if x = 0 implying that f (x) = |x|3 has a local minimum at
x0 = 0.
(Note that we did not use differential calculus as it is not necessary or helpful here.)
b) We first find g (s):
therefore
g (s) = (2s − 2)(2 + 3s2 )−1 + (s2 − 2s)(−1(2 + 3s2 )−2 6s)
(2s − 2)(2 + 3s2 ) − 6s(s2 − 2s)
=
(2 + 3s2 )2
2
6s + 4s − 4
= .
(2 + 3s2 )2
2(3s2 + 2s − 2) = 0,
3s2 + 2s − 2 = 0
which gives
1 1√
s1,2 = − ± 7.
3 3
In order to decide whether we have a local extreme value at s1 or s2 , and when we
do in order to find what type it is we make use of g (s).
d
g (s) = ((6s2 + 4s − 4)(2 + 3s2 )−2 )
ds
= (12s + 4)(2 + 3s2 )−2 + (6s2 + 4s − 4)(−2(2 + 3s2 )−3 (6s))
(12x + 4)(2 + 3x2 ) − 12x(6x2 + 4x − 4)
=
(2 + 3x2 )3
−36s3 − 36s2 − 24s + 8
= .
(2 + 3s2 )3
585
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 606
A COURSE IN ANALYSIS
while
1 1√ 8 2 22 10 √
s2 = − − 7, x22 = + , s32 = − − 7.
3 3 9 9 27 27
Therefore we find
−2u2 − u + 1
h (u) = √
1 − u2
which has zeroes for u1 = 12 and u2 = −1, but −1 ∈ / (−1, 1). Thus h may only have
a local extreme value at u1 = 12 .
Now
2u3 − 3u − 1
h (u) = 3
(1 − u2 ) 2
1
and for u0 = 2 we find
1 2 · 13 − 3 · 12 − 1 −9
h = 2 = 3 < 0.
2 1 2 32
1− 2 4 34 2
586
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 607
1
9. a) We may take, for example f : (−1, 1) −→ R, f (t) = 1+t . It follows that
1
(f ◦ g)(x) = 1+x 2 and we find immediately that
1 1
1 + x2 = 1 + x2 ≤ 1 for all x ∈ (−1, 1).
i.e.
h(x0 + c) ≥ h(y) for all y ∈ (− + x0 + c, x0 + c + ),
and with y0 := x0 + c ∈ (− + x0 + c, x0 + c + ) we have
|f (x) − f (y)| ≤ M |x − y|
where |f (z)| ≤ M for all z, f : [x, y] → R.
587
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 608
A COURSE IN ANALYSIS
or
'
1 11 1
− +1≤ ≤1+ ,
20 10 20
i.e.
'
19 11 21
≤ ≤ .
20 10 20
χn :R → R
x → χn (x) = χ[n,∞) (x)
Indeed,
if x < y < n then χn (x) = χ(y),
if x < n ≤ y then χn (x) = 0 < 1 = χn (y),
if n ≤ x < y then χn (x) = χn (y).
Since the sum of increasing functions is increasing (g(x) ≤ g(y) and f (x) ≤ f (y)
implies g(x) + f (x) ≤ g(y) + f (y)), it follows that XN is increasing.
[
6−
5− [ )
4− [ )
3− [ )
2− [ )
1 −[ )
| | | | | | |
0 1 2 3 4 5 6 7
588
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 609
5
1≤x<2: χn (x) = 2
n=0
5
2≤x<3: χn (x) = 3
n=0
5
3≤x<4: χn (x) = 4
n=0
5
4≤x<5: χn (x) = 5
n=0
5
5≤x : χn (x) = 6.
n=0
d
f (x) = (x(1 + ax2 )−1 )
dx
= (1 + ax2 )−1 + x(−1(1 + ax2 ))−2 (2ax))
1 + ax2 − 2ax2 1 − ax2
= 2 2
= .
(1 + ax ) (1 + ax2 )2
Since (1 + ax2 )2 > 0 for all x ∈ R we only need to look at 1 − ax2 . For x < − √1a
or x > √1a , it follows that 1 − ax2 < 0, hence in ( √1a , ∞) the function is strictly
589
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 610
A COURSE IN ANALYSIS
decreasing. For x ∈ 0, √1a we have 1 − ax2 > 0 and therefore in 0, √1a the
function is strictly increasing.
12. a) Let t1 , t2 ∈ (c, d) such that t1 < t2 . Since g is increasing it follows that
g(t1 ) ≤ g(t2 ). Now the fact that f is also increasing gives f (g(t1 )) ≤ f (g(t2 )), i.e.
f ◦ g is increasing.
b) Using the chain rule we find
and
(g ◦ f ) (x) = g (f (x))f (x).
In both cases, if f and g are positive functions, or if f and g are negative functions
we have that (f ◦ g) and (g ◦ f ) are non-negative functions, hence increasing.
13. By the mean value theorem applied to h = g − f there exists ξ ∈ (a, b) such that
However h(a) = g(a) − f (a) = 0 and therefore g(x) − f (x) = h(x) > 0 for all
x ∈ (a, b), i.e. f (x) < g(x) for all x ∈ (a, b).
Chapter 9
1. a) Recall that lim f (x) = ∞ means that for all M > 0 there exists N ∈ N
x→∞
such that x > N implies f (x) > M . Given M > 0 we have to√find N ∈ N such that
x > N implies√x5 − 5 > M , or x5 > M + 5. Hence for N := [ M + 5] + 1 it follows
for x > N = [ M + 5] + 1 that
√ 2
x2 − 5 > [ M + 5] + 1 − 5
√
= [M + 5] + 2 M + 5 − 4
√
= M + 4 + 2 M + 5 − 4 > M.
which is correct for say x > 1. Now for 0 ≤ l < k there exists Nl such that for
x > Nl
al l−k 1
x < .
ak 2k
Indeed, this is equivalent to
al
x k−l
> 2k
ak
590
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 611
for x > Nl , and this follows from Example 9.10. Therefore we see for Nk :=
max{N0 , . . . , Nk−1 } that x > Nk implies
k−1 k−1
al l−k al
1+ x ≥1− xl−k
ak ak
l=0 l=0
1 1
≥1−k· = .
2k 2
This now implies for x > Nk
ak k
p(x) ≥ x .
2
Again using Example 9.10 we deduce that given M > 0 there exists Ñ ∈ N such
that x > Ñ it follows that a2k xk > M . Hence for N = max{Ñ , Nk } it follows that
x > N implies p(x) > M or
lim p(x) = ∞.
x→∞
c) Note that
1 + a + ax2 1 a(1 + x2 ) 1
2
= 2
+ = + a,
1+x 1+x 1 + x2 1 + x2
therefore for > 0 we have to find N ∈ N such that x > N implies that
1 + a + ax2 1 1
− a = + a − a = < .
1 + x2 1 + x2 1 + x2
We can now continue as in Example 9.9 and take N = N () = 1 + 1, and see for
x > N () that
1 + a + ax2 1 1 1
=
1 + x2 − a 1 + x2 < x < 1 + 1 < .
591
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 612
A COURSE IN ANALYSIS
as well as
a0 = exp(0lna) = exp(0) = 1.
c) First we find
d 1 d 1 1
exp − = − (exp ) −
dt 1 + t2 dt 1 + t2 1 + t2
2t 1
= exp −
(1 + t2 )2 1 + t2
dn −x2 2
e = pn (x)e−x
dxn
592
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 613
dn+1 −x2 d 2
n+1
e = pn (x)e−x
dx dx
2 2
= pn (x)e−x − 2xpn (x)e−x
2 2
= (pn (x) − 2xpn (x))e−x = pn+1 (x)e−x .
The polynomial pn+1 (x) := pn (x) − 2xpn (x) has degree at most n + 1 since the
degree of pn (x) is at most n − 1 and that of −2xp(x) is at most n + 1.
d d 4
4 2
ln( s + 1 − s ) = ( s + 1 − s ) (ln )( s4 + 1 − s2 )
2
ds ds
√
2s3 − 2s s4 + 1 1
= √ √
s4 + 1 s4 + 1 − s2
√
2s3 − 2s s4 + 1
= √ .
s4 − s2 s4 + 1 + 1
593
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 614
A COURSE IN ANALYSIS
x 1 ax
lim = lim
x→∞ exp(ax) a x→∞ exp(ax)
1 y
= lim = ∞.
a y→∞ exp(y)
x
Here we used the fact that x → ∞ if and only if y = a → ∞.
b) For n ∈ N we have
ax ax
exp(ax) = exp + ···+
n
ax n
ax
= exp · . . . · exp (n-terms)
n n
and therefore it follows that
xn x x
lim = lim ·... ·
x→∞ exp(ax) x→∞ exp ax
n exp axn
x x
= lim · . . . · lim = ∞.
x→∞ exp ax
n
x→∞ exp ax
n
The following is important to note: we have not yet proved that if lim f (x) = ∞
x→∞
and lim g(x) = ∞ then it follows that lim (f (x)g(x)) = lim f (x) lim g(x) =
x→∞ x→∞ x→∞ x→∞
∞. Suppose that lim f (x) = ∞ and lim g(x) = ∞. Given M > 0 there exists
x→∞ x→∞√ √
N such that for x > N we have f (x)√> √ M and g(x) > M . Therefore for
x > N it follows that f (x) · g(x) > M M = M , i.e. we have proved that
lim f (x)g(x) = ∞. Finally we use the convention that (+∞) · (+∞) = +∞.
x→∞
8. Firstly we can use the considerations of Problem 1 b). Thus we first write for x = 0
m−1
bk
m k−m
p(x) = bm x 1+ x .
bm
k=0
If m is even we find further for K = lnM , K > 0, i.e. M > 1 given there exists
N ∈ N such that x > N implies
m−1
bk
m k−m k−m
p(−x) = bm x 1+ (−1) x
bm
k=0
≥ lnM.
exp(p(−x)) ≥ M
594
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 615
lim exp(xm ) = 0
x→−∞
0 ≤ exp(p(x)) ≤ expcxm
and therefore
0 ≤ lim exp(p(x)) ≤ lim exp(cxm ) = 0,
x→−∞ x→−∞
i.e.
lim exp p(x) = 0.
x→−∞
595
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 616
A COURSE IN ANALYSIS
1 x+y
(xy) 2 ≤ .
2
This estimate is equivalent to
or
2xy ≤ x2 + y 2
596
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 617
but
1 1 lnx + lny
ln(xy) 2 = ln(xy) = .
2 2
A function satisfying g x+y
2 ≤ g(x)+g(y)
2 is called convex in the sense of J.
Jensen or mid-point convex.
b) The mean value theorem gives
1 1
≤ lnx − lny ≤ .
x y
v
11. The logarithmic derivative of v is given by v. Thus we have
v (x)
= 1, v(0) = 1,
v(x)
1. a) For x ∈ R we have
597
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 618
A COURSE IN ANALYSIS
3. a) This limit does not exist. Suppose that it does and that it is equal to a, i.e.
for all > 0 there exists N () ∈ N such that x > N () implies that | sin x − a| < .
598
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 619
π 1√ π 2
cos − cos 0 = 2 − 1 = −2 sin
4 2 8
or
π 2 1 1√
sin = 1− 2 ,
8 2 2
√
i.e. sin π8 = 1
2 − 1
4 2.
b) By looking at the figure below and using elementary geometry we deduce
that sin π6 = 12
π
6
sin π6 OBA is an equilateral triangle,
| hence 2 sin π6 = 1.
0 1
599
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 620
A COURSE IN ANALYSIS
Since cos2 π
6 = 1 − sin2 π
6 =1− 1
4 = 3
4 we find that
π 1√
cos = 3.
6 2
c) First note that
π π π π π π π
sin = sin + = sin cos + cos sin
3 6 6 6 6 6 6
π π 1√
= 2 sin cos = 3
6 6 2
and
π π π π π π π
cos = cos + = cos cos − sin sin
3 6 6 6 6 6 6
1√ 1√ 1 1 1
= 3 3 − · = .
2 2 2 2 2
Therefore we find √
π sin π3 1
2 3 √
tan = = = 3.
3 cos π3 1
2
d) Since sin π6 = 1
we find
2
1 π π π π π π π
= sin = sin + = sin cos + cos sin
2 6 12 12 12 12 12 12
π π
= 2 sin cos
12 12
or
π π 1
sin cos = ,
12 12 4
which yields
1 π
(∗) π = 4 cos .
sin 12 12
Further we find that
1√ π π π π 2 π 2
3 = cos = cos + = cos − sin .
2 6 12 12 12 12
π 2
π 2
Since cos 12 + sin 12 = 1 it follows that
1√ π 2 π 2
3 = cos + cos −1
2 12 12
or '
π 1 1√
cos = + 3.
12 2 4
π
π cos π
Finally, since cot 12 = sin = 4 cos2 12
12
π , where we used (∗), we find
12
π 1 1√ √
cot =4 + 3 = 2 + 3.
12 2 4
600
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 621
b) Again the mean value theorem yields for some ξ between x and y
1
| tan x − tan y| = | tan ξ||x − y| = |x − y|
| cos2 ξ|
and since that in [−a, a] the function cos has its minimum at a we find
1
| tan x − tan y| ≤ |x − y|.
cos2 a
c) For the first statement we use a short induction argument. For n = 1 we
have | sin x| ≤ | sin x|. Now suppose that | sin nx| ≤ n| sin x|. Using the addition
theorems we find
or
The statement | sin ax| ≤ a| sin x| for all a > 0 and all x ∈ R is not correct. Take
a = 12 and x = π, then the claim is
π 1
1 = sin ≤ | sin π| = 0,
2 2
which of course is not correct.
601
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 622
A COURSE IN ANALYSIS
of f is (0, 1]. On [0, 1] the function sin is strictly increasing. Now, let a > 0. It
1 1
follows for x ∈ R that x + a = x and therefore 1+(x+a) 2 = 1+x2 . Thus either
1 1 1 1
sin 1+(x+a)2 < sin 1+x2 or sin 1+x2 < sin 1+(x+a)2 , but we never have
1
equality, hence x → sin 1+x 2 is not periodic.
8. a)
d
cos(ln(1 + x2 ))
dx
d
= ln(1 + x2 ) (cos )(ln(1 + x2 ))
dx
2x
=− sin(ln(1 + x2 )).
1 + x2
b)
d sin(tan t)
√
dt 1 − cos4 t
d √ d
√
dt sin(tan t) 1 − cos4 t − sin(tan t) dt 1 − cos4 t
=
1 − cos4 t
1
√ 3 1 4 − 12
4
cos2 t cos(tan t) 1 − cos t − sin(tan t) · (4 cos t)(sin t) 2 (1 − cos t)
=
1 − cos4 t
(cos(tan t))(cos t)(1 − cos t) − 2 sin(tan t) cos3 t sin t
4
= 3
(1 − cos4 t) 2
−(cos(tan t))(cos5 t) − 2(sin(tan t))(sin t)(cos3 t) + (cos(tan t))(cos t)
= 3 .
(1 − cos4 t) 2
c)
d √
arcsin 1 + cos s
ds
d√ √
= 1 + cos s (arcsin ) 1 + cos s
ds
− sin s 1
= √
2 1 + cos s 1 − √1 + cos s2
− sin s
= √ √ .
2 1 + cos s cos s
602
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 623
d)
d 2
arctan e−u cot u
du
d −u2 2
= e cot u (arctan ) e−u cot u
du
−u2 −u2 1 1
= −2ue cot u + e
sin2 u 1 + e−u2 cot u 2
2
(−2u(cot u) sin2 u + 1)e−u
= .
1 + e−2u2 cot2 u
n
9. We first find cos jt. For j ≥ 1 it follows that
j=1
t 1 1 1
cos jt · sin = sin j + t − sin j − t
2 2 2 2
and therefore
⎛ ⎞
n
⎝ t
cos jt⎠ sin
j=1
2
n
1 1 1
= sin j + t − sin j − t
2 j=1 2 2
1 1 t
= sin n + t − sin
2 2 2
or for t = 0 2n+1
n
1 sin 2 t
cos jt = −1 ,
j=1
2 sin 2t
i.e. for t = 0 we have
n
1 1 sin 2n+1
2 t
Cn (t) = + cos jt = .
2 j=1 2 sin 2t
603
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 624
A COURSE IN ANALYSIS
for all t ∈ − π2 , π2 . Since t → Cn (2t) is arbitrarily often differentiable
on − π2 , π2
it follows that Dn (.) is arbitrarily often differentiable on − π2 , π2 .
Chapter 11
∂D = {−1, 2, 3, 4, 5, 6}.
−1 0 1 2 3 4 5 6
[ | | ) • • [ ]
For −1 < x < 2, i.e. x ∈ (−1, 2) we take ε1 = 12 min{|x + 1|, |x − 2|} to find
that (−ε1 + x, x + ε1 ) ⊂ (−1, 2) ⊂ D, and for x ∈ (5, 6) we find with ε2 :=
1
2 min{|x − 5|, |x − 6|} that (−ε2 + x, x + ε2 ) ⊂ (5, 6) ⊂ D. The points −1, 2, 3, 4, 5, 6
are not internal points: every interval with centre x ∈ {−1, 2, 3, 4, 5, 6} will contain
points not belonging to D. This is almost the proof that ∂D = {−1, 2, 3, 4, 5, 6}.
For −1, 3, 4, 5, 6 ∈ D it follows immediately that these points belong to ∂D. For 2
we need to argue more carefully: if (−ε + 2, 2 + ε), 0 < ε < 1, is an open inteval with
centre 2, then (−ε + 2, 2) ⊂ D. Hence (−ε + 2, 2 + ε), 0 < ε < 1, always contains
points in D and D . However this also holds when we do not use the restriction
ε < 1.
2. a) In order for
(x2 − 1) (x2 + 4x) to be defined we need to have that
x2 − 1 x2 + 4x ≥ 0, i.e. (x2 − 1) ≤ 0 and (x2 + 4x) ≤ 0 or (x2 − 1) ≥ 0 and
(x2 + 4x) ≥ 0. Now x2 − 1 ≤ 0 if x ∈ [−1, 1] and x2 + 4x = x(x + 4) ≤ 0 if either
x ≤ 0 and x ≥ −4 or x ≥ 0 and x ≤ −4. Moreover x2 − 1 ≥ 0 if x ∈ R \ (−1, 1)
and x(x + 4) ≥ 0 if either x ≥ 0 and x ≥ −4 or x ≤ 0 and x ≤ −4. Together we
find that (x2 − 1)(x2 + 4x) ≥ 0 if
D = R+ \ {0, 1}.
c) As in part a) we note that the condition is that (sinh x) 1 − x4 is non-
negative. Now for x ≥ 0 we have sinh x ≥ 0 and for x < 0 we have
604
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 625
sinh x < 0. Further
for x ∈ [−1, 1] we have 1 − x4 ≥ 0 and in R \ [−1, 1] we find
that 1 − x4 < 0. Therefore it follows that
cos(arcsin x) cos(arcsin x)
cot(arcsin x) = =
sin(arcsin x) x
which implies
D = [−1, 1] \ {0}.
1 + cos πx −π sin πx
lim = lim
x→1 x2 − 2x + 1 x→1 2x − 2
and since sin π = 0 as is 2x − 2 for x = 1 we use the rules again to find
3y 2 − y + 5 6y − 1 6 3
lim = lim = lim = .
y→∞ 5y 2 − 6y − 3 y→∞ 10y + 6 y→∞ 10 5
1 1
d) We first rewrite the term sin2 u
− u2 as:
1 1 u2 − sin2 u u2 − sin2 u u2
2 − 2 = 2 = · .
sin u u 2
u sin u u 4
sin2 u
605
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 626
A COURSE IN ANALYSIS
u2 − sin2 u
and therefore it remains to find lim . We have
u→0 u4
u2 − sin2 u 2u − 2 sin u cos u
lim = lim
u→0 u4 u→0 4u3
2u − sin 2u
= lim
u→0 4u3
2 − 2 cos 2u
= lim
u→0 12u2
4 sin 2u 1 sin 2u 1
= lim = lim = .
u→0 24u 3 u→0 2u 3
lim = lim
x→∞ x2 x→∞ 2x 1 + x2 + ex2
x2 −x2
1+e x2 e 2xe + 1
= lim 2 = lim
x→∞ 1 + x2 + ex x→∞ ex2 e−x2 + x2 e−x2 + 1
2
2xe−x + 1
= lim 2 = 1.
x→∞ e −x + x2 e−x2 + 1
2x2 +12x−2
5. a) In order for f1 (x) = 1 to be defined we have to have x2 − 1 > 0.
15(x2 −1) 2
If x2 − 1 = 0 then we would be dividing by 0, note that the numerator is not 0 for
606
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 627
x = ±1. If x2 − 1 < 0 the square root is not defined. Hence the maximal domain
D1 of f1 is D1 = R \ [−1, 1].
On this domain f1 is neither even nor odd and f1 (x) = 0 if and only if x2 +6x−1 = 0,
i.e. for √ √
x1,2 = −3 ± 9 + 1 = −3 ± 10.
√ √ √
Since 3 < 10 < 4 the only root is x0 := −3 − 10, the number −3 + 10 does
not belong to D1 .
For x > 1 we have 2x2 + 12x − 2 > 0 and therefore
2x2 + 12x − 2
lim 1 = +∞.
x→1
x>0 15 (x2 − 1) 2
√ √
For x < −3 − 10 we know that 2x2 + 12x − 2 > 0 but for −3 − 10 < x < −1 we
have 2x2 + 12x − 2 < 0 which implies that
2x2 + 12x − 2
lim 1 = −∞.
x→−1
x<−1 15 (x2 − 1) 2
2
We claim that for x → ∞ the asymptote is x → 15 x. Indeed we find
2x2 +12x−2
1
15(x2 −1) 2 x2 + 6x − 1
2 = 1
15 x x · x 1 − x12 2
and for x → ∞ we have
x2 + 6x − 1 1 + x6 − x1
lim
1 = lim 1 = 1.
x→∞ 2 x→∞
x 1 − x12 2 1 − x12 2
2
The asymptote for x → −∞ is the function x → − 15 x, since for x < −1
2x2 +12x−2
15(x2 −1) 2
1
x2 + 6x − 1 x2 + 6x − 1
2 = 1 = 1
− 15 x −x(−x) 1 − x12 2 x2 1 − x12 2
and the result follows as before.
To find local extreme values we consider
d 2x2 + 12x − 2 2x3 − 2x − 12
1 = 3 .
dx 15 (x2 − 1) 2 15 (x2 − 1) 2
df1
The condition dx (x) = 0 is equivalent to
x3 − x − 6 = 0.
23 − 2 − 6 = 0.
607
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 628
A COURSE IN ANALYSIS
Now we look at
3
x − x − 6 : (x − 2) = x2 + 2x + 3.
x3 − 2x2
2x2 − x
2x2 − 4x
3x − 6.
Thus we have 3
x − x − 6 = (x − 2) x2 + 2x + 3 .
√
The zeroes of x2 + 2x + 3 are x1,2 = −1 ± 1 − 3, hence they are not real and
therefore only at x0 = 2 may we have local extreme values.
We know that lim f1 (x) = +∞ and lim f1 (x) = +∞, therefore we have a local
x→1 x→∞
x>1
minimum at x0 = 2.
This can be checked by looking at f1 (2):
d 2x3 − 2x − 12 −2x2 + 12x + 2
f1 (x) = 1 = 5
dx 15 (x2 − 1) 2 15 (x2 − 1) 2
and
−2 · 4 + 12 · 2 + 2 18 6
f1 (2) = √ = √ = √ > 0.
15 3 15 3 5 3
Note that f (2) = √2 . Finally we sketch the graph of f1
3
−9 −8 −7 −6 −5 −4 −3 −2 −1 1 2 3 4 5 6 7 8 9
−1
−2
−3
−4
−5
−6
608
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 629
s2 s 2
b) First we note that 1+s4 ∈ [0, 1) for all s ∈ R and therefore s → tan 1+s4
2 2
is defined on R. Secondly s → 1+s s s
4 is even implying that s → tan 1+s4 is even.
2s − 2s5 1
= .
(1 + s4 ) cos2 s2
1+s4
2s0 − 2s50 = 2s0 1 − s40 = 2s0 1 − s20 1 + s20
the real zeroes are s0 = 0, s1 = 1, s2 = −1, and due to symmetry we only need to
investigate the function at s0 = 0 and s1 = 1.
s2
We know that tan t ≥ 0 for t ∈ 0, π2 therefore the function s → tan 1+s 4 must
s2
have a minimum at s0 = 0. Further, for s → ∞ we know that lim tan = 0,
s→∞ 1 + s4
hence we have a maximum at x1 = 1, and by symmetry also at s2 = −1. The value
at x1 = 1 is tan 12 .
| • |
2
c) Since arsinh is defined on R and so is t → 1 − e−t , the maximal domain of
2
f3 is D3 = R. Further t → 1 − e−t is even, therefore f3 is even too, hence we need
to consider f3 only on [0, ∞).
2
The only zero of arsinh is 0, and 1 − e−t = 0 only for t = 0, therefore f3 has a zero
2
at 0. For t → ∞ it follows that −e−t tends to 0, hence
2
√
lim arsinh 1 − e−t = arsinh(1) = ln 1 + 2 .
t→∞
609
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 630
A COURSE IN ANALYSIS
hence on [0, ∞) arsinh is strictly monotone increasing and the only local extreme
value is at 0, which is equal to
2
arsinh 1 − e−0 = arsinh 0 = 0.
√
ln 1 + 2
−
610
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 631
where we used the fact that cosh is an even function and sinh is an odd function.
sinh x
d) Recall that tanh x = cosh x and therefore
sinh x sinh y
tanh x − tanh y cosh x − cosh y
= sinh x sinh y
1 − tanh x tanh y 1 − cosh x cosh y
sinh x cosh y−sinh y cosh x
cosh x cosh y
= cosh x cosh y−sinh x sinh y
cosh x cosh y
sinh x cosh x − sinh y cosh x sinh(x − y)
= = = tanh(x − y).
cosh x cosh y − sinh x sinh y cosh(x − y)
Chapter 12
1. a) We use the formula
n
(∗) S(f, Zn , ξ) = f (ξk )(tk − tk−1 )
k=1
k−1
and therefore with tk = 1 + n , k = 1, . . . , n + 1, we have
k−1 k−2 1
tk − tk−1 = 1 + −1− =
n n n
and
k−1 1 2k − 1
ξk = 1 + + =1+
n 2n 2n
which gives
n
S(f, Zn , ξ) = f (ξk )(tk − tk−1 )
k=1
n
1 2k − 1
= f 1+
n 2n
k=1
n 2 n
2 2k − 1 1 2k − 1
1+ − 1+ .
n 2n n 2n
k=1 k=1
611
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 632
A COURSE IN ANALYSIS
and
n n n n
1 2k − 1 1 1k 1 1
1+ = 1+ + −
n 2n n n n n n
k=1 k=1 k=1 k=1
1 1 n(n + 1) 1 1
= ·n+ + − n
n n 2n n n
3 1
= −
2 2n
therefore it follows that
14 2 3 1
S(f, Zn , ξ) = − 2− +
3 3n 2 2n
19 2 1
= − 2+ .
6 3n 2n
Note that
2 2
2 3 t2 2 4 2 1
(2t − t)dt = t − = · 8 − − +
1 3 2 1 3 2 3 2
32 − 12 − 4 + 3 19
= = ,
6 6
1 2
and the larger the n the closer 2n − 3n2 is to 0.
b) Again we wish to use ∗, therefore we note that
612
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 633
and
which implies
n
S(g, Zn , ξ) = g(ξj )(tj − tj−1 )
j=1
k
n
= g(ξj )(tj − tj−1 ) + g(ξj )(tj − tj−1 )
j=1 j=k+1
f (x)
|
|
|
| |
| |
|
······ ······
3 45 6
[a,b]=[a,tk ]∪[tk ,b]
3. a) The figure in the problem shows the graph of x → |x| on [−2, 1] and indicates
the two triangles. They are right angled triangles and by denoting the distance
between two points A1 , A2 in the plane by l(A1 , A2 ) we find
1 1
area(ABC) = l(A, B)l(A, C) = · 2 · 2 = 2
2 2
613
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 634
A COURSE IN ANALYSIS
and
1 1 1
area(BDE) = l(B, D)l(D, E) = · 1 · 1 =
2 2 2
therefore we find
1
5
|x|dx = .
−2 2
√
b) The figure in the problem shows the graph of r → g(r) = R2 − r2 , r ∈
[−R, R], and the area of the upper disc with radius R is 12 πR2 , hence we have
R
1
R2 − r2 dr = πR2 .
−R 2
4. a)
d
F (x) = (ln(cosh x))
dx
d
= cosh x (ln )(cosh x)
dx
1
= sinh x = tanh x,
cosh x
therefore
tanh xdx = ln cosh x.
b)
s
d a d eslna
F (s) = =
ds lna ds lna
lna slna
= e = as ,
lna
i.e.
as
as ds = .
lna
c)
d eu
F (u) = (sin 5u − 5 cos 5u)
du 26
eu eu
= (sin 5u − 5 cos 5u) + (5 cos 5u + 25 sin 5u)
26 26
eu sin 5u
= ,
26
and we find
eu sin 5u eu
du = (sin 5u − 5 cos 5u).
26 26
614
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 635
d)
d 1
F (r) = − cos r2 + 4r − 6
dr 2
1
= − (2r + 4) − sin r2 + 4r − 6
2
= (r + 2) sin r2 + 4r − 6
which gives
1
(r + 2) sin r2 + 4r − 6 dr = − cos r2 + 4r − 6 .
2
Chapter 13
1. We have
n
1 n
1
1 1
(1 + k 2 )x k2 dx = (1 + k 2 )x k2 dx
0 k=1 k=1 0
n
1
1
= (1 + k 2 ) x k2 dx
k=1 0
1
n
1
2 1+ k12
= (1 + k )
1 x
1 + k2
k=1 0
1
n
1
1
= (1 + k 2 ) k2 +1 x1+ k2
2
k=1 k 0
n
n(n + 1)(2n + 1)
= k2 = .
6
k=1
615
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 636
A COURSE IN ANALYSIS
b
c) If h ≥ 0 is integrable on [a, b] then h(t)dt ≥ 0. Now f (x) ≥ 0 on [−1, 0]
a
implies that f is monotone increasing on [−1, 0] and since f (−1) = 0 it follows that
0
f (x) ≥ 0 for all x ∈ [−1, 0] implying f (x)dx ≥ 0.
−1
1
3. The function x → 1 + 1+x2 sin x3 is odd:
1 3
1
1+ sin (−x) = 1 + sin − x3
1 + (−x)2 1 + x2
1
=− 1+ sin x3 .
1 + x2
a
Now note that for every odd function f : [−a, a] −→ R we have f (t)dt = 0.
−a
Indeed
a
a
0
f (t)dt = f (t)dt + f (t)dt
−a 0 −a
a
0
= f (t)dt − f (−t)dt
0 −a
a
0
= f (t)dt − f (s) · (−1)ds
0 a
a
0
a
a
= f (t)dt + f (s)ds = f (t)dt − f (s)ds = 0.
0 a 0 0
sin(2n+1)t
, t = 0, t ∈ − π2 , π2
Dn (t) := sin t
2n + 1, t = 0.
n
1 1
Dn (t) = + cos(2kt)
2 2
k=1
616
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 637
implying
π
π n
2 2 2 2
Dn (t)dt = 1+2 cos 2kt dt
π0 π 0
k=1
n
π
2
=1+2 cos 2ktdt
k=1 0
n
1 kπ
=1+ cos sds
k 0
k=1
kπ
n
1
= 1+ sin s = 1.
k
k=1 0
6. a) We have
π
4
π
4
π
ϑ cos ϑdϑ = ϑ sin ϑ|04 − sin ϑdϑ
0 0
π π
= ϑ sin ϑ|0 + cos ϑ|0 4 4
π 1√ 1√ 1√ π
= 2+ 2−1= 2 1+ − 1.
42 2 2 4
b) We have
2 2
2
x2 x 2
xln(2x + 1)dx = ln(2x + 1) − dx
1
2
2 1 2 2x +1
2
2
1 x2
= 2ln5 − ln2 − dx,
8 1
2
2x + 1
and we know
x2 1 1 1
(∗) = x− + ,
2x + 1 2 4 8x + 4
617
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 638
A COURSE IN ANALYSIS
9 1 1
= + ln20 − ln8,
16 8 8
which implies
2
1 9 1 1
xln(2x + 1)dx = 2ln5 − ln2 − − ln20 + ln8.
1
2
8 16 8 8
c) We have
1 1
m1
m cosh(ms) m 1
s sinh msds = s − cosh msds
0 m 0 m 0
m1
cosh 1 1
= − 2 sinh ms
m2 m 0
cosh 1 − sinh 1 e + e−1 − e − e−1
= =
m2 2m2
1
= 2 .
m e
d) We have
3
3 3
3
lnt 1 1 1 1
√ dt = t− 2 lntdt = 2t 2 lnt − 2t 2 dt
1 t 1 1 1 t
√
3
− 12
= 2 3ln3 − 2 t dt
1
√
1 3 √
= 2 3ln3 − 4t 2 = 4 + 2 3(ln3 − 2).
1
618
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 639
e) We have
π π
1 3 π 2r
e2r sin 3rdr = e2r sin 3r − e cos 3rdr
0 2 0 2 0
π
3 1 2r 3 π 2r
=− e cos 3r + e sin 3rdr
2 2 0 2 0
or
π
9 3
1+ e2r sin 3rdr = − −e2π − 1
4 0 4
which gives
π
2r 4 3 2π 3 e2π + 1
e sin 3rdr = · e +1 = .
0 13 4 13
or
π
1 n2
(∗) 1− 2 cos nx cos mxdx = 0.
π m −π
that
π
2 π
1 2
2= cos nx + sin2 nx dx = cos2 nxdx,
π −π π −π
which gives
π
1
cos2 nxdx = 1.
π −π
619
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 640
A COURSE IN ANALYSIS
n2
Since m2 > 0 it follows that
π
1
(sin nx)(cos mx)dx = 0.
π −π
Therefore let a = 0 and set D = b2 − 4ac. We consider the three cases D > 0,
D = 0, and D < 0.
D>0. It follows that
⎛ ⎞
2 √ 2
b D
at + bt + c = a ⎝ t +
2
− ⎠
2a 2a
√ √
b+ D b− D
=a t+ t+ .
2a 2a
√
−b± D
Over any interval which does not contain t1 , t2 , t1,2 = 2a the function t →
620
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 641
1
at2 +bt+c is integrable and we have
1 1
= √ √
at2 + bt + c b+ D
a t + 2a t + b−2a D
√a √a
1 D D
= √ − √
a t + b− D t + b+2a D
2a
1 1 1
=√ √ − √ ,
D t + b−2a D t + b+2a D
which implies
1 1 1√ 1 1√
dt = √ dt − √ dt
at2 + bt + c D t+ b−2a D D t+ b+2a D
√
1 2at + b − D
= √ ln √ .
D 2at − b + D
dx 1 x b
Since = arctan it follows by the substitution t + 2a = y that
x2 + α2 α α
⎛ ⎞
b
dt 1 2a t + 2a
= arctan ⎝ √ ⎠
at2 + bt + c a |D| |D|
2a
2 2at + b
= arctan .
|D| |D|
621
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 642
A COURSE IN ANALYSIS
9. We have
a a+c a+c
g(t)dt = g(s − c)ds = g(s)ds.
0 c c
10. In general, we will first determine a primitive and then evaluate it at the boundary
points of the integral
dy 1
a) With the change of variable y = lnx we find that dx = x, x = ey , and
consequently
dx dy −1 −1
= = 2 =
x(lnx)3 y3 2y 2(lnx)2
which gives
e2 e2
dx −1 −1 1
3
= 2 = 2 2
+
e x(lnx) 2(lnx) e 2(lne ) 2(lne)2
1 1 3
=− + = .
8 2 8
Thus we find
π
π
2 dt 1 1 t 2
= arctan tan
π
3
5 + cos t 2 2 2 π
3
1 1 π 1 1 π
= arctan tan − arctan tan
2 2 4 2 2 6
1 1 1 1
= arctan − arctan √ .
2 2 2 2 3
dν 2y
c) With arcsin y 2 = ν we find that dy = 1 giving
(1−y 4 ) 2
y arcsin y 2 1 ν2
dy = νdν =
1 − y4 2 4
1 2
= arcsin y 2
4
622
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 643
and therefore
1 1
√
y arcsin y 2
2 1 2 √2
dy = arcsin y 2
0 1 − y4 4 0
2
1 1 1 π 2 π2
= arcsin = = .
4 2 4 6 144
d) We have
ds ds 1 ds
√ = =
2
5 − 4s − s2 9 − (s + 2)2 3 1−( s+2
3 )
π 5
= − arcsin .
2 3
e) With x = sinh t we find dx
dt = cosh t and further
dx cosh tdt cosh t
3 = 3 = dt
2
(1 + x ) 2 (1+sinh 2 t) 2
(cosh t)3
sinh t x
= tanh t = = √ .
1 + sinh t2 1 + x2
Therefore we get
4 4
dx x
= √ = √4 − √1 .
1 + x 1
3
2 17 2
1 (1 + x2 ) 2
√ √
11. a) To find a primitive of t → 3 2t+1 we use the change of variable y = 2t + 1,
i.e. 2x + 1 = y 2 and dx
dy = y to get
√
3 2t+1 dt = 3y ydy,
623
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 644
A COURSE IN ANALYSIS
Thus we arrive at
√ √ √ 4
4 √ 2t + 1 3 2t+1
e 2t+1
2t+1
3 dt = −
0 ln3 (ln3)2
0
3 · 33 1 · 31 e3 e
= − − +
ln3 ln3 (ln3)2 (ln3)2
3
240 e +e
= − .
ln3 (ln3)2
b) The substitution x = π − y gives
π
π
x sin x (π − y) sin y
I= 2
dx = dy
0 1 + cos x 0 1 + cos2 y
π
π
sin y y sin y
=π 2y
dy − dy
0 1 + cos 0 1 + cos2 y
π
(cos )(y)
= −π 2
dy − I
0 1 + cos y
π
= −π arctan(cos y)|0 − I
π2
= − I,
2
or
π2
I= .
4
12. First we note
x4 − x = x x3 − 1 = x(x − 1) x2 + x + 1 .
We wish to find a, b, c, d ∈ R such that
x+1 x+1 a b cx + d
= = + +
x4 − x x(x − 1) (x2 + x + 1) x x − 1 x2 + x + 1
x3 (a + b + c) + x2 (b + d − c) + x(b − d) − a
= ,
x(x − 1) (x2 + x + 1)
i.e.
a+b+c=0
a+d−c=0
b−d=1
−a=1
which gives a = −1, b = 23 , c = 13 , d = − 31 . Hence we have
x+1 1 2 1 1 x−1
dx = − dx + dx + dx
x4 − x x 3 x−1 3 x2 + x + 1
2 1 x−1
= −ln|x| + ln|x − 1| + dx,
3 3 x2 + x + 1
624
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 645
and further
x−1 1 2x − 2
dx = dx
x2 + x + 1 2 x2 + x + 1
1 2x + 1 1 3
= dx − dx
2 x2 + x + 1 2 x2 + x + 1
1 3 2 2x + 1
= ln x2 + x + 1 − √ arctan √
2 2 3 3
1 2 √ 2x + 1
= ln x + x + 1 − 3 arctan √ ,
2 3
where we use the solution to Problem 8 b) for D < 0.
13. This formula follows by iterating integration by parts:
b
b
b
f (t)g (3) (t)dt = f g |a − f g (2) dt
a a
b
b b
= f g |a − f g |a −
f (t)g (t)dt
a
b
b b
= f g |a − f g |a + f (t)g (t)dt
a
b
b b b
= f g |a − f g |a + f g|a − f (3) (t)g(t)dt.
a
d
g (s)
14. Note that ds g(s) = √ and therefore we find
2 g(s)
g (s)
ds = 2 g(s).
g(s)
π
2 cos r
In the case of √ dr we now find
π
6
sin r
π2 √ π2
cos r
√ dr = 2 sin r π
π
6
sin r 6
√
= 2 − 2.
15. Integration by parts gives
π π
1 1 π
f (t) cos ntdt = (sin nt)f (t) − f (t) sin ntdt
−π n −π n −π
which implies
π
1 π
f (t) cos ntdt = f (t) sin ntdt
n
−π −π
1 π 2πM
≤ M dt = ,
n −π n
625
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 646
A COURSE IN ANALYSIS
where we used the fact that f (t) sin nt ≤ |f (t) sin nt| ≤ M.
16. We have
x x
1
t−n dt = t−n+1
1 −n + 1 1
1 1 1
= −
n − 1 n − 1 xn−1
which implies
x
1 1 1 1
lim t−n dt = − lim = .
x→∞ 1 n − 1 x→∞ n − 1 xn−1 n−1
626
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 647
1 n 1
(1 + ) ≥1+n =2
n n
or n
n 1 (n + 1)n
+ = ≥2
n n nn
implying
(∗) 2nn ≤ (n + 1)n .
We now use induction to prove
n n
n! ≤ 2 .
2
For n = 1 we have 1
1
1! ≤ 2 =1
2
627
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 648
A COURSE IN ANALYSIS
7. Suppose that
1 a1 + · · · + an
(∗) (a1 · . . . · an ) n ≤
n
holds and that n ≥ 2. If 0 < x ≤ 1 − n1 n
then x > 0 ≥ 1 + n(x − 1) or with x = 1 + y
we have (1 + y)n ≥ 1 + ny.
Thus we need to prove the case n ≥ 2 and x > 1 − n1 . In this case 1 + n(x − 1) > 0
and we may apply (∗) to a1 = 1 + n(x − 1) and a2 = · · · = an = 1 to find
n
1 + n(x − 1) + 1 + · · · + 1
xn =
n
≥ (1 + n(x − 1)) · 1 · . . . · 1 = 1 + n(x − 1)
628
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 649
which implies
x n x
(1 + ) ≤ (1 + )m for − x < n < m.
n m
n
n
12
1
√ |ak | ≤ a2k ,
n
k=1 k=1
and since
n
a2k ≤ n · max {a21 , . . . , a2n }
k=1
2
= n · max {|a1 |2 , . . . , |an |2 } = n · max {|a1 |, . . . , |an |}
we find
n
12
√
a2k ≤ n max {|a1 |, . . . , |an |}.
k=1
Chapter 15
2. a) We need to prove that for every > 0 there exists N () ∈ N such that
n > N () implies |an − a| < . For n ≥ M it follows however that
|an − a| = |a − a| = 0 <
implying lim an = a.
n→∞
b) Since lim an = a, for > 0 there exists N1 () ∈ N such that n ≥ N1 ()
n→∞
implies |an − a| < . Therefore, for n ≥ N () = max{N1 (), M } we have |bn − a| =
|an − a| < .
Both problems show that the first M (M could be very large) elements of a sequence
do not have any affect on the limit.
629
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 650
A COURSE IN ANALYSIS
3. Let |bn | ≤ M for all n ≥ k and given > 0 choose N () ∈ N such that n ≥ N ()
implies |an | < M . It follows for n ≥ N () that
|bn an | = |bn ||an | ≤ M |an | < M = ,
M
i.e.
lim |bn an | = 0.
n→∞
Now, since lim an = lim bn = a, given > 0 there exists N1 (), N2 () ∈ N such
n→∞ n→∞
that n ≥ N1 () implies |an − a| < and n ≥ N2 () implies |bn − a| < . Hence for
n ≥ max{N1 (), N2 ()}, we deduce that max{|an − a|, |bn − a|} < which implies
for n ≥ max{N1 (), N2 ()} that |cn − a| < .
b) We may look at (cn )n∈N , cn = (−1)n , which is a divergent sequence as we
know by Example 15.5.C. However −1 ≤ cn ≤ 1 for all n ∈ N and the sequence
(an )n∈N , an = −1 for all n ∈ N has limit a = −1. Also the sequence (bn )n∈N , bn = 1
for all n ∈ N has limit b = 1. Thus a < b, an ≤ cn ≤ bn but (cn )n∈N does not
converge.
5. a) Here we will use the converse triangle inequality
Since lim an = a yields for > 0 the existence of N () ∈ N such that n ≥ N ()
n→∞
implies |an − a| < , for these n, i.e. n ≥ N (), it follows that
i.e.
lim |an | = |a|.
n→∞
630
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 651
Therefore we find
1
max{an , bn } = (an + bn + |an − bn |)
2
and
1
min{an , bn } = (an + bn − |an − bn |),
2
or with cn := an + bn , dn := an − bn
1
max{an , bn } = (cn + |dn |)
2
and
1
min{an , bn } = (cn − |dn |).
2
Since lim cn = a + b and lim |dn | = |a − b| where for the last limit we have used
n→∞ n→∞
Problem 5 a), it follows that
1
lim max{an , bn } = (a + b + |a − b|) = max{a, b}
n→∞ 2
and
1
lim min{an , bn } = (a + b − |a − b|) = min{a, b}.
n→∞ 2
5 5
7. a) First we observe that n+6 − 0 = n+6 < n5 . Now, given > 0 we choose
N () ∈ N, N () > 5 , to find for n ≥ N () that
5
− 0 = 5 < 5 < ,
n + 6 n+6 n
5
which implies lim = 0.
n→∞ n+6
b) We have
4n 4 3 · 4n − 4(3n + 2)
− =
3n + 2 3 3(3n + 2)
−8 8 8
= = < .
9n + 6 9n + 6 9n
8 1 8000
Therefore, if 9n < 1000 , i.e. n > 9 , it follows that
4n 4 1
−
3n + 2 3 < 1000 .
1 1
8. a) Given > 0 for n > + 1 it follows that n < . For these n we find
1 1
nk ≤ n1 < , i.e. lim k = 0.
n→∞ n
631
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 652
A COURSE IN ANALYSIS
1
b) Since lim = 0, given > 0 we find N () ∈ N such that n ≥ N () implies
n→∞ n
1 k
n < , implying for these n that
1
1 − 0 = 11 < ,
nk nk
1
which proves lim 1 = 0.
n→∞ nk
9. a)
(n + 1)2 − n2 n2 + 2n + 1 − n2
lim = lim
n→∞ n n→∞ n
2n + 1 1
= lim = lim 2 + = 2.
n→∞ n n→∞ n
b) √ √ √ √
√ √ ( n + 1 − n)( n + 1 + n)
lim ( n + 1 − n) = lim √ √
n→∞ n→∞ n+1+ n
n+1−n 1
= lim √ √ = lim √ √ = 0,
n→∞ n+1+ n n→∞ n+1+ n
1 √ 1
where the latter follows from √n+1+ n
≤ √1n and lim √ = 0.
n→∞ n
c)
n n(n+1)
j=1 j 2
lim = lim
n→∞ n2 n→∞ n2
n2 + n 1 1 1
= lim = lim + = .
n→∞ 2n2 n→∞ 2 2n 2
d)
n n(n+1)(2n+1)
j=1 j2 6
lim = lim
n→∞ n3 n→∞ n3
2n3 + 3n2 + n 1 1 1 1
= lim = lim + + = .
n→∞ 6n3 n→∞ 3 2n 6n2 3
e)
1
1 + 2 · 3n 3n + 2
lim = lim 5
n→∞ 5 + 4 · 3n
3n + 4
n→∞
n
1
3 +2
= lim n
n→∞ 5 13 + 4
n
1
lim +2
n→∞ 3 2 1
= n = = .
1 4 2
lim 5 +4
n→∞ 3
632
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 653
f) n
n + 4n n 4
lim = lim n + lim =0
n→∞ 5n n→∞ 5 n→∞ 5
n
since lim q n = 0 for |q| < 1 and n
5n ≤ n
2n and lim n = 0 by Example 15.5.E.
n→∞ n→∞ 2
1
10. The case a = 1 is trivial. Let a > 1 then a n = 1 + bn for some bn > 0 and further
n
n n
a = (1 + bn )n = bkn > 1 + bn
k 1
k=0
= 1 + nbn
or
a−1
0 < bn = ,
n
implying lim bn = 0. Therefore
n→∞
√
n
1
lim a = lim a n = lim (1 + bn ) = 1.
n→∞ n→∞ n→∞
12. Since every polynomial has at most a finite number of real zeroes and since we are
only interested in limits we may assume in the following that for ν ≥ K none of
the polynomials under consideration has a root. Next we consider
n
n−1
ak ν k = an ν n + ν n ak ν k−n
k=0 k=0
and
m
m−1
bl ν l = bm ν l + ν m bl ν l−m
l=0 k=0
633
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 654
A COURSE IN ANALYSIS
n−1
1 an + ak ν k−n
= · k=0
m−1 ,
ν m−n bm + k=0 bl ν l−m
and we note n−1
an +ak ν k−n an
lim k=0
m−1 =
ν→∞ b +
m k=0 lb ν l−m b m
1
as well as lim = 0 for m > n, hence
ν→∞ ν m−n
n
ak ν k
lim k=0
m l
= 0 for m > n.
l=0 bl ν
ν→∞
634
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 655
implying n
an n−m ak ν k 3an n−m
ν ≤ k=0
m l
≤ ν
2bm l=0 b l ν 2b m
n
k a νk
which yields that k=0
m l is unbounded and therefore must diverge to +∞.
l=0 bl ν
an
If bm < 0 then we consider
− nk=0 ak ν k
m l
l=0 bl ν
which must tend to +∞ as ν → +∞, and hence
n
ak ν k
k=0
m l
l=0 bl ν
tends to −∞ as k → ∞.
n
j=1 bj
13. With bn := an − a we have to prove that if lim bj = 0 then lim = 0. For
j→∞ n→∞ n
m < n we have
b1 + · · · + bn b1 + · · · + bm bm+1 + bm+2 + · · · + bn
= +
n n n
and therefore
b1 + · · · + bn |b1 + · · · + bm | |bm+1 | + · · · + |bn |
≤ + .
n n n
Since lim bj = 0, given > 0 we can find m ∈ N such that |bj | < 2 for j > m,
j→∞
which implies
|bm+1 | + · · · + |bn | n−m
≤ < .
n n 2 2
Now we can choose N such that for n > N > m it follows that
|b1 + · · · + bm |
<
n 2
which eventually yields for n > N > m that
|b1 + · · · + bn |
< ,
n
i.e.
|b1 + · · · + bn |
lim = 0.
n→∞ n
14. Given x0 we take xn = x0 + n1 and we deduce that for > 0 there exists δ > 0 such
that |xn − x0 | = n1 < δ then
f (xn ) − f (x0 )
− A < .
xn − x0
635
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 656
A COURSE IN ANALYSIS
However
f (xn ) − f (x0 ) f (x0 + n1 ) − f (x0 )
−A= 1 −A
xn − x0 n
1
) − f (x0 )) − A.
= n(f (x0 +
n
1
Thus given > 0 we can find N () ∈ N such that n > N () ≥ δ implies
1
|n(f (x0 + ) − f (x0 )) − A| < ,
n
i.e.
1
lim (n(f (x0 + ) − f (x0 )) − A) = 0.
n→∞ n
Chapter 16
1. Given a sequence of partial sums (sn )n∈N we find the corresponding sequence
n
(an )n∈N with sn = k=1 ak by an = sn − sn−1 , hence
n(n + 1)(2n + 1) (n − 1)n(2(n − 1) + 1)
an = −
6 6
n
= (2n2 + 3n + 1 − (n − 1)(2n − 1))
6
n
= (2n2 + 3n + 1 − 2n2 + 2n + n − 1)
6
n
= · 6n = n2 .
6
n
Indeed we already know that k=1 k 2 = n(n+1)(2n+1)
6 .
n n
2. From an ≤ bn we deduce k=1 an ≤ k=1 bk which implies
∞
n
n
∞
ak = lim an ≤ lim bk = bk .
n→∞ n→∞
k=1 k=1 k=1 k=1
3.
∞
n
1 1
2
= lim
4k − 1 n→∞ 4k 2
−1
k=1 k=1
n
1 1 1
= lim −
n→∞ 2 2k − 1 2k + 1
k=1
n n−1
1 1 1 1
= lim 1+ − −
n→∞ 2 2k − 1 2k + 1 2n + 1
k=2 k=1
n−1 n−1
1 1 1 1
= lim 1 + − −
2 n→∞ 2k + 1 2k + 1 2n + 1
k=1 k=1
1 1 1
= lim 1 − = .
2 n→∞ 2n + 1 2
636
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 657
b) Note that e−nx = (e−x )n and for x < 0 we know that 0 < e−x < 1, hence
∞
1
e−nx = .
n=0
1 − e−x
c) We have
∞ k
∞ k
4 4 4
= −1−
7 7 7
k=2 k=0
1 11 7 11 16
= 4 − = − = .
1− 7
7 3 7 21
1
5. The condition for the convergence of the geometric series is y−2 < 1 or 1 < |y − 2|.
∞ 1
Thus if y > 3 or y < 1 the series k=0 (y−2)k converges with limit
∞
1 1 1
= = .
(y − 2)k 1 − (y − 2) 3−y
k=0
and
n
s̃n := (ak − ak+1 ) = (a1 − a2 ) + (a2 − a3 ) + · · · + (an − an+1 ) = a1 − an+1
k=1
Now we find
∞
(ak − ak−1 ) = lim sn = lim (an − a0 ) = lim an − a0
n→∞ n→∞ n→∞
k=1
and
∞
(ak+1 − ak ) = lim s̃n = lim (a1 − an+1 ) = a1 − lim an .
n→∞ n→∞ n→∞
k=1
637
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 658
A COURSE IN ANALYSIS
7. a) We know that
∞
1 1
= 1 =2
2k 1 − 2
k=0
and
∞
∞ k
(−1)k 1 1 3
= − = 1 =
3k 3 1− 3
4
k=0 k=0
which implies
∞
1 (−1)k 11
+ = .
2k 3k 4
k=0
Thus
∞
∞
1
ln 1 − 2 = ((ln(k + 1) − ln k) − (ln k − ln(k − 1)))
k
k=2 k=2
or
∞ ∞
3 1 1
= ,
4 k2 (2k − 1)2
k=1 k=1
638
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 659
i.e.
∞
1 3A
= .
(2k − 1)2 4
k=1
n3 +2n2 −2
8. a) For an = 15n2 +n it follows that
n3 1 + n2 − n23
an =
n2 (15 + n1 )
1 + n2 − n23
= n
15 + n1
n
≥ ,
16
where we used that 1 + n2 − n23 ≥ 1 and 15 + n1 ≤ 16. Now, given K ∈ R we only
have to choose N > 16K to have that n ≥ N implies
n N
an ≥ ≥ > K.
16 16
b) Using the hint we find
1 1 1
0 < sin ≤ or n ≤ .
n n sin n1
Now, given K > 0 choose N ∈ N such that N > K. Then it follows for n ≥ N that
1
≥ n ≥ N ≥ K,
sin n1
1
hence lim = +∞.
n→∞ sin n1
9. a) Take an = n2 and bn = n1 , then lim an = lim n2 = ∞ and lim bn =
n→∞ n→∞ n→∞
1 2 1
lim = 0. Moreover an · bn = n · n = n and lim (an bn ) = lim n = +∞.
n→∞ n n→∞ n→∞
b) Take an = −n2 and bn = n1 and we find lim an = −∞, lim bn = 0 and
n→∞ n→∞
21
lim (an bn ) = lim (−n ) = lim (−n) = −∞.
n→∞ n→∞ n n→∞
c) Take an = n and bn = nc , then lim an = ∞, lim bn = 0 and lim (an bn ) =
n→∞ n→∞ n→∞
c
lim (n · ) = c.
n→∞ n
Chapter 17
1. a) Since
2n
1
s2n − sn =
j=n+1
j
1 1 n 1
= + ···+ > =
n+1 2n 2n 2
639
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 660
A COURSE IN ANALYSIS
the first part follows in a straightforward way. However now the second part is
trivial: (sn )n∈N is not a Cauchy sequence, hence has no limit.
b) We will prove that
1
(∗) 0 < sn+m − sn <
n+1
which implies that (sn )n∈N is a Cauchy sequence: given > 0 take N > 1 to find
for n ≥ N that |sn+m − sn | < compare with Remark 17.2.
Next we prove (∗). If m is even we find
1 1 1 1 1 1
sn+m −sn = − + − +· · ·+ − >0
n+1 n+2 n+3 n+4 n+m−1 n+m
640
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 661
−A ≤ an ≤ cn ≤ bn ≤ B
implying that the sequence (cn )n∈N is bounded. By the Bolzano-Weierstrass theo-
rem it must contain a convergent subsequence.
√ √
n n
4. Since 0 < n+1 for all n ∈ N, once we know that n+1 is decreasing we can
√ n∈N
n
deduce that it converges. In order to prove that n+1 is decreasing we need
n∈N
to prove √ √
n n+1
≥ for all n ∈ N.
n+1 n+2
√
n+1 √ √
This is equivalent to n+2
n+1 ≥
√
n
which is equivalent to n(n + 2) ≥ n + 1(n + 1)
or
n(n + 2)2 ≥ (n + 1)3 ,
i.e.
n3 + 4n + 4n ≥ n3 + 3n2 + 3n + 1
which is a correct statement.
5. First we prove by induction for k ∈ N that k! ≥ 2k−1 . For k = 1 we have 1 ≥ 1
which is clearly true. Now we observe that
641
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 662
A COURSE IN ANALYSIS
7. We define ⎧
⎨ −2, n = 3k + 1, k ∈ N0
bn := − 1 , n = 3k + 2, k ∈ N0
⎩ 3
17, n = 3k, k ∈ N0 .
Since −3 < −2 < − 13 < 17 < 19 we have −3 ≤ an ≤ 19 for all n ∈ N. Further the
subsequence (a3k+1 )k∈N0 converges to −2, the subsequence (a3k+2 )k∈N0 converges
to − 13 and the subsequence (a3k )k∈N0 converges to 17.
8. Once we know that (xn )n≥0 converges to a limit lim xn = x > 0 we find
n→∞
k
k−1x +a
x= or kxk = (k − 1)xk + a
k xk−1
1 √
i.e. xk = a or x = a k = k a.
Now we prove that (xn )n≥0 is bounded from below and decreasing, hence it must
converge. We need the following steps:
i) We prove xn > 0. We know that x0 > 0 and a > 0. Since
(k − 1)xkn + a
xn+1 = ,
kxk−1
n
xk
n −a
iv) Since xn+1 = xn − kxk−1
we deduce that xkn ≥ a for all n ∈ N.
n
xk
n −a
v) Since xn ≥ 0 and kxk−1
≥ 0 it follows that
n
xkn − a
xn+1 = xn − ≤ xn .
kxk−1
n
1
Thus (xn )n∈N is decreasing and bounded below by a k . Therefore lim xn exists
n→∞
1
and it must be a k .
642
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 663
and therefore k n ∞
1 1 1
lim 1 + ≥ lim = .
k→∞ k n→∞
j=0
j! j=0
j!
i.e. n ∞
1 1
e = lim 1+ =
n→∞ n j=0
j!
n n+1
10. Since an = 1 + n1 < 1 + n1 = bn the intervals [an , bn ] are bounded, closed
and non-empty. We are done if we can prove that (an )n∈N is monotone increasing
and that (bn )n∈N is monotone decreasing. We then have a1 ≤ an ≤ bn ≤ b1 and in
addition
n+1 n
1 1
b n − an = 1+ − 1+
n n
n
1 1 1 1 4
= 1+ = an ≤ b 1 = ,
n n n n n
643
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 664
A COURSE IN ANALYSIS
hence bn − an → 0 as n → ∞.
For n ≥ 2 Bernoulli’s inequality yields
n
1 1 1
1− 2 > (1 − n) 2 = 1 − ,
n n n
and therefore n
n−1 n2 − 1 (n + 1)n (n − 1)n
< =
n n2 nn nn
or n−1 n
n n+1
< ,
n−1 n
i.e. for n ≥ 2
n−1 n−1 n n
1 n n+1 1
an−1 = 1 + = < = 1+ = an .
n−1 n−1 n n
Therefore (an )n∈N is strictly monotone increasing. Once again, using Bernoulli’s
inequality we find for n ≥ 2
n
1 n n 1
1+ 2 >1+ 2 >1+ 2 =1+ ,
n −1 n −1 n n
and further
n n −n
1 n2 n n+1
1+ < 2
=
n n −1 n−1 n
n −n
1 1
= 1+ 1+ ,
n−1 n
which yields
n+1 n
1 1
bn = 1+ < 1+ = bn−1 .
n n−1
Now we see that(bn )n∈N
nis strictly monotone decreasing and the result follows ,
1
recall that lim 1 + = e.
n→∞ n
Chapter 18
1. This problem helps to understand how to handle the Cauchy criterion better. With
l := n + m and j = n + 1 it follows that
m n+k
l i
1 1
= .
2 i=j
2
k=1
Thus we know that for > 0 there exists N ∈ N such that l ≥ j ≥ N implies
l 1 i l
i=j 2 = i=j 2−i < . Therefore the Cauchy criterion is satisfied.
644
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 665
645
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 666
A COURSE IN ANALYSIS
5. a) We need to consider the sequence n1α n∈N . Clearly for α ∈ R we have
1
nα ≥ 0. Moreover, for α ≥ 0 the sequence is decreasing, while for α < 0 it
1
is not decreasing, but only for α > 0 we have lim α = 0. Hence for α > 0
n→∞ n
∞ n+1
the series n=1 (−1) nα converges by Leibniz’s test, for α = 0 the series reduces
∞ k+1
to n=1 (−1) which we know to be divergent and for α < 0 it follows that
1 ∞ k+1
lim = +∞ therefore n=1 (−1) nα diverges in this case.
n→∞ nα
1 1 1 1
b) Clearly 2n−1 ≥ 0 for n ≥ 1, limn→∞ 2n−1 = 0 and 2(n+1)−1 = 2n+1 <
1 1
2n−1 , i.e. the sequence 2n−1 is decreasing and again Leibniz’s test gives the
n∈N
∞ (−1)n+1
convergence of n=1 2n−1 .
1 1 1
c) For n ≥ 2 the term n ln n is defined and positive. Since 0 < n ln n < n we
1
deduce that lim = 0. Moreover we claim that
n→∞ n ln n
1 1
<
(n + 1) ln(n + 1) n ln n
1
which follows from < n1 and ln(n+1)
n+1
1
< 1
ln n . Hence by Leibniz’s criterion we
∞ (−1)n
know the convergence of n=2 n ln n .
6. From the assumption it follows that
n
n
ak ≤ bk
k=1 k=1
and that for K ≥ 0 there exists N ∈ Nsuch that n ≥ Nimplies nk=1 ak ≥ K.
n ∞
This however gives that n ≥ N implies k=1 bk ≥ K, i.e. k=1 bk diverges.
7. a)
(−1)k k 2 k2 1
k 4 + 2k ≤ k 4 + 2k ≤ k 2
∞ ∞ k 2
and 1
k=1 k2 converges, hence k=1 (−1) k
k4 +2k converges.
b)
k! 1 · ...· k 1 2 3 k 2
= = · · ·... · ≤
kk k · ...· k k k k k k2
∞ ∞ k!
and since k=1 k22 converges we deduce that k=1 kk converges.
c) First we claim that ln(n + 1) ≤ n for all n ∈ N. Indeed for n = 1 we
have ln 2 ≤ 2 since ln e = 1 and ln is a monotone function. Further n + 1 ≤ en is
equivalent to ln(n + 1) ≤ n and if n + 1 ≤ en then we get n + 2 ≤ en + 1 ≤ en + en =
2en ≤ een = en+1 , thus we know that ln(n + 1) ≤ n for all n ∈ N and it follows that
ln(n + 1) ln(n + 1) 1
3
≤ 3
≤
3n + 7 3n 3n2
646
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 667
∞
and the convergence of n=1 3n1 2 implies the result.
d) Since | sin x| ≤ |x| we deduce sin 13 ≤ 13 implying the convergence of
∞ 1
∞ 1 n n
n=1 sin n3 since n=1 n3
∞
kx
e) For all x ∈ R we have cos 1
1+k2 ≤ 1+k2 and since
1
k=1 1+k2 converges it
∞ cos kx
follows that k=1 1+k2 converges for all x ∈ R
mx
f) For x ≤ 0 we know that emx ≤ 1 and therefore em4 ≤ m14 and the conver-
∞ ∞ mx
gence of m=1 m14 implies the convergence of m=1 em4 for x ≤ 0. However for
∞ mx
mx e
x > 0 the series m=1 em4 diverges since lim = 0. Indeed, applying the de
m→∞ m4
tx
l’Hospital rules four times to the function t → et4 we find
etx x4 etx x4 tx
lim 4
= lim = lim e = ∞.
t→∞ t t→∞ 4! t→∞ 4!
tx
Therefore there exists N ∈ N such that t ≥ N implies et4 ≥ 1, hence for m ∈ N it
mx
follows that m ≥ N implies em4 ≥ 1.
g)
∞ ∞
∞
x2 2 1 2 1
= x ≤ x < ∞,
l2 + x2 l2 + x2 l2
l=1 l=1 l=1
i.e. the series converges for all x ∈ R.
h) Since
n+5 n+5 1
√ ≥ √ ≥ √
(2n + 1) n + 3 2n n 2 n
and ∞
1
√
n=1
2 n
∞ n+5
diverges, the series n=1 (2n+1)√n+3 diverges too.
8. For n ∈ N we know both the Cauchy-Schwarz inequality (compare with Corollary
14.3)
n n 12 n 12
n
2 2
(∗) ak b k ≤ |ak bk | ≤ ak bk
k=1 k=1 k=1 k=1
and the Minkowski inequality (compare with Lemma 14.5)
n 12 n 12 n 12
2 2 2
(∗∗) |ak + bk | ≤ ak + bk .
k=1 k=1 k=1
∞ 2
∞
Now, since by the assumptions that < ∞ and k=1 b2k < ∞ we may take
k=1 ak
the limit as n → ∞ of (∗) and the right hand side of (∗∗) to obtain
∞ 12 ∞ 12
n n
2
ak b k ≤ |ak bk | ≤ ak b2k
k=1 k=1 k=1 k=1
647
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 668
A COURSE IN ANALYSIS
and 12 12 12
n
∞
∞
2
|ak + bk | ≤ a2k + b2k .
k=1 k=1 k=1
n n 12
Now we see that the partial sums k=1 |ak bk | and k=1 |ak + bk |2 are monotone
and bounded, hence
∞
∞
12 ∞
12
|ak bk | ≤ a2k b2k
k=1 k=1 k=1
and 12 12 12
∞
∞
∞
2
|ak + bk | ≤ a2k + b2k .
k=1 k=1 k=1
∞
Finally we
note that the (absolute) convergence of k=1 |ak bk | implies the conver-
gence of ∞ k=1 ak bk .
|ak | ∞
9. Since for every sequence (ak )k∈N the term 1+|a k|
is bounded by 1 and k=1 21k
∞ |ak |
converges, it follows immediately that k=1 21k 1+|a k|
converges. The inequality
∞
∞
∞
1 |ak + bk | 1 |ak | 1 |bk |
≤ +
2k 1 + |ak + bk | 2k 1 + |ak | 2k 1 + |bk |
k=1 k=1 k=1
2
(n + 1)6 e−(n+1)
lim =0
n→∞ n6 e−n2
648
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 669
∞ 2
implying the convergence of n=1 n6 e−n .
b) Note that
4n2 + 15n − 3 4n2 + 15n2 19
3 ≤ 3 = 3
2
n (n + 1) 2 2
n n2 n2
∞ 1 ∞ 4n2 +15n−3
and the series n=1 3 converges implying the convergence of n=1 3 .
n2 n2 (n+1) 2
c) For x ∈ R fixed we find
xk+1
(k+1)! k! x
xk
=x = .
k!
(k + 1)! k+1
k+1
x
(k+1)! xk
Thus lim xk = 0 and therefore ∞
k=0 k! converges for every x ∈ R.
k→∞
k!
649
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 670
A COURSE IN ANALYSIS
∞ n√2
3
implying the divergence of n=1 (n+3) 4n+15 .
1
12. ∞|an | ≥ 1, i.e. |an | ≥ 1, we deduce that (an )n∈N cannot have the limit
a) From n
or
(a − 1)|an | ≤ (n − 1)|an | − n|an+1 |.
Since a > 1 we find
0 < (n − 1)|an | − a|an+1 |,
or
(n − 1)|an | > n|an+1 |.
Hence the sequence (n|an+1 |)n∈N is strictly monotone and decreasing and bounded
from
∞ below by 0, implying its convergence. Therefore we deduce that the series
n=1 ((n − 1)|an | − n|an+1 |) which is a telescopic series converges, compare with
Chapter
∞ 16, Problem 6, and this implies, see Chapter 16, Problem 6 again, that
n=1 n | converges.
|a ∞Note that we can also prove: if for all n ≥ N we have
an+1 1
an ≥ 1 − n then n=1 an diverges.
650
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 671
an+1
implying lim n 1 − = 4 > 1 and therefore by Raabe’s criterion the
n→∞ an 3
series converges. Note that
an+1 2
lim = lim 9n + 6n + 1 = 1,
n→∞ an n→∞ 2
9n + 18n + 9
therefore the ratio test cannot give the result.
15. a) We note that
N
ln N
1 1 y
dx = e dy
2 x(ln x)α ln 2 ey y α
ln N ln N
1 1
1−α
= α
dy = y ,
ln 2 y 1−α ln 2
651
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 672
A COURSE IN ANALYSIS
N
∞
lim a+
n = lim a−
n = ∞.
n→∞ n→∞
n=1 n=1
∞
However, since n=1 an converges it follows that lim an = 0 implying that lim a+
n =
n→∞ n→∞
0 and lim a−
n = 0.
n→∞
Given A ∈ R and denote by (bn )n∈N the subsequence of all positive elements of
(an )n∈N and by (cn )n∈N the subsequence of all negative elements of (an )n∈N . Choose
n0 to be the smallest index such that
n0
bk > A,
k=1
652
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 673
∞
which is a rearrangement of k=1 ak . Moreover
⎛ ⎞
n0 n1 n2l+1 n2l+2
0≤S−⎝ bk − |ck | + · · · − |ck |⎠ < bk
k=1 k=1 k=n2l−1 +1 k=n2l−1 +1
and
n0 n1 n2l n2l+1
0≤ bk − |ck | + · · · + bk − S < |ck |.
k=1 k=1 k=n2l−2 +1 k=n2l−1 +1
Then a1 is the largest integer such that a1 b−1 ≤ x. If a1 , . . . , an−1 are already
known, then an is the largest integer such that
or
an ≤ xbn − a1 bn−1 − · · · − an−1 b.
We set
yn := xbn − a1 bn−1 − · · · − an−1 b
and find the following algorithm:
y1 := xb
a1 := [y1 ]
yn+1 := (yn − an )b
an+1 := [yn+1 ].
n 1 2 3 4
yn 27 4
7
8
7
2
7
an 0 0 1 0
653
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 674
A COURSE IN ANALYSIS
We may stop now as the results start to repeat, therefore we have the following
periodic expansion
1
= 0.001001 . . . (b = 2)
7
(ii) The case b = 7 is trivial
1
= 0.1 (b = 7)
7
(iii) b = 10
n 1 2 3 4 5 6 7
yn 10
7
30
7
20
7
60
7
40
7
50
7
10
7
an 1 4 2 8 5 7 1
654
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 675
√
2. The set Q ⊂ R cannot be closed since we know that for example 2 ∈ R is
an accumulation point of Q not belonging to Q. However Q R cannot be open
√⊂
either. If it was open then Q must be closed. The sequence n2 is a sequence
n∈N
in Q with accumulation point 0 ∈ Q, so Q is not closed and consequently Q is
not open in R.
3. It follows that {aν |ν ∈ R} is the complement of the set (−∞, a1 )∪(a1 , a2 )∪(a2 , a3 )∪
· · · , i.e. we have
∞
{aν |ν ∈ R} = (aν−1 , aν )
ν=1
∞
with a0 := −∞. Since ν=1 (aν−1 , aν ) is open as it is a union of open sets it follows
that {aν |ν ∈ R} is closed.
!
1 1
4. We claim that with Bν = ν+2 , 1 − ν+2 we have
Bν = (0, 1).
ν∈N
Clearly we have Bν ⊂ (0, 1)for all ν ∈ N, which implies ν∈N Bν ⊂ (0, 1).
Next we prove that (0, 1) ⊂ ν∈N Bν . For x ∈! (0, 1) we need to show the exis-
tence of ν0 ∈ N such that x ∈ ν01+2 , 1 − ν01+2 . However limν→∞ ν+2
1
= 0 and
1 1 1
limν→∞ 1 − ν+2 = 1, implying that for ν large enough ν+2 < x < 1 − ν+2 .
5. In general {aν |ν ∈ N} is not closed. A closed set must contain all of its accumulation
points and if a is not an element of {aν |ν ∈ N} then this set is not closed. However,
for a converging sequence (aν )ν∈N with limit a the set {aν |ν ∈ N} ∪ {a} is closed
since it contains all of its accumulation points.
6. Since A and B are bounded with some K1 , K2 > 0 we have −K1 ≤ a ≤ K1 for all
a ∈ A and −K2 ≤ b ≤ K2 for all b ∈ B. Consequently, for all a ∈ A and b ∈ B it
follows that
−(K1 + K2 ) ≤ a + b ≤ (K1 + K2 ),
i.e. A + B is bounded.
7. a) First we prove that (−3, 2) ∪ (4, 6) is open. Since the open interval (−3, 2)
and (4, 6) are open it follows that their union is open. Clearly (−3, 2) ∪ (4, 6) ⊂ M .
There are only three points, {4}, {6} and {10}, not belonging to (−3, 2) ∪ (4, 6) but
to M . For none of these points exists an open interval containing the point and
belonging entirely to M . Hence (−3, 2) ∪ (4, 6) is the largest open set contained in
M.
The set [−3, 2] ∪ [4, 6] ∪ {10} is closed since it is a finite union of the closed sets
[−3, 2], [4, 6] and {10}, note that {10} = ((−∞, 10)∪(10, ∞))c and (−∞, 10) as well
as (10, ∞) are open. Clearly M ⊂ [−3, 2] ∪ [4, 6] ∪ {10}. There are only two points,
{−3} and {2}, belonging to [−3, 2] ∪ [4, 6] ∪ {10} not belonging to M . However both
655
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 676
A COURSE IN ANALYSIS
⎧
⎪
⎪ 3, n=1
⎪
⎪
⎪
⎪
⎪
⎪
⎪
⎪ 0, n=2
⎪
⎪
⎨
5
an := 2 − n1 , n = 3k
⎪
⎪
⎪
⎪
⎪
⎪ n
⎪
⎪
⎪ n+1 , n = 3k + 1
⎪
⎪
⎪
⎩ 1
2 + n1 , n = 3k + 2
It holds
5 1 5
lim a3k = lim − =
k→∞ k→∞ 2 3k 2
3k + 1
lim a3k+1 = lim =1
k→∞ k→∞ 3k + 2
1 1 1
lim a3k+2 = lim + =
k→∞ k→∞ 2 3k + 2 2
and these are obviously all convergent subsequence of (an )n∈N . Hence (an )n∈N has
3 accumulation points. Further
656
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 677
3 5 1 5
0≤ ≤ − ≤ ≤ 3,
2 2 n 2
n
0≤ ≤1≤3
n+1
and
1 1 1 3
0≤ ≤ + ≤ ≤3
2 2 n 2
implying that sup{an |n ∈ N} = a1 = 3 and inf{an |n ∈ N} = a2 = 0. Note that in
our case the supremum is a maximum and the infimum is a minimum.
Of course we expect
5
lim sup an = lim a3k =
n→∞ k→∞ 2
and
1
lim inf an = lim a3k+2 = .
n→∞ k→∞ 2
Here comes the proof:
For n ≥ 3:
⎧ 5
⎪
⎪ 2 − n1 , n = 3k
⎪
⎪
⎨
5 1
sup{a | ≥ h} = 2 − n+1 , n = 3k + 1
⎪
⎪
⎪
⎪
⎩ 5 1
2 − n+2 , n = 3k + 1
which implies
5
lim (sup{a | ≥ n}) = ,
n→∞ 2
i.e.
5
lim sup an = .
n→∞ 2
657
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 678
A COURSE IN ANALYSIS
implying
1
lim (inf{a | ≥ n}) =
n→∞ 2
or
1
lim inf an = .
n→∞ 2
Since each sequence converges or diverges to −∞, the lim inf and lim sup is in each
case the limit.
10. Once Corollary 19.23 is at our disposal, the problem is trivial. Here we provide a
solution using the same idea as in the first part of Example 19.17. Clearly a is an
upper bound of {an |n ∈ N} and therefore sup{an |n ∈ N} ≤ a. Given > 0 we
can find N () such that a − an < for all n ≥ N (), note that since an ≤ a we do
not need to use the absolute value in this estimate. Thus for n ≥ N () we have
a − < an , implying that a − cannot be an upper bound.
11. We may assume that a is finite, for a = +∞ the statement is trivial. Suppose that
for some > 0 there exists infinitely many anl , l ∈ N, such that anl ≥ a + . Then
all accumulation points of the sequence (anl )l∈N are greater or equal to a + > a.
Hence (an )n∈N has a subsequence converging to a point larger than its limit superior
which is of course a contradiction.
12. Let a = lim supn→∞ an and b = lim supn→∞ bn . For > 0 we have an < a + and
bn < b + for all but finitely many n ∈ N. This implies λan < λa + λ for all but
finitely many n ∈ N and an + bn < a + b + 2 for all but finitely many n ∈ N, which
gives a) and b) respectively.
Now we apply b) to the sequences (an + bn )n∈N and (bn )n∈N to find with (19.20)
which yields
lim sup(an + bn ) ≥ lim sup an + lim inf bn ,
n→∞ n→∞ n→∞
658
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 679
holds for all sequences converging to x ∈ [a, b], xn ∈ [a, b]. Consider the function
-
1, x≥0
g(x) =
−1, x < 0.
g(x)
0 x
−1
1
1 1
For the sequence it holds that lim
2n n∈N = 0 and lim g = 1 =
n→∞ 2n n→∞ 2n
−1 −1
g(0). For the sequence 2n+1 it holds that lim g = −1 = g(0).
n∈(N ) n→∞ 2n + 1
Obviously, g is discontinuous.
659
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 680
A COURSE IN ANALYSIS
i.e. f is continuous at 0.
6. a) Suppose that f is increasing, the decreasing case goes analogously. Since
xν > x0 and lim xν = x0 it follows that x0 < xν ≤ xN , xN := max{xν |ν ∈
ν→∞
N}. Consequently (f (xν ))ν∈N is bounded from below by f (x0 ) and from above by
f (xN ). By the Bolzano-Weierstrass theorem (f (xν ))ν∈N has at least one converging
subsequence.
Let f (xνl1 ) and f (xνk2 ) be two converging subsequences. We want to
l∈N k∈N
660
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 681
show that they have the same limit. For νl1 there exists νk(l)
2
such that xνl1 ≥ xνk(l)
2
2 1
and for νk(l) exists νm(k) such that xνk(l)
2 ≥ xνm(k)
1 implying that
f (xνl1 ) ≥ f (xνk(l)
2 ) ≥ f (xνm(k)
1 )
and therefore lim f (xνl1 ) = lim f (xνk2 ). Thus all subsequences of (f (xν ))ν∈N con-
l→∞ k→∞
verge to the same limit.
b) We suppose again that f is monotone increasing. Let x0 ∈ I and (xν )ν∈N ,
xν ∈ I, xν > x0 , be a sequence converging from the right to x0 and (yν )ν∈N , yν ∈
I, yν < x0 , be a sequence converging from the left to x0 . Part a) implies that
(f (xν ))ν∈N has a limit f (x0 +) and (f (yν ))ν∈N has a limit f (x0 −). By monotonicity,
if f (x0 +) = f (x0 −) then both must coincide with f (x0 ). Denote by D(f, I) the set
of all points of discontinuity which f has in I. It follows that
For every x ∈ D(f, I) exists a rational number r(x) such that f (x−) < r(x) <
f (x+). The mapping x → r(x) maps D(f, I) injectively to Q, hence D(f, I) must
be denumerable.
7. Given f : I → R a monotone function and denote by D(f, I) the denumerable set
of its discontinuities, which we write as a monotone sequence x1 < x2 < x3 · · · Now
we define h : I → R as follows
-
f (x), x ∈ I\D(f, I)
h(x) =
f (xk +), xk ∈ D(f, I)
⏐ ⏐
Clearly f ⏐I\D(f,I) = h ⏐I\D(f,I) , so f and h coincide outside a countable set and
further lim h(x) = f (xk −) exists. Finally we have
x→xk ,x<xk
661
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 682
A COURSE IN ANALYSIS
and
f− (x) = − min(f (x), 0)
This implies immediately with part a) that for a continuous function f the function
f+ and f− are continuous. Now
if f (x) ≥ 0 then f+ (x) = f (x) and f− (x) = 0, and if f (x) ≤ 0 then −f− (x) =
f (x) = min(f (x), 0), and f+ (x) = 0. Thus in each case we get
This decomposition also implies that if f+ and f− are continuous then f is contin-
uous.
Finally we note
- -
f (x), f (x) ≥ 0 f+ (x), f (x) ≥ 0
|f (x)| = =
−f (x), f (x) ≤ 0 f− (x), f (x) ≤ 0
662
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 683
and the continuity of g follows with arguments similar to those given above. For
x ∈ (0, a] the function x → g(x) = f (x) − f (0) is continuous since f is. For
x ∈ [−a, 0) the function x → g(x) = −f (−x) + f (0) is continuous as composition
and sum of continuous functions.
For x = 0 we find
lim g(x) = lim (f (x)− f (0)) = 0 = lim (−f (−x)+ f (0)) = lim g(x)
x→0,x>0 x→0,x>0 x→0,x<0 x→0,x<0
g(x) = x
x0 x0 = h(x0 )
x0 x
663
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 684
A COURSE IN ANALYSIS
g(x0 ) = h(x0 ) + x0 − a
g(x0 ) = h(x0 ) + x0 − a = x0 .
13. a) We apply Theorem 20.14 to the function g − f . For h(x) := g(x) − f (x)
defined on [a, b] we have h(a) = g(a) − f (a) > 0 and h(b) = g(b) − f (b) < 0 implying
that for some x0 ∈ (a, b) it holds that h(x0 ) = 0 or g(x0 ) = f (x0 ).
1 π 3π
b) We apply part a) to g(x) = sin x and f (x) = 2+cos 4 x defined on [ 2 , 2 ].
14. a) Below is a picture of An = 12 n1 + n+1 1
, 12 n1 − n−1
1
1 1 1 1 1 1
2(n + n+1 ) 2(n + n−1 )
| ( | ) |
1 1 ↑ 1
n+1 n An n−1
This gives
( | )( | )( | )( | )( | )
1 1 1 1 1
n+2 n+1 n n−1 n−2
1
Aν0 ∪ Aν1 ∪ · · · ∪ AνN −1 ⊃ { | n ∈ N} ∪ {0},
n
664
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 685
3
x− 3 x x+ 4N 1
4N
The points 0, N1 , · · · , NN−1 , 1 give a partition of (0,1) and the distance of two neigh-
bouring points is N1 . Therefore, for N k
and k+1
N we find
4k + 1 4k + 3
U k ∩ U k+1 = , = ∅
N N 4N 4N
and
N
−3 4N + 3
(0, 1) ⊂ Uk = , .
N 4N 4N
k=0
Thus, (U k )k=0,··· ,N is indeed a finite subcovering of (0, 1). However, since (0, 1) is
N
open it cannot be compact. Finding a finite subcovering for a special open covering
is of course not sufficient for compactness.
17. a) For x ∈ K exists δx > 0 such that for y ∈ K and |x − y| < δx it follows that
|f (y) − f (x)| < f (x)
4 . The family of intervals (x − δx , x + δx ), x ∈ K, forms an open
665
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 686
A COURSE IN ANALYSIS
3f (xj )
0 < min < f (y).
1≤j≤N 4
Thus, given
√ > 0 choose δ = min(δ1 , ) to find for all x, y ∈ [−a, ∞) that |x− y| < δ
√
implies | x + a − y + a| < , i.e. f is uniformly continuous on [−a, ∞).
19. By uniform continuity of g, given > 0 there exists δ > 0 such that |f (x)−f (y)| ≤
for all x, y ∈ [a, b] such that |x − y| < δ. Now let a = x0 < x1 < · · · < xn = b be a
partition of [a, b] such that |xk −xk−1 | ≤ δ for k = 1, · · · , n. We define ϕ : [a, b] → R
as follows: on [xk , xk−1 ] we set
666
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 687
we restrict x, y to D ⊂ D, given > 0 we may still work with the same δ > 0 to
get the uniform continuity of f |D .
b) Since lim g(x) = A exists, we can define g̃ : [a, b] → R by
x→a,x>a
-
g(x), x ∈ (a, b]
g̃ :=
A, x = a.
f (x) − f (x0 )
a=
x − x0
and
f (x0 )x − f (x)x0
b= ,
x − x0
i.e.
f (x) − f (x0 ) f (x0 )x − f (x)x0
Sx0 ,x (t) = t+ .
x − x0 x − x0
The tangent line through (x0 , f (x0 )) is given by gx0 (t) = αt+β with gx0 (x0 ) = f (x0 )
and gx 0 (x0 ) = f (x0 ). This implies gx0 (t) = f (x0 )t + f (x0 ) − f (x0 )x0 = f (x0 )(t −
x0 ) + f (x0 ). Now we find
2. We use mathematical induction. For k = 1 the statement is just the well known
Leibniz’s rule
(f · g) (x) = f (x)g(x) + f (x)g (x)
Now suppose that
k
dk k (k−l)
(f · g)(x) = f (x)g (l) (x)
dxk l
l=0
667
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 688
A COURSE IN ANALYSIS
and consider
k
dk+1 d k (k−l)
(∗) (f · g)(x) = f (x)g (l) (x)
dxk+1 dx l
l=0
k
k d (k−l)
= (f (x) · g (l) (x))
l dx
l=0
k " #
k
= f (k+1−l) (x)g (l) (x) + f (k−l) (x)g (l+1) (x) .
l
l=0
The last term we now handle is the analogous term in the proof of the binomial
theorem, Theorem 3.9.
k k
k (k+1−l) k (k−l)
f (x)g (l) + f (x)g (l+1) (x)
l l
l=0 l=0
k
k (k+1−l)
= f (k+1) (x)g(x) + f (x)g (l) (x)
l
l=1
k
k−1
+ f (k−l) (x)g (l+1) (x) + f (x)g (k+1) (x)
l
l=0
k
k (k+1−l)
= f (k+1) (x)g(x) + f (x)g (l) (x)
l
l=1
k
k
+ f (k−(l−1)) (x)g (l) (x) + f (x)g (k+1) (x)
l−1
l=1
k
k + 1 (k+1) k k
= f (x)g(x) + + f (k+1−l) (x)g (l) (x)
0 l l+1
l=1
k+1 (l)
+ f (x)g (x)
k+1
k + 1
k+1
(∗∗) = f (k+1−l) (x)g (l) (x)
l
l=0
k k
where we used Lemma 3.8, i.e. k+1l = l−1 + l . Thus the general Leibniz’s rule
is proved by combining (∗) and (∗∗).
3. We need to prove for f, g ∈ C k (I) and λ, μ ∈ R that λf + μg ∈ C k (I) and f · g ∈
d df dg
C k (I). Now the linearity of the derivative, i.e. dx (λf + μg) = λ dx + μ dx implies
immediately the linearity of higher derivative, l ≤ k
l−1
dl d d
(λf + μg) = (λf + μg)
dxl dx dxl−1
668
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 689
d dl−1 dl−1
= λ l−1 f + μ l−1 g
dx dx dx
dl f dl g
l
=λ
+μ l,
dx dx
where as the general Leibniz’s rule, see Problem 2, yields for l ≤ j
l
dl
(f · g) = (lj )f (l−j) g (j)
dxl j=0
dl
and it follows dxl
(f g) ∈ C k−l (I).
4. For x = 0 and x = 1 the function is obviously differentiable. For being also
differentiable at x0 = 0 and x1 = 1 the function must be at these points continuous
and the right and the left derivative, i.e.
f (x) − f (x̃) f (x) − f (x̃)
lim and lim
x→x̃,x>x̃ x − x̃ x→x̃,x<x̃ x − x̃
must exist and coincide, x̃ ∈ {x0 , x1 }.
This yields
a · 0 + b = c · 02 + d · 0
a = 2c · 0 + d
1
c+d =1−
1
1
2c · 1 + d = 2
1
or b = 0, a = d, c + d = 0, 2c + d = 1 i.e. a = −1, b = 0, c = 1, d = −1.
5. Using the law of the logarithms
ln(a · b) = ln a + ln b
we find first
n
n
ln fk (x) = ln fk (x),
k=1 k=1
and consequently
n
n
ln fk (x) = ln fk (x)
k=1 k=1
which yields
n n
( k=1 fk (x)) fk (x)
n = ,
k=1 fk (x) fk (x)
k=1
g
where we used (ln g) = g .
669
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 690
A COURSE IN ANALYSIS
f (x) − f (0)
f (0) = lim = lim |x|h(x) = 0,
x→0 x−0 x→0
670
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 691
i.e. for all x0 ∈ R we have f (x0 ) = −f (−x0 ) which means that f is odd. Now, if
g : R → R is an odd function it follows for all x0 ∈ R that
g(x) − g(x0 )
g (x0 ) = lim
x→x0 x − x0
−g(−x) − (−g(−x0 ))
= lim
x→x0 x − x0
−g(y) + g(y0 )
= lim
y→y0 −y − (−y0 )
−(g(y) − g(y0 ))
= lim
y→y0 −(y − y0 )
g(y) − g(y0 )
= lim = g (y0 ) = g (−x0 ),
y→y0 y − y0
i.e. g is even.
b) Since f is a-periodic, we have
d d 2
(f (x) · f (x + a)) = f (x) = 2f (x)f (x)
dx dx
and
d
f (x)f (x + a) = f (x)f (x + a) + f (x)f (x + a),
dx
= f (x)f (x) + f (x)f (x + a)
or
f (x)f (x) = f (x)f (x + a).
By assumption, we have f (x) = 0, so we deduce for all x ∈ R, f (x) = f (x + a), i.e.
f is a-periodic too. Note that the assumption f (x) = 0 for x can be reduced when
assuming that f is continuous. In this case it would be sufficient for f (x) = 0 for
all x ∈ R\Q, or more generally for a dense set in R, a notion we will discuss later
on.
9. For |x| < 1 we find
N
1 − xN +1
xk =
1−x
k=0
which implies by differentiation
N
1 N xN +1 − (N + 1)xN
kxk−1 = 2
+ ,
(1 − x) (1 − x)2
k=1
671
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 692
A COURSE IN ANALYSIS
or
N
x N xN +2 − (N + 1)xN +1
kxk = + ,
(1 − x)2 (1 − x)2
k=1
and this identity holds also for x = 0. Hence for all |x| < 1 we find, since in this
case lim N xN +l = 0, l = 1, 2, · · · , that
N →∞
∞
N
x
kxk = lim kxk = .
N →∞ (1 − x)2
k=1 k=1
d 1 1 3
(1 + x2 )− 2 = − (2x)(1 + x2 )− 2
dx 2
−x
= 3
(1 + x2 ) 2
dk 1 Pk (x)
and P1 (x) = −x. Now suppose that dxk (1 + x2 )− 2 = 2k+1 and that the
(1+x2 ) 2
d 1 d 1
= Pk (x) 2k+1 + Pk (x) 2k+1
dx 2
(1 + x ) 2 dx (1 + x2 ) 2
Pk (x)(1 + x2 ) + Pk (x)(− 2k+1
2 )(2x)
= 2(k+1)+1
(1 + x2 ) 2
Since Pk (x) has degree less or equal to k, Pk (x) has degree less or equal to k − 1
and this implies that (1 + x2 )Pk (x) as well as xPk (x) has degree less or equal to
k + 1.
k
Now, if Pk (x) has degree less or equal to k, then we know that |Pk (x)| ≤ Ck (1+x2 ) 2 ,
which implies
k
dk 1 |Pk (x)| (1 + x2 ) 2
| k
(1 + x2 )− 2 | ≤ 2k+1 ≤ Ck 2k+1
dx (1 + x2 ) 2 (1 + x2 ) 2
672
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 693
1
= Ck k+1 .
(1 + x2 ) 2
b) Let f ∈ Cbm (R). First we note that for l ≤ m, since |f (l) (x)| ≤ Ml for some
Ml , it follows that
1
|f (l) (1 + x2 )− 2 | ≤ Ml .
1
Next, with g(x) = (1 + x2 )− 2 we find with part a),
kj
(j) kj 1 1
|g (x)| ≤ γj,kj 1 j
(1 + x2 ) 2 (1 + x2 ) 2
1 1
≤ γj,kj 1 jkj .
(1 + x2 ) 2 (1 + x2 ) 2
k1 km
1 g (1) (x) g (m) (x)
≤| cm,k1 ,··· ,kn f (k) ((1 + x2 )− 2 ) ···
1! m!
≤ ˜m,k1 ,··· ,km |g (1) (x)k1 | · · · |g (m) (x)|km
1 1
≤C 1
(1 + x2 ) 2 (1 + x2 ) jkj
1
=C m+1 .
(1 + x2 ) 2
Chapter 22
1. We may first apply Rolle’s theorem to ⏐ f which yields⏐the existence of x0 ∈ (a, b)
⏐ ⏐
such that f (x0 ) = 0. Now consider
⏐ f
⏐ 0 ] and f [x0 ,b] . Since f (a) = 0 and
[a,x
⏐ ⏐
f (b) = 0, both functions f [a,x0 ] and f [x0 ,b] satisfy the assumption of Rolle’s
theorem. Thus there exists x1 ∈ (a, x0 ) and x2 ∈ (x0 , b), hence x1 = x2 , such that
f (x1 ) = f (x2 ) = 0
2. By our assumption, we find for x, y ∈ (a, b), x = y, that
f (x) − f (y)
| | ≤ c|x − y|α ,
x−y
which implies that f is differentiable at x and
f (x) − f (y)
f (x) = lim =0
y→x x−y
for all x ∈ (a, b) which yields that f must be constant on (a.b). Clearly f is
continuous on [a, b], hence f is constant on [a,b].
673
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 694
A COURSE IN ANALYSIS
Remark: The condition |f (x) − f (y)| ≤ C|x − y|β , β > 0, is called the Hölder
condition. For β = 1 we recover the Lipschitz condition. The result proved
above says that if f is Hölder continuous, i.e. satisfies the Hölder condition with
exponent (Hölder exponent) β > 1, then f is constant. For 0 < β < 1 there are
non-trivial functions satisfying the Hölder condition.
3. Let x, y ∈ (a, b), x < y. We can find ξ ∈ (x, y) such that f (x) − f (y) = f (ξ)(x − y)
which gives
|f (x) − f (y)| = |f (ξ)||x − y| ≤ M |x − y|,
i.e. f is Lipschitz continuous and by Problem 19 in Chapter 20, f is uniformly
continuous.
4. We consider the function y → ln(1 + y) as the interval [0, xq ] and [ xq , xp ] for x > 0
and apply in both intervals the mean value theorem. Thus using for 0 < y1 < y2
the formula
1
ln(1 + y2 ) − ln(1 + y1 ) = (y2 − y1 ), ξ ∈ (y1 , y2 )
1+ξ
we find
x x 1 x x
ln 1 + = ln 1 + − ln 1 = , 0 < ξ0 <
q q 1 + ξ0 q q
and
x x 1 x x x x
ln 1 + − ln 1 + = − , < ξ1 < .
p q 1 + ξ1 p q q p
1 1
Thus, since 1+ξ0 > 1+ξ1we obtain
ln 1 + xq ln 1 + xp − ln 1 + xq
x > x x ,
q p − q
implying
x x x x x x
− ln 1 + > ln 1 + − ln 1 + ,
p q q q p q
which gives
x x x x
ln 1 + > ln 1 +
p q q p
or
x x
q ln 1 + > p ln 1 + ,
q p
i.e. p q
x x
ln 1 + < ln 1 +
p q
and this implies of course
p q
x x
1+ < 1+ .
p q
674
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 695
eαx
lim = ∞.
x→∞ xβ
= exp(0) = 1,
where we used
1
ln x x
lim x ln x = lim 1 = lim = lim (−x) = 0.
x→0 x→0 x→0 − 12 x→0
x x
7. We write x
a x ln f √ax
f √ =e
x
and note that x
a x ln f √ax
lim f √ = lim e
x→∞,x>0 x x→∞,x>0
675
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 696
A COURSE IN ANALYSIS
a
= exp lim x ln f √
x→∞,x>0 x
and further, by applying l’Hospital’s rule twice
√
a ln f (a y)
lim x ln f √ = lim
x→∞,x>0 x y→0 y
√
af (a y)
= lim √ √
y→0,y>0 2 yf (a y)
√
a2 f (a y) a2
= lim √ √ √ =− ,
y→0,y>0 2f (a y) + 2a yf (a y) 2
where we used f (0) = 1, f (0) = 0 and f (0) = −1. Hence we arrive at
x
a a2
lim f √ = e− 2 .
x→∞,x>0 x
8. Let x ∈ (a, b) and h > 0 such that x + h ∈ (a, b). It follows that f (x + h) − f (x) ≤ 0
and therefore
f (x + h) − f (x)
f (x) = lim ≤ 0.
h→0 h
9. Clearly for k = 0 we have e−at ≥ 0. Moreover for k ∈ N we find
dk −at
(e ) = (−a)k e−at = (−1)k ak e−at
dtk
implying that (−1)k (−1)k ak e−at = ak e−at ≥ 0, i.e. t → e−at , a > 0, is com-
pletely monotone.
Now, 1 − e−at , a > 0, t ≥ 0, is always non-negative and for k ∈ N we find
dk dk
k
1 − e−at = − k e−at
dx dx
k
d −at
and by the previous result follows that (−1)k dx k (1 − e ) ≤ 0.
k
d d
Finally, for α = 1 we have t ≥ 0 for t > 0 and dt (t) = 1 ≥ 0,, as well as dtk (t) = 0
for k ≥ 2. Thus t → t is a Bernstein function. For 0 < α < 1 we find first tα > 0
for α > 0 and t > 0 and further
dk α
(t ) = α(α − 1)(α − 2) · · · (α − k)tα−k
dtk
= α|α − 1||α − 2| · · · |α − k|(−1)k−1 tα−k ,
and we arrive at
dk α
(−1)k (t ) = (−1)α|α − 1||α − 2| · · · |α − k|tα−k ≤ 0.
dtk
676
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 697
10. a) We have
1
1 3 −x
f (x) =
9 (x2 (1 − x)) 13
provided x = 0 and x = 1. Thus f vanishes at x0 = 13 and for 0 < x < 13 we have
f (x) > 0 and for 13 < x < 1 we have f (x) < 0, i.e. approaching from the left of
the point x0 = 13 the function f is strictly increasing, and for x > 13 the function is
strictly decreasing, hence we must have a local maximum at x0 = 13 and the value
1
is f 13 = 433 .
For x = 0 and x = 1 the function f is not differentiable. For x ∈ (0, 1) we have
f (x) > 0 and further f (x) < 0 for x < 0, hence there is no local extreme value at
x = 0. However for x = 1 we find that f (1) = 0 and f (x) > 0 for x > 1 as well
as for x ∈ (0, 1), hence there is a local minimum at x = 1, see J. Kaczar and M.T.
Nowak [6, p. 298].
b) The function f has only strictly positive values and lim f (x) = lim f (x) =
x→∞ x→−∞
0. For x = 0 and x = 1 the function is not differentiable. Thus, to find the maxi-
mum we have to look at (−∞, 0), (0, 1) and (1, ∞) for a local maximum and compare
with f (0) = 32 and f (1) = 32 .
Now for x < 0 we find
1 1 1 1
f (x) = + = +
1−x 1−x+1 1−x 2−x
1 1
and f (x) = (1−x) 2 + (2−x)2 > 0, implying that f is on (−∞, 0) strictly increasing,
3
hence f (x) < 2 for x ∈ (−∞, 0).
For 0 < x < 1 we find
1 1
f (x) = +
1+x 2−x
and
1 1
f (x) = − 2
+
(1 + x) (2 − x)2
1
implying f 2 = 0. Since
2 2
f (x) = +
(1 + x)3 (2 − x)3
and therefore f 12 > 0, we find that f has a local minimum at 12 .
Finally, for x > 1 we have
1 1 1 1
f (x) = + = +
1+x 1+x−1 1+x x
and
1 1
f (x) = − − 2
(1 − x)2 x
Thus on (1, ∞) the function f is strictly decreasing. It follows that the global
maximum of f is 32 and it is attained at two points x0 = 0 and x1 = 1.
677
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 698
A COURSE IN ANALYSIS
provided g (x0 ) = 0.
√
With g(x) = 1 − x2 , x ∈ (−1, 1), we find
x
g (x) = − √ = 0
1 − x2
for x = 0. Thus
−x0 t x2
gx0 (t) = + 1 − x20 + 0 2
1 − x20 1 − x0
−x0 t + 1 − x20 + x20 −x0 t + 1
= =
1 − x20 1 − x20
and for x0 = 0
1 − x20 x0 1 − x20
nx0 (t) = t + 1 − x20 −
x0 x0
1 − x20
= t.
x0
For x0 = 0 the normal line is of course the abscissa. Since g is the upper half circle,
we expect the centre of curvature to be for all x0 ∈ (−1, 1) the origin. In general
we have for c = (c1 , c2 )
1 + g 2 (x0 )
c1 = x0 − g (x0 )
g (x0 )
and
1 + g 2 (x0 )
c2 = g(x0 ) + .
g (x0 )
−1
Since g (x0 ) = 3 we find
(1−x20 ) 2
x20 1
1 + g 2 (x0 ) = 1 + = ,
1 − x20 1 − x20
3
1 + g 2 (x0 ) (1 − x20 ) 2
= − =− 1 − x20 ,
g (x0 ) 1 − x20
678
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 699
1 + g 2 (x0 ) x0
g (x0 ) = − . − 1 − x2
0 = x0 ,
g (x0 ) 1 − x20
and consequently
c1 = x0 − x0 = 0
as well as
c2 = 1 − x20 − 1 − x20 = 0.
Finally, as radius of curvature we find
3 3
1 + g 2 (x0 ) 2 1 (1 − x20 ) 2
r= = 3 . = 1,
|g (x0 )| (1 − x20 ) 2 1
as we shall expect: the circle of curvature of a circle is the circle itself.
g̃x0
ñx0
−1 0 x0 1 x
679
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 700
A COURSE IN ANALYSIS
and
1 + f 2 (x0 )
c2 = f (x0 ) + ,
f (x0 )
and since
1 + f 2 (x0 ) 1 + x40
=
f (x0 ) 2x0
we get
1 x40 + 1 2x4 + x40 + 1 3x4 + 1
c1 = x0 + 2 . = 0 3 = 0 4 ,
x0 2x0 2x0 2x0
1 1 + x40 3 + x40
c2 = + = .
x0 2x0 2x0
Chapter 23
1. For m = 2 the statement is f (λ1 x1 + λ2 x2 ) ≤ λ1 f (x1 )+ λ2 f (x2 ), λ1 , λ2 ∈ [0, 1], λ1 +
λ2 = 1. Thus with λ := λ1 and λ2 := 1 − λ we recover the definition of convexity.
Now suppose (23.12) holds for some m ≥ 2. We want to prove that it also holds
for m + 1. For this, take points x1 , · · · , xm+1 ∈ I and λ1 , · · · , λm+1 ∈ [0, 1] with
m+1
j=1 λj = 1. Since for λm + λm+1 > 0
λm λm+1
λm xm + λm+1 xm+1 = (λm + λm+1 ) xm + xm+1 = λ̃m x̃n ,
λm + λm+1 λm + λm+1
2. We⏐ will prove more, namely that if I has end points a < b and a < a1 , b1 < b then
f ⏐[a ,b ] is Lipschitz continuous.
1 1
680
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 701
and
f (x) − f (y1 ) f (y) − f (x)
≤ ,
x − y1 y−x
hence
f (y1 ) − f (x1 ) f (y) − f (x)
≤ .
y1 − x1 y−x
Applying Lemma 23.4 once more first to x, y, x2 and then to y, x2 , y2 we arrive at
we first find |y − x0 | = δ, hence f (x0 ) ≤ f (y), and using the convexity of f we find
δ δ
f (x0 ) ≤ f (y) ≤ f (x) + 1 − f (x0 ),
|x − x0 | |x − x0 |
681
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 702
A COURSE IN ANALYSIS
i.e. x
1 xn 1
ln + ···+ ≥ ln(x1 . · · · .xn )
n n n
which yields
n
n
n1
1
xk ≥ xk .
n
k=1 k=1
b) Since
d2 1
(x ln x) = > 0
dx2 x
for x ∈ (0, ∞), we note that f is convex, and consequentially by convexity
x+y x+y x y
ln ≤ ln x + ln y,
2 2 2 2
or
x+y
(x + y) ln ≤ x ln x + y ln y.
2
3
e2
−1 0 +1 x
Now, for x ∈ [−1, 0] and 1 ≤ a ≤ 32 we have eax ≤ ex and for x ∈ [0, 1] and
3
1 ≤ a ≤ 32 we find eax ≤ e 2 x implying
- 3
e 2 x , x ∈ [0, 1]
sup fa (x) =
a∈[1, 32 ] ex , x ∈ [−1, 0].
682
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 703
d(x, y) = 0 then ||x − y||k = 0 for all k ∈ N, hence, since ||.||k is a norm, x = y.
Since for every norm ||x − y|| = ||y − x|| holds we also find that d is symmetric i.e.
d(x, y) = d(y, x).
1
Moreover, the monotonicity of f (t) = 1+t , t ≥ 0, implies
and it follows
∞
1
d(x, y) = f (||x − y||k )
2k
k=1
∞
1
≤ f (||x − z||k + ||z − y||k )
2k
k=1
∞
1 ||x − z||k + ||z − y||k
=
2k 1 + ||x − z||k + ||z − y||k
k=1
∞
∞
1 ||x − z||k 1 ||z − y||k
= +
2k 1 + ||x − z||k + ||z − y||k 2k 1 + ||x − z||k + ||z − y||k
k=1 k=1
∞
∞
1 ||z − y||k
1 ||x − z||k
≤ k
+
2 1 + ||x − z||k 2k 1 + ||z − y||k
k=1 k=1
683
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 704
A COURSE IN ANALYSIS
2
≥ 1 + ||y + z|| = 1 + ||y + z||,
√ √ √
where for the last estimate we need a+b ≤ a + b for a, b ≥ 0. Thus with
z = x − y we get
or
1 + ||x||
≤ 2(1 + ||x − y||).
1 + ||y||
and x = 0 implies x(1) + x(2) = 0, i.e. x(1) = 0 and x(2) = 0 implying
x = 0.
x + y = x + y(1) + x + y(2)
≤ x(1) + y(1) + x(2) + y(2)
= x(1) + x(2) + y(1) + y(2)
= x + y.
684
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 705
is obvious.
10. First we recall (xk )k∈N converges to x in .p if for every ε > 0 there exists N (ε)
1/p
n (j) (j) p
such that k ≥ N implies xk − xp = j=1 |xk − x | < ε which implies
(j) (j) (j)
immediately that k ≥ N yields xk − x < for j = 1, . . . , n, i.e. xk
k∈N
converges
to x(j) . Conversely, suppose that for every j = 1, . . . , n the sequence
(j)
xk converges to x(j) . Given > 0 we can find N () such that for j = 1, . . . , n
k∈N
(j)
we have that k ≥ N () implies for all j = 1, . . . , n that xk − x(j) < 1 which
np
gives
⎛ ⎞1
n p p
(j)
||xk − x||p = ⎝ xk − x(j) ⎠ < .
j=1
685
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 706
A COURSE IN ANALYSIS
11. Since lim ||xk − x||p = 0 we have that for every > 0 there exists N () ∈ N such
k→∞
that k ≥ N () implies ||xk − x||p < c . Consequentially given > 0 and N () chosen
as above we find
||xk − x|| ≤ c||xk − x||p < c · = ,
c
i.e. lim ||xk − x|| = 0.
k→∞
Chapter 24
1. For x ∈ R fixed the sequence (gn (x))n∈N converges clearly to 0. However sup |g2n (x)| =
x∈R
∞, thus we cannot expect ||gn − 0||∞ = ||gn ||∞ converging to 0, and therefore the
convergence is not uniform.
2. Once we have proved that f is the pointwise limit of (fn )n∈N we have also shown
that the convergence cannot be uniform. Each fn is continuous, f is not. But the
uniform limit of continuous function must be continuous.
1
Now, for x = 0 we have fn (0) = 12 , hence lim fn (0) = . If x = 0 then (nx − 1)2
n→∞ 2
1
diverges to +∞ and hence lim fn (x) − lim = 0.
n→∞ n→∞ 1 + (nx − 1)2
686
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 707
c)
Since continuous and arctan 0 = 0 we find for each x ∈ R that
arctan is
4x
lim arctan = 0. The mean value theorem implies
n→∞ x2 + n4
| arctan z − arctan y| ≤ |z − y|
d
since dx (arctan x) = 1+x
1
2 ≤ 1. Consequently we have
4x
sup arctan 2 − 0 ≤ sup 4x − 0 ≤ sup 4|x| = sup 4x .
x∈R x +n 4 2
x∈R x + n
4 x∈R x2 + n4 2
x>0 x + n
4
4x 4n2
But sup = which tends to 0 as n → ∞, thus (hn )n∈N , hn (x) =
x>0 x2+n 4 16n4 + n4
arctan x24x
+n4 , converges on R uniformly to 0.
1
d) Since | cos an x| ≤ 1 and lim α = 0 we find
n→∞ n
1
sup | cos(an x) − 0| = sup | cos(an x)| ≤ ,
x∈R x∈R nα
and once again we have uniform convergence.
4. Let x, y ∈ I, x < y. It follows that
5. Since for ck there exists a sequence (ck,n )n∈N of rational numbers ck,n converging to
ck , given > 0 we can find N () ∈ N such that n ≥ N () implies |ck − cn,k | < N +1 .
This implies since 0 ≤ x ≤ 1
N
|p(x) − pn (x)| = | (ck − cn,k )xk |
k=0
N
≤ |ck − cn,k |xk
k=0
N
< = ,
N +1
k=0
687
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 708
A COURSE IN ANALYSIS
= f (ξn ) − f (x)
for some ξn ∈ x1 , x + n1 , or
Now we use the uniform continuity of f : For > 0 we can find δ > 0 such that
|y − z| < δ implies |f (y) − f (z)| < . For δ we may find N ∈ N, N = N (), such
that n ≥ N () implies n1 < δ, and consequently for n ≥ N ()
1
sup |fn (x) − 0| = sup |fn (x)| = ,
x∈[−1,1] x∈[−1,1] 2n
and we obtain uniform convergence. Now for x = 0 we have fn (0) = 1 for all n,
whereas for x ∈ [−1, 1]\{0} we have
1 − n2 x2
lim = 0.
n→∞ (1 − n2 x2 )2
Since the pointwise limit is not continuous, the convergence of the derivative cannot
be uniform.
Chapter 25
1. Let ϕ ∈ T [a, b] be given with respect to the partition Zϕ (x0 , . . . , xn ) and ψ ∈
T [a, b] with respect to the partition Zψ (t0 , . . . , tm ). Denote the joint partition by
Z = Zϕ ∪ Zψ , Z = Z(y0 , . . . , yk ). For 1 ≤ l ≤ k it follows that ϕ|(yl−1 ,yl ) = cl and
ψ|(yl−1 ,yl ) = dl for some cl , dl ∈ R. Consequently (ϕ · ψ)|(yl−1 ,yl ) = cl · dl and hence
ϕ · ψ ∈ T [a, b].
688
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 709
2. Since f is Riemann integrable, given > 0 there exists step functions ϕ, ψ ∈ T [a, b]
b
such that ϕ ≤ f ≤ ψ and a (ψ − ϕ)dx < . Let ϕ be given with respect to Zϕ
and ψ with respect to Zψ . Let Z = Zϕ ∪ Zψ be the joint partition and Z̃ :=
Z ∪ {y1 , . . . , yN }. With respect to Z̃ we define two step functions
-
cj , x ∈ {y1 , . . . , yN }
ϕ̃(x) =
ϕ(x), x ∈ [a, b] \ {y1 , . . . , yN }
and -
cj , x ∈ {y1 , . . . , yN }
ψ̃(x) =
ψ(x), x ∈ [a, b] \ {y1 , . . . , yN }.
The latter equality follows when using Z̃ to calculate both integrals. Note that
for a step function represented with respect to Z = Z(t0 , . . . , tm ) the values at tj ,
1 ≤ j ≤ m, do not contribute to the integral.
Clearly g|(a,c) = 1 and g|(c,b) = 0 are continuous and the one-sided limits exist.
However at c we have
Therefore for the piecewise continuous function f : [a, b] → R there exists a partition
Z(x0 , . . . , xn ) of [a, b] such that f |(xk−1 ,xk ) is continuous with continuous extension
f¯k : [xk−1 , xk ] → R.
689
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 710
A COURSE IN ANALYSIS
¯ ¯
xk fk there exists step functions ϕk , ψk ∈ T [a, b] such that ϕk ≤ fk ≤ ψk and
For
xk−1 (ψk − ϕk )(x)dx < n . We define the step functions
-
f (xj ), x = xj , j = 0, . . . , n
ϕ(x) :=
ϕk (x), x ∈ (xk−1 , xk ), k = 1, . . . , n
and -
f (xj ), x = xj , j = 0, . . . , n
ψ(x) :=
ψk (x), x ∈ (xk−1 , xk ), k = 1, . . . , n.
It follows that ϕ ≤ f ≤ ψ and
b n
xk
(ψ − ϕ)(x)dx = (ψ − ϕ)(x)dx
a k=1 xk−1
n
xk n
= (ψk − ϕk )(x)dx < = ,
xk−1 n
k=1 k=1
690
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 711
b
which is a contradiction. It is now easy to show that ||f ||L1 = a |f (x)|dx is a norm
on C([a, b]). We need to prove:
i) ||f ||L1 ≥ 0 and ||f ||L1 = 0 if and only if f = 0, i.e. f is constant and has the
value 0;
ii) ||λf ||L1 = |λ|||f ||L1 ;
iii) ||f + g||L1 ≤ ||f ||L1 + ||g||L1 .
b b
Clearly a |f (x)|dx ≥ 0 and a |f (x)|dx = 0 if and only if f = 0 has just been
proved above. For λ ∈ R we have
b
b
||λf ||L1 = |λf (x)|dx = |λ| |f (x)|dx = |λ|||f ||L1 ,
a a
By Theorem 25.24 for > 0 and any partition Z(x0 , . . . , xn ) with mesh size less
than δ = δ() and points ξj ∈ [xj−1 , xj ] we have
b
(∗∗) S(f ) − f (x)dx <
a
where S(f ) denotes the Riemann sum for f with respect to Z and ξ1 , . . . , ξn . The
mesh size of Zn is
(n) (n) j j−1 b−a
xj − xj−1 = a + (b − a) − a − (b − a) = .
n n n
Hence, given > 0 we determine N ∈ N such that for n ≥ N it follows that b−a
n <δ
and now (∗∗) implies (∗).
b) Since
n
(n) b − a
Sn (f ) = f (xj )
j=1
n
and since
b
lim Sn (f ) = f (x)dx
n→∞ a
it follows that
⎛ ⎞
n
b
1 (n) 1 1 b
lim ⎝ f (xj )⎠ = lim Sn (f ) = f (x)dx = − f (x)dx.
n→∞ n j=1 n→∞ b − a b−a a a
691
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 712
A COURSE IN ANALYSIS
n
n
p1 n
q1
p q
|αk βk | ≤ |αk | |βk | .
k=1 k=1 k=1
(n) (n)
Now let Zn = (x0 , . . . , xn ) be a sequence of partitions of [a, b] whose mesh
size converges to 0. For the corresponding Riemann sums of f · g we find with
(n) (n)
ξj ∈ [xj−1 , xj ]
n
(n) (n) (n) (n)
|f (ξj )g(ξj )(xj − xj−1 )|
j=1
n
(n) (n) (n) 1 (n) (n) (n) 1
= |f (ξj )(xj − xj−1 )| p |g(xj )|q (xj − xj−1 )| q
j=1
⎛ ⎞ p1 ⎛ ⎞ 1q
n
n
(n) (n) (n) (n) (n) (n)
≤⎝ |f (ξj )|p (xj − xj−1 )⎠ ⎝ |g(ξj )|q (xj − xj−1 )⎠ .
j=1 j=1
p1
q1
b b b
p q
|f (x)g(x)|dx ≤ |f (x)| dx |g(x)| dx .
a a a
1 1
Note that if we agree that q = ∞ is the conjugate of p = 1, i.e. p + q = 1, then we
find
b
b
|f (x)g(x)|dx ≤ sup |g(x)| |f (x)|dx,
a x∈[a,b] a
or
b
|f (x)g(x)|dx ≤ ||f ||L1 ||g||∞ .
a
692
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 713
12
12
b b b
(∗) |f (x)g(x)|dx ≤ |f (x)|2 dx |g(x)|2 dx .
a a a
√
Further for A, B > 0, noting that A · B = 2A · √1 B, we have
2
2
1 √ 1 1 1 2
(∗∗) AB ≤ ( 2A)2 + B = A2 + B .
2 2 2 4
12
b
The result follows by applying (∗∗) to (∗) with A = a
|f (x)|2 dx and B =
12
b 2
a |g(x)| dx .
and
2
b b b
2 2
f (x) cos kx dx ≤ f (x) dx cos kx dx .
a a a
11. Let Z = Z(x0 , . . . , xn ) be a partition of [a, b] and consider the two Riemannian
sums
n n
h(xj )(xj − xj−1 ) and f (h(xj−1 ))(xj − xj−1 ).
j=1 j=1
693
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 714
A COURSE IN ANALYSIS
x −x
We set λj := j b−aj−1 and therefore 0 ≤ λj ≤ 1 and λ1 + · · · + λn = 1. By Problem
1 in Chapter 23 we find
⎛ ⎞ ⎛ ⎞
n n
1
f⎝ h(xj )(xj − xj−1 )⎠ = f ⎝ λj h(xj )⎠
b − a j=1 j=1
n
≤ λj f (h(xj ))
j=1
n
1
f (h(xj ))(xj − xj−1 ).
b − a j=1
If we replace now Z by a sequence (Zk )k∈N of partitions such that Zk+1 is a refine-
ment of Zk and for the mesh sizes we have η(Zk ) → 0 as k → ∞, then it follows
from
⎛ ⎞
nk nk
1 1
f⎝ h(xj )(xj − xj−1 )⎠ ≤ f (h(xj ))(xj − xj−1 )
b − a j=1 b − a j=1
and the continuity of f , recall that convex functions on an interval are continuous
in the interior, see Corollary 23.6, that
b
b
1 1
f h(t)dt ≤ f (h(t))dt.
b−a a b−a a
2n
1 1
2n n
First let us show that gn is continuous. Clearly gn [0, 1
( 1 1
)∪( n1 ,0] is continuous.
2n )∪ 2n , n
1
If x0 = 2n we have
lim gn (x) = lim1 4n2 x = 2n
1
x→ 2n x→ 2n
1 1
x< 2n x< 2n
and
lim gn (x) = lim1 (−4n2 x + 4n) = 2n,
1
x→ 2n x→ 2n
1 1
x> 2n x> 2n
694
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 715
1 1
i.e. gn is continuous at x0 = . At x0 = we find
2n n
and
lim gn (x) = lim1 0 = 0,
1
x→ n x→ n
1 1
x> n x> n
implying gn is continuous on [0, 1]. As indicated in the hint the integral is the area
of the triangle (0, 0), ( n1 , 0), ( 2n
1
, 2n), hence
1
1 1
gn (x)dx = · · 2n = 1.
0 2 n
thus
1
lim gn (x)dx = 1.
n→∞ 0
lim gn (x) = 0.
n→∞
For x = 0 or x = 1 this follows from the definition. Now let x ∈ (0, 1). Since x > 0
it follows that for some N it holds x < N1 and now, for n ≥ N it follows gn (x) = 0
1
implying that limn→∞ gn (x) = 0. Since 0 0dx = 0 we find in this case
1
1
1 = lim gn (x)dx = lim gn (x)dx = 0.
n→∞ 0 0
13. Since (fn )n∈N converges uniformly to f , given > 0 there exists N = N () ∈ N
such that x ∈ [a, b] and n ≥ N () implies |fn (x) − f (x)| < 2(b−a) . For any n we
have
fn − |fn − f | ≤ f ≤ fn + |fn − f |.
Now, since fn is Riemann integrable there exists step functions ϕn , ψn ∈ T [a, b]
b
such that ϕn ≤ fn ≤ ψn and a (ψn − ϕn )dx < 2 . Hence for n ≥ N () we find with
the step functions ϕn − 2(b−a) and ψn + 2(b−a) that
ϕn − ≤ f ≤ ψn +
2(b − a) 2(b − a)
and
b
b
ψn + − ϕn + dx = (ψn − ϕn )dx + < .
a 2(b − a) 2(b − a) a 2
695
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 716
A COURSE IN ANALYSIS
Hence we have proved that the uniform limit of a sequence of Riemann integrable
functions is Riemann integrable. Now it follows that
b
b b
f (x)dx − fn (x)dx = (f (x) − fn (x))dx
a a a
b
≤ |f (x) − fn (x)|dx ≤ (b − a)||fn − f ||∞
a
implying
b
b
lim fn (x)dx = f (x)dx.
n→∞ a a
Chapter 26
1. a) Let F be a primitive of f , i.e.
x
F (x) = f (t)dt + c.
a
Since F (x) = f (x) by Theorem 26.1, if f ∈ C k ([a, b]) then F ∈ C k ([a, b]) and F
is (k + 1)-times continuously differentiable.
b) Recall that if V is a R vector space a set Wa = a + W , a ∈ V and W ⊂ V
a subspace, is called an affine subspace of V . The dimension of Wa is that of W .
Clearly the constant function fc : [a, b] → R, fc (x) = c, forms a one-dimensional
subspace of C([a, b]), a basis for example is given by f1 , f1 (x) = 1. If f ∈ C([a, b])
then the set of all its primitives is given by
x
g : [a, b] → R| g(x) = f (t)dt + fc , c ∈ R
a
x
or with W := {fc |c ∈ R} ⊂ C 1 ([a, b]) and F ∈ C 1 ([a, b]), F (x) := a
f (t)dt, the set
of all primitives of f is the affine subspace F + W ⊂ C 1 ([a, b]).
2. Note that nothing is claimed about the existence of a fixed point. The statement
is that if T has a fixed point then the fixed point must belong to C ∞ ([a, b]).
Now, by Theorem 26.1 we have that T f is differentiable and (T f ) (x) = e−x f (x).
This implies that for a fixed point T g(x) = g(x) that g is in C 1 , i.e. a continuously
differentiable function. Therefore t → e−t g(t) is a C 1 function implying that T g
is a C 2 function. By induction it follows that if g = T g and g ∈ C k ([a, b]) then
g ∈ C k+1 ([a, b]) and therefore a fixed point belongs to C ∞ ([a, b]).
3. a) Let I1 = [a1 , b1 ) and I2 = [a2 , b2 ) and assume that a1 ≤ a2 . If b1 < a2
then I1 ∪ I2 is the union of two disjoint intervals. In the case that a2 ≤ b1 then
we either have I2 ⊂ I1 , namely if b2 ≤ b1 , hence I1 ∪ I2 = I1 , or, if b1 ≤ b2
then I1 ∪ I2 = [a1 , b2 ). Now, for finitely many right half-open intervals I1 , . . . , IN ,
Ij = [aj , bj ) we proceed by induction. The case N = 2 has just been proved.
We assume that I1 ∪ · · · ∪ IN −1 is the union of mutually disjoint right half-open
696
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 717
Now we claim that if for a continuous function g : [a, b] → R we have for all
β
α, β ∈ [a, b], α < β that α g(t)dt = 0 then g(t) = 0 for all t. Indeed, take t0 ∈ [a, b]
and h > 0 such that a ≤ t0 < t < t0 + h ≤ b, to find by our assumptions and by
the mean value theorem
t0 +h
0= g(t)dt = g(ξh )h, ξh ∈ [t0 , t0 + h].
t0
This implies that g(ξh ) = 0 and since limh→0 ξh = t0 the continuity of g implies that
g(t0 ) = 0. Therefore we deduce that f (−x)+ f (x) = 0 for all x, i.e. f (−x) = −f (x)
which implies that f is odd.
697
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 718
A COURSE IN ANALYSIS
5. a) We note that
y
y
y ρ − xρ = ρ tρ−1 dt ≤ ρ 1 dt = ρ(y − x).
x x
b) Since
y
cos t dt = sin y − sin x
x
6. For F we have
x
y
F (x) − F (y) = f (t)dt − f (t)dt
ax a
= f (t)dt
y
which implies
x
x
|F (x) − F (y)| = f (t)dt ≤ ||f ||∞
1dt = ||f ||∞ |x − y|.
y y
7. a) Since (f · g)(x) = 0 for all x ∈ [a, b], but f and g are not both zero, it follows
that f ⊥ g.
b) The product of an odd function and an even function
a is odd. For any odd
function h : [−a, a] → R we have, see Proposition 26.7.B, −a h(t)dt = 0.
c) Let g, h ∈ C([a, b]) such that f ⊥ g and f ⊥ h. For λ, μ ∈ R we find
b
b
b
f (x)(λg(x) + μh(x))dx = λ f (x)g(x)dx + μ f (x)h(x)dx = 0,
a a a
8. We start with
b
b
1 d 2 1 1
0= f (x)f (x)dx = (f (x))dx = f 2 (b) − f 2 (a),
a a 2 dx 2 2
698
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 719
||f ||L2 ≥ 0,
the fact that f (y) ≥ 0 for all y and β (x) ≥ 0 whereas α (x) ≤ 0, which follows
from the fact that β is increasing and α is decreasing, we find G (x) ≥ 0, hence G
is increasing.
11. For > 0 we can find N0 ∈ N such that for n, m ≥ N0 it follows that
|fn (x0 ) − fm (x0 )| <
2
and
|fn (t) − fm
(t)| < for all t ∈ [a, b].
2(b − a)
We now apply the mean value theorem to fn − fm to find
|x − t|
|(fn − fm )(x) − (fn − fm )(t)| < ≤
2(b − a) 2
|fn (x) − fm (x)| ≤ |(fn − fm )(x) − (fn − fm )(x0 )| + |fn (x0 ) − fm (x0 )| < ,
or n, m ≥ N0 implies
||fn − fm ||∞ < ,
i.e. (fn )n∈N is a Cauchy sequence with respect to || · ||∞ . Therefore it has a limit
f which is a continuous function f . Moreover fn converges pointwise to f . Denote
by f ∗ the uniform limit of (fn )n∈N . It follows that
x
fn (x) = fn (x0 ) + fn (t)dt
x0
699
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 720
A COURSE IN ANALYSIS
1 − xN +1
SN (x) =
1−x
implying that
1 − (N + 1)xN + N xN +1
SN (x) = .
(1 − x)2
∞ k 1
Moreover, for |x| < 1 we have k=0 x = 1−x , so we need to prove that for
[a, b] ⊂ (−1, 1)
1 1
sup SN (x) −
1 − x and sup SN (x) − (1 − x)2
x∈[a,b] x∈[a,b]
both tend to 0.
1 1
Denote by κ1 := supx∈[a,b] 1−x < ∞ and κ2 := supx∈[a,b] (1−x) 2 < ∞. It follows
that
N +1
SN (x) − 1 = |x| ≤ κ1 max(|a|N +1 , |b|N +1 )
1−x 1−x
hence
1
sup SN (x) − ≤ κ1 max(|a|N +1 , |b|N +1 )
x∈[a,b] 1 − x
1
and since |a| < 1 and |b| < 1 the uniform convergence of SN (x) to 1−x is proved.
Further
1 (N + 1)|x|N + N |x|N +1
SN (x) − =
(1 − x)
2 (1 − x)2
≤ κ2 (2N + 1) max(|a|N , |b|N ),
where we used that for |y| < 1 we have |y|N +1 < |y|N . Since limN →∞ (2N +1)|y|N =
1
0 for |y| < 1 it also follows that SN (x) converges uniformly to (1−x) 2 . Therefore we
have
∞
1 d k d
2
= x = lim SN (x)
(1 − x) dx dx N →∞
k=1
N ∞
d
= lim SN (x) = lim kxk−1 = kxk−1 .
N →∞ dx N →∞
k=1 k=1
700
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 721
1
For x = m, m ≥ 2, we find
∞ k−1 ∞
1 k 1 m2
k =m k
= 2 = 2 ,
m m 1− m1 m −1
k=1 k=1
∞
k m
i.e. k
= 2 .
m m −1
k=1
Chapter 27
1. For x ∈ (−∞, 0) ∪ (0, c) ∪ (c, ∞) the function uc is clearly differentiable. At x = 0
we find -
x
uc (x) − uc (0) , x>0
= 4
x−0 0, c ≤ x ≤ 0
implying
uc (x) − uc (0)
lim = 0,
x→0 x−0
and at x = c we have
-
uc (x) − uc (c) 0, c≥x≤0
= (x−c)
x−c − 4 , x ≤ c,
which yields
uc (x) − uc (c)
lim = 0,
x−c
x→0
i.e.
uc (x) = |uc (x)|.
Next we observe that uc (2) = 1 for all c < 0, implying that for all c < 0 a solution
to
v (x) = |v(x)|, v(2) = 1
is given by uc . Hence we have existence but not uniqueness.
2. The calculation is simple and goes as follows:
701
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 722
A COURSE IN ANALYSIS
u(x) = kx2
1
− = 2t + c
y
1
which gives y(t) = − 2t+c . Adjusting the initial value requires
1
y(0) = − = −1,
c
1
implying that y(t) = − 2t+1 is a candidate for a solution. We find y(0) = −1 and
further
2
y (t) = = 2y 2 (t).
(2t + 1)2
702
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 723
ϕ(s)
c) The differential equation ϕ (s) = tan s leads to
or
x5 = cos r + r sin r + c,
which yields
1
x(r) = (cos r + r sin r + c) 5 ,
and x( π2 ) = 1 implies
π π π 1
1 = (cos + sin + c) 5
2 2 2
which is solved by c = 1 − π2 . An easy calculation now shows that x(r) = (cos r +
1
r sin r + 1 − π2 ) 5 indeed solves the initial value problem: x( π2 ) = (cos π2 + π2 sin π2 +
1
1 − π2 ) 2 = 1, and
d π 1
(cos r + r sin r + 1 − ) 5
dr 2
1 π −4
= (cos r + r sin r + 1 − ) 5 (− sin r + sin r + r cos r)
5 2
1 π −4
= (cos r + r sin r + 1 − ) 5 (r cos r),
5 2
703
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 724
A COURSE IN ANALYSIS
or
b)
u(x)
d 1 1 1
dt = u (x) − v (x).
dx v(x) 1 + t2 1 + u2 (x) 1 + v 2 (x)
6. We have
u(x)
d
h(t)dt = h(u(x))u (x) − h(−u(x))(−u(x))
dx −u(x)
compare with Proposition 26.7.B, and therefore we must have for all x ∈ R that
u(x)
−u(x) h(t)dt = 0.
1
7. Since u2k ≥ 0 it follows from u = 1+u 2k that u is strictly monotone increasing, and
since u(0) = 1 we deduce that on [0, ∞) the function u is positive. Further we have
704
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 725
since u2k−1 (x) > 0 (which follows from u being strictly positive). Hence u is convex.
The fact that u is an arbitrarily often differentiable function follows as discussed at
the end of Chapter 27: we know that
1
u = g1 (u), g1 (t) =
1 + t2k
and
2kt2k−1
u = g2 (u), g2 (t) = .
(1 + t2k )3
Now we claim that u(n) = gn (u) with an arbitrarily often differentiable function gn .
For n = 1 (and n = 2) we know the result. Now if u(n) = gn (u) then
d
u(n+1) = gn (u) = gn (u) · u
dx
= gn (u)g1 (u)
Consequently we have
-
b 1
dx 1−α (b − a)1−α − 1−α
=
a− (x − a)α ln(b − a) − ln ,
705
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 726
A COURSE IN ANALYSIS
1 √1
For 0 < ≤ x ≤ 1 we have √ ≤ x
and therefore
x(2−x)
1
1
dx dx √
0≤ ≤ √ = 2 − 2 ,
x(2 − x) x
1
implying the convergence of the first integral. For 1 ≤ x ≤ 2 − we find √ ≤
x(2−x)
√1 which yields
2−x
2−
2−
dx dx √
0≤ ≤ √ = 2 − 2 .
1 x(2 − x) 1 2−x
2 dx
and hence the second integral converges too, i.e. 0
√ converges.
x(2−x)
The first integralconverges if and only if α > −1 but in this case the second integral
∞
diverges. Hence 0 xα dx will never converge.
d) A primitive of g(x) = e−ax cos(wx) is the function
e−ax
G(x) = − (a cos(wx) − w sin(wx))
a2 + w 2
and therefore
R
e−ax cos(wx)dx = G(R) − G(0).
0
2. First we note
β
|f (r)| (1 + r2 ) 2 2 β−α
α ≤ c0 α = c0 (1 + r )
2 .
2
(1 + r ) 2 (1 + r2 ) 2
It follows that
R
R
|f (r)| 1
α dr ≤ c0 α−β dr
0 (1 + r2 ) 2 0 (1 + r2 ) 2
1
R
1 1
= c0 α−β dr + c0 α−β dr,
0 (1 + r2 ) 2 1 (1 + r2 ) 2
706
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 727
and clearly the first integral on the right hand side exists for all α and β. If r ≥ 1
1 1
and β < α then α−β ≤ α−β α−β and since
(1+r 2 ) 2 2 2 r 2
R
R
−α+β 1 1−α+β
lim r dr = lim r
R→∞ 1 R→∞ 1−α+β 1
1 R1−α+β
= + lim
α − 1 − β R→∞ 1 − α + β
exists only for 1 − α + β < 0, i.e. β + 1 < α, it follows that for β + 1 < α the integral
∞ f (r)
0 (1+r 2 ) α
2
dr converges absolutely. Now if f is a polynomial of degree m we know
m ∞ f (r)
that |f (r)| ≤ c0 (1 + r2 ) 2 and therefore for m + 1 < α the integral 0 α dr
2 2 (1+r )
converges absolutely in this case. In the case where m + 1 ≥ α the integral must
diverge. We may assume that f (r) ≥ 0 for r ≥ R0 , otherwise we switch to −f .
From Example 11.4 we know that
f (r)
lim =1
r→∞ am r m
when am > 0 is the leading coefficient of f (r). Thus we can find R1 ≥ R0 such that
f (r)
r ≥ R1 implies am rm − 1 < 12 , or a2m rm ≤ f (r). Since for m + 1 ≥ α the integral
∞ am r m ∞ f (r)
α dr diverges it follows that α dr diverges.
R1 2 2
2(1+r ) 0 2 2 (1+r )
3. For k = 0 we find
1
1 0!
(1 − x)α dx = = .
0 α+1 α+1
Assuming that
1
k!
xk (1 − x)α dx =
0 (α + 1)(α + 2) · . . . · (α + k + 1)
we find when integrating by parts
1
1
d (1 − x)α+1
xk+1 (1 − x)α dx = xk+1 − dx
0 0 dx α+1
1
k+1
= xk (1 − x)α+1 dx
α+1 0
(k + 1) k!
= · ,
α + 1 (α + 2)(α + 3) · . . . · (α + k + 2)
where we have used that the boundary terms
1
(1 − x)α+1
xk+1 −
α+1 0
vanish.
707
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 728
A COURSE IN ANALYSIS
2 + a2
dx ≤ 2 + a2
dx ≤ c 0 x− 2 dx = 2c0 (1 − R− 2 )
1 x 1 x 1
∞ ln x
implying the convergence of 0 x2 +a2 dx.
5. For the first part we observe that since g is continuous and g(0) = 0 for some
η > 0 we have g(x) = 0 for x ∈ (−η, η). We may assume that g > 0 in (−η, η)
and consequently
there exists 0 < m ≤ M such that 0 < m ≤ g(x) ≤ M for
x ∈ − η2 , η2 . This implies for 0 < < η2 that
η
η
2 g(x) 2 1 η
dx ≥ m dx = m(ln − ln )
x x 2
η
g(x)
2 1
and therefore lim dx, and hence 0 g(x) x dx does not exist. The second
→0 x
integral goes analogously. Note that now we have x < 0, and since
−
−
g(x) (−g(x))
dx = − dx
− η2 x − η2 x
the estimate
− −
(−g(x)) 1
dx ≤ −M dx
− η2 x − η2 x
yields for → 0 the divergence of this integral, and hence the divergence of
0 g(x) g(x)
−1 x dx. Since g is even the function x → x , x = 0, is odd and therefore
−
1
g(x) g(x)
dx = − dx,
−1 x x
708
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 729
implying that
−
1
g(x) g(x)
(∗) lim dx + dx = 0.
→0 −1 x x
1 g(x)
Clearly, (∗) does not imply −1 x dx = 0 since we know that the latter integral
does not exist.
6. Suppose that α > 1 and limx→∞ xα f (x) = c0 . It follows that there exists R > 0
such that x ≥ R implies
|xα f (x)| − |c0 | ≤ |xα f (x) − c0 | < 1,
or
1 + |c0 |
|f (x)| ≤ ,
xα
R
implying for α > 1 the convergence of 1 |f (x)|dx and hence the convergence of
∞
0 |f (x)|dx.
Now suppose for c0 = 0 and α ≤ 1 that lim xα f (x) = c0 . We consider the
x→∞
case c0 > 0, the case c0 < 0 goes analogously. The existence of the limit implies
xα f (x) ≥ 0 for x ≥ R0 , i.e. f (x) ≥ 0 for x ≥ R0 , and consequently we can find
R1 ≥ R0 such that x ≥ R1 implies
c0
c0 − xα f (x) ≤ |c0 − xα f (x)| < ,
2
or for x ≥ R1
c0
≤ f (x)
2xα
implying
∞
∞
c0
α
dx ≤ f (x)dx,
R1 2x R1
but for α ≤ 1 the integral on the left hand side diverges. Note that in the second
case c0 = ∞ is allowed. Clearly, we can also apply these criteria to continuous
functions f : [a, ∞) → R.
ln x
7. a) Since lim x = ∞, by the second case in Problem 6 the integral
x→∞ 1+x
diverges.
b) Here we have two boundary points which can cause potential problems and
therefore we split the integral as follows:
∞
π
∞
1 − cos y 1 − cos y 1 − cos y
2
dy = 2
dy + dy.
0 y 0 y π y2
1 − cos y 1
Since lim = (use the rules of l’Hospital), it turns out that the first
y→0 y2 2
integral is a Riemann integral and not an improper integral. For the second integral
we observe that
3 1 − cos y
lim y 2 = 0,
y→∞ y2
709
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 730
A COURSE IN ANALYSIS
and the first part of Problem 6 gives the convergence of the integral.
c) The substitution t → −s gives
−1
∞
et e−s
dt = − ds
−∞ t 1 s
2 e−s −1et
and we need only to note that lim s = 0 to deduce that −∞ t dt con-
s→∞ s
verges.
Taking into account that sin kπ = 0 as well as | sin x| = | sin(x+π)|, the substitution
x = t + nπ yields
(n+1)π
π
sin x sin t
dx = dt.
x
nπ 0 t + nπ
1 1
Since for 0 ≤ t ≤ π it follows that t+nπ ≥ (n+1)π we find
π
π
sin t 1 2
dt ≥ sin tdt = ,
0 t + nπ (n + 1)π 0 π(n + 1)
which implies
∞ ∞
∞ sin x 2 21
x dx ≥ π(n + 1)
=
π n=1 n
,
0 n=0
∞ ∞ sin x
and since the series 1
diverges we have proved that the integral dx
n=1 n 0 x
diverges.
f (x) c0
9. Since lim = c0 > 0, for 2 > 0 there exists δ > 0 such that 0 < x − a < δ
x→a g(x)
implies
f (x) c0
g(x) − c0 < 2 ,
c0 f (x) 3c0
or 2 < g(x) < 2 , i.e.
c0 3c0
g(x) ≤ f (x) ≤ g(x).
2 2
This yields
a+δ
a+δ
a+δ
c0 3c0
g(x)dx ≤ f (x)dx ≤ g(x)dx,
2 a a 2 a
710
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 731
b b a+δ b
implying that f (x)dx exists if and only if a g(x)dx = a g(x)dx+ a+δ f (x)dx
a
f (x)
exists. In the case where lim = 0 we can still find for > 0 some δ > 0 such
x→a g(x)
that for 0 < x − a < δ it follows that
f (x) ≤ g(x),
b b a+δ b
implying that a f (x)dx converges if a g(x)dx = a g(x)dx + a+δ g(x)dx con-
f (x)
verges. Now, if lim = ∞ then for R > 0 there exists δ > 0 such that
x→a g(x)
Since x > 0 it follows that x − 1 > −1 and consequently, see Example 28.3,
12
lim tx−1 (1 − t)y−1 dt
→0
1
exists, implying the convergence of B(x, y) = 0 tx−1 (1 − t)y−1 dt. Substituting t
by 1 − s we find
0
1
B(x, y) = − (1 − s)x−1 sy−1 ds = sy−1 (1 − s)x−1 ds = B(y, x).
1 0
711
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 732
A COURSE IN ANALYSIS
This calculation has however a problem: we have not proved the substitution rule
for improper integrals. Thus we should start with
1−
1−
x−1 y−1 x−1 y−1
t (1 − t) dt = − (1 − s) s ds = sy−1 (1 − s)x−1 ds
1−
712
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 733
Chapter 29
1
1. For x = 0 we have 1+x4 < 1 and consequently
∞
∞
1 1
gn (x) = x4 4 )n
= x4 1
n=0 n=0
(1 + x 1 − 1+x 4
= 1 + x4 , x = 0.
However, for x = 0 we have gn (x) = 0 for all n, thus ∞ n=0 gn (0) = 0. If follows
that
∞ ∞
lim gn (x) = 1 = gn (0),
x→0
n=0 n=0
∞
i.e. ∞continuous for x = 0. Since all functions gn are continuous
n=0 gn (x) is not
the convergence of n=0 gn (x) cannot be uniform on any interval containing 0.
2. a) Since | sin kx| ≤ 1 for all x ∈ R and k ∈ N0 it follows that sinkαkx ≤ k1α and
∞ 1 ∞ sin kx
for α > 1 the series α=0 kα converges, hence α=0 kα converges absolutely
and uniformly.
n
b) We observe that for |x| ≤ 1 we have x 3 ≤ 13 and the convergence of
∞ 1 n2 n2
∞ xn
n=1 32 implies the absolute and uniform convergence of n=1 32 for |x| ≤ 1.
n n
1 1
∞ 1
c) Note that n2 +r 2 ≤ n2 for any r ∈ R and since n=1 n2 < ∞ it follows that
∞ 1
n=1 n2 +r 2 converges for all r ∈ R absolutely and uniformly.
3. For α = m ∈ N, m ≥ n, we find
n
m−k+1 m! m
= = .
k n!(m − n)! n
k=1
713
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 734
A COURSE IN ANALYSIS
α − k
Since lim = 1, we find for η such that |x| < η < 1 some N = N (η) with
k→∞ k + 1
the property that n ≥ N (η) implies ak+1
ak ≤ η < 1. Consequently, the series
∞
α k
x
k
k=0
∞ α k
converges for |x| < 1. It takes further effort to prove that k=0 k x is indeed the
Taylor series of gα , |x| < 1.
4. For N ∈ N we have
N
N
N
|(ak + bk )xk | ≤ |ak ||x|k + |bk ||x|k
k=0 k=0 k=0
∞ ∞
≤ |ak ||x|k + |bk ||x|k
k=0 k=0
as well as
N
N
∞
|(λak )xk | ≤ |λ| |ak ||x|k ≤ |λ| |ak ||x|k ,
k=0 k=0 k=0
which allows us in each case to pass to the limit as N → ∞. Once we have secured
absolute and uniform convergence, we may pass in the equalities
N
N
N
(ak + bk )xk = ak xk + bk xk
k=0 k=0 k=0
and
N
N
(λak )xk = λ ak xk
k=0 k=0
to the limit as N → ∞.
ex −e−x ex +e−x
5. We note that sinh x = 2 and cosh x = 2 , and therefore we find
∞ k
1 xk k
kx
sinh x = − (−1)
2 k! k!
k=0 k=0
∞ ∞
1 x2l
x2m−1
= +
2 (2l)! m=1 (2m − 1)!
l=0
∞ ∞
x2l x2m−1
2l
− (−1) − (−1)2m−1
(2l)! m=1 (2m − 1)!
l=0
∞
x2m−1
= ,
m=1
(2m − 1)!
714
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 735
and further
∞ ∞
1 xk k
kx
cosh x = + (−1)
2 k! k!
k=0 k=0
∞ ∞
1 x2l
x2m−1
= +
2 (2l)! m=1 (2m − 1)!
l=0
∞ ∞
x2l x2m−1
2l
+ (−1) + (−1)2m−1
(2l)! m=1 (2m − 1)!
l=0
∞
x2l
= .
(2l)!
l=0
7. For x ∈ R fixed we apply the ratio test to the series representing Jl (x) :
(−1)n+1 ( x2 )l+2(n+1) xl x2 x2n
(n+1)!(n+1+l)! 22 2l 22n (n+1)n!(n+1+l)(n+l)! x2 .
(−1)n ( x2 )l+2n = xl x2n
=
22 (n + 1)(n + 1 + l)
n!(n+l)!
2l 22n n!(n+l)!
Thus, in order to obtain the convergence of Jl (x) we need to assume that there
exists N ∈ N such that n ≥ N implies
x2
≤ τ < 1,
22 (n + 1)(n + 1 + l)
715
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 736
A COURSE IN ANALYSIS
x2 x2
≤ ,
22 (n + 1)(n + 1 + l) 2 2 n2
x2 1
thus if 22 n2 ≤ 4 (but any 0 < τ < 1 will do instead of 14 ) then we are done. Now
x2 1
≤ implies |x| ≤ n.
2 2 n2 4
x2 1
≤
22 (n + 1)(n + 1 + l) 4
∞
(−1)n xl+2n
Jl (x) =
n=0
2l+2n n!(n + l)!
∞
(−1)n (l + 2n)xl+2n−1
Jl (x) =
n=0
2l+2n n!(n + l)!
∞
(−1)n (l + 2n)(l + 2n − 1)xl+2n−2
Jl (x) =
n=0
2l+2n n!(n + l)!
∞ ∞
(−1)n xl+2n+2 (−1)n l2 xl+2n
(x2 − l2 )Jl (x) = l+2n
−
n=0
2 n!(n + l)! n=0 2l+2n n!(n + l)!
∞
(−1)n (l + 2n)xl+2n
xJl (x) =
n=0
2l+2n n!(n + l)!
∞
(−1)n (l + 2n)(l + 2n − 1)xl+2n
x2 Jl (x) =
n=0
2l+2n n!(n + l)!
716
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 737
∞
(−1)n xl+2n+2
=
n=0
2l+2n n!(n + l)!
∞ ∞
(−1)n xl+2n+2 (−1)n (4n(n + l))xl+2n
= +
n=0
2l+2n n!(n + l)! n=0 2l+2n n!(n + l)!
∞ ∞
(−1)n xl+2n+2 (−1)n 4xl+2n
= l+2n
+ l+2n
n=0
2 n!(n + l)! n=1 2 (n − 1)!(n + l − 1)!
∞
∞
(−1)n−1 xl+2n (−1)n 4xl+2n
= +
n=1
2l+2n−2 (n − 1)!(n − 1 + l)! n=1 2l+2n (n − 1)!(n + l − 1)!
∞
∞
(−1)n 4xl+2n (−1)n 4xl+2n
=− l+2n
+ l+2n
n=1
2 (n − 1)!(n − 1 + l)! n=1 2 (n − 1)!(n + l − 1)!
= 0.
∞
1
8. Since for |r| < 1 we have rn = we find with r = −t2
n=0
1−r
∞
∞
1 2 n
= (−t ) = (−1)n t2n .
1 + t2 n=0 n=0
x
∞
x
1
arctan x = dt = (−1)n t2n dt.
0 1 + t2 0 n=0
Since |x| < 1 implies |t| < 1 the series under the integral sign converges uniformly
and therefore we find by changing the order of summation and integration that
∞
x ∞
x2n+1
arctan x = (−1)n t2n dt = (−1)n .
n=0 0 n=0
2n + 1
717
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 738
A COURSE IN ANALYSIS
√
9. Since tan π6 = 3
3
we have by Problem 8
√ ∞
√ 2n+1
π 3 n 1 3
= arctan = (−1)
6 3 n=0
2n + 1 3
√ ∞ ∞
3 1 3n 1 (−1)n
= (−1)n = √ .
3 n=0 2n + 1 32n 3 n=0 (2n + 1)3n
∞
n
10. For n ∈ N0 we denote the nth partial sum of an by Sn := k=0 ak and further
m=0
∞
we set S−1 := 0, implying that an = Sn − Sn−1 and S := lim Sn = an . It
n→∞
∞ n=0
follows for |x| < 1 that by g : (−1, 1) → R, g(x) = n=0 an xn , a function is defined
which satisfies ∞
g(x) = (1 − x) Sn xn .
n=0
Now let > 0. Then there exists N = N () ∈ N such that n > N implies
∞
|S − Sn | < 2 . Further, since for |x| < 1 we have (1 − x) n=0 xn = 1, it follows for
0 < x < 1 that
∞
n
|g(x) − S| = (1 − x) (Sn − S)x
n=0
N
≤ (1 − x) |Sn − S| + .
n=0
2
Now, for this > 0 we can also find δ > 0 such that 1 − δ < x < 1 yields
N
N
(1 − x) |Sn − S| < δ |Sn − S| < ,
n=0 n=0
2
718
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 739
(−1)2l (−1)2l−1 2l + 1 − 2l
+ = .
2l 2l − 1 (2l)(2l − 1)
(−1)k∞
π
= arctan 1 = .
4 2k + 1
k=0
12. In both cases we use the Taylor formula with the Lagrange remainder term.
a) With some 0 < ϑ1 < 1 we have
x2 x3 x4 x5 1
ln(1 + x) = x − + − + ·
2 3 4 5 (1 + ϑ1 )5
x2 x3 x4
>x− + − ,
2 3 4
and for some 0 < ϑ2 < 1 we find
x2 x3 x4 1
ln(1 + x) = x − + − ·
2 3 4 (1 + ϑ2 x)4
x2 x3
<x− + .
2 3
√ x x2 x3 x4 1
1+x=1+ − + − ·
2 8 16 128 (1 + ϑ1 x) 72
x x2 x3
<1+ − + ,
2 8 16
and with some 0 < ϑ2 < 1 we get
√ x x2 x3 1
1+x=1+ − + ·
2 8 16 (1 + ϑ2 x) 52
x x2
>1+ − .
2 8
719
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 740
A COURSE IN ANALYSIS
therefore
2 ∞
∞
1
n m
= x x
1−x n=0 m=0
∞
= ck xk
k=0
where
k
ck = al bk−l
l=0
with al = 1, bl = 1 for all l, hence
k
ck = 1 = k + 1,
l=0
which implies
2 ∞
1
= (k + 1)xk .
1−x
k=0
b) We note that
∞ 2k
∞
∞
cos x 1 k x
= (cos x) = (−1) xl = ck xk
1−x 1−x (2k)! m=0
k=0 l=0
-
(−1)k
, n = 2k
where with an = (2k)! and bm = 1 it follows that
0, n = 2k − 1
k
k
ck = aj bk−j = aj .
j=0 j=0
For k = 2l we find
l
l
1
c2l = a2j = (−1)j
j=0 j=1
(2j)!
and for k = 2l + 1 we have
l
l
1
c2l+1 = a2j = (−1)j ,
j=0 j=1
(2j)!
720
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 741
However, Leibniz’s rule for higher order derivatives, see Corollary 21.12, gives
k
k (l)
(f · g)(k) (0) = f (0)g (k−l) (0)
l
l=0
and since
1 k 1 1
=
k! l l! (k − l)!
the result follows.
Chapter 30
1. Since
k3 − 1 (k − 1)(k 2 + k + 1)
3
=
k +1 (k + 1)(k 2 − k + 1)
(k − 1)((k + 1)2 − (k + 1) + 1)
=
(k + 1)(k 2 − k + 1)
we find for N ∈ N
N N
k3 − 1 (k − 1)((k + 1)2 − (k + 1) + 1)
3
=
k +1 (k + 1)(k 2 − k + 1)
k=2 k=2
N N
k − 1 (k + 1)2 − (k + 1) + 1
=
k+1 k2 − k + 1
k=2 k=2
2 (N + 1)2 − (N + 1) + 1
= ·
((N − 1) + 1)(N + 1) 4−2+1
2(N 2 + N + 1)
= .
3N (N + 1)
Thus we have
∞
N
k3 − 1 k3 − 1 2(N 2 + N + 1) 2
= lim = lim = .
k 3 + 1 N →∞ k 3 + 1 N →∞ 3N (N + 1) 3
k=2 k=2
721
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 742
A COURSE IN ANALYSIS
b) We note that
1 (l + 1)2
1+ =
l(l + 2) l(l + 2)
and therefore
N
N
1 (l + 1)2
1+ =
l(l + 2) l(l + 2)
l=1 l=1
N
N
l+1l+1
=
l l+2
l=1 l=1
N +1 2 2(N + 1)
= · = ,
1 N +2 N +2
which yields
∞
N
1 1
1+ = lim 1+
l(l + 2) N →∞ l(l + 2)
l=1 l=1
2(N + 1)
= lim = 2.
N →∞ N +2
(∗) (1 − b1 ) · . . . · (1 − bn ) ≥ 1 − (b1 + · · · + bn ).
n
n
Pn = (1 − ak ) = PN −1 (1 − ak ),
k=1 k=N
or
n
n
Pn 1
= (1 − ak ) ≥ 1 − ak > ,
PN −1 2
k=N k=N
Pn
implying that is bounded from below and since for n > N we have 0 <
PN −1
Pn
1 − an < 1 it follows that PN −1
is also monotone decreasing, hence it has a
n∈N
722
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 743
k=N
n
and the divergence of ∞ a
k=1 k implies now that lim (1 − ak ) = 0, which yields
n→∞
∞ k=N
that k=1 (1 − ak ) diverges to 0.
∞want to use Lemma 30.5 and hence we need a control on ln(1 + ak ). Since
3. We
k=1 ak converges, hence lim ak = 0, there exists N ∈ N such that for k ≥ N
k→∞
we have |ak | < 12 . Now we apply the Taylor formula with Lagrange remainder, see
Theorem 29.14, to ln(1 + x), |x| < 12 , to find
x2 1
ln(1 + x) = x − , 0 < |ξ| < ,
2(1 + ξ)2 2
or
2 1
< < 2.
9 2(1 + ξ)2
a) From the considerations made above it follows that for k ≥ N
2
ln(1 + ak ) = ak − ϑk a2k , < ϑk < 2.
9
∞ ∞ 2
If ∞ 2 2
k=1 ak converges, then k=1 ϑk ak ≤ 2 it follows that ∞
k=1 ak and k=1 ln(1+
∞ ∞
ak ) converges. If however k=1 (1 + ak ) converges then k=1 ln(1 + ak ) converges,
∞
implying first the convergence of k=1 ϑk a2k and since 29 < ϑk the convergence of
∞ 2
k=1 ak follows.
∞
b) Now suppose that k=1 a2k diverges. From our previous considerations we
deduce for k ≥ N
2
ak − ln(1 + ak ) > a2k ,
9
723
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 744
A COURSE IN ANALYSIS
∞
and since lim |ak | = 0 it follows that k=1 ln(1 + ak ) must diverge to −∞. Con-
k→∞
∞ ∞
sequently k=1 (1 + a k ) diverges and conversely, the divergence of k=1 (1 + ak ),
∞ ∞
i.e. the divergence of k=1 (1 + ak ), implies the divergence of k=1 a2k .
∞ ∞
4. If k=1 (1+ak ) converges absolutely, then it converges and consequently k=1 ln(1+
ak ) converges. Moreover, we must have lim ak = 0 thus for some N ∈ N it follows
k→∞
N −1 ∞
that ak > −1 if k ≥ N . Since ∞ k=1 (1 + ak ) = k=1 (1 + ak ) k=N (1 + ak ),
and a finite rearrangement cannot change the value of the infinite ∞product, we may
assume∞that ak > −1 for all k ∈ N. In this case, with P = k=1 ∞(1 + ak ) and
S = k=1 ln(1 + ak ) we have P = exp(S). If we can show that k=1 ln(1 + ak )
converges absolutely, then we can rearrange the series without changing its value,
see Theorem 18.27. But ∞the equality P = exp(S) then implies that we can also
rearrange the product k=1 (1 + ak ) without changing its ∞ value. Thus it remains
to prove that the absolute convergence∞ of the product k=1 (1 + ak ) implies the
absolute convergence
∞ of the series k=1 ln(1 + a k ). From Proposition 30.10 we
deduce that k=1 ak converges absolutely. Moreover, since lim ak = 0 we find
k→∞
| ln(1 + ak )|
lim = 1,
k→∞ |ak |
or 12 ≤ | ln(1+a
|ak |
k )|
≤ 2 for k sufficiently large implying the absolute convergence of
∞
k=1 ln(1 + ak ).
b) First we observe that for x = 0 the product has the value 1 and the right
hand side converges for x → 0 to 1. Now, for x = 2k ( π2 + lπ) we have cos 2xk = 0 as
well as sin 2xk = 0. Using sin(2ϕ) = 2 sin ϕ cos ϕ we find
x
x 1 sin 2j−1
cos = ,
2j 2 sin 2xj
and consequently
N
N x
x 1 sin 2j−1 sin x
cos = x = N .
j=1
2 j
j=1
2 sin 2 j 2 sin 2xN
724
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 745
sin x
Since 2N sin 2xN = x x
2N
we eventually get
2N
∞
N x
x 1 sin 2j−1 sin x
cos = lim x = lim N
2j N →∞ 2 sin j N →∞ 2 sin xN
j=1 j=1 2 2
sin x sin 2xN sin x
= lim x = .
x N →∞ 2N x
π
Finally, for x = 2 we derive
∞
π sin π2 2
cos = π = .
j=1
2j+1 2 π
Chapter 31
1. From Theorem 31.12, the Legrendre duplication formula, we find for n ∈ N
√
1 πΓ(2n)
Γ n+ = 2n−1
2 2 Γ(n)
√
π(2n − 1)!
= n 1
4 · 2 (n − 1)!
√
π(2n)! n
= · 1
4n n! ·
2 2n
√
π(2n)!
= .
4n n!
2. Using the substitution r = st we find
∞
∞ α
r 1
tα e−st dt = e−r dr
0 0 s s
∞
1 Γ(α + 1)
= α+1 rα e−r dr = .
s 0 sα+1
Note that we applied the change of variable formula to an improper integral. Mean-
while we have seen several times, in particular in the context of the Γ-function, how
R
to derive a result as the above one by looking first at tα e−st dt and then passing
∞
to the limit. For a function f : (0, ∞) → R such that F (s) := 0 f (t)e−st dt exists
we call F the Laplace transform of f .
3. The substitution s = − ln t, i.e. t = e−s yields
1 x−1
1
1
ln dt = (− ln t)x−1 dt
0 t 0
0
∞
−s
= s x−1
e ds = sx−1 e−s ds
∞ 0
= Γ(x).
725
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 746
A COURSE IN ANALYSIS
3
1 √
For x = 2 we find using Γ 2 = π
1 1 √
1 2 3 1 1 π
ln dt = Γ = Γ = ,
0 t 2 2 2 2
1
and for x = 2 we find
1 − 12
1 1 √
ln dt = Γ = π.
0 t 2
we have
∞
1 1
− = 1,
k k+1
k=1
and we find
Γ (1) = −γ.
Note, that if we can justify
∞
d d x−1
Γ(x) = t x=1
e−t dt
dx x=1 0 dx
we would obtain
∞
(ln t)e−t dt = −γ.
0
Γ (1)
5. a) We again use formula (31.14) to get with ψ(1) = Γ(1) = −γ that
∞
1 1 1
ψ(x) − ψ(1) = − + −
x k k+x
k=1
∞
1 1
=− − .
k+x k+1
k=0
b) Since
Γ(x + n) = (x + n − 1)(x + n − 2) · . . . · xΓ(x),
726
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 747
we have
and therefore
d
ψ(n + x) = ln Γ(x + n)
dx
1 1 1 d
= + + ··· + + ln Γ(x)
x+n−1 x+n−2 x dx
1 1
= + ···+ + ψ(x).
x x+n−1
6. Starting with
1
B(x, y) = tx−1 (1 − t)y−1 dt,
0
s
the substitution t = 1+s yields
1
tx−1 (1 − t)y−1 dt
0
∞ y−1
sx−1 s 1
= 1− ds
0 (s + 1)x−1 1+s (1 + s)2
∞
sx−1 1 1
= ds
0 (s + 1)x−1 (s + 1)y−1 (1 + s)2
∞
sx−1
= ds.
0 (s + 1)x+y
Thus
Γ(6)Γ(1) 5!0! 1
B(6, 1) = = = .
Γ(7) 6! 6
Therefore we have proved that
∞
x5 1
dx = .
0 (1 + x)7 6
727
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 748
A COURSE IN ANALYSIS
8. We apply Theorem 31.2 and Corollary 31.3 in combination with the formula
Γ(x)Γ(y)
B(x, y) =
Γ(x + y)
to find
∞ ∞ y
e−γx e k e−γy e k
x
B(x, y) = ·
x 1 + xk y 1 + yk
k=1 k=1
∞
x+y (x+y)
· (x + y)eγ(x+y) 1+ e− k
k
k=1
∞
x+y 1 + x+y
= x
k y .
xy 1 + k 1+ k
k=1
Chapter 32
1. For the partition Z, where x0 = 0, xj = 1j , j ∈ N2k and x2k+1 = 1 we find that
when j = 2l is even a typical term in the variation sum is
1 1 1 1 1
|f (xj ) − f (xj−1 )| = cos lπ − cos l − π = =
j j−1 2 j 2l
and if j = 2l + 1 is odd
1 1 1 1 1
|f (xj ) − f (xj−1 )| = cos l + π− cos lπ = = .
j 2 j−1 j−1 2l
This yields
k
11
VZ (f ) =
2 l
l=1
728
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 749
3. Let A = inf |g| and Z(x0 , . . . , xn ) a partition of [a, b]. It follows that
1 1 |g(xk−1 ) − g(xk )|
−
g(xk ) g(xk−1 ) = |g(xk )||g(xk−1 )|
which implies
n
1 1 1
VZ = −
g g(xk ) g(xk−1 )
k=1
n
|g(xk−1 ) − g(xk )|
=
|g(xk )||g(xk−1 )|
k=1
n
1 1
≤ 2
|g(xk ) − g(xk−1 )| = 2 VZ (g)
A A
k=1
1 1
and taking the supremum over all partitions Z we arrive at V g ≤ A2 V (g).
4. For a partition Z(x0 , . . . , xn ) of [a, b] we find
xk xk
|F (xk ) − F (xk−1 )| = f (t)dt ≤ |f (t)|dt
xk−1 xk−1
and therefore
n
n
xk
b
VZ (F ) = |F (xk ) − F (xk−1 )| ≤ |f (t)|dt = |f (t)|dt,
k=1 k=1 xk−1 a
b b
i.e. VZ (F ) ≤ a |f (t)|dt for all partitions Z implying that V (F ) ≤ a |f (t)|dt. Now
we prove the converse inequality. Let mk := min{|f (t)||t ∈ [xk−1 , xk ]}. By the
mean value theorem for the Riemann integral there exists ξk ∈ [xk−1 , xk ] such that
F (xk ) − F (xk−1 ) = f (ξk )(xk − xk−1 )
implying
|F (xk ) − F (xk−1 )| = |f (ξk )|(xk − xk−1 ) ≥ mk (xk − xk−1 )
and consequently
n
n
VZ (F ) = |F (xk ) − F (xk−1 )| ≥ mk (xk − xk−1 ).
k=1 k=1
where the last equality follows from Theorem 25.24 when observing that mk = f (ηk )
for some ηk ∈ [xk−1 , xk ].
729
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 750
A COURSE IN ANALYSIS
and therefore
m
m
|f (bj ) − f (aj )| ≤ κ(bj − aj ) < .
j=1 j=1
Thus with ||f ||∞ ≤ M, ||g||∞ ≤ M , given >0, choose δ > 0 such that for
(aj , bj ), j = 1, . . . , m and (aj , bj ) ⊂ [a, b], from m j=1 bj − aj < δ it follows that
m
m
j=1 |f (b j ) − f (a j )| < M and j=1 |g(b j ) − g(a j < M . This implies
)|
m
|(f · g)(bj ) − (f · g)(aj )| < + = .
j=1
M M
730
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 751
x
7. For f ∈ C([a, b]) ∪ BV ([a, b]) we define F by F (x) := a f (t)dt. Let x, y ∈ [a, b], it
follows that
y
y
|F (y) − F (x)| = f (t)dt ≤ |f (t)|dt ≤ M |x − y|
x x
for M = ||f ||∞ < ∞, note that continuous functions on a compact interval are
bounded as are functions of bounded variation. Thus F : [a, b] → R is Lipschitz
continuous and therefore absolutely continuous.
731
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 753
References
[1] Beals, R., and Wong, R., Special Functions. A Graduate Text. Cambridge Studies in
Advanced Mathematics, Vol. 126. Cambridge University Press, 2010.
[2] Dieudonné, J., Grundzüge der modernen Analysis, 2. Aufl. Logik und Grundlagen
der Mathematik Bd. 8. Friedrich Vieweg & Sohn, Braunschweig 1972.
[3] Endl. K., und Luh, W., Analysis I, 3. Aufl. Akademische Verlagsgesellschaft, Wies-
baden 1975.
[4] Garling, D.J.H., A Course in Mathematical Analysis, Vol. I. Foundations and Ele-
mentary Real Analysis. Cambridge University Press, Cambridge 2013.
[5] Heuser, H., Lehrbuch der Analysis. Teil 1. B.G. Teubner Verlag, Stuttgart 1980.
[6] Kaczor, W.J., and Nowak, M.T., Problems in Mathematical Analysis II. Students
Mathematical Library, Vol. 12. American Mathematical Society, Providence R.I.,
2001.
[7] Landau, E., Grundlagen der Analysis. Akademische Verlagsgesellschaft, Leipzig 1930.
[8] Lin, M., The AM-GM inequality and the CBS inequality are equivalent.The Mathe-
matical Intelligencer 34. 2 (2012), 6.
[9] Maligranda, L., The AM-GM inequality is equivalent to the Bernoulli inequality. The
Mathematical Intelligencer 34. 1 (2012), 1-2.
[11] Rudin, W., Principles of Mathematical Analysis, 3rd ed. McGraw-Hill International
Editions, Mathematical Series. McGraw-Hill Book Company, Singapore 1976.
[12] Schilling, R.L., Measures, Integration and Martingales. Cambridge University Press,
Cambridge 2005.
733
This page intentionally left blank
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 755
735
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 756
A COURSE IN ANALYSIS
Wallis, John(1616-1703).
Weierstrass, Karl Theodor Wilhelm (1815-1897).
736
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 757
Subject Index
737
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 758
A COURSE IN ANALYSIS
738
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 759
SUBJECT INDEX
739
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 760
A COURSE IN ANALYSIS
740
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 761
SUBJECT INDEX
741
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 762
A COURSE IN ANALYSIS
742
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 763
SUBJECT INDEX
743
July 21, 2015 11:31 World Scientific Book - 9.75in x 6.5in reduction˙9625 page 764
A COURSE IN ANALYSIS
tangent, 296
tangent function, 148
target set, 55, 66
Taylor expansion, 420
Taylor polynomials, 418
Taylor series, 420
Taylor’s formula, 417
telescopic series, 231
tertium non datur, 477
total variation, 466
744