Linear Algebra Via Complex Analysis: Alexander P. Campbell and Daniel Daners Revised Version September 13, 2012

Linear Algebra via Complex Analysis
Alexander P. Campbell and Daniel Daners

Revised Version September 13, 2012
Abstract
The resolvent (I A)
1
of a matrix A is naturally an analytic
function of C, and the eigenvalues are isolated singularities. We
compute the Laurent expansion of the resolvent about the eigenvalues
of A. Using the Laurent expansion, we prove the Jordan decomposi-
tion theorem, prove the Cayley-Hamilton theorem, and determine the
minimal polynomial of A. The proofs do not make use of determi-
nants, and many results naturally generalise to operators on Banach
spaces.
1 Introduction
The Jordan decomposition theorem for square matrices with coecients in
C is most commonly proved by means of algebraic methods. Every good
theorem has several proofs, which give dierent insights and generalise into
dierent directions. The aim of this exposition is to present an approach
using complex analysis. We derive the Jordan decomposition theorem, the
Cayley-Hamilton theorem, and the minimal polynomial from the Laurent
expansions about the eigenvalues of the matrix.
The approach is known to experts in operator theory and functional cal-
culus and is outlined in [5, Section I.5]. It shows unexpected connections be-
tween topics usually treated separately in undergraduate mathematics. We
rely on elementary properties of vector spaces and basic theorems of complex
analysis such as the Cauchy integral formula and Laurent expansions. These
theorems are valid for vector valued functions ; see [4, Sections 3.10 & 3.11].
They can also be applied entry by entry in a matrix or vector.
Let V be a nite dimensional normed vector space over C and let A: V
V be a linear operator. In the simplest case, we have V = C
n
with the Eu-
clidean norm, and A is a nn matrix with entries in C. We rst demonstrate
1
why it is natural to analyse the structure of the resolvent
(I A)
1
using complex analysis. In one dimension, A = a is a complex number, I = 1,
and the resolvent corresponds to (a)
1
. Expanding by a geometric series
about
0
,= a we get
( a)
1
=
1
a
=
1
(
0
) + (
0
a)
=
(
0
a)
1
1 + (
0
)(
0
a)
1
=
k=0
(1)
k
(
0
a)
(k+1)
(
0
)
k
(1.1)
if [(
0
a)
1
(
0
)[ < 1. For a linear operator A, there might be several
points for which (I A)
1
does not exist. We dene the resolvent set of A
by
(A) := C: I A is invertible (1.2)
and the spectrum of A by
(A) := C (A). (1.3)
For a matrix A, (A) is the set of eigenvalues because the rank-nullity the-
orem implies that ker(I A) = 0 if and only if I A is invertible.
Replacing a by I A in (1.1) and absolute value by operator norm (see
(2.1)) we might expect for
0
(A) that
(I A)
1
=
k=0
(1)
k
(
0
I A)
(k+1)
(
0
)
k
(1.4)
if
[
0
[ <
1
|(
0
I A)
1
|
. (1.5)
That is, if
0
(A), then (IA)
1
can be expanded in a power series about
0
with radius of convergence at least 1/|(
0
I A)
1
|. Hence (I A)
1
is
an analytic (holomorphic) function of (A), with Taylor series (1.4) at
0
(A), and (A) is open. In Section 2 we make these arguments rigorous.
A matrix has only nitely many eigenvalues, so they are isolated singu-
larities of the resolvent. Hence, it is natural to use Laurent expansions about
the eigenvalues to analyse the structure of the resolvent. If
1
, . . . ,
q
are the
distinct eigenvalues of A, then the expansion turns out to be
(I A)
1
=
m
j
1
k=1
N
k
j
(
j
)
k+1
+
P
j

j
+
k=0
(1)
k
B
k+1
j
(
j
)
k
, (1.6)
2
where P
j
is the projection parallel to W
i
:= ker(P
j
) onto the generalised
eigenspace associated with
j
, m
j
= dim(im(P
j
)) and N
j
is nilpotent with
im(N
j
) im(P
j
). Moreover,
j
(A[
W
j
) and B
j
= (
j
I A[
W
j
)
1
, which
is consistent with (1.4). Note that P
j
is the residue of (I A)
1
at
j
, so
P
j
=
1
2i
_
C
j
(I A)
1
d, (1.7)
where C
j
is a positively oriented circle about
j
, not containing any other
eigenvalue of A. Moreover,
N
j
=
1
2i
_
C
j
(I A)
1

j
d. (1.8)
The Laurent expansion (1.6) is the core of our exposition and is discussed in
Section 3.
In Section 4 we prove the Jordan decomposition A = D + N, where
D =
1
P
1
+ +
q
P
q
is diagonalisable, N := N
1
+ +N
q
is nilpotent, and
DN = ND. By (1.6) the eigenvalues
j
are poles of (I A)
1
of order
n
j
:= mink 1: N
k
j
= 0 m
j
. (1.9)
Hence A is diagonalisable if and only if all of its eigenvalues are simple poles.
The order of
j
as a pole of the resolvent is therefore a measure for how
far an operator is from being diagonalisable. We show in Section 5 that
p() =
k
j=1
(
j
)
n
j
is the minimal polynomial of A, and we prove the
Cayley-Hamilton theorem.
2 The resolvent as an analytic map
Let V be a nite dimensional normed vector space over C and let A: V V
be a linear operator. To deal with convergent series such as (1.4), we need
a metric or norm on the space of linear operators. We dene the operator
norm by
|A| := sup
x1
|Ax|. (2.1)
This number is nite for every linear operator on nite dimensional spaces.
Note that every nite dimensional space has a norm induced by the Euclidean
norm on C
n
and some isomorphism from V to C
n
. As all norms on nite
dimensional vector spaces are equivalent, it does not matter which one we
use; see [8, Sections II.13].
The expansion (1.4) was motivated by a geometric series. The counterpart
of the geometric series in operator theory is the Neumann series.
3
Proposition 2.1 (Neumann Series). Let B: V V be a linear operator
and let
r := limsup
n
|B
n
|
1/n
.
Then
k=0
B
k
converges if r < 1 and diverges if r > 1. Moreover, r |B|.
If the series converges, then (I B)
1
exists and
(I B)
1
=
k=0
B
k
. (2.2)
Proof. The root test for the absolute convergence of series implies that
n
k=0
B
k
converges if r < 1 and diverges if r > 1; see [1, Theorem 8.5]. The partial
sum
n
k=0
B
k
satises the identity
(I B)
n
k=0
B
k
=
_
n
k=0
B
k
_
(I B) =
n
k=0
B
k
k=0
B
k+1
= I B
n+1
. (2.3)
If
k=0
B
k
converges, then B
n+1
0, and letting n in (2.3)
(I B)
k=0
B
k
=
_

k=0
B
k
_
(I B) = I.
To pass to the limit in (2.3), we use the continuity of multiplication (compo-
sition) of linear operators on V . Hence I B is invertible and (2.2) holds.
Since |B
n
| |B|
n
, we have r |B|. Hence (2.2) holds if |B| < 1.
We can now justify the power series expansion (1.4).
Theorem 2.2 (analyticity of resolvent). The resolvent set (A) is open and
the map (I A)
1
is analytic on (A). If
0
(A), then the power
series expansion (1.4) is valid whenever satises (1.5).
Proof. We use a calculation similar to (1.1) with a replaced by A. The
diculty is that we need to show that (I A) is invertible for close to
0
, so we cannot start with (I A)
1
. In the spirit of (1.1) we write
I A = (
0
I A) +(
0
)I =
_
I +(
0
)(
0
I A)
1
_
(
0
I A) (2.4)
and then show that we can invert. The rst term in (2.4) is of the form I B
with B := (
0
)(
0
I A)
1
. By Proposition 2.1, I B is invertible if
|(
0
)(
0
I A)
1
| = [
0
[|(
0
I A)
1
| < 1,
4
which is equivalent to (1.5). Hence if satises (1.5), then by Proposition 2.1
_
I + (
0
)(
0
I A)
1
_
1
=
k=0
(1)
k
(
0
I A)
k
(
0
)
k
.
We can therefore invert (2.4) to get (1.4).
We next prove that (A) ,= by giving an operator theory version of a
simple proof of the fundamental theorem of algebra from [7]. The proof relies
only on the Cauchy integral formula and a decay estimate for (I A)
1
.
Having proved that (A) ,= , it makes sense to dene the spectral radius
spr(A) := sup[[ : (A)
of A.
Theorem 2.3. If A is a linear operator on a nite dimensional vector space
over C, then (A) ,= . Moreover, spr(A) = limsup
n
|A
n
|
1/n
and for
[[ > spr(A) we have the Laurent series expansion
(I A)
1
=
k=0
A
k
k+1
. (2.5)
Proof. Let r := limsup
n
|A
n
|
1/n
and note that
limsup
n
_
_
_
A
n
n
_
_
_
1/n
=
1
[[
limsup
n
|A
n
|
1/n
=
r
[[
.
By Proposition 2.1, the series
k=0
A
k
/
k
converges if [[ > r and diverges
if [[ < r. Moreover, (2.5) holds for [[ > r because
(I A)
1
=
1
_
I
1
A
_
1
=
1
k=0
A
k
k
=
k=0
A
k
k+1
.
Hence (A) if [[ > r and (2.5) is the Laurent expansion of (I A)
1
about zero in that region. Because the Laurent expansion is valid in the
largest annulus about zero in (A), either (A) = or there exists
0
(A)
with [
0
[ = r. Hence r = spr(A) if (A) ,= .
It remains to prove that (A) ,= . As r |A| we get from (2.5) that
|(I A)
1
|
1
[[
k=0
|A|
k
[[
k
=
1
[[
1
1 |A|[[
1
=
1
[[ |A|
(2.6)
5
for C with [[ > |A|. Suppose that (A) = C. As (I A)
1
is
analytic on C, the Cauchy integral formula yields
A
1
=
1
2i
_
||=R
(I A)
1
d
for all R > 0. Using the decay estimate (2.6), we obtain
|A
1
|
2R
2
|(I A)
1
|
R

1
R |A|
for all R > |A|. Letting R , we see that |A
1
| = 0 which is impossible.
Hence (A) ,= as claimed.
3 The Laurent expansion about an eigenvalue
We have established that the resolvent is an analytic function on (A) and
know that the eigenvalues are isolated singularities of the resolvent. The
centerpiece of our exposition is the Laurent expansion of (I A)
1
about
an eigenvalue
0
(A).
Theorem 3.1. Let
0
(A). Then there exist operators P
0
, N
0
and B
0
so
that for in a neighbourhood of
0
(I A)
1
=
k=1
N
k
0
(
0
)
k+1
+
P
0

0
+
k=0
(1)
k
B
k+1
0
(
0
)
k
. (3.1)
Moreover, the operators P
0
, N
0
and B
0
have the following properties:
(i) P
2
0
= P
0
, that is, P
0
is a projection;
(ii) N
0
P
0
= P
0
N
0
= N
0
;
(iii) B
0
P
0
= P
0
B
0
= 0;
(iv) spr(N
0
) = 0;
(v) AP
0
= P
0
A =
0
P
0
+ N
0
;
(vi) (
0
I A)B
0
= B
0
(
0
I A) = I P
0
.
We defer the proof of the theorem to Section 6 and now discuss some
consequences. We show that N
0
is nilpotent and deduce that every eigenvalue
of A is a pole of the resolvent.
6
Remark 3.2. By (ii) im(N
0
) im(P
0
), so there exists n
0
m
0
:= dim(im(P
0
))
dimV < so that
ker(N
0
) ker(N
0
) ker(N
2
0
) ker(N
n
0
0
) = ker(N
n
0
+1
0
) = . . . (3.2)
with proper inclusions up to the n
0
-th power, and then equality; see [2,
Prop. 8.5 & 8.6]. If we show that N
n
0
0
= 0, then N
0
is nilpotent, N
n
0
1
0
,= 0,
and n
0
is the order of
0
as a pole of (I A)
1
. Then N
0
: im(N
n
0
0
)
im(N
n
0
0
) is invertible. We know from (iv) that zero is the only eigenvalue of
N
0
. Hence im(N
n
0
0
) = 0 as claimed, as otherwise N
0
restricted to im(N
n
0
0
)
has a non-zero eigenvalue by Theorem 2.3. Further note that (3.2) also
implies that P
0
, N
0
, N
2
0
, . . . , N
n
0
1
0
is linearly independent.
Next we discuss the structure of the regular and singular parts of the
Laurent expansion.
Remark 3.3. Since P
0
is a projection we have the direct sum decomposition
V = im(P
0
) ker(P
0
).
Choose bases of im(P
0
) and ker(P
0
) to form a basis of V . With respect to
that basis, P
0
can be written as a block matrix
P
0
=
_
I 0
0 0
_
.
Similarly, with respect to the basis introduced, (ii) and (v) of the theorem
imply that N
0
and A are block matrices of the form
N
0
=
_
N 0
0 0
_
and A =
_
0
I + N 0
0 A
0
_
.
In particular, A[
ker(P
0
)
= A
0
and (vi) shows that
0
(A
0
) with B
0
[
ker(P
0
)
=
(
0
I A
0
)
1
. Moreover, by (iii), B
0
[
im(P
0
)
= 0, so B
0
is of the form
B
0
=
_
0 0
0 (
0
I A
0
)
1
_
.
In particular, the regular part of the Laurent expansion (3.4) is consistent
with (1.4), and coincides with the power series expansion of the resolvent
(I A
0
)
1
about
0
. Further note that the singular part of the Laurent
expansion is trivial on ker(P
0
) and the regular part is trivial on im(P
0
), so
the singular and regular parts live on complementary subspaces.
The above remark proves the following corollary.
7
Corollary 3.4. Let
0
be an eigenvalue of A and let P
0
, N
0
and let B
0
be
as in Theorem 3.1. If m
0
:= dim(im(P
0
), then
P
0
(I A)
1
= (I A)
1
P
0
=
P
0

0
+
m
0
k=1
N
k
0
(
0
)
k+1
(3.3)
is the singular part of the Laurent expansion (3.1), and is valid for all
C
0
. Moreover,
(I P
0
)(I A)
1
= (I A)
1
(I P
0
) =
k=0
(1)
k
B
k+1
0
(
0
)
k
(3.4)
is the regular part of the Laurent expansion (3.1) and valid for in a neigh-
bourhood of
0
. Moreover,
(i) V = im(P
0
) ker(P
0
) is a direct sum;
(ii)
0
is the only eigenvalue of A: im(P
0
) im(P
0
);
(iii)
0
I A: ker(P
0
) ker(P
0
) is invertible.
In Theorem 3.1(v) we already see how the Jordan decomposition arises
from the Laurent expansion about
0
since
0
P
0
is diagonalisable on im(P
0
)
and N
0
is nilpotent. The following proposition is useful to prove uniqueness
of the Jordan decomposition.
Proposition 3.5. Suppose that A = D + N, where D and N are such that
DN = ND and spr(N) = 0. Then (A) = (D) and
(I A)
1
=
k=0
N
k
(I D)
(k+1)
(3.5)
uniformly with respect to in compact subsets of (A). If
0
(A), then
P
0
=
1
2i
_
C
r
(I A)
1
d =
1
2i
_
C
r
(I D)
1
d, (3.6)
where C
r
is a circle centred at
0
not containing any other eigenvalues of A.
Proof. If (D), then
I A = I D N = (I D)
_
I (I D)
1
N
_
. (3.7)
8
By assumption (I D)N = N(I D). Applying (I D)
1
from the left
and from the right we get N(I D)
1
= (I D)
1
N, and so
_
N(I D)
1
_
n
= N
n
(I D)
n
for all n N. Therefore, if K (D) is compact, then there exists M > 0
such that for all n N and K
_
_
_
N(I D)
1
_
n
_
_
1/n
|N
n
|
1/n
|(I D)
1
| M|N
n
|
1/n
.
As spr(N) = 0, for every > 0 there exists n
0
N such that, if n > n
0
and
K, then
_
_
_
N(I D)
1
_
n
_
_
1/n
M|N
n
|
1/n
< . (3.8)
In particular spr
_
N(I D)
1
_
= 0. By Proposition 2.1, we can invert (3.7)
to get (3.5) and (A). The convergence is uniform on K because of (3.8).
If (A), then D = A N has the same structure with AN = NA, so
we can interchange the roles of D and A and conclude that (D). This
proves that (A) = (D).
If
0
(A), then by the uniform convergence of (3.5) on the compact
set C
r
we have that
P
0
=
1
2i
_
C
r
(I A)
1
d =
1
2i
k=0
N
k
_
C
r
(I D)
(k+1)
d. (3.9)
From the Taylor series expansion (1.4)
d
k
d
k
(I D)
1
= k!(I D)
(k+1)
,
so (I D)
(k+1)
has primitive (I D)
k
on (A) for all k 1. Hence,
all integrals in (3.9) vanish except for the rst one, and (3.9) reduces to
(3.6).
4 The Jordan decomposition theorem
In the previous section we looked at the Laurent expansion about a single
eigenvalue of A. Here we look at the expansions about all distinct eigenvalues
1
, . . . ,
q
of A and use them to derive the Jordan decomposition theorem.
For j = 1, . . . , q we look at the projections P
j
given by (1.7). We choose C
j
to
be mutually disjoint positively oriented circles centred at
j
, not containing
any other eigenvalues.
9
Proposition 4.1. For j = 1, . . . , q let P
j
be the projection dened by (1.7).
Then
I = P
1
+ + P
q
=
1
2i
_
C
R
(I A)
1
d, (4.1)
where C
R
is a circle of radius R > spr(A) centred at zero. Moreover,
V = im(P
j
) im(P
2
) im(P
q
), (4.2)
and this direct sum completely reduces A. Finally, for j = 1, . . . , q,
im(P
j
) = ker(
j
I A)
m
j
, (4.3)
where m
j
= dim(im(P
j
)).
Proof. As R > spr(A) the circle C
R
encloses all eigenvalues. By the residue
theorem and the Laurent expansion (2.5), we get
q
j=1
P
j
=
q
j=1
1
2i
_
C
j
(I A)
1
d =
1
2i
_
C
R
(I A)
1
d
=
1
2i
k=0
_
C
R
A
k
k+1
d =
I
2i
_
C
R
1
d = I
as all terms in the series are zero except the one with k = 0. We next show
that P
j
is a projection parallel to P
k
if k ,= j. We have
(2i)
2
P
j
P
k
=
_
C
j
(I A)
1
d
_
C
k
(I A)
1
d
=
_
C
j
_
C
k
(I A)
1
(I A)
1
dd.
Using the resolvent identity from Proposition 6.1(ii) below we get
(2i)
2
P
j
P
k
=
_
C
j
_
C
k
(I A)
1
(I A)
1

dd
=
_
C
j
(I A)
1
_
C
k
1

dd
_
C
k
(I A)
1
_
C
j
1

dd = 0,
since the circle C
j
is outside C
k
and vice versa. This completes the proof of
(4.2). The fact that the direct sum reduces A follows from Corollary 3.4.
To prove (4.3) note that Theorem 3.1(v) implies that (A
j
I)
m
j
P
j
=
N
m
j
j
= 0 and so im(P
j
) ker(
j
I A)
m
j
. By Corollary 3.4 (A
j
I)
m
j
(I
P
j
) is injective on ker(P
j
), so ker(
j
I A)
m
j
im(P
j
), proving (4.3).
10
From Corollary 3.4 we know that A: im(P
j
) im(P
j
) has
j
as its only
eigenvalue. This motivates the following denition.
Denition 4.2. We call im(P
j
) the generalised eigenspace associated with
the eigenvalue
j
and m
j
= dim(im(P
j
)) the algebraic multiplicity of
j
.
The identity (4.3) ensures that Denition 4.2 agrees with the usual deni-
tion of the generalised eigenspace. We now derive a formula for the resolvent
in terms of N
j
and P
j
similar to a partial fraction decomposition of a rational
function.
Theorem 4.3. For every (A) we have the representation
(I A)
1
=
q
j=1
_
P
j

j
+
m
j
1
k=1
N
k
j
(
j
)
k+1
_
.
Proof. Using Corollary 3.4 and Proposition 4.1, for every (A)
(I A)
1
= (I A)
1
q
j=1
P
j
=
q
j=1
(I A)
1
P
j
=
q
j=1
_
P
j

j
+
m
j
1
k=1
N
k
j
(
j
)
k+1
_
as claimed.
An operator D is called diagonalisable if for some direct sum decompo-
sition D acts by scalar multiplication on each subspace. These scalars are
the eigenvalues of D. We are now ready to prove the Jordan decomposition
theorem.
Theorem 4.4 (Jordan decomposition). Let V be a nite dimensional vector
space over C and A: V V a linear operator. Then there exists a diago-
nalisable operator D and a nilpotent operator N such that A = D + N and
DN = ND. If
1
, . . . ,
q
are the distinct eigenvalues of A, then
D =
q
j=1
j
P
j
and N =
q
j=1
N
j
, (4.4)
where P
j
and N
j
are as dened before. In particular D and N are uniquely
determined by A. Finally, A is diagonalisable if and only if all eigenvalues
of A are simple poles of (I A)
1
.
11
Proof. By Theorem 3.1(v) AP
j
=
j
P
j
+ N
j
for j = 1, . . . , q and therefore
by Proposition 4.1
A = A
q
j=1
P
j
=
q
j=1
AP
j
=
q
j=1
(
j
P
j
+ N
j
) =
q
j=1
j
P
j
+
q
j=1
N
j
.
Hence if we dene D and N as in (4.4), then A = D +N. It is clear that D
is diagonalisable. By Theorem 3.1(ii) and Proposition 4.1 it follows that
P
k
N
j
= P
k
P
j
N
j
=
kj
N
j
= N
j
P
k
,
so the direct sum (4.2) reduces N. In particular, N is nilpotent since each
N
j
is nilpotent and also DN = ND since this is the case on im(P
j
).
To show the uniqueness of the decomposition, assume that A =

D +
N with

D diagonalisable,

N nilpotent and

D

N =

N

D. Proposition 3.5
implies that (

D) = (A) and that the spectral projections are equal. Hence
1
, . . . ,
q
are the eigenvalues of

D. As

D is diagonalisable,

DP
j
=
j
P
j
and
NP
j
= AP
j

j
P
j
= N
j
for j = 1, . . . , q. Hence

D = D and

N = N as
claimed.
The last assertion of the theorem follows since N = 0 if and only if
N
j
= 0 in the Laurent expansion (1.6) for all j = 1, . . . , q, which means that
all eigenvalues are simple poles.
To obtain the Jordan canonical form for matrices, it is sucient to con-
struct a basis of im(P
j
) such that the matrix representation of AP
j
consists
of Jordan blocks; see e.g. [2, Theorem 8.47]. For many purposes the full
Jordan canonical form is not needed as examples in [6] show.
5 The Cayley-Hamilton theorem and the min-
imal polynomial
If p() = a
n
n
+ a
n1
n1
+ + a
0
is a polynomial, we dene
p(A) = a
n
A
n
+ a
n1
A
n1
+ + a
1
A + a
0
I.
The Cayley-Hamilton theorem asserts that p
A
(A) = 0 if p
A
() := det(I A)
is the characteristic polynomial of A. We start by nding a representation
of p(A) reminiscent of the Cauchy integral formula.
Lemma 5.1. If p() = a
n
n
+ a
n1
n1
+ + a
0
is a polynomial, then
p(A) =
1
2i
_
C
R
p()(I A)
1
d,
12
where C
R
is a positively oriented circle centred at zero with radius R >
spr(A).
Proof. By the linearity of integrals, it is sucient to consider p() =
k
.
Using (4.1), we get from the Cauchy integral theorem that
2iA
k
=
_
C
R
A
k
(I A)
1
d =
_
C
R
_
I (I A)
_
k
(I A)
1
d
=
k
=0
(1)
_
k
__
C
R
k
(I A)
1
d =
_
C
R
k
(I A)
1
d
as required.
Theorem 4.3 allows us to derive a formula for p(A).
Theorem 5.2. If p is a polynomial and p
(k)
its k-th derivative, then
p(A) =
q
j=1
_
p(
j
)P
j
+
m
j
1
k=1
p
(k)
(
j
)
k!
N
k
j
_
. (5.1)
Proof. From Lemma 5.1 and Theorem 4.3
p(A) =
1
2i
_
C
R
p()(I A)
1
d =
1
2i
q
j=1
_
C
j
p()(I A)
1
d
=
1
2i
q
j=1
_
_
C
j
p()

j
dP
j
+
m
j
1
k=1
_
C
j
p()
(
j
)
k+1
dN
k
j
_
. (5.2)
By the Cauchy integral formula
p(
j
) =
1
2i
_
C
j
p()

j
d
and therefore
p
(k)
(
j
) =
k!
2i
_
C
j
p()
(
j
)
k+1
d.
Substitution into (5.2) yields (5.1).
We are now ready to prove the Cayley-Hamilton theorem.
Corollary 5.3 (Cayley-Hamilton). If p
A
is the characteristic polynomial of
A, then p
A
(A) = 0.
13
Proof. The characteristic polynomial is given by p
A
() =
q
j=1
(
j
)
m
j
,
where m
j
is the algebraic multiplicity of the eigenvalue
j
. Hence p
(k)
A
(
j
) = 0
for 0 k m
j
1 and the representation (5.1) ensures that p
A
(A) = 0.
The monic polynomial p of smallest degree such that p(A) = 0 is called
the minimal polynomial of A. According to Theorem 5.2 it is the polynomial
p of smallest degree with
p(A) =
q
j=1
_
p(
j
)P
j
+
n
j
1
k=1
p
(k)
(
j
)
k!
N
k
j
_
= 0, (5.3)
where n
j
is the order of
j
as a pole of the resolvent given by (1.9). By
Remark 3.2 and Proposition 4.1, the set of operators
P
j
: 1 j q N
k
j
: 1 k n
j
, 1 j q
is linearly independent, so (5.3) holds if and only if p
(k)
(
j
) = 0 for all
j = 1, . . . , q and 0 k n
j
1. This proves the following theorem.
Theorem 5.4 (minimal polynomial). Let A be a matrix over C with distinct
eigenvalues
1
, . . .
q
. Then the minimal polynomial of A is given by
p() =
q
j=1
(
j
)
n
j
,
where n
j
is the order of
j
as a pole of (I A)
1
.
The preceding theorem shows that the minimal polynomial determines
the order of the poles of the resolvent and vice versa.
6 Computation of the Laurent expansion
In this section we prove Theorem 3.1 on the Laurent expansion of the resol-
vent about
0
(A). The arguments in this section do not use that
0
is an
eigenvalue, nor that dim(V ) < . We rst need some elementary properties
of the resolvent.
Proposition 6.1. If (I A)
1
and (I A)
1
exist, then
(i) A(I A)
1
= (I A)
1
A = (I A)
1
I;
(ii) (I A)
1
(I A)
1
= ( )(I A)
1
(I A)
1
;
14
(iii) (I A)
1
(I A)
1
= (I A)
1
(I A)
1
.
Proof. For (i) we write
A(I A)
1
=
_
I (I A)
_
(I A)
1
= (I A)
1
I
and similarly
(I A)
1
A = (I A)
1
_
I (I A)
_
= (I A)
1
I.
For (ii) we note that
(I A)
_
(I A)
1
(I A)
1
_
(I A)
= (I A) (I A) = ( )I.
Applying (I A)
1
from the left and (I A)
1
from the right, we get (ii).
Finally, (iii) follows from (ii) by interchanging the roles of and .
Property (ii) is often referred to as the resolvent identity. It corresponds
to the partial fraction decomposition

( a)( a)
=
1
a

1
a
.
The Laurent series about
0
representing (I A)
1
is
(I A)
1
=
n=
B
n
(
0
)
n
, (6.1)
where
B
n
=
1
2i
_
C
r
(I A)
1
(
0
)
n+1
d (6.2)
and C
r
is a positively oriented circle of radius r centred at
0
, not enclosing
any other eigenvalue of A; see [3, Theorem 8.3.1] or [4, Section 3.11]. We
next prove some recursion relations between the B
n
. The aim is to be able
to express all B
n
in terms of B
2
, B
1
and B
0
.
Lemma 6.2. The coecients B
n
in (6.1) satisfy the relation
B
m
B
n
= B
n
B
m
=
_
_
B
n+m+1
if n, m 0,
B
n+m+1
if n, m < 0,
0 otherwise.
(6.3)
Moreover,
AB
n
= B
n
A =
_
B
n1
+
0
B
n
if n ,= 0,
B
n1
+
0
B
n
I if n = 0.
(6.4)
15
0
r
s
C
s
C
r
Figure 6.1: The circles C
r
and C
s
about 0
Proof. By replacing A by
0
I A, we may assume that
0
= 0. Let C
r
and
C
s
be circles of radius 0 < r < s, both centred at zero such that C
s
does not
enclose any other eigenvalue of A as shown in Figure 6.1. Then
(2i)
2
B
n
B
m
=
__
C
r
(I A)
1
n+1
d
___
C
s
(I A)
1
m+1
d
_
=
_
C
r
_
C
s
(I A)
1
(I A)
1
n+1
m+1
dd.
In the spirit of a partial fraction decomposition, we use the resolvent identity
from Proposition 6.1(ii) to get
(2i)
2
B
n
B
m
=
_
C
r
_
C
s
(I A)
1
(I A)
1
( )
n+1
m+1
dd
=
_
C
s
(I A)
1
m+1
_
C
r
1
n+1
( )
dd
_
C
r
(I A)
1
n+1
_
C
s
1
m+1
( )
dd.
(6.5)
If n, m 0, then we use the partial fraction decompositions
1
n+1
( )
=

n+1
(
n+1
n+1
)
n+1
n+1
( )
=
1
n+1
( )

n
k=0
1
nk+1
1
k+1
(6.6)
and
1
m+1
( )
=
1
m+1
( )
+
m
k=0
1
mk+1
1
k+1
(6.7)
to evaluate the inner integrals. Note that C
s
is outside the circle C
r
.
Using (6.6) if n 0 and the Cauchy integral theorem if n < 0 we get
1
2i
_
C
r
1
n+1
( )
d =
_
_
_
n+1
if n 0,
0 if n < 0.
16
For the other integral note that if m 0, then both = 0 and = are
singularities enclosed by C
s
. Hence, using (6.7) and the residue theorem, we
obtain
1
2i
_
C
s
1
m+1
( )
d =
1
m+1

1
m+1
= 0.
If m < 0, then only = is a singularity, and by the Cauchy integral formula
1
2i
_
C
s
1
m+1
( )
d =
1
m+1
.
Hence if m, n 0, then the second of the inner integrals on the right hand
side of (6.5) is zero, and the other is 2i
(n+1)
. Therefore
B
n
B
m
=
1
2i
_
C
s
(I A)
1
m+n+2
d = B
n+m+1
.
If m, n < 0, then the rst of the inner integrals on the right hand side of
(6.5) is zero, and the other is 2i
(m+1)
and therefore
B
n
B
m
=
1
2i
_
C
r
(I A)
1
m+n+2
d = B
n+m+1
.
If n 0 and m < 0, then a similar argument shows that B
n
B
m
= B
n+m+1
B
m+n+1
= 0. In the remaining case both inner integrals in (6.5) are zero, so
B
n
B
m
= 0. From (6.5) it is also clear that B
n
B
m
= B
m
B
n
. To prove (6.4)
we use Proposition 6.1(i) to conclude that
AB
n
=
1
2i
_
C
r
A(I A)
1
n+1
d =
1
2i
_
C
r
(I A)
1
I
n+1
d
=
1
2i
_
C
r
(I A)
1
n
d
I
2i
_
C
r
1
n+1
d = B
n1
I
2i
_
C
r
1
n+1
d.
This completes the proof of the lemma since the last integral is zero if n ,= 0,
and is 2i if n = 0.
Proof of Theorem 3.1. To prove Theorem 3.1 we set
P
0
:= B
1
and N
0
:= B
2
.
First we deduce from (6.3) that
P
2
0
= B
1
B
1
= B
11+1
= B
1
= P
0
17
which proves (i). Similarly, applying (6.3) again we get (ii) since
P
0
N
0
= B
1
B
2
= B
12+1
= B
2
= N
0
and B
1
and B
2
commute. Similarly we get (iii) since by (6.3)
P
0
B
0
= B
1
B
0
= 0 = B
0
B
1
= B
0
P
0
.
We next use induction to show that B
n
= (1)
n
B
n+1
0
. This is obvious for
n = 0, so assume that n 1. Then by (6.3) and the induction assumption
B
n+1
= B
n+0+1
= B
n
B
0
= (1)
n
B
n+1
0
B
0
= (1)
n+1
B
(n+1)+1
0
as claimed. In a similar manner, we prove that B
n
= B
n2
2
= N
n2
0
for n
2. Again this is obvious for n = 2. By (6.3) and the induction assumption
B
(n+1)
= B
n2+1
= B
n
B
2
= B
n2
2
B
2
= B
(n+1)2
2
= N
(n+1)2
0
as claimed. Hence (3.1) follows. To prove (iv) note that from (6.2)
|N
n
0
| = |B
n+2
| r
n+2
sup
||=r
|(I A)
1
| Kr
n+2
for some constant K > 0 depending on r. The constant K is nite since the
circle [[ = r is compact and the resolvent is continuous. Hence
spr(N
0
) r lim
n
(Kr
2
)
1/n
= r. (6.8)
As we can choose r as small as we like, we conclude that spr(N
0
) = 0.
Finally, note that (v) and (vi) are special cases of (6.4) for n = 1 and
n = 0, respectively.
References
[1] H. Amann and J. Escher, Analysis I, Birkhauser Verlag, Basel, 2005.
[2] S. Axler, Linear Algebra Done Right, second edition, Springer-Verlag, New
York, 1997.
[3] E. Hille, Analytic Function Theory, Vol. 1, Ginn and Company, Boston, 1959.
[4] E. Hille and R. S. Phillips, Functional Analysis and Semi-Groups, revised edi-
tion. American Mathematical Society, Providence, R. I., 1957,
[5] T. Kato, Perturbation Theory for Linear Operators, second edition. Springer-
Verlag, Berlin, 1976.
18
[6] R. R. London and H. P. Rogosinski, Decomposition theory in the teaching of
elementary linear algebra, Amer. Math. Monthly 97 (1990) 478485, http:
//dx.doi.org/10.2307/2323830.
[7] A. R. Schep, A simple complex analysis and an advanced calculus proof of the
fundamental theorem of algebra, Amer. Math. Monthly 116 (2009) 6768.
[8] A. E. Taylor and D. C. Lay, Introduction to Functional Analysis, second edition.
John Wiley & Sons, New York-Chichester-Brisbane, 1980.
Alexander Campbell
School of Mathematics and Statistics, The University of Sydney, NSW 2006, Aus-
tralia
campbell@maths.usyd.edu.au
Daniel Daners
School of Mathematics and Statistics, The University of Sydney, NSW 2006, Aus-
tralia
daniel.daners@sydney.edu.au
19

Linear Algebra Via Complex Analysis: Alexander P. Campbell and Daniel Daners Revised Version September 13, 2012

Încărcat de

Informații document

Descriere originală:

Titlu original

Drepturi de autor

Formate disponibile

Partajați acest document

Partajați sau inserați document

Opțiuni de partajare

Vi se pare util acest document?

Este necorespunzător acest conținut?

Drepturi de autor:

Formate disponibile

Linear Algebra Via Complex Analysis: Alexander P. Campbell and Daniel Daners Revised Version September 13, 2012

Încărcat de

Drepturi de autor:

Formate disponibile

Linear Algebra via Complex Analysis

Alexander P. Campbell and Daniel Daners

S-ar putea să vă placă și