Documente Academic
Documente Profesional
Documente Cultură
' I
~ t a nordr University f StaGfo~d, Calif. 94305, u . s ~ . Centre dlAutomatique Ecole Nationale, SupCrieure des Mines , Paris, FRANCE
and
' -'
C
;-
.. ..
-
.*
Lectures delivered at CENTRO INTERNAZIONALE- MATEMATICO ESTIVO (C.I.M.E.) Seminar on Controllability and Observability, Fondazione Guglielmo Marconi, Pontecchio Marconi (Bologna, ITALY), from July 1 through July 9, 1968.
*
-
. --
-.
-.
. . --C1
. -
8
LL
'p.///yq?
(Nara(ca OR
OR i o NUMBER)
(CATEGORY)
(PAGES)
--
C o r s o t e n u t o a S a s s o M a r c o n i (Bologna) d a l
1 a1 9
Luglio
1968
b
3 .
'
TABLE OF CONTENTS
5 1 5
Standardization of definitions and "classical" results. 23 Definition of states via Nerode equivalence classes. Modules induced by linear input/output maps. Cyclicity and related questions. Transfer functions. Abstract construction of realizations. Construction of realizations. Theory of partial realizations. General theory of 05servability. Historical notes. References. 35 43 59
78
R . E. Kalman
INTRODUCTION The theory of controllability and observability has been developed, one might almost say reluctantly, in response to problems generated by technological science, especially in areas related to control, communication, and computers. It seems that the first
1 5 , by 99
KALMAN k960b-c1.
scattered results before this time (see Section 12 for some historical comments and references), and one h g h t confidently assert today that
some ofthe main results have been discovered, more or less independently, in every country which has reached an advanced stage of "development" and it is certain that these same results will be rediscovered again in still more places as other countries progress on the road to development.
..
I
this field, we ought not hesitate to make some guesses of the significance of what has been accomplished.
(i) The use of the concepts of controlla5ility and observability to study nonclassical questions in optimal control and optimal estimation theory, sometimes as basic hypotheses securing existence, more often as seemingly technical colditionswhich allow a sharper statement of results or shorter proofs. (ii) Interaction between the concepts of controllability and
R.E. Kalman
as: formulation and solution of the problem of realization,
canonical forms, decomposition of systems. The first of these topics is older and has been studied primarily from the point of view of analysis, although the basic lemma (2.7 ) is purely algebraic. The second group of topics may be viewed as "blowing up" the ideas inherent in the basic lemma (2.7 ), resulting in a more and more strictly algebraic point of view. There is active research in both areas.
R. E. Kalman
The mathe-
THEOREM A.
constant, n
i s completely controllable.
The c e n t r a l s p e c i a l case is t r e a t e d i n g r e a t d e t a i l by KAI;MAN, F U 3 , and ARBIB [1969, Chapter 2, Theorem 5.101; f o r a proof of t h e
O IA general case with background comments, r e f e r t o W NT M [ 19673.
As
IIRRIREM B .
lability in a real, (continuous-time, or discrete-time), finitedimensional, constant, linear dynamical system is equivalent to
is not yet known to be valid outside the linear area, but as an intuitive prescription it has been rather usef'ul in guiding systemtheoretic research. The problems involved here are those of fomulation rather than proof. The basic difficulties seem to point toward category theory. System-theoretic
duality, like the categoric one, is concerned with "revers'ing arrows". See Section 1 for a modern discussion of these points 0
and a precise version of Theorem B . Partly as a result of the questions raised by Theorem B and partly because of the algebraic techniques needed to prove Theorem
THEOREM C
R. E, Kalman
F%tm se
one part, that which is completely controllzble and completely observable, is involved in the input/output behavior of the system. The proof given by W M [1962] applies to nonconsta~tsystems only under the severe restriction that the dimensions of the subspace of all control.lable and all unobservable states is constant
on the whole real line. The result represented by Theorem C is far from
definitive, however, since finite-dimensional linear, 2cnconsta>t systems admit at least four differez: canonical decompositic.na: it is
possible and f u t ' l to dualize the notions of controllability rifu and observability, thereby arriving at -properties, present'ljr four called reachability and controllability as well as constructibility" and observability. (see Section 2 definitions.)
the first list with a property from the second list gives a canonical decomposition result analogous to meorem C. The complexity of
fd 1$ the situation was first revealed by \IEISS z i KAIWLN [ 9 ] ; this paper contributed to a revival of interest (with hopes of success)
'AWEISS [1969]uses "determinabilityl1instead of constructibility. The new terminology used in these lectures is not yet entirely standard.
R . E. Kalman
canonical structure theorkm, and in fact necessary to Mly clarify the phrase "involved in the input/output behavior of the system1: is the last basic result:
THEORE24 D .
impulse-response matrix W
dimensional, linear dynamical system, there exists a 3eal,'continuoustime, finite-dimensional, linear dynanical system (a) realizes W:
which
is equal to W;
sa'tisfying ( ) ; a
c)
fd ()
is rcompletely controllable and compbete1.yobservable.; is uniquely determined (modulo the choice of a basis
gar its state space) by reauirenlent t o g e t b r with ('b) or, independently, by ( ) together with a
(4
In short, for any W as described above, there is an "essentially uniq-iiel1 of the same "type1'which satisfies ( ) through (c) a
determined by (a)
+ (b) - (a) + or
(c)
R . E. Kalman
CQROmY
2.
The first general discussion of t h e s i t u a t i o n with a n equival e n t statement of Theorem D i s due t o WT [1963b, Theorems 7 ! and 8 . 3
h his
an e x p l i c i t statement of Corollaries 1 zr.d 2, although they a r e implied by t h e general algorithm given i n Section
7. An edited
These r e s u l t s are' of great importalrce i n 'engineering system theory since they r e l a t e methods based on t h e Laplace transform (using the t r a n s f e r fbnction of the systen) m d t h e time-damin n methods based on input/output data (the m t r i x W) t o %he s t a t e variable ( d y n m i c a l system) methods devzloped i n 1955-1960.' In
f a c t , by Corollary 1 it follows t h a t the t~:o methods ?met yield i d e n t i c a l r e s u l t s ; f o r instance, s t a r t i n g with a constant impulseresponse matrix W, propert~r(c) implies thaz t h e existence
of a s t a b l e control l a y i s always assured by v i r t u e of Theorem A. Thus it i s only a f t e r t h e development represented by Theorems A-D t h a t a rigorous j u s t i f i c a t i o n i s obtained f o r t h e i n t u i t i v e design methods used i n c o n t r o l engineering. A s with
he or em
C, c e r t a i n forinulationzl d i f f'iculties a r i s e
d y d c a l system".
w")
[1966],
and KAINAN; a precise formulation and modernized proof of Theorem D in the weighting pattern case was given recently by KALMAN, FAD, and ARBIB [1969, Chapter 1 , Section 13.] A completely general 0 discussion of what is meant by a "minimal realizationn of a nonconstant impulse-response matrix involves many technical complications due to the fact that such a minimal realization does not exist in the class of linear differential equations with "nice1' coefficient f'unctions. For the current status of this probleia, consult especially DESOER and V A W Y A [19671,SILVERMAIT and MEADOWS
[ 1 6 1 KUMAN, FAI;B, and ARBIB [ 1969, Chapter 1 , Section 131 and 99, 0
WEISS [ 1 6 1 99
From the standpoint of the present lectures, by far the most interesting consequence of Theorem D is its influence, via efforts to arrive at a definitive proof of Corollary 1 on the development , of the algebraic stream of system theory. The first proof of this
important result (in the special case of disklnct eigenvalues) is that of GILBERT [1963]. Immediately afterwards, a general proof This proof, strictly
R . E.Kalman
Using the classical theory of invariant factors, KALEJL9N [1965a] succeeded in showing that the solution of the minimal realization problem can be efTectively reduced to the classical invariantfactor algorithm.
since it strongly suggests the now standard module theoretic approach, but it does not lead to a simple proof of Corollary 1 and is not a practical method of computation.
The best known proof of Corollary 1 was obtained in 1 6 by 95 B. L Ho, with the aid of a remarkable algorithm, which is equally important ' .
from a theoretical and computational viewpoint. The early formulation of the algorithm was described by HO and MIXAN [ 9 6 , with 16] later refinements discussed in HO and KALMAN [lg6gl, KALhIAI?, FALB, and ARBIB [ 1969, Chapter 1 , Section 1 1 and K A W [ 1969~1. 0 1 Almost simultaneously with the work of B L Ho, the basic results . . were discovered independently also by YOULA and TISSI [1966]and by SILVERM!!q
and centers around.the theory of Hankel matrices; however, many ofthe results just referenced seem to be f'undamentally new. This field is currently in a very active stage of develogment. We shall discuss the essential ideas involved in Sections 8-9. Ma-ny other topics, especially Silvemanlsgeneralization of the algorithm to nonconstant systems unfortunately cannot be covered due to lack of time,
fl. E,Kalman
Acknowledgment
skies and hospitality of Italy; along with Bolognese food p1a.yed a subsidiary but vital part in the success of this important gathering of scientists.
R. E. Kalman
topological
(t'le r e a l s ) on a s e t
X,
where X
is
t h e positions and momenta of t h e planets regarded as point masSes and a r e completely determined by t h e laws of gravitation, i.e., they do
not contain any terms t o account f o r "external" forces t h a t may a c t on t h e system. Interesting as t h i s notation of a dynamical system may be (and i s l ) i n pure mathematics, it i s much too limited f o r the study of those dynamical systems which a r e of contemporary i n t e r e s t . There
a r e a t l e a s t three d i f f e r i n t way& i n which t h e c l a s s i c a l concept must be generalized: (i) The t i n e s e t of the system i s not necessarily r e s t r i c t e d
x E X
t h e "passage of time" but a l s o by inputs which a r e o r c6&3Te pulated t o bring about a desired type of behavior;
. J&
mani-
R. E. Kalman
(iii)
The generalization of t h e time s e t i s of minor i n t e r e s t t o us here. The notions of input and output, however, a r e exceedingly
f'undamental; i n fact, c o n t r o l l a b i l i t y i s r e l a t e d t o t h e input and observability t o the output. With respect t o dynamical systems i n
t h e c l a s s i c a l sense, neither c o n t r o l l a b i l i t y nor observability a r e meaningful concepts. A much more detailed discussion of dynamical systems i n t h e modern sense, together with r a t h e r detailed precise definitions, w i l l be found i n KALMAN, FALB, and ARBIB [1969, Chapter
1 1.
From here on, w w i l l use t h e term I'dynamical system1' exclusively e i n t h e modern sense (we have already done so i n t h e ~ n t r o d u c t i o n ) . The following symbols w i l l have a fixed meaning throughout the paper :
T
U
= =
= = =
time set, s e t of input values, s t a t e set, s e t of outpv-t values, input functions, t r a n s i t i o n map,
(1.1)
cp =
q = readout map.
The following assumptio~s i l l always apply (otherwise t h e s e t s w above a r e arbitrary) :
R . E. Kalman
T = an ordered subset of t h e r e a l s
52
=
, ? l -
+ U such t h a t
i s undefined outside some
JU
CT
dependent on w;
nJu,
t h e r e i s a function
w
on
JU and
later,
w i l l be equal t o
Z. = -
(ordered)
11
a b e l i a n group of integers;
U, X, Y, 52
w i l l be l i n e a r spaces;
unde-
s i d e a f i n i t e i n t e r v a l " w i l l mean t h e same as " f i n i t e sequences". The most general notion of a dyimnical system f o r our present needs i s given by t h e followirlg
(1.3)
DEFINITION.
i s a compo'site object
T, U, R, X, Y
c o n s j s t i n g of t h e maps (a -s above) :
cp:
defined on t h e s e t s
T X T X X X R - ,
X,
X,
q(t;
7,
: (t; 7,
w)
c 3
x, 0 )
undefined whenever t -
> T; Y:
( t , x) I-+ ~ ( tx. ) , s a t i s f y t ~ h e following asbumptions: t
7:
T X X -
The t r a n s i t i o n
&
R. E. Kalman
if -
= cul on
7, X,
W)
[T, t ] ,
then f o r a l l x, m').
s E [T, t ]
q(s;
q(s;
7,
The d e f i n i t i o n of a dynamical system on t h i s l e v e l of g e n e r a l i t y should be regarded only a s a scaffolding f o r t h e terminology; i n t e r e & i n g mathematics begins only a f t e r f u r t h e r hypotheses a r e made. instance, it i s usually necessary t o endow t h e s e t s For and
*4 -
T, U, R, X,
Y with a topology
and
9 be continuous.
(1.7)
MAEIIPLE.
T = 5 = reals, -
c o n s i s t only of t h e nowhere-
and
entirely;
f o r - t, all q(t;
T, X,
E T
and w r i t e it a s
W )
~ ' (- t T),
and
t h a t is, a f'unction of
alone.
Check (1.4-5);
in
and
x*(s+ t )
(xis)*t. be continuous.
(x, t ) w x - t
(1.8)
INTERPRETATION.
t h a t it axiomatizes t h e notion of s t a t e .
R. E Kalman
a r u l e f o r s t a t e t r a n s i t i o n s (the f'unction )
T)).The
map
cp i s verbalized
x
as follows:
time
T
''an input a,
i n state
at
produces t h e s t a t e
~ ( tT,; x, a )
a t time
d e f i n i t i o n of an input f'unction
convenience; by (1.6) only equivalence classes of inputs agreeing over [T, t ] enter i n t o the determination of
cp(t;
T,
X,
a).
at
system C.
11
definitions without detailed explanation. The following conventions w i l l remain i n force thkoughout t h e lectures whenever t h e ,-inear case i s discussed:
. .
S= 2
Y
a l l continuous functions
g -
Rm -
which A n i s h out-
K = fixed f i e l d (arbitrary),
R. E. Kalman
U =
Z + -
p ,
X = I , Y = K*, ?
il = a l l f'unctions
t h e i r arguments
(F(*), G ( * ) , H ( - ) )
F ):
0 )
I - {n 1 - ,
matrices over
g) g), -
G (
g -
-3
H .) : R ( -
-+
R). C
i n t h e folllowine,
where
t E= R,
x E
gn, ~ ( 5EFf, )-
an& -
y(t) E
$. -
i n t o a well-defined dynamical
system i n t h e sense of Definition (1.3)~ it i s necessary t o r e c a l l t h e basic f a c t s about f i n i t e systems of ordinary l i n e a r d i f f e r e n t i a l equations with continuous coefficients.
cP
(t,
2): .
R n
X R X
-+
n
5) -
t o be the family o f
matrix solutions of t h e l i n e a r d i f f e r e n t i a l
R . E.Kalman
e
ion
g. It i s called t h e
i s of c l a s s
c1
i n both arguments.
i s e x p l i c i t l y given by
r)x
It i s instructive t o v e r i f y t h a t
I n view of t h e c l a s s i c a l terminology "linear d i f f e r e n t i a l equat i o n s with constant coefficients1; we introduce t h e nonstandard (1.15)
\
DEFINITION.
is a triple
(F, G, H)
of
R. E. Kalman
n X'n, n Xm, p
where
or
-,H)
(.8 11)
Z
is equal to the.dimension of X z
,
as a vector space.
2 .
I n t h i s section, w s h a l l be mainly i n t e r e s t e d i n f i n i t e e dimensional l i n e a r dynamical systems, although t h e f i r s t two d e f i n i t i o n s w i l l be quite general. Let
There e x i s t s a s t a t e cp(t;
X3C, &)
9 and an input
=
&
such t h a t
T,
X3C
f o r a11 t,
X3C
and CM a s 0
have a d d i t i v e structure,
(2.1)
i s controllable i f f .
co
there exists a
t E T and an w
(both
and -t -
may depend
I n words:
*to 0 co.
an event i s c o a t r o l l a b l e i f f it can be t r a n s f e r r e
x)
to
( t , 0)
as the graph of a
[ T t 1. ,
--------------6The
t e c h n i c a l wor'd i f f means i f and only i f .
R.'E. Kalman
. Consider now a reflection of this graph about T This .
is an - sE T
(T, x ) )
(both s and
- -
may depend on
such that
We emphasize: controllability and reachability are entirely different concepts. A strzking example of this fact is encountered below in Proposition ( . 6 . 42) We shall now review briefly some well-known criteria for and relations'between reachability and control.lability in linear systems.
(2-3)
),
A
an event for -
(T,
x)
is
R, s < T, where -
for -
some -
t E R, t > -
T,
where -
The original proof of (b) is in KALMAN [1960b]; both cases are treated in detail in KALMAN, FALB, aad LWIB [ 1 6 , Chzpter 2 99 ,
R . E. Kalman
Section
21.
Note t h a t i f
G(*)
i s i d e n t i c a l l y zero on
G ( = ) i s identically
(-
w,
T)
we cannot have c o n t r o l l a b i l i t y .
w(r, t )
So we have
= t ( 2 ~
- t,
r).
(2.4)
PROPOSITION.
x)
i s reachable
an evsnt
(.) 25
THEORXM.
a state x
R -
... ) c $;
' ,
26
R. E. Kalman
d e f i n i t i o n of reachable s t a t e s of
COROmY.
i s "coardinate- free" :
s t a t e s of space
is
for
It
"every event
(7,
x), T = fixed,
x E XE
i s reachable",
or
event i n Z
i s reachable",
i s completely
..., P I G )
= n.
+-
Xz
- 3 is
R , E.Kalman
COROLLARY 2 .
(w. Hahn)
A constant system
C =
(F, G, -)
is completely reachable if and only if there is no nonzero eigenvector of F which is orthogonal to (every column vector of) G . Finally, let us note that, far from being a technical condition, (2-5) has a direct system-theoretic interpretation, as follows:
(.1 21)
of a real, continuousG, -)
time, n-dimensional, linear, constant dynamical system C = (I?, may be written as a direct sum
The aubsystem
a state x = (xl, x ) E 2
= 0.
Hence, by By construc-
R. E. K a l m a n
G
belongs t o X. 1
The F-invariance of
implies t h a t
Note t h a t
X2
i s not i n t r i n s i c a l l y defined
"(0, x2)
More precisely:
the
bear i n mind f o r the algebraic development which follows a f t e r t h i s section and a l s o i n t h e . d e f i n i t i o n of observability and c o n s t r u c t i b i l i t y beiow. chosen i n such a way t h a t I n general, the d i r e c t s m cannot be u F 2 = 0. 1
While condition (2.8) has been frequently used a s a technical requirement i n the solution of various optimal control problems i n t h e l a t e 1950's, it was only i n 1959-60 t h a t t h e r e l a t i o n between (2.8) and system t h e o r e t i c questions w a s c l a r i f i e d by KADKN [ l g a b - c ]
(see Section
it i s not, at l e a s t i n problems i n which control entby (2.12) control problems s t a t e d f o r respect t o the i n t r i n s i c subspace
because,
a r e n o n t r i v i a l only with
x 1
R, E.. Kalman
The hypothesis "constant" i s by no means e s s e n t i a l f o r Proposition (2.11), but w must forego f u r t h e r comments here. e
For l a t e r purposes, w s t a t e some f a c t s here f o r d i s c r e t e e t h e , constant l i n e a r systems analogous t o those already developed f o r t h e i r continuous-time counterparts. The proofs a r e s t r a i g h t -
.
PROPOSITION.
(2.14)
A state
x of a real, discrete-time,
G, -)
n-dimensional,
x span(G, FG,
..., F"-lG).
Thus such a system i s completely reachable i f and only i f (2.8) holds. PROPOSITION.
A state x
of t h e system C
described
x E span (F-'G,
P
..., F-"G) ,
$ X
where
-k F G
(2.18)
IX:
giy
gi
= column v e c t ~ r of
GI.
PROPOSITION.
f 0,
Note also that Propositions (2.11) and its proof continue to be correct, without any modification, when Itcontinuous-time" is replaced by "discrete-time". Now we turn to a discussion of observability. The original definition of observability by KALMCLNr [1960b, Definition ( . 3 ] 52) was concocted in such a way as to take advanThe conceptual problems surround-
ing duality are easy to handle in the linear case but are still by no means fuLly understood in the nonlinear case (see Section
1) 0.
here only the linear case and even then we shall use the-underlying idea of vector-space duality in a rather ad-hoc fashion. The reader wishing to do so can easily turn our rernarks into a strictly dual treatment of facts (2.1) -(2.12) with the aid of the setup intrduced in Section 1 . 0
(.9 21)
in a real, continuous-
-,H(*))
for all s E [ T,
m).
*In the older literature, starting with KALM&N [1960b, Definition (5.23)1, it is this concept which is called "observability". By hindsight, the present choice of words seems to be more natural to the writer.
R. E. Kalman
~(rr)$(u, T)X = 0
for a l l u E (-
m,
r]. the
(The d e f i n i t i o n
obvious but i s i n f a c t strongly swgested by s t a t i s t i c a l f i l t e r i n g theory (see Section 10). I n any case, Definition (2.21) comple-
ments Definition (2.20) i n exactly t h e same way as Definition (2.1) complements Definition (2.2)
-,H(*))
an event
(T, x)
is
x E kernel 2 ( ~ ,t )
g, t -
t r, where
(b)
x kernel ~ ( s T) ,
s E
g, -
< T,
where
R . E.Kalman
PROOF.
E kernel ~ ( r ,t) @
Hs@(, ()~s
rx )
Part
REMARK.
and let us indulge (only temporarily) in abusas of language of the following sort:*
(, T
x = unreachable )
< *
x E kernel $(T, t)
observable
x range ;(T,
t)
for some t > T. From these relations we can easily deduce the so-called "duality rules"; that is, problems involving observability (or constructibility) are converted into problems involving reachability (or controllability) in a suitably defined dual system. See KAINAN, FUB, and ARBIB [ 1 6 , Chapter 2 Proposition (6.12) 99 ,
0 discussion in Section 1 .
( . ) "dualizes" to the 27
(.3 22)
(F,
--------------11
- , H)
*All this would be s-t;rictlycorrect if we agreed to replace direct sum" in Proposition ( . 1 and its counterpart ( . 5 by 21) 22) "orthogonal.direct sun"; b ~ t this would be an arbitrary convention which, while convenieat, hzs no natural system-theoretic justifica'(2.. tion. ,Reread~erna+%. 13)
R. E Ealman
(.4 22)
rank
(a',
FH, ''
..., (F*)"-%')
n .
By duality, com2lete constructibility in a continuous-time system is equivalent to observability; in a discrete-time system this is not true in general but it is true when det F It is easy to see also that (2.11) "dualizes" to: (2.25)
0 .
of a real, continuous-
-,H)
PROOF.
X1
. [ 1962I
R. E. K a l m a n
.
The main motivation for the succeeding
3.
A c l a s s i c a l dynamical system i s e s s e n t i a l l y t h e a c t i o n of t h e
time s e t
(= r e a l s ) .on t h e s t a t e s X.
I n other words, t h e
(R + u s u a l -
'
on X;
t h e a b e l i a n s t r u c t u r e on
t h e r e corresponds a n ( a s s o c i a t i v e
St.
The idea t h a t
always admits such a s t r u c t u r e was apparently overlooked u n t i l t h e l a t e 1950's when it became fashionable i n automata theory This seens t o be t h e "right9' way (school of SCHUTZENBERGER). of t r a n s l a t i n g t h e i n t u i t i v e notion of dynamics i n t o mthematics, and it w i l l be fundamental i n our succeeding investigations.
(3.1)
T = time s e t integers.
Since w s h a l l be only i n t e r e s t e d i n constant systems from e here on, we s h a l l adopt t h e following normalization convention:*
*In t h e d i s c w d t i m e nonconstant case, we would have t o deal with Z copies of St, each normalized with respect t o a d i f f e r e n t p a r t i c u l a r value of T E Z. -
R E. Kalman
N element of o
i s defined f o r
t > T = 0.
Icul
I n view of (3.2))
.
of
cu by
.
= max
Ct E Z: - -
cu
is n ~ t defined for-any s
< t).
t h e ( l e f t ) s h i f t operator
>0 -
i n Z_ by -
Note t h a t t h e d e f i n i t i o n of t i o n (3.2).
If
Ju
nJut
= empty f o r
o, cut E R,
we define t h e Join
of
cu
and cut
as t h e function
, cut
by cu
+ cut.
X Q
+ R,
c a l l e d concatenation, defined by ..
i s well defined.
i s a semigroup with va
.-.---I---
R. E.Kalman
systems, whose evolution was strongly influenced by problems in communications and automata theory: input/output map a discrete-time constant
clearly imply "constancyt1(implicitly, however, this is clear from the normalization assumption ( . ) on 32
a. )
-: f
R+
..
: m w
( ( ( - *)= f f, y* , 4q
( ( 1 ~ ( 2 1 , .* = * ~1,
)*
y2, ()
...
--------------*Observe that xow is the strict analog of the notation xt customary in topological dynamics. The action of w on x satis= (XU) ov in view of (1.5) fies XO(UOV)
R. E. Kalman
input
w
which stops a t
t = 0.
This d e f i n i t i o n expresses c a u s a l i t y more f o r c e f u l l y and incorporates copstanc:r, operator f o r any provided w define t h e ( l e f t ) s h i f t e So,
u
T
on I'
> 0, -
T Z, -
Note:
the operator c
uQ
0,
the
operator
" l d i s ~ a r d t h e~ term ~ ( 1 ) . s~
f,
A discrete-time, -
i s any map
such t h a t
i s commut&tive. W s q t -- f e hat
i s line ----- a r
i f f it i s a K-vector -------
L I
R. E. Kalman
system to experiments in which only input/outgut data is available. This point of view is related to experimental physics the same w y as the classical notion of a dynamlcal system is related a
-0 6
Newtonian (axiomatic) physics. The basic question which motivates much of what will follow
(.1 31)
and
I?)
in a mathematically consistent, rigorous, and natural m y , the properties of the system C which is supposed to underlie the given input/output r r q f? This suggests immediately the following fundamental concept: (.2 31) DEFINITION. A fixed dynamical system C (internal
definition, as in Section 1 is a realization of a fixed input/ ) output map fo iff f0 = f z 0 the input/output map of Zo.
In view of the notations of Section 1 plus the special convention (3.6), the explicit form of the realization condition is simply that
for all
u ,
SZ.
The symbol
brium state in which Lo remins, by definition, until the application of w. (~ater simply take we
to be 0 ) .
R; E.Kalman
To solve the realization problem, the critical step is to induce a definition of X (of some
zo)
It is rather surprising that this step turns out to be trivial, on the abstract level.
$2
into
Then - cu
-f
U1
f(cu0.v)
There are
I T -
f(utov)
for all
n.
reasons (which are scattered throughout the literature and concea-. trated especially strongly in KALMCLN, FAB, and ARBIB [ 9 9 ) for 16] using this as the
(.5 31)
=
MAIN DEFINITION.
-fJ
denoted as Xf = - { ( ) u f: u E O , )
l
input/output map f
Let us veri*
sense:
R. E.Kalman
(3.16)
PROPOSITION.
such t h a t
(a) Zf
realizes
f;
PROOF.
Cf,
given
f.
W e
C
C f
by (b) by
Further, w define t h e e
on t h e l e f t of
! )
x as - (a)
f.
by
e Again, t h i s map i s well defined since w can take special case i n (3.14). Then
V =.
PI.. as a
. i s correct.
(3.19)
COMMENTS. In automata theory,
Cf
i s known a s t h e
Clearly, any two
f.
(This observation
is a weak version of Theorem D of the Introduction; here "uniqueness" means "moiiulo a pernutation of the labels of eleme~tsin the set Xf".) Eiotice also that Ef is comgletely reachable of Xf
,,
every eLement x =
cut
( u ) ~ . As to observability of Cp
see Section 1 . 0
K. E Kalman
It i s assumed t h a t t h e reader i s conversant with modern algebra (especially: a b e l i a n groups, commutative rings, f i e l d s , modules, t h e r i n g
of polynomials i n one variabletand t h e theory of elementary d i v i s o r s ) , on t h e l e v e l of, say, VAN DER WAERDEN, LANG ZARISKt and S M E [1958, Vol. A UL on d a t e s from
119651, W [ 19651 o r
11.
1965 or l a t e r .
A l l systems C
(F, G, H)
a r e discrete-time, l i n e a r ,
K
(4.2)
state set
FCTNDAMENTAL T E R M O LINEAR S S E T E R . HO E F YT M H O Y
The n a t u r a l
output map
over a fixed f i e l d
admits t h e s t r u c t u r e of a f i n i t e l y
.
K[Z]
(4-3)
C M E T. O MNS
C,
Since t h e r i n g
w i l l be seen t o be r e l a t e d
t o ' t h e inputs t o
It t u r n s out, however, t h a t t h i s a c t i o n of
from combining t h e concatenation product i n
on X,
which r e s u l t s
with t h e d e f i n i t i o n of
R. E, Kalman
Our theorem a s s e r t s t h e
X.
This
dynamics, but depends c r i t i c a l l y on t h e additive structure on R and on t h e l i n e a r i t y of f. The relevant multiplication i s not
(noncommutative) concatenation but (commutative) convolution (because convolution i s t h e natural product i n K[z]) ; dynamics i s thereby r e s t a t e d i n such a way t h a t t h e t o o l s of commutative algebra become applicable. (4.30)), I n a c e r t a i n r a t h e r d e f i n i t e sense (see a l s o Remark
of t h e Laplace transformation, especially a s regards t h e practices developed i n e l e c t r i c a l engineering i n t h e U.S. during the 1950's.
The proof of Theorem (4.2) consists i n a long sequence of canonic a l constructions and the v e r i f i c a t i o n t h a t everything i s well defined and works a s needed. , I n view of (4.1) and the conventions made i n Section 1 R be viewed a s a K-vector space and w ( t ) = 0 and a l l cu E R. ~ ( t= 0 ) (a) m y
f o r almost a l l t E Z_ -
for a l l t
R
> 0. As a r e s u l t , we have t h a t :
a s a K-vector space. Let us exhibit t h e isomor-
P[z]
phism e x p l i c i t l y a s follows:
By (3.2
The isomorphism
R. E,Kalman
I n t h e sequel, w e
and
co as an m-vector polynomial.
(b)
i2
action of
by scalar multiplication a s
-: K[z] X R
where
R:
(n, w)
a.w
(4.5)
( a . K[z], j
1 ,
..., m).
m e product of
product i n ~ [ z ] . W e write t h e scalar product on t h e l e f t , t o avoid any confusion with notation (3.6 ) axioms a r e verified;
R
position,
(c)
i2
t h e action of t h e s h i f t operator
z.
cR i s represented
by m u l t i p l i c a t i o n by
R. E. Kalman
(d)
Each element of
I'
i s a formal power s e r i e s i n
z 1 In fact,
-.
-
as an abstract representation of
t E Z; -
By (3.8 ) and
(4 . ) y ( t ) I,
E KP
f o r each
not defined) f o r
nonzero terms; t h e r e i s no question of convergence and t h e right-hand side of (4.7) i s t o be interpreted s t i c t l y a9gebraically a s a formal power series. that (e) Since Y(Q)
we can say a l s o .
K~[[Z,-~I]
-1 w i t h coefficients i n
0
K ~ ) consisting
first t e r ~ .
has t h e structure of a
multi~lication defined a s
: ~ [ z X I? ]
r:
('IT,.
Y)
H8 - Y
= 'IT(G~)Y.
z -1 by a polynomial i n
z. The verification of
R E Kalman
(g)
.. .
.f
i s a ~ [ z ]homomorphism.
G?/kernel f . . This i s an easy but highly nontrivial lemma, connecting Nerode equivalence with the module structure on R. immediate consequence of the formula The proof i s an
I n fact, by K-linearity of
f,
.(4;9) implies
for a l l v E Q
f(0ov)
i f and only i f
f(m1ov)
k f ( a -0)
= f(zk*ml) f o r a l l k
> 0 in -
Z. -
lemma identifies X f
module Q/kernel f .
K[z]
quotient
["If = m
..
..
kernel f ; [ellp
then
it i s clear t h a t Xf
since R
as a K[z]-moduie i s generated by
el
i t s e l f i s generated by
...
is
...., [e,lf,
Note also
e m
(see (4.6)).
R.E Kalrnan
(4.11) define f.
REMARK.
From the point of view of homological algebra [MAC LANE Since every f r e e module i s
Zf
i s clearly a
f
K 21-subnodule of [
and so X
and
zf' a r e
isomorphic a l s o a s
known t h a t Exercise 21
Xf
I?
Zf:
[elf
f(m)
exhibits
but its fill implications a r e not y e t understood a t present. There i s an easy counterpart of Theorem (4.2) which concerns a dynamical systen given i n " i n t e r n a l " form: (4.12) PROPOSITION. The s t a t e s e t
of every discrete-time,
finite-dimensional, l i n e a r , constant dynamical system C = (F, G, -) admits t h e s t r u c t u r e of a PROOF. K-vector space. K[z]-module.
X =
By
d e f i n i t i o n (see (1.10)),
K"
i s already a
W make it i n t o a e
K[z]-module by defining
R E Kalman
.' .'
(.4 41)
COMMENT.
F :
C provided by f
[] Proposition (3.16)) the state set X can be treated as a K z module irrespective as to whether X is constructed from f
(X =
xf)
(X =
x~).
Thus
the K[z]-module
and the "internalI1definitions of a dynamical system. Henceforth we shall talk about a (discrete-time, linear, constant dynamical) system
C
5.
We shall now give some examples of using module-theoretic language to express standard facts encountered before.
(.5 41)
Fz
PROPOSITION.
I f
is the state-module of C ,
-
the map
is given by X ->
X: x H z - x .
PROOF.
3.
If X
Xf
3,
f
since x 0 ()
x1 ()
R.' E. Kalman
and we get
So t h e a s s e r t i o n i s again verified. N w we can replace Proposition (2.14) by the much more elegant o
(4.16)
PROPOSITION.
A system C = (F, G, -)
i s completely reachable
3.
. i s expressible a s
xE
I n view of (4.15),
t h i s i s t h e same a s requiring t h a t
x be expressible
The reachable s t a t e s of
C a r e precisely
G.
The statement t h a t
X
C.
assume t h a t
i n f i n i t e l y many generators, t h a t is, i n f i n i t e l y many input channels, seems r a t h e r bizzare a t present. (4.19) PRQPOSITION. The system Xf PROOF.
i s completely reachable.
a state
x = [ Elf
i s reached by
(4.20)
PROPOSITION. PROOF.
The system Xf
i s completely observable.
~ ( [ mf]) = f (u) = 0
iff
m E [OIf,
Xf
is 0 E Xfr
Let us generalize t'no l a s t r e s u l t t o obtain a m0dLl.e-theoretic c r i t e r i ~ n f o r com2lete observability. doing t h i s . There a r e two t e c h n i c a l l y d i f f e r e n t ways of
submodule (see Ccrollary (4.17)) i s a quotient module. observability via t h e "dual1' system
( F I , Ht, -)
associated with
(F,
-,H).
-,H)
K[ 21-module
and K-homomorphism
H:
% -t
Y .'K =
R. E. Kalman
t o a K[z]-homomorphism
by s e t t i n g
can say t h a t
%/kernel
Ti
i s t h e module of observable s t a t e s of
A system C = (F,
- ,
H)
%/kernel
3.
(4.22)
COROLLARY.
The observable s t a t e s of
a r e t o be identified.
%/kernel
E .
(4.23)
TR I OO Y E MN L G .
3=$
of a sub-
E )
of
%,
that i s
If
"5
we say t h a t
R. E. Kalman
(4.24) between
CORRESPoNDE3lCETHEOFZ31.
Thereisabijectivecorrespondence
and t h e equivalence c l a s s of modulo a
K[z]-homomorphisms
f: 0
5.
7.
DEFINITION.
HI,
The K-linear
C.
dual of
C = (F, G, H) i s
The s t a t e s of
G I ) ( 1 = matrix transpositton).
a r e c a l l e d c o s t a t e s of
%* of
Z*
may be given t h e
s t r u c t u r e of
module, as follows:
( i ) as a vector spece
X2,
i s t h e dual of
product i n
i s defined by
(4.26~)
REMARK.
W cannot define e
Xz*
has a t r i v i a l D-dual.
(3, z) /K[ z 1) . K(
Section
4, No. 81.
R , E. Icalman
N w w verify e a s i l y t h e follo-+ring o e dual statements of (4.16-17):
(4.2'7)
PROPOSITION.
A system C = (F,
-,H)
i s completely observable
if and only if
H generates I
%*.
C*
(4.28)
COROLLARY.
a r e precisely
t h e reachable s t a t e s of
C, *
%*
generated bx H I
W have eliminated the abuse of languzge incurred by talking e about t'observzble s t a t e s t ' tbzough introduction of t h e new notion of "observable COstatesl'. The full explication of why t h i s i s necessary
(as well a s natural) i s postponed u n t i l Section 10. The preceding simple f a c t s depend only on t h e notion of a module and are immediate once w recognize the f a c t t h a t e
F may be eliminated F
m of an R-module
(R = a r b i t r a r y such t h a t
M
r a m = 0.
m i s free.
Similarly,
is
M has torsion.
If
L C M
i s any subset of
K ,
(r:
t h e annihilator
of
is the set
r-I = 0
for a l l
I E L); R.
it follows immediately t h a t AL
i s an i d e a l i n
Note a l s o t h a t
R.E.Kalman
the statement "M that is a torsion mo6uleV does not imply in general
(~ounterexample: take
an M which is not finitely generated.) Coupling these notions with the special fact that, for us,
R = Kz, []
(.9 42)
is a torsion K[z]-module.
COROLLkUY.
I f
See ( . 8 . 11)
Sufficiency. By assumption X is-finitelygenerated by, say, q nonzero elements necessarilg the columiis of G )
... .
Hence
of XL
is a princi-
d t h Y. E K.. f1
If
..., q.
= 0 - contr y to assumption.
R. E, Kalman
by the simpler.one
F x u z-x. If :
This means (by the usual definition of the minimal polynomial in matrix theory or more generally in linear algebra) that xE
qF annihilates every
I3
so that F z
is a torsion K[z]-module.
Notice, from the second half of the proof, that the notion of a minimal polynomial can be extended from K-linear algebra to K[z]-modules. In fact, the same argument gives us also the well-known
(.0 43)
qM given by
+ = "MR.
X is finitely generated with
then -
(.1 43)
COROLLARY. If a K[z]-module
(as K-vector space) - q~deg qx. The fact that Lf is completely reachable and is
<
therefore generated.by m vectors allows us to estimate the dimension of Ef by (4.31) knowing only deg
Xf itself.
(Knowing Xf
z-x, etc
.r
In other words, the module-theoretic setup considerably enhances the content of Proposition (3.16)
(.3 43)
C
w
PROPOSITION.
is
Suppose for
% = K[.zl- Then
the input 5
followed
is free.
( . ) is due to the 42
fact that through it we can apply to iinear dynamical systems the well-known
(4.34)
qi1qiTl, i =
2 ,
..., q,
< m. ,
Various proofs of this theorem are referenced in IWiMAN) FALB, and ARBIB Note:
VM
q. l
One important consequence of this theorem (others in Section 7 ) is that it gives us the most general situation when torsion module C.
is not a we
( . 6) 43
reachable and completely controllable if its K[z]-module m-dimensional components (i. e, . s > 0 in (4.35 ) )
X has any
.
most of our
( . 7) 43
REMARK.
be regarded as a deep examination of Proposition ( . 4 ) 21) coments apply equally well to (2.7),
the same algebraic condition ( . ) In fact, the only remaining 28. thing to be "algebraizedn is the notion of "continuous-time". We
shall not do this here. Once this last step is taken, the algebraization
5. .CYCLICITYAND RELAW
QUESTIONS
generated by one
e E :
with gecerztx m
is isomorphic with the quotient ring 8/~, viewe6 es sn 3-r3dule. This result is much more i~teresting when, as in our case, Ii is not only commutative and a principal-iseal dozain, but specifically [] the polynomial ring K z . So let X be a cyclic K[z]-module with generator g and let
A = qgK[z],
g
where
is the min-1
g
g .
is a minimal
polynomial also for X. Write $ = $ = p. Io view of (5.l, ) g x X " ~[z]j$~[z]. Let us recall sone features of the ring K[z]/?l.~[z]: (i)
?r
E K[z]
Write these as
[T] or [n$.
Multiplication is 6eiin.d
as
[I
is either a
=
or a divisor of zero.
T,
In fect,
Jr
is a
[TI
R. E. Kalman
u n i t i n K[z]
( t h a t is, (n, )
K)
Then
m + T$
so t h a t
(7J,
= 1 (u, T K C ~ ] )
[a]
= 0
i s t h e inverse of
In].
unit i n K[z],
then both
[n].[$/@] = [(n/Q)$] = 0 .
i s a prime i n
KI z ]
(that
ts,
an irreducible poly-
K),
then
K[z]/$Jc[z] i s a f i e l d .
i n algebraic number theory. Since it i s awkward t o compute with equivalence classes s h a l l often prefer t o work with t h e standard representative of namely a polynomial mined by [TI of l e a s t degree i n [ I . [n], .we
[a],
? i s uniquely deterHenceforth
"
< deg
*.
will
always be used i n t h i s sense. The next kwo assertions a r e M e d i a t e : (5 -2) tothe K[z]/$x[
PROPOSITION.
K[z ]/vK[ z 1 a s a
- K-vector
&
space i s isomorphic
21
i s a l s o isomorphic t o
9'") as a
K z]-module, provided [
i.r
(8.1)
n$.
h .
. (5 03)
then -
PROPOSITION.
dim C = deg $.
If
Xx
R. E. Kalman
KC
21-module i s a d i r e c t s m of c y c l i c u
KC
z ] -modules.
By combining
,.-
(5-4)
factors
PROPOSITION.
I f
then
Xz
..
dim Z
=
Q -
deg
(F, g) where g i s an
XF,
t h e module induced by
Let
x,(z)
det
(21
- F),
then
XF.
71 f o r d e t a i l e d
in
$.
[For cansistency,
5 (z) = ~ (n+l)
( z .I)
"dn) a as
Hence
e' "" n are a basis for $ as .a K-vector space. With l respect to this basis, the K-homomorphism
his
d?gA
R .E, Kalman
z-e = 1
2 %
(4 (z)og,
By definition, g = e n
Conversely, suppose (? & have the matrix representation (5.6-7) I , with respect to some basis in 3Sn. Then (by direct computation)
the rank condition (2.8) is satisfied and therefore (F, g) is completely reachable in both the continuous-time and discretetime cases (Propositions (2.7) and ( . 6 ) 21). We have now proved:
(5 -8)
is completely reachable
(.) 56
*
n z
(5.7).
Given an arbitrary n-th degree polynomial
(.) 59
hz ()
=
COZOLLARY.
+ p1zn-1
.:. + Pn
in
Kz, []
(, g F )
is completely reachable.
R . E.Kalman
is completely reachable.
With respect to the same basis (5.5) which exhibits the canonical forms
(5.6-?) , define
'
is not completely
reachable. Then, recalling Proposition ( . 2 (which is an 21) algebraic consequence of ( . ) and hence equally valid for both 28 continuous-time and discrete-time), Since 2 2 the polynomial deg dim X > 0 and so is also
2
=
is an F-invariant subspace of X
K",
5
22
X. 2)
In view of ( . 2 , 21)
R. E, Kalman
(5.11)
n.
There i s a b i j e c t i o n between
+
... + p,
3
e:
t-tp: x ( J ) . ~ I + l . . g
such t h a t
(j = 1 . , n and
K z] and K-homomorphisms [
X
defined
as i n -(5.5))
x t+ in
zox
l?(x)
l?(x) corresponds t o
gIlx
(5.10).
l?
-)
z t o z*
Since t h e vectors
X(')*g,
..., X(").g
form a W e
? ,
a' i s c l e a r l y a well-defined
K-homomorphism.
formally as an element of
Ki z ]
is a
[ 5 1 = ( 5: 5.g = x)
Unless
pj
t h a t t h i s choice of
j-= , 1
or 3'
j = 1 ,
.
-
n.
W prove f i r s t e for
implies
h(')(z
j.
I) = ~ ( " ( z )
By definition,
...) n
-t-
1 Use induction on .
4)(z - 8 ) '
= X(')
n. E. Kalman
So the open-loop/closed-loop transformation is essentially a change in the canonical basis, provided X is cyclic. It is interesting that the x(') have long been known in
present (very natural) use in module theory seems to be new. -Theorem' ( . 1 b y be viewed as the central special case 51)
--
of Theorem A of the Introduction. Let us restate the latter in precise form as follows:
(..) 513
hz = zn + Bz n-1 () 1
* o m
'&
, in
~[z], K
= arbitrary field.
There exists an n X m matrix L over K such that if and only if (F, G is completely reachable. )
kGL, = h
For some timz, this result had the status of a well-known folk theorem, considered to be a straightforward consequence of ( . ) 59. has been discovered independently by many people. The latter
( first heard 1
of it in 1958, proposed as a conjecture by J E Bertram and proved . . soon arterwards by the so-called root-locus method.) Indeed, the
t
passage from (5.11) to (5.13) is primarily a technical problem. A proof of (5.13) was given by LAKGET~OP [ 9 ] and subseq~eritly 1& simplified by W O W ? [1967]. T'e first proof was (~n~ecessarily) very long, but the second proof is also unsatisfactory; since it-dependson arguments using a splitting field of
*The School.
R.E.Kalman
and fail when K is a finite field. We shall use this situation
as an excuse to illustrate the power of the module-theoretic approach and to give a proof of (5.13) valid for arbitrary fields. The procedure of LANGENHOP and WONHAM rests on the following fact, of which we give a module-theoretic proof:
LEMMA.
Let
be cyclic* and an -
m = 1 is trivial. The general case mounts to the following. Consider the submodule Y of X =
gp
-**,
%-1
of G. In view of (5.15))
Y is cyclic. By the
inductive hypotinesis, we are given the existence of a cyclic generator of Y of the form g = a.g + + a,_l-%-l, ai E K. Y 11 We must prove: for suitable a p E K the vector a-gy + , is a cyclic generator for X .
...
(see (4.13))
By hypothesis,
gX.
By c y c l i c i t y we have t h e representations
f a r suitable
a, f3 E K t h e polynomial
+ &I
is a unit i n
K z]/$K[ z ] [
This,
where
Q, 1
..., Qr
mod Q
i n K[z]
a r e t h e unique priloe f a c t o r s of
.
zero.
Let
classes
Then no p a i r
Q
i = 1 ,
..
can be annihilates
) ,
t h a t is,
XI
yei
= ~ [ z ] g K[z]% ~
whence
i s a proper sub-
contradicting t h e f a c t t h a t
(F, G) i s completely
If a l l the
pi
.. f Ti
0,
so
is a unit in
So l e t
K[ Z]/%K[ a],
= 1 Then .
t h e condition
?i + bi =
Since
K
eliminates a t most
values of
f3 f r o n consideration.
i s i n f i n i t e by
f3 which s a t i s f y (5.16)
.
a E
An e s s e n t i a l p a r t of t h e lemma i s t h e s t i p u l a t i o n t h a t
The hypothesis
II
p.
F = cyclic
H. E. Kalman
that is, the lemnz is trivleUy true for some a E K~[Z] since
g~
= Ga.
between vector-space structure and module structure, and for this reason the lemma is nontrivial. 'As a-matter ofaf&ct, the l e m a is false when K = finite field. The simplest counterexample is provided when ( . 2 rules out a single nonzero value of f3, 51) out thereby ruling
p.
COWE-LE. Let K = 2/2Z, that is, the ring of
( 5 017)
Consider
where the
( X X
= I ,
hence
X is cyclic. Notice also that gl generates X @ X3 1 Cnerates X2 @ X A cyclic generator for X is 3'
while g 2
R E Kalman
.' .
Conditions
a-i+ p.0
a-0 + p-1
# o # o
(mod
x) 1, (mod x) 2,
- ~322.
claimed to be true in the continuous-time case under the hypc'.:lesis of complete controllability.
(2)
(F = nilpotent, G
is 3-GLindependent of
controllabilityllis replaced by I1complet reachabilitytl, e this modification being immaterial in the continuous-time case.
R. E. Kalman
(4)
reflected in module-theoretic properties is true, then (5.13) , should hold without assumptions concerning K because the principal module-theoretic fact, that K z] [
=
independent of the specific choice of K . We now proceed to establlsh Theorem (5.13). nypotheses on K will turn out to be irrelevant. PROOF OF (5.13). Necessity is proved exactly as in ( . ) 58. once we have proved it That is, special
(.8 51)
IZMMA.
. Let
g2.
JI
induces a
or
and Z = ~ [ z l2 g
take an
case I Y /Iz = 0 that is, x = Y Z ~n (5.11) . , . . . such that J x = 0 -for all x E Z Replacing z by () .
i will change the ?
z ,
= z
z .
= X
.Now v e r i f y t h a t
Hence
gl
+ P2
as a
K[z ,
1-m~dule.
Cas2 2.
Y ~ = z : W
f:
0.
L2t
W.
37 11-ypothesis,
E ~ [ z ] such t h a t
w = 5 g2 and therefore, by
q E ~ [ z ] such t h a t
Y,
there is also a
S*g2 = w = tl'gl*
0.
1 - 1 50%
q
5) we
a r e done because
generates Y ,
I n t h e n o n t r i v i a l case,
u n i t (mod
3) TO show: .
a s a - K[ , z
on X
such t h a t
X
r , = u n i t (mod l
x,),
X ,
nomial of
1-module.
(5.19)
deg X = n ,
SVslEMK4.
Let
X be a fixed element of X
K[ z ]
with
FX t h e companion matrix of Fp g
given by (5.6),
Fx -
Then
q E K[z]
i s a u n i t modulo
X
is -
Fx
R. E. Kalman
(5.20) S B E bA . Same notations a s i n (5.19). U L Ml . Write i n (5.5)).
7
Then
7, .x(j'(z) J=1 J
(x(')
-defined -
i s a u n i t rno&ul. X
i f and if - only-
where
i s t h e column vector
PROOF.
Since
X( 1 )
...
< n,
the n-tuple FX
( ,
...
i s uhipuely determined by
q.
By d e f i n i t i o n
z : x H z = x relative
..., en
given by
(5.5)
Similarly,
n - j
m: e
J
,
qag in
*x
relativetothesamebasis
el.'" ' ?
en'
Recall
R , E. Kalman
that By
' ; i e g
generates
iff
( F B ( F ~ ) ~ ) complete reachable. ~ is
The r e s t
.
Same notations a s i n (5.19) g & (5.20). Given n-vector (5.'22), t h e r e e x i s t s a polynomial
such t h a t (5.21) i s s a t i s f i e d .
PROOF. Let
numbers
N
ql,
q2,
...
and determine t h e f i r s t
c o e f f i c i e n t s of
X by t h e r u l e
ar)
reduce t h e matrix i n (5.21) t o t h e d i r e c t sum of two t r i a n g u l a r m t r i c e s , each with nonzero elements on i t s diagonal. I n view of (5.12), always choose a new
'$
xt
qt = u n i t mod X. ,.
R. E. Kalman
z, '
direct sun? is now with respect to the K-madule structure of X . from Y to X polynomial X , (.2, 51)
'cjr
settin;
81 Z' = 9.
2 ,
On Y ,
t, l
qt
BY
5 is replaced by some 5 ,
such that
in W induces a structure
~[z*].
g2
is The
It should be c l e a r t h a t Theorem (5.13) i s not a purely modulet h e o r e t i c r e s u l t , but depends on the interplay between module theory, vector-spaces, and elimination theory ( v i a (5.21)). the fact t h a t
1
t o X,
i n t h e proof of Case 2, i s a t y p i c a l vector-space argument.** There a r e many open (or forgotten) r e s u l t s concerning cyclic modules which are of i n t e r e s t i n system theory. For instance, it
i s easy t o show t h a t an
polynomial
Y
n X n
r e a l matrix i s cyclic i f f a c e r t a i n
Y E g[zl, -
..., zn2 ]
i s nonzero a t
det
F;
t h e polynomial
i n t h e same ring,
Y
W must not terminate t h i s discussion without pointing out e another consequence of c y c l i c i t y which transcends the module framework. Since X = c y c l i c with generator g
i s isomorphic with
K[ z 1/xgK[ 2 1,
If
X
= irreducible, then X i s even a f i e l d . Hence, i n particula,r, g has a galois group. N one has ever given a &vnamical interpret%o
X
.. .
I
t i o n of t h i s galois grou2.
f a c t s i n t h e theory of dyraraical systems which have never been examined t h e +mica1 point of view. For some r e l a t e d comments i n the
d from
mw i
R. E ,Kalman
PREAMBLE.
ing (especially i n e l e c t r i c a l engineering i n t h e United S t a t e s during 1940-1960) t h a t seeks t o phrase a l l r e s u l t s of t h e theory of l i n e a r constant dynamical systems i n t h e language of t h e Laplace transform. Textbooks i n t h i s a r e a o f t e n t r y t o motivate t h e i r biased p o i n t of view by claiming t h a t "the Laplace transform reduces the a n a l y t i c a l problem of solving a d i f f e r e n t i a l equation t o an algebraic problem". When d i r e c t e d t o a mathematician, such claims a r e highly misleading
because t h e mthematical ideas of t h e Laplace transform a r e never i n f a c t used. are The ideas which - a c t u c l l y used belong t o c l a s s i c a l properties of r a t i o n a l functions, t h e More importantly,
t h e word "algebraict1 i s used i n engineering i n an archaic sense and t h e a c t u a l (modern) algebraic content of engineering education and p r a c t i c e as r e l a t e d t o l i n e a r systems i B very meager. For example,
t h e c r u c i a l concept of t h e t r a n s f e r function i s usually introduced v i a h e u r i s t i c argumerits based on linea,rity o r "defined" purely formally a s "the r a t i o of Laplace transforms of t h e output over t h e input". To
and purely algebraic gadget, requires a d r a s t i c a l l y new point of view, which i s now a t hand a s t h e machinery s e t up i n Sections 3-5. e s s e n t i a l idea of our present treatment was f i r s t published i n
The
KALMAN [ 19653I.
R . E. Kalman
The first purpose of this section is to give an intrinsically algebraic definition of the transfer f'unction associated with a discrete-time, constant, linear input/output map (see ~efinition(3.10)) Since the applications of transfer functions are standard, we shall not develop them in detail, but we do want to emphasize their role in relating the classical invariant factor theorem for polynomial matrices to the corresponding module theorem
(.4. 43)
(see lemma
..., m,
ej defined by
(.), 46)
(The scalar product on the right is that in the K[z]-module defined in Section 4.) By definition of I, ?
I',
as
each f( e .) is a formal J 1 with vanishing first term. We shall try to power series in z
represent these formal power series by ratios of polynomials (which we shall call transfer functions%.) and then we can replace formula ( . ) 61 by a certain specially defined product of a ratio of polynomials by a polynomial. Some algebraic sophistication will be needed to find the correct rules of calculations. These "rules" will consititute a rigorous (and simple) version of Heaviside s so-called llcalculusll. There are no conceptual complications of any sort.
R', E. Kalman
Let a
Xf = n/kernel f
be t h e s t a t e s e t of
regarded a s
~ [ ~ ] - m ~ d u lW. assume t h a t ee
Xf
i s a t o r s i o n module with n o n t r i v i a l
m i n i m a l polynomial
Jr.
Then, f o r eac3 j = 1 ,
..., m
we have
r,
J
f ( e .)
Jr
is
In - t u i t i- v e l y-, ----
f (ej) = Oj/).
a s t h e formal d i v i s i o n of
by
Jr
i n t o ascending powers of
0.
z-1
Check t h a t t h e c o e f f i c i e n t of
zO i s always
Verify by comptation
z ' ~ . Write
.J
( z ) = z n ( )
Then
E ~ [ z - l ] K[ [z-'I] C
and (6.2')
becomes
Moreover, t h e
0-th
c b e f f i c i e n t of
is
R. E. Kalman
t h a t t h e leading c o e f f i c i e n t of
$
is
l),
hence
i s a unit i n
K[ [ zI -] '
and t h e r e f o r e
Note t h a t (6.3) and (6.3') a c t u a l l y give s l i g h t l y d i f f e r e n t d e f i n i e f ( e . ) , depending on whether w use a t r a n s f e r function with J -1 ( ~ 0 t h respect t o t h e v a r i a b l e z or z notations have been used t i o n s of
i n t h e engineering l i t e r a t u r e . )
preferable.
ig
1 .)
h he
K[ [ z-'I
There i s a b i j e c t i v e
f: R + I '
KC z 1-homomorphisrs
with minimal
where
Q.E K [' J
21,
&g
denomi.nator of
j .
< Geg I,
and -
I i s t h e l e a s t common
corres-
r a t h e r than with
itself.
Zf
A
=
!Thus, f o r instance,
dim Zf
dim X
f;
Z,
qZ = i e a s t c o m n denominator of
=
minimal polynoraial of
fZ.
R. E. Kalman
RENARK.
the natural
r e a l i z a t i o n o f Z,
,
Z
i s completely reachable a s
Very few
t h e o r e t i c a l r e s u l t s could be conclusively established by t h i s route u n t i l t h e conceptual foundations of t h e theory of reachability and observability were developed. The preceding r e s u l t s may be r e s t a t e d as t'rulestt whereby t h e values 03 f may be computed using
Z.
W have i n fact, e
f(cu) = Z a ,
\q1lere
$Z consisting of t h e numerators of Z with cu, reduce t o minimaldegree polynomials modulo $ and then divide as i n Method 1 above. formally by
( t h a t is,
(*h)/$,
same as above, but do not reduce modulo
$.
are
thrown away i n t h e d e f i n i t i o n of
R. E. Kalman
of t h e s c a l a r product i n for
P,
Xf =
kernel
f.)
.
h his
is
i s given by
zf
= H(ZI
F)-'G.
z = d/dt
or from (1.17) by s e t t i n g
x ( t + 1 ) = z x ( t ) .) via t h e formula
is
(ZI - F ) - ~ =
where
J "
) (z) ,
deg 9,
The matrix
1898, 9 1
upon s e t t i n g w = F,
T =
JrF,
+ Zf
(6.8).
See
the
conventions-rule (6.7))
.I ..
R. E'. Kalman
aspect of t h e prablem i s t o show how t o r e l a t e t h e module invariant f a c t o r s occuring i n t h e s t r u c t u r e thsorem (4.34) t o t h e c l a s s i c a l f a c t s concerning t h e invariant f a c t o r s of a polynomial matrix.
(6.9)
Let
P - p X m be a Then
where A
and -
are -
p X p
and -
and
II
diag (Al,
i = 1 ,
..., q - 1,
P.
and
= rank P.
The
a r e c a l l e d the invariant f a c t o r s of
As anyone would expect, t h e r e i s a correspondence between the module s t r u c t u r e theorem ( b .34) and t h e matrix s t r u c t u r e theorem (6.9) and, i n p a r t i c u l a r , between t h e respective invariant f a c t o r s and Al,
...,
11-
..., .
ing CURTIS and REINXR 11962, $13.31 who a l s o give a proof of (6.9). P O F O (4.34) RO F onto
.
I +
given by
p:
a r e t h e standard gi generate
b a s i s elements of Clearly,
M.
R~/N,
where
It can be proved t h a t
" Ra
i s a f r e e s~bmoduleof
R ~ , with a b a s i s of a t most
<m R.
elements.
of
as
C pij*ei, pij E
Apply
A
Define fk
=
e =
Za
w . .. By (6.10-ll), e , ij 1
C cij*fi,
8
-1 B ,
?i*9i. Hence
Jri
= hi
and r
rank P = J.
(.2 61)
$Z
THEOREM.
...
1 ,
..., q.
Then the
invariant factors of
% are
where
1= r
+ 1, ...,
EKlO7.
JrI A. for 1
q = rank
-t
%:
o,
[U]~.
~~uiialentl~,
(c, A,
Define
= D
-I
Y where ,
Then AY = 0 ,
(zw = $)
0 and W ,
matrices with this property. So the columns of the-matrix W tute a basis for kernel p.
(6.13)
REN4RK.
The preceding proof remains correct, without any det C det D = units , The former
rather than in K z . []
representation follows trivially from the latter but may be easier to compute
.
REMARK. Theorem (6.12) shows how to compute the invariant from those of $ . We must define the invariant Z (because of the
(.4 61)
factors of XZ
x) Z.
where
Jri
are
the denominators of the scalar transfer function A of all common factors. Theorems
of invariant factors in dynamical systems. Nor is it convenient to o'h deduce all properties of matrix-invariant factors frmte representation
R . E Kalman
theorem (6.9).
It i s i n t e r e s t i n g t h a t t h e sharpened r e s u l t s w present e
~ e A, t
orizaf,ion d o G i n R.
V, W
AIB
(read: A -
B)
i f f t h e r e a r e matrices
B = VAW.
(over
- R,
of appropriate s i z e s )
such t h a t
This i s of course j u s t t h e usual d e f i n i t i o n of "divide" i n a ring, specialized t o t h e noncommutative r i n g of matrices. The following r e s u l t [MUTH 1899, Theorems IIIa-b, p. 521 shows t h a t i n case of p r i n c i p a l - i d e a l domains t h e correspondence between matrices and t h e i r i n v a r i a n t f a c t o r s preserves t h e divide r e l a t i o n ( i s "flunctcrial" with respect t o "divide") : (6.17)
THEOREM.
Let
1
be e, principal-idealdomain.
- AIB Then
Sufficiency.
By hypothesis, t h e r e i s a
(diagonal) such t h a t
i l 3 = 4 Hence .
R. E .Kalman
Necessity. This is just the following
( 1) 6 8
hi(^).
(6.S)
mx.
definition of invariant factors: if A . A = greatest common factor of () J all j X j minors of a matrix A, with A~(A) = 1 then , A.1(A) = A~(A)/A~-~(A) Of course, this definition can be shown to be equivalent (over principal-ideal donains) to that implied by ( . ) 69.
(.0 62)
z 1 2 (read: Z1 divides 1Z
such that Z = V W. 1 Z
2
z2)
over z 1
2
~[z]
I i I .) z
for all i.
(6.21)
THEOREM.
PROOF.
z lz2 l
qi(zl)1 qi(zg)
t() ,z
given by ( . 5 . 61)
R. E. Kalman
(6.22)
DEFINITION.
Ell.$
(read: -
C can be simulated by 1
c*)
iff T I T y
This d e f i n i t i o n i s a l s o f'unctorially r e l a t e d t o t h e d e f i n i t i o n of "divide" over a p r i n c i p a l i d e a l domain R standard r e s u l t : (6.23) R-modules. of because of t h e following
THEOREM.
Then Y
Let
if
if - and only
Sufficiency. xl,
..?,
Take both
and Y
i n canonical
X,
xr(x)
generating t h e c y c l i c pieces of if
i
Y, 1
Yr(x)
(with yi = O
> r ( ~ ) those )
of
Y.
The
assignment
w ( i ( ~ ) / i ( ~ ) x idefines a monomorphism
X,
that
X.
Similarly, t h e exhibiting
defines an epimorphism X -+ Y
as
X.
(2e ed.),
4, Exercise 81).
where
be a submodule of
X.
By (4.34),
L/N
L, N
a r e f r e e R-modules.
By a c l a s s i c a l isomorphism
N/N,
theorem, and M
where
L 3M 3 N
R E ,Kalman
From t h e last relation, that, f o r any R-module
X
<
and therefore
wk(x)
Since Y
i d e a l generated by
(n: r ( n ~ ) k} C
.
is
i s a submodule of
n X
f o r a l l n R,
it follows that
a submodule of
X.
PROOF.
Immediate from t h e f a c t t h a t
zc
i s a submodule
of
P R I D E C O ~ S I T I O N THEOREM FOR L ~
I DYNAMICAL S Y S ~ . ~
divides
2'
qi(zl)
.Zz
d i v i d e s qi(Z2)
for a l l 2
i.
can be simulated by Zz
1
(6.23),
since
qi(Z) = ti(ZZ)
R. E.Kalman
(6.26)
1-TATION.
The d e f i n i t i o n of
Zll Z2
means, i n syste&
a r e t o be " r e ~ o d e d ~ ~h e o r i g i n a l input t:
B(Z)U~
u ,
i s replaced by
'
an input mg =
and t h e output y2
i s replaced by an output
Z2 w i l l a c t l i k e
I n view of t h e d e f i n i t i o n of a
Z1 = AZ B 2
i s always s a t i s f i e d whenever
A, B a r e replaced by
(reduced modulo @ ) This means t h a t the z2 coding operations can be c a r r i e d out physically given a delay of
x,
d = deg qz u n i t s o f time (or more). NO feedback i s involved i n coding, 2 i t i s merely necessary t o s t o r e t h e d l a s t elements of t h e input and
output sequences. Hence, i n view of Theorem (6.25) and Corollary (6.24),
w can say t h a t it i s possible t o a l t e r t h e meal behavior of a e system C2 a r b i t r a r i l y by external coding involving delay but not feedback i f and only i f t h e invariant f a c t o r s of t h e desired external behavior behavior ( z ~ ) a r e d i v i s o r s of invariant f a c t o r s of the e x t e ~ a l (Z
) of t h e given system. The invariant f a c t o r s may be z2 c a l l e d t h e PRDES of l i n e a r systems: they represent t h e atoms of system
behalfior which cannot be simulated from smaller u n i t s using a r b i t r a r y but feedback-free coding. I n fact, there i s a close (bot not isomorphic)
full treat-
R. E. Kalman
.
Let, us recall the
where
pi: R + X , :
pf =
epimorphism while
monor ~orphism.
rvf
(7-1)
is completeiy observable.
These facts set up a "f'unctor" between system-theoretic notions and algebra which characterize Xf uniquely. Consequently, it is desirable
(7 2)
f: SZ +
R. E. Kalman
of K[z]-homomorphisms.
The
K[z]-module
is
module of the realization. A realization is canonical iff it is completely reachable and completely observable, that is, . p surjective and
L
is injective.
= R,
p =
ln l~
REMARK.
( . ) In fact, 72.
) define C = (F, G, H by
G =
H =
p
L
lcul
1. 1
~(1).
It is easily verified that these rules will define a system with f x = f Given any such C, , . it is also clear that the rules
. define a factorization of f Hence the correspondence between (3.12) and - (7.2) is bijective. The quickest way to exploit the algebraic consequences of our definition (7.2) i s via the following arrow-theoretic fact:
R . E ,Kalman
Y,
and -6
If -
(7 6 )
?
Since every K z 1-module is automatical-lyalso a K-vector space, [
( j .6)
shows that the two state sets are K-isomorphic, that is, have the same dimension as vector spaces. The fact that they are also K[ z1-isomorphic implies, via Theorem (4.34), that they kiave the same invariant factors.
We have already employed the convention that (in view of the bijection
between f and Z ) the invariant factors of f and Xf are to be , f
R. E.Kalman
identified. In view of
(7 -7)
canonical realizations of a fixed. f have isomorphic state modules. The state module of a canonical realization is uniquely characterized (up to isomorphism) by its invariant factors, which may be also viewed as those of f .
( -8) 7
PROPOSITION.
I f
realization f,
- dim X then
. class of all realizations of f This result has-beenused in some of the literature to justify the terminology "minimal realizationn as equivalent to "canonical realization". We shall see in Section 9 that the two notions are not always equivalent; we prefer to view tion and ( . ) as a derived fact. 78
(7 9 )
claimed
REMARK.
Theorem
G, H) and
2=
X such that
R'. E. Kalman
If we identify X and X then A is simply a basis change and it follows that the class of all matrix triples which are canonical with the general linear realizations of a fixed f is isomorphicgroup over X . The actual computation of a canonical realization, that is, of the abstract Nerode equivalence classes [wIf, require a consider-
able mount of applied-mathematical machinery, which will be developed in the next section. The critical hypthesis is the existence of a factorization of f such that dim X
.-
zation, it is possible to obtain a canonical one by a process of reduction. More precisely, we have
(.1 71)
THEORZM.
of
a canonical realization of f;
PROOF. The reachable states Xr =image ir
are a submodule
Hence
X ,
xr/xr x0
"
is a subpuotient of X .
vie the following comutative diagram, where the j's and p s '
canonical injections and projectionsI .
are
R, E .Kalman
(7.12)
for all i (recall also Corollary ( . 4 ) 62) not enough since the
are invariants of module isomorphisms and not i isomorphisms of the commutative diagram ( . ) 72.
The preceding discussion should be kept in mind to gain an overview of the 'algorfthmsto be developed in the next sections.
R. E. Kalman
8.
CONSTRUCTION O REALIZATIONS F
N w w s h a l l develop and generalize the basic algorithm, o r i g i n a l l y o e o due t o B. L. H (see HO and KALWU [1966]), f o r computing a c k o n i c a l r e a l i z a t i o n C = (F, G, H) of a given input/output map f. Most of
D +
r
f
Using only
the K-linearity of
where the
(k > 0) a r e p
K.
C realizes
*Note t o Printer:
(8.3)
is satisfied.
issically,
with .A. We denote by H the p X V block submatrix of g appear-PIv ing in'the upper left-hand corner of H.
=U,v=
> 1 . =I
R. E. Kalman
COROLLARY.
=p, v
(A) -
i s constant f o r a l l p V - ,
sufficiently
dim C =
t h e claim of t h e proposition i s
Assume therefol-e t h a t
dim C
<
Then
by t h e d e f i n i t i o n (8.3) of a r e a l i z a t i o n .
and rank
It i s c l e a r t h a t
rank R
=v
-P
a r e at most
n = dimC.
< -
Our next objective i s t h e proof of t h e converse of t h e corollary. This can be done i n s e v e r a l ways. The o r i g i n a l proof i s due t o HO and XALMAN 11.9661;
similar r e s u l t s were obtained independently and concurreztly by YOULA and TISSI [ 19661 a s well a s by S m M N [ 19661. I R A L T o d i f f e r e n t proofs w
a r e analyzed and compared i n K A Z , ? , FALB, and ARBIB [1969, Chapter 10, Section
111.
A l l proofsdepend on c e r t a i n f i n i t e n e s s arguments.
W e
R,. E. Kalman ,
(.) 86'
DEFINITIOPJ.
the sequence
= min
{at:
rank
gI,;v.=rank ga,+i ,
= 1 2 , ,
... ) <
A"
= {min
...
)<me
and A"
H. -
The equivalence of the two conditions is immediate from the equality of the row rank and column rank of a finite matrix. The proof of the following result (not needed in the sequel) is left for the redder as an exercise in familiarizing himself with the special pattern,,ofthe elements of a Hankel matrix:
, -
are
( . ) is the existence of a finite-dimensional representation S and 86 Z of the shift operator uA acting on a sequence A. The I1operand" will be the Hankel'matrix associated with a given A; As we shall see soon, this representation of the shift operator induces a rule for
R. E. Kalman
DEFIhTTIOX;
on an infinite sequence
A is given by -
(8-8)
MAIN LEMMA.
sequence A has finite length if and only if the shift operator cH has finite-dimensional left and right matrix representations. fiecisely:
1 X J1 '
ht
and -
h"
A' t.
PROOF.
which satatfsfies ( . ) 89
J1lg:
R. E. Kalman
for a l l j = 0 of
...
(where Z
PV
i s the
(p, V ) th
element block
z). -
ZK+1,2 I f
rank lIK+l,
for a l l
0, 1 ,
... ;
Hence the of
the general case follows by repetition of the same argument. existence of the claimed
g -
A"
are proved
A",
l)th
block column of
ly I ,
i s l i n e a r l y dependent on the
zp,
moreover, t h i s So there
m matrices Z
...
j = 0, 1 ,
... .
X
Nw define o
Z_ t o be an -
Atl
Zi
just 'defined:
R. E, Kalman
dual argument. Now we have enough material on hand to prove the strong version
of Corollary ( . ): 85
(.2 81)
!EEOREM.
in infinite sequence A l -
has a finite-dimensional
rea.lization of dimension n if and only i ' the associated Hankel f mfttrix H has finite length -
= (),I,
A) ".
PROOF. Sufficiency. Let gh", be a A" X 1 block column matrix whose first block element is an m X m unit matrix and
-A -
R. E. Kalman
Then, f o r a l l k
> 0, -
comput 2
(8.9).
By d e f i n i t i o n of
cA and
d, -
t h e last
( , l)thelement of 1
i s a r e a l i z a t i o n of
A. ;ary (8.5).
Necessity.
, -
from canohical.
.
4 has f i n i t e W note e
(8.14)
order N
DEFINITIONo An i n f i n i t e sequence
3 -
i s an extension of
of ( t h e i n i t i a l p a r t o f ) an i n f i n i t e sequence
4 iff -
Bk
for
k=l,...,
N. (At, A")
(8.15)
T E RN HOE .
N i n f i n i t e sequence of f i n i t e length o N
> At with
A".
PROOF.
N of
Suppose
A, -
( A t , A"),
N > At =
+ A".
By (8.8),
&
and
3. -
R. E. Kalman
m e sequence A -
i s uniquely determined by
acting on ~ A l , y , ( $ )
_B
i s uniquely determined by
ZB
zAl,
At,
(B) -
from t h e r i g h t . Moreover,
a r e equal by hypothesis on N.
and
Hht,Vt(~)& = . .
on t h e 2nd,
~&,l,Aw
( B) =
..., N-th
Using only t h i s f a c t
I n fact, w have e
.(8.16)
sequence
B. L HO1sREALIZATIONALGORITHM. C o n s i d e r a n y i n f i n i t e .
The -
R. E. Kalman
At, A".
n = rank Ri,,A,,,
and
..
pht
( iii) Compute
where R cm are idempotent "editing" matrices corresponding to the pJ operations "retain only the first p rows" and "retain only the first m columns".
We claim the
(.9 81)
(At, A ) ",
'
PROOF. If C defined by (8.17-18) a realization of 4, is then it is certainly canonical: by ( . ) C has minimal dimension i 84 n
78. the class of all realizations of A and so it is canonical by ( . ) The required verification is interesting. First, drop all subscripts. Observe that - = QCRP is a pseudo-inverse of H, that H# -
R. E.Kalman
is,
E#H --
5. -
Then, by d e f i n i t i o n of
F, G, H,
'2nd
H# , -
,.
m columns of
COlrlMENT.
c -
which i s j u s t
*lfk?
as required.
Z_: -
we simply
2 -A1,
A" (
here (7.11)). Notice a l s o t h a t the proof of (8.19) does not require (8.12) but depends ( j u s t l i k e t h e l a t t e r ) on d i r e c t use of (8.8).
R. E , Kalman
" has 4
finite length". Of
course, this can be done only on the basis of certain special hypotheses on A, (i i) given in advance. (~xamples (i) :
%=0
for all k
>
q;
Fortunately, the difficulty is only apparent, for the preceding developments can be sharpened further:
FlIMWENTAI; THEOREM OF LINEAR REALIZATION THEORY.
Consider
any infinite sequence A and the corresponding Hankel matrix H. Suppose there exist integers ' a r i k
-81,
g l a l J , l
(4) -
= =
rank ra*
gI
A
($1,
(A) =
g l e l l l , I +
of A -
of order
+ I"
with
At
A = = -81, A" =
a"
A.
(8.8),
Define an extension
A -
of A of order -
-81
EM by
R. E Kalman
(8.25)
FINAL COI4ElC.
is - that is presPcrves the block,structureof the data of the problem. course, one can obtain parallel results by treating ordinary matrix, disregarding its block-Iia&el
zJ,,a,,
as an
structure. Such a
described explicitly by SILVEN/IAN [ 1 6 1 and SILVEiRljM and MEADOFIS [ 1 6 1 96 99 There does not seem to be any obvious computation~l advantage associated with the second method.
9.
it i s concerned
-A= Let -
( A ~ , A2,
... )
be an i n f i n i t e
K.
A dynamical
sequence of system
A' =
C = (F, G, H)
i s a p a r t i a l r e a l i z a t i o n of order
of
iff =
&G
for
k = 0,1,
..., r.
A = (A~, =S
sequence A, s
..., As),
> r. -
terms of
A a partial -
The concepts of canonical p a r t i s l r e a l i z a t i o n and minimal p a r t i a l r e a l i z a t i o n w i l l be understood i n exactly the same sense a s f o r a complete realization. W warn t n e reader, however, t h a t now these e
R,E ,Kalman
have i n f i n i t e l y many inequivalent minimil p a r t i a l r e a l i z a t i o n s i f
r
i s sufficiently s m a l l .
According t o t h e Main Theorem (8.21) of t h e theory of r e a l i z a tions, t h e minimal p a r t i a l r e a l i z a t i o n problem has a unique solution whenever t h e rank condition (8.22) i s satgsfied. I f t h e length
of t h e
p a r t i a l sequence i s prescribed a p r i o r i , it may w e l l happen . t h a t (8.22) does not hold. realization sequence of W a t o do? ht Clearly, i f w have a minimal p a r t i a l e
(F, G, H)
A
of order
r w can exten3 t h e p a r t i a l e
=r
on which t h i s r e a l i z a t i o n i s based t o an i n f i n i t e
-
sequence canonically r e a l i z e d by
(F, G, H)
simply by s e t t i n g
=r
i s equivalent t o t h e determination of a l l
=r
such t h a t t h e extended
( i) f i n i t e - dircens -?ci
( i i ) h t s dimension i s minimal i n t h e c l a s s of a11 extensions.
It i s t r i v i a l t o p r o v e . t h a t finite-dimensional extensions e x i s t
f o r any p a r t i a l sequence (of f i n i t e length). Hence t'ne problem i s immediately The
reduced t o determining extensions which have minimal dimension. solution of t h i s l a t t e r problem c o n s i s t s of two steps.
F i r s t , w show e
R. E. Kalman
below by an examination of the Hankel array defined by the partial sequence. Second, and this is rather surprising, we show that the lower bound can be actually attained. For further details, especially the characterization of equivalence classes of the minimal partial and 1970bI. realizations, see KALMAN [ 1969~
(.) 93
lI(Ir)
of ,a partial .
sequence A we mean that r X r block Hankel matrix whose (i, j th ) =r block is Ai+ if - i + j - 1 < r and undefined otherwise. =
=T.
' P
...,
A ;
(l'<,p r) - _< -
(9-4)
.
--a
PROPOSITION. & J e -
Hankel array of A which are linearly independent of the rows .=r abave them. Then the dimension of &realization of A is at least =r
n0(A 1. =r
PROOF. The rank o ' f% Haiikel matrix of an infinite realization
, -
by Progosition
. .This implies "filling in" the blank spaces in the Hankel array of A . Regardless of how =r
..
R . E. Kzlman
, -
define:
kt >
At
ht (br)
every
i s l i n e a r l y dependent on t h e
hl'($)
> A"
every
block column of
E.(_A ) - -r
i s l i n e a r l y dependent on t h e columns t o t h e
l e f t of it. Every p a r t i a l sequence A sequence
- may be extended t o an i n f i n i t e r
n0 ( =r ) A
for a l l
@> Al(&),
V >
A " ( =r A)
i s satisfied.
.PROOF,
At.
A"
i s trivial. in
It suffices t o show, f o r a r b i t r a r y r, how t o s e l e c t AHl .. such a way t h a t t h e numbers A',, A", -and n0 remain constant.
'
-and exmine i n t u n a l l t h e
...,
h?th
block rows i n
R. E. Kalman
of Aptl using t h i s linear dependence ( t h a t is, w make the f i r s t e AH a l l zeros). This choice of the f i r s t row of
Afil
row of
no(A ) ), =r Eirentually
the f i r s t row of some block row w i l l become linearly dependent on those above it, except when row of
A'
AHl
.. Ar
or
...
rows
Ar+l
A'!
=r when AHl
i s f i l l e d in. Artl
rank
=
!iHl,l~
ra*
gr-l,
=l, r
rank H =r,2'
rank IE
rank^ r =2, --
ra*
gl, r+lnot
---------------
*Of course, uov linear depenaence i n the f i r s t step does imply t h a t the corresponding row of A*l. w i l l be a l l zeros.
R. E.Kalman
With t h e a i d of t h i s simple but subtle observation, t h e problem
Ar
n (&r).
=r
has dimension
.
A
A' = A ~ =r) (A
and
A" = A"(A )
=r
I f
r - At(&) + A?(A ) =r
>
i s unique.
=r
At, A"
and
It f o l l o ~ ? sh a t t h e minimal, p a r t i a l r e a l i z a t i o n i s unique i f t
2 -
h t =r) + A"(A ) ( ~ =r
(the
At(A ) . + An(A ) =r =r
I n v i e w a t h e theorem, w a r e j u s t i f i e d i n c a l l i n g t h e integer f e
. .
*A siailar r e s u l t was obtained shultaneously and independently by T. . Tether (stanford d5.s sertz%ion, 1969)
- 118 REMARK.
R. E Kalman
A , and t
AV
of the partial sequence. We warn, however, that this result does not generalize to all invariants of the minimal realization. For
instance, one cannot determine from A how many cyclic pieces a =r minimal realization of A will have: some minimal realizations =r
COROLLARY. Suppose n1 =r) is the number of independent (A columns of the Hankel array of A (defined analogously with =r
If %(A=r) > no(A ) then, using the Main Theorem, =r we get a contradiction to the fact that the rank of any Hankel matrix
PROOF.
of an infinite sequence is lower bound for the dimension of any realization (~roposition 8 4) ( . ). If
then extending A =r
to any l A + , we contradict the fact that rank IIAlyA,, at least Slxl is equal to
no(^ ). =r
In other words, the characteristic property of rank, that counting rank by row or column dependence y f e l f s iden6ical results, is preserved even for incomplete Hankel arrays. It is useful to check a simple case which illustrates some of --vvn ' the technicalities of the proof of the Main Lemma.
(9.k)
r
X
EXAMPLE.
The dimension of ( , 0 0 ,
1\t
..., 0, A ~ ) is precisely
= A" = r.
R. E.Kalman
In this concluding section, we wish to discuss the problem of observability in a rather general setting: we will not assume linearity, at least in the beginning. This is an ambitious program and leads to many more problems than results. Still, I think it is interesting to give some indication of the difficulties which are conceptual as well as mathematical. This discussion can also K A 1969a, ~
For the latter, the original papers are [KALMAN 1960a, 1963a] and
R. E. Kalman
The reader interested i n further d e t a i l s and a m~dernexposition i s referred especially t o t h e monograph of K A W [i.96913]. W s h a l l examine here only one aspect of t h i s theory (which e does not involve
s rr rj
of t h e "duality principle" between reachability and observability. ~ This principle was formally s t a t e d f o r t h e first time 5y lCALMAN [ 1 9 6 0 I, but t h e pertinent discussion i n t h i s paper i s limited t o t h e l i n e a r case and
i s somewhat ad-hoc.
now possible t o develop a completely general approach t o t h ? ltduality principle1'. W s h a l l do t h i s and, as a by-product, w s h a l l obtain e e
W s h a l l introduce a general notion of t h e "dual" system, and e use it t o replace the problem of observability by an equivalent problem of reachability. I n keeping with t h e point of view of t h e
and dualize
(rather than
z).
The c o n s t r u c t i b i l i t y
and
(K = a r b i t r a r y f i e l d ) and r e c a l l t h e d e f i n i t i o n of the s h i f t
operators
o
n and
on R z
and
but ignore, u n t i l l a t e r , t h e
KE
zl-
and I'.
R. E Kalman
we shall mean any map f which commutes with the shift that is,
&erators,
E R applied
t = 0 from
since cut
is Nerode-equivalent to
fc'.() (uo)l
=
o,
iff
fLo)l (u.()
Giving V E R
giving various values of f(cue ) ( ) (namely those corresponding 1 2 to the sequences @, Vr, zV r + Vr-1' V, z z V and
R. E. Kalman
chosen a t w i l l )
If
The problem i s f'urther complicated by t h e f a c t t h a t it may make a difference whether o r not w have a f r e e choice of e
V.
K A W ,
t h a t t h e problem i s posed on a purely s e t - t h e o r e t i c l e v e l and does not lend i t s e l f t o t h e introduction of more r e f i n e d s t r u c t u r a l assumptions. W s h a l l t h e r e f o r e reformulate t h e problem i n such e
I
t
V
0.
at.. 0
(no l o s s of
after
t =
i s given simply a s
For
t h e moment, t h i s
c l a s s w i l l be t h e c l a s s of a l l functions.
Thus
?
A
( a l l functions of
l ?
-t
K)
.
y
An element y
-f
I '
R. E. Kalman
a number i n t h e f i e l d K.
then
If
resulted from t h e s t a t e
[mIf,
and, by d e f i n i t i o n of
h
This suggests
hx E 2
i s an 03servable costate
i f f there i s a
j; P ;E
such t h a t w have i d e n t i c a l l y f o r a l l e
["If
is,
at
rule Y ;
t o t h e output sequence
r e s u l t i n g from x.
Note,
carefully, t h a t t h i s d e f i n i t i o n subsumes ( 5 ) a fixed choice of t h e c l a s s of f'unctions denoted by t h e circumflex, and ( i i ) a fixed input sequence,after t . .
=
(here
0).
For c e r t a i n purposes,
it
tions.
f.
with ,the point of view adopted e a r l i e r (see Section 4) i n an ad-hoc fashion. Also, t h e vague requirement t o "determine xtl used i n
R. E ,Kalman
(10.1) i s now replaced by a precise notion which can be manipulated (via the actual definition of the circumflex) t o express limitations on the algorithms t h a t w may apply t o the output sequence of the e system. The requirement "every costate i s observable" can be often replaced by a much simpler one. For instance, i f
X
i s a vector
space, it i s enough t o know t h a t !'every linear costate i s observable1' or even just t h a t "every element of some dual basis i s an observable costate"; i f
X
f:
R +T'.
First, w adopt (again with respect t o a fixed interpretation of the e circumflex) : (10.3)
DEFINITION.
f: G !
+T '
i s the map
Note t h a t
i s well-defined,
under dualization.
.A
c.
R. E.Kalman
: -Both
z -I
If
i s constant, so i s
f.
PROOF.
( z
( z-'03)
( f (m) )
(def. of (def. of (f
?),
up),
;(z-f(m))
= ;(f(z.m))
h A
i s constant),
=
=
h
?),
u) 6,
and s o we see t h a t
commutes with
as t h e input/output map
h
r,
I n other words, it i s necessary t o check t h a t t h e notion of time i s a l s o i n h e r i t e d under dualization. I n general, t h i s does not appear
h
I".
Here
R. E.Kalman
A
(10.5)
HYPOTHESIS.
Every f'unction
- 2 in
sa t i sf i e s the (dependent on
f i n i t e n e s s condition:
There i s an i n t e g e r
IF]
r)
implies
= ?(6).
rh
at y
Assuming (10.5))
it i s immediate t h a t
admits a concatenation
Let
C
b e a n a r b i t r a r y c o n s t a n t input/output
i t s dual.
- Then
each observable c o s t a t e of
(relative t o
A
satisf'yinq (10.5))
f,
and conversely.
for a l l
r.
Npw
i s l i n e a r (!);
i n f a c t , d i r e c t use of
the definition o f
So
?of
and
31
i:
=hey define t h e
s t a t e s of
which a r e observable
relative t o the
.
A
f is
completely reachable i s not q u i t e t h e same a s saying t h a t t h e canonical r e a l i z a t i o n of on t h e choice of f i s completely observable because t h e . l a % t e r depends
L .
I '
f.
Moreover, Theorem (10.7) does not give any i n d i c a t i o n how "big" and it may c e r t a i n l y happen t h a t t h e observability problem f o r r m h more d i f f i c u l t 'than t h e r e a c h a b i l i t y problem. be i l l u s t r a t e d later by some examples.
f
Xi: i s
is
These matters w i l l
N w we deduce t h e o r i g i n a l form of t h e d u a l i t y p r i n c i p l e from o Theorem (10.7). The e s s e n t i a l point i s t h a t (10.5) holds automati-
( A l l t h e underlying s e t s ssitb t h e
R. E Kalman
The following f a c t s a r e well known: (10.9) PROPOSITION. Then:
- * Let
denote d u a l i t y i n t h e sense of
K-vector spaces.
2 K-linear,
A
constant, f i n i t e Then:
dimensional.
Suppose f u r t h e r t h a t
h
means K-linear d u a l i t y .
(i)
i s K-linear and constant, t h a t i s , a K[Z-'1-homomorphism F ) and finite-dimensional. P are isomorphic with t h e i s observable.
The reachable s t a t e s of
hence every c o s t a t e of
Xf
The f a c t t h a t ' I
i s K-linear implies, by ( 1 0 . 3 ) ~
f
A
always implies t h a t of
f,
by Proposition (10.4).
is
& the
K[z]-linear
dual of t h e
K[z]-homomorphism
cannot be simplified.
F a r e isomorphic with
$,
t h e K-linear dual of
Xf.
This
9.
It s u f f i c e s t o prove
d ?
(10.n)
x o [hf(zi-x)]
3'
i = 0,1,
...
x,
and
1 ,
..., m}
i s isomorphic with
Then x = 0, by
f
xX~.
every
h(x) = 0.
f is
(A)
t h a t they generate . x :
so t h a t everything i s proved.
implies t h a t
i s a K[ z-'1 -homomorphism.
h
system a t
-k
due t o input
i s given by t h e assignment
k-th
H ,E. Kalman
(10.12) that
RFSICLRK.
hf
i s replaced by
-t (hence
I n terms of t h e p i c t o r i a l d e f i n i t i o n of a system, t h i s statement simply amounts t o I1reversing the d i r e c t i o n s of t h e arrowsv, which i s t h e "right1' way t o define d u a l i t y i n t h e most general mathematical context, namely i n category theory. W would expect e
y e t beczuse t h e c o r r e c t categories t o be considered i n t h e study of dynamical systems have not y e t been determined.
It i s l i k e l y t h a t
Consider t h e system C
defined by
with
i s precisely
. -.
If
it. Consequently, t h e x ' s a r e isomorphic with t h e Nerode equivalence c l a s s e s induced by fz. So C- cannot be .reduced. fz i s If
Relative t o
'IA
= -flmctions", everyccostate of
on a fixed f i n i t e subset of t h e
Thus:
either
does
not define a dy-nmical system o r not a l l c o s t a t e s a r e observable. N w l e t us replace t h e s e t o with t h e r a t i o n a l s . f o r determining x: [O, 1) by i t s i n t e r s e c t i o n
It i s c l e a r t h z t t h e r e i s now a f i n i t e algorithm
w simply apply the r e s u l t s of p a r t i a l r e a l i z a e (we take
..., E2(x)0
= Z =2
and t h e
as a ratio
Z$[2]--whlch x
However,
strict sense since there is no way of knowing when the algorithm has stopped. In other words, given an arbitrary costate x -there exists .. A A no fixed rule y; such that the application of y~ to yx gives ;
A
xx ()
for all x.
A<
results of the partial-realization algorithm will give an approximation to the value of : ( X ) which always converges in a finite
(but a priori unknown) number of steps as more values of the output sequence are observed. In short, the costate-determination algorithm - f s certain pseudo-random elements in it and therefore cannot be fi described through the machinery of deterministic dynamical systems.
R. E. Kalman
1 . HISTORICAL COMMENTS 1
It is not an exaggeration to say that the entire theory of linear, constant (and here, discrete-time) dynamical systems can be viewed as
KEzI)
to study a constant
square matrix (see ( . 3 ) has been "standard" since the 19208s under 41)
att~ibutesto KRYLOV [1931] the idea of computing the characteristic polynomial of a square matrix A by choosing a random vector b and commting successively b, Ab, A , %
...
h he
method
However, the
41) 21) merger of ( . 3 with ( . 5 , which is the essential idea in the algebraic theory of linear systems, was done explicitly first in KAINAN [1965b].
81 comments in XALMAN [1960c, pp. 481, 483, 4 4 and in KAIXAN, HO, and
IWENDRA
development of modern control theory cannot be separated from the development of the concept of controllability; moreover, the technological problems of the 1950's and even earlier had a major influence on the genesis of mathematical ideas (just as the latter have led to many new technological applications of control in the 19601s).
R. E. Kalman
The writer developed the mathematical definition of controllability with applications to control theory, during the first part of 1959. (~gpublishedcourse notes at Johns Hopkins University, 1958/59. ) These
Formal presenta-
1959, see tions of the results were made in Mexico Citg (~e~tember,
KALMlLN [1960b]), University of California at Berkeley (~pril,1969, see
K . [1960d]),
and Moskva (~une,1960, see KAZlMAN [1960c3), and in As
scientific lectures on many other concurrent occasions in the U.S. 'far as the writer is aware, a conscious and explicit definition of controllability which combines a control-theoretic wording with a
precise mathematical criterion was first given in the above references. There are of course many instances of similar ideas arising in related contexts. Perhaps the comments below can be used as the starting point of a more detailed examination of the situation in a seminar in the history of ideas.
The following is the chain of the writer's own ideas culminating in the publications mentioned above:
---------------
*It is sometimes claimed in the mathematical literature of optimal control theory that this cannot be done with a linear system. This is false; the correct statement is "cannot be done with a linear controller producing control functions which are continuous (and not merely piecewise continuousI ) in time." Such a restriction is completely'irrelevant from the technological. point of view. As a matter of fact, computer-controlled systems have been proposed and built for many years on the basis of linear, time-optimal control.
R. E Kalman .
discrete-time time-optimal control of systems with bounded control and linear continuous-time dynamics
[KAIM.N,
1957]
[1954]descend
directly from earlier (and very well known) engineering research on time-optimal control. (The main references in
KIIl;MAN
[19541 are:
McDONALD [ 1 5 1 HOPKIN 1 9 1 , BOGNER and KAZDA [ 1 5 1, as well as a 90, 151 94 research report included in
~~ [1955].)
1944,
although the latter's work was unfortunately not widely known before 1957. During the same time, there was much interest in the same problems in other countries; see, for instance, FELOBAUM [1953] and UTTLEY and HAMMOND
( 0 1, '' 4)
...:. [it was] rigorously proved that the intuition which led to
the formulation of the [engineering] theory [quoted above] was indeed correct. " TSIEN1s survey [ 19541 contains a lengthy account of this state
R. E.Kalman
of affairs and was ready by many.-. We emphasize': none of this extensive literature contains even a hint of the algebraic considerations related to controllability.
(2-3) The critical insight gained and recorded in KA.LlIAN [1957] is
the following: the solution of the discrete-time time-optimal control problem is equivalent to expressing the state as a linear combination of a certain vector sequence (related to control and dynamics) with coefficients bounded by 1 in absolute value, the coefficients being the values of the optimal control sequence. The linear independence of the first n vectors of the sequence guarantees that every point in a neighborhood of zero can be moved to the origin in at most n steps (hence the terminology of "complete controllabilityw); and the condition for this is identical with ( . 7 (stated in KALMAN [1957] 21) and KAIMAN and BERTRAM [1958] only for the case det F
0 and m = 1. )
R. E.Kalman
of a formal machinery f o r t h e d e f i n i t i o n of c o n t r o l l a b i l i t y during t h e
of minimal-time optimal control, t h e researches begun by KAI;MAN [1954, 19571 and IWA14N and BERTRAM [ 19581 eventually evolved intoi what has come t o be c a l l e d t h e mathematical theory of c o n t r o l l a b i l i t y (of l i n e a r systems)
.
1 P0NTRYAGIN.and his: school i n t h e USSR developed t h e i r
t r a n s l a t i o n of 1956.)
l e s s of i t s particular mathemtical. style, must contain ingredients related t o controllability. So it i s i n t e r e s t i n g t o examine how
e x p l i c i t l y the c o n t r o l l a b i l i t y condition appears i n t h e work of PONTRYAGIN and r e l a t e d research. GAMKRELIDZE [1957, $2; 1958 $1, $21 c a l l s t h e time optimal control problem associated with t h e system
..., nn%)
jl
t h e s p e c i a l case of (2.8) f o r
m = 1)
He then proves :
&
t h e "degenerate" case t h e problem e i t h e r reduces t o a simpler one o r t h e motion cannot be influenced by t h e control function u( )
All
>
1 ,
GAMIWZLIDZE [ 1958,
a s t h e condition
(11.4)
..., A"-%.
1)
f o r every column bi E B,
but he does not show t h a t t h i s generalized condition of "nondegeneracy" f o r (11.3) i n h e r i t s t h e i n t e r e s t i n g characterization p o v e d f o r "nondegeneracy" i n t h e case of (11.1). I n fact, condition (11.4) i s much too strong
t o prove t h i s ; t h e correct condition i s (2.8), t h a t i s , complete controllability. I n other wards, i n GP!dKRELIDZE1s work (11.4) plays
t h e r o l e of a t e c h n i c a l condition f o r eliminating "degeneracy" (actually, lack of uniqueness) from a p a r t i c u l a r optimal control problem and i s n o t , e x p l i c i t l y r e l a t e d t o the more basic notion of complete c o n t r o l l a b i l i t y . Neither
CtAMI(REL1Dm
( l l . 4 ) a s a property df t h e dynamical system (11.3), but employ (11.4) only i n r e l a t i o n t o t h e p a r t i c u l a r problem of time-optimal control. See
of the time-optimal controls and calls such systems llnomal't. The assmtion of some kind of "nondegeneracy" conditio~ yas unavoidable in the early phases of research on the time-
3
5
IC
4 1 by a 1
in the special case when A has real eigenvalues [~kreorem 121. ROSE
uses determinants closely related to the now familiar lemmas in cantrol-
kbility theory but he, too, fails to formulate controllability as a concept independent of the time-optimal control problem.
b
m
to a kind of classification of controllability properties o ' nonconstant r In fact, the standard notion of a normal family of extremals
=systems.
(14, 1.)
Normality is
used in the calculus of variations mainly as a 'hondege;eracl' condition. is importan: to note that the "nondegeneracy" conditions lemployed in optima.3 cori-cru~ and the calculus
0 1
--------------
*The use of the word "normal1' I a S A L U [ 1 6 1 for ( 1 . ) only by 90 - 1 4 is accidentally coincident with the earlier use of the "normal" in the c u l u s of variations
-.
a 4
R.E.Kalman
With suitable formulation, however, the basic .:esults of time-optimal control theory continue to hold without the assumption of complete controllability. The same is not true, however, of the four kinds of theorems mentioned in the Intorduction, and therefore these results are more relevant to the story of controllability than the time-optimal control discussed above. There is a considerable body of literature relevant to controllability theory which is quite independent of control theory. For instance, the
treatment of a reachability condition in partial differential equations goes back at least to CHOW [1940]but perhaps it is fairer to at+ribute it to Caratheodoryts well-known approach to entropy via the nonintegrability condition. The current status of these ideas as related to controllability is reviewed by WEISS [1969, Section
9 . An independent 1
and very explicit study of reachability is due to ROXIN [19601;unfortunately, his examples were purely geometric and therefore the paper did.cot help in clarifying the celebrated condition ( . ) 28
The
Wronskian determinant of the classical theory of ordinary differential equations with variable coefficients also has intersections with controllability theory, as pointed out recently with considerable success bpSILVERMAN 1 9 6 . Vany problems in control theory were misunderstood 161 or even incorrectly solved before the advent of controllability theory.
'4
,PIms-
R. E. Kalman
his
clah applies of course also to the nonlinear controllability results obtained via the Pfaffian method.)
(2) The historical development of controllability was heavily
the discussion of the relationship between transfer functions and differential equations and in questiohs relating to the .four theorems of the Introduction.
(4)
ekension to more elaborate algebraic structures. For a survey of the historical background of observability, which would take us too far afield here, the reader should consult
R . E. Kalman
[1965]
A common framework for automata theory and control theory, SIAM J. Contr., 3206-222.
Multiplication induced in the state space of an act, Math. System Theorn 13305-314.
The minimal realization of a nbnanticipative impulse response matrix, SII~M ~ppl.~ath.,J. 15:754-764.
E. G. GILBERT
[ 19631
B. L. HO and R. E. KAL!! Effective construction of linear state-variable models from input/output functions, Regelungstechnik, 1 :545-548. 4
[ 19691
.
The realization of linear, constant input/output maps, . I Complete realizations, SIAM J. Contr., to appear. .
16 1 95
[1960a]
-
A new approach to linear filtering and prediction problems, J Basic Engr. (~rans.ASME), 82~:35-45. .
[1960bl
Contributions to the theory of optimal control, Bol. Soc. Mat. Mexicana, 5:102-119.
[1960c]
143 -
On the general theory of control systems, Proc 1st I ' C Congress, Moscow; Butterworths, London. FA
'
192 161
[1963a]
Algebraic structure of linear dynamical systems. I The . Module of C, Proc. Nat. Acad. Sci. (USA), 5 : 4 1503-1508.
Algebraic aspects of the theory of dynamical systems, in ~ifferentdal-~~uations Dyn&cal and Systems, J K. Hale . and J. P LaSalle ( d . , pp. 133-146, Academic Press. . es) [1969al On multilinear machines, J. Comp. and System Sci., to appear. Dynamic Prediction and Filtering Theory, Springer, to appear. [1969c]
[ 1970a]
On partial realizations of a linear input/output map, Guillemin Anniversary Volume, Holt, Winston and Rinehart. Observability in mltilinear system, to appear.
. and R. S. BUCY R. E W ?
New results in linear prediction and filtering theory, J Basic Engr. (~rans. . ASME, Ser. D, ) 83~:95-100.
R E FdWlAN, P L FB . . . . &
and M. A. ARBIB
[1 6 1 95
Algebra, Addison-Mesley
S MAC LANE .
[ 19631
Homology, Springer
I. A. ,
wm
Controllability of nonlinear processes, SIAM J . control, 3:78-90.
[1 6 1 95
E F. MOORE .
[ 19561
Gedanken-experiments on sequential machines, in Automsta Studies, C. E Shannon and J. McCarthy ( d . , pp. 129-153, . es) Princeton University Press.
[1 6 1 96
Representation and realization of time-variable linear systems, Doctoral dissertation, Columbia University.
L. M. S I L V E W and H. E MEADONS .
[1 6 1 99
H WEBER .
19 1 88
Lehrbuch der Algebra, Vol. 1 2nd Edi-iion, reprinted by , Chelsea, Nzw York.
f 19691
[1 6 1 97
On pole assignment in multi-input controllable linear systems, IEEE Trans. Auto. Contr , AC-12:600-665.
A. M. YAGMM
[ 19621
-u nF
The synthesis of l i n e a r dynamical systems from prescribed weighting patterns, SIAM J. Appl. Math., 14:527-549.
M. A. ARBIB
[1 6 1 95
A common framework for automata theory and control theory, ?JAMIAM.Contr , 3 :206-222. J .
.-
a 11:ll8-127.
Differential equations with a discontinuous forcing term, doctoral dissertation, Princeton University. C CARATHEODORY .
[ 19331
# D
Uber die Einteilung der Variationsprobleme von Lagrange nach Klassen, Comm. Mat. Helv., 5:l-19.
W. L. CHOW
143 90
.*
Uber Systeme von linearen partiellen Differentialgleicl erster Ordnung, Math. Annalen, :98-105.
[ 19571 [ 19581
On the theory of optimal processes in linear systems 16 -1 1: (in Russian), Dokl. Akad. Nauk SSSR, - g 1 . The theory of optimal processes in linear systems (in Russian), Izvestia Akad. Nauk SSSR, 2:449-474.
[ 19591
Discussion of a paper by Bergen and Ragazzini, Trans. Analysis and design principles of second and higherorder saturating servomechanisms, Trans. AIEE, - 11:29L-310. 7 4 Optimal nonlinear control of saturating systems by intermittent control, IRE WESCON Convention Record, 1 IV:133-135
Contributions to the theory of optimal control, Bol. Soc. Mat. Mexicans, -: 0 - 1 . 51219 On the general theory of control systems, Proc. 1st IFAC Congress, Moscow; Butterworths, London. Lecture notes on control system theory (by M. Athans and G ~endaris), Univ. of Calif. at Berkeley. . Blathematdcal description of linear dynamical systems, . 1 : SIAM J 'Contr , - 152-192.
Algebraic structure of linear dynamical systems. I The . Madule of ,C, Proc. Nat. Acad. Sci. (USA), 54:1503-1508.
R E KPUqATJand J E BERTRAM . . .
[ 1958'1
General synthesis procedure for computer control of single and ~ulti-loop linear systems, Trans, AIEE, 12 3-7 I:6?2-609.
R. E W Z A . N , Y C . .
. HO and K. KCRENDRA
i963I
On the numerical solution of the equation by which the frequency of small oscillations is determined in technical problems (in Russian), Izv. Akad. Nauk SSSR Ser. Fix.-Mat., k:491-539.
The time-optimal control problem, Contr. Nonlinear Oscillations, Vol. 5, Princeton Univ. Press.
Nonlinear techniques for improving servo performance, Proc Nat Electronics Conf (USA), 6:400-421.
. .
11 51 9 1
R OLDENBURGER .
[ 19571 [1 6 1 96
- 527-546. 79:
[1 5 98
N J ROSE . . Theoretical aspects of limit control, Report Stevens Institute of Tech., Hoboken, N.J.
459,
E. ROXIN
190 161
.
Reachable zones in autonomous differential systems, . Bol. Soc. Mat. Mexicana, 5:125-135.
[1 6 1 99
119541
The stabilization of on-off controlled servomechanisms, in Autsmatic and Manuzl Control, Academic Press.