Documente Academic
Documente Profesional
Documente Cultură
ABB ALSTOM Power Technology Centre , Cambridge Road, Leicester, LE8 6LH,
U.K.
b
To cite this article: G. P. Liu , S A. Billings & V Kadirkamanathan (2000) Nonlinear system identification
using wavelet networks, International Journal of Systems Science, 31:12, 1531-1541, DOI:
10.1080/00207720050217304
To link to this article: http://dx.doi.org/10.1080/00207720050217304
Internationa l Journal of Systems Science, 2000, volume 31, number 12, pages 1531 1541
1.
Introduction
Nonlinear system identi cation consists of model structure selection and parameter estimation. The rst problem is concerned with selecting a class of mathematical
operator as a model. The second is concerned with an
estimation algorithm based on input output data from
the process, a class of models to be identi ed and a
suitable identi cation criterion. A number of techniques
have been developed in recent years for model selection
and parameter estimation of nonlinear systems.
Forward and backward regression algorithms were analysed in Leontaritis and Billings (1987). Stepwise regression was used in Billings and Voon (1986), and a class of
orthogonal estimators was discussed in Korenberg et al.
(1988). Algorithms which save memory and allow fast
computations have also been proposed in Chen and
Wigger (1995) . Methods to determine a priori structural
identi ability of a model have also been studied (Ljung
and Glad 1994). A survey of existing techniques of nonlinear system identi cation prior to the 1980s is given in
Billings (1980), a survey of the structure detection of
input output nonlinear systems is given in Haber and
Unbehauen (1990), and a recent survey of nonlinear
International Journal of Systems Science ISSN 0020 7721 print/ISSN 1464 5319 online # 2000 Taylor & Francis Ltd
http://www.tandf.co.uk/journals
1532
G. P. Liu et al.
new basis for representing functions. Wavelet-based networks (or simply wavelet networks) are inspired by both
the feedforward neural networks and wavelet decompositions and have been introduced for the identi cation of
nonlinear static systems (Zhang and Benveniste 1992).
But little attention has been paid to identi cation of
nonlinear dynamical systems using wavelet networks
(Coca and Billings 1996, Liu et al. 1998, 1999a).
This paper presents a wavelet network-based identi cation scheme for nonlinear dynamical systems. Two
kinds of wavelet networks are studied: xed and variable
wavelet networks. The former is used for the case where
the estimation accuracy is assumed to be achieved by a
known resolution scale. But, in practice, this assumption
is not realistic because the nonlinear function to be identi ed is unknown and the system operating point may
change with time. Thus, variable wavelet networks are
introduced to deal with this problem. The basic principle
of the variable wavelet network is that the number of
wavelets in the network can either be increased or
decreased over time according to a design strategy in
an attempt to avoid over tting or under tting. In
order to model unknown nonlinearities, the variable
wavelet network starts with a lower resolution scale,
and then increases or reduces this according to the
novelty of the observation. Because the novelty of the
observation is tested, it is ideally suited for on-line identi cation problems. The objective behind the development is to gradually approach the appropriate network
complexity that is su cient to provide an approximation to the system nonlinearities and which is consistent
with the observation s received.
The parameters of the wavelet network are adjusted
by adaptation laws developed using a Lyapunov synthesis approach. The identi cation algorithm is performed over the network parameters by taking
advantage of the decomposition and reconstruction
algorithms of a multiresolution decomposition when
the resolution scale changes in the variable wavelet network. Combining the wavelet network and Lyapunov
synthesis techniques, the identi cation algorithm developed for continuous dynamical nonlinear systems guarantees the stability of the whole identi cation scheme
and the convergence of both the parameters and estimation errors. A simulated example shows the operation of
the proposed identi cation scheme.
2.
Wavelet networks
Wavelets are a class of functions which have some interesting and special properties. Some basic concepts about
orthonormal wavelet bases will be introduced initially.
Then the wavelet series representation of one-dimensional and multi-dimensional functions will be considered. Finally, wavelet networks are introduced.
k f k2 : h f ; f i1=2
f x1 ; x2 ; . . . ; xn
1
k2Z g
j 6 i
which satisfy
Wj \ Wi f1g;
of L2 R , where denotes the direct sum. These subspaces have the following properties:
(1) . . . V1 V0 V1 . . .;
!
[
(2) closeL2
Vj L2 R ;
j2Z
\
(3)
Vj f1g;
j2Z
1533
f x
(5) f x 2 Vj () f 2x 2 Vj1 ; j 2 Z .
Hence, the sequence of subspaces Vj is nested, as
described by property (1). Property (2) shows that
every function f in L2 R can be approximated as
closely as desirable by its projections, denoted by Pj f
in Vj . But, by decreasing j, the projections Pj f could
have arbitrarily small energy, as guaranteed by property
(3). The most important intrinsic property of these
spaces is that more and more variations of Pj f are
removed as j ! 1. In fact, these variations are
peeled o, level by level in decreasing order of the rate
of variations and stored in the complementary subspaces
Wj as in property (4). This process can be made very
e cient by an application of property (5).
If and are compactly supported, they give a local
description, at dierent scales j, of the considered function. The wavelet series representation of the one-dimensional function f x is given by
X
XX
f x
aj0 k j0 k x
bjk jk x
7
jj0 k2Z
k2Z
where
aj0 k h f x; j0 k xi
bjk h f x; jk xi
The wavelet series representation can easily be generalized to any dimension n. For the n-dimensional case
x x1 ; x2 ; . . . ; xn , we introduce the scaling function
U x x1 x2 . . . xn
10
and
the
2n 1
mother
wavelets
W i x,
n
i 1; 2; . . . ; 2 1, are obtained by substituting some
xj s by xj in equation (10). Then the following
family is an orthonormal basis in L2 R n :
2
jk x; . . . ; W
2n 1
xg
jk
for j j0 , j 2 Z , k k1 ; k2 ; . . . ; kn 2 Z
U
11
, and
jn=2
U 2 j x1 k1 ; 2j x 2 k2 ; . . . ; 2j xn kn
jk x 2
12
i
jk x
2jn=2 W
j
j
j
i 2 x1 k1 ; 2 x2 k2 ; . . . ; 2 xn kn
13
n
1
X X 2X
jj0 k2Z
bi
jk W
i1
i
jk x
14
aj0 k h f x; U
j0 k xi
15
bi
jk h f x; W
i
jk xi
16
a j0 k U
j0 k x
k2Aj0
N X 2X
1
X
bi
jk W
jj0 k2B j i1
i
jk x
17
1
jk x; W
j0 k x
N!1;Aj0 ;B j0 ;...;B N !Z
jk x 2j=2 2j x k;
j0 k x; W
k2Z
a j0 k U
j0 k x 2j0 =2 2j0 x k;
fU
k f x f^xk2 0
18
19
3.
Consider the multi-input multi-state (MIMS) continuous dynamical system described by,
r1
x_ fx; u;
x0 x0 ;
d1
20
where u 2 <
is the input vector, x 2 <
is the state
vector and f f1 ; f2 ; . . . ; fd T 2 Ld2 R n is an
unknown nonlinear function vector, where n r d. It
is also assumed that u; x are in compact sets.
Following the structure (17) of wavelet networks, at
the resolution 2N1 , the estimation ^fx; u of the func-
1534
G. P. Liu et al.
^
ex x x
~
Aj0 k A*j0 k Aj0 k
A j0 k U
k2Aj0
j0 k x; u
n
N X 2X
1
X
Bi
jk W
jj0 k2Bj i1
d
where Aj0 k ; Bi
are the wavelet coe cient vectors,
jk 2 R
the scaling function U j0 k x; u and the wavelet functions
i
W jk x; u are similarly de ned as U x and W j x, respectively, by replacing x with x; u.
Here, it is assumed that the number N and the vector
sets Aj0 ; B j are given. So, the wavelet network (21) for
the estimation of the nonlinear function fx; u is called
a xed wavelet network. Based on the estimation ^fx; u
by the xed wavelet network, the nonlinear function
fx; u can be expressed by
X
fx; u
A*j0 k U j0 k x; u
k2Aj0
n
N X 2X
1
X
i*
Bjk W
jj0 k2Bj i1
i
jk x; u
"N
22
i
jk x; ui
25
^x0 x0
26
~ i W
B
jk
i
jk x; u
jj0 k2B j i1
"N
30
N X 2X
1
X
29
k2Aj0
28
~ i Bi* Bi
B
jk
jk
jk
i
jk x; u
21
27
N
1
X
2 X 2X
~ i T B
~ i
B
jk
jk
j k2B i1
jj
0
31
where P fpij g 2 R dd is chosen to be a positive de nite matrix so that the matrix Q PA AT P is also a
positive de nite matrix, and j0 and j are positive constants which will appear in the parameter adaptation
laws, also referred to as the adaptation rates.
The rst derivative of the Lyapunov function V with
respect to time t is
2 X ~_ T ~
V_ e_ Tx Pex eTx P_ex
A A
j0 k2A j0 k j0 k
j0
N
1
X
2 X 2X
T ~ i
~_ i
B
jk Bjk
j
jj
k2B i1
0
32
N
1
X
2 X 2X
T ~ i
T ~ i
~_ i
B
jk Bjk j e x PBjk W
j k2B i1
jj
0
2eTx P"N
i
jk x; u
33
*
Because A*j0 k and Bi
are constant vectors,
jk
i
_A
_
~ j k A_ j k and B
~ jk B_ i . If there is no modelling
jk
0
0
error, i.e., "N 0, Aj0 k and Bi
jk can simply be estimated
by the following adaptation laws:
A_ j0 k j0 eTx PU
i
B_ jk j eTx PW
j0 k x; u
i
jk x; u
34
35
In the presence of a modelling error "N , several algorithms can be applied to ensure the stability of the whole
1535
and
eTx Pz 0g
36
F ; M fz : kzk M
and
eTx Pz < 0g
37
or
where z 2 R d , is a function and M is a positive constant. Here, in order to avoid parameter drift in the
presence of modelling error, the application of the projection algorithm (Goodwin and Mayne, 1987) gives the
following adaptive laws for the parameter estimates Aj0 k
i
and Bjk :
8
j0 eTx PU j0 k
>
>
>
>
>
< if Aj0 k 2 F U j0 k ; Mj0 k
A_ j0 k
38
>
T
2 T
>
>
e
PU
k
M
e
PA
U
A
j x
j0
j0
j0 k x
j0 k j0 k j0 k
>
>
: 0
if Aj0 k 2 F U j0 k ; Mj0 k
B_ i
jk
8 T
i
>
j ex PW jk
>
>
>
>
>
>
if Bi
<
jk 2 F W
i
i
jk ; Mjk
>
>
i
i
i
>
eT PW jk j Mjk 2 eTx PBjk W
>
>
> j x
>
:
i
i
i
if Bjk 2 F W jk ; Mjk
i i
jk Bjk
39
Aj0 k 0 2 F U
i
j0 k ; M j0 k
[ F U
j0 k ; M j0 k
i
i
i
i
jk ; M jk [ F W jk ; Mjk ;
i
Aj0 k and Bjk are con ned to the
j0 k ; Mj0 k [ F U j0 k ; Mj0 k
Bjk 0 2 F W
then the vectors
and
F U
F W
i
i
jk ; M jk
[ F W
sets
i
i
jk ; Mjk ;
d X
d
X
eTx Qe x 2
d X
d
X
i1 j1
i1 j1
40
!2
<
d
d
X
X
jpij j
H ex :
qi jexi j
:
qi
i1
j1
d
d
X
X
i1
j1
!2 9
=
2
pij
qi ;
41
4.
1536
G. P. Liu et al.
^fx; u ^f N x; u
_
1
X 2X
Bi
Nk W
k2B N i1
i
Nk x; u
n
1
X 2X
k2BN 1 i1
BN1k W
i
x; u
N1k
43
for
jexi j ei ;
i 1; 2; . . . ; n; t 2 R
44
iU t ei ;
for
i 1; 2; . . . ; n 45
42
iU t e U t iU 0
46
iL t e L t iL 0
47
"U t
48
!
d
d
d
X
X
jp1j j X
jp2j j
jpdj j
49
;
; . . . ;
q1
q2
qn
j1
j1
j1
It can also be deduced from the set H "N t that the
upper bound "U t and the lower bound "L t are given
by
(
d
X
jpij j
"L t min
i1;2;...;d
qi
j1
0
d
X
k1
d
X
j1
pkj
!2
9
11=2 11
>
1 A C L =
A i t
>
qi qk
;
50
1537
i1;2;...;d
(
d
X
jpij j
j1
k2Aj1
qi
9
11=2 11
!2
>
=
d
d
X
X
1 A C
U
@
pkj
A
i
i
>
qi qk
;
k1 j1
0
51
Aj1k U
j1k x; u
k2A j2
53
k2Z
where ck and dki are known as the two scale reconstruction sequences. Secondly, any scaling function U 2x in
V1 can alternatively be written using the scaling function
U x in V0 and wavelet function W x in W0 as
X
U 2x l
al2k U x k
k2Z
n
2X
1
bi
l2k W
i1
i x
54
k2B j2 i1
k2Aj1
where
k2Bj1 i1
Bi
W
j2k
i
x; u
j2k
56
ANk U
k2AN
Nk x; u
n
1
X 2X
Bi
Nk W
k2BN i1
i
Nk x; u
Bi
Nk
57
!
n
2X
1
X
i
i
ANk 0
cl2k AN1l
dl2k BN1l
58
i1
l2A N
BNk 0 0
59
j2k x; u
1
X 2X
k2Z
p
X i
W i x 2
dk U 2x k
Aj2k U
Bj1k W
i
x; u
j1k
55
1
X 2X
k2BN2 i1
Bi
W
N2k
i
x; u
N2k
60
BN2k 0
l2BN2
bl2k ANl
62
1538
G. P. Liu et al.
Simulation
x_ u xu xe0:5x
u2
63
Figure 1.
64
1 x; u
B4 xu
65
2 x; u
xB4 u
66
3 x; u
xu
67
68
^ at the resolution 20 .
The state error x(t) - x(t)
Figure 2.
1539
Figure 3.
^ at the resolution 21 .
The state error x(t) - x(t)
1540
G. P. Liu et al.
Figure 4.
6.
Conclusions
1541