Documente Academic
Documente Profesional
Documente Cultură
PREDICTIVE CONTROL
FOR NONLINEAR SYSTEMS
University of Strathclyde
Glasgow Scotland, U.K
Introduction
Model Predictive Control (MPC) is one of the
most popular advanced control techniques
The MPC algorithms are well established for
linear systems
Recent developments extended this methodology
to the Non-linear systems control
Techniques developed at the University of
Strathclyde are presented
Linear Quadratic Gaussian Predictive
Control
LQGPC
Preview Control
The algorithm was first presented by Tomizuka M. and D.E. Whitney. The algorithm
uses the LQG approach to optimisation and a stochastic model for the reference signal
beyond the preview horizon.
Performance index:
R
|
J ( t ) = S LMb y( t + j + 1 ) r( t + j + 1 )g b y( t + j + 1 ) r( t + j + 1 )g
M
T
|T j = 0 N
+ u( t + j )T u( t + j ) }
Reference generator: RN ( t + 1 ) = N RN ( t ) + N ( t + N )
Preview Control
LM x( t + 1 ) OP = ( t + 1 ) = L A OP ( t ) + LM BOP u( t ) + LMG OP LM w( t ) OP
MN xR ,N ( t + 1 )PQ M
MN N PQ MN PQ MN N PQ MN ( t + N )PQ
A B G
C
LM y( t ) OP = LMC OP LM x( t ) OP + LM OPv( t )
MN RN ( t )PQ MN PQ MN xR ,N ( t )PQ MN PQ
b
J t = Y ( t ) R( t ) gT e bY ( t ) R( t )g + bU ( t )gT u bU ( t )g
x ( t +1 ) = Ax ( t ) + Bu( t ) + Gw ( t )
Starting with the state-space model:
y( t ) = Dx ( t ) + v ( t )
The model:
xt +1 = f ( xt ) + g ( xt )ut + Gt
yt = h( xt )
Linear State Dependent form of the model:
xt +1 = A( xt ) xt + B( xt )ut + Gt
yt = C ( xt ) xt
Assumption on the parameterisation of the
model:
( A( xt ), B ( xt ) ) is controllable
x ,u
System representation
Simplified notation:
At = A( xt ), Bt = B( xt ), Ct = C ( xt )
ut , ut +1 ,..., ut + N 1 xt +1 , xt + 2 ,..., xt + N
u (t ) = K (t 1) x(t )
Use current control prediction u(t) and the model re-calculated at time
instant t (with the state x(t)) to obtain future state prediction x(t+1).
The NLQG algorithm
The state prediction x(t+1) together with the state feedback gain K(t-1)
from previous iteration of the algorithm is used for a calculation of the
future control prediction u(t+1).
u (t ) = K (t 1) x(t )
Use P(k + 1) to calculate the feedback control gain and calculate the
current control.
The NLQG algorithm
Use P(k + 1) to calculate the feedback control gain and calculate the
current control.
u (t ) = K (t ) x(t )
Calculated current control is used for the plant input signal manipulation.
The NLQG cost function
infinite
J t = E lim
h
T 2
1
T h
Jt (
finite
+ Jt
)
The NLQG cost function
t + n 1 (k )Qc ( k ) x(k ) + uT ( k ) Rc u ( k )
T
x
J t finite =
k =t + u ( k ) M x ( k ) + x ( k ) M u ( k )
T T T
c c
And is minimised by Difference Riccati Equation with border condition
given by the solution of ARE.
Remarks
Plant model
1.7
( )
atan x p ,2 (t )
0 0.01
x p (t + 1) = x p ,2 (t ) x p (t ) + u (t ) + p (t )
0.3 0
0 1
yh (t ) = y (t ) = [1 0] x p (t )
Reference model
xr (t + 1) = [1] xr (t ) + [0] r (t )
rh (t ) = [1] xr (t )
Results
output
1.6
S DRE
P ropos ed algorithm
1.4
Cons tant gain feedbac k
1.2
0.8
0.6
0.4
0.2
-0.2
0 10 20 30 40 50 60 70
Results
c ontrol
2
S DRE
P ropos ed algorithm
1.5
Cons tant gain feedbac k
0.5
-0.5
-1
-1.5
-2
-2.5
0 10 20 30 40 50 60 70
The NLQGPC control law derivation
Re-written state equation:
xt +1 = At xt + tU t , N + Gt
where
t = [ Bt , 01 , 02 ,..., 0 N ]
xt +1 = At xt + tU t , N + Gt
Yt +1, N = t , N At xt + St , N U t , N + Gt , N t , N 1
Introduce notation:
Qk = k T E k , M k = k T E Sk , N , Rk = S k , N T E S k , N + U
Final form:
1 t +Th Q M k k
J t = E lim k T
Uk,N T k
M T + J 0
Th 2Th k =t k Rk U k , N
NLQGPC control law derivation - Algorithm 1
( ) ( )
1
Ut,N = Tt Pt , t + Rt Tt Pt , t + M tT t
( )( ) (M )
1
Pt , = Qt + Tt Pt , t M t + Tt Pt , t Rt + Tt Pt , t t
T
+ t T Pt , t
boundary condition Pt + N = Pt + N ,
iterated backwards for k = t + N 1, t + N 1,..., t + 1
( )
t1+1 = t1 0.3 sin t1 + t2 + gt1
0.3 ( ) + + g
3
t2+1 = t2 1
t t t
2
yt = t
1 ( )
0.3 sin 1
t
1
t +1 = t1 + 0 + g 0
t
t 1 t 0 g
( )
2
0 1 0.3 t2
yt = [1 0] t + [ 0]t ,
Example
3.5
2.5
1.5
output
4
2
Alg. 2, g=0.01: J=23.5602
Alg. 1, g=0.01: J=23.9984
1 SDRE, g=0.01: J=26.9404
0
0 5 10 15 20 25
output
4
2
Alg. 2, g=0.1: J=22.4603
1 Alg. 1, g=0.1: J=22.852
SDRE, g=0.1: J=25.7435
0
0 5 10 15 20 25
Advantages & Disadvantages of NLQGPC
Advantages:
1. Controls based on solutions to the NLQGPC
have been shown to offer high performance.
2. Less computational burden than other non-linear
predictive control techniques.
Disadvantages:
1. Since NLQGPC utilizes the Riccati equation, it
is an unconstrained predictive control technique.
2. Like SDRE, NLQGPC doesnt guarantee closed-
loop global stability.
Dealing with constraints in NLQGPC
The input constraints can be approximated by means of
smooth limiting functions, and then included into the
dynamics of the plant in a state-dependent state-space form.
0 u u
Improving stability via Satisficing
The non-linear
dynamical
model of the F-8
fighter aircraft:
0.2
0.1
-0.1
-0.2
0 2 4 6 8 10 12 14
Time
Elevator Deflection
0.04
0.02
0
u (rad)
-0.02
u 0.05236 rad
-0.04
-0.06
-0.08
0 2 4 6 8 10 12 14
Time
NLQGPC
Low Guarantee of
Computational Robustness &
Burden Asymptotic
Stability
Non-Linear Generalized Minimum
Variance Control
Contents
Introduction
Simulation example
Introduction
The GMV control law has similar characteristics to LQG design in some cases and
is much simpler to implement
However, when the control weighting tends to zero the control law reverts to the
initial algorithm of strm, which is unstable for non-minimum phase processes.
Introduction
Aim: introduce a GMV controller for nonlinear, multivariable,
possibly time-varying processes
+ +
Nonlinear
plant d
Reference Controller
r + e u m +
Wr C0 W
- +
y
Plant subsystems Wd
d
u m y
W1k W0k z k +
Control Output
Nonlinear Linear Delay
J NGMV = E [ 02 (t )]
with 0 (t ) = Pc e ( t ) + ( Fc u ) ( t )
( PcWk + Fk ) u = Fk ( Fk1PcWk + I ) u
K c = Fk1Pc .
PID-based initial design
Consider the delay-free plant Wk and assume a PID controller KPID exists to stabilize
the closed-loop system.
Then a starting point for the weighting choice that will ensure the operator
( PcWk + Fk ) is stably invertible is
Pc = K PID , Fk = 1
To demonstrate this selection reasonable consider scalar case and let controller
( k0 + k1 + k2 ) ( k0 + 2k2 ) z 1 + k2 z 2
K c = k0 +
k1
1 z 1
( 1
+ k2 1 z = ) 1 z 1
Assume the PID gains are positive numbers, with small derivative gain. Then
simple to confirm if Fk = 1 the Pcn term is minimum phase and has real
zeros.
Relationship to the Smith Predictor
The optimal controller can be expressed in a similar form to that of a Smith
Predictor. This provides a new nonlinear version of the Smith Predictor.
Compensator d
Reference Plant
r _ u
+ ( Ap Pcd )1G0Yf 1 + 1
Fck
+ y
_ _ + W
+
Wk
F0Y f1
mk
( Ap Pcd )1G0Yf 1Dk
Dk
-
+
Smith Predictor form of NGMV controller
The system may be redrawn and the compensator rearranged as shown below.
This structure is essential if Pc includes an integrator.
Compensator
Disturbance
Plant
Reference Output
u
0 - + y
+ Ap1G0Y f1 Fck1 Pcd1 W +
r - +
Wk
Pcn
Dk
p
_
+
Comments
This last structure is intuitively reasonable. With no plant-model mismatch, the
control is not due to feedback but involves an open-loop stable compensator.
1
The nonlinear inner-loop has weightings Fck Pc acting like an inner-loop
controller. If weightings are chosen to be of usual form this will represent a filtered
PID controller.
Stability: Under the given assumptions the resulting Smith system is stable. This
follows because the plant is stable, the inner-loop is stable and there are only
stable terms in the input block.
Feedback + Feedforward + Tracking Control
future reference 0 (t ) = Pc e ( t ) + ( Fc u ) ( t )
information
Measurable Un-measurable
+ +
Error Control
Pc
weighting Fc weighting
r C2
Reference
Disturbance
Wr yf Wd Wd0
+ + + models
r C1
+
Nonlinear
Setpoint Controller plant d1 d0
+ + u m + +
Ww Hf C0 W y
w - e0 + + +
Scaling
Hf
Feedback gain/dynamics
Feedback, Tracking and Feedforward Control
Signal Generation Control Modules
yf Total
Measured
disturbance (G P 1 1 i 1 1
0 0 d D f 1 z G1P1d Dd 1 )W 1
di Wd 3
u
f
disturbance
ur + +
d
Reference r (t + p ) G2 Prd1Er1
Nonlinear
plant
+ u Output
+ - +
Setpoint Hf w G0 P0 d D f 11
1
Fck1 W
- e + + m + y
Wk
F0Y f1
Models:
1 1 0.1
1 z 1 0 1 z 1 0 1 0.5 z 1 0
Wr = Wd = z 6 Wd 0 =
0 1 0 1 0.1
0
1 z 1 1 z 1 1 0.5 z 1
Output 1 setpoint
2 NGMV Control 1
NGMV+FF 1.5
NGMV+FF+TR
1.5
1
1
0.5
0.5
0 0
0 20 40 60 80 100 120 140 160 180 200 0 20 40 60 80 100 120 140 160 180 200
Output 2 Control 2
2 1.5
NGMV
1.5 NGMV+FF
1 NGMV+FF+TR
1
0.5
0.5
0 0
0 20 40 60 80 100 120 140 160 180 200 0 20 40 60 80 100 120 140 160 180 200
0
0
-1 -1
0 20 40 60 80 100 120 140 160 180 200 0 20 40 60 80 100 120 140 160 180 200
Output 2 Control 2 NGMV
3 1.5 NGMV+FF
2 NGMV+FF+TR
1
1 0.5
0 0
-1 -0.5
-2
0 20 40 60 80 100 120 140 160 180 200 -1
0 20 40 60 80 100 120 140 160 180 200