Documente Academic
Documente Profesional
Documente Cultură
R = RS() (1)
p = Rv (2)
= J
1
S()J + J
1
n (3)
v = S()v +
1
m
f +
1
m
R
T
b (4)
where R
3
and v R
3
denote the angular and linear
velocities expressed in {B}, m and J R
33
represent
the quadrotors mass and moment of inertia, f and n R
3
denote the external force and torque expressed in {B}, and
b R
3
is the external force disturbance expressed in {I}.
The map S(.) yields a skew symmetric matrix that veries
S(x)y = x y, for x and y R
3
.
Adopting the standard simplied model for quadrotors [4],
[10], such that torques can be generated in any direction and
the actuation force is always aligned with the body z axis,
the external force can be written as
f = Tu
3
+ mgR
T
u
3
(5)
where u
3
=
_
0 0 1
T
, T R is the sum of the thrust
contributions from the four rotors and g is the gravitational
acceleration. With full torque control, the Euler equations of
motion (3) can be reduce to the integrator form = =
[
1
,
2
,
3
]
T
, using the input transformation
n = J + S()J. (6)
The quadrotor is thus an underactuated vehicle, with more
degrees of freedom than actuation variables. In this particular
case, only a scalar force actuation T is available, which must
be complemented by the torque actuation vector n to control
the translational motion of the vehicle.
III. PROBLEM FORMULATION
Let the desired trajectory be described by a function of
time p
d
(t) R
3
, which is assumed to be of class C
4
. In what
follows, time dependence will be omitted to lighten notation.
The control objective consists of designing a control law for
the quadrotor inputs T and n, which ensures the convergence
of the position p to p
d
with the largest possible basin of
attraction.
Due to the underactuated nature of the vehicle, the desired
attitude cannot be arbitrarily selected. From (4) and (5), we
can verify that the equilibrium for trajectory tracking satises
R
d
T
d
u
3
= mgu
3
+b m p
d
. (7)
Consequently, the desired rotation matrix R
d
is automat-
ically prescribed up to a rotation about the body z axis
(R
d
R
z
()T
d
u
3
= R
d
T
d
u
3
), which is in agreement with the
fact that we are only constraining three degree of freedoms
when four control inputs are available. The symmetry ex-
hibited by the quadrotor vehicle dictates the nature of this
particular degree of freedom, given that rotations about the
body z axis play no part in directing the thrust vector.
We start by considering the tracking problem assuming
that the external force b is known and unbounded actuation
is available. The solution to this problem is then extended to
provide bounded actuation and nally to account for external
force disturbances.
IV. CONTROLLER DESIGN
Let the position error vector e
1
be given by
e
1
p p
d
. (8)
For tracking to be attained, the controller must drive this error
to zero and we therefore dene the rst tentative Lyapunov
function as the squared norm of the error e
1
V
1
1
2
e
T
1
e
1
. (9)
Computing the time derivative V
1
, we can write
V
1
= W
1
(e
1
) + k
1
e
T
1
(e
1
+
1
k
1
e
1
) (10)
where e
1
= Rv p
d
, W
1
(e
1
) = k
1
e
T
1
e
1
, and k
1
> 0.
Similarly to W
1
, W
i
(), i {2, 3, 4} is used in the sequel
to denote a quadratic function of its arguments that can
be made positive denite by appropriately selecting a set
of positive coefcients k
j
, j {1, . . . , i}. Applying the
standard backstepping procedure, we dene a second error
and Lyapunov function and compute its time derivatives to
obtain
e
2
e
1
+
1
k
1
e
1
(11)
V
2
1
2
2
i=1
e
T
i
e
i
(12)
V
2
= W
2
(e
1
, e
2
) + k
1
k
2
e
T
2
_
e
2
+
1
k
2
1
k
2
e
1
_
(13)
where e
1
=
1
m
(Rf +b) p
d
.
At this point, if full force control were available, a feed-
back law for f could be applied to yield
V
2
= W
2
(e
1
, e
2
)
and thus solve the tracking problem. In the present case,
rotation motion is required to align the force vector in a
suitable direction. Two courses of action can be taken with
regard to the thrust actuation. One can rst set a control law
for thrust and then follow the backstepping process, or, one
can work through the backstepping process using a generic
thrust until a control law that combines torque actuation and
a higher order derivative of the thrust can be dened. The rst
approach leads to an aggressive time-scale separation of the
system dynamics, as observed for the control law proposed
in [10]. In this paper, we follow the latter approach, which
is also used in [3] and [2].
We apply the backstepping procedure twice more to obtain
the errors
e
3
e
2
+
1
k
2
1
k
2
e
1
(14)
e
4
e
3
+ c
1
e
(3)
1
(15)
and associated Lyapunov functions V
3
1
2
3
i=1
e
T
i
e
i
and
V
4
1
2
4
i=1
e
T
i
e
i
. The constant c
1
is given by c
1
=
1
k
2
1
k
2
(k
1
k
2
+k
3
)
and e
(3)
1
denotes the third-order time derivative
of e
1
. Dening the complete error state vector
e [e
T
1
e
T
2
e
T
3
e
T
4
]
T
(16)
we can write
V
4
=
1
2
e
T
e (17)
V
4
= W
4
(e) +e
T
4
_
h(e, T,
T, R, , p
(4)
d
) + RM(T) u
_
(18)
where
h(e, T,
T, R, , p
(4)
d
) =
c
1
m
RS()(S()Tu
3
+ 2
Tu
3
)
c
1
p
(4)
d
+ k
1
k
2
(e
3
e
2
) + (k
1
k
2
+ k
3
+ k
4
)e
4
(19)
and the matrix M and the vector of inputs u are given by
M(T) =
c
1
m
_
0 0 T
0 T 0
1 0 0
_
(20)
and
u =
_
T
1
2
T
(21)
respectively. From the denition of M in (20) and
V
4
in (18),
it is easy to observe that as long as the thrust T remains
positive, we can dene a control law for u that guarantees
convergence of all the errors to zero. As an initial controller,
we consider the following feedback law
u = M
1
(T)R
T
h(e, T,
T, R, , p
(4)
d
) (22)
to obtain the closed-loop error system
e = Ke (23)
where K R
1212
depends on the coefcients k
i
.
Lemma 1: Consider the closed-loop system (23), which
results from applying the controller (22) and arbitrary
3
to
the quadrotor system described by (1)-(6) and expressing it
in error coordinates. For an appropriate choice of coefcients
k
i
, (23) has an asymptotically stable equilibrium point at the
origin.
Proof: Consider V
4
dened in (17) as a candidate Lya-
punov function for the closed-loop error system. Substituting
(22) in (18) yields
V
4
= W
4
(e) = k
1
e
T
1
e
1
k
1
(k
2
1)e
T
2
e
2
k
3
e
T
3
e
3
k
4
e
T
4
e
4
+ k
1
(e
3
+e
4
)
T
(e
2
e
1
). (24)
There is a suitable choice of coefcients k
i
that makes
V
4
a negative denite function. Using standard Lyapunov
arguments, we have that the origin of (23) is asymptotically
stable.
A. Bounding the position error
To enforce bounds on the actuation, different questions
need to be addressed. First, the reference trajectories should
be such that tracking can be achieved while keeping the
actuators within their limits of operation. For that purpose,
we not only assume that p
d
(t) is a class C
4
function, but
also impose bounds on its derivatives. Second, the control
law depends linearly on the errors e
2
, e
3
, and e
4
. This is
a critical limitation mainly because all these errors depend
on the position error e
1
, which can be arbitrarily large as
opposed to velocities and accelerations that are automatically
bounded by the operation limits of the vehicle.
To overcome this difculty, we consider the same con-
trol law as in (22) but redene the error vector e =
[e
T
1
e
T
2
e
T
3
e
T
4
]
T
such that e
1
remains unchanged whereas e
2
,
e
3
, and e
4
become functions of a bounded version of e
1
.
More specically, the velocity error e
2
is redened as
e
2
(e
1
) +
1
k
1
e
1
(25)
where is the saturation function given by
(x)
_
x if x p
max
p
max
x
x
if x > p
max
(26)
and this change is propagated to e
3
and e
4
according to (14)
and (15), respectively.
To show that the proposed controller continues to be a sta-
bilizing one, we start by considering the function V
4
=
1
2
e
T
e
as in the previous case and recompute its derivative given the
new denition of e. For small position errors e
p
< p
max
,
both the Lyapunov function and controls remain unchanged
and asymptotic stability can thus be proven. For large errors,
it can be shown that
V
4
k
1
e
1
(p
max
e
2
) W
5
(e
2
, e
3
, e
4
). (27)
Once again, a suitable choice of coefcients, ensures that
W
5
is positive denite, yielding a negative denite derivative
for V
4
when e
2
< p
max
. From this analysis, we can
only conclude that the system converges when the initial
conditions satisfy e(0) < p
max
. A less conservative result
can be obtained by showing that e
2
is ultimately bounded
by p
max
. The following theorem shows that this is in fact
the case.
Theorem 2: Let the quadrotor kinematics and dynamics
be described by (1)-(6), and consider the transformation to
error coordinates e
1
, e
2
, e
3
, e
4
given by (8), (25), (14),
(15), respectively. For an appropriate choice of coefcients
k
i
, the closed-loop system that results from applying the
controller (22) and arbitrary
3
to the error system has an
asymptotically stable equilibrium point at the origin.
Proof: Consider the Lyapunov function V
1
2
4
i=2
e
T
i
e
i
. Straightforward computations show that, for
e
1
p
max
, the following upper bound for
V
holds
k
1
((k
2
1) e
2
p
max
) e
2
+ k
1
(e
3
+e
4
)
T
e
2
(k
3
e
3
k
1
p
max
) e
3
(k
4
e
4
k
1
p
max
) e
4
(28)
whereas for e
1
> p
max
,
V
is given by
= k
1
k
2
e
T
2
e
2
k
3
e
T
3
e
3
k
4
e
T
4
e
4
+ (e
2
+e
3
+e
4
)
T
1
p
max
e
1
S((e
1
))
2
e
2
. (29)
For [e
T
2
, e
T
3
, e
T
4
] > p
max
,
V
b.
The external disturbances only affect the rigid body system
through (4). As such, the backstepping process is in all
similar to the one presented in the previous section up to
the denition of e
3
given in (14). Since the exact value of
b is no longer available, we redene e
3
to depend on the
estimate
b instead of on b, so that
e
3
e
2
+
1
k
2
1
k
2
(
1
m
Rf +
b p
d
) = e
2
+
1
k
2
1
k
2
( e
1
b).
(30)
Keeping the denition for e
4
given in (15) and adding a
quadratic term in
b to V
4
, we have
V
4
1
2
e
T
e +
1
2k
1
b
T
b (31)
V
4
= W
4
(e) +e
T
4
_
h(.) + RM(T) u +
1
k
2
1
k
2
b
_
+
1
k
2
1
k
2
e
T
3
b +
1
k
1
b
T
_
b + (e
2
+e
3
+e
4
)
_
(32)
where is a positive denite matrix and the matrix M, the
vector of controls u and the vector h are dened in (20),
(21), and (19), respectively.
Given the expression for
V
4
, we dene the estimator
update law
b = (e
2
+e
3
+e
4
) (33)
and the control law
u = M(T)
1
R
T
_
h(e, T,
T, R, , p
(4)
d
) +
1
k
2
1
k
2
b
_
(34)
leading to the Lyapunov function derivative
V
4
= k
1
e
T
1
e
1
k
1
(k
2
1)e
T
2
e
2
k
3
e
T
3
e
3
k
4
e
T
4
e
4
+ k
1
(e
3
+e
4
)
T
(e
2
e
1
) +
1
k
2
1
k
2
e
T
3
(e
2
+e
3
+e
4
) (35)
and the closed-loop system
e = K
b
e + [0 I
3
I
3
I
3
]
T 1
k
1
b (36)
b = [0 I
3
I
3
I
3
]e. (37)
We can now state the following result.
Lemma 3: For an appropriate choice of coefcients k
i
,
the closed-loop system (36)-(37) has an asymptotically stable
equilibrium point at the origin.
Proof: There is a set of coefcients k
i
such that
V
4
is a negative semi-denite function, which implies that the
origin is stable. Applying LaSalles invariance principle, we
have that (e,
b = 0.
Redening the velocity error e
2
to depend on the bounded
position error (e
1
) as in (25), we can follow the same steps
as in Section IV-A to analyze the stability of the closed-loop
system.
Theorem 4: Let the quadrotor kinematics and dynamics
be described by (1)-(6) and consider the transformation to
the error coordinates e
1
, e
2
, e
3
, e
4
given by (8), (25), (30),
(15), respectively. For an appropriate choice of coefcients
k
i
and arbitrary torque
3
, the closed-loop system (36)-(37)
that results from applying the controller (34) and the update
law (33) has an asymptotically stable equilibrium point at
the origin.
Proof: Using V
=
1
2
4
i=2
e
T
i
e
i
+
1
2k
1
b
T
b as a
Lyapunov function and noting that its derivative is negative
semi-denite for all [e
T
2
e
T
3
e
T
4
] p
max
leads to the result
that there exists a nite time T such that e
2
(t) < p
max
,
for t > T. Once e
2
(t) < p
max
we can use the Lyapunov
function (31) and note that its derivative is negative semi-
denite to prove stability of the closed-loop system and
convergence of e and
b to zero.
The proposed control law (22) can only be applied if the
thrust force never reaches zero. A conservative estimate of
the initial states for which the thrust never crosses the origin
can be obtained using (30) together with the bounds for
the errors and estimation derived from the negative semi-
deniteness of (35). In short, we can obtain the following
lower bound for |T(t)|
|T(t)| mg b m
_
_
_p
(3)
d
(t)
_
_
_
m(
_
2
max
() + 2
2k
2
1
k
2
)V
1/2
4
(0). (38)
If the initial conditions and desired trajectories are such
that the lower bound for |T(t)| is positive, then the thrust
T(t) that results from applying the proposed control law is
guaranteed to take only positive values.
V. EXPLORING THE EXTRA DEGREE OF FREEDOM
As specied in (7), if tracking of the desired trajectory
is accomplished then the rotation matrix is automatically
prescribed up to a rotation about the body z axis. This is
corroborated by the control law (34) obtained in the previous
section, which leaves the torque
3
=
3
free. We can
thus use this extra degree of freedom to meet an additional
heading control objective.
As a rst approach, we can simply enforce the conver-
gence of
3
to zero, applying for example the control law
3
= k
3
. A more involved goal that is still attainable is to
ensure that the vehicle tracks the reference trajectory with no
sideslip angle or equivalently with zero velocity component
along the body y axis. Combining (7), which describes the
trajectory tracking equilibrium, with the additional constraint
[0 1 0]
T
v
d
= v
yd
= 0 (39)
we can obtain the following expression for the desired
rotation matrix
R
d
=
_
S(r
3d
)
2
p
d
S(r
3d
) p
d
S(r
3d
) p
d
S(r
3d
) p
d
r
3d
_
(40)
whose third column is given by
r
3d
=
mgu
3
+b m p
d
mgu
3
+b m p
d
. (41)
Clearly, (40) is not well-dened for S(r
3d
) p
d
= 0. When this
is the case, any pair (r
1d
,r
2d
) such that R
d
= [r
1d
r
2d
r
3d
]
SO(3) yields v
yd
= 0. Since the control law for tracking
already ensures that r
3
converges to r
3d
, to drive v
y
to zero,
we need only have convergence of r
2
to r
2d
or equivalently
of r
T
2d
r
2
to one.
Using (40), straightforward but rather lengthy computa-
tions provide expressions for both
3d
and its time derivative
3d
. Gathering all these elements, we arrive at the following
PD-like control law for
3
3
= l
2
(
3
3d
+ l
1
r
T
2d
r
1
) +
3d
d
dt
(r
T
2d
r
1
) (42)
with l
1
> 0 and l
2
> 0. Using the Lyapunov function
W = (
3
3d
+l
1
r
T
2d
r
1
)
2
and computing its time derivative
along the systems trajectories, we can immediately conclude
that
W < 0 and so
3
converges to
3d
l
1
r
T
2d
r
1
. Clearly
this is not enough to guarantee that r
2
is approaching
r
2d
, however (42) gives some indication of providing this
convergence since the term l
1
r
T
2d
r
1
opposes growth in the
angular distance between r
2
and r
2d
. More formally, we can
dene the states
3
3d
+ l
1
r
T
2d
r
1
R (43)
1 r
T
2d
r
2
[0, 2] (44)
and verify that, at the tracking equilibrium given by e = 0
and = 0, the dynamic system for is described by
= (2 ) (45)
and the origin of (45) is asymptotically stable. If we consider
the closed-loop quadrotor system that results from applying
the control laws (22) and (42) and express it in error
coordinates with state (e, , ), (45) can be thought of as
the quadrotor zero dynamics. We can therefore conclude that
the overall closed-loop system has an asymptotically stable
equilibrium point at the origin, given that e = 0 and = 0
are asymptotically stable and so is the origin of the zero
dynamics. Determining the respective region of attraction
requires further investigation and is left for future work.
As in Section IV-B, assuming that unknown disturbances
are present, we can substitute b by m
b
x
b
y
bz
Fig. 3. Force disturbance estimates
to the continuous but nonsmooth reduction of the desired
velocity. All actuation signals are bounded and do not vary
signicantly from their nominal values during the simulation,
as shown in Fig. 5.
VII. CONCLUSIONS
This paper presented a state feedback solution to the
problem of stabilizing an underactuated quadrotor vehicle
along a predened trajectory using bounded thrust and torque
actuations in the presence of constant force disturbances. A
Lyapunov function for the system was derived using back-
stepping techniques and an adaptive estimator was introduced
so as to compensate for the force disturbance. The bounds on
the actuation result from the boundedness of the Lyapunov
function and depend on the initial conguration errors.
REFERENCES
[1] E. Frazzoli, M. Dahleh, and E. Feron, Trajectory tracking control
design for autonomous helicopters using a backstepping algorithm,
in Proc. American Control Conference, vol. 6, 2000, pp. 41024107.
[2] R. Mahony and T. Hamel, Robust trajectory tracking for a scale
model autonomous helicopter, International Journal of Robust and
Nonlinear Control, vol. 14, no. 12, pp. 10351059, 2004.
[3] , Image-based visual servo control of aerial robotic systems
using linear image features, IEEE Transactions on Robotics, vol. 21,
no. 2, pp. 227239, 2005.
[4] T. J. Koo and S. Sastry, Output tracking control design of a helicopter
model based on approximate linearization, in Proc. IEEE Conference
on Decision and Control, 1998, pp. 36353640.
[5] R. Skjetne and T. Fossen, On integral control in backstepping: Anal-
ysis of different techniques, in Proc. American Control Conference,
vol. 2, 2004, pp. 18991904.
[6] M. Krstic, I. Kanellakopoulos, and P. V. Kokotovic, Nonlinear and
Adaptive Control Design. Wiley, 1995.
[7] A. Teel, Global stabilization and restricted tracking for multiple
integrators with bounded controls, Systems & Control Letters, vol. 18,
no. 3, pp. 165171, 1992.
0 2 4
20
0
20
40
e
1
0 2 4
5
0
5
e
2
0 2 4
5
0
5
e
3
0 2 4
5
0
5
e
4
Time (s)
5 10 15 20 25 30
10
0
10
20
e
11
e
12
e
13
5 10 15 20 25 30
1
0.5
0
0.5
e
21
e
22
e
23
5 10 15 20 25 30
1
0.5
0
0.5
e
31
e
32
e
33
5 10 15 20 25 30
0.2
0
0.2
Time (s)
e
41
e
42
e
43
Fig. 4. Trajectory tracking errors
0 2 4
40
45
50
55
60
F
o
r
c
e
(
N
)
Time (s)
0 2 4
10
5
0
5
10
T
o
r
q
u
e
(
N
m
)
Time (s)
5 10 15 20 25 30
48
49
50
51
52
53
Time (s)
5 10 15 20 25 30
1
0.5
0
0.5
1
Time (s)
T
n
1
n
2
n
3
Fig. 5. Actuation
[8] R. Freeman and L. Praly, Integrator backstepping for bounded
controls and control rates, IEEE Transactions on Automatic Control,
vol. 43, no. 2, pp. 258262, 1998.
[9] F. Mazenc and A. Iggidr, Backstepping with bounded feedbacks,
Systems & Control Letters, vol. 51, no. 3-4, pp. 235245, 2004.
[10] A. Isidori, L. Marconi, and A. Serrani, Robust nonlinear motion
control of a helicopter, IEEE Transactions on Automatic Control,
vol. 48, no. 3, pp. 413 426, 2003.
[11] S. P. Bhata and D. S. Bernstein, A topological obstruction to con-
tinuous global stabilization of rotational motion and the unwinding
phenomenon, Systems and Control Letters, vol. 39, no. 1, pp. 6370,
Jan. 2000.