Sunteți pe pagina 1din 8

IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, VOL. 22, NO.

1, JANUARY 2014 299

Time-Delayed Output Feedback Bilateral Teleoperation With Force


Estimation for n-DOF Nonlinear Manipulators
John M. Daly and David W. L. Wang, Member, IEEE

Abstract— This brief presents a novel bilateral teleoperation models of the master and slave dynamics are used. The master
algorithm for n degree of freedom nonlinear manipulators manipulator is controlled with a standard impedance controller.
connected through time delays. Central to this approach is the Proof of stability is given, but it requires that both the human
use of second-order sliding mode unknown input observers for
estimating the external forces, removing the need for both velocity and the environment be passive in order to show stability [5].
and force sensors. This leads to a lower-cost hardware setup that This approach requires force sensors.
provides all of the advantages of a position-force teleoperation This brief presents a novel bilateral teleoperation algorithm
algorithm. Stability is guaranteed considering the presence of for n-DOF nonlinear manipulators that provide the benefits
time delays. Numerical and experimental results are presented. of a position-force architecture in terms of transparency and
Index Terms— Bilateral teleoperation, force estimation, sliding force tracking, but does not require the use of force sensors.
mode control, sliding mode observers, time delays. The work presented here extends our earlier work presented
in [6] and [7]. The work in [6] was developed for linear
I. I NTRODUCTION 1-DOF systems, while [7] presents our preliminary results for
nonlinear n-DOF teleoperation systems. This brief builds upon
B ILATERAL teleoperation involves the ability to control
a remote manipulator and to sense the forces acting on
the robot in the remote environment. One of the major issues
the results of [7] by extending the closed-loop stability results
for both delay-dependent and delay-independent stability, and
with time-delayed bilateral teleoperation is that of stability. performing a thorough simulation study.
Several bilateral teleoperation architectures exist, including In Section II, the bilateral teleoperation system, along with
the position-position and position-force architectures. An issue associated controllers and observers, is presented. Section III
with the position-position approach is that differences between presents the stability analysis of each of the master and
the master and slave position may be experienced as large slave systems, while Section IV develops stability results for
reaction forces by the operator, even when the slave may the entire closed loop. Numerical simulations are presented
be operating completely in free motion [1]. Niemeyer and in Section V, and an experimental verification is given in
Slotine [2] have developed a well known variant of this Section VI. Conclusions and areas for future work are given
architecture based on wave variables and ensuring passivity in Section VII.
of the closed loop. The position-force architecture involves
transmitting the master position to the slave side and then a II. P ROBLEM F ORMULATION
measurement of the slave environment force back from the Although similar to [8], which deals with linear 1-DOF
slave to the master side. The advantages of this architecture systems and requires measurement of positions, velocities,
are perfect force tracking when the slave is in contact with and external forces, the algorithm presented here is developed
an environment and a better perception of the system in free for nonlinear n-DOF manipulators and requires only position
motion [3]. measurements. A sliding mode controller is used at the slave
Polushin et al. [4] have recently proposed an approach to side to ensure a desired closed-loop impedance and tracking
bilateral teleoperation for n degree of freedom (DOF) nonlin- of the delayed master trajectory. A computed torque method
ear manipulators over a network with unknown and varying impedance controller at the master side is used to give the
time delays. It requires the assumption that the environment master a desired impedance and to apply the reflected slave
is a passive operator in order to guarantee stability. Garcia- environment force back to the master. Robot position mea-
Valdovinos et al. [5] proposed the use of a second-order surements drive observers that estimate both the state and
sliding-mode controller for the slave in a bilateral teleoperation the external forces. The control algorithm in this brief is
system with a constant but unknown time delay. Linear 1-DOF designed in the manipulator Cartesian space and the slave
Manuscript received August 31, 2011; revised September 3, 2012; accepted
environment is modeled as an n-DOF system acting on the
December 23, 2012. Manuscript received in final form January 19, 2013. Date slave end effector. In this brief, it is assumed that time delays
of publication February 14, 2013; date of current version December 17, 2013. are constant. Provided that an upper bound on the time delays
Recommended by Associate Editor M. Fujita.
J. Daly is with JSI Telecom, Kanata, ON K2M 1X3, Canada (e-mail:
in the system is known, a buffer may be implemented in
jmdaly@gmail.com). software to ensure that incoming signals arrive to the controller
D. Wang is with the Department of Electrical and Computer Engi- at a known constant rate.
neering, University of Waterloo, Waterloo, ON N2K2E4, Canada (e-mail:
dwang@uwaterloo.ca).
Consider the following master manipulator dynamics in task
Color versions of one or more of the figures in this paper are available space:
online at http://ieeexplore.ieee.org.
Digital Object Identifier 10.1109/TCST.2013.2242329 Ẋ m1 = X m2 (1)
1063-6536 © 2013 IEEE
300 IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, VOL. 22, NO. 1, JANUARY 2014

−1
 
Ẋ m2 = M̄m (X m1 ) − h̄ m (X m1 , X m2 ) + Fm + Fh (2) The estimate of the environmental force acting on the slave
is obtained from the equivalent output injection term z s2 i as
where X m1 ∈ R n is the vector of positions and X m2 ∈
R n is the vector of velocities. The matrices M̄m (X m1 ) and F̂e = − M̄s ( X̂ s1 )z s2 eq (12)
h̄ m (X m1 , X m2 ) are defined in [9]. The vectors Fm ∈ R n and where the i th element of vector z s2 is defined analogously
Fh ∈ R n represent the input forces and the external forces to the i th element of z m2 , and M̄s is defined analogously to
applied by the human, respectively. Defining the slave states the master. The extension of the single-input, single-output
similarly, the slave state space representation of the dynamics (SISO) observers in [10] to MIMO observers is presented in
is given as the Appendix.
Ẋ s1 = X s2 (3)
−1
  B. Control Laws
Ẋ s2 = M̄s (X s1 ) − h̄ s (X s1 , X s2 ) + Fs − Fe (4)
A signal x(t) delayed by T1 and T2 s, respectively, is
where the vectors Fs ∈ and Fe ∈
Rn Rn
represent the input represented as
forces and the external forces applied by the environment, x d1 (t) ≡ x(t − T1 ), x d2 (t) ≡ x(t − T2 ).
respectively.
Similarly, a signal delayed by two time delays, T1 and T2 , is
represented as
A. Master and Slave Observers
x dd (t) ≡ x(t − T1 − T2 ).
Observers are used at both the master and slave sites. They
are based on the observer developed in [10], but are designed The master control law is a computed torque method
for the n-DOF case. They make use of the super-twisting controller to decouple and linearize each DOF in the task
second-order sliding-mode algorithm. For the case of the n- space. The outer loop controller is specified as
DOF bilateral teleoperation system, multiple-input, multiple- Fm = M̄m ( X̂ m1 )vm + h̄ m ( X̂ m1 , X̂ m2 ) − F̂h . (13)
output (MIMO) observers are developed. The observer for the
master manipulator is given as The inner impedance controller, to provide each degree of
freedom with the desired impedance characteristics, is given
˙
X̂ m1 = X̂ m2 + z m1 (5) as
  −1
 
˙ −1
vm = M̃m −B̃m X̂ m2 − K̃m X̂ m1 + F̂h − F̂e
d2
X̂ m2 = M̄m ( X̂ m1 ) − h̄ m ( X̂ m1 , X̂ m2 ) + Fm + z m2 (6) (14)

where z m1 ∈ R n and z m2 ∈ R n . The i th element of vector z m1 where M̃m ∈ R n×n is the diagonal constant matrix that
is given as specifies the desired mass characteristic for each degree of
freedom. The desired mass for the i th DOF is given by the i th
z m 1i = λmi |X m 1i − X̂ m 1i |1/2sign(X m 1i − X̂ m 1i ) (7) diagonal of M̃m . The matrices B̃m ∈ R n×n and K̃m ∈ R n×n
are also diagonal constant matrices representing the desired
and the i th element of vector z m2 is given as
damping and stiffness values for each DOF.
z m 2i = αmi sign(X m 1i − X̂ m 1i ) (8) In order to present the slave controller, define the master-
slave position and velocity tracking error as e r1 = X s1 −
where λmi and αmi are constants. Conditions on these constants X m1 d1 ∈ R n and e r2 = X s2 − X m2 d1 ∈ R n . The controller
are given in the Appendix. Note that the human force exerted is designed in order to give each DOF of the end effector a
on the end effector does not appear at all in the observer. desired impedance characteristic. Controlling the mass, spring,
Regardless of this, finite time convergence of the state esti- and damping characteristics of the end effector allows the
mates is achieved, and an estimate of the human force is operator to tune the system based on the particular application.
obtained as well. It is this force estimate that is used in the The desired impedance model for the slave end effector is
control law. The human force estimate is obtained from the expressed as
equivalent output injection term z m 2i as  
Fe (s) K̃s
F̂h = M̄m ( X̂ m1 )z m2 eq (9) Zs (s) = = − M̃s s + B̃s + (15)
Ve (s) s
where z m2 eq represents a filtering operation on z m2 in order where Fe (s) represents the Laplace transform of the envi-
to obtain the equivalent output injection signal [11]. The ronmental force, Ve (s) represents the Laplace transform of
equations for the slave observer are expressed as the tracking error velocity, and the matrices M̃s , B̃s , and
˙ K̃s are defined as in the master controller, but for the slave
X̂ s1 = X̂ s2 + z s1 (10) impedances. This impedance model gives rise to the following
 
˙ −1 n-DOF equation of dynamics:
X̂ s2 = M̄s ( X̂ s1 ) − h̄ s ( X̂ s1 , X̂ s2 ) + Fs + z s2 (11)
I = M̃s ė r2 + B̃s e r2 + K̃s e r1 + Fe = 0. (16)
where z s1 ∈ R n and z s2 ∈ R n . The i th element of vectors
z s1 and z s2 are defined analogously to the i th elements of z m1 This equation is obtained from (15) by taking an inverse
and z m2 , respectively. Laplace transform. The slave controller is designed to ensure
DALY AND WANG: TIME-DELAYED OUTPUT FEEDBACK BILATERAL TELEOPERATION WITH FORCE ESTIMATION 301

that the slave closed-loop dynamics have mass, spring, and Theorem 3.1: Consider the master and slave manipulators
damping characteristics as specified in (16). Additionally, the connected bilaterally through a time delay of T1 s from the
controller will ensure that the slave asymptotically tracks the master to the slave and T2 s from the slave to the master,
delayed master trajectory in the absence of contact with the with master control law (13) and (14), master side observer
environment, with tracking error dynamics given by (16). In (5), (6), and slave sliding mode control law (20) with slave
order to ensure that this desired impedance characteristic is side observer (10), (11). Then, there exists a sliding mode
satisfied, a sliding surface for the slave controller is defined controller gain Kg = kg I, where
as  t


s= M̃s−1 I(τ ) dτ = 0. (17) kg ≥ 2 n max αsi + max αmi + εg (21)
i i
0
Once the slave sliding mode controller has driven the system for any εg > 0, and observer gains λm , αm , λs , αs such that
trajectories to s = 0, then (16) will be satisfied and the the state estimates recover the true state in finite time, and the
slave manipulator will have the desired closed-loop dynamics. master and slave manipulators have the desired closed-loop
However, this brief examines output feedback control. An dynamics.
output feedback version of the end effector dynamics that Proof: See [12].
yields the desired impedance model is defined as Theorem 3.1 guarantees stability of each of the master and
slave manipulators. The next section addresses the issue of
Î = M̃s ê˙ r2 + B̃s ê r2 + K̃s ê r1 + F̂e = 0 (18) closed-loop stability under a variety of environmental condi-
d1 d1 tions.
where ê r1 = X̂ s1 − X̂ m1 and ê r2 = X̂ s2 − X̂ m2 . Now, the
sliding surface in the output feedback case is defined as
 t IV. S TABILITY OF THE T ELEOPERATOR S YSTEM IN
C ONTACT W ITH THE E NVIRONMENT
ŝ = M̃s−1 Î(τ ) dτ = 0. (19)
0 Having guaranteed stability for each of the master and slave
Then, the slave side sliding mode controller is given as manipulators with their associated observers and controllers,
 it remains to show that the entire closed loop can be stabilized
Fs = − M̄s ( X̂ s1 ) M̃s−1 K̃s X̂ s1 + M̃s−1 B̃s X̂ s2 in the presence of time delays. This section will present
−1 closed-loop stability results for two situations. The first case
− M̄s ( X̂ s1 ) h̄ s ( X̂ s1 , X̂ s2 )
looks at the situation where the system may be stabilized
d1
+ (M̃m K̃m − M̃s−1 K̃s ) X̂ m1
−1
for any size of delay, and the delay need not be known
−1 d1 a priori, provided that the environment is a linear spring
+ (M̃m B̃m − M̃s−1 B̃s ) X̂ m2
damper. The second case presents a delay-dependent stability
−1 −1 d1 d1
− (M̃m − M̄m ( X̂ m1 )) F̂h result for situations involving the same environment at the
−1 dd slave side. Delay-independent stability for nonlinear finite-gain
+ M̃m F̂e + M̃s−1 F̂e + z s2 eq
 stable environments was presented in [7].
− z m2 eq d1 + Kg sign(ŝ) (20)
A. Delay-Independent Stability With a Linear Environment
where Kg = kg I ∈ R n × n and kg is a scalar whose value will
be specified later. This case examines the situation where the slave end effec-
To summarize, for master and slave manipulators connected tor is in constant contact with a linear environment. This envi-
bilaterally through a time delay of T1 s from the master to the ronment is modeled as a set of n spring-dampers in connection
slave and T2 s from the slave to the master, the system may with each DOF of the end effector. The environment is located
be controlled using the master control law (13) and (14) with at X s1 = Xc and is described by the model
master side observer (5), (6), and slave sliding mode control Fe = Ke (X s1 − Xc ) + Be X s2 (22)
law (20) with slave side observer (10), (11).
where Ke ∈ Rn × n is a diagonal matrix of spring constants and
III. S TABILITY OF E ACH OF THE M ASTER AND Be ∈ R n×n is a diagonal matrix of damping constants. When
S LAVE P LANTS this environment is in contact with the slave manipulator, the
equilibrium point of the slave system is no longer the origin.
In order to show the stability of this system, the following
In order to perform a stability analysis, a new state is defined
assumption is made.
to express the slave as a system with an equilibrium at the
Assumption 3.1: The external forces acting on both master
origin. Define a new state as
and slave are bounded for all time with some known upper
 −1
bounds. In particular, the environment system that generates X s1  = X s1 − K̃s + Ke K e Xc . (23)
the forces on the teleoperator is a finite-gain L 2 stable system,
and the human dynamics are not considered. Applying this transformation to the slave plant ensures that
The next theorem shows that the master and slave con- the equilibrium point of the system in the new states is at the
trollers and observers ensure that each plant has the desired origin.
closed-loop dynamics. Additionally, the slave manipulator will In this section, the Small Gain Theorem is used to develop
asymptotically track the delayed master trajectory. sufficient conditions to ensure closed-loop stability regardless
302 IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, VOL. 22, NO. 1, JANUARY 2014

of the time delays. A transfer function representation of the slave parameters, the maximum delay for which the system
master system is used. The master system has as inputs the is stable may be found. With this result, the designer may
sum of the human force and the delayed environmental force. determine if the particular parameters chosen will maintain
Define this input signal as e1 = Fh − Fe (t − T2 ) ∈ R n . stability for the delay considered in a particular case.
It produces as outputs: position, velocity, and acceleration. This result applies for linear systems with commensurate
T
Define the master output as Ym = [X m1 T , X m2 T , Ẋ m2 ]T . delays. In a teleoperation system, this is achievable as it is
The slave system receives e2 = Ym (t − T1 ) ∈ R as input.
3n always possible to put buffers in the system on each side so
The environmental forces acting on the slave end effector are that incoming signals may be released to each of the master
taken as its output, that is, Ys = Ke X s1  + Be X s2 . The master and slave at a known time interval. In this section, it is assumed
transfer function matrix G m (s) ∈ C 3n × n can be found as that the environment is a linear set of spring-dampers having
⎡ ⎤ the form (22).
G m1 (s)
The theorem from [13] to be presented is valid for linear
G m (s) = ⎣ G m2 (s) ⎦ (24)
retarded function differential equation (RFDE) systems of the
G m3 (s)
form
where 
m

ẋ(t) = A0 x(t) + Ak x(t − kτ ), τ ≥0 (30)
1
G m1 (s) = diag (25) k=1
m s 2 + bmi s + kmi
mi
where x(t) ∈ R n and Ai ∈ R n×n for all i = 0, . . . , m.
s
G m2 (s) = diag (26) The main delay-dependent stability result, found in [13], is
mm s 2 + bmi s + kmi
i
now presented.
s2
G m3 (s) = diag (27) Theorem 4.2: Suppose that (30) is stable at τ = 0, and let
mmi s 2 + bmi s + kmi q = rank(Am ). Furthermore, define
for i = 1 . . . n, where mmi , bmi , and kmi are the i th diagonals ⎧
⎪ θi
⎪ min1≤k≤n ki if λi (G( j ωki ), H( j ωki )) =
of M̃m , B̃m , and K̃m , respectively. This transfer function matrix ⎪
⎪ ω

⎪ k
takes one input vector, the sum of the applied human force and ⎨ e− j θk for some ωki ∈ (0, ∞),
i

the delayed slave side force, and has three output vectors— τ̄i : =

⎪ θki ∈ [0, 2π]
master position, velocity, and acceleration. The time delays in ⎪
⎪ ∞ if ρ(G( j ω), H( j ω)) > 1


both the feedforward and feedback paths may be combined ⎩
∀ω ∈ (0, ∞)
with each transfer function matrix to yield the following
transfer matrices: where

G m d1 (s) = G m (s)e−T1 s , G s d2 (s) = G s (s)e−T2 s . (28) ⎡ ⎤


0 I ··· 0
Stability of the feedback interconnection of G m d1 (s) and ⎢ .. .. . . . ⎥

G(s) : = ⎢ . . . .. ⎥

G s d2 (s) will now be shown. ⎣ 0 0 ··· I ⎦
Assumption 4.1: An upper bound on the ∞-norm of the −(sI − A0 ) A1 · · · Am−1
slave + environment system G s (s) is known.
H(s) : = diag(I, . . . , I, −Am ).
Theorem 4.1: Consider the master system connected
through time delays to slave system. Define Then
1
γs+e = τ̄ : = min τ̄i .
||G s (s)||∞ 1≤i≤q+n(m−1)
where ||G s (s)||∞ is the L 2 norm of the slave + environment Equation (30) is stable for all τ ∈ [0, τ̄ ), but becomes unstable
system. If the master impedance parameters are chosen as at τ = τ̄ .
 Note that the notation λi ( A, B) represents the i th general-
1 3 
mmi > , kmi = mmi , bmi = 2kmi mmi (29) ized eigenvalue of two square matrices A and B. Also
γs+e 2
then the closed-loop teleoperator system is L 2 stable, indepen- ρ( A, B) : = min{|λ| | det( A − λB) = 0}.
dent of delay.
Proof: See [12]. This theorem requires a frequency sweeping test. Strictly
speaking, the theorem requires the evaluation of generalized
eigenvalues continuously between 0 rad/s and ∞ rad/s. In
B. Delay-Dependent Stability With a Linear Environment practice, a numerical approach is used [13]. The frequency
While the results of the previous section show that sufficient axis is broken up into a grid, and at each grid point the
conditions may be found to stabilize the closed-loop system generalized eigenvalues of G( j ωki ) and H( j ωki ) are computed.
for any delay, those choices may be overly conservative for If ρ(G( j ω), H( j ω)) > 1 for all ω, the system has a delay
small delays. This section makes use of a result in [13] to margin of ∞. If this is not the case, the computation results
determine stability of the closed-loop system for a particular in pairs (ωki , θki ) that will provide estimates of the delay margin
range of delays. That is, for a given choice of the master and τ̄i . The smallest τ̄i will be the delay margin for the system. The
DALY AND WANG: TIME-DELAYED OUTPUT FEEDBACK BILATERAL TELEOPERATION WITH FORCE ESTIMATION 303

pairs are found in the cases where the generalized eigenvalues


have magnitude one. At these points, the phase θki is found,
since the generalized eigenvalues are complex in general.
To make use of Theorem 4.2 for the bilateral teleoperation
system, the closed-loop master and slave dynamics must be
expressed as a set of RFDEs of the form (30). Substituting
the environment dynamics into the slave dynamics and per-
forming the change of variable (23) to shift the equilibrium
of the slave + environment dynamics to the origin yields the
following matrices for the RFDE (30) (note that A0 and A1 Fig. 1. Position of the slave (solid line) and master (dashed line) end effectors
are shown at the bottom of the page): with the slave in free motion.
⎡ ⎤
0 0 00
⎢ −M̃−1 Ke −M̃−1 Be 0 0 ⎥
A2 = ⎢ ⎣
m m ⎥
0 0 0 0⎦
0 0 00
where the state of the RFDE system is defined as x(t) =
[X s1 T , X s2 T , X m1 T , X m2 T ]T . For n-DOF master and slave
manipulators, Ai ∈ R 4n × 4n for all i = 0, . . . , 2. In this
system, τ represents the delay from the master to the slave
and the same delay from the slave to the master. Some terms
are delayed by 2τ because the environment force is sent from Fig. 2. Actual (solid line) and estimated (dashed line) human force applied
the slave to the master, and then back from the master to the to the master manipulator.
slave. This accounts for the matrix A2 .
With these matrices defined, the delay-dependent stability of
fifth-order Runge–Kutta solver with sample period
the closed-loop teleoperation system may be checked accord-
Ts = 10−5 s to show the ideal performance of the
ing to the conditions of Theorem 4.2.
algorithm.
V. S IMULATION R ESULTS
The performance of the algorithm is now examined through A. Simulation Results With Slave in Free Motion
a simulation study. The master and slave manipulators are Fig. 1 shows the master and slave end effector trajectories
chosen as standard 2-DOF nonlinear serial link manipulators. for the first DOF. Signals for the second DOF are similar. The
Both the master and slave have the same dynamic parameters slave manipulator tracks the delayed version of the master
of mm1 = mm2 = ms1 = ms2 = 1 kg for the mass and trajectory, as desired. Both the master and slave observers
lm = ls = 1 m for the length of the links. In this simulation, perform properly, as expected. The master external force for
the desired impedance characteristics for each manipulator are the first DOF, along with its estimate, is shown in Fig. 2.
chosen as, M̃m = 22I, B̃m = 32I, K̃m = 22I, M̃s = I, The external force is estimated quite well. There are some
B̃s = 4I, and K̃s = 4I. These values are chosen to ensure chattering type effects from the switching, but this tends to be
that the closed-loop system is stable independent of delay. no worse than the noise that would be produced from a strain
The master and slave force estimates are obtained with 30-Hz gauge for force sensing.
first-order low-pass filters.
The first simulation shows the teleoperation system B. Simulation Results With Slave in Contact
when the slave is in free motion. The second simulation
Fig. 3 shows the master and slave end effector positions
shows the situation with the slave in contact with an
over time for the first DOF. The results for the second DOF are
environment, modeled as a linear spring-damper attached
similar. Due to the definition of the slave desired impedance
to each slave end effector DOF. Time delays of 0.25 s in
model, one does not see asymptotic tracking when the slave
each direction were used. The system was simulated using a
is in contact with an environment. The slave, when coupled

⎡ ⎤
0 I 0 0
⎢−M̃−1 (K̃s + Ke ) −M̃−1 (B̃s + Be ) 0 0 ⎥
A0 = ⎢

s s ⎥

0 0 0 I
0 0 −1 −1
−M̃m K̃m −M̃m B̃m
⎡ ⎤
0 0 0 0
⎢ 0 0 M̃s−1 K̃s −1 K̃ M̃−1 B̃
− M̃m −1 B̃ ⎥
− M̃m
A1 = ⎢

m s s m⎥

0 0 0 0
−1 −1 B
−M̃m Ke −M̃m 0 0
e
304 IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, VOL. 22, NO. 1, JANUARY 2014

Fig. 3. Position of the slave (solid line) and master (dashed line) end effectors
with the slave in contact. Fig. 5. Position of the master (solid line) and slave (dashed line) manipulators
in the first experiment.

Fig. 4. Actual (solid line) and estimated (dashed line) environment force
applied to the slave manipulator.

Fig. 6. Estimate of the environmental torque acting on the slave in the first
experiment.
with the environment, has a much smaller gain than the slave
system in free motion. Fig. 4 shows the external environmental
force acting on the slave and its estimate for the first DOF.
the observers, the switching signals were passed through
The force estimates are quite accurate.
5-Hz second-order low-pass filters.
For the slave environment, a stiff metal structure was used.
VI. E XPERIMENTAL R ESULTS Closed-loop impedance parameters were chosen experimen-
The experiments presented in this section validate that the tally to ensure stable behavior in contact. A fifth-order Runge–
algorithm is practically feasible and has merit as a real- Kutta algorithm was used to integrate the dynamics. The initial
world approach. This algorithm is implemented on two 3-DOF experimental run contains no time delays.
robot manipulators connected to a PC through data acquisition In the first experiment, the closed-loop impedance para-
hardware. For each manipulator, the two joints that actuate meters for the master and slave are chosen as M̃m = 7,
the five bar linkage were rigidly locked.1 The base joint of B̃m = 14, K̃m = 0, M̃s = 7, B̃s = 14, and K̃s = 7. The
each manipulator was actuated and used as the only degree of slave sliding mode controller gain is chosen to be Kg = 3.
freedom in the experiments. The slave control signal was passed through a linear second-
Dynamic models of the base DOF for each manipulator order 100-Hz low pass filter before being applied to the slave
were developed prior to running the experiments. For each manipulator. The environment was placed on the slave side at
robot, the base DOF was modeled as a mass-damper with an angular position of roughly −0.27 rad.
Coulomb friction. Due to the limitations of the hardware Fig. 5 shows the master and slave positions during the run.
used, the sample time in the experiments is limited to As expected, the slave tracks the master when in free motion,
Ts = 5 × 10−4 s. While this sample period is relatively but not when in contact with the environment. The estimate of
small for the manipulators, it is much larger than the sample the external torque acting on the slave is given in Fig. 6. This
period used in simulation. This, along with measurement torque estimate is transmitted to the master manipulator, where
noise, contributes to more chattering in the sliding mode it can be felt by the operator. When the slave is not in contact
observer estimates. As a result, the switching components with the environment, there is some nonzero torque estimate.
in the observers and the slave controller were replaced by This is as a result of running the observers at a relatively large
saturation functions to implement boundary layers. The time step as well as the use of boundary layers.
slave control signal was passed through a second-order A second experiment was performed, introducing time
low-pass filter before being applied to the plant. Without the delays of 0.5 s in each direction. The master closed-loop
use of the filter, the chattering became too great, causing impedance parameters were chosen as M̃m = 22, B̃m = 32,
too much power draw through the motor amplifier power and K̃m = 22. The slave closed-loop impedance parameters
supply. In order to obtain the estimated force signals from were chosen as M̃s = 7, B̃s = 42, and K̃s = 63. The
master impedance parameters were chosen such that the
1 The available hardware permitted only the use of 1 DOF per robot. closed-loop system would be stable independent of delay
DALY AND WANG: TIME-DELAYED OUTPUT FEEDBACK BILATERAL TELEOPERATION WITH FORCE ESTIMATION 305

A theoretical analysis of the observer-controller combina-


tions on each side of the teleoperator has shown stability
for each of the manipulators. Closed-loop stability of the
whole teleoperator system under time delays was guaran-
teed for various slave side environments. One avenue for
future work on the theoretical development is to perform
a closed-loop stability analysis on the system where the
assumptions on the human and the environment have been
relaxed.
It was found that, in simulation with a sufficiently fast
Fig. 7. Position of the master (solid line) and slave (dashed line) manipulators sample frequency, the algorithm performed quite well for cases
in the second experiment. where the slave is both in and out of contact with an environ-
ment. With the use of boundary layers in the slave controller
and the observers, as well as with some filtering of the slave
control signal, a feasible and practical implementation of this
algorithm can be achieved. The experimental results suggested
that this algorithm is stable through contact transitions at
the slave side. This is one aspect to be examined from a
theoretical standpoint in future work. Further future work on
this algorithm will examine the use of second-order sliding-
mode controllers for the manipulators, thereby eliminating the
application of a discontinuous switching signal directly to the
plant.

Fig. 8. Estimate of the human torque applied to the master in the second A PPENDIX
experiment.
The observers used in this brief are based on SISO observers
presented in [10]. Here, MIMO observers are designed, and a
by the Small Gain Theorem. The slave control signal was corollary is developed to prove that the SISO observers may
filtered with a second-order low-pass 25-Hz filter. The same be extended to MIMO observers while still ensuring finite time
environment as the previous experiment was placed on the convergence of the observer error dynamics to zero.
slave side. Consider the n-DOF plant
Fig. 7 shows the master and the slave trajectories. The
ẋ1 = x2 (31)
system remains stable both in and out of contact with the
environment, and the slave tracks the delayed master trajectory ẋ2 = f (t, x1, x2 , u) + ξ (t, x1 , x2 , u) (32)
when in free motion. The estimate of the torque applied to
where x1 ∈ R n , x2 ∈ R n , u ∈ R n , and y = x1. Here,
the master manipulator is given in Fig. 8. During periods of
f (t, x1 , x2 , u) represents the known plant dynamics, while
time when the slave is in contact with the environment, the
ξ (t, x1, x2 , u) represents the unknown inputs/dynamics. In this
operator must apply more torque to the master to compensate
case, this term represents the unknown force input. Also,
for the environment torque that is fed back from the slave.
f ∈ R n may be expressed as
This shows the effectiveness of the force feedback portion of ⎡ ⎤
the algorithm. f1 (t, x1, x2 , u)
⎢ .. ⎥
f (t, x1 , x2 , u) = ⎣ . ⎦
VII. C ONCLUSION f n (t, x1 , x2 , u)
This brief presented a novel bilateral teleoperation algorithm
and equivalently for ξ (t, x1, x2 , u) ∈ R n .
for n-DOF nonlinear manipulators connected through time
Now consider the following observer for (31) and (32):
delays. It provides the benefits of a position-force teleoperation
architecture while only requiring position measurements, x̂˙ 1 = x̂2 + z 1 (33)
giving the advantages of both the position-position and
position-force architectures while removing their respective x̂˙ 2 = f (t, x1 , x̂2 , u) + z 2 (34)
disadvantages. By using unknown input sliding mode where x̂1 ∈ R n and x̂2 ∈ R n are the position and velocity
observers, both the external forces and plant states were esti- estimates. As well, z 1 ∈ R n and z 2 ∈ R n . The i th element of
mated exactly with finite time convergence, thus eliminating vector z 1 is given as
the need for velocity and force sensors. The output feedback
controllers for each of the master and slave decouple and z 1i = λi |x 1i − x̂ 1i |1/2 sign(x 1i − x̂ 1i ) (35)
linearize the dynamics in the Cartesian space, allowing one to
and the i th element of vector z 2 is given as
specify desired impedance characteristics for each end effector
DOF. z 2i = αi sign(x 1i − x̂ 1i ). (36)
306 IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, VOL. 22, NO. 1, JANUARY 2014

It is assumed that x̂1(0) = x1 (0) and x̂2 (0) = 0. Now define R EFERENCES
the term F(t, x1, x2 , x̂2 , u) ∈ R n as [1] D. A. Lawrence, “Stability and transparency in bilateral teleoper-
ation,” IEEE Trans. Robot. Autom., vol. 9, no. 5, pp. 624–637,
F(t, x1 , x2 , x̂2 , u) = f (t, x1 , x2 , u) − f (t, x1 , x̂2 , u) Oct. 1993.
+ξ (t, x1, x2 , u) (37) [2] G. Niemeyer and J.-J. E. Slotine, “Telemanipulation with time delays,”
Int. J. Robot. Res., vol. 23, pp. 873–890, Sep. 2004.
and assume that the inequality [3] M. Tavakoli, A. Aziminejad, R. V. Patel, and M. Moallem, “Enhanced
transparency in haptics-based master-slave systems,” in Proc. Amer.
|Fi (t, x1 , x2 , x̂2 , u)| < fi+ (38) Control Conf., Jul. 2007, pp. 1455–1460.
[4] I. Polushin, P. Liu, C. Lung, and G. On, “Position-error based
holds over the operational domain. Provided that the plant schemes for bilateral teleoperation with time delay: Theory and
experiments,” J. Dynamic Syst., Meas., Control, vol. 132, no. 3,
being stabilized does not have a finite escape time and the pp. 031008-1–031008-11, 2010.
controller that has been designed will stabilize the plant in [5] L. G. Garcia-Valdovinos, V. Parra-Vega, and M. A. Arteaga, “Observer-
the full-state feedback case, one may choose the observer based higher-order sliding mode impedance control of bilateral teleoper-
dynamics to be fast enough so that the state estimates are ation under constant unknown time delay,” in Proc. IEEE/RSJ Int. Conf.
Intell. Robot. Syst., Oct. 2006, pp. 1692–1699.
recovered before the plant trajectories leave some chosen [6] J. M. Daly and D. W. L. Wang, “Bilateral teleoperation using unknown
area [10]. Let αi and λi satisfy the following inequalities, for input observers for force estimation,” in Proc. Amer. Control Conf.,
every element i : Jun. 2009, pp. 89–95.
[7] J. M. Daly and D. W. L. Wang, “Time-delayed bilateral teleop-
αi > fi+ (39) eration with force estimation for n-dof nonlinear robot manipula-
 tors,” in Proc. IEEE/RSJ Int. Conf. Intell. Robot. Syst., Oct. 2010,
2 (αi + fi+ )(1 + pi ) pp. 3911–3918.
λi > . (40)
αi − fi+ (1 − pi ) [8] H. C. Cho, J. H. Park, K. Kim, and J.-O. Park, “Sliding-mode-
based impedance controller for bilateral teleoperation under varying
time delay,” in Proc. IEEE Int. Conf. Robot. Autom., Apr. 2001,
where pi is some chosen constant such that 0 < pi < 1. Then, pp. 1025–1030.
one can state the following corollary. [9] M. W. Spong and M. Vidyasagar, Robot Dynamics and Control. New
Corollary 7.1: Suppose that the parameters for the observer York, USA: Wiley, 1989.
(33) and (34) are selected according to the above conditions [10] J. Davila, L. Fridman, and A. Levant, “Second-order sliding-mode
observer for mechanical systems,” IEEE Trans. Autom. Control, vol. 50,
(39) and (40) for α and λ, and that condition (38) holds over no. 11, pp. 1785–1789, Nov. 2005.
the operational domain of the plant. Then, the variables of the [11] S. Drakunov and V. Utkin, “Sliding mode observers. tutorial.” in Proc.
observer converge in finite time to the states of the system, 34th Conf. Decision Control, Dec. 1995, pp. 3376–3378.
i.e., ( x̂1, x̂2 ) → (x1 , x2 ). Further, the i th component of the [12] J. M. Daly, “Output feedback bilateral teleoperation with force estima-
tion in the presence of time delays,” Ph.D. dissertation, Dept. Comput.
unknown vector ξ (t, x1, x2 , u) may be recovered in finite time Sci., Univ. Waterloo, Waterloo, ON, Canada, Apr. 2010.
as αi sign(x 1i − x̂ 1i )eq . [13] K. Gu, V. L. Kharitonov, and J. Chen, Stability of Time-Delay Systems.
Proof: See [12]. New York, USA: Birkhauser, 2003.

S-ar putea să vă placă și