Sunteți pe pagina 1din 7

A Stochastic Model Predictive Control Approach for Driver-Aided

Intersection Crossing With Uncertain Driver Time Delay


Alexander Katriniok, Stefan Kojchev, Erjen Lefeber, Henk Nijmeijer

Abstract— We investigate the problem of coordinating scheme relies on vehicle-to-vehicle (V2V) communication.
human-driven vehicles in road intersections without any traffic The driver is treated as an uncertain part of the control
lights or signs by issuing speed advices. The vehicles in the loop as his reaction to a speed advice might deviate from
intersection are assumed to move along an a priori known path
arXiv:1903.12497v1 [math.OC] 29 Mar 2019

and to be connected via vehicle-to-vehicle communication. The the expected one. This driver reaction is modeled as a
challenge arises with the uncertain driver reaction to a speed proportional controller, which follows speed advices with a
advice, especially in terms of the driver reaction time delay, as certain bounded offset, while the proportional gain and the
it might lead to unstable system dynamics. For this control speed offset are treated as uncertainties. However, the driver
problem, a distributed stochastic model predictive control reaction time has been neglected in this first study.
concept is designed which accounts for driver uncertainties. By
optimizing over scenarios, which are sequences of independent B. Main Contribution and Outline
and identically distributed samples of the uncertainty over the
prediction horizon, we can give probabilistic guarantees on We extend our previous work in [9] by additionally
constraint satisfaction. Simulation results demonstrate that the accounting for an uncertain but constant driver reaction
scenario-based approach is able to avoid collisions in spite of time, i.e., an uncertain but constant time delay. The control
uncertainty while the non-stochastic baseline controller is not.
problem is to provide smooth driver speed advices for safe
I. I NTRODUCTION and efficient intersection crossing, even for an uncertain time
delay. With the time delay, the open-loop prediction model
Automating road intersections is a frequently discussed
might become unstable which is unfavorable in an MPC
control problem, especially in the context of fully automated
setting. In addition, the uncertain system response to a speed
vehicles (AV) [1]. The potential of automation to increase
advice varies in a much wider range which complicates the
traffic flow, improve safety and reduce fuel consumption
calculation of smooth and convenient speed advices.
is significant. With a solution in place, one day we might
To solve the control problem, a scenario-based approach
even be able to dispense all traffic lights and signs. This
[10], [11] is pursued which draws independent and identi-
contribution focuses on an intermediate solution, which
cally distributed (i.i.d.) samples from a bounded uncertainty
aims at issuing speed advices to the driver, to achieve safe
set over the prediction horizon, referred to as scenarios.
intersection crossing without any traffic lights or signs.
Essentially, every sample reflects a potential realization of
A. Related Work the driver uncertainty. Ultimately, optimization is carried
out over all scenarios subject to constraints that need to be
For the coordination of AVs in intersections, various so-
satisfied for every scenario. With this methodology, we can
lutions have been proposed, e.g., those based on multi-agent
give probabilistic guarantees on constraint satisfaction and
systems [2], hybrid system theory [3], virtual platooning [4]
eventually on collision avoidance. To account for unstable
and model predictive control (MPC) [5], [6], [7].
system dynamics, a state feedback gain is introduced which
When discarding the assumption of fully automated vehi-
ensures stability for every uncertainty realization. Simulation
cles, [8] presents a robust MPC approach for determining
results finally prove that the scenario-based approach is able
safe gaps in the crossing traffic to pass the intersection
to avoid collisions when the driver reaction is uncertain while
and to optimize fuel efficiency. Thereby, no communication
the baseline MPC (neglecting uncertainty) is not. Hereafter,
is available and only the human driven ego-vehicle can
we mainly focus on a proof of concept while a real time
be controlled. Uncertainties in the motion of surrounding
solution of the control problem is part of ongoing research.
vehicles are covered by the robust approach. In [9], the
The paper is organized as follows. Section II outlines the
authors have proposed a distributed scenario-based MPC
MPC prediction model with a focus on the driver time delay
approach to orchestrate vehicles in intersections by issuing
extension. Section III continues with the design of a feedback
speed advices to the driver such that collisions between
gain to prestabilize the MPC prediction model. Then, the
vehicles (or more generally agents) are avoided and traffic
distributed scenario-based MPC scheme is introduced in
flow is optimized. For information exchange, the control
section IV before section V finally proves its efficacy.
A. Katriniok is with Ford Research & Innovation Center, 52072 Aachen,
Germany, de.alexander.katriniok@ieee.org. C. Notation
S. Kojchev, E. Lefeber and H. Nijmeijer are with the Dynamics and Con- The predicted value of variable x at the future time step
trol Group, Department of Mechanical Engineering, TU Eindhoven, 5600
MB Eindhoven, The Netherlands, S.Koychev@alumnus.tue.nl, k + j is referred to as x(k+j|k) . Moreover, [x]i refers to the
A.A.J.Lefeber@tue.nl, H.Nijmeijer@tue.nl. i-th entry of vector x while N+ is the set of positive integers.
II. V EHICLE AND D RIVER R EACTION M ODEL W [3] s[1]
To handle the complexity of intersection scenarios, we rely
on the following assumptions: L[3]
Assumption 1 (Intersection Scenarios): A1. Only sin-
gle intersection scenarios are considered; A2. The desired
[3] [2] [2] [1]
route of every agent is a priori known; A3. All vehicles are sc,2 , sc,3 sc,1 , sc,2
s[2]
human-driven; A4. Besides the driver reaction to a speed [2]
dc,1
advice, there are no further uncertainties; A5. All vehicles
[1] [3]
are equipped with V2V communication; A6. No commu- sc,3 , sc,1
nication failures occur; A7. Data that has been transmitted
after optimization at time k is available to all other agents
at time k + 1. A8. Vehicle kinematic states are measurable. current
position
A. Vehicle Kinematics
Vehicle kinematics of every agent i ∈ A with
A , {1, . . . , NA } is formulated in terms of its acceleration s[3]
[i]
ax , velocity v [i] and path coordinate s[i] in the agent’s Fig. 1: Schematic of the conflict resolution problem.
reference frame with respect to the vehicle’s geometric [i]
center, see Fig. 1. The origin s[i] = 0 of agent i’s reference Moreover, ∆vd is treated as an unmeasurable but bounded
[i]
frame coincides with the first collision point sc,l with agent additive uncertainty, i.e.,
l ∈ A along his path coordinate s[i] . In case, agent i is [i]
∆vd ∈ [∆v d , ∆v d ]
[i] [i]
(4)
not in conflict with any other agent, the origin refers to his [i] [i]
initial position. The time evolution of velocity and position is with vref + ∆vd ≥ 0 to avoid negative speed advices.
represented as a double integrator while drivetrain dynamics C. Resulting Prediction Model
are modeled as a first order lag element, i.e.,
 [i]   1     The driver reaction-vehicle model with input and state
[i] 1
a x
− [i] 0 0 ax [i]
[i]
delay is obtained when replacing ax,ref in (1) with (2), i.e.,
d  [i]   Tax   [i]   Tax  [i]
v = 1 0 0  v +  0  ax,ref (1) [i] [i] [i] [i] [i]
dt [i] ẋ[i] [i] [i] [i]
s 0 1 0 s[i] 0 v (t) = A1 xv (t) + A2 xv (t − τd ) + B vref (t − τd )
[i] [i]
| {z
[i]
} | {z } | {z } | {z }
[i] [i] [i]
+ E [i] ∆vd (t − τd ) (5)
Av xv Bv uv
[i] [i] [i] [i] [i] [i] [i] [i]
where Tax denotes the dynamic drivetrain time constant and with A1 , Av , A2 , −Kd Bv [0 1 0], B [i] , K d Bv
[i] [i] [i] [i]
ax,ref the demanded acceleration. and E , Kd Bv . To be applied in the MPC framework,
B. Driver Reaction Model a discrete-time formulation of the model is required. In case
[i] [i] [i]
[i] of τd = 0, we discretize (A1 + A2 , B [i] , E [i] ) using zero-
We assume the driver to receive a speed advice vref
order hold discretization as in [9].
from the MPC controller and to translate this advice into [i]
[i] For τd > 0, the continuous-time system (5) does not have
a vehicle acceleration demand ax,ref . Generally, we build
a finite dimensional discrete-time representation [13]. We
upon our approach in [9], in which the driver is modeled
[i] therefore digitalize the driver, i.e., we assume the driver to
as a proportional controller with gain Kd > 0 and a [i]
[i] sample the vehicle speed respectively the deviation from vref
bounded offset ∆vd to the speed advice. The latter takes through a (digital) speedometer and to keep his acceleration
into consideration that the driver might not be able to exactly demand constant between two sampling steps. This assump-
follow a speed advice. We extend this driver reaction model tion translates in adding a zero-order hold element between
[i]
by means of a driver reaction time τd ≥ 0 to perceive the the driver reaction model (2) and vehicle kinematics (1), see
speed advice and react accordingly [12] — corresponding Fig. 2. Thus, we are able to discretize both subsystems sepa-
[i]
to a time delay. The demanded vehicle acceleration ax,ref , rately and gain a finite dimensional state space representation
issued by the driver of agent i as a reaction on the speed [i] [i] [i] [i]
for any τd ∈ [τ d , τ d ]. We define τd , T [i] Ts − τ̃d as an
[i]
[i]
advice vref , can then be stated as [i]
integer multiple T ∈ N0 of the sampling time Ts minus
[i]
[i] [i]  [i] [i] [i]
ax,ref (t) = Kd vref (t − τd ) + ∆vd (t − τd )
[i]
(2) some remaining fraction 0 ≤ τ̃d < Ts of the time delay.
[i]
[i] [i]  With (2), we can state the demanded acceleration ax,ref at
− v (t − τd ) . the future time instance tk+T [i] in dependence of the delayed
[i] [i] [i] [i]
During controller synthesis and system operation, we cannot signals vref , ∆vref and v [i] at time tk+T [i] − τd = tk + τ̃d ,
be certain about the driver parameters. Therefore, we assume i.e.,
the driver model to be subject to an unmeasureable but
h
[i] [i] [i] [i] [i] [i]
[i] [i] ax,ref (tk+T [i] ) , Kd vref (tk + τ̃d ) + ∆vd (tk + τ̃d )
bounded parametric uncertainty θ[i] , (Kd , τd ) with i
[i]
[i] [i] [i] [i] [i]
Kd ∈ [K d , K d ], τd ∈ [τ d , τ d ].
[i]
(3) − v [i] (tk + τ̃d ) (6)
[i] [i] [i] [i]
where v [i] (tk + τ̃d ) , ΓA x[i]
v (tk ) + ΓB ax,ref (tk ), OPTIMIZER MPC [i] driver reaction+vehicle
Z τ̃d[i] ∆vd [i]
[i] [i] ax,ref
[i]
ΓA
 [i] [i] [i]
, 0 1 0 eAv τ̃d , ΓB , 0 1 0
   [i]
eAv s ds Bv[i] . δvref vref [i] [i]
v [i]
0 ZOH Gθ,d (s) ZOH Gv (s)
[i]
Before ax,ref (tk+T [i] ) is eventually applied to the ve- − −
hicle, it is delayed by the driver reaction time for v [i] driver vehicle
T [i] − 1 time steps. By introducing the time delay states
[i] [i] [i]
xτ (tk ) , [ax,ref (tk+T [i] −1 ), . . . , ax,ref (tk )]> , we can sum- [i] v [i]
marize those observations as Kv
 
(6)
x[i]
τ (tk+1 ) =   [i] (7) prestab. gain
IT [i] −1 0T [i] −1,1 xτ (tk )
where IT [i] −1 is the (T [i] − 1) × (T [i] − 1) identity Fig. 2: Control loop with digitalized driver, MPC controller
matrix. With the discrete-time vehicle kinematics model and prestabilizing gain.
[i] [i] [i] RT [i] [i]
( Āv , eAv Ts , B̄v , 0 s eAv s ds Bv ), the time evolu- III. S TABILIZING F EEDBACK G AIN
[i]
tion of the vehicle states xv can be stated as Before further outlining the controller design, we briefly
[i] [i]
x[i] [i] [i]
v (tk+1 ) = Āv xv (tk ) + B̄v ax,ref (tk ). (8) focus on how to handle unstable system dynamics, which
might occur when a driver time delay is present. Without
By augmenting the state vector with the time delay states [i]
time delay, i.e., for τd = 0 it can easily be proven that
[i] [i]
xτ,k , xτ (tk ) and using the vehicle velocity as control [i]
[i]
the plant model with control input vref and output v [i]
output, the resulting discrete-time linear system Σθ evolves is strictly stable for every realization of the driver gain
as [i] [i]
( [i] Kd > 0. When considering a driver reaction time τd > 0,
[i] [i] [i] [i] [i] [i]
[i] xk+1 = Aθ xk + Bθ uk + Eθ wk though, the prediction model might become unstable which
Σθ , [i] [i]
(9) is numerically unfavorable when it should be applied in the
yk = C [i] xk
MPC framework [14]. In this case, a common approach
with [i]

[i] [i]
 is to design a prestabilizing state feedback gain Kθ and
Āv 03,T [i] −1 B̄v eventually apply the prestabilized plant model for prediction
[i]
Aθ = −Kd[i] ΓA [i] [i] [i] 
01,T [i] −1 −Kd ΓB  , (10)

purposes [14]. This way, the control input can be written as
0T [i] −1,3 IT [i] −1 0T [i] −1,1 [i] [i] [i] [i]
  uk = Kθ xk + δuk (12)
03,1
[i]  [i] [i] [i] [i]
Bθ =  Kd[i]  , Eθ = Bθ , C [i] = [0 1 0 01,T [i] ]
 where δuk , δvref,k is the new corrective control input that
is determined by the MPC controller.
0T −1,1
With the MPC as discrete-time controller, the feedback
[i] [i],> [i],> [i]
where xk , [xv,k , xτ,k ]> refers to the state gain Kθ needs to be designed in the discrete-time domain
[i] [i] [i] [i]
vector, uk , vref (tk + τ̃d ) to the control input, as well. The main objective is to determine Kθ such that
[i] [i] [i] [i] [i]
wk , ∆vd (tk + τ̃d ) to the additive disturbance and the closed-loop system Aθ + Bθ Kθ is Schur stable for
[i]
yk = v [i] (tk ) to the system output. The attentive reader all realizations of the uncertainty θ[i] . For this purpose,
[i]
[i] we implement a proportional feedback controller Kv in
might have noticed that the initial condition xτ (t0 )
[i] accordance to Fig. 2.
depends on v [i] (t0 + τ̃d − nTs ) with n = 1, . . . , T [i] .
The discrete-time single-input single-output (SISO) driver
Therefore, in the MPC implementation, we measure and [i]
reaction+vehicle model in Fig. 2 with control input vref and
store the velocity v [i] with a frequency which is high [i]
control output v , can be represented in the z-domain as
enough to obtain an appropriate initial condition. As the [i] [i] [i] [i] [i]
Gθ (z) = (Gd,θ (z) Gv (z))/(1 + Gd,θ (z) Gv (z)) where
MPC is run with the fixed sample time Ts , we know that [i] [i]
[i] [i] [i] Gd,θ (z) and Gv (z) denote the discrete-time counterparts of
vref (tk + τ̃d − nTs ) = vref (tk − nTs ) holds for n ∈ N0 .
the continuous-time driver reaction and vehicle kinematics
D. Distances Between Agents transfer functions.
The distance between two agents i, l ∈ A is defined We utilize the Nyquist criterion to design a proportional
[i]
according to [5]. If two agents are potentially in conflict, feedback gain Kv < 0 on the vehicle velocity to stabilize
[i] [l] [i] [i] [i]
they share a common collision point sc,l respectively sc,i Gθ (z). Thus, we need to ensure that 1+Kv Gθ (z) has only
along their respective path coordinate. Otherwise, we define [i]
[i] [l]
zeros inside the unit disc. It can be proven that if Gθ (z) is
sc,l = sc,i = ∞. This way, the distance between agent [i] [i] [i]
strictly stable, Gθ (z)/(1 + Kv Gθ (z)) is strictly stable for
i and l is defined as the sum of the absolute distances [i] [i]
[i] [i] [l] [l] all Kv < 0. If Gθ (z) is unstable, we obtain a lower bound
dc,l , |s[i] − sc,l | and dc,i , |s[l] − sc,i | to the agents’ joint [i] [i] [i]
collision point, when existing, and infinite otherwise, i.e., K [i] [i]
v (θ ) and an upper bound K v (θ ) on Kv to ensure
[i]
( stability for the uncertainty realization θ . When considering
[i] [l] [i] [l]
[i] |s[i] − sc,l | + |s[l] − sc,i | , sc,l , sc,i 6= ∞ [i] [i] [i]
the parametric uncertainty θ = (K d , τ d ) with maximum
dl = (11)
∞ , otherwise. gain and maximum time delay, we get the largest lower and
[i]
smallest upper bound on Kv for every possible realization B. Scenario Model Predictive Control
[i] [i] [i] [i]
of θ[i] . We choose K [i]
v (θ ) ≤ Kv ≤ K v (θ ) and finally Hereafter, we outline the most important steps of
[i]
obtain the state feedback gain Kθ as Algorithm 1.
[i]
Kθ , 0 −Kv[i] 0 01,T [i] .
 
(13) 1) Scenario Generation: During scenario generation, K
different scenarios are sampled, each representing a poten-
Fig. 3 illustrates for the system in section V and 2000 tial driver reaction in terms of the parametric uncertainty
samples of the uncertainty θ[i] that the maximum absolute [i] [i] [i]
θ[i] = (Kd , τd ) and the additive uncertainty ∆vd . We as-
[i] [i] [i]
eigenvalue |λ(Aθ + Bθ Kθ )| of every prestabilized system sume that the driver does not change his general reaction over
is less than one, i.e., every prestabilized system is stable. the prediction horizon. The deviation from the recommended
speed, though, is considered to be time-varying over this
Max. abs. eigenvalue |λ(Aθ + Bθ Kθ )|

[i,κ] [i] [i]


interval, see [9]. To this end, we keep Kd ∈ [K d , K d ]
[i]

[i,κ] [i] [i]


1 and τd ∈ [τ d , τ d ] constant over the prediction horizon
[i]

[i,κ]
for scenario κ ∈ K while the velocity offset ∆vd,(k+j|k)
[i] [i]
is sampled from the interval [∆v d , ∆v d ] for κ ∈ K and
[i]

0.9
j = 0, . . . , N − 1.
We eventually gain the following sampled system model
0.8 for every scenario κ ∈ K (indicated by the superscript κ)
( [i,κ] [i,κ] [i,κ] [i,κ] [i] [i,κ] [i,κ]
[i,κ] xk+1 = Aθ,Kθ xk + Bθ δuk + Eθ wk
Σθ , [i,κ] [i,κ]
0.7 yk = C [i] xk (14)
2 [i,κ] [i,κ] [i,κ] [i]
1.6 1.2 where Aθ,Kθ , Aθ + Bθ Kθ is the closed-loop matrix
1.2 1 1.1 by applying feedback gain (13). Following (6), the time delay
0.8 0.9 [i,κ]
0.4 0.6 0.7 0.8 τd is sampled from a continuous interval which eventually
[i] 0 0.5 allows us to apply the stochastic MPC theory in [10].
Time delay τd [s] [i]
Driver gain Kd [s−1 ] Moreover, referring to (6), the initial condition of the time
[i] [i]
Fig. 3: Max. abs. eigenvalue of the prestabilized system in delay states xτ,0 depends on the uncertain speed offset ∆vd .
[i]
section V in dependence of Kd and τ [i] (2000 samples). This implies that we also need to sample the initial condition
[i,κ] [i]
x0 as xτ,0 is neither measurable nor observable.
IV. D ISTRIBUTED S TOCHASTIC O PTIMAL C ONTROL 2) Scenario Cost: The control objectives for every agent
A. Control Problem and Algorithm i ∈ A can be stated as follows: 1) the velocity v [i] of
[i]
The distributed control problem to solve can generally be every agent i should follow the set speed vset , being the
stated as follows: minimum of the driver selected speed (usually the speed
Problem 1: In spite of the parametric and additive un- limit) and some situation dependent upper bound (e.g., in
[i]
certainties (3)-(4), optimize the driver speed advice vref for curves); 2) driver speed advices should be smooth, as such
every agent i such that collisions are avoided, the agent’s step changes should be small; 3) longitudinal accelerations
cost function is minimized and constraints are satisfied. should be minimized for efficient driving and 4) jerk should
be minimized for the sake of comfort. We phrase these
The general idea of scenario-based MPC is to minimize,
objectives in terms of the following quadratic objective
for every agent i, an average cost over scenarios κ ∈ K,
function in dependence of scenario κ ∈ K
K , {1, . . . , K}, which are generated by drawing i.i.d. sam- N
ples of the uncertainty over the prediction horizon, subject [i,κ] [i]
J [i,κ] (x0 ,δu(·|k) ) , Q[i]
X [i] [i,κ]
(vset,(k+j|k) − v(k+j|k) )2
to constraints that need to be satisfied for every scenario j=1
[10]. Algorithm 1 sketches the algorithm which is applied N −1
to coordinate the agents in the intersection. [i,κ],2
X
+ R[i] ∆u(k+j|k) (15)
Algorithm 1: Scenario MPC at time k, Agent i ∈ A j=0
[l]
1) Receive data via V2V: Receive distances di,(·|k) to N N
collision points from all agents l 6= i. [i,κ],2 [i] [i,κ],2
X X
+ Sa[i] ax,(k+j|k) + S∆a ∆ax,(k+j|k)
2) Scenario Generation: Sample K scenarios. j=1 j=1
3) Scenario Cost: Determine an average scenario cost. where x0
[i,κ]
= xk
[i,κ]
denotes the sampled initial condition
4) Scenario Constraints: Impose input, state and safety [i] [i] [i]
at time k, δu(·|k) = [δu(k|k) , . . . , δu(k+N −1|k) ]> the
constraints for every scenario.
vector of corrective control actions over the prediction
5) Scenario Optimization: Solve a single OCP which [i,κ] [i,κ] [i,κ]
horizon of length N , ∆u(k+j|k) = u(k+j|k) − u(k+j−1|k)
optimizes over K scenarios s.t. scenario constraints.
[i] the step change of the resulting control input
6) Broadcast data via V2V: Broadcast distances dl,(·|k) . [i,κ] [i] [i,κ] [i]
[i],?
u(k+j|k) = Kθ x(k+j|k) + δu(k+j|k) while Q[i] > 0,
7) Apply Control: Apply first element u(k|k) . Go to 1). [i] [i]
R[i] > 0, Sa > 0 and S∆a > 0 are positive weights.
3) Scenario Constraints: Besides control objectives, lo- To avoid the necessity to transmit the trajectories of every
[l]
cal agent constraints and, most important, constraints for scenario, every agent l computes the distance dc,i,(k+j|k)
global collision avoidance need to be accommodated as well. to its collision point based on the center of the interval
[l,κ] [l,κ]
In terms of local agent constraints, the speed advice [minκ∈K {s(k+j|k) }, maxκ∈K {s(k+j|k) }]. The length of this
[i,κ] [i,κ]
vref plus speed offset ∆vd should be constrained for [l]
interval, denoted as ∆L(k+j|k) , is leveraged to increase the
every agent i ∈ A such that only positive speeds are [l]
recommended and a driver selected upper speed bound (close safety distance of agent i. In the end, only dc,i,(k+j|k) and
[l]
to the speed limit) is accounted for. This claim is formulated ∆L(k+j|k) need to be transmitted to the other agents [9].
[i,κ]
as a constraint on the resulting control input u(k+j|k) for 4) Scenario Optimization: We have decomposed the
j = 0, . . . , N − 1 and every scenario κ ∈ K, i.e., control problem by separating the local cost functions and
[i,κ] [i,κ] constraints while collision avoidance constraints are only
u(k+j|k) ∈ U(k+j|k) ∀κ ∈ K, with (16) imposed on agents with lower priority. Summarizing, the
n
[i,κ] [i,κ] [i] local OCPs can be cast as
U(k+j|k) , u ∈ R | 0 ≤ u + ∆vd,(k+j|k) ≤ v (k+j|k)
[i] [i,κ] [i] [i]
o Distributed Scenario OCP, every agent i ∈ A solves:
∧ u = Kθ x(k+j|k) + δu(k+j|k) , δu(k+j|k) ∈ R K
[i] 1 X [i,κ] [i,κ] [i]
where v (k+j|k) is an appropriately selected upper bound. minimize J (x0 , δu(·|k) ) (21)
δu
[i] K
Through the state constraint (·|k) κ=1

[i,κ] [i]
n subject to system dynamics (14)
x(k+j|k) ∈ X(k+j|k) , x ∈ R3 | a[i]x ≤ [x]1 ≤ ax
[i]
(17)
o safety constraints (20)
[i]
∧ 0 ≤ [x]2 ≤ v (k+j|k) input (16) & state constraints (17), (18).
for j = 1, . . . , N and every scenario κ ∈ K, we also bound Scenario OCP (21) is a non-convex quadratically constrained
the actual velocity and constrain the vehicle acceleration to quadratic program (QCQP). To solve the QCQP, we apply
accommodate physical vehicle limitations and safe driving. the penalty convex-concave procedure [15] as local method
[i]
Following [9], we impose the lower bound v mean on the which iteratively solves a convex quadratic problem (QP).
mean velocity over the prediction horizon when approaching [i],? [i] [i,1] [i],?
Finally, the control input uk = Kθ xk + δu(k|k) is
a certain distance to the intersection, i.e., [i] [i,1] [i] [i,K]
applied to the plant where Kθ xk = . . . = Kθ xk .
N
1  [i] X [i,κ] 
vk + v(k+j|k) ≥ v [i]
mean , ∀κ ∈ K. (18) C. Constraint Violation Probability
N +1 j=1 For a centralized scenario MPC scheme, [10], [16] have
Particularly, we claim that the prediction horizon at least proven that scenario constraints are satisfied with a certain
covers the coordinate interval where potential collisions probability that depends on the number K of scenarios. In
might occur with other agents. This way, convergence and this work, the uncertainties of every agent i ∈ A are assumed
feasibility of the distributed control scheme is ensured [9]. to be independent from each other, such that sampling can be
To ascertain collision avoidance, we follow our approach carried out independently as well. Consequently, the theory
in [9]. Essentially, collision avoidance constraints need to be in [10] also holds for our distributed setup. According to
satisfied jointly, thus requiring a certain consensus among [10], the most relevant criterion to guarantee closed-loop
agents. To enforce consensus, we introduce time-invariant constraint satisfaction with a certain probability is the first
priorities on the agents that are determined once and held predicted step constraint violation probability at time step
constant during the maneuver. Therefore, we define an in- k + 1. With the parametric uncertainty θ[i] , the additive
[i]
jective prioritization function γ : A → N+ which assigns (uncertain) disturbance ∆vd and a control input vector of
a unique priority to every agent, where a lower value dimension one, we obtain the following upper bound on the
corresponds to a higher priority. We specify the prioritized first predicted step (and as such closed-loop) state constraint
[i]  [i]
conflict set Ac,γ , l ∈ A | l 6= i ∧ γ(l) < γ(i) ∧ sc,l 6= ∞ violation probability for an arbitrary scenario κ
containing the agents l ∈ A which have a joint collision
n
[i,κ] [i]
o 1
P x(k+1|k) ∈/ X(k+1|k) ≤ . (22)
point with agent i but a higher priority. Safety constraints 1+K
can thus be phrased as Given that only the worst case scenarios are broadcasted
[i,κ] [i] [i] to the other agents (see section IV-B.3), collision avoidance
dl,(k+j|k) ≥ dsafe,l,(k+j|k) , ∀l ∈ Ac,γ (19) constraints might be violated with an even lower probability.
for j = 1, . . . , N and κ ∈ K with an appropriate safety With the driver being eventually in charge of vehicle control,
[i]
distance dsafe,l,(k+j|k) . Ultimately, only the agent with lower we consider a probabilistic guarantee on collision avoidance
priority has to impose this safety constraint. With definition to be appropriate for the given application.
[i]
(11) of dl,(k+j|k) , we can recast (19) in the form [9] V. S IMULATION R ESULTS
[i,κ] [i] [i] [l] A. Simulation Setup
(s(k+j|k) − sc,l )2 ≥ (dsafe,l,(k+j|k) − dc,i,(k+j|k) )2 , (20)
[i] [i] [l] To assess the validity of the stochastic approach, an urban
∀l ∈ Ac,γ : dsafe,l,(k+j|k) > dc,i,(k+j|k) .
four way intersection scenario with four agents passing the
B. Discussion of Results
In Fig. 5, the i-th row illustrates the motion trajectories of
agent i for the stochastic MPC scheme. The three respective
columns, highlight from left to right: 1) the agent’s path
coordinate trajectory along with the trajectories of conflicting
agents; 2) the agent’s actual (colored solid), maximum (solid
black) and minimum mean velocity (dashed gray) together
with the speed advice (colored dashed), the tolerated speed
offset (colored patch) and the set speed (dashed black); 3) the
actual vehicle acceleration. When agent i imposes a safety
Fig. 4: Intersection scenario: Four straight passing agents. constraint with respect to agent l, a colored polygon indicates
intersection straight is considered, see Fig. 4. Every agent the coordinate interval over time that must not be entered by
has a length of L[i] = 4.87 m, a width of W [i] = 1.85 m the trajectory of agent i. The fifth row (scenario MPC) and
and a dynamic drivetrain time constant of Tax = 0.3 s.
[i] sixth row (baseline MPC) show a closer insight into the time
Parameter sets that represent a typical human driver are interval when agents are close to each other and collisions
hard to determine. For the driver gain (in s−1 ) and the might occur. For reasons of brevity, we do not illustrate any
speed offset (in m/s), we choose similar parameter intervals motion trajectories for the baseline control scheme besides
like in our previous study [9]. The time delay (in s) is those in the sixth row.
selected in accordance to studies on platooning [12]. In In the given scenario, agent 3, having the lowest priority,
the simulation model, the following parameters are used: crosses the intersection first by speeding up from 11.1 m/s
[1] [1] [1] [2]
Kd = 0.55, τd = 1.5, ∆vd = 0.5, Kd = 1.0, τd = 1.9,
[2] to 12.6 m/s with a moderate acceleration of 1 m/s2 to avoid
[2] [3] [3] [3] collisions with agent 1 and agent 4. Evidently, the non-
∆vd = −0.3, Kd = 0.7, τd = 1.8, ∆vd = −0.4,
[4] [4] [4] convex problem formulation bears the advantage to let agents
Kd = 0.9, τd = 1.9, ∆vd = 0.2. During simulation,
[i] cross in a sequence which is different from their priorities.
∆vd is varied periodically in a step-wise manner by adding After agent 3, agent 1 passes the intersection without the
an offset which is bounded by the interval [−0.2, 0.2]. need to account for any other agent as he owns the highest
For the stochastic MPC scheme, the uncertainty intervals priority. Agent 2 crosses the intersection after agent 1 by
[i] [i]
are selected as: K d = 0.5 s−1 , K d = 1.2 s−1 (driver reducing his speed to 9.3 m/s with a maximum deceleration
[i] [i]
gain bounds); τ d = 0 s, τ d = 2 s (time delay bounds); of 1.7 m/s2 . Finally, agent 4, who exhibits the second lowest
[i] [i]
∆v d = −1 m/s and ∆v d = 1 m/s (speed offset bounds). priority, crosses the intersection after agent 2. To safely avoid
[i] a collision with agent 2, agent 4 needs to slow down to 8 m/s
Given those intervals, the feedback gain Kv ∈ [−1, −0.45]
ensures a stable prediction model. As a trade-off between with a maximum deceleration of 2.3 m/s2 . For every agent,
the resulting settling time and damping, we have chosen it is evident that the corresponding speed advices are very
[i] smooth and easy to follow for a human driver. Even despite a
Kv = −0.59. Moreover, K = 99 scenarios are generated
for optimization — with (22), this implies an upper constraint time delay of up to 1.9 s, it can be recognized that the state
violation probability bound of 1%. and input trajectories satisfy constraints and do not show
The performance of the stochastic scheme is compared any noticeable oscillations. Without an appropriate feedback
with a baseline MPC controller which only exploits a single gain, simulation studies have shown that severe oscillations
[1]
realization of the driver parameters: Kd = 0.6, τd = 1.2,
[1] and unsmooth speed advices might occur.
[2] [2] [3] [3] [4] The last two rows in Fig. 5 finally provide evidence
Kd = 0.8, τd = 1.5, Kd = 0.9, τd = 1.2, Kd = 1.1,
[4] [i] that the scenario MPC scheme is able to avoid collisions
τd = 1.0 while ∆vd is set to zero for every agent.
between agents while the baseline MPC scheme, neglecting
For both MPC regimes, the remaining parameters are set
uncertainties, is not. Although agent 2 does not violate safety
equally, i.e., using a sample time of 0.25 s and a horizon
constraints for both strategies, agent 3 collides with agent 1
length of N = 40 (i.e., a preview time of 10 s). With this
and agent 4 with agent 2 in case of the baseline controller.
horizon length, the settling time is covered for 93% of
We can conclude that, despite uncertainties, the stochastic
all sampled system models (based on a 5000 sample anal-
MPC scheme satisfies all our requirements.
ysis) which has turned out to be sufficient. Furthermore,
the following weights are applied: Q[i] = 0.5, R[i] = 20, VI. CONCLUSION AND FUTURE WORK
[i] [i]
Sa = 5, S∆a = 1. While agent 3 has selected a set speed of We have proposed a distributed stochastic MPC scheme
11.1 m/s, all other agents apply the speed limit of 13.9 m/s which provides speed advices to the driver in order to allow
as set speed. The driver selected upper velocity bound v [i] for safe and efficient intersection crossing without any traffic
is set 10% (i.e., 1.39 m/s) higher than the speed limit. lights or signs. As an extension of our previous works, we
[i]
Longitudinal accelerations are bounded by ax = −7 m/s2 include the driver reaction time delay as an additional para-
[i]
and ax = 4 m/s2 . Moreover, agent priorities are fixed as in metric uncertainty in our control concept. Simulation results
[9]: γ(1) = 1, γ(2) = 2, γ(3) = 4, γ(4) = 3. To solve OCP provide evidence that state, input and collision avoidance
(21), qpOASES [17] is utilized as QP solver. constraints are satisfied in spite of uncertainty. Future work
Position [m] Velocity [m/s] Acceleration [m/s2 ]
Agent 1 (Prio. 1) max. velocity
100 15 0.2
50 10
0 0
−50 5 Act.
−100 Ref.
Set spd.
−0.2
min. mean velocity
100 0
Agent 2 (Prio. 2) max. velocity 2
50 15
1
0 10 0
−50 Act.
5 −1
−100 Ref.
Set spd. min. mean velocity −2
0
100 Agent 3 (Prio. 4) max. velocity 1
15
50 0.5
10 0
0
Act.
−50 5 Ref.
−0.5
Set spd. min. mean velocity −1
−100 0
100 max. velocity 2
Agent 4 (Prio. 3) 15
50 1
0 10 0
−50 Act. −1
−100 5 Ref. min. mean
Set spd. velocity −2
−150 0
0 5 10 15 20 0 5 10 15 20 0 5 10 15 20
20
Scenario MPC

20 Agent 2 Time [s] Agent 3 Time [s] Agent 4 Time [s]


0 20
0
−20 0
−40 −20 −20
8 10 12 10 12 14 10 11 12 13
Baseline MPC

Agent 2 20 Agent 3 20 Agent 4


20
COLLISION
0 0 0 COLLISION
−20
−40 −20 −20
8 10 12 10 9 11 12 9 10 11 12
Time [s]
Fig. 5: Scenario MPC is able to avoid collisions among agents in spite of uncertainty while Baseline MPC is not.
aims at reducing the computational effort of the stochastic [8] G. Schildbach, M. Soppert, and F. Borrelli, “A collision avoidance
OCP and at verifying results in experimental tests. system at intersections using Robust Model Predictive Control,” in
IEEE Intelligent Vehicles Symposium, 2016, pp. 233–238.
R EFERENCES [9] A. Katriniok, S. Kojchev, E. Lefeber, and H. Nijmeijer, “Distributed
Scenario Model Predictive Control for Driver Aided Intersection
[1] L. Chen and C. Englund, “Cooperative intersection management: A Crossing,” in European Control Conference, 2017, pp. 1746–1752.
survey,” IEEE Transactions on Intelligent Transportation Systems, [10] G. Schildbach, L. Fagiano, C. Frei, and M. Morari, “The Scenario
vol. 17, no. 2, pp. 570–586, 2016. Approach for Stochastic Model Predictive Control with Bounds on
[2] H. Kowshik, D. Caveney, and P. R. Kumar, “Provable Systemwide Closed-Loop Constraint Violations,” Automatica, vol. 50, no. 12, pp.
Safety in Intelligent Intersections,” IEEE Transactions on Vehicular 3009 – 3018, 2014.
Technology, vol. 60, no. 3, pp. 804–818, 2011. [11] M. Farina, L. Giulioni, and R. Scattolini, “Stochastic linear Model
[3] M. R. Hafner, D. Cunningham, L. Caminiti, and D. D. Vecchio, Predictive Control with chance constraints - A review,” Journal of
“Cooperative collision avoidance at intersections: Algorithms and ex- Process Control, vol. 44, pp. 53 – 67, 2016.
periments,” IEEE Transactions on Intelligent Transportation Systems, [12] M. Treiber, A. Kesting, and D. Helbing, “Delays, inaccuracies and
vol. 14, no. 3, pp. 1162–1175, Sept 2013. anticipation in microscopic traffic models,” Physica A: Statistical
[4] A. I. M. Medina, N. v. d. Wouw, and H. Nijmeijer, “Automation of a T- Mechanics and its Applications, vol. 360, no. 1, pp. 71 – 88, 2006.
intersection Using Virtual Platoons of Cooperative Autonomous Vehi- [13] K. J. Astrom and B. Wittenmark, Computer Controlled Systems:
cles,” in IEEE International Conference on Intelligent Transportation Theory and Design, 3rd ed. Prentice Hall, 1996.
Systems, 2015, pp. 1696–1701. [14] J. Maciejowski, Predictive Control with Constraints. Prentice Hall,
[5] A. Katriniok, P. Kleibaum, and M. Joševski, “Distributed Model Harlow, 2002.
Predictive Control for Intersection Automation Using a Parallelized [15] T. Lipp and S. Boyd, “Variations and extension of the convex–concave
Optimization Approach,” in IFAC World Congress, vol. 50, no. 1, 2017, procedure,” Optimization and Engineering, vol. 17, no. 2, pp. 263–287,
pp. 5940–5946. 2016.
[6] G. R. de Campos, P. Falcone, R. Hult, H. Wymeersch, and J. Sjöberg, [16] L. Fagiano, G. Schildbach, M. Tanaskovic, and M. Morari, “Scenario
“Traffic coordination at road intersections: Autonomous decision- and Adaptive Model Predictive Control of Uncertain Systems,” in IFAC
making algorithms using model-based heuristics,” IEEE Intelligent Conference on Nonlinear MPC, vol. 48, no. 23, 2015, pp. 352–359.
Transportation Systems Magazine, vol. 9, no. 1, pp. 8–21, 2017. [17] H. Ferreau, C. Kirches, A. Potschka, H. Bock, and M. Diehl,
[7] J. Shi, Y. Zheng, Y. Jiang, M. Zanon, R. Hult, and B. Houska, “qpOASES: A parametric active-set algorithm for quadratic program-
“Distributed control algorithm for vehicle coordination at traffic in- ming,” Mathematical Programming Computation, vol. 6, no. 4, pp.
tersections,” in European Control Conference, 2018, pp. 1166–1171. 327–363, 2014.

S-ar putea să vă placă și