Upload
others
View
1
Download
0
Embed Size (px)
Citation preview
Pursuit-Evasion Games with Multi-Pursuer:a decomposition approach.
Adriano Festa
(join work with Richard B. Vinter)
EEE Department, IC London.
27th November 2012
G. Castelnuovo, Sapienza Universita di Roma
Festa-Vinter Multi-Pursuer Differential Games
Outline of the talk
1 Starting example: Surge Tank Control
2 Pursuit-Evasion games
3 A decomposition technique vs high dimensionality
4 Numerical Tests
5 Concluding Remarks
Festa-Vinter Multi-Pursuer Differential Games
The Surge Tank Control Problem
Falugi, Kountouriotis, Vinter. Differential Games Controllers ThatConfine a System to a Safe Region in the State Space, WithApplications to Surge Tank Control. IEEE Trans. Automat. Contr.57(11): 2778-2788 (2012)
Festa-Vinter Multi-Pursuer Differential Games
The Surge Tank Control Problem
dx(t)dt
= f (x(t),a(t)) + σ(x(t))b(t)
=
[0 10 0
]x(t) +
[01
](−a(t) + b(t)) .
The constraints that the surge tank must neither overflow orempty are expressible (in normalized units) as
−1 < x1(t) < +1 .
so x ∈ Ω := (−1,1)× R.Permitted tolerances on the Max Rate of Change of Outflow(MROC) index are captured by the additional constraint on theoutflow:
−1 ≤ a(t) ≤ +1 .
Festa-Vinter Multi-Pursuer Differential Games
Surge Tank as a Differential Game
A := {a(.) : [0,∞)→ R | a(t) ∈ [−1,1]} .B := {b(.) : [0,∞)→ R} .
The space Φ of closed loop controls for the a player is
Φ := {non-anticipative mappings φ(.) : B → A} .
The Differential game is: find
v(x) = supφ∈Φ
infb∈B
J (x , φ(b(.)),a(.))
where the payoff function is
J(x ,a,b) :=∫ τx
0
(12|b(t)|2 + θ
)dt .
with θ ≥ 0 (design parameter) and τx first exit time from Ω.
Festa-Vinter Multi-Pursuer Differential Games
Safe Region for the Surge Tank
Festa-Vinter Multi-Pursuer Differential Games
Safe Region for the Surge Tank
Festa-Vinter Multi-Pursuer Differential Games
Safe Region for the Surge Tank
Festa-Vinter Multi-Pursuer Differential Games
Associated Optimal Control Problems
The set Ω = {x | − 1 < h(x) < +1} can be represented
Ω = Ω1 ∩ Ω2where
Ω1 = {x |h(x) < +1}, Ω2 = {x | − 1 < h(x)}
For every x ∈ Ωj , j = 1,2, consider the optimal control problem
(P jx )
vj(x) = infb∈B
∫ τx0 (
12 |b(t)|
2 + θ)dtẏ(t) = f (y(t),aj) + σ(y(t))b(t) a.e. on [0, τx )y(0) = x , hj(y(τx )) = 0 .
aj is frozen at value of a, driving away from boundary of Ωj
v(x) = min{v1(x), v2(x)} ∀x ∈ Ω
Festa-Vinter Multi-Pursuer Differential Games
Comparison with min-max methods
Figure: Value function using a minmax technique (left) anddecomposition procedure (right), θ = 10.
Festa-Vinter Multi-Pursuer Differential Games
Pursuit-Evasion games
The dynamic system is modelled as{y ′(t) = −g(y(t))a(t) + h(y(t))b(t) + l(y(t))y(0) = x
(1)
where y(t) ∈ Ω ⊂ RN is the state, and a and b are the controls.
We assumeg : Ω→ RNh : Ω→ RN are continuous
A,B are compact metric spaces(2)
and, for some constant L,
|g(x)− g(y)|+ |h(x)− h(y)| ≤ L|x − y | ∀x , y ∈ RN . (3)
Festa-Vinter Multi-Pursuer Differential Games
Pursuit-Evasion games
We take as admissible control
A := {a : [0,+∞[→ A measurable } (4)
B := {b : [0,+∞[→ B measurable } (5)
and we consider only a ∈ A, b ∈ B.We are given a closed set T ⊆ RN , and define
tx (a,b) :={
min{t : yx (t ; a,b) ∈ T }+∞ if yx (t ; a,b) /∈ T ∀t .
(6)
the first player ”a” wants to minimize the time of hitting, and thesecond player ”b” wants to maximize the same cost.
Festa-Vinter Multi-Pursuer Differential Games
PE games
We re-normalize these costs by the nonlinear transformation
ψ(u) :={
1− e−u if u < +∞1 if u = +∞. (7)
and consider the discounted cost functional
J(x ,a,b) = ψ(tx (a,b)) =∫ tx
0e−sds. (8)
Festa-Vinter Multi-Pursuer Differential Games
PE games
We need the notion of nonanticipating strategy; for the firstplayer is
Γ := {α : B → A : t > 0,b(s) = b̃(s) for all s ≤ timplies α[b](s) = α[b̃](s) for all s ≤ t} (9)
for the second player is defined in the analogous way is ∆.
The lower and the upper values for the game are
v(x) := supβ∈∆
infa∈A
J(x ,a, β[a]) = infα∈Γ
supb∈B
J(α[b],b) (10)
the fact that lower and upper value are coincident is due to thenature of the dynamics and the cost functional.
Festa-Vinter Multi-Pursuer Differential Games
PE games
the value function v(x) is the viscosity solution of the followingHamilton-Jacobi-Isaacs equation{
v(x) + H(x ,Dv(x)) = 0 x ∈ Ω \ Tv(x) = 0 x ∈ ∂T (11)
where
H(x ,p) := maxa∈A
minb∈B{−(g(x)a− h(x)b + l(x)) · p} − 1
= minb∈B
maxa∈A{−(g(x)a− h(x)b + l(x)) · p} − 1. (12)
Festa-Vinter Multi-Pursuer Differential Games
Why the value function?
Solving this equation, and therefore getting the value functionof the game, we can get the optimal behavior for every playerfrom the starting point x0 as
a(t) = S(yx0(t))S(z) ∈ argmaxa∈A minb∈B
{−(−g(x)a + h(x)b + l(x)) · Dv(x)}
(13)b(t) = W (yx0(t))W (z) ∈ argminb∈B maxa∈A
{−(−g(x)a + h(x)b + l(x)) · Dv(x)}.
(14)
Festa-Vinter Multi-Pursuer Differential Games
Example 1
We consider the pursuit-evasion game with two pursuers p1,p2and one evader e where all the agents are free to move in the1D space with various velocities.
p′1 =23a1
p′2 = a2e′ = b2p1(0) = p01p2(0) = p02e(0) = e0
(15)
where a1,a2,b ∈ B(0,1) = [−1,1], p1,p1,e ∈ R.
Capture happens when mini∈{1,2} |pi − e| ≤ r with somer ≥ 0.We underline that this problem is in a space of dimensionthree.
Festa-Vinter Multi-Pursuer Differential Games
Example 1- reduced dynamics
We get the reduced dynamicsy ′1 = −
23a1 +
b2
y ′2 = −a2 +b2
y1(0) = p01 − e0
y2(0) = p02 − e0
(16)
here we have a1,a2,b ∈ B(0,1), y1, y2 ∈ [0,+∞] and
T := {(y1, y2) ∈ R2 : mini∈{1,2}
|yi | ≤ r}. (17)
The HJI equation associated to the problem isv(x) + max
a1,a2∈Aminb∈B
{−(−23a1 +
b2 ,−a2 +
b2 ) · Dv(x)
}= 1
x ∈ [0,+∞]2 \ Tv(0) = 0 x ∈ ∂T
(18)Festa-Vinter Multi-Pursuer Differential Games
A decomposition technique vs hightdimensionality
I need some additional Hypotheses:
We assumeA = Bn(0, ρa1)× Bn(0, ρa2)× ...× Bn(0, ρaM ),B = [Bn(0, ρb)]m,g(x)ρa − h(x)ρb − |l(x)| > 0, ∀x ∈ Ω.
(19)
where with [Bn(0, ρb)]m we mean the space
{(b1,b2, ...,bn︸ ︷︷ ︸,b1, ...,bn, ...,b1, ...,bn)︸ ︷︷ ︸m times
∈ RN : |(b1, ...,bn)| = ρb}
Festa-Vinter Multi-Pursuer Differential Games
Under these Hypotheses we can show
PropositionH(x ,p) is convex with respect to the variable p.
H(x ,p) = g(x) maxa∈A{a · p} − h(x) max
b∈B{−b · p} − l(x) · p − 1
Figure: in this case N = 2,m = 2,n = 1.
Festa-Vinter Multi-Pursuer Differential Games
Decomposition
We consider the following class of problems, withi ∈ I := {1, ...m} ⊂ N,{
u(xi) + H (x ,Dui(x)) = 0 x ∈ Ω \ Tiui(x) = 0 x ∈ Ti
(20)
where T := ∪iTi .
Theorem
Let assume the standard Hypotheses and (19).Then we have that
u(x) := min{ui ; i ∈ I} (21)
is the unique value function of the P-E game.
Festa-Vinter Multi-Pursuer Differential Games
Useful? Example 1
T := T1 ∪ T2 with Ti := {(x1, x2) ∈ R2 : |yi | ≤ r};
the second equation of the dynamics doesn’t affect thedecomposed problem.
v(x1, x2) + min
a1∈Amaxb∈B
{−(23a1 −
b2 ) ·
∂∂x1
v(x1, x2)}
= 1
x1 ∈ (r ,+∞]v(0) = 0 x2 ∈ [0, r ]∂∂x2
v(x1, x2) = 0(22)
this is rather easy to solve. We get
v(x1, x2) = re−6(x1−r)−e−6(x1−r)+1 and u(x1, x2) = −6(x − r)(1− r)(23)
Festa-Vinter Multi-Pursuer Differential Games
Useful? Example 1
in the same way if we consider T2
v(x1, x2) = re−2(x1−r)−e−2(x1−r)+1 and u(x1, x2) = −2(x − r)(1− r).(24)
the Hypotheses of the Theorem are satisfied:g(x)ρa − h(x)ρb =
(23 −
12 ,1−
12
)= (16 ,
12) > 0
u(x1, x2) ={−6(x − r)(1− r) if x1 ≤ 13x2 +
23 r
−2(x2 − r)(1− r) if x1 > 13x2 +23 r .
(25)
Festa-Vinter Multi-Pursuer Differential Games
Example 1 - value function
In this way, solving two simpler problems (of dimention 1).We get the solution of the original one.
Festa-Vinter Multi-Pursuer Differential Games
Sketches of the proof I
DefinitionWe say that v : Ω→ R is semiconcave on the open convex setω if there exists a constant C ≥ 0 such that
λv(x) + (1− λ)v(y) ≤ v (λx + (1− λ)y) + 12
Cλ(1− λ)|x − y |2
(26)for all x , y ∈ Ω and λ ∈ [0,1].
We can show that in our case, every solution of a decomposedproblem is semicancave.Moreover we have the following property of s.c. functions. Said
D∗v(x) ={
p ∈ RN : p = limn→+∞
Dv(xn), xn → x}
we have D+v(x) = coD∗v(x).Festa-Vinter Multi-Pursuer Differential Games
Sketches of the proof II
We want to show that u = min{ui ; i ∈ I} is the viscosity solutionof the HJI equation associated to the PE game.
The minimum of a family of supersolution is alwayssupersolution.
We prove that u is subsolution, too.We know now that the propriety of semiconcavity is preservedtaking the minimum of a class of semiconcave functions, so
D+u(x) = coD∗u(x) ⊆ co {D∗ui(x)|i ∈ I}⊆ co {coD∗ui(x)|i ∈ I} = co
{D+ui(x)|i ∈ I
}(27)
Festa-Vinter Multi-Pursuer Differential Games
Sketches of the proof III
This implies, that a p ∈ D+u(x) and Λ = {λi , i ∈ Is.t .∑
i λi = 1}
p = (λ1, λ2...)·(p1,p2, ...) =∑
i
λipi (λ1, λ2...) ∈ Λ,pi ∈ D+ui(x)
(28)we know that for every pi ∈ D+ui(x) ,H(x ,pi) ≤ 0 for everyi ∈ I.
H(x ,p) = H
(x ,∑
i
λipi
)≤∑
i
λiH(x ,pi) ≤ 0. (29)
This shows that u(x) is a subsolution and conclude the proof.
Festa-Vinter Multi-Pursuer Differential Games
Decomposition technique for PE games
TheoremWe call I := {1,2, ...,m} and T := {yi ∈ Rn : min |yi | ≤ r}.Said vi : RN × I → R, xi = (xni , ..., xn(i+1)−1),
vi(xi) + max
ai∈Aminb∈B
{−fi(xi ,ai ,b) · ∂∂xi vi(xi)
}= 1
xi ∈ [0,+∞]n \ Tivi(xi) = 0 xi ∈ Ti∂∂x vi(x) = 0
(30)with fi(xi ,ai ,b) = −g(xi)ai(t) + h(xi)b(t) + l(xi).We have that the value function of the PE game is
u(x) := log(
1−mini∈I
vi(x)).
Festa-Vinter Multi-Pursuer Differential Games
Tag-Chase: example 1
(case1 c = 0.95)
Festa-Vinter Multi-Pursuer Differential Games
mov1.mpgMedia File (video/mpeg)
Tag-Chase: example 2
(case2 c = 0.95)
Festa-Vinter Multi-Pursuer Differential Games
mov2.mpgMedia File (video/mpeg)
Tag-Chase: example 3
(case3 c = 0.95)
Festa-Vinter Multi-Pursuer Differential Games
mov3.mpgMedia File (video/mpeg)
Remarks
The key idea to preserve convexity of the Hamiltonian is:
there is a “specular” behavior of the two players
the player who minimize “dominates” the other one
There is a generalization of this result, but until now, we needthe semiconcavity of the decomposed problems.(it’s a strong and inconvenient assumption)
Thank you.
Festa-Vinter Multi-Pursuer Differential Games
Remarks
The key idea to preserve convexity of the Hamiltonian is:
there is a “specular” behavior of the two players
the player who minimize “dominates” the other one
There is a generalization of this result, but until now, we needthe semiconcavity of the decomposed problems.(it’s a strong and inconvenient assumption)
Thank you.
Festa-Vinter Multi-Pursuer Differential Games
Remarks
The key idea to preserve convexity of the Hamiltonian is:
there is a “specular” behavior of the two players
the player who minimize “dominates” the other one
There is a generalization of this result, but until now, we needthe semiconcavity of the decomposed problems.(it’s a strong and inconvenient assumption)
Thank you.
Festa-Vinter Multi-Pursuer Differential Games