An Analytic and Numerical Investigation of a Differential Game

Gibali, Aviv; Kelis, Oleg

doi:10.3390/axioms10020066

Open AccessArticle

An Analytic and Numerical Investigation of a Differential Game

by

Aviv Gibali

^1,2,*

and

Oleg Kelis

^1,3

¹

Department of Mathematics, ORT Braude College, Karmiel 2161002, Israel

²

The Center for Mathematics and Scientific Computation, U. Haifa, Mt. Carmel, Haifa 3498838, Israel

³

Department of Mathematics, The Technion—Israel Institute of Technology, Haifa 3200003, Israel

^*

Author to whom correspondence should be addressed.

Axioms 2021, 10(2), 66; https://0-doi-org.brum.beds.ac.uk/10.3390/axioms10020066

Submission received: 2 March 2021 / Revised: 12 April 2021 / Accepted: 15 April 2021 / Published: 17 April 2021

(This article belongs to the Special Issue Advances in Analysis and Control of Systems with Uncertainties)

Download

Browse Figures

Versions Notes

Abstract

:

In this paper we present an appropriate singular, zero-sum, linear-quadratic differential game. One of the main features of this game is that the weight matrix of the minimizer’s control cost in the cost functional is singular. Due to this singularity, the game cannot be solved either by applying the Isaacs MinMax principle, or the Bellman–Isaacs equation approach. As an application, we introduced an interception differential game with an appropriate regularized cost functional and developed an appropriate dual representation. By developing the variational derivatives of this regularized cost functional, we apply Popov’s approximation method and show how the numerical results coincide with the dual representation.

Keywords:

differential game; dual representation; cost functional; double projection methods; saddle points

MSC:

65K15; 90C47; 9108

1. Introduction

In this paper, we present a zero-sum differential game with linear dynamics and a quadratic cost functional. Such games appear in many areas of control theory, for example, robust controllability [1], pursuit-evasion [2,3,4,5], robust tracking [6] and robust investment [7], to name but a few.

The singularity of such a game is caused by the weight matrix of the minimizer’s control cost, meaning that problem of minimization its variational Hamiltonian with respect to the minimizer control has either infinitely many solutions or no solutions. This makes the game challenging, since it can not be solved by some well-known approaches, such as the Isaacs min-max principle [8] and the Bellman–Isaacs equation [8,9,10,11].

Known techniques in the literature involve high order optimality conditions, but their practical usage is quite limited and not general enough [2,5]. Regularization approaches with additional assumption have been studied by several researchers, such as [12]. Other related results include [13,14,15].

In [16] the differential game is tackled by introducing a cost functional containing the minimizer’s control cost. A regularization approach is then considered, yielding an auxiliary differential game with partial cheap control of the minimizer. While differential games with total cheap control of at least one of the players has been studied enough already—see, e.g., [1,6,17,18,19,20]—partial cheap control of at least one of the players has been studied by only a few [16].

In the recent work of the authors [21], a saddle-point reformulation of the zero-sum singular differential game was studied, and two gradient methods were presented and analyzed. This work considering a slightly more general game than [16] and a pursuit–evasion game illustrates the applicability of the numerical methods.

Following the above and as a continuing work of [21], the objectives of this work are as follows:

Introduce an appropriate cost functional that includes in addition to relative lateral velocity, lateral relative separation.
For the above functional, develop a dual representation and present its variational derivatives.
Present numerical calculations of the dual representation that gives the interception.
Validate the above via Popov’s approximation method.

The paper is organized as follows. We first recall some basic definitions and results in Section 2. In Section 3, an interception game is considered. In Section 4, the dual representation of the game’s cost functional is derived, which follows by numerical validation of the double projection methods for finding saddle points in Section 5. Final conclusions are given in Section 6.

2. Preliminaries

A standard linear zero-sum differential game consists of constraint

\frac{d z (t)}{d t} = A z (t) + B u (t) + C v (t), z (0) = z_{0}, t \in [0, t_{f}],

(1)

and a cost functional

J (u, v) ≜ z^{T} (t_{f}) F z (t_{f}) + \int_{0}^{t_{f}} [z^{T} (t) D z (t) + u^{T} (t) G_{u} u (t) - v^{T} (t) G_{v} v (t)] d t,

(2)

to be minimized by u and maximized by v. The involved parameters in (1) and (2) are:

t_{f}

, a given final time moment; T, the transpose;

E^{n}

, an n dimensional Euclidean space;

z (t) \in E^{n}

, a state vector;

u (t) \in E^{r}

; (

r \leq n

);

v (t) \in E^{s}

, the players’ controls and quadratic integrable; A, B and C are the given matrices, where B is fully ranked. Moreover,

z_{0} \in E^{n}

is a given initial vector; F, D and

G_{u}

are given positive semi-definite symmetric matrices; and

G_{v}

is a positive, definite symmetric matrix.

Next we recall several definitions.

Definition 1.

The differential control game (1) and (2) is called singular if all or some of the coordinates of the minimizer’s control are singular, that is,

G_{u} = 0

or

G_{u} = diag (g_{u_{1}}, \dots, g_{u_{q}}, \underset{r - q}{\underset{︸}{0, \dots, 0}}), g_{u_{j}} > 0, j = 1, \dots, q,

(3)

Definition 2.

The control game (1) and (2) are called regular if the cost functional (2) for a small enough

ε > 0

has one of the following structures:

J^{ε} (u, v) = z {(t_{f})}^{T} F z (t_{f}) + \int_{0}^{t_{f}} [z^{T} (t) D z (t) + ε^{2} u^{T} (t) u (t) - v^{T} (t) G_{v} v (t)] d t .

(4)

Or

J^{ε} (u, v) = z^{T} (t_{f}) F z (t_{f}) + \int_{0}^{t_{f}} (z^{T} (t) D z (t) + u^{T} (t) (G_{u} + E) u (t) - v^{T} (t) G_{v} v (t)) d t,

(5)

where

G_{u} + E = diag (g_{u_{1}}, \dots, g_{u_{q}}, \underset{r - q}{\underset{︸}{ε^{2}, \dots, ε^{2}}}) .

(6)

Such regular cheap/partial cheap control games were analyzed in [12,16,22,23] using the study of the Riccati matrix differential equation for a finite-horizon game, and the Riccati matrix algebraic equation fora n infinite-horizon game.

In this work we study an interception game, which is a special regular differential game, in order to minimize with respect to u and maximize with respect to v the associated regularized cost functional. This then suggested exploring the equivalent saddle-point reformulation.

Consider the Hilbert space

L_{2} [0, t_{f}]

and let

Q \subseteq L_{2} [0, t_{f}]

and

R \subseteq L_{2} [0, t_{f}]

be two closed, convex and bounded sets of admissible controls. Then, solving (1)–(4) is equivalent to solving the following min-max problem.

min_{u \in Q} max_{v \in R} J^{ε} (u, v)

(7)

where

J^{ε}

is continuous, convex–concave (convex in u and concave in v) and differentiable. See [24] for further details.

A saddle-point reformulation of the min-max problem (7) is formulated as finding a point

(u^{*}, v^{*}) \in Q \times R

such that

J^{ε} (u^{*}, v) \leq J^{ε} (u^{*}, v^{*}) \leq J^{ε} (u, v^{*})

(8)

for all

v \in R

and

u \in Q

.

Another known relationship with the above assumptions on

J^{ε}

is the following variational inequality reformulation. The saddle-point problem (8) is equivalent to finding a point

(u^{*}, v^{*}) \in Q \times R

such that

〈(\binom{\frac{δ J^{ε}}{δ u} (u^{*}, v^{*})}{- \frac{δ J^{ε}}{δ v} (u^{*}, v^{*})}), (\binom{u}{v}) - (\binom{u^{*}}{v^{*}})〉 \geq 0 for all (u, v) \in Q \times R

where

〈 \cdot, \cdot 〉

is an appropriate inner product and

δ J^{ε} / (δ u)

and

δ J^{ε} / (δ v)

are the variational derivatives of the functional

J^{ε}

, as will be explained later (can be thought of as partial derivatives in the case of real functions).

Saddle-point problems (as well as variational inequalities) stand at the core of many real-world applications in convex programming, game theory and many more instances; see, e.g., Rockafellar [25]. In [21] we considered two gradient methods for solving saddle-point problems, the Arrow–Hurwicz–Uzawa algorithm [26] and Korpelevich’s extragradient method [27]; see also [28,29,30,31,32] and the many references therein.

These methods use gradients in each of their update rules. Thus we recall the variational/functional derivative definition next (see, for example, [33]).

Definition 3.

Consider an integral functional

J (x (t))

of an argument

x (t)

. The variational/functional derivative of

J (x (t))

with respect to

x (t)

,

\frac{δ J}{δ x} : [0, T] \mapsto R^{n}

is defined as

δ J (x (t)) h (t) = \int_{0}^{t_{f}} \frac{δ J}{δ x (t)} h (t) d t .

(9)

Recall that for a function

f : R^{n} \mapsto R

, the gradient

\nabla f

is defined by

\frac{d}{d ϵ} f (x + ϵ h) |_{_{ϵ = 0}} = \nabla f (x) \cdot h

Thus the variational derivative works similarly as such; that is,

\frac{d}{d ϵ} J (x + ϵ h) |_{_{ϵ = 0}} = δ J (x) \cdot h,

and with (9), we get

\frac{d}{d ϵ} J (x + ϵ h) |_{_{ϵ = 0}} = \int_{0}^{t_{f}} \frac{δ J}{δ x} h (t) d t .

(10)

Remark 1.

In a more general setting, where the zero-sum differential cost functional is not differentiable, subdifferentials/subgradients are needed; see [24,25,34] and references therein.

Now we recall the Arrow–Hurwicz–Uzawa method [26] and Korpelevich’s extragradient method [27], which we applied in our previous paper [21]. For simplicity we present the algorithms for solving variational inequalities, and clearly the translation to min-max and saddle-point problems can be easily derived.

Let

H_{1}, H_{2}

be two real Hilbert spaces and a bifunction

F : H_{1} \times H_{2} \to R

with partial derivatives

\nabla_{u} F

and

\nabla_{v} F

. Choose an arbitrary starting point

(u_{0}, v_{0}) \in Q \times R

and step-size

α > 0

. Given the current iterate

(u_{k}, v_{k})

, the Arrow–Hurwicz–Uzawa update rule is formulated as follows:

\begin{matrix} \{\begin{matrix} u_{k + 1} = & P_{Q} (u_{k} - α \nabla_{u} F (u_{k}, v_{k})) \\ v_{k + 1} = & P_{R} (v_{k} + α \nabla_{v} F (u_{k}, v_{k})) \end{matrix} \end{matrix}

(11)

where

P_{Q}

and

P_{R}

are the orthogonal projection operators for the sets Q and R, respectively.

A related modification with weaker convergence assumptions is Korpelevich’s method, in which additional mid-points computations are done, corresponding to gradients, and thus its known name is the extragradient method:

\begin{matrix} \{\begin{matrix} \bar{u_{k}} = & P_{Q} (u_{k} - α \nabla_{u} F (u_{k}, v_{k})) \\ \bar{v_{k}} = & P_{R} (v_{k} + α \nabla_{v} F (u_{k}, v_{k})) \\ u_{k + 1} = & P_{Q} (u_{k} - α \nabla_{u} F (\bar{u_{k}}, \bar{v_{k}})) \\ v_{k + 1} = & P_{R} (v_{k} + α \nabla_{v} F (\bar{u_{k}}, \bar{v_{k}})) \end{matrix} \end{matrix}

(12)

Although convergence of the extragradient method is guaranteed under weaker assumptions than the Arrow–Hurwicz–Uzawa method, there is still the need to calculate two evaluations of

\nabla F = (\nabla_{u} F, \nabla_{v} F)

and two projections onto Q and R. One step in the direction of simplifying the extragradient method with respect to the double projections is Censor et al.’s [30,31] subgradient extragradient method. In this method, the second orthogonal projection onto R is replaced by an easy computed projection onto some constructible set

T_{k}

.

\begin{matrix} \{\begin{matrix} \bar{u_{k}} = & P_{Q} (u_{k} - α \nabla_{u} F (u_{k}, v_{k})) \\ \bar{v_{k}} = & P_{R} (v_{k} + α \nabla_{v} F (u_{k}, v_{k})) \\ u_{k + 1} = & P_{T_{k}} (u_{k} - α \nabla_{u} F (\bar{u_{k}}, \bar{v_{k}})) \\ v_{k + 1} = & P_{T_{k}} (v_{k} + α \nabla_{v} F (\bar{u_{k}}, \bar{v_{k}})) \end{matrix} \end{matrix}

(13)

For avoiding the extra evaluations of

\nabla F

per each iteration, Popov [35] proposed the following modification introducing the so-called “leading” point:

\{\begin{matrix} \bar{u_{k}} = & P_{Q} (u_{k} - α \nabla_{u} F (u_{k}, v_{k})) \\ \bar{v_{k}} = & P_{R} (v_{k} + α \nabla_{v} F (u_{k}, v_{k})) \\ u_{k + 1} = & P_{Q} (\bar{u_{k}} - α \nabla_{u} F (u_{k}, v_{k})) \\ v_{k + 1} = & P_{R} (\bar{v_{k}} + α \nabla_{v} F (u_{k}, v_{k})) \end{matrix}

A standard assumption, which we also use, for the convergence of the above methods, is the so-called roundedness of the derivatives, which means that the functional derivatives of

J^{ε} (u_{k}, v_{k})

and

J^{ε} (u_{k}, v_{k})

with respect to u and v, respectively, are uniformly bounded; i.e., there is a constant

M > 0

such that

||\frac{δ J^{ε} (u_{k}, v_{k})}{δ u (t)}|| \leq M, ||\frac{δ J^{ε} (u_{k}, v_{k})}{δ v (t)}|| \leq M,

(14)

for all

k \geq 0

.

Since the introduction of the above methods, many modifications and extensions have been offered using various techniques—inertial, hybrid, viscosity and more; see, e.g., [36,37] and the references therein.

3. Interception Game

In this section we consider a particular singular problem (1)–(4), namely,

n = 2

,

r = 1

,

s = 1

. The matrices of coefficients in (1) and (2) are

A = (\begin{matrix} 0 & 1 \\ 0 & 0 \end{matrix}), D = (\begin{matrix} 0 & 0 \\ 0 & 0 \end{matrix}), F = (\begin{matrix} f_{1} & 0 \\ 0 & f_{2} \end{matrix})

(15)

B^{T} = (0, 1), C^{T} = (0, 1), G_{v} = g

(16)

with the scalars

g,

f_{1}, f_{2} > 0

.

The initial position

z_{0}

is

z_{0}^{T} (0) = (0, 1) .

(17)

The system (1) subject to the data (15), (16) has the following form:

\{\begin{matrix} \frac{d z_{1} (t)}{d t} & = z_{2} (t) \\ \frac{d z_{2} (t)}{d t} & = u (t) + v (t) \end{matrix}

(18)

The solution of (18) with initial position (17) has the following integral form:

z (t) = (z_{1} {(t)}^{T} ∣ z_{2} {(t)}^{T}) = M (t) M {(0)}^{- 1} z_{0} (0) + \int_{0}^{t} M (t) M {(s)}^{- 1} f (s) d s,

(19)

where

M (t) = (\begin{matrix} 1 & t \\ 0 & 1 \end{matrix})

(20)

is a fundamental matrix solution of the corresponding homogeneous system

\{\begin{matrix} \frac{d z_{1} (t)}{d t} & = z_{2} (t) \\ \frac{d z_{2} (t)}{d t} & = 0 \end{matrix}

(21)

and

f (s) = (\begin{matrix} 0 \\ u (s) + v (s) \end{matrix}) .

(22)

Thus, the analytical solution (after some technical calculations in (19) with (20) and (22)) can be written as

\begin{matrix} z_{1} (t) & = & t + \int_{0}^{t} (t - s) (u (s) + v (s)) d s \end{matrix}

(23)

\begin{matrix} z_{2} (t) & = & 1 + \int_{0}^{t} (u (s) + v (s)) d s \end{matrix}

(24)

The system (18), with (17) is a linearized kinematic model of a planar engagement between two vehicles—an interceptor (pursuer) and a target (evader) where both vehicles are directly controlled by their lateral accelerations

u (t) = - a_{p} (t)

and

v (t) = a_{e} (t)

, respectively. The coordinates of the state vector

z (t) = (z_{1} (t), z_{2} (t))

are the relative lateral separation and the relative lateral velocity of the vehicles. The basic schematic view of the planar engagement geometry is shown in Figure 1, where:

The x-axis of the coordinate system is aligned with the initial line of sight;
The points $(x_{p}, y_{p})$ , $(x_{e}, y_{e})$ are the current coordinates;
The origin is collocated with the initial pursuer position;
$V_{p}$ and $V_{e}$ are the velocities;
$a_{p}$ and $a_{e}$ are the lateral accelerations;
$φ_{p}$ and $φ_{e}$ are the respective aspect angles between velocities vectors and reference line of sight;
$y = y_{e} - y_{p}$ is the relative lateral separation normal to the initial sight of line;
r is the current range between the vehicles;
The line-of-sight angle $λ$ is the angle between the current and initial lines of sight.

More details of such an engagement can be found, for instance, in [38,39].

The behavior of each player in this singular game is evaluated by the following regularized cost functional.

J^{ε} (u, v) = f_{1} z_{1}^{2} (t_{f}) + f_{2} z_{2}^{2} (t_{f}) + \int_{0}^{t_{f}} (ε^{2} u^{2} (t) - g v^{2} (t)) d t .

(25)

The cost functional (25) has to be minimized by the pursuer

u (t)

and maximized by the evader

v (t)

.

The game, consisting of the dynamics (1), with the data (15) and (16), initial condition (17) and cost functional (25) is called the interception differential game.

By combining the above data and definitions and substituting in the functional (25) we obtain:

\begin{matrix} J^{ε} (u, v) & = & f_{1} (t_{f} + \int_{0}^{t_{f}} (t_{f} - t) (u (t) + v (t)) d t) \\ + f_{2} {(1 + \int_{0}^{t_{f}} (u (t) + v (t)) d t)}^{2} + \int_{0}^{t_{f}} (ε^{2} u^{2} (t) - g v^{2} (t)) d t . \end{matrix}

(26)

For the numerical validation of the saddle-point approximation of this functional, we next develop the appropriate variational derivatives.

Theorem 1.

The variational derivatives of (26) are given by:

\begin{matrix} \frac{δ J^{ε}}{δ u} & = & 2 f_{1} (t_{f} - t) (t_{f} + \int_{0}^{t_{f}} (t_{f} - t) (u (t) + v (t)) d t) \\ + 2 f_{2} (1 + \int_{0}^{t_{f}} (u (t) + v (t)) d t) + 2 ε^{2} u (t), \end{matrix}

(27)

and

\begin{matrix} \frac{δ J^{ε}}{δ v} & = & 2 f_{1} (t_{f} - t) (t_{f} + \int_{0}^{t_{f}} (t_{f} - t) (u (t) + v (t)) d t) \\ + 2 f_{2} (1 + \int_{0}^{t_{f}} (u (t) + v (t)) d t) - 2 g v (t) . \end{matrix}

(28)

Proof.

Our functional

J^{ε}

has two dependent variables of t (u and v), and we have to find the variational derivative with respect to u and v. In the view of above arguments, we have

\int_{0}^{t_{f}} \frac{δ J^{ε}}{δ u} h_{1} (t) d t = δ J^{ε} (u) \cdot h_{1} = \frac{\partial}{\partial ϵ_{1}} J^{ε} (u + ϵ_{1} h_{1}, v + ϵ_{2} h_{2}) |_{(ϵ_{1}, ϵ_{2}) = (0, 0)},

(29)

and

\int_{0}^{t_{f}} \frac{δ J^{ε}}{δ v} h_{2} (t) d t = δ J^{ε} (v) \cdot h_{2} = \frac{\partial}{\partial ϵ_{2}} J^{ε} (u + ϵ_{1} h_{1}, v + ϵ_{2} h_{2}) |_{_{(ϵ_{1}, ϵ_{2}) = (0, 0)}} .

(30)

Equations (29) and (30) will render the functional derivatives

\frac{δ J^{ε}}{δ u}

and

\frac{δ J^{ε}}{δ v}

, respectively. Therefore, we start with the (29):

\begin{matrix} \frac{\partial}{\partial ϵ_{1}} J^{ε} (u + ϵ_{1} h_{1}, v + ϵ_{2} h_{2}) |_{_{(ϵ_{1}, ϵ_{2}) = (0, 0)}} \\ = & \frac{\partial}{\partial ϵ_{1}} [f_{1} {(t_{f} + \int_{0}^{t_{f}} (t_{f} - t) (u (t) + ϵ_{1} h_{1} (t) + v (t) + ϵ_{2} h_{2} (t)) d t)}^{2}] |_{_{(ϵ_{1}, ϵ_{2}) = (0, 0)}} \\ + \frac{\partial}{\partial ϵ_{1}} [f_{2} {(1 + \int_{0}^{t_{f}} (u (t) + ϵ_{1} h_{1} (t) + v (t) + ϵ_{2} h_{2} (t)) d t)}^{2}] |_{_{(ϵ_{1}, ϵ_{2}) = (0, 0)}} \\ + \frac{\partial}{\partial ϵ_{1}} [\int_{0}^{t_{f}} (ε^{2} {(u (t) + ϵ_{1} h_{1} (t))}^{2} - g {(v (t) + ϵ_{2} h_{2} (t))}^{2}) d t] |_{_{(ϵ_{1}, ϵ_{2}) = (0, 0)}} \\ = & 2 f_{1} (t_{f} + \int_{0}^{t_{f}} (t_{f} - t) (u (t) + ϵ_{1} h_{1} (t) + v (t) + ϵ_{2} h_{2} (t)) d t) \int_{0}^{t_{f}} (t_{f} - t) h_{1} (t) d t |_{_{(ϵ_{1}, ϵ_{2}) = (0, 0)}} \\ + 2 f_{2} (1 + \int_{0}^{t_{f}} (u (t) + ϵ_{1} h_{1} (t) + v (t) + ϵ_{2} h_{2} (t)) d t) \int_{0}^{t_{f}} h_{1} (t) d t |_{_{(ϵ_{1}, ϵ_{2}) = (0, 0)}} \\ + \int_{0}^{t_{f}} 2 ε^{2} (u (t) + ϵ_{1} h_{1} (t)) h_{1} (t)) d t |_{_{(ϵ_{1}, ϵ_{2}) = (0, 0)}} \\ = & 2 f_{1} (t_{f} + \int_{0}^{t_{f}} (t_{f} - t) (u (t) + v (t)) d t) \int_{0}^{t_{f}} (t_{f} - t) h_{1} (t) d t \\ + 2 f_{2} (1 + \int_{0}^{t_{f}} (u (t) + v (t)) d t) \int_{0}^{t_{f}} h_{1} (t) d t + \int_{0}^{t_{f}} 2 ε^{2} u (t) h_{1} (t) d t \\ = & 2 f_{1} (t_{f} \int_{0}^{t_{f}} (t_{f} - t) h_{1} (t) d t + \int_{0}^{t_{f}} (t_{f} - t) (u (t) + v (t)) d t \int_{0}^{t_{f}} (t_{f} - t) h_{1} (t) d t) \\ + 2 f_{2} \int_{0}^{t_{f}} (1 + \int_{0}^{t_{f}} (u (t) + v (t)) d t) h_{1} (t) d t + \int_{0}^{t_{f}} 2 ε^{2} u (t) h_{1} (t) d t \\ = & 2 f_{1} (t_{f} \int_{0}^{t_{f}} (t_{f} - t) h_{1} (t) d t + \int_{0}^{t_{f}} (t_{f} - t) \{\int_{0}^{t_{f}} (t_{f} - t) (u (t) + v (t)) d t\} h_{1} (t) d t) \\ + 2 f_{2} \int_{0}^{t_{f}} (1 + \int_{0}^{t_{f}} (u (t) + v (t)) d t) h_{1} (t) d t + \int_{0}^{t_{f}} 2 ε^{2} u (t) h_{1} (t) d t . \end{matrix}

The above expression together with (10) yield

\int_{0}^{t_{f}} \frac{δ J^{ε}}{δ u} h_{1} (t) d t = \int_{0}^{t_{f}} [2 f_{1} t_{f} (t_{f} - t) + 2 f_{1} (T - t) \{\int_{0}^{t_{f}} (t_{f} - t) (u (t) + v (t)) d t\}

+ 2 f_{2} (1 + \int_{0}^{t_{f}} (u (t) + v (t)) d t) + 2 ε^{2} u (t)] h_{1} (t) d t .

It follows that the variational derivative of

J^{ε}

with respect to u is given by

\begin{matrix} \frac{δ J^{ε}}{δ u} & = & 2 f_{1} t_{f} (t_{f} - t) + 2 f_{1} (t_{f} - t) \{\int_{0}^{t_{f}} (t_{f} - t) (u (t) + v (t)) d t\} \\ + 2 f_{2} (1 + \int_{0}^{t_{f}} (u (t) + v (t)) d t) + 2 ε^{2} u (t) . \end{matrix}

with some simplifications, we obtain (27), and in a similar way we derive (28); and the proof is complete. □

4. Duality Representation

We consider the functional (26) with the constants

f_{1},

f_{2},

g > 0

and a small parameter

ε > 0

, and present its variational derivatives.

Theorem 2.

The dual representation of the functional (26) is given by:

u^{*} (t) = - \frac{1}{2 ε^{2}} h^{T} (t) ℓ^{*},

(31)

v^{*} (t) = \frac{1}{2 g} h^{T} (t) ℓ^{*},

(32)

where

ℓ^{*} = arg max_{ℓ \in E^{2}} [ℓ^{T} d - \frac{1}{4} ℓ^{T} G ℓ] .

(33)

or equivalently in scalar form

ℓ^{*} = arg max_{ℓ \in E^{2}} [ℓ_{1} t_{f} + ℓ_{2} - \frac{1}{4} (\frac{ℓ_{1}^{2}}{f 1} + \frac{ℓ_{2}^{2}}{f_{2}}) - \frac{1}{4} t_{f} (\frac{1}{ε^{2}} - \frac{1}{g}) (ℓ_{1}^{2} \frac{t_{f}^{3}}{3} + ℓ_{1} ℓ_{2} t_{f} + ℓ_{2}^{2})]

(34)

and all the coefficients will be presented in the proof itself.

Proof.

In the lines of [40], let us calculate the program maxi-min:

ρ = max_{v (\cdot) \in L_{2} [0, t_{f}]} min_{u (\cdot) \in L_{2} [0, t_{f}]} J^{ε}

(35)

Using [41]

z^{T} R z = max_{ℓ \in E^{n}} (ℓ^{T} z - \frac{1}{4} ℓ^{T} R^{- 1} ℓ),

(36)

where R is symmetric positive definite matrix. Note that

f_{1} z_{1}^{2} (t_{f}) + f_{2} z_{2}^{2} (t_{f}) = z^{T} (t_{f}) F z (t_{f}),

(37)

in (25) where

z = {(z_{1}, z_{2})}^{T}

,

F = [\begin{matrix} f_{1} & 0 \\ 0 & f_{2} \end{matrix}] .

(38)

Then,

J^{ε} = max_{ℓ \in E^{2}} φ (ℓ, u (\cdot), v (\cdot)),

(39)

where

φ (ℓ, u (\cdot), v (\cdot)) = ℓ^{T} z (t_{f}) - \frac{1}{4} ℓ^{T} F^{- 1} ℓ + \int_{0}^{t_{f}} (ε^{2} u^{2} (t) - g v^{2} (t)) d t,

(40)

F^{- 1} = [\begin{matrix} \frac{1}{f_{1}} & 0 \\ 0 & \frac{1}{f_{2}} \end{matrix}] .

(41)

Consequently,

ρ = max_{v (\cdot)} min_{u (\cdot)} max_{ℓ \in E^{2}} φ (ℓ, u (\cdot), v (\cdot)) .

(42)

Then (40) becomes:

φ (ℓ, u (\cdot), v (\cdot)) = ℓ_{1} (t_{f} + \int_{0}^{t_{f}} (t_{f} - t) (u (t) + v (t)) d t) + ℓ_{2} (1 + \int_{0}^{t_{f}} (u (t) + v (t)) d t)

- \frac{1}{4} (\frac{1}{f_{1}} ℓ_{1}^{2} + \frac{1}{f_{2}} ℓ_{2}^{2}) + \int_{0}^{t_{f}} (ε^{2} u^{2} (t) - g v^{2} (t)) d t =

ℓ_{1} T + ℓ_{2} - \frac{1}{4} (\frac{1}{f_{1}} ℓ_{1}^{2} + \frac{1}{f_{2}} ℓ_{2}^{2}) + φ_{u} (ℓ, u (\cdot)) + φ_{v} (ℓ, v (\cdot)),

(43)

where

φ_{u} (ℓ, u (\cdot)) ≜ \int_{0}^{t_{f}} [(ℓ_{1} (t_{f} - t) + ℓ_{2}) u (t) + ε^{2} u^{2} (t)] d t,

(44)

φ_{v} (ℓ, v (\cdot)) ≜ \int_{0}^{t_{f}} [(ℓ_{1} (t_{f} - t) + ℓ_{2}) v (t) - g v^{2} (t)] d t,

(45)

In (42), the operations of maximum over

ℓ \in E^{2}

and minimum over

u (\cdot) \in L_{2} [0, t_{f}]

commute. Therefore,

ρ = max_{ℓ \in E^{2}} max_{v (\cdot)} min_{u (\cdot)} φ (ℓ, u (\cdot), v (\cdot)) =

max_{ℓ_{1}, ℓ_{2} \in E} [ℓ_{1} t_{f} + ℓ_{2} - \frac{1}{4} (\frac{1}{f_{1}} ℓ_{1}^{2} + \frac{1}{f_{2}} ℓ_{2}^{2}) + min_{u (\cdot)} φ_{u} (ℓ, u (\cdot)) + max_{v (\cdot)} φ_{v} (ℓ, v (\cdot))] .

(46)

The inner minimizer and maximizer in (46) are

u^{*} (t, ℓ) = - \frac{ℓ_{1} (t_{f} - t) + ℓ_{2}}{2 ε^{2}},

(47)

v^{*} (t, ℓ) = \frac{ℓ_{1} (t_{f} - t) + ℓ_{2}}{2 g} .

(48)

Substituting (47) and (48) into (43),

χ (ℓ) = φ (ℓ, u^{*} (\cdot, ℓ), v^{*} (\cdot, ℓ)) =

ℓ_{1} T + ℓ_{2} - \frac{1}{4} (\frac{1}{f_{1}} ℓ_{1}^{2} + \frac{1}{f_{2}} ℓ_{2}^{2}) + φ_{u} (ℓ, u^{*} (\cdot, ℓ) + φ_{v} (ℓ, v^{*} (\cdot, ℓ)) =

(49)

ℓ^{T} d - \frac{1}{4} ℓ^{T} G ℓ,

where

d = {(T, 1)}^{T},

(50)

G = F^{- 1} + μ \int_{0}^{t_{f}} h (t) h^{T} (t) d t,

(51)

μ = \frac{1}{ε^{2}} - \frac{1}{g},

(52)

h (t) = {(t_{f} - t, 1)}^{T} .

(53)

Thus,

G = [\begin{matrix} \frac{1}{f 1} + \frac{μ t_{f}^{3}}{3} & \frac{μ t_{f}^{2}}{2} \\ \frac{μ t_{f}^{2}}{2} & \frac{1}{f 2} + μ t_{f} \end{matrix}]

(54)

Observe that the above is associated with the theory of symmetrical matrices [42]. Moreover, note that the studied interception differential game is solvable if the matrix G given by (54) is positive definite. Hence, the desired result has been obtained. □

5. Numerical Validation

In this section we examine the numerical behavior of the double projection methods described in Section 2 (Arrow–Hurwicz–Uzawa [26], Korpelevich’s extragradient [27] and Popov [35]). We show that under boundedness of the derivatives assumption (14) the results such as

ε \to 0^{+}

coincide with the dual development of the previous section. We choose

g = 4

,

f_{1}

,

f_{2} = 0.5

,

t_{f} = 4

and present the results for

ε = 0.1, 0.01, 0.001

. Observe that the Table 1, Table 2 and Table 3 and Figure 2, Figure 3, Figure 4, Figure 5 and Figure 6 show that for

ε \to 0^{+}

, the values

z_{1} (4)

and

z_{2} (4)

go to zero, meaning that the relative lateral velocity

z_{2}

and relative lateral separation

z_{1}

of the vehicles tend to zero at the end of the game, and the objective of the interception control is achieved. Observe also that the graphs of the inner minimizer and maximizer in Figure 2 coincide with the results in Table 2. Moreover, the values of the cost functional (26) decrease to 0.

Since we barely noticed any major differences between the numerical methods, we decided to present only the results of Popov [35].

6. Conclusions

In this work we presented a singular, zero-sum, linear-quadratic differential game in which the weight matrix of the minimizer’s control cost in the cost functional is singular. As an application we focused on an interception differential game and introduced a regularized cost functional; we examined its dual representation and validated it via numerical schemes for finding saddle points.

Author Contributions

All authors contributed equally to the following: conceptualization, methodology, formal analysis, writing—original draft preparation. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The study does not report any data.

Acknowledgments

The authors would like to thank the referees for their comments on the manuscript which helped in improving earlier versions of this paper. Moreover, we acknowledge Vladimir Turetsky useful help regarding the duality representation.

Conflicts of Interest

The authors declare no conflict of interest.

References

Turetsky, T.; Glizer, V.Y. Robust state-feedback controllability of linear systems to a hyperplane in a class of bounded controls. Optim. Theory Appl. 2004, 123, 639–667. [Google Scholar] [CrossRef]
Forouhar, K. Singular Differential Game Numerical Techniques and Closed Loop Strategies; University of California Press: Los Angeles, CA, USA, 1980. [Google Scholar]
Glizer, V.Y.; Shinar, J. On the structure of a class of time-optimal trajectories. Optim. Control Appl. Methods 1993, 14, 271–279. [Google Scholar] [CrossRef]
Shinar, J.; Glizer, V.Y. Application of receding horizon control strategy to pursuit-evasion problems. Optim Control Appl Methods 1995, 16, 127–141. [Google Scholar] [CrossRef]
Simakova, E.N. Differential pursuit game. Autom. Remote Control 1967, 28, 173–181. [Google Scholar]
Turetsky, T.; Glizer, V.Y.; Shinar, J. Robust trajectory tracking: Differential game/cheap control approach. Int. J. Syst. Sci. 2014, 45, 2260–2274. [Google Scholar] [CrossRef]
Hu, Y.; Oksendal, B.; Sulem, A. Singular mean-field control games with applications to optimal harvesting and investment problems. arXiv 2014, arXiv:1406.1863v1. [Google Scholar]
Isaacs, R. Differential Games; John Wiley and Sons: New York, NY, USA, 1967. [Google Scholar]
Basar, T.; Olsder, G.J. Dynamic Noncooperative Game Theory; Academic Press: London, UK, 1992. [Google Scholar]
Bryson, A.E., Jr.; Ho, Y.-C. Applied Optimal Control; Taylor & Francis Group: New York, NY, USA, 1975. [Google Scholar]
Krasovskii, N.N.; Subbotin, A.I. Game-Theoretical Control Problems; Springer: New York, NY, USA, 1988. [Google Scholar]
Shinar, J.; Glizer, V.Y.; Turetsky, V. Solution of a singular zero-sum linear-quadratic differential game by regularization. Int. Game Theory Rev. 2014, 16, 14–32. [Google Scholar] [CrossRef]
Amato, F.; Pironti, A. A Note on singular zero-sum linear quadratic differential games. In Proceedings of the 33rd Conference on Decision and Control, Lake Buena Vista, FL, USA, 14–16 December 1994; IEEE Publishing: New York, NY, USA, 1994; pp. 1533–1535. [Google Scholar]
Stoorvogel, A.A. The singular zero-sum differential game with stability using H_∞ control theory. Math. Control Signals Syst. 1991, 4, 121–138. [Google Scholar] [CrossRef]
Stoorvogel, A.A. The H_∞ Control Problem: A State Space Approach; University of Michigan Press: Ann Arbor, MI, USA, 2000. [Google Scholar]
Glizer, V.Y.; Kelis, O. Solution of a zero-sum linear quadratic differential game with singular control cost of minimiser. Control Decis. 2015, 2, 155–184. [Google Scholar] [CrossRef]
Glizer, V.Y. Asymptotic solution of zero-sum linear-quadratic differential game with cheap control for the minimizer. Nonlinear Differ. Equ. Appl. 2000, 7, 231–258. [Google Scholar] [CrossRef]
Petersen, I.R. Linear-quadratic differential games with cheap control. Syst. Control Lett. 1986, 8, 181–188. [Google Scholar] [CrossRef]
Starr, A.W.; Ho, Y.-C. Nonzero-sum differential games. Optim. Theory Appl. 1969, 3, 184–206. [Google Scholar] [CrossRef]
Turetsky, T.; Glizer, V.Y. Robust solution of a time-variable interception problem: A cheap control approach. Int. Game Theory Rev. 2007, 9, 637–655. [Google Scholar] [CrossRef]
Gibali, A.; Kelis, O. Gradient methods for solving zero-sum linear-quadratic differential games. Appl. Anal. Optim. 2018, 2, 237–252. [Google Scholar]
Glizer, V.Y.; Kelis, O. Solution of a singular infinite horizon zero-sum linear-quadratic differential game: A regularization approach. In Proceedings of the IEEE 23rd Mediterranean Conference on Control and Automation (MED2015), Torremolinos, Spain, 16–19 June 2015; pp. 390–397. [Google Scholar]
Glizer, V.Y.; Kelis, O. Singular infinite horizon zero-sum linear-quadratic differential game: Saddle-point equilibrium sequence. Numer. Algebra Control Optim. 2017, 7, 1–20. [Google Scholar] [CrossRef] [Green Version]
Goebel, R. Convexity in zero-sum differential games. Control Optim. 2002, 40, 1491–1504. [Google Scholar] [CrossRef]
Rockafellar, R.T. Convex Analysis; Princeton University Press: Princeton, NJ, USA, 1970. [Google Scholar]
Arrow, K.J.; Hurwicz, L.; Uzawa, H. Studies in Linear and Non-Linear Programming; Stanford University Press: Stanford, CA, USA, 1958. [Google Scholar]
Korpelevich, G.M. The extragradient method for finding saddle points and other problems. Ekon. Mat. Metod. 1976, 12, 747–756. [Google Scholar]
Antipin, A.S. On a method for convex programs using a symmetrical modification of the Lagrange function. Ekon. Mater. Metod. 1976, 12, 1164–1173. [Google Scholar]
Censor, Y.; Gibali, A.; Reich, S. Strong convergence of subgradient extragradient methods for the variational inequality problem in Hilbert space. Optim. Methods Softw. 2011, 26, 827–845. [Google Scholar] [CrossRef]
Censor, Y.; Gibali, A.; Reich, S. The subgradient extragradient method for solving variational inequalities in Hilbert space. Optim. Theory Appl. 2011, 148, 318–335. [Google Scholar] [CrossRef] [Green Version]
Censor, Y.; Gibali, A.; Reich, S. Extensions of Korpelevich’s extragradient method for solving the variational inequality problem in Euclidean space. Optimization 2012, 61, 1119–1132. [Google Scholar] [CrossRef]
Zaslavski, A.J. Numerical Optimization with Computational Errors; Springer: Cham, Switzerland, 2016. [Google Scholar]
Gelfand, I.M.; Fomin, S.V. Calculus of Variations; Dover Publications, Inc.: Mineola, NY, USA, 2000. [Google Scholar]
Nedić, A.; Ozdaglar, A. Subgradient methods for saddle-point problems. Optim. Theory Appl. 2009, 142, 205–228. [Google Scholar] [CrossRef]
Popov, L.D. A modification of the Arrow-Hurwicz method for finding saddle points. Math. Notes 1980, 28, 845–848. [Google Scholar] [CrossRef]
Malitsky, Y.V.; Semenov, V.V. An extragradient algorithm for monotone variational inequalities. Cybern. Syst. Anal. 2014, 50, 271–277. [Google Scholar] [CrossRef]
Thong, D.V.; Li, X.H.; Dong, Q.L. An inertial Popov’s method for solving pseudomonotone variational inequalities. Optim. Lett. 2021, 15, 757–777. [Google Scholar] [CrossRef]
Gutman, S.; Leitmann, G. Optimal strategies in a neighborhood of a collision course. AIAA J. 1976, 14, 1210–1212. [Google Scholar] [CrossRef]
Shinar, J. Solution techniques for realistic pursuit-evasion games. In Control and Dynamic Systems; Academic Press: New York, NY, USA, 1981; Volume 17, pp. 63–124. [Google Scholar]
Shinar, J.; Glizer, V.Y.; Turetsky, V.; Ianovsky, E. Solvability of linearquadratic differential games associated with pursuit-evasion problems. Int. Game Theory Rev. 2008, 10, 481–515. [Google Scholar] [CrossRef]
Bellman, R. Introduction to Matrix Analysis; McGraw-Hill Book Co., Inc.: New York, NY, USA; Toronto, ON, Canada; London, UK, 1960. [Google Scholar]
Crasmareanu, M. The determinant inner product and the Heisenberg product of Sym(2). Int. Electron. J. Geom. 2021, 14, 145–156. [Google Scholar]

Figure 1. Geometry of the interception game.

Figure 2.

u^{*} (t)

and

v^{*} (t)

.

Figure 2.

u^{*} (t)

and

v^{*} (t)

.

Figure 3. The relative lateral separation

z_{1} (t)

for varying

ε

.

Figure 3. The relative lateral separation

z_{1} (t)

for varying

ε

.

Figure 4. A zoom of

z_{1} (4)

for varying

ε

.

Figure 4. A zoom of

z_{1} (4)

for varying

ε

.

Figure 5. The relative lateral velocity

z_{2} (t)

for varying

ε

.

Figure 5. The relative lateral velocity

z_{2} (t)

for varying

ε

.

Figure 6. A zoom of

z_{2} (4)

for varying

ε

.

Figure 6. A zoom of

z_{2} (4)

for varying

ε

.

Table 1. Duality.

$ε$	$ℓ_{1}$	$ℓ_{2}$	$ℓ^{*}$
$0.1$	$0.007417428195$	$- 0.00977333591$	$0.009948188435$
$0.01$	$0.00007499156434$	$- 0.00009997687968$	$0.00009999468885$
$0.001$	$7.499991563 \times 10^{- 7}$	$- 9.999976875 \times 10^{- 7}$	$9.999994680 \times 10^{- 7}$

Table 2. The inner minimizer and maximizer for varying

ε

.

Table 2. The inner minimizer and maximizer for varying

ε

.

$ε$	$u^{*} (t)$	$v^{*} (t)$
$0.1$	$- 0.9948188435 + 0.3708714098 \cdot t$	$0.002487047109 - 0.0009271785245 \cdot t$
$0.01$	$- 0.9999468885 + 0.3749578217 \cdot t$	$0.00002499867221 - 0.000009373945540 \cdot t$
$0.001$	$- 0.9999994690 + 0.3749995782 \cdot t$	$2.499998672 \times 10^{- 7} - 9.374989455 \times 10^{- 8} \cdot t$

Table 3. Numerical calculations for varying

ε

.

Table 3. Numerical calculations for varying

ε

.

$ε$	$z_{1} (4)$	$z_{2} (4)$	the Cost Functional (26)
$0.1$	$0.007417429$	$- 0.009773335$	$0.009948188431$
$0.01$	$0.000074991$	$- 0.000099977$	$0.00009999468883$
$0.001$	$7.49 \times 10^{- 7}$	$- 0.000001$	$9.999994685 \times 10^{- 7}$

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Gibali, A.; Kelis, O. An Analytic and Numerical Investigation of a Differential Game. Axioms 2021, 10, 66. https://0-doi-org.brum.beds.ac.uk/10.3390/axioms10020066

AMA Style

Gibali A, Kelis O. An Analytic and Numerical Investigation of a Differential Game. Axioms. 2021; 10(2):66. https://0-doi-org.brum.beds.ac.uk/10.3390/axioms10020066

Chicago/Turabian Style

Gibali, Aviv, and Oleg Kelis. 2021. "An Analytic and Numerical Investigation of a Differential Game" Axioms 10, no. 2: 66. https://0-doi-org.brum.beds.ac.uk/10.3390/axioms10020066

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Analytic and Numerical Investigation of a Differential Game

Abstract

1. Introduction

2. Preliminaries

3. Interception Game

4. Duality Representation

5. Numerical Validation

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI