Cheap Control in a Non-Scalarizable Linear-Quadratic Pursuit-Evasion Game: Asymptotic Analysis

Turetsky, Vladimir; Glizer, Valery Y.

doi:10.3390/axioms11050214

Open AccessFeature PaperArticle

Cheap Control in a Non-Scalarizable Linear-Quadratic Pursuit-Evasion Game: Asymptotic Analysis

by

Vladimir Turetsky

^1,*,†

and

Valery Y. Glizer

^2,†

¹

Department of Mathematics, Ort Braude College of Engineering, 51 Snunit Str., P.O. Box 78, Karmiel 2161002, Israel

²

The Galilee Research Center for Applied Mathematics, Ort Braude College of Engineering, 51 Snunit Str., P.O. Box 78, Karmiel 2161002, Israel

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Axioms 2022, 11(5), 214; https://0-doi-org.brum.beds.ac.uk/10.3390/axioms11050214

Submission received: 27 March 2022 / Revised: 25 April 2022 / Accepted: 27 April 2022 / Published: 5 May 2022

(This article belongs to the Special Issue Singularly Perturbed Problems: Asymptotic Analysis and Approximate Solution)

Download

Browse Figures

Versions Notes

Abstract

:

In this work, a finite-horizon zero-sum linear-quadratic differential game, modeling a pursuit-evasion problem, was considered. In the game’s cost function, the cost of the control of the minimizing player (the minimizer/the pursuer) was much smaller than the cost of the control of the maximizing player (the maximizer/the evader) and the cost of the state variable. This smallness was expressed by a positive small multiplier (a small parameter) of the square of the

L_{2}

-norm of the minimizer’s control in the cost function. Parameter-free sufficient conditions for the existence of the game’s solution (the players’ optimal state-feedback controls and the game value), valid for all sufficiently small values of the parameter, were presented. The boundedness (with respect to the small parameter) of the time realizations of the optimal state-feedback controls along the corresponding game’s trajectory was established. The best achievable game value from the minimizer’s viewpoint was derived. A relation between solutions of the original cheap control game and the game that was obtained from the original one by replacing the small minimizer’s control cost with zero, was established. An illustrative real-life example is presented.

Keywords:

linear-quadratic differential game; cheap control; singular (degenerate) differential game; pursuit-evasion game

1. Introduction

A cheap control problem is an extremal control problem where a control cost of at least one of the decision makers is much smaller than a state cost in at least one cost function of the problem. Cheap control problems appear in many topics of optimal control and differential game theories. For example, such problems appear in the following topics: (1) regularization of singular optimal controls (see, e.g., [1,2,3,4]); (2) limitation analysis for optimal regulators and filters (see, e.g., [5,6,7]); (3) extremal control problems with high gain control in dynamics (see, e.g., [8,9]); (4) inverse optimal control problems (see, e.g., [10]); (5) robust optimal control of systems with uncertainties/disturbances (see, e.g., [11,12]); (6) guidance problems (see, e.g., [13,14]).

The Hamilton boundary-value problem and the Hamilton–Jacobi–Bellman–Isaacs equation, associated with the cheap control problem by solvability (control optimality) conditions, are singularly perturbed because of the smallness of the control cost.

In the present paper, we considered one class of cheap control pursuit-evasion differential games. Cheap control differential games have been studied in a number of works in the literature (see, e.g., [4,11,12,15,16] and references therein). In most of these studies, the case where a state cost appeared in the integral part of the cost function was treated. This feature allowed (subject to some additional condition on the state cost) the use of the boundary function method [17] for an asymptotic analysis of the corresponding singularly perturbed Hamilton–Jacobi–Bellman–Isaacs equation. Moreover, the time realization of the optimal state-feedback control with the small cost had an impulse-like behaviour, meaning it was unbounded as the control cost tended to zero. To the best of our knowledge, cheap control games, where the time realization of the state-feedback optimal control with the small cost remains bounded as this cost tends to zero, were considered only in a few works and only for specific problem settings. Thus in [13], a pursuit-evasion problem, modeled by a linear-quadratic zero-sum differential game with time-invariant four-dimensional dynamics and scalar controls of the players, was considered. In this game, the control cost of the pursuer was assumed to be small. Moreover, the integral part of the game’s cost function did not contain the state cost. By a linear state transformation, this cheap control game was converted to a scalar linear-quadratic cheap control game. In this scalar game, the time realization of the optimal state-feedback pursuer’s control against a bang–bang evader’s control was analyzed. Sufficient conditions for the boundedness of this time realization for all sufficiently small values of the pursuer’s control cost were derived. In [14], a similar problem was solved in the case where the control costs of both the pursuer and evader were small and had the same order of smallness. In [11], a more general pursuit-evasion problem was studied. This problem was modeled by a linear-quadratic zero-sum differential game with time-dependent six-dimensional dynamics. The controls of both the pursuer and evader were scalar. The costs of these controls were small and had the same order of smallness. The state cost was absent in the integral part of the game’s cost function. This game also allowed a transformation to a scalar linear-quadratic cheap control game. In this scalar game, the time realization of the optimal state-feedback pursuer’s control against an open-loop bounded evader’s control was analyzed. Sufficient conditions, guaranteeing that the time realization satisfied given constraints for all sufficiently small values of the controls’ costs, were obtained. In [12], a robust tracking problem, modeled by a linear-quadratic zero-sum differential game with time-dependent n-dimensional (

n \geq 1

) dynamics, was analyzed. The controls of both minimizing and maximizing players were vector-valued. The costs of these controls were small and had the same order of smallness. For this game, the limit behaviour of the state-dependent part of the cost function, generated by the optimal state-feedback control of the minimizing player (the minimizer) and any

L_{2}

-bounded open-loop control of the maximizing player (the maximizer), was studied. Sufficient conditions, providing the tendency to zero of this part of the cost function as the small controls’ costs approached zero (the exact tracking), were derived. Subject to these conditions, necessary conditions for the boundedness of the time realization of the optimal state-feedback minimizer’s control for all sufficiently small values of the controls’ costs were obtained.

In the present work, we studied a much more general cheap control linear-quadratic zero-sum differential game than those in [11,13,14]. For this game, an asymptotic analysis of its solution was carried out in the case where the small control’s cost of the minimizer tended to zero. In particular, the asymptotic behavior of the time realizations of both players’ optimal state-feedback controls along the corresponding (optimal) trajectory of the game was analyzed. The boundedness of these time realizations was established for all sufficiently small values of the minimizer’s control cost. Moreover, in contrast to the results of the work [12], the conditions for such boundedness were sufficient and they were not restricted by any other specific conditions, such as the exact tracking in [12].

Also in the present work, we considered one more linear-quadratic zero-sum differential game. This game was obtained from the original cheap control game by replacing the small control cost of the minimizer with zero. This new game was called a degenerate game and was similar to the continuous/discrete time system obtained from a singularly perturbed system by replacing a small parameter of singular perturbation with zero. The relation between the original cheap control game and the degenerate game was established.

This paper is organised as follows. In Section 2, the problems of the paper (the cheap control differential game and the degenerate differential game) are rigorously formulated, main definitions and some preliminary results are presented and the objectives of the paper are stated. In Section 3, the solution of the cheap control differential game is obtained and the asymptotic analysis of this solution is carried out. Section 4 is devoted to deriving the solution of the degenerate differential game. In addition, some relations between the solution of the cheap control differential game and the degenerate differential game are established in this section. In Section 5, based on the theoretical results of the paper, one interception problem in 3D space was studied. Conclusions of the paper are presented in Section 6.

2. Preliminaries and Problem Statement

Consider the controlled system

\dot{x} = A (t) x + B (t) u + C (t) v, x (t_{0}) = x_{0}, t \in [t_{0}, t_{f}],

(1)

where

x \in R^{n}

,

u \in R^{r}

and

v \in R^{s}

are the state, the pursuer’s control and the evader’s control, respectively;

t_{0}

is an initial time moment;

t_{f}

is a final time moment; the matrix-valued functions

A (t)

,

B (t)

and

C (t)

of appropriate dimensions are continuous for

t \in [t_{0}, t_{f}]

. The controls

u (t)

and

v (t)

are assumed to be measurable bounded functions for

t \in [t_{0}, t_{f}]

.

The target set is a linear manifold

T_{x} = \{x \in R^{n} : D x + d = 0\},

(2)

where D is a prescribed

m \times n

-matrix (

m < n

) and

d \in R^{m}

is a prescribed vector. The objective of the pursuer is to steer the system onto a target set at

t = t_{f}

, whereas the evader desires to avoid hitting the target set by exploiting feedback strategies

u (t, x)

and

v (t, x)

, respectively.

Let us consider the set

U_{x}

of all functions

u = u (t, x) : [0, t_{f}] \times R^{n} \to R^{r}

, which are measurable w.r.t.

t \in [0, t_{f}]

for any fixed

x \in R^{n}

and satisfy the local Lipschitz condition w.r.t.

x \in R^{n}

uniformly in

t \in [0, t_{f}]

. Similarly, we consider the set

V_{x}

of all functions

v = v (t, x) : [0, t_{f}] \times R^{n} \to R^{s}

, which are measurable w.r.t.

t \in [0, t_{f}]

for any fixed

x \in R^{n}

and satisfy the local Lipschitz condition w.r.t.

x \in R^{n}

uniformly in

t \in [0, t_{f}]

.

Definition 1.

Let us denote by

U_{x}

the set of all functions

u (t, x) \in U_{x}

satisfying the following conditions:

(1_{u x})

the initial-value problem (1) for

u (t) = u (t, x)

and any fixed

v (t) \in L_{2} ([0, t_{f}], R^{s})

has the unique absolutely continuous solution

x_{u} (t)

,

t \in [0, t_{f}]

;

(2_{u x})

u (t, x_{u} (t)) \in L_{2} ([0, t_{f}], R^{r})

.

Also, let us denote by

V_{x}

the set of all functions

v (t, x) \in V_{x}

satisfying the following conditions:

(1_{v x})

the initial-value problem (1) for

v (t) = v (t, x)

and any fixed

u (t) \in L_{2} ([0, t_{f}], R^{r})

has the unique absolutely continuous solution

x_{v} (t)

,

t \in [0, t_{f}]

;

(2_{v x})

v (t, x_{v} (t)) \in L_{2} ([0, t_{f}], R^{s})

.

In what follows, the set

U_{x}

is called the set of all admissible state-feedback controls (strategies) of the pursuer, while the set

V_{x}

is called the set of all admissible state-feedback controls (strategies) of the evader.

Below, two differential games modeling this conflict situation are formulated.

2.1. Cheap Control Differential Game

The first is the Cheap Control Differential Game (CCDG) with the dynamics (1) and the cost function

{\tilde{J}}_{α β} (u, v) = | D x (t_{f}) {+ d |}^{2} + α \int_{t_{0}}^{t_{f}} | u (t) |^{2} d t - β \int_{t_{0}}^{t_{f}} {| v (t) |}^{2} d t,

(3)

where

| x |

denotes the Euclidean norm of the vector x;

α, β > 0

are the penalty coefficients for the players’ control expenditure, and

α

is assumed to be small. The objectives of the pursuer and the evader were to minimize and to maximize the cost function (3) by

u (\cdot) \in U_{x}

and

v (\cdot) \in V_{x}

, respectively.

The CCDG (1), (3) is a zero-sum linear-quadratic differential game (see, e.g., [18,19,20,21,22]).

Definition 2.

Let

u (t, x)

,

(t, x) \in [t_{0}, t_{f}] \times R^{n}

, be any given admissible pursuer strategy, i.e.,

u (\cdot) \in U_{x}

. Then, the value

{\tilde{J}}_{α β}^{u} (u (\cdot); t_{0}, x_{0}) = sup_{v (t) \in L_{2} ([t_{0}, t_{f}], R^{s})} {\tilde{J}}_{α β} (u (\cdot), v (t)),

(4)

calculated along the corresponding trajectories of the system (1), is called the guaranteed result of the strategy

u (\cdot)

in the CCDG.

The value

{\tilde{J}}_{α β}^{u *} (t_{0}, x_{0}) = inf_{u (\cdot) \in U_{x}} {\tilde{J}}_{α β}^{u} (u (\cdot); t_{0}, x_{0})

(5)

is called the upper value of the CCDG.

If the infimum value (5) is attained for

{\tilde{u}}_{α β}^{0} (t, x) \in U_{x}

, i.e.,

inf_{u (\cdot) \in U_{x}} {\tilde{J}}_{α β}^{u} (u (\cdot); t_{0}, x_{0}) = min_{u (\cdot) \in U_{x}} {\tilde{J}}_{α β}^{u} (u (\cdot); t_{0}, x_{0})

and

{\tilde{u}}_{α β}^{0} (t, x) = arg min_{u (\cdot) \in U_{x}} {\tilde{J}}_{α β}^{u} (u (\cdot); t_{0}, x_{0}),

(6)

the strategy

{\tilde{u}}_{α β}^{0} (t, x)

is called the optimal strategy of the pursuer in the CCDG.

Definition 3.

Let

v (t, x)

,

(t, x) \in [t_{0}, t_{f}] \times R^{n}

, be any given admissible evader strategy, i.e.,

v (\cdot) \in V_{x}

. Then, the value

{\tilde{J}}_{α β}^{v} (v (\cdot); t_{0}, x_{0}) = inf_{u (t) \in L_{2} ([t_{0}, t_{f}], R^{r})} {\tilde{J}}_{α β} (u (t), v (\cdot)),

(7)

calculated along the corresponding trajectories of the system (1), is called the guaranteed result of the strategy

v (\cdot)

in the CCDG.

The value

{\tilde{J}}_{α β}^{v *} (t_{0}, x_{0}) = sup_{v (\cdot) \in V_{x}} {\tilde{J}}_{α β}^{v} (v (\cdot); t_{0}, x_{0})

(8)

is called the lower value of the CCDG.

If the supremum value (8) is attained for

{\tilde{v}}_{α β}^{0} (t, x) \in V_{x}

, i.e.,

sup_{v (\cdot) \in V_{x}} {\tilde{J}}_{α β}^{v} (v (\cdot); t_{0}, x_{0}) = max_{v (\cdot) \in V_{x}} {\tilde{J}}_{α β}^{v} (v (\cdot); t_{0}, x_{0})

and

{\tilde{v}}_{α β}^{0} (t, x) = arg max_{v (\cdot) \in V_{x}} {\tilde{J}}_{α β}^{v} (v (\cdot); t_{0}, x_{0}),

(9)

the strategy

{\tilde{v}}_{α β}^{0} (t, x)

is called the optimal strategy of the evader in the CCDG.

Definition 4.

If

{\tilde{J}}_{α β}^{u *} (t_{0}, x_{0}) = {\tilde{J}}_{α β}^{v *} (t_{0}, x_{0}) ≜ {\tilde{J}}_{α β}^{0} (t_{0}, x_{0}),

(10)

then it is said that the CCDG has the game value

{\tilde{J}}_{α β}^{0}

.

2.2. Singular (Degenerate) Differential Game

In this game the dynamics were the same as in the CCDG, i.e., (1), while the cost function of this game was obtained from (3) by replacing

α

with zero:

{\tilde{J}}_{β} (u, v) = | D x (t_{f}) {+ d |}^{2} - β \int_{0}^{t_{f}} {| v (t) |}^{2} d t .

(11)

The differential game (1), (11) is called the Singular Differential Game (SDG).

Remark 1.

The sets of all admissible state-feedback controls (strategies) of the pursuer and the evader in the SDG are the same as in the CCDG, i.e.,

U_{x}

and

V_{x}

, respectively. The guaranteed results

{\tilde{J}}_{β}^{u} (u (\cdot); t_{0}, x_{0})

and

{\tilde{J}}_{β}^{v} (v (\cdot); t_{0}, x_{0})

of any given strategies

u (\cdot) \in U_{x}

and

v (\cdot) \in V_{x}

in the SDG are defined similarly to (4) and (7), respectively. Namely,

{\tilde{J}}_{β}^{u} (u (\cdot); t_{0}, x_{0}) = sup_{v (t) \in L_{2} ([t_{0}, t_{f}], R^{s})} {\tilde{J}}_{β} (u (\cdot), v (t)),

(12)

{\tilde{J}}_{β}^{v} (v (\cdot); t_{0}, x_{0}) = inf_{u (t) \in L_{2} ([t_{0}, t_{f}], R^{r})} {\tilde{J}}_{β} (u (t), v (\cdot)) .

(13)

The upper

{\tilde{J}}_{β}^{u *} (t_{0}, x_{0})

and lower

{\tilde{J}}_{β}^{v *} (t_{0}, x_{0})

values of the SDG are defined similarly to (5) and (8), respectively. Namely,

{\tilde{J}}_{β}^{u *} (t_{0}, x_{0}) = inf_{u (\cdot) \in U_{x}} {\tilde{J}}_{β}^{u} (u (\cdot); t_{0}, x_{0}),

(14)

{\tilde{J}}_{β}^{v *} (t_{0}, x_{0}) = sup_{v (\cdot) \in V_{x}} {\tilde{J}}_{β}^{v} (v (\cdot); t_{0}, x_{0}) .

(15)

If

{\tilde{J}}_{β}^{u *} (t_{0}, x_{0}) = {\tilde{J}}_{β}^{v *} (t_{0}, x_{0}) ≜ {\tilde{J}}_{β}^{*} (t_{0}, x_{0}),

(16)

then

{\tilde{J}}_{β}^{*} (t_{0}, x_{0})

is called the value of the SDG.

Definition 5.

The sequence of state-feedback controls

{{\tilde{u}}_{β, k} (\cdot)}

,

{\tilde{u}}_{β, k} (\cdot) \in U_{x}

,

(k = 1, 2, . . .)

, is called minimizing in the SDG if

lim_{k \to \infty} {\tilde{J}}_{β}^{u} ({\tilde{u}}_{β, k} (\cdot); t_{0}, x_{0}) = {\tilde{J}}_{β}^{u *} (t_{0}, x_{0}) .

(17)

If there exists

{\tilde{u}}_{β}^{*} (t, x) \in U_{x}

, for which the upper value of the SDG is attained, this state-feedback control is called an optimal state-feedback control of the pursuer in the SDG:

{\tilde{u}}_{β}^{*} (t, x) = arg min_{u (\cdot) \in U_{x}} {\tilde{J}}_{β}^{u} (u (\cdot); t_{0}, x_{0}) .

(18)

Definition 6.

The sequence of state-feedback controls

{{\tilde{v}}_{β, k} (\cdot)}

,

{\tilde{v}}_{β, k} (\cdot) \in V_{x}

,

(k = 1, 2, . . .)

, is called maximizing in the SDG if

lim_{k \to \infty} {\tilde{J}}_{β}^{v} ({\tilde{v}}_{β, k} (\cdot); t_{0}, x_{0}) = {\tilde{J}}_{β}^{v *} (t_{0}, x_{0}) .

(19)

If there exists

{\tilde{v}}_{β}^{*} (t, x) \in V_{x}

, for which the lower value of the SDG is attained, this state-feedback control is called an optimal state-feedback control of the evader in the SDG:

{\tilde{v}}_{β}^{*} (t, x) = arg max_{v (\cdot) \in V_{x}} {\tilde{J}}_{β}^{v} (v (\cdot); t_{0}, x_{0}) .

(20)

Remark 2.

Since the cost function (11) of the SDG does not contain a quadratic control cost of u, its solution (if it exists) cannot be obtained either by the Isaacs’s MinMax principle or by the Bellman–Isaacs equation method (see [23]). This justified calling this game singular. The CCDG could be considered as a singularly perturbed SDG, whereas the SDG was a degenerate CCDG.

2.3. Reduction of the Games

Let

Φ (t, τ)

be the transition matrix of the homogeneous system

\dot{x} = A (t) x

. By applying the state transformation

z = D Φ (t_{f}, t) x + d,

(21)

the system (1) is reduced to

\dot{z} = H_{1} (t) u + H_{2} (t) v, z (t_{0}) = z_{0}, t \in [t_{0}, t_{f}],

(22)

where

m \times r

and

m \times s

matrices

H_{1} (t)

and

H_{2} (t)

are

H_{1} (t) = D Φ (t_{f}, t) B (t), H_{2} (t) = D Φ (t_{f}, t) C (t),

(23)

z_{0} = D Φ (t_{f}, t_{0}) x_{0} + d .

(24)

Due to (21), for the reduced system (22), the cost functions (3) and (11) of the CCDG and SDG become

J_{α β} = | z (t_{f}) |^{2} + α \int_{t_{0}}^{t_{f}} | u (t) |^{2} d t - β \int_{t_{0}}^{t_{f}} {| v (t) |}^{2} d t,

(25)

and

J_{β} = | z (t_{f}) |^{2} - β \int_{t_{0}}^{t_{f}} {| v (t) |}^{2} d t,

(26)

respectively.

The games (22), (25) and (22), (26) are called the Reduced Cheap Control Differential Game (RCCDG) and the Reduced Singular Differential Game (RSDG), respectively.

Let us consider the set

U_{z}

of all functions

u = u (t, z) : [0, t_{f}] \times R^{m} \to R^{r}

, which are measurable w.r.t.

t \in [0, t_{f}]

for any fixed

z \in R^{m}

and satisfy the local Lipschitz condition w.r.t.

z \in R^{m}

uniformly in

t \in [0, t_{f}]

. Similarly, we consider the set

V_{z}

of all functions

v = v (t, z) : [0, t_{f}] \times R^{m} \to R^{s}

, which are measurable w.r.t.

t \in [0, t_{f}]

for any fixed

z \in R^{m}

and satisfy the local Lipschitz condition w.r.t.

z \in R^{m}

uniformly in

t \in [0, t_{f}]

.

Definition 7.

Let us denote by

U_{z}

the set of all functions

u (t, z) \in U_{z}

satisfying the following conditions:

(1_{u z})

the initial-value problem (22) for

u (t) = u (t, z)

and any fixed

v (t) \in L_{2} ([0, t_{f}], R^{s})

has the unique absolutely continuous solution

z_{u} (t)

,

t \in [0, t_{f}]

;

(2_{u z})

u (t, z_{u} (t)) \in L_{2} ([0, t_{f}], R^{r})

.

In addition, let us denote by

V_{z}

the set of all functions

v (t, z) \in V_{z}

satisfying the following conditions:

(1_{v z})

the initial-value problem (22) for

v (t) = v (t, z)

and any fixed

u (t) \in L_{2} ([0, t_{f}], R^{r})

has the unique absolutely continuous solution

z_{v} (t)

,

t \in [0, t_{f}]

;

(2_{v x})

v (t, z_{v} (t)) \in L_{2} ([0, t_{f}], R^{s})

.

In what follows, the set

U_{z}

is called the set of all admissible state-feedback controls (strategies) of the pursuer in both games RCCDG and RSDG, while the set

V_{z}

is called the set of all admissible state-feedback controls (strategies) of the evader in both games RCCDG and RSDG.

Remark 3.

Based on Definition 7, the guaranteed results

J_{α β}^{u} (u (\cdot); t_{0}, z_{0})

and

J_{α β}^{v} (v (\cdot); t_{0}, z_{0})

of any given strategies

u (\cdot) \in U_{z}

and

v (\cdot) \in V_{z}

in the RCCDG are defined similarly to (4) and (7), respectively. The upper

J_{α β}^{u *} (t_{0}, z_{0})

and lower

J_{α β}^{v *} (t_{0}, z_{0})

values of the RCCDG are defined similarly to (5) and (8), respectively. The optimal state-feedback controls of the pursuer

u_{α β}^{0} (t, z)

and the evader

v_{α β}^{0} (t, z)

,

(t, z) \in [0, t_{f}] \times R^{m}

, are defined similarly to (6) and (9), respectively. The value of the RCCDG

J_{α β}^{0} (t_{0}, z_{0})

is defined similarly to (10).

Remark 4.

Based on Definition 7, the guaranteed results

J_{β}^{u} (u (\cdot); t_{0}, z_{0})

and

J_{β}^{v} (v (\cdot); t_{0}, z_{0})

of any given strategies

u (\cdot) \in U_{z}

and

v (\cdot) \in V_{z}

in the RSDG are defined similarly to (12) and (13), respectively. The upper

J_{β}^{u *} (t_{0}, z_{0})

and lower

J_{β}^{v *} (t_{0}, z_{0})

values of the RSDG are defined similarly to (14) and (15), respectively. The minimizing sequence

{u_{β, k} (\cdot)}

,

u_{β, k} (\cdot) \in U_{z}

,

(k = 1, 2, . . .)

, and the optimal state-feedback control

u_{β}^{*} (t, z)

of the pursuer in the RSDG are defined similarly to (17) and (18), respectively. The maximizing sequence

{v_{β, k} (\cdot)}

,

v_{β, k} (\cdot) \in V_{z}

,

(k = 1, 2, . . .)

, and the optimal state-feedback control

v_{β}^{*} (t, z)

of the evader in the RSDG are defined similarly to (19) and (20), respectively. The value of the RSDG

J_{β}^{*} (t_{0}, z_{0})

is defined similarly to (16).

Remark 5.

If

u_{α β}^{0} (t, z)

and

v_{α β}^{0} (t, z)

are the optimal strategies of the pursuer and the evader in the RCCDG, then the strategies

u_{α β}^{0} (t, D Φ (t_{f}, t) x + d) and v_{α β}^{0} (t, D Φ (t_{f}, t) x + d),

(27)

are optimal strategies of the pursuer and the evader in the CCDG.

If

{u_{β, k} (t, z)}_{k = 1}^{+ \infty}

and

{v_{β, k} (t, z)}_{k = 1}^{+ \infty}

are the minimizing sequence and the maximizing sequence in the RSDG, then the sequences

{\{u_{β, k} (t, D Φ (t_{f}, t) x + d)\}}_{k = 1}^{+ \infty} and {\{v_{β, k} (t, D Φ (t_{f}, t) x + d)\}}_{k = 1}^{+ \infty}

(28)

are minimizing and maximizing sequences in the SDG. Moreover, if

u_{β}^{*} (t, z)

and

v_{β}^{*} (t, z)

are the optimal strategies of the pursuer and the evader in the RSDG, then the strategies

u_{β}^{*} (t, D Φ (t_{f}, t) x + d) and v_{β}^{*} (t, D Φ (t_{f}, t) x + d),

(29)

are optimal strategies of the pursuer and the evader in the SDG.

2.4. Objectives of the Paper

In this paper, we investigated the asymptotic behaviour of the solution to the RCCDG and the relation between the RCCDG and the RSDG solutions. In particular, the objectives of the paper were:

(1): to establish the boundedness of the time realizations $u_{α β}^{0} (t) = u_{α β}^{0} (t, z_{α β}^{0} (t))$ , $v_{α β}^{0} (t) = v_{α β}^{0} (t, z_{α β}^{0} (t))$ of the RCCDG optimal strategies along the corresponding trajectory $z_{α β}^{0} (t)$ of (22) for $α \to 0$ ;
(2): to establish the best achievable RCCDG value from the pursuer’s point of view:

$J_{best}^{0} (t_{0}, z_{0}) \overset{▵}{=} inf_{α \in (0, α^{0}]} J_{α β}^{0} (t_{0}, z_{0}),$

(30)

where $α^{0} > 0$ is some sufficiently small number;
(3): to obtain the RSDG value, and establish the limiting relation between the values of the RCCDG and the RSDG:

$lim_{α \to 0} J_{α β}^{0} (t_{0}, z_{0}) = J_{β}^{*} (t_{0}, z_{0});$

(31)
(4): to construct the RSDG pursuer’s minimizing sequence ${\{u_{β, k} (\cdot)\}}_{k = 1}^{+ \infty}$ and the evader’s optimal state-feedback control $v_{β}^{*} (\cdot)$ based on the RCCDG solution.

3. The RCCDG Solution and Its Asymptotic Properties

By virtue of [19,20,21,22], we obtained the RCCDG solution:

J_{α β}^{0} (t_{0}, z_{0}) = z_{0}^{T} R_{α β} (t_{0}) z_{0},

(32)

u_{α β}^{0} (t, z) = - \frac{1}{α} H_{1}^{T} (t) R_{α β} (t) z,

(33)

v_{α β}^{0} (t, z) = \frac{1}{β} H_{2}^{T} (t) R_{α β} (t) z,

(34)

where the matrix-valued function

R_{α β} (t)

is the solution of the Riccati matrix differential equation

\dot{R} = R Q_{α β} (t) R, R (t_{f}) = I_{m}, t \in [t_{0}, t_{f}],

(35)

Q_{α β} (t) = \frac{1}{α} H_{1} (t) H_{1}^{T} (t) - \frac{1}{β} H_{2} (t) H_{2}^{T} (t),

(36)

H^{T}

denotes a transposed matrix and

I_{m}

is the unit

m \times m

-matrix.

The solution of (35) is readily obtained:

R_{α β} (t) = S_{α β}^{- 1} (t), t \in [t_{0}, t_{f}],

(37)

if and only if the matrix

S_{α β} (t) = I_{m} + \int_{t}^{t_{f}} Q_{α β} (τ) d τ

(38)

is invertible for all

t \in [t_{0}, t_{f}]

.

Thus, the RCCDG is solvable if and only if

det (S_{α β} (t)) \neq 0, t \in [t_{0}, t_{f}] .

(39)

Condition S. The system (22) is controllable with respect to

u (t)

at any interval

[t, t_{f}]

,

t \in [t_{0}, t_{f})

.

Remark 6.

By using the t-dependent controllability gramians

G_{1} (t) = \int_{t}^{t_{f}} H_{1} (τ) H_{1}^{T} (τ) d τ, t \in [t_{0}, t_{f}),

(40)

Condition S can be rewritten [18] as

det G_{1} (t) > 0, t \in [t_{0}, t_{f}) .

(41)

The following statement is a direct consequence of (Theorem 3.1 [24]).

Proposition 1.

Let Condition S hold. Then, for any

β > 0

there exists

\tilde{α} = \tilde{α} (β)

such that the condition (39) holds for all

α > 0

satisfying

α \leq \tilde{α} .

(42)

Let

z_{α β}^{0} (t)

denote the optimal motion of (22) for

u = u_{α β}^{0} (t, z)

,

v = v_{α β}^{0} (t, z)

.

Proposition 2.

Let Condition S hold. Then, there exists the bounded limit function

\tilde{z} (t) = lim_{α \to 0} z_{α β}^{0} (t), t \in [t_{0}, t_{f}],

(43)

which is independent of β. Moreover

lim_{α \to 0} z_{α β}^{0} (t_{f}) = \tilde{z} (t_{f}) = 0 .

(44)

Proof.

Let

α > 0

satisfy (42). By substituting the optimal strategies (33) and (34) into the system (22), due to (36), (37) and (38), the dynamics become

\dot{z} = - Q_{α β} (t) R_{α β} (t) z .

(45)

Define

y ≜ R_{α β} (t) z = {(I_{m} + \int_{t}^{t_{f}} Q_{α β} (τ) d τ)}^{- 1} z .

(46)

Then,

\dot{y} = - {(I_{m} + \int_{t}^{t_{f}} Q_{α β} (τ) d τ)}^{- 1} (- Q_{α β} (t)) {(I_{m} + \int_{t}^{t_{f}} Q_{α β} (τ) d τ)}^{- 1} z +

{(I_{m} + \int_{t}^{t_{f}} Q_{α β} (τ) d τ)}^{- 1} (- Q_{α β} (t)) {(I_{m} + \int_{t}^{t_{f}} Q_{α β} (τ) d τ)}^{- 1} z = 0,

(47)

yielding

y (t) \equiv c = const, t \in [t_{0}, t_{f}] .

(48)

For

t = t_{0}

,

y (t_{0}) = c = {(I_{m} + \int_{t_{0}}^{t_{f}} Q_{α β} (τ) d τ)}^{- 1} z_{0} .

(49)

Thus, due to (46) and (48), the solution

z_{α β}^{0} (t)

of (45) is

z_{α β}^{0} (t) = (I_{m} + \int_{t}^{t_{f}} Q_{α β} (τ) d τ) {(I_{m} + \int_{t_{0}}^{t_{f}} Q_{α β} (τ) d τ)}^{- 1} z_{0} .

(50)

Due to (36) and (40),

z_{α β}^{0} (t) = (I_{m} + \frac{1}{α} G_{1} (t) - \frac{1}{β} \int_{t}^{t_{f}} H_{2} (τ) H_{2}^{T} (τ) d τ) {(I_{m} + \frac{1}{α} G_{1} (t_{0}) - \frac{1}{β} \int_{t_{0}}^{t_{f}} H_{2} (τ) H_{2}^{T} (τ) d τ)}^{- 1} z_{0} .

(51)

By factoring

\frac{1}{α}

out of both matrices, (51) becomes

z_{α β}^{0} (t) = (α I_{m} - \frac{α}{β} \int_{t}^{t_{f}} H_{2} (τ) H_{2}^{T} (τ) d τ + G_{1} (t)) {(α I_{m} - \frac{α}{β} \int_{t}^{t_{f}} H_{2} (τ) H_{2}^{T} (τ) d τ + G_{1} (t_{0}))}^{- 1} z_{0} .

(52)

Since the gramian

G_{1} (t_{0})

is non-singular, the limit (43) is readily calculated for

t \in [t_{0}, t_{f}]

:

lim_{α \to 0} z_{α β}^{0} (t) = G_{1} (t) G_{1}^{- 1} (t_{0}) z_{0} ≜ \tilde{z} (t) .

(53)

For

t = t_{f}

, (51) is

z_{α β}^{0} (t_{f}) = α {(α I_{m} - \frac{α}{β} \int_{t_{0}}^{t_{f}} H_{2} (τ) H_{2}^{T} (τ) d τ + G_{1} (t_{0}))}^{- 1} z_{0},

(54)

and

lim_{α \to 0} z_{α β}^{0} (t_{f}) = 0 .

(55)

Since

G_{1} (t_{f}) = 0

, (53) yields

\tilde{z} (t_{f}) = 0 .

(56)

Equations (55) and (56) prove (44). This completes the proof of the proposition. □

Proposition 3.

Let Condition S hold. Then the time realizations

u_{α β}^{0} (t) = u_{α β}^{0} (t, z_{α β}^{0} (t))

,

v_{α β}^{0} (t) = v_{α β}^{0} (t, z_{α β}^{0} (t))

of the optimal strategies (33)–(34) are bounded for

α \to 0

.

Proof.

By substituting (50) into (33), by using (36) and (40), and by factoring

\frac{1}{α}

out of the matrix, the time realization of the RCCDG optimal minimizer’s strategy is

u_{α β}^{0} (t) = - H_{1}^{T} (t) {(α I_{m} + G_{1} (t_{0}) - \frac{α}{β} \int_{t_{0}}^{t_{f}} H_{2} (τ) H_{2}^{T} (τ) (τ) d τ)}^{- 1} z_{0} .

(57)

Thus, for any

β > 0

, there exists the bounded limit function

lim_{α \to 0} u_{α β}^{0} (t) = - H_{1} (t) G_{1}^{- 1} (t_{0}) z_{0} ≜ \tilde{u} (t), t \in [t_{0}, t_{f}] .

(58)

Similarly, the time realization the RCCDG optimal maximizer’s strategy is

v_{α β}^{0} (t) = \frac{α}{β} H_{2}^{T} (t) {(α I_{m} + G_{1} (t_{0}) - \frac{α}{β} \int_{t_{0}}^{t_{f}} H_{2} (τ) H_{2}^{T} (τ) (τ) d τ)}^{- 1} z_{0} .

(59)

yielding

lim_{α \to 0} v_{α β}^{0} (t) = 0 ≜ \tilde{v} (t), t \in [t_{0}, t_{f}] .

(60)

□

Proposition 4.

Let Condition S hold. Then the feedback strategies (33) and (34) are well defined for

α = 0

for all

(t, z) \in [t_{0}, t_{f}) \times R^{m}

.

Proof.

Similarly to (57), by factoring

\frac{1}{α}

from the gain of the strategy (33),

u_{α β}^{0} (t, z) = - H_{1}^{T} (t) {(α I_{m} + G_{1} (t) - \frac{α}{β} \int_{t}^{t_{f}} H_{2} (τ) H_{2}^{T} (τ) (τ) d τ)}^{- 1} z,

(61)

which is well defined for

α = 0

,

(t, z) \in [t_{0}, t_{f}) \times R^{m}

:

lim_{α \to 0} u_{α β}^{0} (t, z) = \tilde{K} (t) z ≜ \tilde{u} (t, z),

(62)

where

\tilde{K} (t) = - H_{1}^{T} (t) G_{1}^{- 1} (t) .

(63)

Similarly to (59),

v_{α β}^{0} (t, z) = \frac{α}{β} H_{2}^{T} (t) {(α I_{m} + G_{1} (t) - \frac{α}{β} \int_{t}^{t_{f}} H_{2} (τ) H_{2}^{T} (τ) (τ) d τ)}^{- 1} z,

(64)

yielding

lim_{α \to 0} v_{α β}^{0} (t, z) = 0 ≜ \tilde{v} (t, z),

(65)

for all

(t, z) \in [t_{0}, t_{f}) \times R^{m}

. □

Remark 7.

Due to (40), the gain (63) of the limit feedback

\tilde{u} (t, z)

is infinite for

t \to t_{f}

:

lim_{t \to t_{f}} | | \tilde{K} (t) | | = \infty,

(66)

where

| | \cdot | |

is the Euclidean norm of a matrix.

Remark 8.

The limit motion

\tilde{z} (t)

given in (53) is generated by the limit feedback strategies

\tilde{u} (t, z)

and

\tilde{v} (t, z))

given in (62) and (65), respectively. Moreover, their time realizations along

\tilde{z} (t)

are equal to

\tilde{u} (t)

and

\tilde{v} (t)

given in (58) and (60), respectively:

\tilde{u} (t, \tilde{z} (t)) = \tilde{u} (t), \tilde{v} (t, \tilde{z} (t)) = \tilde{v} (t) .

(67)

Proposition 5.

Let Condition S hold. Then for any

β > 0

, the RCCDG game value satisfies

lim_{α \to 0} J_{α β}^{0} (t_{0}, z_{0}) = 0 .

(68)

Moreover, all the terms of the optimal cost function (25) tend to zero for

α \to 0

:

lim_{α \to 0} {| z_{α β}^{0} (t_{f}) |}^{2} = 0,

(69)

lim_{α \to 0} (α \int_{t_{0}}^{t_{f}} {| u_{α β}^{0} (t) |}^{2} d t) = 0,

(70)

lim_{α \to 0} (β \int_{t_{0}}^{t_{f}} {| v_{α β}^{0} (t) |}^{2} d t) = 0 .

(71)

Proof.

By factoring

\frac{1}{α}

from the matrix

R_{α β} (t)

,

J_{α β}^{0} (t_{0}, z_{0}) = α z_{0}^{T} {(α I_{m} + G_{1} (t_{0}) - \frac{α}{β} \int_{t_{0}}^{t_{f}} H_{2} (τ) H_{2}^{T} (τ) (τ) d τ)}^{- 1} z_{0} .

(72)

Since the matrix

G_{1} (t_{0})

is non-singular, (72) directly leads to (68).

The limiting Equation (69) is the consequence of (55); (70) holds, because, due to Proposition 3, the limit time realization of the minimizer’s optimal strategy is bounded; (71) follows from (60). □

Corollary 1.

Let Condition S hold. Then,

J_{best}^{0} (t_{0}, z_{0}) = 0 .

(73)

Proof.

First of all, let us note that, due to Remark 6, the matrix

G_{1} (t_{0})

is positive definite. Therefore, using (72), we can conclude the following. There exists a positive number

α^{0} \leq \tilde{α}

such that, for all

α \in (0, α^{0}]

,

J_{α β}^{0} (t_{0}, z_{0}) \geq 0 .

(74)

This inequality, along with the equality (68), directly yields the statement of the corollary. □

4. RSDG Solution

Lemma 1.

Let Condition S hold. Then, there exists a positive number

α_{0} < \tilde{α}

, such that for all

α \in (0, α_{0}]

the guaranteed result

J_{β}^{u} (u_{α β}^{0} (\cdot); t_{0}, z_{0})

of the pursuer’s state-feedback control

u_{α β}^{0} (t, z)

in the RSDG satisfies the inequality

0 \leq J_{β}^{u} (u_{α β}^{0} (\cdot); t_{0}, z_{0}) \leq a α,

(75)

where

a > 0

is some value independent of α.

Proof.

First of all, let us remember that

u_{α β}^{0} (t, z)

is the optimal pursuer’s control in the RCCDG, and this control is given by Equation (33). Taking into account Remark 4 and Equation (26), the guaranteed result of this control in the RSDG is calculated as follows:

\begin{matrix} J_{β}^{u} (u_{α β}^{0} (\cdot); t_{0}, z_{0}) = sup_{v (t) \in L_{2} ([t_{0}, t_{f}], R^{s})} J_{β} (u_{α β}^{0} (\cdot), v (\cdot)) \\ = sup_{v (t) \in L_{2} ([t_{0}, t_{f}], R^{s})} (| z (t_{f} |^{2} - β \int_{t_{0}}^{t_{f}} | v (t) |^{2} d t) \end{matrix}

(76)

along trajectories of the system

\dot{z} = H_{1} (t) u_{α β}^{0} (t, z) + H_{2} (t) v (t), t \in [t_{0}, t_{f}], z (0) = z_{0} .

(77)

For any

v (t) \in L_{2} ([t_{0}, t_{f}], R^{s})

, we have the inequality

| z (t_{f}) |^{2} - β \int_{t_{0}}^{t_{f}} | v (t) |^{2} d t \leq | z (t_{f} |^{2} + α \int_{t_{0}}^{t_{f}} | u_{α β}^{0} (t, z) |^{2} d t - β \int_{t_{0}}^{t_{f}} {| v (t) |}^{2} d t

(78)

along trajectories of the system (77). Therefore,

\begin{matrix} 0 \leq sup_{v (t) \in L_{2} ([t_{0}, t_{f}], R^{s})} (| z (t_{f} |^{2} - β \int_{t_{0}}^{t_{f}} | v (t) |^{2} d t) \leq \\ sup_{v (t) \in L_{2} ([t_{0}, t_{f}], R^{s})} (| z (t_{f} |^{2} + α \int_{t_{0}}^{t_{f}} | u_{α β}^{0} (t, z) |^{2} d t - β \int_{t_{0}}^{t_{f}} | v (t) |^{2} d t) . \end{matrix}

(79)

Since

u_{α β}^{0} (t, z)

is the optimal state-feedback control in the RCCDG, then using the form of the cost function in this game (see Equation (25)) and the definition of the value in this game (see Remark 3), we directly have

sup_{v (t) \in L_{2} ([t_{0}, t_{f}], R^{s})} (| z (t_{f} |^{2} + α \int_{t_{0}}^{t_{f}} | u_{α β}^{0} (t, z) |^{2} d t - β \int_{t_{0}}^{t_{f}} | v (t) |^{2} d t) = J_{α β}^{0} (t_{0}, z_{0}) .

(80)

Remember that

J_{α β}^{0} (t_{0}, z_{0})

is the RCCDG value given by Equation (32).

Further, using Equations (76), (80) and the inequality (79), we obtain immediately

0 \leq J_{β}^{u} (u_{α β}^{0} (\cdot); t_{0}, z_{0}) \leq J_{α β}^{0} (t_{0}, z_{0}) .

(81)

Now, the statement of the lemma directly follows from Equation (72) and the inequality (81). □

Consider the following admissible state-feedback control of the maximizing player (the evader) in the RSDG:

{\bar{v}}^{0} (t, z) \equiv 0, (t, z) \in [t_{0}, t_{f}] \times R^{m} .

(82)

Lemma 2.

Let Condition S hold. Then, the guaranteed result

J_{β}^{v} ({\bar{v}}^{0} (\cdot); t_{0}, z_{0})

of

{\bar{v}}^{0} (t, z)

in the RSDG is

J^{v} ({\bar{v}}^{0} (\cdot); t_{0}, z_{0}) = 0 .

(83)

Proof.

Substituting

v (t) = {\bar{v}}^{0} (t, z)

into the system (22) and the cost function (26) yields the following system and cost function:

\dot{z} = H_{1} (t) u, z (t_{0}) = z_{0}, t \in [t_{0}, t_{f}],

(84)

\bar{J} (u (\cdot)) = J_{β} (u (\cdot), {\bar{v}}^{0} (\cdot)) = {| z (t_{f}) |}^{2} .

(85)

Therefore,

J^{v} ({\bar{v}}^{0} (\cdot); t_{0}, z_{0})

is the infimum value with respect to

u (t) \in L_{2} ([t_{0}, t_{f}], R^{r})

of the cost function (85) along trajectories of the system (84), i.e.,

J^{v} ({\bar{v}}^{0} (\cdot); t_{0}, z_{0}) = inf_{u (\cdot) \in L_{2} ([t_{0}, t_{f}], R^{r})} \bar{J} (u (\cdot)) .

(86)

The optimal control problem (84) and (85) is singular (see, e.g., [3]), and the value (86) can be derived similarly to this work. To do this, first, we replaced approximately the singular problem (84) and (85) with the regular optimal control problem consisting of the system (84) and the new cost function

{\bar{J}}_{α} (u (\cdot)) \overset{▵}{=} | z (t_{f}) |^{2} + α \int_{t_{0}}^{t_{f}} {| u (t) |}^{2} d t

(87)

to be minimized by

u (\cdot) \in L_{2} ([t_{0}, t_{f}], R^{r})

along trajectories of the system (84). In (87),

α > 0

is a small parameter of the regularization.

For any given

α > 0

, the problem in (84), (87) is a linear-quadratic optimal control problem. By virtue of the results of [25], we directly have that the solution (the optimal control) of this problem is

{\bar{u}}_{α}^{0} (t) = - (1 / α) H_{1}^{T} (t) {\bar{R}}_{α} (t) {\bar{z}}_{α} (t)

, and the optimal value of its function has the form

{\bar{J}}_{α}^{0} = {\bar{J}}_{α} ({\bar{u}}_{α}^{0} (\cdot)) = z_{0}^{T} {\bar{R}}_{α} (t_{0}) z_{0},

(88)

where the

m \times m

-matrix-valued function

{\bar{R}}_{α} (t)

is the solution of the terminal-value problem

{\dot{\bar{R}}}_{α} = \frac{1}{α} {\bar{R}}_{α} H_{1} (t) H_{1}^{T} (t) {\bar{R}}_{α}, t \in [t_{0}, t_{f}], {\bar{R}}_{α} (t_{f}) = I_{m},

(89)

the vector-valued function

{\bar{z}}_{α} (t)

is the solution of the initial-value problem

{\dot{\bar{z}}}_{α} = - \frac{1}{α} H_{1} (t) H_{1}^{T} (t) {\bar{R}}_{α} (t) {\bar{z}}_{α}, t \in [t_{0}, t_{f}], z (t_{0}) = z_{0} .

(90)

Using Remark 6, we obtain the unique solution of the problem (89) as follows:

{\bar{R}}_{α} (t) = {(I_{m} + \frac{1}{α} G_{1} (t))}^{- 1}, t \in [t_{0}, t_{f}],

(91)

where the

m \times m

-matrix-valued function

G_{1} (t)

is given in Remark 6 (see (40) for

t \in [t_{0}, t_{f}]

).

Substituting (91) into (88), we obtain after some rearrangement

{\bar{J}}_{α}^{0} = α z_{0}^{T} {(α I_{m} + G_{1} (t_{0})}^{- 1} z_{0},

(92)

yielding the following inequality for all sufficiently small

α > 0

:

0 \leq {\bar{J}}_{α}^{0} \leq c α,

(93)

where

c > 0

is some value independent of

α

.

Using Equation (88) and inequality (93), we obtain for all sufficiently small

α > 0

:

\begin{matrix} 0 \leq inf_{u (\cdot) \in L_{2} ([t_{0}, t_{f}], R^{r})} \bar{J} (u (\cdot)) \leq \bar{J} ({\bar{u}}_{α}^{0} (\cdot)) \leq {\bar{J}}_{α} ({\bar{u}}_{α}^{0} (\cdot)) = {\bar{J}}_{α}^{0} \leq c α, \end{matrix}

yielding

\begin{matrix} 0 \leq inf_{u (\cdot) \in L_{2} ([t_{0}, t_{f}], R^{r})} \bar{J} (u (\cdot)) \leq c α . \end{matrix}

The latter implies immediately

inf_{u (\cdot) \in L_{2} ([t_{0}, t_{f}], R^{r})} \bar{J} (u (\cdot)) = 0

which, along with Equation (86), proves the statement of the lemma. □

Theorem 1.

Let Condition S hold. Then, the RSDG value

J_{β}^{*} (t_{0}, z_{0})

exists and

J_{β}^{*} (t_{0}, z_{0}) = 0 .

(94)

Proof.

Let

J_{β}^{u *} (t_{0}, z_{0})

and

J_{β}^{v *} (t_{0}, z_{0})

be the upper and lower values of the RSDG, respectively. Then, due to the definitions of these values (see Remark 4), we have

J_{β}^{u *} (t_{0}, z_{0}) \leq J_{β}^{u} (u_{α β}^{0} (\cdot); t_{0}, z_{0}), α \in (0, α_{0}],

(95)

J_{β}^{v} ({\bar{v}}^{0} (\cdot); t_{0}, z_{0}) \leq J_{β}^{v *} (t_{0}, z_{0}),

(96)

J_{β}^{v *} (t_{0}, z_{0}) \leq J_{β}^{u *} (t_{0}, z_{0}) .

(97)

Now, using the equality (83) and the inequalities (75), (95)–(97) yield

\begin{matrix} 0 = J_{β}^{v} ({\bar{v}}^{0} (\cdot); t_{0}, z_{0}) \leq J_{β}^{v *} (t_{0}, z_{0}) \leq J_{β}^{u *} (t_{0}, z_{0}) \\ \leq J_{β}^{u} (u_{α β}^{0} (\cdot); t_{0}, z_{0}) \leq a α, α \in (0, α_{0}] . \end{matrix}

(98)

The latter implies

0 \leq J_{β}^{v *} (t_{0}, z_{0}) \leq J_{β}^{u *} (t_{0}, z_{0}) \leq a α, α \in (0, α_{0}] .

(99)

From (99), for

α \to 0

, we directly have

J_{β}^{v *} (t_{0}, z_{0}) = J_{β}^{u *} (t_{0}, z_{0}) = 0

, which proves the theorem. □

Corollary 2.

Let Condition S hold. Then,

J_{best}^{0} (t_{0}, z_{0}) = J_{β}^{*} (t_{0}, z_{0}) .

(100)

Proof.

The statement of the corollary directly follows from Theorem 1 and Equation (73). □

Corollary 3.

Let Condition S hold. Then, the limit Equality (31) is valid.

Proof.

The statement of the corollary is a direct consequence of Equations (68) and (94). □

By

{α_{k}}_{k = 1}^{+ \infty}

, we denote a sequence of numbers, satisfying the following conditions: (I)

α_{k} \in (0, α_{0}]

,

(k = 1, 2, . . .)

; (II)

{lim}_{k \to + \infty} α_{k} = 0

.

Theorem 2.

Let Condition S hold. Then, the sequence of the pursuer’s state-feedback controls

{\{u_{α_{k} β}^{0} (t, z)\}}_{k = 1}^{+ \infty}

is the minimizing sequence in the RSDG. The state-feedback control

{\bar{v}}^{0} (t, z)

, given by (82), is the optimal evader’s strategy in the RSDG.

Proof.

From the chain of the equality and the inequalities (98) we obtain

lim_{k \to + \infty} J_{β}^{u} (u_{α_{k} β}^{0} (\cdot); t_{0}, z_{0}) = J_{β}^{u *} (t_{0}, z_{0}),

(101)

meaning the validity of the first statement of the theorem.

Similarly, we have

J_{β}^{v} ({\bar{v}}^{0} (\cdot); t_{0}, z_{0}) = J_{β}^{v *} (t_{0}, z_{0}),

(102)

which implies the validity of the second statement of the theorem. □

Remark 9.

It should be noted that the optimal evader’s strategy

{\bar{v}}^{0} (t, z)

in the RSDG coincides with the limit (as

α \to 0

) of the optimal evader’s strategy in the RCCDG for all

(t, z) \in [t_{0}, t_{f}) \times R^{m}

(see Proposition 4 and Equation (65)). Also, it should be noted that the limit (as

k \to + \infty

) of the minimizing sequence

{\{u_{α_{k} β}^{0} (t, z)\}}_{k = 1}^{+ \infty}

in the RSDG is

\bar{u} (t, z)

for all

(t, z) \in [t_{0}, t_{f}) \times R^{m}

(see Proposition 4 and Equations (62) and (63)). However, the function

\bar{u} (t, z)

does not belong to the set

U_{z}

. Therefore, this function does not belong to the set

U_{z}

, i.e., it is not an admissible pursuer’s state-feedback control in the RSDG.

5. Example: Interception Problem in Three-Dimensional Space

5.1. Engagement Model and Its Reduction

Consider the engagement in 3D space of two flying vehicles (the interceptor or the pursuer and the target or the evader), which has similar geometry to that considered in [26,27]. In contrast to [26,27], we assumed that both the pursuer and the evader have first-order dynamics controllers. Two mutually perpendicular control channels could have different time constants:

τ_{p_{1}}

,

τ_{p_{2}}

for the pursuer’s controller and

τ_{e_{1}}

,

τ_{e_{2}}

for the evader’s one.

The equations of motion were written down in the line-of-sight coordinate system where the axis X was the initial line-of-sight, the plane

X Y

was the collision plane determined by the initial line-of-sight and the target’s velocity vectors and the plane

X Z

was normal to

X Y

.

Let

(X_{p}, Y_{p}, Z_{p})

and

(X_{e}, Y_{e}, Z_{e})

be the coordinates of the interceptor (the pursuer) and the target (the evader), respectively. The relative separations in the Y and Z-directions were

Y = Y_{p} - Y_{e}

and

Z = Z_{p} - Z_{e}

. By linearization along the initial line-of-sight, the equations of motion were written down in the form (1) where the state vector was

x = {(Y, \dot{Y}, {\ddot{Y}}_{p}, {\ddot{Y}}_{e}, Z, \dot{Z}, {\ddot{Z}}_{p}, {\ddot{Z}}_{e})}^{T},

(103)

the players’ control vectors (lateral acceleration commands) were

u = {(u_{1}, u_{2})}^{T}

(for the pursuer) and

v = {(v_{1}, v_{2})}^{T}

(for the evader); the final time

t_{f}

was the time of achieving the zero distance between the players along the axis X. The matrices in (1) were

A (t) \equiv [\begin{matrix} 0 & 1 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 1 & - 1 & 0 & 0 & 0 & 0 \\ 0 & 0 & - 1 / τ_{p_{1}} & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & - 1 / τ_{e_{1}} & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 1 & - 1 \\ 0 & 0 & 0 & 0 & 0 & 0 & - 1 / τ_{p_{2}} & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & - 1 / τ_{e_{2}} \end{matrix}],

(104)

B (t) \equiv [\begin{matrix} 0 & 0 \\ 0 & 0 \\ 1 / τ_{p_{1}} & 0 \\ 0 & 0 \\ 0 & 0 \\ 0 & 0 \\ 0 & 1 / τ_{p_{2}} \\ 0 & 0 \end{matrix}], C (t) \equiv [\begin{matrix} 0 & 0 \\ 0 & 0 \\ 0 & 0 \\ 1 / τ_{e_{1}} & 0 \\ 0 & 0 \\ 0 & 0 \\ 0 & 0 \\ 0 & 1 / τ_{e_{2}} \end{matrix}] .

(105)

In the pursuit problem, the target set was

x_{1} = Y = 0, x_{5} = Z = 0

, meaning that in (2),

D = [\begin{matrix} 1 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 1 & 0 & 0 & 0 \end{matrix}], d = [\begin{matrix} 0 \\ 0 \end{matrix}] .

(106)

Thus, in this example,

n = 8

,

r = s = m = 2

.

The transition matrix of the homogeneous system was readily obtained as

Φ (t_{f}, t) = [\begin{matrix} Φ_{1} (t_{f}, t, τ_{p_{1}}, τ_{e_{1}}) & O_{4} \\ O_{4} & Φ_{1} (t_{f}, t, τ_{p_{2}}, τ_{e_{2}}) \end{matrix}],

(107)

where

O_{4}

is the zero

4 \times 4

matrix,

Φ_{1} (t_{f}, t, τ_{p}, τ_{e}) = [\begin{matrix} 1 & t_{f} - t & - h (t, τ_{p}) & h (t, τ_{e}) \\ 0 & 1 & - τ_{p} (1 - e^{- ϑ (t, τ_{p})}) & τ_{e} (1 - e^{- ϑ (t, τ_{e})}) \\ 0 & 0 & e^{- ϑ (t, τ_{p})} & 0 \\ 0 & 0 & 0 & e^{- ϑ (t, τ_{e})} \end{matrix}],

(108)

ϑ (t, τ) ≜ \frac{t_{f} - t}{τ},

(109)

h (t, τ) ≜ τ^{2} (e^{- ϑ (t, τ)} + ϑ (t, τ) - 1) .

(110)

Then, by applying the transformation (21) with D and d as in (106), the original system was reduced to the two-dimensional system of the form (22), where

H_{1} (t) = [\begin{matrix} - h (t, τ_{p_{1}}) & 0 \\ 0 & - h (t, τ_{p_{2}}) \end{matrix}], H_{2} (t) = [\begin{matrix} h (t, τ_{e_{1}}) & 0 \\ 0 & h (t, τ_{e_{2}}) \end{matrix}] .

(111)

Explicitly, the system (22) became

\begin{matrix} {\dot{z}}_{1} & = & - h (t, τ_{p_{1}}) u_{1} + h (t, τ_{e_{1}}) v_{1}, z_{1} (t_{0}) = z_{0_{1}}, t \in [t_{0}, t_{f}], \\ {\dot{z}}_{2} & = & - h (t, τ_{p_{2}}) u_{2} + h (t, τ_{e_{2}}) v_{2} z_{2} (t_{0}) = z_{0_{2}}, t \in [t_{0}, t_{f}] . \end{matrix}

(112)

5.2. Reduced Cheap Control Game

In this example, the RCCDG cost function (25) is

J_{α β} = z_{1}^{2} (t_{f}) + z_{2}^{2} (t_{f}) + α \int_{t_{0}}^{t_{f}} [u_{1}^{2} (t) + u_{2}^{2} (t)] d t - β \int_{t_{0}}^{t_{f}} [v_{1}^{2} (t) + v_{2}^{2} (t)] d t .

(113)

Due to (111), the gramian (40) is calculated as

G_{1} (t) = [\begin{matrix} \int_{t}^{t_{f}} h^{2} (η, τ_{p_{1}}) d η & 0 \\ 0 & \int_{t}^{t_{f}} h^{2} (η, τ_{p_{2}}) d η \end{matrix}],

(114)

and

det G_{1} (t) = (\int_{t}^{t_{f}} h^{2} (η, τ_{p_{1}}) d η) (\int_{t}^{t_{f}} h^{2} (η, τ_{p_{2}}) d η) .

(115)

For all

τ > 0

, we have that

h (t, τ) > 0

,

t \in [t_{0}, t_{f})

, and

h (t_{f}, τ) = 0 .

Therefore, the condition (41), and, consequently, Condition S hold.

Due to the symmetry of the matrices (111), the matrix (37) is also symmetric:

R_{α β} (t) = [\begin{matrix} r_{{α β}_{1}} (t) & 0 \\ 0 & r_{{α β}_{2}} (t) \end{matrix}],

(116)

where

r_{{α β}_{i}} (t) = \frac{1}{1 + \frac{1}{α} \int_{t}^{t_{f}} h^{2} (η, τ_{p_{i}}) d η - \frac{1}{β} \int_{t}^{t_{f}} h^{2} (η, τ_{e_{i}}) d η}, i = 1, 2 .

(117)

Thus, the RCCDG is solvable if

1 + \frac{1}{α} \int_{t}^{t_{f}} h^{2} (η, τ_{p_{i}}) d η - \frac{1}{β} \int_{t}^{t_{f}} h^{2} (η, τ_{e_{i}}) d η > 0, t \in [t_{0}, t_{f}], i = 1, 2 .

(118)

Similarly to [24], it is proved that the solvability condition (118) yields the value

\tilde{α}

in (42) as

\tilde{α} = min {{\tilde{α}}_{1}, {\tilde{α}}_{2}},

(119)

where

{\tilde{α}}_{i} = {\tilde{α}}_{i} (β) = \{\begin{matrix} μ_{i} (β) β, & β < \int_{t_{0}}^{t_{f}} h^{2} (η, τ_{e_{i}}) d η, \\ + \infty, & β \geq \int_{t_{0}}^{t_{f}} h^{2} (η, τ_{e_{i}}) d η, \end{matrix} i = 1, 2,

(120)

μ_{i} (β) = \frac{1}{max_{t \in [t_{0}, {\bar{t}}_{i}]} F_{i} (t, β)}, i = 1, 2,

(121)

F_{i} (t, β) = \frac{\int_{t}^{{\bar{t}}_{i} (β)} h^{2} (η, τ_{e_{i}}) d η}{\int_{t}^{t_{f}} h^{2} (η, τ_{p_{i}}) d η}, i = 1, 2,

(122)

the moments

{\bar{t}}_{i} (β) \in (t_{0}, t_{f})

,

i = 1, 2

, satisfy

\int_{{\bar{t}}_{i} (β)}^{t_{f}} h^{2} (η, τ_{e_{i}}) d η = β, i = 1, 2 .

(123)

By using (32)–(34) and (116), the solution of the game (112) and (113) is

J_{α β}^{0} (t_{0}, z_{0}) = r_{1} (t_{0}) z_{0_{1}}^{2} + r_{2} (t_{0}) z_{0_{2}}^{2},

(124)

u_{α β}^{0} (t, z) = - \frac{1}{α} {(h (t, τ_{p_{1}}) r_{1} (t) z_{1}, h (t, τ_{p_{2}}) r_{2} (t) z_{2})}^{T},

(125)

v_{α β}^{0} (t, z) = \frac{1}{β} {(h (t, τ_{e_{1}}) r_{1} (t) z_{1}, h (t, τ_{e_{2}}) r_{2} (t) z_{2})}^{T} .

(126)

Let us consider the numerical example for

t_{0} = 0

s,

t_{f} = 3

s,

β = 0.1

,

τ_{p_{1}} = τ_{p_{2}} = 0.1

s,

τ_{e_{1}} = 0.15

s,

τ_{e_{2}} = 0.2

s. For these parameters,

β = 0.1 < \int_{t_{0}}^{t_{f}} h^{2} (η, τ_{e_{1}}) d η = \int_{0}^{3} h^{2} (η, 0.15) d η = 0.1737,

(127)

β = 0.1 < \int_{t_{0}}^{t_{f}} h^{2} (η, τ_{e_{2}}) d η = \int_{0}^{3} h^{2} (η, 0.2) d η = 0.293 .

(128)

In this example, the moments, defined by (123), are

{\bar{t}}_{1} = 0.4792

s,

{\bar{t}}_{2} = 0.8443

s (see Figure 1).

In Figure 2, the functions

F_{i} (t, β)

, given by (122), are shown for

t \in [t_{0}, {\bar{t}}_{i}]

,

i = 1, 2

. It is seen that these functions were decreasing. Therefore,

μ_{i} = \frac{1}{F_{1} (0, β)} = 1.1035, μ_{2} = \frac{1}{F_{2} (0, β)} = 0.4214 .

(129)

Due to (119) and (120),

\tilde{α} = β min {μ_{1}, μ_{2}} = 0.04214

.

In Figure 3 and Figure 4, the components of the optimal trajectories

z_{α β}^{0} (t)

are shown for decreasing values of

α < \tilde{α}

, along with the components of the corresponding limiting function

\tilde{z} (t)

. It is clearly seen that the optimal trajectories tended to

\tilde{z} (t)

for

α \to 0

, and

z_{α β}^{0} (t_{f})

tended to zero.

The respective components of time realizations of the optimal strategies

u_{α β}^{0} (\cdot)

and

v_{α β}^{0} (\cdot)

, along with the components of the corresponding limiting functions

\tilde{u} (t)

and

\tilde{v} (t)

, are depicted in Figure 5, Figure 6, Figure 7 and Figure 8, respectively. It is seen that the time realizations of the optimal strategies tended to the corresponding limiting functions for

α \to 0

, remaining bounded.

The game value

J_{α β}^{0} (t_{0}, z_{0})

is depicted in Figure 9 as a function of

α

. It is seen that it tended to zero for

α \to 0

.

The respective terminal and integral terms of the cost function are shown in Figure 10 and Figure 11, respectively. It is seen that all components of the optimal cost tended to zero for

α \to 0

.

Remark 10.

From Equation (125), it was seen that the small control cost of the interceptor yielded the high gain in its optimal state-feedback control. This important feature of the interceptor’s optimal state-feedback control increased considerably the ability of the interceptor to capture the target. One more important feature of the interceptor’s optimal state-feedback control was that the time realization of this control along the optimal interception’s trajectory and, especially, the trajectory itself, were bounded while the small parameter α tended to zero. Both aforementioned features of the interceptor’s state-feedback control, obtained by solution of the cheap control game, were extremely important in various real-life situations of a capture of a maneuverable flying target by a maneuverable flying interceptor. It should be noted that if the small control cost of the interceptor tended to zero, the ability of the interceptor to capture the target increased tending to the best achievable result, which was the zero-miss distance at the end of the interception.

6. Conclusions

In this paper, a pursuit-evasion problem, modeled by a finite-horizon linear-quadratic zero-sum differential game, was considered. In the game’s cost function, the penalty coefficient for the minimizing player’s control expenditure was a small value

α > 0

. Thus, the considered game was a zero-sum differential game with a cheap control of the minimizing player. By the proper state transformation, the initially formulated game was converted to a smaller Euclidean dimension differential game, called the reduced game. This game, also was a cheap control game and it was treated in the sequel of the paper. Due to the game’s solvability conditions, the solution of the reduced cheap control game was converted to the solution of the terminal-value problem for the matrix Riccati differential equation. Sufficient condition for the existence of the solution to this terminal-value problem in the entire interval of the game’s duration was presented, and the solution of this terminal-value problem was obtained. Using this solution, the value of the reduced cheap control game, as well as the optimal state-feedback controls of the minimizing player (the pursuer) and the maximizing player (the evader), were derived. The trajectory of the game, generated by the optimal players’ state-feedback controls, (the optimal trajectory), was obtained. The limits of the optimal trajectory, as well as of the time realizations of the players’ optimal state-feedback controls along the optimal trajectory, for

α \to 0

were calculated. By this calculation, the boundedness of the optimal trajectory and the corresponding time realizations of the players’ optimal state-feedback controls for

α \to 0

were shown. The limit of the game value for

α \to 0

also was calculated, yielding the best achievable game value from the pursuer’s viewpoint. Along with the cheap control game, its degenerate version was considered. This version was obtained from the cheap control game by setting there formally

α = 0

, yielding the new zero-sum linear-quadratic pursuit-evasion game. This new game was singular, because it could not be solved either by the Isaacs’s MinMax principle or by the Bellman–Isaacs equation method. For this singular game, the notion of the pursuer’s minimizing sequence of state-feedback controls (instead of the pursuer’s optimal state-feedback control) was proposed. It was established that the

α

-dependent pursuer’s optimal state-feedback control in the cheap control game constituted the pursuer’s minimizing sequence of state-feedback controls (as

α \to 0

) in the singular game. It was shown that the limit of this minimizing sequence was not an admissible pursuer’s state-feedback control in the singular game. However, the evader’s optimal state-feedback control and the value of the singular game coincided with the limits (for

α \to 0

) of the evader’s optimal state-feedback control and the value, respectively, of the cheap control game. Based on the theoretical results of the paper, the interception problem in 3D space, modeled by a zero-sum linear-quadratic game with the eight-dimensional dynamics, was studied. Similarly to the theoretical part of the paper, the case of the small penalty coefficient

α > 0

for the pursuer’s (interceptor’s) control expenditure in the cost function was considered. By proper linear state transformation, the original cheap control game was reduced to the new cheap control game with the two-dimensional dynamics. The asymptotic behaviour of the solution to this new game for

α \to 0

was analyzed.

Author Contributions

The authors contributed equally to this article. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Bell, D.J.; Jacobson, D.H. Singular Optimal Control Problems; Academic Press: Cambridge, MA, USA, 1975. [Google Scholar]
Kurina, G.A. A degenerate optimal control problem and singular perturbations. Soviet Math. Dokl. 1977, 18, 1452–1456. [Google Scholar]
Glizer, V.Y. Stochastic singular optimal control problem with state delays: Regularization, singular perturbation, and minimizing sequence. SIAM J. Control Optim. 2012, 50, 2862–2888. [Google Scholar] [CrossRef]
Shinar, J.; Glizer, V.Y.; Turetsky, V. Solution of a singular zero-sum linear-quadratic differential game by regularization. Int. Game Theory Rev. 2014, 16, 1–32. [Google Scholar] [CrossRef]
Kwakernaak, H.; Sivan, R. The maximally achievable accuracy of linear optimal regulators and linear optimal filters. IEEE Trans. Autom. Control 1972, 17, 79–86. [Google Scholar] [CrossRef] [Green Version]
Braslavsky, J.H.; Seron, M.M.; Mayne, D.Q.; Kokotović, P.V. Limiting performance of optimal linear filters. Automatica 1999, 35, 189–199. [Google Scholar] [CrossRef]
Seron, M.M.; Braslavsky, J.H.; Kokotović, P.V.; Mayne, D.Q. Feedback limitations in nonlinear systems: From Bode integrals to cheap control. IEEE Trans. Autom. Control 1999, 44, 829–833. [Google Scholar] [CrossRef] [Green Version]
Kokotović, P.V.; Khalil, H.K.; O’Reilly, J. Singular Perturbation Methods in Control: Analysis and Design; Academic Press: London, UK, 1986. [Google Scholar]
Young, K.D.; Kokotović, P.V.; Utkin, V.I. A singular perturbation analysis of high-gain feedback systems. IEEE Trans. Autom. Control 1977, 22, 931–938. [Google Scholar] [CrossRef]
Moylan, P.J.; Anderson, B.D.O. Nonlinear regulator theory and an inverse optimal control problem. IEEE Trans. Autom. Control 1973, 18, 460–465. [Google Scholar] [CrossRef]
Turetsky, V.; Glizer, V.Y. Robust solution of a time-variable interception problem: A cheap control approach. Int. Game Theory Rev. 2007, 9, 637–655. [Google Scholar] [CrossRef]
Turetsky, V.; Glizer, V.Y.; Shinar, J. Robust trajectory tracking: Differential game/cheap control approach. Int. J. Systems Sci. 2014, 45, 2260–2274. [Google Scholar] [CrossRef]
Turetsky, V.; Shinar, J. Missile guidance laws based on pursuit—Evasion game formulations. Automatica 2003, 39, 607–618. [Google Scholar] [CrossRef]
Turetsky, V. Upper bounds of the pursuer control based on a linear-quadratic differential game. J. Optim. Theory Appl. 2004, 121, 163–191. [Google Scholar] [CrossRef]
Petersen, I.R. Linear-quadratic differential games with cheap control. Syst. Control Lett. 1986, 8, 181–188. [Google Scholar] [CrossRef]
Glizer, V.Y. Asymptotic solution of zero-sum linear-quadratic differential game with cheap control for the minimizer. NoDEA Nonlinear Diff. Equ. Appl. 2000, 7, 231–258. [Google Scholar] [CrossRef]
Vasil’eva, A.B.; Butuzov, V.F.; Kalachev, L.V. The Boundary Function Method for Singular Perturbation Problems; SIAM Books: Philadelphia, PA, USA, 1995. [Google Scholar]
Bryson, A.; Ho, Y. Applied Optimal Control; Hemisphere: New York, NY, USA, 1975. [Google Scholar]
Zhukovskii, V.I. Analytic design of optimum strategies in certain differential games. I. Autom. Remote Control 1970, 4, 533–536. [Google Scholar]
Krasovskii, N.N.; Subbotin, A.I. Game-Theoretical Control Problems; Springer: New York, NY, USA, 1988. [Google Scholar]
Basar, T.; Olsder, G.J. Dynamic Noncooperative Game Theory; Academic Press: London, UK, 1992. [Google Scholar]
Petrosyan, L.A.; Zenkevich, N.A. Game Theory; World Scientific Publishing Company: Singapore, 2016. [Google Scholar]
Isaacs, R. Differential Games; John Wiley: New York, NY, USA, 1965. [Google Scholar]
Turetsky, V. Robust route realization by linear-quadratic tracking. J. Optim. Theory Appl. 2016, 170, 977–992. [Google Scholar] [CrossRef]
Kalman, R.E. Contributions to the Theory of Optimal Control. Bol. Soc. Mat. Mex. 1960, 5, 102–119. [Google Scholar]
Shinar, J.; Gutman, S. Three-Dimensional Optimal Pursuit and Evasion with Bounded Controls. IEEE Trans. Autom. Control 1980, 25, 492–496. [Google Scholar] [CrossRef]
Shinar, J.; Medinah, M.; Biton, M. Singular surfaces in a linear pursuit-evasion game with elliptical vectograms. J. Optim. Theory Appl. 1984, 43, 431–458. [Google Scholar] [CrossRef]

Figure 1. Moments

{\bar{t}}_{i} (β)

.

Figure 1. Moments

{\bar{t}}_{i} (β)

.

Figure 2. Functions

F_{i} (t, β)

.

Figure 2. Functions

F_{i} (t, β)

.

Figure 3. Trajectories

z_{{α β}_{1}}^{0} (t)

and limiting function

{\tilde{z}}_{1} (t)

.

Figure 3. Trajectories

z_{{α β}_{1}}^{0} (t)

and limiting function

{\tilde{z}}_{1} (t)

.

Figure 4. Trajectories

z_{{α β}_{2}}^{0} (t)

and limiting function

{\tilde{z}}_{2} (t)

.

Figure 4. Trajectories

z_{{α β}_{2}}^{0} (t)

and limiting function

{\tilde{z}}_{2} (t)

.

Figure 5. Time realizations

u_{{α β}_{1}}^{0} (t)

and limiting function

{\tilde{u}}_{1} (t)

.

Figure 5. Time realizations

u_{{α β}_{1}}^{0} (t)

and limiting function

{\tilde{u}}_{1} (t)

.

Figure 6. Time realizations

u_{{α β}_{2}}^{0} (t)

and limiting function

{\tilde{u}}_{2} (t)

.

Figure 6. Time realizations

u_{{α β}_{2}}^{0} (t)

and limiting function

{\tilde{u}}_{2} (t)

.

Figure 7. Time realizations

v_{{α β}_{1}}^{0} (t)

and limiting function

{\tilde{v}}_{1} (t)

.

Figure 7. Time realizations

v_{{α β}_{1}}^{0} (t)

and limiting function

{\tilde{v}}_{1} (t)

.

Figure 8. Time realizations

v_{{α β}_{2}}^{0} (t)

and limiting function

{\tilde{v}}_{2} (t)

.

Figure 8. Time realizations

v_{{α β}_{2}}^{0} (t)

and limiting function

{\tilde{v}}_{2} (t)

.

Figure 9. The game value.

Figure 10. The terminal term of the cost function.

Figure 11. Integral terms of the cost function.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Turetsky, V.; Glizer, V.Y. Cheap Control in a Non-Scalarizable Linear-Quadratic Pursuit-Evasion Game: Asymptotic Analysis. Axioms 2022, 11, 214. https://0-doi-org.brum.beds.ac.uk/10.3390/axioms11050214

AMA Style

Turetsky V, Glizer VY. Cheap Control in a Non-Scalarizable Linear-Quadratic Pursuit-Evasion Game: Asymptotic Analysis. Axioms. 2022; 11(5):214. https://0-doi-org.brum.beds.ac.uk/10.3390/axioms11050214

Chicago/Turabian Style

Turetsky, Vladimir, and Valery Y. Glizer. 2022. "Cheap Control in a Non-Scalarizable Linear-Quadratic Pursuit-Evasion Game: Asymptotic Analysis" Axioms 11, no. 5: 214. https://0-doi-org.brum.beds.ac.uk/10.3390/axioms11050214

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Cheap Control in a Non-Scalarizable Linear-Quadratic Pursuit-Evasion Game: Asymptotic Analysis

Abstract

1. Introduction

2. Preliminaries and Problem Statement

2.1. Cheap Control Differential Game

2.2. Singular (Degenerate) Differential Game

2.3. Reduction of the Games

2.4. Objectives of the Paper

3. The RCCDG Solution and Its Asymptotic Properties

4. RSDG Solution

5. Example: Interception Problem in Three-Dimensional Space

5.1. Engagement Model and Its Reduction

5.2. Reduced Cheap Control Game

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI