Necessary Optimality Conditions for a Class of Control Problems with State Constraint

Korytowski, Adam; Szymkat, Maciej

doi:10.3390/g12010009

Open AccessArticle

Necessary Optimality Conditions for a Class of Control Problems with State Constraint

by

Adam Korytowski

^* and

Maciej Szymkat

Faculty of Electrical Engineering, Automatics, Computer Science and Biomedical Engineering, AGH University of Science and Technology, al. Mickiewicza 30, 30-059 Kraków, Poland

^*

Author to whom correspondence should be addressed.

Games 2021, 12(1), 9; https://0-doi-org.brum.beds.ac.uk/10.3390/g12010009

Submission received: 23 November 2020 / Revised: 7 January 2021 / Accepted: 13 January 2021 / Published: 18 January 2021

(This article belongs to the Special Issue Optimal Control Theory)

Download

Browse Figures

Versions Notes

Abstract

:

An elementary approach to a class of optimal control problems with pathwise state constraint is proposed. Based on spike variations of control, it yields simple proofs and constructive necessary conditions, including some new characterizations of optimal control. Two examples are discussed.

Keywords:

optimal control; state constraint; necessary optimality conditions

1. Introduction

Necessary optimality conditions for control problems with pathwise state constraints have been widely studied since the beginnings of optimal control theory [1], and this domain of research is still vivid nowadays [2]. Most of the existing approaches may be divided into two streams [3]. The first one, characterized by the use of classical methods of analysis with often heuristic proofs, yields results of limited generality (see the review [4], and [3]). The other is based on the abstract theory of infinite-dimensional optimization and its results encompass a wide class of problems, with rigorous but demanding proofs (see [5,6,7,8,9]). However, these results are difficult for practical verification, because of too general characterizations of the adjoint variables and multiplier functions. Generally, the existing approaches are hardly constructive, meaning they do not give sufficient indications how to improve a nonoptimal control.

We propose an elementary approach to necessary optimality conditions for problems in Mayer form with free final state and a scalar state constraint. The controls are scalar functions and nontangentiality is assumed at all entry and exit points. As is well known, the proof of the minimum principle with free final state and without pathwise state constraints can be made elementary and simple by considering the cost increment caused by a single spike variation of control. Our first purpose is to show that a similar proof technique may be effective when a pathwise state constraint is present, with the difference that additionally a coordinated pair of spikes is used. A second purpose is to extend the known results for state constraints of index one, mainly to nonregular problems in which the optimal control and the corresponding state trajectory may at the same time take values on the boundaries of their respective admissible sets. In particular, we allow for discrete sets of admissible control values. From a conceptual point of view, this work also offers a clear geometrical interpretation of the results. On the practical side, an advantage of our approach is that the obtained conditions are readily verifiable and constructive: if they are not fulfilled, a gradient optimization procedure can be indicated and initialized which guarantees an improvement of the control, up to numerical precision (as in the method of Monotone Structural Evolution [10]). Of course, the other approaches clearly prevail in a wider perspective, when problems of greater complexity are also taken into account. They then produce optimality conditions, which can be effectively used in optimal control computations (see [11,12,13,14]).

Consider a control system described by a state equation

\dot{x} (t) = f (x (t), u (t)), t \in [0, T], x (0) = x_{0}, x (t) \in R^{n},

(1)

with a given initial condition

x_{0}

and a given time horizon T. The controls

u : [0, T] \to R

are piecewise continuous functions of time, taking values in a given set U, that is, they belong to

P C (0, T; U)

. 1 The function

f : R^{n} \times R \to R^{n}

is of class

C^{1}

in its both arguments. We make a general assumption that all solutions of (1) appearing in the sequel are well defined in the whole time interval

[0, T]

. The state is subject to a scalar pathwise constraint,

g (x (t)) \leq 0, t \in [0, T] .

(2)

The function

g : R^{n} \to R

is of class

C^{2}

, and

\partial g (ξ) \neq 0

if

g (ξ) = 0

. We assume

g (x_{0}) \leq 0

. A performance index (or cost),

Q (u) = q (x (T)),

is minimized on the trajectories of (1). The function

q : R^{n} \to R

is of class

C^{1}

.

For a control

u \in P C (0, T; U)

, let x be the corresponding solution of the initial value problem (1). The control u is admissible if the trajectory x satisfies the state constraint (2). The control u is optimal if it is admissible and minimizes the cost Q in the set of all admissible controls. A boundary interval of u is defined as any nonempty and right-open interval of time in which

g (x (t)) \equiv 0

. Any nonempty and right-open interval of time, such that

g (x (t)) < 0

for every t in that interval, is nonboundary. If

[t_{1}, t_{2} [

is an inclusion-maximal boundary interval of u and

t_{1} > 0

, then

t_{1}

is called an entry point of u. If

t_{2} < T

, then

t_{2}

is an exit point. Denote

\dot{g} (ξ, v) = \partial g {(ξ)}^{T} f (ξ, v) for ξ \in R^{n}, v \in R .

(3)

The derivative of the function

t \mapsto g (x (t))

along the trajectories of (1) is equal to

\dot{g} (x (t), u (t))

.

For admissible controls we introduce the concept of verifiability, aiming to distinguish the controls to which the spike technique of (non)optimality verification, developed below, can be effectively applied. Let u be an admissible control with the corresponding state trajectory x. We call this control verifiable if

(i): it has a finite number of inclusion-maximal boundary intervals,
(ii): an implication holds that if $g (x (t)) = 0$ for a certain t, then t belongs to the closure of some boundary interval of u, 2
(iii): the conditions of nontangentiality

\dot{g} (x (t_{en}), u (t_{en} -)) > 0, \dot{g} (x (t_{ex}), u (t_{ex})) < 0

(4)

hold at all entry points

t_{en}

and all exit points

t_{ex}

of u,

(iv): there is an open set $X_{u} \subset R^{n}$ containing all points $x (t)$ such that $g (x (t)) = 0$ , and there is a $C^{1}$ function $w : X_{u} \to R$ such that

(w (ξ) \in U and \dot{g} (ξ, w (ξ)) = 0) if (ξ \in X_{u} and g (ξ) \leq 0) .

Note that if (4) is true, then the functions u and

t \mapsto \dot{g} (x (t), u (t))

are discontinuous at

t_{en}

and

t_{ex}

. Claim (iv) may be regarded as a weakened form of the assumption that the state constraint (2) is of index (or order) one (cf. [3,4,8]). From the implicit function theorem it follows that if

ξ \in X_{u}

,

g (ξ) \leq 0

and

\partial_{2} \dot{g} (ξ, w (ξ)) \neq 0

, then

\partial w (ξ) = - \frac{\partial_{1} \dot{g} (ξ, w (ξ))}{\partial_{2} \dot{g} (ξ, w (ξ))} = - \frac{\partial_{1} f (ξ, w (ξ)) \partial g (ξ) + \partial^{2} g (ξ) f (ξ, w (ξ))}{\partial g {(ξ)}^{T} \partial_{2} f (ξ, w (ξ))} .

For any verifiable control u define a function

F : [0, T] \times R^{n} \to R^{n}

F (t, ξ) = {\begin{matrix} f (ξ, w (ξ)), t \in Θ_{u}, ξ \in X_{u} \\ f (ξ, u (t)), elsewhere, \end{matrix}

where

Θ_{u}

denotes the union of all boundary intervals of u. Obviously, the corresponding state trajectory satisfies

\dot{x} (t) = F (t, x (t))

for almost all

t \in [0, T]

.

2. The One-Spike Control Variation and Trajectory Variation

Let u be a verifiable control, and x, the corresponding state trajectory. Denote

U_{t} = {\begin{matrix} {v \in U : \dot{g} (x (t), v) < 0}, g (x (t)) = 0 \\ U, g (x (t)) < 0 . \end{matrix}

For any

τ \in [0, T [

, any

v \in U_{τ}

, and any sufficiently small

ε > 0

, we shall define a control

u^{ε} \in P C (0, T; U)

. We also define

x^{ε}

as the solution of the initial value problem

{\dot{x}}^{ε} (t) = f (x^{ε} (t), u^{ε} (t)), t \in [0, T], x^{ε} (0) = x_{0} .

We put

u^{ε} (t) = u (t)

if

t < τ

, and

u^{ε} (t) = v

if

τ \leq t < τ + ε

. To define

u^{ε} (t)

for

t \geq τ + ε

, suppose first that

τ \notin Θ_{u}

. Then

(i): $u^{ε} (t) = w (x^{ε} (t))$ if $g (x^{ε} (t_{1})) = 0$ for some $t_{1} \leq t$ , and u has no exit points in $[t_{1}, t]$ ,
(ii): $u^{ε} (t) = u (t_{en} -)$ if $g (x (t)) = 0$ and $g (x^{ε} (t)) < 0$ , where $t_{en}$ is the greatest entry point of u less than or equal to t,
(iii): $u^{ε} (t) = u (t)$ otherwise.

Let now

τ

belong to

θ

, an inclusion-maximal boundary interval of u, and

t \geq τ + ε

. Then

u^{ε} (t) = w (x^{ε} (t))

for

t \in θ

, and (i), (ii), (iii) are valid for

t \notin θ

.

The spike variation of control is the difference

u^{ε} - u

. Note that the control

u^{ε}

is admissible for every sufficiently small positive

ε

.

Lemma 1.

The trajectory increment

Δ x = x^{ε} - x

satisfies

Δ x (t) = ε δ x (t) + o (ε)

(5)

for every

t \in [τ, T]

, where the trajectory variation

δ x : [τ, T] \to R^{n}

is absolutely continuous except, possibly, at the entry points of u, and independent of

ε

. For almost every

t \in [τ, T]

δ \dot{x} (t) = \partial_{2} F {(t, x (t))}^{T} δ x (t),

(6)

moreover,

δ x (τ) = f (x (τ), v) - f (x (τ), u (τ))

(7)

and at every entry point

t_{en} > τ

of u

δ x (t_{en} +) = Z {(t_{en})}^{T} δ x (t_{en} -) .

(8)

Here

Z (t_{en}) = I - \frac{\partial g (x (t_{en})) Δ f {(t_{en})}^{T}}{\partial g {(x (t_{en}))}^{T} Δ f (t_{en})}

(9)

Δ f (t_{en}) = f (x (t_{en}), u (t_{en} -)) - f (x (t_{en}), w (x (t_{en}))) .

Proof.

If the control u has no entry points in

] τ, T [

, the lemma is obviously true by virtue of the classical theorems on ordinary differential equations. Suppose that u has exactly one entry point

t_{en}

in

] τ + ε, T [

. From the mentioned theorems it directly follows that (5) holds in the time interval

[τ, t_{en} [

, with an absolutely continuous function

δ x

, satisfying (7) and (6) in that interval. We shall prove that the relationships (5) and (6) may be extended to the whole interval

[τ, T]

, with the function

δ x

absolutely continuous in

] t_{en}, T]

. To this end, let us first notice that for every sufficiently small

ε > 0

the control

u^{ε}

has an entry point

t_{en}^{ε} = t_{en} + ε δ t_{en} + o (ε)

, where

δ t_{en}

is a real number independent of

ε

. This follows from the verifiability of u (see (4)) and from the construction of

u^{ε}

. Let

Δ t_{en} = t_{en}^{ε} - t_{en}

. To fix attention, assume

Δ t_{en} > 0

. We then have

x (t_{en}^{ε}) = x (t_{en}) + f (x (t_{en}), w (x (t_{en}))) Δ t_{en} + o (Δ t_{en}) x^{ε} (t_{en}^{ε}) = x^{ε} (t_{en}) + f (x^{ε} (t_{en}), u^{ε} (t_{en} -)) Δ t_{en} + o (Δ t_{en}) = x^{ε} (t_{en}) + f (x (t_{en}), u (t_{en} -)) Δ t_{en} + o (Δ t_{en}),

as

f (x^{ε} (t_{en}), u (t_{en} -)) = f (x (t_{en}), u (t_{en} -))

+ \partial_{1} f {(x (t_{en}), u (t_{en} -))}^{T} Δ x (t_{en})

+ o (Δ x (t_{en}))

. Hence

Δ x (t_{en}^{ε}) = Δ x (t_{en}) + Δ f (t_{en}) Δ t_{en} + o (Δ t_{en}) .

(10)

By the definition of entry points,

g (x (t_{en})) = g (x^{ε} (t_{en}^{ε})) = 0

. Thus

g (x^{ε} (t_{en}^{ε})) = g (x (t_{en}) + Δ x (t_{en}) + f (x (t_{en}), u (t_{en} -)) Δ t_{en} + o (Δ t_{en})) = \partial g {(x (t_{en}))}^{T} (Δ x (t_{en}) + f (x (t_{en}), u (t_{en} -)) Δ t_{en}) + o (Δ t_{en}) = 0,

and so

Δ t_{en} = - \frac{\partial g {(x (t_{en}))}^{T} Δ x (t_{en})}{\partial g {(x (t_{en}))}^{T} f (x (t_{en}), u (t_{en} -))} + o (Δ t_{en}) .

Substituting this into (10), we obtain

Δ x (t_{en}^{ε}) = (I - \frac{Δ f (t_{en}) \partial g {(x (t_{en}))}^{T}}{\partial g {(x (t_{en}))}^{T} f (x (t_{en}), u (t_{en} -))}) Δ x (t_{en}) + o (Δ t_{en}) .

As

\partial g {(x (t_{en}))}^{T} f (x (t_{en}), u (t_{en} -)) = \partial g {(x (t_{en}))}^{T} Δ f (t_{en})

and

Δ x (t_{en}) = ε δ x (t_{en} -) + o (ε)

, we get

Δ x (t_{en}^{ε}) = ε Z {(t_{en})}^{T} δ x (t_{en} -) + o (ε) .

Defining

δ x (t_{en} +)

by (8), we arrive at the extension of (5) and (6) to

[τ, T]

because of the same classical theorems on differential equations. For

Δ t_{en} < 0

, an analogous argument leads to the same result. The proof can be easily generalized to an arbitrary finite number of entry points. □

3. The Adjoint Function and The One-Spike Necessary Optimality Condition

As in Section 2, let u be a verifiable control and x, the corresponding solution of (1). With every such control we associate an adjoint function

ψ : [0, T] \to R^{n}

, defined as a solution of the adjoint equation

\dot{ψ} (t) = - \partial_{2} F (t, x (t)) ψ (t),

(11)

absolutely continuous except at the entry points of u and satisfying the final condition

ψ (T) = \partial q (x (T)) .

(12)

At every entry point

t_{en}

,

ψ (t_{en}) = ψ (t_{en} +)

and

ψ (t_{en} -) = Z (t_{en}) ψ (t_{en} +) .

(13)

Let

τ

be an arbitrary point from

[0, T [

, and

δ x

, the trajectory variation determined in Lemma 1. It is easy to notice that the function

t \mapsto ψ {(t)}^{T} δ x (t)

is constant in the whole time interval

[τ, T]

. Indeed, its derivative

\dot{ψ} {(t)}^{T} δ x (t) + ψ {(t)}^{T} δ \dot{x} (t)

equals zero at every t where

ψ

and

δ x

are differentiable, and at the entry points

t_{en} > τ

we have by virtue of (13) and (8):

ψ {(t_{en} -)}^{T} δ x (t_{en} -) =

ψ {(t_{en} +)}^{T} Z {(t_{en})}^{T} δ x (t_{en} -) =

ψ {(t_{en} +)}^{T} δ x (t_{en} +)

. Thus,

ψ {(τ)}^{T} (f (x (τ), v) - f (x (τ), u (τ))) = \partial q {(x (T))}^{T} δ x (T) .

Define the pre-Hamiltonian

H : R^{n} \times R^{n} \times R \to R

,

H (ψ, x, u) = ψ^{T} f (x, u)

, and its increment

Δ H (τ, v) = H (ψ (τ), x (τ), v) - H (ψ (τ), x (τ), u (τ))

(14)

for any

v \in U

and

τ \in [0, T]

(note that

Δ H

is only defined for a uniquely predetermined control u). We can now express the value of cost on the control

u^{ε}

, defined in Section 2

Q (u^{ε}) = q (x^{ε} (T)) = Q (u) + ε \partial q {(x (T))}^{T} δ x (T) + o (ε) = Q (u) + ε Δ H (τ, v) + o (ε) .

A sufficient condition for the existence of spike variations which improve the cost is a straightforward consequence.

Lemma 2.

Assume that

τ \in [0, T [

,

v \in U_{τ}

, and

Δ H (τ, v) < 0

. Then the control

u^{ε}

is admissible and

Q (u^{ε}) < Q (u)

for every sufficiently small

ε > 0

.

A theorem on optimal control of the minimum principle type follows from Lemma 2.

Theorem 1.

Assume that the control u is optimal. Then:

(i): $Δ H (t, v) \geq 0$ for every $t \in [0, T]$ and every $v \in U_{t}$ ,
(ii): the function $[0, T] ∋ t \mapsto χ (t) = H (ψ (t), x (t), u (t))$ is constant.

Proof.

Conclusion (i) is a direct consequence of Lemma 2. Conclusion (ii) for the nonboundary intervals is proved exactly as in the classical proofs of the minimum principle without pathwise state constraints. In the interior of every boundary interval, the function

χ

is of class

C^{1}

with the derivative identically zero. The continuity of

χ

at entry points readily follows from (13) and (9). Let now

t_{ex}

be an exit point of u. By (i) and (4), there is a

δ > 0

such that

Δ H (t, u (t_{ex})) \geq 0

for all

t \in [t_{ex} - δ, t_{ex} [

, and

Δ H (t, w (x (t_{ex}))) \geq 0

for all

t \in] t_{ex}, t_{ex} + δ]

. The continuity of

χ

at

t_{ex}

is shown by limit passages:

t \to t_{ex} -

for the first of these inequalities, and

t \to t_{ex} +

for the second. □

Corollary 1.

Assume that the control u is optimal and

g (x (t)) = 0

for some

t \in [0, T]

. Then

Δ H (t, v) \geq 0

or

\dot{g} (x (t), v) \geq 0

for every

v \in U

.

4. The Two-Spike Necessary Optimality Condition

For a verifiable control u and the corresponding state trajectory x, we shall define a two-spike control variation. Let

θ

be an inclusion-maximal boundary interval of u. For any quintuple

τ_{1}, τ_{2}, v_{1}, v_{2}, η

such that

τ_{1}, τ_{2} \in θ

,

τ_{1} < τ_{2}

,

v_{1}, v_{2} \in U

,

η > 0

, and for any sufficiently small

ε > 0

we define a control

u^{ε} \in P C (0, T; U)

(not to be confused with the control

u^{ε}

defined in Section 2) and the corresponding state trajectory

x^{ε}

. We put

u^{ε} (t) = u (t)

if

t < τ_{1}

,

u^{ε} (t) = v_{1}

if

τ_{1} \leq t < τ_{1} + ε

,

u^{ε} (t) = v_{2}

if

τ_{2} \leq t < τ_{2} + η ε

, and

u^{ε} (t) =

w (x^{ε} (t))

for any other t in

θ

. Points (i), (ii) and (iii) of the definition in Section 2 apply to all the remaining values of t. The two-spike control variation is the difference

u^{ε} - u

.

The control u may sometimes be improved even if it fulfills the necessary optimality condition (i) of Theorem 1. We shall now give conditions, sufficient for the existence of a two-spike control variation in

θ

which is admissible and guarantees a cost improvement.

Lemma 3.

Assume that

τ_{1}, τ_{2} \in θ

,

τ_{1} < τ_{2}

,

v_{1}, v_{2} \in U

,

η > 0

, and

\dot{g} (x (τ_{1}), v_{1}) < 0

(15)

\dot{g} (x (τ_{1}), v_{1}) + η \dot{g} (x (τ_{2}), v_{2}) < 0

(16)

Δ H (τ_{1}, v_{1}) + η Δ H (τ_{2}, v_{2}) < 0 .

(17)

Then for every sufficiently small

ε > 0

the control

u^{ε}

is admissible, and

Q (u^{ε}) < Q (u)

.

Proof.

It follows from the definition of

u^{ε}

and the inequalities (15), (16) that for every sufficiently small

ε > 0

the function

t \mapsto g (x^{ε} (t))

is negative in the time interval

] τ_{1}, \sup θ]

and constant in the intervals

[τ_{1} + ε, τ_{2}]

and

[τ_{2} + η ε, \sup θ]

,

g (x^{ε} (t)) = ε \dot{g} (x (τ_{1}), v_{1}) + o (ε), t \in [τ_{1} + ε, τ_{2}],

g (x^{ε} (t)) = ε (\dot{g} (x (τ_{1}), v_{1}) + η \dot{g} (x (τ_{2}), v_{2})) + o (ε), t \in [τ_{2} + η ε, \sup θ] .

From this we infer that the control

u^{ε}

is admissible for all sufficiently small

ε > 0

. Reasoning similarly as in Section 3 and using the adjoint function defined therein, we estimate the value of the performance index on the control

u^{ε}

.

Q (u^{ε}) = Q (u) + ε (Δ H (τ_{1}, v_{1}) + η Δ H (τ_{2}, v_{2}) + o (ε) .

Hence, (17) is a sufficient condition for the two-spike control variation to reduce the cost for every sufficiently small

ε > 0

. □

Lemma 4.

Assume that

τ_{1}, τ_{2} \in θ

,

τ_{1} < τ_{2}

,

v_{1}, v_{2} \in U

, and

\dot{g} (x (τ_{1}), v_{1}) < 0, Δ H (τ_{1}, v_{1}) \geq 0

\dot{g} (x (τ_{2}), v_{2}) \geq 0, Δ H (τ_{2}, v_{2}) < 0 .

(18)

Assume also that if

\dot{g} (x (τ_{2}), v_{2}) > 0

, then

\frac{Δ H (τ_{1}, v_{1})}{\dot{g} (x (τ_{1}), v_{1})} > \frac{Δ H (τ_{2}, v_{2})}{\dot{g} (x (τ_{2}), v_{2})} .

(19)

Under these assumptions there is an

η > 0

, such that for every sufficiently small

ε > 0

the control

u^{ε}

is admissible, and

Q (u^{ε}) < Q (u)

.

Proof.

We shall show that the assumptions of Lemma 3 follow from the assumptions of Lemma 4. Denote

η_{1} = - \frac{Δ H (τ_{1}, v_{1})}{Δ H (τ_{2}, v_{2})} and η_{2} = - \frac{\dot{g} (x (τ_{1}), v_{1})}{\dot{g} (x (τ_{2}), v_{2})} for \dot{g} (x (τ_{2}), v_{2}) > 0 .

Of course

η_{1} \geq 0

and

η_{2} > 0

. The inequality (15) is obvious. In view of (18), (17) is true for every

η > η_{1}

. If

\dot{g} (x (τ_{2}), v_{2}) = 0

, then (16) holds for every

η > 0

, and the assumptions of Lemma 3 are satisfied. If

\dot{g} (x (τ_{2}), v_{2}) > 0

, then the inequality (16) holds for

η < η_{2}

. We thus have a two-sided bound on

η

,

η_{1} < η < η_{2}

. The interval of admissible values of

η

is nonempty if

η_{1} < η_{2}

, and this inequality follows from (19). □

By contradicting the sufficient nonoptimality conditions of Lemma 4 we obtain new necessary conditions of optimality.

Theorem 2 (main result).

Assume that the control u is optimal and verifiable, and has a boundary interval

θ

. Let also

t_{1}, t_{2} \in θ

,

t_{1} \leq t_{2}

, and

v_{1}, v_{2} \in U

. Under these assumptions:

(i): if $\dot{g} (x (t_{1}), v_{1}) < 0$ and $Δ H (t_{1}, v_{1}) = 0$ , then $Δ H (t_{2}, v_{2}) \geq 0$ ,
(ii): if $\dot{g} (x (t_{2}), v_{2}) = 0$ and $Δ H (t_{2}, v_{2}) < 0$ , then $\dot{g} (x (t_{1}), v_{1}) \geq 0$ ,
(iii): if $\dot{g} (x (t_{1}), v_{1}) < 0$ and $Δ H (t_{2}, v_{2}) < 0$ , then $\dot{g} (x (t_{2}), v_{2}) > 0$ and

\frac{Δ H (t_{1}, v_{1})}{\dot{g} (x (t_{1}), v_{1})} \leq \frac{Δ H (t_{2}, v_{2})}{\dot{g} (x (t_{2}), v_{2})} .

(20)

Proof.

Let first

t_{1} < t_{2}

. Suppose, contrary to (i), that

\dot{g} (x (t_{1}), v_{1}) < 0

,

Δ H (t_{1}, v_{1}) = 0

and

Δ H (t_{2}, v_{2}) < 0

. From Corollary 1,

\dot{g} (x (t_{2}), v_{2}) \geq 0

. By Lemma 4, this contradicts the assumption that u is optimal. The implication (ii) is similarly proved. Let

\dot{g} (x (t_{2}), v_{2}) = 0

,

Δ H (t_{2}, v_{2}) < 0

and

\dot{g} (x (t_{1}), v_{1}) < 0

. By Corollary 1,

Δ H (t_{1}, v_{1}) \geq 0

and the assumptions of Lemma 4 are fulfilled (with

τ_{1} : = t_{1}

and

τ_{2} : = t_{2}

). To prove (iii), assume that

\dot{g} (x (t_{1}), v_{1}) < 0

and

Δ H (t_{2}, v_{2}) < 0

. It follows from Corollary 1 that

Δ H (t_{1}, v_{1}) \geq 0

and

\dot{g} (x (t_{2}), v_{2}) \geq 0

. If

\dot{g} (x (t_{2}), v_{2}) = 0

, the assumptions of Lemma 4 hold. If

\dot{g} (x (t_{2}), v_{2}) > 0

, the inequality opposite to (19), that is (20), is true.

Let now

t_{1} = t_{2} = τ_{1}

. The proof goes similarly, however, we additionally have to use a simple observation (rc): the functions

t \mapsto \dot{g} (x (t), v)

and

t \mapsto Δ H (t, v)

are right-continuous in

[0, T [

for every

v \in U

. Let

\dot{g} (x (τ_{1}), v_{1}) < 0

,

Δ H (τ_{1}, v_{1}) = 0

and

Δ H (τ_{1}, v_{2}) < 0

. By virtue of (rc) and Corollary 1, there is a

τ_{2} > τ_{1}

such that

Δ H (τ_{2}, v_{2}) < 0

and

\dot{g} (x (τ_{2}), v_{2}) \geq 0

. Lemma 4 then gives a contradiction. To prove (ii), assume

\dot{g} (x (τ_{1}), v_{2}) = 0

,

Δ H (τ_{1}, v_{2}) < 0

and

\dot{g} (x (τ_{1}), v_{1}) < 0

. By (rc) and Corollary 1,

Δ H (τ_{1}, v_{1}) \geq 0

and there is a

\hat{τ} \in θ

,

\hat{τ} > τ_{1}

, such that

Δ H (τ_{2}, v_{2}) < 0

and

\dot{g} (x (τ_{2}), v_{2}) \geq 0

for every

τ_{2} \in] τ_{1}, \hat{τ}]

. If

\dot{g} (x (τ_{2}), v_{2}) = 0

for some

τ_{2} \in] τ_{1}, \hat{τ}]

, Lemma 4 again yields a contradiction. If

\dot{g} (x (τ_{2}), v_{2}) > 0

for every

τ_{2} \in] τ_{1}, \hat{τ}]

, then (19) holds for every

τ_{2} > τ_{1}

sufficiently close to

τ_{1}

, since

\dot{g} (x (τ_{2}), v_{2}) \to 0

as

τ_{2} \to τ_{1} +

, and so Lemma 4 gives a contradiction. We shall now prove (iii). Let

\dot{g} (x (τ_{1}), v_{1}) < 0

and

Δ H (τ_{1}, v_{2}) < 0

. It follows from (rc) and Corollary 1 that

Δ H (τ_{1}, v_{1}) \geq 0

and there is a

\hat{τ} \in θ

,

\hat{τ} > τ_{1}

, such that

Δ H (τ_{2}, v_{2}) < 0

and

\dot{g} (x (τ_{2}), v_{2}) \geq 0

for every

τ_{2} \in] τ_{1}, \hat{τ}]

. If

\dot{g} (x (τ_{2}), v_{2}) = 0

for some

τ_{2} \in] τ_{1}, \hat{τ}]

, a contradiction follows from Lemma 4. Similarly, Lemma 4 gives a contradiction if for some

τ_{2} \in] τ_{1}, \hat{τ}]

the relationships

\dot{g} (x (τ_{2}), v_{2}) > 0

and (19) are fulfilled. In consequence, the inequalities

\dot{g} (x (τ_{2}), v_{2}) > 0

and (20) with

t_{1} : = τ_{1}

and

t_{2} : = τ_{2}

hold true for every

τ_{2} \in] τ_{1}, \hat{τ}]

. By (rc),

Δ H (τ_{2}, v_{2}) \to

Δ H (τ_{1}, v_{2}) < 0

and

\dot{g} (x (τ_{2}), v_{2}) \to

\dot{g} (x (τ_{1}), v_{2}) \geq 0

as

τ_{2} \to τ_{1} +

.

If

\dot{g} (x (τ_{1}), v_{2}) = 0

, then (19) is true for all

τ_{2} > τ_{1}

, sufficiently close to

τ_{1}

. We have thus come to a contradiction. Hence

\dot{g} (x (τ_{1}), v_{2}) > 0

, and the inequality (20) holds by virtue of (rc). □

5. A Geometrical Interpretation and a Minimum Condition

Let u be a verifiable control with a boundary interval

θ

. The corresponding state and adjoint trajectories are denoted by x and

ψ

, respectively. Define a family of sets

C_{t} = {y \in R^{2} : y_{1} = \dot{g} (x (t), v), y_{2} = Δ H (t, v), v \in U}, t \in θ .

It readily follows from this definition that

0 \in C_{t}

for every

t \in θ

. In the sequel we implicitly assume that

| C_{t} | > 1

.

We shall now characterize the properties of the sets

C_{t}

which result from control optimality. For an arbitrary nonzero vector

y \in R^{2}

, define

\arg y

as the angle between

col (1, 0)

and y, measured anticlockwise and taking values in the interval

] - π, π]

. Let also

ϕ_{\min} (t) = \inf {\arg y : y \in C_{t} \ {0}}, ϕ_{\max} (t) = \sup {\arg y : y \in C_{t} \ {0}}

for every

t \in θ

. Corollary 1 says that if u is optimal, then

C_{t}

has no points in quadrant III of the coordinate system

y_{1} y_{2}

. The following theorem is a straightforward consequence of that corollary and of Theorem 2.

Theorem 3.

Assume that the control u is optimal. Then

(i): $- \frac{1}{2} π \leq ϕ_{\min} (t) \leq ϕ_{\max} (t) \leq π$ for every $t \in θ$ ,
(ii): $ϕ_{\max} (t_{1}) - ϕ_{\min} (t_{2}) \leq π$ for every pair $t_{1}, t_{2} \in θ$ such that $t_{1} \leq t_{2}$ .

From this it easily follows that if the control u is optimal and

ϕ_{\max} (t) = ϕ_{\min} (t) + π

for every

t \in θ

, then

\frac{1}{2} π \leq ϕ_{\max} (t) \leq π

for every

t \in θ

, and the function

ϕ_{\max}

is nondecreasing in

θ

.

It proves useful to describe the consequences of control optimality in terms of the straight lines supporting the sets

C_{t}

at zero. This allows easier verification of the necessary conditions of optimality, and also expressing a partial optimality criterion as a minimum condition imposed on the extended pre-Hamiltonian. We say that a straight line is a supporting line of

C_{t}

at the origin (SLO) if it is given by

ρ^{T} y = 0

with

ρ, y \in R^{2}

,

ρ \neq 0

, and

ρ^{T} y \geq 0

for every

y \in C_{t}

. Generally, the set

C_{t}

may have many SLOs, whether the control is optimal or not. If u is optimal, then every set

C_{t}

,

t \in θ

, has an SLO with

ρ \geq 0

. The set

C_{t}

has a unique SLO if and only if

ϕ_{\max} (t) = ϕ_{\min} (t) + π

.

The equality

ϕ_{\max} (t) = ϕ_{\min} (t) + π

occurs in two practically important situations (mutually nonexclusive). One of them, in which the right-hand side of the system equation (1) is affine in control, will be discussed in Section 7. Here we consider the other situation, in which

C_{t}

has a tangent at the origin. A sufficient condition for that reads

u (t) \in int U and | \partial_{2} \dot{g} (x (t), u (t)) | + | \partial_{2} Δ H (t, u (t)) | > 0 .

(21)

Under this condition, the tangent has the equation

ρ^{T} y = 0

with

ρ_{1} = \partial_{2} Δ H (t, u (t)), ρ_{2} = - \partial_{2} \dot{g} (x (t), u (t)) .

Of course, if

C_{t}

has both an SLO and a tangent at the origin, they coincide.

Suppose u is optimal and

ϕ_{\max} (t) = ϕ_{\min} (t) + π

for every

t \in θ

. Then every set

C_{t}

,

t \in θ

, has a unique SLO. The SLO is vertical if

ϕ_{\max} (t) = \frac{1}{2} π

; if

ϕ_{\max} (t) > \frac{1}{2} π

, the SLO equation may be written as

y_{2} = p (t) y_{1}

with a nonpositive directional coefficient

p (t) = \tan ϕ_{\max} (t)

. If, additionally, the condition (21) is fulfilled with

\partial_{2} \dot{g} (x (t), u (t)) \neq 0

, then

p (t) = \lim_{\begin{array}{l} v \to u (t) \\ v \neq u (t) \end{array}} \frac{Δ H (t, v)}{\dot{g} (x (t), v)} = \frac{\partial_{2} Δ H (t, u (t))}{\partial_{2} \dot{g} (x (t), u (t))} = \frac{\partial_{3} H (ψ (t), x (t), u (t))}{\partial_{2} \dot{g} (x (t), u (t))} .

(22)

The function p thus defined is nondecreasing in all that part of

θ

where it is determined.

Define the extended pre-Hamiltonian

\hat{H} (ψ, p, x, v) = ψ^{T} f (x, v) - p \dot{g} (x, v)

. If the control u is optimal and the function p is determined as above in all the interval

θ

, then the following minimum condition is straightforward by the properties of SLO

\hat{H} (ψ (t), p (t), x (t), u (t)) \leq \hat{H} (ψ (t), p (t), x (t), v) \forall v \in U \forall t \in θ .

(23)

This necessary optimality condition is similar to the minimum condition of indirect adjoining. We postpone a discussion of relations with the classical results to Section 9.

6. Example 1

In this example we apply the above necessary conditions to verify optimality of two controls, the first of which is optimal, and the second is not. We show that the nonoptimality is easily detected. The control system is described by state equations

{\dot{x}}_{1} = 2 - \frac{1}{2} x_{1} + (x_{1} - 2) x_{2} + a u (b - u), {\dot{x}}_{2} = u,

with the initial conditions

x_{1} (0) = x_{2} (0) = 1

. The set of admissible control values consists of three elements,

U = {- 1, 0, 1}

. The state is subject to a pathwise constraint

g (x) = x_{2} - 1 \leq 0

. The cost to be minimized is given by

Q (u) = \frac{1}{2} {(x_{1} (T) - 6)}^{2} + \frac{1}{2} x_{2} {(T)}^{2}

. We take

a = 0.125

,

b = 1.025

,

T = 4

.

Let u be a verifiable control, and x and

ψ

, respectively, the corresponding state trajectory and adjoint function. Let us write the pre-Hamiltonian

H (ψ, x, u) = ψ_{1} (2 - \frac{1}{2} x_{1} + (x_{1} - 2) x_{2} + a u (b - u)) + ψ_{2} u

and the adjoint equations in nonboundary intervals of time

{\dot{ψ}}_{1} = (\frac{1}{2} - x_{2}) ψ_{1}, {\dot{ψ}}_{2} = (2 - x_{1}) ψ_{1} .

The adjoints satisfy the final conditions

ψ_{1} (T) = x_{1} (T) - 6

,

ψ_{2} (T) = x_{2} (T)

. As

\dot{g} (x, u) = u

, we have

w (x) = 0

. In consequence, the state and adjoint equations in the boundary intervals take the form

{\dot{x}}_{1} = \frac{1}{2} x_{1}, {\dot{x}}_{2} = 0

{\dot{ψ}}_{1} = - \frac{1}{2} ψ_{1}, {\dot{ψ}}_{2} = (2 - x_{1}) ψ_{1} .

To determine the behavior of the adjoint function at entry points, we calculate the matrix (9)

Z (t_{en}) = [\begin{matrix} 1 & 0 \\ a (u (t_{en} -) - b) & 0 \end{matrix}] .

Hence by (13),

ψ_{1} (t_{en} -) = ψ_{1} (t_{en} +)

and

ψ_{2} (t_{en} -) = a (u (t_{en} -) - b) ψ_{1} (t_{en} +)

. In accordance with (14),

Δ H (t, v) = (a b ψ_{1} (t) + ψ_{2} (t)) v - a ψ_{1} (t) v^{2}

in every boundary interval.

It is evident that the nontangentiality conditions (4) are fulfilled at all entry and exit points. The optimality of u should be verified with Theorem 1(i) in the whole interval

[0, T]

, and additionally with Theorem 2 or 3 in the boundary intervals. Every set

C_{t}

introduced in Section 5 consists of three points,

C_{t} =

{y (t), 0, z (t)}

, where

y_{1} (t) = - 1

,

y_{2} (t) = Δ H (t, - 1)

,

z_{1} (t) = + 1

, and

z_{2} (t) = Δ H (t, + 1)

. If the control u is optimal, then it follows from part (i) of Theorem 3 that

y_{2} (t) \geq 0 \forall t \in θ

, and from part (ii), that

- y_{2} (t_{1}) \leq

z_{2} (t_{2})

for every pair

t_{1}, t_{2} \in θ

such that

t_{1} \leq t_{2}

. By Lemmas 2 and 4, u is nonoptimal if

y_{2} (t) < 0

for some

t \in θ

or

- y_{2} (t_{1}) > z_{2} (t_{2})

for some

t_{1}, t_{2} \in θ

,

t_{1} \leq t_{2}

.

Example 1a.

A numerically computed approximation of optimal control and optimal state trajectory is presented in Figure 1. The control has discontinuities at

s_{1} = 0.49483839

,

s_{2} = 0.99973751

,

s_{3} = 1.4945759

and

s_{4} = 3.1366809

. Figure 2 shows the corresponding adjoint trajectory. Let us verify the necessary conditions of optimality. Figure 3 shows that the condition of Theorem 1(i) is fulfilled. In the boundary interval

θ = [s_{3}, s_{4} [

, we additionally have to verify the conditions of Theorem 2 or 3. It can be seen in Figure 3 that

Δ H (t, - 1) > 0

for all

t \in θ

. Thus, implications (i) and (ii) of Theorem 2 are vacuously true for all

v_{1}

,

v_{2}

,

t_{1}

,

t_{2}

satisfying the assumptions of the theorem, and so is (iii) except the case where

v_{1} = - 1

,

v_{2} = 1

and

Δ H (t_{2}, 1) < 0

. In that case,

\dot{g} (x (t_{2}), v_{2}) = 1

and (20) reads

- Δ H (t_{1}, - 1) \leq Δ H (t_{2}, 1)

. As can be checked by inspection, (20) holds for all

t_{1}, t_{2} \in θ

,

t_{1} \leq t_{2}

, and so (iii) is true. Alternatively and equivalently, we can use Theorem 3. We see in Figure 4 that

- \frac{1}{2} π < ϕ_{\min} (t) < ϕ_{\max} (t)

and

ϕ_{\max} (t) - π < 0

for every

t \in θ

, then conclusion (i) of Theorem 3 is true. It can be also seen that

ϕ_{\max} (t_{1}) - π < ϕ_{\min} (t_{2})

for every

t_{1} \in θ

and

t_{2} \in [t_{1}, s_{4} [

, and so conclusion (ii) holds too. Thus, the necessary optimality conditions of Theorems 1, 2 and 3 are satisfied.

Example 1b.

Consider a nonoptimal control

u (t) = {\begin{matrix} 0, 0 \leq t < s_{1} \\ - 1, s_{1} \leq t \leq T, \end{matrix}

where

s_{1} = 3.2429037

. It is plotted in Figure 5 together with the corresponding state trajectory. The adjoint function is depicted in Figure 6. As follows from Figure 7, the necessary optimality condition of Theorem 1(i) is satisfied and in consequence, there are no one-spike variations described in Section 2 which guarantee an improvement of the cost. Let us now check the conditions of Theorem 3 in the boundary interval

θ = [0, s_{1} [

. To this end we define

{\vec{ϕ}}_{\min} (t) = \inf {ϕ_{\min} (s) : t < s \leq s_{1}}

and

{\overset{\leftarrow}{ϕ}}_{\max} (t) = \sup {ϕ_{\max} (s) : 0 \leq s < t}

for

t \in θ

. The inequalities

ϕ_{\min} (t) \leq ϕ_{\max} (t) \leq π

in conclusion (i) of Theorem 3 directly follow from the definitions. Figure 8 shows that

{\vec{ϕ}}_{\min} (t) > - \frac{1}{2} π

, and the more so

ϕ_{\min} (t) > - \frac{1}{2} π

for every

t \in θ

. Let us rewrite conclusion (ii) of the theorem in an equivalent form,

{\overset{\leftarrow}{ϕ}}_{\max} (t) - π \leq {\vec{ϕ}}_{\min} (t)

\forall t \in θ

. Figure 8 shows that this inequality holds only in

[t^{*}, s_{1} [

, with

t^{*} > 0

. This proves that the control u is not optimal and there are two-spike variations in the boundary interval (defined in Section 4) which yield a cost reduction for any sufficiently small positive value of the parameter

ε

. A closer analysis of the conditions of Lemma 4 shows that the difference

τ_{2} - τ_{1}

, that is, the distance between the spikes in such a variation cannot be arbitrarily small.

7. The Control Affine Case

Consider the system (1) with the function f affine in control,

f (x, u) = a (x) + b (x) u

. Many of the results obtained so far may then be significantly simplified, or even strengthened. In this section, u stands for a certain verifiable control, x for the corresponding state trajectory, and

ψ

for the corresponding adjoint. We also denote

α_{1} (t) = \partial g {(x (t))}^{T} b (x (t)), α_{2} (t) = ψ {(t)}^{T} b (x (t)) .

In consequence we have

Δ H (t, v) = α_{2} (t) (v - u (t))

, where

t \in [0, T]

and

v \in U

. The formula (3) in every boundary interval

θ

of u may be written as

\dot{g} (x (t), v) =

α_{1} (t) (v - u (t))

, for

t \in θ

and

v \in U

. The equality (9) is simplified to

Z (t_{en}) = I - \frac{\partial g (x (t_{en})) b {(x (t_{en}))}^{T}}{α_{1} (t_{en})} .

(24)

It follows from (13) that the left-hand limit of the switching function

α_{2}

equals zero at every entry point

t_{en}

,

α_{2} (t_{en} -) = 0

. If the control u is optimal, then

α_{2}

vanishes at every exit point

t_{ex}

,

α_{2} (t_{ex}) = 0

. Indeed,

α_{2} (t_{ex}) u (t_{ex} -) = α_{2} (t_{ex}) u (t_{ex})

by Theorem 1(ii), and

u (t_{ex} -) \neq u (t_{ex})

by (4).

We shall now formulate the results of Section 3 for the control affine case, beginning with Lemma 2.

Lemma 5.

Assume that

τ \in [0, T [

,

v \in U_{τ}

, and

α_{2} (τ) (v - u (τ)) < 0

. Then for every sufficient-ly small

ε > 0

, the control

u^{ε}

defined inSection 2is admissible and

Q (u^{ε}) < Q (u)

.

Theorem 1(i) takes the following form.

Theorem 4.

Assume that the control u is optimal. Then

α_{2} (t) (v - u (t)) \geq 0

for every

t \in [0, T]

and every

v \in U_{t}

.

Corollary 2.

Assume that u is optimal and

t \in [0, T]

. Then the following implications hold:

(i): if $(g (x (t)) < 0 a n d α_{2} (t) > 0)$ or $(t \in Θ_{u}, α_{1} (t) > 0 a n d α_{2} (t) > 0)$ , then $\min U$ exists and $u (t) = \min U$ ,
(ii): if $(g (x (t)) < 0 a n d α_{2} (t) < 0)$ or $(t \in Θ_{u}, α_{1} (t) < 0 a n d α_{2} (t) < 0)$ , then $\max U$ exists and $u (t) = \max U$ .

From here till the end of this section, u is a verifiable control with a boundary interval

θ

. Let us pass to the results of Section 4. For all

t \in [0, T]

such that

α_{1} (t) \neq 0

, define

p (t) = \frac{α_{2} (t)}{α_{1} (t)} .

(25)

Note that this is an extension of the function p given by (22). The following lemma is an immediate consequence of Lemma 4.

Lemma 6.

Assume that

τ_{1}, τ_{2} \in θ

,

τ_{1} < τ_{2}

,

v_{1}, v_{2} \in U

, and

α_{1} (τ_{1}) (v_{1} - u (τ_{1})) < 0, α_{2} (τ_{1}) (v_{1} - u (τ_{1})) \geq 0

α_{1} (τ_{2}) (v_{2} - u (τ_{2})) \geq 0, α_{2} (τ_{2}) (v_{2} - u (τ_{2})) < 0 .

Let also

p (τ_{1}) > p (τ_{2})

if

α_{1} (τ_{2}) \neq 0

. Then there is an

η > 0

, such that for every sufficiently small

ε > 0

the control

u^{ε}

defined inSection 4is admissible and

Q (u^{ε}) < Q (u)

.

The question arises how to choose

η

in the construction of

u^{ε}

under the assumptions of Lemma 6. It follows from the proof of Lemma 4 that if

α_{1} (τ_{2}) = 0

, then

η

may be any positive number, whereas if

α_{1} (τ_{2}) \neq 0

, then

η_{1} < η < η_{2}

with

η_{1} = - \frac{α_{2} (τ_{1}) (v_{1} - u (τ_{1}))}{α_{2} (τ_{2}) (v_{2} - u (τ_{2}))}, η_{2} = - \frac{α_{1} (τ_{1}) (v_{1} - u (τ_{1}))}{α_{1} (τ_{2}) (v_{2} - u (τ_{2}))}

(26)

If the control u has nonextremal values in the boundary interval

θ

, a simple consequence follows from Lemma 6.

Corollary 3.

Assume that

v < u (t) < \hat{v}

and

α_{1} (t) α_{2} (t) < 0

for some

v, \hat{v} \in U

and every

t \in θ

. Let also the function p be strictly decreasing in

θ

. Then there exist

τ_{1}, τ_{2} \in θ

,

v_{1}, v_{2} \in U

and

η > 0

, such that the control

u^{ε}

defined inSection 4is admissible and

Q (u^{ε}) < Q (u)

for every sufficiently small

ε > 0

.

The following theorem is a straightforward consequence of Theorem 2.

Theorem 5.

Assume that the control u is optimal,

t_{1}, t_{2} \in θ

,

t_{1} \leq t_{2}

, and

v_{1}, v_{2} \in U

. Under these assumptions:

(i): if $α_{1} (t_{1}) (v_{1} - u (t_{1})) < 0$ and $α_{2} (t_{1}) = 0$ , then $α_{2} (t_{2}) (v_{2} - u (t_{2})) \geq 0$ ,
(ii): if $α_{1} (t_{2}) = 0$ and $α_{2} (t_{2}) (v_{2} - u (t_{2})) < 0$ , then $α_{1} (t_{1}) (v_{1} - u (t_{1})) \geq 0$ ,
(iii): if $α_{1} (t_{1}) (v_{1} - u (t_{1})) < 0$ and $α_{2} (t_{2}) (v_{2} - u (t_{2})) < 0$ , then $α_{1} (t_{2}) \neq 0$ and $p (t_{1}) \leq p (t_{2}) < 0$ .

Corollary 4.

Assume that u is optimal,

v_{1}, v_{2} \in U

,

α_{1} (t) (v_{1} - u (t)) < 0

and

α_{2} (t) (v_{2} - u (t)) < 0

for every

t \in θ

. Then the function p is negative and nondecreasing in

θ

.

The analysis of Section 5 applied to the control affine case leads to the following conclusions. Every set

C_{t}

is included in a certain straight line in

R^{2}

, passing through the origin. If

α_{1} {(t)}^{2} + α_{2} {(t)}^{2} > 0

, this line has parametric equations

y_{1} (s) =

α_{1} (t) s

,

y_{2} (s) =

α_{2} (t) s

,

s \in R

. If

| C_{t} | = 2

, then

ϕ_{\max} (t) = ϕ_{\min} (t)

, and if

| C_{t} | > 2

, then either

ϕ_{\max} (t) = ϕ_{\min} (t) + π

or

ϕ_{\max} (t) =

ϕ_{\min} (t)

. Theorem 3 remains unchanged.

Suppose that the control u is optimal. We then have by Theorem 3 that for every

t \in θ

the set

C_{t}

has an SLO given by

ρ^{T} y = 0

with

ρ \geq 0

. If

α_{1} {(t)}^{2} + α_{2} {(t)}^{2} > 0

, we can put

ρ_{1} = | α_{2} (t) |

and

ρ_{2} = | α_{1} (t) |

. Let us now assume that for every

t \in θ

there are

v, \hat{v} \in U

such that

v < u (t) < \hat{v}

, and in consequence

ϕ_{\max} (t) = ϕ_{\min} (t) + π

. Let also

\frac{1}{2} π < ϕ_{\max} (t) < π

. It then follows from the reasoning in Section 5 that the equality

y_{2} = p (t) y_{1}

holds for every

t \in θ

and every

y \in C_{t}

, with the function p (25) negative and nondecreasing in the interval

θ

.

Finally, note that the minimum condition on the extended pre-Hamiltonian (23) is trivially satisfied with equality (independently of whether the control u is optimal or not).

8. Example 2: The Pendulum on a Cart

We shall now consider a problem with the right-hand side of the state Equation (1) affine in control. The system is described by state equations

{\dot{x}}_{1} = x_{3}

{\dot{x}}_{2} = x_{4}

{\dot{x}}_{3} = f_{3} (x, u) = \frac{u - x_{4}^{2} \sin x_{2} + \sin x_{2} \cos x_{2}}{1 + \sin^{2} x_{2}}

{\dot{x}}_{4} = f_{4} (x, u) = \frac{(u - x_{4}^{2} \sin x_{2}) \cos x_{2} + 2 \sin x_{2}}{1 + \sin^{2} x_{2}} .

(27)

The initial state

x (0) = x_{0}

and the time horizon T are fixed. The performance index

Q (u) = \frac{1}{2} x {(T)}^{T} x (T)

is minimized subject to control bounds and a pathwise state constraint

u_{\min} \leq u (t) \leq u_{\max}, g (x (t)) = x_{3} (t) - x_{3 \max} \leq 0, t \in [0, T] .

We write the pre-Hamiltonian

H = ψ_{1} x_{3} + ψ_{2} x_{4} + ψ_{3} f_{3} + ψ_{4} f_{4}

and the adjoint equations in the nonboundary time intervals

\begin{matrix} {\dot{ψ}}_{1} = 0 \\ {\dot{ψ}}_{2} = - ψ_{3} \frac{\partial f_{3}}{\partial x_{2}} - ψ_{4} \frac{\partial f_{4}}{\partial x_{2}} \\ {\dot{ψ}}_{3} = - ψ_{1} \\ {\dot{ψ}}_{4} = - ψ_{2} + \frac{2 x_{4} \sin x_{2} (ψ_{3} + ψ_{4} \cos x_{2})}{1 + \sin^{2} x_{2}}, \end{matrix}

where

\frac{\partial f_{3}}{\partial x_{2}} = \frac{\cos 2 x_{2} - x_{4}^{2} \cos x_{2} - f_{3} \sin 2 x_{2}}{1 + \sin^{2} x_{2}}

\frac{\partial f_{4}}{\partial x_{2}} = \frac{2 \cos x_{2} - u \sin x_{2} - x_{4}^{2} \cos 2 x_{2} - f_{4} \sin 2 x_{2}}{1 + \sin^{2} x_{2}} .

The adjoint function satisfies the final condition

ψ (T) = x (T)

. The state equations in the bound-ary intervals are obtained by the substitution

u = w (x) = (x_{4}^{2} - \cos x_{2}) \sin x_{2}

in (27), which gives

{\dot{x}}_{1} = x_{3}, {\dot{x}}_{2} = x_{4}, {\dot{x}}_{3} = 0, {\dot{x}}_{4} = \sin x_{2} .

Hence the pre-Hamiltonian and the adjoint equations in the boundary intervals read

H = ψ_{1} x_{3} + ψ_{2} x_{4} + ψ_{4} \sin x_{2}

{\dot{ψ}}_{1} = 0, {\dot{ψ}}_{2} = - ψ_{4} \cos x_{2}, {\dot{ψ}}_{3} = - ψ_{1}, {\dot{ψ}}_{4} = - ψ_{2} .

At every entry point

t_{en}

the jump condition (13) is valid with the matrix (24), whence

ψ_{i} (t_{en} -) = ψ_{i} (t_{en} +)

,

i = 1, 2, 4

,

ψ_{3} (t_{en} -) = - \cos x_{2} (t_{en}) ψ_{4} (t_{en} +)

. We further compute

α_{1} (t) = {(1 + \sin^{2} x_{2} (t))}^{- 1},

α_{2} (t) = (ψ_{3} (t) + ψ_{4} (t) \cos x_{2} (t)) {(1 + \sin^{2} x_{2} (t))}^{- 1},

and from (25),

p (t) = ψ_{3} (t) + ψ_{4} (t) \cos x_{2} (t)

for every

t \in [0, T]

. Note that

α_{1} (t)

is always positive, and

p (t)

and

α_{2} (t)

have the same sign.

Assume that the control u is optimal. It follows from Corollary 2 that for every t in any nonboundary interval of u

u (t) = {\begin{matrix} u_{\min}, α_{2} (t) > 0 \\ u_{\max}, α_{2} (t) < 0 . \end{matrix}

Let now

θ

be a boundary interval of u. We infer from Corollary 2 that if

t \in θ

, then

u (t) = u_{\min}

or

α_{2} (t) \leq 0

. If these relations are not satisfied at some

t \in θ

, then the control u can be improved in accordance with Lemma 5, by means of a spike control variation described in Section 2. We deduce from Theorem 5 that if

t_{1}, t_{2} \in θ

,

t_{1} \leq t_{2}

, and

u (t_{1}) > u_{\min}

, then

(i): $u (t_{2}) = u_{\max}$ if $α_{2} (t_{1}) = 0$ and $α_{2} (t_{2}) < 0$ ,
(ii): $u (t_{2}) = u_{\min}$ if $α_{2} (t_{1}) = 0$ and $α_{2} (t_{2}) > 0$ ,
(iii): $p (t_{1}) \leq p (t_{2}) < 0$ if $α_{2} (t_{2}) \neq 0$ and $u_{\min} < u (t_{2}) < u_{\max}$ .

If some of the necessary conditions of Theorem 5 or Corollary 4 are not fulfilled, then—as follows from Lemma 6—the control u can be improved with the use of a two-spike control variation (Section 4).

We shall now numerically analyze two cases, taking

x_{0} = col (- 0.4, 3.5, 1, - 1.1)

,

u_{\min} = - 4

,

u_{\max} = 4

,

T = 1.5

, and

x_{3 \max} = 1

.

Example 2a.

The optimal control in the considered problem is of the form

u (t) = {\begin{matrix} u_{\min}, 0 \leq t < s_{1} \\ u_{\max}, s_{1} \leq t < s_{2} \\ w (x (t)), s_{2} \leq t < s_{3} \\ u_{\min}, s_{3} \leq t \leq T, \end{matrix}

where

s_{1} = 0.20359164

,

s_{2} = 0.36709680

,

s_{3} = 1.1925492

, with

Q (u) = 2.6850568

. Figure 9 shows the control u, the switching function

α_{2}

, and the function p (25). It is easy to see that the necessary optimality conditions of Theorems 4 and 5 are fulfilled in the whole interval

[0, T]

. The optimal state trajectory is depicted in Figure 10. Notice the cusps of

x_{3}

at

t_{en} = s_{2}

and

t_{ex} = s_{3}

, indicating that u satisfies the nontangentiality conditions and is verifiable.

Example 2b.

Consider a verifiable, but nonoptimal control with a boundary interval

u (t) = {\begin{matrix} w (x (t)), 0 \leq t < s_{1} \\ u_{\min}, s_{1} \leq t \leq T \end{matrix}, s_{1} = 1.1825443

The corresponding value of cost is

Q (u) = 2.7408438

. Figure 11 presents the control u, the switching function

α_{2}

, and the function p (25). It can be seen that the necessary conditions of Theorem 4 and Corollary 2 are satisfied, which means that there are no one-spike control variations described in Section 2, guaranteeing an improvement of the cost. Figure 12 shows the state trajectory. The plot of

x_{3}

has a cusp at the exit point

t_{ex} = s_{1}

, and so the control u is verifiable. We can also see in Figure 11 that in the time interval [0, 0.2948] the function p is decreasing, hence it is possible to construct a two-spike control variation in that interval (according to Section 4) which reduces the cost. In order to verify this numerically, consider also Figure 13 which presents a contour plot of the difference

p (τ_{2}) - p (τ_{1})

. Let for instance

τ_{1} = 0

,

τ_{2} = 0.2948

,

v_{1} = - 4

,

v_{2} = 4

(red cross on the left y-axis). By (26), the parameter

η

may have an arbitrary value from the interval

] 0.5410, 0.7134 [

; we choose

η = 0.713

. Figure 14 demonstrates the dependence of the cost increment on the width of the first spike

ε

. The greatest improvement takes place at

ε ≅ 0.073

. For

ε = 0.073

, Figure 15 shows the state trajectory, and Figure 16, an enlargement of the plot of

x_{3}^{ε}

.

9. Connections with Some Classical Results

There are essential connections between some of our results presented in Section 5 and Section 7, and certain classical results obtained by the so called indirect adjoining method, dating back to the works of R.V. Gamkrelidze, A.E. Bryson, H. Maurer, D.H. Jacobson, and many others (see [1,3,4,5]). As we have no space to discuss all similarities and analogies that can be found in the vast literature, we shall concentrate on one representative theorem due to H. Maurer [3]. We shall use a reduced version of that theorem, specialized to the case of state constraint of order one, verifiable control, fixed initial state and free final state.

Consider the optimal control problem formulated in Section 1, with the additional assumption that U is a closed interval with nonempty interior. Define

H^{1} (x, u, λ^{1}, η^{1}) = {(λ^{1})}^{T} f (x, u) + η^{1} \dot{g} (x, u), λ^{1} \in R^{n}, η^{1} \in R .

Theorem 6 ([3], Theorem 5.1).

Let u be a verifiable optimal control and x, the corre-sponding state trajectory. Suppose that f and g are of class

C^{2}

, and let

\partial_{2} \dot{g} (x (t), u (t)) \neq 0

and

u (t) \in int U

for every t in any boundary interval. Additionally, assume that there are finitely many entry points. Then there exist a number

λ_{0} \geq 0

and functions

λ^{1} : [0, T] \to R^{n}

,

η^{1} : [0, T] \to R

such that

{\dot{λ}}^{1} = - \partial_{1} H^{1} (x, u, λ^{1}, η^{1}) = - \partial_{1} f (x, u) λ^{1} - η^{1} \partial_{1} \dot{g} (x, u)

(28)

λ^{1} (T) = λ_{0} \partial q (x (T)) .

(29)

The following jump condition holds

λ^{1} (t_{en} +) = λ^{1} (t_{en} -) - β^{1} (t_{en}) \partial g (x (t_{en})), β^{1} (t_{en}) \geq 0,

(30)

at every entry point

t_{en}

, and

λ^{1}

is continuous at every exit point.The function

η^{1}

satisfies

η^{1} (t) g (x (t)) = 0

on

[0, T]

and is a

C^{1}

function in the interior

] t_{1}, t_{2} [

of every boundary interval, given by

η^{1} (t) = - λ^{1} {(t)}^{T} \frac{\partial_{2} f (x (t), u (t))}{\partial_{2} \dot{g} (x (t), u (t))} .

(31)

Moreover,

η^{1} (t) \geq 0

and

{\dot{η}}^{1} (t) \leq 0

for

t_{1} < t < t_{2}

. It also holds that for a.e.

t \in [0, T]

\min_{v \in U} H^{1} (x (t), v, λ^{1} (t), η^{1} (t)) = H^{1} (x (t), u (t), λ^{1} (t), η^{1} (t)) = c o n s t .

(32)

Let us first notice that if

λ^{1}

in (31) is identical with the adjoint

ψ

, then the multiplier

η^{1}

is equal to the function

- p

given by (22) in Section 5. The function

- p

, similarly to

η^{1}

, is nonnegative and nonincreasing in every boundary interval. The adjoint Equation (28) is then identical with (11) almost everywhere. Indeed, in every boundary interval the Equation (11) takes the form

\dot{ψ} = - \partial_{1} f (x, w (x)) ψ - \partial w (x) \partial_{2} f {(x, w (x))}^{T} ψ .

By virtue of Section 1 and (22),

\partial w (x) = - \frac{\partial_{1} \dot{g} (x, u)}{\partial_{2} \dot{g} (x, u)}, p = \frac{ψ^{T} \partial_{2} f (x, u)}{\partial_{2} \dot{g} (x, u)} .

Hence if

η^{1} = - p

, then

η^{1} \partial_{1} \dot{g} (x, u) = \partial w (x) \partial_{2} f {(x, w (x))}^{T} ψ

. The final condition (29) coin-cides with (12) if

λ_{0} = 1

. This last equality is ensured in [3] by special regularity conditions. Further, it is evident that the jump condition (13) can be written in the form

ψ (t_{en} -) = ψ (t_{en} +) - \frac{Δ H (t_{en}, u (t_{en} -))}{\dot{g} (t_{en}, u (t_{en} -))} \partial g (x (t_{en})) .

Thus, in this case (30) is identical with (13) if

β^{1} (t_{en}) = - \frac{Δ H (t_{en}, u (t_{en} -))}{\dot{g} (t_{en}, u (t_{en} -))} \geq 0 .

In the control affine case discussed in Section 7 the condition (13) is readily transformed to

ψ (t_{en} -) = ψ (t_{en} +) - p (t_{en} +) \partial g (x (t_{en})),

with p determined by (25). Hence, we have

β^{1} (t_{en}) = - p (t_{en} +) \geq 0

by Corollary 4. We skip the proof of the inequality sign in the general case, which is more complicated. The identity of the adjoints

λ^{1}

and

ψ

entails the equivalence between the minimum conditions (32) and (23). In conclusion, the results obtained in this work for the case where in the boundary intervals

u (t) \in int U

,

ϕ_{\max} (t) =

ϕ_{\min} (t) + π

and

p (t)

is given by (22) or (25), are in agreement with Theorem 5.1 in [3].

Finally, note that this work’s approach does not require that the optimal control in the boundary intervals takes values in the interior of U, whereas that assumption is essential in [3]. Also, in contrast to [3], we give an explicit representation of the jump of the adjoint function (Section 3, see also [11,12]).

Author Contributions

Conceptualization, methodology, investigation, writing—A.K. and M.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data sharing not applicable.

Acknowledgments

The authors thank the two anonymous reviewers for their comments that helped to improve the paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

Pontryagin, L.S.; Boltyanskii, V.G.; Gamkrelidze, R.V.; Mishchenko, E.F. The Mathematical Theory of Optimal Processes; Nauka: Moscow, Russia, 1961; (in Russian, first English language edition: Wiley & Sons, Inc., New York, 1962). [Google Scholar]
Karamzin, D.; Pereira, F.L. On a Few Questions Regarding the Study of State-Constrained Problems in Optimal Control. J. Optimiz. Theory App. 2019, 180, 235–255. [Google Scholar] [CrossRef]
Maurer, H. On the Minimum Principle for Optimal Control Problems with State Constraints; Universität Münster: Münster, Germany, 1979. [Google Scholar]
Hartl, R.F.; Sethi, S.P.; Vickson, R.G. A survey of the maximum principles for optimal control problems with state constraints. SIAM Rev. 1995, 37, 181–218. [Google Scholar] [CrossRef]
Arutyunov, A.V.; Karamzin, D.Y.; Pereira, F.L. The maximum principle for optimal control problems with state constraints by R.V. Gamkrelidze: Revisited. J. Optimiz. Theory Appl. 2011, 149, 474–493. [Google Scholar] [CrossRef]
Bonnans, J.F. Course on Optimal Control. Part I: The Pontryagin approach (Version of 21 August 2019). Available online: http://www.cmap.polytechnique.fr/~bonnans/notes/oc/ocbook.pdf (accessed on 30 December 2020).
Bourdin, L. Note on Pontryagin Maximum Principle with Running State Constraints and Smooth Dynamics—Proof based on the Ekeland Variational Principle; University of Limoges: Limoges, France, 2016; Available online: https://arxiv.org/pdf/1604.04051v1.pdf (accessed on 30 December 2020).
Dmitruk, A.; Samylovskiy, I. On the relation between two approaches to necessary optimality conditions in problems with state constraints. J. Optimiz. Theory Appl. 2017, 173, 391–420. [Google Scholar] [CrossRef]
Vinter, R. Optimal Control; Birkhäuser: Boston, MA, USA, 2000. [Google Scholar]
Korytowski, A.; Szymkat, M. On convergence of the Monotone Structural Evolution. Control Cybern. 2016, 45, 483–512. [Google Scholar]
Bonnans, J.F. The shooting approach to optimal control problems. IFAC Proc. Vol. 2013, 46, 281–292. [Google Scholar] [CrossRef] [Green Version]
Bonnans, J.F.; Hermant, A. Well-posedness of the shooting algorithm for state constrained optimal control problems with a single constraint and control. SIAM J. Control Optim. 2007, 46, 1398–1430. [Google Scholar] [CrossRef] [Green Version]
Chertovskih, R.; Karamzin, D.; Khalil, N.T.; Pereira, F.L. Regular path-constrained time-optimal control problems in three-dimensional flow fields. Eur. J. Control 2020, 56, 98–106. [Google Scholar] [CrossRef] [Green Version]
Cortez, K.; de Pinho, M.R.; Matos, A. Necessary conditions for a class of optimal multiprocess with state constraints. Int. J. Robust Nonlinear Control 2020, 30, 6021–6041. [Google Scholar] [CrossRef]

1	PC(0,T; U) is the space of all functions [0,T] → U which have a finite number of discontinuities, are right-continuous in [0,T[, left-continuous at T, and have a finite left-hand limit at every point.
2	Controls leading to state trajectories with boundary touch points are not verifiable.

Figure 1. Optimal control (left scale) and optimal state trajectory (right scale).

Figure 2. Adjoint trajectory.

Figure 3. Verifying conditions of Theorems 1 and 2.

Figure 4. Verifying the conditions of Theorem 3.

Figure 5. Control (left scale) and state trajectory (right scale).

Figure 6. Adjoint trajectory.

Figure 7. Verifying the condition of Theorem 1(i).

Figure 8. Verifying the conditions of Theorem 3.

Figure 9. Optimal control (left scale), switching function and p (right scale).

Figure 10. Optimal state trajectory.

Figure 11. Control (left scale), switching function and p (right scale).

Figure 12. State trajectory.

Figure 13. Contour plot of

p (τ_{2}) - p (τ_{1})

.

Figure 13. Contour plot of

p (τ_{2}) - p (τ_{1})

.

Figure 14. Cost increment vs. width of first spike.

Figure 15. State trajectory

x^{ε}

.

Figure 15. State trajectory

x^{ε}

.

Figure 16. Blow-up of

x_{3}^{ε}

.

Figure 16. Blow-up of

x_{3}^{ε}

.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Korytowski, A.; Szymkat, M. Necessary Optimality Conditions for a Class of Control Problems with State Constraint. Games 2021, 12, 9. https://0-doi-org.brum.beds.ac.uk/10.3390/g12010009

AMA Style

Korytowski A, Szymkat M. Necessary Optimality Conditions for a Class of Control Problems with State Constraint. Games. 2021; 12(1):9. https://0-doi-org.brum.beds.ac.uk/10.3390/g12010009

Chicago/Turabian Style

Korytowski, Adam, and Maciej Szymkat. 2021. "Necessary Optimality Conditions for a Class of Control Problems with State Constraint" Games 12, no. 1: 9. https://0-doi-org.brum.beds.ac.uk/10.3390/g12010009

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Necessary Optimality Conditions for a Class of Control Problems with State Constraint

Abstract

1. Introduction

2. The One-Spike Control Variation and Trajectory Variation

3. The Adjoint Function and The One-Spike Necessary Optimality Condition

4. The Two-Spike Necessary Optimality Condition

5. A Geometrical Interpretation and a Minimum Condition

6. Example 1

7. The Control Affine Case

8. Example 2: The Pendulum on a Cart

9. Connections with Some Classical Results

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI