Abnormality and Strict-Sense Minimizers That Are Not Extended Minimizers

Fusco, Giovanni; Motta, Monica

doi:10.3390/math12070943

Open AccessFeature PaperArticle

Abnormality and Strict-Sense Minimizers That Are Not Extended Minimizers

by

Giovanni Fusco

^†

and

Monica Motta

^*,†

Department of Mathematics “Tullio Levi-Civita”, Università degli Studi di Padova, Via Trieste, 63, 35121 Padova, Italy

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Mathematics 2024, 12(7), 943; https://0-doi-org.brum.beds.ac.uk/10.3390/math12070943

Submission received: 21 February 2024 / Revised: 19 March 2024 / Accepted: 20 March 2024 / Published: 22 March 2024

(This article belongs to the Special Issue Variational Problems and Applications, 2nd Edition)

Download Versions Notes

Abstract

:

We consider a constrained optimal control problem and an extension of it, in which the set of strict-sense trajectories is enlarged. Extension is a common procedure in optimal control used to derive necessary and sufficient optimality conditions for the original problem from the extended one, which usually admits a minimizer and has a more regular structure. However, this procedure fails if the two problems have different infima. Therefore, it is relevant to identify such situations. Following on from earlier work by Warga but adopting perturbation techniques developed in nonsmooth analysis, we investigate the relation between the occurrence of an infimum gap and the abnormality of necessary conditions. For the notion of a local minimizer based on control distance and an extension, including the impulsive one, we prove that (i) a local extended minimizer that is not a local minimizer of the original problem, and (ii) a local strict-sense minimizer that is not a local minimizer of the extended problem both satisfy the extended maximum principle in abnormal form. The main novelty is result (ii), as until now, it has only been shown that a strict-sense minimizer that is not an extended minimizer is abnormal for an ‘averaged version’ of the maximum principle.

Keywords:

optimal control problems; maximum principle; state constraints; gap phenomena; impulsive optimal control

MSC:

49K15; 34K45; 49N25

1. Introduction

It is common practice in the fields of the calculus of variations and optimal control to extend the space of solutions for problems that cannot be solved in, say, an ordinary space, or if the solution is difficult to find, even with numerical approximation. This process, known as extension, involves compactifying and regularizing the problem, resulting in a more manageable structure and the possibility of obtaining necessary and sufficient conditions for optimality. However, in order for an extension to be well posed, it is fundamental that the infimum value achievable in the original problem coincides with that of the extended problem. Otherwise, the extended problem will not provide any useful information about the original problem, which is the only one whose strategies we actually want or can implement. So, for instance, determining an extended minimizer and the solution to the Hamilton–Jacobi equation associated with the extended problem (analytically or using numerical methods) is useful only if from them, we can derive a quasi-optimal control and the value function for the original problem, respectively. Clearly, this is only possible if there is no gap between infima. However, in the presence of endpoint and state constraints, a gap often occurs, even in situations where the set of strict-sense original solutions is

L^{\infty}

-dense in the set of extended paths. In particular, this problem arises when all strict-sense solutions close to an extended trajectory that satisfies the constraints, for instance, a local minimizer, fail to meet them in turn. Criteria for avoiding an infimum gap have, therefore, been extensively investigated in the literature. In the calculus of variations, for example, it is well known that, in the absence of suitable coercivity assumptions, the minimum of an integral cost over Lipschitz-continuous functions with assigned initial and final points may not exist or may be greater than the minimum assumed in the largest set of absolutely continuous functions. In this context, the gap issue is called the Lavrentiev phenomenon, and it is still widely studied (see, e.g., [1] and the comprehensive bibliography therein). As far as optimal control is concerned, a classical extension involves relaxation, obtained by either convexifying the set of admissible velocities or introducing relaxed controls that take values in a set of probability measures. Another extension is the impulsive one, in which a non-coercive problem with unbounded controls, i.e., where minimizing sequences of solutions may have increasing velocities and tend in the limit to discontinuous paths, is extended by admitting functions of bounded variation as solutions. A detailed description of these well-known extensions and a wide bibliography can be found, e.g., in [2,3]. The gap phenomenon has also been studied extensively in optimal control, often in correspondence with necessary optimality conditions, known in the literature as the Pontryagin Maximum Principle (see [4,5] for its original formulation and applications). In particular, starting from the seminal work by Warga [6] in the early 1970s, criteria for excluding an infimum gap for different problems and extensions have been expressed in terms of normality conditions for some versions of the Pontryagin Maximum Principle, where normality means that all sets of multipliers have the cost multiplier different from zero (see, e.g., [7,8,9,10,11,12,13]).

This paper focuses on the connection between the presence of an infimum gap, at least in a local sense, and a nonsmooth version of the maximum principle satisfied in abnormal form, i.e., not normal, for the following problem

(P)

and its extension

(P_{e})

.

Given

T > 0

and

{\overset{ˇ}{x}}_{0} \in R^{n}

, we introduce the constrained control system

\begin{matrix} \dot{y} (t) = F (t, y (t), ω (t), α (t)) for a . e . t \in [0, T], y (0) = {\overset{ˇ}{x}}_{0}, \end{matrix}

(1)

\begin{matrix} h (t, y (t)) \leq 0 \forall t \in [0, T], y (T) \in T . \end{matrix}

(2)

Here,

T

is a closed subset of

R^{n}

, which we call the target;

h : R \times R^{n} \to R

is the state constraint function; and

F : R \times R^{n} \times \bar{V} \times A \to R^{n}

is the dynamics function, where the compact subset

A \subset R^{q}

and the bounded subset

V \subset R^{m}

are the sets of control values. Indeed, with

\bar{V}

denoting the closure of V, let us define the sets

A

,

V

, and

W

of admissible control functions as follows

A : = L^{1} ([0, T], A) V : = L^{1} ([0, T], V) W : = L^{1} ([0, T], \bar{V}) .

We call an extended process any triple

(ω, α, y) \in A \times W \times W^{1, 1} ([0, T], R^{n})

that satisfies the dynamic constraint (1). If

ω \in V

in particular, then we refer to

(ω, α, y)

as a strict-sense process. Any process (either extended or strict-sense) that additionally fulfills the endpoint and the state constraint in (2) is said to be feasible. The sets of feasible strict-sense and feasible extended processes are denoted by

Γ_{s}

and

Γ_{e}

, respectively.

Given a cost function

Ψ : R^{n} \to R

, we introduce the strict-sense optimal control problem

minimize Ψ (y (T)) over (ω, α, y) \in Γ_{s} (P_{s})

and the extended optimal control problem

minimize Ψ (y (T)) over (ω, α, y) \in Γ_{e} . (P_{e})

Note how the controls

α

and

ω

play different roles, given that only the control set

V

, to which

ω

belongs, is extended. The opportunity to consider both arises from applications. For example, in impulsive problems, it is common that only certain control components can take values in an unbounded set. In this case, which we clarify in Section 4,

ω

represents these components while

α

represents the remaining ones (see, e.g., the model example in [14]). Incidentally, this distinction is reflected in the hypotheses on the dynamics

F

, which require continuity in

α

and, instead, a form of uniform continuity for both

F

and its Clarke-generalized Jacobian

D_{x} F

in the variable

ω

, as specified in Section 2.

Since

Γ_{s} \subseteq Γ_{e}

, it immediately follows that

{inf}_{Γ_{e}} Ψ (y (T)) \leq {inf}_{Γ_{s}} Ψ (y (T))

. In fact, this inequality might be strict, in which case we say that there is an infimum gap. In order to introduce the notion of a local infimum gap, for any pair of extended processes

(ω, α, y)

,

(ω^{'}, α^{'}, y^{'})

, we define the following control distance:

d ((ω, α, y), (ω^{'}, α^{'}, y^{'})) : = {∥ ω - ω^{'} ∥}_{L^{1} (0, T)} + ℓ ({t \in [0, T] : α (t) \neq α^{'} (t)}),

where ℓ is the Lebesgue measure. Hence, a feasible strict-sense [resp., extended] process

(\bar{ω}, \bar{α}, \bar{y})

is a local minimizer for

(P_{s})

[resp.,

(P_{e})

] if there exists some

δ > 0

such that

Ψ (\bar{y} (T)) \leq Ψ (y (T))

for any

(ω, α, y)

in

Γ_{s}

[resp.,

Γ_{e}

], satisfying

d ((\bar{ω}, \bar{α}, \bar{y}), (ω, α, y)) \leq δ

.

We distinguish the following two types of local infimum gaps according to whether we focus on the strict-sense problem or the extended problem:

Type-E local infimum gap, when the cost of a local minimizer $(\bar{ω}, \bar{α}, \bar{y})$ of $(P_{e})$ is strictly smaller than the infimum of $(P_{s})$ in a $d$ -neighborhood of $(\bar{ω}, \bar{α}, \bar{y})$ ,
Type-S local infimum gap, if a local minimizer of $(P_{s})$ is not a local minimizer of $(P_{e})$ .

Assuming the hypotheses provided in Section 2, and with reference to the maximum principle of Definition 3 below, our main results are the following:

(i): If at $(\bar{ω}, \bar{α}, \bar{y}) \in Γ_{e}$ , there is a type-E local infimum gap, then $(\bar{ω}, \bar{α}, \bar{y})$ satisfies the maximum principle in abnormal form, i.e., for a set of multipliers with cost multiplier equal to zero;
(ii): If $(\bar{ω}, \bar{α}, \bar{y}) \in Γ_{s}$ is a local minimizer of $(P)$ , then it satisfies the same maximum principle as the extended problem. If, in addition, at $(\bar{ω}, \bar{α}, \bar{y}) \in Γ_{s}$ , there is a type-S local infimum gap, then it is an abnormal extremal.

We emphasize that the choice of the distance

d

above, which plays a fundamental role in the proof of these results, represents a novelty compared to the works we quoted above. In fact, in these papers, they always consider

L^{\infty}

local minimizers, where the

L^{\infty}

distance between the trajectories is used instead of

d

.

Furthermore, in Section 4, we illustrate a relevant application of the above results to impulsive optimal control. There are significant examples in aerodynamics [14,15], mechanics [16,17], and biology [18,19] where the evolution of the involved variables can be modeled as a control system, in which controls can reach very high intensity in a very short time interval, resulting in an abrupt change in the state of the system. The impulsive extension is, therefore, a limit problem, in which the previous controls and trajectories are replaced with their (suitably defined) limits. It is worth emphasizing that in the above-mentioned applications to real-world problems, impulsive controls are only idealizations of the original controls so results in relation to the impulsive problem are of interest only if they provide information on the original problem, namely only if no gap of any type occurs.

As already mentioned, Warga was the first to study the correlation between the presence of an infimum gap and the validity of the maximum principle in abnormal form for a classical extension through relaxation in the measure of the controls. Specifically, he announced the result for a type-S

L^{\infty}

-local infimum gap in his early paper [6], which focused on state constraint-free optimal control problems with smooth data. Then, in his monograph [13], Warga proved the relationship between the gap and abnormality for a type-E

L^{\infty}

-local infimum gap in optimal control problems with state constraints (see also [11]). His subsequent work [12] extended this result to include nonsmooth data, utilizing the results in [20]. Vinter and Palladino [10] proved the above-mentioned correlation in the case of both type-E and type-S

L^{\infty}

-local infimum gaps for the classical extension through convex relaxation of a class of nonsmooth state-constrained optimal control problems, which subsumed those considered by Warga and under less restrictive hypotheses on data. Their techniques differed significantly from those of Warga, reflecting different approaches to the maximum principle. In more detail, the method adopted in [11,12,20] involved constructing approximating cones to reachable sets and using set separation arguments, whereas the technique adopted in [10] utilized perturbation and penalization procedures as well as Ekeland’s variational principle. When applied to nonsmooth optimal control problems, it is difficult to compare these methods as they require different assumptions on the dynamics and target. More importantly, they give rise to distinct abnormality conditions. Indeed, following Warga’s method, these conditions involve the use of ‘derivative containers’ as generalized gradients from [12], whereas the second method relies on Clarke’s version of the maximum principle, in which subdifferentials are considered (see [21,22]). More recently, following the latter approach, results similar to those in [10] were established in [7] for the impulsive extension of optimal control problems with unbounded dynamics and state constraints (see also the references therein). Additionally, in [8,9], an abstract extension including both relaxation and impulsive extension as special cases was addressed. In particular, in [7,8], for the first time, we also provided sufficient conditions for the nondegeneracy of the abnormality condition related to a type-E

L^{\infty}

-local infimum gap.

However, besides considering

L^{\infty}

-local minimizers, all these works focused primarily on the type-E local infimum gap. Specifically, apart from Warga’s initial work, the type-S local infimum gap was only studied in [10] for the extension through convexification of the dynamics and in [9] for a more general extension. In both papers, the results were not entirely satisfactory; however, it was shown that a strict-sense

L^{\infty}

-local minimizer that is not also an extended local minimizer satisfies, in abnormal form, an ‘averaged version’ of the maximum principle, which is much less informative than the actual maximum principle.

In this paper, for the extension under consideration, on the one hand, we fill the gap in the previous literature regarding the results obtained for the type-E and type-S local infimum gaps by showing that in both cases, the local minimizer is abnormal for the maximum principle associated with the extended problem. On the other hand, we extend the previous results for the type-E

L^{\infty}

-local infimum gap to the case of the local minimizer based on the distance

d

described above. Note that from the continuity property of the input-output map associated with the control system, it follows that the present results imply the previous ones. With regard to the techniques used, we are inspired by the approach proposed in [10], as generalized to the case of an abstract extension in [9]. In particular, this allows us to consider rather weak assumptions, including nonsmooth dynamics and state constraint functions, and a target that is simply a closed set (see Section 2).

This paper is organized as follows. In Section 2, we present the notations used, some useful definitions, and precise assumptions. In Section 3, we rigorously introduce the concepts of type-E and type-S local infimum gaps and state our main results, which are proved in Section 5. Section 4 is devoted to applying these results to the impulsive extension of a control-affine system with unbounded controls. We also give an example. Section 6 contains some concluding remarks.

2. Notations and Basic Assumptions

2.1. Notations and Preliminaries

Given

T > 0

and

X \subseteq R^{k}

, we denote by

W^{1, 1} ([0, T], X)

,

L^{1} ([0, T], X)

, and

L^{\infty} ([0, T], X)

the sets of absolutely continuous functions, Lebesgue integrable functions, and essentially bounded functions defined on

[0, T]

and taking values in X, respectively. We do not write domains and codomains when the meaning is clear, and we adopt

{∥ \cdot ∥}_{L^{1} (0, T)}

,

{∥ \cdot ∥}_{L^{\infty} (0, T)}

, or

{∥ \cdot ∥}_{L^{1}}

,

{∥ \cdot ∥}_{L^{\infty}}

to denote the

L^{1}

and the ess-sup norm, respectively. Moreover,

ℓ (X)

,

co (X)

,

\bar{X}

, and

\partial X

denote the Lebesgue measure, the convex hull, the closure, and the boundary of X, respectively. Given a closed set

C \subseteq R^{k}

and a point

z \in R^{k}

, we define the distance of z from

C

as

d_{C} (z) : = {min}_{y \in C} | z - y |

. For any

a, b \in R

, we set

a \lor b : = max {a, b}

. We employ

N B V^{+} ([0, T], R)

to denote the set of monotone non-decreasing, real-valued functions

μ

on

[0, T]

of bounded variation, vanishing at the point 0 and right continuous on

] 0, T [

. Each

μ \in N B V^{+} ([0, T], R)

defines a Borel measure on

[0, T]

, denoted by

μ

; its total variation is indicated by

{∥ μ ∥}_{T V}

or

μ ([0, T])

; and its support is denoted by spt

(μ)

. If

(μ_{i}) \subset N B V^{+} ([0, T], R)

, we say that

μ_{i} ⇀^{*} μ \in N B V^{+} ([0, T], R)

if

\int_{[0, T]} ψ μ_{i} (d t) \to \int_{[0, T]} ψ μ (d t)

for any continuous map

ψ : [0, T] \to R

.

Let us present some notions from nonsmooth analysis (see [21,22] for more details). A set

K \subseteq R^{k}

is a cone if, given

k \in K

and

a > 0

, then

a k \in K

. Let

C

be a closed subset of

R^{k}

, and let

\bar{x} \in C

. Then, the limiting normal cone

N_{C} (\bar{x})

of

C

at

\bar{x}

is given by

N_{C} (\bar{x}) : = \{η \in R^{k} : \exists (x_{i}, η_{i}) \subset C \times R^{k} s . t . (x_{i}, η_{i}) \to (\bar{x}, η), \underset{x \to x_{i}}{lim sup} \frac{η_{i} \cdot (x - x_{i})}{| x - x_{i} |} \leq 0 \forall i\} .

Let

H : R^{k} \to R

be a lower semicontinuous map, and let

\bar{z} \in R^{k}

. Then, the limiting subdifferential of H at

\bar{z}

is

\partial H (\bar{z}) : = \{ξ : \exists ξ_{i} \to ξ, z_{i} \to \bar{z} s . t . \underset{z \to z_{i}}{lim sup} \frac{ξ_{i} \cdot (z - z_{i}) - H (z) + H (z_{i})}{| z - z_{i} |} \leq 0 \forall i\} .

If

k = h + l

and

\bar{z} = (\bar{x}, \bar{y}) \in R^{h} \times R^{l}

,

\partial_{x} H (\bar{x}, \bar{y})

and

\partial_{y} H (\bar{x}, \bar{y})

denote the partial limiting subdifferential of H at

(\bar{x}, \bar{y})

with respect to x, y, respectively. When H is differentiable,

\nabla H

is the usual gradient operator, and

\nabla_{x} H

,

\nabla_{y} H

denote the partial derivatives of H. If H is also locally Lipschitz continuous, the hybrid subdifferential of H at

\bar{z} \in R^{k}

is

\partial^{>} H (\bar{z}) : = co \{ξ : \exists {(z_{i})}_{i} \subset diff (H) ∖ {\bar{z}} s . t . z_{i} \to \bar{z}, H (z_{i}) > 0 \forall i, \nabla H (z_{i}) \to ξ\},

where

diff (H)

is the set of differentiability points of H. Finally, if

U : R^{k} \to R^{l}

is a locally Lipschitz-continuous map and

\bar{z} \in R^{k}

, then

D U (\bar{z})

stands for the Clarke-generalized Jacobian, given by

D U (\bar{a}) : = co \{ξ : \exists {(z_{i})}_{i} \subset diff (U) ∖ {\bar{z}} s . t . z_{i} \to \bar{z} and \nabla U (z_{i}) \to ξ\},

where

\nabla U

refers to the Jacobian matrix of U. If

k = h + l

and

\bar{z} = (\bar{x}, \bar{y}) \in R^{h} \times R^{l}

,

D_{x} U (\bar{x}, \bar{y})

,

D_{y} U (\bar{x}, \bar{y})

denote the Clarke-generalized Jacobian of U at

(\bar{x}, \bar{y})

with respect to x, y, respectively. We recall that the following relation holds:

q \cdot D U (z) = co \partial (q \cdot U) (z) \forall (z, q) \in R^{k + k} .

(3)

2.2. Basic Assumptions

Now, we present the hypotheses we assume throughout this paper. In the following,

(\bar{ω}, \bar{α}, \bar{y})

is a feasible process, which we refer to as the reference process. Moreover, for a given

θ > 0

, the set

Σ_{θ} \subset R^{1 + n}

is defined as

Σ_{θ} : = \{(t, x) \in R \times R^{n} : t \in [0, T], x \in \bar{y} (t) + θ B\} .

H1.

The Borel set

A \subset R^{q}

is compact, and the Borel set

V \subset R^{m}

is bounded. Moreover, there exists a sequence

{(V_{i})}_{i}

of closed subsets of V satisfying

V_{i} \subseteq V_{i + 1} \forall i, ⋃_{i = 1}^{+ \infty} V_{i} = V .

H2.

The cost function Ψ is Lipschitz continuous on a neighborhood of

\bar{y} (T)

. The target

T \subseteq R^{n}

is closed. The state constraint function h is upper semicontinuous, and for some

K_{h} > 0

, it satisfies

| h (t, x) - h (t, x^{'}) | \leq K_{h} | x - x^{'} | for any (t, x), (t, x^{'}) \in Σ_{θ} .

H3.

For all

(x, w, a) \in {x \in R^{n} : (t, x) \in Σ_{θ} for some t \in [0, T]} \times \bar{V} \times A

, the map

F (\cdot, x, w, a)

is Lebesgue measurable on

[0, T]

. Moreover, for some

k \in L^{1} ([0, T], [0, + \infty [)

, one has

| F (t, x, w, a) | \leq k (t), | F (t, x^{'}, w, a) - F (t, x, w, a) | \leq k (t) | x^{'} - x |,

(4)

for all

(t, x, w, a)

,

(t, x^{'}, w, a) \in Σ_{θ} \times \bar{V} \times A

. Furthermore, there exists a continuous increasing function

φ : [0, + \infty [\to [0, + \infty [

vanishing at 0 and satisfying, for all

(t, x, a) \in Σ_{θ} \times A

, the following relations

\begin{matrix} | F (t, x, w^{'}, a) - F (t, x, w, a) | \leq k (t) φ (| w^{'} - w |) \forall w^{'}, w \in \bar{V}, \\ D_{x} F (t, x, w^{'}, a) \subseteq D_{x} F (t, x, w, a) + k (t) φ (| w^{'} - w |) B \forall w^{'}, w \in \bar{V} . \end{matrix}

Remark 1.

Hypothesis (H1) holds whenever V is a relatively open set. Moreover, we observe that if (H1) is satisfied, then

V

is a dense subset of

W

in the

L^{1}

-norm. In particular, for any

\bar{ω} \in W

and any

ε > 0

, there exists an integer

i_{ε}

for which

d_{H} (V_{i}, \bar{V}) < ε / T

for every

i \geq i_{ε}

, where

d_{H} (V_{i}, \bar{V})

stands for the Hausdorff distance between

V_{i}

and

\bar{V}

. Therefore, as a consequence of the selection theorem [23] (Theorem 2, p. 91), it is possible to find a measurable function

ω_{ε} (t) \in {proj}_{V_{i_{ε}}} (ω (t))

a.e., satisfying

∥ ω_{ε} - \bar{ω} ∥_{L^{1}} \leq T {∥ ω_{i} - ω ∥}_{L^{\infty}} \leq T d_{H} (V_{i}, \bar{V}) \leq ε .

Remark 2.

A sufficient condition for (H3) to be satisfied is that

F (t, x, w, a) = F_{1} (t, x, a) + F_{2} (t, x, w, a),

provided

F_{1}

and

F_{2}

meet relation (4), and

F_{2}

is continuous on the compact domain

Σ_{θ} \times \bar{V} \times A

and continuously differentiable with respect to the state variable. Hypothesis (H3) still holds if, for some integer

d \geq 1

, the dynamics function has the following control-polynomial structure

F (t, x, w, a) : = f (t, x, a) {(w_{1})}^{d} + \sum_{k = 1}^{d} (\sum_{2 \leq j_{1} \leq \dots \leq j_{k} \leq m} g_{j_{1}, \dots, j_{k}}^{k} (t, x) w_{j_{1}} \dots w_{j_{k}} {(w_{1})}^{d - k}),

provided f is continuous and locally Lipschitz continuous in

(t, x)

uniformly with respect to a and all the maps

g_{j_{1}, \dots, j_{k}}^{k}

are locally Lipschitz continuous.

3. Type-E or Type-S Local Infimum Gap and Abnormality

In this section, we first introduce the precise definitions of the two types of local infimum gaps we may encounter, depending on whether the process we consider is a local minimum of the extended or the strict-sense problem. Then, in Theorem 1 we establish our main result, namely that the presence of any kind of local infimum gap implies the abnormal extremal condition described in the second part of the section.

3.1. Type-E and Type-S Local Infimum Gaps

As already mentioned in Section 1, for any pair of extended processes

z = (ω, α, y)

,

\hat{z} = (\hat{ω}, \hat{α}, \hat{y})

, we consider the distance

d (z, \hat{z}) : = {∥ ω - \hat{ω} ∥}_{L^{1}} + ℓ \{t \in [0, T] : α (t) \neq \hat{α} (t)\} .

(5)

Moreover,

Γ_{s}

and

Γ_{e}

are the sets of feasible strict-sense and feasible extended processes, respectively.

Definition 1

(Local minimizer). Let

\tilde{Γ}

and

(\tilde{P})

denote

Γ_{e}

and

(P_{e})

or

Γ_{s}

and

(P_{s})

, respectively. A process

\bar{z} : = (\bar{ω}, \bar{α}, \bar{y}) \in \tilde{Γ}

is a local Ψ-minimizer for problem

(\tilde{P})

if, for some

δ > 0

, one has

Ψ (\bar{y} (T)) = inf \{Ψ (y (T)) : z = (ω, α, y) \in \tilde{Γ}, d (z, \bar{z}) < δ\} .

The process

\bar{z}

is a Ψ-minimizer for problem

(\tilde{P})

if

Ψ (\bar{y} (T)) = inf_{\tilde{Γ}} Ψ (y (T))

.

Remark 3.

Under hypothesis (H3), for each extended control

(ω, α) \in W \times A

in a suitable

d

-neighborhood of the reference control

(\bar{ω}, \bar{α})

, there is one and only one solution

y : = y [ω, α]

of (1). Furthermore, the input-output map

(ω, α) \mapsto y [ω, α]

from

W \times A

to

C^{0}

is continuous in this neighborhood, provided

W \times A

is endowed with the distance

d

and

C^{0}

is endowed with the distance induced by the sup-norm. Consequently, if the process

\bar{z}

is an

L^{\infty}

-local minimizer, meaning that

\bar{z}

reaches the minimum over processes

z = (ω, α, y)

with

∥ y - \bar{y} ∥_{L^{\infty}} < δ

for some

δ > 0

, then it is also a local minimizer according to Definition 1. In general, the contrary is not true. This makes the results in [8,9] concerning

L^{\infty}

-local minimizers not directly applicable to the present case.

It is now natural to provide the definitions of the local infimum gaps, depending on whether the reference process is extended or strict-sense.

Definition 2

(Infimum gaps). Let

Ψ : R^{n} \to R

be a continuous function.

(i): If $\bar{z} : = (\bar{ω}, \bar{α}, \bar{y}) \in Γ_{e}$ and for some $δ > 0$ , it holds that

$Ψ (\bar{y} (T)) < inf \{Ψ (y (T)) : z = (ω, α, y) \in Γ_{s}, d (z, \bar{z}) < δ\},$

we say that at $\bar{z}$ , there is a type-E local Ψ-infimum gap. If ${z = (ω, α, y) \in Γ_{s}, d (z, \bar{z}) < δ} = \emptyset$ , we set $inf \{Ψ (y (T)) : z = (ω, α, y) \in Γ_{s}, d (z, \bar{z}) < δ\} = + \infty$ .
(ii): If $\bar{z} : = (\bar{ω}, \bar{α}, \bar{y}) \in Γ_{s}$ is a local Ψ-minimizer for problem $(P_{s})$ , which is not a local Ψ-minimizer for problem $(P_{e})$ , i.e., $\forall ε > 0$ $\exists (ω, α, y) \in Γ_{e}$ satisfying

$Ψ (y (T)) < Ψ (\bar{y} (T)) and d (z, \bar{z}) < ε,$

we say that at $\bar{z}$ , there is a type-S local Ψ-infimum gap.
(iii): We say that there is a Ψ-infimum gap if $inf_{Γ_{e}} Ψ (y (T)) < inf_{Γ_{s}} Ψ (y (T)) .$

In cases where

Ψ

can easily be inferred from the context, we write infimum gap in place of

Ψ

-infimum gap.

Remark 4.

Given the continuity of the input-output map associated with control system (1), it is easy to see that the notion of the type-E local Ψ-infimum gap at

\bar{z}

does not depend on the cost function Ψ, as it is equivalent to the fact that

\{z = (ω, α, y) \in Γ_{s} : d (z, \bar{z}) < δ\} = \emptyset for some δ > 0

(6)

(see [8], Proposition 2.1). If

\bar{z}

satisfies (6), we say that it is an isolated process.

3.2. Main Results

We introduce a nonsmooth version of the Pontryagin maximum principle for

(P_{e})

, and we provide the notions of normal and abnormal extremals. Then, we establish a link between the abnormality and occurrence of a gap phenomenon.

Definition 3

(Pontryagin maximum principle). Let

\bar{z} : = (\bar{ω}, \bar{α}, \bar{y}) \in Γ_{e}

, and let hypotheses (H1)–(H3) be satisfied. We say that

\bar{z}

is a Ψ-extremal or satisfies the Pontryagin maximum principle if there exists a path

p \in W^{1, 1} ([0, T], R^{n})

,

γ \geq 0

,

μ \in N B V^{+} ([0, T], R)

, and a Borel-measurable and μ-integrable function

m : [0, T] \to R^{n}

satisfying the following conditions:

\begin{matrix} {∥ p ∥}_{L^{\infty}} + {∥ μ ∥}_{T V} + γ \neq 0, \end{matrix}

(7)

\begin{matrix} - \dot{p} (t) \in co \partial_{x} \{q (t) \cdot F (t, \bar{y} (t), \bar{ω} (t), \bar{α} (t))\} a . e . t \in [0, T]; \end{matrix}

(8)

\begin{matrix} - q (T) \in γ \partial Ψ (\bar{y} (T)) + N_{T} (\bar{y} (T)); \end{matrix}

(9)

\begin{matrix} \begin{matrix} for a . e . t \in [0, T], one has \\ q (t) \cdot F (t, \bar{y} (t), \bar{ω} (t), \bar{α} (t)) = max_{(w, a) \in \bar{V} \times A} q (t) \cdot F (t, \bar{y} (t), w, a); \end{matrix} \end{matrix}

(10)

\begin{matrix} m (t) \in \partial_{x}^{>} h (t, \bar{y} (t)) μ - a . e . t \in [0, T]; \end{matrix}

(11)

\begin{matrix} s p t (μ) \subseteq {t \in [0, T] : h (t, \bar{y} (t)) = 0}, \end{matrix}

(12)

where

q (t) : = \{\begin{matrix} p (t) + \int_{[0, t [} m (t^{'}) μ (d t^{'}) t \in [0, T [, \\ p (T) + \int_{[0, T]} m (t^{'}) μ (d t^{'}) t = T . \end{matrix}

We say that a Ψ-extremal

\bar{z}

is normal if all sets of multipliers

(p, γ, μ, m)

, as described above, have

γ > 0

. Conversely, we say that

\bar{z}

is abnormal when it is not normal. Clearly, abnormal Ψ-extremals do not depend on Ψ so we refer to them simply as abnormal extremals.

Remark 5.

By the start of the 1970s, it was commonly acknowledged that efforts to expand the usefulness of existing necessary conditions were being hindered by a common problem: a dearth of methods for examining the characteristics of nonsmooth functions and sets with nonsmooth boundaries. One approach to extending the celebrated Pontryagin Maximum Principle [5] in this direction is to use nonsmooth analysis, a branch of analysis that investigates precisely how to locally approximate functions that are non-differentiable and sets with a non-differentiable boundary. The maximum principle above is based on this approach, developed by Clarke and collaborators, for which we refer to the books [21,22].

Theorem 1.

Let

\bar{z} : = (\bar{ω}, \bar{α}, \bar{y}) \in Γ_{e}

and assume that hypotheses (H1)–(H3) hold. Then, consider the following statements:

(i): If $\bar{z}$ is a local Ψ-minimizer for $(P_{e})$ , then $\bar{z}$ is a Ψ-extremal. If at $\bar{z}$ , there is a type-E local Ψ-infimum gap, then $\bar{z}$ is an abnormal extremal;
(ii): If $\bar{z} \in Γ_{s}$ is a local Ψ-minimizer for $(P_{s})$ , then $\bar{z}$ is a Ψ-extremal. If at $\bar{z}$ , there is a type-S local Ψ-infimum gap, then $\bar{z}$ is an abnormal extremal.

The proof of Theorem 1 is given in Section 5.

The main novelty of Theorem 1 is statement (ii), concerning the case where

\bar{z}

is a local minimizer of the original problem but not of the extended one. Indeed, in the previous literature (see [9,10]), it was proven that in such cases, an

L^{\infty}

-local minimizer

\bar{z}

is an abnormal extremal only for an ‘averaged version’ of the maximum principle, meaning that the adjoint Equation (8) was replaced with the following weaker differential inclusion

- \dot{p} (t) \in co \{⋃_{(w, a) \in \bar{V} \times A} \partial_{x} (q (t) \cdot F (t, \bar{y} (t), w, a))\} a . e . t \in [0, T],

in which all information on optimal control is lost. Incidentally, note that the difference between the two adjoint equations still holds even if

F

is

C^{1}

in the state variable.

Remark 6.

It is worth mentioning that, despite hypothesis (H1) implying that

V

is a dense subset of

W

in the

L^{1}

-norm, it has been well known since the earliest work by Warga [12] and Kaskovz [11] that, in general, if only this latter condition is satisfied, the link between the gap and abnormality established in Theorem 1 may fail (see, e.g., the example in [24], Section 9).

A straightforward corollary of Theorem 1 is that the normality of an extremal turns out to be sufficient for any type of local infimum gap not to occur.

Theorem 2.

Let

\bar{z} : = (\bar{ω}, \bar{α}, \bar{y}) \in Γ_{e}

, and assume that hypotheses (H1)–(H3) hold. Then, consider the following statements:

(i): If $\bar{z}$ is a local Ψ-minimizer for $(P_{e})$ , which is a normal Ψ-extremal, at $\bar{z}$ , there is no type-E local Ψ-infimum gap. If, in addition, $\bar{z}$ is a Ψ-minimizer for $(P_{e})$ , then there is no Ψ-infimum gap;
(ii): If $\bar{z} \in Γ_{s}$ is a local Ψ-minimizer for $(P_{s})$ , which is a normal Ψ-extremal, at $\bar{z}$ , there is no type-S local Ψ-infimum gap, namely $\bar{z}$ is a local Ψ-minimizer for $(P_{e})$ as well.

4. An Application: The Impulsive Extension

In this section, we describe how the previous results can be used to investigate the gap phenomenon in a case relevant to applications: the impulsive extension of an optimal control problem with endpoint and state constraints. We also provide an example of an impulsive problem in which both a type-E and a type-S local infimum gap occur, and we explicitly show the abnormality condition in this case.

4.1. An Impulsive Optimization Problem

Let us consider the following free end-time optimization problem with unbounded, control-affine dynamics:

(P) \{\begin{matrix} minimize Ψ (S, x (S), v (S)) \\ over S > 0, u \in L^{1} ([0, S], U), (x, v) \in W^{1, 1} ([0, S], R^{n + 1}), s . t . \\ (\dot{x} (s), \dot{v} (s)) = (f (s, x (s)) + \sum_{j = 1}^{m} g_{j} (s, x (s)) u^{j} (s)), |u (s)|) a . e . s \in [0, S], \\ (x (0), v (0)) = ({\overset{ˇ}{x}}_{0}, 0), \\ h (s, x (s)) \leq 0 for all s \in [0, S], (S, x (S)) \in T^{*}, v (S) \leq K, \end{matrix}

in which

U \subseteq R^{m}

,

T^{*} \subset R^{1 + n}

,

f : R^{1 + n} \to R^{n}

,

g_{j} : R^{1 + n} \to R^{n}

for any

j = 1, \dots, m

,

Ψ : R^{1 + n + 1} \to R

, and

h : R^{1 + n} \to R

.

We make the following assumptions on the data:

H4.

K \in] 0, + \infty [

(i.e., K might be

+ \infty

); the (unbounded) set of control values U is a closed cone; the target

T^{*}

is a closed set; and the dynamics functions f,

g_{j}

, the constraint function h, and the cost function Ψ are locally Lipschitz continuous.

Note that

v (s)

, sometimes called fuel or energy, is simply the

L^{1}

-norm of the control function u on

[0, s]

. Assuming, as usual, that the function

v \mapsto Ψ (s, x, v)

is merely monotone nondecreasing, this problem is non-coercive, i.e., there are no conditions that prevent a minimizing sequence of trajectories from having increasing velocities and converging to a discontinuous path. It is well known that it is possible to embed the original problem

(P)

into the space-time or extended problem

(P_{e})

below, where the time becomes a new state variable and the trajectories are reparameterizations of the limits of the graphs of the trajectories of

(P)

in the

L^{\infty}

-norm [25,26,27,28] (we recall that

(P)

can be analyzed using a distributional approach, meaning that u is substituted by a Radon measure, only if the coefficients

g_{i}

are autonomous and commute, i.e., the Lie brackets

[g_{i}, g_{j}]

are equal to 0 for any

i, j = 1, \dots, m

(see, e.g., [25,29])):

(P_{e}) \{\begin{matrix} minimize Ψ (y^{0} (T), y (T), ν (T)) \\ over T > 0, (ω^{0}, ω) \in W (T), (y^{0}, y, ν) \in W^{1, 1} ([0, T], R^{1 + n + 1}), s . t . \\ {\dot{y}}^{0} (t) = ω^{0} (t) a . e . t \in [0, T], \\ \dot{y} (t) = f (y^{0} (t), y (t)) ω^{0} (t) + \sum_{j = 1}^{m} g_{j} (y^{0} (t), y (t)) ω^{j} (t) a . e . t \in [0, T], \\ \dot{ν} (t) = |ω (t)| a . e . t \in [0, T], \\ (y^{0}, y, ν) (0) = (0, {\overset{ˇ}{x}}_{0}, 0), (y^{0} (T), y (T), ν (T)) \in T^{*} \times] - \infty, K], \\ h (y^{0} (t), y (t)) \leq 0 for all t \in [0, T], \end{matrix}

where

W (T) : = L^{1} ([0, T], W)

, with W the set of control values given by

W : = \{(w^{0}, w) \in [0, + \infty [\times U : w^{0} + | w | = 1\} .

Let

(S, u, x, v)

be an original process, i.e., it satisfies the dynamics constraint together with the initial condition of problem

(P)

, and let

σ : [0, S] \to [0, + \infty [

be defined as follows

σ (s) : = s + v (s) for any s \in [0, S] .

We observe that

(T, ω^{0}, ω, y^{0}, y, ν) : = (σ (S), {\dot{y}}^{0}, (u \circ y^{0}) {\dot{y}}^{0}, σ^{- 1}, x \circ y^{0}, v \circ y^{0})

results in an extended process, i.e., it satisfies the dynamics constraint together with the initial condition of problem

(P_{e})

, and

ω^{0} = {\dot{y}}^{0} > 0

a.e. Actually, the map that associates with each original process an extended process with

ω^{0} > 0

a.e. turns out to be a bijection, so that problem

(P)

is in correspondence with the strict-sense problem

(P_{s})

, namely the optimal control problem that arises when in

(P_{e})

, we limit ourselves to consider strict-sense processes only, i.e., extended processes with

ω^{0} > 0

a.e. Therefore, the extension involves allowing the control variable

ω^{0}

to vanish on some non-trivial intervals contained in

[0, T]

. There,

y^{0}

remains constant, whereas y evolves instantaneously according to

\dot{y} = \sum_{j = 1}^{m} g_{j} (y^{0}, y) ω^{j} (t)

. This is the reason why

(P_{e})

, despite being an ordinary optimal control problem with controls taking values in compact sets, is usually labeled as the impulsive extension of

(P)

. Indeed, problem

(P_{e})

is also equivalent to another generalization of

(P)

where the controls are vector-valued measures and the trajectories are bounded variation paths [14,30,31,32,33,34].

Adopting the terminology of the present paper, we say that an extended or strict-sense process

(T, ω^{0}, ω, y^{0}, y, ν)

is feasible [resp. an original process

(S, u, x, v)

is feasible] if it additionally fulfills all the endpoint and the state constraint of

(P_{e})

[resp.

(P)

]. The sets of feasible original, feasible extended, and feasible strict-sense processes are denoted by

Γ^{*}

,

Γ_{e}

, and

Γ_{s}

, respectively. Given

z = (T, ω^{0}, ω, y^{0}, y, ν)

and

\hat{z} = (\hat{T}, {\hat{ω}}^{0}, \hat{ω}, {\hat{y}}^{0}, \hat{y}, \hat{ν}) \in Γ_{e}

, we define the distance:

d_{imp} (z, \hat{z}) : = | T - \hat{T} | + ∥ (ω^{0}, ω) - ({\hat{ω}}^{0}, \hat{ω}) ∥_{L^{1} (0, T \land \hat{T})} .

(13)

Note that

d_{imp}

is equivalent to the distance obtained by replacing

T \land \hat{T}

with

T \lor \hat{T}

in the

L^{1}

-norm (possibly extending the controls to

R

constantly equal to 0), as

∥ (ω^{0}, ω) - ({\hat{ω}}^{0}, \hat{ω}) ∥_{L^{1} (0, T \lor \hat{T})} - ∥ (ω^{0}, ω) - ({\hat{ω}}^{0}, \hat{ω}) ∥_{L^{1} (0, T \land \hat{T})} \leq M | T - \hat{T} |

for some constant

M > 0

. At this point, the definitions of the local minimizer and type-E and type-S local

Ψ

-infimum gaps (see Definitions 1 and 2) can be easily adapted to the impulsive extension by replacing the distance

d

defined in (5) with the distance

d_{imp}

given in (13). The unmaximized Hamiltonian associated with problem

(P_{e})

above is given by

H (s, x, p_{0}, p, π, w^{0}, w) : = p_{0} w^{0} + p \cdot (f (s, x) w^{0} + \sum_{j = 1}^{m} g_{j} (s, x) w^{j}) + π | ω |

for all

(s, x, p_{0}, p, π, w^{0}, w) \in R^{1 + n + 1 + n + 1} \times W

.

Definition 4.

We say that

(\bar{T}, {\bar{ω}}^{0}, \bar{ω}, {\bar{y}}^{0}, \bar{y}, \bar{ν}) \in Γ_{e}

is a Ψ-extremal if there exists a path

(p_{0}, p) \in W^{1, 1} ([0, \bar{T}], R^{1 + n})

,

γ \geq 0

,

π \leq 0

,

μ \in N B V^{+} ([0, \bar{T}], R)

, and Borel-measurable and μ-integrable functions

(m_{0}, m) : [0, \bar{T}] \to R^{1 + n}

satisfying the following conditions:

\begin{matrix} ∥ p_{0} ∥_{L^{\infty}} + {∥ p ∥}_{L^{\infty}} + μ ([0, \bar{T}]) + γ \neq 0 \end{matrix}

(14)

\begin{matrix} - ({\dot{p}}_{0}, \dot{p}) (t) \in co \partial_{s, x} H ({\bar{y}}^{0} (t), \bar{y} (t), q_{0} (t), q (t), π, {\bar{ω}}^{0} (t), \bar{ω} (t)) a . e . t \end{matrix}

(15)

\begin{matrix} - (q_{0} (\bar{T}), q (\bar{T}), π) \in γ \partial Ψ ({\bar{y}}^{0} (\bar{T}), \bar{y} (\bar{T}), \bar{ν} (\bar{T})) + N_{T^{*} \times] - \infty, K]} ({\bar{y}}^{0} (\bar{T}), \bar{y} (\bar{T}), \bar{ν} (\bar{T})) \end{matrix}

(16)

\begin{matrix} \begin{matrix} H ({\bar{y}}^{0} (t), \bar{y} (t), q_{0} (t), q (t), π, {\bar{ω}}^{0} (t), \bar{ω} (t)) \\ = max_{(w^{0}, w) \in W} H ({\bar{y}}^{0} (t), \bar{y} (t), q_{0} (t), q (t), π, w^{0}, w) = 0 a . e . t \end{matrix} \end{matrix}

(17)

\begin{matrix} (m_{0}, m) (t) \in \partial_{s, x}^{>} h ({\bar{y}}^{0} (t), \bar{y} (t)) μ - a . e . t \end{matrix}

(18)

\begin{matrix} spt (μ) \subseteq {t \in [0, \bar{T}] : h ({\bar{y}}^{0} (t), \bar{y} (t)) = 0}, \end{matrix}

(19)

where

(q_{0}, q) : [0, \bar{T}] \to R^{1 + n}

is given by

(q_{0}, q) (t) : = \{\begin{matrix} (p_{0}, p) (t) + \int_{[0, t [} (m_{0}, m) (t^{'}) μ (d t^{'}) t \in [0, \bar{T} [, \\ (p_{0}, p) (\bar{T}) + \int_{[0, \bar{T}]} (m_{0}, m) (t^{'}) μ (d t^{'}) t = \bar{T} . \end{matrix}

Moreover, if

γ \partial_{ν} Ψ ({\bar{y}}^{0} (\bar{T}), \bar{y} (\bar{T}), \bar{ν} (\bar{T})) = 0

and

\bar{ν} (\bar{T}) < K

, then

π = 0

. Furthermore, if

{\bar{y}}^{0} (0) < {\bar{y}}^{0} (\bar{T})

, then (14) can be strengthened with

{∥ p ∥}_{L^{\infty}} + μ ([0, \bar{T}]) + γ \neq 0 .

(20)

We say that

(\bar{T}, {\bar{ω}}^{0}, \bar{ω}, {\bar{y}}^{0}, \bar{y}, \bar{ν})

is normal if all sets of multipliers

(p_{0}, p, γ, π, μ, m_{0}, m)

, as described above, have

γ > 0

. Conversely, we say that

(\bar{T}, {\bar{ω}}^{0}, \bar{ω}, {\bar{y}}^{0}, \bar{y}, \bar{ν})

is abnormal when it is not normal.

From Theorem 1, we deduce the following result.

Theorem 3.

Let

\bar{z} : = (\bar{T}, {\bar{ω}}^{0}, \bar{ω}, {\bar{y}}^{0}, \bar{y}, \bar{ν}) \in Γ_{e}

, and assume that hypothesis (H4) holds. Then, consider the following statements:

(i): If $\bar{z}$ is a local Ψ-minimizer for $(P_{e})$ , then $\bar{z}$ is a Ψ-extremal. If at $\bar{z}$ , there is a type-E local Ψ-infimum gap, then $\bar{z}$ is an abnormal extremal;
(ii): If $\bar{z} \in Γ_{s}$ is a local Ψ-minimizer for $(P_{s})$ , then $\bar{z}$ is a Ψ-extremal. If at $\bar{z}$ , there is a type-S local Ψ-infimum gap, then $\bar{z}$ is an abnormal extremal.

Proof.

The impulsive extended problem

(P_{e})

has a free end time, so the results of the previous sections concerning fixed end-time problems do not apply straightforwardly. However, through a standard time-rescaling procedure that applies to free end-time problems with Lipschitz-continuous time dependence, we can embed problem

(P_{e})

into a fixed end-time optimization problem, satisfying all the assumptions of Theorem 1 and for which, for example,

\bar{z}

is still a local minimizer if it was so for

(P_{e})

. Precisely, let

W : = W (\bar{T})

,

D : = L^{1} ([0, \bar{T}], [- 1 / 2, 1 / 2])

, and consider the rescaled problem:

(P_{e}^{r}) \{\begin{matrix} minimize Ψ (y^{0} (\bar{T}), y (\bar{T}), ν (\bar{T})) \\ over (ω^{0}, ω) \in W, d \in D, (y^{0}, y, ν) \in W^{1, 1} ([0, \bar{T}], R^{1 + n + 1}), s . t . \\ {\dot{y}}^{0} (t) = (1 + d (t)) ω^{0} (t) a . e . t \in [0, \bar{T}], \\ \dot{y} (t) = (1 + d (t)) F (y^{0} (t), y (t), ω^{0} (t), ω (t)) a . e . t \in [0, \bar{T}], \\ \dot{ν} (t) = (1 + d (t)) |ω (t)| a . e . t \in [0, \bar{T}], \\ (y^{0}, y, ν) (0) = (t_{1}, {\overset{ˇ}{x}}_{0}, 0), \\ h (y^{0} (t), y (t)) \leq 0 for all t \in [0, \bar{T}], (y^{0} (\bar{T}), y (\bar{T}), ν (\bar{T})) \in T^{*} \times] - \infty, K], \end{matrix}

where, for any

(t, x, w^{0}, w) \in R^{1 + n} \times W

, we have the set

F (t, x, w^{0}, w) : = f (t, x) w^{0} + \sum_{j = 1}^{m} g_{j} (t, x) w^{j} .

Any element

(ω^{0}, ω, d, y^{0}, y, ν)

satisfying all constraints in

(P_{e}^{r})

is referred to as a feasible rescaled extended process. If

ω^{0} > 0

a.e., then

(ω^{0}, ω, d, y^{0}, y, ν)

is called a feasible rescaled strict-sense process. For any pair of feasible rescaled extended processes

ζ : = (ω^{0}, ω, d, y^{0}, y, ν)

,

\hat{ζ} : = ({\hat{ω}}^{0}, \hat{ω}, \hat{d}, {\hat{y}}^{0}, \hat{y}, \hat{ν})

, we define the distance as

d^{r} (ζ, \hat{ζ}) : = {∥ (ω^{0}, ω, d) - ({\hat{ω}}^{0}, \hat{ω}, \hat{d}) ∥}_{L^{1} (0, \bar{T})} .

Let us associate the (feasible) rescaled process

\bar{ζ} : = ({\bar{ω}}^{0}, \bar{ω}, \bar{d} = 0, {\bar{y}}^{0}, \bar{y}, \bar{ν})

with the given reference process

\bar{z} = (\bar{T}, {\bar{ω}}^{0}, \bar{ω}, {\bar{y}}^{0}, \bar{y}, \bar{ν})

. From a straightforward application of the chain rule and standard calculations, we deduce that for any

δ > 0

, there exists some

ε \in] 0, δ [

such that with each feasible rescaled extended process

ζ : = ({\tilde{ω}}^{0}, \tilde{ω}, \tilde{d}, {\tilde{y}}^{0}, \tilde{y}, \tilde{ν})

satisfying

d^{r} (ζ, \bar{ζ}) < ε

, using the time change

τ (s) = \int_{0}^{s} \frac{d s^{'}}{1 + \tilde{d} (s^{'})}, s \in [0, \bar{T}],

we can associate the following feasible extended process

z = (T, ω^{0}, ω, y^{0}, y, ν) : = (τ (\bar{T}), ({\tilde{ω}}^{0}, \tilde{ω}, {\tilde{y}}^{0}, \tilde{y}, \tilde{ν}) \circ τ) .

satisfying

d_{imp} (z, \bar{z}) < δ

. Moreover,

Ψ (({\tilde{y}}^{0}, \tilde{y}, \tilde{ν}) (\bar{T})) = Ψ ((y^{0}, y, ν) (T))

.

As a consequence, if

\bar{z}

is a local

Ψ

-minimizer for

(P_{e})

, then

\bar{ζ}

is a local

Ψ

-minimizer for

(P_{e}^{r})

, at which there is a type-E local infimum gap as soon as at

\bar{z}

, there is a type-E local infimum gap. At this point, the proof of Theorem 3 can be derived by applying Theorem 1 to the rescaled problem. We omit the details, which follow the same line as the proofs in [22] (Theorem 8.7.1), and [8] (Theorem 4.1). □

Remark 7.

Using similar arguments to those in [8], what we have done in this section can be easily generalized to control-polynomial impulsive problems, by which we mean that the dynamics of the original problem

(P)

can be replaced with

(\dot{x}, \dot{v}) (t) = (f (t, x) + \sum_{k = 1}^{d} (\sum_{1 \leq j_{1} \leq \dots \leq j_{k} \leq m} g_{j_{1}, \dots, j_{k}}^{k} (t, x) u^{j_{1}} \dots u^{j_{k}}), {|u|}^{d}) a . e . t,

where d is an integer

\geq 1

. This generalization may be relevant for some applications to Lagrangian mechanics, where dynamics are usually control-polynomial with a degree of

d = 2

(see [17]).

4.2. An Example

The following example tells us that both a type-S local infimum gap and a type-E local infimum gap may occur. Moreover, we exhibit sets of abnormal multipliers, which exist in accordance with Theorem 3.

Consider the optimization problem with scalar, unbounded controls:

(P) \{\begin{matrix} minimize | x^{1} (1) - 1 | \\ over u \in L^{1} ([0, 1], [0, + \infty [), (x^{1}, x^{2}) \in W^{1, 1} ([0, 1], R^{2}) s . t . \\ ({\dot{x}}^{1} (s), {\dot{x}}^{2} (s)) = (u (s), 2) a . e . s \in [0, 1], \\ (x^{1}, x^{2}) (0) = (- 1, - 1), x^{2} (1) = 1, \int_{0}^{1} u (s) d s \leq 3, \\ h (x^{1} (s), x^{2} (s)) : = 1 - | x^{1} (s) | \lor | x^{2} (s) | \leq 0 for all s \in [0, 1] . \end{matrix}

Let

W : = \{(w^{0}, w) \in [0, + \infty [\times [0, + \infty [: w^{0} + w = 1\}

. Then, the space-time extension of the above problem is given by

(P_{e}) \{\begin{matrix} minimize | y^{1} (T) - 1 | \\ over T > 0, (ω^{0}, ω) \in L^{1} ([0, T], W), (y^{0}, y^{1}, y^{2}, ν) \in W^{1, 1} ([0, T], R^{4}) s . t . \\ ({\dot{y}}^{0}, {\dot{y}}^{1}, {\dot{y}}^{2}, \dot{ν}) (t) = (ω^{0}, ω, 2 ω^{0}, ω) (t) a . e . t \in [0, T], \\ (y^{0}, y^{1}, y^{2}, ν) (0) = (0, - 1, - 1, 0), y^{0} (T) = 1, y^{2} (T) = 1, ν (T) \leq 3, \\ h (y^{1} (t), y^{2} (t)) = 1 - | y^{1} (t) | \lor | y^{2} (t) | \leq 0 for all t \in [0, T] . \end{matrix}

Type-S local infimum gap. Let

\bar{z} : = (\bar{T}, {\bar{ω}}^{0}, \bar{ω}, {\bar{y}}^{0}, {\bar{y}}^{1}, {\bar{y}}^{2}, \bar{ν})

be the following strict-sense process, where

\bar{T} = 1

, the control

({\bar{ω}}^{0}, \bar{ω})

is given by the constant pair

({\bar{ω}}^{0}, \bar{ω}) (t) = (1, 0) \forall t \in [0, 1],

and

({\bar{y}}^{0}, {\bar{y}}^{1}, {\bar{y}}^{2}, \bar{ν}) (t) = (t, - 1, - 1 + 2 t, 0) \forall t \in [0, 1] .

It is easy to see that

\bar{z}

, which corresponds to the process of

(P)

associated with the control

\bar{u} \equiv 0

, is trivially a strict-sense minimizer, as

({\bar{y}}^{0}, {\bar{y}}^{1}, {\bar{y}}^{2}, \bar{ν})

is the unique feasible strict-sense trajectory. However,

\bar{z}

is not a local minimizer for the extended problem

(P_{e})

. Indeed, let us fix

ε > 0

sufficiently small, and let us consider the extended process

z_{ε} = (T_{ε}, ω_{ε}^{0}, ω_{ε}, y_{ε}^{0}, y_{ε}^{1}, y_{ε}^{2}, ν_{ε})

, where

T_{ε} = 1 + ε

and

(ω_{ε}^{0}, ω_{ε})

is given by

(ω_{ε}^{0}, ω_{ε}) (t) : = \{\begin{matrix} (1, 0) if t \in [0, 1] \\ (0, 1) if t \in] 1, 1 + ε], \end{matrix}

so that one has

(y_{ε}^{0}, y_{ε}^{1}, y_{ε}^{2}, ν_{ε}) (t) = \{\begin{matrix} (t, - 1, - 1 + 2 t, 0) if t \in [0, 1] \\ (1, - 2 + t, 1, t - 1) if t \in] 1, 1 + ε] . \end{matrix}

For any

ε > 0

, this is the description in the state space of a discontinuous state trajectory

(x_{ε}^{1}, x_{ε}^{2})

for problem

(P)

, which first reaches the point

(- 1, 1)

using the control

u = 0

and then jumps to the position

(- 1 + ε, 1)

with an impulse. Note that

{\bar{z}}_{ε}

is a feasible extended process that satisfies

d_{imp} (z_{ε}, \bar{z}) = | T_{ε} - \bar{T} | + ∥ (ω_{ε}^{0}, ω_{ε}) - ({\bar{ω}}^{0}, \bar{ω}) ∥_{L^{1} (0, 1 \land (1 + ε))} = ε

whose cost is strictly less than the cost corresponding to

\bar{z}

because it holds that

| y_{ε}^{1} (1 + ε) - 1 | = 2 - ε < 2 = | {\bar{y}}^{1} (1) - 1 | .

Thus, by the arbitrariness of

ε > 0

, at

\bar{z}

, there is a type-S local infimum gap. Indeed, a set of abnormal multipliers corresponding to

\bar{z}

is given by

(p_{0}, p, γ, π, μ, m_{0}, m)

, where

γ = π = 0

,

p_{0} \equiv 0

,

μ \equiv 0

,

p = (p_{1}, p_{2}) \equiv (0, 1)

,

m_{0} \equiv 0

, and

m (t) = (m_{1}, m_{2}) (t) \in \partial^{>} h ({\bar{y}}^{1} (t), {\bar{y}}^{2} (t))

for any

t \in [0, 1]

.

Type-E local infimum gap. Now consider the following extended process

\hat{z} : = ({\hat{ω}}^{0}, \hat{ω}, {\hat{y}}^{0}, {\hat{y}}_{1}, {\hat{y}}_{2}, \hat{ν})

, where

\hat{T} = 3

and

({\hat{ω}}^{0}, \hat{ω})

is given by

({\hat{ω}}^{0}, \hat{ω}) (t) : = \{\begin{matrix} (1, 0) t \in [0, 1] \\ (0, 1) t \in] 1, 3], \end{matrix}

so that one has

({\hat{y}}^{0}, {\hat{y}}^{1}, {\hat{y}}^{2}, \hat{ν}) (t) = \{\begin{matrix} (t, - 1, - 1 + 2 t, 0) t \in [0, 1] \\ (1, - 2 + t, 1, t - 1) t \in] 1, 3] . \end{matrix}

It is easy to see that

\hat{z}

is a minimizer for

(P_{e})

as it is feasible, and its corresponding cost is equal to zero. Moreover, at

\hat{z}

, there is a type-E local infimum gap since

\bar{z}

defined in the previous step is the unique feasible strict-sense process. Indeed, a set of abnormal multipliers corresponding to

\hat{z}

is given by

(p_{0}, p, γ, π, μ, m_{0}, m)

, where

γ = π = 0

,

p_{0} \equiv 0

,

μ ({0}) = 2

,

μ (] 0, 1]) = 0

,

p = (p_{1}, p_{2}) \equiv (- 2, 0)

,

m_{0} \equiv 0

,

m (0) = (m_{1}, m_{2}) (0) = (1, 0)

, and

m (t) = (m_{1}, m_{2}) (t) \in \partial^{>} h ({\bar{y}}^{1} (t), {\bar{y}}^{2} (t))

for any

t \in] 0, 1 [

.

5. Proof of Theorem 1

First, we point out that by utilizing standard cutoff procedures, we may assume. without loss of generality. that hypotheses (H2) and (H3) hold, replacing

Σ_{θ}

with

R^{1 + n}

. In the proofs, we utilize extended trajectories lying in an

L^{\infty}

-tube around the reference trajectory

\bar{y}

, and the control functions take values in compact sets. Therefore, the input-output map

(ω, α) \mapsto y [ω, α]

associated with (1) is well defined and continuous (actually, uniformly continuous).

5.1. Proof of Statement (i)

If

\bar{z}

is a local

Ψ

-minimizer for

(P_{e})

, the fact that it is an extremal can be easily derived from [22] (Theorem 9.3.1). Proving that whenever there is a type-E local infimum gap at

\bar{z}

, it is an abnormal extremal, instead requires a careful adaptation of the reasoning employed in the proof in [8] (Theorem 2.1), where the same result was obtained for the notion of a type-E local infimum gap, in which the distance

d

between the controls was replaced with the

L^{\infty}

-distance of the trajectories. Specifically, the proof is structured as follows. In the first step, we construct a sequence of optimization problems

({\hat{P}}_{i})

over strict-sense processes with the controls taking values in

V_{i} \times A

, where

V_{i}

is as in (H1) and the cost function penalizes processes that violate the endpoint and the state constraint. Hence, we build another sequence of optimal control problems, say

(P_{i})

, by suitably perturbing

({\hat{P}}_{i})

. Finally, by applying the Ekeland principle, we find a sequence

(z_{i})

of minimizers for

(P_{i})

that converges to the reference process

\bar{z} = (\bar{ω}, \bar{α}, \bar{y})

. In the second step of the proof, we write the necessary conditions satisfied by each

z_{i}

, whereas in the third step, we pass to the limit in these conditions, obtaining a set of abnormal multipliers for

\bar{z}

.

Step 1. Define the function

Φ : R^{n + 1} \to R

, given by

Φ (x, c) : = d_{T} (x) \lor c

and for any

y \in W^{1, 1} ([0, T], R^{n})

we set

J (y) : = Φ (y (T), max_{t \in [0, T]} h (t, y (t))) .

Let

{(ε_{i})}_{i}

be a sequence converging to 0, and let

{(ρ_{i})}_{i}

be such that

ρ_{i}^{2} = sup {J (y) : z = (ω, α, y) \in Γ_{s}, d (z, \bar{z}) \leq ε_{i}} .

By the uniform continuity of the input-output map and the Lipschitz continuity of

Φ

, it follows that

{lim}_{i \to + \infty} ρ_{i}^{2} = 0

. Moreover,

ρ_{i} > 0

as soon as i is sufficiently large, as

\bar{z}

is an isolated process by Remark 4.

By (H1) and Remark 1, for any i, there exists a closed subset

V_{ε_{i}} \subset V

and a control

{\hat{ω}}_{i} \in V_{ε_{i}} : = L^{1} ([0, T], V_{ε_{i}})

such that

∥ {\hat{ω}}_{i} - \bar{ω} ∥_{L^{1}} \leq ε_{i}

. Hence, let

{\hat{z}}_{i} = ({\hat{ω}}_{i}, {\hat{α}}_{i}, {\hat{y}}_{i})

be such that

{\hat{α}}_{i} \equiv \bar{α}

and

{\hat{y}}_{i} = y [{\hat{ω}}_{i}, {\hat{α}}_{i}]

. As a consequence,

{\hat{z}}_{i}

is a

ρ_{i}^{2}

-minimizer for the optimization problem (

{\hat{P}}_{i}

), given by

({\hat{P}}_{i}) \{\begin{matrix} Minimize J (y) \\ over z = (ω, α, y) \in Γ^{i} \end{matrix}

where

Γ^{i} : = {(ω, α, y) \in V_{ε_{i}} \times A \times W^{1, 1} ([0, T], R^{n}) satisfying (1)} .

It is easy to show that if we equip

Γ^{i}

with the distance

d

, it turns out to be a complete metric space. Accordingly, by applying Ekeland’s variational principle, we deduce that there exists

z_{i} = (ω_{i}, α_{i}, y_{i}) \in Γ^{i}

, which is a minimizer for the optimal control problem

(P_{i})

, given by

(P_{i}) \{\begin{matrix} Minimize J (y) + ρ_{i} \int_{0}^{T} [| ω (t) - ω_{i} (t) | + ϑ_{i} (t, α (t))] d t \\ over z = (ω, α, y) \in Γ^{i}, \end{matrix}

where

ϑ_{i} : [0, T] \times A

is defined as

ϑ_{i} (t, a) : = \{\begin{matrix} 0 if a = α_{i} (t) \\ 1 otherwise . \end{matrix}

Moreover, one has

d (z_{i}, {\hat{z}}_{i}) \leq ρ_{i}

so

d (z_{i}, \bar{z}) \leq ρ_{i} + ε_{i} \to 0

. In particular, it holds that

ω_{i} \to \bar{ω} in L^{1}, ℓ ({t \in [0, T] : α_{i} (t) \neq \bar{α} (t)}) \to 0 .

(21)

Furthermore, since the input-output map

(ω, α) \mapsto y [ω, α]

is continuous, one has

y_{i} \to \bar{y} in L^{\infty}, {\dot{y}}_{i} ⇀ \dot{\bar{y}} weakly in L^{1} .

(22)

By the previous convergence analysis and, since

\bar{z}

is isolated, one has

J (y_{i}) > 0

for any i. Therefore, possibly passing to a subsequence, for any i, we have

either d_{T} (y_{i} (T)) > 0 or c_{i} : = {max}_{t \in [0, T]} h (t, y_{i} (t)) > 0 .

(23)

Step 2. From the above reasoning, it follows that

(z_{i}, c_{i}) = (ω_{i}, α_{i}, y_{i}, max_{t \in [0, T]} h (t, y_{i} (t)))

is a minimizer for the optimal control problem

(Q_{i})

, given by

(Q_{i}) \{\begin{matrix} Minimize (d_{T} (y (T)) \lor c (T)) + ρ_{i} \int_{0}^{T} [| ω (t) - ω_{i} (t) | + ϑ_{i} (t, α (t))] d t \\ over (ω, α, y, c) \in V_{ε_{i}} \times A \times W^{1, 1} ([0, T], R^{n + 1}) satisfying \\ (\dot{y} (t), \dot{c} (t)) = (F (t, y (t), ω (t), α (t)), 0) a . e . t \in [0, T], \\ y (0) = {\overset{ˇ}{x}}_{0}, \\ \tilde{h} (t, y (t), c (t)) : = h (t, y (t)) - c (t) \leq 0 \forall t \in [0, T] . \end{matrix}

Possibly passing to a subsequence, only one of the following two cases occurs:

\begin{matrix} Case (a) : c_{i} > 0 for any i . \\ Case (b) : c_{i} \leq 0 for any i . \end{matrix}

Let us first analyze Case (a). Since from

h (t, y_{i} (t)) - c_{i} > 0

it follows that

h (t, y_{i} (t)) > 0

, one has

\partial_{x, c}^{>} \tilde{h} (t, x, c) = \partial_{x}^{>} h (t, x) \times {- 1}

. Moreover, by the max rule for subdifferentials (see, e.g., [22] (Section 5)), if

(β_{i}^{1}, β_{i}^{2}) \in \partial Φ (y_{i} (T), c_{i})

, there exists

σ_{i}^{1}

,

σ_{i}^{2} \geq 0

such that

σ_{i}^{1} + σ_{i}^{2} = 1

,

β_{i}^{1} \in σ_{i}^{1} (\partial d_{T} (y_{i} (T)) \cap \partial B)

and

β_{i}^{2} = σ_{i}^{2}

. Furthermore,

σ_{i}^{1} = 0

[resp.

σ_{i}^{2} = 0

] whenever

d_{T} (y_{i} (T)) < d_{T} (y_{i} (T)) \lor c_{i}

[resp.

c_{i} < d_{T} (y_{i} (T)) \lor c_{i}

]. Thanks to the above reasoning, if we write the necessary conditions of the maximum principle satisfied by the minimizer

(z_{i}, c_{i})

, we deduce that there exists

(p_{i}, π_{i}) \in W^{1, 1} ([0, T], R^{n + 1})

,

λ_{i} \geq 0

,

μ_{i} \in N B V^{+} ([0, T], R)

,

σ_{i}^{1}

,

σ_{i}^{2} \geq 0

such that

σ_{i}^{1} + σ_{i}^{2} = 1

and a Borel-measurable and

μ_{i}

-integrable map

m_{i} : [0, T] \to R^{n}

satisfying conditions (i)′–(vi)′ below:

(i)′: $∥ p_{i} ∥_{L^{\infty}} + λ_{i} + μ_{i} ([0, T]) + {∥ π_{i} ∥}_{L^{\infty}} = 1$ ;
(ii)′: $- {\dot{p}}_{i} (t) \in co \partial_{x} {q_{i} (t) \cdot F (t, y_{i} (t), ω_{i} (t), α_{i} (t))$ and ${\dot{π}}_{i} (t) = 0$ for a.e. $t \in [0, T]$ ;
(iii)′: $- q_{i} (T) \in λ_{i} σ_{i}^{1} (\partial Φ (y_{i} (T)) \cap \partial B)$ , $π (0) = 0$ , $- π (T) + μ_{i} ([0, T]) = λ_{i} σ_{i}^{2}$ ;
(iv)′: $m_{i} (t) \in \partial_{x}^{>} h (t, y_{i} (t))$ $μ_{i}$ -a.e. $t \in [0, T]$ ;
(v)′: spt $(μ_{i}) \subset {t \in [0, T] : h (t, y_{i} (t)) - c_{i} = 0}$ ;
(vi)′: $\int_{0}^{T} q_{i} (t) \cdot F (t, y_{i} (t), ω_{i} (t), α_{i} (t)) d t$
$\geq \int_{0}^{T} [q_{i} (t) \cdot F (t, y_{i} (t), ω (t), α (t)) - ρ_{i} λ_{i} (| ω_{i} (t) - ω (t) |) + ϑ_{i} (t, α (t))] d t$
$\geq \int_{0}^{T} [q_{i} (t) \cdot F (t, y_{i} (t), ω (t), α (t)) - ρ_{i} λ_{i} (1 + diam (\bar{V}))] d t$
for any $(ω, α) \in V_{ε_{i}} \times A$ ,

where

diam (\bar{V})

is the diameter of the compact set

\bar{V}

and

q_{i} : [0, T] \to R^{n}

is defined as

q_{i} (t) : = \{\begin{matrix} p_{i} (t) + \int_{[0, t]} m_{i} (t^{'}) μ_{i} (d t^{'}) if t \in [0, T [, \\ p_{i} (T) + \int_{[0, T]} m_{i} (t^{'}) μ_{i} (d t^{'}) if t = T . \end{matrix}

(24)

From (ii)′ and (iii)′, we deduce that

π_{i} \equiv 0

and

μ_{i} ([0, T]) = λ_{i} σ_{i}^{2}

. Since

∥ m_{i} ∥_{L^{\infty}} \leq K_{h}

, from (iii)′, we also have

λ_{i} σ_{i}^{1} = | q_{i} (T) | \leq ∥ p_{i} ∥_{L^{\infty}} + K_{h} μ_{i} ([0, T])

. By summing up these relations and (i)′, we obtain

2 ∥ p_{i} ∥_{L^{\infty}} + (2 + K_{h}) μ_{i} ([0, T]) + λ_{i} \geq 1 + λ_{i} σ_{i}^{1} + λ_{i} σ_{i}^{2},

which implies

∥ p_{i} ∥_{L^{\infty}} + μ_{i} ([0, T]) \geq \frac{1}{2 + K_{h}}

. By rescaling the multipliers, one obtains

∥ p_{i} ∥_{L^{\infty}} + μ_{i} ([0, T]) = 1

and

λ_{i} \geq 2 + K_{h}

.

If instead, Case (b) occurs, then

d_{T} (y_{i} (T)) > 0

for any i by (23). Hence, for

δ > 0

small, the process

(z_{i}, c_{i} + δ)

is still a minimizer for

(Q_{i})

, and

h (t, y_{i} (t)) - (c_{i} + δ) < 0

for all

t \in [0, T]

. If we also write in this case the necessary conditions of optimality satisfied by the minimizer

(z_{i}, c_{i} + δ)

, we deduce the existence of

p_{i} \in W^{1, 1} ([0, T], R^{n})

and

λ_{i} > 0

, fulfilling relations (i)′–(vi)′ above for

μ_{i} \equiv 0

,

σ_{i}^{2} = 0

(hence,

σ_{i}^{1} = 1

). Indeed, if it were

λ_{i} = 0

, then

q_{i} (T) = p_{i} (T) = 0

, so the linearity of the adjoint equation (ii)′ implies

p_{i} \equiv 0

, contradicting (i)′. In this case, from (iii)′, we deduce

0 < λ_{i} = | q_{i} (T) | \leq ∥ p_{i} ∥_{L^{\infty}}

. By summing up this relation with (i)′, we obtain

2 ∥ p_{i} ∥_{L^{\infty}} + λ_{i} > 1 + λ_{i}

, which implies

∥ p_{i} ∥_{L^{\infty}} > \frac{1}{2}

. By rescaling the multipliers, we have

∥ p_{i} ∥_{L^{\infty}} = 1

and

λ_{i} \leq 2 \leq 2 + K_{h}

.

Step 3. For both Case (a) and Case (b), we have proved that for any i, there exists

p_{i} \in W^{1, 1} ([0, T], R^{n})

,

μ_{i} \in N B V^{+} ([0, T], R)

, and a Borel-measurable and

μ_{i}

-integrable map

m_{i} : [0, T] \to R^{n}

satisfying relations (i)–(vi) below:

(i): $∥ p_{i} ∥_{L^{\infty}} + μ_{i} ([0, T]) = 1$ ;
(ii): $- {\dot{p}}_{i} (t) \in co \partial_{x} {q_{i} (t) \cdot F (t, y_{i} (t), ω_{i} (t), α_{i} (t))$ a.e. $t \in [0, T]$ ;
(iii): $- q_{i} (T) \in [0, 2 + K_{h}] (\partial Φ (y_{i} (T)) \cap \partial B)$ ;
(iv): $m_{i} (t) \in \partial_{x}^{>} h (t, y_{i} (t))$ $μ_{i}$ -a.e. $t \in [0, T]$ ;
(v): spt $(μ_{i}) \subset {t \in [0, T] : h (t, y_{i} (t)) - c_{i} = 0}$ ;
(vi): $\int_{0}^{T} q_{i} (t) \cdot F (t, y_{i} (t), ω_{i} (t), α_{i} (t)) d t$
$\geq \int_{0}^{T} [q_{i} (t) \cdot F (t, y_{i} (t), ω (t), α (t)) - ρ_{i} (2 + K_{h}) (1 + diam (\bar{V}))] d t$
for any $(ω, α) \in V_{ε_{i}} \times A$ ,

where

q_{i} : [0, T] \to R^{n}

is given by (24). Employing a standard convergence analysis (see [7] for more details), we deduce the existence of

(p, μ) \in W^{1, 1} ([0, T], R^{n}) \times N B V^{+} ([0, T], R)

and a Borel-measurable and

μ

-integrable map

m : [0, T] \to R^{n}

satisfying, up to a subsequence, the following conditions:

\begin{matrix} μ_{i} ⇀^{*} μ, m_{i} (t) μ_{i} (d t) ⇀^{*} m (t) μ (d t), \\ p_{i} \to p in L^{\infty}, q_{i} \to q in L^{1}, {\dot{p}}_{i} ⇀ \dot{p} weakly in L^{1} . \end{matrix}

(25)

Therefore, using (22) and passing to the limit in conditions (i), (iv), and (v), we obtain

\begin{matrix} {∥ p ∥}_{L^{\infty}} + μ ([0, T]) = 1, m (t) \in \partial_{x}^{>} h (t, \bar{y} (t)) μ - a . e . t \in [0, T], \\ spt (μ) \subset {t \in [0, T] : h (t, \bar{y} (t)) = 0} . \end{matrix}

Moreover, using the basic properties of subdifferentials and the fact that

\partial d_{T} (x) = N_{T} (x) \cap B

for any

x \in T

(see [22]), by (iii), we deduce that

- q (T) \in N_{T} (\bar{y} (T)),

where

q : [0, T] \to R^{n}

is given by

q (t) : = \{\begin{matrix} p (t) + \int_{[0, t]} m (t^{'}) μ (d t^{'}) if t \in [0, T [ \\ p (T) + \int_{[0, T]} m (t^{'}) μ (d t^{'}) if t = T . \end{matrix}

Let us now derive the adjoint Equation (8). Let

Ω_{i} : = {t \in [0, T] : α_{i} (t) = \bar{α} (t)}

, so that

ℓ (Ω_{i}) \to 0

by (21). Using (3) and hypothesis (H3), for a.e.

t \in Ω_{i}

, we obtain

\begin{matrix} (- {\dot{p}}_{i} & (t), {\dot{y}}_{i} (t)) \in (co \partial_{x} {q_{i} (t) \cdot F (t, y_{i} (t), ω_{i} (t), \bar{α} (t))}, F (t, y_{i} (t), ω_{i} (t), \bar{α} (t))) \\ \subseteq (q_{i} (t) \cdot D_{x} F (t, y_{i} (t), \bar{ω} (t), \bar{α} (t)) + | q_{i} (t) | k (t) φ (| ω_{i} (t) - \bar{ω} (t) |) B, \\ F (t, y_{i} (t), \bar{ω} (t), \bar{α} (t)) + k (t) φ (| ω_{i} (t) - \bar{ω} (t) |) B) \\ \subseteq (co \partial_{x} {q (t) \cdot F (t, y_{i} (t), \bar{ω} (t), \bar{α} (t)), F (t, y_{i} (t), \bar{ω} (t), \bar{α} (t))}) + r_{i} (t) B \end{matrix}

where, since

∥ q_{i} ∥_{L^{\infty}} \leq {∥ p_{i} ∥}_{L^{\infty}} + K_{h} μ_{i} ([0, T]) \leq 1 + K_{h}

, the map

r_{i} : [0, T] \to R

is given by

r_{i} (t) = | q_{i} (t) - q (t) | k (t) + 2 (1 + K_{h}) k (t) φ (| ω_{i} (t) - \bar{ω} (t) |) .

By the continuity of

φ

, (21), and (25), we deduce that, up to a subsequence,

r_{i} (t) \to 0

for a.e.

t \in [0, T]

. Moreover, it holds that

| r_{i} (t) | \leq 2 (1 + K_{h}) (1 + φ (diam (\bar{V}))) k (t) \in L^{1} .

Hence, by the dominated convergence theorem,

r_{i} \to 0

in

L^{1}

(in particular,

φ (| ω_{i} - \bar{ω} |) \to 0

in

L^{1}

). From the compactness of trajectories theorem (see [22], Theorem 2.5.3), it follows that for a.e.

t \in [0, T]

, it holds that

(- \dot{p} (t), \dot{\bar{y}} (t)) \in (co \partial_{x} {q (t) \cdot F (t, \bar{y} (t), \bar{ω} (t), \bar{α} (t))}, F (t, \bar{y} (t), \bar{ω} (t), \bar{α} (t)))

Now, we conclude the proof by demonstrating (10). Let

(ω, α) \in W \times A

and, as a consequence of hypothesis (H1), let

{(v_{i})}_{i} \subset V

satisfy

v_{i} \in V_{ε_{i}}

for each i, and

∥ ω - v_{i} ∥_{L^{1}} \leq ε_{i} ↓ 0

. Condition (vi) implies that

\int_{0}^{T} q_{i} (t) \cdot {\dot{\bar{y}}}_{i} (t) d t \geq \int_{0}^{T} [q_{i} (t) \cdot F (t, y_{i} (t), v_{i} (t), α (t)) - ρ_{i} (1 + diam (\bar{V})) (2 + K_{h})] d t

Up to a subsequence, the term on the right in the above relation converges to

\int_{0}^{T} [q (t) \cdot F (t, \bar{y} (t), ω (t), α (t))] d t

by the dominated convergence theorem. At the same time, it holds that

\int_{0}^{T} q_{i} (t) \cdot {\dot{y}}_{i} (t) d t = \int_{0}^{T} q (t) \cdot \dot{\bar{y}} (t) d t + \int_{0}^{T} (q_{i} (t) - q (t)) \cdot \dot{y_{i}} (t) d t + \int_{0}^{T} q (t) \cdot ({\dot{y}}_{i} (t) - \dot{\bar{y}} (t)) d t .

But now the second term on the right tends to zero by the dominated convergence theorem, whereas the third one converges to zero because of (22) and since q is bounded. Therefore, we have proved that for any

(ω, α) \in W \times A

, one has

\int_{0}^{T} q (t) \cdot \dot{\bar{y}} (t) d t \geq \int_{0}^{T} q (t) F (t, \bar{y} (t), ω (t), α (t)) d t .

From a measurable selection theorem, (10) immediately follows.

5.2. Proof of Statement (ii)

Let

\bar{z} = (\bar{ω}, \bar{α}, \bar{y}) \in Γ_{s}

be a local

Ψ

-minimizer for

(P_{s})

. We can derive that it is an extremal of the Pontryagin maximum principle from [22] (Theorem 9.3.1). In particular, the maximality condition (10) still holds with the maximum taken over

\bar{V} \times A

since we assume that the dynamics function is continuous with respect to the w-variable.

If

\bar{z}

is a local

Ψ

-minimizer for

(P_{s})

, which is not a local

Ψ

-minimizer for

(P_{e})

, then, on the one hand, there exists

δ > 0

such that

Ψ (\bar{y} (T)) \leq Ψ (y (T))

for any

z = (ω, α, y) \in Γ_{s}

such that

d (z, \bar{z}) \leq 2 δ

. On the other hand, taken

{(ε_{i})}_{i} \subset] 0, δ [

with

ε_{i} ↓ 0

, for each i, there exists some

z_{i} = (ω_{i}, α_{i}, y_{i}) \in Γ_{e}

such that

d (z_{i}, \bar{z}) \leq ε_{i} < δ

and

Ψ (y_{i} (T)) < Ψ (\bar{y} (T))

. Hence, for any

z = (ω, α, y) \in Γ_{s}

such that

d (z_{i}, z) \leq δ

, one has

d (z, \bar{z}) \leq 2 δ

, so by construction, we have

Ψ (y_{i} (T)) < Ψ (\bar{y} (T)) \leq Ψ (y (T)) .

Since the strict-sense process z is arbitrary, this proves that at

z_{i}

, there is a type-E local infimum gap for any i. Hence, by Theorem 1.(i), for any i, there exists

p_{i} \in W^{1, 1} ([0, T], R^{n})

,

μ_{i} \in N B V^{+} ([0, T], R)

, and a Borel-measurable and

μ_{i}

-integrable map

m_{i} : [0, T] \to R^{n}

satisfying conditions (i)–(vi) below:

(i): $∥ p_{i} ∥_{L^{\infty}} + μ_{i} ([0, T]) = 1$ ;
(ii): $- {\dot{p}}_{i} (t) \in co \partial_{x} {q_{i} (t) \cdot F (t, y_{i} (t), ω_{i} (t), α_{i} (t))$ a.e. $t \in [0, T]$ ;
(iii): $- q_{i} (T) \in N_{T} (y_{i} (T))$ ;
(iv): $m_{i} (t) \in \partial_{x}^{>} h (t, y_{i} (t))$ $μ_{i}$ -a.e. $t \in [0, T]$ ;
(v): spt $(μ_{i}) \subset {t \in [0, T] : h (t, y_{i} (t)) - c_{i} = 0}$ ;
(vi): $q_{i} (t) \cdot F (t, y_{i} (t), ω_{i} (t), α_{i} (t)) = max_{(w, a) \in \bar{V} \times A} q_{i} (t) \cdot F (t, y_{i} (t), w, a)$ a.e. t,

where

q_{i} : [0, T] \to R^{n}

is as in (24). We observe that our construction implies

d (z_{i}, \bar{z}) \to 0

, so (21) and (22) hold true. We can thus conclude the proof employing a standard convergence analysis similar to that in Step 3 of the proof of Theorem 1.(i).

6. Concluding Remarks

In this paper, we investigate infimum gap phenomena that may occur when we pass from an optimal control problem with nonsmooth data, endpoint, and state constraints to an extended version of it in a framework that includes the impulsive extension of a class of non-coercive problems with unbounded dynamics. In particular, we consider type-E and type-S local infimum gaps. In the former, an extended minimizer has a cost that is strictly smaller than the infimum cost over close feasible strict-sense processes. In the latter, a local strict-sense minimizer does not locally minimize the extended problem. Following on from Warga’s previous research but utilizing more recent perturbation techniques from nonsmooth analysis, which allow us to obtain results for non-differentiable data and an arbitrary closed set as the target, we prove that whenever there is either a type-E or a type-S local infimum gap at a process for a notion of local minimizer based on the control distance

d

defined in (5), it satisfies a nonsmooth constrained version of the Pontryagin maximum principle in abnormal form. In contrast to previous results, where there was an ‘asymmetry’ between the necessary abnormality conditions derived for type-E and type-S local infimum gaps, for the extension under consideration, we obtain the same condition for both.

As a corollary, we provide sufficient conditions in the form of a normality test for the absence of local infimum gap phenomena. Although a normality test for gap avoidance might seem completely theoretical and hardly verifiable, it can actually be very useful because in certain situations, normality follows from easily verifiable criteria. These criteria take the form of constraint and endpoint qualification conditions for normality and have been extensively explored in the literature (see, e.g., [35,36,37,38] and the references therein). As shown in [7] (see also the references therein), where several explicit conditions for normality in control-affine impulsive extensions were presented, these criteria are generally weaker than those previously established for directly determining the absence of a gap.

The framework introduced in this paper may have implications for future infimum gap research in several directions. On the one hand, it may be the starting point for some generalizations, including the following: (i) Determining a higher-order maximum principle for local minimizers of the strict-sense problem and proving that in the case of a type-S local infimum gap, abnormality of the higher-order conditions also occurs. So far, results of this kind are only known for extended minimizers and type-E infimum gaps, limited to the impulsive extension case (see [39]). (ii) Exploring infimum gap phenomena for the impulsive extension of optimal control problems involving control-affine systems with time delays. Necessary optimality conditions for such systems were recently established in [40]. We point out that this line of research, conducted in collaboration with R.Vinter, could have important implications for many applications modeled as a sort of impulsive problem with delays, where impulses may occur only at some prescribed instants. For instance, applications in fed-batch fermentation [41,42] and in the impulsive control of delayed neural networks [43].

Another interesting problem might be to consider different extension procedures for classes of control systems not considered in this paper (such as distributed parameters systems or multistage problems).

Author Contributions

Conceptualization, G.F. and M.M.; Writing—original draft, G.F. and M.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the INdAM-GNAMPA Project 2023, CUP E53C22001930001, and by PRIN 2022, Prot. 2022238YY5, CUP C53D23002370006.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Mariconda, C. Non-occurrence of gap for one-dimensional non-autonomous functionals. Calc. Var. Partial Differ. Equ. 2023, 62, 55. [Google Scholar] [CrossRef]
Bardi, M.; Capuzzo-Dolcetta, I. Optimal Control and Viscosity Solutions of Hamilton-Jacobi-Bellman Equations; Systems & Control: Foundations & Applications; Birkhäuser: Boston, MA, USA, 1997; 570p. [Google Scholar]
Bressan, A.; Piccoli, B. Introduction to the Mathematical Theory of Control; AIMS Series on Applied Mathematics, 2; American Institute of Mathematical Sciences (AIMS): Springfield, MO, USA, 2007. [Google Scholar]
Krotov, V.F.; Burkeev, V.Z.; Gurman, V.I. New Variational Methods in Flight Dynamics; Translate from Russian; Israel Program for Scientific Translations: Jerusalem, Israel, 1971. [Google Scholar]
Pontryagin, L.S.; Boltyanskii, V.G.; Gamkrelidze, R.V.; Mishchenko, E.F. The Mathematical Theory of Optimal Processes; Translate from Russian; Interscience Publishers John Wiley & Sons, Inc.: New York, NY, USA; London, UK, 1962. [Google Scholar]
Warga, J. Normal Control Problems have no Minimizing Strictly Original Solutions. Bull. Am. Math. Soc. 1971, 77, 625–628. [Google Scholar] [CrossRef]
Fusco, G.; Motta, M. No Infimum Gap and Normality in Optimal Impulsive Control Under State Constraints. Set-Valued Var. Anal. 2021, 29, 519–550. [Google Scholar] [CrossRef]
Fusco, G.; Motta, M. Nondegenerate abnormality, controllability, and gap phenomena in optimal control with state constraints. SIAM J. Control Optim. 2022, 60, 280–309. [Google Scholar] [CrossRef]
Fusco, G.; Motta, M. Strict sense minimizers which are relaxed extended minimizers in general optimal control problems. In Proceedings of the 60th IEEE Conference on Decision and Control (CDC), Austin, TX, USA, 13–15 December 2021. [Google Scholar]
Palladino, M.; Vinter, R.B. When are minimizing controls also minimizing extended controls? Discret. Contin. Dyn. Syst. 2015, 35, 4573–4592. [Google Scholar] [CrossRef]
Kaśkosz, B. Extremality, controllability, and abundant subsets of generalized control systems. J. Optim. Theory Appl. 1999, 101, 73–108. [Google Scholar] [CrossRef]
Warga, J. Controllability, extremality, and abnormality in nonsmooth optimal control. J. Optim. Theory Appl. 1983, 41, 239–260. [Google Scholar] [CrossRef]
Warga, J. Optimal Control of Differential and Functional Equations; Academic Press: New York, NY, USA, 1972. [Google Scholar]
Karamzin, D.Y.; de Oliveira, V.A.; Pereira, F.L.; Silva, G.N. On the properness of an impulsive control extension of dynamic optimization problems. ESAIM Control Optim. Calc. Var. 2015, 21, 857–875. [Google Scholar] [CrossRef]
Azimov, D.; Bishop, R. New trends in astrodynamics and applications: Optimal trajectories for space guidance. Ann. N. Y. Acad. Sci. 2005, 1065, 189–209. [Google Scholar] [CrossRef]
Aldo, B. Hyper-impulsive motions and controllizable coordinates for Lagrangean systems. Atti Accad. Naz. Lincei Mem. 1991, XIX, 197–246. [Google Scholar]
Bressan, A.; Rampazzo, F. Moving constraints as stabilizing controls in classical mechanics. Arch. Ration. Mech. Anal. 2010, 196, 97–141. [Google Scholar] [CrossRef]
Catllá, A.; Schaeffer, D.; Witelski, T.; Monson, E.; Lin, A. On spiking models for synaptic activity and impulsive differential equations. SIAM Rev. 2008, 50, 553–569. [Google Scholar] [CrossRef]
Gajardo, P.; Ramirez, C.H.; Rapaport, A. Minimal time sequential batch reactors with bounded and impulse controls for one or more species. SIAM J. Control Optim. 2008, 47, 2827–2856. [Google Scholar] [CrossRef]
Warga, J. Optimization and controllability without differentiability assumptions. SIAM J. Control Optim. 1983, 21, 837–855. [Google Scholar] [CrossRef]
Clarke, F.H. Optimization and Nonsmooth Analysis; Wiley-Interscience: New York, NY, USA, 1983. [Google Scholar]
Vinter, R.B. Optimal Control; Birkhäuser: Boston, MA, USA, 2000. [Google Scholar]
Aubin, J.-P.; Cellina, A. Differential inclusions. Set-valued maps and viability theory. In Fundamental Principles of Mathematical Sciences; Grundlehren der Mathematischen Wissenschaften; Springer: Berlin/Heidelberg, Germany, 1984; Volume 264. [Google Scholar]
Palladino, M.; Rampazzo, F. A geometrically based criterion to avoid infimum gaps in optimal control. J. Differ. Equ. 2020, 269, 10107–10142. [Google Scholar] [CrossRef]
Bressan, A.; Rampazzo, F. On differential systems with vector-valued impulsive controls. Boll. Un. Mat. Ital. B 1988, 2, 641–656. [Google Scholar]
Miller, B.M. The method of discontinuous time substitution in problems of the optimal control of impulse and discrete-continuous systems. Avtomat. Telemekh. 1993, 12, 3–32. (In Russian) [Google Scholar]
Rishel, R.W. An extended Pontryagin principle for control systems whose control laws contain measures. SIAM J. Control 1965, 3, 191–205. [Google Scholar] [CrossRef]
Warga, J. Variational problems with unbounded controls. J. Soc. Ind. Appl. Math. Ser. A Control 1965, 3, 424–438. [Google Scholar] [CrossRef]
Hájec, O. Book review: Differential systems involving impulses. Bull. Am. Math. Soc. 1985, 12, 272–279. [Google Scholar] [CrossRef]
Miller, B.M.; Rubinovich, E.Y. Impulsive Control in Continuous and Discrete-Continuous Systems; Kluwer Academic/Plenum Publishers: New York, NY, USA, 2003. [Google Scholar]
Sarychev, A. Nonlinear systems with impulsive and generalized function controls. In Nonlinear Synthesis; Progress in Systems and Control Theory; Birkhäuser: Boston, MA, USA, 1991; Volume 9, pp. 244–257. [Google Scholar]
Wolenski, P.; Žabić, S. A sampling method and approximation results for impulsive systems. SIAM J. Control Optim. 2007, 46, 983–998. [Google Scholar] [CrossRef]
Arutyunov, A.; Dykhta, V.; Pereira, L.F. Necessary conditions for impulsive nonlinear optimal control problems without a priori normality assumptions. J. Optim. Theory Appl. 2005, 124, 55–77. [Google Scholar] [CrossRef]
Arutyunov, A.V.; Karamzin, D.Y.; Pereira, F.L. State constraints in impulsive control problems: Gamkrelidze-like conditions of optimality. J. Optim. Theory Appl. 2015, 166, 440–459. [Google Scholar] [CrossRef]
Arutyunov, A.V.; Karamzin, D.Y. A survey on regularity conditions for state-constrained optimal control problems and the non-degenerate maximum principle. J. Optim. Theory Appl. 2020, 184, 697–723. [Google Scholar] [CrossRef]
Fontes, F.A.C.C.; Frankowska, H. Normality and nondegeneracy for optimal control problems with state contraints. J. Optim. Theory Appl. 2015, 166, 115–136. [Google Scholar] [CrossRef]
Frankowska, H.; Tonon, D. Inward pointing trajectories, normality of the maximum principle and the non occurrence of the Lavrentieff phenomenon in optimal control under state constraints. J. Convex Anal. 2013, 20, 1147–1180. [Google Scholar]
Lopes, S.O.; Fontes, F.A.C.C.; de Pinho, M.d.R. On constraint qualifications for nondegenerate necessary conditions of optimality applied to optimal control problems. Discret. Contin. Dyn. Syst. 2011, 29, 559–575. [Google Scholar] [CrossRef]
Motta, M.; Palladino, M.; Rampazzo, F. Unbounded Control, Infimum Gaps, and Higher Order Normality. SIAM J. Control Optim. 2022, 60, 1436–1462. [Google Scholar] [CrossRef]
Fusco, G.; Motta, M. Impulsive optimal control problems with time delays in the drift term. arXiv 2023, arXiv:2307.12806. [Google Scholar]
Gao, C.X.; Li, K.Z.; Feng, E.M.; Xiu, Z.L. Nonlinear impulsive system of fed- batch culture in fermentative production and its properties. Chaos Soliton Fract. 2006, 28, 271–277. [Google Scholar] [CrossRef]
Xiu, Z.L.; Song, B.H.; Sun, L.H.; Zeng, A.P. Theoretical analysis of effects of metabolic overflow and time delay on the performance and dynamic behavior of a two-stage fermentation process. Biochem. Eng. J. 2002, 11, 101–109. [Google Scholar] [CrossRef]
Li, X.; Cao, J.; Daniel, W.C.H. Impulsive Control of Nonlinear Systems with Time-Varying Delay and Applications. IEEE Trans. Cybern. 2020, 50, 2661–2673. [Google Scholar] [CrossRef] [PubMed]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Fusco, G.; Motta, M. Abnormality and Strict-Sense Minimizers That Are Not Extended Minimizers. Mathematics 2024, 12, 943. https://0-doi-org.brum.beds.ac.uk/10.3390/math12070943

AMA Style

Fusco G, Motta M. Abnormality and Strict-Sense Minimizers That Are Not Extended Minimizers. Mathematics. 2024; 12(7):943. https://0-doi-org.brum.beds.ac.uk/10.3390/math12070943

Chicago/Turabian Style

Fusco, Giovanni, and Monica Motta. 2024. "Abnormality and Strict-Sense Minimizers That Are Not Extended Minimizers" Mathematics 12, no. 7: 943. https://0-doi-org.brum.beds.ac.uk/10.3390/math12070943

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Abnormality and Strict-Sense Minimizers That Are Not Extended Minimizers

Abstract

1. Introduction

2. Notations and Basic Assumptions

2.1. Notations and Preliminaries

2.2. Basic Assumptions

3. Type-E or Type-S Local Infimum Gap and Abnormality

3.1. Type-E and Type-S Local Infimum Gaps

3.2. Main Results

4. An Application: The Impulsive Extension

4.1. An Impulsive Optimization Problem

4.2. An Example

5. Proof of Theorem 1

5.1. Proof of Statement (i)

5.2. Proof of Statement (ii)

6. Concluding Remarks

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI