Neural Network and Extended State Observer-Based Model Predictive Control for Smooth Braking at Preset Points in Autonomous Vehicles

Chen, Jianlin; Xu, Yang; Zheng, Zixuan

doi:10.3390/drones8060273

Open AccessArticle

Neural Network and Extended State Observer-Based Model Predictive Control for Smooth Braking at Preset Points in Autonomous Vehicles

by

Jianlin Chen

^1,2

,

Yang Xu

^3,*

and

Zixuan Zheng

⁴

¹

School of Civil Aviation, Northwestern Polytechnical University, 127 Youyi West Street, Xi’an 710072, China

²

Yangtze River Delta Research Institution, Northwestern Polytechnical University, 27 Zigang Road, Taicang 215400, China

³

School of Aeronautics and Astronautics, Tiangong University, 399 Binshui West Road, Tianjin 300387, China

⁴

School of Astronautics, Northwestern Polytechnical University, 127 Youyi West Street, Xi’an 710072, China

^*

Author to whom correspondence should be addressed.

Drones 2024, 8(6), 273; https://0-doi-org.brum.beds.ac.uk/10.3390/drones8060273

Submission received: 27 April 2024 / Revised: 17 June 2024 / Accepted: 17 June 2024 / Published: 20 June 2024

(This article belongs to the Special Issue Advances in Modeling, Estimation, and Control of Intelligent Transportation Systems)

Download

Browse Figures

Versions Notes

Abstract

:

In this paper, we explore the problem of smooth braking at preset points in autonomous vehicles using model predictive control (MPC) with a receding horizon extended state observer (RHESO) and a neural network (NN). An NN-based modeling method is proposed to intuitively describe the relationship between vehicle speed and the vehicle controllers (brake and throttle), and establish a dynamic model of autonomous vehicles. A sufficient condition is put forward to guarantee the convergence of the proposed NN. Furthermore, a composite MPC strategy based on RHESO is designed, which optimizes a given cost function over the receding horizon while mitigating the effects of modeling inaccuracies and disturbances. Additionally, easily verifiable conditions are provided to ensure the autonomous driving vehicles’ uniform boundedness. Numerically illustrative examples are given to demonstrate the effectiveness of the proposed approach.

Keywords:

model predictive control; extended state observer; neural network; autonomous vehicle; preset point smooth braking

1. Introduction

Generally, non-stationary braking in vehicles poses major safety hazards, which may cause vehicle imbalance and accidents [1], resulting in uncomfortable riding experiences for passengers [2], damaging brake system components, increasing repair and maintenance costs [3], and even posing a threat to pedestrians on the road [4].

Unlike manual braking, the emergence of autonomous vehicles (AVs) makes it easier for vehicles to brake smoothly and accurately. Accuracy and smoothness are the two key performance criteria of smooth braking in AVs, meaning low tracking errors can be achieved without aggressive braking/throttle operations. Furthermore, smooth and precise braking in AVs has potential applications in many fields, such as bus passenger pick-up and drop-off [5], truck loading and unloading of goods at preset points [6], and passenger elevator docking with airplanes [7]. In these application scenarios, AVs must stop smoothly and precisely at preset points to ensure comfortable riding experiences for passengers and avoid collisions. Moreover, smooth braking at preset points in AVs significantly improves both parking efficiency and parking space utilization.

Smooth braking at preset points in AVs is a typical position and speed (PS) tracking control problem, which is also a dynamic control problem [8,9]. Existing PS tracking methods can be broadly categorized into three groups. The first group of methods does not consider vehicle dynamics or disturbance models. Control commands are solely generated based on tracking errors, employing designs such as proportional–integral–derivative (PID) controllers and their variants [10,11]. The disadvantage is that control precision heavily depends on adjustable parameters with high conservation and poor portability. The second method type relies on precise dynamic models that overlook disturbances to better predict PS information [12]. In [13], the authors analyzed a smooth braking torque control strategy for a brushless DC motor. In [14], a preview of a servo-loop speed control algorithm was proposed to achieve smooth, accurate, and computationally inexpensive speed tracking for AVs. Furthermore, cooperative braking control was introduced to ensure that AVs maintained safe spacing and stopped rapidly, smoothly, and accurately at the designated target positions with zero velocity [15]. These studies range from torque control based on brushless DC motor models to designing smooth and accurate braking methods using brushless DC motor models to designing smooth and accurate braking methods based on vehicle kinematic models, achieving various conditions for smooth and accurate braking of autonomous vehicles from hardware to software levels. However, precise dynamic models of AVs are difficult to obtain, and the real-time performance of control calculations using these models is often poor. A common drawback of the first two method types is their inability to handle the inherent constraints of AVs, such as speed limits, front-wheel deviation limits, and control input constraints

Different from the first two groups of methods, the third type of control strategy is based on model predictive control (MPC), which effectively handles various constraints. By minimizing the difference between the actual and predicted PS, the optimal control signal can be generated through repeated online optimization [16,17]. Under the MPC framework, there are schemes that use dynamic models of AVs [18,19], as well as schemes that use kinematic models of AVs [20,21]. Note that schemes using dynamic models for tracking PS constitute a servo control problem [22,23], with the advantage of directly tracking speed as a state variable. However, the disadvantage is that accurate model parameters are hard to obtain, and the real-time solution of the control signal cannot be guaranteed due to strong nonlinearity. In contrast, the scheme using kinematic models of AVs essentially addresses a speed planning problem [24], with the advantage of easily obtaining precise model parameters and ensuring real-time control signal solutions; the disadvantage is that the planned speed must be tracked by the servo layer, which leads to delays in speed tracking.

To maintain the high tracking accuracy of servo control while reducing the optimization load, the best approach is to combine the strengths of both the kinematic and dynamic models to establish a new dynamic model that can effectively handle the servo control of PS. One of the most direct methods is to add the speed variable as a state variable into the kinematic model. Correspondingly, throttle and brake openings are used as control inputs to manage the AVs’ speed servo control. However, due to unknown nonlinearities and disturbances, the relationship between the throttle (or brake) opening and AVs’ speed is challenging to determine. As an excellent tool for approximating nonlinear functions, neural networks (NNs) can describe the relationship between AVs’ speed and the throttle (or brake) opening [25,26]. There is significant research on using NNs to approximate unknown nonlinear system models, but few studies have applied NNs to autonomous vehicle modeling.

In addition, distractions such as pedestrians in vehicles, uneven road surfaces, and the coupling between equipment inside the vehicle often affect the accurate tracking of AVs’ PS. To minimize the impact of these disturbances, it is necessary to design disturbance observers for autonomous vehicles. The existing observers are primarily divided into two categories. The first type of observer is designed using the dynamic characteristics of the disturbance, which usually requires prior knowledge of the dynamic model parameters of the disturbance [27,28,29,30]. Another type of disturbance observer is the extended state observer (ESO); it can estimate the disturbance with a bounded rate of change and it has a wider application range [31,32,33]. However, these studies mainly focus on the final observation accuracy of the disturbance observer, while neglecting the stationarity of the disturbance observation process (i.e., the disturbance observation process may have a significant overshoot). One of the goals of this paper is to comprehensively consider the disturbance estimation process and smooth braking, as well as realize smooth and accurate braking at preset points in AVs while canceling the disturbance effects.

This paper aims to address the problem of smooth braking at preset points in AVs with a receding horizon extended state observer (RHESO)-based composite MPC strategy. The main contributions of this paper are as follows:

(1): A novel extended kinematic model of AVs is established using a proposed NN such that the relationship between the AVs’ speed and throttle (and brake) opening is more clearly described;
(2): A novel composite MPC strategy based on the RHESO is put forward, which is optimized over the receding horizon while eliminating the effects of the model’s inaccuracy and disturbance;
(3): Easily verifiable sufficient conditions are established to ensure the recursive feasibility of the MPC scheme and the stability of the closed-loop system.

Finally, the notation used in this article are presented in Table 1.

2. Problem Formulations and Preliminaries

2.1. System Model

A general vehicle kinematic model is given as follows:

\begin{matrix} \dot{ξ_{c}} (t) = [\begin{matrix} \dot{x} (t) \\ \dot{y} (t) \\ \dot{θ} (t) \end{matrix}] = [\begin{matrix} v (t) cos (θ (t)) \\ v (t) sin (θ (t)) \\ ω (t) \end{matrix}] \end{matrix}

(1)

where

ξ_{c} (t) \in R^{3}

indicates the state vector,

x (t) \in R

and

y (t) \in R

are the reference coordinates on the rear axle center in a geodetic coordinate system,

θ (t) \in R

denotes the heading angle,

u_{c} (t) = {[ω (t) v (t)]}^{T} \in R^{2}

is the control input with the linear speed

v (t) \in R

, the angular speed is

ω (t) ≜ v (t) \frac{tan (δ (t))}{L} \in R

, where

L \in R

is the vehicle wheelbase, and

δ (t) \in R

is the wheel deflection angle. The vehicle model is presented in Figure 1.

Generally, the inner and outer ring structures are used to control the intelligent driving vehicle. The function of the outer loop is to generate the desired longitudinal speed

v (t)

and lateral speed

ω (t)

according to the desired trajectory, while the function of the inner loop is to obtain the real control input signal, i.e., the throttle opening (or brake opening) and steering wheel angle, by tracking the desired longitudinal speed

v (t)

and lateral speed

ω (t)

, respectively.

Note that if the desired longitudinal speed

v (t)

and lateral speed

ω (t)

change rapidly, the tracking of

v (t)

and

ω (t)

in the inner loop may experience hysteresis and inaccuracies. Under such conditions, it becomes challenging to achieve smooth braking at preset points for AVs. Since the variation in longitudinal velocity during braking typically exceeds that of lateral velocity, it can be assumed that

ω (t) = 0

and

θ (t) = \bar{θ}

with

\bar{θ}

being a known parameter. Considering these characteristics, this paper proposes a neural network (NN)-based longitudinal kinematic model, as follows:

\begin{matrix} \dot{ξ} (t) = [\begin{matrix} \dot{x} (t) \\ \dot{y} (t) \\ \dot{v} (t) \end{matrix}] = [\begin{matrix} v (t) cos (\bar{θ}) \\ v (t) sin (\bar{θ}) \\ W^{T} σ (u_{tb} (t)) + ε (t) \end{matrix}], \end{matrix}

(2)

where the extended dynamic model

\dot{v} (t) = W^{T} σ (u_{tb} (t)) + ε (t)

is based on an NN to be designed;

W \in R^{3 \times 1}

is an unknown ideal matrix consisting of weight and bias;

σ (\cdot)

is a vector of smooth activation functions;

ε (t)

is the external disturbance satisfying

∥ ε (t) ∥ \leq \bar{ε}

and

∥ \dot{ε} (t) ∥ \leq \tilde{ε}

with

\bar{ε}

, and

\tilde{ε}

being known positive constants;

u_{tb} (t) ≜ {[v (t) u_{o} (t) 1]}^{T}

is the input signal of the NN, and

u_{o} (t)

is the longitudinal speed control variable, where

\begin{matrix} u_{o} (t) = \{\begin{matrix} u_{to} (t), & if u_{to} (t) \in (0, 1], u_{bo} (t) = 0 \\ u_{bo} (t), & if u_{bo} (t) \in (0, 9], u_{to} (t) = 0 \\ 0, & if u_{to} (t) = 0, u_{bo} (t) = 0 \end{matrix} \end{matrix}

with

u_{to} (t)

and

u_{bo} (t)

being the throttle opening and brake opening, respectively.

As stated in [34], it is assumed that there exists a known constant,

ϵ_{W}

, such that

∥ W ∥ \leq ϵ_{W}

. In addition, according to [35], the gradient of

σ (\cdot)

is assumed to satisfy

∥ \nabla σ (\cdot) ∥ \leq ϵ_{σ}

with

ϵ_{σ}

being a positive constant. For simplicity, we denote

σ (t) ≜ σ (u_{tb} (t))

.

Note that the AV is driven by the control input

u (t) ≜ u_{o} (t)

in the form of a segment constant in each sampling interval

[i T, (i + 1) T)

,

i \in Z_{\geq 0}

, with sampling period T (in seconds). Based on (2), a discrete-time AV kinematic model is given as follows:

\begin{matrix} ξ (k + 1) = [\begin{matrix} x (k) \\ y (k) \\ v (k) \end{matrix}] + [\begin{matrix} T v (k) cos (\bar{θ}) \\ T v (k) sin (\bar{θ}) \\ T W^{T} σ (k) + T ε (k) \end{matrix}] . \end{matrix}

(3)

In the following, the control framework diagram of this paper is presented in Figure 2. In Figure 2, the control framework is divided into offline and online parts. The offline part is used to collect data on the speed and acceleration of AVs under different throttle and brake openings and to train NN parameters using the collected data to prepare for subsequent system control. The online part is a closed-loop control loop, including the RHESO and MPC that need to be designed later.

2.2. Estimation of the Unknown NN Parameter

The ideal NN parameter W is an unknown matrix that exists in theory, but an estimated matrix

{\hat{W}}^{*}

that is close enough to W can be obtained through estimation methods. Therefore, an adaptive estimator is designed to obtain the estimation

{\hat{W}}^{*}

.

Consider the following adaptive estimator:

\begin{matrix} \hat{v} (k + 1) = v (k) + T {\hat{W}}^{T} (k) σ (k), \end{matrix}

where

\hat{v} (k)

is the estimation of

v (k)

;

\hat{W} (k)

is the estimation of W at time instant k. Subtracting

v (k + 1)

from

\hat{v} (k + 1)

leads to the following:

\begin{matrix} \hat{v} (k + 1) - v (k + 1) = T ({\hat{W}}^{T} (k) - W^{T}) σ (k) - T ε (k), \end{matrix}

which yields the following:

\begin{matrix} \frac{1}{T} (\hat{v} (k + 1) - v (k + 1)) = ({\hat{W}}^{T} (k) - W^{T}) σ (k) - ε (k) . \end{matrix}

(4)

Denote

\begin{matrix} e (k + 1) ≜ \frac{1}{T} (\hat{v} (k + 1) - v (k + 1)), \tilde{W} (k) ≜ \hat{W} (k) - W, \end{matrix}

then, (4) can be rewritten as follows:

\begin{matrix} e (k + 1) = {\tilde{W}}^{T} (k) σ (k) - ε (k) . \end{matrix}

To minimize

E (k) = \frac{1}{2} e^{T} (k + 1) e (k + 1)

, using the Levenberg–Marquardt algorithm [36], the tuning law of

\hat{W} (k)

is designed as follows:

\begin{matrix} \hat{W} (k + 1) = & \hat{W} (k) - \frac{Λ}{1 + σ^{T} (k) σ (k)} {(\frac{\partial E (k)}{\partial \hat{W} (k)})}^{T} = \hat{W} (k) - \frac{Λ σ (k) e^{T} (k + 1)}{1 + σ^{T} (k) σ (k)}, \end{matrix}

(5)

where

Λ

is a positive constant to be designed.

In the following, a theorem is given to demonstrate that the proposed tuning law (5) can make the estimation

\hat{W} (k)

converge to a neighborhood of ideal value W.

Theorem 1.

If

σ (k)

in (5) is persistently exciting and

0 < Λ < 1

holds, then

\tilde{W} (k)

is uniformly bounded to the set

W

where

W ≜ {\tilde{W} | ∥ \tilde{W} ∥^{2} \leq \bar{κ}}

with the following:

\begin{matrix} \bar{κ} ≜ \frac{1}{κ} {(1 + \frac{1}{\sqrt{1 - Λ}})}^{2} {\bar{ε}}^{2}, κ ≜ \underset{̲}{λ} (σ ({[0 0 1]}^{T}) σ^{T} ({[0 0 1]}^{T})) . \end{matrix}

Proof.

We select a Lyapunov candidate function, as follows:

\begin{matrix} V (\tilde{W} (k)) = {\tilde{W}}^{T} (k) Λ^{- 1} \tilde{W} (k), \end{matrix}

thus, we have the following:

\begin{matrix} V (\tilde{W} (k + 1)) - V (\tilde{W} (k)) \\ = & - \frac{e (k + 1) σ^{T} (k) \tilde{W} (k)}{1 + σ^{T} (k) σ (k)} - \frac{{\tilde{W}}^{T} (k) σ (k) e^{T} (k + 1)}{1 + σ^{T} (k) σ (k)} + \frac{e (k + 1) σ^{T} (k) Λ σ (k) e^{T} (k + 1)}{{(1 + σ^{T} (k) σ (k))}^{2}} \\ = & - \frac{2 e (k + 1) e^{T} (k + 1)}{1 + σ^{T} (k) σ (k)} - \frac{e (k + 1) ε^{T} (k)}{1 + σ^{T} (k) σ (k)} - \frac{ε (k) e^{T} (k + 1)}{1 + σ^{T} (k) σ (k)} \\ + \frac{σ^{T} (k) Λ σ (k) e (k + 1) e^{T} (k + 1)}{{(1 + σ^{T} (k) σ (k))}^{2}} \\ \leq & - \frac{e (k + 1) e^{T} (k + 1)}{1 + σ^{T} (k) σ (k)} + \frac{ε (k) ε^{T} (k)}{1 + σ^{T} (k) σ (k)} + \frac{σ^{T} (k) Λ σ (k) e (k + 1) e^{T} (k + 1)}{{(1 + σ^{T} (k) σ (k))}^{2}} \\ \leq & - (1 - \frac{σ^{T} (k) Λ σ (k)}{1 + σ^{T} (k) σ (k)}) \frac{e (k + 1) e^{T} (k + 1)}{1 + σ^{T} (k) σ (k)} + \frac{ε (k) ε^{T} (k)}{1 + σ^{T} (k) σ (k)} \\ \leq & - (1 - Λ) \frac{e (k + 1) e^{T} (k + 1)}{1 + σ^{T} (k) σ (k)} + \frac{ε (k) ε^{T} (k)}{1 + σ^{T} (k) σ (k)} . \end{matrix}

(6)

Select a proper

Λ

, such that

0 < Λ < 1

. By letting

V (\tilde{W} (k + 1)) - V (\tilde{W} (k)) < 0

, we can obtain the following:

\begin{matrix} \frac{ε (k) ε^{T} (k)}{1 + σ^{T} (k) σ (k)} < (1 - Λ) \frac{e (k + 1) e^{T} (k + 1)}{1 + σ^{T} (k) σ (k)}, \end{matrix}

which indicates the following:

\begin{matrix} ε^{T} (k) ε (k) < (1 - Λ) e^{T} (k + 1) e (k + 1) . \end{matrix}

(7)

This means that

V (\tilde{W} (k))

will remain convergent until condition (7) is not satisfied. Due to

e (k + 1) \in R

and

ε (k) \in R

, we have the following:

\begin{matrix} \frac{1}{1 - Λ} ε^{2} (k) < {({\tilde{W}}^{T} (k) σ (k) - ε (k))}^{2}, \end{matrix}

which yields the following:

\begin{matrix} {(1 + \frac{1}{\sqrt{1 - Λ}})}^{2} ε^{2} (k) < {\tilde{W}}^{T} (k) σ (k) σ^{T} (k) \tilde{W} (k) . \end{matrix}

(8)

To ensure that inequality (8) always holds, we require the following:

\begin{matrix} {(1 + \frac{1}{\sqrt{1 - Λ}})}^{2} {\bar{ε}}^{2} < \underset{̲}{λ} (σ (k) σ^{T} (k)) {\tilde{W}}^{T} (k) \tilde{W} (k) . \end{matrix}

(9)

Considering

σ (k) = σ (u_{tb} (k))

, we have the following:

\begin{matrix} κ ≜ \underset{̲}{λ} (σ (k) σ^{T} (k)) = \underset{̲}{λ} (σ ({[0 0 1]}^{T}) σ^{T} ({[0 0 1]}^{T})) . \end{matrix}

Then, (9) can be rewritten as follows:

\begin{matrix} \frac{1}{κ} {(1 + \frac{1}{\sqrt{1 - Λ}})}^{2} {\bar{ε}}^{2} < {\tilde{W}}^{T} (k) \tilde{W} (k) . \end{matrix}

Denote

W ≜ {\tilde{W} (k) | {∥ \tilde{W} (k) ∥}^{2} \leq \bar{κ}}

with

\bar{κ} ≜ \frac{1}{κ} {(1 + \frac{1}{\sqrt{1 - Λ}})}^{2} {\bar{ε}}^{2}

. Based on the fact that

σ (k)

is persistently exciting [37], it can be obtained that

\tilde{W} (k)

would converge to the set

W

. Therefore,

\tilde{W} (k)

is uniformly bounded. This completes the proof. □

From Theorem 1, it can be obtained that

\tilde{W} (k)

will finally converge to the set

W

. Therefore, if there exists

\tilde{W} (k_{s}) \in W

for

k_{s} \in Z_{\geq 0}

, then

\tilde{W} (k_{s} + 1) \in W

holds. Hence, it is easy to obtain the following:

\begin{matrix} ∥ \tilde{W} (k_{s} + 1) - \tilde{W} (k_{s}) ∥ = & ∥ \hat{W} (k_{s} + 1) - \hat{W} (k_{s}) ∥ \leq ∥ \tilde{W} (k_{s} + 1) ∥ + ∥ \tilde{W} (k_{s}) ∥ \leq 2 \sqrt{\bar{κ}} . \end{matrix}

(10)

Inequality (10) can be used as a termination condition of the tuning law in (5). When (10) is satisfied,

{\hat{W}}^{*} = \hat{W} (k_{s})

can be obtained.

In the following, the process of approximating AVs’ dynamics using a NN is given in Algorithm 1.

Algorithm 1 first collects data on changes in AV velocity and acceleration under different throttle and brake opening conditions. Secondly, these data are divided into inputs and outputs to train the NN. Thirdly, the NN structure, loss function, and activation function are designed. Finally, the NN is trained until its parameters converge. At this stage,

{\hat{W}}^{*}

is obtained and the approximate dynamic model of the AV can be obtained.

Remark 1.

The process of training

\hat{W} (k)

involves first tuning the throttle and brake, that is, recording the impact of the throttle and brake on speed. Then, the collected data are adopted for training to obtain

{\hat{W}}^{*}

. Therefore, the process of training the NN parameter is completed offline.

2.3. Linearization Model and Constraints of the AV

Note that (3) can be rewritten in the following linear form:

\begin{matrix} ξ (k + 1) = & A ξ (k) + B u (k) + D d (k), \end{matrix}

(11)

where

\begin{matrix} A = & [\begin{matrix} 1 & 0 & T cos (\bar{θ}) \\ 0 & 1 & T sin (\bar{θ}) \\ 0 & 0 & 1 + T {\hat{W}}_{1}^{*} a_{σ} \end{matrix}], B = [\begin{matrix} 0 \\ 0 \\ T {\hat{W}}_{2}^{*} b_{σ} \end{matrix}] = D, \\ d (k) = & \frac{1}{{\hat{W}}_{2}^{*} b_{σ}} ({\hat{W}}^{* T} σ (k) - e (k + 1) - {\hat{W}}_{1}^{*} a_{σ} v (k)) - u (k), \end{matrix}

where

{\hat{W}}_{i}^{*}

(i = 1, 2, 3)

indicates the i-th element of

{\hat{W}}^{*}

; and

a_{σ}

and

b_{σ}

represent the linearization coefficients of

σ (k)

with

u_{o} (k) = 0

. Assume that

| d (k) | \leq \bar{d}

with

\bar{d} > 0

being a known parameter.

Algorithm 1 Algorithm for approximating AVs’ dynamics using a NN

Brake data collection:

1. Divide

V_{max}

into n equal parts for AVs with the maximum speed of

V_{max}

. Denote

a_{i} = i \frac{V_{max}}{m}

, where

(i = 1, 2, \dots, m)

is the step size for each increase in velocity. Divide the maximum brake opening, 9, equally into n parts, and the size of each brake opening is

\frac{9}{n}

. Denote

b_{j} = j \frac{9}{n}

, where

(j = 1, 2, \dots, n)

denotes the step size for each increase in the brake opening value. Set

i = 1

and

j = 1

. Go to step 2.

2. Set the AV to travel at velocity

a_{i}

. Then, apply the AV brake with the brake opening of

b_{j}

. Go to step 3.

3. Collect the data of the velocity

v (a_{i}, b_{j})

and acceleration

a (a_{i}, b_{j})

from the start of braking until the AV comes to a complete stop. Go to step 4.

4. If

i + 1 \leq m

,

i = i + 1

. Go to step 2. Otherwise, go to step 5.

5. If

j + 1 \leq \frac{9}{n}

, set

i = 1

and

j = j + 1

. Go to step 2. Otherwise, go to step 6.

Throttle data collection:

6. Divide

V_{max}

into p equal parts for AVs with the maximum speed of

V_{max}

. Denote

c_{i} = i \frac{V_{max}}{p}

,

(i = 0, 1, 2, \dots, p)

as the step size for each increase in velocity. Divide the maximum throttle opening 1 equally into q parts, and the size of each brake opening is

\frac{1}{q}

. Denote

d_{j} = j \frac{1}{q}

,

(j = 1, 2, \dots, q)

as the step size for each increase in the brake opening value. Set

i = 0

and

j = 1

. Go to step 7.

7. Set the AV to travel at a fixed velocity

c_{i}

and accelerate at a throttle opening of

d_{j}

. Go to step 8.

8. Collect the data of the velocity

v (c_{i}, d_{j})

and acceleration

a (c_{i}, d_{j})

from the beginning of acceleration until the AV reaches

V_{max}

. Go to 9.

9. If

i + 1 \leq p

,

i = i + 1

. Go to step 7. Otherwise, go to step 10.

10. If

j + 1 \leq \frac{1}{q}

, set

i = 1

and

j = j + 1

. Go to step 7. Otherwise, go to 11.

Training and testing the NN:

11. Set the input of the NN as

b_{j}

and

v (a_{i}, b_{j})

, with the output being

a (a_{i}, b_{j})

. Alternatively, the input of NN can be set as

c_{i}

and

v (c_{i}, d_{j})

, with the output being

a (c_{i}, d_{j})

. Go to step 12.

12. Extract

70 %

of these data as training data, and the remaining data as test data for backup. Go to step 13.

13. Define the NN structure, loss function, and activation function

σ (\cdot)

. Using the training data obtained in step 12, the NN is trained until the network parameters converge. Go to step 13.

14. Using the NN obtained in step 12. If the loss meets the requirements, it indicates successful training. Terminate the algorithm and export the NN parameters

{\hat{W}}^{*}

. If the loss does not meet the requirements, reset the training parameters and retrain. Go to step 13.

Considering the driving safety of the AV, its state and control input are subject to the following constraints:

\begin{matrix} ξ (k) \in & X ≜ {ξ (k) : {\tilde{h}}_{i, min, ξ} \leq ξ_{i} (k) \leq {\tilde{h}}_{i, max, ξ}}, \end{matrix}

(12)

\begin{matrix} u (k) \in & U ≜ {u (k) : {\tilde{h}}_{min, u} \leq u (k) \leq {\tilde{h}}_{max, u}}, \end{matrix}

(13)

where

ξ_{i} (k)

(i = 1, 2, 3)

indicates the i-th element of

ξ (k)

;

{\tilde{h}}_{i, min, ξ}

and

{\tilde{h}}_{i, max, ξ}

represent the given minimum and maximum values of

ξ_{i} (k)

, respectively; and

{\tilde{h}}_{min, u}

and

{\tilde{h}}_{max, u}

denote the given minimum and maximum values of

u (k)

, respectively. Then, (12) and (13) can be rewritten as follows:

\begin{matrix} ξ (k) \in & X ≜ {ξ (k) : b_{ξ}^{T} ξ (k) \leq h_{ξ}}, \end{matrix}

(14)

\begin{matrix} u (k) \in & U ≜ {u (k) : b_{u}^{T} u (k) \leq h_{u}}, \end{matrix}

(15)

where

b_{ξ}

,

b_{u}

,

h_{ξ}

, and

h_{u}

are matrices obtained from (12) and (13).

2.4. The Design of RHESO

In this subsection, a RHESO is designed to accurately estimate the unknown disturbance

d (k)

, as follows:

\begin{matrix} m (k + 1) = & (I - L_{d} D) (m (k) + L_{d} ξ (k)) - L_{d} (A ξ (k) + B u (k)) + b (k), \end{matrix}

(16a)

\begin{matrix} \hat{d} (k) = & m (k) + L_{d} ξ (k), \end{matrix}

(16b)

where

m (k) \in R

is an intermediate variable;

\hat{d} (k) \in R

is an estimation of

d (k)

;

L_{d} \in R^{1 \times 3}

is an observer gain; and

b (k) \in R

is a decision variable to be designed. As a result, both

L_{d}

and

b (k)

need to be designed subsequently.

Furthermore, system (11) can be extended to the following system:

\begin{matrix} ξ (k + 1) = & A ξ (k) + B u (k) + D d (k), \end{matrix}

(17a)

\begin{matrix} d (k + 1) = & d (k) + h (k), \end{matrix}

(17b)

where

h (k) \in R

is an unknown bounded nonlinear term satisfying

| h (k) | \leq H

, which is equivalent to

b_{h}^{T} h (k) \leq h_{h}

with

b_{h} = [1 - 1]

and

h_{h} = {[H H]}^{T}

.

Considering

e_{d} (k) = d (k) - \hat{d} (k)

, we obtain the following:

\begin{matrix} e_{d} (k + 1) = & (I - L_{d} D) e_{d} (k) - b (k) + h (k) . \end{matrix}

(18)

To restrict the transient performance of the RHESO (16a) and (16b), the following constraint of

e_{d} (k)

is given as follows:

\begin{matrix} e_{d} (k) \in E ≜ {e_{d} (k) : - {\tilde{h}}_{e_{d}} \leq e_{d} (k) \leq {\tilde{h}}_{e_{d}}} \end{matrix}

(19)

where

{\tilde{h}}_{e_{d}} > 0

is a known parameter. Then, (19) can be rewritten as follows:

\begin{matrix} e_{d} (k) \in E = {e_{d} (k) : b_{e_{d}}^{T} e_{d} (k) \leq h_{e_{d}}} \end{matrix}

(20)

where

b_{e_{d}} = [1 - 1]

and

h_{e_{d}} = {[{\tilde{h}}_{e_{d}} {\tilde{h}}_{e_{d}}]}^{T}

.

Remark 2.

Different from the traditional ESO, the RHESO proposed in (16a) and (16b) allows us to obtain the optimal estimation by optimizing the decision variable

b (k)

. Furthermore, it is possible to avoid significant overshoot caused by the estimation error

e_{d} (k)

in terms of the constraint (19), thereby ensuring that system (11) does not experience performance degradation due to a large estimation error

e_{d} (k)

before convergence.

2.5. Control Input and System Reconstruction

In this subsection, the control input is designed for system (11). Firstly, the control input of system (11) is designed as follows:

\begin{matrix} u (k) = \{\begin{matrix} K ξ (k) + c (k) - V \hat{d} (k), & ξ (k) \notin X_{T} \\ K ξ (k) - V \hat{d} (k), & ξ (k) \in X_{T} \end{matrix} \end{matrix}

(21)

where

V ≜ {[0 1]}^{T}

;

X_{T}

is a terminal set to be designed;

K \in R^{2 \times 3}

is the feedback gain to be designed; and

c (k) \in R^{2}

is the decision variable to be designed.

Secondly, based on (11), (18) and (21), the extended state system is obtained as follows:

\begin{matrix} \overset{ˇ}{η} (k + 1) = {\tilde{A}}_{η} \overset{ˇ}{η} (k) + {\tilde{B}}_{η} u_{η} (k) + d_{η} (k) + ϖ (k), \end{matrix}

(22)

where

\begin{matrix} u_{η} (k) & = K_{η} \overset{ˇ}{η} (k) + c_{η} (k) - {[\hat{d} (k) 0]}^{T}, \\ \overset{ˇ}{η} (1) & ≜ [\begin{matrix} ξ (1) \\ 0 \end{matrix}], \overset{ˇ}{η} (k) ≜ [\begin{matrix} ξ (k) \\ e_{d} (k - 1) \end{matrix}], {\tilde{A}}_{η} ≜ [\begin{matrix} A & 0 \\ 0 & I \end{matrix}], \\ {\tilde{B}}_{η} & ≜ [\begin{matrix} B & 0 \\ 0 & - I \end{matrix}], d_{η} (k) ≜ [\begin{matrix} D d (k) \\ 0 \end{matrix}], K_{η} ≜ [\begin{matrix} K & 0 \\ 0 & L_{d} D \end{matrix}], \end{matrix}

(23)

\begin{matrix} ϖ (k) & ≜ [\begin{matrix} 0 \\ h (k - 1) \end{matrix}], k \in Z [2, + \infty), \end{matrix}

(24)

\begin{matrix} c_{η} (k) & = [\begin{matrix} c (k) \\ b (k - 1) \end{matrix}], k \in Z [2, + \infty) . \end{matrix}

(25)

Then, we obtain the following:

\begin{matrix} F \overset{ˇ}{η} (k + 1) = & ({\tilde{A}}_{η} + {\tilde{B}}_{η} K_{η}) \overset{ˇ}{η} (k) + {\tilde{B}}_{η} c_{η} (k) + ϖ (k) = {\tilde{Φ}}_{η} \overset{ˇ}{η} (k) + {\tilde{B}}_{η} c_{η} (k) + ϖ (k) \end{matrix}

(26)

where

\begin{matrix} F & ≜ [\begin{matrix} I & - D \\ 0 & I \end{matrix}], {\tilde{Φ}}_{η} ≜ [\begin{matrix} Φ & 0 \\ 0 & Φ_{e_{d}} \end{matrix}], Φ ≜ A + B K, Φ_{e_{d}} ≜ I - L_{d} D . \end{matrix}

Since F is a full rank matrix, (26) is converted as follows:

\begin{matrix} \overset{ˇ}{η} (k + 1) = & (F^{- 1} {\tilde{A}}_{η} + F^{- 1} {\tilde{B}}_{η} K_{η}) \overset{ˇ}{η} (k) + F^{- 1} {\tilde{B}}_{η} c_{η} (k) + F^{- 1} ϖ (k) \\ = & (A_{η} + B_{η} K_{η}) \overset{ˇ}{η} (k) + B_{η} c_{η} (k) + F^{- 1} ϖ (k) \\ = & Φ_{η} \overset{ˇ}{η} (k) + B_{η} c_{η} (k) + F^{- 1} ϖ (k), \end{matrix}

(27)

where

Φ_{η} = A_{η} + B_{η} K_{η}

,

A_{η} = F^{- 1} {\tilde{A}}_{η}

and

B_{η} = F^{- 1} {\tilde{B}}_{η}

. According to (14), (15) and (19), the constraints of

\overset{ˇ}{η} (k)

and

c_{η} (k)

are obtained as follows:

\begin{matrix} \overset{ˇ}{η} (k) & ≜ {\overset{ˇ}{η} (k) : b_{η}^{T} \overset{ˇ}{η} (k) \leq h_{η}}, \\ c_{η} (k) & ≜ {c_{η} (k) : {\tilde{b}}_{u}^{T} \overset{ˇ}{η} (k) + b_{c_{η}}^{T} c_{η} (k) \leq h_{c_{η}} + h_{d} (k)} \end{matrix}

where

\begin{matrix} b_{η} & ≜ [\begin{matrix} b_{ξ} & 0 \\ 0 & b_{e_{d}} \end{matrix}], {\tilde{b}}_{u} ≜ [\begin{matrix} K^{T} b_{u} & 0 \\ 0 & 0 \end{matrix}], b_{c_{η}} ≜ [\begin{matrix} b_{u} & 0 \\ 0 & 0 \end{matrix}], \\ h_{η} & ≜ [\begin{matrix} h_{ξ} \\ h_{e_{d}} \end{matrix}], h_{c_{η}} ≜ [\begin{matrix} h_{u} \\ 0 \end{matrix}], h_{d} (k) ≜ [\begin{matrix} b_{u}^{T} \hat{d} (k) \\ 0 \end{matrix}] . \end{matrix}

Based on

| h (k) | \leq H

, the constraint of

ϖ (k)

is obtained as follows:

\begin{matrix} ϖ (k) \in W ≜ {ϖ (k) : b_{ϖ}^{T} ϖ (k) \leq h_{ϖ}}, \end{matrix}

where

\begin{matrix} b_{ϖ} ≜ [\begin{matrix} 0 & 0 \\ 0 & b_{h} \end{matrix}], h_{ϖ} ≜ [\begin{matrix} 0 \\ {\bar{h}}_{h} \end{matrix}], {\bar{h}}_{h} ≜ [\begin{matrix} H \\ H \end{matrix}] . \end{matrix}

The nominal system of (27) is denoted as follows:

\begin{matrix} η (k + 1) = Φ_{η} η (k) + B_{η} c_{η} (k), \end{matrix}

where

η (k)

is the nominal state of

\overset{ˇ}{η} (k)

.

Denoting

e_{η} (k) ≜ \overset{ˇ}{η} (k) - η (k)

, we have the following:

\begin{matrix} e_{η} (k + 1) = Φ_{η} e_{η} (k) + F^{- 1} ϖ (k) . \end{matrix}

(28)

Then, constraints of

η (k)

and

c_{η} (k)

are obtained as follows:

\begin{matrix} η (k) & \in M ≜ {η (k) : b_{η}^{T} η (k) \leq h_{η} - b_{η}^{T} e_{η} (k)}, \\ c_{η} (k) & \in N ≜ {c_{η} (k) : {\tilde{b}}_{u}^{T} η (k) + b_{c_{η}}^{T} c_{η} (k) \leq h_{c_{η}} - ({\tilde{b}}_{u}^{T} e_{η} (k) - h_{d} (k))} . \end{matrix}

2.6. The MPC Scheme

To achieve smooth braking at preset points in AVs under constraints, an MPC scheme is proposed in this subsection. Firstly, the cost function of system (27) is denoted as follows:

\begin{matrix} J (\hat{ȷ} (k), {\hat{c}}_{η} (k)) \\ ≜ & \sum_{s = 0}^{N - 1} (∥ \hat{η} {(k + s | k) ∥}_{Q}^{2} + {∥ K_{η} \hat{η} (k + s | k) + {\hat{c}}_{η} (k + s | k) ∥}_{R}^{2}) + {∥ \hat{η} (k + N | k) ∥}_{P}^{2}, \end{matrix}

where

\hat{η} (k + s | k)

is the predictive state with

\hat{η} (k | k) = \overset{ˇ}{η} (k)

;

{\hat{c}}_{η} (k + s | k)

is the predictive decision variable;

\hat{ȷ} (k) ≜ {[{\hat{η}}^{T} (k | k) {\hat{η}}^{T} (k + 1 | k) \dots {\hat{η}}^{T} (k + N | k)]}^{T}

is the predictive state sequence;

{\hat{c}}_{η} (k) ≜ {[{\hat{c}}_{η}^{T} (k | k) {\hat{c}}_{η}^{T} (k + 1 | k) \dots {\hat{c}}_{η}^{T} (k + N - 1 | k)]}^{T}

is the predictive decision variable sequence with

{\hat{c}}_{η} (k | k) ≜ c_{η} (k)

;

Q > 0

and

R > 0

are the weighting matrices with appropriate dimensions;

P > 0

is the terminal penalty matrix to be designed; and N is the predictive horizon. Then, the optimization problem of MPC is denoted as follows:

Prob 1:

\begin{matrix} ({\hat{ȷ}}^{*} (k), {\hat{c}}_{η}^{*} (k)) ≜ arg min J (\hat{ȷ} (k), {\hat{c}}_{η} (k)) \end{matrix}

subject to

\begin{matrix} \hat{η} (k + s + 1 | k) = Φ_{η} \hat{η} (k + s | k) + B_{η} {\hat{c}}_{η} (k + s | k), \end{matrix}

(29)

\begin{matrix} b_{η}^{T} \hat{η} (k + s | k) \leq h_{η} - ϕ (k + s | k), s \in Z [0, N - 1], \end{matrix}

(30)

\begin{matrix} {\tilde{b}}_{u}^{T} \hat{η} (k + s | k) + b_{c_{η}}^{T} {\hat{c}}_{η} (k + s | k) \leq h_{c_{η}} - | {\tilde{h}}_{e_{d}} + \bar{d} | {[b_{u} 0]}^{T} - χ (k + s | k), s \in Z [0, N - 1], \end{matrix}

(31)

\begin{matrix} \hat{η} (k + N | k) \in X_{T} (k + N | k), \end{matrix}

(32)

where

\begin{matrix} ϕ (k + s | k) ≜ \sum_{j = 0}^{s - 1} max_{ϖ (k + s - 1 - j | k) \in W} b_{η}^{T} Φ_{η}^{j} F^{- 1} ϖ (k + s - 1 - j | k), ϕ (k | k) ≜ 0, \\ χ (k + s | k) ≜ \sum_{j = 0}^{s - 1} max_{ϖ (k + s - 1 - j | k) \in W} {\tilde{b}}_{u}^{T} Φ_{η}^{j} F^{- 1} ϖ (k + s - 1 - j | k), χ (k | k) ≜ 0, \\ e_{η} (k + s + 1 | k) = Φ_{η} e_{η} (k + s | k) + F^{- 1} ϖ (k + s | k), e_{η} (k | k) = 0, \\ X_{T} (k + N | k) ≜ \hat{X} (k + N | k) \cap \hat{C} (k + N | k), \\ \hat{X} (k + N | k) ≜ {\hat{η} (k + N | k) : b_{η}^{T} \hat{η} (k + N | k) \leq h_{η} - ϕ (k + N | k)}, \\ \hat{C} (k + N | k) ≜ {\hat{η} (k + N | k) : {\tilde{b}}_{u}^{T} \hat{η} (k + N | k) \leq h_{c_{η}} - | {\tilde{h}}_{e_{d}} + \bar{d} | {[b_{u} 0]}^{T} - χ (k + N | k)}, \\ {\hat{ȷ}}^{*} (k) ≜ {[{\hat{η}}^{* T} (k | k) {\hat{η}}^{* T} (k + 1 | k) \dots {\hat{η}}^{* T} (k + N | k)]}^{T}, \\ {\hat{c}}_{η}^{*} (k) ≜ {[{\hat{c}}^{* T} (k | k) {\hat{c}}^{* T} (k + 1 | k) \dots {\hat{c}}^{* T} (k + N - 1 | k)]}^{T} . \end{matrix}

Remark 3.

The polynomial complexity of the proposed RHESO-based MPC consists of three parts: (1) the polynomial complexity of the RHESO; (2) the polynomial complexity of Prob 1; and (3) the polynomial complexity of the input signal. The polynomial complexity of the RHESO is obtained as follows:

\begin{matrix} O ((I - L_{d} D) (m (k) + L_{d} ξ (k)) - L_{d} (A ξ (k) + B u (k)) + b (k)) + O (m (k) + L_{d} ξ (k)) \\ = & O (24) \end{matrix}

The polynomial complexity of Prob 1 is obtained as follows:

\begin{matrix} O (\sum_{s = 0}^{N - 1} (∥ \hat{η} {(k + s | k) ∥}_{Q}^{2} + {∥ K_{η} \hat{η} (k + s | k) + {\hat{c}}_{η} (k + s | k) ∥}_{R}^{2}) + {∥ \hat{η} (k + N | k) ∥}_{P}^{2}) \\ = & O (52 N + 31) \end{matrix}

The polynomial complexity of the input signal is obtained as follows:

\begin{matrix} O (K ξ (k) + c (k) - V \hat{d} (k)) = O (13) \end{matrix}

Then the polynomial complexity of the proposed RHESO-based MPC is obtained as follows:

\begin{matrix} O (24) + O (52 N + 31) + O (13) = O (52 N + 31) \end{matrix}

Definition 1

([38]). If for all

{\bar{Ξ}}_{1} > 0

, there exists a constant

{\bar{Ξ}}_{2} ({\bar{Ξ}}_{1})

such that we have the following:

\begin{matrix} ∥ \overset{ˇ}{η} (0) ∥ \leq {\bar{Ξ}}_{1} \Rightarrow ∥ \overset{ˇ}{η} (k) ∥ \leq {\bar{Ξ}}_{2} ({\bar{Ξ}}_{1}) \end{matrix}

holds for all

\overset{ˇ}{η} (0) \in R^{n}

and

k \in Z_{\geq 0}

, then system (27) is said to be uniformly bounded.

3. Results

In this part, a lemma is presented firstly to demonstrate the feasibility of Prob 1, and then a theorem is presented to verify the uniformly bounded stability of the closed-loop system.

Lemma 1.

If a feasible solution exists for Prob 1 at time instant k, then it also has at least one feasible solution at time instant

k + 1

.

Proof.

Denote a candidate decision variable sequence

{\tilde{c}}_{η} (k + 1)

as

\begin{matrix} {\tilde{c}}_{η} (k + 1) \\ ≜ & [\begin{matrix} {\tilde{c}}_{η} (k + 1 | k + 1) & \dots & {\tilde{c}}_{η} (k + N | k + 1) \end{matrix}] = [\begin{matrix} {\hat{c}}_{η}^{*} (k + 1 | k) & \dots & {\hat{c}}_{η}^{*} (k + N - 1 | k) & 0 \end{matrix}], \end{matrix}

where

{\tilde{c}}_{η} (k + 1 + s | k + 1)

is a candidate decision variable for

s \in Z [0, N - 1]

. Then, there exists

\begin{matrix} \hat{η} (k + 1 | k + 1) & = Φ_{η} \hat{η} (k | k) + B_{η} {\hat{c}}_{η}^{*} (k | k) + F^{- 1} ϖ (k) . \end{matrix}

(33)

Considering the predictive dynamics of

\overset{ˇ}{η} (k + 1)

using

{\tilde{c}}_{η} (k + 1)

at time instant

k + 1

, we have the following:

\begin{matrix} \tilde{η} (k + 1 + s + 1 | k + 1) = & Φ_{η} \tilde{η} (k + 1 + s | k + 1) + B_{η} {\tilde{c}}_{η} (k + 1 + s | k + 1) \\ + F^{- 1} ϖ (k + 1 + s | k + 1), \end{matrix}

(34)

where

\tilde{η} (k + 1 + s + 1 | k + 1)

is the feasible state for

s \in Z [0, N]

.

Combining (29), (33) and (34), we have the following:

\begin{matrix} \tilde{η} (k + s | k + 1) = {\hat{η}}^{*} (k + s | k) + Φ_{η}^{s - 1} F^{- 1} ϖ (k) . \end{matrix}

(35)

Denote

\begin{matrix} \tilde{ȷ} (k + 1) = [\begin{matrix} \tilde{η} (k + 1 | k + 1) & \dots & \tilde{η} (k + 1 + N | k + 1) \end{matrix}] \end{matrix}

as the feasible predictive state sequence. To guarantee that Prob 1 is feasible at time instant

k + 1

, the following conditions should be satisfied:

\begin{matrix} b_{η}^{T} \tilde{η} (k + 1 + s | k + 1) \leq h_{η} - ϕ (k + 1 + s | k + 1), s \in Z [0, N] \end{matrix}

(36)

\begin{matrix} {\tilde{b}}_{u}^{T} \tilde{η} (k + 1 + s | k + 1) & + b_{c_{η}}^{T} {\tilde{c}}_{η} (k + 1 + s | k + 1) \leq h_{c_{η}} - | {\tilde{h}}_{e_{d}} + \bar{d} | {[b_{u} 0]}^{T} \\ - χ (k + 1 + s | k + 1), s \in Z [0, N - 1] \end{matrix}

(37)

\begin{matrix} \tilde{η} (k + 1 + N | k + 1) \in X_{T} (k + 1 + N | k + 1) \end{matrix}

(38)

In the following, conditions (36)–(38) are proved one by one. Firstly, the proof of condition (36) is given. Assume that Prob 1 is feasible at time instant k. Then, the constraint (30) holds, which indicates the following:

\begin{matrix} b_{η}^{T} {\hat{η}}^{*} (k + 1 + s | k) \leq h_{η} - ϕ (k + 1 + s | k) \end{matrix}

(39)

for

s \in Z [0, N - 2]

. Combining (35) and (39), the following exists:

\begin{matrix} b_{η}^{T} (\tilde{η} (k + 1 + s | k + 1) - Φ_{η}^{s} F^{- 1} ϖ (k)) \leq h_{η} - ϕ (k + 1 + s | k), \end{matrix}

which yields

\begin{matrix} b_{η}^{T} \tilde{η} (k + 1 + s | k + 1) \\ \leq h_{η} - ϕ (k + 1 + s | k) + b_{η}^{T} Φ_{η}^{s} F^{- 1} ϖ (k) \\ = h_{η} - \sum_{j = 0}^{s - 1} max_{ϖ (k + s - 1 - j | k) \in W} b_{η}^{T} Φ_{η}^{j} F^{- 1} ϖ (k + s - 1 - j | k) \\ + b_{η}^{T} Φ_{η}^{s} F^{- 1} (ϖ (k) - max_{ϖ (k + s - j | k) \in W} ϖ (k + s - j | k)) \\ \leq h_{η} - \sum_{j = 0}^{s - 1} max_{ϖ (k + s - 1 - j | k) \in W} b_{η}^{T} Φ_{η}^{j} F^{- 1} ϖ (k + s - 1 - j | k) = h_{η} - ϕ (k + 1 + s | k + 1) \end{matrix}

for

s \in Z [0, N]

. Therefore, condition (36) is satisfied for

s \in Z [0, N]

.

Secondly, the proof of condition (37) is given. By using the candidate variable sequence

{\tilde{c}}_{η} (k + 1)

, we have that

{\tilde{c}}_{η} (k + 1 + s | k + 1) = {\hat{c}}_{η} (k + 1 + s | k)

. Based on (31) and (35), we have the following:

\begin{matrix} {\tilde{b}}_{u}^{T} (\tilde{η} (k + 1 + s | k + 1) - Φ_{η}^{s} F^{- 1} ϖ (k)) + b_{c_{η}}^{T} {\hat{c}}_{η} (k + 1 + s | k) \\ \leq & h_{c_{η}} - | h_{e_{d}} + \bar{d} | {[b_{u} 0]}^{T} - χ (k + 1 + s | k), \end{matrix}

which indicates the following:

\begin{matrix} {\tilde{b}}_{u}^{T} \tilde{η} (k + 1 + s | k + 1) + b_{c_{η}}^{T} {\tilde{c}}_{η} (k + 1 + s | k + 1) \\ \leq h_{c_{η}} - | h_{e_{d}} + \bar{d} | {[b_{u} 0]}^{T} - χ (k + 1 + s | k) + {\tilde{b}}_{u}^{T} Φ_{η}^{s} F^{- 1} ϖ (k) \\ = h_{c_{η}} - | {\tilde{h}}_{e_{d}} + \bar{d} | {[b_{u} 0]}^{T} - \sum_{j = 0}^{s - 1} max_{ϖ (k + s - 1 - j | k) \in W} {\tilde{b}}_{u}^{T} Φ_{η}^{j} F^{- 1} ϖ (k + s - 1 - j | k) \\ + {\tilde{b}}_{u}^{T} Φ_{η}^{s} F^{- 1} (ϖ (k) - max_{ϖ (k + s - j | k) \in W} ϖ (k + s - j | k)) \\ \leq h_{c_{η}} - | {\tilde{h}}_{e_{d}} + \bar{d} | {[b_{u} 0]}^{T} - \sum_{j = 0}^{s - 1} max_{ϖ (k + s - 1 - j | k) \in W} {\tilde{b}}_{u}^{T} Φ_{η}^{j} F^{- 1} ϖ (k + s - 1 - j | k) \\ = h_{c_{η}} - | {\tilde{h}}_{e_{d}} + \bar{d} | {[b_{u} 0]}^{T} - χ (k + 1 + s | k + 1) \end{matrix}

for

s \in Z [0, N - 2]

. For

s = N - 1

, there exists

{\tilde{c}}_{η} (k + N | k + 1) = 0

. Based on (32), (35) and

{\tilde{c}}_{η} (k + N | k + 1) = 0

, it is easy to obtain the following:

\begin{matrix} {\tilde{b}}_{u}^{T} (\tilde{η} (k + N | k + 1) - Φ_{η}^{N - 1} F^{- 1} ϖ (k)) \leq h_{c_{η}} - | {\tilde{h}}_{e_{d}} + \bar{d} | {[b_{u} 0]}^{T} - χ (k + N | k), \end{matrix}

which means that we have the following:

\begin{matrix} {\tilde{b}}_{u}^{T} \tilde{η} (k + N | k + 1) \\ \leq & h_{c_{η}} - | {\tilde{h}}_{e_{d}} + \bar{d} | {[b_{u} 0]}^{T} - χ (k + N | k) + {\tilde{b}}_{u}^{T} Φ_{η}^{N - 1} F^{- 1} ϖ (k) \\ = & h_{c_{η}} - | {\tilde{h}}_{e_{d}} + \bar{d} | {[b_{u} 0]}^{T} - \sum_{j = 0}^{N - 2} max_{ϖ (k + N - 2 - j | k) \in W} {\tilde{b}}_{u}^{T} Φ_{η}^{j} F^{- 1} ϖ (k + N - 2 - j | k) \\ - {\tilde{b}}_{u}^{T} Φ_{η}^{N - 1} F^{- 1} max_{ϖ (k + N - 1 - j | k) \in W} ϖ (k + N - 1 - j | k) + {\tilde{b}}_{u}^{T} Φ_{η}^{N - 1} F^{- 1} ϖ (k) \\ \leq & h_{c_{η}} - | {\tilde{h}}_{e_{d}} + \bar{d} | {[b_{u} 0]}^{T} - \sum_{j = 0}^{N - 2} max_{ϖ (k + N - 2 - j | k) \in W} {\tilde{b}}_{u}^{T} Φ_{η}^{j} F^{- 1} ϖ (k + N - 2 - j | k) \\ = & h_{c_{η}} - | {\tilde{h}}_{e_{d}} + \bar{d} | {[b_{u} 0]}^{T} - χ (k + N | k + 1) . \end{matrix}

Therefore, condition (37) is satisfied for

s \in Z [0, N - 1]

.

Thirdly, the proof of condition (38) is proved, as

X_{T} (k + 1 + N | k + 1) ≜ \hat{X} (k + 1 + N | k) \cap \hat{C} (k + 1 + N | k + 1)

. In terms of (36), it follows that

\tilde{η} (k + 1 + N | k + 1) \in \hat{X} (k + 1 + N | k)

. Furthermore, based on (32), it is easy to obtain that

\tilde{η} (k + 1 + N | k + 1) \in \hat{C} (k + 1 + N | k)

. Therefore,

\tilde{η} (k + 1 + N | k + 1) \in X_{T} (k + 1 + N | k + 1)

is satisfied, such that condition (38) holds.

As a result, if Prob 1 is feasible at time instant k, then Prob 1 is also feasible at time instant

k + 1

. This completes the proof. □

Theorem 2.

If

Ξ > 0

and

Υ_{η}

exist, which make the linear matrix inequality (LMI),

\begin{matrix} [\begin{matrix} - Ξ & Ξ A_{η}^{T} + Υ_{η} B_{η}^{T} & Υ_{η} & Ξ \\ A_{η} Ξ^{T} + B_{η} Υ_{η}^{T} & - Ξ & 0 & 0 \\ Υ_{η}^{T} & 0 & - R^{- 1} & 0 \\ Ξ^{T} & 0 & 0 & - Q^{- 1} \end{matrix}] < 0 \end{matrix}

(40)

then system (22) is uniformly bounded and

K_{η} = Υ_{η}^{T} Ξ^{- 1}

with

Ξ^{- 1} = P

. Furthermore, the state of system (22) converges to the set

S ≜ {\overset{ˇ}{η} (k) : ∥ \overset{ˇ}{η} (k) ∥_{Q}^{2} \leq \bar{σ}}

, where

\begin{matrix} \bar{σ} ≜ & \sum_{s = 1}^{N - 1} \{{\bar{λ}}^{2} (Q) Ω^{2} (s) + 2 {\bar{λ}}^{2} (Q) Ω (s) F (s) + {\bar{λ}}^{2} (R) Ξ^{2} (s) + 2 {\bar{λ}}^{2} (R) Ξ (s) Ψ (s)\} \\ + {\bar{λ}}^{2} (P) Ω^{2} (N) + \bar{λ} (P) Ω (N) π (N), \end{matrix}

with

\begin{matrix} Ω (s) ≜ max_{ϖ (k) \in W} ∥ Φ_{η}^{s - 1} F^{- 1} ϖ (k) ∥, F (s) ≜ ∥ h_{η} - ϕ (k + s | k) ∥, \\ Ξ (s) ≜ max_{ϖ (k) \in W} ∥ K_{η} Φ_{η}^{s - 1} F^{- 1} ϖ (k) ∥, Ψ (s) ≜ ∥ h_{c_{η}} - | h_{e_{d}} + \bar{d} | {[b_{u} 0]}^{T} - χ (k + s | k) ∥ . \end{matrix}

Proof.

Considering the optimal solution of Prob 1 is obtained at time instant k, it can be concluded that Prob 1 has at least one feasible solution at time instant

k + 1

. Denote

J (\tilde{ȷ} (k + 1), {\tilde{c}}_{η} (k + 1))

as the feasible cost of Prob 1 at time instant

k + 1

with

\tilde{ȷ} (k + 1)

and

{\tilde{c}}_{η} (k + 1)

given in Lemma 1.

The optimal cost of Prob 1 at time instant k is denoted as

J ({\hat{ȷ}}^{*} (k), {\hat{c}}_{η}^{*} (k))

. Choose

Δ J (k) ≜ J (\tilde{ȷ} (k + 1), {\tilde{c}}_{η} (k + 1)) - J ({\hat{ȷ}}^{*} (k), {\hat{c}}_{η}^{*} (k))

. Split

Δ J (k) = Δ_{1} + Δ_{2} + Δ_{3}

with

\begin{matrix} Δ_{1} = & \sum_{s = 1}^{N - 1} \{∥ \tilde{η} {(k + s | k + 1) ∥}_{Q}^{2} - ∥ {\hat{η}}^{*} {(k + s | k) ∥}_{Q}^{2} + {∥ K_{η} \tilde{η} (k + s | k + 1) + {\tilde{c}}_{η} (k + s | k + 1) ∥}_{R}^{2} \\ - ∥ K_{η} {\hat{η}}^{*} (k + s | k) + {\hat{c}}_{η}^{*} {(k + s | k) ∥}_{R}^{2}\}, \\ Δ_{2} = & ∥ \tilde{η} (k + T_{s} | k + 1) ∥_{Q}^{2} + ∥ K_{η} \tilde{η} (k + T_{s} | k + 1) ∥_{R}^{2} + {∥ \tilde{η} (k + 1 + T_{s} | k + 1) ∥}_{P}^{2} \\ - ∥ {\hat{η}}^{*} (k + T_{s} | k) ∥_{P}^{2}, \\ Δ_{3} = & - ∥ {\hat{η}}^{*} {(k | k) ∥}_{Q}^{2} - {∥ K_{η} {\hat{η}}^{*} (k | k) + {\hat{c}}_{η}^{*} (k | k) ∥}_{Q}^{2} . \end{matrix}

For

Δ_{1}

, we have the following:

\begin{matrix} Δ_{1} = & \sum_{s = 1}^{N - 1} \{∥ \tilde{η} {(k + s | k + 1) ∥}_{Q}^{2} - ∥ {\hat{η}}^{*} (k + s | k) ∥_{Q}^{2} + ∥ K_{η} ({\hat{η}}^{*} (k + s | k) + Φ_{η}^{s - 1} F^{- 1} ϖ (k)) \\ + {\tilde{c}}_{η} {(k + s | k + 1) ∥}_{R}^{2} - {∥ K_{η} {\hat{η}}^{*} (k + s | k) + {\hat{c}}_{η}^{*} (k + s | k) ∥}_{R}^{2}\} \\ = & \sum_{s = 1}^{N - 1} \{∥ \tilde{η} {(k + s | k + 1) ∥}_{Q}^{2} - ∥ {\hat{η}}^{*} (k + s | k) ∥_{Q}^{2} + ∥ K_{η} {\hat{η}}^{*} (k + s | k) + {\hat{c}}_{η}^{*} (k + s | k) \\ + K_{η} Φ_{η}^{s - 1} F^{- 1} {ϖ (k) ∥}_{R}^{2} - {∥ K_{η} {\hat{η}}^{*} (k + s | k) + {\hat{c}}_{η}^{*} (k + s | k) ∥}_{R}^{2}\} \\ \leq & \sum_{s = 1}^{N - 1} \{(∥ \tilde{η} {(k + s | k + 1) ∥}_{Q} - {∥ {\hat{η}}^{*} (k + s | k) ∥}_{Q}) (∥ \tilde{η} {(k + s | k + 1) ∥}_{Q} + {∥ {\hat{η}}^{*} (k + s | k) ∥}_{Q}) \\ + (∥ K_{η} {\hat{η}}^{*} (k + s | k) + {\hat{c}}_{η}^{*} (k + s | k) + K_{η} Φ_{η}^{s - 1} F^{- 1} {ϖ (k) ∥}_{R} \\ - ∥ K_{η} {\hat{η}}^{*} (k + s | k) + {\hat{c}}_{η}^{*} {(k + s | k) ∥}_{R}) \times (∥ K_{η} {\hat{η}}^{*} (k + s | k) + {\hat{c}}_{η}^{*} (k + s | k) \\ + K_{η} Φ_{η}^{s - 1} F^{- 1} {ϖ (k) ∥}_{R} + ∥ K_{η} {\hat{η}}^{*} (k + s | k) + {\hat{c}}_{η}^{*} {(k + s | k) ∥}_{R})\} \\ \leq & \sum_{s = 1}^{N - 1} \{\bar{λ} (Q) ∥ Φ_{η}^{s - 1} F^{- 1} ϖ (k) ∥ \times (2 \bar{λ} (Q) ∥ {\hat{η}}^{*} (k + s | k) ∥ + \bar{λ} (Q) ∥ Φ_{η}^{s - 1} F^{- 1} ϖ (k) ∥) \\ + \bar{λ} (R) ∥ K_{η} Φ_{η}^{s - 1} F^{- 1} ϖ (k) ∥ (2 \bar{λ} (R) ∥ K_{η} {\hat{η}}^{*} (k + s | k) + {\hat{c}}_{η}^{*} (k + s | k) ∥ \\ + \bar{λ} (R) ∥ K_{η} Φ_{η}^{s - 1} F^{- 1} ϖ (k) ∥)\} \\ \leq & \sum_{s = 1}^{N - 1} \{\bar{λ} (Q) ∥ Φ_{η}^{s - 1} F^{- 1} ϖ (k) ∥ (2 \bar{λ} (Q) ∥ h_{η} - ϕ (k + s | k) ∥ + \bar{λ} (Q) ∥ Φ_{η}^{s - 1} F^{- 1} ϖ (k) ∥) \\ + \bar{λ} (R) ∥ K_{η} Φ_{η}^{s - 1} F^{- 1} ϖ (k) ∥ (2 \bar{λ} (R) ∥ h_{c_{η}} - {[({\tilde{h}}_{e_{d}} + \bar{d}) 0]}^{T} - χ (k + s | k) ∥ \\ + \bar{λ} (R) ∥ K_{η} Φ_{η}^{s - 1} F^{- 1} ϖ (k) ∥)\} \\ \leq & \sum_{s = 1}^{N - 1} \{{\bar{λ}}^{2} (Q) Ω^{2} (s) + 2 {\bar{λ}}^{2} (Q) Ω (s) F (s) + {\bar{λ}}^{2} (R) Ξ^{2} (s) + 2 {\bar{λ}}^{2} (R) Ξ (s) Ψ (s)\}, \end{matrix}

where

\begin{matrix} Ω (s) ≜ max_{ϖ (k) \in W} ∥ Φ_{η}^{s - 1} F^{- 1} ϖ (k) ∥, F (s) ≜ ∥ h_{η} - ϕ (k + s | k) ∥, \\ Ξ (s) ≜ max_{ϖ (k) \in W} ∥ K_{η} Φ_{η}^{s - 1} F^{- 1} ϖ (k) ∥, Ψ (s) ≜ ∥ h_{c_{η}} - | {\tilde{h}}_{e_{d}} + \bar{d} | {[b_{u} 0]}^{T} - χ (k + s | k) ∥ . \end{matrix}

For

Δ_{2}

, the following can be obtained:

\begin{matrix} Δ_{2} = & ∥ \tilde{η} {(k + N | k + 1) ∥}_{Q}^{2} + ∥ K_{η} \tilde{η} {(k + N | k + 1) ∥}_{R}^{2} + {∥ \tilde{η} (k + 1 + N | k + 1) ∥}_{P}^{2} \\ - ∥ {\hat{η}}^{*} {(k + N | k) ∥}_{P}^{2} + ∥ \tilde{η} {(k + N | k + 1) ∥}_{P}^{2} - {∥ \tilde{η} (k + N | k + 1) ∥}_{P}^{2} \\ \leq & (∥ \tilde{η} {(k + N | k + 1) ∥}_{P}^{2} - ∥ {\hat{η}}^{*} {(k + N | k) ∥}_{P}^{2}) - ∥ \tilde{η} {(k + N | k + 1) ∥}_{\bar{Q}}^{2} \\ \leq & ∥ \tilde{η} (k + N | k + 1) - {\hat{η}}^{*} {(k + N | k) ∥}_{P} (∥ \tilde{η} {(k + N | k + 1) ∥}_{P} + ∥ {\hat{η}}^{*} (k + N | k) ∥_{P}) \\ - ∥ \tilde{η} {(k + N | k + 1) ∥}_{\bar{Q}}^{2} \\ \leq & - ∥ \tilde{η} {(k + N | k + 1) ∥}_{\bar{Q}}^{2} + \bar{λ} (P) ∥ Φ_{η}^{N - 1} F^{- 1} ϖ (k) ∥ \\ \times (2 \bar{λ} (P) ∥ {\hat{η}}^{*} (k + N | k) ∥ + \bar{λ} (P) ∥ Φ_{η}^{N - 1} F^{- 1} ϖ (k) ∥), \end{matrix}

where

- \bar{Q} = Φ_{η}^{T} P Φ_{η} + Q + K_{η}^{T} R K_{η} - P

and

π (N) ≜ min {F (N), Ψ (N)}

. Letting

- \bar{Q} < 0

, we obtain the following:

\begin{matrix} Φ_{η}^{T} P Φ_{η} + Q + K_{η}^{T} R K_{η} - P < 0 \end{matrix}

which can be equivalently converted to LMI (40), such that K and

L_{d}

can be obtained by solving LMI (40). Thus,

∥ \tilde{η} {(k + N | k + 1) ∥}_{\bar{Q}}^{2} > 0

, and we have the following:

\begin{matrix} Δ_{2} \leq {\bar{λ}}^{2} (P) Ω^{2} (N) + \bar{λ} (P) Ω (N) π (N) . \end{matrix}

For

Δ_{3}

, there exists

Δ_{3} \leq - {∥ {\hat{η}}^{*} (k | k) ∥}_{Q}^{2}

. Combining

Δ_{1}

,

Δ_{2}

, and

Δ_{3}

, we obtain the following:

\begin{matrix} Δ J (k) \leq - ∥ {\hat{η}}^{*} {(k | k) ∥}_{Q}^{2} + \bar{σ} . \end{matrix}

with

\begin{matrix} \bar{σ} ≜ & \sum_{s = 1}^{N - 1} \{{\bar{λ}}^{2} (Q) Ω^{2} (s) + 2 {\bar{λ}}^{2} (Q) Ω (s) F (s) + {\bar{λ}}^{2} (R) Ξ^{2} (s) + 2 {\bar{λ}}^{2} (R) Ξ (s) Ψ (s)\} \\ + {\bar{λ}}^{2} (P) Ω^{2} (N) + \bar{λ} (P) Ω (N) π (N) . \end{matrix}

According to the optimality principle, it is easy to obtain the following:

\begin{matrix} J ({\hat{ȷ}}^{*} (k + 1), {\hat{c}}_{η}^{*} (k + 1)) - J ({\hat{ȷ}}^{*} (k), {\hat{c}}_{η}^{*} (k)) \leq & J (\tilde{ȷ} (k + 1), {\tilde{c}}_{η} (k + 1)) - J ({\hat{ȷ}}^{*} (k), {\hat{c}}_{η}^{*} (k)) \\ \leq & - ∥ {\hat{η}}^{*} {(k | k) ∥}_{Q}^{2} + \bar{σ} \\ = & - ∥ \overset{ˇ}{η} {(k) ∥}_{Q}^{2} + \bar{σ} . \end{matrix}

Therefore,

J ({\hat{ȷ}}^{*} (k), {\hat{c}}_{η}^{*} (k))

can be used as the Lyapunov function. As a result,

\overset{ˇ}{η} (k)

can converge to the set

S

as

k \to + \infty

, suggesting that system (22) is uniformly bounded, such that the uniform boundedness of system (11) and error system (18) can also be guaranteed. □

Remark 4.

Compared to existing smooth and precise braking methods [13,14,15], the RHESO-based MPC proposed in this paper has the following advantages: (1) A new neural network-based vehicle kinematic model is proposed, which simulates the dynamic characteristics of vehicles through neural networks to supplement the shortcomings of the kinematic model in accurately describing the throttle and brake control on the vehicle speed; (2) the proposed RHESO can estimate the unmodeled dynamics and potential disturbances in AV kinematic modeling, providing preliminary theoretical support for the AV to achieve smooth and accurate preset point braking; and (3) design an AV speed control method based on MPC, taking into account the inherent speed and control constraints of AV to enable smooth and accurate braking at preset points.

4. Numerical Example

An electric AV model provided by the CarSim software is employed to obtain the associated data of the throttle/brake on vehicle speed and acceleration. Therefore, the corresponding approximate dynamic model can be further obtained by using the proposed NN. Partial parameters of the electric AV used in this example are given in Table 2.

As this paper focuses on longitudinal control, the driving angle

\bar{θ} = 60^{°}

is selected in (2). The external disturbance is selected as

ε (t) = sin (t + 0.2)

. Other simulation parameters are given in Table 3.

The activation function vector and the weight matrix for the NN are reconstructed as follows:

\begin{matrix} σ (t) = {[v (t) u_{o} (t) 1]}^{T}, {\hat{W}}^{*} = & {[- 0.21 - 1.58 1.09]}^{T} . \end{matrix}

The matrices

L_{d}

, K, and P are obtained by solving LMI (40) as follows:

\begin{matrix} L_{d} = & [0 0 - 6.1], K = [1.4 2.6 3.6], \\ P = & [\begin{matrix} 2576.1 & - 1376.5 & 18.3 & 0 \\ - 1376.5 & 1003.7 & 33.9 & 0 \\ 18.3 & 33.9 & 43.9 & 0 \\ 0 & 0 & 0 & 20.9 \end{matrix}] . \end{matrix}

The initial state of the electric AV is selected as follows:

\begin{matrix} ξ (0) = {[- 15.1 - 26.2 8]}^{T} . \end{matrix}

To display the effectiveness of the proposed RHESO-based MPC for achieving smooth braking at preset points in AVs, its numerical simulation result is compared with those obtained by the traditional ESO-based MPC, active disturbance rejection control (ADRC), and robust MPC (RMPC). The simulation results are displayed in Figure 3, Figure 4, Figure 5 and Figure 6. Figure 3 shows the position response of the electric AV using the proposed RHESO-based MPC, as well as traditional ESO-based MPC, ADRC, and RMPC. It can be clearly seen that the position components of the AV, i.e.,

x (t)

and

y (t)

, enable it to converge well to the origin when the proposed RHESO-based MPC is employed. However, if the traditional ESO-based MPC, ADRC, and RMPC are adopted, the AV’s position components

x (t)

and

y (t)

cannot converge to the origin. Similarly, Figure 4 exhibits the velocity response of the electric AV obtained by the proposed RHESO-based MPC, traditional ESO-based MPC, ADRC, and RMPC. The simulation results in Figure 3 and Figure 4 reveal that the proposed RHESO-based MPC allows for achieving smooth braking at designated locations in the electric AV. On the contrary, smooth braking at designated positions in the electric AV is difficult to achieve if the traditional ESO-based MPC, ADRC, and RMPC are applied. Figure 5 plots the control input of the electric AV. Furthermore, the history of estimation errors obtained by the proposed RHESO and the traditional ESO are shown in Figure 6. Note that the error convergence speed of the traditional ESO is faster than that of the proposed RHESO, but to some extent, the existence of optimized estimation errors makes the electric AV’s braking process smoother and more accurate, which is another advantage of the proposed RHESO-based MPC.

In the following, simulation results under another set of parameters are shown. The initial conditions are given as follows:

\begin{matrix} ξ (0) = {[- 30 - 17.3 18]}^{T}, \bar{θ} = 30^{\circ}, ε (t) = 5 sin (0.2 t) + cos (t) . \end{matrix}

Figure 7, Figure 8, Figure 9 and Figure 10 show the impacts of the proposed RHESO-MPC and traditional ESO-based MPC, ADRC, and RMPC on AV when the initial distance and initial speed are relatively large. From Figure 7, Figure 8, Figure 9 and Figure 10, it is easy to observe that the RHESO-MPC outperforms traditional ESO-based MPC, ADRC, and RMPC in terms of position control accuracy and speed convergence accuracy when both the initial position and speed increase. As a conclusion, from Figure 3, Figure 4, Figure 5, Figure 6, Figure 7, Figure 8, Figure 9 and Figure 10, the effectiveness of the proposed method is verified.

5. Conclusions

This paper analyzed the problem of smooth braking at preset points in AVs. Given the difficulty of obtaining dynamic model parameters for AVs and the limitations of using kinematic models for servo control, an NN was designed to approximate the velocity order model within the dynamic model, enabling the use of throttle/brake openings as control signals. A new extended kinematic model of AVs was proposed by integrating the proposed NN model with the traditional kinematic model, which facilitates longitudinal speed servo control. An RHESO-based MPC method was also proposed to achieve smooth braking at preset points in AVs. Furthermore, sufficient conditions were established to guarantee the recursive feasibility of the optimization problem and the practical stability of the closed-loop system. Finally, the effectiveness of the proposed method was validated through an illustrative numerical simulation, demonstrating that the proposed extended kinematic model can effectively control velocity and position and that the RHESO-based MPC method ensures smooth braking at preset points.

Author Contributions

Conceptualization, J.C.; methodology, Y.X.; software, Y.X.; validation, J.C., Y.X. and Z.Z.; formal analysis, J.C.; investigation, Y.X.; resources, Y.X.; data curation, Z.Z.; writing—original draft preparation, Y.X.; writing—review and editing, J.C.; supervision, Z.Z.; project administration, J.C.; funding acquisition, J.C. All authors have read and agreed to the published version of the manuscript.

Funding

This work is partially supported by the Shaanxi Province Natural Science Basic Research Plan 2023-JC-QN-0007, Taicang Basic Research Plan TC2023JC02, and the Fundamental Research Funds for the Central Universities.

Data Availability Statement

Data used in this paper are presented in the Section 4.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

AV	autonomous vehicle
MPC	model predictive control
NN	neural network
RHESO	receding horizon extended state observer
PS	position and speed
PID	proportional–integral–derivative
ESO	extended state observer

References

Fu, Y.; Li, C.; Yu, F.R.; Luan, T.H.; Zhang, Y. A Decision-making strategy for vehicle autonomous braking in emergency via deep reinforcement learning. IEEE Trans. Veh. Technol. 2020, 69, 5879–5888. [Google Scholar] [CrossRef]
Guo, J.; Hu, P.; Wang, R. Nonlinear coordinated steering and braking control of vision-based autonomous vehicles in emergency obstacle avoidance. IEEE Trans. Intell. Transp. Syst. 2016, 17, 3230–3240. [Google Scholar] [CrossRef]
Wahid, S.M.S. Automotive brake wear: A review. Environ. Sci. Pollut. Res. 2018, 25, 174–180. [Google Scholar] [CrossRef] [PubMed]
Rasouli, A.; Tsotsos, J.K. Autonomous vehicles that interact with pedestrians: A survey of theory and practice. IEEE Trans. Intell. Transp. Syst. 2020, 21, 900–918. [Google Scholar] [CrossRef]
Kong, X.; Li, M.; Tang, T.; Tian, K.; Matias, L.M.; Xia, F. Shared subway shuttle bus route planning based on transport data analytics. IEEE Trans. Autom. Sci. Eng. 2018, 15, 1507–1520. [Google Scholar] [CrossRef]
Serrano, C.; Delorme, X.; Dolgui, A. Scheduling of truck arrivals, truck departures and shop-floor operation in a cross-dock platform, based on trucks loading plans. Int. J. Prod. Econ. 2017, 194, 102–112. [Google Scholar] [CrossRef]
Zhang, Z.; Ding, M.; Ding, Y.; Ma, G. Research on special vehicle detection and passenger elevator docking behavior recognition in intelligent monitoring. In Proceedings of the 6th International Conference on Information Science, Computer Technology and Transportation, Xishuangbanna, China, 26–28 November 2021. [Google Scholar]
Gerdes, J.C.; Hedrick, J.K. Vehicle speed and spacing control via coordinated throttle and brake actuation. Control Eng. 1997, 5, 1607–1614. [Google Scholar] [CrossRef]
Attia, R.; Orjuela, R.; Basset, M. Combined longitudinal and lateral control for automated vehicle guidance. Vehicle Syst. Dyn. 2014, 52, 261–279. [Google Scholar] [CrossRef]
Santis, R.M.D. A novel PID configuration for speed and position control. J. Dyn. Syst. Meas. Control 1994, 116, 542–549. [Google Scholar] [CrossRef]
Ioannou, P.; Xu, Z.; Eckert, S.; Clemons, D.; Sieja, T. Intelligent cruise control: Theory and experiment. In Proceedings of the IEEE Conference on Decision and Control, San Antonio, TX, USA, 15–17 December 1993; pp. 1885–1890. [Google Scholar]
Chen, Y.; Wang, J. Adaptive vehicle speed control with input injections for longitudinal motion independent road frictional condition estimation. IEEE Trans. Veh. Technol. 2011, 60, 839–848. [Google Scholar] [CrossRef]
Cao, Y.; Shi, T.; Niu, X.; Li, X.; Xia, C. A smooth torque control strategy for brushless DC motor in braking operation. IEEE Trans. Energy Convers. 2018, 33, 1443–1452. [Google Scholar] [CrossRef]
Xu, S.; Peng, H.; Song, Z.; Chen, K.; Tang, Y. Accurate and smooth speed control for an autonomous vehicle. In Proceedings of the 2018 IEEE Intelligent Vehicles Symposium, Suzhou, China, 26–30 June 2018; pp. 26–30. [Google Scholar]
Liu, Y.; Xu, B.; Ding, Y. Convergence analysis of cooperative braking control for interconnected vehicle systems. IEEE Trans. Intell. Transp. Syst. 2017, 18, 1894–1906. [Google Scholar] [CrossRef]
Wu, X.; Wei, C.; Tian, H.; Wang, W.; Jiang, C. Fault-tolerant control for path-following of independently actuated autonomous vehicles using tube-based model predictive control. IEEE Trans. Intell. Transp. Syst. 2022, 23, 20282–20297. [Google Scholar] [CrossRef]
Yang, H.; Wang, Z.; Xia, Y.; Zuo, Z. EMPC with adaptive APF of obstacle avoidance and trajectory tracking for autonomous electric vehicles. ISA Trans. 2023, 135, 438–448. [Google Scholar] [CrossRef] [PubMed]
Luan, Z.; Zhang, J.; Zhao, W.; Wang, C. Trajectory tracking control of autonomous vehicle with random network delay. IEEE Trans. Veh. Technol. 2020, 69, 8140–8150. [Google Scholar] [CrossRef]
Nguyen, A.T.; Rath, J.; Guerra, T.M.; Palhares, R.; Zhang, H. Robust set-invariance based fuzzy output tracking control for vehicle autonomous driving under uncertain lateral forces and steering constraints. IEEE Trans. Intell. Transp. Syst. 2021, 22, 5849–5860. [Google Scholar] [CrossRef]
Wang, X.; Sun, W. Trajectory tracking of autonomous vehicle: A differential flatness approach with disturbance-observer-based control. IEEE Trans. Intell. Veh. 2023, 8, 1368–1379. [Google Scholar] [CrossRef]
Maghenem, M.; Loría, A.; Panteley, E. A cascades approach to formation-tracking stabilization of force-controlled autonomous vehicles. IEEE Trans. Autom. Control 2018, 63, 2662–2669. [Google Scholar] [CrossRef]
Xu, S.; Li, S.E.; Peng, H.; Cheng, B.; Zhang, X.; Pan, Z. Fuel-saving cruising strategies for parallel HEVs. IEEE Trans. Veh. Technol. 2016, 65, 4676–4686. [Google Scholar] [CrossRef]
Xu, S.; Li, S.E.; Cheng, B.; Li, K. Instantaneous feedback control for a fuel-prioritized vehicle cruising system on highways with a varying slope. IEEE Trans. Intell. Transp. Syst. 2017, 18, 1210–1220. [Google Scholar] [CrossRef]
Wu, J.; Zhou, H.; Liu, Z.; Gu, M. Ride comfort optimization via speed planning and preview semi-active suspension control for autonomous vehicles on uneven roads. IEEE Trans. Veh. Technol. 2020, 69, 8343–8355. [Google Scholar] [CrossRef]
Patil, O.S.; Le, D.M.; Greene, M.L.; Dixon, W.E. Lyapunov-derived control and adaptive update laws for inner and outer layer weights of a deep neural network. IEEE Control Syst. Lett. 2022, 6, 1855–1860. [Google Scholar] [CrossRef]
Chai, R.; Tsourdos, A.; Savvaris, A.; Chai, S.; Xia, Y.; Chen, C.L.P. Design and implementation of deep neural network-based control for automatic parking maneuver process. IEEE Trans. Neural Netw. Learn. Syst. 2022, 33, 1400–14143. [Google Scholar] [CrossRef] [PubMed]
Guo, L.; Chen, W. Disturbance attenuation and rejection for systems with nonlinearity via DOBC approach. Int. J. Robust Nonlinear Control 2005, 15, 109–125. [Google Scholar] [CrossRef]
Cui, Y.; Qiao, J.; Zhu, Y.; Yu, X.; Guo, L. Velocity-tracking control based on refined disturbance observer for gimbal servo system with multiple disturbances. IEEE Trans. Ind. Electron. 2022, 69, 10311–10321. [Google Scholar] [CrossRef]
Liu, Y.; Wang, H.; Guo, L. Composite robust H_∞ control for uncertain stochastic nonlinear systems with state delay via a disturbance observer. IEEE Trans. Autom. Control 2018, 63, 4345–4352. [Google Scholar] [CrossRef]
Xie, Y.; Qiao, J.; Yu, X.; Guo, L. Dual-disturbance observers-based control for a class of singularly perturbed systems. IEEE Trans. Syst. Man Cybern. Syst. 2022, 52, 2423–2434. [Google Scholar] [CrossRef]
Zhao, L.; Zhang, B.; Yang, H.; Wang, Y. Observer-based integral sliding mode tracking control for a pneumatic cylinder with varying loads. IEEE Trans. Syst. Man Cybern. Syst. 2020, 50, 2650–2658. [Google Scholar] [CrossRef]
Chang, S.; Wang, Y.; Zuo, Z.; Yang, H. Fixed-time formation control for wheeled mobile robots with prescribed performance. IEEE Trans. Control Syst. Technol. 2022, 30, 844–851. [Google Scholar] [CrossRef]
Xu, H.; Zhang, J.; Yang, H.; Xia, Y. Extended state functional observer-based event-driven disturbance rejection control for discrete-time systems. IEEE Trans. Cybern. 2022, 52, 6949–6958. [Google Scholar] [CrossRef]
Jagannathan, S. Neural Network Control of Nonlinear Discrete-Time Systems; CRC Press: Boca Raton, FL, USA, 2006. [Google Scholar]
Hornik, K.; Stinchcombe, M.; White, H. Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks. Neural Netw. 1990, 3, 551–560. [Google Scholar] [CrossRef]
Li, M.; Wu, H.; Wang, Y.; Handroos, H.; Carbone, G. Modified Levenberg-Marquardt algorithm for backpropagation neural network training in dynamic model identification of mechanical systems. J. Dyn. Syst. Meas. Control 2017, 139, 031012. [Google Scholar] [CrossRef]
Tao, G. Adaptive Control Design and Analysis; John Wiley & Sons: Hoboken, NJ, USA, 2003. [Google Scholar]
Peuteman, J.; Aeyels, D.; Sepulchre, R. Boundedness properties for time-varying nonlinear systems. SIAM J. Control Optim. 2000, 39, 1408–1422. [Google Scholar] [CrossRef]

Figure 1. The vehicle model.

Figure 2. The control framework diagram.

Figure 3. Curves of position components obtained by four methods.

Figure 4. Curves of velocity components obtained by four methods.

Figure 5. Curves of the control input obtained by four methods.

Figure 6. Curves of the estimation error obtained by two methods.

Figure 7. Curves of position components obtained by four methods.

Figure 8. Curves of velocity components obtained by four methods.

Figure 9. Curves of the control input obtained by four methods.

Figure 10. Curves of the estimation error obtained by two methods.

Table 1. Notation used in this paper.

$R$	Set of Real Numbers
$Z_{\geq 0}$	Set of nonnegative integers
$Z^{+}$	Set of positive integers
$Z [a, b]$	The positive integer set ${a, a + 1, \dots, b}$
$A \in R^{m \times n}$	The dimensions of A are $m \times n$
$A > 0$	A is a positive definite matrix
$A < 0$	A is a negative definite matrix
I	The unit matrix with appropriate dimensions
$∥ \cdot ∥$	The Euclidean norm
$0$	The matrix full of 0 with appropriate dimensions
$\bar{λ} (A)$	The maximum eigenvalue of A
$\underset{̲}{λ} (A)$	The minimum eigenvalue of A
$x (k + s \| k)$	The prediction of x at the time instant $k + s$ from the time instant k

Table 2. Partial parameters of the electric AV.

Parameter	Value
Sprung mass	1270 kg
Wheelbase	2910 mm
Vehicle height	1730 mm
Vehicle width	2082 mm
Roll inertia	536.6 kg· $m^{2}$
Pitch inertia	1536.7 kg· $m^{2}$
Yaw inertia	1536.7 kg· $m^{2}$
Brake torque at front wheel	250 N· $m / Mpa$
Brake torque at rear wheel	150 N· $m / Mpa$

Table 3. Parameters used in the simulation.

Parameter	$Λ$	$κ$	$\bar{ε}$	H
Value	0.5	1.19	3.5	0
Parameter	${\hat{W}}_{1}^{*}$	${\hat{W}}_{2}^{*}$	$a_{σ}$	$b_{σ}$
Value	−0.21	−1.58	1	1
Parameter	${\tilde{h}}_{1, min, ξ}$	${\tilde{h}}_{1, max, ξ}$	${\tilde{h}}_{2, min, ξ}$	${\tilde{h}}_{2, max, ξ}$
Value	−100 m	100 m	−100 m	100 m
Parameter	${\tilde{h}}_{3, min, ξ}$	${\tilde{h}}_{3, max, ξ}$	${\tilde{h}}_{min, u}$	${\tilde{h}}_{max, u}$
Value	0 m/s	50 m/s	0	9
Parameter	N	Q	R	T
Value	10	$150 I$	I	0.1 s

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chen, J.; Xu, Y.; Zheng, Z. Neural Network and Extended State Observer-Based Model Predictive Control for Smooth Braking at Preset Points in Autonomous Vehicles. Drones 2024, 8, 273. https://0-doi-org.brum.beds.ac.uk/10.3390/drones8060273

AMA Style

Chen J, Xu Y, Zheng Z. Neural Network and Extended State Observer-Based Model Predictive Control for Smooth Braking at Preset Points in Autonomous Vehicles. Drones. 2024; 8(6):273. https://0-doi-org.brum.beds.ac.uk/10.3390/drones8060273

Chicago/Turabian Style

Chen, Jianlin, Yang Xu, and Zixuan Zheng. 2024. "Neural Network and Extended State Observer-Based Model Predictive Control for Smooth Braking at Preset Points in Autonomous Vehicles" Drones 8, no. 6: 273. https://0-doi-org.brum.beds.ac.uk/10.3390/drones8060273

Article Menu

Neural Network and Extended State Observer-Based Model Predictive Control for Smooth Braking at Preset Points in Autonomous Vehicles

Abstract

1. Introduction

2. Problem Formulations and Preliminaries

2.1. System Model

2.2. Estimation of the Unknown NN Parameter

2.3. Linearization Model and Constraints of the AV

2.4. The Design of RHESO

2.5. Control Input and System Reconstruction

2.6. The MPC Scheme

3. Results

4. Numerical Example

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI