1. Introduction
Gases that trap heat in the atmosphere are called greenhouse gas (
), which lead to global warming. Based on data published by the US EPA, carbon dioxide is the main component of
, accounting for 79%. Human activities are responsible for almost all of the increase in greenhouse gases in the atmosphere over the last 150 years. Transportation is the primary source of greenhouse gas emissions in the US [
1]. Based on the data in 2020, over 90% of the fuel used for transportation is petroleum based, which includes primarily gasoline and diesel. Therefore, the fuel economy of the vehicle is critical to help to reduce
emissions and mitigate global warming.
Nowadays, electric vehicles are one of the promising solutions to reduce
emissions [
2,
3,
4,
5]. In general, electric vehicles can be classified into three types [
2,
6,
7]: (1) battery electric vehicles (BEVs), which use the battery to store energy; (2) fuel cell electric vehicles (FCEVs), which use hydrogen and oxygen to generate electricity; (3) hybrid and plug-in hybrid electric vehicles (HEVs/PHEVs), which have two powertrains, one is driven by the internal combustion engine, and the other one is driven by the electric motor (e-Motor). In this paper, we focus on PHEVs.
Due to the flexibility of the two powertrains, a PHEV can have four different operating modes: (1) engine-only mode, (2) e-Motor-only mode, (3) combined power or blended mode, and (4) regenerative braking mode. The blended mode generally involves a control strategy to distribute torque between the ICE and e-Motor so that each of them can operate in its optimal performance region. In addition, this mode can be further classified into two modes, which are charge-depletion (CD) mode and charge-sustaining (CS) mode [
8]. During the CD mode, the vehicle is driven mostly by the e-Motor until the battery is discharged to the pre-set value. In CS mode, however, there are constraints on the battery state of charge (
) and
needs to be maintained within a certain range. An appropriate optimal power management strategy between different operating modes and real-time torque distribution between the ICE and the e-Motor is a key to optimizing fuel consumption or fuel economy.
In the past decades, extensive research on power management strategies [
9,
10,
11] has been conducted by many researchers. Some researchers have focused on optimizing the fuel consumption and
emissions for a PHEV with different methods, and these methods have been classified and summarized by many researchers [
11,
12,
13]. In general, these methods can be classified into two categories: offline power management strategies and online power management strategies.
The offline power management strategies require prior knowledge of the global information to calculate the optimal solution. Consequently, these methods cannot be implemented in real-time. Dynamic programming (DP) is one of the optimal methods in this area and has been widely used for the analysis of sequential decision-making problems [
14,
15,
16]. In general, DP solves a global optimization problem by breaking it into several sub-problems, then it will search different control inputs from the final state, and examine the control sets to get the minimum cost as the optimal final solution. However, DP is very computationally intensive and it requires prior knowledge of the entire drive cycle information which means this method cannot be implemented in real-time on a vehicle. Hence, DP simulation results will be used as an optimal reference control strategy for comparison in this paper.
As opposed to offline power management strategies, online power management strategies can be applied to real-time problems, and many methods have been studied and developed. These methods can be further classified into two categories: rule-based methods and optimization-based methods [
11,
12,
13]. Fuzzy logic belongs to the rule-based methods. Besides fuzzy logic, meta-heuristic methods have been applied to many mechanical design problems and readers can find some comprehensive reviews in [
17,
18]. Model predictive control (MPC) and equivalent consumption minimization strategy (ECMS) belong to the optimization-based methods. In this paper, NN belongs to the rule-based controller since it learns the control policy of DP.
MPC relies on prediction models to obtain a control action by solving an online optimization problem over a finite horizon. The main advantage of the MPC strategy is its ability to handle constraints on states, inputs, and outputs, and thereby take system limitations into account. This allows for operating a system closer to the input and state-space boundaries, a property that could be exploited to enhance profitability [
19,
20]. With these advantages, the MPC algorithm is widely used in industry. The main disadvantage of MPC is that it is often too slow to apply to systems with rapid dynamics [
19,
20].
ECMS was presented by Paganelli and tries for instantaneous optimization, taking battery
into account [
21,
22,
23]. Additionally, it has been further developed by many researchers in recent years. ECMS is a common strategy in this area, and it can be implemented for real-time problems. However, ECMS suffers from the lack of generality in the cost function and strongly depends on the equivalence factor [
24,
25,
26]. Since we will only compare the performance of two city drive cycles, it is easy for us to tune the equivalence factor and take the benefits from ECMS. Therefore, the ECMS strategy is applied in this paper to compare with DP and ANNs.
Nowadays, machine learning is studied by many researchers in different research fields, and various machine learning-based controllers have been developed, such as the deep learning-based inverse model for internal combustion engine fuel economy control [
27,
28] and the PHEV power management strategy [
29,
30]. ANNs are used in this paper to learn the DP solution so as to generate a real-time solution for torque split without requiring knowledge of the drive cycle. The main idea of the ANN supervisory controller in this paper is that an ANN controller is trained from optimized torque splits obtained from offline techniques applied to the existing drive cycles, then it can generate a control policy so that the controller can obtain the solutions for arbitrary drive cycles in real-time to mimic the offline control algorithms. It has lots of advantages, such as the controller not being limited by the specific driving conditions which means the ANN controller can be used for different countries or different driving habits, and the training set can also be modified to retrain the model, which makes the ANN model more adaptable.
In [
29], two ANN models were developed for two input conditions, with or without trip information, under the CD-CS mode. However, the performance of their controller in CD-CS mode is not significant compared to the default mode. Similar results were found in our previous study [
31]. Furthermore, we found that for highway drive conditions, there are few benefits to using the blended-CD mode and blended-CS mode since very few start-stops will occur in this drive condition and the vehicle mostly operates in an almost constant-velocity region. So the ICE-only mode for CS or e-Motor-only mode for CD is likely to be more beneficial to highway conditions. However, urban driving conditions can potentially benefit from the blended-CS mode. In the blended-CS mode, the battery
is maintained and the PHEV acts like a normal HEV. In addition, we found 20%
threshold to switch from CD mode to CS mode under the city drive conditions has great potential benefits for the total
emissions with 10%
as the lower boundary to protect the battery [
31].
So, in this paper, we continued and extended our previous study to focus on urban driving conditions with a low initial
condition of 20%. The lower and upper
boundaries are 10% and 30%, respectively. The main motivation for using ANN in this research is to leverage machine learning to replicate the DP algorithm under urban city drive conditions for online real-time problems. Compared to [
29], fewer inputs and a different output are selected in our research presented here. The ANN inputs and output selection will be introduced in
Section 3.3. Furthermore, our ANN controller does not require any trip information which means there is only one controller needed for the whole power management instead of two separate controllers based on different inputs. In addition, our ANNs’ output (torque split) is more directly implementable on real vehicles since the torque split is straightforward to obtain the desired engine torque and e-Motor torque, which can be sent to the engine controller and e-Motor controller to convert them into fuel injection and current output signals. We applied and set the DP controller as the baseline and several ANN controllers were developed. Two completely different urban driving conditions were used for the comparison with the DP solution as well as the ECMS method. Our results show that ANN can mimic DP very well, even under different urban conditions.
On the other hand, ANN also has some apparent disadvantages, such as its black-box nature. Therefore, finding an efficient method to train artificial neural networks is very important for researchers in this field, which is one of the motivations and contributions of this study. In this paper, we developed several ANN supervisory controllers with different hyper-parameters to replicate DP results. A total of 31 city drive cycles with over 30,000 data points are used to train and validate the ANN controllers. We studied the effects of hyper-parameters of the ANN on the results for the city drive conditions and observed a general rule that more than two hidden layers in the ANN is a more efficient way to obtain an ANN model that has better training MSE.
2. Vehicle Modeling
As part of the EcoCAR2 competition, a traditional fuel-powered vehicle, a Chevrolet Malibu 2013, was modified into a PHEV with parallel through-the-road (PTTR) architecture in which the internal combustion engine drives the front wheels, and the rear wheels are driven by an electric motor. The two powertrains can work independently of each other but are connected in parallel, through the road, as the front and rear wheels rotate at the same speed (in no-slip conditions). Since our vehicle model simply combines the torques from the two powertrains (ICE and e-Motor), the results are independent of specific vehicle architecture, as long as the vehicle is set up as a parallel hybrid vehicle. The vehicle specifications are listed in
Table 1.
Since the test vehicle has two parallel powertrains, ICE and e-Motor, the total
emissions can be expressed as the sum of fuel
emissions and electricity
emissions:
In a PHEV,
emissions are generated during the burning of fossil fuel in the ICE, the creation of the fuel, and the production of electricity. Thus, the well-to-wheel (WTW)
emissions, which include the emissions during the creation of the energy and its application process, make more sense to be used as the metric for
emissions evaluation instead of the ICE exhaust
emissions. Taking that into consideration, the fuel
emissions and electricity
emissions can be expressed as:
where
is the fuel consumption,
is the electricity energy consumption, and
and
are the coefficients of diesel WTW
emissions and electricity WTW
emissions, respectively. The values are taken from the Argonne National Laboratory’s Greet Model [
32].
Based on the vehicle speed which is given by the drive cycle, the traction load required at the wheel can be modeled as:
where
is the total traction force required at the wheel,
is the resistance force,
v is the vehicle velocity,
A,
B, and
C are the loss coefficients that are taken from EPA dynamometer testing data [
33],
is the inertia force, and
is the mass of the vehicle.
Since both ICE and e-Motor contribute to the traction force, the equation can be further expressed as:
where
is the engine torque at the wheels,
is the e-Motor torque at the wheels,
is the radius of the wheel, and
is the required torque at the wheels.
The torque split ratio between the ICE and e-Motor will determine the
and
. It is defined as Equation (
8). The torque split is constrained within the range from −1 to 1. The torque split and its corresponding operation mode are shown in
Table 2. A negative torque split value means that the engine is providing more torque than the vehicle required, and the e-Motor is charging the battery pack. At regeneration mode, the required torque is negative, and the e-Motor will absorb energy from braking.
The angular velocities of the wheel, the ICE, and the e-Motor can be obtained from:
where
,
,
are the angular velocities of the vehicle wheels, ICE, and e-Motor, respectively.
,
,
are the engine differential gear ratio, transmission gear ratio, and e-Motor differential gear ratio, respectively.
The transmission gear number can be calculated from the transmission model based on the transmission shift map for the 6T40 GM transmission [
34]. Additionally, ICE torque and e-Motor torque can be expressed as:
where
and
are the ICE output torque and e-Motor output torque,
i =
or
, and min(
), max(
) are the minimum and maximum output torques due to the mechanical limitations of the ICE and the e-Motor which are obtained from tests under various speeds of the ICE and e-Motor.
To calculate the fuel
emissions, it is first necessary to model the fuel consumption. Based on dynamometer tests, the fuel consumption is approximated as a second-order polynomial function of the engine speed and engine torque [
31]:
where the units of
,
, and
are
/stroke, rpm, and Nm, respectively;
are tuned parameters as listed in
Table 3.
The e-Motor
emissions are calculated from the energy consumption of electricity,
:
Besides
emissions,
is the other metric used for the analysis and comparison of each power management strategy. The
of the battery can be expressed as:
where
is the estimated
value at time
t,
is the battery current,
is the nominal battery capacity and
is the time step.
The discharging current and previously determined values, , are used to estimate current , (t + ).
To calculate the
, it is assumed that the energy transfer efficiency from battery electrical energy to e-Motor mechanical energy or vice versa is constant. Thus,
where
is the energy transfer efficiency and a 10% loss is modeled for the accessory losses,
is the voltage of the battery with 300 V at all times for simplicity.
4. Results Comparison
The UDDS and NYCC-LD urban drive cycles, which are not included in the training set, are used for validation by comparing the results with DP and ECMS. The UDDS drive cycle simulates an urban route of 12.07 km (7.5 miles) with frequent stops, and the maximum speed is 91.25 km/h (56.7 mph). Additionally, the average speed of UDDS is 31.5 km/h (19.6 mph). The NYCC-LD also simulates low-speed urban driving with frequent stops, but with a shorter total distance of 1.89 km (1.18 miles), a lower maximum speed of 44.6 km/h (27.7 mph), and a lower average speed of 11.4 km/h (7.1 mph) compared to UDDS drive cycle. The drive cycles’ speed profiles are shown in
Figure 8.
There are five metrics for the comparison: fuel consumption (liter), fuel
emissions (gram), electricity
emissions (gram), total
emissions which is the sum of fuel and electricity
emissions, and the terminal
status. DP is the baseline for the result comparison. In general, the closer the controller’s result is to DP’s, the better the controller is.
Figure 9 shows
results of the UDDS drive cycle for each control strategy. Fuel consumption, fuel
emissions, electricity
emissions, and total
emissions are summarized in
Table 4.
Since we set the DP’s terminal to the same 20% as the initial , ANNs replicate DP’s under urban driving conditions. Therefore, the electricity emissions of ANNs should be virtually zero. The very small negative or positive emissions value will only occur when the terminal is slightly higher or lower than the preset 20% . If the terminal is slightly lower than 20%, which means the battery slightly discharges and the electrical energy is consumed, it will cause a very small positive electrical emission. While in the case of terminal slightly higher than 20%, which means the battery is slightly charged, it will have a very small negative electrical emissions value since it stores the electricity energy instead of consuming it.
From
Figure 9, all the ANN controllers show a similar
behavior to DP. ECMS has a slightly higher average
value compared to DP and ANNs. ANN controllers generate 6.22% less total
emissions than that of ECMS, as shown in
Table 4. In addition, all the controllers’
s are within 0.1% deviation compared to DP.
Figure 10 shows the
comparison results of NYCC-LD. Fuel consumption, fuel
emissions, electricity
emissions, and total
emissions are listed in
Table 5.
From
Figure 10, ANN controllers show similar results to UDDS and all the ANN controllers show similar
behavior compared to DP. However, ECMS shows a different
behavior compared to DP and ANNs, with a 6.6% deviation. Moreover, ANN controllers have 12.1% less total
emissions than ECMS at least under this drive cycle, as shown in
Table 5.
In order to check the robustness of each control strategy, UDDS and NYCC-LD drive cycles are repeated 10 times.
Figure 11 and
Figure 12 are the
results of UDDS and NYCC-LD, respectively. Additionally, the results are listed in
Table 6 and
Table 7, respectively. In
Figure 12, ECMS shows a different
to DP and ANNs, and its total
emission is 10.06% higher than the other controllers under the UDDS 10 times drive cycle. This shows similar trends in UDDS and NYCC-LD which illustrates the significance of the equivalence factor value selection. In
Table 7, ANN2-8-8, ANN5, and ANN1-32 have lower total
emissions because of the higher final
at the end of the drive cycle which means they stored more electrical energy for future use. Overall, the ANN controllers still show similar
behavior and total
emissions to DP which is consistent with the UDDS and NYCC-LD performance. Furthermore, all ANN controllers have the same sum of total
emissions in all four driving cycles in this section compared to DP. However, ANN5 has the best performance among the ANN controllers if we take
constraints into consideration. ANN2-8-32 comes next, then ANN2-8-8. This also indicates that multiple hidden layers may help to improve the ANN’s performance.