Short Term Traffic Flow Prediction of Urban Road Using Time Varying Filtering Based Empirical Mode Decomposition

Wang, Yanpeng; Zhao, Leina; Li, Shuqing; Wen, Xinyu; Xiong, Yang

doi:10.3390/app10062038

Open AccessArticle

Short Term Traffic Flow Prediction of Urban Road Using Time Varying Filtering Based Empirical Mode Decomposition

¹

College of Traffic and Transportation, Chongqing Jiaotong University, Chongqing 400074, China

²

College of Mathematics and Statistics, Chongqing Jiaotong University, Chongqing 400074, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2020, 10(6), 2038; https://0-doi-org.brum.beds.ac.uk/10.3390/app10062038

Submission received: 13 February 2020 / Revised: 4 March 2020 / Accepted: 11 March 2020 / Published: 17 March 2020

(This article belongs to the Special Issue New Trends of Sustainability in Civil Engineering and Architecture Ⅱ)

Download

Browse Figures

Versions Notes

Abstract

:

Short-term traffic flow prediction is important to realize real-time traffic instruction. However, due to the existing strong nonlinearity and non-stationarity in short-term traffic volume data, it is hard to obtain a satisfactory result through the traditional method. To this end, this paper develops an innovative hybrid method based on the time varying filtering based empirical mode decomposition (TVF-EMD) and least square support vector machine (LSSVM). Specifically, TVF-EMD is firstly used to deal with the implied non-stationarity in the original data by decomposing them into several different subseries. Then, the LSSVM models are established for each subseries to capture the linear and nonlinear characteristics embedded in the original data, and the corresponding prediction results are superimposed to obtain the final one. Finally, case studies based on two groups of data measured from an arterial road intersection are employed to evaluate the performance of the proposed method. The experimental results indicate it outperforms the other involved models. For example, compared with the LSSVM model, the average improvements by the proposed method in terms of the indexes of mean absolute error, mean relative percentage error, root mean square error and root mean square relative error are 7.397, 15.832%, 10.707 and 24.471%, respectively.

Keywords:

short-term traffic flow prediction; decomposition; TVF-EMD; LSSVM; forecasting accuracy

1. Introduction

As one of the key technologies of real-time traffic signal control, traffic assignment, route guidance, and other functions in the intelligent transportation system, short-term traffic flow prediction has always been the research focus. Its forecasting accuracy plays a decisive role in improving the performance of the intelligent transportation system [1]. For pursuing higher accuracy, a variety of spatio-temporal forecasting methods have been developed [2,3]. Among them, the temporal forecasting methods are widely used and have attracted more and more attention in the recent decades. Generally, these methods are roughly divided into three categories, i.e., statistical theoretical models, intelligent models, and hybrid models.

Statistical methods mainly include time-series models (e.g., autoregressive integrated moving average (ARIMA), seasonal autoregressive integrated moving average (SARIMA), etc.) [4,5,6], Kalman filtering model [7,8], and history average model [9]. Among them, the time series model has been widely applied in the prediction of traffic volume data. For example, Kumar et al. [4] developed a SARIMA model to predict the traffic flow, in which the order of model was determined by autocorrelation function and partial autocorrelation function. The forecasting results showed that the proposed model had satisfactory forecasting accuracy. Zhao et al. [5] proposed a short-term traffic flow forecasting model combined the ARIMA model and the space-time characteristics of the expressway network to improve forecasting accuracy. Wang et al. [6] adopted an ARIMA model to forecast the traffic time-series data, and the satisfactory results could be obtained. Generally, the statistical theoretical models are simple, convenient and easy to apply. However, those models usually overlook the interferences of random factor, strong non-stationarity and nonlinearity hidden in the traffic data.

Unlike the statistical methods, the intelligent models usually perform better in explaining the nonlinear relationship between the input and output. These models include artificial neural network (ANN) [10,11,12,13,14], support vector machine (SVM) [15,16] and least square SVM (LSSVM) [17]. Wang et al. [16] proposed a brand-new model integrated the wavelet function and the SVM model to forecast the target data, which could improve the forecasting results. Luo et al. [18] presented a hybrid optimization algorithm combined particle swarm optimization (PSO) and genetic algorithm to find the optimal parameters of LSSVM, which could effectively improve the model’s accuracy and convergence speed. Shang et al. [19] introduced the proportion coefficient to combine the advantages of Gaussian kernel function and polynomial function. The forecasting result showed the built model was effective and practicable. Obviously, these intelligent models do not contain some special model architectures and have highly adaptable, especially for the nonlinear data. However, they may suffer from the problems of slow convergence speed and over-fitting.

To obtain more accurate and stable prediction, many scholars have introduced a variety of hybrid models which could combine the advantages of different models. The hybrid models can be commonly divided into four types: decomposition-based methods, weighting-based methods, parameter optimization-based methods, and error correction-based methods [20]. In recent years, the decomposition-based methods have become the research focus [21]. This kind of hybrid model could use the data processing models to address the nonlinear and non-stationary features in the data, and thus the forecasting accuracy could be enhanced.

The widely used decomposition algorithms have wavelet decomposition, empirical mode decomposition (EMD), and ensemble empirical mode decomposition (EEMD), etc. Among them, wavelet decomposition [22] is a multi-scale signal analysis method to tackle non-stationary signals. However, its performance usually relies on the selection of wavelet base functions. On the other hand, EMD can filter the signal adaptively [22]. By this method, different features in the original sequence can be filtered out step by step, and the corresponding subseries can be regarded as intrinsic mode functions (IMFs). Unfortunately, its decomposition process may suffer from the problems of model mixing and end effect. By adding many Gaussian white noise samples in EMD, EEMD has been developed [23]. However, it has the problem in the determination of noise amplitude and ensemble number. Nevertheless, these decomposition algorithms still have been successfully applied in the traffic flow prediction. For example, Duo et al. [24] proposed a hybrid forecasting method of short-term traffic volume based on EMD and the improved SVM. The forecasting results verified that EMD could improve accuracy significantly. Tang et al. [25] adopted a new hybrid model for traffic volume prediction by using the combination of EEMD and SVM. The results showed this model had superior performance over the single SVM. Tian et al. [26] presented a hybrid prediction model based on the improved complete EEMD (ICEEMDAN) algorithm, the kernel online sequential extreme learning machine (KOSELM), and the ARIMA model. The forecasting accuracy had been improved significantly. Despite these applications, the decomposition-based methods still go through various challenges.

To further enhance the accuracy of the traffic volume prediction, it is necessary to find new methods to deal with short-term traffic volume data. This paper proposes a novel time varying filtering based empirical mode decomposition (TVF-EMD) algorithm, which vividly describes the time-varying characteristics of data and overcomes the occurrence of mode mixing [27]. Specifically, TVF-EMD is firstly adopted to decompose the short-term traffic volume data and obtain multiple subsequences. Secondly, the LSSVM model is adopted for each subsequence to perform the final prediction. On this basis, five evaluation indexes including the mean absolute error, mean relative percentage error, root mean square error, root mean square relative error and equal coefficient are used to systematically evaluate the forecasting results. Meanwhile, the comparison of the proposed method with other forecasting models including EMD-LSSVM, LSSVM, and ARIMA is conducted. Finally, some conclusions are provided.

The rest of this paper is organized as follows: In Section 2, TVF-EMD and LSSVM are briefly discussed. Simultaneously, the structure and procedure of the proposed method are described in detail; In Section 3, two case studies are performed and the effectiveness of the proposed method is analyzed and discussed; In Section 4, some conclusions are summarized.

2. Methods

TVF-EMD is a data decomposition algorithm, which can be used to reduce the nonlinear and non-stationary components in short-time traffic volume data. On the other hand, LSSVM could perform well in describing short-time traffic volume data with nonlinear and non-stationary characteristics. This paper simultaneously combines the advantages of these two models and builds a new hybrid forecasting model, i.e., TVF-EMD-LSSVM. In order to better understand this method, the specific illustration of its notations is summarized in Appendix A.

2.1. Time Varying Filtering Based Empirical Mode Decomposition

EMD is an adaptive signal processing method that can decompose the signal into a series of IMFs and a non-zero mean residual [28], the expression is shown in Equation (1):

x (t) = \sum_{i = 1}^{N} i m f_{i} (t) + r (t)

(1)

where

i m f_{i} (t)

is the

i

th

i m f

,

i = 1, 2, \dots, N

. The EMD screening process can be divided into five steps, as shown in Appendix B.

As an IMF, the following conditions should be satisfied: (i) the number of zeros and poles must either be equal or differ at most by one; (ii) the local mean value of the upper and lower envelopes is zero. However, the above requirements have two limitations: (i) in the actual screening process, it is too rigid for stopping criterion; (ii) the second requirement of IMF may not be valid at a low sampling rate [27]. Thus, the model mixing occurs during decomposition. Aiming to overcome the weakness of EMD, Li et al. [27] proposed a TVF-EMD screening method to solve the above problems by developing local narrow-band signal. The local narrowband signal is not only similar to the IMF but also provides a Hilbert spectrum with physical significance. The filtering process of this method is completed by time-varying filtering, which is divided into three steps: (i) estimation of the local cut-off frequency; (ii) calculation of the local mean function; (iii) judgement of the residual signal.

2.1.1. Estimation of the Local Cut-Off Frequency

In TVF-EMD method, B-spline approximation filter is chosen as a time-varying filter, which adopts polynomial splines to approximate the signal and can be represented as:

g_{m}^{n} (t) = {[p_{m}^{n} * x]}_{↓ m} * b_{m}^{n} (t)

(2)

where

{[.]}_{↓ m}

is the down-sampling operation;

p_{m}^{n}

is a pre-filter and

p_{m}^{n} = {[({[b_{m}^{n} \times b_{m}^{n}]}_{↓ m}) - 1]}_{↑ m} \times b_{m}^{n}

,

b_{m}^{n} (t) = β^{n} (t / m)

;

β^{n} (t)

denotes B-spline function;

n

stands for B-spline order;

m

represents the node;

t

is time;

*

represents convolution operation.

According to Equation (2), the node

m

determines the local cut-off frequency of the B-spline time-varying filter. In practice, the nodes cannot be known. As a result, it is necessary to estimate the local cut-off frequency from the input signal. Then, the B-spline time-varying filter is constructed. The specific process is provided in Appendix C.

2.1.2. Calculation of Local Mean Function

After obtaining the local cut-off frequency

φ_{b i s}' (t)

, the signals

h (t)

can be obtained by

h (t) = \cos [\int φ_{b i s}' (t) d t]

(3)

Taking the extreme time point

({t_{\min}}, {t_{\min}})

of

h (t)

as node

m

, the time-varying filter can be constructed by B-spline approximation, and the cutoff frequency of the filter is consistent with

φ_{b i s}' (t)

. Subsequently, the B-spline approximation filter is performed on the input signal and the result is recorded as

m (t)

.

2.1.3. Judgement of the Residual Signal

Since the definition of local narrow-band signal is closely related to the instantaneous bandwidth, TVF-EMD has formulated the relative criteria to check the instantaneous narrow-band signal, namely,

θ (t) = \frac{B_{L o u g h l i n} (t)}{φ_{a v g} (t)}

(4)

For a given bandwidth

ξ

threshold, if

θ (t) \leq ξ

, the signal can be viewed as a narrow-band signal. Here, the weighted average instantaneous frequency

φ_{a v g} (t)

and Loughlin instantaneous bandwidth

B_{L}

can be calculated by:

φ_{a v g} (t) = \frac{{a_{1}}^{2} (t) φ_{1}' (t) + {a_{2}}^{2} (t) φ_{2}' (t)}{{a_{1}}^{2} (t) + {a_{2}}^{2} (t)}

(5)

B_{L} (t) = \sqrt{\frac{a {'_{1}}^{2} (t) + a {'_{2}}^{2} (t)}{{a_{1}}^{2} (t) + {a_{2}}^{2} (t)} + \frac{{a_{1}}^{2} (t) {a_{2}}^{2} (t) {(φ_{1}' (t) - φ_{2}' (t))}^{2}}{{({a_{1}}^{2} (t) + {a_{2}}^{2} (t))}^{2}}}

(6)

2.2. Least Square Support Vector Machine

After decomposing by TVF-EMD, the LSSVM model is built for each subseries. LSSVM has great improvement over the SVM model. The inequality constraints in the standard SVM algorithm are replaced by the equality constraints. On the conditions, the quadratic programming problem is transformed into the problem of solving linear equations [29].

Considering a set of data

D = (x_{i}, y_{i}), i = 1, \dots, k

, where

x_{i} \in R^{g}

is input and

g

is the dimension of

x_{i}

which can be determined by minimizing the root mean square error of the values output by the training part [20];

y_{i} \in R

is corresponding output. Assuming that the training part

{(x_{1}, y_{1}), (x_{2}, y_{2}), \dots (x_{k - g}, y_{k - g})}

is composed of

k - g

data sets and the corresponding output is

y_{i} = x (i + g)

,

i = 1, 2, \dots, k - g

. Thus, the regression function can be written as follow:

f (x) = ω^{T} ψ (x) + d

(7)

where

ψ (\cdot)

denotes a non-linear function;

ω

represents a weight vector;

d

is an offset. The parameters

ω

and

d

can be obtained by optimizing the following function:

{\begin{matrix} \min_{ω, d, q} J_{1} (ω, q) = μ E_{W} + ς E_{D} = \frac{1}{2} μ ω^{T} ω + \frac{1}{2} ς \sum_{i = 1}^{k - g} q_{i}^{2} \\ s . t . y_{i} = ω^{T} ψ (x) + d + q_{i} \end{matrix}

(8)

where

q_{i}

denotes error variable;

μ

and

ς

denote variable parameters;

E_{W} = \frac{1}{2} ω^{T} ω

;

E_{D} = \frac{1}{2} \sum_{i = 1}^{k - g} q_{i}^{2} = \frac{1}{2} \sum_{i = 1}^{k - g} {[y_{i} - ω^{T} ψ (x) + d]}^{2}

. To solve the above optimization problems, the Lagrange function is constructed as shown in Equation (9).

L (ω, d, q, α) = J (ω, q) - \sum_{i = 1}^{k - g} α_{i} {ω^{T} ψ (x) + d + q - y_{i}}

(9)

where

α_{i}

is the Lagrange multiplier. According to the Karush-Kuhn-Tucker (KKT) conditions, the optimal solution can be calculated by:

{\begin{cases} \frac{\partial L}{\partial ω} = 0 \to ω = \sum_{i = 1}^{k - g} α_{i} ψ (x_{i}) \\ \frac{\partial L}{\partial d} = 0 \to \sum_{i = 1}^{k - g} α_{i} = 0 \\ \frac{\partial L}{\partial q_{i}} = 0 \to α_{i} = γ q_{i} \\ \frac{\partial L}{\partial α_{i}} = 0 \to ω^{T} ψ (x_{i}) + d + q_{i} - y_{i} = 0 \end{cases}

(10)

where

γ = ζ / μ

denotes the penalty coefficient. After eliminating

q_{i}

and

ω

, the original optimization problem becomes

[\begin{matrix} 0 & L^{T} \\ L & Ω_{i j} + \frac{1}{γ} I \end{matrix}] [\begin{matrix} d \\ α \end{matrix}] = [\begin{matrix} 0 \\ Y \end{matrix}]

(11)

where

Ω_{i j} = ψ {(x_{i})}^{T} ψ (x_{j}) = K (x_{i,} x_{j})

;

L = {[1, \dots, 1]}^{T}; Y = {[y_{1}, \dots y_{N}]}^{T}

. Finding out

α

and

d

through Equation (11), the LSSVM regression model becomes:

f (x) = \sum_{i = 1}^{k - g} α_{i} K (x, x_{i}) + d

(12)

where

K (x, x_{i})

is the kernel function which needs to meet Mercer’s conditions. Generally, the kernel functions include RBF kernel function, sigmoid kernel function and polynomial kernel function, etc. The RBF kernel function is also called the Gaussian kernel function. It has strong nonlinear learning ability with fewer parameters, which is the most effective kernel function. Therefore, the RBF kernel function is selected in this paper. It can be expressed as

K (x, x_{i}) = \exp [- {‖ x - x_{i} ‖}^{2} / (2 σ^{2})], σ > 0

(13)

where

σ

denotes the kernel function parameter. When applying the LSSVM model with RBF kernel function, the selections of the parameter

σ

and the penalty coefficients

γ

determine the model’s learning and generalization capabilities. Thus, it’s vital to search for the most suitable parameters.

2.3. The Proposed Method

Based on the above discussions, a novel hybrid model which combines the TVF-EMD model and LSSVM model can be developed to improve the forecasting accuracy. First, the TVF-EMD method is presented to deal with the non-stationary and nonlinear traffic volume series. After that, multiple subsequences called narrow-band signals are obtained. Then, the LSSVM model is established for each subsequence. Finally, the prediction results of the subsequences are accumulated to generate the lasted forecasting results. The specific process of TVF-EMD-LSSVM model is shown in Figure 1, and the steps are shown as follows:

Step 1: Preprocess the original traffic volume data with the errors data and missing data to get the experimental data;

Step 2: Decompose the data into several subsequences

({c_{j} (1), \dots, c_{j} (k)}, j = 1 \dots M + 1)

by TVF-EMD algorithm;

Step 3: Divide each subseries into two parts, including training parts

{x^{'} (1), \dots, x^{'} (k)}

and test parts

{x^{'} (k + 1), \dots, x^{'} (k + N)}

;

Step 4: Establish the LSSVM model to predict the

k + 1

th data

{\hat{c}}_{j} (k + 1)

of subsequences, and sum up to get the forecasting value

\hat{x} (k + 1)

;

Step 5: After updating the training set data to

{x^{'} (2), \dots, x^{'} (k + 1)}

, repeat step 2 to step 4 to obtain the prediction results. Continue to predict one step ahead until the prediction task is completed.

3. Case Study

3.1. Data Description

The data collection A (including 2016 samples) was measured from the intersection entrance A of an arterial road in the main urban area of Chongqing and the location is shown in Figure 2. The statistical interval was 5 min, as shown in Figure 3. Two-thirds of the data were used to train the model, and the rest were used to test the performance of the built model. Table 1 summarizes the characteristics of data collection A. It could be observed that this dataset had strong volatility.

3.2. Data Processing

There are many factors affecting prediction accuracy, such as data quality, data characteristics, and model selection, etc. However, the quality of traffic volume data is one of the main factors [26]. Therefore, the processing of the abnormal data including missing data and erroneous data appears to be crucial in traffic volume prediction [30]. To repair abnormal data, the adjacent completion method is adopted and its function is shown in Equation (14):

x (t) = [x (t - w) + x (t - w - 1) + \dots + x (t - 1)] / w

(14)

where

w

denotes the number of data to be repaired.

3.3. Evaluation Criteria

In order to analyze and evaluate the forecasting performance of the proposed model, five commonly used evaluation indexes including mean absolute error (MAE), mean relative percentage error (MRPE), root mean square error (RMSE), root mean square relative error (RMSRE) and equal coefficient (EC) were used in the study [26,29]. Their specific definitions are given by:

M A E = \frac{1}{n} \sum_{i = 1}^{n} | y_{i} - {\hat{y}}_{i} |

(15)

M R P E = \frac{1}{n} \sum_{i = 1}^{n} | \frac{y_{i} - {\hat{y}}_{i}}{y_{i}} |

(16)

R M S E = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}

(17)

R M S R E = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(\frac{y_{i} - {\hat{y}}_{i}}{y_{i}})}^{2}}

(18)

E C = 1 - \frac{\sqrt{\sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}}{\sqrt{\sum_{i = 1}^{n} {(y_{i})}^{2}} + \sqrt{\sum_{i = 1}^{n} {({\hat{y}}_{i})}^{2}}}

(19)

The smaller values of MAE, MRPE, RMSE, and RMSRE indicate the higher accuracy. The closer to one the EC value is, the higher accuracy the prediction is.

3.4. Prediction Results and Analysis

3.4.1. TVF-EMD-LSSVM Model Prediction

According to the forecasting process of the proposed model in Section 2.3, TVF-EMD is used to decompose the experimental data A into 10 subsequences, as shown in Figure 4.

By constructing training and test sets for each subsequence, the LSSVM model is built to predict them. The dimension parameter was determined by minimizing the root mean square error of the output value in the training part [20]. Moreover, the optimal penalty coefficient and kernel function parameters of each subsequence were determined by the optimization function. Finally, the traffic volume prediction value was obtained by accumulating the forecasting results of the subsequences.

3.4.2. Comparison and Analysis of Forecasting Results

To illustrate the performance of the proposed method, three additional forecasting models including ARIMA model, LSSVM model, and EMD-LSSVM model were used to perform the performance comparison. The processes of the LSSVM model, ARIMA model, and EMD-LSSVM model were similar to the forecasting progress in Section 3.4.1. The evaluation indexes of four different models are shown in Table 2 and the corresponding prediction results are shown in Figure 5. From these comparisons, some main observations are provided below:

Compared with the other three involved models, the proposed model had better forecasting performance, where its error indexes of MAE, MRPE, RMSE, RMSRE, and EC were 1.721, 3.969%, 2.974, 6.797%, and 0.9956, respectively. Specifically, in Figure 4, the red line represents the prediction result of the proposed model, while the blue line represents the true value. Their comparison indicates the proposed method could well capture the time-varying characteristics of the actual situation. From Table 2, the forecasting accuracy of the proposed model was higher than the EMD-LSSVM model with the reductions in terms of the five indexes MAE, MRPE, RMSE, RMSRE, and EC by 2.654, 5.991%, 2.831, 8.464%, and 0.0174, respectively. The reason could be that the TVF-EMD algorithm uses time-varying filtering technology, which could describe the time-varying characteristics of the data. Simultaneously, it can improve the imperfection of the model mixing in the EMD algorithm.
Compared with the single models, the decomposition-based forecasting methods had the higher forecasting accuracy. For example, five error indexes in terms of MAE, MRPE, RMSE, RMSRE, and EC of the LSSVM model were 8.131, 17.871%, 10.801, 27.674%, and 0.9336, respectively, which presents the evident accuracy reduction in comparison with those of the proposed method. Compared with EMD-LSSVM model, these indexes were reduced by 3.756, 7.911%, 4.996, 12.413%, and 0.0306, respectively. The reason for these phenomena could be attributed to high non-stationarity and nonlinear characteristics embedded in the original data, which could be effectively addressed by the decomposition methods.
The MAE, MRPE, RMSE, RMSRE, and EC of the ARIMA model were 8.284, 17.977%, 11.01, 27.25% and 0.9322. Compared with LSSVM, these indexes were reduced by 0.153, 0.106%, 0.209 0.424, and 0.0014, respectively. The reason could be attributed to that the nonlinear features hidden in the original data were more significant than those of linear one, which leads to the conclusion that the linear ARIMA model cannot capture the characteristics well. Therefore, it owns the lowest forecasting accuracy.

3.5. Additional Case

To further test the stability of the proposed model, another group of data (data collection B) was used. These data were measured from the intersection entrance B of an arterial road in Chongqing (including 2016 samples), as shown in Figure 2 and Figure 6. Table 3 provided the relevant information of them. For simplicity, only the error indexes are given in Table 4. The intuitively results are shown in Figure 7. From Table 4 and Figure 7, the main results we

TVF-EMD was better than EMD in dealing with data nonlinearity and non-stationary. The forecasting result proves that the forecasting accuracy of TVF-EMD based method was higher than EMD based method.
The hybrid models could take advantage of the superiority each component model. The results display that the forecasting accuracy of the hybrid models was higher than that of the single models.
The ARIMA model usually presented the high performance for the data with significant linear features. However, for short-term traffic volume data with high nonlinear characteristics, the LSSVM model may have better forecasting performance.

4. Conclusions

In practice, the data of short-term traffic volume commonly owns strong nonlinearity and non-stationarity so that it is hard to provide a satisfactory forecasting result through the traditional methods. In order to improve the forecasting performance, a novel hybrid model based on the combination of TVF-EMD algorithm and LSSVM is developed in this study. Two case studies based on measured data from an intersection are provided to evaluate the performance of the proposed method. Several main conclusions are summarized as follows:

TVF-EMD has a more positive impact than EMD on improving forecasting accuracy. As a newly-improved decomposition method, TVF-EMD can vividly describe the time-varying characteristics (e.g., non-stationarity and nonlinearity) hidden in the data by time-varying filtering technology, where the problems of end effect and model mixing may be well addressed.

The forecasting accuracy of the hybrid models is higher than those of the single models. Generally, the hybrid model could combine the advantages of different component models. In this paper, the advantages of TVF-EMD in processing data non-stationarity and nonlinearity and the merit of LSSVM’s strong ability in addressing the nonlinear problem are combined.

The innovation of this paper is to introduce a new data processing method TVF-EMD algorithm, which improves the model mixing problem of the original EMD algorithm. To further improve the forecasting performance, some future tasks should be carried out. For example, the combination of the proposed method with probabilistic prediction models should be focused; the multi-step ahead prediction will be developed in the future; the application of the proposed method in other fields, such as wind speed prediction and solar radiation prediction, should also be performed.

Author Contributions

Conceptualization, Y.W. and L.Z.; investigation, Y.X.; methodology, Y.W. and X.W.; software, L.Z.; validation, Y.W., L.Z. and S.L.; writing—original draft, Y.W.; writing—review and editing, Y.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Science and Technology Research Project Fund of Chongqing Education Commission (Grant NO. KJ1705136, KJ1600512).

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Notation Illustration

Parameters and variables		$υ$	Dimension
		$ξ$	Bandwidth
$A (t)$	Instantaneous amplitude	$x (t)$	Input signal
$φ^{'} (t)$	Instantaneous frequency	$a_{i} (t)$	Amplitude of the i-th component
$φ_{b i s}' (t)$	Local cut-off frequency	${[.]}_{↓ m}$	Down-sampling operation
$β^{n} (t)$	B-spline function	$n$	B-spline order
$t$	Time	$*$	Convolution operation
$φ_{a v g} (t)$	Average instantaneous frequency	$B_{L}$	Instantaneous bandwidth
$ω$	Weight vector	$d$	Offset
$μ$	Variable Parameter	$ς$	Variable Parameter
$γ$	Penalty coefficient	$σ$	Kernel function parameters
$φ (t)$	Instantaneous phase	$ψ (\cdot)$	Non-linear function
$φ_{i} (t)$	Phase of the i-th Component	$q_{i}$	Error variable
$p_{m}^{n}$	Pre-filter	$α_{i}$	Lagrange multiplier
$m$	Node	$x_{i}$	Input

Appendix B. The Screening Process of EMD

Step 1: Find the local maximum and minimum of the

x (t)

;

Step 2: Calculate the average value

m (t)

of the upper line and lower envelope line

m (t) = (u (t) + l (t)) / 2

. The upper envelope

u (t)

and the lower envelope

l (t)

are obtained by using the cubic spline function;

Step 3: Extract

h (t) = x (t) - m (t)

and judge whether the

h (t)

is satisfied the conditions of IMF. If not, view

h (t)

as the original sequence and repeat the above steps;

Step 4: After

n

times of screening, an IMF

h_{n} (t)

which satisfies the conditions of IMF recorded as

c_{1} (t) = h_{n} (t)

can be received and calculate the residual component

r_{1} (t) = x (t) - c_{1} (t)

;

Step 5: Repeat the above steps for

r_{1} (t)

to get all the IMFs.

Appendix C. The Construction of the B-Spline Time-Varying Filter

Step 1: The Hilbert transform is used to calculate the instantaneous amplitude

A (t)

and instantaneous frequency

φ^{'} (t)

of the input signal

x (t)

.

A (t) = \sqrt{x^{2} + \hat{x} {(t)}^{2}}

(A1)

φ^{'} (t) = d (\arctan [\hat{x} (t) / x (t)]) / d t

(A2)

where

\hat{x} (t)

denotes the Hilbert transform of the signal.

Step 2: Determine the maximum value

{t_{\max}}

and minimum

{t_{\min}}

value of

A (t)

. For multicomponent signals, the analytical signal can be expressed as the sum of two signals.

z (t) = A (t) e^{j φ (t)} = a_{1} e^{j φ_{1} (t)} + a_{2} e^{j φ_{2} (t)}

(A3)

where

φ (t)

stands for instantaneous phase,

φ (t) = \arctan [\hat{x} (t) / x (t)]

.

Therefore, the following equations can be obtained.

A^{2} (t) = a_{1}^{2} (t) + a_{2}^{2} (t) + 2 a_{1} (t) a_{2} (t) \cdot \cos [φ_{1} (t) - φ_{2} (t)]

(A4)

\begin{array}{l} φ^{'} (t) & = ({φ^{'}}_{1} (t) (a_{1}^{2} (t) + a_{1} (t) a_{2} (t) \cos [φ_{1} (t) - φ_{2} (t)]) + \\ {φ^{'}}_{2} (t) (a_{2}^{2} (t) + a_{1} (t) a_{2} (t) \cos [φ_{1} (t) - φ_{2} (t)])) \frac{1}{A^{2} (t)} + \\ \frac{1}{A^{2} (t)} ({a^{'}}_{1} (t) + a_{2} (t) \sin [φ_{1} (t) - φ_{2} (t)] - {a^{'}}_{2} (t) + \\ a_{1} (t) \sin [φ_{1} (t) - φ_{2} (t)]) \end{array}

(A5)

In Equations (A4) and (A5),

a_{i} (t)

and

φ_{i} (t)

are the amplitude and phase of the

i

th component respectively. Assuming that the local minimum value

A (t)

is obtained at

t_{\min}

, it satisfies the Equation (A6).

\cos [φ_{1} (t_{\min}) - φ_{2} (t_{\min})] = - 1

(A6)

Then, Equations (A7) and (A8) can be obtained by substituting Equation (A6).

A (t_{\min}) = | a_{1} (t_{\min}) - a_{2} (t_{\min}) |

(A7)

\begin{array}{l} φ' (t_{\min}) A^{2} (t_{\min}) & = φ_{1}' (t_{\min}) {a_{1}}^{2} (t_{\min}) - φ_{1}' (t_{\min}) a_{1} (t_{\min}) a_{2} (t_{\min}) \\ + φ_{2}' (t_{\min}) [{a_{2}}^{2} (t_{\min}) - a_{1} (t_{\min}) a_{2} (t_{\min})] \end{array}

(A8)

Simultaneously,

A (t_{\min})

denotes a local minimum of

A (t)

, let

A^{'} (t_{\min}) = 0

, the Equation (A9) can be acquired.

a_{1}' (t_{\min}) - a_{2}' (t_{\min}) = 0

(A9)

Thus, the minimum value of

A (t)

can be obtained by solving Equations (A4)–(A9). Similarly, the maximum value of

A (t)

can be determined, too.

Step 3: Calculate

a_{1} (t)

and let,

\begin{array}{l} β_{1} (t) = | a_{1} (t) - a_{2} (t) | \\ β_{2} (t) = a_{1} (t) + a_{2} (t) \end{array}

(A10)

Thus, the Equation (A11) can be obtained from Equation (A5).

\begin{array}{l} β_{1} (t_{\min}) = A (t_{\min}) = | a_{1} (t_{\min}) - a_{2} (t_{\min}) | \\ β_{2} (t_{\max}) = A (t_{\max}) = a_{1} (t_{\max}) + a_{2} (t_{\max}) \end{array}

(A11)

Because

a_{1} (t)

and

a_{2} (t)

change slowly,

β_{1} (t)

and

β_{2} (t)

can be acquired by interpolation in point set

A ({t_{\min}})

and

A ({t_{\max}})

respectively.

a_{1} (t)

and

a_{2} (t)

can be gained by Equation (A11).

\begin{array}{l} a_{1} (t) = [β_{1} (t) + β_{2} (t)] / 2 \\ a_{2} (t) = [β_{2} (t) - β_{1} (t)] / 2 \end{array}

(A12)

Step 4: Calculate

φ_{1}' (t)

and

φ_{2}' (t)

, let,

\begin{array}{l} η_{1} (t) = φ_{1}' (t) [{a_{1}}^{2} (t) - a_{1} (t) a_{2} (t)] + φ_{2}' (t) [{a_{2}}^{2} (t) - a_{1} (t) a_{2} (t)] \\ η_{2} (t) = φ_{1}' (t) [{a_{1}}^{2} (t) + a_{1} (t) a_{2} (t)] + φ_{2}' (t) [{a_{2}}^{2} (t) + a_{1} (t) a_{2} (t)] \end{array}

(A13)

From Equation (A5), we have

\begin{array}{l} η_{1} (t_{\min}) & = φ' (t_{\min}) A^{2} (t_{\min}) \\ = φ_{1}' (t_{\min}) [{a_{1}}^{2} (t_{\min}) - a_{1} (t_{\min}) a_{2} (t_{\min})] \\ + φ_{2}' (t_{\min}) [{a_{2}}^{2} (t_{\min}) - a_{1} (t_{\min}) a_{2} (t_{\min})] \\ η_{2} (t_{\max}) & = φ' (t_{\max}) A^{2} (t_{\max}) \\ = φ_{1}' (t_{\max}) [{a_{1}}^{2} (t_{\max}) + a_{1} (t_{\max}) a_{2} (t_{\max})] \\ + φ_{2}' (t_{\max}) [{a_{2}}^{2} (t_{\max}) + a_{1} (t_{\max}) a_{2} (t_{\max})] \end{array}

(A14)

Since

a_{1} (t)

,

a_{2} (t)

,

φ_{1}' (t)

and

φ_{2}' (t)

changes slowly,

η_{1} (t)

and

η_{2} (t)

can be received by interpolation in point set

φ^{'} ({t_{\min}}) A^{2} ({t_{\min}})

and

φ^{'} ({t_{\max}}) A^{2} ({t_{\max}})

. Thus,

φ_{1}' (t)

and

φ_{2}' (t)

can be calculated by solving Equation (A13).

\begin{array}{l} φ_{1}' (t) = \frac{η_{1} (t)}{2 {a_{1}}^{2} (t) - 2 a_{1} (t) a_{2} (t)} + \frac{η_{2} (t)}{2 {a_{1}}^{2} (t) + 2 a_{1} (t) a_{2} (t)} \\ φ_{2}' (t) = \frac{η_{1} (t)}{2 {a_{2}}^{2} (t) - 2 a_{1} (t) a_{2} (t)} + \frac{η_{2} (t)}{2 {a_{2}}^{2} (t) + 2 a_{1} (t) a_{2} (t)} \end{array}

(A15)

Step 5: Calculate the local cut-off frequency

φ_{b i s}' (t)

as follows:

φ_{b i s}' (t) = \frac{φ_{1}' (t) + φ_{2}' (t)}{2} = \frac{η_{2} (t) - η_{1} (t)}{4 a_{1} (t) a_{2} (t)}

(A16)

Step 6: Rearrange

φ_{b i s}' (t)

to solve the problem of signal intermittence.

References

Zhang, H.; Wang, X.; Cao, J.; Tang, M.; Guo, Y. A hybrid short-term traffic flow forecasting model based on time series multifractal characteristics. Appl. Intell. 2018, 48, 2429–2440. [Google Scholar] [CrossRef]
Liu, Y.; Liu, Z.; Vu, H.L.; Lyu, C. A spatio-temporal ensemble method for large-scale traffic state prediction. Comput. Civ. Infrastruct. Eng. 2020, 35, 26–44. [Google Scholar] [CrossRef]
Min, W.; Wynter, L. Real-time road traffic prediction with spatio-temporal correlations. Transp. Res. Part C Emerg. Technol. 2011, 19, 606–616. [Google Scholar] [CrossRef]
Kumar, S.V.; Vanajakshi, L. Short-term traffic flow prediction using seasonal ARIMA model with limited input data. Eur. Transp. Res. Rev. 2015, 7, 1–9. [Google Scholar] [CrossRef] [Green Version]
Zhao, Z.; Chen, W.; Yue, H.; Liu, Z. A novel short-term traffic forecast model based on travel distance estimation and ARIMA. In Proceedings of the 2016 Chinese Control and Decision Conference (CCDC), Yinchuan, China, 28–30 May 2016. [Google Scholar]
Wang, Y.; Li, L.; Xu, X. A piecewise hybrid of ARIMA and SVMs for short-term traffic flow prediction. Lect. Notes Comput. Sci. 2017, 493–502. [Google Scholar] [CrossRef]
Wang, Y.; Papageorgiou, M. Real-time freeway traffic state estimation based on extended Kalman filter: A general approach. Transp. Res. Part B Methodol. 2005, 39, 141–167. [Google Scholar] [CrossRef]
Okutani, I.; Stephanedes, Y.J. Dynamic prediction of traffic volume through Kalman filtering theory. Transp. Res. Part B 1984, 18, 1–11. [Google Scholar] [CrossRef]
Williams, B.M.; Durvasula, P.K.; Brown, D.E. Urban Freeway Traffic Flow Prediction Application of Seasonal Autoregressive Integrated. Transp. Res. Rec. 1998, 1644, 132–141. [Google Scholar] [CrossRef]
Kandil, N.; Wamkeue, R.; Saad, M.; Georges, S. An efficient approach for short term load forecasting using artificial neural networks. Int. J. Electr. Power Energy Syst. 2006, 28, 525–530. [Google Scholar] [CrossRef]
Ishak, S.; Kotha, P.; Alecsandru, C. Optimization of Dynamic Neural Network Performance for Short-Term Traffic Prediction. Transp. Res. Rec. 2003, 45–56. [Google Scholar] [CrossRef]
Vlahogianni, E.I.; Karlaftis, M.G.; Golias, J.C. Optimized and meta-optimized neural networks for short-term traffic flow prediction: A genetic approach. Transp. Res. Part C Emerg. Technol. 2005, 13, 211–234. [Google Scholar] [CrossRef]
Zhu, J.Z.; Cao, J.X.; Zhu, Y. Traffic volume forecasting based on radial basis function neural network with the consideration of traffic flows at the adjacent intersections. Transp. Res. Part C Emerg. Technol. 2014, 47, 139–154. [Google Scholar] [CrossRef]
Kumar, K.; Parida, M.; Katiyar, V.K. Short term traffic flow prediction in heterogeneous condition using artificial neural network. Transport 2015, 30, 397–405. [Google Scholar] [CrossRef]
Wu, C.H.; Wei, C.C.; Su, D.C.; Chang, M.H.; Ho, J.M. Travel time prediction with support vector regression. IEEE Conf. Intell. Transp. Syst. Proc. ITSC 2003, 2, 1438–1442. [Google Scholar] [CrossRef] [Green Version]
Wang, J.; Shi, Q. Short-term traffic speed forecasting hybrid model based on Chaos-Wavelet Analysis-Support Vector Machine theory. Transp. Res. Part C Emerg. Technol. 2013, 27, 219–232. [Google Scholar] [CrossRef]
Cong, Y.; Wang, J.; Li, X. Traffic Flow Forecasting by a Least Squares Support Vector Machine with a Fruit Fly Optimization Algorithm. Procedia Eng. 2016, 137, 59–68. [Google Scholar] [CrossRef] [Green Version]
Luo, C.; Huang, C.; Cao, J.; Lu, J.; Huang, W.; Guo, J.; Wei, Y. Short-Term Traffic Flow Prediction Based on Least Square Support Vector Machine with Hybrid Optimization Algorithm. Neural Process. Lett. 2019, 50, 2305–2322. [Google Scholar] [CrossRef]
Shang, Q.; Lin, C.; Yang, Z.; Bing, Q.; Zhou, X. Short-term traffic flow prediction model using particle swarm optimization-based combined kernel function-least squares support vector machine combined with chaos theory. Adv. Mech. Eng. 2016, 8, 1–12. [Google Scholar] [CrossRef] [Green Version]
Jiang, Y.; Huang, G. Short-term wind speed prediction: Hybrid of ensemble empirical mode decomposition, feature selection and error correction. Energy Convers. Manag. 2017, 144, 340–350. [Google Scholar] [CrossRef]
Shang, Q.; Lin, C.; Yang, Z.; Bing, Q.; Zhou, X. A hybrid short-term traffic flow prediction model based on singular spectrum analysis and kernel extreme learning machine. PLoS ONE 2016, 11, 1–25. [Google Scholar] [CrossRef]
Labate, D.; Foresta, F.L.; Occhiuto, G.; Morabito, F.C.; Lay-Ekuakille, A.; Vergallo, P. Empirical mode decomposition vs. wavelet decomposition for the extraction of respiratory signal from single-channel ECG: A comparison. IEEE Sens. J. 2013, 13, 2666–2674. [Google Scholar] [CrossRef]
Huang, N.E.; Shen, Z.; Long, S.R.; Wu, M.C.; Snin, H.H.; Zheng, Q.; Yen, N.C.; Tung, C.C.; Liu, H.H. The empirical mode decomposition and the Hubert spectrum for nonlinear and non-stationary time series analysis. Proc. R. Soc. Math. Phys. Eng. Sci. 1998, 454, 903–995. [Google Scholar] [CrossRef]
Duo, M.; Qi, Y.; Lina, G.; Xu, E. A short-term traffic flow prediction model based on EMD and GPSO-SVM. In Proceedings of the 2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference, Chongqing, China, 25–26 March 2017. [Google Scholar]
Tang, J.; Chen, X.; Hu, Z.; Zong, F.; Han, C.; Li, L. Traffic flow prediction based on combination of support vector machine and data denoising schemes. Phys. Stat. Mech. Appl. 2019, 534, 120642. [Google Scholar] [CrossRef]
Tian, X.; Yu, D.; Xing, X.; Wang, S.; Wang, Z. Hybrid short-term traffic flow prediction model of intersections based on improved complete ensemble empirical mode decomposition with adaptive noise. Adv. Mech. Eng. 2019, 11, 1–15. [Google Scholar] [CrossRef]
Li, H.; Li, Z.; Mo, W. A time varying filter approach for empirical mode decomposition. Signal Process. 2017, 138, 146–158. [Google Scholar] [CrossRef]
Zhang, X.; Liu, Z.; Miao, Q.; Wang, L. An optimized time varying filtering based empirical mode decomposition method with grey wolf optimizer for machinery fault diagnosis. J. Sound Vib. 2018, 418, 55–78. [Google Scholar] [CrossRef]
Jiang, Y.; Zhao, N.; Peng, L.; Liu, S. A new hybrid framework for probabilistic wind speed prediction using deep feature selection and multi-error modification. Energy Convers. Manag. 2019, 199, 111981. [Google Scholar] [CrossRef]
Li, L.; Su, X.; Wang, Y.; Lin, Y.; Li, Z.; Li, Y. Robust causal dependence mining in big data network and its application to traffic flow predictions. Transp. Res. Part C Emerg. Technol. 2015, 58, 292–307. [Google Scholar] [CrossRef]

Figure 1. The procedure of time varying filtering based empirical mode decomposition and least square support vector machine (TVF-EMD-LSSVM).

Figure 2. Location of the intersection in Chongqing.

Figure 3. Traffic volume time series (A).

Figure 4. The pictures of TVF-EMD decomposition results (A).

Figure 5. The predictions of different methods (A).

Figure 6. Traffic volume time series (B).

Figure 7. The predictions of different methods (B).

Table 1. Data characteristics (A).

Data Resource	Mean	Variance	Maximum	Minimum	Skewness	Kurtosis	Non-Stationarity
A	70.184	20,710	188	3	0.0183	1.6495	Strong

Table 2. The comparison results of different methods (A).

Model	MAE	MRPE	RMSE	RMSRE	EC
LSSVM	8.131	17.871%	10.801	27.674%	0.9336
EMD-LSSVM	4.375	9.96%	5.805	15.261%	0.9642
TVF-EMD-LSSVM	1.721	3.969%	2.974	6.797%	0.9816
ARIMA	8.284	17.977%	11.01	27.25%	0.9322

Table 3. Data characteristics (B).

Data Resource	Mean	Variance	Maximum	Minimum	Skewness	Kurtosis	Non-Stationarity
B	68.8284	2200.5	235	2	0.2587	2.0441	Strong

Table 4. The comparison results of different methods (B).

Model	MAE	MRPE	RMSE	RMSRE	EC
LSSVM	9.281	20.415%	14.787	33.153%	0.8201
EMD-LSSVM	5.93	13.228%	8.405	20.675%	0.8983
TVF-EMD-LSSVM	0.898	2.653%	1.20	5.089%	0.9855
ARIMA	9.584	20.364%	15.762	32.269%	0.808

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, Y.; Zhao, L.; Li, S.; Wen, X.; Xiong, Y. Short Term Traffic Flow Prediction of Urban Road Using Time Varying Filtering Based Empirical Mode Decomposition. Appl. Sci. 2020, 10, 2038. https://0-doi-org.brum.beds.ac.uk/10.3390/app10062038

AMA Style

Wang Y, Zhao L, Li S, Wen X, Xiong Y. Short Term Traffic Flow Prediction of Urban Road Using Time Varying Filtering Based Empirical Mode Decomposition. Applied Sciences. 2020; 10(6):2038. https://0-doi-org.brum.beds.ac.uk/10.3390/app10062038

Chicago/Turabian Style

Wang, Yanpeng, Leina Zhao, Shuqing Li, Xinyu Wen, and Yang Xiong. 2020. "Short Term Traffic Flow Prediction of Urban Road Using Time Varying Filtering Based Empirical Mode Decomposition" Applied Sciences 10, no. 6: 2038. https://0-doi-org.brum.beds.ac.uk/10.3390/app10062038

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Short Term Traffic Flow Prediction of Urban Road Using Time Varying Filtering Based Empirical Mode Decomposition

Abstract

1. Introduction

2. Methods

2.1. Time Varying Filtering Based Empirical Mode Decomposition

2.1.1. Estimation of the Local Cut-Off Frequency

2.1.2. Calculation of Local Mean Function

2.1.3. Judgement of the Residual Signal

2.2. Least Square Support Vector Machine

2.3. The Proposed Method

3. Case Study

3.1. Data Description

3.2. Data Processing

3.3. Evaluation Criteria

3.4. Prediction Results and Analysis

3.4.1. TVF-EMD-LSSVM Model Prediction

3.4.2. Comparison and Analysis of Forecasting Results

3.5. Additional Case

4. Conclusions

Author Contributions

Funding

Conflicts of Interest

Appendix A. Notation Illustration

Appendix B. The Screening Process of EMD

Appendix C. The Construction of the B-Spline Time-Varying Filter

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI