Next Article in Journal
An Analytical Model for the Tension-Shear Coupling of Woven Fabrics with Different Weave Patterns under Large Shear Deformation
Previous Article in Journal
A Robot Arm and Image-Guided Navigation Assisted Surgical System for Maxillary Repositioning in Orthognathic Surgery: A Phantom Skull-Based Trial
Previous Article in Special Issue
Non-Intrusive Load Disaggregation by Convolutional Neural Network and Multilabel Classification
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Hybrid Double Forecasting System of Short Term Power Load Based on Swarm Intelligence and Nonlinear Integration Mechanism

School of Statistics, Dongbei University of Finance and Economics, Dalian 116025, China
*
Author to whom correspondence should be addressed.
Submission received: 5 January 2020 / Revised: 18 February 2020 / Accepted: 19 February 2020 / Published: 24 February 2020
(This article belongs to the Special Issue Artificial Neural Networks in Smart Grids)

Abstract

:
Accurate and reliable power load forecasting not only takes an important place in management and steady running of smart grid, but also has environmental benefits and economic dividends. Accurate load point forecasting can provide a guarantee for the daily operation of the power grid, and effective interval forecasting can further quantify the uncertainty of power load on this basis to provide dependable and precise load information. However, most of the previous work focuses on the deterministic point prediction of power load and rarely considers the interval prediction of power load, which makes the prediction of power load not comprehensive. In this study, a new double hybrid load forecasting system including point forecasting module and interval forecasting module is developed, which can make up for the shortcomings of incomplete analysis for the existing research. The point forecasting module adopts a nonlinear integration mechanism based on Back Propagation (BP) network optimized by Multi-objective Evolutionary Algorithm based on Decomposition (MOEA/D) to improve the accuracy of point prediction. A fuzzy clustering interval prediction method based on different data feature classification is successfully proposed which provides an effective tool for load uncertainty analysis. The experiment results show that the system not only has a good effect in accurately predicting power load, but also can analyze the uncertainty of the power load, which can be used as an effective technology of power system planning.

1. Introduction

Power load forecasting is the foundation and key task of management and control of power system [1]. It is often applied in energy supervise, unit commitment and load control [2]. High precision load prediction ensures the secure and steady operation of power system [3]. Therefore, it is essential to enhance the deliverability and prediction precision of smart grid [4]. However, due to many indeterminate reasons such as climate variation, economy growth, public activities and national decisions, the accuracy of power load forecast often fails to achieve the expected results [5]. In view of this, all countries in the world are looking for effective load forecast methods to enhance the accuracy of load forecasting [6].
In addition, the combination of ultra-short term load forecast (USTLF), short term load forecast (STLF), medium and long term load forecast (LTLF) is significant to the safe and economic operation of power system [7]. USTLF and STLF are the necessary basis of power grid dispatching, and reducing the error of STLF is an effective method to strengthen the supervising level of power system [8]. Accurate prediction of power load can save a lot of time to manage the power grid and avoid major changes [9]. Therefore, it is essential to establish a load prediction model with high forecast ability. For the sake of achieve precise and steady STLF, researchers have adopted a lot of methods, including (a) statistical method, (b) artificial intelligence method and (c) hybrid method [10].
In the early stage of power load forecasting, traditional statistical methods are often applied, including some conventional forecasting methods, such as regression method [11], exponential smoothing [12], Autoregressive Moving Average Model (ARMA) [13], Autoregressive Integrated Moving Average (ARIMA) [14,15], seasonal ARIMA [16], grey forecasting model (GM) [17], etc. These models could obtain power load prediction, but due to their own limitations, they cannot achieve the expected forecasting accuracy. In order to overcome these limitations, more and more effective load forecasting approaches have been put forward. In recent years, the prediction model based on artificial intelligence is gradually springing up in electric load prediction [18]. At the moment, the algorithms of artificial intelligence mainly include artificial neural network (ANN) [19,20], support vector machine (SVM) [21], multi-layer perceptron (MLP) and radial basis function (RBF). Although better than the traditional methods, they still unable to fit the current complicated and variable power load characteristics well to achieve satisfactory accuracy due to the defects of a single prediction method [22]. For example, the artificial neural network is easy to fall into local optimization, over fitting and low convergence rate [23]. This has led to the largely establishment of integrating and hybrid models, which are composed of several single models and can achieve better prediction performance [24].
Xiaobo Zhang et al. [25] successfully proposed a new power load prediction model, CS-SSA-SVM, which integrated singular spectrum analysis (SSA), support vector machine (SVM) and cuckoo search (CS) algorithm. This model can significantly enhance the effectiveness of power load forecast. Dong, Y. et al. [26] developed a short-term load prediction model using a unit for feature learning named Pyramid System and recurrent neural networks, and it can greatly increase the stabilization and safety of the smart grid. Wang, R. et al. [27] proposed a new power load forecasting system by combining data preprocessing, hybrid optimized algorithm and certain individual conventional prediction methods, which conquers the shortcomings of individual conventional prediction model and obtains a single model optimization with higher prediction accuracy than traditional forecasting model.
Another problem of power load forecasting is that the research direction is relatively single. Specifically, most of the previous analysis only focuses on the point prediction of load, and rarely considers the load interval prediction together to prediction modeling and analysis. This is not enough to meet the needs of engineering applications, or to ensure the reliability of the power system. Probability interval prediction can display more messages, and its results can help managers to implement appropriate policies. However, the research on interval modeling and prediction is still lacking. At present, the main research direction of uncertainty quantification is mainly statistical methods, including quantile regression [28], bootstrap method [29], kernel density estimation [30], etc. in addition, there are interval prediction methods based on artificial neural network, including lower upper bound estimation method (LUBE) and so on [31].
Table 1 summarizes the existing point prediction and interval prediction methods and models, and evaluates the advantages and disadvantages of these methods.
For point forecasting, from the traditional statistical model to the artificial neural network model, and even for the recently developed hybrid model, the prediction accuracy has been continuously improved, but the models still have a lot of space for improvement. According to the nonlinear characteristics of load, this paper proposes a method of nonlinear combination of single prediction model, and uses swarm intelligence optimization algorithm to optimize the model parameters to further improving the prediction effect. For interval forecasting, no matter quantile regression method [32], bootstrap method [33], kernel density estimation method [34] or LUBE method, these methods have their own advantages and disadvantages that are difficult to overcome. In conclusion, the interval prediction method is not uniform and further research and investigation are needed according to the existing knowledge to obtain more effective results [35]. Therefore, based on the hypothesis of distribution, this study develops a new architecture of interval prediction, which is better than most single model interval prediction architectures.
According to the review of the above literature and methods, the major contribution of this article is to design a hybrid double prediction system, including two parts: point forecasting and interval forecasting, which make up for the shortcomings of the existing research. Specifically, the double prediction system includes the preprocessing module based on Improved Complete Ensemble Empirical Mode Decomposition (ICEEMDAN), the prediction module based on nonlinear combination model, the interval prediction module and the evaluation module. As a new signal processing technology, ICEEMDAN decomposes and reconstructs the power load sequence to get a clear time sequence. The nonlinear combination model is an effective prediction model proposed in this study. Among it, Extreme Learning Machine (ELM) [36], RBF, Elman Neural Network (ENN) and ARIMA [37] are selected as the basic models of the combination model, and the prediction results of these four models are nonlinear aggregated by BP [38] neural network. BP network is very sensitive to the selection of parameters, which directly affects the validity of point prediction and interval prediction. Therefore, in order to find the optimal parameters in BP model, Multi-objective Evolutionary Algorithm based on Decomposition (MOEA/D) is developed effectively. Finally, an interval prediction method based on fuzzy clustering is established, which does not need the hypothesis of distribution and model. Therefore, the interval structure has a strong anti-interference ability to the abnormal values in the interval data. In addition, to testify the ability of the designed prediction architecture, we select 9 indicators to verify the accuracy of the prediction, and implement a series of discussion to judge the effectiveness of the prediction system.
The leading innovations of the forecasting system are summed up bellow:
(1) A new double forecasting system for power load is established in this paper, which is successfully combined of point forecasting module and interval forecasting module. The purpose of the system is to improve the accuracy of load point prediction, and effectively analyze the uncertainty of power load data.
(2) A nonlinear combination method based on the BP algorithm optimized by MOEA/D is proposed to better improve the forecasting performance of the system. To obtain the optimal combination pattern of each model, the nonlinear aggregation mechanism based on BP is adopted to combine the models and eliminate the inherent defects of individual model and linear combination. In particular, MOEA/D algorithm is used to search the best parameters of BP, which further improves the prediction accuracy.
(3) An interval prediction method based on fuzzy clustering is established, which provides an effective tool for load uncertainty analysis. The interval forecasting method determines the upper and lower bounds of the power load prediction value to quantify the uncertainty information of the load, and provide more comprehensive reference information for the operation risk decision makers of power system.
The rest of this paper is: Section 2 shows the methods applied in the proposed double forecasting architecture. In Section 3, the double forecasting system is established. Section 4 introduces the experimental data and displays the experimental results. Section 5 provides further discussion. Finally, the conclusion is given in Section 6.

2. Knowledge and Tools of Model Preparation

In constructing our model, several methods are chosen as the best choice, and they are combined to improve the ability of the model. Here we introduce three main methods named improved complete ensemble empirical mode decomposition (ICEEMDAN), multi-objective evolutionary algorithm based on decomposition (MOEA/D) and Fuzzy C-Means (FCM) clustering algorithm in detail.
The first is improved complete ensemble empirical mode decomposition (ICEEMDAN).
Being an effective data processing method, ICEEMDAN is proposed by Colominas, Schlotthauer and Torres [39] in 2014. It disintegrates the actual series into some intrinsic mode functions (IMF) and a residual from high frequency to low frequency. CEEMDAN has been proposed to restore the integrity property of EMD.
Nevertheless, CEEMDAN is still worthy of improvement in mode and signal decomposition. In ICEEMDAN, F k ( ) represents the operator figures out the k-th mode achieved by EMD, ω ( i ) is the formation of Gaussian white noise that has zero mean and unit variance, and x ( i ) = x + ω ( i ) . M ( ) is the operator and it can produce the local average of the signal. Then E 1 ( z ) = z M ( z ) can be obtained. The general steps of ICEEMDAN are as follows:
Step 1. The first residue is obtained by calculating the local averages of I formations x ( i ) = x + β 0 E 1 ( ω ( i ) ) (i = 1, …, I) using EMD:
r 1 = M ( x ( i ) ) ,   ( i   =   1 , ,   I )
In the formula, β 0 = ε 0 S ( x ) / S ( E 1 ( ω ( i ) ) ) , and ε 0 is the reciprocal of the expected signal-to-noise ratio between the first additional noise and the analytical signal. S ( ) is the operator which can calculate the standard deviation of the signal.
Step 2. The first mode is calculated at the first stage (k = 1):
d ˜ 1 = x r 1
Step3. Calculate the k-th residual (k = 2, …, K):
r k = M ( r k 1 + β k 1 E k ( ω ( i ) ) )
β k = ε 0 S ( r k )
Step4. The second mode is defined, where the local average of the formations r 1 + β 1 E 2 ( ω ( i ) ) is obtained by estimating the second residue. The second mode is
d ˜ 2 = r 1 r 2 = r 1 M ( r 1 + β 1 E 2 ( ω ( i ) ) )
Step 5. Computing the k-th mode
d ˜ k = r k 1 r k
Step 6. Return back to Step 3 and prepare for the next k.
The second is Multi-objective Evolutionary Algorithm Based on Decomposition (MOEA/D).
Recently, decomposition-based multi-objective evolutionary algorithm (MOEA/D) proposed by Zhang Qingfu et al. [40] has attracted more and more researchers’ interest because of its concise and effective characteristics, and many theoretical and practical achievements have emerged. The MOEA/D algorithm is introduced below.
A multi-objective optimization problem (MOP) with M objectives and N decision variables can be expressed as follows:
Mininize F ( x ) = ( f 1 ( x ) , f 2 ( x ) , , f m ( x ) ) Subject to x Ω
In the formula, Ω R n is decision space, and the decision vector x = { x 1 , x 2 , , x n } Ω is a candidate solution of MOP. Here, the objective function F ( x ) : x R m includes M conflicting object functions with continuous real values f 1 ( x ) , f 2 ( x ) , , f m ( x ) , and Ω is described as
Ω = { x R n | f j ( x ) 0 ,   j = 1 , , m }
where Rm represents the target space.
The pareto dominance relation of individuals is as follows: if there are decision vectors U and V, and satisfy the following two conditions at the same time, we call U dominance V:
(i) If and only if f i ( u ) f i ( v ) , for every i { 1 , , m } .
(ii) There exists at least one index j { 1 , , m } make f i ( u ) > f i ( v ) .
In this case V is said to be dominated by U, which can be denoted by u v , and among are dominant relations.
If there is no point x Ω that makes F ( x ) dominate F ( x * ) , the point x * Ω is Pareto optimal. That is, there is only one best set of compromise solutions called the non-dominated (not dominated by all other solutions). The value of Pareto optimization solution in decided space and target space is defined as Pareto solution set (PS) and Pareto frontier (PF).
MOEA/D has strong search ability for continuous optimization, combinatorial optimization and PS complex problems. The principle of the algorithm is:
If a multi-objective optimal problem similar to Equation (7) and a weight vector λ = ( λ 1 , , λ m ) are given, and the given weight vector satisfies i = 1 m λ i = 1 , λ i 0 , i = 1 , 2 , , m . MOEA/D based on Tchebycheff decomposition uses this weight vector to optimize a MOP into several sub-problems by the following methods.
m i n x Ω g t c ( x | λ j , z * ) = min x Ω max 1 i m { λ i j | f i ( x ) z i * }
where z * = ( z 1 * , z 2 * , , z m * ) ( i . e . ,   z i * < min { f i ( x ) | x Ω } , 1 , 2 , , m ) is the ideal point, and λ j = ( λ 1 j , , λ m j ) T . By solving multiple sub-problems with different weight vectors in Equation (9), Pareto optimal solution set with good diversity can be obtained [41].
As is known that g t c is continuous of λ , so if λ i is close to λ j , the g t c ( x | λ i , z * ) solution must close to the g t c ( x | λ j , z * ) solution. Hence, a useful tool of g t c ( x | λ j , z * ) optimization is information about g t c with weight vectors near the λ i .
In the algorithm MOEA/D, the population is made up of the optimal solution of the sub-problem currently found. Each sub-problem maintains a list of neighbors, which preserves sub-problems with weight vectors similar to the sub-problem. Therefore, under the assumption of continuity, two neighbor sub-problems should have similar optimal solutions. In each generation of MOEA/D, each sub-problem is optimized applying only the message of its neighbor sub-problems.
As for each generation t, MOEA/D using the Tchebycheff holds [42]:
(1) A point group x 1 , , x N N , in which the xi is the present solution for the i-th sub-problem.
(2) F V 1 , , F V N , where F V i = F ( x i ) for each i = 1 , , N .
(3) The best value zi for objective fi now and z = ( z 1 , , z m ) T .
(4) An external population (EP), applying to store non-dominated solutions found during the search.
The pseudo code of MOEA/D is described as follows:
Algorithm: MOEA/D.
Input:
  • MOP-multi-objective optimization problem
  • N-the number of the MOEA/D subproblems
  • λ1, …, λN-a uniform distribution of N weight vectors
  • T-the number of the weight vectors in the neighborhood of each weight vector
  • max_gen-the maximum number of generations
Output:
  • EP-external population
Setup:
  • Set EP =
  • gen = 0
Step 1: Initialization
  • /*Initialize an primary internal population uniformly randomly.*/
  P0 = {x1, , xN} and FVi = F(xi)
  • /*Initialize z = (z1, , zn)T by a specific problem method. */
  • /* Calculate the Euclidean distance between any two weight vectors, and then calculate the closest T weight vectors to each weight vector.*/
  • i = 1 , , N , set B(i) = {i1, , iT}
  λi1, …, λiT represent the T closest weight vectors to λi
Step 2: Updating
  • WHILE (t< max_gen) DO
  • FOR EACHi = 1,…, NDO
  /* Genetic operators */
  /* Randomly select two indexes k, l from B(i), and then generate a new solution y from xk and xl by using genetic operators. */
  • FOR EACHj = 1,…, nDO
  /*Update of z.*/
  if zj < fj(y), then set zj = fj(y)
  END FOR
  • FOR EACH index j B ( i ) DO
  /*Update of neighboring solutions.*/
  if g t e ( y | λ j , z ) g t e ( x j | λ j , z * ) , then set xj = y and FVj = F(yj).
  END FOR
  • /*Update of EP.*/
 /*Remove from EP all the vectors dominated by F(y). Add F(y)to EP if no vector in EP dominate F(y). */
  END FOR
  • t = t+1
  END WHILE
  • RETURN EP
The third is Fuzzy C-Means (FCM) clustering algorithm.
FCM clustering algorithm divides the sample points in the sample space X = { x 1 , x 2 , , x n } into c (c > 1) classes, and the degree of each sample point xi belonging to the k-th ( 1 k c ) class is expressed as uik. The fuzzy clustering of sample space X is represented by fuzzy matrix U = ( u i k ) n × c , and U satisfies the following conditions:
{ u i k [ 0 , 1 ] k = 1 c u i k = 1
The objective function is defined as:
J m ( U , V ) = i = 1 n k = 1 c u i k m d i k 2 ( x i , v k )
In the formula, V = { v 1 , , v k , , v c } , vk is the k-th clustering center, d 2 ( x i , v k ) is the distance measurement function between xi and vk, and m is the fuzzy weighted index. In order to get the best fuzzy c partition of dataset X, the solution (U,V) that makes Jm(U,V) the smallest need to be obtained. This can be achieved by the following steps [43]:
Step 1: Initialization. Input dataset {xi, i = 1,2,…,n}, clustering number c, fuzzy weighted index m( m R > 1 ), maximum number of iterations T and threshold ε. The membership matrix U(t) (t = 0, t is the iteration numbers) is initialized randomly and satisfies the Formula (10).
Step 2: Update the clustering center and membership matrix.
v k ( t + 1 ) = i = 1 n ( u i k ( t ) ) m x i i = 1 n ( u i k ( t ) ) m , k
u i k ( t + 1 ) = [ 1 / d 2 ( x i , v k ( t + 1 ) ) ] 1 / ( m 1 ) j = 1 c [ 1 / d 2 ( x i , v j ( t + 1 ) ) ] 1 / ( m 1 ) , k , i
Step 3: If max i , k u i k ( t ) u i k ( t + 1 ) < ε or t > T, the category of xi is signed as z i = arg   max 1 k c ( u i k ) . Otherwise, t = t + 1 and go to step 2.

3. Construction of Power Load Double Prediction System

In this paper, a double forecasting system of power load is established. The double forecasting system means a forecasting system that integrates point forecasting and interval forecasting. The relationship between point forecasting and interval forecasting is similar to point estimation and interval estimation in statistics. The flow of the system we designed is to carry out point prediction first, get the result of point prediction, then construct a suitable confidence interval according to the result of point prediction to carry out interval prediction. The forecasting system includes two modules: point forecasting module and interval forecasting module. The following is the system construction process, and the system structure is shown in Figure 1.

3.1. Point Forecasting Module

In this section, we successfully put forward a new type of nonlinear hybrid point forecasting model, using RBF, ELM, ENN and ARIMA, as well as BP network, MOEA/D algorithm and nonlinear combination mechanism to achieve high-precision and more stable load point prediction results. Considering the good prediction performance of BP network, this paper takes BP network as the method of nonlinear combination.
The point prediction module of the designed system is composed of four steps. The details are as follows:
Step 1: Power load data preprocessing.
For the sake of removing the noise and further collect helpful information from the power load sequence, we use ICEEMDAN technology to disintegrate the original sequence, and rebuild the smooth time series. Specifically, the original sequence is decomposed into some IMFs. IMFs with higher frequency are eliminated to filter the time series. Here we remove IMF1, and the remaining IMFs are rebuilt to get the final series.
Step 2: Single model predicting.
In this study, we first use single models to predict the points to obtain a preliminary impression of each model. Then the higher prediction accuracy model, RBF, ELM, ENN and ARIMA, are chosen as member prediction models to build the combination model. RBF, ELM and ENN are used to handle the non-linear features of load, while ARIMA has a good effect on discerning linear characteristics of load data. Say concretely, we divide 1488 load data into training set train and testing set test, where train includes 1152 data and test includes 336 data. The rolling forecasting strategy is employed, which adopts the original data of the previous five periods to forecast the next period. The input and output structure of train is shown in Equations (14) and (15). Using the RBF, ELM and ENN which trained by train to predict the load of the testing set, the prediction sequences predict1, predict2 and predict3 are obtained, respectively. Similarly, ARIMA is used to obtain the prediction sequence. predicti (i = 1,…,4), including 336 forecasting values are taken as the input datasets, which are inputted into the BP model.
i n p u t 1 = [ x ( 1 ) x ( 2 ) x ( l ) x ( 2 ) x ( 3 ) x ( l + 1 ) x ( n l ) x ( n l + 1 ) x ( n 1 ) ]
o u t p u t 1 = [ x ( l + 1 ) x ( l + 2 ) x ( n ) ]
In the Equations (14) and (15), n is the number of train and l is look-back time lag, and x(k) is the power load value at time k. For example, take x(1) to x(l) as input and x(l + 1) as output; next, take x(2) to x(l + 1) as input and x(l + 2) as output, to train the structure of RBF, ELM and ENN. Here, we set l = 5.
Step 3: Nonlinear combination model constructing.
For the sake of obtaining the combination model, a nonlinear decision-making method on the basis of BP neural network optimized by MOEA/D is proposed to achieve the best result.
Before introducing the nonlinear integration, the linear integration method is first introduced. Linear integration is to sum up the prediction results by linear weighting the prediction results of n single prediction models. Suppose yt is the actual value, predictit (i = 1, 2, ..., n) is the prediction result at time t, and n is the number of prediction models. wi represents the weight of the i-th prediction model, and i = 1 n w i = 1 . Therefore, the final prediction results can be calculated as follows
y ^ t = i = 1 n w i p r e d i c t i t , t = 1 , 2 , , m
Considering the complexity of power load and the characteristic of different single models, we choose nonlinear weight combination forecasting to compensate for the shortage of linear integration. In this study, as an important application of nonlinear integration, BP is applied to the integration of a single prediction component for final prediction. Specially, BP neural network is a complex nonlinear black box operation, and a large number of internal nodes in it are connected with each other, which can be used as an arbitrary function approximation mechanism. We regard the trained BP neural network as the weight integration of each single model, so this combination weight can also be regarded as nonlinear integration. However, BP parameters have an important impact on the prediction results and it is difficult to determine. Therefore, as an advanced swarm intelligence algorithm, MOEA/D is used to improve BP parameters to improve forecasting performance.
In particular, the predict1, predict2, predict3 and predict4, which contains 336 power load prediction values obtained from the RBF, ELM, ENN and ARIMA models mentioned in Step 2, are the basic data of the BP model, where the first 240 values are considered as the training set and the remaining 96 values are taken as the testing set. Then the 96 prediction values obtained by BP are the forecasting results of our proposed model. It is worth noting that it is difficult to find the weights and thresholds of neurons in each layer of BP network, so MOEA/D is introduced to search for the best weights and thresholds of neural network, which solves this problem to a certain extent. The input and output structure of BP network training is shown in Equations (17) and (18).
i n p u t 2 = [ p r e d i c t 1 p r e d i c t 2 p r e d i c t 3 p r e d i c t 4 ] = [ x ^ 1 ( 1 ) x ^ 2 ( 1 ) x ^ 3 ( 1 ) x ^ 4 ( 1 ) x ^ 1 ( 2 ) x ^ 2 ( 2 ) x ^ 3 ( 2 ) x ^ 4 ( 2 ) x ^ 1 ( N ) x ^ 2 ( N ) x ^ 3 ( N ) x ^ 4 ( N ) ]
o u t p u t 2 = [ x ( 1 ) x ( 2 ) x ( N ) ] T
In which N is 336, and x ^ 1 ( k ) , x ^ 2 ( k ) , x ^ 3 ( k ) , x ^ 4 ( k ) is the k-th load value predicted by RBF, ELM, ENN and ARIMA, respectively. By input and output, the optimized BP neural network can be trained.
Step 4: Power load point prediction.
According to the established nonlinear prediction model, the rolling forecasting technique is applied for multi-step forecasting, and the final prediction results are obtained. The evaluation index is calculated by using the prediction result and test, and then the performance of the model is effectively evaluated.
Specially, multi-step forecasting means forecasting multiple load values in the future. A time index t is the forecast origin and a positive integer l is the forecast horizon. It can be assumed that the time index t is exactly the time point that we are in, and our target is to obtain the forecasting value y ^ t + l (l ≥ 1). l = 1,2,3 corresponds to 1-step, 2-steps and 3-steps, respectively. Figure 2 shows the data usage of multi-step forecasting.

3.2. Interval Forecasting Module

Interval forecasting is obtaining the value interval of future load by certain forecasting methods, which shows the possible fluctuation range of future load. If accurate load point prediction can guarantee the daily operation of power grid, then effective interval prediction can further quantify the uncertainty of power load and provide reliable and accurate load information. The interval forecasting method of this system is developed according to point prediction, which is an interval prediction method based on fuzzy system. The principle of the interval forecasting method proposed in this study is to classify the point forecasting results according to different kinds of load data and construct different but adaptive intervals. The classification is based on the real load data, and judge which category the prediction results belong to, and then construct a specific confidence interval for this category, so as to get the predicted load interval. The three steps of interval forecasting module are shown as:
Step 1: Data classification.
The training set train of load data is clustered into several classes by FCM clustering method. Assume that the data in each category follows the same normal distribution. Therefore, we can get some interval classes F 1 , F 2 , , F k . The mean value and variance of each category are calculated respectively to prepare for the interval construction next.
Step 2: Load interval estimation.
The confidence degree of each category interval is 95%. According to the mean and variance of each category data, the corresponding confidence interval is constructed. Different categories have different width of unified prediction interval. This process of constructing different adaptive intervals according to different data characteristics is also one of the innovations of this model. According to the testing set test point prediction results in the point prediction module, identify the category F that each prediction value falls into. Then, according to the constructed confidence interval of each category, the prediction interval of each prediction value is calculated as:
[ x i z 1 α / 2 s j n j , x i + z 1 α / 2 s j n j ]
where xi is point prediction value, j is the category number of xi, sj is standard deviation of category j, nj is the data number of category j.
Step 3: Sorting out the prediction results.
According to the prediction interval of the above points, the final interval estimation of the power load is obtained.

4. Experiments and Analysis

This section introduces the application of the double forecast model and several comparison models, and divides the comparison into three experimental demonstrations. The operating environment of the experiment is: 2.60 GHz CPU, 4.00 GB RAM, Windows 7 and Matlab R2016A. Considering the random factors, to guarantee the reliability of the final results, 20 experiments are carried out each time, and the average value is taken, respectively.

4.1. Dataset Description

From July 1, 2019 to July 31, 2019, power load data were collected from New South Wales, Queensland, South Australia and Tasmania, including four weeks of power load data in this paper. Electricity demand is collected every 30 min, with a total of 1488 data points and 48 data points per day. Among them, the data from July 1, 2019 to July 27, 2019 were chosen as training set of selected model, 27 days in total, including 1296 data points; the data from July 28, 2019 to July 31, 2019 were used as testing set, 4 days in total, including 192 data points. At the same time, the last three days data are used to decide the network structure of BP model. Training set and testing set select identical rolling forecasting technique and output one-step, two-step and three-step prediction results. The power load data from July 1, 2019 to July 31, 2019 and its statistical indicators which are minimum, maximum, mean and std., are shown in Table 2. The distribution condition of areas and dataset are presented in Figure 3.
As shown in the Table 2, the statistics of all samples, training sets and testing sets of three sites are similar. The data set is reasonable and can be chosen to testify the supreme ability of the proposed model.

4.2. System Evaluation

The evaluation indexes of the designed double prediction system are introduced, including 5 indexes of point prediction and 4 indexes of interval prediction.

4.2.1. Point Forecasting Evaluation

Generally speaking, the evaluation criteria are not unique for the prediction system. Hence, this paper uses five common evaluation standards to assess the point forecasting performance of the proposed model and other comparative models. The five indexes include mean absolute percentage error (MAPE), root mean square error (RMSE), mean absolute error (MAE), direction change (DC) and the index of agreement of forecasting results (IA). Among these indexes, the smaller the values of MAPE, RMSE and MAE, the larger the values of DC and IA, the better the prediction performance. See Table 3 for details of the four indicators.

4.2.2. Interval Forecasting Evaluation

For interval prediction, we select four evaluation indexes, which are forecasting interval coverage probability (FICP), forecasting interval normalized average width (FINAW), mean width of the constructed PIs (MPI) and Accumulated width deviation (AWD). Table 4 shows the specific definitions of four indices.
Specifically, FICP is the forecasting interval coverage probability of the test data set, which is the main evaluation index of interval prediction. It indicates the coverage effect of the obtained confidence interval to the actual value. Given the confidence level, if FICP is at least greater than or equal to 1-alpha, the constructed interval is valid; otherwise, the constructed interval is invalid. FINAW is the normalized average width of the forecasting interval of the testing set. The cost of reducing the width is diminishing the possibility of expected target covering; increasing the coverage requires increasing the width of the interval, so FICP and FINAW are essentially contradictory. MPI represents the average width of the obtained interval. AWD is the accumulated width deviation of testing dataset, which could be obtained by calculating the relative deviation degree. The cumulative sum of AWDi can measure the relative deviation degree. See Table 4 for the specific description of the formula.

4.3. Diebold-Mariano Test

To verify the designed hybrid model owns better forecasting ability than compared models, an effective verification method called Diebold-Mariano (DM) test proposed by Diebold FX and Mariano RS is adopted. The theory of DM test is introduced first.
Considering the significance level α , zero hypothesis H0 indicates the predictive effectiveness of the developed model and the comparison model are not significantly different. The meaning of H1 is contrasted with H0. The relevant formulas are shown as:
H 0 : E [ L ( e r r i 1 ) ] = E [ L ( e r r i 2 ) ]
H 1 : E [ L ( e r r i 1 ) ] E [ L ( e r r i 2 ) ]
In the formula, L represents the loss function of prediction error. erri1 and erri2 are the error sequence predicted by selected model.
In addition, the statistics of DM test can be defined in the following ways:
DM = i = 1 n ( L ( e r r i 1 ) L ( e r r i 2 ) ) S 2 / n s 2
in which S2 is the estimate of the variance of d i = L ( e r r i 1 ) L ( e r r i 2 ) . Assuming a certain significance level α , the obtained value DM is in comparison with that of z α / 2 . Once DM statistics exceed the interval [ z α / 2 , z α / 2 ], H0 can be rejected. This shows the predictive performance of the established model and that of the comparative model are significantly different, which means that H1 will be accepted.

4.4. Results and Analysis of Point Forecasting

To testify the performance of the point prediction module, two experiments are conducted in this part: experiment I and experiment II. The main purpose of experiment I is to prove the good ability of the nonlinear combination model in the point forecasting, so as to reasonably verify the superiority of the proposed model. In addition, experiment I proves the necessity of data preprocessing. In the same way, to prove the rationality and superiority of the ICEEMDAN technology selected in this work, it is compared with other commonly used data preprocessing technology, and this is the content of experiment II. The detailed analysis of each experiment is as follows.

4.4.1. Experiment I: Comparison with Individual Models

In this work, all experimental datasets are trialed to assess the effectiveness of the point prediction module, while three comparisons are designed. In comparison (a), the proposed model is compared with the four data preprocessed models, ICEEMDAN-RBF, ICEEMDAN-ELM, ICEEMDAN-ENN and ICEEMDAN-ARIMA, in order to analyze the advantages of the combination model using the nonlinear combination method. In comparison (b), four ICEEMDAN-based models are compared with single models RBF, ELM, ENN and ARIMA, respectively. In comparison (c), the effectiveness of the designed forecasting model is further tested by using the traditional model SVR and Generalized Regression Neural Network (GRNN) as comparison methods. The predicted results are displayed in Table 5 and Figure 4, and the comparison consequences are as follows.
(1) For comparison (a), the developed hybrid nonlinear model has the best ability in one to three step load forecast in four datasets, whose error index was superior to other models. For instance, in one-step forecasting, the MAPE of the established model is about 0.6577%, 0.7373%, 2.0714% and 1.3288%, while the prediction accuracy of the other ICEEMDAN-based model is 0.1% to 2% lower than that of the established model. For the two and three step prediction, the proposed model is also better than other models in four sites.
(2) For comparison (b), comparing ICEEMDAN-ENN and ICEEMDAN-ARIMA models with ARIMA and ENN without data preprocessing, it can be found that data preprocessing is very important to enhance the ability of load forecasting. For site 1, the MAPE of ENN and ARIMA are higher than that of ICEEMDAN-ENN and ICEEMDAN-ARIMA in one, two and three step prediction. The accuracy of ICEEMDAN-RBF and ICEEMDAN-ELM which are not shown in the table are also improved. For site 2, site 3 and site 4, and Figure 4, the situation is similar.
(3) For comparison (c), it can be seen from five indexes MAPE, MAE, RMSE, IA and DC of Table 5, the proposed model is more accurate than other individual models, such as SVR and GRNN. In addition to the proposed hybrid prediction model, the single model with high prediction accuracy is ARIMA and ENN. Therefore, we choose ARIMA and ENN as the model used in combination models, and the same circumstance as RBF. ARIMA is a linear model, therefore it can show that the power load data has certain linear characteristics, so it is a wise choice to take ARIMA into account in the proposed model. Additionally, although the prediction accuracy of BP is not shown, the experimental results show that it has relatively good prediction effect, so BP is selected as the model used in nonlinear combination.

4.4.2. Experiment II: Tests of Data Preprocessing Methods

This experiment is aiming at comparing the effectiveness of ICEEMDAN selected in this system with other common data preprocessing methods, which includes EMD, EEMD, CEEMD, SSA and WD. Therefore, the point forecasting models on the basis of different data preprocessing methods are EMD-based model, EEMD-based model, CEEMD-based model, SSA-based model and WD-based model. These models only use different decomposition method in the data preprocessing stage. Through the experiment, we can test whether the proposed prediction model is reasonable, and can also find the best method to remove the noises to improve the prediction effectiveness.
The results obtained by models using different data preprocessing approaches are shown in Table 6. Figure 5 shows a clearer and more intuitive comparison. The conclusion can be drawn that the model on the basis of the ICEEMDAN decomposition technology has much better performance than other decomposition based prediction models. Say concretely, for site 2, the MAPE value of ICEEMDAN-based proposed model is 0.7373%, 1.8561% and 1.8413% for three steps, which is 0.1 to 4 percentage points higher than that of EMD-based model, EEMD-based model, CEEMD-based model, SSA-based model and WD-based model. Of all the benchmark models, EMD-based model is the worst. Compared with other models, the MAE, RMSE, IA and DC of the one to three step prediction of the developed model are also improved to different extent, which further shows the superiority of the data preprocessing method selected by this hybrid model.
Remark 2.
Experiment I and experiment II focus on proving the advantages of the proposed point forecast module, and the results show that the designed point forecasting model is a very promising power load forecasting method. It can also prove that the combination of data preprocessing technology, optimization algorithm and nonlinear combined method could successfully solve the difficulties of load prediction through appropriate prediction methods.

4.5. Results and Analysis of Interval Prediction (Experiment III)

Base on the point power load forecasting, the probability interval forecasting could show more load information. In this part, we develop a method based on fuzzy clustering, which carries out interval forecasting on the basis of point forecasting. In addition, four datasets are applied in this experiment. To verifying the ability of the designed interval forecast module, we use the all the compared model of point prediction, and also use multi-step prediction to verify the interval predicted results. The results of the interval predicted model and other models are shown in Table 7. Due to the limited space, we only display the results of site 2 and site 3. We set the confidence interval to 90% to assess the effectiveness of the interval predicted model.
(1) For site 2, the best values of all indexes in all models are obtained by the proposed prediction model. For proposed model, the coverage probability of forecasting interval (FICP) is 98.96% in one-step, 79.06% in two-step and 78.95% in three-step. The average width of the interval is 356.9044, 333.2484 and 355.3731 in three steps according to MPI. Compared with the absolute value of power load, the interval width obtained is relatively accurate. AWD is 0.0002, 0.0587 and 0.0554 for three steps, shows the deviation degree of the constructed interval is small. All indexes reflect that the predicted interval of proposed model is qualified. In contrast, for the FICP of single prediction model, none of the predictions is better than proposed model. Although the ICEEMDAN-ELM has the same value of FICP as proposed model in one-step, it is largely lower than two and three-step.
(2) By combining FINAW with FICP, for the proposed model, when the FICP value is very high, FINAW is relatively small, which also shows the superiority of the developed model. In the one and two step forecast, the AWD of most other benchmark models is more than ten times of the developed model in one-step, and they are much larger in two and three step. This reflects the less deviation of the developed model. These four indexes fully reflect the superior forecasting ability of the developed model. The same conclusion can be drawn for site 3 in Table 7.
(3) At the same time, in order to intuitive show the comparison results, the results of the designed model and comparison models are pictured in Figure 6. The conclusions are consistent with Table 7, providing intuitive evidence for verifying the superior ability of the proposed system in the load interval forecasting. As shown in Figure 6, compared with other models, the proposed model has more accuracy interval forecast results. Obviously, the prediction range not only covers most of the load values, but also is narrowest among all models. This shows that the designed model is more stable than others. As a result, the designed model has greater advantages for three experimental datasets.
Remark 1.
The same as the comparison model used in point forecast, 13 different competition models based on four datasets and multi-step interval forecast are compared. The results show the designed interval model is better than all the comparison models. Due to the excellent ability of the designed interval prediction module based on fuzzy clustering, it is a very promising interval prediction method of power load.

5. Discussions

For the sake of discussing the experiment conclusions in detail and reduce the error of power load forecasting, the validity of the established model, the combination mechanism of combination model and the practical application in the power system are discussed.

5.1. DM Test

The validity of the model is verified by DM test by all other models comparing with the proposed hybrid forecasting model. Based on DM test theory, the zero hypothesis is that the prediction results of both models is no significant difference, while the alternative hypothesis is contrast. We chose two scales with alpha of 0.1 and 0.05 as the criteria to judge the significance of the results, among which Z0.05/2 = 1.96 and Z0.1/2 = 1.645. Table 8 displays the DM statistics result and averages for the four datasets.
It can be seen that most of the DM test values calculated by the developed model and the above comparison model are larger than the upper limit of 5% significance level. However, for some results of ICEEMDAN-RBF, ICEEMDAN-ARIMA and CEEMD-based model as well as WD-based model, the results do not show significant differences with the proposed model. Therefore, it can be considered to reject the zero hypothesis at the level of 10% significance. For example, the DM test statistic of ICEEMDAN-ARIMA model in site 4 is 1.7554 for one-step, which is not significantly differ from the developed model at 5% significance level, but significantly differ from the developed model at 10% significance level. At the 10% level of significance, almost all the distinctions between the designed model and the benchmark model are significant. There are a few models whose results indicate that the difference between the compared model and the proposed model are not significant, but the indicators such as MAPE show that the proposed model still own the best ability. Therefore, it can be proved that the designed hybrid double forecasting model is preferable to other models.

5.2. Performance Testing of Optimization Algorithms

This section first introduce the parameter settings of BP network and MOEA/D algorithm, then implement the convergence testing of metaheuristic algorithms.

5.2.1. Parameter Settings

The artificial intelligence algorithm BP is used to combined the power load results. In BP neural network, the weights and thresholds of input, hidden and output layer occupy an important position in network performance. In order to effectively determine the connection weight and node threshold, we choose the MOEA/D algorithm to optimize its parameters. The parameters of BP and MOEA/D are shown in Table 9 and Table 10, respectively.

5.2.2. Convergence Testing of Optimization Algorithms

To discuss the performance of MOEA/D algorithm, different population size numbers are selected to test ability under four test functions, and two multi-objective optimization algorithms, Multi-objective Grey Wolf Optimization (MOGWO) and Multi-objective Dragonfly Algorithm (MODA), are selected as the compared model. Table 11 shows the details of the four test functions. Through the comparison of different optimization methods, it is proved that the prediction ability of MOEA/D is better than that of other multi-objective algorithms. A total of 20 experiments are carried out in each case and the average value is obtained. The calculation results of each index are shown in Table 12.
We choose two performance indexes of optimization algorithm as the criteria to evaluate the performance of optimization algorithm, which are Inverted Generational Distance (IGD) index and Spread index. In addition, the running time of different algorithms is compared. In particular, IGD is an indicator of the convergence condition of the algorithm, and its result can be used to judge the robustness and stability of the algorithm. If the IGD value is smaller, the ability of the algorithm is better. In Pareto set, Spread is usually used to evaluate the distribution of solutions. If SP is equal to 0, all non-dominant solutions are equidistant.
The final simulated results are shown in Table 12. Considering all the algorithms, when the population size is 100, 200, 300 and 500, respectively, the larger the population size has the better the convergence effect. For MOGWO, too large of a population leads to over fitting of data, which makes the algorithm worse. Compared with different algorithms, MOEA/D has the best performance for ZDT1, ZDT2, ZDT3 and ZDT4. The IDG of MOEA/D algorithm is far less than that of other algorithms, which shows the MOEA/D algorithm has the best convergence performance, and MODA is the second best optimal algorithm. The convergence effect of MOGWO algorithm is much worse than other algorithms. For Spread, MOEA/D has the best allocation performance. The elapsed time of MOEA/D algorithm is significantly lower than the other two algorithms, which shows that MOEA/D is undoubtedly the fastest and best algorithm in terms of working efficiency.

5.3. Combination Mechanism of Combined Model

For the sake of verifying the effectiveness of the designed nonlinear combination mechanism MOEA/D-BP, a simple average strategy and a linear combination mechanism are selected as the comparison in this study. Among them, the simple average strategy computes the mean value of the prediction results of each model, while the linear combination mechanism uses the multi-objective algorithm MOEA/D as the weight determination method to get the final prediction results. The compared consequences between the developed model and the other two methods are represented in Table 13.
Specially, the simple average method is to use the simple average formula under statistical sense to calculate the final predicted value. The method formula is briefly introduced as follows:
f = f 1 + f 2 + f 3 + f 4 4
where fi is the prediction results of the corresponding model. The linear combination of the models is the weighted combination of the results of the four single models, and a final prediction value is obtained. The weights are determined by the multi-objective optimization algorithm, which increases the intelligence of the method.
The effects of each combination mechanism are compared based on five point forecasting error measurement rules and four interval error forecasting measurement rules. The result shows that the forecasting effectiveness of the nonlinear combination model is more accurate than that of the simple average method and the linear combination mechanism, regardless of the sites and forecasting steps. The linear combination mechanism is often more effective than the simple average strategy. In other words, the simple average strategy is the worst. Therefore, the developed nonlinear combination mechanism MOEA/D-BP has successfully improved the forecasting effectiveness of power load.

5.4. Practical Application of Load Forecasting To a Power System

Load forecasting is of great significance for how to improve the stability and reliability of power grid. Accurate forecasting results can play a decisive role in the safety and stability of power network operation. Point forecast represents the possible situation of load in a future period, while interval forecast can reflect the possible range of load. The result of load forecasting is directly reflected in the power grid planning, and the details are as follows [44].

5.4.1. Application of Load Point Forecasting

Power supply load calculation needs load forecasting. According to the results of load forecasting, we can calculate the power supply load of each voltage level. The power supply load is the premise of calculating the power balance and the basis of determining the newly added variable capacitance. The network load is generally calculated according to the voltage level, which refers to the load provided by the public transformer of the same voltage level. In order to allocate the capacity of distribution network reasonably, it is necessary to predict and analyze the distribution of power supply load of each voltage level network.
Point load forecasting can be applied to high voltage power grid planning. After receiving power from the upper level grid or power supply, the high-voltage grid can directly supply power to the high-voltage users, or provide power to the lower level medium voltage grid, which is the link between the transmission network and the medium voltage grid. Load forecasting is directly related to substation capacity demand and distribution in high voltage power grid planning. Substation capacity demand is to determine the number and capacity of main transformer according to the prediction results of network load and the value of capacity load ratio. The distribution of substation is determined according to the load density of spatial load forecasting.
Point load forecasting can also be applied to medium voltage power grid planning. In the medium voltage network planning, the load forecasting results are directly related to the distribution and transformation planning and line scale planning. The planning of medium voltage distribution is mainly to determine the capacity and distribution points of the new distribution transformer. The medium voltage line planning is mainly based on the load forecasting results to determine the number of lines to meet the growing demand for power supply [45].

5.4.2. Application of Load Interval Forecasting

Analyzing the existing load forecasting methods, it is found that a large number of methods are deterministic load point forecasting results. In fact, because there are various uncertain factors in the power system, the decision making must face a certain degree of risk, so the uncertainty of power demand must be considered in the decision-making. The results of traditional deterministic forecasting methods can not reflect the uncertainty of demand, and interval forecasting can meet this objective requirement.
Interval forecasting transmits more information than point forecasting. The result of interval forecasting is a series of interval value, and this interval corresponds to a certain level of probability confidence level, which can describe the possible range of future forecasting results. According to the results of interval forecasting, the power system decision makers can better understand the fluctuation range of future load changes, and better understand the uncertainty and risk factors that may exist in the future load when carrying out production planning, system safety analysis and other work, so as to make more reasonable decisions in time. According to the upper and lower bounds of interval prediction, the rotating reserve capacity of power system can be arranged, so as to improve the economic benefits of power system operation. Interval load forecasting can also meet the optimal unit combination, economic scheduling and optimal power flow of the power system dispatching department, which is conducive to improving the utilization rate of power generation equipment and the effectiveness of economic scheduling. Therefore, the analysis of power system load change and the study of power load interval forecasting method are helpful for decision makers to better grasp the change of data in power grid planning and other aspects, so as to achieve more scientific analysis and evaluation.

6. Conclusions

Precise and dependable power load forecasting not only takes an important place in power management and operation of smart grid, but also own environmental advantages as well as economic and social benefits. However, due to the complicated fluctuation of power load, its further development and utilization are greatly limited, and even may endanger the dispatching and management of power system. Most of the previous work focused on the deterministic point prediction of power load, seldom considered the other important aspect which is the interval prediction of power load, and this situation makes the prediction of power load not comprehensive.
In order to fully mine and evaluate the deterministic and uncertain characteristics of power load, this study successfully developed a double forecast system, which makes up for the shortcomings of the existing research. The system is divided into two parts: the point forecasting module based on nonlinear combination and the interval forecasting module based on fuzzy clustering. It is of great importance to comprehensively discuss the predictability and modeling of load. Different from the previous work, this paper effectively designs BP neural network based on MOEA/D optimization as a new nonlinear combination mechanism, obtains the final prediction results, further improves the accuracy of point prediction, and improves the final prediction ability. On the basis of improving the prediction accuracy, the load data is divided into different categories based on fuzzy clustering, and then different intervals are constructed according to the prediction data of different categories. This method constructs different intervals according to different characteristics of data, which is an effective interval prediction method. Finally, a large number of experiments are carried out by using the quantitative index, which proves the effectiveness and superiority of the system. In addition, because the designed system has good performance, it can also be used in other load forecasting, wind power forecasting, economic forecasting and other fields.

Author Contributions

Funding acquisition, P.J.; writing—original draft, Y.N. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (grant number71573034).

Conflicts of Interest

The authors declare that there are no conflict of interest regarding the publication of this paper.

Nomenclature

SVRSupport Vector RegressionMAEMean Absolute Error
GRNNGeneralized Regression Neural NetworkMAPEMean Absolute Percentage Error
ENNElman Neural NetworkRMSERoot Mean Square Error
ELMExtreme Learning MachineIAIndex of Agreement
RBFRadial Basis Function ModelDCDirectional Change
BPBack Propagation Neural NetworkFICPForecasting Interval Coverage Probability
ARAutoregressive ModelFINAWForecasting Interval Normalized Average Width
ARMAAutoregressive Moving Average ModelAWDAccumulated Width Deviation
ARIMAAutoregressive Integrated Moving AverageMPIAverage Width of the Constructed PIs
EMDEmpirical Mode DecompositionMODAMulti-objective Dragonfly Algorithm
EEMDEnsemble Empirical Mode DecompositionMOGWOMulti-objective Grey Wolf Optimization
CEEMDComplementary Ensemble Empirical Mode DecompositionMOEA/DMulti-objective Evolutionary Algorithm based on Decomposition
SSASingular Spectrum AnalysisMOEA/D-BPOptimizing BP with MOEA/D
WDWavelet Domain DenoisingICEEMDAN-RBFRBF using ICEEMDAN preprocessed data
ICEEMDANImproved Complete Ensemble Empirical Mode DecompositionICEEMDAN-ELMELM using ICEEMDAN preprocessed data
ICEEMDAN-ENNENN using ICEEMDAN preprocessed data
ZDTTest functions for multi-objective algorithmICEEMDAN-ARIMAARIMA using ICEEMDAN preprocessed data

References

  1. Zhang, X.; Wang, J. A novel decomposition-ensemble model for forecasting short-term load-time series with multiple seasonal patterns. Appl. Soft Comput. 2018, 65, 478–494. [Google Scholar] [CrossRef]
  2. Jiang, P.; Liu, Z. Variable weights combined model based on multi-objective optimization for short-term wind speed forecasting. Appl. Soft Comput. 2019, 82, 105587. [Google Scholar] [CrossRef]
  3. Hsu, Y.; Tung, T.; Yeh, H.; Lu, C. Two-Stage Artificial Neural Network Model for Short-Term Load Forecasting. IFAC Pap. 2018, 51, 678–683. [Google Scholar] [CrossRef]
  4. Chen, Y.; Kloft, M.; Yang, Y.; Li, C.; Li, L. Mixed kernel based extreme learning machine for electric load forecasting. Neurocomputing 2018, 312, 90–106. [Google Scholar] [CrossRef]
  5. Xiao, L.; Shao, W.; Wang, C.; Zhang, K.; Lu, H. Research and application of a hybrid model based on multi-objective optimization for electrical load forecasting. Appl. Energy 2016, 180, 213–233. [Google Scholar] [CrossRef]
  6. Yu, F.; Xu, X. A short-term load forecasting model of natural gas based on optimized genetic algorithm and improved BP neural network. Appl. Energy 2014, 134, 102–113. [Google Scholar] [CrossRef]
  7. Wang, J.; Du, P.; Lu, H.; Yang, W.; Niu, T. An improved grey model optimized by multi-objective ant lion optimization algorithm for annual electricity consumption forecasting. Appl. Soft Comput. J. 2018. [Google Scholar] [CrossRef]
  8. Yang, W.; Wang, J.; Tong, N. A hybrid forecasting system based on a dual decomposition strategy and multi-objective optimization for electricity price forecasting. Appl. Energy 2019, 235, 1205–1225. [Google Scholar] [CrossRef]
  9. Guo, Z.; Zhou, K.; Zhang, X.; Yang, S. A deep learning model for short-term power load and probability density forecasting. Energy 2018, 160, 1186–1200. [Google Scholar] [CrossRef]
  10. Dong, Y.; Ma, X.; Ma, C.; Wang, J. Research and Application of a hybrid forecasting model based on data decomposition for electrical load forecasting. Energies 2016, 9, 1050. [Google Scholar] [CrossRef] [Green Version]
  11. Akay, D.; Atak, M. Grey prediction with rolling mechanism for electricity demand forecasting of Turkey. Energy 2007, 32, 1670–1675. [Google Scholar] [CrossRef]
  12. Box, G.E.P.; Jenkins, G.M.; Reinsel, G.C. Time Series Analysis Forecasting and Control, 3rd ed.; Prentice Hall: Upper Saddle River, NJ, USA, 1994. [Google Scholar]
  13. Huang, S.J.; Shih, K.R. Short-term load forecasting via ARMA model identification including non-Gaussian process considerations. IEEE Trans. Power Syst. 2003, 18, 673–679. [Google Scholar] [CrossRef] [Green Version]
  14. Xu, X.Y.; Qi, Y.Q.; Hua, Z.S. Forecasting demand of commodities after natural disasters. Expert Syst. Appl. 2012, 37, 4313–4317. [Google Scholar] [CrossRef]
  15. Chen, J.F.; Wang, W.M.; Huang, C.M. Analysis of an adaptive time-series autoregressive moving-average (ARMA) model for short-term load forecasting. Electric Power Syst. Res. 1995, 34, 187–196. [Google Scholar] [CrossRef]
  16. Felice, M.D.; Alessandri, A.; Catalano, F. Seasonal climate forecasts for medium-term electricity demand forecasting. Appl. Energy 2015, 137, 435–444. [Google Scholar] [CrossRef]
  17. Zhou, P.; Ang, B.W.; Poh, K.L. A trigonometric Grey prediction approach to forecasting electricity demand. Energy 2006, 31, 2839–2847. [Google Scholar] [CrossRef]
  18. Wang, J.; Niu, T.; Lu, H.; Guo, Z.; Yang, W.; Du, P. An analysis-forecast system for uncertainty modeling of wind speed: A case study of large-scale wind farms. Appl. Energy 2018, 211, 492–512. [Google Scholar] [CrossRef]
  19. Methaprayoon, K.; Lee, W.J.; Rasmiddatta, S.; Liao, J.R.; Ross, R.J. Multistage artificial neural network short-term load forecasting engine with front-end weather forecast. IEEE Trans. Ind. Appl. 2007, 43, 1410–1416. [Google Scholar] [CrossRef]
  20. Mori, H.; Yuihara, A. Deterministic annealing clustering for ANN-based short term load forecasting. IEEE Trans. Power Syst. 2001, 16, 545–551. [Google Scholar] [CrossRef]
  21. Li, R.; Dong, Y.; Zhu, Z.; Li, C.; Yang, H. A dynamic evaluation framework for ambient air pollution monitoring. Appl. Math. Model. 2018. [Google Scholar] [CrossRef]
  22. Hao, Y.; Tian, C. The study and application of a novel hybrid system for air quality early-warning. Appl. Soft Comput. 2018. [Google Scholar] [CrossRef]
  23. Bo, H.; Nie, Y.; Wang, J. Electric Load Forecasting Use a Novelty Hybrid Model on the Basic of Data Preprocessing Technique and Multi-Objective Optimization Algorithm. IEEE Access 2020, 8, 13858–13874. [Google Scholar] [CrossRef]
  24. Wang, J.; Niu, T.; Lu, H.; Yang, W.; Du, P. A Novel Framework of Reservoir Computing for Deterministic and Probabilistic Wind Power Forecasting. IEEE Trans. Sustain. Energy 2019, 11, 337–349. [Google Scholar] [CrossRef]
  25. Zhang, X.; Wang, J.; Zhang, K. Short-term electric load forecasting based on singular spectrum analysis and support vector machine optimized by Cuckoo search algorithm. Electr. Power Syst. Res. 2017, 146, 270–285. [Google Scholar] [CrossRef]
  26. Dong, Y.; Wang, J.; Wang, C.; Guo, Z. Research and Application of Hybrid Forecasting Model Based on an Optimal Feature Selection System—A Case Study on Electrical Load Forecasting. Energies 2017, 10, 490. [Google Scholar] [CrossRef] [Green Version]
  27. Wang, R.; Wang, J.; Xu, Y. A novel combined model based on hybrid optimization algorithm for electrical load forecasting. Appl. Soft Comput. 2019, 82, 105548. [Google Scholar] [CrossRef]
  28. Wang, H.Z.; Wang, G.B.; Li, G.Q.; Peng, J.C.; Liu, Y.T. Deep belief network based deterministic and probabilistic wind speed forecasting approach. Appl. Energy 2016, 182, 80–93. [Google Scholar] [CrossRef]
  29. Errouissi, R.; Cardenas-Barrera, J.; Meng, J.; Castillo-Guerra, E.; Gong, X.; Chang, L. Bootstrap prediction interval estimation for wind speed forecasting. In Proceedings of the 2015 IEEE Energy Conversion. Congress and Exposition (ECCE), Montreal, QC, Canada, 20–24 September 2015; pp. 1919–1924. [Google Scholar] [CrossRef]
  30. Juban, J.; Siebert, N.; Kariniotakis, G.N. Probabilistic short-term wind power forecasting for the optimal management of wind generation. IEEE Lausanne Power Tech. 2007, 2007, 683–688. [Google Scholar] [CrossRef]
  31. Khosravi, A.; Nahavandi, S.; Creighton, D.; Atiya, A.F. Lower upper bound estimation method for construction of neural network-based prediction intervals. IEEE Trans. Neural Netw. 2011. [Google Scholar] [CrossRef]
  32. Yan, J.; Liu, Y.; Han, S.; Wang, Y.; Feng, S. Reviews on uncertainty analysis of wind power forecasting. Renew. Sustain. Energy Rev. 2015, 52, 1322–1330. [Google Scholar] [CrossRef]
  33. Samuels, T. Bootstrapping and resampling. Anaesthesia 2017, 72, 271–272. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Yang, W.; Wang, J.; Lu, H.; Niu, T.; Du, P. Hybrid wind energy forecasting and analysis system based on divide and conquer scheme: A case study in China. J. Clean. Prod. 2019, 222, 942–959. [Google Scholar] [CrossRef] [Green Version]
  35. Moghram, I.; Rahman, S. Analysis and evaluation of five short-term load forecasting techniques. IEEE Trans. Power Syst. 1989, 198. [Google Scholar] [CrossRef]
  36. Hao, Y.; Tian, C. A novel two-stage forecasting model based on error factor and ensemble method for multi-step wind power forecasting. Appl. Energy 2019, 238, 368–383. [Google Scholar] [CrossRef]
  37. Liu, Z.; Jiang, P.; Zhang, L.; Niu, X. A combined forecasting model for time series: Application to short-term wind speed forecasting. Appl. Energy 2019, 259, 114137. [Google Scholar] [CrossRef]
  38. Zhang, X.; Wang, J.; Gao, Y. A hybrid short-term electricity price forecasting framework: Cuckoo search-based feature selection with singular spectrum analysis and SVM. Energy Econ. 2019. [Google Scholar] [CrossRef]
  39. Colominas, M.A.; Gastón, S.; María, T. Improved complete ensemble EMD: A suitable tool for biomedical signal processing. Biomed. Signal Process. Control 2014, 14, 19–29. [Google Scholar] [CrossRef]
  40. Zhang, Q.; Li, H. MOEA/D: A Multiobjective Evolutionary Algorithm Based on Decomposition. IEEE Trans. Evol. Comput. 2008, 11, 712–731. [Google Scholar] [CrossRef]
  41. Deb, K.; Kalyanmoy, D. Multi-Objective Optimization Using Evolutionary Algorithms; Wiley & Sons: Hoboken, NJ, USA, 2001; Volume 2, p. 509. [Google Scholar]
  42. Ishibuchi, H.; Sakane, Y.; Tsukamoto, N.; Nojima, Y. Simultaneous use of different scalarizing functions in MOEA/D. In Proceedings of the 12th Annual Conference on Genetic and Evolutionary Computation (GECCO ’10), Portland, Oregon, 7–11 July 2010. [Google Scholar] [CrossRef]
  43. Bezdek, J.C.; Ehrlich, R.; Full, W. FCM: The fuzzy c-means clustering algorithm. Comput. Geosci. 1984, 10, 191–203. [Google Scholar] [CrossRef]
  44. Xia, C.; Zhang, M.; Cao, J. A hybrid application of soft computing methods with wavelet SVM and neural network to electric power load forecasting. J. Electr. Syst. Inf. Technol. 2017. [Google Scholar] [CrossRef]
  45. Badri, A.; Ameli, Z.; Birjandi, A.M. Application of Artificial Neural Networks and Fuzzy logic Methods for Short Term Load Forecasting. Energy Procedia 2012, 14, 1883–1888. [Google Scholar] [CrossRef] [Green Version]
Figure 1. The flowchart of load forecasting by double hybrid model. The model construction steps are recorded in the upper right, and the methods displayed in the specific model are divided into five parts. D and E part show the results of point forecasting and interval forecasting, respectively.
Figure 1. The flowchart of load forecasting by double hybrid model. The model construction steps are recorded in the upper right, and the methods displayed in the specific model are divided into five parts. D and E part show the results of point forecasting and interval forecasting, respectively.
Applsci 10 01550 g001
Figure 2. Data selection scheme for multi-step forecasting. In the red box are the data selection methods of 1-step, 2-step and 3-step forecasting, respectively.
Figure 2. Data selection scheme for multi-step forecasting. In the red box are the data selection methods of 1-step, 2-step and 3-step forecasting, respectively.
Applsci 10 01550 g002
Figure 3. Description of observations in four sites. On the upper part is the geographic location obtained by data points. The lower part of the graph shows the divide method of training set and testing set.
Figure 3. Description of observations in four sites. On the upper part is the geographic location obtained by data points. The lower part of the graph shows the divide method of training set and testing set.
Applsci 10 01550 g003
Figure 4. The multi-step prediction ability of all models in Experiment I for site 1. The center of the picture is a one-step prediction time series figure. Around it are the error diagrams of eight comparison models and the proposed model.
Figure 4. The multi-step prediction ability of all models in Experiment I for site 1. The center of the picture is a one-step prediction time series figure. Around it are the error diagrams of eight comparison models and the proposed model.
Applsci 10 01550 g004
Figure 5. The multi-step forecast ability in Experiment II for site 2. The picture is divided into three parts: one to three steps time series forecasting chart, three steps error comparison bar chart and one-step forecasting error point chart.
Figure 5. The multi-step forecast ability in Experiment II for site 2. The picture is divided into three parts: one to three steps time series forecasting chart, three steps error comparison bar chart and one-step forecasting error point chart.
Applsci 10 01550 g005
Figure 6. The final interval results of some models in multi-step forecasting in Experiment III for site 3. The figure shows the interval forecast results of the proposed model, singular spectrum analysis (SSA)-based model, Improved Complete Ensemble Empirical Mode Decomposition (ICEEMDAN)- Autoregressive Integrated Moving Average (ARIMA) model and Generalized Regression Neural Network (GRNN) single model, respectively. In addition, the figure shows the relationship between the interval forecasting band and the real value.
Figure 6. The final interval results of some models in multi-step forecasting in Experiment III for site 3. The figure shows the interval forecast results of the proposed model, singular spectrum analysis (SSA)-based model, Improved Complete Ensemble Empirical Mode Decomposition (ICEEMDAN)- Autoregressive Integrated Moving Average (ARIMA) model and Generalized Regression Neural Network (GRNN) single model, respectively. In addition, the figure shows the relationship between the interval forecasting band and the real value.
Applsci 10 01550 g006
Table 1. Summary of point prediction and interval prediction models.
Table 1. Summary of point prediction and interval prediction models.
MethodAdvantageDisadvantage
Point Forecasting
Statistical models (AR, ARIMA, fractional-ARIMA.)
  • Consider information from historical data
  • High calculation efficiency
  • Few adjustable model parameters
  • Can mine the linear law of data effectively
  • Cannot mine the nonlinear law of data
  • Stable data only
  • Assuming the interference sequence is white noise is necessary
Artificial neural network
  • Can fit the non-linear patterns of data, and has good generalization ability and self-learning ability
  • The calculation process is complex
  • High requirements on the number of training samples
  • Fall into local optimum easily
Hybrid model
  • Integrate the advantages of each single model
  • Improvement of prediction accuracy
  • Improved spaces in each part of the model still exist
  • Non-linear patterns of data cannot be further mined by linear combination
Interval Forecasting
Quantile regression
  • Be able to deal with heterogeneity
  • High tolerance for outliers
  • Consider the whole distribution
  • Ability to capture tail features of distribution
  • High training data set requirements
  • Large amount of calculation
  • Results are easily discarded
Bootstrap methods
  • Avoid possible discarding in quantile regression
  • Very effective when handling small samples
  • Poor performance in handling large samples
  • Large amount of computation
Kernel density estimation
  • Fast construction prediction interval
  • Strict distribution assumption
LUBE
  • Avoid the data distribution hypothesis
  • High calculation efficiency
  • Model coefficient is easy to adjust
  • The objective function is complex
  • Traditional mathematical methods cannot optimize the objective function
Table 2. Structure of four selected datasets in Australia.
Table 2. Structure of four selected datasets in Australia.
SiteSamplesNumbersStatistical Indicator
MeanStd.MaxMin
Site 1All samples14888568.511124.3211379.766372.63
Training12968556.831114.8811379.766372.63
Testing1928647.371186.1211107.046697.94
Site 2All samples14885935.55738.007884.644318.74
Training12965954.18737.867884.644318.74
Testing1925809.81728.397337.134707.27
Site 3All samples14881395.97296.052229.02611.41
Training12961379.04292.242178.39611.41
Testing1921510.24297.052229.02997.92
Site 4All samples14881231.81138.651628.73910.65
Training12961223.45135.831611.11910.65
Testing1921288.19144.561628.73987.02
Power load of New South Wales, Queensland, South Australia and Tasmania from 1 July to 31 July 2019.
Table 3. Point forecasting evaluation metrics.
Table 3. Point forecasting evaluation metrics.
IndicatorDefinitionEquation
MAPEMean Absolute Percentage Error MAPE = 1 N i = 1 N | y i y ^ i y i | 100 %
MAEMean Absolute Error MAE = 1 N i = 1 N | y i y ^ i |
RMSERoot Mean Square Error RMSE = 1 N i = 1 N ( y i y ^ i ) 2
DCDirectional Change DC = 100 N 1 t = 1 N 1 a t
IAIndex of agreement IA = 1 i = 1 N ( y i y ^ i ) 2 / i = 1 N ( | y ^ i y ¯ | | y i y ¯ | ) 2
Among the formula, y i and y ^ i is the true and predicted value. N represents the testing set number. In addition, ai is the directional factor, and is calculated as a t = { 0 , o t h e r w i s e 1 , i f ( ( y ( t + 1 ) y ( t ) ) ( y ^ ( t + 1 ) y ( t ) ) ) > 0 .
Table 4. Interval prediction evaluation metrics.
Table 4. Interval prediction evaluation metrics.
IndicatorDefinitionEquation
FICPForecasting interval coverage probability FICP = 1 N i = 1 N c i × 100 %
FINAWForecasting interval normalized average width FINAW = 1 N i = 1 N U i L i y max y min × 100 %
MPIMean width of the constructed PIs MPI = 1 N i = 1 N | U i L i |
AWDAccumulated width deviation AWD = 1 N i = 1 N AWD i
Among the formula, Ui and Li represent the upper limit and lower limit of forecasting interval, respectively. ci is the number of the truth value contained in constructed interval. N represents the testing set number. ymax and ymin are the maximum and minimum of the targets in the whole prediction process. In addition, the calculation expression of AWDi is AWD = 1 N i = 1 N A W D i , and AWDi is the width deviation of construction interval of each sample.
Table 5. Abilities of the established model and single models.
Table 5. Abilities of the established model and single models.
SiteModelOne-StepTwo-StepThree-Step
MAPEMAERMSEIADCMAPEMAERMSEIADCMAPEMAERMSEIADC
Site 1SVR1.9099156.7013187.91950.99440.82202.5219210.3356258.51800.98940.74743.0868262.8112341.31880.98170.6614
GRNN1.2373107.3457140.29390.99640.92152.1193183.9606244.34080.98850.86323.1821276.0676368.40480.97270.7778
RBF1.3330 120.0048 162.0430 0.9951 0.9058 2.5730 227.8929 303.8217 0.9819 0.8053 5.7891 518.6760 671.9120 0.8625 0.6143
ELM0.8640 75.3729 101.7329 0.9981 0.9476 2.8427 246.4113 327.8275 0.9733 0.8304 6.0739 518.3696 663.3588 0.9629 0.7570
ENN0.938781.4666108.15640.99790.94762.1680188.5787247.67810.98860.80533.4611299.2232375.49250.97270.6614
ARIMA1.2105103.8960137.74560.99650.90582.5009224.8970300.27670.98390.75262.4838223.0066292.30590.98470.7672
ICEEMDAN-RBF0.736765.473094.16110.99840.94762.2596198.6571278.74250.98620.83685.2163471.45791091.95320.81480.7937
ICEEMDAN-ELM0.854374.472198.48470.99830.94242.7422239.9533319.75600.98100.75795.6887494.3489636.18000.92150.6190
ICEEMDAN-ENN0.921681.0992105.16460.99800.93191.8371159.6742204.52020.99230.85793.0383261.0713330.91700.97880.6984
ICEEMDAN-ARIMA1.064994.4410123.49990.99730.92672.3297209.7987281.92990.98570.78952.3985215.6049282.76380.98560.7831
Proposed Model0.657756.779180.10180.99880.93721.6540143.5974192.17160.99320.84741.7361151.6523200.29430.99260.8730
Site 2SVR3.6548212.2143250.97210.97800.67543.8775228.6832273.74030.97370.63683.5513208.0377261.83310.97340.6085
GRNN1.521386.3305117.67540.99320.83772.3967135.6083189.39530.98220.69473.3451188.7230274.16980.96180.6508
RBF0.9823 58.2252 76.4982 0.9973 0.9058 3.3178 195.5145 263.8302 0.9576 0.7862 7.1590 416.3507 603.8911 0.6753 0.4778
ELM0.9063 53.5149 68.9417 0.9977 0.9058 2.9227 170.9014 228.5369 0.9705 0.8253 6.0964 353.5843 484.5720 0.8766 0.6571
ENN1.028360.511880.10600.99690.92672.1192125.0372168.72740.98630.80533.5815209.5433279.50270.96110.6667
ARIMA2.2121125.7167146.76450.99010.74872.4574146.4694189.35420.98300.73682.4456146.0298189.52410.98300.7407
ICEEMDAN-RBF0.972957.354680.51290.99690.89013.1682185.6588279.81240.96390.72637.2699412.7709950.58170.69260.5661
ICEEMDAN-ELM0.887552.532366.62240.99790.91102.8790169.0021227.71030.97550.73685.8342340.6032450.38270.90430.6349
ICEEMDAN-ENN0.981558.700277.62690.99710.90051.9832115.6628155.76120.98820.85263.1668183.2636247.55780.96920.7354
ICEEMDAN-ARIMA1.010560.309679.70030.99700.88482.3534139.8655181.76110.98420.77372.3018136.8617181.24630.98430.7672
Proposed Model0.737343.342355.63320.99850.93191.8561110.1146146.89690.98930.80001.8413109.4865144.41270.98970.8307
Site 3SVR18.1694250.6877267.68870.86190.575916.2465226.7004250.18550.87110.584214.4738204.6135234.39310.87900.5503
GRNN3.455853.607171.47460.98380.73305.944593.0097121.88830.95030.53688.4107133.1739176.05790.88960.4709
RBF4.7792 72.5550 105.3551 0.9017 0.7671 15.8938 241.5826 305.6250 0.8735 0.5163 18.0374 270.8042 333.5527 0.8372 0.3619
ELM2.5036 38.1800 53.4851 0.9917 0.8482 4.5972 69.6088 90.9124 0.9746 0.5895 6.8086 102.6562 132.6552 0.9411 0.4233
ENN2.590639.549154.80270.99120.81685.253380.6505109.29070.96260.56848.0953123.4767165.82620.90660.4550
ARIMA3.311950.777666.88330.98600.75394.684369.460598.55520.97210.69474.683369.431498.70340.97220.6984
ICEEMDAN-RBF4.172270.1981187.82320.90750.801015.8023220.4719256.59420.86290.594716.2465226.7004250.18550.87110.5842
ICEEMDAN-ELM2.210533.560151.59230.99220.88487.0790108.0376149.78400.92970.473712.7588193.4990266.79740.76560.4233
ICEEMDAN-ENN2.430737.218253.89560.99140.82724.920675.847699.29840.96930.56327.2839112.3960147.70660.92590.4233
ICEEMDAN-ARIMA2.448136.685253.64280.99170.84824.425066.379190.28810.97600.68954.422266.504590.69950.97590.6931
Proposed Model2.071430.961844.25470.99450.86913.606354.351475.73870.98280.74743.702854.737372.09330.98470.7720
Site 4SVR3.938749.367754.00310.96790.64405.455470.119878.73840.92840.57894.403456.517370.10770.94260.6296
GRNN1.674922.133330.17840.98810.84822.656835.334847.60120.96830.68953.625848.261363.03780.94020.6243
RBF2.7267 168.5272 237.3737 0.9866 0.7898 4.9468 289.4248 391.8644 0.9526 0.7295 9.3387 429.3665 579.0957 0.9044 0.6844
ELM1.4988 19.6842 26.1392 0.9919 0.8264 4.1464 55.3052 77.8929 0.9656 0.7295 8.0347 107.8908 151.7349 0.9279 0.6481
ENN1.577720.772727.29600.99080.82203.084141.047456.37620.95780.65264.607161.417083.68010.89860.6138
ARIMA1.775923.505931.36900.98730.80632.746336.207448.62300.97150.71052.753336.315048.68650.97140.7143
ICEEMDAN-RBF2.5025208.7714257.02990.98950.74744.7572285.1412335.27670.95970.55798.9365489.6447531.07560.90600.5947
ICEEMDAN-ELM1.376118.090823.99220.99270.84294.120854.667872.73030.92470.65267.429698.1502129.97490.75420.6032
ICEEMDAN-ENN1.518120.077227.74230.99040.81682.623734.979647.81120.96940.70003.754350.263168.49130.93040.6296
ICEEMDAN-ARIMA1.419118.597124.45870.99270.81152.527833.455246.07660.97400.69472.540133.627446.15560.97390.6931
Proposed Model1.328817.310922.26510.99390.83252.111627.145235.14270.98450.74742.240129.298739.59000.98060.7249
Table 6. The models with different data preprocessing techniques.
Table 6. The models with different data preprocessing techniques.
SiteModelOne-StepTwo-StepThree-Step
MAPEMAERMSEIADCMAPEMAERMSEIADCMAPEMAERMSEIADC
Site 1EMD-based model7.9921703.2179848.00340.79830.59168.0933697.2798827.36830.81120.57897.9109691.9289832.91870.81170.6190
EEMD-based model6.4768550.6402680.48570.89250.61787.0479612.2662730.68380.86570.61585.2011459.5873598.19090.92480.6561
CEEMD-based model2.0549176.5654227.75460.99040.80632.0285175.3443224.95820.99080.82112.1103180.5625228.82690.99010.7989
SSA-based model1.9070168.8455234.24890.98970.85861.9845173.4462231.29320.99010.84741.9283168.1976228.00820.99050.8677
WD-based model0.658257.088477.95110.99890.95291.732997.4880132.73050.99680.91051.7491152.1173195.52540.99310.7989
Proposed Model0.657756.779180.10180.99880.93721.6540143.5974192.17160.99320.84741.7361151.6523200.29430.99260.8730
Site 2EMD-based model5.3342320.8286436.03180.87920.67545.2566317.6441426.51150.88530.73685.8824356.6128504.22900.82310.6984
EEMD-based model2.3126133.3762166.13390.98650.74352.1817125.8414255.67550.98810.71583.1988127.6659204.15060.96690.7302
CEEMD-based model1.568591.9520121.01010.99300.85861.9680122.0495168.28340.96340.79682.5498110.6276159.97720.97300.7930
SSA-based model1.596093.8540115.79240.99350.80631.9178120.0527167.58940.97630.82632.5884112.8322154.93220.97370.7824
WD-based model0.794646.771160.52760.99830.91101.9386116.6306157.22160.98340.80423.2482132.8212172.42810.98580.7302
Proposed Model0.737343.342355.63320.99850.93191.8561110.1146146.89690.98930.80001.8413109.4865144.41270.98970.8307
Site 3EMD-based model5.461385.9224119.75610.95360.73825.131080.7809107.17740.96430.70005.691389.9809123.27270.94910.6931
EEMD-based model3.707955.882676.54990.98220.79064.054461.463879.38190.98090.76324.834976.8613109.74420.96260.7037
CEEMD-based model3.991363.7212106.03440.96490.81684.423469.8325108.05330.96350.76324.650573.2335109.60050.96250.7513
SSA-based model3.834457.270971.34120.98500.77494.125846.864959.03530.98980.83684.261959.185473.74290.98010.7989
WD-based model2.200133.124144.22240.99430.87434.488268.536893.90260.97300.62633.905558.959379.12660.98490.7460
Proposed Model2.071430.961844.25470.99450.86913.606354.351475.73870.98280.74743.702854.737372.09330.98470.7720
Site 4EMD-based model3.0164164.0450197.05000.98640.66492.4541135.9545177.99120.98850.69472.8497158.4661196.58240.98530.6825
EEMD-based model2.0505112.7215145.81500.99230.78532.3454127.9509166.08140.98970.78422.3412126.2956169.41550.98930.7725
CEEMD-based model1.406555.950477.33500.99780.90052.2781125.7992159.20660.99080.73682.3741130.5010164.78060.99010.6984
SSA-based model2.9273158.8158203.46840.98430.74872.5953140.1998191.12800.98630.76842.4590136.6107176.64790.98880.7143
WD-based model1.388518.231625.03510.99210.83772.177228.634438.94300.98050.72632.567130.686340.05570.97150.7072
Proposed Model1.328817.310922.26510.99390.83252.111627.145235.14270.98450.74742.240129.298739.59000.98060.7249
Table 7. The designed model comparing with all other models for site 2 and site 3.
Table 7. The designed model comparing with all other models for site 2 and site 3.
SiteModelOne-StepTwo-StepThree-Step
FICPFINAWAWDMPIFICPFINAWAWDMPIFICPFINAWAWDMPI
Site 2SVR0.55730.13830.1098482.56480.53400.13270.1310483.86630.60530.13130.1340443.8932
GRNN0.80210.12880.0468331.09030.62300.12700.1767315.30020.53160.12460.2991297.3550
ENN0.94790.13480.0055363.88980.75920.13220.0725366.48380.54210.12530.2361339.8606
ARIMA0.95310.13140.0034355.76320.68590.12970.0730388.33550.71580.12830.0820384.4358
ICEEMDAN-RBF0.96350.12960.0076366.60680.68060.12570.1734441.82820.56320.10080.39721005.4671
ICEEMDAN-ELM0.98960.13540.0005364.46630.59160.12750.1506407.43730.45260.13720.3171520.7903
ICEEMDAN-ENN0.97400.13440.0035355.52770.73300.12850.0844351.92590.47370.12230.2831339.3854
ICEEMDAN-ARIMA0.95830.13090.0037354.38020.66490.12640.1009387.42350.68420.12710.1005389.6756
EMD-based model0.37500.12730.9095281.91690.35600.12930.5799269.88170.40530.11662.2804300.4038
EEMD-based model0.63020.13010.1391311.35770.67020.12940.1106343.89370.72630.13970.1007345.9396
CEEMD-based model0.84380.13880.0371362.41710.83250.13550.0333347.87790.83680.13820.0364350.5409
SSA-based model0.85420.13770.0572345.74670.82720.13680.0324355.38230.84740.13960.0250355.5278
WD-based model0.98440.13290.0002349.53550.82200.12820.0344342.38390.66320.13190.1068333.2383
Proposed Model0.98960.13050.0002356.90440.79060.12460.0587333.24840.78950.12690.0554355.3731
Site 3SVR0.13020.13201.0036209.50360.17280.12960.9299200.21020.23160.13610.7603207.3294
GRNN0.69790.13270.0908145.74290.49210.13550.3627149.09680.36320.13490.7105138.6840
ENN0.84900.13420.0582169.39760.56540.13330.2129163.68300.46320.12680.4586155.3926
ARIMA0.86980.13890.0386175.02060.66490.12720.1579170.74870.67370.12510.1720166.9475
ICEEMDAN-RBF0.84900.10940.1037297.09920.70160.00040.1732359.65310.64210.00100.3749819.3448
ICEEMDAN-ELM0.90100.13680.0475163.60250.47120.13540.3762161.37930.28420.12140.8042181.5007
ICEEMDAN-ENN0.85420.12980.0509158.83480.56540.12900.2501154.39470.43160.13430.5467154.2944
ICEEMDAN-ARIMA0.86460.13830.0397174.31030.71730.13190.1549177.15930.71050.13300.1534178.5047
EMD-based model0.52600.13540.2949151.72760.54450.14360.2268151.79520.54740.13900.2904147.9657
EEMD-based model0.74480.13350.1169158.33640.70680.13190.1259146.18600.58420.13140.2563137.7404
CEEMD-based model0.73960.13790.2067150.04180.69630.13440.1944156.74980.66320.12490.2152149.5520
SSA-based model0.64580.13390.1313145.73130.68060.14280.0744158.44970.72110.13980.0708159.1189
WD-based model0.92710.13380.0182160.79550.62300.13370.1790141.42220.75260.12500.1136160.1570
Proposed Model0.93750.13190.0171163.11460.69630.13040.1033143.13010.70530.13000.1055154.2024
Table 8. Statistics value of DM test for experimented models.
Table 8. Statistics value of DM test for experimented models.
ModelSite 1Site 2Site 3Site 4Average
1-Step2-Step3-Step1-Step2-Step3-Step1-Step2-Step3-Step1-Step2-Step3-Step1-Step2-Step3-Step
SVR10.24213.79905.445012.00947.74896.625620.230314.842913.651015.426211.48617.271414.47709.46928.2482
GRNN6.70052.70046.33445.83392.69354.59385.10005.54637.28803.69343.32654.55055.33193.56675.6916
ENN4.60880.70966.61493.83290.76335.49662.31364.00466.69164.65953.61815.04333.85372.27395.9616
ARIMA6.04544.23484.58158.64863.08213.74925.00902.55163.35443.81523.00782.42185.87953.21913.5267
ICEEMDAN-RBF2.27263.40491.90433.99484.00351.69981.61681.02722.83501.67112.78434.93042.38892.80502.8423
ICEEMDAN-ELM3.39184.55827.66652.68564.45446.84131.28455.80377.19992.52866.29387.52472.47275.27767.3081
ICEEMDAN-ENN4.33952.83647.62153.66372.22466.43661.98595.65247.06684.00984.93886.25043.49973.91306.8438
ICEEMDAN-ARIMA5.06714.82084.71533.85963.87304.44821.68763.30022.70031.93443.78623.35573.13723.94513.8049
EMD-based model12.510312.232412.19998.41576.80387.85716.74233.69244.55512.76783.04273.76677.60906.44287.0947
EEMD-based model7.883310.43978.83328.88330.58321.42245.03270.25603.79982.31942.79701.99476.02963.51904.0125
CEEMD-based model8.54832.05891.92906.55682.06202.19975.01883.00533.54581.87962.38831.23855.50092.37862.2283
SSA-based model6.11501.54861.55397.23911.94792.71825.38032.27931.59682.53173.28602.34485.31652.26552.0534
WD-based model0.55093.15240.49012.00422.94324.37721.00973.68310.00901.84571.37563.13931.35262.78852.0039
“‗” refers to 10% significance level; “̲” refers to 5% significance level. Considering the significance level α , zero hypothesis H0 indicates the predictive effectiveness of the proposed model and the comparison model are not significantly different. The meaning of H1 indicates the predictive effectiveness of the proposed model and the comparison model are significantly different. The relevant formulas are: H 0 : E [ L ( e r r i 1 ) ] = E [ L ( e r r i 2 ) ] , H 1 : E [ L ( e r r i 1 ) ] E [ L ( e r r i 2 ) ] . In the formula, L represents the loss function of prediction error. Errori1 and Errori2 are the error sequence predicted by selected model.
Table 9. Default settings of Back Propagation (BP).
Table 9. Default settings of Back Propagation (BP).
Experimental ParametersValue setting
Training functionTRAINLM
Adapt learning functionLEARNGDM
Performance functionMSE
Number layers3
Inputs layer nodes number (layer 1)4
Hidden layer nodes number (layer 2)5
Output layer nodes number (layer 3)1
Activation function hidden layerTANSIG
Activation function output layerPURELIN
Epochs100
Table 10. Default settings of Multi-objective Evolutionary Algorithm based on Decomposition (MOEA/D).
Table 10. Default settings of Multi-objective Evolutionary Algorithm based on Decomposition (MOEA/D).
Experimental ParametersValue Setting
iterations100
Population size100
Optimizing parameter dimension4
Neighbor size26
Crossover rate2
Simulated binary crossover parameter2
Polynomial variation parameter5
Table 11. Test functions details of multi-objective optimization algorithm.
Table 11. Test functions details of multi-objective optimization algorithm.
Test FunctionObjective Function ExpressionConstraint ScopeCharacteristic
ContinuityConvexity
ZDT1 f 1 ( x ) = x 1 ; f 2 ( x ) = g ( 1 f 1 / g ) ; g ( x ) = 1 + 9 i = 2 n x i / ( n 1 ) [0,1]
ZDT2 f 1 ( x ) = x 1 ; f 2 ( x ) = g ( 1 ( f 1 / g ) 2 ) ; g ( x ) = 1 + 9 i = 2 n x i / ( n 1 ) [0,1]
ZDT3 f 1 ( x ) = x 1 ; f 2 ( x ) = g ( 1 f 1 / g ( f 1 / g ) sin ( 10 π f 1 ) ) ; g ( x ) = 1 + 9 i = 2 n x i / ( n 1 ) [0,1]
ZDT4 f 1 ( x ) = x 1 ; f 2 ( x ) = g ( x ) ( 1 f 1 / g ) ; g ( x ) = 91 + i = 2 n ( x i 2 10 cos ( 4 π x i ) ) x 1 [ 0 , 1 ] x 2 , , x n [ 5 , 5 ]
Table 12. Results of MOEA/D and compared algorithms Multi-objective Grey Wolf Optimization (MOGWO), Multi-objective Ant Lion Optimizer (MOALO) and Multi-objective Dragonfly Algorithm (MODA).
Table 12. Results of MOEA/D and compared algorithms Multi-objective Grey Wolf Optimization (MOGWO), Multi-objective Ant Lion Optimizer (MOALO) and Multi-objective Dragonfly Algorithm (MODA).
Test FunctionPopulationMOEA/DMOGWOMODA
IGDSpreadTimeIGDSpreadTimeIGDSpreadTime
ZDT11000.00130.57393.10970.01930.8866116.09570.02501.038420.4854
2000.00120.48516.17520.01300.9015245.12880.00191.549468.4797
3000.00090.71069.00260.01920.9386401.74270.00591.4796153.8985
5000.00060.551515.31530.02270.9087815.35930.00171.6583366.6784
ZDT21000.00140.87033.18830.01920.9375116.48610.01151.381823.7629
2000.00090.46006.04960.01480.9921240.52540.00461.482676.2996
3000.00080.61349.28650.03691.1163410.12650.00361.2181135.2485
5000.00060.574115.39780.02481.0642848.28620.00181.6289363.9572
ZDT31000.00130.52023.16620.02571.139781.44490.01371.336318.3297
2000.00080.54526.13860.02751.1883143.52050.01421.508957.9230
3000.00100.67839.08590.02841.2127206.65380.01441.4316116.3454
5000.00050.750415.58360.02861.1296351.91250.01411.3629301.9324
ZDT41000.00150.52183.16090.01650.9105118.77870.01071.122416.4013
2000.00080.44196.06500.02670.9716247.06660.00281.411066.0034
3000.00060.44759.14210.03271.0000406.45410.00261.4141335.4201
5000.00100.634415.36490.02371.0559858.89850.00531.3491344.9575
Table 13. Three different combination mechanisms.
Table 13. Three different combination mechanisms.
ForecastingIndexCombination MechanismSite 1Site 2Site 3Site 4
1-Step2-Step3-Step1-Step2-Step3-Step1-Step2-Step3-Step1-Step2-Step3-Step
Point forecasting indexMAPESimple average0.83252.61165.24870.88032.77805.44602.21546.817711.96861.36933.94876.9071
Linear combination0.75231.95063.06210.88222.27483.65322.29145.34077.22751.36813.05623.8838
Proposed nonlinear model0.65771.65401.73610.73731.85611.84132.07143.60633.70281.32882.11162.2401
MAESimple average72.5631228.5527455.727952.1120163.0551317.708933.6322104.0366181.636417.986652.363091.3075
Linear combination65.4888170.9959268.404852.3353133.6672210.505734.756581.4093110.498817.917240.323251.6062
Proposed nonlinear model56.7791143.5974151.652343.3423110.1146109.486530.961854.351454.737317.310927.145229.2987
RMSESimple average96.5555304.1764584.019966.3370219.8825420.320751.4845143.8811249.270923.879769.6797120.9681
Linear combination633.9588693.0493620.0500354.4644409.8790375.808551.6760112.2067147.128624.189454.321669.3954
Proposed nonlinear model80.1018192.1716200.294355.6332146.8969144.412744.254775.738772.093322.265135.142739.5900
IASimple average0.99830.98280.93330.99790.97710.91560.99220.93520.79240.99280.93140.7827
Linear combination0.99860.99040.96900.99780.98430.94570.99220.96140.92670.99270.96090.9287
Proposed nonlinear model0.99880.99320.99260.99850.98930.98970.99450.98280.98470.99390.98450.9806
DCSimple average0.94240.78420.63490.90580.73680.64020.84290.49470.43920.80630.64210.6085
Linear combination0.95290.81580.79370.90580.79470.68250.86910.58420.48680.84290.67890.6720
Proposed nonlinear model0.93720.84740.87300.93190.80000.83070.86910.74740.77200.83250.74740.7249
Interval forecasting indexFICPSimple average0.98960.73820.42110.98960.58640.43160.90630.52880.53160.80730.49210.3421
Linear combination0.98960.84290.66840.98960.75390.57370.90100.67020.43680.82810.58120.4579
Proposed nonlinear model0.98440.86390.84740.98960.79060.78950.93750.69630.70530.85940.70680.6947
FINAWSimple average0.13730.13970.12140.13720.12690.12980.13710.13430.12440.12610.12450.1293
Linear combination0.13810.13520.13030.13080.13520.12770.12970.00140.00230.13110.12440.1283
Proposed nonlinear model0.13330.13140.13120.13050.12460.12690.13190.13040.13000.13300.12750.1246
AWDSimple average0.00170.09180.38660.00050.14300.35380.04500.35760.74010.02590.37110.8820
Linear combination0.00140.02980.18330.00050.08090.22040.04150.24510.48240.02950.24050.4074
Proposed nonlinear model0.00050.03440.04200.00020.05870.05540.01710.10330.10550.01490.10420.1169
MPISimple average617.1405682.9169678.9992369.6635402.1000469.9174164.4780159.5414175.057777.976181.859199.3020
Linear combination633.9588693.0493620.0500354.4644409.8790375.8085163.5420163.5335139.865484.147085.524878.2974
Proposed nonlinear model573.6699584.0110577.3195356.9044333.2484355.3731163.1146143.1301154.202479.908178.439179.3207

Share and Cite

MDPI and ACS Style

Jiang, P.; Nie, Y. A Hybrid Double Forecasting System of Short Term Power Load Based on Swarm Intelligence and Nonlinear Integration Mechanism. Appl. Sci. 2020, 10, 1550. https://0-doi-org.brum.beds.ac.uk/10.3390/app10041550

AMA Style

Jiang P, Nie Y. A Hybrid Double Forecasting System of Short Term Power Load Based on Swarm Intelligence and Nonlinear Integration Mechanism. Applied Sciences. 2020; 10(4):1550. https://0-doi-org.brum.beds.ac.uk/10.3390/app10041550

Chicago/Turabian Style

Jiang, Ping, and Ying Nie. 2020. "A Hybrid Double Forecasting System of Short Term Power Load Based on Swarm Intelligence and Nonlinear Integration Mechanism" Applied Sciences 10, no. 4: 1550. https://0-doi-org.brum.beds.ac.uk/10.3390/app10041550

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop